photo


Last seen: 1 year ago Active since 2022

Followers: 0   Following: 0

Statistics

  • First Answer

View badges

Feeds

View by

Question


使用PPO和TRPO算法在reinforcement learning design app输出连续动作时,动作值不在设定好的区间内
%Open model mdl='FCEV'; blk='FCEV/RL Agent'; %open_system(mdl); %(s,a) obsInfo = rlNumericSpec([3 1]); obsInfo.Name = ...

1 year ago | 1 answer | 1

1

answer