Statistics
1 Question
0 Answers
RANK
293,670
of 300,553
REPUTATION
0
CONTRIBUTIONS
1 Question
0 Answers
ANSWER ACCEPTANCE
0.0%
VOTES RECEIVED
0
RANK
of 21,024
REPUTATION
N/A
AVERAGE RATING
0.00
CONTRIBUTIONS
0 Files
DOWNLOADS
0
ALL TIME DOWNLOADS
0
RANK
of 169,635
CONTRIBUTIONS
0 Problems
0 Solutions
SCORE
0
NUMBER OF BADGES
0
CONTRIBUTIONS
0 Posts
CONTRIBUTIONS
0 Public Channels
AVERAGE RATING
CONTRIBUTIONS
0 Highlights
AVERAGE NO. OF LIKES
Feeds
Question
我再使用强化学习工具箱编写SAC智能体进行训练时策略一直在上下限波动,没有很好的探索,而使用DDPG智能体和PPO智能体则是能够进行一些有效的探索,请问这是什么原因?
%main % 观测空间和动作空间定义 % numObs = 11; %观测空间维度 % numAct = 4;%动作空间维度 numObs1 = 7; %观测空间维度 numAct1 = 3;%动作空间维度 %BS, EB, ,CL a...
1 year ago | 1 answer | 0