photo


Last seen: 1 year ago Active since 2024

Followers: 0   Following: 0

Statistics

MATLAB Answers

1 Question
0 Answers

RANK
293,670
of 300,553

REPUTATION
0

CONTRIBUTIONS
1 Question
0 Answers

ANSWER ACCEPTANCE
0.0%

VOTES RECEIVED
0

RANK
 of 21,024

REPUTATION
N/A

AVERAGE RATING
0.00

CONTRIBUTIONS
0 Files

DOWNLOADS
0

ALL TIME DOWNLOADS
0

RANK

of 169,635

CONTRIBUTIONS
0 Problems
0 Solutions

SCORE
0

NUMBER OF BADGES
0

CONTRIBUTIONS
0 Posts

CONTRIBUTIONS
0 Public Channels

AVERAGE RATING

CONTRIBUTIONS
0 Highlights

AVERAGE NO. OF LIKES

Feeds

View by

Question


我再使用强化学习工具箱编写SAC智能体进行训练时策略一直在上下限波动,没有很好的探索,而使用DDPG智能体和PPO智能体则是能够进行一些有效的探索,请问这是什么原因?
%main % 观测空间和动作空间定义 % numObs = 11; %观测空间维度 % numAct = 4;%动作空间维度 numObs1 = 7; %观测空间维度 numAct1 = 3;%动作空间维度 %BS, EB, ,CL a...

1 year ago | 1 answer | 0

1

answer