Why is the DDPG episode rewards never change during the whole training process?

Question

Guoge Tan on 25 May 2020

0
Link

Direct link to this question

https://uk.mathworks.com/matlabcentral/answers/532933-why-is-the-ddpg-episode-rewards-never-change-during-the-whole-training-process

Commented: Shahriar on 29 Jun 2022

Accepted Answer: Emmanouil Tzorakoleftherakis

I'm training a DDPG agent using the Reinforcement Learning toolbox on MATLAB R2020a for a path planning problem. But as you can see, the DDPG episode rewards and average rewards never change during 5000 episodes. I used a simple neural networks with 20 neurons and three layers, the learning rate is set to 0.01, and the Gradient Threshold is 1. Then I try to set weights and bias for fully connected layers and change my reward function, but the result is the same.

I also saw at here that others have a similar problem. So any advice for my problem? Thank you.

1 Comment
Show -1 older commentsHide -1 older comments

Shahriar on 29 Jun 2022

@Guoge Tan could you solve this issue? I have a similar situation.

Sign in to comment.

Sign in to answer this question.

Answer 1

Emmanouil Tzorakoleftherakis on 26 May 2020

0
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/532933-why-is-the-ddpg-episode-rewards-never-change-during-the-whole-training-process#answer_439593

Looks like the scale between Q0 and episode reward is very different. Try unchecking "Show Episode Q0" to see of the episode reward changes. I would then simplify the critic network to make sure it outputs values in a similar scale as the episode reward.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Why is the DDPG episode rewards never change during the whole training process?

1 Comment
Show -1 older commentsHide -1 older comments

Accepted Answer

0 Comments
Show -2 older commentsHide -2 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Why is the DDPG episode rewards never change during the whole training process?

1 Comment Show -1 older commentsHide -1 older comments

Accepted Answer

0 Comments Show -2 older commentsHide -2 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

1 Comment
Show -1 older commentsHide -1 older comments

0 Comments
Show -2 older commentsHide -2 older comments