Reinforcement Learning Toolbox Customize DQN Agent

Question

Umut Can Akdag on 19 May 2020

0
Link

Direct link to this question

https://uk.mathworks.com/matlabcentral/answers/526599-reinforcement-learning-toolbox-customize-dqn-agent

Answered: Ryan Comeau on 19 May 2020

In reinforcement learning toolbox, is it possible to customize the way that DQN Agent takes actions? For example in a snake game if the last action taken was "North" then agent should not even bother to try the action "South" and vice versa. Should I do that with Custom Agent template from scratch like here or is there any better way to do this ? For example only customizing rlDQNAgent's takeAction method instead of implementing all other things from scratch.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Ryan Comeau on 19 May 2020

0
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/526599-reinforcement-learning-toolbox-customize-dqn-agent#answer_433475

Hello,

I can see the value in what you're proposing. In some cases it would be useful to have RL agents not fail the game immediately for simple mistakes like this. That is not however the purpose of the RL agent. Failing is an important step in the learning process. What you can do however, is modify the reward function to make some failures worse than others, which will heavily discourage the rl agents from making these mistakes.

So in the snake example, hitting a wall could be a -10 reward, hitting the extended body could be -20 and hitting itself after a North than South type could be -100. This would rapidly teach the agent to not do that. You can open up the class of environment in question and look for the function called step(). This function shapes the reward and takes the action. Alter the reward function to your liking.

Hope this helps,

RC

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Reinforcement Learning Toolbox Customize DQN Agent

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Reinforcement Learning Toolbox Customize DQN Agent

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments