Reinforcement learning: "NextObs" vs. "LoggedState" in step function

1 view (last 30 days)
Hi,
I could not find out what the difference between "NextObs" and "LoggedSignals" is in the step function. In all scripts both are passed on from the step function.
[NextObs,Reward,IsDone,LoggedSignals] = myStepFunction(Action,LoggedSignals)
"LoggedSignals" is obviously used for the next step, but what is "NextObs" used for?
Thanks!

Answers (1)

Emmanouil Tzorakoleftherakis
Actually, NextObs is the important thing here. It represents the value of your states after you apply current action and integrating one step.
LoggedSignals is where you can log information to view later - can be left empty too.
  4 Comments
lfyx
lfyx on 1 Nov 2021
Hello, may I ask that, can the "sim" function output the LoggedSignals to the work space? Many information about the simulation action or observarion are saved in the LoggedSignals. However, the output of "sim" is the experince structure.
Maha Mosalam
Maha Mosalam on 22 Nov 2021
Hi, what about the xact role of IsDone flag it it shuld be true or false or what?

Sign in to comment.

Products


Release

R2019b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!