The simulation time set by reinforcement learning is inconsistent with the training time

4 views (last 30 days)
The simulation time set by reinforcement learning is inconsistent with the training time。

Accepted Answer

Subhajyoti
Subhajyoti on 19 Jul 2024
Hello Lin,
The inconsistency between the simulation time set by reinforcement learning and the training time is expected due to the different and independent nature of these parameters. Here's an explanation to clarify the difference:
- Simulation Time is the time required to run the simulation, where each episode represents an interaction between the Reinforcement Learning Agent and the Environment.
- Training Time is the time taken to for the model parameters to converge to the optimal parameters.
The simulation time and training time are governed by different parameters and are inherently independent of each other. The simulation time depends on the number of episodes and the duration of each episode, whereas the training time depends on the number of epochs and the computation time required to update the model parameters.
Refer to the following MathWorks Documentation to understand more about Training RL Agents and training options.
Hope the above information is helpful.

More Answers (0)

Products


Release

R2023b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!