About Learning Agents

Your definitions for the episode, steps, and iteration look good.

To be clear, an iteration is triggered to train when either the max_episodes or max_steps thresholds are crossed, whichever comes first. Most likely you would be crossing the max_steps if using the default settings.

The log is outputted every training iteration, not every 78. The averages are computed by using the mean(). See {Your-Unreal-Install-Dir}\Engine\Plugins\Experimental\LearningAgents\Content\Python\train_common.py, line 323.

I think this is explained above. You define the settings for the steps/episode threshold in the training settings. I believe the default is 1000 episodes or 10000 steps. Whichever is crossed first will trigger a training iteration.

No problem, happy to take a look at any relevant paper that is using Learning Agents (assuming you have an English translation available).

Good luck!

1 Like