About Learning Agents

Hello,

An Episode is a series of states/actions from start to finish for one agent in the game environment.

Steps is the number of states/actions pairs the agent will encounter in a training episode. This is typically controlled by calling “RunTraining” during your manager’s Tick event. If you are running your manager’s Actor & game at 60 FPS, then it will be 60 steps / seconds. We suggest ticking slower generally speaking, perhaps 10 steps / second. In Learning Agents 0.1 for UE 5.3, when this threshold is hit, the episode will automatically terminate, so adjust higher if needed.

Iterations is how many repeats of the training process to run. The full training process is 1) collect data episodes to fill replay buffer and 2) sync to python process and run training on randomly sampled batches. An iteration is “filled” when either the Max Recorded Episodes Per Iteration or Max Recorded Steps Per Iteration is met.

  1. SAC and Q-Learning are NOT currently supported but something we would like to get working before Learning Agents 1.0

Thanks,
Brendan