I’ve been looking at/adding logs to train_ppo.py to get a grasp on what is going on with my setup. It seems to be receiving/creating everything, starts training, and then it looks like it pulls the experience when I call StopTraining after about 8 seconds of training where my agent is clearly receiving commands but it doesn’t seem like any of the training is actually being saved:
LogLearning: Display: Training Process: Receiving Policy...
LogLearning: Display: Training Process: Receiving Critic...
LogLearning: Display: Training Process: Receiving Encoder...
LogLearning: Display: Training Process: Receiving Decoder...
LogLearning: Display: Training Process: Creating Optimizer...
LogLearning: Display: Training Process: Creating PPO Policy...
LogLearning: Display: Training Process: Opening TensorBoard...
LogLearning: Display: Training Process: Begin Training...
LogLearning: Display: Training Process: Profile| Pull Experience 2528ms
LogLearning: Display: Training Process: Done!
LogLearning: Display: Training Process: Exiting...
I never see it actually running any of the “push” functionality it looks like it should be… It is as if this code:
trainer.recv_experience(
trim_episode_start,
trim_episode_end)
Never receives a response until I stop the training even though I am calling the RunTraining function every tick. Should I be seeing the push logs while training is running?