Course: Learning Agents (5.5)

Deathcalibur · May 16, 2025, 2:06pm

As far as I remember, the struct observation simply concatenates the child elements, so the neural network should end up the same or very similar in your example.

_YAF_Lightbringer · May 21, 2025, 2:20pm

Hi,
I think the course leaves out an important detail in the IL part: the trainer file needs to be set to “train_behavior_cloning” in the trainer process settings.

Otherwise, the python prcoess will try to run the default PPO trainer, and the ULearningAgentsImitationTrainer will not have sent the correct JSON config.

For Google, and for the next person trying to fix this, the subprocess will throw this error.

LogLearning: Display: Sending config signal…
LogLearning: Display: ImitationTrainer_8: Imitation Training Started
LogLearning: Display: ImitationTrainer_8: Sending / Receiving initial policy…
LogLearning: Display: Subprocess: Traceback (most recent call last):
LogLearning: Display: Subprocess: File “C:\Epic Games\UE_5.5\Engine\Plugins\Experimental\LearningAgents\Content\Python\train.py”, line 53, in
LogLearning: Display: Subprocess: train(communicator.config, communicator)
LogLearning: Display: Subprocess: File “C:\Epic Games\UE_5.5\Engine\Plugins\Experimental\LearningAgents\Content\Python\train_ppo.py”, line 30, in train
LogLearning: Display: Subprocess: ppo_config = config[‘PPOSettings’]
LogLearning: Display: Subprocess: ~~~~~~^^^^^^^^^^^^^^^
LogLearning: Display: Subprocess: KeyError: ‘PPOSettings’

3DMoss · June 22, 2025, 2:31pm

Just went through the course and everything was working until I got to the imitation training. For some reason, it can’t find the python executable:

LogLearning: Error: ImitationTrainer_0: Can’t find Python executable “../../../../../../Unreal/LearningToDrive/LearningToDrive/Intermediate/PipInstall/Scripts/python.exe”.

BP_SportsCarManager is working fine, and able to spawn the python process. Both of them have the same settings in “Trainer Process Settings” and the trainer file name has been changed to: “train_behavior_cloning” for imitation trainer.

What could be causing this error message? And why would it happen for imitation training only?

This is using 5.5.4.

kabischmid · July 2, 2025, 1:56am

Could somebody describe the available model/s?

Anonymous_451ed00facdea15b2cdafa3e060ae772 · July 10, 2025, 4:29pm

Thank you so much! I got an inexplicable error here. It makes me unable to proceed.

tomhalpin8 · July 11, 2025, 7:45pm

@Deathcalibur Any notable updates to Learning Agents we should be aware of in 5.6?

Deathcalibur · July 15, 2025, 2:25pm

The big news is we updated the python so that it’s possible to scale out training to ~100s of game processes in parallel.

Here’s the list:

Multi-Processed Training:
We are able to easily spawn X number of Unreal processes and have them all communicating with a single python process to be able to improve training throughput.
Easy Tensorboard Installation
Simply use the new “Tensorboard” plugin - no longer need to find the python env and manually install it via pip
MLflow
Supports MLflow as an alternative to Tensorboard
Shared Memory on Mac
Action Modifiers
Ability to modify the actions based on context. For example, mask invalid actions.

I don’t believe there are major breaking change with this tutorial and 5.6. I’ve been pretty busy working on a project so I haven’t taken the time to post 5.6 tutorial, which I’m sure is unfortunately confusing for some.

Thanks! Let me know if there are questions.

tomhalpin8 · July 20, 2025, 1:24am

@Deathcalibur

I’m not sure if this is helpful or not, I follow Joseph Suarez on X. He seems to be doing a lot of research into RL and how to make it faster from the ground up. I’m curious if any of that research could be of use to the Epic team or if you are already aware of the work. Anyhow, here is a recent article of his Hopefully it’s of some use.

X RL Article

Joseph’s X Account

FlameTheory · July 20, 2025, 12:12pm

Thanks for the update @Deathcalibur !

Could you provide some pointers on how to get Multi-Processed Training up and running?

Layter · July 21, 2025, 8:53pm

Hi, I followed the tutorial, and it worked well. Then I extended it by adding custom logic for rotating at the start and stopping at the end of a spline. However, before reaching the maximum reward, the agents seem to “forget” how to rotate. I’ve tried many adjustments (reward tuning, observations, parameters), but nothing has helped so far.

Any help would be appreciated!

uced · July 23, 2025, 8:31pm

Is there a 5.6 version of the tutorial ?
I can’t compile the GatherAgentObservation function in 5.6.
The ObsActor seems to be the problem for me. I tried to create it from the BP nodes (“create variable”, “promote to variable”, then manually.
Anyone had it working in 5.6 ?
Thanks

Deathcalibur · July 24, 2025, 2:08pm

I haven’t updated the tutorial because I believe it should just work (could be wrong though). It’s a lot of effort to update the tutorial unfortunately and I’m rather busy. If there is more confirmed problems, then I might try to squeeze it in.

If you post a screenshot or something I might be able to help you figure out what’s wrong.

@FlameTheory
Take a look at this. It’s not really a turn crank solution. Mainly giving you the command line args needed to get setup with Docker.
Scaling Out with Learning Agents Tutorial.pdf (81.8 KB)

@tomhalpin8 I follow Joseph on X. His stuff is cool but I think most of it is not super applicable to UE games.

FlameTheory · July 25, 2025, 8:52pm

That was very helpful thank you!

I now have this working on macOS without the need for Docker. Here are the steps I took:

Make a new project folder
Copy over all the files from /Users/Shared/Epic Games/UE_5.6/Engine/Plugins/Experimental/LearningAgents/Content/Python and /Users/Shared/Epic Games/UE_5.6/Engine/Plugins/Experimental/NNERuntimeBasicCpu/Content/Python
Create a new virtual environment: python3 -m venv venv
Activate the venv: source venv/bin/activate
Install numpy, tensorboard, torch, torchvision, torchaudio: pip install torch etc
Start the server with: python3 train.py Training -l --nne-cpu-path nne_runtime_basic_cpu.py Socket 127.0.0.1:48491 output
Make sure your UE project is setup to use sockets and external training
Spawn as many instances as you like in new terminal windows with: /Users/Shared/Epic\ Games/UE_5.6/Engine/Binaries/Mac/UnrealEditor -Project=/Path/To/Your/Game.uproject MapName -game -nullrhi -nosound

Deathcalibur · July 25, 2025, 9:08pm

Yes perfect! Glad to hear it works on Mac. I do something similar on my Windows PC. I wrote a script to spawn X UE processes.

uced · July 26, 2025, 6:40am

Thanks a lot for your answer and my apologies for the very late reply (couldn’t find the time to open UE5 during the week).
I finally found what didn’t work for me when copy pasting the Gather Agent Observation function.
When you automatically create the ObsActor variable (right click on “set” node, or promote to variables in “get” nodes), it’s created with the type “object reference”, which is not compatible with all the nodes needing it.
So i had to manually change its type to “actor object reference”, then it compiled.