Hi, I’m trying to increase the complexity of the environment at the end when all the agents reset after every 5 minutes or agent reward > 1,2,3, then 2, etc.
I found a hacky way using a branch so it will only run once on Agent Reset Episode but I was wondering if there’s a function or if I missed something.
And what’s the best way to get the average sum reward to use that as a value I can increase the difficulty against?
Thank you, 5.4 has been amazing with the observation workflow
Thanks,
James