I talked to my colleague who also trains some agents using the physics and his solution is to use the training settings to trim the first few samples from each episode:
He added these settings to make physics examples easier without having to muddle around with “agent pausing”.
Thanks,
Brendan