Back from a week off, so back to training. I developed a walk cycle a while ago which I will attempt to train next. I am not expecting it to learn this animation first time. I feel adding movement will require a level of tinkerage.
In the deep mimic paper they add reward for forward velocity. Currently I do not have any exta rewards outside of mimicking the animation. Will share progress once training is complete.