Course: Learning Agents (5.5)

Not yet. I imagine they are difficult to implement without making changes to the codebase.

We have successfully implemented a variant of PPO for our internal usage and the “bring your own algorithm” worked well for that.