Course: Learning Agents (5.3)

Maybe some bools linger during resets? In any case I’m omitting that completion criteria and let them still “play” even though they lost. Maybe they will learn to still do the right thing even if they can’t win.