I haven’t tried it yet but I think you could run the normal Ai using the behavior tree and use it to train learning agents using Imitation Training, then after a bunch of iterations you can transition them over to Reinforcement Training. They should utilize the characteristics of the behavior tree so long as you have enough observations and the rewards are set up to reward the right behaviors.