I ran into a similar problem, same key mapping different actions in different contexts. I don’t like using a delay node either because it feels unreliable; the delay needed may vary across machines.
(I thought the entire idea of the Enhanced Input was to handle such situations.)