Try manually triggering a “warm-up” of the sound asset before it’s actually needed in the sequence. You can do this using a hidden cue that starts off-screen or is muted, just to force the engine to load it into memory early. It’s kind of a brute-force method, but surprisingly effective when timing matters.
Also, checked if the sound waves are set to Stream or Decompress on Load