Misalignment between frames generated from two cameras

Hi,

I am rendering a scene with two camera setup (RGB and Depth) as implemented by GitHub - amrgomaaelhady/SynthoGestures: This repository has the SynthoGestures Framework. using UE’s Sequencer.

The renderer is expected to generate RGB and Depth frames corresponding to the scene, but I could see that there is spatial alignment between RGB and Depth output when I render a 30fps video.
30fps

However, as I go for higher frames, the RGB and Depth videos look synchronized when I overlay the images from by the two cameras. What could be the possible issue here? And how can I achieve perfect alignment at low frame rate? Thank you.