We once tried to use IK to solve this problem, but the result was not very good. The main problem is that the 3D skeleton position is not accurate and the length of the bones between frames may change, making it difficult to map them to a skeletal mesh for IK.
We also have made a solution using MediaPipe, usage video below,