Metahuman Face-capture & Audio Edit workflow

I am struggeling with the workflow for editing facecapture and audio using the Live Link Face App.

I am using the Take Recorder to capture the Live Link facial expressions and Microphone Audio. This results into an animation and audio file. I want to cut these capture into separate animations and audio files. To keep everything in sync I need to edit the audio and the animation at the same time.
How is this done ? Can I use the sequencer or do I need other tools. Motionbuilder cant export audio as for as I know.

You can place both animation and audio in sequencer and set in/out points to render. Audio will be rendered as a Wav file and would need to be combined with video in a video editor.

Thanks for the answer, but I need my output to be another (edited) fbx and wav file to use it in game. I am looking for a way to trim or edit the recorded animation (blendshape curves) and sound together at the same time.
It looks my sequencer cannot output to fbx ???