Best way to Capture audio and play it later?

Hey all! For the project I am currently working on, there would be demand for playing audio that is recorded beforehand. Game is run in Dedicaded server, where any client could point somewhere and then record speech that can be played later on by other clients. Where would I start with this? Creating and joining game isn’t using the subsystem. Can I still use the subsystems voice functions, if they are only needed to RPC to server that saves them?

I guess the audio is needed to capture as Raw data somehow and then converted back and forth? I was thinking that clients could spawn audio components that holds the sound they record. So main problem is to get the audio inside the audio component somehow. Could anybody help me to get started? Thanks.