How to Use Platforms Like ElevenLabs More Efficiently in Game Development

Hello everyone,
I’m currently using AI voice services (such as ElevenLabs, Artlist, etc.), and each time I need to generate WAV or MP3 files, I have to manually update these audio files one by one into the user data table of the dialog system. This process feels incredibly tedious and boring.

For example, a text of over 500 characters (just a small unit) requires generating around 40 audio files. This means I have to continuously copy and paste from the SaaS service, repeating the process over and over. Ignoring the time spent on calculations and manually adjusting the audio files, just these steps alone already take me more than an hour.

Does anyone else have a similar issue? I’m thinking about developing some tools to make this process more efficient. (Feel free to like or leave a comment to let me know.)

Or if you have other suggestions, I’m currently importing each file into Unreal manually and assigning it to the corresponding field.