Added more information to generated audio and a way to save generated Soundwaves to allow users to generate audio that can be packaged into the project as USoundWaves.
This gave me the opportunity to create an Editor Tool to generate and save the audios without the need to have a blueprint and generate them in simulations, this will help to avoid costs in runtime, as we will be able to generate the audios before packaging, hehehehe. I will start working on it soon.
These changes were made in the ‘Convert … to Sound Wave’ functions:
Tasks that uses the Sound Wave generation are already using these changes but will continue generating Transient Sound Waves.
To generate the Sound Waves and save in project’s content, you can use the ‘… to Audio Data’ tasks and generate a Sound Wave using the Audio Data as parameter in the modified functions.
when I use the text to speech function it only gets the second part of the sentence as result. E.g. if I say “lorem ipsum one two three four five” I get the string as “three four five”.
the same problem occurs if I use the button to speak (when I press the button it calls the text to speech function)
I decided to work around the truncated sentence problem by recording the audio using the audio component to a wav file. And then using the wav file to text function. Unfortunately in this case the wav file to text function does not work at all, i.e. I do not get any result.
The logic is as follows:
KeyPressed: Audio Capture Enable → Start Recording Output
KeyReleased: Audio Capture Disable → Finish Recording Output (export type wav file) → delay (1sec) → Wav file to text
To make sure the wav file was saved correctly, I changed the wav file to text function to convert wav file to usoundwave and then plugged its output into play sound 2d and the wav file was read correctly. Unfortunately the wav file to text function does not work for me at all.
Logs:
UE4.27
AzSpeech v1.4.3
PIE View:
Task: WavFileToText (155787)
ActivationTime: 6milliseconds
ActiveTime (102 second) -> Never stops
Current recognised string:
I couldn’t reproduce the issue using Text to Speech
maybe it’s not a problem with the plugin but with the internet connection or some other settings? I’ve also noticed that using the built-in editor function “AzSpeech audio generator” not always, but sometimes cuts off my first words of text (both in Polish and English). For example, I set your text to the built-in AZSpeech audio generator engine:
"One of the problems is possibly because of the unsupported language, could you test using another one like en-US instead of en-PL?
I will continue to investigate the reason for the first issue mentioned. But could you send me the SDK logs? : )"
And in the wav file I received I can only hear “DK logs”
If I use the generate audio button again, I already get the whole speech
UPDATE:
ok, I think I found the cause, now it doesn’t cut off the first part of my sentence. I disabled the micforon input device (live gamer portable 2 plus) in the windows settings and left only Vive Pro Multimedia Audio.
But when i try to use .wav to Text or Speech to Text using en-PL as language or candidate language, the tasks doesn’t recognize anything.
I use pl-PL not en-PL
Searching the supported languages page, I didn’t find en-PL
as above. I use pl-PL and that is on the supported list.
BTW.
How can I force speech to text to work continually without having to use any buttons? I’ve done something like this, but when i exit PIE my editor crash