Speech Recognition Plugin - Sphinx-UE4

ShaneC · December 18, 2015, 3:37pm

I have recently made some bug fixes, and added Chinese and French languages.

Let me know your thoughts.
Thanks

n00854180t · December 18, 2015, 5:49pm

@ - Awesome work man. I like that you can add words to the dictionary - that’s going to be instrumental for making a game-specific wordset.

Not sure on the wiki issue.

Edit: Do you think you could give some pointers to where I should start in the plugin code for adding the phoneme recognition type as an option?

ShaneC · December 18, 2015, 9:39pm

Hi n00854180t,
I have read that there are some issues with phoneme recognition.

Never the less, I am adding in some ground-work into the code
This will allow a mode to be selected when initializing.
For the time being, this will be Dictionary Keyword, and Phoneme Recognition.
The modes can certainly be expanded on at a later date.
Cheers,
Shane

n00854180t · December 18, 2015, 9:49pm

You the man!

If you need someone to test phoneme recognition I’d be happy to give it a try.

As an aside, is there currently a way to pass in a SoundWave or raw sound samples rather than using the microphone? I think phoneme recognition would be especially useful for lipsyncing voice over dialogue sounds.

anonymous_user_c8558ff7 · December 20, 2015, 4:54am

Woah! Question: Let’s say I wanna give an AI an order using this; how would it be appliable for it to recognize the command: “go” I specify it via blueprints?

Ahmed_Bedier · December 20, 2015, 11:02am

so i’m intersted in this plugin, can you add added Arabic languages or it’s too early for that ?

ShaneC · December 21, 2015, 2:01am

n00854180t,
Passing in a sound wave is certainly possible. i hadn’t gone down that path yet, as I didn’t really see a useful application (and I have limited time).
You could certainly pass in a sound wave, and have it return a time-frame of when phoneme’s were spoken.

This brings me to my next point, the approximation (from my testing) is rather approximate (or all over the shop, from my testing).
You could perhaps use this information with a phoneme dictionary list, and an advanced algorithm to match the detected/matched phoneme’s, and approximate the timing of the gaps.
I am unsure how well this would work, to be perfectly honest.

Regarding the phoneme work, I have created a branch, and will check-in my work later in the week. I don’t really have the time to thoroughly test (it seems very random)
But, this will show you how it CAN work, and should provide a starting place for your development or expansion.

,
Did you look at the blueprint example?
Simply change the Word Spoken function to perform a switch statement on the incoming text, and trigger whatever you like based on the phrase you added.
Note: Simple phrases are more error prone (eg. false positives)… especially something as simple as “Go”

Andrew, there does exist an Arabic language model for Sphinx.
I went down the path of investigation of Arabic (as another person had requested) but it seems Arabic is not supported by Unreal Engine 4.
Have you tried pasting Arabic text into an FString. It simply comes up as a series of unknown characters.

I had thought of converting the Arabic dictionary file, into a Latin transliteration, but I have no idea how that could be done on a large scale.
Ideas?

n00854180t · December 21, 2015, 9:10pm

@ - awesome man, you’re a life saver.

I’ll definitely be checking out the phoneme branch and seeing where I can take it. I’d love to be able to create an auto-lipsync tool for UE4 using it, but we’ll see if it’s accurate/good enough for that

anonymous_user_c8558ff7 · December 21, 2015, 10:38pm

Hey , I did check it, I just wasn’t so sure how well it could handle multiple transliterations. I’ll see if I can manage to create a simple AI, that obeys voice commands.

ShaneC · December 21, 2015, 11:48pm

Hey ,
Hmm, multiple transliterations? I thought you wanted a particular voice phrase to trigger an in-game action?
The transliteration comment was for Andrew, regarding a request for Arabic speech

Call Init with phrases you wish to recognise.

Then have a switch on the returned phrase, to trigger different logic based on the recognised phrase.

You would probably also want other checks, so that only certain phrases perform certain actions at certain times, but that is really up to you.
Cheers

anonymous_user_ae1dd25c · January 5, 2016, 1:24pm

Can I run your program in MacBook UE4?

ShaneC · January 6, 2016, 4:56am

Hi Artigen,
At this time, the plugin does not work under Mac OS.
I would be happy to port it to Mac OS, if you were willing to pay me for my time.
Alternatively, the code is publicly available on GitHub.
All the best,
Shane

SaviorNT · January 6, 2016, 7:05am

Has this been tested yet on 4.11?

And oh my… this is exactly the plugin I have been looking for since UE4 came out \o/

Muzaheed · January 6, 2016, 8:47am

getting this error when i am trying to open it on 4.10
i tried to build from source using vs2015.
but still same

SaxonRah · January 6, 2016, 9:17am

They must have compiled with debugging and released that version and not the non debugging version. the d on the end signifys debugging.

ShaneC · January 6, 2016, 10:17am

ahh, that would be my bad. I`ll update the dll’s and let you know when I have done so.

ShaneC · January 6, 2016, 1:40pm

Okay, I have updated GitHub, and the Wiki.

Note: There’s an additional step to copy across the appropriate dll to match the version of unreal you are using.
Please let me know if you encounter any issues.

Ahmed_Bedier · January 7, 2016, 3:02pm

sorry for the delay, about converting Arabic dictionary file into a Latin transliteration i don’t think this will work, tried it before in unity but who guess it could work in unreal
Also i got agood news 4.11 - 4.12 will support Arabic so it can be easy to compelete your work without converting any thing

ShaneC · January 8, 2016, 12:15am

Oh, that’s great news. Thanks for letting me know.
I`ll let the person who e-mailed me know that Arabic is being added in 4.11-4.12.

anonymous_user_0a38404d · January 18, 2016, 8:27pm

Hi, i have a problem, when Run my project then I put “I” (for init Speech) my Microphone Input not work, Stop and Close project come back Work my microphone

Have you ever happen it ?