Rhasspy offline voice assistant toolkit

2.07 works with me!

I have first used EN, downloading went ok, after that I selected NL. Also worked correct!
Then I did an extra language DE, which downloaded fine as well :slight_smile:

1 Like

Sorry about that! I stopped storing the compiled web interface files in git, but I forgot to add the appropriate build step to create-venv.sh. It’s update in the git repo now, so please try again.

Based on the recommended settings, it looks like you should check “Apply Frontend” for computer.umdl and set the sensitivity to 0.6. You can download any of the other models too, and see how they do. The jarvis and neoya models probably won’t work, since the seem to require setting multiple sensitivity values (I need to figure out how to do this inside of Rhasspy).

Shouldn’t it be to 0.4, since I need the wake word using snowboy being detected a bit easier?

Edit:
It is working better with 0.6 now. Thanks.

1 Like

@S_n_Nguy_n, I’m happy to announce that the latest version of Rhasspy is my first attempt at supporting Vietnamese! Pick the vi profile, save, and restart Rhasspy to get started. After it downloads, make sure to re-train before trying it out.

Thanks again for the WAV files and transcriptions. I used them to create the sample sentences and test the acoustic model. Hopefully I got it working well enough to do simple voice commands. Let me know if it works; I could try the same approach for some a bunch of other languages.

If there is something that I can do to help, I would like to see Portuguese (BR) implemented.

I will try and respond to you to further improve, thank you

This is stop_words for vietnam profile, can you add it?

Got it. I’ll add this for the next update. Thanks!

I’ve got a start on a Portuguese profile, but I don’t have any WAV files for testing. If you would be willing to record yourself saying a few commands (“turn on the living room lamp”, etc.), that would be great. A few WAV files and the text that Rhasspy should detect are all I need for testing.

That’s great! I will record and send to you in a few days. I am out of the city right now. I presume 16bit 44.1kHz mono should do the trick.

That would be perfect. My initial tests with phrases “spoken” by Google Translate have not been very promising. If testing with real audio from you doesn’t work either, I have a backup method. Someone awesome put together a list of free Portuguese speech data sets, so I can just train my own model.

I’m happy to report that the latest version of Rhasspy adds support for Portuguese! Thanks to @FunkyBoT for supplying test data, and a big thanks to the people who created the Montreal Forced Aligner for supplying pre-trained acoustic models in many languages where it’s almost impossible to get good quality, free speech training data.

If anyone can supply some example WAV files with transcriptions in any of these languages, I will do my best to add it to Rhasspy.

Got an error downloading the PT profile.

Sorry, had the wrong download link in the script. Should be fixed now. You may need to use the red Re-Download button on the Settings page, or just delete the profiles/pt/download directory.

Updated to 2.10 but still getting an error:

I’d recommend deleting the download directory in your pt profile on your Hass.IO share. It pulled down the wrong file (portuguese_g2p.zip), and won’t re-download it unless it’s gone.

I had pressed the “red download button”. Will try deleting the files in the pt profile.

I’ve double-checked that everything got pushed up to Dockerhub correctly. The problem is this file, which should be about 5MB. I had accidentally used the URL for the Vietnamese file, which is about 3MB.

Training in PT are taking too long. “R” spinning forever

There is a small misspelling in “light_state = (acenda | apague) {state}”