2.07 works with me!
I have first used EN, downloading went ok, after that I selected NL. Also worked correct!
Then I did an extra language DE, which downloaded fine as well
2.07 works with me!
I have first used EN, downloading went ok, after that I selected NL. Also worked correct!
Then I did an extra language DE, which downloaded fine as well
Sorry about that! I stopped storing the compiled web interface files in git, but I forgot to add the appropriate build step to create-venv.sh
. Itâs update in the git repo now, so please try again.
Based on the recommended settings, it looks like you should check âApply Frontendâ for computer.umdl
and set the sensitivity to 0.6. You can download any of the other models too, and see how they do. The jarvis and neoya models probably wonât work, since the seem to require setting multiple sensitivity values (I need to figure out how to do this inside of Rhasspy).
Shouldnât it be to 0.4, since I need the wake word using snowboy being detected a bit easier?
Edit:
It is working better with 0.6 now. Thanks.
@S_n_Nguy_n, Iâm happy to announce that the latest version of Rhasspy is my first attempt at supporting Vietnamese! Pick the vi
profile, save, and restart Rhasspy to get started. After it downloads, make sure to re-train before trying it out.
Thanks again for the WAV files and transcriptions. I used them to create the sample sentences and test the acoustic model. Hopefully I got it working well enough to do simple voice commands. Let me know if it works; I could try the same approach for some a bunch of other languages.
If there is something that I can do to help, I would like to see Portuguese (BR) implemented.
I will try and respond to you to further improve, thank you
This is stop_words for vietnam profile, can you add it?
Got it. Iâll add this for the next update. Thanks!
Iâve got a start on a Portuguese profile, but I donât have any WAV files for testing. If you would be willing to record yourself saying a few commands (âturn on the living room lampâ, etc.), that would be great. A few WAV files and the text that Rhasspy should detect are all I need for testing.
Thatâs great! I will record and send to you in a few days. I am out of the city right now. I presume 16bit 44.1kHz mono should do the trick.
That would be perfect. My initial tests with phrases âspokenâ by Google Translate have not been very promising. If testing with real audio from you doesnât work either, I have a backup method. Someone awesome put together a list of free Portuguese speech data sets, so I can just train my own model.
Iâm happy to report that the latest version of Rhasspy adds support for Portuguese! Thanks to @FunkyBoT for supplying test data, and a big thanks to the people who created the Montreal Forced Aligner for supplying pre-trained acoustic models in many languages where itâs almost impossible to get good quality, free speech training data.
If anyone can supply some example WAV files with transcriptions in any of these languages, I will do my best to add it to Rhasspy.
Sorry, had the wrong download link in the script. Should be fixed now. You may need to use the red Re-Download button on the Settings page, or just delete the profiles/pt/download directory.
Iâd recommend deleting the download directory in your pt profile on your Hass.IO share. It pulled down the wrong file (portuguese_g2p.zip
), and wonât re-download it unless itâs gone.
I had pressed the âred download buttonâ. Will try deleting the files in the pt profile.
Iâve double-checked that everything got pushed up to Dockerhub correctly. The problem is this file, which should be about 5MB. I had accidentally used the URL for the Vietnamese file, which is about 3MB.
Training in PT are taking too long. âRâ spinning foreverâŠ
There is a small misspelling in âlight_state = (acenda | apague) {state}â