Help us make voice better in under a minute

system · October 24, 2024, 3:02pm

Give us a minute of your time and lend your voice to improve our community-driven, privacy-focused voice assistant, and together we’ll break big tech’s stranglehold on home voice control.

When you use Home Assistant Assist we’ll never take your data unwillingly to improve its functionality; that’s why we’re asking for your help today (more on that below).

Specifically, we’re trying to improve our wake word engine, which “wakes” the device to listen for more commands. Our open source microWakeWord engine is an incredible, lightweight bit of code but requires training with real voices to improve. Everyone’s voice is unique, and we need recordings of people of all genders, ages, and accents from around the world saying this wake word. So get the whole home involved, along with friends and family.

Help us today

Your voice, your choice

Privacy is at the core of everything we do, and that’s what’s driving us to build an open, local, and private voice assistant. Years ago a voice assistant arms race began between a couple of big tech companies, where they sold very cheap voice hardware, all because they wanted your data. They used that voice data to rapidly improve their voice capabilities (and who knows what else) at the expense of your privacy.

We will never use those tactics to improve Home Assistant. We do something novel and bewildering to the smart home industry - we explicitly ask for your help. For instance, Home Assistant has opt-in statistics; these really help us focus on what people are using and put resources into making the biggest impact. When you opt-in it really helps, but we totally understand if you don’t - it’s your home and your data. We’re doing the same with voice, explicitly asking for help, and would love your help to improve its wake word capability.

What big tech might consider a weakness is actually our greatest strength - privacy empowers our users, allowing them to not worry about being tracked or classified by an algorithm.

microWakeWord goes global

microWakeWord is a very lightweight wake word engine that can run on ESP32-class devices. This allows the voice assistant to listen for a specific phrase, for instance “Okay Nabu”, and ignore all other noise and speech until it hears that. Running it on-device speeds everything up significantly, cutting out the lag from streaming audio continuously to a more powerful device running wake word software. The downside to running it on-device is the amount of training required to get good results.

A few months ago, we promoted an early version of this Wake Word Collective tool in the Open Home Foundation newsletter (subscribe if you haven’t already). From that one email alone, we received over 5,800 samples covering 30 different languages. While microWakeWord used to only excel at picking up the English pronunciation of “Okay Nabu”, it’s now many times more effective at picking up different accents. Our testing shows our model trained on this data falsely rejects 5% of samples, while previous models rejected 18%, which shows your samples are making a difference!

We still need more different types of voices from different parts of the world. If you or your family use Assist and find it still struggling to wake up to hear your commands, this is the perfect opportunity to lend your voice to the project.

How it works

Before you start recording you will fill in what language you’re most comfortable speaking, allowing us to train language-specific wake word models that are better at detecting subtle differences in pronunciation. You must also read and agree to the Wake Word Collective terms. This explains, among other things, that these recordings will be made available publicly under a Creative Commons CC0 public domain dedication. We operate our research openly for the benefit of all, and that is why these will be available to the public.

Next, it explains that you only need you to say two words, “Okay, Nabu”, a couple of times. You’ll need to set down your phone, tablet, or laptop, and walk around the room, saying the wake word whenever the circle on the screen turns green. Taking samples from all around the room better reflects the real-world use of a voice assistant. Don’t worry about background noise; it’s useful for training.

I said this would only take a minute, but you’ve probably been reading this for longer, so go on and get recording.

This is a companion discussion topic for the original entry at https://www.home-assistant.io/blog/2024/10/24/wake-word-collective

baudneo · October 24, 2024, 3:41pm

Really appreciate you guys asking and being transparent.

Lakini · October 24, 2024, 4:21pm

Does this specifically improve the detection of “Okay Nabu”, or does the processing of the collected recordings allow for an “abstraction” to (potentially) improve the detection of self-chosen wakewords as well? (“This is how an “a”-sound changes when people are not talking directly to a microphone”, “This is how vowels get affected by background noise”, …)

And kind of related: is there a blogpost or a video somewhere to better understand the decision criteria that resulted in “Okay Nabu”?

dahamsta · October 24, 2024, 4:32pm

I haven’t really been keeping up with the voice assistant, but I thought the wake word was going to be configurable. I was hoping to be able to say “ok [house]”. Is that not the case?

netmikey · October 24, 2024, 4:40pm

Awesome, I’m always glad to help.

Quick feedback though: please let us listen to the recordings before submitting them. (Just one example: Sometimes my webcam mic has issues and I’d like to make sure not to transmit white noise or silence instead of what you guys were asking for.)

Nick4 · October 24, 2024, 7:35pm

Glad to help: I just donated some free time and my voice!

Mahko_Mahko · October 24, 2024, 7:38pm

Thanks for asking.

A small and easy thing to give back.

Hedda · October 24, 2024, 9:27pm

Could there be an option to contribute anonymously? I like to contribute, but …

They say AI can now clone your voice with just 3-seconds of recording.

Hedda · October 24, 2024, 9:34pm

FYI, indirectly asked the question recently when requested “Hey Assist” (same for “Home Assistant”) as additional default wake words included by default, so far only got an incomplete answer on that from Nabu Casa’s Mike Hansen here → [REQUEST] Official 'Okay Assist' and 'Hey Assist' included as wake words by default for ESPHome and Home Assistant? · Issue #11 · esphome/micro-wake-word-models · GitHub

github.com/esphome/micro-wake-word-models

[REQUEST] Official 'Okay Assist' and 'Hey Assist' included as wake words by default for ESPHome and Home Assistant?

opened 09:03AM - 17 Oct 24 UTC

Hedda

Related to the request posted here: - https://github.com/esphome/micro-wake-w…ord-models/issues/8 Please also consider including a default wake word here that includes the word 'Assist' in a combination phrase as default or optional standard wakeword pharase, or rather and "**_Okay Assist_**" and "**_Hey Assist_**" phrases. Are there any good reasons now why not included as standard wakeword phrases by default? - _**Okay Assist.**_ and - _**Hey Assist.**_ As in - "_**Okay Assist, turn on the dining room lights**_", and - "_**Hey Assist, turn off the kitchen lights**_", Here with "Assist" I am referring to Home Assistant's Assist for default conversation agent standard Assist pipeline in Home Assistant: * https://www.home-assistant.io/voice_control/ The integration is also called Assist Satellite * https://www.home-assistant.io/integrations/assist_satellite/ and so is the Assist satellite entity * https://developers.home-assistant.io/blog/2024/10/01/assist-satellite-entity/ * https://developers.home-assistant.io/docs/core/entity/assist-satellite/ * https://github.com/home-assistant/architecture/discussions/1136 * https://github.com/home-assistant/core/pulls?q=is%3A+assist_satellite%22 After all, Home Assistant's official voice assistant is called "Assist", and there are even official logos and branding for that name. ![image](https://github.com/user-attachments/assets/4d4b4b64-8382-4142-97c9-850cd2274b3f) My reasons for the request is partially for the same as in https://github.com/esphome/micro-wake-word-models/issues/8 which are that 'Nabu' / 'Nabucasa' does not have same brand recognition. Nabu as in Nabu Casa is also only the official partner to Open Home Foundation and the ESPHome and Home Assistant projects, so it does not make much sense of it being a default. Would be great with option to set simply 'Home Assistant' as default wake word as a way to show off and market for brand recognition. Even on Nabu Casa's website it says that "Assist" is the official name that the voice assistant, so makes no sence "Nabu" being default? * https://www.nabucasa.com/config/assist/

PS: Also see this other related request → Official 'Home Assistant' wake word · Issue #8 · esphome/micro-wake-word-models · GitHub

Nick4 · October 24, 2024, 10:07pm

How would ‘they’ (abusers) link that voice to the real identity behind hedda?
There are so many voices that can be used from YT vids that are more interesting then these snippets from a virtual person with a nickname.

Hedda · October 25, 2024, 4:36am

Maybe not, still, I guess it could not hurt if the data was anonymized before it was published as public domain.

sparkydave · October 25, 2024, 4:39am

Don’t forget, you are only saying 2 specific (and short) words. I don’t think that’s enough to clone you voice.

sparkydave · October 25, 2024, 4:40am

Hopefully this will improve it’s ability to understand us Aussies.

tobol · October 25, 2024, 5:53am

Sorry, but this wake word isn’t gonna succeed. Nabu isn’t brand that people want to mention dozens of times every day. I won’t put my 2c into making it one. I want some generic term like “computer”, “house”. Alternatively “Alfred” or “Jarvis” (so I can feel like a batman or iron man). I don’t want to explain to my guests what “nabu” means.

sparkydave · October 25, 2024, 6:37am

All of which is already possible.

Nick4 · October 25, 2024, 6:42am

Since you’re seem to be using HA for some time, you might know that you get a lot of freedom to set up your system the way you want.
Helping with this request will not imply that you have to use that wakeword in the future and I guess they are asking to use the same words since that’s probably the best way to train this.

Lakini · October 25, 2024, 8:33am

It is, but as long as it’s not clear if this is an effort to specifically improve this one wake word, or if this helps with setting up any kind of wake word as well, then i see it as valuable feedback when people see reasons why “okay nabu” might from a “usability” perspective not the best choice.

Hedda · October 25, 2024, 10:23am

@synesthesiam Can I suggest that you take this opportunity to also ask the same people to also say "Home Assistant", “Okey Assist”, and “Okey Jarvis” to help train such models as well so they could be included as alternative models for optional wake words than any user can chance to without too much effort?

GitHub - kahrendt/microWakeWord: A TensorFlow based wake word detection training framework using synthetic sample generation suitable for certain microcontrollers.

By popular demand, it would be awesome if you could include these too as optional standards by default:

"Home Assistant" (4 syllables)
“Okey Assist” (4 syllables)
“Okey Jarvis” (4 syllables)

As requested in repo for community Home Assistant Wake Word Collection (for openWakeWord):

Wanted Wake-Words · fwartner/home-assistant-wakewords-collection · Discussion #46 · GitHub

PS: Don’t think it is a good idea to include “Alexa” or “Siri” due to possible copyright infringement concerns.

donburch888 · October 25, 2024, 11:43am

Do you think “Alexa” and “Siri” were in response to millions of people already begging to say these names ? The brand recognition came from the names being used.

A name/word that will not be used in ordinary conversation is desirable. While talking to “computer” is popular (especially for Trekers), “Computer” or “House” would create too many false positives in many homes (especially while TV/radio news is in the background). In fact existing brand recognition would be a problem as it increases the chance of false positives.

I do agree with Hedda that Assist should provide a choice of several well-developed wake words, though I suggest a bit more discussion on which phrases would be best.

pove · October 25, 2024, 2:10pm

I tried several options in my m5 Atom Echo. Sadly, the only wake word my family remember (specially kids) is “Alexa”.

They use it in other kids houses and in my fathers in law’s.