Question after watching Year of Voice Chapter 3

At the end of the latest Year of Voice Chapter on YT (I think it was chapter 3).

A statement was made that the next chapter would reveal Wake Word Detection.

I’m guessing that means it’s pretty close to being reality. I’d really love to get an idea as to what I’m going to need to buy when the time arrives that way I can come up with at least a ballpark estimate of what I’ll need to spend.

And I also want to say Thanks for all the hard work! I am really looking forward to being able to (almost) completely switch over from using the Google speakers.

There are things that HA still won’t be able to do on it’s own (answer questions such as “what is this or that”, “where can I find X product” etc).

A couple tangent questions:

  1. One thing I currently use my Google speakers for is to stream music – typically from the tunein app as they have my preferred local radio stations. When I set up my HA Voice satellites, will they be able to handle this task?
  2. Once I have HA Voice satellites, I’d like to mute the microphones on all my Google speakers (so they won’t listen – improve privacy). If I make a request to HA Voice that needs to be handled by Google Assistant, can HA pass that request on to one of the Google speakers (preferably a specific one).

BTW… In my ideal world, I could use an old android phone and/Android tablet as satellites. I have a bunch of them lying around. This would give the added ability to run a dashboard on the same device. A phone that sits on my desk with the time shown and a few controls and it would also allow voice control.

1 Like

Anyone have any suggestions.

Hi @Doni49,

My 2 cents:

  1. I dont know abortus TuneIn, however you are able to do that with YouTube music and Spotify. You can already do that using Assist and a automation with a sentence trigger.

  2. You can patrially do that using the Google sdk integration. You can pass sentences which will be processed by Google Home. However, I dont think it’s able to do that on a specific speaker.

Regarding Google home devices. I listened to a Home Assistant podcast about year of the voice. There, they discussed satelites and hardware. Nothing was said about Google devices. However, it could be a copyright thing. Maybe some tweaker or hacker will do it, but I won’t expect an official integration or something from HA or Nabu Casa.

Regarding Google home devices. I listened to a Home Assistant podcast about year of the voice. There, they discussed satelites and hardware. Nothing was said about Google devices. However, it could be a copyright thing. Maybe some tweaker or hacker will do it, but I won’t expect an official integration or something from HA or Nabu Casa.

The only things I’m hoping for as far as Google Home devices go are the two specific things mentioned (item #1 & #2). The main part of my question was about figuring out what type of equipment will be needed to use wake words with HA.

I would like to set the microphone on each Google device to mute (so they aren’t listening EVER) and allow HA to provide instructions to them (play music, answer a question etc). Essentially they’ll only be used in cases that HA can’t do the task alone.

Getting you to an estimate, is very, very difficult, because there are so many variables in your questions, that you’d need to answer for yourself first. :smiley:

Wake word detection is not yet ready. Paulus made a statement regarding this in the last release video (2023.09.0), that the developers try to get that ready within the year of voice, but not anytime soon. He also stated, that the devices they want to use for this are sold out everywhere (ESP S3). :slight_smile: So I wouldn’t keep my hopes up for an eta next month. :slight_smile:

That brings us to your question, what do you need. That’s hard to say, because you have so many things to decide upfront. As an alternative, I’d like to give you some details about my setup, that might give you some insight on what you want for yourself.

  • I’m using the very nice and nifty m5Stack ATOM Echo eg. on my workbench. There I only need voice assist, but nothing else like playing music or any other thing like a temp sensor.
    Easy to install and very small, costs around 10 to 15 Dollar or Euro in the northern hemisphere. It’s not bigger than a 2 euro coin.
    => Ideal device for only voice assist. Giving commands and getting a feedback. Get read an audiobook or music is a no-no!
  • I got an ESPMuse lend from a friend. That is a cool toy. You’d use these, if you want to have some nice speaker, that does real good sound (I’m no sound expert, for me, knowing "MC"s from my childhood, mp3 is “very high quality” :rofl:), this is for you. :wink:
    => Ideal device if you want mainly to listen to music with this device and want the added benefit of having Assist there, too.
  • Rapsberry Pis up from Modell 2 have an audio jack. So you can run nearly every speaker you have or can get for a small buck. Squeezelite (player on the Pi) and LMS (LogitechMediaServer, available as Add-on for HA!) to control the music.
    => Ideal, if you have lying some old speakers around, or some old Pis, that need a new playing field. I can recommend the old “PS3 Eye Camera” as microphone. Great 4-mic-array and you have the added benefit, if you have enough power on your Pi, you can even run video feeds of of it. :slight_smile:
  • ESP devices can channel music HA->Wifi->ESP->BT->speaker, but that is not fully developed and seem to be not running reliable at the moment. I don’t have this in my setup, so only can say what I read about.
  • Rhasspy satellites are a nice alternative to run on a Pi. Rhasspy already supports wake word detection and can, in version 3beta, work with HA via Assist (as opposed to events).
    => at the moment it is more a testing field, but it shows some of the concepts, that will find their way into HA eventually, I mean the developer of Rhasspy is the developer of HA voice assistant… :laughing:
  • Old phones or tablets are very good for Assist. Install the companion app and you can even run it without https in a local network (opposed to a browser on the phone, who needs https). Just great, as everything is already there: mic, mostly good speakers and a touch display.
    => Ideal to get a second or third use for tablets and phones. I’m running four tablets at the moment, works great. I even switched my HA handling to a 10’’ tablet. For configuring I still use my laptop, but for the daily stuff. :slight_smile:
  • Yeah, an old landline phone works quite well. I don’t know how these adapters are called, but you can insert your line from the phone and an ehernet cable and you are good to go. If you take a look at the HA blog, there is even a guide to revieve your old walkie-talkie to use with Assist.

And there are so many more alternatives, it would really need some examples from you, where and how you’d want to use it. :slight_smile:

But now to the downside: Google or Amazon devices are closed source applications, and therefor extremly hard to hack. They both did a pretty good job holding DIY people out. I wouldn’t bother with these devices and sell them on eBay. If you want to use Google as your source for further questions, you can already do so I heard, but I can’t tell you how that works. I can tell you, that it works very intuitive with OpenAI. You can set that and OpenAI gives you a really nice feedback. For example in the morning, you can let OpenAI generate a daily briefing for you, even with news headlines. :slight_smile:

I don’t know, what you have Google Assist do for you, but I’m quite sure it won’t take long, that HA Assist can replace it. I for one haven’t used Google Home before (neither Alexa), but HA Assist can do everything I need. With the companion app even from outside (aka sitting with friends and asking HA what the weather forecast for tomorrow is works).

Hope this helps a little further! :slight_smile:

1 Like

Thanks for the very detailed response. At the end of the most recent chapter in the Voice series, Paulus said that Wake Word Detection would the topic of the next chapter. I took that as meaning Wake Word detection would be released next and they were close.

So I was hoping to get an idea of what type of equipment would be required to make that happen.

I included a couple secondary questions that I considered less important that the issue of what equipment would be required for Wake word.

As the next chapter of “Year of the voice”. :wink: But that is not the “normal” release cycle. The wake word will be the next step, but nobody knows when. :wink: :smiley:

As I said, the wake word will come, as you could see in Rhasspy3, and I’m quite sure, “Mike the voice” will work hard on this, but no ETA here. :slight_smile:

FYI… I got a notification on YouTube that Chapter 4 Year of Voice will be live on Thursday, October 12 at 2PM central time.

I plan to be watching.

2 Likes

There are a few hints that they are working on wake word detection:

But there is no stable version which you can use already, you have to wait until it’s announced on 12.10. and then wait for the official release.

Thanks. I knew that.

When I asked my original question, I was only trying to get an idea of what kind of hardware would be needed for when it is available. That way I could try to save some money to get the hardware when the time did come.

My post yesterday was to let people reading this message that may not have gotten the YT notification that the next chapter would be on YouTube next Thursday.