Wake Words & M5 Atom Echo for a Natural Voice Chat Bot

leodin · November 23, 2023, 10:41pm

I just read here that I can’t talk to it in a natural conversation when I use wake words…?
@pcwii

…that would be too bad.

I was hoping to get exactly that.

Is there any workaround? I did setup whisper, elevenlabs, and openAI… now I would like to be able to just talk to my chatbot…

WallyR · November 23, 2023, 10:52pm

It is not the wake word that is the issue, but the speak-to-text conversion.
It is somewhat easy to take the spoken text and compare it to a select few lines of commands and add some deviation in the comparing to get a successful hit.
If you want to replace those select few lines with any possible spoken text, then the lines with increase exponentially many times with each word.
You would require a supercomputer quite quickly to get a decent response time and this is the difference on a local voice assistant and the ones provided by Apple/Amazon/Google.

leodin · November 24, 2023, 5:49am

Thanks for you quick reply @WallyR !

humm. I don’t quite understand yet.
What is the local faster-whisper then, that we can install via Wyoming Protocol?
Can’t it do exactly that?

I understand that it needs a trigger/wakeword and limited input window to avoid data overload.
but such as I already can use the local Whisper Speech to Text via the HA Browser it also should be possible to use that via the M5 Echo, no?

WallyR · November 24, 2023, 8:22am

I looked up faster whisper and it can do somewhat of that what you want.
I do not know if you have the hardware for it.
The GitHub page seems to suggest a NVidia Tesla V100 to get faster than real time or an Intel Xeon Gold 6226R to get equal to real time.

leodin · November 24, 2023, 1:06pm

Why can we then install it by default on the Home Assistant Green as an Addon when it would not be supported by its Hardware? confused

WallyR · November 24, 2023, 2:14pm

There is a Whisper version and a Faster Whisper version.
Faster Whisper version can handle more, but require more powerful hardware.

The one you link to is only the Whisper version

WallyR · November 24, 2023, 2:31pm

Ok, I see where the confusion is coming from.
There are two ways of setting up voice assistant in HA.
One is local and the other is with cloud services, where NabuCasa is one of them.

Check this video out.

leodin · November 25, 2023, 12:57pm

Getting closer now, thanks @WallyR for that forward.

This is what I would like to install, and I am wondering, if it is actually possible:

Can anyone confirm or provide me with some helpful tips, tutorials, etc. to get it done?
It would be so amazing if I could talk to my cloud automations and chatbots via HA…

leodin · November 26, 2023, 2:02pm

Update: figured out how to solve (1) and (4)-(7)

still don’t know any way to capture SST sentences and pass them straight forward.

This tutorial of Technithusiast got me closer but I don’t want to use telegram, I just want to be totally hands-free by using M5Stack Wake Words to extract the TTS text which should then be passed on to make.com via HTTP request:

mirecekd · November 26, 2023, 2:10pm