After several weeks of on & off effort, I finally got all the right chips & the kinks out of my Voice test setup yesterday. I’m using a remote node for processing input and output, and built that node using an ESP32 WROOM, ICS43434 i2s microphone, Max98357 i2s 3W Class D Amplifier, a small speaker, and ESPHome.
This is the second most difficult task I’ve undertaken in quit a while, second only to the 1000 line ESPHome ESP32 app I wrote to control my Pellet Stove. Wiring the voice supporting hardware was challenging only because all the i2s lines are called something different on each device. I thought i2s was a “standard” in the formal sense… guess not…
Then there’s Piper, Whisper, OpenWakeWord, pipelines, etc. And dozens of options for all of those that all need to be set correctly.
It’s a heavy lift.
Don’t get me wrong… this is early stuff, and as a formally trained audio electronics engineer I completely and deeply understand the massive complexity involved here. Most people think that just because Amazon and and Google have cute, little, privacy-invasive devices that mostly work, it should be easy to replicate that on the Home Assistant eco system. Hardly.
While I can say “Hey Jarvis (led recognizing wake word then lights), turn on light” and it responds “Turned on light” and the light goes on, I’m seeing VERY odd behavior in the ESPHome logs, and it takes a few reboots of the ESP for everything to settle down, connect all the pipelines & api’s and be ready to accept text input. And only then does the flakiness of Assistant really show its colors. But… the light did turn on after I told it to.
We are all witnessing the birth of real independence from Amazon and Google. It’s about time. I feel that this proof of concept some of us are now replicating will spawn a massive push to simplify, streamline and improve this amazing core voice functionality of our beloved Home Assistant Platform. We will also start seeing custom i2s voice processing chips that are designed with all of us makers in mind (think Wake Words in a chip) start coming to market. Give it time.
I for one am really excited to see how this all matures and I’ll be experimenting & testing all winter long and for as long as it takes to truly unplug & replace Alexa with Home Assistant. Her days are now actually numbered. --Jeff