TTS, STT, Ada, Rhasspy, Bluetooth speaker & USB microphone

Jpsy · December 28, 2020, 7:30pm

I am searching for ways to use HA OS (formerly AKA Hassio) as a Text-to-speech and Speech-to-text engine, i.e. with a USB microphone and a Bluetooth speaker, both directly attached / paired to my RPi4.

Ideally the result would become some kind of Amazon Echo / Google Home replacement. But simple TTS would be a great first step too.

I spent the better part of a day to look for solutions. I have found the setup for TTS and STT. I see the Ada integration and Almond. And I found Rhasspy (which looks very promising). I also have a Nabu Casa account (in case that helps).

But honestly: I do not get the simplest step up and running. After all the fuzz about Ada and Almond some months ago it seems unbelievable that this is so complicated. I look back at two intense years of HA experience. But this currently looks like a wall with no doors or windows to me.

Jpsy · December 28, 2020, 7:34pm

Oh yes, and I see the USB microphone in the hardware list of supervisor and I also managed to pair the Bluetooth speaker through the CLI using bluetoothctl. But I have NOT managed to get any audio into or out of HA through these channels.