I am searching for ways to use HA OS (formerly AKA Hassio) as a Text-to-speech and Speech-to-text engine, i.e. with a USB microphone and a Bluetooth speaker, both directly attached / paired to my RPi4.
Ideally the result would become some kind of Amazon Echo / Google Home replacement. But simple TTS would be a great first step too.
I spent the better part of a day to look for solutions. I have found the setup for TTS and STT. I see the Ada integration and Almond. And I found Rhasspy (which looks very promising). I also have a Nabu Casa account (in case that helps).
But honestly: I do not get the simplest step up and running. After all the fuzz about Ada and Almond some months ago it seems unbelievable that this is so complicated. I look back at two intense years of HA experience. But this currently looks like a wall with no doors or windows to me.