I have a nearly perfect voice pipeline setup that does everything I want (except alarms).
I’m running HomeAssistant in docker (so anywhere you see an “add on” referenced below, they too are running as their own docker containers since I can’t add “add ons” to HA in the traditional manner since it doesn’t support it when running HA in docker).
- Linux Voice Assistant (paired with a Jabra 410 conference speakerphone/microphone) - GitHub - OHF-Voice/linux-voice-assistant: Voice satellite for Home Assistant using the ESPHome protocol
- Extended OpenAI Conversation - GitHub - jekalmin/extended_openai_conversation: Home Assistant custom component of conversation agent. It uses OpenAI to control your devices.
- Text to Speech - Faster Whisper with Parakeet - GitHub - rhasspy/wyoming-faster-whisper: Wyoming protocol server for faster whisper speech to text system
- Speech to Text - Kokoro (CPU only) - GitHub - remsky/Kokoro-FastAPI: Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
- LLM running on an AMD MI60 32gb GPU - bartowski/phi-4-GGUF:Q4_0 (via llama.cpp using llama-server)
- Custom trained (on personal voice samples) wake word (Hey Kitty Robot) - GitHub - TaterTotterson/microWakeWord-Trainer-Nvidia-Docker: Train microWakeWord for use with HomeAssistant Voice (trained using CPU only took 43 minutes)
This works remarkably well and I pulled all Alexa devices out of my house, and I can do things that I never expected to be possible. I can say things (these are all real examples, and none of them are custom sentences…the LLM just figures it out):
- “Hey kitty robot, good morning” - Response: “Good morning, would you like me to turn on the lights and feed the cats?” - (without me needing to use the wake word again): “Yes” - “I’ve turned on the kitchen and living room lights and the cats are being fed”
- “Hey kitty robot, do I need to wear a coat if I go out today?” - Response: “The current temperature is 16°, and the weather is mostly clear, so I would recommend wearing warm clothes and a coat. If you’re sensitive to the cold, consider gloves and a hat.”
- “Hey kitty robot, I’m leaving the house” - Response: “Would you like me to turn off all the lights?” - (without me needing to use the wake word) - “Yes” - Response: “I’ve turned off the living room lights, the hallway light and kitchen lights. Would you like me to do anything else?” (without me using the wake word) - “No.”
I can set timers, and they work incredibly well. But what I can’t get figured out is alarms/reminders. These were two big things we used all the time with Alexa, and I just can’t figure out a way to get this functionality within HA Voice (it’s not a big enough reason to put the Alexa spyware devices back into our house, but nonetheless it’s annoying have to use our iPhones instead of the voice assistant).
Things I’ve tried, none of which worked (for me, your mileage may vary, as it’s possible that maybe something in my pipeline is preventing them from working):
- GitHub - omaramin-2000/HA-Alarms-and-Reminders: Set alarms and reminders to ring in home assistant voice satellite using intents or from the dashboard
- GitHub - Pewidot/ha_voice_alarms
- GitHub - gurux13/hass-alarm: Wake-up alarm HASS custom integration
Those were the only “workarounds” I’ve seen mentioned after digging into this topic quite extensively.
Does anybody have something that’s working for them, or if it’s known if HA is going to be adding this kind of functionality?
