Recommended speaker setup for TTS

Hi Team

Would like to be able to communicate with ‘HA’ when going to bed - so in the bedroom be able to say ‘going to bed’ (or push a button) and then run a script ‘turning off all lights’, checking if any windows are open, if the shed is open, remind if the thrash is being pick up next morning - guess it would be easiest with a speaker to respond with TTS.

Through out the house, I have Symfonic/Ikea speakers for music (and TTS), and a few Google Home for voice commands.

Currently there is nothing in the bedroom - and the goal is not really to have music, but HA ‘dialogue’.

So, what options do fit into my setup ? These are ones that I see, what have I missed ?

Button + TTS speak

-Ikea speaker (guess there are still a few to get picked up) for TTS respons, and a button - cost around 120 euro. But no ‘dialoque’, just TTS responds to button. But solves the task.

-Self build (like this TTS Speaker) ESP with AMP and Speaker - guess cost would be 20-30 Euro. Not sure how it works with TTS, but seems like it should work.

‘Dialogue’ + TTS speak

Could be fun to be able to ask questions - and nut just rely on a ‘button’ that can do 1-2 things.

-Sones Era 100 - 200 euro

-Google Nest mini - 40 euro

-Home Assistant Voice Preview Edition 60-70 Euro (with a speaker). Seems like it is possible to attach a simple speaker to the jack plug in the PE, and be able to get TTS responds from that ?

Can some one confirm that the HA Voice PE with a jack connected speaker can play TTS ? Or is there no need for a speaker ? Can you say to PE “Go to bed’, it will then run the ’Go to bed’ script - the script has several steps (like turning of all lights, check shed, check trash), and each step have their own TTS resonds (‘lights turned of’, ‘shed door is open, ‘green house open’, ‘set out green trash’, ‘set out plastic trash’…)

Guess easiest and cheapest is to go for a Google nest.

But my heart says ‘HA voice PE’ (with a speaker?).

What are the recommendations ?

ha voice pe has a speaker, and a button, and a touch sensitive ring around the button to adjust volume, dual microphones, and an LED light ring.

I don’t remember, but according to @ Home Assistant Voice Preview Edition - Home Assistant there is an audio out. It’s not needed (speaker built in), unless I suppose you wanted higher quality audio.

see also @ https://support.nabucasa.com/hc/en-us/categories/24451727188125-Home-Assistant-Voice-Preview-Edition

I have 2 HAVPE, one for upstairs and another for downstairs areas.
I would easily recommend it over others, simply due to helping to support home assistant/nabucasa, but also consider how well google (or sonos for that matter) supports or more importantly, doesn’t support their stuff.

Any satellite on ESP32S3 is enough for you. You can make it yourself. You can buy a ready-made one. With some customization, you can experience Continued conversation beyond just executing commands or answering questions.
I also prefer a separate infrastructure for music, and use satellites only for voice interaction.

Voice Assistant esphome with Esp32-s3 - Max98357 - Inmp441 You can use a different DAC if you want a jack socket or better sound quality.

GitHub - formatBCE/Respeaker-Lite-ESPHome-integration DIY analogue of VPE

or VPE

Yes, you can, and connecting it to a speaker that has AUX IN will improve the sound. To have the Home Assistant Voice PE do two way dialogue you need not only TTS but also STT. The device itself doesn’t provide them. These services must be provided for example on your Home Assitant server, a PC on the network or the Nabu Casa subscription.

Do you have any reference for this “Hi-Fi systems”?

I’m trying to “duplicate” or “copy” the behaviour of a typical Alexa setup (I made the mistake of buying just amazon devices, now I have trouble with the AI of Amazon that never update and It’s not “native” to locally with HA). Anyways, I’m trying to configure other setup but (as I don’t have one) I don’t know how bad/well the sound of a HAVPE Is. Is it worse than an echo 5th gen? If yes, how can I “fix that” keeping the same funcionality as Amazon devices?

As I could find, the only way is using well integrated with HA speakers (sonos or denon for best quality) and use the HAPVPE just as a “microphone”. But I couldn’t realize how does that REALLY work (I mean for example with Alexa you have a integrated speaker, you can talk, and if you have music it volumes down to hear your instructions, then give you the response, and lastly comes back the original volumen again without really “interrupt” the music that behaviour should happen even without calling the assistant, even if I’m listening music and someone give an announcement to my specific speaker, or to all speakers in the house, it should have the same behaviour turn down the volume say the announcement and turn up to original, also you can play audio in all speakers at the same time and all of this without breaking the sonos/denon integration like audio groups and other funcionalities that the speakers have themselves). Have you heard about any content here that can help me? I really want not to make a mistake again on this, but I need someone with this experience to help me. Ay least with speaker/mic brands integration (I don’t know if HAVPE mic + sonos or denon spkr is the best combination, or is it other one better over there). Thanks for your help.