Audio control, conferencing and multi-room audio

Thank you very much for the pointer! It took some time, but I found it: Is there a way to stream audio from one ESPHome to another? . I think this is exactly base what I want.

For better understanding, I’ve drawn a diagram of my current goal, which I think will satisfy my needs:

Using a built-in speaker is not an option for me, so I’m planning something like a pair of Teufel Ultima 20 or similar in each room.

Since I don’t think I need syncing, I’m not sure it will help me. I think it might even be harmful for a conference scenario.

The jack is only for connecting a laptop. I hate wires, but the jack is used all the time because it’s the fastest option to plug in and unplug.

I don’t think I need studio quality (and studios require purity), but I’d like it to be on par with consumer-grade devices from Yamaha or at least Onkyo.

I think the price of the ESP32 - both initial cost and operating cost - is great. It seems like it would be much more expensive if it were based on an RPi.

Thanks! However, I need a server anyway for Home Assistant, storage, and some experiments. Therefore, it won’t incur any additional cost, except some extra power consumption for STT and graphic card.