Speaker recognition in Voice Assistant

Adding speaker recognition to Assistant Pipeline can improve usability of voice assistant.

Maybe it can be run as add-on running with Whisper.

Make speaker recognition part of Wyoming protocol.

I’d like to bring this up again. Diarization or speaker idenfitication are a really cool thing in Google Home devices, where I can work with my own calendar and todo list while my wife works with her services.

Similarly we could allow creating speech embeddings of multiple people within Home Assistant. Each speaker could be mapped to a specific Assist pipeline, which then has only the devices and services enabled, which are mapped to this person.

Similarly, specific prompts and specific MCP servers could be enabled.

Diart could be one of many helpful libraries to achieve this.

3 Likes

I second this. I’ll take just the “speaker identity” as part of the intent trigger.

Any progress on speaker recognition in whisper or home assistant?

I would like this as well