Hey all, I’ve got a bug I’m running into and looking to see if anyone else has seen something similar.
The symptoms:
When asking for the first request in “awhile” (this happens at least every few hours if no command have been issued lately):
- The wake word is detected (the sound is issued that a command has been heard)
- The command is heard clearly and STT worked properly, per the debugger in the HA UI
- According to the Assist Satellite addon-logs it also thinks it played the audio response without error
- No audio response was played other than the ding that a command was received
- The actions do also happen in the background (lights on and off etc)
- Finally, any subsequent command works fine, without any issues. Full audio response, and the logs seem the same, to my eye, anyway
The setup:
I’m running the assist microphone addon, the latest HA version, and gemini as my LLM, and openwakeword as my wake word detection for my Jabra USB speakerphone. Assist Satellite is just a local audio version of Wyoming satellite running as an addon.
Have others experienced this?
Also some logs:
Assist Satellite
INFO:root:Connected to server
INFO:root:Streaming audio
DEBUG:root:Event(type='run-pipeline', data={'start_stage': 'wake', 'end_stage': 'tts', 'restart_on_end': True, 'snd_format': {'rate': 16000, 'width': 2, 'channels': 1}}, payload=None)
DEBUG:root:Ping enabled
DEBUG:root:Wake word detected
DEBUG:root:Event(type='transcript', data={'text': 'What time is it?'}, payload=None)
INFO:root:Streaming audio
DEBUG:root:Connected to snd service
DEBUG:root:Event(type='synthesize', data={'text': 'The time is 07:11 AM', 'voice': {'name': 'AvaNeural'}}, payload=None)
Playing raw data 'stdin' : Signed 16 bit Little Endian, Rate 16000 Hz, Mono
HA Assist pipeline of a failed run
stage: done
run:
pipeline: 01hzttk4v9w4374h8ct6qncmey
language: en
conversation_id: 01JYFCK65QD0ZFJ7A4PSKC7JEF
tts_output:
token: gtl8LmXy3vqv-nYKttvnYg.wav
url: /api/tts_proxy/gtl8LmXy3vqv-nYKttvnYg.wav
mime_type: audio/x-wav
stream_response: false
events:
- type: run-start
data:
pipeline: 01hzttk4v9w4374h8ct6qncmey
language: en
conversation_id: 01JYFCK65QD0ZFJ7A4PSKC7JEF
tts_output:
token: gtl8LmXy3vqv-nYKttvnYg.wav
url: /api/tts_proxy/gtl8LmXy3vqv-nYKttvnYg.wav
mime_type: audio/x-wav
stream_response: false
timestamp: "2025-07-01T04:55:24.660946+00:00"
- type: wake_word-start
data:
entity_id: wake_word.openwakeword
metadata:
format: wav
codec: pcm
bit_rate: 16
sample_rate: 16000
channel: 1
timeout: 0
timestamp: "2025-07-01T04:55:24.661212+00:00"
- type: wake_word-end
data:
wake_word_output:
wake_word_id: ok_nabu_v0.1
wake_word_phrase: ok nabu
timestamp: 33366780
timestamp: "2025-07-01T14:11:44.432968+00:00"
- type: stt-start
data:
engine: stt.home_assistant_cloud
metadata:
language: en-US
format: wav
codec: pcm
bit_rate: 16
sample_rate: 16000
channel: 1
timestamp: "2025-07-01T14:11:44.435498+00:00"
- type: stt-vad-start
data:
timestamp: 33367310
timestamp: "2025-07-01T14:11:45.092377+00:00"
- type: stt-vad-end
data:
timestamp: 33368730
timestamp: "2025-07-01T14:11:46.235985+00:00"
- type: stt-end
data:
stt_output:
text: What time is it?
timestamp: "2025-07-01T14:11:46.410689+00:00"
- type: intent-start
data:
engine: conversation.google_generative_ai_conversation
language: en-US
intent_input: What time is it?
conversation_id: 01JYFCK65QD0ZFJ7A4PSKC7JEF
device_id: aa360a57a7160ab5499b364b59a65b7e
prefer_local_intents: true
timestamp: "2025-07-01T14:11:46.411540+00:00"
- type: intent-end
data:
processed_locally: false
intent_output:
response:
speech:
plain:
speech: The time is 07:11 AM
extra_data: null
card: {}
language: "*"
response_type: action_done
data:
targets: []
success: []
failed: []
conversation_id: 01JYFCK65QD0ZFJ7A4PSKC7JEF
continue_conversation: false
timestamp: "2025-07-01T14:11:46.424303+00:00"
- type: tts-start
data:
engine: tts.home_assistant_cloud
language: en-US
voice: AvaNeural
tts_input: The time is 07:11 AM
timestamp: "2025-07-01T14:11:46.424554+00:00"
- type: tts-end
data:
tts_output:
media_id: media-source://tts/-stream-/gtl8LmXy3vqv-nYKttvnYg.wav
token: gtl8LmXy3vqv-nYKttvnYg.wav
url: /api/tts_proxy/gtl8LmXy3vqv-nYKttvnYg.wav
mime_type: audio/x-wav
timestamp: "2025-07-01T14:11:46.431173+00:00"
- type: run-end
data: null
timestamp: "2025-07-01T14:11:46.431278+00:00"
wake_word:
entity_id: wake_word.openwakeword
metadata:
format: wav
codec: pcm
bit_rate: 16
sample_rate: 16000
channel: 1
timeout: 0
done: true
wake_word_output:
wake_word_id: ok_nabu_v0.1
wake_word_phrase: ok nabu
timestamp: 33366780
stt:
engine: stt.home_assistant_cloud
metadata:
language: en-US
format: wav
codec: pcm
bit_rate: 16
sample_rate: 16000
channel: 1
done: true
stt_output:
text: What time is it?
intent:
engine: conversation.google_generative_ai_conversation
language: en-US
intent_input: What time is it?
conversation_id: 01JYFCK65QD0ZFJ7A4PSKC7JEF
device_id: aa360a57a7160ab5499b364b59a65b7e
prefer_local_intents: true
done: true
processed_locally: false
intent_output:
response:
speech:
plain:
speech: The time is 07:11 AM
extra_data: null
card: {}
language: "*"
response_type: action_done
data:
targets: []
success: []
failed: []
conversation_id: 01JYFCK65QD0ZFJ7A4PSKC7JEF
continue_conversation: false
tts:
engine: tts.home_assistant_cloud
language: en-US
voice: AvaNeural
tts_input: The time is 07:11 AM
done: true
tts_output:
media_id: media-source://tts/-stream-/gtl8LmXy3vqv-nYKttvnYg.wav
token: gtl8LmXy3vqv-nYKttvnYg.wav
url: /api/tts_proxy/gtl8LmXy3vqv-nYKttvnYg.wav
mime_type: audio/x-wav
Anecdotally, my brother has had the same problem (haven’t confirmed with logs yet, but identical behavior) using a voice PE, so my suspicion is outside of satellite, but first I wanted to see if others have seen this.