I’ve got some M5 Stacks working fine with Assist, I can turn off/on lights without any issues, but the device never plays back and confirmation audio
The ESPHome logs show
[11:27:22][D][voice_assistant:684]: STT by VAD end
[11:27:22][D][voice_assistant:422]: State changed from STREAMING_MICROPHONE to STOP_MICROPHONE
[11:27:22][D][voice_assistant:428]: Desired state set to AWAITING_RESPONSE
[11:27:23][D][voice_assistant:422]: State changed from STOP_MICROPHONE to STOPPING_MICROPHONE
[11:27:23][D][light:036]: 'Den Atom Echo' Setting:
[11:27:23][D][light:059]: Red: 0%, Green: 0%, Blue: 100%
[11:27:23][D][light:109]: Effect: 'Fast Pulse'
[11:27:23][D][esp-idf:000]: I (269069) I2S: DMA queue destroyed
[11:27:23][D][voice_assistant:422]: State changed from STOPPING_MICROPHONE to AWAITING_RESPONSE
[11:27:23][D][voice_assistant:529]: Event Type: 4
[11:27:23][D][voice_assistant:557]: Speech recognised as: "Turn off the den lights."
[11:27:23][D][voice_assistant:529]: Event Type: 5
[11:27:23][D][voice_assistant:562]: Intent started
[11:27:34][D][voice_assistant:529]: Event Type: 6
[11:27:34][D][voice_assistant:529]: Event Type: 7
[11:27:34][D][voice_assistant:585]: Response: "Turned off light"
[11:27:34][D][light:036]: 'Den Atom Echo' Setting:
[11:27:34][D][light:051]: Brightness: 100%
[11:27:34][D][light:059]: Red: 0%, Green: 0%, Blue: 100%
[11:27:34][D][light:109]: Effect: 'None'
[11:27:34][D][voice_assistant:529]: Event Type: 8
[11:27:34][D][voice_assistant:605]: Response URL: "http:/<HOMEASSISTANTIP>/api/tts_proxy/db98156bf572727274889253f275cea21c83824c_en-ca_dba8942832_cloud.wav"
[11:27:34][D][voice_assistant:422]: State changed from AWAITING_RESPONSE to STREAMING_RESPONSE
[11:27:34][D][voice_assistant:428]: Desired state set to STREAMING_RESPONSE
[11:27:34][D][esp-idf:000]: I (280281) I2S: DMA Malloc info, datalen=blocksize=512, dma_buf_count=8
[11:27:34][D][i2s_audio.speaker:164]: Started I2S Audio Speaker
[11:27:34][D][i2s_audio.speaker:167]: Stopping I2S Audio Speaker
[11:27:34][D][i2s_audio.speaker:178]: Stopped I2S Audio Speaker
[11:27:34][D][light:036]: 'Den Atom Echo' Setting:
[11:27:34][D][light:051]: Brightness: 60%
[11:27:34][D][light:059]: Red: 100%, Green: 89%, Blue: 71%
[11:27:34][D][voice_assistant:325]: Speaker has finished outputting all audio
[11:27:34][D][voice_assistant:422]: State changed from RESPONSE_FINISHED to IDLE
It looks like the audio starts/stops immediately and I can play the wav file without issues from the web browser
I’ve got no customization on the config file
substitutions:
name: denm5stack
friendly_name: Den Atom Echo
packages:
m5stack.atom-echo-voice-assistant: github://esphome/firmware/voice-assistant/m5stack-atom-echo.yaml@main
esphome:
name: ${name}
name_add_mac_suffix: false
friendly_name: ${friendly_name}
api:
encryption:
key: kYYaz9V9MqR1MKs9HugqRDdVGjSd5QJTrnYR7xDnqis=
wifi:
ssid: !secret wifi_ssid
password: !secret wifi_password
What have I missed?