Assist with M5Stack: No audio confirmation

I’ve got some M5 Stacks working fine with Assist, I can turn off/on lights without any issues, but the device never plays back and confirmation audio

The ESPHome logs show

[11:27:22][D][voice_assistant:684]: STT by VAD end
[11:27:22][D][voice_assistant:422]: State changed from STREAMING_MICROPHONE to STOP_MICROPHONE
[11:27:22][D][voice_assistant:428]: Desired state set to AWAITING_RESPONSE
[11:27:23][D][voice_assistant:422]: State changed from STOP_MICROPHONE to STOPPING_MICROPHONE
[11:27:23][D][light:036]: 'Den Atom Echo' Setting:
[11:27:23][D][light:059]:   Red: 0%, Green: 0%, Blue: 100%
[11:27:23][D][light:109]:   Effect: 'Fast Pulse'
[11:27:23][D][esp-idf:000]: I (269069) I2S: DMA queue destroyed

[11:27:23][D][voice_assistant:422]: State changed from STOPPING_MICROPHONE to AWAITING_RESPONSE
[11:27:23][D][voice_assistant:529]: Event Type: 4
[11:27:23][D][voice_assistant:557]: Speech recognised as: "Turn off the den lights."
[11:27:23][D][voice_assistant:529]: Event Type: 5
[11:27:23][D][voice_assistant:562]: Intent started
[11:27:34][D][voice_assistant:529]: Event Type: 6
[11:27:34][D][voice_assistant:529]: Event Type: 7
[11:27:34][D][voice_assistant:585]: Response: "Turned off light"
[11:27:34][D][light:036]: 'Den Atom Echo' Setting:
[11:27:34][D][light:051]:   Brightness: 100%
[11:27:34][D][light:059]:   Red: 0%, Green: 0%, Blue: 100%
[11:27:34][D][light:109]:   Effect: 'None'
[11:27:34][D][voice_assistant:529]: Event Type: 8
[11:27:34][D][voice_assistant:605]: Response URL: "http:/<HOMEASSISTANTIP>/api/tts_proxy/db98156bf572727274889253f275cea21c83824c_en-ca_dba8942832_cloud.wav"
[11:27:34][D][voice_assistant:422]: State changed from AWAITING_RESPONSE to STREAMING_RESPONSE
[11:27:34][D][voice_assistant:428]: Desired state set to STREAMING_RESPONSE
[11:27:34][D][esp-idf:000]: I (280281) I2S: DMA Malloc info, datalen=blocksize=512, dma_buf_count=8

[11:27:34][D][i2s_audio.speaker:164]: Started I2S Audio Speaker
[11:27:34][D][i2s_audio.speaker:167]: Stopping I2S Audio Speaker
[11:27:34][D][i2s_audio.speaker:178]: Stopped I2S Audio Speaker
[11:27:34][D][light:036]: 'Den Atom Echo' Setting:
[11:27:34][D][light:051]:   Brightness: 60%
[11:27:34][D][light:059]:   Red: 100%, Green: 89%, Blue: 71%
[11:27:34][D][voice_assistant:325]: Speaker has finished outputting all audio
[11:27:34][D][voice_assistant:422]: State changed from RESPONSE_FINISHED to IDLE

It looks like the audio starts/stops immediately and I can play the wav file without issues from the web browser

I’ve got no customization on the config file

substitutions:
  name: denm5stack
  friendly_name: Den Atom Echo
packages:
  m5stack.atom-echo-voice-assistant: github://esphome/firmware/voice-assistant/m5stack-atom-echo.yaml@main
esphome:
  name: ${name}
  name_add_mac_suffix: false
  friendly_name: ${friendly_name}
api:
  encryption:
    key: kYYaz9V9MqR1MKs9HugqRDdVGjSd5QJTrnYR7xDnqis=


wifi:
  ssid: !secret wifi_ssid
  password: !secret wifi_password

What have I missed?

After some updates via ESPHome, it seems like it’s working as expected now, I’m not really sure why it broke (the encryption key has changed since my original post)

Since this was the first post I saw when I saw “no audio” when searching. I found the fix for now which was just adding the following in your esp portion of the yaml. Hope this helps anyone else stressing out about getting it to compile.

esp32:
  framework:
    type: esp-idf
    version: 4.4.8
    platform_version: 5.4.0