ESP32 S3 Box3

cad64 · May 16, 2024, 9:19am

I’m having the same issue since the update. It was working fine previously.

iophobia · May 16, 2024, 1:16pm

I also don’t get spoken responses anymore.

OverloadUT · May 16, 2024, 5:47pm

Adding my voice to people who aren’t getting spoken responses from the S3 Box3 after the 2024.5.0 update

drewsky208 · May 16, 2024, 8:06pm

Oh happy day, well kinda (happy in that misery loves company and it wasn’t something I did). Finally a thread where someone else sees what I see. Since 2024.5.0, my S3 box 3 can now hear me but it’s silent.

o0o-sp · May 18, 2024, 9:11pm

Can you please tell me exactly where should I insert this yaml code? In a “voice_assistant” section of the esp32-s3-box-3 yaml conf file?

super-qua · May 19, 2024, 8:47am

I also did not get any spoken response with 2024.5.0.
reverting esphome CLI to 2024.04.2 worked.

Edwin_D · May 19, 2024, 10:00am

Assuming spoken response is fixed, does not having a media player also mean no possibility to sent TTS messages, or is that already possible? I’m not interested in music, but I am in announcements.

kicker10BOG · May 21, 2024, 3:34am

I just did the second update today and still am not getting a spoken response. Is there something I need to add to the configuration now?

smcnaught · May 21, 2024, 3:36am

Short term fix - downgrading to an older version of esp-idf:

esp32:
  board: esp32s3box
  flash_size: 16MB
  framework:
    type: esp-idf
    version: 4.4.6

chaotik_penguin · May 22, 2024, 7:36pm

I’ve been looking on how to downgrade. I’ve only installed esphome/voice assistant via the web browser before. Do I need to run esphome on my linux laptop and compile a firmware and put it onto the esp32-s3-box-3? Just trying to find some clear steps of how I can put an older version on it.

any help is appreciated.

D3SOX · May 22, 2024, 11:46pm

I also needed this, but first my microphone didn’t work. I’m unsure what change fixed that, maybe the revert to ESPHome 2024.4.x, but I have a working version now with on-device hot word working, mic, Wi-Fi, everything working GitHub - D3SOX/firmware: Holds firmware configuration files for projects that the ESPHome team provides.

Flashed via web esp home on the first click on Configure Wi-Fi it might give an error but try again it always worked the second time for me.

janstadt · May 24, 2024, 12:48pm

Yes thats correct.

youkorr · May 25, 2024, 10:04pm

good evening I saw your firmware it is really great and I have no bugs everything works perfect but it is impossible for me to add in the compilation
github://jaymunro/esphome_firmware/wake-word-voice-assistant/esp32-s3-box-3.yaml@continued-conversation

While github://jaymunro/esphome_firmware/esp32-box-docks/box3-sensor-for-voice-assistant.yaml@sensor_dock.
Installs in esphome compilation works

DragonQ · May 29, 2024, 1:26pm

Here is my config for my brand new ESP32-S3-BOX-3 which “works” except the wake word detection is awful. It works maybe 1 in 10 times and that’s when we’re talking directly into the mic (10cm away) quite loudly. No chance from across the room. Am I doing something wrong? Do I have a unit with a duff mic?

# NOTE: Microphone is broken when using ESPHome 2024.5.x, so use 2024.4.2 docker image for now!

# Global variables
substitutions:
  name: ha-voice-1
  friendly_name: HA Voice 1
  micro_wake_word_model: hey_jarvis

# Main package
packages:
  esphome.voice-assistant: github://esphome/firmware/wake-word-voice-assistant/esp32-s3-box-3.yaml@main

# Remove MAC from name
esphome:
  name_add_mac_suffix: true

# WiFi credentials
wifi:
  ssid: !secret wifi_ssid
  password: !secret wifi_password
  domain: .not.lan

# Bug fix for speaker not working
esp32:
  framework:
    version: 4.4.6

exquisitus · May 29, 2024, 10:44pm

DragonQ

I too have a brand new box with the same exact problem, with the only difference that I have not installed esphome, it’s still with factory firmware. I came here searching for possible clues, thought to myself, perhaps if I install a different firmware I could get some ideas as to what’s wrong with it… Now I think espressif is just selling crap. My experience with customer service was… totally bizarre, namely solid denial, what you are saying can’t possibly be true, because it just can’t and that’s that. Didn’t really expect that from espressif, but here I am with a box’o’crap instead of a working product and looking for other more reliable vendors…

sparkydave · May 29, 2024, 11:03pm

Perhaps rather than simply using the github package link you could try copying the code and editing the following bit (or using further substitutions), playing with the settings to see if you can get the mic working better.

voice_assistant:
  id: va
  microphone: box_mic
  speaker: box_speaker
  use_wake_word: true
  noise_suppression_level: 2
  auto_gain: 31dBFS
  volume_multiplier: 2.0
  vad_threshold: 3

Try playing with the volume multiplier and, noise suppression and vad threshold. Having said that, mine isn’t great either.

DragonQ · May 30, 2024, 8:19am

FYI, copying the code doesn’t achieve anything, you can override those values after including the package.

I have messed with the volume_multipler and vad_threshold a bit but they don’t seem to help.

sparkydave · May 30, 2024, 8:41am

Which is why I also said:

DragonQ · May 30, 2024, 8:58am

I do have another unit to try so if that has the same issues then I think I’ll have to sadly park this as “not even ready for early adopters”.

Maybe some updates in a few months will improve things but it does seem odd that videos exist of people using this hardware seemingly with ease.

EDIT: Good news, the second unit I bought seems to work way better than the first one, despite both running the same software (same YAML except device name). 3/3 wake words recognised so far, all from a metre away.

I will try to return the first one as faulty unless the shipping is prohibitive, in which case I’ll keep it as a device to play with.

pepe59 · May 30, 2024, 2:55pm

Mine works very well at a distance of about 5m.
Have you removed the foil from the display and microphones?