I’ve literally been trying for weeks to get a voice assistant running, I wanted to build one for the experience and customisable nature, also cheaper than buying pre-made - although ironically I’ve spent way more on parts trying to get stuff to work.
I’ve wanted a PTT assistant, and a wake word assistant.
Boards I’ve tried:
ESP32D Wroom
ESP32 S3 Mini
Seeed Studio ESP32S3Plus
And I’ve mainly tried using Max 98357A amplifiers and I’ve tried INMP441 microphones and ICS43434 microphones.
I’ve tried copying and editing code from ESPHome and GitHub.
I’ve spent literal days with ChatGPT, Claude, and DeepSeek trying to get to the bottom of things.
I even bought an m5 stack atom echo (albeit the old Pico version and not the S3) and I’ll come to that in a moment.
I’d post code that I’ve tried but honestly I must have tried over 30 versions at this point, and as I said spent days debugging with AI, as well as listening to suggestions for buying more stuff to try.
My confusion arises from a myriad of people seemingly making voice assistants online with boards that don’t have the PSRAM off the seeed s3 plus, and PTT versions with a little C3 (or six I can’t remember), and even following along with code on GitHub from a chap called ‘war lion’ got me nowhere - it compiles etc but nothing ever happens.
So I’m logical, so it must be the hardware, get more hardware and still I can’t make it work, so it must be my soldering, cool well I’ll breadboard everything, so it must be my pin choices, ok cool I’ll research data sheets, I’ll quiz AI after getting that to look at data sheets.
Still absolutely no luck.
Even last night I was trying to get the aforementioned WarLion code to work with Claude and in that particular case the info isn’t getting to the speaker (yes, I have pipelines set up - I even set up a home assistant cloud pipeline which then managed to crash my system and stop HA working for an hour). Claude decided that the power coming from the 5v from the seeed s3 wasn’t enough for the max 98357a and I should try it with an external 5v directly to that as well as the esp. again though, I’ve literally never seen anybody doing that. Heck, AI tells me that esp boards can’t run any more than one LED but that seems just wrong.
Going back to the echo, I bought it, it worked, I tried to add an external amplifier and speaker, broke the code and again after spending days with it and AI, it’s never worked properly since.
I’m not going to give up, but in a world where people seem to have made voice assistants with loads of different types of ESP boards and not just the one particular chip’d full size S3 dev kit or respeaker, what can I do to be better and actually have a chance of making something that works??
