ESP S3 Box Lite - arduino framework config

Does anyone here have a working config (that users the arduino framework) for esp32 S3 Box Lite?
The config they published on github is only for esp-idf framework: https://github.com/esphome/firmware/blob/1cc35128b9d3d2e7edf2dd62331a058cc27e754d/voice-assistant/esp32-s3-box-lite.yaml
which is annoying because that framework does not support media_player and I need that for Text to speech.
Thanks