Use the ONNX ASR add-on with the Parakeet v2 model, paired with Piper for speech synthesis.
This setup delivers near-instantaneous responses for local voice commands — the 125H is plenty powerful.
Any extra delay usually comes from the LLM, so if you use it, it is important to choose a suitable model and provider.