For long TTS responses I am now testing this alternative approach based on real-time streaming from LLM to TTS directly: Streaming LLM's responses into TTS for near-instant responses (works with HAVPE!)
2 Likes