I think it should be possible to use a different (ML) algorithm to do the intent matching. Which is reliable and doesn’t hallucinate like LLMs. Something like a decision tree, to find the most probable intent.
To also recognize 90% matching intends like:
Turn off the light now.
Nabu turn off the lights.