The entity_id of the device that captured the voice input should be part of the intent (And the intent should be emitted as an event).
This gives context to intents. For example “Turn on the lights” while spoken in the living room could turn on the lights only in the living room, while “turn on the lights” spoken in the kitchen could turn on only the lights in the kitchen.