HA Video Vision
AI-powered video analysis and facial recognition for Home Assistant cameras.
Installation
Add as custom repository in HACS:
https://github.com/LosCV29/ha-video-vision
Features
Real video analysis - sends video clips, not snapshots
Facial recognition - “Who’s at the door?” → “It’s Carlos”
Free by default - OpenRouter Nemotron model
Local option - run on your own GPU
Smart notifications - AI descriptions with snapshots
Supported Providers
| Provider | Model | Cost |
|---|---|---|
| OpenRouter | Nemotron 12B VL | FREE |
| Google Gemini | gemini-2.0-flash | Free tier |
| Local vLLM | Qwen-VL, LLaVA | Free |
Example Output
Instead of: “Motion detected on Front Porch”
You get: “A woman in a blue jacket is approaching the front door carrying a package. Identified: Mom (87% confidence)”
Works With
- PolyVoice - Voice control: “Check the driveway”
- Any RTSP camera - Reolink, Hikvision, etc.
- Frigate - Trigger on events
Links
- GitHub: GitHub - LosCV29/ha-video-vision: AI video analysis and facial recognition for Home Assistant cameras
- PolyVoice (companion): GitHub - LosCV29/polyvoice: Multi-provider voice assistant for Home Assistant with 15+ built-in functions
Feedback welcome!