Hello everyone,
I am planning to build a DIY smart video doorbell and looking for suggestions from the community regarding the hardware setup, software architecture, and integrations .
Objective
The goal is to build a fully self-hosted smart video doorbell using a Raspberry Pi , with deep integration into Home Assistant and Frigate for AI detection. The system should support two-way video intercom communication through the Home Assistant mobile app on a phone or tablet.
Main Requirements
1. Integration with Home Assistant
The doorbell system should integrate completely with Home Assistant and support automations such as:
- Sending instant notifications when the doorbell button is pressed
- Opening the live video stream in the Home Assistant mobile app
- Triggering automations such as lights, recordings, or announcements
2. Integration with Frigate
I plan to use Frigate NVR for AI-based object detection and event recording.
Frigate should handle:
- Person detection
- Motion events
- Snapshot and video recording
- Event triggers for Home Assistant automations
3. Two-Way Video Intercom
The system should support two-way communication between the visitor and the user.
Requirements:
- Live video stream
- Two-way voice communication
- Access through the Home Assistant mobile app
- Should work on mobile phones and tablets
- Preferably low latency
Door Unit Hardware (Planned)
For the door unit I plan to use:
- Raspberry Pi (Pi Zero 2 W or Raspberry Pi 4)
- High-quality Raspberry Pi camera module
- Microphone
- Speaker with small amplifier
- Physical doorbell button
- Status LED
- Weatherproof enclosure for outdoor use
- Wi-Fi or Ethernet connectivity
Smart & AI Features
Person Detection
Using Frigate , detect when a person approaches the door and trigger:
- Home Assistant notification
- Snapshot capture
- Video recording
Automatic Light Control
When a person is detected at night , Home Assistant should automatically:
- Turn porch lights ON
- Improve camera visibility
Face Recognition
I would also like to add face recognition to identify known visitors such as:
- Family members
- Friends
- Regular visitors
This could trigger automations like announcements.
Alexa Announcements
Integration with Alexa smart speakers to announce events such as:
- “Someone is at the front door”
- “Doorbell was pressed”
- “Arya is at the front door” (if face recognition identifies a known person)
Other AI Features (Optional)
Possible additional features:
- Package detection
- Unknown person alerts
- Visitor snapshot in notification
- Motion zone detection
- Event history
Additional Features
- Live camera stream in Home Assistant
- Doorbell press automation
- Snapshot notifications
- Video recording
- Motion alerts
- Secure local-only system without cloud dependency
Questions
- What would be the best way to implement two-way audio/video with Home Assistant?
- Which Raspberry Pi camera module would work best for this use case?
- What is the recommended approach for low-latency streaming to the Home Assistant app?
- Should AI processing run on the Raspberry Pi , or be handled by the Home Assistant / Frigate server ?
- Are there any existing open-source projects implementing a similar DIY doorbell?
Planned System Architecture (Concept)
Door Unit
- Raspberry Pi
- Camera
- Microphone
- Speaker
- Doorbell button
Processing & Automation
- Home Assistant server
- Frigate NVR for AI detection
- Automations for notifications, lights, and announcements
User Interface
- Home Assistant mobile app
- Tablets for monitoring
- Alexa speakers for voice announcements
Goal
To build a reliable, privacy-focused, locally controlled smart doorbell with advanced AI features while maintaining strong integration with Home Assistant and Frigate .
Any suggestions regarding hardware choices, software stack, or implementation approach would be greatly appreciated.
Thank you!