I prefer to just separate my camera from the doorbell for a few reasons. Everyone is aware of them and looking for doorbell cameras so they can cover faces or whatever. I find it better and still very easy if you solit them up and me for example, someone pressing the doorbell triggers the camera mounted up under the roof sofet to start recording and keep recording untill the internal motion sensor stops detecting presence at the door. So, it works basically the same exact way and does the same things exect IMO the views are better when my camera is mounted up higher.
Id just get yourself a real outdoor security camera to start with because no disrespect but thise esp32-cam’s are junk and have poor picture and streaming video quality plus they have a long history of overheating and self-destructing but, to each his own, im just trying to help.
Also, have you checked out this excellent 2-way audio syatem someone else here made?? I haven’t tried it yet but, it looks awesome and people seem to like it alot.