Stream audio from cellphone microphone to speaker with esp32

Hi, I was wondering is there a way to stream the audio that my cellphone picks up and play it in a speaker connected to an esp32?

I am trying to make an intelligent doorbell and I want to add the feature to talk with the people on the door through my phone.

That is exactly what I am trying to do. I have an intercom and would like to create two way communication between my phone and whoever is on the other side of the intercom. I live in an appartment. Unfortunately that means that I have to rely on the intercom system that is already installed.

Still waiting if someone can help on the idea with many thanks