Local realtime person detection for RTSP cameras

walt · May 16, 2020, 6:45pm

Are you using a Coral TPU? Do you see a message about it being detected or not detected on Frigate startup?

rolfo · May 16, 2020, 7:38pm

No, I have no Coral (yet). I try to use CPU only.

scstraus · May 16, 2020, 7:40pm

Just to give you a benchmark, I was using an ~2015 i7 and 640x480 or thereabout substream with 4 cameras@ 5fps and it used up the entire CPU. And that was only 8 zones. With the new dynamic zones, I find it’s often analyzing many more zones than that now (maybe double or more).

It’s extremely CPU intensive to run. There’s a reason no one does it that way, because the $70 for the Coral is so much more cost effective. So, I think what you are seeing with CPU utilization on an old CPU sounds well within what could be expected, especially when it’s virtualized and sounds like not set up for accessing all CPU resources.

blakeblackshear · May 16, 2020, 8:05pm

How many cameras and what resolution are your camera feeds?

rolfo · May 16, 2020, 8:24pm

Just one 720p (4-5 later). I can assign up to I think 13 cores (threads?) and as much RAM as it need for Frigate machine (one core for HA, 2 for Xeoma which I need now and 13 is “free”) till I don’t get Coral - it’s very hard here in Poland. But it’s pointless when it uses only one core. And I think @scstraus clearly explained my problem. But on the other hand, assuming (with cpubenchmark.net) that average i7 from 2015 is twice more powerful than my Xeon, I have two of it so if I’ll manage to use multithread it could work until my Coral come. I think that 0.5 fps is enough for me now. And in fact, I have second machine with E3-1220 but because of mess with ubuntu there I couldn’t manage to run Frigate properly there.

scstraus · May 16, 2020, 8:28pm

I doubt you have 13 actual physical cores on an old CPU like that. It sounds like you are talking about vCPU’s in your VM which will actually hurt performance if you assign more than the physical cores you have. Better to assign a similar number of vCPU’s as the actual physical cores you have (you can include hyperthreads as CPU cores). When I used the CPU version a long time ago, it would use as many cores as I gave it, so I don’t think that’s your issue unless it’s a config problem.

rolfo · May 16, 2020, 8:32pm

It’s that CPU: https://ark.intel.com/content/www/us/en/ark/products/40200/intel-xeon-processor-e5520-8m-cache-2-26-ghz-5-86-gt-s-intel-qpi.html - two of them in fact - I made mistake in previous post. So summary 8 cores / 16 threads I think.

EDIT:
And proxmox says it is: 16 x Intel® Xeon® CPU E5520 @ 2.27GHz (2 Sockets)

EDIT2:
I made a short video with the problem - https://streamable.com/in1lvi
The htop is on proxmox machine, not VM. Have a look what happens to time of camera when I back inside and there is no motion. I think it could be motion detection, not object recognition issue.
As I said - I know that my CPU is old and not made for this. If there is no solution I will just wait for the Coral

scstraus · May 16, 2020, 9:25pm

Okay fair enough, then you do actually have 16 threads to play with. But it may indeed be the fact that your CPU is just too old and lacks the features. I tried it on a 2009 mac mini and I couldn’t get it to run at all on that CPU. I got CPU features missing errors in the log. But if it’s doing something, then it seems like you should be able to throw more cores at it and get it to work better. Not sure if frigate can support hyperthreading to use 2 separate threads per core, but you should be able to throw some more whole cores at it and get better performance.

blakeblackshear · May 16, 2020, 9:57pm

The detection runs in a single python process, so it is limited to a single CPU core. It is a separate process from motion detection and decoding, but a single process. That is because the Coral can only be used by a single process. If you are not using a Coral, you can use a separate container for each camera to use multiple cores for detection (just make sure they have different client ids for mqtt). 0.5 fps is about what I would expect with a 600ms detection speed.

scstraus · May 16, 2020, 10:04pm

Ah, so I stand corrected. Was the reason I was able to use all my cores because I had 4 cameras or because something changed from the version I was on?

blakeblackshear · May 16, 2020, 10:16pm

The original CPU version was very different and used a detection process per camera. That isn’t possible with the Coral.

nic0dk · May 19, 2020, 8:26am

same here

mr-onion · May 19, 2020, 12:16pm

Hi Kyle / @blakeblackshear

Are you able to please advise which branch the dev image relates to? The latest I can see on github is 0.5.1-rc4. I run on a Pi so need to rebuild.

I have Reolink cameras and it sounds like this would fix my issue too. Hopefully this will allow me to undo my workaround which was to switch from RTMP to RTSP

Kyle · May 19, 2020, 12:17pm

I’m running the blakeblackshear/frigate:dev image.

Specifically this one: https://hub.docker.com/layers/blakeblackshear/frigate/dev/images/sha256-f4f9909c1b4e973008d3f794097f47e4bc2a4a5be4967d234b7c0099f61aca39?context=explore

mr-onion · May 19, 2020, 12:40pm

Thanks, but I mean which code branch was used to build this image. I’m beginning to think maybe this particular dev branch was not pushed to Github., which is why I cannot find it

rolfo · May 19, 2020, 8:02pm

@blakeblackshear - I have another one question. Or feature request. I ran trough docs but couldn’t find it. Is (or will be) possible to define two areas but with two different MQTT topics? What I’d try to achieve is turning my stupid old PTZ camera to smart camera with (very) basic human tracking. It should be possible with HA automations like human detected in left area, call service onvif.ptz with data pan left. Of course it won’t be perfect solution but in fact I think it could work.

sailhobie · May 19, 2020, 9:33pm

Just posting a solution I had in case someone else runs into a similar problem. My Amiccom camera kept erroring out with “Invalid data found when processing input”. After a bunch of trials I notice FFMPEG was listing the resolution as 1088 instead of 1080. When I called out the resolution dimensions in the config it worked.

Also note that currently the example in the config has the width and length backwards.

blakeblackshear · May 19, 2020, 9:51pm

Not possible to define different areas yet, but that is coming in a future release. I have the exact same use case for PTZ. Eventually I plan to add the ability to track a subject with PTZ based on static cameras.

AdamWood · May 20, 2020, 12:14am

I get a crazy amount of false positives for people. It seems just about every cat, bird, and dog is recognized as a person and frequently at 90%+. Is the code sometimes mislabeling what it identifies? Is there a different set of pretrained models that only looks for people and vehicles? I know I could eliminate a lot of these false positives by increasing the minimum area but I like the fact it can recognize someone 150 feet away. Sorry for all the grief I really do love it.

blakeblackshear · May 20, 2020, 12:38am

If you can capture some sample clips I can incorporate them into my testing. One of the next things I am going to focus on is reducing false positives. I want to see how much I can reduce without needing to train custom models.