Recorder queue reached the maximum size of

bartgrefte · October 31, 2021, 8:20am

This morning I noticed that the energy dashboard stopped recording data starting at 2:51AM, the RPi sending the energy data was still doing it’s thing so it wasn’t going wrong there, a quick look in the HASS OS (2021.9.7) log resulted in finding this:

“ERROR (MainThread) [homeassistant.components.recorder] The recorder queue reached the maximum size of 30000; Events are no longer being recorded”

Since I could not find anything about this error in the docs and Google didn’t help either, I did a reboot and now the energy dashboard is picking up data again.

Do I have to reboot HASS OS everytime this error appears?

ricc · October 31, 2021, 8:37am

I had the exact same thing around the same time on HASS OS 2021.10.6.
Maybe it had to do with the shift from summer to wintertime?

_Mitch07 · October 31, 2021, 9:02am

Seems like more people are experiencing this problem.

github.com/home-assistant/core

High CPU usage since summer to winter time change

opened 01:13AM - 31 Oct 21 UTC

chneau

### The problem In UK 2021/10/31, at 01:59:59, time got back to 01:00:00 (sum…mer to winter, Daylight saving), since then (it's 01:08) home-assistant has a high CPU usage, using a core at 100%. ``` CONTAINER ID NAME CPU % MEM USAGE / LIMIT MEM % NET I/O BLOCK I/O PIDS 42985e0497d4 hass 104.53% 251.7MiB / 7.658GiB 3.21% 0B / 0B 103MB / 1.77MB 15 ``` Edit: memory usage seems to increase quickly: at 01:14:00 ``` CONTAINER ID NAME CPU % MEM USAGE / LIMIT MEM % NET I/O BLOCK I/O PIDS 42985e0497d4 hass 104.93% 703.1MiB / 7.658GiB 8.97% 0B / 0B 112MB / 1.98MB 16 ``` Edit2: Switching lights work fine but it does not appear on the state history of the light. ### What version of Home Assistant Core has the issue? core-2021.10.6 ``` REPOSITORY TAG IMAGE ID CREATED SIZE homeassistant/home-assistant stable e0a45773808a 12 days ago 1.14GB ``` I could not find the exact image id on docker hub, but here is the label section of `docker inspect` ``` "io.hass.arch": "amd64", "io.hass.base.arch": "amd64", "io.hass.base.image": "homeassistant/amd64-base:3.14", "io.hass.base.name": "python", "io.hass.base.version": "2021.09.1", "io.hass.type": "core", "io.hass.version": "2021.10.6", "org.opencontainers.image.authors": "The Home Assistant Authors", "org.opencontainers.image.created": "2021-10-18 06:34:53+00:00", "org.opencontainers.image.description": "Open-source home automation platform running on Python 3", "org.opencontainers.image.documentation": "https://www.home-assistant.io/docs/", "org.opencontainers.image.licenses": "Apache License 2.0", "org.opencontainers.image.source": "https://github.com/home-assistant/core", "org.opencontainers.image.title": "Home Assistant", "org.opencontainers.image.url": "https://www.home-assistant.io/", "org.opencontainers.image.version": "2021.10.6" ``` ### What was the last working version of Home Assistant Core? _No response_ ### What type of installation are you running? Home Assistant Container ### Integration causing the issue _No response_ ### Link to integration documentation on our website _No response_ ### Example YAML snippet _No response_ ### Anything in the logs that might be useful for us? Interesting `The recorder queue reached the maximum size of 30000` ```txt 2021-10-30T09:40:10.884156228Z 2021-10-30 10:40:10 WARNING (MainThread) [homeassistant.components.websocket_api.http.connection] [139778345277952] Disconnected: Did not receive auth message within 10 seconds 2021-10-30T09:40:22.323961416Z 2021-10-30 10:40:22 WARNING (MainThread) [homeassistant.components.webhook] Received message for unregistered webhook c9fa7b5955dcce6df0ec16e14a28b23623563b96373bc5a66c0413c418093008 from 192.168.1.117 2021-10-31T01:03:30.660640416Z 2021-10-31 01:03:30 ERROR (MainThread) [homeassistant.components.recorder] The recorder queue reached the maximum size of 30000; Events are no longer being recorded 2021-10-31T01:04:57.487128770Z [cont-finish.d] executing container finish scripts... 2021-10-31T01:04:57.489476430Z [cont-finish.d] done. ``` at `2021-10-31T01:03:30.660640416Z` I restarted the container to see if it could fix the issue, it did not. ``` ### Additional information Maybe after 02:00:00 it will stop? Everything is working properly: light switches, the mobile phone app is working properly, the website served by the container (server:8123) is working properly. Restarting the container or restarting the PC does not solve the high CPU usage

Arakon · October 31, 2021, 9:32am

Same here, running Supervised on 2021.10.6 with MariaDB.

orbsmiv · October 31, 2021, 9:45am

Same issue here, running HA 2021.10.6 in Docker with PostgreSQL database backend. I also assumed an issue moving from British Summer Time to GMT.

jojolll · October 31, 2021, 10:00am

Oh, I’ve been trying to figure out for the last 10 minutes why I have a load average of 3 and a core that is always at 100% lol.
Like you, the recorder displayed this error at 2:03 am.

I can’t post captures as a new registrant, but I did contact :

That the pi immediately went up in temperature just before it stopped recording data
That it was the core container that was consuming 100% on at least one core cpu.
That in this container it is the command python3 -m homeassistant --config /config that consumed all this CPU.
That a reboot of the core seems to solve the problem.

Mike_B · October 31, 2021, 10:06am

Yes same here, perhaps have to schedule a core reboot after any daylight saving time change?

erik3 · October 31, 2021, 11:29am

Same here. Recorder crashed, emitting the said error message, and CPU was running constantly since. Rebooting of core solved issue. Amazing how something like this has been overlooked and not fixed. This can’t be the first DST-change for Home Assistant.

Yes, as usual the users have to fix basic stuff with Home Assistant themselves. Same thing with (non-existing) log rotation that keeps filling the SD-card with 500mb/day.

francisp · October 31, 2021, 11:35am

It did not happen last year, or the year before.

Tryfos · October 31, 2021, 12:24pm

Unbelievable. A simple DST change broke everything.
I have the exact same error in logs but a slightly different behavior: When I noticed the error the CPU usage wasn’t spiking, but only RAM usage was very high (I’ve placed Recorder to RAM). I don’t recall the same thing happening last year when the time changed.

gordonpm · October 31, 2021, 12:36pm

Same here. Entire HA system crashed (running pre-built image on VMware VM) Had to hard-boot the VM.
Seems to be ok now - didn’t happen last year.

grimmaldus · October 31, 2021, 1:59pm

Yea same here however I have no excess CPU drainage.

Mike_B · October 31, 2021, 2:44pm

Not the only one having issues

Mike_B · October 31, 2021, 2:45pm

Erik, how do you solve this issue?

mobile.andrew.jones · October 31, 2021, 2:53pm

It looks likely - though not confirmed, that the problem was caused by something that has gone wrong with the code that handles the time pattern trigger in automations. My log is filled with thousands of entries complaining that automations could not be started because they were already running (single mode). But every affected automation it has tried to start about 20 times every second between the “new” 1am and 2am. Once it reached 2am, things returned to normal.

Because the recorder now stores traces for the automations, every attempted run it has tried to store in the recorder, and clearly the recorder could not keep up with the requests. We all have the same error - 30,000 queued events and the recorder gives up and doesn’t attempt to reconnect.

My log file was 127mb.

( Same reply to @gordonpm )

finity · October 31, 2021, 4:03pm

But the db handling has been recently mucked with and long term stats were added since last year.

I’m not saying it’s the reason but it’s very possibly related.

Vorta · October 31, 2021, 5:51pm

I’ve found this thread because I have the same problem. All of my HomeAssistant graphs stopped being recorded when the clocks moved. Is there any solution for this yet?

gordonpm · October 31, 2021, 5:57pm

Reboot HA seems to fix it for people.

Jonah1970 · October 31, 2021, 8:26pm

Same here … a reboot fixed the problem for me but the size of my database increased dramatically when I restarted HA at 10:00:

mobile.andrew.jones · October 31, 2021, 8:52pm

If like a lot of us, you had automations that ran 20+ times a second for the entire repeat hour (my log has complaints about an automation trying to run that was already running - 286 thousand times), that will be a lot of additional entries in the database.