Home Assistant just stops working!

This “HA just stops working” issue is happening to me as well. It began a couple of months ago I think, and HA stops working about once a week - usually between midnight and 6 am. Because the log starts afresh on a reboot, is there any way of capturing the log so that I can see what might be causing this?

I have a 4GB RPi4 with an SSD (booting from the SSD).

I’ve noticed that on a couple of occasions at least, I’ve had to reboot twice to get a fully working system. After the first reboot, the system at first starts working, but will then lock up after an hour or two (and the RPi4 becomes very warm). A second reboot, and it’s OK (for the next week or so).

System Health

version core-2021.9.6
installation_type Home Assistant OS
dev false
hassio true
docker true
user root
virtualenv false
python_version 3.9.6
os_name Linux
os_version 5.10.17-v8
arch aarch64
timezone Europe/Amsterdam
Home Assistant Community Store
GitHub API ok
Github API Calls Remaining 4932
Installed Version 1.15.2
Stage running
Available Repositories 942
Installed Repositories 11
Home Assistant Cloud
logged_in true
subscription_expiration 19 October 2021, 2:00
relayer_connected true
remote_enabled false
remote_connected false
alexa_enabled true
google_enabled true
can_reach_cert_server ok
can_reach_cloud_auth ok
can_reach_cloud ok
Home Assistant Supervisor
host_os Home Assistant OS 6.4
update_channel stable
supervisor_version supervisor-2021.10.0
docker_version 20.10.7
disk_total 109.3 GB
disk_used 20.6 GB
healthy true
supported true
board rpi4-64
supervisor_api ok
version_api ok
installed_addons File editor (5.3.3), Samba share (9.5.1), InfluxDB (4.2.1), Terminal & SSH (9.2.1), Grafana (7.2.0), Z-Wave JS to MQTT (0.26.0), TimescaleDB (1.1.6), DSMR Reader (0.6.0)
Lovelace
dashboards 2
resources 8
views 4
mode storage

I fixed my freeze and stopping problems by installing an Ice Tower Cooler.

This is how my Pi 3B+ processor temperature looked like before installing the Ice Tower Cooler, averaging at 54 °C.

And this is how it looks like after installing the cooler, averaging at 34°C.

The cooler made a big difference in the temperature, which is about -20°C.

This is what it looks like now after installing it.

I recommend this to anyone having stability issues with their setup.

The package comes with two fans.
A transparent fan with color-cycling RGB LEDs but it is a bit noisy and uses about 40mA @5V. This comes installed on the cooler. There is an additional fan in the package which is black but it is noiseless and uses 20mA @ 5V. I replaced the transparent one with the better black one as you can see in the photo.

1 Like

I solved my HA from constantly going off. I felt mine was more network based. Noticed that if i reset the router rather than the pi4, it would come back on. I thought because my ethernet was plugged in, that my pi would automatically be using it, but it wasn’t. It was till using the wifi.

So under Supervisor> System> Host> IPAdress> Change Make sure your eth0 ipv4 has a static ip and my ipv6 is set to DHCP.

Your static ip can’t be the same as your wifi one. Then connect to this one. Since doing that my HA has stayed up. 5 days and counting. (If you want your ip the same, just take care when swapping them over. Probably test a new ethernet ip before hand)

I have the same problem with random freezes of the system. There is some temperature spike but not significant, from 35 celcius to 44 celcius before it freezes, last time it happened at around 23:00 hours, close to midnight, times before that I didn’t check. This is the third time now in almost two months.

My setup is RPI 4 8GB with boot from SSD, no SD card installed. I’ve noticed that my KNMI integration was the last thing in the logs that gave errors. I’ve uninstalled the KNMI integration, let’s see if that fixes my freezes.

Network connection is done through cable, not WiFi. I have a case for my RPI 4 with heatsink and small fan that got supplied with it.

The random freezes suck because I want to move my password vault from the Google browser to my HA local instance and connect through VPN to avoid port forward. But for the password vault the be hosted on my RPI I need 99,999% uptime of the system to be able to rely on it.

I was able to capture a CLI trace at the time of a crash. My configuration: Raspberry Pi 4B 4GB, 120GB Intenso SSD with Home Assistant (external), Raspberry Pi 4 Case, USB-C Power Supply 5V 3A.
See graph: power and memory consumption as well as temperature look normal. The temperature actually got down at the crash time:


Here is the picture from the CLI screen; it shows “rebooting in 5 seconds” at the end, but this doesn’t happen. I’m unable to inteprete the CLI log, but maybe you can see some meaningful things to identify the reason of crashing, which happens almost daily.

Thanks.

That’s a kernel crash. I cannot really tell more…

it’s a kernel crash in the dlls drivers for network / bluetooth coms

That does not look like a network driver issue, to me that looks like a bug in the CPU scheduler

Just adding a data point here. As I posted earlier in this thread, my HA would stop working randomly. Yesterday it died entirely - wouldn’t boot up again. Discovered that the culprit was either the SSD (a SanDisk M.2 SSD) or the Unitek USB housing for the SSD. One or both of them are now dead.

This may have been what has been causing the troubles earlier.

I’ve replaced both with a Samsung 2.5" SSD and a SATA/USB connector, and restored a full backup from September. Everything is now back up and running, but I do have a gap of historical data from September to yesterday…

You should consider moving your database to a legit server. I’ve had a synology (cheap ~$200) server with 6TB of storage running for 7 years. It contains my database. No issues whatsoever. Not to mention, you can use the server for other things like music, movies, cctv recordings, and personal documents.

I think every HA installation should have the Home Assistant Google Drive Backup addon. Automatically takes the backup every day and uploads it to google drive. Check it out.

1 Like

That seems like a step backwards for a system designed to be independent of the internet.

You can take a flash drive copy and bring it manually to a site of a choice, but having one of your backup offsite is a basis the commonly used “3-2-1” backup strategy (3 backups, 2 medias, 1 offsite).

For sure, having a single backup on the same machine you backup from is not a great stategy.

Awesome suggestion, there is also GitHub - ikifar2012/remote-backup-addon: Automatically create and backup HA snapshots using SCP for those (like me) that cringe when there isn’t a self-hosted option.

This is happening to me at least once a week as well. I am using a Home Assistant Blue running the latest core version and 2022.7.3. This has been happening to me for 2-3 months now.

Everything feels cool to the touch. The heat sink sits on top of a fan so I don’t think there is a cooling issue:

I’m also seeing this with Home Assistant Blue for the last few months, always running the latest versions, but usually skip vX.X.0’s
Similarly, if I disconnect my ethernet physically for ~10 seconds, and reconnect, it comes back online.
I’m able to ping my host IP fine, while it’s down, but can’t access homeassistant.local.

I’m running on Nest Routers/Mesh, with HA connected via a switch. No other device on that switch or in the network has the same issue.

Strangely, mine also seems to go down only on late Wednesday’s/Thursdays (Ireland). Last outage was a few weeks ago.

Perhaps relevant. I can see the ethernet led on the HA blue flashing, and can ping no problem, but all of my automations and zigbee are down, which knocks out my lights. So that’d suggest to me there’s something going on with the container/supervisor/core.

And in terms of usage, If anything I seem to be under utilising odroid :sweat_smile:

I’ve got my HA’s IP set to static, with a designation in Nest/Google Wifi, which has always been the case for the last year. IPv6 is disabled.
My network settings:

Do you have http entry in your config file? I don’t have one and wondering if that has anything to do with the issue.

@alexcapone I don’t have http in my configuration.yml, never have.

Since posting, I’ve had about three instances of home assistant acting strange:

  1. Speedtest.net sensor reports ~5mb download speeds (Usually >450mb)
  2. Can’t connect to homeassistant.local sometimes, but can access via Nabu Casa
  3. Disconnecting ethernet, restarting router/network, restarting HA core and/or HA supervisor, did not fix these issues the last time. Only a physical reboot worked (removing power cable for a few seconds)

I have the same problem with my rpi4 on ssd.
I think it has something to do with the bluetooth. I rly want to get this problem fixed because when im on hollydays the people in my house gettin rly pissed by this.
Havent found anything in the logs everything says that this and this is not working anymore but there is no initial thing that triggers this behavior.(at least i didnt saw it)
Did someone solved it already?

Still seems to be an issue. I have Raspby 4 4GB RAM and SSD via USB in a place which keeps just stopping knocking the whole system out. When I boot it up all log files start from scratch. Is there a way to not do that so I could see what latest log entry was before crash? History is also gone every time. At home I have a very similar set up (same Raspberry pi specs) and it never stops. Difference to stopping one is Zigbee stick (conbee vs sonoff) and a slightly different SSD hardware. My case has a cooler for the CPU but not for the SSD section. I have no answer to it yet :frowning: