This “HA just stops working” issue is happening to me as well. It began a couple of months ago I think, and HA stops working about once a week - usually between midnight and 6 am. Because the log starts afresh on a reboot, is there any way of capturing the log so that I can see what might be causing this?
I have a 4GB RPi4 with an SSD (booting from the SSD).
I’ve noticed that on a couple of occasions at least, I’ve had to reboot twice to get a fully working system. After the first reboot, the system at first starts working, but will then lock up after an hour or two (and the RPi4 becomes very warm). A second reboot, and it’s OK (for the next week or so).
I recommend this to anyone having stability issues with their setup.
The package comes with two fans.
A transparent fan with color-cycling RGB LEDs but it is a bit noisy and uses about 40mA @5V. This comes installed on the cooler. There is an additional fan in the package which is black but it is noiseless and uses 20mA @ 5V. I replaced the transparent one with the better black one as you can see in the photo.
I solved my HA from constantly going off. I felt mine was more network based. Noticed that if i reset the router rather than the pi4, it would come back on. I thought because my ethernet was plugged in, that my pi would automatically be using it, but it wasn’t. It was till using the wifi.
So under Supervisor> System> Host> IPAdress> Change Make sure your eth0 ipv4 has a static ip and my ipv6 is set to DHCP.
Your static ip can’t be the same as your wifi one. Then connect to this one. Since doing that my HA has stayed up. 5 days and counting. (If you want your ip the same, just take care when swapping them over. Probably test a new ethernet ip before hand)
I have the same problem with random freezes of the system. There is some temperature spike but not significant, from 35 celcius to 44 celcius before it freezes, last time it happened at around 23:00 hours, close to midnight, times before that I didn’t check. This is the third time now in almost two months.
My setup is RPI 4 8GB with boot from SSD, no SD card installed. I’ve noticed that my KNMI integration was the last thing in the logs that gave errors. I’ve uninstalled the KNMI integration, let’s see if that fixes my freezes.
Network connection is done through cable, not WiFi. I have a case for my RPI 4 with heatsink and small fan that got supplied with it.
The random freezes suck because I want to move my password vault from the Google browser to my HA local instance and connect through VPN to avoid port forward. But for the password vault the be hosted on my RPI I need 99,999% uptime of the system to be able to rely on it.
I was able to capture a CLI trace at the time of a crash. My configuration: Raspberry Pi 4B 4GB, 120GB Intenso SSD with Home Assistant (external), Raspberry Pi 4 Case, USB-C Power Supply 5V 3A.
See graph: power and memory consumption as well as temperature look normal. The temperature actually got down at the crash time:
Here is the picture from the CLI screen; it shows “rebooting in 5 seconds” at the end, but this doesn’t happen. I’m unable to inteprete the CLI log, but maybe you can see some meaningful things to identify the reason of crashing, which happens almost daily.
Just adding a data point here. As I posted earlier in this thread, my HA would stop working randomly. Yesterday it died entirely - wouldn’t boot up again. Discovered that the culprit was either the SSD (a SanDisk M.2 SSD) or the Unitek USB housing for the SSD. One or both of them are now dead.
This may have been what has been causing the troubles earlier.
I’ve replaced both with a Samsung 2.5" SSD and a SATA/USB connector, and restored a full backup from September. Everything is now back up and running, but I do have a gap of historical data from September to yesterday…
You should consider moving your database to a legit server. I’ve had a synology (cheap ~$200) server with 6TB of storage running for 7 years. It contains my database. No issues whatsoever. Not to mention, you can use the server for other things like music, movies, cctv recordings, and personal documents.
I think every HA installation should have the Home Assistant Google Drive Backup addon. Automatically takes the backup every day and uploads it to google drive. Check it out.
You can take a flash drive copy and bring it manually to a site of a choice, but having one of your backup offsite is a basis the commonly used “3-2-1” backup strategy (3 backups, 2 medias, 1 offsite).
For sure, having a single backup on the same machine you backup from is not a great stategy.
This is happening to me at least once a week as well. I am using a Home Assistant Blue running the latest core version and 2022.7.3. This has been happening to me for 2-3 months now.
Everything feels cool to the touch. The heat sink sits on top of a fan so I don’t think there is a cooling issue:
I’m also seeing this with Home Assistant Blue for the last few months, always running the latest versions, but usually skip vX.X.0’s
Similarly, if I disconnect my ethernet physically for ~10 seconds, and reconnect, it comes back online.
I’m able to ping my host IP fine, while it’s down, but can’t access homeassistant.local.
I’m running on Nest Routers/Mesh, with HA connected via a switch. No other device on that switch or in the network has the same issue.
Strangely, mine also seems to go down only on late Wednesday’s/Thursdays (Ireland). Last outage was a few weeks ago.
Perhaps relevant. I can see the ethernet led on the HA blue flashing, and can ping no problem, but all of my automations and zigbee are down, which knocks out my lights. So that’d suggest to me there’s something going on with the container/supervisor/core.
And in terms of usage, If anything I seem to be under utilising odroid
I’ve got my HA’s IP set to static, with a designation in Nest/Google Wifi, which has always been the case for the last year. IPv6 is disabled.
My network settings:
Can’t connect to homeassistant.local sometimes, but can access via Nabu Casa
Disconnecting ethernet, restarting router/network, restarting HA core and/or HA supervisor, did not fix these issues the last time. Only a physical reboot worked (removing power cable for a few seconds)
I have the same problem with my rpi4 on ssd.
I think it has something to do with the bluetooth. I rly want to get this problem fixed because when im on hollydays the people in my house gettin rly pissed by this.
Havent found anything in the logs everything says that this and this is not working anymore but there is no initial thing that triggers this behavior.(at least i didnt saw it)
Did someone solved it already?
Still seems to be an issue. I have Raspby 4 4GB RAM and SSD via USB in a place which keeps just stopping knocking the whole system out. When I boot it up all log files start from scratch. Is there a way to not do that so I could see what latest log entry was before crash? History is also gone every time. At home I have a very similar set up (same Raspberry pi specs) and it never stops. Difference to stopping one is Zigbee stick (conbee vs sonoff) and a slightly different SSD hardware. My case has a cooler for the CPU but not for the SSD section. I have no answer to it yet