HI All, I am using Homeassistant (core-2022.3.7, supervisor-2022.05.0) on Dockers. Its on RaspBerry Pi4 with SD card.
Since, last couple of weeks I am having a strange problem my HA hangs completely and I cannot access it. The only way is to reboot it. At hang time, I can ping the system but cannot ssh into it.
There is no way i can see the logs as what caused this issue as logs are generally kept for very small time frame and are usually erased when system is booting up.
I was thinking how to troubleshoot or diagnose this issue. This usually happens once in every 24hrs.
good points,
we dont have power outages here
Corrupted SD is possiblity.
I did tried to check using following and found no issues, Do let me know if there are any better ways to check for corruption.
time badblocks -sv /dev/mmcblk1p1 -o mmcblk1p1.log
Checking blocks 0 to 306175
Checking for bad blocks (read-only test): done
Pass completed, 0 bad blocks found. (0/0/0 errors)
i checked the power cable and it appears to be new.
Secondly, I have placed my hardware in secure location so kids cannot touch it.
Even i have to use my kasa smart switch to reboot as physically its complicated to reach
WARNING (MainThread) [homeassistant.bootstrap] Support for the running Python version 3.7.3 is deprecated and will be removed in the first release after December 7, 2020. Please upgrade Python to 3.8.0 or higher.
in your logs?
What do you mean with store logs for longer? You can go to your configuration directory → home-assistant.log and home-assistant.log.1 files
no i dont have anything like this in logs. Mostly, it adb related issue (Nvidia Shield)
yeah thanks for name of logs but I dont usually see what happened around the time when system was hung. Usually these log files contain data after when i reboot something. like 1am in the morning system was not responding hence i did a remote reboot.
hi, i am still having the same issue.
Even though now i am updated to latest version of homeassistant.
I was wondering if there is a way to scan my sd card for any errors.
I tried fsck command but it suggests file system is mounted.
Thanks
Nothing is erased. Ignore the home-assistant.log files in /config, they are irrelevant. That’s just a file where core replicates logs since the last restart for convenience.
Your real log on a system with supervisor is the system journal. It contains everything from all containers and system itself and is persistent across many restarts
I wrote a guide on how to access it. Take a look and start there, hopefully that can give you some insight into what’s going on.
thank you,
i did connected to ssh and tried to get journalctl.
There is a funny thing. I get jounal messages from Feb 14 and then May 12 today nothing in between. Do you think this is normal?
Even the message from today are after the reboot which was around 1724.
Feb 14 10:12:01 HomeAssistant4 kernel: bluetooth hci0: firmware: failed to load brcm/BCM4345C0.hcd (
Feb 14 10:12:01 HomeAssistant4 kernel: bluetooth hci0: firmware: failed to load brcm/BCM.hcd (-2)
Feb 14 10:12:01 HomeAssistant4 kernel: Bluetooth: hci0: BCM: firmware Patch file not found, tried:
Feb 14 10:12:01 HomeAssistant4 kernel: Bluetooth: hci0: BCM: 'brcm/BCM4345C0.hcd'
Feb 14 10:12:01 HomeAssistant4 kernel: Bluetooth: hci0: BCM: 'brcm/BCM.hcd'
Feb 14 10:12:01 HomeAssistant4 systemd[1]: Starting Network Time Synchronization...
Feb 14 10:12:01 HomeAssistant4 systemd[1]: Starting Raise network interfaces...
Feb 14 10:12:01 HomeAssistant4 systemd[1]: Started Update UTMP about System Boot/Shutdown.
Feb 14 10:12:01 HomeAssistant4 systemd-timesyncd[284]: System clock time unset or jumped backwards,
May 12 17:24:17 HomeAssistant4 systemd[1]: Started Network Time Synchronization.
May 12 17:24:17 HomeAssistant4 systemd[1]: Reached target System Initialization.
May 12 17:24:17 HomeAssistant4 systemd[1]: Listening on D-Bus System Message Bus Socket.
May 12 17:24:17 HomeAssistant4 systemd[1]: Starting Home Assistant OS Agent..
No that is very odd, I’ve never seen anything like that. I have tons of stuff in my journal all the time. Logs from every container as well as kernel, audit and all the system stuff go in there so I don’t see how that could be. You aren’t applying a filter there?
Although I just noticed from one of your earlier screenshots it says you are running an unsupported installation. Is the reason related to the journal by any chance? If not what is it?
no i am not applying any filters its just a barebone journalctl command.
I have also reset the journal today (i assuming it was corrupt).
If this does not work out I am going to make it ‘supported install’ on a usb drive (boot disk).
Regards