HomeAssistant doesn't work anymore, Supervisor disconnected

Hello,
exact on Christmas Eve my HA does not run anymore. It went all great the last years, but now I can’t find out, how to get the problem away.
My System runs on a RasperryPi 4. At least Once in a month I install the available updates. So this month. So the last version was 2024.11 or 2024.12 (I can’t tell by digit anymore).

Some days ago there was a Notification, that there is a critical problem or something. Checking the “more information” it said: If it occurs the first time, just press “ignore”. So did I. Then the HA restarted (it took a while longer). After that I startet the Update of HA Core. It took very long and after a while, the HA wasn’t reachable anymore:

  • HA-IP:8123 is not reachable.
  • HA-IP:4357 observer Supervisor: Disconnected
    Following logs are written hourly (last line every minute):
2024-12-27 13:00:36.114 INFO (MainThread) [supervisor.resolution.check] Starting system checks with state running
2024-12-27 13:00:36.115 INFO (MainThread) [supervisor.resolution.checks.base] Run check for trust/supervisor
2024-12-27 13:00:36.131 INFO (MainThread) [supervisor.resolution.checks.base] Run check for multiple_data_disks/system
2024-12-27 13:00:36.132 INFO (MainThread) [supervisor.resolution.checks.base] Run check for ipv4_connection_problem/system
2024-12-27 13:00:36.133 INFO (MainThread) [supervisor.resolution.checks.base] Run check for disabled_data_disk/system
2024-12-27 13:00:36.133 INFO (MainThread) [supervisor.resolution.checks.base] Run check for docker_config/system
2024-12-27 13:00:36.134 INFO (MainThread) [supervisor.resolution.checks.base] Run check for dns_server_ipv6_error/dns_server
2024-12-27 13:00:36.185 INFO (MainThread) [supervisor.resolution.checks.base] Run check for pwned/addon
2024-12-27 13:00:36.186 INFO (MainThread) [supervisor.resolution.checks.base] Run check for dns_server_failed/dns_server
2024-12-27 13:00:36.187 INFO (MainThread) [supervisor.resolution.checks.base] Run check for free_space/system
2024-12-27 13:00:36.188 INFO (MainThread) [supervisor.resolution.checks.base] Run check for security/core
2024-12-27 13:00:36.191 INFO (MainThread) [supervisor.resolution.check] System checks complete
2024-12-27 13:00:36.191 INFO (MainThread) [supervisor.resolution.evaluate] Starting system evaluation with state running
2024-12-27 13:00:36.251 ERROR (MainThread) [supervisor.resolution.evaluations.container] Corrupt docker overlayfs detect: 500 Server Error for http+docker://localhost/v1.47/containers/ceccb81960aba903f2260058651f9d6b3bf84f6e397ec50579730ea9bfb54960/json: Internal Server Error ("readlink /mnt/data/docker/overlay2/l/IRR35VG5FRXRQWX5VVUDHHSWK4: structure needs cleaning")
2024-12-27 13:00:36.422 INFO (MainThread) [supervisor.resolution.evaluate] System evaluation complete
2024-12-27 13:00:36.423 INFO (MainThread) [supervisor.resolution.fixup] Starting system autofix at state running
2024-12-27 13:00:36.424 INFO (MainThread) [supervisor.resolution.fixups.store_execute_reset] Reset corrupt Store: 5c53de3b
2024-12-27 13:00:36.838 ERROR (MainThread) [supervisor.utils] Can't remove folder /data/addons/git/5c53de3b: 
2024-12-27 13:00:36.840 INFO (MainThread) [supervisor.store.git] Loading add-on /data/addons/git/5c53de3b repository
2024-12-27 13:00:37.264 ERROR (MainThread) [supervisor.store.git] Integrity check on /data/addons/git/5c53de3b failed: Cmd('git') failed due to: exit code(-11)
  cmdline: git fsck.
2024-12-27 13:00:37.265 INFO (MainThread) [supervisor.resolution.fixup] System autofix complete
s6-supervise watchdog: warning: unable to spawn ./run (waiting 60 seconds): Exec format error

Since there have been same/similiar issues in the forum I could not find a solution for my problem.
I can’t even understand the failure (which is secondary) - I assume it has something to with “Docker” (reading the logs of the Supervisor at 2024-12-27 13:00:36.251). Primarly: How can I fix it (sep by step)?

See item #4 here: How to Troubleshoot Raspberry Pi Crashing.

Thanks for the fast response.
PC Windows Explorer says after inserting the Micro-SD-Card in the SD-Card-Adapter:

  • hassos-boot (F:): Many dtbo-files and the config.txt (for the raspi?)
  • SDHC (E:): To read this I have to do a formatting.

So I can`t reach the files of the Homeassistant?

You should follow #4 not number 5. The logs your provided indicate corrupted files

I did #4. I removed the SD Card, because I thought to read the files on it. #5 I did not read.

After inserting the SD-Card HA started again. Supervisor is healthy and no logs there. The HA UI shows on first sight everything. Tried to make a Backup via UI. It throws an error:

2024-12-28 13:20:36.356 ERROR (MainThread) [homeassistant.components.hassio] Failed to to call /backups/new/full - An error occurred while making backup, check job 'd3d23fdac9e2426fb1ba724be55350ac' or supervisor logs for details
But supervisor Log is empty.

So what now? I copied the files via Windows Explorer folders backup and config.

HomeAssistant Core Version: 2024.11.1

In Terminal after checking HA Core comes the fopllowing error:

So the system is not working correct.

#5 explains how to read your files using Linux reader.

I’m assuming you don’t have a good backup stored separately from your HA system. So, the steps would be to:

  1. recover the config directory and files from your current system
  2. Install a new HA system
  3. Copy the old configuration to the new system.

Check. So I have to go all the way from the scratch with flashing the SD-Card or is there an easy/other way?

But what is the Problem with the docker? could that problem be solved for itself?

Not that I know of.

For future reference: Disaster Recovery Planning.

1 Like