Common problem I’ve seen is that every time there’s a docker update HA throws the “unable to connect” page until the update is completed, then everything is back to normal.
This time, both servers, both operating nearly identically, quit working. The supervisor service isn’t auto starting, and even when manually started, I get the “retry” error connection page for HA.
The systemctl service shows this on one:
Dec 08 23:31:03 server1.newvalleyresources.com hassio-supervisor[34323]: 20-12-09 06:31:03 WARNING (MainThread) [supervisor.misc.tasks] Watchdog found a problem with CoreDNS plugin!
Dec 08 23:31:03 server1.newvalleyresources.com hassio-supervisor[34323]: 20-12-09 06:31:03 ERROR (MainThread) [supervisor.plugins.cli] Can't start cli plugin
Dec 08 23:31:03 server1.newvalleyresources.com hassio-supervisor[34323]: 20-12-09 06:31:03 ERROR (MainThread) [supervisor.misc.tasks] CLI watchdog reanimation failed!
Dec 08 23:31:03 server1.newvalleyresources.com hassio-supervisor[34323]: 20-12-09 06:31:03 ERROR (SyncWorker_0) [supervisor.docker] Image homeassistant/amd64-hassio-multicast not exists for hassio_multicast
Dec 08 23:31:03 server1.newvalleyresources.com hassio-supervisor[34323]: 20-12-09 06:31:03 INFO (MainThread) [supervisor.plugins.dns] Starting CoreDNS plugin
Dec 08 23:31:03 server1.newvalleyresources.com hassio-supervisor[34323]: 20-12-09 06:31:03 ERROR (MainThread) [supervisor.plugins.multicast] Can't start Multicast plugin
Dec 08 23:31:03 server1.newvalleyresources.com hassio-supervisor[34323]: 20-12-09 06:31:03 ERROR (MainThread) [supervisor.misc.tasks] Multicast watchdog reanimation failed!
Dec 08 23:31:03 server1.newvalleyresources.com hassio-supervisor[34323]: 20-12-09 06:31:03 ERROR (SyncWorker_3) [supervisor.docker] Image homeassistant/amd64-hassio-dns not exists for hassio_dns
Dec 08 23:31:03 server1.newvalleyresources.com hassio-supervisor[34323]: 20-12-09 06:31:03 ERROR (MainThread) [supervisor.plugins.dns] Can't start CoreDNS plugin
Dec 08 23:31:03 server1.newvalleyresources.com hassio-supervisor[34323]: 20-12-09 06:31:03 ERROR (MainThread) [supervisor.misc.tasks] CoreDNS watchdog reanimation failed!
And this on the other:
Dec 08 23:32:04 server2 hassio-supervisor[6064]: 20-12-09 06:32:04 ERROR (SyncWorker_4) [supervisor.docker] Image homeassistant/amd64-hassio-audio not exists for hassio_audio
Dec 08 23:32:04 server2 hassio-supervisor[6064]: 20-12-09 06:32:04 ERROR (MainThread) [supervisor.plugins.multicast] Can't start Multicast plugin
Dec 08 23:32:04 server2 hassio-supervisor[6064]: 20-12-09 06:32:04 ERROR (MainThread) [supervisor.misc.tasks] Multicast watchdog reanimation failed!
Dec 08 23:32:04 server2 hassio-supervisor[6064]: 20-12-09 06:32:04 ERROR (MainThread) [supervisor.plugins.audio] Can't start Audio plugin
Dec 08 23:32:04 server2 hassio-supervisor[6064]: 20-12-09 06:32:04 ERROR (MainThread) [supervisor.misc.tasks] PulseAudio watchdog reanimation failed!
Dec 08 23:32:04 server2 hassio-supervisor[6064]: 20-12-09 06:32:04 WARNING (MainThread) [supervisor.misc.tasks] Watchdog found a problem with CoreDNS plugin!
Dec 08 23:32:04 server2 hassio-supervisor[6064]: 20-12-09 06:32:04 INFO (MainThread) [supervisor.plugins.dns] Starting CoreDNS plugin
Dec 08 23:32:04 server2 hassio-supervisor[6064]: 20-12-09 06:32:04 ERROR (SyncWorker_5) [supervisor.docker] Image homeassistant/amd64-hassio-dns not exists for hassio_dns
Dec 08 23:32:04 server2 hassio-supervisor[6064]: 20-12-09 06:32:04 ERROR (MainThread) [supervisor.plugins.dns] Can't start CoreDNS plugin
Dec 08 23:32:04 server2 hassio-supervisor[6064]: 20-12-09 06:32:04 ERROR (MainThread) [supervisor.misc.tasks] CoreDNS watchdog reanimation failed!
Same here. 18.04. Apparently my HA instance stopped working at 2am this morning and nothing I’ve tried has gotten it to come back yet. Just the “Retry” page…
Reading my logfiles I saw that my upgrade included:
Unpacking docker-ce-cli (5:20.10.0~3-0~raspbian-buster) over (5:19.03.12~3-0~raspbian-buster)
Unpacking docker-ce (5:20.10.0~3-0~raspbian-buster) over (5:19.03.12~3-0~raspbian-buster)
Unpacking containerd.io (1.4.3-1) over (1.2.13-2)
(and more, but seems not related)
Do you know if I also have to revert containerd.io ?
Assuming we don’t know how to revert to a previous version of Docker, and assuming we have only the /usr/share/hassio/* folder structure preserved (don’t have snapshot???)…
update:
All docker images of hassio addons were gone. After clicking ‘start’ for each addon in Supervisor the exact name of the missing image was displayed, which I manually downloaded with docker image pull <name>
I got things working on the latest version of docker-ce now.
What I did is restart hass a couple of times using # hassio-supervisor start / stop / status. After a few attempts hass started. Then i restored a snapshot.
It takes some waiting in between, bus after about half an hour things are running again
Downgrading docker-ce package to 5.19 version solved the problem for me.
After that I only had to re download the docker images of the hassio addons I use.
I have been pulling my hairs since last midnight… I always blamed it to new supervisor requirements when i am on ubuntu 18.04 on Intel NUC. I even updated to Ubuntu 20.04. Now i find this, went back to docker 5.19; everything is fine. Thanks a lot guys, you are the best…
You are really great guys. Had the problems with Debian buster since yesterday and did not know how to fix this. But the downupgrade to docker 5.19 worked.