Home assistant random offline

Hello,

I recently had the RPI 2 and noticed that sometimes my instance was not available anymore (once in a few days), i also noticed i could not SSH to the host anymore. I was running HA on docker on the rPI.

I started a topic with some people and they suggested me to upgrade the RPI to rpi 4 with 4gb of ram. Also to get an SSD along with it so i could rule out the possibility of the microsd failing.

now I bought the rpi4 and a 500gb ssd, set up HA OS on it (so not using linux + docker anymore but directly installed Hassio). And I noticed the same issue. I plugged in my ssd into my pc because I could see that when I replugged my RPI then the logs would be gone but nothing special was in the logs.

here are the last lines of today:

2021-07-05 03:44:24 WARNING (Thread-9) [pychromecast.socket_client] [Living Room speaker(192.168.0.230):8009] Heartbeat timeout, resetting connection
2021-07-05 03:44:40 ERROR (Thread-9) [pychromecast.socket_client] [Living Room speaker(192.168.0.230):8009] Failed to connect to service ServiceInfo(type='host', data=('192.168.0.230', 8009)), retrying in 5.0s
2021-07-05 11:49:10 WARNING (MainThread) [homeassistant.helpers.entity] Update of sensor.desktop_1bb00jk_msi_z170a_tomahawk_ms_7970_nuvoton_nct6793d_voltages_cpu_vcore is taking over 10 seconds
2021-07-05 13:57:16 WARNING (zeroconf-Engine-242) [zeroconf] Choked at offset 13670 while unpacking b'\x13\x10\x80\x00\x00\x00\x00\x02\x00\x00\x00\x00\x06WP3543\x08_display\x04_tcp\x05local\x00\x00\xff\x00\x01\x06WP3543\x08_display\x04_tcp\x05local\x00\x00!\x00\x01\x00\x00\x00x\x00\x14\x00\x00\x00\x00\x1cR\x06WP3543\x05local\x00\xc0,\x00\x10\x00\x01\x00\x00\x11\x94\x0043container_id={DE05F4D3-8FD9-4170-8A30-CC1E6DE332B1}'
Traceback (most recent call last):
  File "/usr/local/lib/python3.8/site-packages/zeroconf/__init__.py", line 754, in __init__
    self.read_others()
  File "/usr/local/lib/python3.8/site-packages/zeroconf/__init__.py", line 825, in read_others
    domain = self.read_name()
  File "/usr/local/lib/python3.8/site-packages/zeroconf/__init__.py", line 886, in read_name
    length = self.data[off]
IndexError: index out of range
2021-07-05 16:11:41 WARNING (MainThread) [homeassistant.helpers.entity] Update of sensor.desktop_1bb00jk_msi_z170a_tomahawk_ms_7970_nuvoton_nct6793d_voltages_cpu_vcore is taking over 10 seconds
2021-07-05 16:36:02 ERROR (MainThread) [custom_components.hacs] <Plugin custom-cards/viewport-accessibility> GitHub returned 404 for https://api.github.com/repos/custom-cards/viewport-accessibility
2021-07-05 16:36:02 ERROR (MainThread) [custom_components.hacs] custom-cards/viewport-accessibility - GitHub returned 404 for https://api.github.com/repos/custom-cards/viewport-accessibility
2021-07-05 18:22:41 WARNING (MainThread) [homeassistant.helpers.entity] Update of sensor.desktop_1bb00jk_msi_z170a_tomahawk_ms_7970_nuvoton_nct6793d_voltages_cpu_vcore is taking over 10 seconds
2021-07-05 18:23:01 WARNING (MainThread) [homeassistant.components.sensor] Updating openhardwaremonitor sensor took longer than the scheduled update interval 0:00:30
2021-07-05 18:23:41 WARNING (MainThread) [homeassistant.helpers.entity] Update of sensor.desktop_1bb00jk_msi_z170a_tomahawk_ms_7970_nuvoton_nct6793d_voltages_cpu_vcore is taking over 10 seconds

Can I also somewhere on the physical ssd see the system logs? I checked all the docker containers on the SSD but I could not find any logs on there:

These are the addons I have:

These custom components:

I am not doing crazy things with my HA. I have 4 switches linked and 1 light. I run a few automations to auto turn on and off 2 of the switches each day.
I also have some phone trackers for me and my girlfriend. Aswell some data from my PC’s hardware is flowing in to HA.

Any idea what I could do? Might this be an IP Issue?

I had a static IP configured in the rpi2 (using raspi-config), however now I have HA directly installed so raspi config is not possible. I could maybe use DHCP settings of my modem to assign a static ip? But I don’t think this is the issue since the rpi2 had a static ip and had the issue already.

1 Like

UPDATE: I might have found some usefull logs here, i managed to SSH into the core of the HA using a public & private key and checked the journal of the system in the timeframe of which I think it happened. See anything strange here?
https://pastebin.com/vRWQ83fk

It happened again, i already set a static IP address but it keeps happening. Here are all the logs:
Here are all the logs:

Host: Host - Pastebin.com

Core: Core - Pastebin.com

DNS: DNS - Pastebin.com

Here is also the supervisor logs: Supervisor - Pastebin.com
and HA logs: Homeassistant - Pastebin.com

Also randomly had this issue today. Could not open the supervisor / ssh into the root (with this method). Also couldn’t restart the HA from the frontend since it said that /config/configuration.yaml was not found. Had to physically restart the HA again to get it working. I already let my HA reboot every night at 04:00, also using systemmonitor i did not see any issues regarding cpu usage

Bump, anyone that can help me i’m clueless

Happens almost daily at this point.

Happened at 11:29 today, i unplugged my rpi, put it in my linux laptop to look around the filesystem for logs,

Jul 29 11:01:10 homeassistant a489a58ce705[464]: 21-07-29 13:01:10 INFO (MainThread) [supervisor.updater] Fetching update data from https://version.home-assistant.io/stable.json
Jul 29 11:01:10 homeassistant hassos-supervisor[1296]: 21-07-29 13:01:10 INFO (MainThread) [supervisor.updater] Fetching update data from https://version.home-assistant.io/stable.json
Jul 29 11:02:02 homeassistant a489a58ce705[464]: 21-07-29 13:02:02 INFO (MainThread) [supervisor.homeassistant.api] Updated Home Assistant API token
Jul 29 11:02:02 homeassistant hassos-supervisor[1296]: 21-07-29 13:02:02 INFO (MainThread) [supervisor.homeassistant.api] Updated Home Assistant API token
Jul 29 11:03:54 homeassistant a489a58ce705[464]:
Jul 29 11:08:54 homeassistant a489a58ce705[464]:
Jul 29 11:13:55 homeassistant a489a58ce705[464]:
Jul 29 11:15:38 homeassistant 4be2a7a96361[464]: Registered MSG_REQ_POOL_USAGE
Jul 29 11:15:55 homeassistant 4be2a7a96361[464]: Registered MSG_REQ_POOL_USAGE
Jul 29 11:15:55 homeassistant 4be2a7a96361[464]: Could not find child 903 -- ignoring
Jul 29 11:18:55 homeassistant a489a58ce705[464]:
Jul 29 11:23:55 homeassistant a489a58ce705[464]:
Jul 29 11:28:56 homeassistant a489a58ce705[464]:

This was the only thing for the supervisor logs, as you can see nothing relevant. What can i do?

Similar thing is happening here!

Core version core-2021.7.4
Supervisor version supervisor-2021.06.8

And for no apparent reason, it just stops being available. One day after rebooting it… maybe a day and a half.

Where should we look for errors??

Anyone else??

Cheers!

Indeed, i’m clueless at this point and this is really frustrating. I even have an automation set to reboot HA daily at night, but it still occurs frequently

What versions are you running now.

After updating my system, it stopped happening. At all!

Running all latest versions:

Home Assistant OS 6.2
supervisor-2021.06.8
core-2021.8.7

Uh.

The one other thing I did on my setup, was /cleaning/ my router and setting it up from scratch. I used to have doubts about the network/IPs integrity - not anymore.

Hardware related? Maybe trying the exact same system snapchat on a different machine…??