I keep having an issue where my HAOS system keeps going offline for no apparent reason. What I mean by that is, I can’t connect to the UI with the companion app or web. Alerts/automations don’t happen. I can SSH to the system just fine but I can’t issue any ‘ha core’ commands without them timing out.
I have the syslog add-on so my logs are going to graylog. I did see this which must be relevant:
homeassistant systemd: Started Process Core Dump (PID 216921/UID 0).
homeassistant homeassistant: [02:26:08] INFO: Home Assistant Core finish process exit code 256
homeassistant homeassistant: [02:26:08] INFO: Home Assistant Core finish process received signal 11
I’ve run this on a bare metal HP EliteDesk 800 G2 and had this same issue. I thought maybe it was a hardware issue so I installed proxmox and HAOS on top, according to this guide. I was expecting if there was a hardware issue, proxmox would die too. Well, proxmox is just fine but I still have the HAOS issue.
This most recent time it failed, I did see a log file with a .fault extension (which is odd because in the past this file was never generated). I will attach it here. Also unique to this most recent failure is the terminal/screen of the VM had what appears to be a dump.
I have a hunch that this might be related to my zooz USB stick (using USB pass-through in proxmox) but I have no way to prove it. If it comes down to it, I can remove the stick and let it run but I have a lot of lighting automations that run on zwave. I’m hoping there is something in the logs that would help prove my hunch.
At any rate, any assistance would be greatly appreciated. I’m sure I’m not supplying critical information so please let me know if I can provide anything else of value.