HAOS hangs intermittently

Since some time my HAOS installation hangs. I cannot access it using the UI nor from the companion app. The only way to get it running is to power-cycle the Raspberry. It’s a large install (100+ Zigbee devices, some hundred automatons) that I’ve been successfully running 5+ years.

While the Ĺovelace IU is not accessible, I can access the observer URL as well as the web UI for AppDaemon.

This has occurred 1-4 times per week for perhaps 2 months and I’ve spent quite some time trying to track it down. I suspected my hardware, so I recently replaced the entire setup, now with a Raspberry PI 5 8GB with a m.2 HAT with a 1 TB NVME drive. It has been running really good 4 days and today it also crashed.
Some facts:

  • Raspberry PI 5 8GB
  • Pimoroni NVMe Base
  • Kingston KC3000 1TB
  • Core2025.2.5
  • Supervisor2025.02.4
  • Operating System 14.2
  • Frontend20250221.0
  • SONOFF_Zigbee_3.0_USB_Dongle_Plus_V2
  • Tellstick Duo

One error I’ve found; unsure if it’s related:
/config/home-assistant-log-fault

Fatal Python error: Segmentation fault
Thread 0x0000007f44d262e8 (most recent call first):
  File "/usr/local/lib/python3.13/concurrent/futures/thread.py", line 90 in _worker
  File "/usr/local/lib/python3.13/threading.py", line 992 in run
  File "/usr/local/lib/python3.13/threading.py", line 1041 in _bootstrap_inner
  File "/usr/local/lib/python3.13/threading.py", line 1012 in _bootstrap

This seems to be close to the issue described in https://community.home-assistant.io/t/never-seen-this-before-home-assistant-log-fault/849121

I’ve found other posts with issues seemingly close, but e.g. replacing the PSU does not seem to be a working solution for me, since I changed all hardware.

How do I best track this down?

Try starting here: The Home Assistant Cookbook - Index.

See the following:

How to Troubleshoot Raspberry Pi Crashing.

2024.5+: Tracking down instability issues caused by integrations.