ZHA restarts and failing as of 2024.4.3, are there any easy to use alternatives

I have a SkyConnect v1.0 and use ZHA.
I just upgraded to 2024.4.3 and ZHA is restarting every few hours (because of a watchdog timeout) and then fails to initialize making all zigbee devices unavailable.

So this is pretty serious making my HA unusable now.
Apart from rolling back to the previous version I used (2024.3.3) are there any other alternatives (other coordinators or something) to get zigbee operational again?

But why is this happening? I also have SkyConnect and ZHA, and upgrading has been painless.

You really need to know before you start swapping hardware. Have you been through the diagnostic layers?

  • Power supply (important if you’re running on a Raspberry Pi).
  • Local interference (the USB cable supplied with the SkyConnect is not really long enough).
  • Network interference (has your neighbour changed their wi-fi channel?).

Even if the upgrade is to blame, you need to know why - another integration/coordinator may be just as bad.

Lots of good advice here:

@Stiltjack thanx for the link.
I will go throught it but I’m pretty sure that 2024.4.3 caused this because that was the only change that has occured.
In my logging I see a watchdog timeout (see attached below).
I’m not an expert on this and my HA is pretty much useless now because I need to restart it every few hours.
I have reported the issue already, but ofcourse a fix will take too long.
For me to get the knowledge, will take too long as well, hence my question for an alternative to ZHA.
I guess I will downgrade now and hope this fixes it.

2024-04-21 18:59:34.834 WARNING (MainThread) [bellows.zigbee.application] Watchdog heartbeat timeout: TimeoutError()
2024-04-21 18:59:38.046 ERROR (bellows.thread_0) [bellows.uart] Lost serial connection: ConnectionResetError('Failed to transmit ASH frame after 4 retries')
2024-04-21 18:59:38.051 ERROR (MainThread) [bellows.ezsp] NCP entered failed state. Requesting APP controller restart
2024-04-21 18:59:39.062 WARNING (bellows.thread_0) [homeassistant.util.executor] Thread[SyncWorker_0] is still running at shutdown: File "/usr/local/lib/python3.12/threading.py", line 1030, in _bootstrap
    self._bootstrap_inner()
  File "/usr/local/lib/python3.12/threading.py", line 1073, in _bootstrap_inner
    self.run()
  File "/usr/local/lib/python3.12/threading.py", line 1010, in run
    self._target(*self._args, **self._kwargs)
  File "/usr/local/lib/python3.12/concurrent/futures/thread.py", line 92, in _worker
    work_item.run()
  File "/usr/local/lib/python3.12/concurrent/futures/thread.py", line 58, in run
    result = self.fn(*self.args, **self.kwargs)
  File "/usr/local/lib/python3.12/site-packages/serial/serialposix.py", line 673, in flush
    termios.tcdrain(self.fd)
2024-04-21 18:59:39.963 WARNING (bellows.thread_0) [homeassistant.util.executor] Thread[SyncWorker_0] is still running at shutdown: File "/usr/local/lib/python3.12/threading.py", line 1030, in _bootstrap
    self._bootstrap_inner()
  File "/usr/local/lib/python3.12/threading.py", line 1073, in _bootstrap_inner
    self.run()
  File "/usr/local/lib/python3.12/threading.py", line 1010, in run
    self._target(*self._args, **self._kwargs)
  File "/usr/local/lib/python3.12/concurrent/futures/thread.py", line 92, in _worker
    work_item.run()
  File "/usr/local/lib/python3.12/concurrent/futures/thread.py", line 58, in run
    result = self.fn(*self.args, **self.kwargs)
  File "/usr/local/lib/python3.12/site-packages/serial/serialposix.py", line 673, in flush
    termios.tcdrain(self.fd)
2024-04-21 18:59:55.949 ERROR (SyncWorker_22) [homeassistant] Error doing job: Task was destroyed but it is pending!

2024.3.3 contained 22 different changes…

Just downgraded from 2024.4.3 to 2024.3.3 and so far so good.