WTH Why is there no standard way of detecting downtime

I have a relatively large amount of integrations and devices - some of which are flaky. This can be either because batteries are empty in bluetooth sensors, but also because data is being generated by slightly hacky systems (a USB cable that sometimes fails, a Raspberry Pi gen 1, etc)

It’s painful that once something stops reporting data you don’t find out unless you’re looking at a dashboard. It would be nice if there is a standardized way to do “uptime alarms” or alarms for things in Home Assistant in a functional way. I’d like to receive a push notifcation when something is broken. I have this for all the computers in my networks, why not for entities in Home Assistant. In some ways this would work well with the existing “problems” framework that Home Assistant does already have (can I get a push notification for those too?)

Alternatives:

  • I could create an automation for each, but that would mean creating them for at least 100 entities, which is just a pain and unworkable
  • There are logs that also report errors with integrations, but they are very technical and very non-functional. Maybe this needs to be tied in somehow?

Gotchas

  • This might be noisy, it needs to be opt in and easy to silence

I have something like this setup as an automation. Most devices I have are in groups, so I have them all added to an automation that checks if a device in available for 30 minutes and then send me a notification. Also brings me to a dashboard where I can see the status of all devices so I know what else, if anything, is unavailable/offline.