Also restart your router.
From HAs point of view there is no real difference between a local and an internet site.
It is most likely your router that is the issue.
The fact that a restart if HA had an effect is that it made it possible for the router to clear some of its stored states for connections.
I run the OVA image on Unraid so yes itâs docker inside but not my own build.
The host machine (Unraid) hosts other services such as Teslamate and those services remain connected and fully operational.
Soft restarting HA does not address the issue, a complete VM Reboot does fix it.
I do not have a ârouterâ as such, at least not in the way I think youâre refering.
I run a pfsense FW and have my cable box in âmodemâ mode.
ISP link and DNS servers are monitored and neither have outages reported.
When HA is in this state I can still reach it via both Nabucassa and Cloudflare tunnels so surely only a part of it is struggling with connectvitiy.
If I run the âterminalâ plug in, that container can alway resolve and can reach the internet.
Iâm not familiar with the internal docker architecture but if this fault recurs which container should I grab a shell on to test?
Nice to see someone else reporting this. I thought maybe i had something wrong in my setup all of a sudden. Iâm also seeing this since 2025.3.0âŚ
Iâm running HA docker container, on a ubuntu OS server. I have done so for several years, but as of 2025.3.x i get timeout issues for cloud services. This includes loosing nabu casa cloud access also for me. Iâve tried all common network tools to throubleshoot this, but I canât see anything obvious wrong from those diagnoses. Pinging nabu casa is no use, seems like they donât allow pinging their server. But " nc -vz eu-central-1-3.ui.nabu.casa 443" returned âeu-central-1-3.ui.nabu.casa (3.125.89.246:443) openâ when issues were present. Also all services listed under âsettings - system - repairs - system informationâ were all listed as timeout errors. Thatâs: HACS, AccuWeather, Hass Cloud, Easee charger, spotifyâŚ
Reverting to images pre- 2025.3.x are working fine.
A router is a device setting on two or more networks and move datat between them.
Unless you have public IP addresses on all your internal devices, then you have a router in the traditional sense.
I did describe part of my solution: my L3 device is a PFSense Firewall connecting to my ISP through their device which is no longer a ârouterâ.
I am trying to explain that I am not using poor firewall software provided by my ISP which needs restarting frequently to be reliable - and that which I am using is working properly, is robust and was online throughout the issue I described.
Weâre not here to deliver an amusing (to some) display of network pedantry so Iâll leave it.
both my tunnels worked properly throughout the issue so itâs not that the guest (or host) machine lost reachability in any total way. I suspect the docker networking within HA is misbehaving but I dont have sufficient understanding of that architecture to look more deeply into it without some assistance.
For example, I do not know why the cloudflare container continues to be reliable when others are not?
What specs do you have on that pfsense firewall?
A common mistake is going to low on ram and then actually hit OOM, which means no new connections can be made. This will typically only last a few minutes until some other connection states are cleared, but it might then occur again soon there after.
Right, So I am on 2025.3.3 in docker on debian and have 0 issues with the 1 cloud connected service that I have.
But on your opening post it also seems to have trouble connecting to local ip adresses ?
(might be startup problems that cleared after a while?)
I am curious though.
Soft restarting HA does not address the issue, a complete VM Reboot does fix it.
1: how did you upgrade
2: did you restart the vm after upgrading?
3: did the problems start immediately ?
I probably wont be giving you the fix for this. But it does seem somewhat host os related.
Perhaps the upgrade changed some firewalld rules or something but as you rebooted most of the usefull logs are probably gone. (dmesg, syslog, route table that kind of thing)
I press the upgrade buttons in HA when theyâre offered. I dont do anything special.
I donât then go any specifically restart the VM, that tends to only happen then the OS gets a release too, again I accept those updates when offered (did one yesterday!)
Well, not immediately, the first issue began at approx 1am while the update was done interactively while I was awake!.
I did also spot the local services not responding but I dont have any other logs to show what / why etc. Some of them are LED drivers which are powered down often so I tend to ignore those.
Interestingly though logs do show that HA was unable to communicate with my plex server which is also hosted on the same machine (Unraid hosted docker in that case, while HA is running HAOS in a VM). Those packets never left the (physcial) nic.
I just noticed you wrote that you run HAOS in about a day back, but I think you got us a bit confused when you wrote that you run docker earlier.
HAOS v15 seems to have issues. With RPis it seems to hit the USB drivers and with other installations it seems to have the effect of a higher CPU and/or RAM usage.
I do not know how it affects unraid though, but maybe the netcard is wired in a way so it is affected too.
All while the integrations continue to fail to connect.
Maybe i dont get the network stack inside HA but I think Iâve shown that the homeassistant container can in fact still communicate while some parts of it still canât?
Iâm having a similar issue, running HAOS bare metal on a dell Optiplex.
Many previously fine integrations are now failing to load, connection to the machine seem sluggish as does any navigation.
Error from the logs for solar forecast
Logger: metno
Source: components/met/coordinator.py:81
First occurred: 9:52:29 PM (40 occurrences)
Last logged: 10:50:04 PM
Otherwise try to start HA in fail safe mode, which disables all the custom components. Then go and disable them manually, and start HA again. Once up and running, hopefully without any sluggish behaviour both in fail safe and normal without the integration, start to enable the integrations one by one to see which one is holding it back.
You can enter the homeassistant container with
docker exec -it homeassistant /bin/bash
and use top to see how the processor memory usage changes once you enable the integrations.
I assume you have something which have been updated recently.