Network randomly dropping

I’m using a Dell Optiplex 3020M that I had lying around, running haOS directly. I plan to one day replace this, but it works fine when its up…

Trouble is, randomly, it seems to drop off the network. I cannot ping it, I cannot SSH to it or load the webpage. Only thing I seem to be able to do is forcibly power it off and back on to regain access. This happens randomly so I am not sure at what time or currently what causes it. I can go days and days without seeing the isssue, then it might happen 2 days in a row.

I’ve had a quick look in logs and cannot see anything that is obvious to me, so I was hoping someone could point me to the best place to look and how I can find the real cause?

different network cable first
different switch port

maybe then boot from a live linux stick and monitor network stability without haos

Check your disk for errors.

Forcible reboot is a harsh treatment.

Do you have other devices on your LAN? Are they stable?

Do you have a static IP address allocated on your router DHCP section for the PC?

Do you have spiders setting up camp in your nice warm power supply?

Faulty ram? Dying disk? Dying power supply? My NAS had a dying power brick and replacing it fixed all weird stability errors.

Running PiHole, Adguard, or any other DNS re-director proxy system?

How many devices (IP and Ethernet) are connecting through the router?

I’ve tried switching out port and cable already. Live Linux boot disc it is good idea and I will try that. I’d previously been using this for other things and not had any stability issues, so was looking more at HA to be fair, but of course that doesn’t mean it isn’t hardware

I’ll do that, but might switch the disk anyway. I don’t like a forcible reboot either.

Everything else on my entire network (as far as I am aware) is stable. IP is provided by DHCP rather than static in the OS because I have had to rebuild this a couple of times as I experiment. Easier to be sure it gets the same IP.

The PSU for this device is a laptop charger so I don’t think the spiders can get in, but I will check the box itself.

I’ll do some diagnostics. Again had used it before this and it was fine, but that doesn’t mean it isn’t something like this.

I use Adguard Home and Nginx Proxy Manager. Both are working for other things. I cannot get to HA direct to its IP on port 8123 either.

Well if you add a system to block and redirect DNS requests using lists curated by random people, then sometimes it messes you up.
Don’t use it unless you know Exactly what it is doing…

Hint: No one knows exactly what it is doing…

I’m pretty sure that when I created the original post, I wasn’t able to ping the IP of HA. However, its going down almost daily now and every time I can ping it. I have a keyboard connected and if I just issue the command to restart HA Core, then it comes back up.

Although I take onboard the comment about AGH blocking etc. I don’t believe this is the cause of any of my issues, but I will certainly keep this in mind as I continue to investigate. I have a VM of HA which I use when building new integrations into my environment so as not to clog up anything on the live one and that never has issues.

I installed the Scrutiny add-on and the disk is reporting issues, so my first task now I have a little bit of time is to swap that out.

Using these tools, you can see in the logging what is blocked, and for instance you can temporarily disable them to see if the blockinglists are the issue.

Fingers crossed you haven’t lost any data or your database is corrupted.

If so, hope your last good backup is restorable on a fresh rebuild on your new drive.