I am observing this behavior for days in Proxmox 7.2 and with the last version of homeAssistant(2022.7.5) I do not know why this is happening… but it is exactly as you described, just random restart of HAOS I would say, but if I check the docker containers, actually those restarts are not counted by docker… So I start to think that it can be networking… not sure…
i am running Proxmox 7.1-12 with HAS 2022.7.4 without issue. Proxmox has been super reliable. I have a 4 node CEPH cluster running. I now wont be upgrading to 7.2 for a little while me thinks.
Watching with great interest.
Hi All, in the proxmox haos community installation guide thread it is shared that the fix is to install the edge kernel. It seems to fix the issue for me as well thus we can close this thread. Will monitor and report back in that thread if I encounter issues.
So after changing a bit the network config, assigning a fixed IP address, it seems that it’s not doing those restarts anymore … I will test it more in depth but it looks much better, today no restarts at all…
Can you please provide a link on how to do this? thanks
interesting, mind sharing your changes? I have used fixed IP from the start as well.
of course I don’t mind, but unfortunately I did no magic, I have just changed from DHCP to a fixed IP, nothing else… And still the 2nd day stable…
Did you set DHCP from within HAOS? Or set the allocation by the router?
DHCP of course was configured in the router, but I just have give via HAOS some fixed IP (not a fixed DHCP IP for the MAC on the router).
I used the proxmox helper script to install the edge kernel. So far it is acting much much better. Just in case someone else comes along and reads this. I am also using DHCP with a lease. Much easier to make network changes after the fact if using DHCP vs static configuration.
Hi,
just to let you know it started to do the same once more, and it seems that the frontend is restarting every 5-10 minutes, it can be really annoying… And anything I did it would solve it so I am going now to Supervised installation, at least I will have more control with the OS, and it seems that for the time being is not doing it.
I talked too fast …
Aug 24 13:14:10 supervised kernel: [ 2551.678612] hassio: port 7(veth2c972ed) entered disabled state
Aug 24 13:14:10 supervised kernel: [ 2551.679087] veth02d46b3: renamed from eth0
Aug 24 13:14:10 supervised kernel: [ 2551.726743] hassio: port 7(veth2c972ed) entered disabled state
Aug 24 13:14:10 supervised kernel: [ 2551.727993] device veth2c972ed left promiscuous mode
Aug 24 13:14:10 supervised kernel: [ 2551.727999] hassio: port 7(veth2c972ed) entered disabled state
Aug 24 13:15:59 supervised 92d3bc96b29f[361]: [13:15:59] [172.30.32.2][6bdd6c22][ExtensionHostConnection] <358> Extension Host Process exited with code: 0, signal: null.
Aug 24 13:16:00 supervised 92d3bc96b29f[361]: [13:16:00] [172.30.32.2][894303d0][ManagementConnection] The reconnection short grace time of 5m has expired, so the connection will be disposed.
This is exactly what I got in /var/log/messages
before loosing the frontend for some time…
IT started to be really unusable, I tried the following with no success:
- Removing HTTPS, and use just HTTP
- Remove the telegram
- Use DHCP or Fixed IP
- Used HAOS or Supervised installation
- Changed the network card type from Proxmox
- Now I am trying not using Postgres for the recorder it didnt happen yet…
Edit: The postgres 14.4 installation is on a LXC running alpine…
Have you tried the PVE edge kernel?
Hi would like to update again that I’m still having the same issues of restarts. I’m not sure if there is a link that this happens if I utilise 4 cores (all) as by default the script to install uses 2.
I might want to try to migrate to esxi to see if it fixes. There is some discussion on N5105 and PVE.
I updated to the Edge kernel yesterday, and so far is going well, at least for now… but I desisted using HAOS, I went to supervised installation, so I could control myself the OS…
For total control, run Home Assistant Container. It’s the method I chose. I feel like the Supervised method is more trouble than it’s worth, now and in the future.
uhmm I thought on that but that would be already too much control for my little time but I though even in using a LXC for it but… uhmm Add-ons are just comfortable to me…
Your root cause to stability issues is actually Proxmox and problems with the N5105 series CPUs. I had similar problems with the N5095 cpu stability that started with Proxmox moving to kernerl 5.15.x, but mine resolved by updating the microcode in proxmox. I was struggling with instability issues since June timeframe, and couldnt keep things running for a day, until I was able to track down the below thread.
See this proxmox thread for details, some folks are reporting better stability with proxmox on 5.19 kernels and the latest microcode - VM freezes irregularly | Proxmox Support Forum