Help troubleshooting ESXI install

Hi,
I have switched to vmware esxi and would like to run hassos on it. I have followed the instructions on the official ‘getting started’ page and succesfully installed hassos. It is up and running, as in: the CLI shows up, and the webinterface is up and running. So far so good.

Now the trouble starts:
I have a backup (tar). Trying to restore it results in infinite loading (I let it run +5 hours). After a reboot, the CLI still comes up. Navigating in my browsers to http:[IP]:4357 says everything is healthy. However, the webinterface is unavailable.

I have also tried restoring just:

  • the HA version (disabled every other option)
  • the addons

both result in the same issue of the interface not showing up anymore.

Since I cannot access the webinterface, I don’t know how to troubleshoot from here on

Can you telnet into the web UI? That would at least tell whether the socket is accepting connections.

telnet <web ui IP> <port>

When HA is working (clean install, no attempt to recover backup:
telnet 192.168.1.237 8123
(powershell) I get a blank screen. Then hitting any key results in

 HTTP/1.0 400 Bad Request
Content-Type: text/plain; charset=utf-8
Content-Length: 44
Date: Thu, 02 Mar 2023 07:42:53 GMT
Server: Python/3.10 aiohttp/3.8.4

Bad status line 'Invalid method encountered'

Connection to host lost.

When attempting to telnet after I hit the ‘full restore’ button, and waiting 15 minutes, I get

Connecting To 192.168.1.237...Could not open connection to the host, on port 8123: Connect failed

in ESXi I see barely any CPU, disk or memory usage after clicking the restore button. HA immediately reports ‘Connection lost. Reconnecting…’

again, the CLI remains accessible. Can I use that to provide information for troubleshooting?

The blank screen means the the TCP handshake completes. So, network and web server are fine, maybe the problem being with HA Core itself.

Does telnet also work right after the connection failed message?

Tested it just now. Nope, as soon as the ‘connection lost’ message pops up, telnet fails to connect.
I’m really liking the snapshot feature of esxi btw haha. Makes it easy to experiment without having to start all over again!

Yeah, ESXi is the way to go. Leaves HyperV in the dust.

I suspect that HA is crashing, including the web server, when the restore starts. I’m not sure how to but I’d bet that there’s a way to get logs from CLI.

By the way, which installation instructions did you follow?

1 Like

I used the official ova file. I found this article but essentially only followed the very first paragraph since HA now has an ova file

1 Like