Local Nvidia supercomputer HA with AI LLM

KruseLuds · January 8, 2025, 10:43pm

Nvidia has just recently come out with some affordable supercomputers. I would LOOOVE to be able to run HA on one of those, to then be able to include numerous cameras and Frigate, as well as locally housed large language models for AI processing…

Is that possible with any of their new models?

tmjpugh · January 8, 2025, 10:49pm

Aren’t they just PCs with nvidia GPU? If yes then it will run HA.

jackjourneyman · January 9, 2025, 12:27am

But will HA use the GPU?

I have a gaming machine with a Nvidia card on which I run a fully local voice assistant. The details are way over my head, but my understanding is that it was especially written to exploit the speed of the GPU - which HA and the rest aren’t.

NathanCu · January 9, 2025, 12:56am

It’s an Nvidia Grace (ARM) CPU married directly to a Blackwell GPU that drives in excess of 5000 TOPS. (100x what qualifies a machine to be a Copilot +PC)

Its more like a gigantic Jetson Nano. And no i don’t expect it to run HA directly. Although I DO expect it to run ollama and 30B models locally (Jensen says it runs Nvidias ai stack…) basically you could in theory run a foundational model at home and if it’s capable of a test time compute or test time training (reasoning) capable model… You almost have something that could theoretically run an o1 class model or comparable on your desk.

I know 3K is a lot of scratch but for what I think this box will do - it will be a steal.

LiQuid_cOOled · January 9, 2025, 4:25am

I been looking at the latest CES reveals and agree. The computing power in such a small form factor is amazing. Considering I have several GPUs I paid 2K for, a price of 3K seems reasonable cheap.

I wouldn’t use it as a dedicated HA device though

Rudd-O · January 9, 2025, 4:31am

HA won’t use the GPU, but maybe Whisper, Piper or Ollama can be persuaded to do so. HAOS does not ship with the NVIDIA driver so that ron’t work.

tmjpugh · January 9, 2025, 4:58am

Docker or virtualized HA would be best.

HA has no use for such power. Any integration would need seperate service. It is the seperate service that would use processing power and HA integration simply connect to the service.

Rudd-O · January 9, 2025, 5:55am

What is best for home AI? RTX 5090 or wait for the DIGITS box?

NathanCu · January 9, 2025, 1:36pm

A 5090 will push appropriately 3300 TOPS at max throuput will cost nearly as much as a digits but you get to install it in your machine of choice and game on it. It’s not a direct comparison.

The digits is pure ai.all the time and there’s still a lot of questions. A 5090. Well if you like PUBG… It’s gonna be a banger.

crzynik · January 9, 2025, 1:58pm

HA addons do not support the nvidia-runtime. You would need to (and likely very much want to) run a standard linux installation and run everything in docker

NathanCu · January 9, 2025, 2:04pm

Honestly I’d run the box purely as suggested by Nvidia and call to it from off box. It’s a different animal, purpose built for what it does. I don’t think it’s going to run non ai workload worth a crud. Ok yeah it can brute force compute but if you want this box to do what it does you’ll want it to be a purpose built machine.