I got an Nvdia Jetson Orin NX 16GB about 2 weeks ago. Luckily, HA had worked with Nvidia to get it all working, using GPU’s for whisper/piper working, plus you can add custom docker containers to the HA container as they run a special ARM version of Ubuntu. Seemed like it was going to be a huge headache. I was thinking of returning it (still might).
Then I decided to just add an entry under the Wyoming interface on my current HAOS server, point it to the IP of my Jetson and ports needed and it works just as good as when I was running just the Docker container on the Jetson! With that said, the models I am currently using are pretty old. There are some newer ones that are much smaller and don’t seem to work that good. Actions all locally are almost instant. I will say, HA Cloud is still better at not mixing your words up as much but local commands are slightly faster then HA cloud. Right now OpenWakeWord is on a wyoming satellite so HA isn’t streaming anything, that and I am using a USB speaker phone hooked into my jetson from HA. I just pointed it to port 10700 and I am running the assist-satellite container on the Jetson.
Still working on getting and LLM setup on it and get HA pointed to it. These things aren’t fun to work with, especially when stuff goes bad. I am not a Linux or Container guru but I know enough that it’s “different” Out of the box, it just froze on the last step. Spent hours troubleshooting because some key prerequisites just didn’t work. It has a dedicated USB C port for plugging another computer into, running Ubuntu 22.04… VM’s not recommended… One of their utilities “jtop” which shows how much GPU, CPU, ect (CLI resource manger)… is being used just “doesn’t” work anymore so I cant tell how much resources its even using. Might just wipe it and start over. Especially now that I know I don’t need to move everything to the Jetson but if I can just point piper and whisper to the Jetson, then I only need to run those containers. The difference between the 8GB model and 16GB model is 70TOPs vs 100TOPs so RAM really does matter. HA containers listed here
i will say, he max power mode on this thing is 25W is impressive, which I set it to. it’s on medium on of the box. Below are the specs. With that said, it was not cheap. I think the 8GB model was about 3/4th the price and came with the OS on an sdcard instead of an nvme drive. Also, they are apparently picky with nvme drives which is another reason I went with the 16GB model. Here are the full specs
1024 NVIDIA® CUDA® cores | 32 Tensor cores | End-to-end lossless
compression | Tile Caching | OpenGL® 4.6 | OpenGL ES 3.2 | Vulkan™
1.1
| CUDA 10 |
Here are the docker containers I am using. As you can see, Piper is 17GB and Whisper is 10GB alone. Newer version is around 728MB for each. I was having issues with it but the below work great. It just has issues picking up specific words that HA cloud doesn’t. Like attic, it thinks I am always saying 'added" or ‘addict’. TV noise in the background is particularly an issue but running the CPU based models on a roughly 3 year old mini PC, which is total overkill for my HA server, takes 3 to 5 seconds for local commands. Need to compare the 2 but it’s obvious.
dustynv/wyoming-openwakeword latest-r36.2.0 e3f760f9cc65 7 months ago 994MB
dustynv/wyoming-assist-microphone latest-r36.2.0 0ead157124bc 7 months ago 1.08GB
dustynv/homeassistant-core latest-r36.2.0 2eb72d233ee8 7 months ago 3.24GB
dustynv/wyoming-whisper latest-r36.2.0 0869f969c10b 7 months ago 10GB
dustynv/wyoming-piper master-r36.2.0 619a537fc0bc 7 months ago 17.4GB