Integration with LocalAI

neowisard · February 4, 2024, 1:46pm

Your models and local API implementation local llama (localAI,functionary, llama-cpp-python) does not fully compatible with OpenAI. it’s developing now yet.
I try over 20 models and local API’s (textgen, localai, …) , now i try functionary with functionary2 2.2 model.

Unfortunately, I haven’t had any success yet. Everything works much better with OpenAI, but it is not perfect either.

I’ve already moved the TTS\STT work to the GPU and the voice assistant works great, but it needs some “brains” via LLM.

amrut · February 4, 2024, 6:53pm

ohh … I thought it would work with the availability of OpenAI functions from LocalAI.

is it still not compatible?

SavageCore · February 5, 2024, 5:15am

It should work, as seen here: https://youtu.be/pAKqKTkx5X4?si=VmZTxHgm05jKCUNw&t=1142

Looks like they followed this guide: Running LLM’s Locally: A Step-by-Step Guide

This then gives you a model called lunademo

But I too am getting my question back as the response. They run correctly via AnythingLLM.

I’ll be back if I figure it out.

Found the issue is being tracked here: Prompt returns the statment · Issue #85 · jekalmin/extended_openai_conversation · GitHub

UPDATE: Enabling “Use Tools” within the conversation gets it working! It’s not perfect for me yet though.

Examples:

P: What is the time?

A: The current time is 2024-02-05 06:24:59.857207+00:00.

(Not human readable)

P: Turn on Kitchen Lights

A: Kitchen Lights are already on.

(They are not on)

amrut · February 5, 2024, 5:22pm

I couldn’t find this option .
Are you saying inside OpenAPI extended conversion?

neowisard · February 5, 2024, 8:55pm

Yeah, none of this is working yet locally. Maybe you can use vllm & functionary , I have too old GPUs (Tesla P40) and it doesn’t run this model and vllm on them.
I.e. no functions work. you can start the assist, talk and speak, it will respond, tell you, but NONE of the functions will run.
And if you activate OpenAI with the same parameters - everything will switch immediately.

corpseCallosum · February 26, 2024, 7:30pm

Did you ever figure it out? I have the same problem running 4090 GPU and LM Studio. I’ve tried a bunch of models. There must be a missing link.

GhostyFKR · February 27, 2024, 7:30pm

No luck unfortunately I’m still wrapping my head around this one… I’ve managed to sort of make it work with the Ollama but it can’t issue commands so it’s back to square one.

jrussell05 · March 1, 2024, 3:54pm

Hi All,

I’ve been trying various permutations of models, integrations, and settings. Currently I’m trying to get the Home-LLM setup working. I’ve followed the guide in Midori to install Home-3B-v3.q4_k_m.gguf on a docker instance of LocalAI. I also installed the LLaMa Conversation integration on my Home Assistant server and followed the setup instructions given. However, this is some of the output I’m getting:

and

I think the problem on the HA end is that the Service called is 'turn_off' instead of 'light.turn_off'. But I’m not sure how to deal with that.

Any thoughts?

nathanhaun256 · March 4, 2024, 7:12pm

I have only recently started to look into this. I have the assyst pipeline running relitavly smoothly with the actuall open AI model. Is there a local AI model that runs well. I have not fully read this entire thread but im quickly gathering there is not a model that just works.

thecodingart · March 5, 2024, 12:27pm

For confirmation, enabling “use tools” worked for you ‽

Also, what model are you using?

Assist query + Raw RESTful request:

Assist query using OpenAI Extended

8:24AM DBG Function return: { "arguments": { "message": "who is bruce wayne" } , "function": "answer"}  map[arguments:map[message:who is bruce wayne] function:answer]
8:24AM DBG nothing to do, computing a reply
8:24AM DBG Reply received from LLM: who is bruce wayne
8:24AM DBG Reply received from LLM(finetuned): who is bruce wayne
8:24AM DBG Response: {"created":1709644415,"object":"chat.completion","id":"c42d7f72-dfa4-497b-8d34-d5e5b5e0b909","model":"luna-ai-llama2-uncensored.Q8_0","choices":[{"index":0,"finish_reason":"","message":{"role":"assistant","content":"who is bruce wayne"}}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}
[192.168.7.188]:36018 200 - POST /chat/completions

As you can see, the function return seems to be populating the query as the answer

mudler · March 30, 2024, 11:22am

You can now use LocalAI All-in-One images which already pre-configure the required models now (also for function calling) Quickstart | LocalAI documentation

Note that the CPU images aren’t great for function calling yet, you need a GPU currently.

amrut · April 1, 2024, 1:49pm

Were you able to integrate with HA and the commands given were executed properly?

a-d-r-i-a-n-d · April 4, 2024, 10:45pm

We now have Ollama integration and the Ollama addon would also be nice.

Cake1468 · April 16, 2024, 5:28pm

I’m not sure if it’s helpful, but I came across GPT4All project which I was able to get up and running on local hardware.

I have no idea however to get Home Assistant to interact with it.

BramNH · April 19, 2024, 5:23pm

For people that have a Nvidia GPU, I made guide to get the Functionary LLM working with the Extended OpenAI HACS integration.

https://community.home-assistant.io/t/ai-voice-control-for-home-assistant-fully-local/

guttermonk · April 22, 2024, 10:33pm

Similar to this post, I’m curious if it would be practical to run LocalAI as an add-on using a mini PC with a Coral attached.

kalfa · August 29, 2024, 10:28pm

Can you install an add-on on a different machine than the one running HA?

Or is it the add-on is install on the same HA machine, but the add-on’s backend is wherever it’s programmed to be?

Asking for a friend

guttermonk · August 30, 2024, 2:04am

@kalfa I have Node-Red, Grafana, and InfluxDB installed as Docker containers on my Synology, and then I just made a custom menu item to point to their URLs. I haven’t used them much, so not sure how practical it is to do that vs. installing the add-ons on the computer running Home Assistant, but it should be possible (although more complicated).

kalfa · August 30, 2024, 9:28am

do you have any pointer on how to do it?

not sure what you mean with “custom menu items”, it seems like you have them reachable from the HA UI, but not integrated as an add-on?

Home Assistant Add-ons - Home Assistant - Are you talking about those?

In practice an HA App, so there is a specific interface they need to comply with, so that HA is aware of them and them aware of HA.

And since they are docker containers, and HA create an internal docker-network to communicate with them - all plain and standard ideas - the question I have is:

how do I install an add-on (defined as above) on a different machine such that HA knows about it as full add-on and can interact with it bidirectionally

in practice, one way to tackle the problem is extending the docker-network across multiple nodes, that’s to say having two dockerd meshed. is it that you are talking about?

not sure if there are simpler or better solutions.

I simply don’t want to re-invent the wheel or even a different wheel if there is one that is standard/being standardized. I just want to deliver some projects on distributed add-ons, however it is done

guttermonk · August 31, 2024, 1:11pm

Bearded Tinker has several videos on how to setup docker containers to work with HA, such as this one.