LLM Vision: Let Home Assistant see!

pav · November 9, 2024, 8:20am

Why is it that in the Calendar events are marked with the heading ‘Nothing seen’, but when opened contain the exact response text ? Very misleading …

valentinfrlch · November 9, 2024, 6:06pm

This is the default title if no keywords are found in the reponse. Will change next update.

pav · November 10, 2024, 9:30am

In this particular case I asked to verify and acknowledge whether any persons were to be seen, and if so to describe them. Which it did correctly - so I wonder why this would be a case of ‘no keywords found in the response’ …
Makes me wonder what ‘keywords’ are then ?

valentinfrlch · November 10, 2024, 10:15am

Right now titles of notifications and events are not generated by AI. Only the body is. The title is simply label + " seen", where label is ‘Person’, ‘Car’, etc.
For example the title is “Person seen” if the summary contains ‘person’, ‘man’, ‘woman’ or ‘individual’.

I am working on AI generated titles, but this is how it works for now.

pav · November 10, 2024, 11:56am

I’m afraid this is NOT ‘how it works for now’
Consider this : the response I got was “A man and a woman are on a porch. The woman is wearing a light-colored, sleeveless top and a skirt. The man is wearing a dark-colored shirt and light-colored shorts.”
Yet it was labeled ‘Nothing seen’
But as you’re working on a better titling system anyhow, let’s not make a fuss about it …

valentinfrlch · November 10, 2024, 12:28pm

v1.3.1 Data Analyzer

Today’s update adds a new action to seamlessly update sensors based on image/video input. Just describe what data you want to extract and select a sensor to update. You can use Helpers to create virtual sensors.
Supported sensors are number , text , boolean and select. Data types and available options for select sensors are recognized automatically.

heviiguy · November 11, 2024, 12:03am

INSTALLATION ISSUE

Okay, this is embarrasing. I’ve installed Ollama on the same Linux box on which Frigate and HA are running. The latter 2 are in docker containers. Ollama was installed directly.

I verified correct installtion by entering http://127.0.0.1:11434/ in a browser on that box. The resultant dialogue was: Ollama is running

Problem
The problem arises when, from another box on the same local network, I try to specify the Ollama server address during the integration set-up. After entering the same domain name used to access HA, the entry is not accepted.

port 11434 was selected
port 11434 has been forwarded on my router
the https option was activated

Interestingly, when I ask Ollama to confirm the port, this is what she spits out:

ollama run llava-phi3
>>> What port is the Ollama server running on this machine?
The Ollama server is currently running on the localhost at port 3001.

Can somebody please point out where I've missed something which is probably woefully quite basic?

NIUB · November 11, 2024, 9:07am

Is Llama3.2-vision supported?

NIUB · November 11, 2024, 9:13am

I tried it and it works with llama3.2-vision

valentinfrlch · November 11, 2024, 3:29pm

You don’t need to forward the port unless you want to access Ollama directly outside of your network. Since HA and Ollama both run on the same machine you can likely access Ollama from HA using http://localhost:11434.
Since HA runs in a docker container you probably need to change the network mode if you haven’t done so already. I’m not a docker expert, but I think you can achieve this with --network="host" for docker run or network_mode: "host" in docker compose respectively.

The LLM model is not aware what hardware it is running on or what ports are exposed, so you can ignore that.

heviiguy · November 11, 2024, 5:59pm

Thanks for pointing this out, Valentin. I clearly wasn’t aware of it.

Using 127.0.0.1 as the IP address enabled me to configure the integration. Now let’s see what kind of trouble I can get myself into…

NIUB · November 11, 2024, 6:03pm

hi man。 You don’t need to do so much. Just check Ollama’s FAQ for answers. I have also encountered this before.

github.com

ollama/ollama/blob/36a8372b2884c40cc5b86f6f859b012dc8125b80/docs/faq.md

# FAQ

## How can I upgrade Ollama?

Ollama on macOS and Windows will automatically download updates. Click on the taskbar or menubar item and then click "Restart to update" to apply the update. Updates can also be installed by downloading the latest version [manually](https://ollama.com/download/).

On Linux, re-run the install script:

```shell
curl -fsSL https://ollama.com/install.sh | sh
```

## How can I view the logs?

Review the [Troubleshooting](./troubleshooting.md) docs for more about using logs.

## Is my GPU compatible with Ollama?

Please refer to the [GPU docs](./gpu.md).

This file has been truncated. show original

valentinfrlch · November 11, 2024, 7:31pm

You’re welcome! If you have any questions, don’t hesitate to ask!

cknickl · November 12, 2024, 1:15am

Right now I get a notification from Home Assistant that motion was detected by a camera, then it’s updated a second or two later with the AI description.

For at least one of my cameras I’m ok just waiting on the AI version. Is there a way to suppress the first notification??

I’m using the 1.3.1 version of the template.

valentinfrlch · November 12, 2024, 3:58pm

The update should happen silently so you should only get one message. Are you on Android?

cknickl · November 12, 2024, 4:59pm

I am on iOS. The update is happening silently, which is awesome. I may just be looking at my phone too quickly and so I’m catching the initial notification. I was just hoping to almost delay the notifications by a few seconds to wait for the updated one.

tc23 · November 12, 2024, 10:05pm

For anyone who is using Node Red and calling this via a service node, if you are making changes to the prompt anything, you need to restart node red. Deploying won’t catch the updates. At least when using Ollama

ainen · November 13, 2024, 4:15am

I am now getting just a snapshot so I’d say the update is working much better. However, it now appears that the snapshot I get is for the previous event. For example, I have a camera on my back door telling me when my dog is back there so I can let her in/out. When she is ready to come in, I am notified about a dog at the back door except the snapshot is of her waiting to be let out.

edit: I just noticed that the HA integration wasn’t running the latest version. I will update that and check in later.

Walter1 · November 14, 2024, 2:31pm

Hi , I downloaded version 1.3.1 and made an APIkey in google AI-studio.
When in further install and copy the API key I get the message invalid key.
What can I be doing wrong? I’m on a free of charge plan
SOLVED → I takes a while before the key works

kmanan · November 14, 2024, 8:04pm

Hi. Looking to get RIng working with this. I have Scrypted configured to work with Ring and Scrypted captures clips. I am not a user of their paid NVR tool. I was wondering if there was a way to point to the location where scrypted stores the clips to use for this? Getting the Generic Camera thing working seems extremely complicated so was wondering if between Frigate, SCrypted, Ring MQTT HA, there was a way to use this

I think I got this working. WIll test over the weekend. Scrypted lets you get an rtsp link to the camera that can be added as a generic cam in HA. I have it. The image is being captured, need to test if Ollama processes it now.

EDIT: It works. I need to upgrade the RAM on my server that’s running Ollama but Ring → Scrypted → RTPC URL → Generic Camera works. Also, the page on the website re Ring is extremely confusing.