Gemini Pro API

User87 · January 3, 2024, 10:15pm

So far no luck with gemini-pro in 2024.1.0. I tried removing and re-configuring the integration. Do I need a new API key? Unsure what I’m doing wrong.

tronikos · January 6, 2024, 8:48am

The PRs haven’t been merged yet. Hopefully they will make it in the next monthly release.

neebski · January 12, 2024, 6:22am

@tronikos Can we pull your repo from git via HACS to get your fixes early?

Also I’m guessing @User87 wants function calling like the OpenAI conversation agent to control and send commands. That would be amazing to not have to pay for credits on OpenAI.

tronikos · January 12, 2024, 7:25am

You can but it’s much easier to wait. It will be included in the 2024.2 release.

mkotek · January 25, 2024, 10:29pm

You mean 2024.2 of course

One question though - Gemini Pro is not directly available in my country yet. Would it be possible to use Vertex AI API with Gemini Pro as a model instead - described here: https://ai.google.dev/examples?hl=en&keywords=vertexai

tronikos · January 26, 2024, 8:28pm

Yes I meant 2024.2. Fixed.

I don’t think it will work with Vertex AI API without significant changes that I don’t plan on making them.

mkotek · January 30, 2024, 10:58pm

It’s a pity, because the direct version does not work almost anywhere in Europe and Vertex AI version would work.

tronikos · February 4, 2024, 9:28am

Is it working now in Europe? According to Google Bard update: Image generation and Gemini Pro adds more languages it’s supposed to be globally available since February 1.

pdobrien3 · February 7, 2024, 11:16am

I have the service working perfect in dev tools/services. i cant figure out how to store the response to something i can use? Any chance you could provide an example?

tronikos · February 7, 2024, 10:20pm

Here is a script that takes a text prompt, camera to take a snapshot, and media player to play back the response:

alias: Gemini Pro Vision
fields:
  prompt:
    selector:
      text:
        multiline: true
    name: prompt
  media_player:
    selector:
      entity:
        filter:
          - domain: media_player
    name: media player
  camera:
    selector:
      entity:
        filter:
          - domain: camera
    name: camera
sequence:
  - service: camera.snapshot
    data:
      entity_id: "{{ camera }}"
      filename: /media/snapshot.jpg
  - service: google_generative_ai_conversation.generate_content
    data:
      prompt: "{{ prompt }}"
      image_filename: /media/snapshot.jpg
    response_variable: content
  - service: tts.speak
    target:
      entity_id: tts.piper
    data:
      media_player_entity_id: {{ media_player }}
      message: "{{ content.text }}"
      cache: false
  - variables:
      content: "{{ content }}"
  - stop: end
    response_variable: content
mode: single
icon: mdi:message-image

mkotek · February 8, 2024, 12:09am

No, it does not work, at least the API version - still getting an error for the location of the user.

pdobrien3 · February 9, 2024, 12:35pm

yea, also I can’t use the nest api as an image_filename: either. I don’t totally understand response_variable: or blueprints either. I think what you provided is a blueprint? This is what I came up with as I believe I am also going to have to store the thumbnail in a response_variable: ?

- id: 'c12'
  alias: Doorbell Camera Snapshot Notification
  trigger:
    platform: device
    device_id: feb17d26775a5xxxxxxxxxxxxxxxxx
    domain: nest
    type: camera_person
  action:
    - service_template: >-
        {%- if is_state('input_boolean.home', 'off') or
               not is_state('device_tracker.iphone', 'home') and
               is_state('sensor.ipad_ssid', 'wirelessfun') -%}
              notify.mobile_app_iphone
        {% else %}
              notify.ios
        {% endif %}
      data:
        message: Person Detected at the Front Door.
        data:
          image: >-
            /api/nest/event_media/{{ trigger.event.data.device_id }}/{{ trigger.event.data.nest_event_id }}/thumbnail
	    response_variable: thumbnail
    - service: google_generative_ai_conversation.generate_content
      data:
        prompt: "Very briefly describe what you see in this image from my doorbell camera. Your message needs to be short enough to fit in a phone notification. Do not describe stationary objects or buildings."
        image_filename: {{ thumbnail}}
      response_variable: content
    - service: tts.speak
      target:
        entity_id: tts.piper
      data:
        media_player_entity_id: media_player.google_mini
        message: "{{ content.text }}"
        cache: false
    - variables:
        content: "{{ content}}"
    - stop: end
      response_variable: content    
  mode: queued

mkotek · February 10, 2024, 11:04am

@tronikos I have no idea, how you coded the integration, but I wonder, if it would be possible to specify the endpoint for the integration manually? It seems, european endpoint blocks the integration at the moment, but US one should work: python - Google Generative AI API error: "User location is not supported for the API use." - Stack Overflow

DaWheelz · February 17, 2024, 9:42am

I am having the same problem…

Guy-Falkes · February 19, 2024, 10:20pm

There’s both a language AND country list there. Even though one’s language may be on the list of supported languages, not all regions are supported. I’m Dutch and thus live in The Netherlands. Dutch is supported, but th region Netherlands is not

DeltaNu1142 · February 26, 2024, 2:12pm

I’ve looked around, but I’m finding trouble getting tips on configuring the prompt template:

This smart home is controlled by Home Assistant.

An overview of the areas and the devices in this smart home:
{%- for area in areas() %}
  {%- set area_info = namespace(printed=false) %}
  {%- for device in area_devices(area) -%}
    {%- if not device_attr(device, "disabled_by") and not device_attr(device, "entry_type") and device_attr(device, "name") %}
      {%- if not area_info.printed %}

{{ area_name(area) }}:
        {%- set area_info.printed = true %}
      {%- endif %}
- {{ device_attr(device, "name") }}{% if device_attr(device, "model") and (device_attr(device, "model") | string) not in (device_attr(device, "name") | string) %} ({{ device_attr(device, "model") }}){% endif %}
    {%- endif %}
  {%- endfor %}
{%- endfor %}

Answer the user's questions about the world truthfully.

If the user wants to control a device, reject the request and suggest using the Home Assistant app.

What is the syntax for supplying the integration with more than one area, entity, device, device_attr? If someone would be willing to share an example of their template, that would help me lots!

TIA

EboBH83 · March 4, 2024, 6:14am

I’ve used this very successfully to send informative, time-saving notifications. The next big thing would be to be able to use the responses as conditiones.
One example would be to disarm the alarm system when no cars are in the garage, and arm it when at least one car arrives.
Has anyone figured out this part?

User87 · March 4, 2024, 3:55pm

You may need to create a helper to store the garage car count. I wouldn’t be terribly surprised if you would be able to count the number of cars with frigate alone - not using Gemini. If you did want to use Gemini, then you would run the automation with a time frequency trigger and store it as a helper value for use as a condition of another automation.

Andyz0x · October 1, 2024, 5:45pm

Erro:Error generating content: 404 Gemini 1.0 Pro Vision has been deprecated on July 12, 2024. Consider switching to different model, for example gemini-1.5-flash.

IM Getting this error, someone can help me?

haus · October 23, 2024, 11:58pm

I don’t know if you solved this, but according to this, we have upgrade (HA Core, I think):

github.com/home-assistant/core

Unable to switch to supported model

opened 07:03PM - 20 Oct 24 UTC

closed 07:48PM - 20 Oct 24 UTC

dgarozzo

integration: google_generative_ai_conversation

### The problem `Executed: October 20, 2024 at 2:52:22 PM Error: Error gener…ating content: 404 Gemini 1.0 Pro Vision has been deprecated on July 12, 2024. Consider switching to different model, for example gemini-1.5-flash. Result: params: domain: google_generative_ai_conversation service: generate_content service_data: image_filename: /config/www/camerasnapshots/snapshot-doorbell.jpg prompt: >- Very briefly describe what you see in this image from my doorbell camera. Your message needs to be short to fit in a phone notification. Don't describe stationary objects or buildings. target: {} running_script: false` ### What version of Home Assistant Core has the issue? core-2024.4.3 ### What was the last working version of Home Assistant Core? _No response_ ### What type of installation are you running? Home Assistant OS ### Integration causing the issue Google Generative AI ### Link to integration documentation on our website https://www.home-assistant.io/integrations/google_generative_ai_conversation ### Diagnostics information _No response_ ### Example YAML snippet ```yaml service: google_generative_ai_conversation.generate_content metadata: {} data: image_filename: /config/www/camerasnapshots/snapshot-doorbell.jpg prompt: >- Very briefly describe what you see in this image from my doorbell camera. Your message needs to be short to fit in a phone notification. Don't describe stationary objects or buildings. response_variable: google_ai_response ``` ### Anything in the logs that might be useful for us? ```txt Error: Error generating content: 404 Gemini 1.0 Pro Vision has been deprecated on July 12, 2024. Consider switching to different model, for example gemini-1.5-flash. ``` ### Additional information _No response_