PyPI - llm-gemini - Versions diffs - 0.20a1__tar.gz → 0.21__tar.gz - Mend

llm-gemini 0.20a1tar.gz → 0.21tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (13) hide show

{llm_gemini-0.20a1 → llm_gemini-0.21}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: llm-gemini
-Version: 0.20a1
+Version: 0.21
 Summary: LLM plugin to access Google's Gemini family of models
 Author: Simon Willison
 License-Expression: Apache-2.0
@@ -10,7 +10,7 @@ Project-URL: Issues, https://github.com/simonw/llm-gemini/issues
 Project-URL: CI, https://github.com/simonw/llm-gemini/actions
 Description-Content-Type: text/markdown
 License-File: LICENSE
-Requires-Dist: llm>=0.26a0
+Requires-Dist: llm>=0.26
 Requires-Dist: httpx
 Requires-Dist: ijson
 Provides-Extra: test
@@ -18,6 +18,7 @@ Requires-Dist: pytest; extra == "test"
 Requires-Dist: pytest-recording; extra == "test"
 Requires-Dist: pytest-asyncio; extra == "test"
 Requires-Dist: nest-asyncio; extra == "test"
+Requires-Dist: cogapp; extra == "test"
 Dynamic: license-file
 # llm-gemini
@@ -63,25 +64,71 @@ llm models default gemini-2.0-flash
 llm "A joke about a pelican and a walrus"
 ```
-Other models are:
-- `gemini-2.5-pro-preview-05-06` - latest paid Gemini 2.5 Pro preview
-- `gemini-2.5-flash-preview-04-17` - Gemini 2.5 Flash preview
-- `gemini-2.5-pro-exp-03-25` - free experimental release of Gemini 2.5 Pro
-- `gemini-2.5-pro-preview-03-25` - paid preview of Gemini 2.5 Pro
-- `gemma-3-27b-it` - [Gemma 3](https://blog.google/technology/developers/gemma-3/) 27B
-- `gemini-2.0-pro-exp-02-05` - experimental release of Gemini 2.0 Pro
-- `gemini-2.0-flash-lite` - Gemini 2.0 Flash-Lite
-- `gemini-2.0-flash` - Gemini 2.0 Flash
-- `gemini-2.0-flash-thinking-exp-01-21` - experimental "thinking" model from January 2025
-- `gemini-2.0-flash-thinking-exp-1219` - experimental "thinking" model from December 2024
-- `learnlm-1.5-pro-experimental` - "an experimental task-specific model that has been trained to align with learning science principles" - [more details here](https://ai.google.dev/gemini-api/docs/learnlm).
-- `gemini-2.0-flash-exp` - [Gemini 2.0 Flash](https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/#gemini-2-0-flash)
-- `gemini-exp-1206` - recent experimental #3
-- `gemini-exp-1121` - recent experimental #2
-- `gemini-exp-1114` - recent experimental #1
-- `gemini-1.5-flash-8b-latest` - the least expensive
-- `gemini-1.5-flash-latest`
+## Available models
+<!-- [[[cog
+import cog
+from llm import cli
+from click.testing import CliRunner
+runner = CliRunner()
+result = runner.invoke(cli.cli, ["models", "-q", "gemini/"])
+lines = reversed(result.output.strip().split("\n"))
+to_output = []
+NOTES = {
+    "gemini/gemini-2.5-pro-preview-05-06": "Latest paid Gemini 2.5 Pro preview",
+    "gemini/gemini-2.5-flash-preview-05-20": "Gemini 2.5 Flash preview",
+    "gemini/gemini-2.5-flash-preview-04-17": "Earlier Gemini 2.5 Flash preview",
+    "gemini/gemini-2.5-pro-exp-03-25": "Free experimental release of Gemini 2.5 Pro",
+    "gemini/gemini-2.0-flash-thinking-exp-01-21": "Experimental \"thinking\" model from January 2025",
+    "gemini/gemini-1.5-flash-8b-latest": "The least expensive model",
+}
+for line in lines:
+    model_id, rest = line.split(None, 2)[1:]
+    note = NOTES.get(model_id, "")
+    to_output.append(
+        "- `{}`{}".format(
+            model_id,
+            ': {}'.format(note) if note else ""
+        )
+    )
+cog.out("\n".join(to_output))
+]]] -->
+- `gemini/gemini-2.5-flash-preview-05-20`: Gemini 2.5 Flash preview
+- `gemini/gemini-2.5-pro-preview-05-06`: Latest paid Gemini 2.5 Pro preview
+- `gemini/gemini-2.5-flash-preview-04-17`: Earlier Gemini 2.5 Flash preview
+- `gemini/gemini-2.5-pro-preview-03-25`
+- `gemini/gemini-2.5-pro-exp-03-25`: Free experimental release of Gemini 2.5 Pro
+- `gemini/gemini-2.0-flash-lite`
+- `gemini/gemini-2.0-pro-exp-02-05`
+- `gemini/gemini-2.0-flash`
+- `gemini/gemini-2.0-flash-thinking-exp-01-21`: Experimental "thinking" model from January 2025
+- `gemini/gemini-2.0-flash-thinking-exp-1219`
+- `gemini/gemma-3n-e4b-it`
+- `gemini/gemma-3-27b-it`
+- `gemini/gemma-3-12b-it`
+- `gemini/gemma-3-4b-it`
+- `gemini/gemma-3-1b-it`
+- `gemini/learnlm-1.5-pro-experimental`
+- `gemini/gemini-2.0-flash-exp`
+- `gemini/gemini-exp-1206`
+- `gemini/gemini-exp-1121`
+- `gemini/gemini-exp-1114`
+- `gemini/gemini-1.5-flash-8b-001`
+- `gemini/gemini-1.5-flash-8b-latest`: The least expensive model
+- `gemini/gemini-1.5-flash-002`
+- `gemini/gemini-1.5-pro-002`
+- `gemini/gemini-1.5-flash-001`
+- `gemini/gemini-1.5-pro-001`
+- `gemini/gemini-1.5-flash-latest`
+- `gemini/gemini-1.5-pro-latest`
+- `gemini/gemini-pro`
+<!-- [[[end]]] -->
+All of these models have aliases that omit the `gemini/` prefix, for example:
+```bash
+llm -m gemini-1.5-flash-8b-latest --schema 'name,age int,bio' 'invent a dog'
+```
 ### Images, audio and video
@@ -154,6 +201,31 @@ To chat interactively with the model, run `llm chat`:
 llm chat -m gemini-2.0-flash
 ```
+### Timeouts
+By default there is no `timeout` against the Gemini API. You can use the `timeout` option to protect against API requests that hang indefinitely.
+With the CLI tool that looks like this, to set a 1.5 second timeout:
+```bash
+llm -m gemini-2.5-flash-preview-05-20 'epic saga about mice' -o timeout 1.5
+```
+In the Python library timeouts are used like this:
+```python
+import httpx, llm
+model = llm.get_model("gemini/gemini-2.5-flash-preview-05-20")
+try:
+    response = model.prompt(
+        "epic saga about mice", timeout=1.5
+    )
+    print(response.text())
+except httpx.TimeoutException:
+    print("Timeout exceeded")
+```
+An `httpx.TimeoutException` subclass will be raised if the timeout is exceeded.
 ## Embeddings
 The plugin also adds support for the `gemini-embedding-exp-03-07` and `text-embedding-004` embedding models.
@@ -186,6 +258,21 @@ llm similar readmes -c 'upload csvs to stuff' -d embed.db
 See the [LLM embeddings documentation](https://llm.datasette.io/en/stable/embeddings/cli.html) for further details.
+## Listing all Gemini API models
+The `llm gemini models` command lists all of the models that are exposed by the Gemini API, some of which may not be available through this plugin.
+```bash
+llm gemini models
+```
+You can add a `--key X` option to use a different API key.
+To filter models by their supported generation methods use `--method` one or more times:
+```bash
+llm gemini models --method embedContent
+```
+If you provide multiple methods you will see models that support any of them.
 ## Development
 To set up this plugin locally, first checkout the code. Then create a new virtual environment:

{llm_gemini-0.20a1 → llm_gemini-0.21}/README.md RENAMED Viewed

@@ -41,25 +41,71 @@ llm models default gemini-2.0-flash
 llm "A joke about a pelican and a walrus"
 ```
-Other models are:
-- `gemini-2.5-pro-preview-05-06` - latest paid Gemini 2.5 Pro preview
-- `gemini-2.5-flash-preview-04-17` - Gemini 2.5 Flash preview
-- `gemini-2.5-pro-exp-03-25` - free experimental release of Gemini 2.5 Pro
-- `gemini-2.5-pro-preview-03-25` - paid preview of Gemini 2.5 Pro
-- `gemma-3-27b-it` - [Gemma 3](https://blog.google/technology/developers/gemma-3/) 27B
-- `gemini-2.0-pro-exp-02-05` - experimental release of Gemini 2.0 Pro
-- `gemini-2.0-flash-lite` - Gemini 2.0 Flash-Lite
-- `gemini-2.0-flash` - Gemini 2.0 Flash
-- `gemini-2.0-flash-thinking-exp-01-21` - experimental "thinking" model from January 2025
-- `gemini-2.0-flash-thinking-exp-1219` - experimental "thinking" model from December 2024
-- `learnlm-1.5-pro-experimental` - "an experimental task-specific model that has been trained to align with learning science principles" - [more details here](https://ai.google.dev/gemini-api/docs/learnlm).
-- `gemini-2.0-flash-exp` - [Gemini 2.0 Flash](https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/#gemini-2-0-flash)
-- `gemini-exp-1206` - recent experimental #3
-- `gemini-exp-1121` - recent experimental #2
-- `gemini-exp-1114` - recent experimental #1
-- `gemini-1.5-flash-8b-latest` - the least expensive
-- `gemini-1.5-flash-latest`
+## Available models
+<!-- [[[cog
+import cog
+from llm import cli
+from click.testing import CliRunner
+runner = CliRunner()
+result = runner.invoke(cli.cli, ["models", "-q", "gemini/"])
+lines = reversed(result.output.strip().split("\n"))
+to_output = []
+NOTES = {
+    "gemini/gemini-2.5-pro-preview-05-06": "Latest paid Gemini 2.5 Pro preview",
+    "gemini/gemini-2.5-flash-preview-05-20": "Gemini 2.5 Flash preview",
+    "gemini/gemini-2.5-flash-preview-04-17": "Earlier Gemini 2.5 Flash preview",
+    "gemini/gemini-2.5-pro-exp-03-25": "Free experimental release of Gemini 2.5 Pro",
+    "gemini/gemini-2.0-flash-thinking-exp-01-21": "Experimental \"thinking\" model from January 2025",
+    "gemini/gemini-1.5-flash-8b-latest": "The least expensive model",
+}
+for line in lines:
+    model_id, rest = line.split(None, 2)[1:]
+    note = NOTES.get(model_id, "")
+    to_output.append(
+        "- `{}`{}".format(
+            model_id,
+            ': {}'.format(note) if note else ""
+        )
+    )
+cog.out("\n".join(to_output))
+]]] -->
+- `gemini/gemini-2.5-flash-preview-05-20`: Gemini 2.5 Flash preview
+- `gemini/gemini-2.5-pro-preview-05-06`: Latest paid Gemini 2.5 Pro preview
+- `gemini/gemini-2.5-flash-preview-04-17`: Earlier Gemini 2.5 Flash preview
+- `gemini/gemini-2.5-pro-preview-03-25`
+- `gemini/gemini-2.5-pro-exp-03-25`: Free experimental release of Gemini 2.5 Pro
+- `gemini/gemini-2.0-flash-lite`
+- `gemini/gemini-2.0-pro-exp-02-05`
+- `gemini/gemini-2.0-flash`
+- `gemini/gemini-2.0-flash-thinking-exp-01-21`: Experimental "thinking" model from January 2025
+- `gemini/gemini-2.0-flash-thinking-exp-1219`
+- `gemini/gemma-3n-e4b-it`
+- `gemini/gemma-3-27b-it`
+- `gemini/gemma-3-12b-it`
+- `gemini/gemma-3-4b-it`
+- `gemini/gemma-3-1b-it`
+- `gemini/learnlm-1.5-pro-experimental`
+- `gemini/gemini-2.0-flash-exp`
+- `gemini/gemini-exp-1206`
+- `gemini/gemini-exp-1121`
+- `gemini/gemini-exp-1114`
+- `gemini/gemini-1.5-flash-8b-001`
+- `gemini/gemini-1.5-flash-8b-latest`: The least expensive model
+- `gemini/gemini-1.5-flash-002`
+- `gemini/gemini-1.5-pro-002`
+- `gemini/gemini-1.5-flash-001`
+- `gemini/gemini-1.5-pro-001`
+- `gemini/gemini-1.5-flash-latest`
+- `gemini/gemini-1.5-pro-latest`
+- `gemini/gemini-pro`
+<!-- [[[end]]] -->
+All of these models have aliases that omit the `gemini/` prefix, for example:
+```bash
+llm -m gemini-1.5-flash-8b-latest --schema 'name,age int,bio' 'invent a dog'
+```
 ### Images, audio and video
@@ -132,6 +178,31 @@ To chat interactively with the model, run `llm chat`:
 llm chat -m gemini-2.0-flash
 ```
+### Timeouts
+By default there is no `timeout` against the Gemini API. You can use the `timeout` option to protect against API requests that hang indefinitely.
+With the CLI tool that looks like this, to set a 1.5 second timeout:
+```bash
+llm -m gemini-2.5-flash-preview-05-20 'epic saga about mice' -o timeout 1.5
+```
+In the Python library timeouts are used like this:
+```python
+import httpx, llm
+model = llm.get_model("gemini/gemini-2.5-flash-preview-05-20")
+try:
+    response = model.prompt(
+        "epic saga about mice", timeout=1.5
+    )
+    print(response.text())
+except httpx.TimeoutException:
+    print("Timeout exceeded")
+```
+An `httpx.TimeoutException` subclass will be raised if the timeout is exceeded.
 ## Embeddings
 The plugin also adds support for the `gemini-embedding-exp-03-07` and `text-embedding-004` embedding models.
@@ -164,6 +235,21 @@ llm similar readmes -c 'upload csvs to stuff' -d embed.db
 See the [LLM embeddings documentation](https://llm.datasette.io/en/stable/embeddings/cli.html) for further details.
+## Listing all Gemini API models
+The `llm gemini models` command lists all of the models that are exposed by the Gemini API, some of which may not be available through this plugin.
+```bash
+llm gemini models
+```
+You can add a `--key X` option to use a different API key.
+To filter models by their supported generation methods use `--method` one or more times:
+```bash
+llm gemini models --method embedContent
+```
+If you provide multiple methods you will see models that support any of them.
 ## Development
 To set up this plugin locally, first checkout the code. Then create a new virtual environment:

{llm_gemini-0.20a1 → llm_gemini-0.21}/llm_gemini.egg-info/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: llm-gemini
-Version: 0.20a1
+Version: 0.21
 Summary: LLM plugin to access Google's Gemini family of models
 Author: Simon Willison
 License-Expression: Apache-2.0
@@ -10,7 +10,7 @@ Project-URL: Issues, https://github.com/simonw/llm-gemini/issues
 Project-URL: CI, https://github.com/simonw/llm-gemini/actions
 Description-Content-Type: text/markdown
 License-File: LICENSE
-Requires-Dist: llm>=0.26a0
+Requires-Dist: llm>=0.26
 Requires-Dist: httpx
 Requires-Dist: ijson
 Provides-Extra: test
@@ -18,6 +18,7 @@ Requires-Dist: pytest; extra == "test"
 Requires-Dist: pytest-recording; extra == "test"
 Requires-Dist: pytest-asyncio; extra == "test"
 Requires-Dist: nest-asyncio; extra == "test"
+Requires-Dist: cogapp; extra == "test"
 Dynamic: license-file
 # llm-gemini
@@ -63,25 +64,71 @@ llm models default gemini-2.0-flash
 llm "A joke about a pelican and a walrus"
 ```
-Other models are:
-- `gemini-2.5-pro-preview-05-06` - latest paid Gemini 2.5 Pro preview
-- `gemini-2.5-flash-preview-04-17` - Gemini 2.5 Flash preview
-- `gemini-2.5-pro-exp-03-25` - free experimental release of Gemini 2.5 Pro
-- `gemini-2.5-pro-preview-03-25` - paid preview of Gemini 2.5 Pro
-- `gemma-3-27b-it` - [Gemma 3](https://blog.google/technology/developers/gemma-3/) 27B
-- `gemini-2.0-pro-exp-02-05` - experimental release of Gemini 2.0 Pro
-- `gemini-2.0-flash-lite` - Gemini 2.0 Flash-Lite
-- `gemini-2.0-flash` - Gemini 2.0 Flash
-- `gemini-2.0-flash-thinking-exp-01-21` - experimental "thinking" model from January 2025
-- `gemini-2.0-flash-thinking-exp-1219` - experimental "thinking" model from December 2024
-- `learnlm-1.5-pro-experimental` - "an experimental task-specific model that has been trained to align with learning science principles" - [more details here](https://ai.google.dev/gemini-api/docs/learnlm).
-- `gemini-2.0-flash-exp` - [Gemini 2.0 Flash](https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/#gemini-2-0-flash)
-- `gemini-exp-1206` - recent experimental #3
-- `gemini-exp-1121` - recent experimental #2
-- `gemini-exp-1114` - recent experimental #1
-- `gemini-1.5-flash-8b-latest` - the least expensive
-- `gemini-1.5-flash-latest`
+## Available models
+<!-- [[[cog
+import cog
+from llm import cli
+from click.testing import CliRunner
+runner = CliRunner()
+result = runner.invoke(cli.cli, ["models", "-q", "gemini/"])
+lines = reversed(result.output.strip().split("\n"))
+to_output = []
+NOTES = {
+    "gemini/gemini-2.5-pro-preview-05-06": "Latest paid Gemini 2.5 Pro preview",
+    "gemini/gemini-2.5-flash-preview-05-20": "Gemini 2.5 Flash preview",
+    "gemini/gemini-2.5-flash-preview-04-17": "Earlier Gemini 2.5 Flash preview",
+    "gemini/gemini-2.5-pro-exp-03-25": "Free experimental release of Gemini 2.5 Pro",
+    "gemini/gemini-2.0-flash-thinking-exp-01-21": "Experimental \"thinking\" model from January 2025",
+    "gemini/gemini-1.5-flash-8b-latest": "The least expensive model",
+}
+for line in lines:
+    model_id, rest = line.split(None, 2)[1:]
+    note = NOTES.get(model_id, "")
+    to_output.append(
+        "- `{}`{}".format(
+            model_id,
+            ': {}'.format(note) if note else ""
+        )
+    )
+cog.out("\n".join(to_output))
+]]] -->
+- `gemini/gemini-2.5-flash-preview-05-20`: Gemini 2.5 Flash preview
+- `gemini/gemini-2.5-pro-preview-05-06`: Latest paid Gemini 2.5 Pro preview
+- `gemini/gemini-2.5-flash-preview-04-17`: Earlier Gemini 2.5 Flash preview
+- `gemini/gemini-2.5-pro-preview-03-25`
+- `gemini/gemini-2.5-pro-exp-03-25`: Free experimental release of Gemini 2.5 Pro
+- `gemini/gemini-2.0-flash-lite`
+- `gemini/gemini-2.0-pro-exp-02-05`
+- `gemini/gemini-2.0-flash`
+- `gemini/gemini-2.0-flash-thinking-exp-01-21`: Experimental "thinking" model from January 2025
+- `gemini/gemini-2.0-flash-thinking-exp-1219`
+- `gemini/gemma-3n-e4b-it`
+- `gemini/gemma-3-27b-it`
+- `gemini/gemma-3-12b-it`
+- `gemini/gemma-3-4b-it`
+- `gemini/gemma-3-1b-it`
+- `gemini/learnlm-1.5-pro-experimental`
+- `gemini/gemini-2.0-flash-exp`
+- `gemini/gemini-exp-1206`
+- `gemini/gemini-exp-1121`
+- `gemini/gemini-exp-1114`
+- `gemini/gemini-1.5-flash-8b-001`
+- `gemini/gemini-1.5-flash-8b-latest`: The least expensive model
+- `gemini/gemini-1.5-flash-002`
+- `gemini/gemini-1.5-pro-002`
+- `gemini/gemini-1.5-flash-001`
+- `gemini/gemini-1.5-pro-001`
+- `gemini/gemini-1.5-flash-latest`
+- `gemini/gemini-1.5-pro-latest`
+- `gemini/gemini-pro`
+<!-- [[[end]]] -->
+All of these models have aliases that omit the `gemini/` prefix, for example:
+```bash
+llm -m gemini-1.5-flash-8b-latest --schema 'name,age int,bio' 'invent a dog'
+```
 ### Images, audio and video
@@ -154,6 +201,31 @@ To chat interactively with the model, run `llm chat`:
 llm chat -m gemini-2.0-flash
 ```
+### Timeouts
+By default there is no `timeout` against the Gemini API. You can use the `timeout` option to protect against API requests that hang indefinitely.
+With the CLI tool that looks like this, to set a 1.5 second timeout:
+```bash
+llm -m gemini-2.5-flash-preview-05-20 'epic saga about mice' -o timeout 1.5
+```
+In the Python library timeouts are used like this:
+```python
+import httpx, llm
+model = llm.get_model("gemini/gemini-2.5-flash-preview-05-20")
+try:
+    response = model.prompt(
+        "epic saga about mice", timeout=1.5
+    )
+    print(response.text())
+except httpx.TimeoutException:
+    print("Timeout exceeded")
+```
+An `httpx.TimeoutException` subclass will be raised if the timeout is exceeded.
 ## Embeddings
 The plugin also adds support for the `gemini-embedding-exp-03-07` and `text-embedding-004` embedding models.
@@ -186,6 +258,21 @@ llm similar readmes -c 'upload csvs to stuff' -d embed.db
 See the [LLM embeddings documentation](https://llm.datasette.io/en/stable/embeddings/cli.html) for further details.
+## Listing all Gemini API models
+The `llm gemini models` command lists all of the models that are exposed by the Gemini API, some of which may not be available through this plugin.
+```bash
+llm gemini models
+```
+You can add a `--key X` option to use a different API key.
+To filter models by their supported generation methods use `--method` one or more times:
+```bash
+llm gemini models --method embedContent
+```
+If you provide multiple methods you will see models that support any of them.
 ## Development
 To set up this plugin locally, first checkout the code. Then create a new virtual environment:

{llm_gemini-0.20a1 → llm_gemini-0.21}/llm_gemini.egg-info/requires.txt RENAMED Viewed

@@ -1,4 +1,4 @@
-llm>=0.26a0
+llm>=0.26
 httpx
 ijson
@@ -7,3 +7,4 @@ pytest
 pytest-recording
 pytest-asyncio
 nest-asyncio
+cogapp

{llm_gemini-0.20a1 → llm_gemini-0.21}/llm_gemini.py RENAMED Viewed

@@ -40,6 +40,7 @@ GOOGLE_SEARCH_MODELS = {
     "gemini-2.5-pro-exp-03-25",
     "gemini-2.5-flash-preview-04-17",
     "gemini-2.5-pro-preview-05-06",
+    "gemini-2.5-flash-preview-05-20",
 }
 # Older Google models used google_search_retrieval instead of google_search
@@ -54,14 +55,56 @@ GOOGLE_SEARCH_MODELS_USING_SEARCH_RETRIEVAL = {
 }
 THINKING_BUDGET_MODELS = {
+    "gemini-2.0-flash-thinking-exp-01-21",
+    "gemini-2.0-flash-thinking-exp-1219",
     "gemini-2.5-flash-preview-04-17",
+    "gemini-2.5-pro-exp-03-25",
+    "gemini-2.5-pro-preview-03-25",
+    "gemini-2.5-pro-preview-05-06",
+    "gemini-2.5-flash-preview-05-20",
+}
+NO_VISION_MODELS = {"gemma-3-1b-it", "gemma-3n-e4b-it"}
+ATTACHMENT_TYPES = {
+    # Text
+    "text/plain",
+    "text/csv",
+    # PDF
+    "application/pdf",
+    # Images
+    "image/png",
+    "image/jpeg",
+    "image/webp",
+    "image/heic",
+    "image/heif",
+    # Audio
+    "audio/wav",
+    "audio/mp3",
+    "audio/aiff",
+    "audio/aac",
+    "audio/ogg",
+    "application/ogg",
+    "audio/flac",
+    "audio/mpeg",  # Treated as audio/mp3
+    # Video
+    "video/mp4",
+    "video/mpeg",
+    "video/mov",
+    "video/avi",
+    "video/x-flv",
+    "video/mpg",
+    "video/webm",
+    "video/wmv",
+    "video/3gpp",
+    "video/quicktime",
 }
 @llm.hookimpl
 def register_models(register):
     # Register both sync and async versions of each model
-    for model_id in [
+    for model_id in (
         "gemini-pro",
         "gemini-1.5-pro-latest",
         "gemini-1.5-flash-latest",
@@ -76,6 +119,12 @@ def register_models(register):
         "gemini-exp-1206",
         "gemini-2.0-flash-exp",
         "learnlm-1.5-pro-experimental",
+        # Gemma 3 models:
+        "gemma-3-1b-it",
+        "gemma-3-4b-it",
+        "gemma-3-12b-it",  # 12th March 2025
+        "gemma-3-27b-it",
+        "gemma-3n-e4b-it",  # 20th May 2025
         "gemini-2.0-flash-thinking-exp-1219",
         "gemini-2.0-flash-thinking-exp-01-21",
         # Released 5th Feb 2025:
@@ -83,8 +132,6 @@ def register_models(register):
         "gemini-2.0-pro-exp-02-05",
         # Released 25th Feb 2025:
         "gemini-2.0-flash-lite",
-        # Released 12th March 2025:
-        "gemma-3-27b-it",
         # 25th March 2025:
         "gemini-2.5-pro-exp-03-25",
         # 4th April 2025 (paid):
@@ -93,22 +140,29 @@ def register_models(register):
         "gemini-2.5-flash-preview-04-17",
         # 6th May 2025:
         "gemini-2.5-pro-preview-05-06",
-    ]:
+        # 20th May 2025:
+        "gemini-2.5-flash-preview-05-20",
+    ):
         can_google_search = model_id in GOOGLE_SEARCH_MODELS
         can_thinking_budget = model_id in THINKING_BUDGET_MODELS
+        can_vision = model_id not in NO_VISION_MODELS
+        can_schema = "flash-thinking" not in model_id and "gemma-3" not in model_id
         register(
             GeminiPro(
                 model_id,
+                can_vision=can_vision,
                 can_google_search=can_google_search,
                 can_thinking_budget=can_thinking_budget,
-                can_schema="flash-thinking" not in model_id,
+                can_schema=can_schema,
             ),
             AsyncGeminiPro(
                 model_id,
+                can_vision=can_vision,
                 can_google_search=can_google_search,
                 can_thinking_budget=can_thinking_budget,
-                can_schema="flash-thinking" not in model_id,
+                can_schema=can_schema,
             ),
+            aliases=(model_id,),
         )
@@ -150,39 +204,7 @@ class _SharedGemini:
     supports_schema = True
     supports_tools = True
-    attachment_types = (
-        # Text
-        "text/plain",
-        "text/csv",
-        # PDF
-        "application/pdf",
-        # Images
-        "image/png",
-        "image/jpeg",
-        "image/webp",
-        "image/heic",
-        "image/heif",
-        # Audio
-        "audio/wav",
-        "audio/mp3",
-        "audio/aiff",
-        "audio/aac",
-        "audio/ogg",
-        "application/ogg",
-        "audio/flac",
-        "audio/mpeg",  # Treated as audio/mp3
-        # Video
-        "video/mp4",
-        "video/mpeg",
-        "video/mov",
-        "video/avi",
-        "video/x-flv",
-        "video/mpg",
-        "video/webm",
-        "video/wmv",
-        "video/3gpp",
-        "video/quicktime",
-    )
+    attachment_types = set()
     class Options(llm.Options):
         code_execution: Optional[bool] = Field(
@@ -228,6 +250,14 @@ class _SharedGemini:
             description="Output a valid JSON object {...}",
             default=None,
         )
+        timeout: Optional[float] = Field(
+            description=(
+                "The maximum time in seconds to wait for a response. "
+                "If the model does not respond within this time, "
+                "the request will be aborted."
+            ),
+            default=None,
+        )
     class OptionsWithGoogleSearch(Options):
         google_search: Optional[bool] = Field(
@@ -243,12 +273,14 @@ class _SharedGemini:
     def __init__(
         self,
-        model_id,
+        gemini_model_id,
+        can_vision=True,
         can_google_search=False,
         can_thinking_budget=False,
         can_schema=False,
     ):
-        self.model_id = model_id
+        self.model_id = "gemini/{}".format(gemini_model_id)
+        self.gemini_model_id = gemini_model_id
         self.can_google_search = can_google_search
         self.supports_schema = can_schema
         if can_google_search:
@@ -256,6 +288,8 @@ class _SharedGemini:
         self.can_thinking_budget = can_thinking_budget
         if can_thinking_budget:
             self.Options = self.OptionsWithThinkingBudget
+        if can_vision:
+            self.attachment_types = ATTACHMENT_TYPES
     def build_messages(self, prompt, conversation):
         messages = []
@@ -453,14 +487,14 @@ class _SharedGemini:
 class GeminiPro(_SharedGemini, llm.KeyModel):
     def execute(self, prompt, stream, response, conversation, key):
-        url = f"https://generativelanguage.googleapis.com/v1beta/models/{self.model_id}:streamGenerateContent"
+        url = f"https://generativelanguage.googleapis.com/v1beta/models/{self.gemini_model_id}:streamGenerateContent"
         gathered = []
         body = self.build_request_body(prompt, conversation)
         with httpx.stream(
             "POST",
             url,
-            timeout=None,
+            timeout=prompt.options.timeout,
             headers={"x-goog-api-key": self.get_key(key)},
             json=body,
         ) as http_response:
@@ -486,7 +520,7 @@ class GeminiPro(_SharedGemini, llm.KeyModel):
 class AsyncGeminiPro(_SharedGemini, llm.AsyncKeyModel):
     async def execute(self, prompt, stream, response, conversation, key):
-        url = f"https://generativelanguage.googleapis.com/v1beta/models/{self.model_id}:streamGenerateContent"
+        url = f"https://generativelanguage.googleapis.com/v1beta/models/{self.gemini_model_id}:streamGenerateContent"
         gathered = []
         body = self.build_request_body(prompt, conversation)
@@ -494,7 +528,7 @@ class AsyncGeminiPro(_SharedGemini, llm.AsyncKeyModel):
             async with client.stream(
                 "POST",
                 url,
-                timeout=None,
+                timeout=prompt.options.timeout,
                 headers={"x-goog-api-key": self.get_key(key)},
                 json=body,
             ) as http_response:
@@ -584,8 +618,20 @@ def register_commands(cli):
     @gemini.command()
     @click.option("--key", help="API key to use")
-    def models(key):
-        "List of Gemini models pulled from their API"
+    @click.option(
+        "methods",
+        "--method",
+        multiple=True,
+        help="Filter by supported generation methods",
+    )
+    def models(key, methods):
+        """
+        List of Gemini models pulled from their API
+        Use --method to filter by supported generation methods for example:
+        llm gemini models --method generateContent --method embedContent
+        """
         key = llm.get_key(key, "gemini", "LLM_GEMINI_KEY")
         if not key:
             raise click.ClickException(
@@ -594,7 +640,16 @@ def register_commands(cli):
         url = f"https://generativelanguage.googleapis.com/v1beta/models"
         response = httpx.get(url, headers={"x-goog-api-key": key})
         response.raise_for_status()
-        click.echo(json.dumps(response.json()["models"], indent=2))
+        models = response.json()["models"]
+        if methods:
+            models = [
+                model
+                for model in models
+                if any(
+                    method in model["supportedGenerationMethods"] for method in methods
+                )
+            ]
+        click.echo(json.dumps(models, indent=2))
     @gemini.command()
     @click.option("--key", help="API key to use")

{llm_gemini-0.20a1 → llm_gemini-0.21}/pyproject.toml RENAMED Viewed

@@ -1,13 +1,13 @@
 [project]
 name = "llm-gemini"
-version = "0.20a1"
+version = "0.21"
 description = "LLM plugin to access Google's Gemini family of models"
 readme = "README.md"
 authors = [{name = "Simon Willison"}]
 license = "Apache-2.0"
 classifiers = []
 dependencies = [
-    "llm>=0.26a0",
+    "llm>=0.26",
     "httpx",
     "ijson"
 ]
@@ -22,7 +22,7 @@ CI = "https://github.com/simonw/llm-gemini/actions"
 gemini = "llm_gemini"
 [project.optional-dependencies]
-test = ["pytest", "pytest-recording", "pytest-asyncio", "nest-asyncio"]
+test = ["pytest", "pytest-recording", "pytest-asyncio", "nest-asyncio", "cogapp"]
 [tool.pytest.ini_options]
 asyncio_mode = "strict"

{llm_gemini-0.20a1 → llm_gemini-0.21}/tests/test_gemini.py RENAMED Viewed

@@ -232,6 +232,14 @@ def test_cli_gemini_models(tmpdir, monkeypatch):
     result2 = runner.invoke(cli, ["gemini", "models", "--key", GEMINI_API_KEY])
     assert result2.exit_code == 0
     assert "gemini-1.5-flash-latest" in result2.output
+    # And with --method
+    result3 = runner.invoke(
+        cli, ["gemini", "models", "--key", GEMINI_API_KEY, "--method", "embedContent"]
+    )
+    assert result3.exit_code == 0
+    models = json.loads(result3.output)
+    for model in models:
+        assert "embedContent" in model["supportedGenerationMethods"]
 @pytest.mark.vcr

{llm_gemini-0.20a1 → llm_gemini-0.21}/LICENSE RENAMED Viewed

File without changes

{llm_gemini-0.20a1 → llm_gemini-0.21}/llm_gemini.egg-info/SOURCES.txt RENAMED Viewed

File without changes

{llm_gemini-0.20a1 → llm_gemini-0.21}/llm_gemini.egg-info/dependency_links.txt RENAMED Viewed

File without changes

{llm_gemini-0.20a1 → llm_gemini-0.21}/llm_gemini.egg-info/entry_points.txt RENAMED Viewed

File without changes

{llm_gemini-0.20a1 → llm_gemini-0.21}/llm_gemini.egg-info/top_level.txt RENAMED Viewed

File without changes

{llm_gemini-0.20a1 → llm_gemini-0.21}/setup.cfg RENAMED Viewed

File without changes

llm-gemini 0.20a1__tar.gz → 0.21__tar.gz

llm-gemini 0.20a1tar.gz → 0.21tar.gz