PyPI - gemini-webapi - Versions diffs - 1.9.0__tar.gz → 1.10.0__tar.gz - Mend

gemini-webapi 1.9.0tar.gz → 1.10.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (37) hide show

{gemini_webapi-1.9.0 → gemini_webapi-1.10.0}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
-Metadata-Version: 2.2
+Metadata-Version: 2.4
 Name: gemini-webapi
-Version: 1.9.0
+Version: 1.10.0
 Summary: ✨ An elegant async Python wrapper for Google Gemini web app
 Author: UZQueen
 License:                     GNU AFFERO GENERAL PUBLIC LICENSE
@@ -679,6 +679,7 @@ License-File: LICENSE
 Requires-Dist: httpx[http2]~=0.28.1
 Requires-Dist: pydantic~=2.10.5
 Requires-Dist: loguru~=0.7.3
+Dynamic: license-file
 <p align="center">
     <img src="https://raw.githubusercontent.com/HanaokaYuzu/Gemini-API/master/assets/banner.png" width="55%" alt="Gemini Banner" align="center">
@@ -711,7 +712,7 @@ A reverse-engineered asynchronous python wrapper for [Google Gemini](https://gem
 ## Features
 - **Persistent Cookies** - Automatically refreshes cookies in background. Optimized for always-on services.
-- **ImageFx Support** - Supports retrieving images generated by ImageFx, Google's latest AI image generator.
+- **Image Generation** - Natively supports generating and modifying images with natural language.
 - **Extension Support** - Supports generating contents with [Gemini extensions](https://gemini.google.com/extensions) on, like YouTube and Gmail.
 - **Classified Outputs** - Automatically categorizes texts, web images and AI generated images in the response.
 - **Official Flavor** - Provides a simple and elegant interface inspired by [Google Generative AI](https://ai.google.dev/tutorials/python_quickstart)'s official API.
@@ -727,13 +728,12 @@ A reverse-engineered asynchronous python wrapper for [Google Gemini](https://gem
   - [Initialization](#initialization)
   - [Select language model](#select-language-model)
   - [Generate contents from text](#generate-contents-from-text)
-  - [Generate contents from image](#generate-contents-from-image)
+  - [Generate contents with files](#generate-contents-with-files)
   - [Conversations across multiple turns](#conversations-across-multiple-turns)
   - [Continue previous conversations](#continue-previous-conversations)
   - [Retrieve model's thought process](#retrieve-models-thought-process)
   - [Retrieve images in response](#retrieve-images-in-response)
-  - [Generate images with ImageFx](#generate-images-with-imagefx)
-  - [Save images to local files](#save-images-to-local-files)
+  - [Generate images with Imagen3](#generate-images-with-imagen3)
   - [Generate contents with Gemini extensions](#generate-contents-with-gemini-extensions)
   - [Check and switch to other reply candidates](#check-and-switch-to-other-reply-candidates)
   - [Control log level](#control-log-level)
@@ -867,15 +867,15 @@ asyncio.run(main())
 >
 > Simply use `print(response)` to get the same output if you just want to see the response text
-### Generate contents from image
+### Generate contents with files
-Gemini supports image recognition and generating contents from images. Optionally, you can pass images in a list of file data in `bytes` or their paths in `str` or `pathlib.Path` to `GeminiClient.generate_content` together with text prompt.
+Gemini supports file input, including images and documents. Optionally, you can pass files as a list of paths in `str` or `pathlib.Path` to `GeminiClient.generate_content` together with text prompt.
 ```python
 async def main():
     response = await client.generate_content(
-            "Describe each of these images",
-            images=["assets/banner.png", "assets/favicon.png"],
+            "Introduce the contents of these two files. Is there any connection between them?",
+            files=["assets/sample.pdf", Path("assets/banner.png")],
         )
     print(response.text)
@@ -889,9 +889,15 @@ If you want to keep conversation continuous, please use `GeminiClient.start_chat
 ```python
 async def main():
     chat = client.start_chat()
-    response1 = await chat.send_message("Briefly introduce Europe")
-    response2 = await chat.send_message("What's the population there?")
-    print(response1.text, response2.text, sep="\n\n----------------------------------\n\n")
+    response1 = await chat.send_message(
+        "Introduce the contents of these two files. Is there any connection between them?",
+        files=["assets/sample.pdf", Path("assets/banner.png")],
+    )
+    print(response1.text)
+    response2 = await chat.send_message(
+        "Use image generation tool to modify the banner with another font and design."
+    )
+    print(response2.text, response2.images, sep="\n\n----------------------------------\n\n")
 asyncio.run(main())
 ```
@@ -949,24 +955,27 @@ async def main():
 asyncio.run(main())
 ```
-### Generate images with ImageFx
+### Generate images with Imagen3
-In February 2022, Google introduced a new AI image generator called ImageFx and integrated it into Gemini. You can ask Gemini to generate images with ImageFx simply by natural language.
+You can ask Gemini to generate and modify images with Imagen3, Google's latest AI image generator, simply by natural language.
 > [!IMPORTANT]
 >
-> Google has some limitations on the image generation feature in Gemini, so its availability could be different per region/account. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/14286560) (as of February 15th, 2024):
+> Google has some limitations on the image generation feature in Gemini, so its availability could be different per region/account. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/14286560) (as of March 19th, 2025):
 >
-> > Image generation in Gemini Apps is available in most countries, except in the European Economic Area (EEA), Switzerland, and the UK. It’s only available for **English prompts**.
-> >
 > > This feature’s availability in any specific Gemini app is also limited to the supported languages and countries of that app.
 > >
 > > For now, this feature isn’t available to users under 18.
+> >
+> > To use this feature, you must be signed in to Gemini Apps.
+You can save images returned from Gemini to local by calling `Image.save()`. Optionally, you can specify the file path and file name by passing `path` and `filename` arguments to the function and skip images with invalid file names by passing `skip_invalid_filename=True`. Works for both `WebImage` and `GeneratedImage`.
 ```python
 async def main():
     response = await client.generate_content("Generate some pictures of cats")
-    for image in response.images:
+    for i, image in enumerate(response.images):
+        await image.save(path="temp/", filename=f"cat_{i}.png", verbose=True)
         print(image, "\n\n----------------------------------\n")
 asyncio.run(main())
@@ -976,32 +985,17 @@ asyncio.run(main())
 >
 > by default, when asked to send images (like the previous example), Gemini will send images fetched from web instead of generating images with AI model, unless you specifically require to "generate" images in your prompt. In this package, web images and generated images are treated differently as `WebImage` and `GeneratedImage`, and will be automatically categorized in the output.
-### Save images to local files
-You can save images returned from Gemini to local files under `/temp` by calling `Image.save()`. Optionally, you can specify the file path and file name by passing `path` and `filename` arguments to the function and skip images with invalid file names by passing `skip_invalid_filename=True`. Works for both `WebImage` and `GeneratedImage`.
-```python
-async def main():
-    response = await client.generate_content("Generate some pictures of cats")
-    for i, image in enumerate(response.images):
-        await image.save(path="temp/", filename=f"cat_{i}.png", verbose=True)
-asyncio.run(main())
-```
 ### Generate contents with Gemini extensions
 > [!IMPORTANT]
 >
-> To access Gemini extensions in API, you must activate them on the [Gemini website](https://gemini.google.com/extensions) first. Same as image generation, Google also has limitations on the availability of Gemini extensions. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/13695044) (as of February 18th, 2024):
+> To access Gemini extensions in API, you must activate them on the [Gemini website](https://gemini.google.com/extensions) first. Same as image generation, Google also has limitations on the availability of Gemini extensions. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/13695044) (as of March 19th, 2025):
 >
-> > To use extensions in Gemini Apps:
-> >
-> > Sign in with your personal Google Account that you manage on your own. Extensions, including the Google Workspace extension, are currently not available to Google Workspace accounts for school, business, or other organizations.
+> > To connect apps to Gemini, you must have Gemini Apps Activity on.
 > >
-> > Have Gemini Apps Activity on. Extensions are only available when Gemini Apps Activity is turned on.
+> > To use this feature, you must be signed in to Gemini Apps.
 > >
-> > Important: For now, extensions are available in **English, Japanese, and Korean** only.
+> > Important: If you’re under 18, Google Workspace and Maps apps currently only work with English prompts in Gemini.
 After activating extensions for your account, you can access them in your prompts either by natural language or by starting your prompt with "@" followed by the extension keyword.

{gemini_webapi-1.9.0 → gemini_webapi-1.10.0}/README.md RENAMED Viewed

@@ -29,7 +29,7 @@ A reverse-engineered asynchronous python wrapper for [Google Gemini](https://gem
 ## Features
 - **Persistent Cookies** - Automatically refreshes cookies in background. Optimized for always-on services.
-- **ImageFx Support** - Supports retrieving images generated by ImageFx, Google's latest AI image generator.
+- **Image Generation** - Natively supports generating and modifying images with natural language.
 - **Extension Support** - Supports generating contents with [Gemini extensions](https://gemini.google.com/extensions) on, like YouTube and Gmail.
 - **Classified Outputs** - Automatically categorizes texts, web images and AI generated images in the response.
 - **Official Flavor** - Provides a simple and elegant interface inspired by [Google Generative AI](https://ai.google.dev/tutorials/python_quickstart)'s official API.
@@ -45,13 +45,12 @@ A reverse-engineered asynchronous python wrapper for [Google Gemini](https://gem
   - [Initialization](#initialization)
   - [Select language model](#select-language-model)
   - [Generate contents from text](#generate-contents-from-text)
-  - [Generate contents from image](#generate-contents-from-image)
+  - [Generate contents with files](#generate-contents-with-files)
   - [Conversations across multiple turns](#conversations-across-multiple-turns)
   - [Continue previous conversations](#continue-previous-conversations)
   - [Retrieve model's thought process](#retrieve-models-thought-process)
   - [Retrieve images in response](#retrieve-images-in-response)
-  - [Generate images with ImageFx](#generate-images-with-imagefx)
-  - [Save images to local files](#save-images-to-local-files)
+  - [Generate images with Imagen3](#generate-images-with-imagen3)
   - [Generate contents with Gemini extensions](#generate-contents-with-gemini-extensions)
   - [Check and switch to other reply candidates](#check-and-switch-to-other-reply-candidates)
   - [Control log level](#control-log-level)
@@ -185,15 +184,15 @@ asyncio.run(main())
 >
 > Simply use `print(response)` to get the same output if you just want to see the response text
-### Generate contents from image
+### Generate contents with files
-Gemini supports image recognition and generating contents from images. Optionally, you can pass images in a list of file data in `bytes` or their paths in `str` or `pathlib.Path` to `GeminiClient.generate_content` together with text prompt.
+Gemini supports file input, including images and documents. Optionally, you can pass files as a list of paths in `str` or `pathlib.Path` to `GeminiClient.generate_content` together with text prompt.
 ```python
 async def main():
     response = await client.generate_content(
-            "Describe each of these images",
-            images=["assets/banner.png", "assets/favicon.png"],
+            "Introduce the contents of these two files. Is there any connection between them?",
+            files=["assets/sample.pdf", Path("assets/banner.png")],
         )
     print(response.text)
@@ -207,9 +206,15 @@ If you want to keep conversation continuous, please use `GeminiClient.start_chat
 ```python
 async def main():
     chat = client.start_chat()
-    response1 = await chat.send_message("Briefly introduce Europe")
-    response2 = await chat.send_message("What's the population there?")
-    print(response1.text, response2.text, sep="\n\n----------------------------------\n\n")
+    response1 = await chat.send_message(
+        "Introduce the contents of these two files. Is there any connection between them?",
+        files=["assets/sample.pdf", Path("assets/banner.png")],
+    )
+    print(response1.text)
+    response2 = await chat.send_message(
+        "Use image generation tool to modify the banner with another font and design."
+    )
+    print(response2.text, response2.images, sep="\n\n----------------------------------\n\n")
 asyncio.run(main())
 ```
@@ -267,24 +272,27 @@ async def main():
 asyncio.run(main())
 ```
-### Generate images with ImageFx
+### Generate images with Imagen3
-In February 2022, Google introduced a new AI image generator called ImageFx and integrated it into Gemini. You can ask Gemini to generate images with ImageFx simply by natural language.
+You can ask Gemini to generate and modify images with Imagen3, Google's latest AI image generator, simply by natural language.
 > [!IMPORTANT]
 >
-> Google has some limitations on the image generation feature in Gemini, so its availability could be different per region/account. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/14286560) (as of February 15th, 2024):
+> Google has some limitations on the image generation feature in Gemini, so its availability could be different per region/account. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/14286560) (as of March 19th, 2025):
 >
-> > Image generation in Gemini Apps is available in most countries, except in the European Economic Area (EEA), Switzerland, and the UK. It’s only available for **English prompts**.
-> >
 > > This feature’s availability in any specific Gemini app is also limited to the supported languages and countries of that app.
 > >
 > > For now, this feature isn’t available to users under 18.
+> >
+> > To use this feature, you must be signed in to Gemini Apps.
+You can save images returned from Gemini to local by calling `Image.save()`. Optionally, you can specify the file path and file name by passing `path` and `filename` arguments to the function and skip images with invalid file names by passing `skip_invalid_filename=True`. Works for both `WebImage` and `GeneratedImage`.
 ```python
 async def main():
     response = await client.generate_content("Generate some pictures of cats")
-    for image in response.images:
+    for i, image in enumerate(response.images):
+        await image.save(path="temp/", filename=f"cat_{i}.png", verbose=True)
         print(image, "\n\n----------------------------------\n")
 asyncio.run(main())
@@ -294,32 +302,17 @@ asyncio.run(main())
 >
 > by default, when asked to send images (like the previous example), Gemini will send images fetched from web instead of generating images with AI model, unless you specifically require to "generate" images in your prompt. In this package, web images and generated images are treated differently as `WebImage` and `GeneratedImage`, and will be automatically categorized in the output.
-### Save images to local files
-You can save images returned from Gemini to local files under `/temp` by calling `Image.save()`. Optionally, you can specify the file path and file name by passing `path` and `filename` arguments to the function and skip images with invalid file names by passing `skip_invalid_filename=True`. Works for both `WebImage` and `GeneratedImage`.
-```python
-async def main():
-    response = await client.generate_content("Generate some pictures of cats")
-    for i, image in enumerate(response.images):
-        await image.save(path="temp/", filename=f"cat_{i}.png", verbose=True)
-asyncio.run(main())
-```
 ### Generate contents with Gemini extensions
 > [!IMPORTANT]
 >
-> To access Gemini extensions in API, you must activate them on the [Gemini website](https://gemini.google.com/extensions) first. Same as image generation, Google also has limitations on the availability of Gemini extensions. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/13695044) (as of February 18th, 2024):
+> To access Gemini extensions in API, you must activate them on the [Gemini website](https://gemini.google.com/extensions) first. Same as image generation, Google also has limitations on the availability of Gemini extensions. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/13695044) (as of March 19th, 2025):
 >
-> > To use extensions in Gemini Apps:
-> >
-> > Sign in with your personal Google Account that you manage on your own. Extensions, including the Google Workspace extension, are currently not available to Google Workspace accounts for school, business, or other organizations.
+> > To connect apps to Gemini, you must have Gemini Apps Activity on.
 > >
-> > Have Gemini Apps Activity on. Extensions are only available when Gemini Apps Activity is turned on.
+> > To use this feature, you must be signed in to Gemini Apps.
 > >
-> > Important: For now, extensions are available in **English, Japanese, and Korean** only.
+> > Important: If you’re under 18, Google Workspace and Maps apps currently only work with English prompts in Gemini.
 After activating extensions for your account, you can access them in your prompts either by natural language or by starting your prompt with "@" followed by the extension keyword.

gemini_webapi-1.10.0/assets/sample.pdf ADDED Viewed

Binary file

{gemini_webapi-1.9.0 → gemini_webapi-1.10.0}/src/gemini_webapi/client.py RENAMED Viewed

@@ -13,6 +13,7 @@ from .exceptions import AuthError, APIError, TimeoutError, GeminiError
 from .types import WebImage, GeneratedImage, Candidate, ModelOutput
 from .utils import (
     upload_file,
+    parse_file_name,
     rotate_1psidts,
     get_access_token,
     load_browser_cookies,
@@ -263,7 +264,7 @@ class GeminiClient:
     async def generate_content(
         self,
         prompt: str,
-        images: list[bytes | str | Path] | None = None,
+        files: list[str | Path] | None = None,
         model: Model | str = Model.UNSPECIFIED,
         chat: Optional["ChatSession"] = None,
         **kwargs,
@@ -275,8 +276,8 @@ class GeminiClient:
         ----------
         prompt: `str`
             Prompt provided by user.
-        images: `list[bytes | str | Path]`, optional
-            List of image file paths or file data in bytes.
+        files: `list[str | Path]`, optional
+            List of file paths to be attached.
         model: `Model` | `str`, optional
             Specify the model to use for generation.
             Pass either a `gemini_webapi.constants.Model` enum or a model name string.
@@ -324,17 +325,17 @@ class GeminiClient:
                             None,
                             json.dumps(
                                 [
-                                    images
+                                    files
                                     and [
                                         prompt,
                                         0,
                                         None,
                                         [
                                             [
-                                                [await upload_file(image, self.proxy)],
-                                                "filename.jpg",
+                                                [await upload_file(file, self.proxy)],
+                                                parse_file_name(file),
                                             ]
-                                            for image in images
+                                            for file in files
                                         ],
                                     ]
                                     or [prompt],
@@ -361,18 +362,17 @@ class GeminiClient:
             try:
                 response_json = json.loads(response.text.split("\n")[2])
-                # Plain request
-                body = json.loads(response_json[0][2])
-                if not body[4]:
-                    # Request with thinking models
-                    body = json.loads(response_json[1][2])
-                if not body[4]:
-                    # Request with Gemini extensions enabled
-                    body = json.loads(response_json[4][2])
-                if not body[4]:
+                body = None
+                for part in response_json:
+                    try:
+                        main_part = json.loads(part[2])
+                        if main_part[4]:
+                            body = main_part
+                            break
+                    except (IndexError, TypeError, ValueError):
+                        continue
+                if not body:
                     raise Exception
             except Exception:
                 await self.close()
@@ -551,7 +551,7 @@ class ChatSession:
     async def send_message(
         self,
         prompt: str,
-        images: list[bytes | str | Path] | None = None,
+        files: list[str | Path] | None = None,
         **kwargs,
     ) -> ModelOutput:
         """
@@ -562,8 +562,8 @@ class ChatSession:
         ----------
         prompt: `str`
             Prompt provided by user.
-        images: `list[bytes | str | Path]`, optional
-            List of image file paths or file data in bytes.
+        files: `list[str | Path]`, optional
+            List of file paths to be attached.
         kwargs: `dict`, optional
             Additional arguments which will be passed to the post request.
             Refer to `httpx.AsyncClient.request` for more information.
@@ -588,7 +588,7 @@ class ChatSession:
         """
         return await self.geminiclient.generate_content(
-            prompt=prompt, images=images, model=self.model, chat=self, **kwargs
+            prompt=prompt, files=files, model=self.model, chat=self, **kwargs
         )
     def choose_candidate(self, index: int) -> ModelOutput:

{gemini_webapi-1.9.0 → gemini_webapi-1.10.0}/src/gemini_webapi/constants.py RENAMED Viewed

@@ -44,7 +44,7 @@ class Model(Enum):
         "gemini-2.0-flash-thinking-with-apps",
         {"x-goog-ext-525001261-jspb": '[null,null,null,null,"f8f8f5ea629f5d37"]'},
         False,
-    )
+    )  # Deprecated, should be removed in the future
     G_2_0_EXP_ADVANCED = (
         "gemini-2.0-exp-advanced",
         {"x-goog-ext-525001261-jspb": '[null,null,null,null,"b1e46a6037e6aa9f"]'},
@@ -54,17 +54,17 @@ class Model(Enum):
         "gemini-1.5-flash",
         {"x-goog-ext-525001261-jspb": '[null,null,null,null,"418ab5ea040b5c43"]'},
         False,
-    )
+    )  # Deprecated, should be removed in the future
     G_1_5_PRO = (
         "gemini-1.5-pro",
         {"x-goog-ext-525001261-jspb": '[null,null,null,null,"9d60dfae93c9ff1f"]'},
         True,
-    )
+    )  # Deprecated, should be removed in the future
     G_1_5_PRO_RESEARCH = (
         "gemini-1.5-pro-research",
         {"x-goog-ext-525001261-jspb": '[null,null,null,null,"e5a44cb1dae2b489"]'},
         True,
-    )
+    )  # Deprecated, should be removed in the future
     def __init__(self, name, header, advanced_only):
         self.model_name = name

{gemini_webapi-1.9.0 → gemini_webapi-1.10.0}/src/gemini_webapi/utils/__init__.py RENAMED Viewed

@@ -1,6 +1,6 @@
 from asyncio import Task
-from .upload_file import upload_file  # noqa: F401
+from .upload_file import upload_file, parse_file_name  # noqa: F401
 from .rotate_1psidts import rotate_1psidts  # noqa: F401
 from .get_access_token import get_access_token  # noqa: F401
 from .load_browser_cookies import load_browser_cookies  # noqa: F401

{gemini_webapi-1.9.0 → gemini_webapi-1.10.0}/src/gemini_webapi/utils/upload_file.py RENAMED Viewed

@@ -7,14 +7,14 @@ from ..constants import Endpoint, Headers
 @validate_call
-async def upload_file(file: bytes | str | Path, proxy: str | None = None) -> str:
+async def upload_file(file: str | Path, proxy: str | None = None) -> str:
     """
     Upload a file to Google's server and return its identifier.
     Parameters
     ----------
-    file : `bytes` | `str` | `Path`
-        File data in bytes, or path to the file to be uploaded.
+    file : `str` | `Path`
+        Path to the file to be uploaded.
     proxy: `str`, optional
         Proxy URL.
@@ -30,9 +30,8 @@ async def upload_file(file: bytes | str | Path, proxy: str | None = None) -> str
         If the upload request failed.
     """
-    if not isinstance(file, bytes):
-        with open(file, "rb") as f:
-            file = f.read()
+    with open(file, "rb") as f:
+        file = f.read()
     async with AsyncClient(http2=True, proxy=proxy) as client:
         response = await client.post(
@@ -43,3 +42,25 @@ async def upload_file(file: bytes | str | Path, proxy: str | None = None) -> str
         )
         response.raise_for_status()
         return response.text
+def parse_file_name(file: str | Path) -> str:
+    """
+    Parse the file name from the given path.
+    Parameters
+    ----------
+    file : `str` | `Path`
+        Path to the file.
+    Returns
+    -------
+    `str`
+        File name with extension.
+    """
+    file = Path(file)
+    if not file.is_file():
+        raise ValueError(f"{file} is not a valid file.")
+    return file.name

{gemini_webapi-1.9.0 → gemini_webapi-1.10.0}/src/gemini_webapi.egg-info/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
-Metadata-Version: 2.2
+Metadata-Version: 2.4
 Name: gemini-webapi
-Version: 1.9.0
+Version: 1.10.0
 Summary: ✨ An elegant async Python wrapper for Google Gemini web app
 Author: UZQueen
 License:                     GNU AFFERO GENERAL PUBLIC LICENSE
@@ -679,6 +679,7 @@ License-File: LICENSE
 Requires-Dist: httpx[http2]~=0.28.1
 Requires-Dist: pydantic~=2.10.5
 Requires-Dist: loguru~=0.7.3
+Dynamic: license-file
 <p align="center">
     <img src="https://raw.githubusercontent.com/HanaokaYuzu/Gemini-API/master/assets/banner.png" width="55%" alt="Gemini Banner" align="center">
@@ -711,7 +712,7 @@ A reverse-engineered asynchronous python wrapper for [Google Gemini](https://gem
 ## Features
 - **Persistent Cookies** - Automatically refreshes cookies in background. Optimized for always-on services.
-- **ImageFx Support** - Supports retrieving images generated by ImageFx, Google's latest AI image generator.
+- **Image Generation** - Natively supports generating and modifying images with natural language.
 - **Extension Support** - Supports generating contents with [Gemini extensions](https://gemini.google.com/extensions) on, like YouTube and Gmail.
 - **Classified Outputs** - Automatically categorizes texts, web images and AI generated images in the response.
 - **Official Flavor** - Provides a simple and elegant interface inspired by [Google Generative AI](https://ai.google.dev/tutorials/python_quickstart)'s official API.
@@ -727,13 +728,12 @@ A reverse-engineered asynchronous python wrapper for [Google Gemini](https://gem
   - [Initialization](#initialization)
   - [Select language model](#select-language-model)
   - [Generate contents from text](#generate-contents-from-text)
-  - [Generate contents from image](#generate-contents-from-image)
+  - [Generate contents with files](#generate-contents-with-files)
   - [Conversations across multiple turns](#conversations-across-multiple-turns)
   - [Continue previous conversations](#continue-previous-conversations)
   - [Retrieve model's thought process](#retrieve-models-thought-process)
   - [Retrieve images in response](#retrieve-images-in-response)
-  - [Generate images with ImageFx](#generate-images-with-imagefx)
-  - [Save images to local files](#save-images-to-local-files)
+  - [Generate images with Imagen3](#generate-images-with-imagen3)
   - [Generate contents with Gemini extensions](#generate-contents-with-gemini-extensions)
   - [Check and switch to other reply candidates](#check-and-switch-to-other-reply-candidates)
   - [Control log level](#control-log-level)
@@ -867,15 +867,15 @@ asyncio.run(main())
 >
 > Simply use `print(response)` to get the same output if you just want to see the response text
-### Generate contents from image
+### Generate contents with files
-Gemini supports image recognition and generating contents from images. Optionally, you can pass images in a list of file data in `bytes` or their paths in `str` or `pathlib.Path` to `GeminiClient.generate_content` together with text prompt.
+Gemini supports file input, including images and documents. Optionally, you can pass files as a list of paths in `str` or `pathlib.Path` to `GeminiClient.generate_content` together with text prompt.
 ```python
 async def main():
     response = await client.generate_content(
-            "Describe each of these images",
-            images=["assets/banner.png", "assets/favicon.png"],
+            "Introduce the contents of these two files. Is there any connection between them?",
+            files=["assets/sample.pdf", Path("assets/banner.png")],
         )
     print(response.text)
@@ -889,9 +889,15 @@ If you want to keep conversation continuous, please use `GeminiClient.start_chat
 ```python
 async def main():
     chat = client.start_chat()
-    response1 = await chat.send_message("Briefly introduce Europe")
-    response2 = await chat.send_message("What's the population there?")
-    print(response1.text, response2.text, sep="\n\n----------------------------------\n\n")
+    response1 = await chat.send_message(
+        "Introduce the contents of these two files. Is there any connection between them?",
+        files=["assets/sample.pdf", Path("assets/banner.png")],
+    )
+    print(response1.text)
+    response2 = await chat.send_message(
+        "Use image generation tool to modify the banner with another font and design."
+    )
+    print(response2.text, response2.images, sep="\n\n----------------------------------\n\n")
 asyncio.run(main())
 ```
@@ -949,24 +955,27 @@ async def main():
 asyncio.run(main())
 ```
-### Generate images with ImageFx
+### Generate images with Imagen3
-In February 2022, Google introduced a new AI image generator called ImageFx and integrated it into Gemini. You can ask Gemini to generate images with ImageFx simply by natural language.
+You can ask Gemini to generate and modify images with Imagen3, Google's latest AI image generator, simply by natural language.
 > [!IMPORTANT]
 >
-> Google has some limitations on the image generation feature in Gemini, so its availability could be different per region/account. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/14286560) (as of February 15th, 2024):
+> Google has some limitations on the image generation feature in Gemini, so its availability could be different per region/account. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/14286560) (as of March 19th, 2025):
 >
-> > Image generation in Gemini Apps is available in most countries, except in the European Economic Area (EEA), Switzerland, and the UK. It’s only available for **English prompts**.
-> >
 > > This feature’s availability in any specific Gemini app is also limited to the supported languages and countries of that app.
 > >
 > > For now, this feature isn’t available to users under 18.
+> >
+> > To use this feature, you must be signed in to Gemini Apps.
+You can save images returned from Gemini to local by calling `Image.save()`. Optionally, you can specify the file path and file name by passing `path` and `filename` arguments to the function and skip images with invalid file names by passing `skip_invalid_filename=True`. Works for both `WebImage` and `GeneratedImage`.
 ```python
 async def main():
     response = await client.generate_content("Generate some pictures of cats")
-    for image in response.images:
+    for i, image in enumerate(response.images):
+        await image.save(path="temp/", filename=f"cat_{i}.png", verbose=True)
         print(image, "\n\n----------------------------------\n")
 asyncio.run(main())
@@ -976,32 +985,17 @@ asyncio.run(main())
 >
 > by default, when asked to send images (like the previous example), Gemini will send images fetched from web instead of generating images with AI model, unless you specifically require to "generate" images in your prompt. In this package, web images and generated images are treated differently as `WebImage` and `GeneratedImage`, and will be automatically categorized in the output.
-### Save images to local files
-You can save images returned from Gemini to local files under `/temp` by calling `Image.save()`. Optionally, you can specify the file path and file name by passing `path` and `filename` arguments to the function and skip images with invalid file names by passing `skip_invalid_filename=True`. Works for both `WebImage` and `GeneratedImage`.
-```python
-async def main():
-    response = await client.generate_content("Generate some pictures of cats")
-    for i, image in enumerate(response.images):
-        await image.save(path="temp/", filename=f"cat_{i}.png", verbose=True)
-asyncio.run(main())
-```
 ### Generate contents with Gemini extensions
 > [!IMPORTANT]
 >
-> To access Gemini extensions in API, you must activate them on the [Gemini website](https://gemini.google.com/extensions) first. Same as image generation, Google also has limitations on the availability of Gemini extensions. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/13695044) (as of February 18th, 2024):
+> To access Gemini extensions in API, you must activate them on the [Gemini website](https://gemini.google.com/extensions) first. Same as image generation, Google also has limitations on the availability of Gemini extensions. Here's a summary copied from [official documentation](https://support.google.com/gemini/answer/13695044) (as of March 19th, 2025):
 >
-> > To use extensions in Gemini Apps:
-> >
-> > Sign in with your personal Google Account that you manage on your own. Extensions, including the Google Workspace extension, are currently not available to Google Workspace accounts for school, business, or other organizations.
+> > To connect apps to Gemini, you must have Gemini Apps Activity on.
 > >
-> > Have Gemini Apps Activity on. Extensions are only available when Gemini Apps Activity is turned on.
+> > To use this feature, you must be signed in to Gemini Apps.
 > >
-> > Important: For now, extensions are available in **English, Japanese, and Korean** only.
+> > Important: If you’re under 18, Google Workspace and Maps apps currently only work with English prompts in Gemini.
 After activating extensions for your account, you can access them in your prompts either by natural language or by starting your prompt with "@" followed by the extension keyword.

{gemini_webapi-1.9.0 → gemini_webapi-1.10.0}/src/gemini_webapi.egg-info/SOURCES.txt RENAMED Viewed

@@ -10,6 +10,7 @@ pyproject.toml
 assets/banner.png
 assets/favicon.png
 assets/logo.svg
+assets/sample.pdf
 src/gemini_webapi/__init__.py
 src/gemini_webapi/client.py
 src/gemini_webapi/constants.py

{gemini_webapi-1.9.0 → gemini_webapi-1.10.0}/tests/test_client_features.py RENAMED Viewed

@@ -32,17 +32,9 @@ class TestGeminiClient(unittest.IsolatedAsyncioTestCase):
     @logger.catch(reraise=True)
     async def test_thinking_model(self):
-        response = await self.geminiclient.generate_content(
-            "What's 1+1?", model=Model.G_2_0_FLASH_THINKING
-        )
-        logger.debug(response.thoughts)
-        logger.debug(response.text)
-    @logger.catch(reraise=True)
-    async def test_thinking_with_apps(self):
         response = await self.geminiclient.generate_content(
             "Tell me a fact about today in history and illustrate it with a youtube video",
-            model=Model.G_2_0_FLASH_THINKING_WITH_APPS,
+            model=Model.G_2_0_FLASH_THINKING,
         )
         logger.debug(response.thoughts)
         logger.debug(response.text)
@@ -61,10 +53,10 @@ class TestGeminiClient(unittest.IsolatedAsyncioTestCase):
             logger.debug(f"Model version ({model.model_name}): {response.text}")
     @logger.catch(reraise=True)
-    async def test_upload_image(self):
+    async def test_upload_files(self):
         response = await self.geminiclient.generate_content(
-            "Describe these images",
-            images=[Path("assets/banner.png"), "assets/favicon.png"],
+            "Introduce the contents of these two files. Is there any connection between them?",
+            files=["assets/sample.pdf", Path("assets/banner.png")],
         )
         logger.debug(response.text)
@@ -92,11 +84,14 @@ class TestGeminiClient(unittest.IsolatedAsyncioTestCase):
         chat = self.geminiclient.start_chat()
         response1 = await chat.send_message(
             "What's the difference between these two images?",
-            images=["assets/banner.png", "assets/favicon.png"],
+            files=["assets/banner.png", "assets/favicon.png"],
         )
         logger.debug(response1.text)
-        response2 = await chat.send_message("Tell me more.")
+        response2 = await chat.send_message(
+            "Use image generation tool to modify the banner with another font and design."
+        )
         logger.debug(response2.text)
+        logger.debug(response2.images)
     @logger.catch(reraise=True)
     async def test_send_web_image(self):
@@ -120,14 +115,6 @@ class TestGeminiClient(unittest.IsolatedAsyncioTestCase):
             self.assertTrue(image.url)
             logger.debug(image)
-    @logger.catch(reraise=True)
-    async def test_image_generation_failure(self):
-        response = await self.geminiclient.generate_content(
-            "Generate some pictures of people"
-        )
-        self.assertFalse(response.images)
-        logger.debug(response.text)
     @logger.catch(reraise=True)
     async def test_card_content(self):
         response = await self.geminiclient.generate_content("How is today's weather?")