npm - @aigne/gemini - Versions diffs - 0.11.4 → 0.11.6 - Mend

@aigne/gemini 0.11.4 → 0.11.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

package/CHANGELOG.md +29 -0
package/README.md +120 -1
package/lib/cjs/gemini-image-model.js +2 -2
package/lib/esm/gemini-image-model.js +2 -2
package/package.json +4 -4

package/CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,34 @@
 # Changelog
+## [0.11.6](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.11.5...gemini-v0.11.6) (2025-09-01)
+### Bug Fixes
+* **transport:** improve HTTP client option handling and error serialization ([#445](https://github.com/AIGNE-io/aigne-framework/issues/445)) ([d3bcdd2](https://github.com/AIGNE-io/aigne-framework/commit/d3bcdd23ab8011a7d40fc157fd61eb240494c7a5))
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/openai bumped to 0.13.7
+  * devDependencies
+    * @aigne/core bumped to 1.57.5
+    * @aigne/test-utils bumped to 0.5.43
+## [0.11.5](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.11.4...gemini-v0.11.5) (2025-08-30)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/openai bumped to 0.13.6
+  * devDependencies
+    * @aigne/core bumped to 1.57.4
+    * @aigne/test-utils bumped to 0.5.42
 ## [0.11.4](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.11.3...gemini-v0.11.4) (2025-08-30)

package/README.md CHANGED Viewed

@@ -23,13 +23,14 @@ AIGNE Gemini SDK for integrating with Google's Gemini AI models within the [AIGN
 <picture>
   <source srcset="https://raw.githubusercontent.com/AIGNE-io/aigne-framework/main/assets/aigne-gemini-dark.png" media="(prefers-color-scheme: dark)">
   <source srcset="https://raw.githubusercontent.com/AIGNE-io/aigne-framework/main/assets/aigne-gemini.png" media="(prefers-color-scheme: light)">
-  <img src="https://raw.githubusercontent.com/AIGNE-io/aigne-framework/main/aigne-gemini.png" alt="AIGNE Arch" />
+  <img src="https://raw.githubusercontent.com/AIGNE-io/aigne-framework/main/assets/aigne-gemini.png" alt="AIGNE Arch" />
 </picture>
 ## Features
 * **Google Gemini API Integration**: Direct connection to Google's Gemini API services
 * **Chat Completions**: Support for Gemini's chat completions API with all available models
+* **Image Generation**: Support for both Imagen and Gemini image generation models
 * **Multimodal Support**: Built-in support for handling both text and image inputs
 * **Function Calling**: Support for function calling capabilities
 * **Streaming Responses**: Support for streaming responses for more responsive applications
@@ -60,6 +61,8 @@ pnpm add @aigne/gemini @aigne/core
 ## Basic Usage
+### Chat Model
 ```typescript file="test/gemini-chat-model.test.ts" region="example-gemini-chat-model"
 import { GeminiChatModel } from "@aigne/gemini";
@@ -86,6 +89,38 @@ console.log(result);
   */
 ```
+### Image Generation Model
+```typescript
+import { GeminiImageModel } from "@aigne/gemini";
+const model = new GeminiImageModel({
+  apiKey: "your-api-key", // Optional if set in env variables
+  model: "imagen-4.0-generate-001", // Default Imagen model
+});
+const result = await model.invoke({
+  prompt: "A serene mountain landscape at sunset with golden light",
+  n: 1,
+});
+console.log(result);
+/* Output:
+  {
+    images: [
+      {
+        base64: "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAA..."
+      }
+    ],
+    usage: {
+      inputTokens: 0,
+      outputTokens: 0
+    },
+    model: "imagen-4.0-generate-001"
+  }
+  */
+```
 ## Streaming Responses
 ```typescript file="test/gemini-chat-model.test.ts" region="example-gemini-chat-model-streaming"
@@ -119,6 +154,90 @@ console.log(fullText); // Output: "Hello from Gemini! I'm Google's helpful AI as
 console.log(json); // { model: "gemini-1.5-flash" }
 ```
+## Image Generation Parameters
+The `GeminiImageModel` supports different parameters depending on the model type:
+### Imagen Models (e.g., `imagen-4.0-generate-001`)
+- **`prompt`** (string): The text description of the image you want to generate
+- **`n`** (number): Number of images to generate (defaults to 1)
+- **`seed`** (number): Random seed for reproducible generation
+- **`safetyFilterLevel`** (string): Safety filter level for content moderation
+- **`personGeneration`** (string): Person generation settings
+- **`outputMimeType`** (string): Output image format (e.g., "image/png", "image/jpeg")
+- **`outputGcsUri`** (string): Google Cloud Storage URI for output
+- **`outputCompressionQuality`** (number): JPEG compression quality (1-100)
+- **`negativePrompt`** (string): Description of what to exclude from the image
+- **`language`** (string): Language for the prompt
+- **`includeSafetyAttributes`** (boolean): Include safety attributes in response
+- **`includeRaiReason`** (boolean): Include RAI reasoning in response
+- **`imageSize`** (string): Size of the generated image
+- **`guidanceScale`** (number): Guidance scale for generation
+- **`aspectRatio`** (string): Aspect ratio of the image
+- **`addWatermark`** (boolean): Add watermark to generated images
+### Gemini Models (e.g., `gemini-1.5-pro`)
+- **`prompt`** (string): The text description of the image you want to generate
+- **`n`** (number): Number of images to generate (defaults to 1)
+- **`temperature`** (number): Controls randomness in generation (0.0 to 1.0)
+- **`maxOutputTokens`** (number): Maximum number of tokens in response
+- **`topP`** (number): Nucleus sampling parameter
+- **`topK`** (number): Top-k sampling parameter
+- **`safetySettings`** (array): Safety settings for content generation
+- **`seed`** (number): Random seed for reproducible generation
+- **`stopSequences`** (array): Sequences that stop generation
+- **`systemInstruction`** (string): System-level instructions
+### Advanced Image Generation Example
+```typescript
+const result = await model.invoke({
+  prompt: "A futuristic cityscape with neon lights and flying cars",
+  model: "imagen-4.0-generate-001",
+  n: 2,
+  imageSize: "1024x1024",
+  aspectRatio: "1:1",
+  guidanceScale: 7.5,
+  negativePrompt: "blurry, low quality, distorted",
+  seed: 12345,
+  includeSafetyAttributes: true,
+  outputMimeType: "image/png"
+});
+```
+## Model Options
+You can also set default options when creating the model:
+```typescript
+const model = new GeminiImageModel({
+  apiKey: "your-api-key",
+  model: "imagen-4.0-generate-001",
+  modelOptions: {
+    safetyFilterLevel: "BLOCK_MEDIUM_AND_ABOVE",
+    includeSafetyAttributes: true,
+    outputMimeType: "image/png"
+  }
+});
+```
+## Environment Variables
+Set the following environment variable for automatic API key detection:
+```bash
+export GEMINI_API_KEY="your-gemini-api-key"
+```
+## API Reference
+For complete parameter details and advanced features:
+- **Imagen Models**: Refer to [Google GenAI Models.generateImages()](https://googleapis.github.io/js-genai/release_docs/classes/models.Models.html#generateimages)
+- **Gemini Models**: Refer to [Google GenAI Models.generateContent()](https://googleapis.github.io/js-genai/release_docs/classes/models.Models.html#generatecontent)
 ## License
 Elastic-2.0

package/lib/cjs/gemini-image-model.js CHANGED Viewed

@@ -142,8 +142,8 @@ class GeminiImageModel extends core_1.ImageModel {
         });
         const allImages = (response.candidates ?? [])
             .flatMap((candidate) => candidate.content?.parts ?? [])
-            .filter((part) => part?.inlineData?.data)
-            .map((part) => ({ base64: part.inlineData.data }));
+            .map((part) => (part.inlineData?.data ? { base64: part.inlineData?.data } : null))
+            .filter(type_utils_js_1.isNonNullable);
         return {
             images: allImages,
             usage: {

package/lib/esm/gemini-image-model.js CHANGED Viewed

@@ -139,8 +139,8 @@ export class GeminiImageModel extends ImageModel {
         });
         const allImages = (response.candidates ?? [])
             .flatMap((candidate) => candidate.content?.parts ?? [])
-            .filter((part) => part?.inlineData?.data)
-            .map((part) => ({ base64: part.inlineData.data }));
+            .map((part) => (part.inlineData?.data ? { base64: part.inlineData?.data } : null))
+            .filter(isNonNullable);
         return {
             images: allImages,
             usage: {

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@aigne/gemini",
-  "version": "0.11.4",
+  "version": "0.11.6",
   "description": "AIGNE Gemini SDK for integrating with Google's Gemini AI models",
   "publishConfig": {
     "access": "public"
@@ -37,7 +37,7 @@
   "dependencies": {
     "@google/genai": "^1.15.0",
     "zod": "^3.25.67",
-    "@aigne/openai": "^0.13.5"
+    "@aigne/openai": "^0.13.7"
   },
   "devDependencies": {
     "@types/bun": "^1.2.18",
@@ -45,8 +45,8 @@
     "npm-run-all": "^4.1.5",
     "rimraf": "^6.0.1",
     "typescript": "^5.8.3",
-    "@aigne/core": "^1.57.3",
-    "@aigne/test-utils": "^0.5.41"
+    "@aigne/test-utils": "^0.5.43",
+    "@aigne/core": "^1.57.5"
   },
   "scripts": {
     "lint": "tsc --noEmit",