npm - @aigne/gemini - Versions diffs - 0.14.16-beta.9 → 0.14.17-beta - Mend

@aigne/gemini 0.14.16-beta.9 → 0.14.17-beta

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (14) hide show

package/CHANGELOG.md +288 -0
package/lib/cjs/gemini-chat-model.d.ts +4 -0
package/lib/cjs/gemini-chat-model.js +99 -50
package/lib/cjs/gemini-image-model.d.ts +1 -1
package/lib/cjs/gemini-image-model.js +4 -4
package/lib/cjs/gemini-video-model.js +3 -3
package/lib/dts/gemini-chat-model.d.ts +4 -0
package/lib/dts/gemini-image-model.d.ts +1 -1
package/lib/esm/gemini-chat-model.d.ts +4 -0
package/lib/esm/gemini-chat-model.js +100 -51
package/lib/esm/gemini-image-model.d.ts +1 -1
package/lib/esm/gemini-image-model.js +4 -4
package/lib/esm/gemini-video-model.js +3 -3
package/package.json +4 -4

package/CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,293 @@
 # Changelog
+## [0.14.17-beta](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16...gemini-v0.14.17-beta) (2026-01-20)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.73.0-beta
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.70-beta
+## [0.14.16](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.26...gemini-v0.14.16) (2026-01-16)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0
+    * @aigne/platform-helpers bumped to 0.6.7
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69
+## [0.14.16-beta.26](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.25...gemini-v0.14.16-beta.26) (2026-01-16)
+### Features
+* add dynamic model options resolution with getter pattern ([#708](https://github.com/AIGNE-io/aigne-framework/issues/708)) ([5ed5085](https://github.com/AIGNE-io/aigne-framework/commit/5ed5085203763c70194853c56edc13acf56d81c6))
+* add modalities support for chat model ([#454](https://github.com/AIGNE-io/aigne-framework/issues/454)) ([70d1bf6](https://github.com/AIGNE-io/aigne-framework/commit/70d1bf631f4e711235d89c6df8ee210a19179b30))
+* add prompt caching for OpenAI/Gemini/Anthropic and cache token display ([#838](https://github.com/AIGNE-io/aigne-framework/issues/838)) ([46c628f](https://github.com/AIGNE-io/aigne-framework/commit/46c628f180572ea1b955d1a9888aad6145204842))
+* add reasoningEffort option for chat model ([#680](https://github.com/AIGNE-io/aigne-framework/issues/680)) ([f69d232](https://github.com/AIGNE-io/aigne-framework/commit/f69d232d714d4a3e4946bdc8c6598747c9bcbd57))
+* add thinking support to Gemini chat models ([#650](https://github.com/AIGNE-io/aigne-framework/issues/650)) ([09b828b](https://github.com/AIGNE-io/aigne-framework/commit/09b828ba668d90cc6aac68a5e8190adb146b5e45))
+* **core:** add nested getter pattern support for model options ([#796](https://github.com/AIGNE-io/aigne-framework/issues/796)) ([824b2fe](https://github.com/AIGNE-io/aigne-framework/commit/824b2fe55cb2a24620e2bb73b470532918fa2996))
+* improve image model architecture and file handling ([#527](https://github.com/AIGNE-io/aigne-framework/issues/527)) ([4db50aa](https://github.com/AIGNE-io/aigne-framework/commit/4db50aa0387a1a0f045ca11aaa61613e36ca7597))
+* **models:** support gemini 3.x thinking level and thoughtSignature ([#760](https://github.com/AIGNE-io/aigne-framework/issues/760)) ([243f2d4](https://github.com/AIGNE-io/aigne-framework/commit/243f2d457792a20ba2b87378576092e6f88e319c))
+* **model:** support video model ([#647](https://github.com/AIGNE-io/aigne-framework/issues/647)) ([de81742](https://github.com/AIGNE-io/aigne-framework/commit/de817421ef1dd3246d0d8c51ff12f0a855658f9f))
+* support custom prefer input file type ([#469](https://github.com/AIGNE-io/aigne-framework/issues/469)) ([db0161b](https://github.com/AIGNE-io/aigne-framework/commit/db0161bbac52542c771ee2f40f361636b0668075))
+* support define agent by third library & orchestrator agent refactor ([#799](https://github.com/AIGNE-io/aigne-framework/issues/799)) ([7264b11](https://github.com/AIGNE-io/aigne-framework/commit/7264b11ab6eed787e928367f09aa08d254968d40))
+### Bug Fixes
+* add prefer input file type option for image model ([#536](https://github.com/AIGNE-io/aigne-framework/issues/536)) ([3cba8a5](https://github.com/AIGNE-io/aigne-framework/commit/3cba8a5562233a1567b49b6dd5c446c0760f5c4c))
+* bump version ([696560f](https://github.com/AIGNE-io/aigne-framework/commit/696560fa2673eddcb4d00ac0523fbbbde7273cb3))
+* bump version ([70d217c](https://github.com/AIGNE-io/aigne-framework/commit/70d217c8360dd0dda7f5f17011c4e92ec836e801))
+* bump version ([af04b69](https://github.com/AIGNE-io/aigne-framework/commit/af04b6931951afa35d52065430acc7fef4b10087))
+* bump version ([ba7ad18](https://github.com/AIGNE-io/aigne-framework/commit/ba7ad184fcf32b49bf0507a3cb638d20fb00690d))
+* bump version ([93a1c10](https://github.com/AIGNE-io/aigne-framework/commit/93a1c10cf35f88eaafe91092481f5d087bd5b3a9))
+* **core:** preserve Agent Skill in session compact and support complex tool result content ([#876](https://github.com/AIGNE-io/aigne-framework/issues/876)) ([edb86ae](https://github.com/AIGNE-io/aigne-framework/commit/edb86ae2b9cfe56a8f08b276f843606e310566cf))
+* **core:** simplify token-estimator logic for remaining characters ([45d43cc](https://github.com/AIGNE-io/aigne-framework/commit/45d43ccd3afd636cfb459eea2e6551e8f9c53765))
+* correct calculate token usage for gemini model ([7fd1328](https://github.com/AIGNE-io/aigne-framework/commit/7fd13289d3d0f8e062211f7c6dd5cb56e5318c1b))
+* correct run example & doc improvements ([#707](https://github.com/AIGNE-io/aigne-framework/issues/707)) ([f98fc5d](https://github.com/AIGNE-io/aigne-framework/commit/f98fc5df28fd6ce6134128c2f0e5395c1554b740))
+* **docs:** update video mode docs ([#695](https://github.com/AIGNE-io/aigne-framework/issues/695)) ([d691001](https://github.com/AIGNE-io/aigne-framework/commit/d69100169457c16c14f2f3e2f7fcd6b2a99330f3))
+* **gemini:** handle empty responses when files are present ([#648](https://github.com/AIGNE-io/aigne-framework/issues/648)) ([f4e259c](https://github.com/AIGNE-io/aigne-framework/commit/f4e259c5e5c687c347bb5cf29cbb0b5bf4d0d4a1))
+* **gemini:** implement retry mechanism for empty responses with structured output fallback ([#638](https://github.com/AIGNE-io/aigne-framework/issues/638)) ([d33c8bb](https://github.com/AIGNE-io/aigne-framework/commit/d33c8bb9711aadddef9687d6cf472a179cd8ed9c))
+* **gemini:** include thoughts token count in output token usage ([#669](https://github.com/AIGNE-io/aigne-framework/issues/669)) ([f6ff10c](https://github.com/AIGNE-io/aigne-framework/commit/f6ff10c33b0612a0bc416842c5a5bec3850a3fe6))
+* **gemini:** properly handle thinking level for gemini 3.x models ([#763](https://github.com/AIGNE-io/aigne-framework/issues/763)) ([a5dc892](https://github.com/AIGNE-io/aigne-framework/commit/a5dc8921635811ed9ca2ff9e3e0699006f79cf22))
+* **gemini:** return reasoningEffort in model options for gemini-3 ([#765](https://github.com/AIGNE-io/aigne-framework/issues/765)) ([682bfda](https://github.com/AIGNE-io/aigne-framework/commit/682bfda353b31fd432232baa57f8e0b0838eb76d))
+* **gemini:** should include at least one user message ([#521](https://github.com/AIGNE-io/aigne-framework/issues/521)) ([eb2752e](https://github.com/AIGNE-io/aigne-framework/commit/eb2752ed7d78f59c435ecc3ccb7227e804e3781e))
+* **gemini:** use StructuredOutputError to trigger retry for missing JSON response ([#660](https://github.com/AIGNE-io/aigne-framework/issues/660)) ([e8826ed](https://github.com/AIGNE-io/aigne-framework/commit/e8826ed96db57bfcce0b577881bf0d2fd828c269))
+* improve image model parameters ([#530](https://github.com/AIGNE-io/aigne-framework/issues/530)) ([d66b5ca](https://github.com/AIGNE-io/aigne-framework/commit/d66b5ca01e14baad2712cc1a84930cdb63703232))
+* improve test coverage tracking and reporting ([#903](https://github.com/AIGNE-io/aigne-framework/issues/903)) ([031144e](https://github.com/AIGNE-io/aigne-framework/commit/031144e74f29e882cffe52ffda8f7a18c76ace7f))
+* **model:** handle large video files by uploading to Files API ([#769](https://github.com/AIGNE-io/aigne-framework/issues/769)) ([5fd7661](https://github.com/AIGNE-io/aigne-framework/commit/5fd76613bd7301cc76bde933de2095a6d86f8c7e))
+* **models:** add image parameters support for video generation ([#684](https://github.com/AIGNE-io/aigne-framework/issues/684)) ([b048b7f](https://github.com/AIGNE-io/aigne-framework/commit/b048b7f92bd7a532dbdbeb6fb5fa5499bae6b953))
+* **models:** add imageConfig to gemini image model ([#621](https://github.com/AIGNE-io/aigne-framework/issues/621)) ([252de7a](https://github.com/AIGNE-io/aigne-framework/commit/252de7a10701c4f5302c2fff977c88e5e833b7b1))
+* **models:** add mineType for transform file ([#667](https://github.com/AIGNE-io/aigne-framework/issues/667)) ([155a173](https://github.com/AIGNE-io/aigne-framework/commit/155a173e75aff1dbe870a1305455a4300942e07a))
+* **models:** aigne hub video params ([#665](https://github.com/AIGNE-io/aigne-framework/issues/665)) ([d00f836](https://github.com/AIGNE-io/aigne-framework/commit/d00f8368422d8e3707b974e1aff06714731ebb28))
+* **models:** auto retry when got emtpy response from gemini ([#636](https://github.com/AIGNE-io/aigne-framework/issues/636)) ([9367cef](https://github.com/AIGNE-io/aigne-framework/commit/9367cef49ea4c0c87b8a36b454deb2efaee6886f))
+* **models:** enhance gemini model tool use with status fields ([#634](https://github.com/AIGNE-io/aigne-framework/issues/634)) ([067b175](https://github.com/AIGNE-io/aigne-framework/commit/067b175c8e31bb5b1a6d0fc5a5cfb2d070d8d709))
+* **models:** improve message structure handling and enable auto-message options ([#657](https://github.com/AIGNE-io/aigne-framework/issues/657)) ([233d70c](https://github.com/AIGNE-io/aigne-framework/commit/233d70cb292b937200fada8434f33d957d766ad6))
+* **models:** parallel tool calls for gemini model ([#844](https://github.com/AIGNE-io/aigne-framework/issues/844)) ([adfae33](https://github.com/AIGNE-io/aigne-framework/commit/adfae337709295b594a8f5da61213535d2ef61aa))
+* **model:** transform local file to base64 before request llm ([#462](https://github.com/AIGNE-io/aigne-framework/issues/462)) ([58ef5d7](https://github.com/AIGNE-io/aigne-framework/commit/58ef5d77046c49f3c4eed15b7f0cc283cbbcd74a))
+* **model:** updated default video duration settings for AI video models ([#663](https://github.com/AIGNE-io/aigne-framework/issues/663)) ([1203941](https://github.com/AIGNE-io/aigne-framework/commit/12039411aaef77ba665e8edfb0fe6f8097c43e39))
+* should not return local path from aigne hub service ([#460](https://github.com/AIGNE-io/aigne-framework/issues/460)) ([c959717](https://github.com/AIGNE-io/aigne-framework/commit/c95971774f7e84dbeb3313f60b3e6464e2bb22e4))
+* standardize file parameter naming across models ([#534](https://github.com/AIGNE-io/aigne-framework/issues/534)) ([f159a9d](https://github.com/AIGNE-io/aigne-framework/commit/f159a9d6af21ec0e99641996b150560929845845))
+* support gemini-2.0-flash model for image model ([#429](https://github.com/AIGNE-io/aigne-framework/issues/429)) ([5a0bba1](https://github.com/AIGNE-io/aigne-framework/commit/5a0bba197cf8785384b70302f86cf702d04b7fc4))
+* support optional field sturectured output for gemini ([#468](https://github.com/AIGNE-io/aigne-framework/issues/468)) ([70c6279](https://github.com/AIGNE-io/aigne-framework/commit/70c62795039a2862e3333f26707329489bf938de))
+* **transport:** improve HTTP client option handling and error serialization ([#445](https://github.com/AIGNE-io/aigne-framework/issues/445)) ([d3bcdd2](https://github.com/AIGNE-io/aigne-framework/commit/d3bcdd23ab8011a7d40fc157fd61eb240494c7a5))
+* update deps compatibility in CommonJS environment ([#580](https://github.com/AIGNE-io/aigne-framework/issues/580)) ([a1e35d0](https://github.com/AIGNE-io/aigne-framework/commit/a1e35d016405accb51c1aeb6a544503a1c78e912))
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.25
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.25
+## [0.14.16-beta.25](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.24...gemini-v0.14.16-beta.25) (2026-01-16)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.24
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.24
+## [0.14.16-beta.24](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.23...gemini-v0.14.16-beta.24) (2026-01-15)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.23
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.23
+## [0.14.16-beta.23](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.22...gemini-v0.14.16-beta.23) (2026-01-15)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.22
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.22
+## [0.14.16-beta.22](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.21...gemini-v0.14.16-beta.22) (2026-01-15)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.21
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.21
+## [0.14.16-beta.21](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.20...gemini-v0.14.16-beta.21) (2026-01-15)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.20
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.20
+## [0.14.16-beta.20](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.19...gemini-v0.14.16-beta.20) (2026-01-14)
+### Bug Fixes
+* improve test coverage tracking and reporting ([#903](https://github.com/AIGNE-io/aigne-framework/issues/903)) ([031144e](https://github.com/AIGNE-io/aigne-framework/commit/031144e74f29e882cffe52ffda8f7a18c76ace7f))
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.19
+    * @aigne/platform-helpers bumped to 0.6.7-beta.2
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.19
+## [0.14.16-beta.19](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.18...gemini-v0.14.16-beta.19) (2026-01-13)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.18
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.18
+## [0.14.16-beta.18](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.17...gemini-v0.14.16-beta.18) (2026-01-12)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.17
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.17
+## [0.14.16-beta.17](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.16...gemini-v0.14.16-beta.17) (2026-01-12)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.16
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.16
+## [0.14.16-beta.16](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.15...gemini-v0.14.16-beta.16) (2026-01-10)
+### Bug Fixes
+* **core:** simplify token-estimator logic for remaining characters ([45d43cc](https://github.com/AIGNE-io/aigne-framework/commit/45d43ccd3afd636cfb459eea2e6551e8f9c53765))
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.15
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.15
+## [0.14.16-beta.15](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.14...gemini-v0.14.16-beta.15) (2026-01-09)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.14
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.14
+## [0.14.16-beta.14](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.13...gemini-v0.14.16-beta.14) (2026-01-08)
+### Bug Fixes
+* bump version ([696560f](https://github.com/AIGNE-io/aigne-framework/commit/696560fa2673eddcb4d00ac0523fbbbde7273cb3))
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.13
+    * @aigne/platform-helpers bumped to 0.6.7-beta.1
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.13
+## [0.14.16-beta.13](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.12...gemini-v0.14.16-beta.13) (2026-01-07)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.12
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.12
+## [0.14.16-beta.12](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.11...gemini-v0.14.16-beta.12) (2026-01-06)
+### Bug Fixes
+* **core:** preserve Agent Skill in session compact and support complex tool result content ([#876](https://github.com/AIGNE-io/aigne-framework/issues/876)) ([edb86ae](https://github.com/AIGNE-io/aigne-framework/commit/edb86ae2b9cfe56a8f08b276f843606e310566cf))
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.11
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.11
+## [0.14.16-beta.11](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.10...gemini-v0.14.16-beta.11) (2026-01-06)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.10
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.10
+## [0.14.16-beta.10](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.9...gemini-v0.14.16-beta.10) (2026-01-02)
+### Dependencies
+* The following workspace dependencies were updated
+  * dependencies
+    * @aigne/core bumped to 1.72.0-beta.9
+  * devDependencies
+    * @aigne/test-utils bumped to 0.5.69-beta.9
 ## [0.14.16-beta.9](https://github.com/AIGNE-io/aigne-framework/compare/gemini-v0.14.16-beta.8...gemini-v0.14.16-beta.9) (2025-12-31)

package/lib/cjs/gemini-chat-model.d.ts CHANGED Viewed

@@ -67,6 +67,8 @@ export declare class GeminiChatModel extends ChatModel {
             $get: string;
         } | undefined;
     }> | undefined;
+    countTokens(input: ChatModelInput): Promise<number>;
+    private contentUnionToContent;
     process(input: ChatModelInput, options: AgentInvokeOptions): PromiseOrValue<AgentProcessResult<ChatModelOutput>>;
     protected thinkingBudgetModelMap: ({
         pattern: RegExp;
@@ -104,10 +106,12 @@ export declare class GeminiChatModel extends ChatModel {
         budget?: number;
         level?: ThinkingLevel;
     };
+    private getParameters;
     private processInput;
     private buildConfig;
     private buildTools;
     private buildVideoContentParts;
     private buildContents;
+    private contentToParts;
     private ensureMessagesHasUserMessage;
 }

package/lib/cjs/gemini-chat-model.js CHANGED Viewed

@@ -61,6 +61,40 @@ class GeminiChatModel extends core_1.ChatModel {
     get modelOptions() {
         return this.options?.modelOptions;
     }
+    async countTokens(input) {
+        const { model, ...request } = await this.getParameters(input);
+        const contents = [];
+        const { systemInstruction, tools } = request.config ?? {};
+        if (systemInstruction)
+            contents.push(this.contentUnionToContent(systemInstruction));
+        if (tools?.length)
+            contents.push({ role: "system", parts: [{ text: JSON.stringify(tools) }] });
+        contents.push(...[request.contents].flat().map(this.contentUnionToContent));
+        const tokens = (await this.googleClient.models.countTokens({
+            model,
+            contents,
+        })).totalTokens;
+        if (!(0, type_utils_js_1.isNil)(tokens))
+            return tokens;
+        return super.countTokens(input);
+    }
+    contentUnionToContent(content) {
+        if (typeof content === "object" && "parts" in content) {
+            return { role: "system", parts: content.parts };
+        }
+        else if (typeof content === "string") {
+            return { role: "system", parts: [{ text: content }] };
+        }
+        else if (Array.isArray(content)) {
+            return {
+                role: "system",
+                parts: content.map((i) => (typeof i === "string" ? { text: i } : i)),
+            };
+        }
+        else {
+            return { role: "system", parts: [content] };
+        }
+    }
     process(input, options) {
         return this.processInput(input, options);
     }
@@ -135,10 +169,10 @@ class GeminiChatModel extends core_1.ChatModel {
             budget = Math.min(m.max, budget);
         return { support: true, budget };
     }
-    async *processInput(input, options) {
+    async getParameters(input) {
         const { modelOptions = {} } = input;
         const model = modelOptions.model || this.credential.model;
-        const { contents, config } = await this.buildContents(input, options);
+        const { contents, config } = await this.buildContents(input);
         const thinkingBudget = this.getThinkingBudget(model, modelOptions.reasoningEffort);
         const parameters = {
             model,
@@ -160,6 +194,10 @@ class GeminiChatModel extends core_1.ChatModel {
                 ...(await this.buildConfig(input)),
             },
         };
+        return parameters;
+    }
+    async *processInput(input, options) {
+        const parameters = await this.getParameters(input);
         const response = await this.googleClient.models.generateContentStream(parameters);
         let usage = {
             inputTokens: 0,
@@ -211,7 +249,7 @@ class GeminiChatModel extends core_1.ChatModel {
                                     },
                                 };
                                 // Preserve thought_signature for 3.x models
-                                if (part.thoughtSignature && model.includes("gemini-3")) {
+                                if (part.thoughtSignature && parameters.model.includes("gemini-3")) {
                                     toolCall.metadata = {
                                         thoughtSignature: part.thoughtSignature,
                                     };
@@ -362,8 +400,8 @@ class GeminiChatModel extends core_1.ChatModel {
                         };
         return { tools, toolConfig: { functionCallingConfig } };
     }
-    async buildVideoContentParts(media, options) {
-        const { path: filePath, mimeType: fileMimeType } = await this.transformFileType("local", media, options);
+    async buildVideoContentParts(media) {
+        const { path: filePath, mimeType: fileMimeType } = await this.transformFileType("local", media);
         if (filePath) {
             const stats = await index_js_1.nodejs.fs.stat(filePath);
             const fileSizeInBytes = stats.size;
@@ -394,7 +432,7 @@ class GeminiChatModel extends core_1.ChatModel {
             }
         }
     }
-    async buildContents(input, options) {
+    async buildContents(input) {
         const result = {
             contents: [],
         };
@@ -438,55 +476,46 @@ class GeminiChatModel extends core_1.ChatModel {
                     .find((c) => c?.id === msg.toolCallId);
                 if (!call)
                     throw new Error(`Tool call not found: ${msg.toolCallId}`);
-                const output = (0, yaml_1.parse)(msg.content);
-                const isError = "error" in output && Boolean(input.error);
-                const response = {
-                    tool: call.function.name,
+                if (!msg.content)
+                    throw new Error("Tool call must have content");
+                // parse tool result as a record
+                let toolResult;
+                {
+                    let text;
+                    if (typeof msg.content === "string")
+                        text = msg.content;
+                    else if (msg.content?.length === 1) {
+                        const first = msg.content[0];
+                        if (first?.type === "text")
+                            text = first.text;
+                    }
+                    if (text) {
+                        try {
+                            const obj = (0, yaml_1.parse)(text);
+                            if ((0, type_utils_js_1.isRecord)(obj))
+                                toolResult = obj;
+                        }
+                        catch {
+                            // ignore
+                        }
+                        if (!toolResult)
+                            toolResult = { result: text };
+                    }
+                }
+                const functionResponse = {
+                    id: msg.toolCallId,
+                    name: call.function.name,
                 };
-                // NOTE: base on the documentation of gemini api, the content should include `output` field for successful result or `error` field for failed result,
-                // and base on the actual test, add a tool field presenting the tool name can improve the LLM understanding that which tool is called.
-                if (isError) {
-                    Object.assign(response, { status: "error" }, output);
+                if (toolResult) {
+                    functionResponse.response = toolResult;
                 }
                 else {
-                    Object.assign(response, { status: "success" });
-                    if ("output" in output) {
-                        Object.assign(response, output);
-                    }
-                    else {
-                        Object.assign(response, { output });
-                    }
+                    functionResponse.parts = await this.contentToParts(msg.content);
                 }
-                content.parts = [
-                    {
-                        functionResponse: {
-                            id: msg.toolCallId,
-                            name: call.function.name,
-                            response,
-                        },
-                    },
-                ];
+                content.parts = [{ functionResponse }];
             }
-            else if (typeof msg.content === "string") {
-                content.parts = [{ text: msg.content }];
-            }
-            else if (Array.isArray(msg.content)) {
-                content.parts = await Promise.all(msg.content.map(async (item) => {
-                    switch (item.type) {
-                        case "text":
-                            return { text: item.text };
-                        case "url":
-                            return { fileData: { fileUri: item.url, mimeType: item.mimeType } };
-                        case "file": {
-                            const part = await this.buildVideoContentParts(item, options);
-                            if (part)
-                                return part;
-                            return { inlineData: { data: item.data, mimeType: item.mimeType } };
-                        }
-                        case "local":
-                            throw new Error(`Unsupported local file: ${item.path}, it should be converted to base64 at ChatModel`);
-                    }
-                }));
+            else if (msg.content) {
+                content.parts = await this.contentToParts(msg.content);
             }
             return content;
         }))).filter(type_utils_js_1.isNonNullable);
@@ -497,6 +526,26 @@ class GeminiChatModel extends core_1.ChatModel {
         }
         return result;
     }
+    async contentToParts(content) {
+        if (typeof content === "string")
+            return [{ text: content }];
+        return Promise.all(content.map(async (item) => {
+            switch (item.type) {
+                case "text":
+                    return { text: item.text };
+                case "url":
+                    return { fileData: { fileUri: item.url, mimeType: item.mimeType } };
+                case "file": {
+                    const part = await this.buildVideoContentParts(item);
+                    if (part)
+                        return part;
+                    return { inlineData: { data: item.data, mimeType: item.mimeType } };
+                }
+                case "local":
+                    throw new Error(`Unsupported local file: ${item.path}, it should be converted to base64 at ChatModel`);
+            }
+        }));
+    }
     ensureMessagesHasUserMessage(systems, contents) {
         // no messages but system messages
         if (!contents.length && systems.length) {

package/lib/cjs/gemini-image-model.d.ts CHANGED Viewed

@@ -28,7 +28,7 @@ export declare class GeminiImageModel extends ImageModel<GeminiImageModelInput,
      * @param input The input to process
      * @returns The generated response
      */
-    process(input: GeminiImageModelInput, options: AgentInvokeOptions): Promise<ImageModelOutput>;
+    process(input: GeminiImageModelInput, _options: AgentInvokeOptions): Promise<ImageModelOutput>;
     private generateImageByImagenModel;
     private generateImageByGeminiModel;
 }

package/lib/cjs/gemini-image-model.js CHANGED Viewed

@@ -52,7 +52,7 @@ class GeminiImageModel extends core_1.ImageModel {
      * @param input The input to process
      * @returns The generated response
      */
-    async process(input, options) {
+    async process(input, _options) {
         const model = input.modelOptions?.model || this.credential.model;
         const responseFormat = input.responseFormat || "base64";
         if (responseFormat === "url") {
@@ -61,7 +61,7 @@ class GeminiImageModel extends core_1.ImageModel {
         if (model.includes("imagen")) {
             return this.generateImageByImagenModel(input);
         }
-        return this.generateImageByGeminiModel(input, options);
+        return this.generateImageByGeminiModel(input);
     }
     async generateImageByImagenModel(input) {
         const model = input.modelOptions?.model || this.credential.model;
@@ -100,7 +100,7 @@ class GeminiImageModel extends core_1.ImageModel {
             model,
         };
     }
-    async generateImageByGeminiModel(input, options) {
+    async generateImageByGeminiModel(input) {
         const model = input.modelOptions?.model || this.credential.model;
         const mergedInput = { ...this.modelOptions, ...input.modelOptions, ...input };
         const inputKeys = [
@@ -135,7 +135,7 @@ class GeminiImageModel extends core_1.ImageModel {
             "imageConfig",
         ];
         const images = await Promise.all((0, type_utils_js_1.flat)(input.image).map(async (image) => {
-            const { data, mimeType } = await this.transformFileType("file", image, options);
+            const { data, mimeType } = await this.transformFileType("file", image);
             return { inlineData: { data, mimeType } };
         }));
         const response = await this.client.models.generateContent({

package/lib/cjs/gemini-video-model.js CHANGED Viewed

@@ -88,7 +88,7 @@ class GeminiVideoModel extends core_1.VideoModel {
         if (mergedInput.personGeneration)
             config.personGeneration = mergedInput.personGeneration;
         if (mergedInput.lastFrame) {
-            config.lastFrame = await this.transformFileType("file", mergedInput.lastFrame, options).then((file) => {
+            config.lastFrame = await this.transformFileType("file", mergedInput.lastFrame).then((file) => {
                 return {
                     imageBytes: file.data,
                     mimeType: file.mimeType,
@@ -97,7 +97,7 @@ class GeminiVideoModel extends core_1.VideoModel {
         }
         if (mergedInput.referenceImages) {
             config.referenceImages = await Promise.all(mergedInput.referenceImages.map(async (image) => {
-                return await this.transformFileType("file", image, options).then((file) => {
+                return await this.transformFileType("file", image).then((file) => {
                     return {
                         image: {
                             imageBytes: file.data,
@@ -113,7 +113,7 @@ class GeminiVideoModel extends core_1.VideoModel {
             config,
         };
         if (mergedInput.image) {
-            params.image = await this.transformFileType("file", mergedInput.image, options).then((file) => {
+            params.image = await this.transformFileType("file", mergedInput.image).then((file) => {
                 return {
                     imageBytes: file.data,
                     mimeType: file.mimeType,

package/lib/dts/gemini-chat-model.d.ts CHANGED Viewed

@@ -67,6 +67,8 @@ export declare class GeminiChatModel extends ChatModel {
             $get: string;
         } | undefined;
     }> | undefined;
+    countTokens(input: ChatModelInput): Promise<number>;
+    private contentUnionToContent;
     process(input: ChatModelInput, options: AgentInvokeOptions): PromiseOrValue<AgentProcessResult<ChatModelOutput>>;
     protected thinkingBudgetModelMap: ({
         pattern: RegExp;
@@ -104,10 +106,12 @@ export declare class GeminiChatModel extends ChatModel {
         budget?: number;
         level?: ThinkingLevel;
     };
+    private getParameters;
     private processInput;
     private buildConfig;
     private buildTools;
     private buildVideoContentParts;
     private buildContents;
+    private contentToParts;
     private ensureMessagesHasUserMessage;
 }

package/lib/dts/gemini-image-model.d.ts CHANGED Viewed

@@ -28,7 +28,7 @@ export declare class GeminiImageModel extends ImageModel<GeminiImageModelInput,
      * @param input The input to process
      * @returns The generated response
      */
-    process(input: GeminiImageModelInput, options: AgentInvokeOptions): Promise<ImageModelOutput>;
+    process(input: GeminiImageModelInput, _options: AgentInvokeOptions): Promise<ImageModelOutput>;
     private generateImageByImagenModel;
     private generateImageByGeminiModel;
 }

package/lib/esm/gemini-chat-model.d.ts CHANGED Viewed

@@ -67,6 +67,8 @@ export declare class GeminiChatModel extends ChatModel {
             $get: string;
         } | undefined;
     }> | undefined;
+    countTokens(input: ChatModelInput): Promise<number>;
+    private contentUnionToContent;
     process(input: ChatModelInput, options: AgentInvokeOptions): PromiseOrValue<AgentProcessResult<ChatModelOutput>>;
     protected thinkingBudgetModelMap: ({
         pattern: RegExp;
@@ -104,10 +106,12 @@ export declare class GeminiChatModel extends ChatModel {
         budget?: number;
         level?: ThinkingLevel;
     };
+    private getParameters;
     private processInput;
     private buildConfig;
     private buildTools;
     private buildVideoContentParts;
     private buildContents;
+    private contentToParts;
     private ensureMessagesHasUserMessage;
 }

package/lib/esm/gemini-chat-model.js CHANGED Viewed

@@ -1,7 +1,7 @@
 import { agentProcessResultToObject, ChatModel, StructuredOutputError, safeParseJSON, } from "@aigne/core";
 import { logger } from "@aigne/core/utils/logger.js";
 import { mergeUsage } from "@aigne/core/utils/model-utils.js";
-import { isNonNullable } from "@aigne/core/utils/type-utils.js";
+import { isNil, isNonNullable, isRecord, } from "@aigne/core/utils/type-utils.js";
 import { nodejs } from "@aigne/platform-helpers/nodejs/index.js";
 import { v7 } from "@aigne/uuid";
 import { createPartFromUri, createUserContent, FunctionCallingConfigMode, GoogleGenAI, ThinkingLevel, } from "@google/genai";
@@ -58,6 +58,40 @@ export class GeminiChatModel extends ChatModel {
     get modelOptions() {
         return this.options?.modelOptions;
     }
+    async countTokens(input) {
+        const { model, ...request } = await this.getParameters(input);
+        const contents = [];
+        const { systemInstruction, tools } = request.config ?? {};
+        if (systemInstruction)
+            contents.push(this.contentUnionToContent(systemInstruction));
+        if (tools?.length)
+            contents.push({ role: "system", parts: [{ text: JSON.stringify(tools) }] });
+        contents.push(...[request.contents].flat().map(this.contentUnionToContent));
+        const tokens = (await this.googleClient.models.countTokens({
+            model,
+            contents,
+        })).totalTokens;
+        if (!isNil(tokens))
+            return tokens;
+        return super.countTokens(input);
+    }
+    contentUnionToContent(content) {
+        if (typeof content === "object" && "parts" in content) {
+            return { role: "system", parts: content.parts };
+        }
+        else if (typeof content === "string") {
+            return { role: "system", parts: [{ text: content }] };
+        }
+        else if (Array.isArray(content)) {
+            return {
+                role: "system",
+                parts: content.map((i) => (typeof i === "string" ? { text: i } : i)),
+            };
+        }
+        else {
+            return { role: "system", parts: [content] };
+        }
+    }
     process(input, options) {
         return this.processInput(input, options);
     }
@@ -132,10 +166,10 @@ export class GeminiChatModel extends ChatModel {
             budget = Math.min(m.max, budget);
         return { support: true, budget };
     }
-    async *processInput(input, options) {
+    async getParameters(input) {
         const { modelOptions = {} } = input;
         const model = modelOptions.model || this.credential.model;
-        const { contents, config } = await this.buildContents(input, options);
+        const { contents, config } = await this.buildContents(input);
         const thinkingBudget = this.getThinkingBudget(model, modelOptions.reasoningEffort);
         const parameters = {
             model,
@@ -157,6 +191,10 @@ export class GeminiChatModel extends ChatModel {
                 ...(await this.buildConfig(input)),
             },
         };
+        return parameters;
+    }
+    async *processInput(input, options) {
+        const parameters = await this.getParameters(input);
         const response = await this.googleClient.models.generateContentStream(parameters);
         let usage = {
             inputTokens: 0,
@@ -208,7 +246,7 @@ export class GeminiChatModel extends ChatModel {
                                     },
                                 };
                                 // Preserve thought_signature for 3.x models
-                                if (part.thoughtSignature && model.includes("gemini-3")) {
+                                if (part.thoughtSignature && parameters.model.includes("gemini-3")) {
                                     toolCall.metadata = {
                                         thoughtSignature: part.thoughtSignature,
                                     };
@@ -359,8 +397,8 @@ export class GeminiChatModel extends ChatModel {
                         };
         return { tools, toolConfig: { functionCallingConfig } };
     }
-    async buildVideoContentParts(media, options) {
-        const { path: filePath, mimeType: fileMimeType } = await this.transformFileType("local", media, options);
+    async buildVideoContentParts(media) {
+        const { path: filePath, mimeType: fileMimeType } = await this.transformFileType("local", media);
         if (filePath) {
             const stats = await nodejs.fs.stat(filePath);
             const fileSizeInBytes = stats.size;
@@ -391,7 +429,7 @@ export class GeminiChatModel extends ChatModel {
             }
         }
     }
-    async buildContents(input, options) {
+    async buildContents(input) {
         const result = {
             contents: [],
         };
@@ -435,55 +473,46 @@ export class GeminiChatModel extends ChatModel {
                     .find((c) => c?.id === msg.toolCallId);
                 if (!call)
                     throw new Error(`Tool call not found: ${msg.toolCallId}`);
-                const output = parse(msg.content);
-                const isError = "error" in output && Boolean(input.error);
-                const response = {
-                    tool: call.function.name,
+                if (!msg.content)
+                    throw new Error("Tool call must have content");
+                // parse tool result as a record
+                let toolResult;
+                {
+                    let text;
+                    if (typeof msg.content === "string")
+                        text = msg.content;
+                    else if (msg.content?.length === 1) {
+                        const first = msg.content[0];
+                        if (first?.type === "text")
+                            text = first.text;
+                    }
+                    if (text) {
+                        try {
+                            const obj = parse(text);
+                            if (isRecord(obj))
+                                toolResult = obj;
+                        }
+                        catch {
+                            // ignore
+                        }
+                        if (!toolResult)
+                            toolResult = { result: text };
+                    }
+                }
+                const functionResponse = {
+                    id: msg.toolCallId,
+                    name: call.function.name,
                 };
-                // NOTE: base on the documentation of gemini api, the content should include `output` field for successful result or `error` field for failed result,
-                // and base on the actual test, add a tool field presenting the tool name can improve the LLM understanding that which tool is called.
-                if (isError) {
-                    Object.assign(response, { status: "error" }, output);
+                if (toolResult) {
+                    functionResponse.response = toolResult;
                 }
                 else {
-                    Object.assign(response, { status: "success" });
-                    if ("output" in output) {
-                        Object.assign(response, output);
-                    }
-                    else {
-                        Object.assign(response, { output });
-                    }
+                    functionResponse.parts = await this.contentToParts(msg.content);
                 }
-                content.parts = [
-                    {
-                        functionResponse: {
-                            id: msg.toolCallId,
-                            name: call.function.name,
-                            response,
-                        },
-                    },
-                ];
+                content.parts = [{ functionResponse }];
             }
-            else if (typeof msg.content === "string") {
-                content.parts = [{ text: msg.content }];
-            }
-            else if (Array.isArray(msg.content)) {
-                content.parts = await Promise.all(msg.content.map(async (item) => {
-                    switch (item.type) {
-                        case "text":
-                            return { text: item.text };
-                        case "url":
-                            return { fileData: { fileUri: item.url, mimeType: item.mimeType } };
-                        case "file": {
-                            const part = await this.buildVideoContentParts(item, options);
-                            if (part)
-                                return part;
-                            return { inlineData: { data: item.data, mimeType: item.mimeType } };
-                        }
-                        case "local":
-                            throw new Error(`Unsupported local file: ${item.path}, it should be converted to base64 at ChatModel`);
-                    }
-                }));
+            else if (msg.content) {
+                content.parts = await this.contentToParts(msg.content);
             }
             return content;
         }))).filter(isNonNullable);
@@ -494,6 +523,26 @@ export class GeminiChatModel extends ChatModel {
         }
         return result;
     }
+    async contentToParts(content) {
+        if (typeof content === "string")
+            return [{ text: content }];
+        return Promise.all(content.map(async (item) => {
+            switch (item.type) {
+                case "text":
+                    return { text: item.text };
+                case "url":
+                    return { fileData: { fileUri: item.url, mimeType: item.mimeType } };
+                case "file": {
+                    const part = await this.buildVideoContentParts(item);
+                    if (part)
+                        return part;
+                    return { inlineData: { data: item.data, mimeType: item.mimeType } };
+                }
+                case "local":
+                    throw new Error(`Unsupported local file: ${item.path}, it should be converted to base64 at ChatModel`);
+            }
+        }));
+    }
     ensureMessagesHasUserMessage(systems, contents) {
         // no messages but system messages
         if (!contents.length && systems.length) {

package/lib/esm/gemini-image-model.d.ts CHANGED Viewed

@@ -28,7 +28,7 @@ export declare class GeminiImageModel extends ImageModel<GeminiImageModelInput,
      * @param input The input to process
      * @returns The generated response
      */
-    process(input: GeminiImageModelInput, options: AgentInvokeOptions): Promise<ImageModelOutput>;
+    process(input: GeminiImageModelInput, _options: AgentInvokeOptions): Promise<ImageModelOutput>;
     private generateImageByImagenModel;
     private generateImageByGeminiModel;
 }

package/lib/esm/gemini-image-model.js CHANGED Viewed

@@ -49,7 +49,7 @@ export class GeminiImageModel extends ImageModel {
      * @param input The input to process
      * @returns The generated response
      */
-    async process(input, options) {
+    async process(input, _options) {
         const model = input.modelOptions?.model || this.credential.model;
         const responseFormat = input.responseFormat || "base64";
         if (responseFormat === "url") {
@@ -58,7 +58,7 @@ export class GeminiImageModel extends ImageModel {
         if (model.includes("imagen")) {
             return this.generateImageByImagenModel(input);
         }
-        return this.generateImageByGeminiModel(input, options);
+        return this.generateImageByGeminiModel(input);
     }
     async generateImageByImagenModel(input) {
         const model = input.modelOptions?.model || this.credential.model;
@@ -97,7 +97,7 @@ export class GeminiImageModel extends ImageModel {
             model,
         };
     }
-    async generateImageByGeminiModel(input, options) {
+    async generateImageByGeminiModel(input) {
         const model = input.modelOptions?.model || this.credential.model;
         const mergedInput = { ...this.modelOptions, ...input.modelOptions, ...input };
         const inputKeys = [
@@ -132,7 +132,7 @@ export class GeminiImageModel extends ImageModel {
             "imageConfig",
         ];
         const images = await Promise.all(flat(input.image).map(async (image) => {
-            const { data, mimeType } = await this.transformFileType("file", image, options);
+            const { data, mimeType } = await this.transformFileType("file", image);
             return { inlineData: { data, mimeType } };
         }));
         const response = await this.client.models.generateContent({

package/lib/esm/gemini-video-model.js CHANGED Viewed

@@ -85,7 +85,7 @@ export class GeminiVideoModel extends VideoModel {
         if (mergedInput.personGeneration)
             config.personGeneration = mergedInput.personGeneration;
         if (mergedInput.lastFrame) {
-            config.lastFrame = await this.transformFileType("file", mergedInput.lastFrame, options).then((file) => {
+            config.lastFrame = await this.transformFileType("file", mergedInput.lastFrame).then((file) => {
                 return {
                     imageBytes: file.data,
                     mimeType: file.mimeType,
@@ -94,7 +94,7 @@ export class GeminiVideoModel extends VideoModel {
         }
         if (mergedInput.referenceImages) {
             config.referenceImages = await Promise.all(mergedInput.referenceImages.map(async (image) => {
-                return await this.transformFileType("file", image, options).then((file) => {
+                return await this.transformFileType("file", image).then((file) => {
                     return {
                         image: {
                             imageBytes: file.data,
@@ -110,7 +110,7 @@ export class GeminiVideoModel extends VideoModel {
             config,
         };
         if (mergedInput.image) {
-            params.image = await this.transformFileType("file", mergedInput.image, options).then((file) => {
+            params.image = await this.transformFileType("file", mergedInput.image).then((file) => {
                 return {
                     imageBytes: file.data,
                     mimeType: file.mimeType,

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@aigne/gemini",
-  "version": "0.14.16-beta.9",
+  "version": "0.14.17-beta",
   "description": "AIGNE Gemini SDK for integrating with Google's Gemini AI models",
   "publishConfig": {
     "access": "public"
@@ -40,8 +40,8 @@
     "yaml": "^2.8.1",
     "zod": "^3.25.67",
     "zod-to-json-schema": "^3.24.6",
-    "@aigne/platform-helpers": "^0.6.7-beta",
-    "@aigne/core": "^1.72.0-beta.8"
+    "@aigne/core": "^1.73.0-beta",
+    "@aigne/platform-helpers": "^0.6.7"
   },
   "devDependencies": {
     "@types/bun": "^1.2.22",
@@ -49,7 +49,7 @@
     "npm-run-all": "^4.1.5",
     "rimraf": "^6.0.1",
     "typescript": "^5.9.2",
-    "@aigne/test-utils": "^0.5.69-beta.8"
+    "@aigne/test-utils": "^0.5.70-beta"
   },
   "scripts": {
     "lint": "tsc --noEmit",