npm - @mmmbuto/zai-codex-bridge - Versions diffs - 0.4.0 → 0.4.2 - Mend

@mmmbuto/zai-codex-bridge 0.4.0 → 0.4.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

package/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,39 @@
+# Changelog
+All notable changes to this project will be documented in this file.
+The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
+and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [0.4.2] - 2026-01-16
+### Changed
+- Replaced the README with expanded setup, usage, and troubleshooting guidance
+- Clarified Codex provider configuration and proxy endpoint usage
+## [0.4.1] - 2026-01-16
+### Added
+- Tool calling support (MCP/function calls) when `ALLOW_TOOLS=1`
+- Bridging for `function_call_output` items to Chat `role: tool` messages
+- Streaming support for `delta.tool_calls` with proper Responses API events
+- Non-streaming support for `msg.tool_calls` in final response
+- Tool call events: `response.output_item.added` (function_call), `response.function_call_arguments.delta`, `response.function_call_arguments.done`
+- Automated tool call test in test suite
+### Changed
+- `translateResponsesToChat()` now handles `type: function_call_output` items
+- `streamChatToResponses()` now detects and emits tool call events
+- `translateChatToResponses()` now includes `function_call` items in output array
+### Fixed
+- Tool responses (from MCP/function calls) are now correctly forwarded to upstream as `role: tool` messages
+- Function call items are now properly included in `response.completed` output array
+## [0.4.0] - Previous
+### Added
+- Initial release with Responses API to Chat Completions translation
+- Streaming support with SSE
+- Health check endpoint
+- Zero-dependency implementation

package/README.md CHANGED Viewed

@@ -1,6 +1,6 @@
-# Z.AI Codex Bridge
+# ZAI Codex Bridge
-> Local proxy that translates OpenAI Responses API format to Z.AI Chat Completions format
+> Local proxy that translates OpenAI **Responses API** ↔ Z.AI **Chat Completions** for Codex CLI
 [![npm](https://img.shields.io/npm/v/@mmmbuto/zai-codex-bridge?style=flat-square&logo=npm)](https://www.npmjs.org/package/@mmmbuto/zai-codex-bridge)
 [![node](https://img.shields.io/node/v/@mmmbuto/zai-codex-bridge?style=flat-square&logo=node.js)](https://github.com/DioNanos/zai-codex-bridge)
@@ -10,36 +10,39 @@
 ## What It Solves
-Codex uses the OpenAI **Responses API** format (with `instructions` and `input` fields), but Z.AI only supports the legacy **Chat Completions** format (with `messages` array).
+Newer **Codex CLI** versions speak the OpenAI **Responses API** (e.g. `/v1/responses`, with `instructions` + `input` + event-stream semantics).
+Some gateways/providers (including Z.AI endpoints) only expose legacy **Chat Completions** (`messages[]`).
 This proxy:
-1. Accepts Codex requests in **Responses format**
-2. Translates them to **Chat format**
+1. Accepts Codex requests in **Responses** format
+2. Translates them to **Chat Completions**
 3. Forwards to Z.AI
-4. Translates the response back to **Responses format**
+4. Translates back to **Responses** format (stream + non-stream)
 5. Returns to Codex
-**Without this proxy**, Codex fails with error from Z.AI:
+**Without this proxy**, Codex may fail (example from upstream error payloads):
 ```json
 {"error":{"code":"1214","message":"Incorrect role information"}}
 ```
+> If you’re using **codex-termux** and a gateway that doesn’t fully match the Responses API, this proxy is the recommended compatibility layer.
 ---
 ## Features
-- Transparent translation between Responses and Chat formats
+- Responses API ↔ Chat Completions translation (request + response)
 - Streaming support with SSE (Server-Sent Events)
-- Zero dependencies - uses Node.js built-ins only
-- Health checks at `/health` endpoint
-- Configurable via CLI flags and environment variables
+- Health check endpoint (`/health`)
+- Works on Linux/macOS/Windows (WSL) + Termux (ARM64)
+- **Optional tool/MCP bridging** (see “Tools / MCP” below)
+- Zero/low dependencies (Node built-ins only, unless noted by package.json)
 ---
 ## Requirements
-- **Node.js**: 18.0.0 or higher (for native `fetch`)
-- **Platform**: Linux, macOS, Windows (WSL), Termux (ARM64)
+- **Node.js**: 18+ (native `fetch`)
 - **Port**: 31415 (default, configurable)
 ---
@@ -54,28 +57,34 @@ npm install -g @mmmbuto/zai-codex-bridge
 ## Quick Start
-### 1. Start the Proxy
+### 1) Start the Proxy
 ```bash
 zai-codex-bridge
 ```
-The proxy will listen on `http://127.0.0.1:31415`
+Default listen address:
+- `http://127.0.0.1:31415`
-### 2. Configure Codex
+### 2) Configure Codex
-Add to `~/.codex/config.toml`:
+Add this provider to `~/.codex/config.toml`:
 ```toml
 [model_providers.zai_proxy]
 name = "ZAI via local proxy"
-base_url = "http://127.0.0.1:31415/v1"
+base_url = "http://127.0.0.1:31415"
 env_key = "OPENAI_API_KEY"
 wire_api = "responses"
 stream_idle_timeout_ms = 3000000
 ```
-### 3. Use with Codex
+> Notes:
+> - `base_url` is the server root. Codex will call `/v1/responses`; this proxy supports that path.
+> - We keep `env_key = "OPENAI_API_KEY"` because Codex expects that key name. You can store your Z.AI key there.
+### 3) Run Codex via the Proxy
 ```bash
 export OPENAI_API_KEY="your-zai-api-key"
@@ -84,6 +93,29 @@ codex -m "GLM-4.7" -c model_provider="zai_proxy"
 ---
+## Tools / MCP (optional)
+Codex tool-calling / MCP memory requires an additional compatibility layer:
+- Codex uses **Responses API tool events** (function call items + arguments delta/done, plus function_call_output inputs)
+- Some upstream models/providers may not emit tool calls (or may emit them in a different shape)
+This proxy can **attempt** to bridge tools when enabled:
+```bash
+export ALLOW_TOOLS=1
+```
+Important:
+- Tool support is **provider/model dependent**. If upstream never emits tool calls, the proxy can’t invent them.
+- If tools are enabled, the proxy must translate:
+  - Responses `tools` + `tool_choice` → Chat `tools` + `tool_choice`
+  - Chat `tool_calls` (stream/non-stream) → Responses function-call events
+  - Responses `function_call_output` → Chat `role=tool` messages
+(See repo changelog and docs for the exact implemented behavior.)
+---
 ## CLI Usage
 ```bash
@@ -97,7 +129,7 @@ zai-codex-bridge --port 8080
 zai-codex-bridge --log-level debug
 # Custom Z.AI endpoint
-zai-codex-bridge --zai-base-url https://custom.z.ai/v1
+zai-codex-bridge --zai-base-url https://api.z.ai/api/coding/paas/v4
 # Show help
 zai-codex-bridge --help
@@ -106,17 +138,20 @@ zai-codex-bridge --help
 ### Environment Variables
 ```bash
-export PORT=31415
 export HOST=127.0.0.1
+export PORT=31415
 export ZAI_BASE_URL=https://api.z.ai/api/coding/paas/v4
 export LOG_LEVEL=info
+# Optional
+export ALLOW_TOOLS=1
 ```
 ---
-## Auto-Starting Proxy with Codex
+## Auto-start the Proxy with Codex (recommended)
-You can create a shell function that starts the proxy automatically when needed:
+Use a shell function that starts the proxy only if needed:
 ```bash
 codex-with-zai() {
@@ -125,114 +160,107 @@ codex-with-zai() {
   local HEALTH="http://${HOST}:${PORT}/health"
   local PROXY_PID=""
-  # Start proxy only if not responding
   if ! curl -fsS "$HEALTH" >/dev/null 2>&1; then
     zai-codex-bridge --host "$HOST" --port "$PORT" >/dev/null 2>&1 &
     PROXY_PID=$!
     trap 'kill $PROXY_PID 2>/dev/null' EXIT INT TERM
-    sleep 2
+    sleep 1
   fi
-  # Run codex
-  codex -m "GLM-4.7" -c model_provider="zai_proxy" "$@"
+  codex -c model_provider="zai_proxy" "$@"
 }
 ```
 Usage:
 ```bash
-codex-with-zai
-# Proxy auto-starts, Codex runs
-# Ctrl+D exits both
+export OPENAI_API_KEY="your-zai-api-key"
+codex-with-zai -m "GLM-4.7"
 ```
 ---
 ## API Endpoints
-### `POST /responses`
-Accepts OpenAI Responses API format, translates to Chat, returns Responses format.
-### `POST /v1/responses`
-Same as `/responses` (for compatibility with Codex's path structure).
-### `GET /health`
-Health check endpoint.
+- `POST /responses` — accepts Responses API requests
+- `POST /v1/responses` — same as above (Codex default path)
+- `GET /health` — health check
 ---
-## Translation Details
+## Translation Overview
 ### Request: Responses → Chat
-```javascript
-// Input (Responses format)
+```js
+// Input (Responses)
 {
-  model: "GLM-4.7",
-  instructions: "Be helpful",
-  input: [
-    { role: "user", content: "Hello" }
-  ],
-  max_output_tokens: 1000
+  "model": "GLM-4.7",
+  "instructions": "Be helpful",
+  "input": [{ "role": "user", "content": "Hello" }],
+  "max_output_tokens": 1000
 }
-// Output (Chat format)
+// Output (Chat)
 {
-  model: "GLM-4.7",
-  messages: [
-    { role: "system", content: "Be helpful" },
-    { role: "user", content: "Hello" }
+  "model": "GLM-4.7",
+  "messages": [
+    { "role": "system", "content": "Be helpful" },
+    { "role": "user", "content": "Hello" }
   ],
-  max_tokens: 1000
+  "max_tokens": 1000
 }
 ```
-### Response: Chat → Responses
+### Response: Chat → Responses (simplified)
-```javascript
-// Input (Chat format)
+```js
+// Input (Chat)
 {
-  choices: [{
-    message: { content: "Hi there!" }
-  }],
-  usage: {
-    prompt_tokens: 10,
-    completion_tokens: 5
-  }
+  "choices": [{ "message": { "content": "Hi there!" } }],
+  "usage": { "prompt_tokens": 10, "completion_tokens": 5 }
 }
-// Output (Responses format)
+// Output (Responses - simplified)
 {
-  output: [{ value: "Hi there!", content_type: "text" }],
-  status: "completed",
-  usage: {
-    input_tokens: 10,
-    output_tokens: 5
-  }
+  "status": "completed",
+  "output": [{ "type": "message", "content": [{ "type": "output_text", "text": "Hi there!" }] }],
+  "usage": { "input_tokens": 10, "output_tokens": 5 }
 }
 ```
 ---
-## Testing
+## Troubleshooting
-```bash
-# Set your Z.AI API key
-export ZAI_API_KEY="sk-your-key"
+### 401 / “token expired or incorrect”
+- Verify the key is exported as `OPENAI_API_KEY` (or matches `env_key` in config.toml).
+- Make sure the proxy is not overwriting Authorization headers.
-# Run test suite
-npm run test:curl
-```
+### 404 on `/v1/responses`
+- Ensure `base_url` points to the proxy root (example: `http://127.0.0.1:31415`).
+- Confirm the proxy is running and `/health` returns `ok`.
+### 502 Bad Gateway
+- Proxy reached upstream but upstream failed. Enable debug:
+  ```bash
+  LOG_LEVEL=debug zai-codex-bridge
+  ```
 ---
-## Documentation
+## Versioning Policy
-Complete usage guide: [docs/guide.md](docs/guide.md)
+This repo follows **small, safe patch increments** while stabilizing provider compatibility:
+- Keep patch bumps only: `0.4.0 → 0.4.1 → 0.4.2 → ...`
+- No big jumps unless strictly necessary.
+(See `CHANGELOG.md` for details once present.)
 ---
 ## License
-MIT License - Copyright (c) 2026 Davide A. Guglielmi
+MIT License — Copyright (c) 2026 Davide A. Guglielmi
 See [LICENSE](LICENSE) for details.

package/RELEASING.md ADDED Viewed

@@ -0,0 +1,80 @@
+# Releasing
+This document describes the release process for zai-codex-bridge.
+## Version Policy
+- **Patch releases only** (0.4.0 → 0.4.1 → 0.4.2, etc.)
+- No minor or major bumps without explicit discussion
+- Always increment by +0.0.1 from current version
+## Release Steps
+### 1. Run Tests
+```bash
+# Set your API key
+export ZAI_API_KEY="sk-your-key"
+# Run test suite
+npm run test:curl
+# or
+npm test
+```
+### 2. Bump Version
+```bash
+# Use the release script (recommended)
+npm run release:patch
+# Or manually edit package.json and change:
+# "version": "0.4.0" -> "version": "0.4.1"
+```
+### 3. Update CHANGELOG.md
+Add an entry for the new version following [Keep a Changelog](https://keepachangelog.com/en/1.0.0/) format.
+### 4. Commit
+```bash
+git add package.json CHANGELOG.md
+git commit -m "chore: release v0.4.1"
+```
+### 5. Tag
+```bash
+git tag v0.4.1
+```
+### 6. Push (Optional)
+```bash
+git push
+git push --tags
+```
+### 7. Publish to npm
+```bash
+npm publish
+```
+## release:patch Script
+The `npm run release:patch` script:
+1. Verifies current version is 0.4.x
+2. Bumps patch version by +0.0.1
+3. Refuses to bump minor/major versions
+4. Updates package.json in-place
+Example:
+```bash
+$ npm run release:patch
+Current version: 0.4.0
+Bumping to: 0.4.1
+Updated package.json
+```

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@mmmbuto/zai-codex-bridge",
-  "version": "0.4.0",
+  "version": "0.4.2",
   "description": "Local proxy that translates OpenAI Responses API format to Z.AI Chat Completions format for Codex",
   "main": "src/server.js",
   "bin": {
@@ -8,7 +8,9 @@
   },
   "scripts": {
     "start": "node src/server.js",
-    "test:curl": "node scripts/test-curl.js"
+    "test": "node scripts/test-curl.js",
+    "test:curl": "node scripts/test-curl.js",
+    "release:patch": "node scripts/release-patch.js"
   },
   "keywords": [
     "codex",

package/scripts/release-patch.js ADDED Viewed

@@ -0,0 +1,60 @@
+#!/usr/bin/env node
+/**
+ * Safe patch version bumper
+ * Only allows patch releases (0.4.0 -> 0.4.1)
+ * Refuses minor/major bumps
+ */
+const fs = require('fs');
+const path = require('path');
+const PACKAGE_PATH = path.join(__dirname, '..', 'package.json');
+function bumpPatch(version) {
+  const parts = version.split('.').map(Number);
+  if (parts.length !== 3) {
+    throw new Error(`Invalid version format: ${version}`);
+  }
+  const [major, minor, patch] = parts;
+  // Only allow 0.4.x versions
+  if (major !== 0 || minor !== 4) {
+    console.error(`ERROR: Current version is ${version}`);
+    console.error('This script only supports patch releases for 0.4.x versions.');
+    console.error('For other version changes, edit package.json manually.');
+    process.exit(1);
+  }
+  const newVersion = `0.4.${patch + 1}`;
+  return newVersion;
+}
+function main() {
+  // Read package.json
+  const pkg = JSON.parse(fs.readFileSync(PACKAGE_PATH, 'utf8'));
+  const currentVersion = pkg.version;
+  console.log(`Current version: ${currentVersion}`);
+  // Bump patch
+  const newVersion = bumpPatch(currentVersion);
+  console.log(`Bumping to: ${newVersion}`);
+  // Update package.json
+  pkg.version = newVersion;
+  // Write back
+  fs.writeFileSync(PACKAGE_PATH, JSON.stringify(pkg, null, 2) + '\n');
+  console.log('Updated package.json');
+  console.log('\nNext steps:');
+  console.log('  1. Update CHANGELOG.md');
+  console.log('  2. Commit: git add package.json CHANGELOG.md && git commit -m "chore: release v' + newVersion + '"');
+  console.log('  3. Tag: git tag v' + newVersion);
+  console.log('  4. Publish: npm publish');
+}
+main();

package/scripts/test-curl.js CHANGED Viewed

@@ -135,6 +135,155 @@ async function testStreamingFormat() {
   });
 }
+async function testToolCall() {
+  console.log('\n=== Testing POST /v1/responses (Tool Call) ===\n');
+  console.log('Note: This test requires ALLOW_TOOLS=1 and upstream model support for tools.\n');
+  const payload = {
+    model: 'GLM-4.7',
+    instructions: 'You are a helpful assistant.',
+    input: [
+      {
+        role: 'user',
+        content: 'What is the weather in Tokyo? Use the get_weather tool.'
+      }
+    ],
+    tools: [
+      {
+        type: 'function',
+        function: {
+          name: 'get_weather',
+          description: 'Get the current weather for a location',
+          parameters: {
+            type: 'object',
+            properties: {
+              location: {
+                type: 'string',
+                description: 'The city and state, e.g. San Francisco, CA'
+              }
+            },
+            required: ['location']
+          }
+        }
+      }
+    ],
+    tool_choice: 'auto',
+    stream: true
+  };
+  return new Promise((resolve, reject) => {
+    const options = {
+      hostname: PROXY_HOST,
+      port: PROXY_PORT,
+      path: '/v1/responses',
+      method: 'POST',
+      headers: {
+        'Content-Type': 'application/json',
+        'Authorization': `Bearer ${ZAI_API_KEY}`
+      }
+    };
+    const req = http.request(options, (res) => {
+      console.log('Status:', res.statusCode);
+      if (res.statusCode !== 200) {
+        let body = '';
+        res.on('data', (chunk) => body += chunk);
+        res.on('end', () => {
+          console.log('Error response:', body);
+          resolve({ status: 'error', message: body });
+        });
+        return;
+      }
+      console.log('\nStreaming response:');
+      let buffer = '';
+      let foundToolCall = false;
+      let foundOutputItemAdded = false;
+      let foundFunctionCallDelta = false;
+      let foundOutputItemDone = false;
+      let foundResponseCompleted = false;
+      res.on('data', (chunk) => {
+        buffer += chunk.toString();
+        const events = buffer.split('\n\n');
+        buffer = events.pop() || '';
+        for (const evt of events) {
+          const lines = evt.split('\n');
+          for (const line of lines) {
+            if (!line.startsWith('data:')) continue;
+            const payload = line.slice(5).trim();
+            if (!payload || payload === '[DONE]') continue;
+            try {
+              const data = JSON.parse(payload);
+              const type = data.type;
+              // Look for tool call events
+              if (type === 'response.output_item.added') {
+                if (data.item?.type === 'function_call') {
+                  foundOutputItemAdded = true;
+                  console.log('[EVENT] output_item.added (function_call):', data.item?.name);
+                }
+              }
+              if (type === 'response.function_call_arguments.delta') {
+                foundFunctionCallDelta = true;
+                process.stdout.write('.');
+              }
+              if (type === 'response.output_item.done') {
+                if (data.item?.type === 'function_call') {
+                  foundOutputItemDone = true;
+                }
+              }
+              if (type === 'response.completed') {
+                foundResponseCompleted = true;
+              }
+            } catch (e) {
+              // Skip parse errors
+            }
+          }
+        }
+      });
+      res.on('end', () => {
+        console.log();
+        console.log('\n=== Tool Call Test Results ===');
+        if (!foundToolCall && !foundOutputItemAdded) {
+          console.log('SKIP: upstream did not return tool_calls');
+          console.log('This may mean:');
+          console.log('  - ALLOW_TOOLS is not enabled on the proxy');
+          console.log('  - The model does not support tool calls');
+          console.log('  - The prompt did not trigger a tool call');
+          resolve({ status: 'skipped', reason: 'no tool calls from upstream' });
+          return;
+        }
+        const passed = foundOutputItemAdded && foundFunctionCallDelta && foundOutputItemDone && foundResponseCompleted;
+        console.log('output_item.added (function_call):', foundOutputItemAdded ? 'PASS' : 'FAIL');
+        console.log('function_call_arguments.delta:', foundFunctionCallDelta ? 'PASS' : 'FAIL');
+        console.log('output_item.done (function_call):', foundOutputItemDone ? 'PASS' : 'FAIL');
+        console.log('response.completed:', foundResponseCompleted ? 'PASS' : 'FAIL');
+        console.log('\nOverall:', passed ? 'PASS' : 'FAIL');
+        resolve({ status: passed ? 'pass' : 'fail', results: { foundOutputItemAdded, foundFunctionCallDelta, foundOutputItemDone, foundResponseCompleted } });
+      });
+    });
+    req.on('error', (err) => {
+      console.error('Request error:', err.message);
+      reject(err);
+    });
+    req.write(JSON.stringify(payload, null, 2));
+    req.end();
+  });
+}
 async function main() {
   console.log('zai-codex-bridge Manual Test');
   console.log('================================');
@@ -151,7 +300,22 @@ async function main() {
     await testResponsesFormat();
     await testStreamingFormat();
+    // Tool call test (optional - depends on upstream support)
+    console.log('\n\n=== Tool Support Tests ===');
+    const toolResult = await testToolCall();
     console.log('\n=== All Tests Complete ===\n');
+    console.log('Summary:');
+    console.log('  Health: PASS');
+    console.log('  Non-streaming: PASS');
+    console.log('  Streaming: PASS');
+    if (toolResult.status === 'pass') {
+      console.log('  Tool calls: PASS');
+    } else if (toolResult.status === 'skipped') {
+      console.log('  Tool calls: SKIPPED (upstream does not support or did not return tool_calls)');
+    } else {
+      console.log('  Tool calls: FAIL or ERROR');
+    }
   } catch (error) {
     console.error('\nError:', error.message);
     process.exit(1);

package/src/server.js CHANGED Viewed

@@ -143,18 +143,41 @@ function translateResponsesToChat(request) {
         content: request.input
       });
     } else if (Array.isArray(request.input)) {
-      // Array of ResponseItem objects - filter only Message items with role
+      // Array of ResponseItem objects
       for (const item of request.input) {
+        // Handle function_call_output items (tool responses) - only if ALLOW_TOOLS
+        if (ALLOW_TOOLS && item.type === 'function_call_output') {
+          const toolMsg = {
+            role: 'tool',
+            tool_call_id: item.call_id || item.tool_call_id || '',
+            content: ''
+          };
+          // Extract content from output or content field
+          if (item.output !== undefined) {
+            toolMsg.content = typeof item.output === 'string'
+              ? item.output
+              : JSON.stringify(item.output);
+          } else if (item.content !== undefined) {
+            toolMsg.content = typeof item.content === 'string'
+              ? item.content
+              : JSON.stringify(item.content);
+          }
+          messages.push(toolMsg);
+          continue;
+        }
         // Only process items with a 'role' field (Message items)
         // Skip Reasoning, FunctionCall, LocalShellCall, etc.
         if (!item.role) continue;
         // Map non-standard roles to Z.AI-compatible roles
-        // Z.AI accepts: system, user, assistant
+        // Z.AI accepts: system, user, assistant, tool
         let role = item.role;
         if (role === 'developer') {
           role = 'user'; // Map developer to user
-        } else if (role !== 'system' && role !== 'user' && role !== 'assistant') {
+        } else if (role !== 'system' && role !== 'user' && role !== 'assistant' && role !== 'tool') {
           // Skip any other non-standard roles
           continue;
         }
@@ -238,6 +261,7 @@ function translateResponsesToChat(request) {
 /**
  * Translate Chat Completions response to Responses format
  * Handles both output_text and reasoning_text content
+ * Handles tool_calls if present (only if ALLOW_TOOLS)
  */
 function translateChatToResponses(chatResponse, responsesRequest, ids) {
   const msg = chatResponse.choices?.[0]?.message ?? {};
@@ -262,6 +286,27 @@ function translateChatToResponses(chatResponse, responsesRequest, ids) {
     content,
   };
+  // Build output array: message item + any function_call items
+  const finalOutput = [msgItem];
+  // Handle tool_calls (only if ALLOW_TOOLS)
+  if (ALLOW_TOOLS && msg.tool_calls && Array.isArray(msg.tool_calls)) {
+    for (const tc of msg.tool_calls) {
+      const callId = tc.id || `call_${randomUUID().replace(/-/g, '')}`;
+      const name = tc.function?.name || '';
+      const args = tc.function?.arguments || '';
+      finalOutput.push({
+        id: callId,
+        type: 'function_call',
+        status: 'completed',
+        call_id: callId,
+        name: name,
+        arguments: typeof args === 'string' ? args : JSON.stringify(args),
+      });
+    }
+  }
   return buildResponseObject({
     id: responseId,
     model: responsesRequest?.model || chatResponse.model || DEFAULT_MODEL,
@@ -269,7 +314,7 @@ function translateChatToResponses(chatResponse, responsesRequest, ids) {
     created_at: createdAt,
     completed_at: nowSec(),
     input: responsesRequest?.input || [],
-    output: [msgItem],
+    output: finalOutput,
     tools: responsesRequest?.tools || [],
   });
 }
@@ -400,6 +445,10 @@ async function streamChatToResponses(upstreamBody, res, responsesRequest, ids) {
   let out = '';
   let reasoning = '';
+  // Tool call tracking (only if ALLOW_TOOLS)
+  const toolCallsMap = new Map(); // index -> { callId, name, arguments, partialArgs }
+  let nextOutputIndex = 1; // After message item
   while (true) {
     const { done, value } = await reader.read();
     if (done) break;
@@ -428,6 +477,106 @@ async function streamChatToResponses(upstreamBody, res, responsesRequest, ids) {
         const delta = chunk.choices?.[0]?.delta || {};
+        // Handle tool_calls (only if ALLOW_TOOLS)
+        if (ALLOW_TOOLS && delta.tool_calls && Array.isArray(delta.tool_calls)) {
+          for (const tc of delta.tool_calls) {
+            const index = tc.index;
+            if (index == null) continue;
+            if (!toolCallsMap.has(index)) {
+              // New tool call - send output_item.added
+              const callId = tc.id || `call_${randomUUID().replace(/-/g, '')}`;
+              const name = tc.function?.name || '';
+              toolCallsMap.set(index, {
+                callId,
+                name,
+                arguments: '',
+                partialArgs: ''
+              });
+              const fnItemInProgress = {
+                id: callId,
+                type: 'function_call',
+                status: 'in_progress',
+                call_id: callId,
+                name: name,
+                arguments: '',
+              };
+              sse({
+                type: 'response.output_item.added',
+                output_index: nextOutputIndex,
+                item: fnItemInProgress,
+              });
+              if (name) {
+                sse({
+                  type: 'response.function_call_name.done',
+                  item_id: callId,
+                  output_index: nextOutputIndex,
+                  name: name,
+                });
+              }
+            }
+            const tcData = toolCallsMap.get(index);
+            // Handle name update if it comes later
+            if (tc.function?.name && !tcData.name) {
+              tcData.name = tc.function.name;
+              sse({
+                type: 'response.function_call_name.done',
+                item_id: tcData.callId,
+                output_index: OUTPUT_INDEX + index,
+                name: tcData.name,
+              });
+            }
+            // Handle arguments delta
+            if (tc.function?.arguments && typeof tc.function.arguments === 'string') {
+              tcData.partialArgs += tc.function.arguments;
+              sse({
+                type: 'response.function_call_arguments.delta',
+                item_id: tcData.callId,
+                output_index: OUTPUT_INDEX + index,
+                delta: tc.function.arguments,
+              });
+            }
+            // Check if this tool call is done (finish_reason comes later in the choice)
+            const finishReason = chunk.choices?.[0]?.finish_reason;
+            if (finishReason === 'tool_calls' || (tc.function?.arguments && tc.function.arguments.length > 0 && chunk.choices?.[0]?.delta !== null)) {
+              tcData.arguments = tcData.partialArgs;
+              sse({
+                type: 'response.function_call_arguments.done',
+                item_id: tcData.callId,
+                output_index: OUTPUT_INDEX + index,
+                arguments: tcData.arguments,
+              });
+              const fnItemDone = {
+                id: tcData.callId,
+                type: 'function_call',
+                status: 'completed',
+                call_id: tcData.callId,
+                name: tcData.name,
+                arguments: tcData.arguments,
+              };
+              sse({
+                type: 'response.output_item.done',
+                output_index: OUTPUT_INDEX + index,
+                item: fnItemDone,
+              });
+            }
+          }
+          // Skip to next iteration after handling tool_calls
+          continue;
+        }
         // NON mescolare reasoning in output_text
         if (typeof delta.reasoning_content === 'string' && delta.reasoning_content.length) {
           reasoning += delta.reasoning_content;
@@ -495,6 +644,21 @@ async function streamChatToResponses(upstreamBody, res, responsesRequest, ids) {
     item: msgItemDone,
   });
+  // Build final output array: message item + any function_call items
+  const finalOutput = [msgItemDone];
+  if (ALLOW_TOOLS && toolCallsMap.size > 0) {
+    for (const [index, tcData] of toolCallsMap.entries()) {
+      finalOutput.push({
+        id: tcData.callId,
+        type: 'function_call',
+        status: 'completed',
+        call_id: tcData.callId,
+        name: tcData.name,
+        arguments: tcData.arguments,
+      });
+    }
+  }
   const completed = buildResponseObject({
     id: responseId,
     model: responsesRequest?.model || DEFAULT_MODEL,
@@ -502,14 +666,14 @@ async function streamChatToResponses(upstreamBody, res, responsesRequest, ids) {
     created_at: createdAt,
     completed_at: nowSec(),
     input: responsesRequest?.input || [],
-    output: [msgItemDone],
+    output: finalOutput,
     tools: responsesRequest?.tools || [],
   });
   sse({ type: 'response.completed', response: completed });
   res.end();
-  log('info', `Stream completed - ${out.length} output, ${reasoning.length} reasoning`);
+  log('info', `Stream completed - ${out.length} output, ${reasoning.length} reasoning, ${toolCallsMap.size} tool_calls`);
 }
 /**