npm - @ljoukov/llm - Versions diffs - 3.0.0 → 3.0.2 - Mend

@ljoukov/llm 3.0.0 → 3.0.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

package/README.md CHANGED Viewed

@@ -9,7 +9,7 @@ Unified TypeScript wrapper over:
 - **OpenAI Responses API** (`openai`)
 - **Google Gemini via Vertex AI** (`@google/genai`)
-- **Fireworks chat-completions models** (`kimi-k2.5`, `glm-5`, `minimax-m2.1`)
+- **Fireworks chat-completions models** (`kimi-k2.5`, `glm-5`, `minimax-m2.1`, `gpt-oss-120b`)
 - **ChatGPT subscription models** via `chatgpt-*` model ids (reuses Codex auth store, or a token provider)
 Designed around a single streaming API that yields:
@@ -34,6 +34,8 @@ See Node.js docs on environment variables and dotenv files: https://nodejs.org/a
 ### OpenAI
 - `OPENAI_API_KEY`
+- `OPENAI_RESPONSES_WEBSOCKET_MODE` (`auto` | `off` | `only`, default: `auto`)
+- `OPENAI_BASE_URL` (optional; defaults to `https://api.openai.com/v1`)
 ### Gemini (Vertex AI)
@@ -86,12 +88,25 @@ refresh-token rotation and serves short-lived access tokens.
 - `CHATGPT_AUTH_TOKEN_PROVIDER_URL` (example: `https://chatgpt-auth.<your-domain>`)
 - `CHATGPT_AUTH_API_KEY` (shared secret; sent as `Authorization: Bearer ...` and `x-chatgpt-auth: ...`)
 - `CHATGPT_AUTH_TOKEN_PROVIDER_STORE` (`kv` or `d1`, defaults to `kv`)
+- `CHATGPT_RESPONSES_WEBSOCKET_MODE` (`auto` | `off` | `only`, default: `auto`)
 This repo includes a Cloudflare Workers token provider implementation in `workers/chatgpt-auth/`.
 If `CHATGPT_AUTH_TOKEN_PROVIDER_URL` + `CHATGPT_AUTH_API_KEY` are set, `chatgpt-*` models will fetch tokens from the
 token provider and will not read the local Codex auth store.
+### Responses transport
+For OpenAI and `chatgpt-*` model paths, this library now tries **Responses WebSocket transport first** and falls back
+to HTTP/SSE automatically when needed.
+- `auto` (default): try WebSocket first, then fall back to SSE
+- `off`: use SSE only
+- `only`: require WebSocket (no fallback)
+When fallback is triggered by an unsupported WebSocket upgrade response (for example `426`), the library keeps using
+SSE for the rest of the process to avoid repeated failing upgrade attempts.
 ## Usage
 `v2` uses OpenAI-style request fields:
@@ -263,7 +278,7 @@ console.log(result.text);
 ### Fireworks
-Use Fireworks model ids directly (for example `kimi-k2.5`, `glm-5`, `minimax-m2.1`):
+Use Fireworks model ids directly (for example `kimi-k2.5`, `glm-5`, `minimax-m2.1`, `gpt-oss-120b`):
 ```ts
 import { generateText } from "@ljoukov/llm";
@@ -457,7 +472,7 @@ const fs = createInMemoryAgentFilesystem({
 });
 const result = await runAgentLoop({
-  model: "chatgpt-gpt-5.3-codex-spark",
+  model: "chatgpt-gpt-5.3-codex",
   input: "Change value from 1 to 2 using filesystem tools.",
   filesystemTool: {
     profile: "auto",
@@ -481,13 +496,13 @@ import {
 } from "@ljoukov/llm";
 const fs = createInMemoryAgentFilesystem({ "/repo/a.ts": "export const n = 1;\n" });
-const tools = createFilesystemToolSetForModel("chatgpt-gpt-5.3-codex-spark", {
+const tools = createFilesystemToolSetForModel("chatgpt-gpt-5.3-codex", {
   cwd: "/repo",
   fs,
 });
 const result = await runToolLoop({
-  model: "chatgpt-gpt-5.3-codex-spark",
+  model: "chatgpt-gpt-5.3-codex",
   input: "Update n to 2.",
   tools,
 });