npm - ghc-proxy - Versions diffs - 0.3.1 → 0.4.0 - Mend

ghc-proxy 0.3.1 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (22) hide show

package/README.md +9 -13
package/dist/GptEncoding-DuDWxow_.mjs +1134 -0
package/dist/GptEncoding-DuDWxow_.mjs.map +1 -0
package/dist/cl100k_base-YsziDpoU.mjs +101373 -0
package/dist/cl100k_base-YsziDpoU.mjs.map +1 -0
package/dist/file-type-BNd4XJK1.mjs +4527 -0
package/dist/file-type-BNd4XJK1.mjs.map +1 -0
package/dist/main.mjs +46992 -1337
package/dist/main.mjs.map +1 -1
package/dist/o200k_base-C_Bgi80R.mjs +204724 -0
package/dist/o200k_base-C_Bgi80R.mjs.map +1 -0
package/dist/p50k_base-DRo0AxsG.mjs +50482 -0
package/dist/p50k_base-DRo0AxsG.mjs.map +1 -0
package/dist/p50k_base-teVr-d1Y.mjs +10 -0
package/dist/p50k_base-teVr-d1Y.mjs.map +1 -0
package/dist/p50k_edit-nucqZWIv.mjs +10 -0
package/dist/p50k_edit-nucqZWIv.mjs.map +1 -0
package/dist/prompt-mE5xxWUf.mjs +848 -0
package/dist/prompt-mE5xxWUf.mjs.map +1 -0
package/dist/r50k_base-B2MFjxES.mjs +50464 -0
package/dist/r50k_base-B2MFjxES.mjs.map +1 -0
package/package.json +6 -7

package/README.md CHANGED Viewed

@@ -48,7 +48,7 @@ This is the most common use case. There are two ways to set it up:
 bunx ghc-proxy@latest start --claude-code
 ```
-This starts the proxy, opens an interactive model picker, and copies a ready-to-paste environment command to your clipboard. Run that command in another terminal to launch Claude Code with the correct configuration.
+This starts the proxy, opens an interactive model picker, and prints a ready-to-paste environment command. Run that command in another terminal to launch Claude Code with the correct configuration.
 ### Option B: Permanent config (Recommended)
@@ -62,7 +62,6 @@ Create or edit `~/.claude/settings.json` (this applies globally to all projects)
     "ANTHROPIC_MODEL": "claude-opus-4.6",
     "ANTHROPIC_DEFAULT_SONNET_MODEL": "claude-sonnet-4.6",
     "ANTHROPIC_DEFAULT_HAIKU_MODEL": "claude-haiku-4.5",
-    "DISABLE_NON_ESSENTIAL_MODEL_CALLS": "1",
     "CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC": "1"
   },
   "permissions": {
@@ -86,7 +85,6 @@ bunx ghc-proxy@latest start --wait
 | `ANTHROPIC_MODEL` | The model Claude Code uses for primary/Opus tasks |
 | `ANTHROPIC_DEFAULT_SONNET_MODEL` | The model used for Sonnet-tier tasks |
 | `ANTHROPIC_DEFAULT_HAIKU_MODEL` | The model used for Haiku-tier (fast/cheap) tasks |
-| `DISABLE_NON_ESSENTIAL_MODEL_CALLS` | Prevents Claude Code from making extra API calls |
 | `CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC` | Disables telemetry and non-essential network traffic |
 > **Tip:** The model names above (e.g. `claude-opus-4.6`) are mapped to actual Copilot models by the proxy. See [Model Mapping](#model-mapping) below for details.
@@ -112,7 +110,7 @@ The proxy authenticates with GitHub using the [device code OAuth flow](https://d
 When the Copilot token response includes `endpoints.api`, `ghc-proxy` now prefers that runtime API base automatically instead of relying only on the configured account type. This keeps enterprise/business routing aligned with the endpoint GitHub actually returned for the current token.
-Incoming requests hit a [Hono](https://hono.dev/) server. `chat/completions` requests are validated, normalized into the shared planning pipeline, and then forwarded to Copilot. `responses` requests use a native Responses path with explicit compatibility policies. `messages` requests are routed per-model and can use native Anthropic passthrough, the Responses translation path, or the existing chat-completions fallback. The translator tracks exact vs lossy vs unsupported behavior explicitly; see the [Messages Routing and Translation Guide](./docs/messages-routing-and-translation.md) and the [Anthropic Translation Matrix](./docs/anthropic-translation-matrix.md) for the current support surface.
+Incoming requests hit an [Elysia](https://elysiajs.com/) server. `chat/completions` requests are validated, normalized into the shared planning pipeline, and then forwarded to Copilot. `responses` requests use a native Responses path with explicit compatibility policies. `messages` requests are routed per-model and can use native Anthropic passthrough, the Responses translation path, or the existing chat-completions fallback. The translator tracks exact vs lossy vs unsupported behavior explicitly; see the [Messages Routing and Translation Guide](./docs/messages-routing-and-translation.md) and the [Anthropic Translation Matrix](./docs/anthropic-translation-matrix.md) for the current support surface.
 ### Request Routing
@@ -186,7 +184,7 @@ bunx ghc-proxy@latest debug          # Print diagnostic info (version, paths, to
 | `--github-token` | `-g` | -- | Pass a GitHub token directly (from `auth`) |
 | `--claude-code` | `-c` | `false` | Generate a Claude Code launch command |
 | `--show-token` | -- | `false` | Display tokens on auth and refresh |
-| `--proxy-env` | -- | `false` | Use `HTTP_PROXY`/`HTTPS_PROXY` from env |
+| `--proxy-env` | -- | `false` | Use `HTTP_PROXY`/`HTTPS_PROXY` from env (Node.js only; Bun reads proxy env natively) |
 | `--idle-timeout` | -- | `120` | Bun server idle timeout in seconds |
 | `--upstream-timeout` | -- | `300` | Upstream request timeout in seconds (0 to disable) |
@@ -228,7 +226,7 @@ When Claude Code sends a request for a model like `claude-sonnet-4.6`, the proxy
 | Prefix | Default Fallback |
 |--------|-----------------|
 | `claude-opus-*` | `claude-opus-4.6` |
-| `claude-sonnet-*` | `claude-sonnet-4.5` |
+| `claude-sonnet-*` | `claude-sonnet-4.6` |
 | `claude-haiku-*` | `claude-haiku-4.5` |
 ### Customizing Fallbacks
@@ -237,7 +235,7 @@ You can override the defaults with **environment variables**:
 ```bash
 MODEL_FALLBACK_CLAUDE_OPUS=claude-opus-4.6
-MODEL_FALLBACK_CLAUDE_SONNET=claude-sonnet-4.5
+MODEL_FALLBACK_CLAUDE_SONNET=claude-sonnet-4.6
 MODEL_FALLBACK_CLAUDE_HAIKU=claude-haiku-4.5
 ```
@@ -247,7 +245,7 @@ Or in the proxy's **config file** (`~/.local/share/ghc-proxy/config.json`):
 {
   "modelFallback": {
     "claudeOpus": "claude-opus-4.6",
-    "claudeSonnet": "claude-sonnet-4.5",
+    "claudeSonnet": "claude-sonnet-4.6",
     "claudeHaiku": "claude-haiku-4.5"
   }
 }
@@ -255,22 +253,21 @@ Or in the proxy's **config file** (`~/.local/share/ghc-proxy/config.json`):
 **Priority order:** environment variable > config.json > built-in default.
+> **Note:** Model fallbacks only apply to the **chat completions translation path**. The native Messages and Responses API strategies pass the model ID through to Copilot as-is.
 ### Small-Model Routing
 `/v1/messages` can optionally reroute specific low-value requests to a cheaper model:
 - `smallModel`: the model to reroute to
 - `compactUseSmallModel`: reroute recognized compact/summarization requests
-- `warmupUseSmallModel`: reroute explicitly marked warmup/probe requests
-Both switches default to `false`. Routing is conservative:
+The switch defaults to `false`. Routing is conservative:
 - the target `smallModel` must exist in Copilot's model list
 - it must preserve the original model's declared endpoint support
 - tool, thinking, and vision requests are not rerouted to a model that lacks the required capabilities
-Warmup routing is intentionally narrow. Requests must look like explicit warmup/probe traffic; ordinary tool-free chat requests are not rerouted just because they include `anthropic-beta`.
 ### Responses Compatibility
 `/v1/responses` is designed to stay close to the OpenAI wire format while making Copilot limitations explicit:
@@ -300,7 +297,6 @@ Example `config.json`:
 {
   "smallModel": "gpt-4.1-mini",
   "compactUseSmallModel": true,
-  "warmupUseSmallModel": false,
   "useFunctionApplyPatch": true,
   "responsesApiContextManagementModels": ["gpt-5", "gpt-5-mini"],
   "modelReasoningEfforts": {