npm - openclaw-freerouter - Versions diffs - 2.0.0 → 2.0.2 - Mend

openclaw-freerouter 2.0.0 → 2.0.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/README.md CHANGED Viewed

@@ -1,121 +1,98 @@
-# FreeRouter — OpenClaw Plugin
+# FreeRouter — Smart LLM Router for OpenClaw
-Smart LLM router that classifies your requests and routes them to the best model automatically. Uses a 14-dimension weighted scorer (<1ms) with configurable tier→model mapping.
+**Stop overpaying for AI.** Every message to your AI assistant costs money — a simple "hello" shouldn't cost the same as "prove the Riemann hypothesis." FreeRouter automatically routes each request to the right model based on complexity.
-## Install
+## The Problem
-```bash
-# From local directory
-openclaw plugins install -l /path/to/freerouter-plugin
+- 🔥 **Wasted money** — Running Claude Opus ($15/$75 per 1M tokens) for every message, even "what's 2+2?"
+- 🤷 **No control** — Can't switch models mid-conversation without editing config files and restarting
+- 📊 **Blind routing** — Your AI shows "freerouter/auto" instead of telling you which model actually answered
+- 🧠 **No adaptive thinking** — OpenClaw's built-in thinking levels (`/think low`, `/think high`) are just prompt hints. Anthropic deprecated manual thinking for Opus 4.6 — it now requires the native `thinking: { type: "adaptive" }` API parameter, which OpenClaw doesn't send
+- 🔧 **Complex setup** — Existing routers need separate servers, Docker, complex infra
-# Or copy to extensions
-openclaw plugins install /path/to/freerouter-plugin
-```
+## The Solution
-## Configure
+FreeRouter is an OpenClaw plugin that:
+- **Classifies every request in <1ms** using a 14-dimension weighted scorer (no LLM needed for classification)
+- **Routes to the cheapest model that can handle it** — Kimi for "hello", Opus for architecture design
+- **Sends native adaptive thinking** — Automatically passes `thinking: { type: "adaptive" }` to Anthropic's API for Opus 4.6, and `thinking: { type: "enabled", budget_tokens: N }` for Sonnet. No prompt hacks — real API-level thinking control that OpenClaw doesn't support natively
+- **Reports the real model name** — You see `anthropic/claude-opus-4-6`, not `freerouter/auto`
+- **Lets you override anytime** — Just say "use opus" in plain English
-After install, configure tiers in `openclaw.json`:
+## Install
-```json5
-{
-  // Set FreeRouter as your default model
-  agents: {
-    defaults: {
-      model: { primary: "freerouter/auto" }
-    }
-  },
+```bash
+openclaw plugins install openclaw-freerouter
+```
-  // Add FreeRouter as a provider pointing to its HTTP proxy
-  providers: {
-    freerouter: {
-      baseUrl: "http://127.0.0.1:18801/v1",
-      api: "openai-completions"
-    }
-  },
+Then run the setup wizard:
-  // Plugin config
-  plugins: {
-    entries: {
-      freerouter: {
-        enabled: true,
-        config: {
-          port: 18801,        // HTTP proxy port (0 = disabled)
-          host: "127.0.0.1",  // Bind address
-          // Customize which models handle each tier
-          tiers: {
-            SIMPLE:    { primary: "kimi-coding/kimi-for-coding", fallback: ["anthropic/claude-haiku-4-5"] },
-            MEDIUM:    { primary: "anthropic/claude-sonnet-4-5", fallback: ["anthropic/claude-opus-4-6"] },
-            COMPLEX:   { primary: "anthropic/claude-opus-4-6", fallback: [] },
-            REASONING: { primary: "anthropic/claude-opus-4-6", fallback: [] }
-          },
+```bash
+openclaw freerouter setup
+```
-          // Thinking/reasoning config
-          thinking: {
-            adaptive: ["claude-opus-4-6"],
-            enabled: { models: ["claude-sonnet-4-5"], budget: 4096 }
-          },
+Or configure manually — see [Configuration](#configure) below.
-          // Default tier for ambiguous requests
-          defaultTier: "MEDIUM"
-        }
-      }
-    }
-  }
-}
-```
+## Switch Models Anytime
-## How It Works
+The killer feature: **switch models using natural language, slash commands, or session locks.**
-1. OpenClaw sends a request with model `freerouter/auto`
-2. FreeRouter's HTTP proxy receives it
-3. The 14-dimension classifier scores the request in <1ms
-4. Routes to the best model for the task (e.g., Kimi for simple, Opus for reasoning)
-5. Forwards to the real provider API
-6. **Returns the actual model name** (e.g., `anthropic/claude-opus-4-6`) so OpenClaw displays what's really running
+### Just Say It (Natural Language)
-## Tiers
+No slash commands needed. Just talk:
-| Tier | Default Model | Use Case |
-|------|--------------|----------|
-| SIMPLE | kimi-coding/kimi-for-coding | Quick lookups, translations, simple Q&A |
-| MEDIUM | anthropic/claude-sonnet-4-5 | Code generation, creative writing, moderate complexity |
-| COMPLEX | anthropic/claude-opus-4-6 | Architecture design, multi-step reasoning |
-| REASONING | anthropic/claude-opus-4-6 | Mathematical proofs, formal logic, deep analysis |
+| What you say | What happens |
+|---|---|
+| `use opus` | Switches to Claude Opus for this message |
+| `switch to sonnet` | Switches to Claude Sonnet |
+| `try kimi` | Switches to Kimi |
+| `let's use opus` | Switches to Opus |
+| `please use sonnet` | Switches to Sonnet |
+| `can you use haiku` | Switches to Haiku |
+| `use opus: explain quantum computing` | Uses Opus for this specific prompt |
+| `use opus, what is 2+2?` | Same — model + prompt in one message |
+| `go back to auto` | Return to automatic routing |
-## Per-Prompt Model Override
+### Lock a Model for the Whole Session
-Force a specific model for one message:
+When you know the task is important and you want Opus (or any model) for everything:
-- `/opus Explain quantum computing` → Claude Opus 4.6
-- `/sonnet Write a poem` → Claude Sonnet 4.5
-- `/kimi What's 2+2?` → Kimi K2.5
-- `/haiku Translate this` → Claude Haiku 4.5
-- `[opus] Deep analysis...` → Claude Opus 4.6
+| What you say | What happens |
+|---|---|
+| `use opus for this session` | 🔒 Locks ALL messages to Opus |
+| `switch to sonnet from now on` | 🔒 Locks to Sonnet |
+| `stick with opus` | 🔒 Locks to Opus |
+| `keep using sonnet` | 🔒 Locks to Sonnet |
+| `/lock opus` | 🔒 Same thing, slash command |
+| `/unlock` | 🔓 Back to auto-routing |
+| `/lock status` | Shows current lock state |
-## Per-Prompt Tier Override
+Session locks expire after 4 hours of inactivity.
-Force a tier (uses that tier's primary model):
+### When FreeRouter Isn't Sure
-- `/simple What's 2+2?` → SIMPLE tier
-- `/max Prove the Riemann hypothesis` → REASONING tier
-- `[reasoning] Analyze this code...` → REASONING tier
+If your request is ambiguous, FreeRouter asks before switching:
-## Session Lock
+> **You:** "opus please"
+> **FreeRouter:** 🤔 Did you want to switch to **anthropic/claude-opus-4-6**? Reply **yes** or **no**.
+> **You:** "yes"
+> **FreeRouter:** ✅ Confirmed!
-Lock an entire session to a specific model:
+This prevents accidental switches when you're just talking *about* a model.
-- `/lock opus` → 🔒 All messages use Opus until unlocked
-- `/lock sonnet` → 🔒 All messages use Sonnet
-- `/lock simple` → 🔒 Lock to SIMPLE tier's primary model
-- `/lock anthropic/claude-opus-4-6` → 🔒 Full model ID
-- `/unlock` → 🔓 Return to auto-routing
-- `/lock auto` → 🔓 Same as unlock
-- `/lock status` → Show current lock state
+### Slash Commands (Power Users)
-Session locks expire after 4 hours of inactivity.
+| Command | Effect |
+|---|---|
+| `/opus What is 2+2?` | Per-prompt: Opus for this message only |
+| `/sonnet Write a poem` | Per-prompt: Sonnet |
+| `/kimi Quick answer` | Per-prompt: Kimi |
+| `/haiku Translate this` | Per-prompt: Haiku |
+| `/simple What is 2+2?` | Per-prompt: use SIMPLE tier model |
+| `/max Prove this theorem` | Per-prompt: use REASONING tier model |
+| `[opus] Deep analysis` | Bracket syntax (same as /opus) |
-### Supported Aliases
+### Supported Model Aliases
 | Alias | Model |
 |-------|-------|
@@ -125,37 +102,183 @@ Session locks expire after 4 hours of inactivity.
 | `haiku`, `haiku-4`, `haiku-4.5` | anthropic/claude-haiku-4-5 |
 | `kimi`, `kimi-k2`, `k2.5` | kimi-coding/kimi-for-coding |
-## Scoring Dimensions
+## Adaptive Thinking (Native API-Level)
+This is a key reason FreeRouter exists.
+**The problem:** OpenClaw's built-in `/think low|medium|high` commands are prompt-level hints — they add text like "think harder" to your prompt. This is unreliable and doesn't use Anthropic's actual thinking API. Worse, **Anthropic deprecated manual thinking (`type: "enabled"`) for Opus 4.6** — it now only supports `type: "adaptive"`, where the model decides how much to think based on the task.
+**What FreeRouter does:** Sends the real `thinking` parameter directly to Anthropic's API:
+| Model | Thinking Mode | What's Sent |
+|---|---|---|
+| Claude Opus 4.6 | **Adaptive** (always) | `thinking: { type: "adaptive" }` |
+| Claude Sonnet 4.5 | Enabled with budget | `thinking: { type: "enabled", budget_tokens: 4096 }` |
+| Others (Kimi, Haiku) | Off | No thinking parameter |
+**Why this matters:**
+- Adaptive thinking lets Opus 4.6 decide how much reasoning it needs — simple questions get quick answers, complex proofs get deep thinking chains
+- You get the full benefit of Claude's extended thinking without managing budgets
+- The `X-FreeRouter-Thinking` response header tells you exactly which thinking mode was used
+Configure in your plugin config:
+```json5
+"thinking": {
+  "adaptive": ["claude-opus-4-6"],           // Models that use adaptive (always-on)
+  "enabled": {
+    "models": ["claude-sonnet-4-5"],          // Models that get explicit thinking
+    "budget": 4096                             // Token budget for thinking
+  }
+}
+```
+## How Routing Works
+1. You send a message → OpenClaw forwards to FreeRouter
+2. FreeRouter's 14-dimension classifier scores the request in **<1ms** (0.035ms average)
+3. Based on the score, it picks the best tier:
+| Tier | Default Model | Use Case | Cost |
+|------|--------------|----------|------|
+| SIMPLE | Kimi K2.5 | Quick lookups, translations, "hello" | $0.50/1M |
+| MEDIUM | Claude Sonnet 4.5 | Code, creative writing, moderate tasks | $3/$15/1M |
+| COMPLEX | Claude Opus 4.6 | Architecture, deep analysis | $15/$75/1M |
+| REASONING | Claude Opus 4.6 | Proofs, formal logic, step-by-step | $15/$75/1M |
+4. Forwards to the real provider API (Anthropic, Kimi, OpenAI, etc.)
+5. Returns the response with the **actual model name** — not "freerouter/auto"
-The classifier evaluates 14 weighted dimensions:
+### Scoring Dimensions
+The classifier evaluates 14 weighted dimensions without calling any LLM:
 - Token count, code presence, reasoning markers, technical terms
 - Creative markers, simple indicators, multi-step patterns
 - Question complexity, imperative verbs, constraints
 - Output format, references, negation, domain specificity
-- Agentic task indicators
+- Agentic task indicators (multi-tool, multi-step workflows)
+Supports multilingual classification: English, Chinese, Japanese, Russian, German, Vietnamese, Arabic, Korean, and more.
+## Configure
+After install, add to your `openclaw.json`:
+```json5
+{
+  // 1. Set FreeRouter as your default model
+  "agents": {
+    "defaults": {
+      "model": {
+        "primary": "freerouter/freerouter/auto",
+        "fallbacks": ["anthropic/claude-opus-4-6"]
+      }
+    }
+  },
+  // 2. Add FreeRouter as a provider
+  "providers": {
+    "freerouter": {
+      "baseUrl": "http://127.0.0.1:18801/v1",
+      "api": "openai-completions"
+    }
+  },
+  // 3. Plugin config
+  "plugins": {
+    "entries": {
+      "freerouter": {
+        "enabled": true,
+        "config": {
+          "port": 18801,
+          "host": "127.0.0.1",
+          "tiers": {
+            "SIMPLE":    { "primary": "kimi-coding/kimi-for-coding", "fallback": ["anthropic/claude-haiku-4-5"] },
+            "MEDIUM":    { "primary": "anthropic/claude-sonnet-4-5", "fallback": ["anthropic/claude-opus-4-6"] },
+            "COMPLEX":   { "primary": "anthropic/claude-opus-4-6", "fallback": [] },
+            "REASONING": { "primary": "anthropic/claude-opus-4-6", "fallback": [] }
+          },
+          "thinking": {
+            "adaptive": ["claude-opus-4-6"],
+            "enabled": { "models": ["claude-sonnet-4-5"], "budget": 4096 }
+          },
+          "defaultTier": "MEDIUM"
+        }
+      }
+    }
+  }
+}
+```
+Then restart: `openclaw gateway restart`
+## CLI Commands
+| Command | Description |
+|---|---|
+| `openclaw freerouter status` | Show config, tier mapping, and live stats |
+| `openclaw freerouter setup` | Interactive setup wizard |
+| `openclaw freerouter test` | Quick 5-query classification smoke test |
+| `openclaw freerouter doctor` | Diagnose issues (port conflicts, missing config, etc.) |
+| `openclaw freerouter port <n>` | Change the proxy port |
+| `openclaw freerouter reset` | Show default config for recovery |
+### Chat Commands
+| Command | Description |
+|---|---|
+| `/freerouter` | Show routing stats in chat |
+| `/freerouter-doctor` | Quick health check |
 ## Port Conflicts
-If port 18801 is in use, change it in the plugin config:
+If port 18801 is in use, change it:
 ```json5
-{ plugins: { entries: { freerouter: { config: { port: 18802 } } } } }
+{ "plugins": { "entries": { "freerouter": { "config": { "port": 18802 } } } } }
 ```
-Set `port: 0` to disable the HTTP proxy entirely.
+Set `"port": 0` to disable the HTTP proxy entirely.
-## Commands
+## Troubleshooting
+```bash
+# Check if everything is working
+openclaw freerouter doctor
-- `/freerouter` — Show routing stats in chat
-- `openclaw freerouter status` — CLI status and stats
+# Quick classification test
+openclaw freerouter test
+# Reset to defaults
+openclaw freerouter reset
+```
 ## Response Headers
-Every proxied response includes:
-- `X-FreeRouter-Model` — Actual model used
-- `X-FreeRouter-Tier` — Classification tier
+Every proxied response includes metadata headers:
+- `X-FreeRouter-Model` — Actual model used (e.g., `anthropic/claude-opus-4-6`)
+- `X-FreeRouter-Tier` — Classification tier (SIMPLE/MEDIUM/COMPLEX/REASONING)
 - `X-FreeRouter-Thinking` — Thinking mode (off/adaptive/enabled)
-- `X-FreeRouter-Reasoning` — Classification reasoning
+- `X-FreeRouter-Reasoning` — Why this model was chosen
+## API Endpoints
+The HTTP proxy exposes:
+- `POST /v1/chat/completions` — OpenAI-compatible chat endpoint
+- `GET /v1/models` — List available models
+- `GET /health` — Health check
+- `GET /stats` — Routing statistics
+- `GET /sessions/locks` — Active session locks
+- `DELETE /sessions/locks` — Clear all session locks
+## Tests
+```bash
+npm test  # Runs all 212 tests
+```
+- `test-freerouter.mjs` — Classification, multilingual, edge cases, performance
+- `test-resilience.mjs` — Fallback chains, bad configs, recovery, port conflicts
+- `test-overrides.mjs` — Model overrides, session locks, natural language switching
 ## License

package/package.json CHANGED Viewed

@@ -1,10 +1,12 @@
 {
   "name": "openclaw-freerouter",
-  "version": "2.0.0",
+  "version": "2.0.2",
   "description": "Smart LLM router plugin for OpenClaw — classify requests and route to the best model using your own API keys. 14-dimension scoring, <1ms classification, per-prompt/session model switching.",
   "type": "module",
   "openclaw": {
-    "extensions": ["./src/index.ts"]
+    "extensions": [
+      "./src/index.ts"
+    ]
   },
   "main": "src/index.ts",
   "scripts": {

package/src/service.ts CHANGED Viewed

@@ -624,11 +624,15 @@ export function createProxyServer(options: ProxyOptions): { server: Server; stat
     }
     // Determine thinking mode
+    // Opus 4.6 REQUIRES adaptive thinking — manual mode is deprecated by Anthropic
+    // Other models can use enabled(budget) for explicit thinking control
     const thinkingCfg = pluginConfig.thinking as any;
     const adaptivePatterns = thinkingCfg?.adaptive ?? ["claude-opus-4-6"];
     const enabledCfg = thinkingCfg?.enabled;
-    if (adaptivePatterns.some((p: string) => routedModel.includes(p)) && (tier === "COMPLEX" || tier === "REASONING")) {
+    if (adaptivePatterns.some((p: string) => routedModel.includes(p))) {
+      // Adaptive thinking models (e.g., Opus 4.6) — always use adaptive
+      // Anthropic deprecated manual thinking for Opus 4.6
       thinkingMode = "adaptive";
     } else if (enabledCfg?.models?.some((p: string) => routedModel.includes(p))) {
       thinkingMode = `enabled(${enabledCfg.budget ?? 4096})`;