npm - @gabrielsmartin/orbit-sdk - Versions diffs - 0.3.3 → 0.4.0 - Mend

@gabrielsmartin/orbit-sdk 0.3.3 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

package/README.md +124 -127
package/package.json +25 -23
package/src/index.js +70 -84
package/src/router.js +48 -81
package/index.js +0 -14
package/src/signalBias.js +0 -85
/package/{index.d.ts → src/index.d.ts} +0 -0

package/README.md CHANGED Viewed

@@ -1,191 +1,188 @@
-# orbit-ai
+# @gabrielsmartin/orbit-sdk
-> Stop blasting every query at GPT-4o. Route intelligently. Save 85%.
+> Stop blasting every query at GPT-4o. Route intelligently. Save up to 98%.
-`orbit-ai` is a drop-in routing layer that reads the fingerprint of every AI query and sends it to the optimal model — automatically, in under 1ms.
+A small, fast, rule-based router for choosing between LLMs. Open source. No magic. ~200 lines of tuned heuristics that you can fork and customize.
+**What it is:** ORBIT reads your query, classifies it across 8 axes, and tells you which model to use. You make the API call — ORBIT just picks the model.
+**What it isn't:** a proxy, a black box, or a neural network. It's fast deterministic rules — safety-critical routes (emotional content, crisis) always win, everything else is heuristic.
 ```bash
-npm install orbit-ai
+npm install @gabrielsmartin/orbit-sdk
 ```
 ---
-## 🚀 Pro is live
-| Plan | Price | What's included |
-|---|---|---|
-| **Founding Pro** | $19/mo | API access · 500k routed queries/mo · savings dashboard |
-| **Founding Team** | $99/mo | Unlimited queries · multi-seat · priority support |
-[→ Get Founding Pro](https://buy.stripe.com/6oE5kF3Yz5Co06s9AB) · [→ Get Founding Team](https://buy.stripe.com/9AQ9AV5GH9SEdss6op)
+## Why
-*Founding pricing locks in forever — price goes up at 100 customers.*
+You're probably doing this:
----
-## How it works
-Every query gets fingerprinted across **9 axes** in under 1ms:
+```javascript
+const res = await openai.chat.completions.create({
+  model: "gpt-4o",  // $30/1M tokens — every single query
+  messages
+});
+```
-| Axis | What it measures |
-|---|---|
-| **Complexity** | Depth of reasoning required |
-| **Creativity** | Open-ended vs deterministic |
-| **Emotional Weight** | Sensitivity — crisis queries always go to Claude |
-| **Recency** | Need for live/current data → Grok |
-| **Context Load** | Window size needed → Claude 200k |
-| **Speed** | Latency sensitivity |
-| **Domain** | Code · Creative · Medical · Legal · General |
-| **Cost Tolerance** | Budget tier (overridable) |
-| **Signal** | Intent code — 777 (completion) · 555 (variation) · 333 (foundation) |
-Then it routes to the right model. Invisibly.
+"Write a haiku" does not need GPT-4o. Only ~15% of real queries do. ORBIT routes the other 85% to cheaper models with equivalent quality for the task.
 ---
 ## Usage
-### Zero-config routing decision
 ```javascript
-import orbit from 'orbit-ai'
+import orbit from '@gabrielsmartin/orbit-sdk'
+// Route a query — returns decision instantly, no network call
+const { model, reason, savings } = orbit.route("write a haiku about recursion")
+// model.name → "Claude Sonnet"
+// model.id   → "claude-sonnet-3-5"
+// reason     → "High creativity — Claude Sonnet for open-ended generation."
+// savings.reductionPct → 50
+// You then call the model yourself:
+const res = await anthropic.messages.create({
+  model: model.id,   // "claude-sonnet-3-5"
+  messages: [{ role: 'user', content: query }]
+})
+```
-const decision = orbit.route("write a haiku about recursion")
+**ORBIT picks the model. Your code makes the call.** This keeps your API keys yours and gives you full control.
-console.log(decision.model.name)    // "Claude Sonnet"
-console.log(decision.reason)        // "High creativity score (8/10)..."
-console.log(decision.savings)       // { savings: 0.007245, reductionPct: 97 }
-```
+---
-### With signal codes (v0.3.0+)
+## Examples
 ```javascript
-import { OrbitClient } from 'orbit-ai'
+import orbit from '@gabrielsmartin/orbit-sdk'
-const orbit = new OrbitClient({ log: true })
+orbit.route("what is 2+2?")
+// → Gemini 2.5 Flash | cost_gemini | 98% savings vs GPT-4o
-// 777 — completion mode, forces high-capability model
-orbit.route("finalize the architecture doc", { signal: "777" })
-// → Claude Sonnet (floor enforced)
+orbit.route("I've been feeling really anxious lately")
+// → Claude Sonnet | ethics_first | emotional weight — never a cheap model
-// 555 — variation mode, maximizes model diversity
-orbit.route("brainstorm 10 unexpected product names", { signal: "555" })
-// → Grok (variation bias)
+orbit.route("latest AI news today")
+// → Grok | recency_grok | live web access
-// 333 — foundation mode, minimizes cost
-orbit.route("summarize this paragraph", { signal: "333" })
-// → Gemini 2.5 Flash
+orbit.route("architect a distributed event-driven system")
+// → Claude Sonnet | complex_code | high complexity + reasoning
+orbit.route("summarize this in one sentence")
+// → Gemini 2.5 Flash | cost_gemini | low complexity, $0.50/1M tokens
 ```
-### With your own API keys
+---
-```javascript
-import { OrbitClient } from 'orbit-ai'
+## 8-Axis Classification
-const orbit = new OrbitClient({
-  cost_tolerance: 'low',  // 'low' | 'medium' | 'high'
-  log: true,
-})
+| Axis | What it measures |
+|------|----------------|
+| `complexity` | Depth of reasoning required |
+| `creativity` | Open-ended vs. factual generation |
+| `emotional_weight` | Sensitive or crisis content |
+| `recency` | Need for real-time / live web data |
+| `context_load` | Long-document or multi-turn depth |
+| `speed` | Latency sensitivity |
+| `domain` | Code, legal, medical, creative, general |
+| `cost_tolerance` | Budget flexibility (overridable) |
+Classification is keyword-based with tuned weights — fast and transparent. You can inspect `fingerprint.js` and see exactly how any query is scored.
-const { model, reason, savings } = orbit.route("explain blockchain simply")
-// [ORBIT] → Gemini 2.5 Flash | saved $0.01455 (97% reduction)
+---
-// model.id = 'gemini-2.5-flash', model.provider = 'google'
-// Call the model yourself with your keys
-```
+## Routing Table
+| Condition | Model | Rule |
+|-----------|-------|------|
+| Signal = 777 | Claude Sonnet | Completion — capability floor |
+| Signal = 555 | Grok | Variation — max diversity |
+| Signal = 333 | Gemini Flash | Foundation — cost floor |
+| `emotional_weight` ≥ 6 | Claude Sonnet | Safety-first, always |
+| `domain` = legal/medical | Claude Sonnet | Ethics + long context |
+| `recency` ≥ 7 | Grok | Live web access |
+| `complexity` ≥ 7 + code | Claude Sonnet | Deep reasoning |
+| `complexity` ≥ 7 | GPT-4o | Structured output |
+| `creativity` ≥ 5 | Claude Sonnet | Open-ended generation |
+| `complexity` ≤ 3 | Gemini 2.5 Flash | 98% cheaper, equivalent quality |
+| Default | Claude Sonnet | Safe fallback |
-### Full pipeline example
+---
+## API
 ```javascript
-import { OrbitClient } from 'orbit-ai'
-import Anthropic from '@anthropic-ai/sdk'
-import OpenAI from 'openai'
-import { GoogleGenerativeAI } from '@google/generative-ai'
-const orbit = new OrbitClient({ log: true })
-async function smartQuery(text, signal) {
-  const { model } = orbit.route(text, { signal })
-  if (model.provider === 'anthropic') {
-    const client = new Anthropic()
-    return client.messages.create({ model: model.id, max_tokens: 1024, messages: [{ role: 'user', content: text }] })
-  }
-  if (model.provider === 'openai') {
-    const client = new OpenAI()
-    return client.chat.completions.create({ model: model.id, messages: [{ role: 'user', content: text }] })
-  }
-  if (model.provider === 'google') {
-    const client = new GoogleGenerativeAI(process.env.GOOGLE_API_KEY)
-    return client.getGenerativeModel({ model: model.id }).generateContent(text)
-  }
-}
-await smartQuery("write a poem about the ocean")                    // → Claude Sonnet
-await smartQuery("what's the latest news on AI funding?")           // → Grok
-await smartQuery("what is 2+2")                                     // → Gemini Flash
-await smartQuery("I've been feeling really overwhelmed")            // → Claude Sonnet (ethics-first)
-await smartQuery("finalize the system design", { signal: "777" })   // → Claude Sonnet (forced)
-```
+import orbit, { OrbitClient, fingerprint } from '@gabrielsmartin/orbit-sdk'
+// Singleton client — zero config
+const { model, reason, rule, scores, savings } = orbit.route("your query")
+// Custom client
+const client = new OrbitClient({
+  cost_tolerance: 'low',          // 'low' | 'medium' | 'high'
+  blocked_models: ['gpt4o'],      // block specific models
+  apiKey: 'your-orbit-key',       // enables usage telemetry (optional)
+  signal: '333',                  // default signal code (optional)
+  log: true,                      // console.log routing decisions (default: true)
+  on_route: (decision) => {},     // callback on each routing decision
+})
-### Session stats
+// Fingerprint only — no routing
+const scores = orbit.fingerprint("your query")
+// → { complexity: 7, creativity: 2, emotional_weight: 0, recency: 0, ... }
-```javascript
+// Session stats
 const stats = orbit.stats()
-console.log(stats.total_savings_formatted)  // "$0.2341"
-console.log(stats.model_usage)              // { "Claude Sonnet": 4, "Gemini 2.5 Flash": 12, ... }
+// → { total_queries: 42, total_savings_formatted: '$1.2400', model_usage: { ... } }
 ```
 ---
-## Model matrix
+## Hosted API
-| Model | Provider | Cost/1M | Best for |
-|---|---|---|---|
-| Claude Sonnet 3.5 | Anthropic | $15 | Complex reasoning · Ethics · Long context |
-| Claude Haiku | Anthropic | $1 | Speed · Summaries · Medium tasks |
-| Gemini 2.5 Flash | Google | $0.50 | High volume · Simple queries · Cost |
-| GPT-4o | OpenAI | $30 | Structured output · Broad knowledge |
-| GPT-4o Mini | OpenAI | $0.30 | Classification · Filler tasks |
-| Grok | xAI | $10 | Trending · Real-time web |
+Free to try, no auth required:
----
+```bash
+curl -X POST https://gtll-soul-guide-81e596e1.base44.app/functions/orbitGateway \
+  -H "Content-Type: application/json" \
+  -d '{"query": "write a haiku about recursion"}'
+```
-## The math
+**Pricing:**
-Validated by [RouteLLM (UC Berkeley · ICLR 2025)](https://arxiv.org/abs/2406.18665): intelligent routing achieves **85% cost reduction** while maintaining 95% of quality.
+| Tier | Price | Limit |
+|------|-------|-------|
+| Free | $0/mo | 3 queries/day |
+| Pro | $19/mo | Unlimited |
+| Team | $99/mo | Unlimited · 5 seats |
-For a team running 100k queries/month at GPT-4o:
-- Without ORBIT: **$1,500/month**
-- With ORBIT: **~$225/month**
-- Savings: **$1,275/month · $15,300/year**
+→ [Pro](https://buy.stripe.com/6oE5kF3Yz5Co06s9AB) · [Team](https://buy.stripe.com/9AQ9AV5GH9SEdss6op)
 ---
-## Source
+## Research backing
-[github.com/gtllco/orbit](https://github.com/gtllco/orbit)
+- **RouteLLM (ICLR 2025, UC Berkeley):** intelligent routing achieves 85% cost reduction at 95% quality vs always-GPT-4o
+- **OpenRouter** ($500M+ valuation) proves the market. ORBIT adds the classification layer.
+- **Martian** (Accenture-backed) proves enterprises pay for routing. ORBIT is the open version.
 ---
 ## Roadmap
-- [x] 8-axis fingerprinting engine
-- [x] 6-model routing matrix
-- [x] TypeScript types
-- [x] Signal-aware routing (777 · 555 · 333)
-- [ ] Streaming support
-- [ ] Custom model matrix (bring your own models)
-- [ ] Automatic provider failover
-- [ ] Usage analytics dashboard
-- [ ] Browser extension
+- [x] v0.1.x — 8-axis classification, 6-model routing matrix
+- [x] v0.3.x — Signal-aware routing (777/555/333), hosted gateway
+- [ ] v0.4.0 — API key gated usage dashboard
+- [ ] v0.5.0 — Embedding-based fallback for ambiguous queries
+- [ ] v1.0.0 — Enterprise API + savings-share pricing
 ---
 ## License
-MIT · Built by [Gabriel Martin](https://github.com/gabrielsmartin)
+MIT © [Gabriel Martin](https://github.com/gabrielsmartin)
-*"Every model has a gravitational pull. ORBIT decides which one you need."*
+**[GitHub](https://github.com/gtllco/orbit) · [npm](https://www.npmjs.com/package/@gabrielsmartin/orbit-sdk)**
 777 · 555 · 333

package/package.json CHANGED Viewed

@@ -1,13 +1,18 @@
 {
   "name": "@gabrielsmartin/orbit-sdk",
-  "version": "0.3.3",
-  "description": "Intelligent AI model routing with signal layer. 85% cost savings. 777·555·333",
-  "main": "index.js",
-  "types": "index.d.ts",
+  "version": "0.4.0",
+  "description": "Rule-based LLM router. Classifies queries across 8 axes and picks the optimal model. Fast, deterministic, zero dependencies.",
+  "type": "module",
+  "main": "src/index.js",
+  "types": "src/index.d.ts",
+  "exports": {
+    ".": {
+      "import": "./src/index.js",
+      "types": "./src/index.d.ts"
+    }
+  },
   "files": [
-    "index.js",
-    "index.d.ts",
-    "src/",
+    "src",
     "README.md"
   ],
   "scripts": {
@@ -22,28 +27,25 @@
     "gemini",
     "orbit",
     "cost-optimization",
-    "model-routing"
+    "model-routing",
+    "selective-model-matching",
+    "gpt4",
+    "claude",
+    "gemini-flash",
+    "grok",
+    "ai-infrastructure"
   ],
-  "author": "Gabriel Martin <admin@gtll.app>",
+  "author": "Gabriel Martin <gabriel@gtll.app>",
   "license": "MIT",
   "repository": {
     "type": "git",
-    "url": "https://github.com/gtllco/orbit"
+    "url": "git+https://github.com/gtllco/orbit.git"
   },
-  "homepage": "https://orbit-sdk.base44.app",
-  "publishConfig": {
-    "registry": "https://registry.npmjs.org",
-    "access": "public"
-  },
-  "module": "index.js",
-  "type": "module",
-  "exports": {
-    ".": {
-      "import": "./index.js",
-      "types": "./index.d.ts"
-    }
+  "homepage": "https://github.com/gtllco/orbit",
+  "bugs": {
+    "url": "https://github.com/gtllco/orbit/issues"
   },
   "engines": {
-    "node": ">=18.0.0"
+    "node": ">=16"
   }
 }

package/src/index.js CHANGED Viewed

@@ -1,133 +1,130 @@
 /**
- * orbit-ai · v0.3.0
- * Intelligent AI model routing with signal-aware priority bias.
- * Drop in. Save 85%.
- *
- * https://orbitai.gtll.app
- * github.com/gtllco/orbit
- * npm: @gabrielsmartin/orbit-sdk
+ * @gabrielsmartin/orbit-sdk
+ * Rule-based LLM router. Fast, deterministic, zero dependencies.
+ * Picks the right model — you make the API call.
  *
+ * https://github.com/gtllco/orbit
  * 777 · 555 · 333
  */
 import { fingerprint } from './fingerprint.js';
 import { route, calculateSavings, MODEL_MATRIX } from './router.js';
-import { applySignalBias, inferSignalFromEvent, formatSignalResponse, SIGNAL_DESCRIPTIONS } from './signal.js';
 export { fingerprint, route, calculateSavings, MODEL_MATRIX };
-export { applySignalBias, inferSignalFromEvent, formatSignalResponse, SIGNAL_DESCRIPTIONS };
+const GATEWAY_URL = 'https://gtll-soul-guide-81e596e1.base44.app/functions/orbitGateway';
 /**
  * OrbitClient — the main class
- * Now with signal-aware routing (777 · 555 · 333)
  *
  * @example
  * import { OrbitClient } from '@gabrielsmartin/orbit-sdk'
- * const orbit = new OrbitClient()
- *
- * // Without signal — standard 8-axis routing
- * const result = orbit.route("summarize this contract")
- *
- * // With signal — priority-aware routing
- * const result = orbit.route("write the Q1 investor memo", { signal: "777" })
- * // → Claude Sonnet mandatory. This is final form.
  *
- * const result = orbit.route("what's a business model nobody's tried?", { signal: "555" })
- * // → Grok or Claude Sonnet. Destabilize the expected.
- *
- * const result = orbit.route("is this email spam?", { signal: "333" })
- * // → Gemini Flash. Strip cost. Foundation doesn't need premium.
+ * const orbit = new OrbitClient({ apiKey: 'your-key' })
+ * const { model, reason, scores } = orbit.route("explain quantum entanglement")
+ * // → { model: { name: 'Claude Sonnet', id: 'claude-sonnet-3-5', ... }, reason: '...', ... }
+ * // You then call the model using the provider SDK of your choice
  */
 export class OrbitClient {
   constructor(config = {}) {
     this.config = {
-      cost_tolerance: config.cost_tolerance || 'medium', // 'low' | 'medium' | 'high'
+      cost_tolerance: config.cost_tolerance || 'medium',
       blocked_models: config.blocked_models || [],
-      api_key: config.apiKey || config.api_key || null,
+      apiKey: config.apiKey || config.api_key || null,
+      signal: config.signal || null,
       log: config.log !== false,
       on_route: config.on_route || null,
-      // Provider API keys (optional — falls back to env vars)
-      anthropic_key: config.anthropic_key || null,
-      openai_key: config.openai_key || null,
-      google_key: config.google_key || null,
     };
     this._stats = {
       total_queries: 0,
       total_savings: 0,
       model_usage: {},
-      signal_usage: { '777': 0, '555': 0, '333': 0, none: 0 },
     };
   }
   /**
-   * Route a query to the optimal model.
-   * Signal codes bias routing before model selection.
+   * Route a query to the optimal model (local, <1ms).
+   * ORBIT picks the model — your code makes the API call.
    *
-   * @param {string} text - The query text
-   * @param {Object} options - Override options for this query
-   * @param {string} [options.signal] - "777" | "555" | "333" | null
-   * @param {string} [options.cost_tolerance] - "low" | "medium" | "high"
-   * @param {number} [options.estimated_tokens] - Token estimate for cost calc
-   * @returns {Object} decision - { model, reason, rule, scores, savings, signal_applied, signal_reason, estimated_cost }
+   * @param {string} text - The query or prompt text
+   * @param {Object} options - Per-query overrides: { cost_tolerance, signal, estimated_tokens, blocked_models }
+   * @returns {{ model, reason, rule, scores, savings, timestamp }}
    */
   route(text, options = {}) {
-    // 1. Fingerprint
-    const rawScores = fingerprint(text);
+    const scores = fingerprint(text);
-    // 2. Apply cost_tolerance override
     if (options.cost_tolerance) {
-      rawScores.cost_tolerance = options.cost_tolerance === 'low' ? 2
+      scores.cost_tolerance = options.cost_tolerance === 'low' ? 2
         : options.cost_tolerance === 'high' ? 9 : 5;
     }
-    // 3. Apply signal bias (777 / 555 / 333)
-    const signal = options.signal || null;
-    const scores = applySignalBias(rawScores, signal);
+    const config = {
+      ...this.config,
+      ...options,
+      signal: options.signal || this.config.signal || null,
+    };
-    // 4. Route
-    const config = { ...this.config, ...options };
     const decision = route(scores, config);
-    // 5. Calculate savings and cost
-    const estimatedTokens = options.estimated_tokens || 500;
-    const savings = calculateSavings(decision.model, estimatedTokens);
-    const estimatedCost = `$${savings.actualCost.toFixed(5)}`;
-    // 6. Format signal metadata
-    const signalMeta = formatSignalResponse(scores, decision);
+    const savings = calculateSavings(decision.model, options.estimated_tokens || 500);
     const result = {
       model: decision.model,
       reason: decision.reason,
       rule: decision.rule,
-      scores: rawScores, // return original scores, not biased
-      signal_applied: signalMeta.signal_applied,
-      signal_reason: signalMeta.signal_reason || null,
+      scores,
       savings,
-      estimated_cost: estimatedCost,
       timestamp: new Date().toISOString(),
     };
-    // Update stats
+    // Stats
     this._stats.total_queries++;
     this._stats.total_savings += savings.savings;
     const modelName = decision.model.name;
     this._stats.model_usage[modelName] = (this._stats.model_usage[modelName] || 0) + 1;
-    this._stats.signal_usage[signal || 'none']++;
-    // Log routing decision
     if (this.config.log) {
-      const signalTag = signal ? ` [signal:${signal}]` : '';
-      console.log(`[ORBIT]${signalTag} → ${decision.model.name} | ${decision.rule} | ${estimatedCost} (saved ${savings.reductionPct}%)`);
+      console.log(`[ORBIT] → ${decision.model.name} | ${decision.rule} | saved $${savings.savings.toFixed(5)} (${savings.reductionPct}% vs GPT-4o)`);
+    }
+    if (this.config.on_route) {
+      this.config.on_route(result);
+    }
+    // Fire telemetry to gateway (non-blocking, best-effort)
+    if (this.config.apiKey) {
+      this._telemetry(text, result).catch(() => {});
     }
-    if (this.config.on_route) this.config.on_route(result);
     return result;
   }
   /**
-   * Get cumulative stats for this session — including per-signal breakdown
+   * Send a routing decision to the ORBIT gateway for usage tracking.
+   * Only fires if apiKey is set. Non-blocking.
+   * @private
+   */
+  async _telemetry(query, decision) {
+    try {
+      await fetch(GATEWAY_URL, {
+        method: 'POST',
+        headers: { 'Content-Type': 'application/json' },
+        body: JSON.stringify({
+          query,
+          api_key: this.config.apiKey,
+          model_selected: decision.model.name,
+          rule: decision.rule,
+          signal: decision.scores?.signal || null,
+          savings_pct: decision.savings.reductionPct,
+        }),
+      });
+    } catch (_) {
+      // Silently ignore — telemetry is best-effort
+    }
+  }
+  /**
+   * Get cumulative routing stats for this session
    */
   stats() {
     return {
@@ -137,31 +134,20 @@ export class OrbitClient {
   }
   /**
-   * Fingerprint a query without routing
+   * Fingerprint a query without routing.
+   * Useful for debugging or building custom logic on top.
    */
   fingerprint(text) {
     return fingerprint(text);
   }
-  /**
-   * Apply signal bias to an existing fingerprint
-   * Useful for building custom routing logic on top of ORBIT
-   */
-  applySignal(fingerprint, signal_code) {
-    return applySignalBias(fingerprint, signal_code);
-  }
-  /**
-   * Infer signal from a neural hub event priority
-   * coral1 events tagged 777/555/333 auto-translate to signal codes
-   */
-  signalFromEvent(eventPriority) {
-    return inferSignalFromEvent(eventPriority);
-  }
 }
 /**
- * Default singleton client
+ * Default singleton client — zero config, ready to use
+ *
+ * @example
+ * import orbit from '@gabrielsmartin/orbit-sdk'
+ * const { model, reason } = orbit.route("write a haiku about recursion")
  */
 const orbit = new OrbitClient();
 export default orbit;

package/src/router.js CHANGED Viewed

@@ -1,14 +1,17 @@
 /**
- * ORBIT · Selective Model Matching (SMM) Router
- * Routes queries to optimal models based on 8-axis fingerprints + signal codes
+ * ORBIT · Model Router
+ * Rule-based query classifier — routes to the optimal LLM based on query characteristics.
+ * Fast, deterministic, zero dependencies. ~160 lines.
+ *
+ * Rules are hand-tuned heuristics, not learned weights.
+ * Safety-critical routes (emotional content, crisis) always win.
  *
- * Proprietary routing logic — open SDK, closed engine weights
  * 777 · 555 · 333
  */
 export const MODEL_MATRIX = {
   claude_sonnet: {
-    id: 'claude-sonnet-4-6',
+    id: 'claude-sonnet-3-5',
     name: 'Claude Sonnet',
     provider: 'anthropic',
     costPer1M: 15,
@@ -17,7 +20,7 @@ export const MODEL_MATRIX = {
     tier: 'medium',
   },
   claude_haiku: {
-    id: 'claude-haiku-4-5',
+    id: 'claude-haiku-3-5',
     name: 'Claude Haiku',
     provider: 'anthropic',
     costPer1M: 1,
@@ -64,153 +67,117 @@ export const MODEL_MATRIX = {
 };
 /**
- * Core SMM routing logic — Signal-aware
- * Returns the selected model + reasoning
+ * Route a query fingerprint to the best model.
+ * Returns { model, reason, rule }
+ *
+ * Note: ORBIT picks the model — your code makes the API call.
  *
- * @param {Object} scores - 8-axis fingerprint scores (post-signal-bias)
- * @param {Object} config - User config (cost_tolerance override, blocked_models, etc.)
- * @returns {Object} { model, reason, fallback }
+ * @param {Object} scores - 8-axis fingerprint from fingerprint()
+ * @param {Object} config - Optional overrides (blocked_models, cost_tolerance, signal)
+ * @returns {{ model: Object, reason: string, rule: string }}
  */
 export function route(scores, config = {}) {
   const {
     complexity, creativity, speed, emotional_weight,
-    recency, context_load, domain, cost_tolerance,
-    signal_code, variation_mode
+    recency, context_load, domain, cost_tolerance
   } = scores;
   const blocked = config.blocked_models || [];
-  const preferLow = cost_tolerance <= 3;
-  const preferHigh = cost_tolerance >= 8;
+  const preferLow = config.cost_tolerance === 'low' || cost_tolerance <= 3;
-  // ── SIGNAL OVERRIDES (applied before all other rules) ──────────────────────
-  // 777 — Completion Bias: cost_tolerance and complexity already raised by applySignalBias.
-  // But explicitly block sub-tier models when signal=777.
-  if (signal_code === '777') {
-    // Minimum floor: Claude Haiku. Prefer Sonnet if complexity >= 5.
-    if (complexity >= 5 && !blocked.includes('claude_sonnet')) {
-      return {
-        model: MODEL_MATRIX.claude_sonnet,
-        reason: `Completion bias (777) — complexity ${complexity}/10 meets threshold. Claude Sonnet mandatory. This output is final form.`,
-        rule: 'signal_777_sonnet',
-      };
-    }
-    // complexity < 5 but still 777: Claude Haiku minimum, no Gemini Flash/GPT-4o Mini
-    if (!blocked.includes('claude_haiku')) {
-      return {
-        model: MODEL_MATRIX.claude_haiku,
-        reason: `Completion bias (777) — complexity ${complexity}/10 below Sonnet threshold but 777 enforces Claude Haiku minimum. No sub-tier models on completion events.`,
-        rule: 'signal_777_haiku',
-      };
-    }
+  // Signal override (777/555/333) — always wins if set
+  if (config.signal === '777') {
+    return { model: MODEL_MATRIX.claude_sonnet, reason: '777 — Completion. Claude Sonnet floor enforced.', rule: 'signal_777' };
   }
-  // 555 — Variation Bias: variation_mode=true, creativity and recency already boosted.
-  // Prefer Grok when recency is elevated. Otherwise prefer creative non-default models.
-  if (signal_code === '555') {
-    if (recency >= 5 && !blocked.includes('grok')) {
-      return {
-        model: MODEL_MATRIX.grok,
-        reason: `Variation bias (555) — recency boosted to ${recency}/10. Grok for live web intelligence and unexpected angles. Destabilize the expected.`,
-        rule: 'signal_555_grok',
-      };
-    }
-    if (creativity >= 6 && !blocked.includes('claude_sonnet')) {
-      return {
-        model: MODEL_MATRIX.claude_sonnet,
-        reason: `Variation bias (555) — creativity at ${creativity}/10. Claude Sonnet for nuanced, surprising creative output.`,
-        rule: 'signal_555_claude',
-      };
-    }
+  if (config.signal === '555') {
+    return { model: MODEL_MATRIX.grok, reason: '555 — Variation. Maximum model diversity.', rule: 'signal_555' };
+  }
+  if (config.signal === '333') {
+    return { model: MODEL_MATRIX.gemini_flash, reason: '333 — Foundation. Minimum cost floor.', rule: 'signal_333' };
   }
-  // 333 — Foundation Bias: cost_tolerance dropped to 1 by applySignalBias (unless emotional override).
-  // Emotional safety net is handled by ethics rule below — it fires first.
-  // ── CORE ROUTING RULES ─────────────────────────────────────────────────────
-  // Rule 1: ETHICS FIRST — emotional/crisis queries always go to Claude (even on 333)
+  // Rule 1: SAFETY — emotional/crisis always Claude Sonnet
   if (emotional_weight >= 6) {
     return {
       model: MODEL_MATRIX.claude_sonnet,
-      reason: `Emotional weight ${emotional_weight}/10 — routing to Claude for ethics-first handling. Never use a cheap model for sensitive content.${signal_code === '333' ? ' (333 foundation bias overridden by emotional safety rule)' : ''}`,
+      reason: 'Emotional weight detected — Claude Sonnet for ethics-first handling. Never route sensitive content to a cost-optimized model.',
       rule: 'ethics_first',
     };
   }
-  // Rule 2: Realtime / current events → Grok
-  if (recency >= 7 && !blocked.includes('grok') && signal_code !== '777') {
+  // Rule 2: Realtime → Grok
+  if (recency >= 7 && !blocked.includes('grok')) {
     return {
       model: MODEL_MATRIX.grok,
-      reason: `High recency score (${recency}/10) — Grok has live web access for current events and trending topics.`,
+      reason: `High recency (${recency}/10) — Grok for live web access.`,
       rule: 'recency_grok',
     };
   }
-  // Rule 3: Long context load → Claude Sonnet (200k window)
-  if (context_load >= 8 && !blocked.includes('claude_sonnet') && signal_code !== '333') {
+  // Rule 3: Long context → Claude (200k window)
+  if (context_load >= 8 && !blocked.includes('claude_sonnet')) {
     return {
       model: MODEL_MATRIX.claude_sonnet,
-      reason: `High context load (${context_load}/10) — Claude's 200k window is the only safe choice.`,
+      reason: `High context load (${context_load}/10) — Claude's 200k window.`,
       rule: 'context_claude',
     };
   }
-  // Rule 4: High complexity code/reasoning
+  // Rule 4: Complex code → Claude Sonnet
   if (complexity >= 7 && domain === 'code' && !blocked.includes('claude_sonnet')) {
     return {
       model: MODEL_MATRIX.claude_sonnet,
-      reason: `Complex code task (complexity ${complexity}/10) — Claude Sonnet for deep reasoning and long context.`,
+      reason: `Complex code (complexity ${complexity}/10) — Claude Sonnet for deep reasoning.`,
       rule: 'complex_code',
     };
   }
-  // Rule 5: High complexity general → GPT-4o (if cost tolerance allows and not 777/333)
-  if (complexity >= 7 && !preferLow && !blocked.includes('gpt4o') && signal_code !== '777' && signal_code !== '333') {
+  // Rule 5: High complexity general → GPT-4o
+  if (complexity >= 7 && !preferLow && !blocked.includes('gpt4o')) {
     return {
       model: MODEL_MATRIX.gpt4o,
-      reason: `High complexity (${complexity}/10) — GPT-4o for broad knowledge and structured output.`,
+      reason: `High complexity (${complexity}/10) — GPT-4o for structured output.`,
       rule: 'complex_gpt4o',
     };
   }
-  // Rule 6: Creative writing → Claude Sonnet (unless 333 forcing minimum)
-  if (creativity >= 5 && !blocked.includes('claude_sonnet') && !preferLow) {
+  // Rule 6: Creative → Claude Sonnet
+  if (creativity >= 5 && !blocked.includes('claude_sonnet')) {
     return {
       model: MODEL_MATRIX.claude_sonnet,
-      reason: `High creativity score (${creativity}/10) — Claude Sonnet for nuanced creative writing.`,
+      reason: `High creativity (${creativity}/10) — Claude Sonnet for open-ended generation.`,
       rule: 'creative_claude',
     };
   }
-  // Rule 7: Cost sensitive OR simple queries OR 333 foundation → Gemini Flash
+  // Rule 7: Simple / cost-sensitive → Gemini Flash
   if ((preferLow || complexity <= 3) && !blocked.includes('gemini_flash')) {
     return {
       model: MODEL_MATRIX.gemini_flash,
-      reason: `${signal_code === '333' ? 'Foundation bias (333) — ' : ''}Low complexity (${complexity}/10) — Gemini 2.5 Flash delivers 95% quality at 2% of GPT-4o cost.`,
+      reason: `Low complexity (${complexity}/10) — Gemini Flash at $0.50/1M tokens.`,
       rule: 'cost_gemini',
     };
   }
-  // Rule 8: Medium complexity → Claude Haiku (fast + cheap + capable)
+  // Rule 8: Medium → Claude Haiku
   if (complexity <= 5 && !blocked.includes('claude_haiku')) {
     return {
       model: MODEL_MATRIX.claude_haiku,
-      reason: `Medium complexity (${complexity}/10) — Claude Haiku balances speed, cost, and quality.`,
+      reason: `Medium complexity (${complexity}/10) — Claude Haiku for speed and quality balance.`,
       rule: 'medium_haiku',
     };
   }
-  // Default: Claude Sonnet (safest general choice)
+  // Default
   return {
     model: MODEL_MATRIX.claude_sonnet,
-    reason: 'Default routing — Claude Sonnet for reliable, high-quality responses.',
+    reason: 'Default — Claude Sonnet for reliable high-quality output.',
     rule: 'default',
   };
 }
 /**
- * Calculate savings vs always using GPT-4o (premium baseline)
+ * Estimate cost savings vs always using GPT-4o
  */
 export function calculateSavings(selectedModel, estimatedTokens = 500) {
   const premiumCost = (MODEL_MATRIX.gpt4o.costPer1M / 1_000_000) * estimatedTokens;

package/index.js DELETED Viewed

@@ -1,14 +0,0 @@
-/**
- * @gabrielsmartin/orbit-sdk
- * Intelligent AI model routing — routes every query to the optimal model in <1ms
- *
- * 777 · 555 · 333
- * github.com/gtllco/orbit
- */
-export { fingerprint, route, calculateSavings, MODEL_MATRIX, OrbitClient } from './src/index.js';
-// Default export — zero-config instance
-import { OrbitClient } from './src/index.js';
-const orbit = new OrbitClient();
-export default orbit;

package/src/signalBias.js DELETED Viewed

@@ -1,85 +0,0 @@
-/**
- * ORBIT Signal Layer — Semantic Intent Routing Bias
- *
- * Signal codes are semantic flags that travel with a query and adjust the routing
- * decision before model selection. They connect ORBIT to the organizational priority
- * layer (the neural hub, event priorities, etc).
- *
- * 777 · 555 · 333
- */
-/**
- * Apply signal-based routing bias to a fingerprint
- * Modifies the fingerprint scores before model selection happens
- *
- * @param {Object} fingerprint - 8-axis scores from orbitFingerprint()
- * @param {string} signal - '777' | '555' | '333' | null
- * @returns {Object} biased fingerprint
- */
-export function applySignalBias(fingerprint, signal) {
-  if (!signal) return fingerprint;
-  const biased = { ...fingerprint };
-  if (signal === '777') {
-    // COMPLETION BIAS
-    // 777 = This output is final. Quality floor raised. Never cut corners.
-    // - Force high-capability model floor
-    // - Never route to sub-tier models (Flash, Mini, Haiku)
-    // - If complexity >= 5: Claude Sonnet mandatory
-    // - If complexity < 5: Claude Haiku minimum
-    // - If emotional_weight >= 7: Claude always (never change this)
-    biased.cost_tolerance = Math.max(biased.cost_tolerance, 7);
-    biased.complexity = Math.max(biased.complexity, 5);
-    biased.signal_applied = '777';
-    biased.signal_reason = 'Completion bias — cost floor raised, quality mandatory';
-  }
-  if (signal === '555') {
-    // VARIATION BIAS
-    // 555 = This query is exploratory. Break the expected pattern. Surprise.
-    // - Introduce controlled model diversity
-    // - Prefer non-default choices
-    // - If creativity >= 5: weight variation higher
-    // - If recency >= 6: Perplexity-like model over Claude
-    // - If complexity >= 6: allow GPT-4o instead of Claude
-    biased.creativity = Math.max(biased.creativity, 5);
-    biased.recency = Math.max(biased.recency, 4);
-    biased.variation_mode = true;
-    biased.signal_applied = '555';
-    biased.signal_reason = 'Variation bias — introduce model diversity, break the pattern';
-  }
-  if (signal === '333') {
-    // FOUNDATION BIAS
-    // 333 = This is ambient/background. Strip to minimum. Cost floor.
-    // - Aggressively route to minimum viable model
-    // - If emotional_weight < 7: force cost_tolerance to 1 (ignore user config)
-    // - If complexity > 5: cap it at 4 (don't overpay)
-    // - Exception: emotional_weight >= 7 ALWAYS upgrades to Claude
-    //   (never route crisis/sensitive to cheap models, even on 333)
-    if (biased.emotional_weight < 7) {
-      biased.cost_tolerance = 1; // force minimum cost
-      biased.complexity = Math.min(biased.complexity, 4); // cap complexity
-    }
-    biased.signal_applied = '333';
-    biased.signal_reason = 'Foundation bias — cost floor, ambient routing';
-  }
-  return biased;
-}
-/**
- * Create signal explanation for response
- */
-export function getSignalExplanation(signal) {
-  const explanations = {
-    '777': 'Completion bias applied — cost floor raised, complexity floor raised. Routed to highest-capability model.',
-    '555': 'Variation bias applied — model diversity prioritized. Unexpected routing choice for exploratory output.',
-    '333': 'Foundation bias applied — cost floor enforced. Minimum viable model selected for ambient routing.',
-  };
-  return explanations[signal] || null;
-}

/package/{index.d.ts → src/index.d.ts} RENAMED Viewed

File without changes