npm - @relayplane/proxy - Versions diffs - 0.1.10 → 0.2.0 - Mend

@relayplane/proxy 0.1.10 → 0.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (43) hide show

package/README.md +112 -298
package/__tests__/server.test.ts +512 -0
package/__tests__/telemetry.test.ts +126 -0
package/dist/cli.d.ts +35 -0
package/dist/cli.d.ts.map +1 -0
package/dist/cli.js +262 -3115
package/dist/cli.js.map +1 -1
package/dist/config.d.ts +80 -0
package/dist/config.d.ts.map +1 -0
package/dist/config.js +208 -0
package/dist/config.js.map +1 -0
package/dist/index.d.ts +25 -1130
package/dist/index.d.ts.map +1 -0
package/dist/index.js +72 -3096
package/dist/index.js.map +1 -1
package/dist/server.d.ts +209 -0
package/dist/server.d.ts.map +1 -0
package/dist/server.js +1089 -0
package/dist/server.js.map +1 -0
package/dist/streaming.d.ts +80 -0
package/dist/streaming.d.ts.map +1 -0
package/dist/streaming.js +271 -0
package/dist/streaming.js.map +1 -0
package/dist/telemetry.d.ts +111 -0
package/dist/telemetry.d.ts.map +1 -0
package/dist/telemetry.js +315 -0
package/dist/telemetry.js.map +1 -0
package/package.json +21 -46
package/src/cli.ts +341 -0
package/src/config.ts +206 -0
package/src/index.ts +82 -0
package/src/server.ts +1328 -0
package/src/streaming.ts +331 -0
package/src/telemetry.ts +343 -0
package/tsconfig.json +19 -0
package/vitest.config.ts +21 -0
package/LICENSE +0 -21
package/dist/cli.d.mts +0 -1
package/dist/cli.mjs +0 -3134
package/dist/cli.mjs.map +0 -1
package/dist/index.d.mts +0 -1141
package/dist/index.mjs +0 -3039
package/dist/index.mjs.map +0 -1

package/README.md CHANGED Viewed

@@ -1,371 +1,185 @@
 # @relayplane/proxy
-**100% Local. Zero Cloud. Full Control.**
+Intelligent AI model routing proxy for cost optimization and observability.
-Intelligent AI model routing that cuts costs by 50-80% while maintaining quality.
-> **Note:** Designed for standard API key users (`ANTHROPIC_API_KEY`, `OPENAI_API_KEY`). MAX subscription OAuth is not currently supported — MAX users should continue using their provider directly.
-> ⚠️ **Cost Monitoring Required**
->
-> RelayPlane routes requests to LLM providers using your API keys. **This incurs real costs.**
->
-> - Set up billing alerts with your providers (Anthropic, OpenAI, etc.)
-> - Monitor usage through your provider's dashboard
-> - Use `/relayplane stats` or `curl localhost:3001/control/stats` to track usage
-> - Start with test requests to understand routing behavior
->
-> RelayPlane provides cost *optimization*, not cost *elimination*. You are responsible for monitoring your actual spending.
-[![CI](https://github.com/RelayPlane/proxy/actions/workflows/ci.yml/badge.svg)](https://github.com/RelayPlane/proxy/actions/workflows/ci.yml)
-[![npm version](https://img.shields.io/npm/v/@relayplane/proxy)](https://www.npmjs.com/package/@relayplane/proxy)
-[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
-## Install
-```bash
-npm install @relayplane/proxy
-```
-Or run directly:
+## Installation
 ```bash
-npx @relayplane/proxy
-```
-## CLI Commands
-```bash
-# Start the proxy server
-npx @relayplane/proxy
-# Start on custom port
-npx @relayplane/proxy --port 8080
-# View routing statistics
-npx @relayplane/proxy stats
-# View stats for last 30 days
-npx @relayplane/proxy stats --days 30
-# Show help
-npx @relayplane/proxy --help
-```
-## OpenClaw Slash Commands
-If you're using OpenClaw, these chat commands are available:
-| Command | Description |
-|---------|-------------|
-| `/relayplane stats` | Show usage statistics and cost savings |
-| `/relayplane status` | Show proxy health and configuration |
-| `/relayplane switch <mode>` | Change routing mode (auto\|cost\|fast\|quality) |
-| `/relayplane models` | List available routing models |
-Example:
-```
-/relayplane stats
-/relayplane switch cost
+npm install -g @relayplane/proxy
 ```
 ## Quick Start
-### 1. Set your API keys
 ```bash
-export ANTHROPIC_API_KEY="sk-ant-..."
-export OPENAI_API_KEY="sk-..."
-# Optional: GEMINI_API_KEY, XAI_API_KEY, MOONSHOT_API_KEY
-```
-### 2. Start the proxy
-```bash
-npx @relayplane/proxy --port 3001
-```
+# Set your API keys
+export ANTHROPIC_API_KEY=your-key
+export OPENAI_API_KEY=your-key
-### 3. Point your tools to the proxy
+# Start the proxy
+relayplane-proxy
-```bash
+# Configure your tools to use the proxy
 export ANTHROPIC_BASE_URL=http://localhost:3001
 export OPENAI_BASE_URL=http://localhost:3001
-# Now run OpenClaw, Cursor, Aider, or any tool
-openclaw
-```
-That's it. All API calls now route through RelayPlane for intelligent model selection.
-## How It Works
-```
-Your Tool (OpenClaw, Cursor, etc.)
-         │
-         ▼
-    RelayPlane Proxy
-    ├── Infers task type (code_review, analysis, etc.)
-    ├── Checks routing rules
-    ├── Selects optimal model (Haiku for simple, Opus for complex)
-    ├── Tracks outcomes (success/failure/latency)
-    └── Learns patterns → improves over time
-         │
-         ▼
-    Provider (Anthropic, OpenAI, etc.)
+# Run your AI tools (Claude Code, Cursor, Aider, etc.)
 ```
-## Learning & Adaptation
+## Features
-RelayPlane doesn't just route — it **learns from every request**:
+- **Intelligent Routing**: Routes requests to the optimal model based on task type
+- **Cost Tracking**: Tracks and reports API costs across all providers
+- **Provider Agnostic**: Works with Anthropic, OpenAI, Gemini, xAI, and more
+- **Local Learning**: Learns from your usage patterns to improve routing
+- **Privacy First**: Never sees your prompts or responses
-- **Outcome Tracking** — Records success/failure for each route decision
-- **Pattern Detection** — Identifies what works for your specific codebase
-- **Continuous Improvement** — Routing gets smarter the more you use it
-- **Local Intelligence** — All learning happens in your local SQLite DB
+## CLI Options
 ```bash
-# View your routing stats (last 7 days)
-npx @relayplane/proxy stats
+relayplane-proxy [command] [options]
-# View last 30 days
-npx @relayplane/proxy stats --days 30
+Commands:
+  (default)              Start the proxy server
+  telemetry [on|off|status]  Manage telemetry settings
+  stats                  Show usage statistics
+  config                 Show configuration
-# Query the raw data directly
-sqlite3 ~/.relayplane/data.db "SELECT model, task_type, COUNT(*) FROM runs GROUP BY model, task_type"
+Options:
+  --port <number>    Port to listen on (default: 3001)
+  --host <string>    Host to bind to (default: 127.0.0.1)
+  --offline          Disable all network calls except LLM endpoints
+  --audit            Show telemetry payloads before sending
+  -v, --verbose      Enable verbose logging
+  -h, --help         Show this help message
+  --version          Show version
 ```
-Unlike static routing rules, RelayPlane adapts to **your** usage patterns.
-## Supported Providers
+## Telemetry
-| Provider | Models | Streaming | Tools |
-|----------|--------|-----------|-------|
-| **Anthropic** | Claude 3.5 Haiku, Sonnet 4, Opus 4.5 | ✓ | ✓ |
-| **OpenAI** | GPT-4o, GPT-4o-mini, GPT-4.1, o1, o3 | ✓ | ✓ |
-| **Google** | Gemini 2.0 Flash, Gemini Pro | ✓ | ✓ |
-| **xAI** | Grok (grok-*) | ✓ | ✓ |
-| **Moonshot** | Moonshot v1 (8k, 32k, 128k) | ✓ | ✓ |
+RelayPlane collects anonymous telemetry to improve model routing. This data helps us understand usage patterns and optimize routing decisions.
-## Routing Modes
+### What We Collect (Exact Schema)
-| Model | Description |
-|-------|-------------|
-| `relayplane:auto` | Infers task type, routes to optimal model |
-| `relayplane:cost` | Prioritizes cheapest models (maximum savings) |
-| `relayplane:quality` | Uses best available model |
-Or pass through explicit models: `claude-3-5-sonnet-latest`, `gpt-4o`, etc.
-## Why RelayPlane?
-| Without RelayPlane | With RelayPlane |
-|-------------------|-----------------|
-| Pay Opus token rates for simple tasks | Route simple tasks to Haiku (1/10 the cost) |
-| Static model selection | Learns from outcomes over time |
-| Manual optimization | Automatic cost-quality balance |
-| No visibility into spend | Built-in savings tracking |
+```json
+{
+  "device_id": "anon_8f3a...",
+  "task_type": "code_review",
+  "model": "claude-3-5-haiku",
+  "tokens_in": 1847,
+  "tokens_out": 423,
+  "latency_ms": 2341,
+  "success": true,
+  "cost_usd": 0.02
+}
+```
-## Key Features
+### Field Descriptions
-- **100% Local** — All data in SQLite (`~/.relayplane/data.db`)
-- **Zero Friction** — Set 2 env vars, done
-- **Learning** — Improves routing based on outcomes
-- **Full Streaming** — SSE support for all providers
-- **Tool Calls** — Function calling across providers
+| Field | Type | Description |
+|-------|------|-------------|
+| `device_id` | string | Anonymous random ID (not fingerprintable) |
+| `task_type` | string | Inferred from token patterns, NOT prompt content |
+| `model` | string | The model that handled the request |
+| `tokens_in` | number | Input token count |
+| `tokens_out` | number | Output token count |
+| `latency_ms` | number | Request latency in milliseconds |
+| `success` | boolean | Whether the request succeeded |
+| `cost_usd` | number | Estimated cost in USD |
-## Programmatic Usage
+### Task Types
-```typescript
-import { startProxy, RelayPlane, calculateSavings } from '@relayplane/proxy';
+Task types are inferred from request characteristics (token counts, ratios, etc.) - never from prompt content:
-// Start the proxy
-await startProxy({ port: 3001, verbose: true });
+- `quick_task` - Short input/output (< 500 tokens each)
+- `code_review` - Medium-long input, medium output
+- `generation` - High output/input ratio
+- `classification` - Low output/input ratio, short output
+- `long_context` - Input > 10,000 tokens
+- `content_generation` - Output > 1,000 tokens
+- `tool_use` - Request includes tool calls
+- `general` - Default classification
-// Or use RelayPlane directly
-const relay = new RelayPlane({});
-const result = await relay.run({ prompt: 'Review this code...' });
-console.log(result.taskType); // 'code_review'
-console.log(result.model);    // 'anthropic:claude-3-5-haiku-latest'
+### What We NEVER Collect
-// Check savings
-const savings = calculateSavings(relay.store, 30);
-console.log(`Saved ${savings.savingsPercent}% this month`);
+- ❌ Your prompts
+- ❌ Model responses
+- ❌ File paths or contents
+- ❌ Anything that could identify you or your project
-relay.close();
-```
+### Verification
-## CLI Options
+You can verify exactly what data is collected:
 ```bash
-npx @relayplane/proxy [options]
-Options:
-  --port <number>    Port to listen on (default: 3001)
-  --host <string>    Host to bind to (default: 127.0.0.1)
-  -v, --verbose      Enable verbose logging
-  -h, --help         Show help
-```
-## REST API
-The proxy exposes control endpoints for stats and monitoring:
+# See telemetry payloads before they're sent
+relayplane-proxy --audit
-### `GET /control/status`
+# Disable all telemetry transmission
+relayplane-proxy --offline
-Proxy status and current configuration.
-```bash
-curl http://localhost:3001/control/status
+# View the source code
+# https://github.com/RelayPlane/proxy
 ```
-```json
-{
-  "enabled": true,
-  "mode": "cascade",
-  "modelOverrides": {}
-}
-```
+### Opt-Out
-### `GET /control/stats`
-Aggregated statistics and routing counts.
+To disable telemetry completely:
 ```bash
-curl http://localhost:3001/control/stats
+relayplane-proxy telemetry off
 ```
-```json
-{
-  "uptimeMs": 3600000,
-  "uptimeFormatted": "60m 0s",
-  "totalRequests": 142,
-  "successfulRequests": 138,
-  "failedRequests": 4,
-  "successRate": "97.2%",
-  "avgLatencyMs": 1203,
-  "escalations": 12,
-  "routingCounts": {
-    "auto": 100,
-    "cost": 30,
-    "passthrough": 12
-  },
-  "modelCounts": {
-    "anthropic/claude-3-5-haiku-latest": 98,
-    "anthropic/claude-sonnet-4-20250514": 44
-  }
-}
-```
-### `POST /control/enable` / `POST /control/disable`
-Enable or disable routing (passthrough mode when disabled).
+To re-enable:
 ```bash
-curl -X POST http://localhost:3001/control/enable
-curl -X POST http://localhost:3001/control/disable
+relayplane-proxy telemetry on
 ```
-### `POST /control/config`
-Update configuration (hot-reload, merges with existing).
+Check current status:
 ```bash
-curl -X POST http://localhost:3001/control/config \
-  -H "Content-Type: application/json" \
-  -d '{"routing": {"mode": "cascade"}}'
+relayplane-proxy telemetry status
 ```
 ## Configuration
-RelayPlane creates a config file on first run at `~/.relayplane/config.json`:
-```json
-{
-  "enabled": true,
-  "routing": {
-    "mode": "cascade",
-    "cascade": {
-      "enabled": true,
-      "models": [
-        "claude-3-haiku-20240307",
-        "claude-3-5-sonnet-20241022",
-        "claude-3-opus-20240229"
-      ],
-      "escalateOn": "uncertainty",
-      "maxEscalations": 1
-    },
-    "complexity": {
-      "enabled": true,
-      "simple": "claude-3-haiku-20240307",
-      "moderate": "claude-3-5-sonnet-20241022",
-      "complex": "claude-3-opus-20240229"
-    }
-  },
-  "reliability": {
-    "cooldowns": {
-      "enabled": true,
-      "allowedFails": 3,
-      "windowSeconds": 60,
-      "cooldownSeconds": 120
-    }
-  },
-  "modelOverrides": {}
-}
-```
-**Edit and save — changes apply instantly** (hot-reload, no restart needed).
-### Configuration Options
-| Field | Description |
-|-------|-------------|
-| `enabled` | Enable/disable routing (false = passthrough mode) |
-| `routing.mode` | `"cascade"` or `"standard"` |
-| `routing.cascade.models` | Ordered list of models to try (cheapest first) |
-| `routing.cascade.escalateOn` | When to escalate: `"uncertainty"`, `"refusal"`, or `"error"` |
-| `routing.complexity.simple/moderate/complex` | Models for each complexity level |
-| `reliability.cooldowns` | Auto-disable failing providers temporarily |
-| `modelOverrides` | Map input model names to different targets |
+Configuration is stored in `~/.relayplane/config.json`.
-### Examples
+### Set API Key (Pro Features)
-Use GPT-4o for complex tasks:
-```json
-{
-  "routing": {
-    "complexity": {
-      "complex": "gpt-4o"
-    }
-  }
-}
+```bash
+relayplane-proxy config set-key your-api-key
 ```
-Override a specific model:
-```json
-{
-  "modelOverrides": {
-    "claude-3-opus": "claude-3-5-sonnet-20241022"
-  }
-}
+### View Configuration
+```bash
+relayplane-proxy config
 ```
-## Data Storage
+## Usage Statistics
-All data stored locally at `~/.relayplane/data.db` (SQLite).
+View your usage statistics:
 ```bash
-# View recent runs
-sqlite3 ~/.relayplane/data.db "SELECT * FROM runs ORDER BY created_at DESC LIMIT 10"
-# Check routing rules
-sqlite3 ~/.relayplane/data.db "SELECT * FROM routing_rules"
+relayplane-proxy stats
 ```
-## Links
+This shows:
+- Total requests and cost
+- Success rate
+- Breakdown by model
+- Breakdown by task type
+## Environment Variables
-- [RelayPlane Proxy](https://relayplane.com/integrations/openclaw)
-- [GitHub](https://github.com/RelayPlane/proxy)
-- [RelayPlane](https://relayplane.com/)
+| Variable | Description |
+|----------|-------------|
+| `ANTHROPIC_API_KEY` | Anthropic API key |
+| `OPENAI_API_KEY` | OpenAI API key |
+| `GEMINI_API_KEY` | Google Gemini API key |
+| `XAI_API_KEY` | xAI/Grok API key |
+| `MOONSHOT_API_KEY` | Moonshot API key |
 ## License