npm - @modelrelay/sdk - Versions diffs - 0.2.0 → 0.4.0 - Mend

@modelrelay/sdk 0.2.0 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ModelRelay TypeScript SDK
-Typed client for browsers and Node.js that wraps the ModelRelay API. It exchanges publishable keys for frontend tokens, starts Stripe Checkout for end users, and proxies chat completions with SSE handling.
+Typed client for Node.js that wraps the ModelRelay API for **consuming** LLM/usage endpoints. Use secret API keys or bearer tokens issued by your backend; publishable-key frontend token flows have been removed.
 ## Installation
@@ -14,21 +14,12 @@ bun add @modelrelay/sdk
 ```ts
 import { ModelRelay } from "@modelrelay/sdk";
-// Use a publishable key for client-side apps.
+// Use a secret key or bearer token from your backend.
 const mr = new ModelRelay({
-  key: "mr_pk_...",
-  endUser: { id: "user-123" } // required when using publishable keys
+  key: "mr_sk_..."
 });
-// 1. Start an end-user Stripe checkout.
-await mr.billing.checkout({
-  endUserId: "user-123",
-  planId: "pro-plan",
-  successUrl: window.location.origin + "/success",
-  cancelUrl: window.location.origin + "/cancel"
-});
-// 2. Stream chat completions (publishable keys are exchanged for a frontend token automatically).
+// Stream chat completions.
 const stream = await mr.chat.completions.create({
   model: "grok-4-1-fast-reasoning",
   messages: [{ role: "user", content: "Hello" }]
@@ -41,18 +32,9 @@ for await (const event of stream) {
 }
 ```
-### Manual frontend token exchange
-If you need to mint the token yourself (e.g., to store in your app state):
-```ts
-const token = await mr.auth.frontendToken({ userId: "user-123" });
-const chat = new ModelRelay({ token: token.token });
-```
 ### Server-side usage
-Provide a secret API key or bearer token instead of a publishable key:
+Provide a secret API key or bearer token:
 ```ts
 const mr = new ModelRelay({ key: "mr_sk_..." });
@@ -72,15 +54,115 @@ console.log(completion.content.join(""));
 - **Environments**: `environment: "production" | "staging" | "sandbox"` or override `baseUrl`.
 - **Auth**: pass a secret/publishable `key` or a bearer `token`. Publishable keys mint frontend tokens automatically.
-- **Timeouts & retries**: `timeoutMs` (default 60s, set `0` to disable) and `retry` (`{ maxAttempts, baseBackoffMs, maxBackoffMs, retryPost }` or `false` to disable).
+- **Timeouts & retries**: `connectTimeoutMs` (default 5s per attempt) and `timeoutMs` (default 60s overall; set `0` to disable). Per-call overrides available on `chat.completions.create`. `retry` config (`{ maxAttempts, baseBackoffMs, maxBackoffMs, retryPost }` or `false`) controls exponential backoff with jitter.
 - **Headers & metadata**: `defaultHeaders` are sent with every request; `defaultMetadata` merges into every chat request and can be overridden per-call via `metadata`.
 - **Client header**: set `clientHeader` to override the telemetry header (defaults to `modelrelay-ts/<version>`).
+### Timeouts & retry examples
+```ts
+// Shorten connect + request timeouts globally
+const mr = new ModelRelay({
+  key: "mr_sk_...",
+  connectTimeoutMs: 3_000,
+  timeoutMs: 20_000,
+  retry: { maxAttempts: 4, baseBackoffMs: 200, maxBackoffMs: 2_000 }
+});
+// Per-call overrides (blocking)
+await mr.chat.completions.create(
+  { model: "grok-4-1-fast-reasoning", messages: [{ role: "user", content: "Hi" }], stream: false },
+  { timeoutMs: 5_000, retry: false }
+);
+// Streaming: keep connect timeout but disable request timeout
+const stream = await mr.chat.completions.create(
+  { model: "grok-4-1-fast-reasoning", messages: [{ role: "user", content: "Hi" }] },
+  { connectTimeoutMs: 2_000 } // request timeout is already disabled for streams by default
+);
+```
+### Typed models, providers, and stop reasons
+- Models and providers use string literal unions with an `Other` escape hatch: pass `{ other: "my-provider" }` or `{ other: "custom/model-x" }` to preserve custom IDs while benefiting from autocomplete on known values (e.g., `Models.OpenAIGpt4o`, `Providers.Anthropic`).
+- Stop reasons are parsed into the `StopReason` union (e.g., `StopReasons.EndTurn`); unknown values surface as `{ other: "<raw>" }`.
+- Usage backfills `totalTokens` when providers omit it, ensuring consistent accounting.
+### Telemetry & metrics hooks
+Provide lightweight callbacks to observe latency and usage without extra deps:
+```ts
+const calls: string[] = [];
+const mr = new ModelRelay({
+  key: "mr_sk_...",
+  metrics: {
+    httpRequest: (m) => calls.push(`http ${m.context.path} ${m.status} ${m.latencyMs}ms`),
+    streamFirstToken: (m) => calls.push(`first-token ${m.latencyMs}ms`),
+    usage: (m) => calls.push(`usage ${m.usage.totalTokens}`)
+  },
+  trace: {
+    streamEvent: ({ event }) => calls.push(`event ${event.type}`),
+    requestFinish: ({ status, latencyMs }) => calls.push(`finished ${status} in ${latencyMs}`)
+  }
+});
+// Per-call overrides
+await mr.chat.completions.create(
+  { model: "echo-1", messages: [{ role: "user", content: "hi" }] },
+  { metrics: { usage: console.log }, trace: { streamEvent: console.debug } }
+);
+```
+### Error categories
+- **ConfigError**: missing key/token, invalid base URL, or request validation issues.
+- **TransportError**: network/connect/request/timeout failures (`kind` is one of `connect | timeout | request | other`), includes retry metadata when retries were attempted.
+- **APIError**: Non-2xx responses with `status`, `code`, `fields`, `requestId`, and optional `retries` metadata.
 ## API surface
-- `auth.frontendToken()` — exchange publishable keys for short-lived frontend tokens (cached until expiry).
 - `chat.completions.create(params, options?)`
   - Supports streaming (default) or blocking JSON (`stream: false`).
   - Accepts per-call `requestId`, `headers`, `metadata`, `timeoutMs`, and `retry` overrides.
-- `billing.checkout()` — start an end-user Stripe Checkout session.
 - `apiKeys.list() | create() | delete(id)` — manage API keys when using secret keys or bearer tokens.
+- `customers` — manage customers with a secret key (see below).
+## Backend Customer Management
+Use a secret key (`mr_sk_*`) to manage customers from your backend:
+```ts
+import { ModelRelay } from "@modelrelay/sdk";
+const mr = new ModelRelay({ key: "mr_sk_..." });
+// Create or update a customer (upsert by external_id)
+const customer = await mr.customers.upsert({
+  tier_id: "your-tier-uuid",
+  external_id: "github-user-12345",  // your app's user ID
+  email: "user@example.com",
+});
+// List all customers
+const customers = await mr.customers.list();
+// Get a specific customer
+const customer = await mr.customers.get("customer-uuid");
+// Create a checkout session for subscription billing
+const session = await mr.customers.createCheckoutSession("customer-uuid", {
+  success_url: "https://myapp.com/billing/success",
+  cancel_url: "https://myapp.com/billing/cancel",
+});
+// Redirect user to session.url to complete payment
+// Check subscription status
+const status = await mr.customers.getSubscription("customer-uuid");
+if (status.active) {
+  // Grant access
+}
+// Delete a customer
+await mr.customers.delete("customer-uuid");
+```