npm - @mantyx/sdk - Versions diffs - 0.9.1 → 0.10.0 - Mend

@mantyx/sdk 0.9.1 → 0.10.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (21) hide show

package/CHANGELOG.md +8 -1
package/README.md +75 -2
package/dist/a2a-server.cjs.map +1 -1
package/dist/a2a-server.d.cts +1 -1
package/dist/a2a-server.d.ts +1 -1
package/dist/a2a-server.js +1 -1
package/dist/{chunk-AE7ZSLBH.js → chunk-XMUCELMH.js} +126 -24
package/dist/chunk-XMUCELMH.js.map +1 -0
package/dist/{client-BB6cjfsz.d.cts → client-DHwh8MPj.d.cts} +440 -3
package/dist/{client-BB6cjfsz.d.ts → client-DHwh8MPj.d.ts} +440 -3
package/dist/index.cjs +416 -24
package/dist/index.cjs.map +1 -1
package/dist/index.d.cts +3 -93
package/dist/index.d.ts +3 -93
package/dist/index.js +287 -2
package/dist/index.js.map +1 -1
package/docs/agent-runs-protocol.md +123 -113
package/docs/oauth.md +356 -0
package/docs/wire-protocol.md +1102 -0
package/package.json +1 -1
package/dist/chunk-AE7ZSLBH.js.map +0 -1

package/docs/agent-runs-protocol.md CHANGED Viewed

@@ -66,17 +66,124 @@ All SDK-facing endpoints sit under
 /api/v1/workspaces/{workspaceSlug}/...
 ```
-and are authenticated with a workspace API key with usage `developer_api`:
+and accept **either** of two bearer credentials interchangeably. The same
+header carries either, so SDKs only need one code path:
 ```
-Authorization: Bearer <api-key>
+Authorization: Bearer <credential>
 # or, equivalently:
-X-API-Key: <api-key>
+X-API-Key: <credential>
 ```
-The workspace slug in the URL must match the key's tenant. Mismatches return
-`404 not_found`. Missing/invalid keys return `401 unauthorized`. Rate limits
-follow the workspace's existing developer-API sliding-window policy.
+| Credential                | Token format    | Identifies               | Bound to                | Use when |
+| ------------------------- | --------------- | ------------------------ | ----------------------- | -------- |
+| **Workspace API key**     | `mantyx_…`      | The workspace            | One workspace, no end-user | Personal scripts, internal automations, anything the SDK caller owns end-to-end. |
+| **OAuth 2.0 access token**| `mantyx_at_…`   | An end user **and** the workspace they consented for | One workspace, one user (or one app for `client_credentials`) | "Sign in with MANTYX" apps, third-party integrations, anywhere consent + scopes matter. |
+The server resolves whichever it sees by token-prefix sniffing (see
+`packages/api/src/services/bearer-credential.ts`) — SDKs do **not** need
+separate code paths or env variables for the two flavours.
+The workspace slug in the URL must match the credential's tenant.
+Mismatches return `404 not_found` with a `hint` field pointing at the
+correct slug. Missing/invalid credentials return `401 unauthorized`.
+Rate limits follow the workspace's existing developer-API sliding-window
+policy and are tracked per-credential.
+### 2.1 Workspace API keys (machine credentials)
+A workspace admin issues an API key under **Settings → API keys** with
+**Usage = Developer API**. The key inherits two optional restrictions:
+- **Agent allowlist** (`ApiKey.agentIds`) — empty list = "every
+  non-system agent in the workspace"; otherwise only the listed agents
+  are visible to `spec.agentId` and ephemeral runs created from the key.
+- **Plan gate** — the workspace tier must include the `apiKeys` feature.
+API keys carry no granular scopes; possession of a Developer-API key is
+enough to call every route in this document.
+### 2.2 OAuth 2.0 access tokens
+OAuth tokens are a drop-in alternative for the same set of routes, with
+two differences:
+1. **Scopes are required.** Each route checks the token carries the
+   right scope via `requireScope(...)` and returns
+   `403 { "error": "insufficient_scope", "required": "runs:write" }`
+   (the value is a string for single-scope routes, an array for
+   multi-scope ones — see §2.3). The SDK is expected to surface this
+   verbatim. The agent-runs surface uses these scopes:
+   | Endpoint                                                     | Required scope |
+   | ------------------------------------------------------------ | -------------- |
+   | `GET    .../models`                                          | `models:read` |
+   | `POST   .../agent-runs`                                      | `runs:write` |
+   | `GET    .../agent-runs/{runId}`                              | `runs:read` |
+   | `GET    .../agent-runs/{runId}/stream`                       | `runs:read` |
+   | `POST   .../agent-runs/{runId}/cancel`                       | `runs:write` |
+   | `POST   .../agent-runs/{runId}/tool-results`                 | `runs:write` |
+   | `POST   .../agent-sessions`                                  | `sessions:write` |
+   | `GET    .../agent-sessions/{sessionId}`                      | `sessions:read` |
+   | `DELETE .../agent-sessions/{sessionId}`                      | `sessions:write` |
+   | `POST   .../agent-sessions/{sessionId}/messages`             | `sessions:write` |
+   | `GET    /api/oauth/userinfo`                                 | `mantyx.identity:read` |
+   For an SDK that exposes one-shot runs and sessions end-to-end, request
+   at minimum `models:read runs:read runs:write sessions:read sessions:write`,
+   and add `mantyx.identity:read` if the SDK calls
+   `/api/oauth/userinfo` to discover the workspace slug after sign-in.
+2. **Tokens are workspace-scoped.** An access token is minted for one
+   workspace (chosen by the user at consent time for public apps, or the
+   registering workspace for private apps). Calling
+   `/api/v1/workspaces/{otherSlug}/...` with such a token returns
+   `404 not_found` plus a `hint` with the correct slug.
+OAuth tokens **also** honor the per-token agent allow-list
+(`OAuthAccessToken.agentIds`) the user picked at consent time — see
+[`docs/oauth.md`](./oauth.md) for the full registration / authorization-code
++ PKCE flow. PKCE (`S256`) is mandatory and every MANTYX OAuth app is a
+confidential client, so the token endpoint requires both `client_secret`
+and `code_verifier`.
+**Token lifetimes.** Access tokens live **1 hour** (`expires_in: 3600`).
+Refresh tokens are **persistent and non-rotating**: they have no
+time-based expiry and `grant_type=refresh_token` returns the **same**
+refresh token the SDK already holds while minting a brand-new short-lived
+access token. Multiple processes may refresh concurrently using the same
+refresh token without invalidating each other. Refresh tokens stop
+working only when the application access is revoked (`/oauth/revoke`,
+`DELETE /api/oauth/grants/:id`, or app deletion).
+> **SDK guidance.** Persist the refresh token at first sign-in, treat it
+> as long-lived, and keep refreshing the access token off it on demand
+> (e.g. ~5 minutes before `expires_in` runs out, or lazily on the first
+> `401`). Do **not** rotate or replace the refresh token after each
+> refresh — the value is stable.
+A single SDK call site looks identical regardless of credential:
+```http
+POST /api/v1/workspaces/acme/agent-runs HTTP/1.1
+Authorization: Bearer mantyx_at_…   # OAuth access token
+# — or —
+Authorization: Bearer mantyx_…      # workspace API key
+Content-Type: application/json
+{ "modelId": "openai:gpt-5.5", "prompt": "...", "tools": [...] }
+```
+### 2.3 Error model for credentials
+| Status | Body shape                                                                            | When |
+| ------ | ------------------------------------------------------------------------------------- | ---- |
+| `401`  | `{ "error": "Unauthorized", "message": "API key or OAuth access token required..." }` | No `Authorization` / `X-API-Key` header. |
+| `401`  | `{ "error": "Invalid API key or OAuth access token" }`                                | Token doesn't match a row, expired, or revoked. |
+| `403`  | `{ "error": "This API key is not for the Developer API", "hint": "..." }`             | API key has wrong `usage`. |
+| `403`  | `{ "error": "Workspace API keys are not available on this plan.", "code": "api_keys_plan" }` <br> `{ "error": "OAuth applications are not available on this plan.", "code": "oauth_apps_plan" }` | Workspace tier lacks the `apiKeys` / `oauthApps` feature. |
+| `403`  | `{ "error": "insufficient_scope", "required": "runs:write" }` (or an array if a route needs multiple) | OAuth token is missing a scope a route demands. The response also sets `WWW-Authenticate: Bearer error="insufficient_scope", scope="..."`. |
+| `404`  | `{ "error": "Workspace path does not match this credential", "hint": "..." }`         | URL slug ≠ token's workspace. |
 ## 3. Models
@@ -843,21 +950,8 @@ data: <utf-8 JSON>
 // Gemini `includeThoughts`, OpenAI `reasoning_content` on reasoning models).
 { "seq": 2, "type": "thinking_delta", "data": { "text": "First, I should…" } }
-// completed assistant message (text + optional tool calls about to execute).
-// `turn` is the 0-based tool-turn index this message closes.
-// `finishReason` is the canonical lowercase stop reason normalized across
-// providers (`"end_turn"`, `"tool_use"`, `"max_tokens"`, `"refusal"`,
-// `"malformed_function_call"`, …); `null` / omitted when the provider did
-// not report one. `toolCalls` is omitted when the model called no tools.
-{ "seq": 3, "type": "assistant_message",
-  "data": {
-    "text": "...",
-    "turn": 0,
-    "finishReason": "tool_use",
-    "toolCalls": [
-      { "id": "call_abc", "name": "search", "input": { /* JSON-Schema-matching args */ } }
-    ]
-  } }
+// completed assistant message (text + any tool calls about to execute)
+{ "seq": 3, "type": "assistant_message", "data": { "text": "...", "toolCalls": [...] } }
 // server-side tool call/result (informational; SDK does not act on these)
 { "seq": 4, "type": "tool_call",   "data": { "toolUseId": "...", "name": "...", "input": {...} } }
@@ -884,70 +978,18 @@ data: <utf-8 JSON>
 // is observability so SDK clients can render "memory budget exhausted" status notes.
 { "seq": 7, "type": "tool_budget_exceeded", "data": { "tool": "recall", "maxCalls": 4, "callIndex": 5 } }
-// terminal event — exactly one of `result`, `error`, or `cancelled` lands per run.
+// terminal event
 { "seq": 8, "type": "result",    "data": { "subtype": "success", "text": "Final reply" } }
 { "seq": 8, "type": "result",    "data": { "subtype": "error_local_tool_timeout", "error": "..." } }
-{ "seq": 8, "type": "error",     "data": {
-    "error":        "Model output was truncated (stop_reason=max_tokens). …",
-    "code":         "truncation",
-    "errorClass":   "truncation",
-    "finishReason": "max_tokens",
-    "partialText":  "{\n  \"answer\":… (truncated JSON) …",
-    "retryable":    false
-} }
 { "seq": 8, "type": "cancelled", "data": {} }
 ```
-A run terminates with exactly one of `result`, `error`, or `cancelled`. The
-connection is closed by the server immediately after sending the terminal
-event. Clients should not assume any particular ordering between the
-human-readable `event:` field and the parsed `type` inside `data` — they
-are always equal, but implementations should rely on `data.type` because
-some HTTP middleware strips the `event:` line.
-**`error` event payload fields.** The runner enriches the `error` event
-with structured triage attributes when the failure carried a salvage
-path (typically truncation, upstream deadline, or max-budget-with-text):
-| Field          | Type     | Required | Notes |
-| -------------- | -------- | -------- | ----- |
-| `error`        | string   | yes      | Human-readable message (also persisted on the run row's `error` column). |
-| `code`         | string   | yes      | Legacy alias for `errorClass`. Equals `errorClass` when present; otherwise a small lowercase token (`"error"`, `"invalid_spec"`, `"worker_error"`, …) the SDK can switch on. |
-| `errorClass`   | string   | no       | Canonical category. One of `"rate_limit"`, `"overloaded"`, `"server"`, `"context_window"` (input too big), `"truncation"` (output budget exhausted), `"invalid_request"`, `"auth"`, `"timeout"`, `"local_timeout"`, `"upstream_deadline"`, `"unknown"`. New categories may land additively. |
-| `finishReason` | string \| null | no | Canonical lowercase stop reason normalized across providers (`"max_tokens"`, `"refusal"`, `"malformed_function_call"`, …). When present, mirrors the value on the last `assistant_message`. |
-| `partialText`  | string   | no       | **Best-effort raw bytes** the model emitted before the failure. For `outputSchema` runs this is likely **incomplete JSON** that will fail `JSON.parse` — see §4.5 / `docs/wire-protocol.md` §7. Also persisted on the run row's `finalText` column so the Calls UI can render it alongside a truncation banner. |
-| `retryable`    | boolean  | no       | Coarse retry hint inherited from the pipeline's error classifier. Informational; the SDK still owns the actual retry decision. |
-**Truncation contract.** When the model is mid-output and Gemini /
-Anthropic / OpenAI hit the output budget, MANTYX does **not** discard
-the bytes that already streamed. Instead:
-1. The last `assistant_message` for the turn carries the partial text
-   plus `finishReason: "max_tokens"`.
-2. The terminal SSE event is an `error` (not `result`) with
-   `errorClass: "truncation"` and `data.partialText` set to the same
-   bytes.
-3. The run row exposed by `GET /agent-runs/:runId` has
-   `{ status: "failed", finalText: "<partial text>",
-   error: "Model output was truncated …", failureReason: { errorClass:
-   "truncation", finishReason: "max_tokens" } }`.
-`partialText` is a **best-effort raw byte sequence** — for `outputSchema`
-runs it will almost always fail `JSON.parse` because the JSON object was
-not closed. SDKs should treat it as diagnostic data, never as a
-schema-conformant reply. Surfacing it (as a "truncated reply — JSON
-likely incomplete" status note) is the recommended pattern; silently
-falling back to it as the answer is not.
-**Run snapshot fields.** `GET /agent-runs/:runId` returns the run row
-with these triage-relevant columns:
-| Field           | Notes |
-| --------------- | ----- |
-| `status`        | `"queued" \| "running" \| "succeeded" \| "failed" \| "cancelled"`. |
-| `finalText`     | Final assistant text on success; same string as terminal `data.partialText` when `failureReason.errorClass === "truncation"`. Otherwise `null`. |
-| `error`         | Human-readable error message (matches terminal `error.data.error`). `null` on success / cancellation. |
-| `failureReason` | JSON object `{ errorClass, finishReason }` on `status === "failed"` runs that carried a salvage payload. Future-proof for additional triage fields. `null` otherwise. |
+A run terminates with exactly one of `result` or `cancelled`. The connection
+is closed by the server immediately after sending the terminal event. Clients
+should not assume any particular ordering between the human-readable `event:`
+field and the parsed `type` inside `data` — they are always equal, but
+implementations should rely on `data.type` because some HTTP middleware
+strips the `event:` line.
 ## 8. Local tool result
@@ -1003,32 +1045,6 @@ Common codes:
 | `run_terminal`         | 409  | Tool-result after run finished |
 | `rate_limited`         | 429  | Per-API-key sliding window |
-**Run-level error categories.** When a run terminates via the SSE `error`
-event (§7), the payload carries an `errorClass` triage category in
-addition to the human-readable `error` message. SDKs typically expose
-this as a typed field on their run-error type (TS `MantyxRunError.errorClass`,
-Python `MantyxRunError.error_class`, Go `RunError.ErrorClass`). The
-canonical set:
-| `errorClass`        | Typical cause | Has `partialText`? |
-| ------------------- | ------------- | ------------------ |
-| `rate_limit`        | Provider rate-limited the request (HTTP 429-equivalent). | No |
-| `overloaded`        | Provider returned a transient "overloaded" / 5xx. | No |
-| `server`            | Generic upstream provider error. | No |
-| `context_window`    | Input exceeded the model's context window. | No |
-| `truncation`        | Output budget exhausted mid-reply (`finishReason: "max_tokens"`). | **Yes** |
-| `invalid_request`   | Provider rejected the spec / params. | No |
-| `auth`              | BYOK credentials invalid for this run. | No |
-| `timeout`           | Generic upstream timeout (provider-side). | No |
-| `local_timeout`     | SDK didn't POST a `tool-result` within `localToolTimeoutMs`. | No |
-| `upstream_deadline` | MANTYX worker deadline exceeded waiting on the provider. | Sometimes |
-| `unknown`           | Anything else — fallback so SDKs always have a category. | No |
-The category set is **additive over the wire**: new categories may
-appear without bumping the protocol version, so SDKs should default to
-`unknown` (or simply pass the raw string through to callers) for
-unrecognized values rather than crashing.
 ## 11. Suggested client architecture
 A reference SDK should:
@@ -1084,13 +1100,7 @@ A reference SDK should:
      the event to the caller (status banner, log line, telemetry). Do
      **not** abort the run on these events; the run continues through
      `result` / `error` / `cancelled` as usual.
-   - On terminal `result` with `subtype === "success"`, resolve the call
-     with the final `text`. On a terminal `error` event, raise a typed
-     run-error that carries the new triage attributes (`errorClass`,
-     `finishReason`, `partialText`, `retryable`) so callers can render
-     "truncated reply — JSON likely incomplete" banners and short-circuit
-     retry policies. Treat `partialText` as **diagnostic** data — never
-     auto-fall-back to it as the final answer.
+   - On terminal `result`, resolve the call. On `error` subtype, throw.
 4. Re-emit assistant deltas/events as a stream/iterator for callers who care
    about live output.
 5. Treat the protocol as the contract. Implementation details such as Valkey

package/docs/oauth.md ADDED Viewed

@@ -0,0 +1,356 @@
+# OAuth 2.0 in MANTYX
+MANTYX exposes an OAuth 2.0 authorization server at **`/api/oauth/...`** that
+issues access tokens accepted on every existing API surface
+(`/api/v1`, `/api/a2a`, `/mcp`) plus a small identity endpoint
+(`/api/oauth/userinfo`).
+OAuth tokens are a **drop-in alternative to workspace API keys** — same
+HTTP contract, same `Authorization: Bearer …` header, same per-agent
+allowlist semantics. The only thing that changes is that the token
+carries **scopes**: per-route permissions you grant at consent time
+instead of the coarse `ApiKeyUsage` (`mcp` | `developer_api` | `a2a`)
+on classic workspace API keys.
+> See `architecture.md` for the full request pipeline and where the
+> bearer resolver sits in it.
+## When to use OAuth vs. an API key
+* **Personal scripts and internal tools you control end-to-end** — keep
+  using a workspace API key. It's one click to issue, one header to set.
+* **Apps that other people sign in to** — register an OAuth application
+  and run the Authorization Code + PKCE flow. End users approve specific
+  scopes for a specific workspace. Two visibility modes:
+  * **Private** — locked to the workspace that registered the app. Only
+    members of that workspace can authorize. Optionally enable
+    `client_credentials` for unattended machine-to-machine traffic.
+  * **Public** — any user can authorize the app and pick the workspace
+    they want to grant access to on the consent screen.
+## High-level flow
+```mermaid
+sequenceDiagram
+    participant App as Your app
+    participant User as End user
+    participant Web as MANTYX SPA
+    participant API as MANTYX API
+    App->>User: Open /oauth/authorize?...
+    User->>Web: Sign in (if needed) and review scopes
+    Web->>API: POST /api/oauth/authorize/decide (approve)
+    API->>App: 302 redirect_uri?code=…&state=…
+    App->>API: POST /api/oauth/token (code + verifier)
+    API->>App: { access_token, refresh_token, scope, expires_in }
+    App->>API: GET /api/v1/workspaces/{slug}/agents (Bearer token)
+    API->>App: 200 OK (scope check passes)
+```
+## Registering an application
+Open **Developer → OAuth apps** (workspace admins only). Both private and
+public apps are registered from this page; the Visibility radio decides
+which flow you get.
+Provide:
+* **Name** and **description** (shown on the consent screen).
+* **Logo URL** (optional).
+* **Visibility** — **Private** locks tokens to this workspace; **Public**
+  lets any signed-in user pick a workspace at consent time.
+* **Redirect URIs** — at least one. Allowed schemes:
+  * `https://…`
+  * `http://localhost`, `http://127.0.0.1`, or `http://[::1]` (any port,
+    any path)
+  * Custom schemes for native apps, e.g. `myapp://callback`.
+* **Allowed scopes** — only scopes you check here can be requested by
+  the application at consent time.
+* **Client secret** — every MANTYX OAuth app is a **confidential
+  client**. The `client_secret` is returned **once** on creation; the
+  `/token`, `/revoke`, and `/introspect` endpoints all require the
+  matching value. We do not support PKCE-only public-client
+  registrations — visibility (private vs. public) only controls *who*
+  can authorize the app, not whether the app keeps a secret. PKCE is
+  still mandatory on top of the secret for defense in depth (see
+  below).
+* **Allow `client_credentials` grant** *(private apps only)* — for
+  unattended machine-to-machine use; not available on public apps
+  because the token has to be bound to a single workspace at mint time.
+The `client_id` is `mantyx_oa_<id>`. Confidential client secrets are
+`mantyx_oas_<secret>`.
+OAuth applications require the **`oauthApps`** feature on the registering
+workspace's tier (mirrors the existing `apiKeys` plan check). For public
+apps the same gate is also applied to the workspace each end user picks
+at consent time, so a free workspace can't host paid features through
+a public app authorized for it.
+## Authorization Code + PKCE (browser, native, server-side)
+Every grant carries **two** client-binding factors:
+* `client_secret` — proves the registered client made the call (every
+  MANTYX OAuth app is confidential, so this is always required).
+* PKCE `code_verifier` — proves the same browser session that started
+  `/authorize` is finishing the exchange. We accept only `S256` and
+  reject any token request without a verifier.
+1. Generate a high-entropy `code_verifier` (43–128 chars, RFC 7636).
+2. Compute `code_challenge = base64url(sha256(code_verifier))` (no
+   padding).
+3. Send the user to:
+   ```text
+   GET /api/oauth/authorize
+       ?client_id=mantyx_oa_…
+       &redirect_uri=<exact registered URI>
+       &response_type=code
+       &scope=mantyx.identity:read+agents:read+runs:write
+       &state=<random per-session token>
+       &code_challenge=<S256 challenge>
+       &code_challenge_method=S256
+   ```
+   The MANTYX SPA at `/oauth/authorize` reads the same query, asks the
+   user to log in if needed, lets them pick the workspace (third-party
+   apps), pick the agent allow-list (when any of `agents:invoke`,
+   `runs:write`, `a2a:invoke`, `mcp:connect` are requested), and then
+   approves or denies.
+4. On approve we redirect to:
+   ```text
+   <redirect_uri>?code=<auth code>&state=<your state>
+   ```
+   On deny:
+   ```text
+   <redirect_uri>?error=access_denied&state=<your state>
+   ```
+5. Exchange the code:
+   ```http
+   POST /api/oauth/token
+   Content-Type: application/x-www-form-urlencoded
+   grant_type=authorization_code
+   &code=…
+   &redirect_uri=<exact same URI>
+   &client_id=mantyx_oa_…
+   &client_secret=mantyx_oas_…
+   &code_verifier=<original verifier>
+   ```
+   Response:
+   ```json
+   {
+     "access_token": "mantyx_at_…",
+     "token_type": "Bearer",
+     "expires_in": 3600,
+     "refresh_token": "mantyx_rt_…",
+     "scope": "mantyx.identity:read agents:read runs:write"
+   }
+   ```
+6. Use the access token like any other workspace bearer:
+   ```http
+   GET /api/v1/workspaces/<slug>/agents
+   Authorization: Bearer mantyx_at_…
+   ```
+   `<slug>` must be the workspace the consent was for. OAuth tokens
+   issued for workspace A return **403** on `/api/v1/workspaces/B/...`.
+7. **Token lifetimes.**
+   * Access tokens live **1 hour** (`expires_in: 3600`).
+   * Refresh tokens are **persistent and non-rotating**: they never
+     time-expire. They stop working only when the application access
+     is explicitly revoked via `/oauth/revoke` (with the refresh
+     token), `DELETE /api/oauth/grants/:id`, or deletion of the
+     OAuth application itself.
+   * Calling `grant_type=refresh_token` mints a brand-new short-lived
+     access token and **echoes back the same refresh token** the
+     client already holds. The previous access tokens are **not**
+     revoked — multiple backend workers can mint live access tokens
+     off a shared refresh without invalidating each other's chains.
+   ```http
+   POST /api/oauth/token
+   grant_type=refresh_token
+   &refresh_token=mantyx_rt_…
+   &client_id=mantyx_oa_…
+   &client_secret=mantyx_oas_…
+   &scope=runs:write              # optional narrowing (must be a subset)
+   ```
+   ```json
+   {
+     "access_token": "mantyx_at_…",
+     "token_type": "Bearer",
+     "expires_in": 3600,
+     "refresh_token": "<same value the client just sent>",
+     "scope": "runs:write"
+   }
+   ```
+   Clients should persist the refresh token once at first sign-in
+   (treat it as long-lived) and only refresh the access token from
+   it as needed.
+8. **Revoke (RFC 7009).**
+   ```http
+   POST /api/oauth/revoke
+   token=<access or refresh token>
+   &client_id=mantyx_oa_…
+   &client_secret=mantyx_oas_…
+   ```
+   Always returns `200`, even when the token is unknown — by design.
+   * Revoking an **access token** kills only that single access
+     token. Other access tokens minted from the same refresh keep
+     working until they expire (or until the refresh is revoked).
+   * Revoking a **refresh token** kills the refresh and *every* live
+     access token tied to its grant in one shot.
+## Client credentials (private workspace apps, machine-to-machine)
+Private workspace applications with `allowsClientCredentials: true` can
+request a token without a user:
+```http
+POST /api/oauth/token
+grant_type=client_credentials
+&client_id=mantyx_oa_…
+&client_secret=mantyx_oas_…
+&scope=runs:write+agents:invoke   # optional, must be a subset of allowedScopes
+```
+The token's `tenantId` is the application's owning workspace and its
+agent allow-list defaults to every non-system agent in that workspace
+(no end-user consent screen). Use this for cron jobs, internal services
+and partner integrations where there is no end user.
+## "Sign in with MANTYX"
+There is **no OIDC** today. The access token is enough:
+1. Run the auth-code + PKCE flow with `scope=mantyx.identity:read`.
+2. Call:
+   ```http
+   GET /api/oauth/userinfo
+   Authorization: Bearer mantyx_at_…
+   ```
+   Response:
+   ```json
+   {
+     "sub": "<user id>",
+     "email": "user@example.com",
+     "workspace": { "id": "…", "slug": "…", "name": "…" }
+   }
+   ```
+`/api/auth/me` continues to accept the user's web JWT as before; both
+paths can be used to bootstrap session info for "Sign in with MANTYX"
+clients.
+## Scope catalog
+Defined in `packages/api/src/oauth/scopes.ts` and mirrored by the SPA
+in `packages/web/src/lib/oauthScopes.ts`. The catalog is also expressed
+in the OpenAPI spec at `packages/api/openapi/developer-v1.yaml`.
+| Scope | Purpose |
+| --- | --- |
+| `mantyx.identity:read` | `/api/oauth/userinfo` and `/api/auth/me`. |
+| `agents:read` | `GET /api/v1/.../agents`. |
+| `agents:write` | Reserved for future agent CRUD on the Developer API. |
+| `agents:invoke` | Run an agent — required by ephemeral runs, agent sessions, A2A invoke. |
+| `sessions:read` / `sessions:write` | Ephemeral SDK agent sessions. |
+| `runs:read` / `runs:write` | Read run snapshots and SSE streams; start, cancel, submit tool-results. |
+| `models:read` | `GET /api/v1/.../models`. |
+| `tools:read` / `tools:write` | List/manage workspace tools. |
+| `schedules:read` / `schedules:write` | List/manage cron schedules and trigger them manually. |
+| `inbounds:read` / `inbounds:write` | List/manage inbound webhooks/email configs. |
+| `plugins:read` | List installed plugins for the workspace. |
+| `hive:read` / `hive:write` | Workspace Hive objects. |
+| `a2a:discovery` | `GET /api/a2a/{slug}/discovery`. |
+| `a2a:invoke` | Send Agent2Agent JSON-RPC requests (also requires `agents:invoke`). |
+| `mcp:connect` | Open MCP Streamable HTTP sessions. |
+`runs:write`, `agents:invoke`, `a2a:invoke`, and `mcp:connect` participate
+in the **agent allow-list** that the consent screen surfaces. An empty
+list expands to "every non-system agent in the workspace" at request
+time — same semantics as today's `WorkspaceApiKey.agentIds`.
+## Redirect URI rules
+* Exact-match comparison (case-sensitive scheme/host, fragment-stripped).
+  Trailing slashes are significant.
+* Loopback HTTP is allowed without TLS for localhost development.
+* Custom schemes are allowed for native apps; pick a scheme you control
+  (e.g. `com.example.myapp://callback`).
+* `redirect_uri` on `/api/oauth/token` must equal the value used at
+  `/api/oauth/authorize`.
+## Error model
+| Where | Body |
+| --- | --- |
+| `/authorize` query validation | `{ "error": "Invalid authorize request", "details": {...} }` |
+| Unknown / unauthorized client | `401 { "error": "invalid_client" }` |
+| PKCE failure, expired/used code, redirect mismatch | `400 { "error": "invalid_grant" }` |
+| Insufficient scope on a Developer API call | `403 { "error": "insufficient_scope", "required": ["..."] }` |
+| Wrong workspace in URL | `403 { "error": "wrong_workspace", "correctSlug": "..." }` |
+| Plan does not include OAuth apps | `403 { "error": "...", "code": "oauth_apps_plan" }` |
+## Token format
+* Access tokens: `mantyx_at_<32-byte url-safe random>`.
+* Refresh tokens: `mantyx_rt_<32-byte url-safe random>`.
+* Client ids: `mantyx_oa_<id>`.
+* Client secrets: `mantyx_oas_<secret>`.
+Stored as **SHA-256 with HMAC** (rate-friendly), with a 12-character
+prefix index for fast lookups (mirrors today's
+`WorkspaceApiKey.keyPrefix`).
+## Token lifetimes & lifecycle
+| Token | Lifetime | How it ends |
+| --- | --- | --- |
+| **Access token** | 1 hour (`expires_in: 3600`). | Time-expires; or revoked via `/oauth/revoke`, refresh-token revocation, grant deletion, or app deletion. |
+| **Refresh token** | **No time-based expiry** — persistent. | Revoked via `/oauth/revoke` (refresh token), `DELETE /api/oauth/grants/:id` (user "Revoke access" action), or deletion of the OAuth application. |
+| **Authorization code** | 10 minutes, single-use. | Consumed by `/oauth/token` (auth-code grant) or expires. |
+Refresh tokens are **non-rotating**. Calling
+`grant_type=refresh_token` issues a new short-lived access token but
+returns the **same refresh token** the client already holds. Multiple
+backend workers may refresh concurrently using the same shared
+refresh token without invalidating each other's chains.
+This makes refresh tokens the long-lived authorization-of-record:
+clients should persist them once at first sign-in (encrypted at rest)
+and treat the refresh value as the single trust anchor for the grant.
+## See also
+* `packages/api/src/routes/oauth.ts` — authorization server endpoints.
+* `packages/api/src/services/bearer-credential.ts` — unified resolver
+  for API keys and OAuth tokens.
+* `packages/api/src/middleware/oauth-scope.ts` — `requireScope(...)`.
+* `packages/api/openapi/developer-v1.yaml` — `securitySchemes.oauth2`
+  and per-operation scope lists.