npm - @cyanheads/mcp-ts-core - Versions diffs - 0.7.6 → 0.8.1 - Mend

@cyanheads/mcp-ts-core 0.7.6 → 0.8.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (107) hide show

package/CLAUDE.md +22 -7
package/README.md +2 -2
package/changelog/0.8.x/0.8.0.md +33 -0
package/changelog/0.8.x/0.8.1.md +17 -0
package/changelog/template.md +13 -0
package/dist/core/context.d.ts +67 -0
package/dist/core/context.d.ts.map +1 -1
package/dist/core/context.js +46 -1
package/dist/core/context.js.map +1 -1
package/dist/core/index.d.ts +2 -1
package/dist/core/index.d.ts.map +1 -1
package/dist/core/index.js +1 -0
package/dist/core/index.js.map +1 -1
package/dist/linter/rules/error-contract-rules.d.ts +45 -0
package/dist/linter/rules/error-contract-rules.d.ts.map +1 -0
package/dist/linter/rules/error-contract-rules.js +321 -0
package/dist/linter/rules/error-contract-rules.js.map +1 -0
package/dist/linter/rules/handler-body-rules.d.ts +18 -0
package/dist/linter/rules/handler-body-rules.d.ts.map +1 -0
package/dist/linter/rules/handler-body-rules.js +134 -0
package/dist/linter/rules/handler-body-rules.js.map +1 -0
package/dist/linter/rules/index.d.ts +2 -0
package/dist/linter/rules/index.d.ts.map +1 -1
package/dist/linter/rules/index.js +2 -0
package/dist/linter/rules/index.js.map +1 -1
package/dist/linter/rules/resource-rules.d.ts.map +1 -1
package/dist/linter/rules/resource-rules.js +9 -0
package/dist/linter/rules/resource-rules.js.map +1 -1
package/dist/linter/rules/source-text.d.ts +19 -0
package/dist/linter/rules/source-text.d.ts.map +1 -0
package/dist/linter/rules/source-text.js +96 -0
package/dist/linter/rules/source-text.js.map +1 -0
package/dist/linter/rules/tool-rules.d.ts.map +1 -1
package/dist/linter/rules/tool-rules.js +9 -0
package/dist/linter/rules/tool-rules.js.map +1 -1
package/dist/logs/combined.log +4 -4
package/dist/logs/error.log +4 -4
package/dist/mcp-server/apps/appBuilders.d.ts +9 -4
package/dist/mcp-server/apps/appBuilders.d.ts.map +1 -1
package/dist/mcp-server/apps/appBuilders.js +4 -0
package/dist/mcp-server/apps/appBuilders.js.map +1 -1
package/dist/mcp-server/resources/resource-registration.d.ts.map +1 -1
package/dist/mcp-server/resources/resource-registration.js +3 -2
package/dist/mcp-server/resources/resource-registration.js.map +1 -1
package/dist/mcp-server/resources/utils/resourceDefinition.d.ts +13 -5
package/dist/mcp-server/resources/utils/resourceDefinition.d.ts.map +1 -1
package/dist/mcp-server/resources/utils/resourceDefinition.js.map +1 -1
package/dist/mcp-server/resources/utils/resourceHandlerFactory.d.ts.map +1 -1
package/dist/mcp-server/resources/utils/resourceHandlerFactory.js +5 -4
package/dist/mcp-server/resources/utils/resourceHandlerFactory.js.map +1 -1
package/dist/mcp-server/tools/tool-registration.d.ts.map +1 -1
package/dist/mcp-server/tools/tool-registration.js +13 -7
package/dist/mcp-server/tools/tool-registration.js.map +1 -1
package/dist/mcp-server/tools/utils/toolDefinition.d.ts +64 -16
package/dist/mcp-server/tools/utils/toolDefinition.d.ts.map +1 -1
package/dist/mcp-server/tools/utils/toolDefinition.js +25 -11
package/dist/mcp-server/tools/utils/toolDefinition.js.map +1 -1
package/dist/mcp-server/tools/utils/toolHandlerFactory.d.ts.map +1 -1
package/dist/mcp-server/tools/utils/toolHandlerFactory.js +6 -4
package/dist/mcp-server/tools/utils/toolHandlerFactory.js.map +1 -1
package/dist/testing/index.d.ts +8 -0
package/dist/testing/index.d.ts.map +1 -1
package/dist/testing/index.js +5 -1
package/dist/testing/index.js.map +1 -1
package/dist/types-global/errors.d.ts +82 -0
package/dist/types-global/errors.d.ts.map +1 -1
package/dist/types-global/errors.js +25 -0
package/dist/types-global/errors.js.map +1 -1
package/dist/utils/formatting/index.d.ts +1 -0
package/dist/utils/formatting/index.d.ts.map +1 -1
package/dist/utils/formatting/index.js +1 -0
package/dist/utils/formatting/index.js.map +1 -1
package/dist/utils/formatting/partialResult.d.ts +145 -0
package/dist/utils/formatting/partialResult.d.ts.map +1 -0
package/dist/utils/formatting/partialResult.js +145 -0
package/dist/utils/formatting/partialResult.js.map +1 -0
package/dist/utils/index.d.ts +2 -1
package/dist/utils/index.d.ts.map +1 -1
package/dist/utils/index.js +2 -1
package/dist/utils/index.js.map +1 -1
package/dist/utils/network/httpError.d.ts +112 -0
package/dist/utils/network/httpError.d.ts.map +1 -0
package/dist/utils/network/httpError.js +153 -0
package/dist/utils/network/httpError.js.map +1 -0
package/dist/utils/network/retry.d.ts.map +1 -1
package/dist/utils/network/retry.js +0 -1
package/dist/utils/network/retry.js.map +1 -1
package/package.json +5 -4
package/scripts/split-changelog.ts +133 -0
package/skills/add-app-tool/SKILL.md +12 -0
package/skills/add-resource/SKILL.md +40 -0
package/skills/add-service/SKILL.md +54 -1
package/skills/add-test/SKILL.md +39 -0
package/skills/add-tool/SKILL.md +42 -5
package/skills/api-context/SKILL.md +75 -1
package/skills/api-errors/SKILL.md +183 -5
package/skills/api-linter/SKILL.md +223 -3
package/skills/api-testing/SKILL.md +79 -4
package/skills/api-utils/SKILL.md +4 -2
package/skills/design-mcp-server/SKILL.md +13 -10
package/skills/field-test/SKILL.md +81 -15
package/skills/maintenance/SKILL.md +5 -2
package/skills/report-issue-framework/SKILL.md +2 -2
package/skills/security-pass/SKILL.md +6 -5
package/templates/AGENTS.md +23 -8
package/templates/CLAUDE.md +23 -8
package/templates/changelog/template.md +18 -5

package/skills/api-linter/SKILL.md CHANGED Viewed

@@ -48,7 +48,11 @@ Grouped by family. Jump to any rule ID via its anchor.
 | Names | `name-required`, `name-format`, `name-unique` | [Name rules](#name-rules) |
 | Tools | `description-required`, `handler-required`, `auth-type`, `auth-scope-format`, `annotation-type`, `annotation-coherence`, `meta-ui-type`, `meta-ui-resource-uri-required`, `meta-ui-resource-uri-scheme`, `app-tool-resource-pairing` | [Tool rules](#tool-rules) |
 | Resources | `uri-template-required`, `uri-template-valid`, `resource-name-not-uri`, `template-params-align` | [Resource rules](#resource-rules) |
+| Landing | `landing-*` (23 rules — shape, tagline, logo, links, repo, envExample, connectSnippets, theme) | [Landing config rules](#landing-config-rules) |
 | Prompts | `generate-required` | [Prompt rules](#prompt-rules) |
+| Handler body | `prefer-mcp-error-in-handler`, `prefer-error-factory`, `preserve-cause-on-rethrow`, `no-stringify-upstream-error` | [Handler body rules](#handler-body-rules) |
+| Error contract (structural) | `error-contract-type`, `error-contract-empty`, `error-contract-entry-type`, `error-contract-code-type`, `error-contract-code-unknown`, `error-contract-code-unknown-error`, `error-contract-reason-required`, `error-contract-reason-format`, `error-contract-reason-unique`, `error-contract-when-required`, `error-contract-retryable-type` | [Error contract rules](#error-contract-rules) |
+| Error contract (conformance) | `error-contract-conformance`, `error-contract-prefer-fail` | [Error contract rules](#error-contract-rules) |
 | server.json | ~40 rules prefixed `server-json-*` | [server.json rules](#server-json-rules) |
 ---
@@ -196,7 +200,9 @@ Every tool, resource, and prompt definition needs a non-empty `name` string. For
 **Severity:** error
-Names must match `^[a-zA-Z0-9._-]+$` (alphanumerics, dots, hyphens, underscores). Tools conventionally use `snake_case`, resources and prompts use `kebab-case` or `snake_case`.
+**Scope:** tools only — resources and prompts are checked by `name-required` only.
+Tool names must match `^[A-Za-z0-9._-]{1,128}$` (alphanumerics, dots, hyphens, underscores; 1–128 chars). Tools conventionally use `snake_case`.
 **Fix:** rename to a valid identifier. If the legacy name is user-facing, keep `title` as the display string and use a valid `name` internally.
@@ -259,7 +265,7 @@ Every element in `auth` must be a non-empty string. Empty strings in the array a
 **Severity:** warning
-Contradictory annotation combinations. The canonical case: `readOnlyHint: true` with `destructiveHint: true` — a read-only tool cannot be destructive. `idempotentHint: true` alongside `readOnlyHint: true` is fine (explicit redundancy is allowed).
+Catches `readOnlyHint: true` with **any** explicit `destructiveHint` value (even `false`) — the destructive hint is meaningless on a read-only tool, so its presence signals authoring confusion. Drop `destructiveHint` entirely when the tool is read-only.
 ### meta-ui-type
@@ -322,7 +328,7 @@ resource('myscheme://{id}/data', {
 **Severity:** error
-Every variable in the URI template must appear as a key in the `params` schema, and vice versa. `test://{itemId}/data` with `params: z.object({ item_id: ... })` is rejected — casing mismatches count.
+Every variable in the URI template must appear as a key in the `params` schema. `test://{itemId}/data` with `params: z.object({ item_id: ... })` is rejected — casing mismatches count. The check is template → schema only; extra schema keys not referenced by the template are not flagged.
 **Fix:** rename one side so they match exactly. The error message names which variables are on which side.
@@ -391,6 +397,220 @@ Most of these are mechanical — fix the manifest field named in the diagnostic'
 ---
+## Landing config rules
+Validate the `landing` config passed to `createApp()` (the config object that drives the framework's landing page). Run only when `input.landing` is provided to `validateDefinitions`. All errors — landing config that's structurally broken would render incorrectly on the public page.
+| Rule | Severity | Catches |
+|:-----|:---------|:--------|
+| `landing-shape` | error | `landing` is not a plain object |
+| `landing-tagline-type` | error | `tagline` is present but not a string |
+| `landing-tagline-length` | error | `tagline` exceeds the max length |
+| `landing-logo-type` | error | `logo` is present but not a string |
+| `landing-logo-size` | error | `logo` is too long for inline rendering |
+| `landing-links-type` | error | `links` is present but not an array |
+| `landing-links-count` | error | `links` exceeds the max count |
+| `landing-link-shape` | error | A `links[]` entry is not a plain object |
+| `landing-link-href` | error | A link entry's `href` is missing or not a non-empty string |
+| `landing-link-label` | error | A link entry's `label` is missing or not a non-empty string |
+| `landing-repo-root-type` | error | `repoRoot` is present but not a string |
+| `landing-repo-root-shape` | error | `repoRoot` is not a recognized GitHub URL shape |
+| `landing-env-example-type` | error | `envExample` is present but not a plain object |
+| `landing-env-example-count` | error | `envExample` has too many entries |
+| `landing-env-example-key` | error | An `envExample` key is empty or invalid |
+| `landing-env-example-value` | error | An `envExample` value is not a string |
+| `landing-connect-snippets-type` | error | `connectSnippets` is present but not a plain object |
+| `landing-connect-snippets-key` | error | A `connectSnippets` key is empty |
+| `landing-connect-snippets-value` | error | A `connectSnippets` value is not a string |
+| `landing-connect-snippets-empty` | error | A `connectSnippets` value is an empty string |
+| `landing-theme-type` | error | `theme` is present but not a plain object |
+| `landing-theme-accent` | error | `theme.accent` is present but not a string |
+| `landing-theme-accent-format` | error | `theme.accent` doesn't match the expected color format |
+Diagnostic anchors for these rules are the rule ID — e.g. `skills/api-linter/SKILL.md#landing-shape`. Pass `landing` to `validateDefinitions({ landing, tools, resources, prompts })` to opt in.
+---
+## Handler body rules
+Heuristic source-text checks that scan `handler.toString()` for common error-handling anti-patterns. All warnings — false positives are possible because the rules can't see code reached through wrappers, factories assigned to variables, or service-layer throws. Each rule fires at most once per handler to keep reports quiet.
+### prefer-mcp-error-in-handler
+**Severity:** warning
+Fires when a handler contains `throw new Error(...)`. Plain `Error` doesn't carry a JSON-RPC code — the framework's auto-classifier degrades to `InternalError`, hiding the actual failure mode.
+**Fix:** use `McpError` or a factory:
+```ts
+// instead of:
+throw new Error('Item not found');
+// use:
+throw notFound('Item not found', { itemId });
+```
+### prefer-error-factory
+**Severity:** warning
+Fires when a handler builds an error via `new McpError(JsonRpcErrorCode.X, ...)` and a matching factory exists (`notFound`, `rateLimited`, `serviceUnavailable`, …). The factory form is shorter, self-documenting, and consistent with the rest of the codebase.
+**Fix:** swap the constructor for the factory the diagnostic names:
+```ts
+// instead of:
+throw new McpError(JsonRpcErrorCode.NotFound, 'Item missing');
+// use:
+throw notFound('Item missing');
+```
+### preserve-cause-on-rethrow
+**Severity:** warning
+Fires when a `catch (e)` block throws a structured `McpError` (or factory) without passing `{ cause: e }`. Dropping the cause loses the original stack trace — observability platforms and `pino-pretty` rely on it to render error chains.
+**Fix:** thread the cause through the 4th `McpError` argument or factory options:
+```ts
+try {
+  await fetchUpstream();
+} catch (e) {
+  throw serviceUnavailable('Upstream failed', { service: 'pubmed' }, { cause: e });
+}
+```
+### no-stringify-upstream-error
+**Severity:** warning
+Fires when a handler throws an error message containing `JSON.stringify(...)`. Stringifying caught or upstream errors into the message risks leaking internal stack traces, AWS internal ARNs, or third-party trace IDs to clients.
+**Fix:** sanitize first, or attach the raw blob to the error's `data` payload — never the message.
+```ts
+// instead of:
+throw new Error(`Upstream failed: ${JSON.stringify(e)}`);
+// use:
+throw serviceUnavailable('Upstream failed', { upstreamError: e }, { cause: e });
+```
+---
+## Error contract rules
+Validate the optional `errors[]` declarative contract on tool/resource definitions. Structural rules check the shape of contract entries; conformance rules cross-check the handler body against the declared codes.
+When a contract is declared, surfaced under `_meta['mcp-ts-core/errors']` in `tools/list` / `resources/list`, and the handler receives a typed `ctx.fail(reason, …)` keyed by the declared reason union. See `skills/api-errors/SKILL.md` for runtime semantics.
+### error-contract-type
+**Severity:** error
+Fires when `errors` is present but not an array. The contract must be a tuple of `ErrorContract` entries.
+### error-contract-empty
+**Severity:** warning
+Fires when `errors: []` is declared. An empty contract is a no-op — nothing to surface in `tools/list`, no reason union for `ctx.fail`, no conformance to check.
+**Fix:** drop the field, or declare actual failure modes.
+### error-contract-entry-type
+**Severity:** error
+Fires when an entry in `errors[]` isn't an object. Each entry must be `{ code, reason, when }` (and optionally `retryable`).
+### error-contract-code-type
+**Severity:** error
+Fires when an entry's `code` is missing or not a number. Use the `JsonRpcErrorCode` enum:
+```ts
+errors: [{ code: JsonRpcErrorCode.NotFound, reason: 'no_match', when: 'No items matched' }]
+```
+### error-contract-code-unknown
+**Severity:** error
+Fires when an entry's `code` is a number but not a known `JsonRpcErrorCode` value. Likely a typo or stale magic number — import the enum and use a member.
+### error-contract-code-unknown-error
+**Severity:** warning
+Fires when an entry uses `JsonRpcErrorCode.UnknownError` (-32099). That code is the auto-classifier's giveup-fallback; declaring it in a contract conveys nothing useful to clients.
+**Fix:** pick a more specific code (`InternalError`, `ServiceUnavailable`, etc.) or drop the entry.
+### error-contract-reason-required
+**Severity:** error
+Fires when an entry's `reason` is missing or empty. `reason` is the stable machine-readable identifier clients switch on; it must always be present.
+### error-contract-reason-format
+**Severity:** warning
+Fires when `reason` isn't snake_case (matched against `^[a-z][a-z0-9_]*$`). Reasons are part of the public API — treat them like API constants. `'NotFound'`, `'no-match'`, `'1bad'` all warn.
+**Fix:** rename to snake_case (`'no_match'`, `'rate_limited'`, …).
+### error-contract-reason-unique
+**Severity:** error
+Fires when two entries in the same contract share a `reason`. Reasons must be unique within a contract — they're how `ctx.fail(reason, …)` selects the entry.
+### error-contract-when-required
+**Severity:** error
+Fires when an entry's `when` field is missing or empty. `when` is the human-readable explanation surfaced to LLMs and UI clients; without it, the contract is opaque.
+### error-contract-retryable-type
+**Severity:** warning
+Fires when an entry's optional `retryable` field is present but isn't a boolean. Only `true` or `false` is meaningful — drop the field if you can't commit to either.
+### error-contract-conformance
+**Severity:** warning
+Cross-check rule. Fires when a handler throws a non-baseline code (via `JsonRpcErrorCode.X` or a factory like `notFound()`) that isn't declared in `errors[]`.
+Baseline codes (`InternalError`, `ServiceUnavailable`, `Timeout`, `ValidationError`, `SerializationError`) are auto-allowed because they bubble from anywhere — services, framework utilities, the auto-classifier — and are implicitly always-possible on any tool. Only domain-specific codes need declaring.
+**Fix:** add the missing code to `errors[]` with a stable reason, or route through `ctx.fail(reason, …)` if it maps to an existing entry.
+**Heuristic limitations:** the scan reads `handler.toString()` and only catches direct `throw new McpError(JsonRpcErrorCode.X, …)` and `throw factory(…)` patterns. Indirect throws (`const e = notFound(); throw e;`), throws from called services, and throws via runtime helpers like `httpErrorFromResponse(...)` are invisible.
+### error-contract-prefer-fail
+**Severity:** warning
+Fires when a handler throws a code that **is** declared in the contract directly (via factory or `new McpError`) instead of routing through `ctx.fail(reason, …)`. Direct throws bypass the typed helper, leaving observers without a stable `data.reason` and disconnecting the throw site from the contract entry.
+**Fix:** swap the direct throw for `ctx.fail` using the reason the diagnostic suggests:
+```ts
+// instead of:
+throw notFound('No items match');
+// use:
+throw ctx.fail('no_match', 'No items match');
+```
+The diagnostic message includes the declared reason(s) for the code so you can copy-paste.
+---
 ## Escape hatches
 ### Dynamic upstream data

package/skills/api-testing/SKILL.md CHANGED Viewed

@@ -24,6 +24,7 @@ import { createMockContext } from '@cyanheads/mcp-ts-core/testing';
 createMockContext()                                           // minimal — ctx.state operations throw without tenantId
 createMockContext({ tenantId: 'test-tenant' })               // enables ctx.state (tenant-scoped in-memory storage)
+createMockContext({ errors: myTool.errors })                 // attaches typed ctx.fail keyed by the contract reasons
 createMockContext({ sample: vi.fn().mockResolvedValue(...) }) // with MCP sampling
 createMockContext({ elicit: vi.fn().mockResolvedValue(...) }) // with elicitation
 createMockContext({ progress: true })                        // with task progress (ctx.progress populated)
@@ -41,6 +42,7 @@ createMockContext({ uri: new URL('myscheme://item/123') })   // for resource han
 interface MockContextOptions {
   auth?: AuthContext;
   elicit?: (message: string, schema: z.ZodObject<z.ZodRawShape>) => Promise<ElicitResult>;
+  errors?: readonly ErrorContract[];
   notifyResourceListChanged?: () => void;
   notifyResourceUpdated?: (uri: string) => void;
   progress?: boolean;
@@ -57,6 +59,7 @@ interface MockContextOptions {
 | _(none)_ | Minimal context — `ctx.state` operations throw without `tenantId`; `ctx.elicit`/`ctx.sample`/`ctx.progress` are `undefined` |
 | `auth` | Sets `ctx.auth` for scope-checking tests |
 | `elicit` | Assigns a function to `ctx.elicit` for testing elicitation calls |
+| `errors` | Attaches a typed `ctx.fail` against the contract — same wiring the production handler factory uses. Pass `myTool.errors` directly. |
 | `notifyResourceListChanged` | Assigns `ctx.notifyResourceListChanged` for resource notification tests |
 | `notifyResourceUpdated` | Assigns `ctx.notifyResourceUpdated` for resource update notification tests |
 | `progress` | Populates `ctx.progress` with real state-tracking implementation (see below) |
@@ -90,13 +93,13 @@ expect(progress._messages).toContain('step message');
 ### Mock logger
-`ctx.log` captures all log calls for inspection:
+`ctx.log` captures all log calls for inspection. The mock returns the typed `MockContextLogger` from `@cyanheads/mcp-ts-core/testing` — import that instead of hand-casting:
 ```ts
+import { createMockContext, type MockContextLogger } from '@cyanheads/mcp-ts-core/testing';
 const ctx = createMockContext();
-const log = ctx.log as ContextLogger & {
-  calls: Array<{ level: string; msg: string; data?: unknown }>;
-};
+const log = ctx.log as MockContextLogger;
 await myTool.handler(input, ctx);
 expect(log.calls.some(c => c.level === 'info' && c.msg.includes('Processing'))).toBe(true);
@@ -311,3 +314,75 @@ it('throws NotFound for missing resource', async () => {
 ```
 Use `.rejects.toThrow(McpError)` to assert type only. Use `.rejects.toMatchObject({ code: ... })` when the specific error code matters.
+---
+## Testing handlers with `errors[]` (typed contract)
+Tools and resources that declare an `errors[]` contract receive a typed `ctx.fail` helper at runtime. Pass the definition's own `errors` to `createMockContext` and the mock wires `fail` the same way the production handler factory does:
+```ts
+import { createMockContext } from '@cyanheads/mcp-ts-core/testing';
+import { JsonRpcErrorCode } from '@cyanheads/mcp-ts-core/errors';
+import { fetchItems } from '@/mcp-server/tools/definitions/fetch-items.tool.js';
+it('throws ctx.fail("no_match") when no items resolve', async () => {
+  const ctx = createMockContext({ errors: fetchItems.errors });
+  const input = fetchItems.input.parse({ ids: ['missing'] });
+  await expect(fetchItems.handler(input, ctx)).rejects.toMatchObject({
+    code: JsonRpcErrorCode.NotFound,
+    data: { reason: 'no_match' },
+  });
+});
+```
+For lower-level tests that need the raw `fail` helper without a full mock context (e.g. asserting the reason → code mapping), use `createFail` directly — see [Testing the handler-side `fail` plumbing](#testing-the-handler-side-fail-plumbing) below.
+### Why test `data.reason` and not just `code`?
+The contract reason is the stable machine-readable identifier — clients switch on it the same way they would on an HTTP status. A code alone (`NotFound`) doesn't disambiguate between contract entries that share a code (`'no_match'` vs `'withdrawn'` both mapping to `NotFound`). Asserting on `data.reason` locks the test to the specific contract entry.
+### `data.reason` is overridable-proof
+The framework spreads caller-supplied data first and writes `reason` last, so a handler that passes `data: { reason: 'something_else' }` cannot override the contract reason. Tests can rely on `data.reason` always equaling the contract entry's reason — write assertions that depend on it without paranoia.
+### Testing the handler-side `fail` plumbing
+To verify the definition wires `ctx.fail` correctly without exercising the full handler factory, use the `errors` array directly:
+```ts
+import { createFail } from '@cyanheads/mcp-ts-core';
+it('builds an error with the contract code and reason', () => {
+  const fail = createFail(myTool.errors!);
+  const err = fail('no_match', 'not found', { itemId: '123' });
+  expect(err.code).toBe(JsonRpcErrorCode.NotFound);
+  expect(err.data).toEqual({ reason: 'no_match', itemId: '123' });
+});
+```
+---
+## Fuzz testing
+For schema-heavy or input-validation-critical handlers, the framework ships fuzz helpers under `@cyanheads/mcp-ts-core/testing/fuzz`. They generate valid + adversarial inputs from your Zod schemas via `fast-check` and assert handler invariants (no crashes, no prototype pollution, no stack-trace leaks).
+```ts
+import { fuzzTool, fuzzResource, fuzzPrompt } from '@cyanheads/mcp-ts-core/testing/fuzz';
+it('survives fuzz testing', async () => {
+  const report = await fuzzTool(myTool, { numRuns: 100, numAdversarial: 30 });
+  expect(report.crashes).toHaveLength(0);
+  expect(report.leaks).toHaveLength(0);
+  expect(report.prototypePollution).toBe(false);
+});
+```
+| Helper | Purpose |
+|:-------|:--------|
+| `fuzzTool(def, opts)` / `fuzzResource(def, opts)` / `fuzzPrompt(def, opts)` | Drive valid + adversarial inputs through the handler. Returns a `FuzzReport`. |
+| `zodToArbitrary(schema)` | Convert a Zod schema to a `fast-check` `Arbitrary` for custom property-based tests. |
+| `adversarialArbitrary()` / `ADVERSARIAL_STRINGS` | Targeted injection sets (prototype pollution probes, control characters, oversized payloads). |
+`FuzzOptions`: `numRuns` (default 50), `numAdversarial` (default 30), `seed` (reproducibility), `timeout` (per-call ms, default 5000), `ctx` (`MockContextOptions` for stateful handlers).

package/skills/api-utils/SKILL.md CHANGED Viewed

@@ -30,7 +30,9 @@ Utility exports from `@cyanheads/mcp-ts-core/utils`. Utilities with complex APIs
 | Export | API | Notes |
 |:-------|:----|:------|
 | `fetchWithTimeout` | `(url, timeoutMs, context: RequestContext, options?: FetchWithTimeoutOptions) -> Promise<Response>` | Wraps `fetch` with `AbortController` timeout. `FetchWithTimeoutOptions` extends `RequestInit` (minus `signal`) and adds `rejectPrivateIPs?: boolean` and `signal?: AbortSignal` (external cancellation). SSRF guard (best-effort, not hard isolation): blocks RFC 1918, loopback, link-local, CGNAT, cloud metadata. DNS validation on Node; hostname-only on Workers. Manual redirect following (max 5) with per-hop SSRF check. **DNS rebinding / TOCTOU gap** — the validation lookup and `fetch`'s own resolution are independent; pair with egress controls or a DNS-pinning fetch proxy for strong isolation. |
-| `withRetry` | `<T>(fn: () => Promise<T>, options?: RetryOptions) -> Promise<T>` | Executes `fn` with exponential backoff. Retries on transient errors (`ServiceUnavailable`, `Timeout`, `RateLimited`); non-transient errors fail immediately. On exhaustion, enriches the final error with attempt count in message and `data.retryAttempts`. **Place the retry boundary around the full pipeline** (fetch + parse), not just the network call. See `docs/service-resilience.md`. `RetryOptions`: `maxRetries` (default `3`), `baseDelayMs` (default `1000`), `maxDelayMs` (default `30000`), `jitter` (default `0.25`), `operation` (log label), `context` (RequestContext), `signal` (AbortSignal), `isTransient` (custom predicate). |
+| `withRetry` | `<T>(fn: () => Promise<T>, options?: RetryOptions) -> Promise<T>` | Executes `fn` with exponential backoff. Retries on transient errors (`ServiceUnavailable`, `Timeout`, `RateLimited`); non-transient errors fail immediately. On exhaustion, enriches the final error with attempt count in message and `data.retryAttempts`. **Place the retry boundary around the full pipeline** (fetch + parse), not just the network call. `RetryOptions`: `maxRetries` (default `3`), `baseDelayMs` (default `1000`), `maxDelayMs` (default `30000`), `jitter` (default `0.25`), `operation` (log label), `context` (RequestContext), `signal` (AbortSignal), `isTransient` (custom predicate). |
+| `httpErrorFromResponse` | `(response: Response, options?: HttpErrorFromResponseOptions) -> Promise<McpError>` | Maps an HTTP `Response` to a properly classified `McpError` — full status table including 401/403/408/422/429/5xx, body capture (truncated), `retry-after` header, optional `cause`. Use this instead of hand-rolling `if (status === 429) ...` ladders. Reads the response body — `clone()` first if you need it elsewhere. `HttpErrorFromResponseOptions`: `service?` (logical name in message, e.g. `'NCBI'`), `captureBody?` (default `true`), `bodyLimit?` (default `500`), `data?` (extra fields merged into `error.data`), `cause?`, `codeOverride?` (per-status mapping override). Pairs naturally with `withRetry` — both classify codes the same way. |
+| `httpStatusToErrorCode` | `(status: number) -> JsonRpcErrorCode \| undefined` | Sync status → code lookup. Returns `undefined` for 1xx/2xx/3xx. Use when you need just the code without a `Response` object handy. |
 ---
@@ -101,7 +103,7 @@ The `utils` export includes two type guards. The full set of guards lives in the
 | Export | API | Notes |
 |:-------|:----|:------|
-| `ErrorHandler` | `.tryCatch<T>(fn, opts) -> Promise<T>` `.handleError(error, opts) -> Error` `.determineErrorCode(error) -> JsonRpcErrorCode` `.mapError(error, mappings, defaultFactory?) -> T \| Error` `.formatError(error) -> Record<string, unknown>` | Service-level error handling. `tryCatch` wraps async or sync `fn`, logs via `handleError`, and always rethrows. No `.tryCatchSync()`. Use in services, NOT in tool handlers (those throw raw `McpError`). Options: `operation`, `context`, `errorCode`, `input`, `rethrow`, `includeStack`, `critical`, `errorMapper`. |
+| `ErrorHandler` | `.tryCatch<T>(fn, opts) -> Promise<T>` `.handleError(error, opts) -> Error` `.classifyOnly(error) -> { code, message, data? }` `.determineErrorCode(error) -> JsonRpcErrorCode` `.mapError(error, mappings, defaultFactory?) -> T \| Error` `.formatError(error) -> Record<string, unknown>` | Service-level error handling. `tryCatch` wraps async or sync `fn`, logs via `handleError`, and always rethrows. No `.tryCatchSync()`. Use in services, NOT in tool handlers (those throw raw `McpError`). `tryCatch` accepts `Omit<ErrorHandlerOptions, 'rethrow'>` — required: `operation`. Optional: `context`, `errorCode`, `input`, `includeStack`, `critical`, `errorMapper`. `handleError` accepts the full `ErrorHandlerOptions` including `rethrow`. |
 ---

package/skills/design-mcp-server/SKILL.md CHANGED Viewed

@@ -83,7 +83,7 @@ The user-goal list shapes the tool surface; the operation list fills in the gaps
 | Primitive | Use when | Examples |
 |:----------|:---------|:--------|
 | **Tool** | The default. Any operation or data access an agent needs to accomplish the server's purpose. | Search, create, update, analyze, fetch-by-ID, list reference data |
-| **App Tool** | Tool whose results benefit from interactive HTML UI (data visualization, forms, rich rendering). Uses `appTool()` + paired `appResource()`. Hosts without MCP Apps support receive the text fallback from `format()`. | Dashboards, data explorers, interactive charts, form-based workflows |
+| **App Tool** | **Rare — default to a standard tool.** Only when a human will actively interact with the result in real time *and* the target client supports MCP Apps. Most clients are tool-only and most agent workflows are read-by-LLM, not viewed-by-human. App tools add an iframe + CSP, `app.ontoolresult`/`callServerTool` plumbing, host-context wiring, and a `format()` text twin that still has to be content-complete (since most clients only see that). Two surfaces to keep in sync, two failure modes per change. | Dense tabular state a human scrubs through; form-based human approval in an MCP Apps-capable client |
 | **Resource** | *Additionally* expose as a resource when the data is addressable by stable URI, read-only, and useful as injectable context. | Config, schemas, status, entity-by-ID lookups |
 | **Prompt** | Reusable message template that structures how the LLM approaches a task | Analysis framework, report template, review checklist |
 | **Neither** | Internal detail, admin-only, not useful to an LLM | Token refresh, webhook setup, migrations |
@@ -321,7 +321,9 @@ The pattern: name the shortcut for what it does (`text_search`, `name_search`),
 #### Error design
-Errors are part of the tool's interface — design them during the design phase, not as an afterthought. Two aspects: **classification** (what error code) and **messaging** (what the LLM reads).
+Errors are part of the tool's interface — design them during the design phase, not as an afterthought. Three aspects: **the contract** (which failures are public), **classification** (what error code), and **messaging** (what the LLM reads).
+**Declare a typed contract for domain failures.** When a tool has known failure modes the agent should plan around (`no_match`, `queue_full`, `vendor_down`), enumerate them as `errors: [{ reason, code, when, retryable? }]` on the definition. The framework publishes the contract under `tools/list` `_meta['mcp-ts-core/errors']` so capable clients can preview failure modes, types `ctx.fail(reason, …)` against the declared reason union (typos become TS errors), and auto-populates `_meta.error.data.reason` on responses for stable observability. Baseline codes (`InternalError`, `ServiceUnavailable`, `Timeout`, `ValidationError`, `SerializationError`) bubble from anywhere and don't need to be enumerated. See `api-errors` skill for the full pattern.
 **Classify errors by origin.** Different error sources need different codes and different recovery guidance. Map the failure modes for each tool during design:
@@ -333,7 +335,7 @@ Errors are part of the tool's interface — design them during the design phase,
 | **Auth/permissions** | Insufficient scopes, expired token | `Forbidden` / `Unauthorized` | Maybe — escalate or re-auth |
 | **Server internal** | Parse failure, missing config, unexpected state | `InternalError` | No — server-side issue |
-The framework auto-classifies many of these at runtime (HTTP status codes, JS error types, common patterns), but explicit classification in the handler gives better error messages. Use error factories (`notFound()`, `validationError()`, etc.) when you want a specific code; plain `throw new Error()` when the framework's auto-classification is good enough.
+The framework auto-classifies many of these at runtime (HTTP status codes, JS error types, common patterns), but explicit classification in the handler gives better error messages. For declared contract failures, throw via `ctx.fail('reason', …)`. For ad-hoc throws outside the contract, use error factories (`notFound()`, `validationError()`, etc.) when the code matters; plain `throw new Error()` when the framework's auto-classification is good enough.
 **Write error messages as recovery instructions.** The message is the agent's only signal for what to do next.
@@ -354,7 +356,7 @@ throw forbidden(
 throw notFound(`Paper '${id}' not found on arXiv. Verify the ID format (e.g., '2401.12345' or '2401.12345v2').`);
 ```
-**During design, list the expected failure modes for each tool.** Not every mode needs a custom message, but the common ones should have clear recovery guidance baked in. Include these in the tool's section of the design doc — they inform both the handler implementation and the error factory choices.
+**During design, list the expected failure modes for each tool** with the reason, code, and when-clause that will land in the contract. Include these in the tool's section of the design doc — they become the literal `errors: [...]` entries during scaffolding and inform recovery messaging. Not every failure needs a contract entry; baseline infrastructure errors (5xx, timeouts, validation) are fine to let bubble.
 #### Design table
@@ -367,7 +369,7 @@ Summarize each tool:
 | **Description** | Concrete capability statement. Add operational guidance (prerequisites, constraints, gotchas) when non-obvious. |
 | **Input schema** | `.describe()` on every field. Constrained types (enums, literals, regex). Explain costs/tradeoffs of parameter choices. |
 | **Output schema** | Designed for the LLM's next action. Include chaining IDs. Communicate filtering. Post-write state where useful. |
-| **Error messages** | Name what went wrong and what the LLM should do about it. Include hints for common recovery paths. |
+| **Errors** | Declare domain failure modes as a typed contract (`errors: [{ reason, code, when, retryable? }]`) so `ctx.fail` is type-checked and capable clients can preview failures via `tools/list`. Error messages name what went wrong and what the LLM should do about it. |
 | **Annotations** | `readOnlyHint`, `destructiveHint`, `idempotentHint`, `openWorldHint`. Helps clients auto-approve safely. |
 | **Auth scopes** | `tool:<snake_tool_name>:<verb>` or `resource:<kebab-resource-name>:<verb>` (e.g., `tool:inventory_search:read`, `resource:echo-app-ui:read`). Domain-led `<domain>:<verb>` (e.g., `inventory:read`) is an acceptable alternative — pick one convention per server and stay consistent. Skip for read-only or stdio-only servers. |
@@ -399,7 +401,7 @@ Skip for purely data/action-oriented servers.
 **Server-as-service.** When the server IS the source of truth (knowledge graph, in-memory task tracker, local scratchpad, embedded inference wrapper), the resilience table below doesn't apply — there's no upstream to retry. The design questions shift to state management: what's tenant-scoped vs. global, what TTLs apply, what survives a restart, what the storage backend is. Plan persistence via `ctx.state` for tenant-scoped KV (auto-namespaced by `tenantId`), or use a `StorageService` provider directly when data must cross tenants. Service init still happens in `setup()`, accessed via `getMyService()` at request time. Calls within the server are local and synchronous-ish — the API-efficiency table below also doesn't apply.
-For services wrapping external APIs, plan the resilience layer. See `docs/service-resilience.md` for full rationale.
+For services wrapping external APIs, plan the resilience layer.
 | Concern | Decision |
 |:--------|:---------|
@@ -507,9 +509,9 @@ Execute the plan using the scaffolding skills:
 1. `add-service` for each service
 2. `add-tool` for each standard tool
-3. `add-app-tool` for each MCP Apps tool (creates paired tool + UI resource)
-4. `add-resource` for each standalone resource
-5. `add-prompt` for each prompt
+3. `add-resource` for each standalone resource
+4. `add-prompt` for each prompt
+5. `add-app-tool` *only if any app tools survived the design step* (rare — see the App Tool row in Step 3)
 6. `devcheck` after each addition
 ## Checklist
@@ -529,6 +531,7 @@ Items without an `If …:` prefix apply to every design. Conditional items only
 - [ ] Output schemas designed for LLM's next action — chaining IDs, post-write state, filtering communicated
 - [ ] `format()` renders all data the LLM needs — different clients forward different surfaces (Claude Code → `structuredContent`, Claude Desktop → `content[]`); both must carry the same data, not just a count or title
 - [ ] Error messages guide recovery — name what went wrong and what to do next
+- [ ] **If a tool has known domain failure modes:** typed error contract declared (`errors: [{ reason, code, when, retryable? }]`) so `ctx.fail` is type-checked and capable clients see failures via `tools/list`
 - [ ] Annotations set correctly (`readOnlyHint`, `destructiveHint`, `idempotentHint`, `openWorldHint`)
 - [ ] Design doc written to `docs/design.md`
 - [ ] Design confirmed with user (or user pre-authorized implementation)
@@ -537,7 +540,7 @@ Items without an `If …:` prefix apply to every design. Conditional items only
 - [ ] **If state-aware procedural guidance adds value:** instruction tool considered with `nextToolSuggestions` pre-filled from diagnostics
 - [ ] **If workflow tools have destructive modes:** destructive arm guarded by `ctx.elicit` when available, with `destructiveHint` annotation as fallback for non-interactive clients
 - [ ] **If a parameter determines blast radius:** safe default set (e.g., `mode: 'preview'`, `dryRun: true`, `confirmCount` required)
-- [ ] **If interactive UI adds value to results:** MCP Apps tool identified (with `format()` text fallback for non-app hosts)
+- [ ] **App tools default to no.** If one was proposed, verified there's a real human-in-the-loop in an MCP Apps-capable client justifying the iframe/CSP/`format()`-twin maintenance cost — otherwise dropped in favor of a standard tool
 - [ ] **If the server exposes resources:** URIs use `{param}` templates, pagination planned for large lists
 - [ ] **If the server has external deps or shared state:** service layer planned (or explicitly skipped with reasoning)
 - [ ] **If services wrap external APIs:** resilience planned (retry boundary, backoff, parse classification)