npm - @jaggerxtrm/specialists - Versions diffs - 3.5.0 → 3.6.0 - Mend

@jaggerxtrm/specialists 3.5.0 → 3.6.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (37) hide show

package/README.md +12 -1
package/config/hooks/specialists-session-start.mjs +105 -0
package/config/nodes/research-multi.node.json +11 -0
package/config/nodes/research.node.json +27 -0
package/config/presets.json +26 -0
package/config/skills/specialists-creator/SKILL.md +323 -145
package/config/skills/specialists-creator/scripts/scaffold-specialist.ts +228 -0
package/config/skills/using-nodes/SKILL.md +333 -0
package/config/skills/using-specialists/SKILL.md +843 -173
package/config/specialists/debugger.specialist.json +74 -0
package/config/specialists/executor.specialist.json +117 -0
package/config/specialists/explorer.specialist.json +82 -0
package/config/specialists/memory-processor.specialist.json +65 -0
package/config/specialists/node-coordinator.specialist.json +64 -0
package/config/specialists/overthinker.specialist.json +65 -0
package/config/specialists/parallel-review.specialist.json +65 -0
package/config/specialists/planner.specialist.json +93 -0
package/config/specialists/researcher.specialist.json +65 -0
package/config/specialists/reviewer.specialist.json +60 -0
package/config/specialists/specialists-creator.specialist.json +68 -0
package/config/specialists/sync-docs.specialist.json +80 -0
package/config/specialists/test-runner.specialist.json +67 -0
package/config/specialists/xt-merge.specialist.json +60 -0
package/dist/index.js +13818 -2743
package/package.json +6 -3
package/config/specialists/debugger.specialist.yaml +0 -121
package/config/specialists/executor.specialist.yaml +0 -257
package/config/specialists/explorer.specialist.yaml +0 -85
package/config/specialists/memory-processor.specialist.yaml +0 -154
package/config/specialists/overthinker.specialist.yaml +0 -76
package/config/specialists/parallel-review.specialist.yaml +0 -75
package/config/specialists/planner.specialist.yaml +0 -94
package/config/specialists/reviewer.specialist.yaml +0 -142
package/config/specialists/specialists-creator.specialist.yaml +0 -90
package/config/specialists/sync-docs.specialist.yaml +0 -68
package/config/specialists/test-runner.specialist.yaml +0 -65
package/config/specialists/xt-merge.specialist.yaml +0 -159

package/config/skills/specialists-creator/SKILL.md CHANGED Viewed

@@ -2,10 +2,11 @@
 name: specialists-creator
 description: >
   Use this skill when creating or fixing a specialist definition. It guides the
-  agent through writing a valid `.specialist.yaml`, choosing supported models,
+  agent through writing a valid `.specialist.json`, choosing supported models,
   validating against the schema, and avoiding common specialist authoring
   mistakes.
-version: 1.0
+version: 1.1
+synced_at: 236ca5e6
 ---
 # Specialist Author Guide
@@ -16,7 +17,7 @@ version: 1.0
 ## ACTION REQUIRED BEFORE ANYTHING ELSE
-Run these commands **right now**, before reading further, before writing any YAML, before doing anything else:
+Run these commands **right now**, before reading further, before writing any JSON, before doing anything else:
 ```bash
 pi --list-models
@@ -145,45 +146,47 @@ specialists models  # confirm assignments look balanced
 ### For a new specialist (single model selection)
-> **See [⛔ MANDATORY FIRST STEP](#-mandatory-first-step--verify-models-before-writing-any-yaml) at the top of this skill.**
-> Use `pi --list-models` (not `specialists models`) to discover models, ping both before writing YAML.
+> **See [⛔ MANDATORY FIRST STEP](#-mandatory-first-step--verify-models-before-writing-any-json) at the top of this skill.**
+> Use `pi --list-models` (not `specialists models`) to discover models, ping both before mutating config.
 ```bash
 # 1. pi --list-models            — see exactly what's available on pi right now
 # 2. Pick tier + pick highest version in family
 # 3. pi --model <primary>  --print "ping"   — must return "pong"
 # 4. pi --model <fallback> --print "ping"   — must return "pong"
-# 5. Write YAML with verified model strings
+# 5. Run scaffold-specialist.ts first (pre-script already wired in specialists-creator)
+# 6. Use sp edit for field-by-field mutations
 ```
 **Rule:** Never hardcode a model without pinging it. If ping fails, try the next best in that tier.
 ---
-## Quick Start: Minimal Skeleton
+## Quick Start: Scaffold + `sp edit`
-```yaml
-specialist:
-  metadata:
-    name: my-specialist          # kebab-case, required
-    version: 1.0.0               # semver, required
-    description: "One sentence." # required
-    category: workflow           # required (free text)
+```bash
+# 1. Create/normalize the specialist JSON with all schema sections present
+node config/skills/specialists-creator/scripts/scaffold-specialist.ts config/specialists/my-specialist.specialist.json
-  execution:
-    model: anthropic/claude-sonnet-4-6  # run model setup workflow above to choose + verify
-    permission_required: READ_ONLY
+# 2. Apply a preset for common model/thinking defaults (optional but preferred)
+sp edit my-specialist --preset standard
-  prompt:
-    task_template: |
-      $prompt
+# 3. Set individual fields via dot.path (primary mutation workflow)
+sp edit my-specialist specialist.metadata.name my-specialist
+sp edit my-specialist specialist.metadata.version 1.0.0
+sp edit my-specialist specialist.execution.model anthropic/claude-sonnet-4-6
+sp edit my-specialist specialist.execution.fallback_model google-gemini-cli/gemini-3.1-pro-preview
+sp edit my-specialist specialist.execution.permission_required READ_ONLY
-      Working directory: $cwd
-```
+# 4. Use --file only for multiline prompt fields
+sp edit my-specialist specialist.prompt.system --file .tmp/system.prompt.txt
+sp edit my-specialist specialist.prompt.task_template --file .tmp/task-template.prompt.txt
-Validate before committing:
-```bash
-bun skills/specialist-author/scripts/validate-specialist.ts specialists/my-specialist.specialist.yaml
+# 5. Verify materialized JSON
+sp view my-specialist
+# 6. Validate schema
+bun skills/specialist-author/scripts/validate-specialist.ts config/specialists/my-specialist.specialist.json
 ```
 ---
@@ -214,6 +217,7 @@ bun skills/specialist-author/scripts/validate-specialist.ts specialists/my-speci
 | `stall_timeout_ms` | number | — | kill if no event for N ms |
 | `interactive` | boolean | `false` | enable multi-turn keep-alive by default |
 | `response_format` | enum | `text` | `text` \| `json` \| `markdown` |
+| `output_type` | enum | `custom` | `codegen` \| `analysis` \| `review` \| `synthesis` \| `orchestration` \| `workflow` \| `research` \| `custom` |
 | `permission_required` | enum | `READ_ONLY` | see tier table below |
 | `thinking_level` | enum | — | `off` \| `minimal` \| `low` \| `medium` \| `high` \| `xhigh` |
@@ -244,26 +248,110 @@ bun skills/specialist-author/scripts/validate-specialist.ts specialists/my-speci
 | `task_template` | string | yes | Template string with `$variable` substitution |
 | `system` | string | no | System prompt / agents.md content |
 | `skill_inherit` | string | no | Single skill folder/file injected via `pi --skill` (Agent Forge compat) |
-| `output_schema` | object | no | JSON schema for structured output |
+| `output_schema` | object | no | JSON schema for structured output — injected into system prompt by runner; post-run validation is warn-only |
 | `examples` | array | no | Few-shot examples |
+**Output contract precedence (runner-injected):** `response_format` → `output_type` → `output_schema`.
+**`response_format` behavior**
+- `text`: no report template is injected (raw behavior)
+- `json`: specialist must return one parseable JSON object
+- `markdown`: specialist must use canonical report sections when applicable:
+  - `## Summary`
+  - `## Status`
+  - `## Changes`
+  - `## Verification`
+  - `## Risks`
+  - `## Follow-ups`
+  - `## Beads`
+  - Optional: `## Architecture`, `## Acceptance Criteria`, `## Machine-readable block`
+**`output_type` (semantic archetype)**
+- `codegen`: implementation/change manifests
+- `analysis`: architecture/exploration reports
+- `review`: compliance/review verdicts
+- `synthesis`: decision summaries across multiple findings
+- `orchestration`: coordinator actions/state handoffs
+- `workflow`: procedural/operational run outputs
+- `research`: source-backed findings with confidence
+- `custom`: no built-in extension (schema still includes base contract fields in structured modes)
+**`output_schema` guidance**: Add when output must be machine-readable by downstream consumers (beads notes, feed, orchestrators). The schema is injected into the system prompt and validated post-run with warn-only behavior (never hard-fail in v1).
+**Mandatory markdown+schema rule:** if `response_format: markdown` and `output_schema` is present, the output must include `## Machine-readable block` containing exactly one JSON object in a single ` ```json ` fenced block. That JSON object is canonical and must match the schema.
+Standard schemas by specialist type (shown as the `output_schema` object value):
+executor — change manifest:
+```json
+{
+  "type": "object",
+  "properties": {
+    "status": { "enum": ["success", "partial", "failed"] },
+    "files_changed": { "type": "array", "items": { "type": "string" } },
+    "symbols_modified": { "type": "array", "items": { "type": "string" } },
+    "lint_pass": { "type": "boolean" },
+    "tests_pass": { "type": "boolean" },
+    "issues_closed": { "type": "array", "items": { "type": "string" } },
+    "follow_ups": { "type": "array", "items": { "type": "string" } }
+  }
+}
+```
+explorer — analysis report:
+```json
+{
+  "type": "object",
+  "properties": {
+    "summary": { "type": "string" },
+    "key_files": { "type": "array", "items": { "type": "string" } },
+    "architecture_notes": { "type": "string" },
+    "recommendations": { "type": "array", "items": { "type": "string" } }
+  }
+}
+```
+planner — epic result:
+```json
+{
+  "type": "object",
+  "properties": {
+    "epic_id": { "type": "string" },
+    "children": { "type": "array", "items": { "type": "string" } },
+    "test_issues": { "type": "array", "items": { "type": "string" } },
+    "first_task": { "type": "string" }
+  }
+}
+```
 ### `specialist.skills` (optional)
-```yaml
-skills:
-  paths:                          # passed as pi --skill; folder (reads SKILL.md inside) or direct file
-    - skills/my-skill/            # folder — pi loads SKILL.md from inside
-    - ~/.agents/skills/domain/    # same
-    - skills/notes.md             # direct file also accepted
-  scripts:
-    - run: ./scripts/pre-check.sh # file path OR shell command
-      phase: pre                  # "pre" or "post"
-      inject_output: true         # true = stdout available as $pre_script_output
-    - run: "bd ready"             # inline command — runs via shell
-      phase: pre
-      inject_output: true
-    - run: ./scripts/cleanup.sh
-      phase: post
+```json
+{
+  "skills": {
+    "paths": [
+      "skills/my-skill/",
+      "~/.agents/skills/domain/",
+      "skills/notes.md"
+    ],
+    "scripts": [
+      {
+        "run": "./scripts/pre-check.sh",
+        "phase": "pre",
+        "inject_output": true
+      },
+      {
+        "run": "bd ready",
+        "phase": "pre",
+        "inject_output": true
+      },
+      {
+        "run": "./scripts/cleanup.sh",
+        "phase": "post"
+      }
+    ]
+  }
+}
 ```
 `run` accepts either a **file path** (`./scripts/foo.sh`, `~/scripts/foo.sh`) or a **shell command** (`bd ready`, `git status`). Pre-run validation checks that file paths exist and shell commands are on `PATH`. Shebang typos (e.g. `pytho` instead of `python`) are caught and reported as errors before the session starts.
@@ -274,29 +362,44 @@ skills:
 Informational declarations used by pre-run validation and future tooling (e.g. `specialists doctor`).
-```yaml
-capabilities:
-  required_tools: [bash, read, grep, glob]   # pi tools this specialist needs
-  external_commands: [bd, git, gh]           # CLI binaries validated on PATH before run
+```json
+{
+  "capabilities": {
+    "required_tools": ["bash", "read", "grep", "glob"],
+    "external_commands": ["bd", "git", "gh"]
+  }
+}
 ```
 `external_commands` causes a hard failure if any binary is not found on `PATH` — the session will not start.
 ### `specialist.output_file` (optional, top-level)
-```yaml
-output_file: .specialists/my-specialist-result.md
+```json
+{
+  "output_file": ".specialists/my-specialist-result.md"
+}
 ```
 Writes the final session output to this file path after the session completes. Relative to the working directory.
 ### `specialist.communication` (optional)
-```yaml
-communication:
-  next_specialists: planner             # single specialist to chain after completion
-  # or an array:
-  next_specialists: [planner, test-runner]
+```json
+{
+  "communication": {
+    "next_specialists": "planner"
+  }
+}
+```
+Or as an array:
+```json
+{
+  "communication": {
+    "next_specialists": ["planner", "test-runner"]
+  }
+}
 ```
 `next_specialists` declares which specialist(s) should receive this specialist's output as `$previous_result`. Chaining is executed by the caller (e.g. `run_parallel` pipeline) — this field is declarative metadata.
@@ -319,16 +422,21 @@ Drives the staleness detection shown in `specialists status` and `specialists li
 | `STALE` | A watched file's mtime > `metadata.updated` |
 | `AGED` | STALE + days since `updated` > `stale_threshold_days` |
-```yaml
-specialist:
-  metadata:
-    updated: "2026-03-01"
-  validation:
-    files_to_watch:
-      - src/specialist/schema.ts
-      - src/specialist/runner.ts
-    stale_threshold_days: 30
+```json
+{
+  "specialist": {
+    "metadata": {
+      "updated": "2026-03-01"
+    },
+    "validation": {
+      "files_to_watch": [
+        "src/specialist/schema.ts",
+        "src/specialist/runner.ts"
+      ],
+      "stale_threshold_days": 30
+    }
+  }
+}
 ```
 This specialist goes STALE the moment `schema.ts` or `runner.ts` is modified after March 1st, and AGED if that condition persists for more than 30 days.
@@ -368,11 +476,15 @@ These are **always available** in `task_template` — no configuration needed:
 Files listed under `skills.paths` are read and appended to the system prompt at runtime:
-```yaml
-skills:
-  paths:
-    - skills/specialist-author/SKILL.md
-    - .claude/agents.md
+```json
+{
+  "skills": {
+    "paths": [
+      "skills/specialist-author/SKILL.md",
+      ".claude/agents.md"
+    ]
+  }
+}
 ```
 Each file is appended as:
@@ -393,15 +505,23 @@ Missing files are silently skipped (no error).
 Scripts run **locally** (not inside the agent session):
-```yaml
-skills:
-  scripts:
-    - path: scripts/gather-context.sh
-      phase: pre
-      inject_output: true    # stdout -> $pre_script_output in task_template
-    - path: scripts/notify.sh
-      phase: post
-      inject_output: false   # runs after session, output discarded
+```json
+{
+  "skills": {
+    "scripts": [
+      {
+        "run": "scripts/gather-context.sh",
+        "phase": "pre",
+        "inject_output": true
+      },
+      {
+        "run": "scripts/notify.sh",
+        "phase": "post",
+        "inject_output": false
+      }
+    ]
+  }
+}
 ```
 - `pre` scripts run before the agent session starts; use `inject_output: true` to surface their stdout.
@@ -413,62 +533,107 @@ skills:
 ## Annotated Full Example
-```yaml
-specialist:
-  metadata:
-    name: code-reviewer
-    version: 1.0.0
-    description: "Reviews a PR diff for correctness, style, and security issues."
-    category: code-quality
-    author: team@example.com
-    updated: "2026-03-22"
-    tags: [review, code-quality, security]
-  execution:
-    mode: tool
-    model: anthropic/claude-sonnet-4-6
-    fallback_model: google-gemini-cli/gemini-3.1-pro-preview
-    timeout_ms: 300000
-    stall_timeout_ms: 60000
-    interactive: true                # default keep-alive; supports resume flows
-    response_format: markdown
-    permission_required: READ_ONLY   # not READ_WRITE
-  prompt:
-    system: |
-      You are an expert code reviewer. Focus on correctness, maintainability, and security.
-      Do NOT modify any files -- output a markdown review only.
-    task_template: |
-      Review the following changes:
-      $prompt
-      $pre_script_output
-      Working directory: $cwd
-      Output a structured markdown review with sections: Summary, Issues, Suggestions.
-    skill_inherit: skills/code-review/guidelines.md
-  skills:
-    paths:
-      - skills/code-review/
-    scripts:
-      - run: scripts/get-diff.sh
-        phase: pre
-        inject_output: true
-  capabilities:
-    required_tools: [bash, read]
-    external_commands: [git]
-  communication:
-    next_specialists: [sync-docs]
-  output_file: .specialists/review.md
-  beads_integration: auto
+```json
+{
+  "specialist": {
+    "metadata": {
+      "name": "code-reviewer",
+      "version": "1.0.0",
+      "description": "Reviews a PR diff for correctness, style, and security issues.",
+      "category": "code-quality",
+      "author": "team@example.com",
+      "updated": "2026-03-22",
+      "tags": ["review", "code-quality", "security"]
+    },
+    "execution": {
+      "mode": "tool",
+      "model": "anthropic/claude-sonnet-4-6",
+      "fallback_model": "google-gemini-cli/gemini-3.1-pro-preview",
+      "timeout_ms": 300000,
+      "stall_timeout_ms": 60000,
+      "interactive": true,
+      "response_format": "markdown",
+      "permission_required": "READ_ONLY"
+    },
+    "prompt": {
+      "system": "You are an expert code reviewer. Focus on correctness, maintainability, and security.\nDo NOT modify any files -- output a markdown review only.\n",
+      "task_template": "Review the following changes:\n\n$prompt\n\n$pre_script_output\n\nWorking directory: $cwd\n\nOutput a structured markdown review with sections: Summary, Issues, Suggestions.\n",
+      "skill_inherit": "skills/code-review/guidelines.md"
+    },
+    "skills": {
+      "paths": [
+        "skills/code-review/"
+      ],
+      "scripts": [
+        {
+          "run": "scripts/get-diff.sh",
+          "phase": "pre",
+          "inject_output": true
+        }
+      ]
+    },
+    "capabilities": {
+      "required_tools": ["bash", "read"],
+      "external_commands": ["git"]
+    },
+    "communication": {
+      "next_specialists": ["sync-docs"]
+    },
+    "output_file": ".specialists/review.md",
+    "beads_integration": "auto"
+  }
+}
+```
+---
+## Context Window & Lifecycle Design
+Specialists run as long-lived Pi sessions. Context management is not optional — ignoring it causes silent quality degradation before any hard limit is hit.
+### Context rot starts before the window fills
+Quality degrades as the context grows — compressed early context causes inconsistency, missed facts, and instruction drift. Design for bounded, coherent runs rather than arbitrarily long ones.
+**Rules when authoring a specialist:**
+- Set `stall_timeout_ms` explicitly for any specialist that may idle between turns (keep-alive/interactive). Without it, a stuck session holds resources indefinitely.
+- Use `thinking_level: low` for orchestration/coordinator specialists that emit structured JSON output — thinking tokens cost context budget without improving structured output quality.
+- For research/explorer specialists: bounded scope per session + `handoff_summary` in `output_schema` > one unbounded session.
+- `interactive: true` specialists must define what "done" looks like in their system prompt — otherwise they drift.
+### Context metrics are always available
+`status.json` exposes `metrics.token_usage` (cumulative input+output tokens) and `metrics.turns` on every turn. These are written by 08zd Phase 1 and available to any caller (NodeSupervisor, orchestrator, human).
+**context_pct formula**: `(cumulative_input_tokens / model_context_window) * 100`
+Approximate context windows:
+| Model family | Window |
+|-------------|--------|
+| `claude-opus-4-6`, `claude-sonnet-4-6`, `claude-haiku-4-5` | 200k tokens |
+| `gemini-3.1-pro-preview` | 1M tokens |
+| `qwen3.5-plus`, `dashscope/qwen3.5-plus` | 128k tokens |
+| `zai/glm-5`, `zai/glm-5-turbo` | 128k tokens |
+### For Node members specifically
+NodeSupervisor injects `member_health` into the coordinator resume prompt on **every turn** — not just at warning thresholds. This is by design: the coordinator needs continuous data to make proactive rotation decisions before quality degrades.
+When authoring a specialist intended to run as a Node member:
+- Include a `handoff_summary` field in `output_schema` so context can be transferred on rotation
+- Keep system prompts concise — the NodeSupervisor will inject additional context on each resume
+- `thinking_level: low` or `off` for coordinator-class specialists; higher levels for deep analysis members
+### Design checklist for long-running specialists
+Before finalising a specialist that uses `interactive: true` or is expected to run many turns:
+```
+[ ] stall_timeout_ms set (not relying on timeout_ms alone)
+[ ] thinking_level set appropriately for the output type
+[ ] output_schema includes handoff_summary or equivalent for rotation
+[ ] system prompt has explicit termination condition ("you are done when...")
+[ ] task_template doesn't inject large static blobs that could be fetched on demand
 ```
 ---
@@ -478,15 +643,15 @@ specialist:
 | Zod Error | Cause | Fix |
 |-----------|-------|-----|
 | `Must be kebab-case` | `name` has uppercase or spaces | Use `my-specialist` not `MySpecialist` |
-| `Must be semver` | `version: "v1.0"` | Use `version: 1.0.0` (no `v` prefix) |
+| `Must be semver` | `version: "v1.0"` | Use `"version": "1.0.0"` (no `v` prefix) |
 | `Invalid enum value ... 'READ_WRITE'` | Wrong permission value | Use `READ_ONLY`, `LOW`, `MEDIUM`, or `HIGH` |
 | `Invalid enum value ... 'auto'` on permission_required | Using `auto` for permission_required | `auto` is only valid for `beads_integration` |
-| `Required` on `task_template` | `task_template` missing from `prompt:` | Add `task_template` (even if just `$prompt`) |
-| `Required` on `model` | `model` missing from `execution:` | Add a model string |
+| `Required` on `task_template` | `task_template` missing from `prompt` | Add `task_template` (even if just `"$prompt"`) |
+| `Required` on `model` | `model` missing from `execution` | Add a model string |
 | `Required` on `description` | Missing `description` in `metadata` | Add description string |
 | `Required` on `category` | Missing `category` in `metadata` | Add category string |
-| Silently ignored / no output | YAML valid but `task_template` doesn't use `$prompt` | Add `$prompt` to `task_template` |
-| `defaults` key unrecognized | Using `defaults:` top-level key | Remove it; use `--variables` at invocation or built-ins |
+| Silently ignored / no output | JSON valid but `task_template` doesn't use `$prompt` | Add `$prompt` to `task_template` |
+| `defaults` key unrecognized | Using `defaults` top-level key | Remove it; use `--variables` at invocation or built-ins |
 ---
@@ -494,11 +659,11 @@ specialist:
 Specialists are discovered from three scopes (highest priority first):
-1. **Project**: `<project-root>/specialists/*.specialist.yaml`
-2. **User**: `~/.agents/specialists/*.specialist.yaml`
+1. **Project**: `<project-root>/specialists/*.specialist.json`
+2. **User**: `~/.agents/specialists/*.specialist.json`
 3. **System**: package-bundled specialists
-Name your file `<metadata.name>.specialist.yaml`.
+Name your file `<metadata.name>.specialist.json`.
 ---
@@ -512,15 +677,28 @@ pi --list-models
 pi --model <provider>/<primary-model-id>  --print "ping"   # must return "pong"
 pi --model <provider>/<fallback-model-id> --print "ping"   # must return "pong"
-# 2. Write the YAML with the verified model
+# 2. Scaffold first (fills missing schema sections/fields)
+node config/skills/specialists-creator/scripts/scaffold-specialist.ts config/specialists/my-specialist.specialist.json
+# 3. Mutate with sp edit (dot.path + presets)
+sp edit my-specialist --preset standard
+sp edit my-specialist specialist.execution.model <provider>/<primary-model-id>
+sp edit my-specialist specialist.execution.fallback_model <provider>/<fallback-model-id>
+# 4. Use --file only for multiline prompt fields
+sp edit my-specialist specialist.prompt.system --file .tmp/system.prompt.txt
+sp edit my-specialist specialist.prompt.task_template --file .tmp/task-template.prompt.txt
+# 5. Verify rendered config
+sp view my-specialist
-# 3. Validate schema with the bundled helper
-bun skills/specialist-author/scripts/validate-specialist.ts specialists/my-specialist.specialist.yaml
+# 6. Validate schema with the bundled helper
+bun skills/specialist-author/scripts/validate-specialist.ts config/specialists/my-specialist.specialist.json
-# 4. List to confirm discovery
+# 7. List to confirm discovery
 specialists list
-# 5. Smoke test
+# 8. Smoke test
 specialists run my-specialist --prompt "ping" --no-beads
 ```