npm - gentle-pi - Versions diffs - 0.6.0 → 0.7.0 - Mend

gentle-pi 0.6.0 → 0.7.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (13) hide show

package/README.md +10 -1
package/assets/agents/jd-fix-agent.md +16 -0
package/assets/agents/jd-judge-a.md +16 -0
package/assets/agents/jd-judge-b.md +16 -0
package/docs/skill-style-guide.md +116 -0
package/extensions/gentle-ai.ts +24 -15
package/package.json +2 -1
package/prompts/skill-creation.md +20 -0
package/scripts/verify-package-files.mjs +4 -0
package/skills/judgment-day/SKILL.md +2 -2
package/skills/skill-creator/SKILL.md +86 -0
package/skills/skill-improver/SKILL.md +53 -0
package/tests/gentle-ai.test.ts +49 -0

package/README.md CHANGED Viewed

@@ -55,6 +55,7 @@ Most coding-agent sessions fail for operational reasons, not model reasons:
 | **Reviewer protection**        | Surfaces review workload risk before a task turns into an oversized PR.                                                                       |
 | **Per-agent model assignment** | Pi-native modal for assigning stronger or cheaper models to specific SDD/custom agents.                                                       |
 | **Skill discovery registry**   | Maintains `.atl/skill-registry.md` from project and user skills so review/comment/PR workflows do not silently miss the right skill.          |
+| **Skill creation workflow**    | Provides the `skill-creator`/`skill-improver` skills, `/skill-creation` prompt, and packaged style guide for LLM-first skills.                 |
 | **Delivery skills**            | Includes issue-first PRs, chained PRs, work-unit commits, cognitive docs, comment writing, and Judgment Day review.                           |
 | **Runtime safety**             | Blocks destructive shell commands, asks for confirmation for sensitive operations, and blocks direct read/write/edit access to sensitive paths. |
@@ -299,6 +300,10 @@ Behavior:
 Skill discovery is a guardrail, not a workflow router: it helps Pi load the right skill without forcing extra ceremony.
+`gentle-pi` also ships package-owned `skill-creator` and `skill-improver` skills plus the `/skill-creation` prompt for creating or updating project skills. Both skills use `docs/skill-style-guide.md` as their normative style contract. The workflow checks for duplicates, keeps `SKILL.md` concise, uses one-line trigger-rich frontmatter, and reminds maintainers to refresh the registry after skill changes.
+Packaged skills include `cognitive-doc-design`, `comment-writer`, `judgment-day`, `skill-creator`, `skill-improver`, and the other delivery/review skills under `skills/`. SDD init is installed as the packaged `sdd-init` runtime agent under `assets/agents/` and refreshed with the SDD assets.
 Delegation contract:
 - parent/orchestrator resolves project/user skills from the registry and passes matching paths under `## Skills to load before work`;
@@ -395,6 +400,7 @@ Legacy string entries are still accepted and treated as `model`-only config.
 | `/gentle-ai:install-sdd`         | Repairs missing global SDD runtime assets without overwriting files. |
 | `/gentle-ai:install-sdd --force` | Force-refreshes installed global SDD assets.                         |
 | `/skill-registry:refresh`        | Regenerates `.atl/skill-registry.md`.                               |
+| `/skill-creation`                | Creates or updates an LLM-first skill using the packaged `skill-creator` contract and style guide. |
 Package-owned global SDD runtime assets are also refreshed automatically on session start when `gentle-pi` changes. Project-local `.pi/agents` and `.pi/chains` remain manual overrides and are never overwritten by startup refresh.
@@ -431,6 +437,8 @@ Compatibility aliases:
 - `cognitive-doc-design` — documentation that reduces cognitive load.
 - `comment-writer` — concise, warm, postable collaboration comments.
 - `issue-creation` — issue workflow with checks before creation.
+- `skill-creator` — create LLM-first skills with valid frontmatter.
+- `skill-improver` — audit and upgrade existing LLM-first skills.
 ## Memory
@@ -464,7 +472,8 @@ Memory contract for SDD delegation:
 | `assets/chains/`               | SDD chains installed as global Pi runtime assets.                                                          |
 | `assets/support/`              | Strict TDD support docs for apply/verify phases.                                                           |
 | `skills/`                      | Gentle AI delivery and collaboration skills.                                                               |
-| `prompts/`                     | Gentle-prefixed prompt templates.                                                                          |
+| `prompts/`                     | Gentle-prefixed prompt templates, including `/skill-creation`.                                             |
+| `docs/skill-style-guide.md`    | Normative style guide used by the packaged skill creation/improvement skills.                              |
 ## Development

package/assets/agents/jd-fix-agent.md ADDED Viewed

@@ -0,0 +1,16 @@
+---
+name: jd-fix-agent
+description: Judgment Day surgical fix agent for confirmed findings. Can edit code and run focused tests.
+tools: read, grep, glob, edit, write, bash
+---
+You are the Judgment Day fix agent for Gentle AI.
+Apply surgical fixes for confirmed Judgment Day findings only. Preserve the original design intent, keep the patch focused, and avoid unrelated refactors.
+Rules:
+- Edit only the files needed to resolve confirmed findings.
+- Add or update focused tests when the fix changes behavior.
+- Run the relevant tests when practical and report exact results.
+- Clearly list what was fixed, what was verified, and any remaining risks.

package/assets/agents/jd-judge-a.md ADDED Viewed

@@ -0,0 +1,16 @@
+---
+name: jd-judge-a
+description: Judgment Day blind adversarial reviewer A. Read-only; reports findings and does not fix code.
+tools: read, grep, glob, bash
+---
+You are Judgment Day judge A for Gentle AI.
+Run an independent, blind adversarial review of the assigned change. Focus on correctness, regressions, missing tests, unsafe behavior, and mismatches with the user's request.
+Rules:
+- Stay read-only. Do not edit files or apply fixes.
+- Do not coordinate with judge B before producing your review.
+- Report concrete findings with file paths, evidence, severity, and suggested verification.
+- If you find no confirmed issues, say so clearly.

package/assets/agents/jd-judge-b.md ADDED Viewed

@@ -0,0 +1,16 @@
+---
+name: jd-judge-b
+description: Judgment Day blind adversarial reviewer B. Read-only; independently reports findings and does not fix code.
+tools: read, grep, glob, bash
+---
+You are Judgment Day judge B for Gentle AI.
+Run an independent, blind adversarial review of the assigned change. Challenge assumptions from a different angle than judge A, with special attention to edge cases, test gaps, integration risks, and user-visible regressions.
+Rules:
+- Stay read-only. Do not edit files or apply fixes.
+- Work independently from judge A and do not rely on judge A's conclusions.
+- Report concrete findings with file paths, evidence, severity, and suggested verification.
+- If you find no confirmed issues, say so clearly.

package/docs/skill-style-guide.md ADDED Viewed

@@ -0,0 +1,116 @@
+# Skill Style Guide
+This guide is the normative style contract for LLM-first skills shipped with or created inside `gentle-pi` projects.
+## Purpose
+A skill is a runtime instruction contract for an LLM. It should make future agent behavior more reliable by encoding reusable workflow rules, decision gates, and output expectations.
+A skill is not a tutorial, article, README, or generic checklist for humans.
+## When to create a skill
+Create or update a skill when:
+- a workflow or convention is reused across sessions;
+- project-specific constraints differ from generic best practices;
+- a decision tree helps the agent choose safely;
+- templates, schemas, or local references improve repeatability;
+- agents keep missing the same instruction without an explicit runtime contract.
+Do not create a skill for:
+- one-off tasks;
+- generic documentation;
+- rules that belong in tests, linters, or executable code;
+- broad background context without concrete execution rules.
+## Required structure
+Use this directory shape:
+```text
+skills/{skill-name}/
+├── SKILL.md
+├── assets/       # optional: templates, schemas, examples, fixtures
+└── references/   # optional: longer local docs or rationale
+```
+`SKILL.md` must use this section order:
+1. `Activation Contract`
+2. `Hard Rules`
+3. `Decision Gates`
+4. `Execution Steps`
+5. `Output Contract`
+6. `References`
+Omit optional supporting directories when they are not needed.
+## Frontmatter
+Use YAML frontmatter with this shape:
+```yaml
+---
+name: {kebab-case-skill-name}
+description: "Trigger: {phrases users or agents will say}. {What this skill does}."
+license: Apache-2.0
+metadata:
+  author: gentleman-programming
+  version: "1.0"
+---
+```
+Rules:
+- `name` must be kebab-case and match the skill directory unless there is a deliberate compatibility reason.
+- `description` must be one physical line, quoted, YAML-safe, and trigger-rich.
+- Put essential trigger words first in `description`.
+- Do not add a `Keywords` section.
+- Preserve license and metadata unless the project has a stronger local convention.
+## Writing rules
+- Write imperative runtime instructions, not explanatory prose.
+- Keep `SKILL.md` concise: target 180–450 tokens, recommended max 700, hard max 1000.
+- Prefer bullets and compact decision tables over paragraphs.
+- State when to activate the skill and when not to activate it.
+- Preserve author intent when improving an existing skill.
+- Do not invent domain policies, triggers, or constraints. Ask or mark ambiguity instead.
+- Move long examples, schemas, generated templates, and background rationale to `assets/` or `references/`.
+- References must point to local files that ship with the project or package.
+## Decision gates
+Use a table when choices matter:
+```markdown
+| Situation | Action |
+| --- | --- |
+| Missing frontmatter | Fix required fields |
+| Existing skill covers it | Update the existing skill instead |
+| Long examples needed | Move them to `assets/` |
+```
+Decision gates should prevent unsafe overreach, duplicate skills, and unnecessary ceremony.
+## Output contract
+Every skill should tell the agent what to return. Good output contracts include:
+- files created or modified;
+- commands or verification run;
+- registry refresh needed;
+- unresolved ambiguities;
+- residual risks.
+## Registry expectations
+After creating, removing, moving, or renaming project skills, refresh the skill registry when available:
+```text
+/skill-registry:refresh
+```
+The registry is an index. `SKILL.md` remains the source of truth.

package/extensions/gentle-ai.ts CHANGED Viewed

@@ -438,7 +438,18 @@ const SDD_AGENT_NAMES = [
 ] as const;
 const SDD_AGENT_NAME_SET = new Set<string>(SDD_AGENT_NAMES);
-type SddAgentName = (typeof SDD_AGENT_NAMES)[number];
+const JUDGMENT_DAY_AGENT_NAMES = [
+	"jd-judge-a",
+	"jd-judge-b",
+	"jd-fix-agent",
+] as const;
+const CORE_MODEL_AGENT_NAMES = [
+	...SDD_AGENT_NAMES,
+	...JUDGMENT_DAY_AGENT_NAMES,
+] as const;
+const CORE_MODEL_AGENT_NAME_SET = new Set<string>(CORE_MODEL_AGENT_NAMES);
 type ThinkingLevel = "off" | "minimal" | "low" | "medium" | "high" | "xhigh";
 interface AgentRoutingEntry {
 	model?: string;
@@ -998,14 +1009,7 @@ function listDiscoverableAgents(cwd: string): AgentEntry[] {
 	];
 	const byName = new Map<string, AgentEntry>();
 	for (const agent of agents) byName.set(agent.name, agent);
-	const discovered = Array.from(byName.values());
-	const sddFirst = SDD_AGENT_NAMES.map((name) =>
-		discovered.find((agent) => agent.name === name),
-	).filter((agent): agent is AgentEntry => agent !== undefined);
-	const rest = discovered
-		.filter((agent) => !SDD_AGENT_NAMES.includes(agent.name as SddAgentName))
-		.sort((left, right) => left.name.localeCompare(right.name));
-	return [...sddFirst, ...rest];
+	return orderDiscoverableAgents(Array.from(byName.values()));
 }
 async function listDiscoverableAgentsAsync(cwd: string): Promise<AgentEntry[]> {
@@ -1030,14 +1034,17 @@ async function listDiscoverableAgentsAsync(cwd: string): Promise<AgentEntry[]> {
 	}
 	const byName = new Map<string, AgentEntry>();
 	for (const agent of agents) byName.set(agent.name, agent);
-	const discovered = Array.from(byName.values());
-	const sddFirst = SDD_AGENT_NAMES.map((name) =>
-		discovered.find((agent) => agent.name === name),
+	return orderDiscoverableAgents(Array.from(byName.values()));
+}
+function orderDiscoverableAgents(agents: AgentEntry[]): AgentEntry[] {
+	const coreFirst = CORE_MODEL_AGENT_NAMES.map((name) =>
+		agents.find((agent) => agent.name === name),
 	).filter((agent): agent is AgentEntry => agent !== undefined);
-	const rest = discovered
-		.filter((agent) => !SDD_AGENT_NAMES.includes(agent.name as SddAgentName))
+	const rest = agents
+		.filter((agent) => !CORE_MODEL_AGENT_NAME_SET.has(agent.name))
 		.sort((left, right) => left.name.localeCompare(right.name));
-	return [...sddFirst, ...rest];
+	return [...coreFirst, ...rest];
 }
 function projectSettingsPath(cwd: string): string {
@@ -1904,6 +1911,8 @@ async function applyReviewGate(
 export const __testing = {
 	listAgentsFromDir,
 	listAgentsFromDirAsync,
+	listDiscoverableAgents,
+	orderDiscoverableAgents,
 	classifyGuardedCommand,
 	loadRuntimeGuardrailsConfig,
 	buildGentlePrompt,

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "gentle-pi",
-  "version": "0.6.0",
+  "version": "0.7.0",
   "description": "Turn Pi into el Gentleman: a senior-architect development harness with SDD/OpenSpec, subagents, strict TDD evidence, review guardrails, and skill discovery.",
   "license": "MIT",
   "type": "module",
@@ -24,6 +24,7 @@
   },
   "files": [
     "assets/",
+    "docs/",
     "extensions/",
     "lib/",
     "prompts/",

package/prompts/skill-creation.md ADDED Viewed

@@ -0,0 +1,20 @@
+---
+description: Create or update an LLM-first skill
+argument-hint: "<skill idea or name>"
+---
+Create or update an LLM-first skill for: $ARGUMENTS
+Use the `skill-creator` skill if it is available. If the skill is not auto-loaded, read `skills/skill-creator/SKILL.md` and `docs/skill-style-guide.md` when present before editing.
+## Process
+1. Clarify the reusable behavior, target runtime, trigger phrases, and non-goals if they are not obvious.
+2. Inspect existing skills first; update an existing skill instead of creating a duplicate.
+3. Create or update `skills/{kebab-name}/SKILL.md` with valid one-line frontmatter description and concise runtime instructions.
+4. Put templates, schemas, or examples under `assets/`; put longer supporting docs under `references/`.
+5. If the skill is part of `gentle-pi`, update `scripts/verify-package-files.mjs`.
+6. Refresh the registry with `/skill-registry:refresh` when available, or tell the user to refresh/reload.
+## Report
+Return the files changed, the selected trigger phrases, any supporting files, and whether registry/package verification remains to run.

package/scripts/verify-package-files.mjs CHANGED Viewed

@@ -25,6 +25,7 @@ const requiredPaths = [
   "assets/support/sdd-status-contract.md",
   "assets/support/strict-tdd.md",
   "assets/support/strict-tdd-verify.md",
+  "docs/skill-style-guide.md",
   "extensions/gentle-ai.ts",
   "extensions/sdd-init.ts",
   "extensions/skill-registry.ts",
@@ -33,6 +34,7 @@ const requiredPaths = [
   "prompts/gis.md",
   "prompts/gpr.md",
   "prompts/gwr.md",
+  "prompts/skill-creation.md",
   "skills/branch-pr/SKILL.md",
   "skills/chained-pr/SKILL.md",
   "skills/cognitive-doc-design/SKILL.md",
@@ -41,6 +43,8 @@ const requiredPaths = [
   "skills/issue-creation/SKILL.md",
   "skills/judgment-day/SKILL.md",
   "skills/release/SKILL.md",
+  "skills/skill-creator/SKILL.md",
+  "skills/skill-improver/SKILL.md",
   "skills/skill-registry/SKILL.md",
   "skills/work-unit-commits/SKILL.md",
 ];

package/skills/judgment-day/SKILL.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 name: judgment-day
-description: "Trigger: judgment day, dual review, adversarial review, juzgar. Run blind dual review, fix confirmed issues, then re-judge."
+description: "Trigger: judgment day, judgement day, dual review, adversarial review, juzgar. Run blind dual review, fix confirmed issues, then re-judge."
 license: Apache-2.0
 metadata:
   author: gentleman-programming
@@ -9,7 +9,7 @@ metadata:
 ## Activation Contract
-Load this skill only when the user explicitly asks for Judgment Day, dual/adversarial review, or equivalent Spanish trigger (`juzgar`, `que lo juzguen`). Review a specific target: files, feature, PR, or architecture slice.
+Load this skill only when the user explicitly asks for Judgment Day, Judgement Day, dual/adversarial review, or equivalent Spanish trigger (`juzgar`, `que lo juzguen`). Review a specific target: files, feature, PR, or architecture slice.
 ## Hard Rules

package/skills/skill-creator/SKILL.md ADDED Viewed

@@ -0,0 +1,86 @@
+---
+name: skill-creator
+description: "Trigger: /skill-creation, skill creation, skill creator, create skill, new skill. Create LLM-first skills with valid frontmatter."
+license: Apache-2.0
+metadata:
+  author: gentleman-programming
+  version: "1.0"
+---
+## Activation Contract
+Use this skill when creating or updating a reusable AI skill for Pi or another agent runtime.
+Create a skill when:
+- a workflow or convention is reused across sessions;
+- generic agent behavior needs project-specific constraints;
+- a decision tree helps the agent choose safely;
+- examples, templates, or references would make future execution more reliable.
+Do not create a skill for a one-off task, generic documentation, or rules that belong in code/tests.
+## Hard Rules
+- Follow `docs/skill-style-guide.md` as the normative source for skill structure and style.
+- A skill is an LLM runtime contract, not human-facing docs.
+- Keep `SKILL.md` concise: target 180–450 tokens, max 1000.
+- Use imperative instructions and concrete gates; avoid tutorials and background prose.
+- Frontmatter `description` must be one physical YAML-safe line and include trigger words first.
+- Do not add a `Keywords` section; put essential trigger words in `description`.
+- Put templates, schemas, and generated examples in `assets/`.
+- Put longer rationale or local doc links in `references/`.
+- After changing project skills, refresh the registry with `/skill-registry:refresh` when available.
+## Decision Gates
+| Need | Action |
+| --- | --- |
+| Small reusable behavior | Create `skills/{skill-name}/SKILL.md` only |
+| Templates, schemas, fixtures | Add `skills/{skill-name}/assets/` |
+| Longer explanation or edge cases | Add `skills/{skill-name}/references/` |
+| Existing skill covers it | Update the existing skill instead |
+| Skill affects delegation discovery | Ensure trigger words appear in `description` |
+## Execution Steps
+1. Read `docs/skill-style-guide.md` before creating or updating skills.
+2. Inspect existing skills and confirm the new skill does not duplicate one.
+3. Choose a kebab-case skill name that matches the user-facing trigger.
+4. Create or update this structure:
+```text
+skills/{skill-name}/
+├── SKILL.md
+├── assets/       # optional
+└── references/   # optional
+```
+5. Use this frontmatter shape:
+```yaml
+---
+name: {skill-name}
+description: "Trigger: {phrases users or agents will say}. {What this skill does}."
+license: Apache-2.0
+metadata:
+  author: gentleman-programming
+  version: "1.0"
+---
+```
+6. Write sections in this order: Activation Contract, Hard Rules, Decision Gates, Execution Steps, Output Contract, References.
+7. If this is a packaged `gentle-pi` skill, add it to `scripts/verify-package-files.mjs`.
+8. Refresh or document the skill registry update path.
+## Output Contract
+Return:
+- Files created or modified.
+- Whether this created a new skill or updated an existing one.
+- Any supporting `assets/` or `references/` files added.
+- Whether package verification or skill registry refresh is needed.
+## References
+- `docs/skill-style-guide.md` — normative LLM-first skill style guide.
+- `skills/skill-registry/SKILL.md` — registry refresh and indexing contract.

package/skills/skill-improver/SKILL.md ADDED Viewed

@@ -0,0 +1,53 @@
+---
+name: skill-improver
+description: "Trigger: improve skills, audit skills, refactor skills, skill quality. Audit and upgrade existing LLM-first skills."
+license: Apache-2.0
+metadata:
+  author: gentleman-programming
+  version: "1.0"
+---
+## Activation Contract
+Use this skill when auditing, refactoring, normalizing, or improving existing `SKILL.md` files. Use `skill-creator` when creating a brand-new skill from a reusable pattern.
+## Hard Rules
+- Read `docs/skill-style-guide.md` first and treat it as the normative style contract.
+- Treat `SKILL.md` as the source of truth; preserve author intent, critical rules, activation semantics, and output requirements.
+- Use `.atl/skill-registry.md` as an index of skill names, triggers, scopes, and exact paths when available.
+- Default to audit-only. Modify files only when the user explicitly asks to apply improvements.
+- Never delete meaningful content silently; move long explanation, examples, templates, or schemas into local `references/` or `assets/`.
+- Do not invent triggers, policies, or domain rules. Mark ambiguous cases for human review.
+## Decision Gates
+| Situation | Action |
+| --- | --- |
+| Missing or invalid frontmatter | Fix `name`, quoted one-line `description`, `license`, and `metadata` |
+| Skill reads like tutorial docs | Convert to runtime instructions and move background to `references/` |
+| Body exceeds budget | Preserve rules, move examples/background to supporting files |
+| Branching logic hidden in prose | Convert to a compact decision table |
+| Rules conflict or intent is unclear | Report the issue; do not rewrite that rule automatically |
+## Execution Steps
+1. Read `docs/skill-style-guide.md`.
+2. Read `.atl/skill-registry.md`; use listed paths to select skills. If missing, scan known skill directories for `*/SKILL.md`.
+3. For each selected skill, audit metadata, trigger clarity, section order, body budget, actionability, decision gates, output contract, and local references.
+4. Return an audit report grouped by skill with severity and exact proposed changes.
+5. In apply mode, edit only safe issues, preserve content, create supporting files when needed, then refresh or request `/skill-registry:refresh`.
+## Output Contract
+Return:
+- Skills audited and paths used.
+- Issues found, grouped by severity.
+- Files changed, if apply mode was requested.
+- Registry refresh recommendation when skill metadata or paths changed.
+- Ambiguities that need human review.
+## References
+- `docs/skill-style-guide.md` — normative LLM-first skill style guide.
+- `skills/skill-registry/SKILL.md` — registry refresh and indexing contract.

package/tests/gentle-ai.test.ts CHANGED Viewed

@@ -34,3 +34,52 @@ test("agent discovery skips skills directories", async (t) => {
 		["reviewer", "worker"],
 	);
 });
+test("agent model discovery prioritizes SDD and Judgment Day agents", (t) => {
+	const root = mkdtempSync(join(tmpdir(), "gentle-pi-model-agents-"));
+	t.after(() => rmSync(root, { recursive: true, force: true }));
+	writeMarkdown(join(root, "zeta.md"), "name: zeta\n");
+	writeMarkdown(join(root, "jd-fix-agent.md"), "name: jd-fix-agent\n");
+	writeMarkdown(join(root, "sdd-apply.md"), "name: sdd-apply\n");
+	writeMarkdown(join(root, "alpha.md"), "name: alpha\n");
+	writeMarkdown(join(root, "jd-judge-b.md"), "name: jd-judge-b\n");
+	writeMarkdown(join(root, "sdd-init.md"), "name: sdd-init\n");
+	writeMarkdown(join(root, "jd-judge-a.md"), "name: jd-judge-a\n");
+	const discovered = __testing.listAgentsFromDir(root, "user");
+	const ordered = __testing.orderDiscoverableAgents(discovered);
+	assert.deepEqual(
+		ordered.map((agent) => agent.name),
+		[
+			"sdd-init",
+			"sdd-apply",
+			"jd-judge-a",
+			"jd-judge-b",
+			"jd-fix-agent",
+			"alpha",
+			"zeta",
+		],
+	);
+});
+test("discoverable model agents include installed Judgment Day agents", (t) => {
+	const root = mkdtempSync(join(tmpdir(), "gentle-pi-installed-agents-"));
+	const previousHome = process.env.GENTLE_PI_AGENT_HOME;
+	process.env.GENTLE_PI_AGENT_HOME = root;
+	t.after(() => {
+		if (previousHome === undefined) delete process.env.GENTLE_PI_AGENT_HOME;
+		else process.env.GENTLE_PI_AGENT_HOME = previousHome;
+		rmSync(root, { recursive: true, force: true });
+	});
+	writeMarkdown(join(root, "agents", "jd-judge-a.md"), "name: jd-judge-a\n");
+	writeMarkdown(join(root, "agents", "jd-judge-b.md"), "name: jd-judge-b\n");
+	writeMarkdown(join(root, "agents", "jd-fix-agent.md"), "name: jd-fix-agent\n");
+	const discovered = __testing.listDiscoverableAgents(root).map((agent) => agent.name);
+	assert.deepEqual(
+		discovered.filter((name) => name.startsWith("jd-")),
+		["jd-judge-a", "jd-judge-b", "jd-fix-agent"],
+	);
+});