npm - @ikunin/sprintpilot - Versions diffs - 2.2.31 → 2.3.1 - Mend

@ikunin/sprintpilot 2.2.31 → 2.3.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (47) hide show

package/_Sprintpilot/skills/sprintpilot-sprint-progress/workflow.md ADDED Viewed

@@ -0,0 +1,169 @@
+# Sprintpilot — Sprint Progress Check
+## Purpose
+Produce a concise health check of the current sprint's autopilot
+execution. Reads the structured progress snapshot, recent halts /
+verify failures, and step-level events; layers brief judgment on top
+to highlight what (if anything) needs attention.
+## Outputs
+- A ≤15-line human-readable summary printed to chat.
+- A single recommended next action (or "nothing to do — autopilot is
+  healthy / idle").
+No file writes. No state mutations.
+## Conventions
+- `<root>` = project root (where `_bmad-output/` lives).
+- All shell-outs use `node` (no global install assumed).
+- On any error (no plan, missing ledger, etc.), degrade gracefully —
+  print what you DO know and skip the parts you don't. The user gets
+  a partial answer rather than a halt.
+---
+## Step 1 — Collect the Structured Snapshot
+<action>Run the progress CLI in JSON mode:
+```
+node _Sprintpilot/bin/autopilot.js progress --project-root <root> --json
+```
+Parse the response. Key fields:
+- `plan_present` — false → project running in sprint-status order; the
+  rest of the analysis is naturally lighter.
+- `plan_id`
+- `current_story` / `current_step`
+- `sprint_progress` — `{ total, done, pending, skipped, excluded, source }`
+- `recent_events` — last 3 `story_step_*` ledger entries.
+Don't fail if the command exits non-zero (e.g., missing project root).
+Capture stderr and treat as "unknown progress" — proceed with Step 2
+which still produces useful output.</action>
+---
+## Step 2 — Pull Recent Halt / Verify Context
+<action>Read the tail of the ledger to identify any unresolved
+halts, verify rejections, or repeated step-failure loops. Inline node
+is the lightest path:
+```
+node -e "
+const l = require('./_Sprintpilot/lib/orchestrator/action-ledger.js');
+const entries = l.read({projectRoot: process.cwd()}, {limit: 40});
+const interesting = entries.filter(e =>
+  e.kind === 'halt' ||
+  e.kind === 'verify_rejected' ||
+  e.kind === 'plan_exhausted' ||
+  e.kind === 'plan_reorder_rejected' ||
+  e.kind === 'auto_derive_emitted' ||
+  e.kind === 'plan_migrated'
+);
+process.stdout.write(JSON.stringify(interesting));
+"
+```
+Look for:
+- **`halt` with reason in {`autopilot_lock_held`, `worktree_orphans_detected`,
+  `plan_exhausted`, `user_pause`, `user_replan_sprint`, `user_abort_sprint`}**
+  — autopilot is stopped and needs user attention.
+- **`verify_rejected` with `consecutive >= 3`** — autopilot is stuck in
+  a retry loop; the LLM may need to re-read the failing artifact.
+- **`plan_reorder_rejected`** — a recent reorder violated the DAG;
+  the user has unresolved input pending.
+- **Repeated `story_step_started` for the same story+phase without
+  matching `story_step_completed`** — phase entered but never finished;
+  could indicate a wedged session.
+Don't fail if the ledger is empty (greenfield project, never run).
+Just note "no execution history yet" and proceed.</action>
+---
+## Step 3 — Synthesize the Report
+<action>Render a single brief block to chat following this template
+(omit any line that doesn't apply):
+```
+Sprint progress
+  Plan:     <plan_id> — <done>/<total> done (<pending> pending,
+            <skipped> skipped, <excluded> excluded)
+  Bar:      [=====     ] <pct>%
+  Tracker:  <linked>/<total> stories linked to <provider> (<project_key>)   ← only when issue_tracker set
+  Current:  <story_key> [<issue_id>] (step: <phase>)   OR   "idle"
+            ↑ issue_id bracket only when set on this story
+  Recent:   <kind> <story> [<issue_id>] / <phase> (<elapsed>s ago)
+            <kind> <story> [<issue_id>] / <phase> (<elapsed>s ago)
+Health:    <one of: HEALTHY | STALLED | NEEDS-INPUT | EXHAUSTED | NO-PLAN>
+Reason:    <one short sentence>
+Suggest:   <one concrete next action OR "continue running">
+```
+The `autopilot progress --json` response carries the lookup data:
+- `current_issue_id` — the issue_id of the currently-running story (or null).
+- `issue_tracking` — `{provider, project_key, base_url, total, linked, coverage}`
+  when an issue_tracker is configured; null otherwise (omit the Tracker
+  line entirely when null — don't surface zeros as noise).
+- Each `recent_events[]` entry carries an `issue_id` field (or null).
+Always include the `[<issue_id>]` bracket when the field is non-null;
+omit it when null. Don't write "[no issue]" or similar — silence
+communicates "not tracked" cleanly.
+**Health classification:**
+| Signal | Health | Suggest |
+|---|---|---|
+| `plan_present=false` AND no halts in last 40 | NO-PLAN | "Continue in sprint-status order, or run /sprintpilot-plan-sprint to enable dependency-aware ordering." |
+| Most-recent halt is `plan_exhausted` | EXHAUSTED | "Run /sprintpilot-plan-sprint to add more stories, or `autopilot start --no-auto-plan` to continue in sprint-status order." |
+| Most-recent halt is `user_pause` | NEEDS-INPUT | "Resume with `autopilot start`." |
+| Most-recent halt is `user_replan_sprint` | NEEDS-INPUT | "Next `autopilot start` will invoke /sprintpilot-plan-sprint." |
+| `verify_rejected` with `consecutive >= 3` in last 5 entries | STALLED | "Inspect the failing artifact named in `verify_result.issues`; consider `user_input { kind: 'force_continue' }` only if you've manually resolved the issue." |
+| `plan_reorder_rejected` more recent than any subsequent reorder_queue | NEEDS-INPUT | "Reorder violations exist; revise the order to respect the DAG before sending another reorder_queue." |
+| No halts, current_story present, current_step is a valid phase | HEALTHY | "Continue running; nothing requires attention." |
+| No halts, no current_story, sprint_progress.pending > 0 | HEALTHY | "Run `autopilot start` to pick up the next pending story." |
+| No halts, no current_story, sprint_progress.pending == 0 | HEALTHY | "Sprint complete; consider running the bmad-retrospective skill if not already done." |
+Default to HEALTHY when classification is ambiguous — don't manufacture
+alarm.</action>
+<action>If the user invoked the skill with a story name argument
+(e.g., `/sprintpilot-sprint-progress 1-3-add-auth`), also call:
+```
+node _Sprintpilot/bin/autopilot.js progress --project-root <root> --story <story-key> --json
+```
+And append a one-block "Story detail" section showing that story's
+plan entry. When `issue_id` is non-null, prominently display it on its
+own line (it's the primary cross-reference back to the user's issue
+tracker):
+```
+Story: <story_key>
+  Issue:        <issue_id>       ← omit line when null
+  Epic:         <epic>
+  Plan status:  <plan_status>
+  BMad status:  <bmad_status>
+  Priority:     <priority>
+  Current step: <current_step>   ← omit when not running
+  Completed:    <completed_at>   ← omit when not done
+```
+Do not repeat the full sprint summary in this mode — just the focused
+story block.</action>
+---
+## Failure modes
+| Symptom | Recovery |
+|---|---|
+| `autopilot progress` exits non-zero (missing project root, etc.) | Capture stderr; print "Progress CLI unavailable: <stderr first line>"; still attempt Step 2. |
+| Ledger file missing | Print "No execution history yet — sprint hasn't started"; skip Step 2 analysis; suggest `autopilot start`. |
+| Plan file corrupt | Print "sprint-plan.yaml unreadable (run `node _Sprintpilot/scripts/sprint-plan.js read --project-root .` to inspect)"; do NOT auto-archive — that's user's call. |
+| Recent ledger has 0 entries | Note "Ledger is empty — autopilot hasn't run yet or was reset"; skip halt analysis. |

package/lib/commands/install.js CHANGED Viewed

@@ -748,7 +748,15 @@ async function evictV1Installation(projectRoot, { dryRun, migrateV1, yes }) {
 // Regex-based so we don't add a YAML parser dep for two scalar fields.
 // Unrecognized / unreadable files fall back to bundled defaults.
 async function readExistingAutopilotConfig(projectRoot, v1Snapshot) {
-  const out = { sessionStoryLimit: null, retrospectiveMode: null };
+  const out = {
+    sessionStoryLimit: null,
+    retrospectiveMode: null,
+    // v2.3.0 additions. null means "not set in user config" → use the bundled
+    // default. autoInferDependencies is read only to surface a deprecation
+    // notice on upgrade — we never write it back.
+    autoPlanOnStart: null,
+    autoInferDependencies: null,
+  };
   let raw = null;
   // Precedence order:
@@ -809,6 +817,22 @@ async function readExistingAutopilotConfig(projectRoot, v1Snapshot) {
   if (modeMatch && RETROSPECTIVE_MODES.includes(modeMatch[1])) {
     out.retrospectiveMode = modeMatch[1];
   }
+  // v2.3.0 — `auto_plan_on_start: true|false`. Bool; bundled default is false.
+  const planMatch = raw.match(
+    new RegExp(`^[ \\t]*auto_plan_on_start:[ \\t]*(true|false)${commentTail}`, 'm'),
+  );
+  if (planMatch) {
+    out.autoPlanOnStart = planMatch[1] === 'true';
+  }
+  // Legacy `auto_infer_dependencies: true|false` — read so the installer can
+  // surface a deprecation notice when the user is upgrading from v2.2.x with
+  // the flag set to true (it's now a no-op). Never written back.
+  const inferMatch = raw.match(
+    new RegExp(`^[ \\t]*auto_infer_dependencies:[ \\t]*(true|false)${commentTail}`, 'm'),
+  );
+  if (inferMatch) {
+    out.autoInferDependencies = inferMatch[1] === 'true';
+  }
   return out;
 }
@@ -859,7 +883,10 @@ function applyScalar(source, key, value) {
   return `${trimmed}  ${key}: ${value}\n`;
 }
-async function patchAutopilotConfig(projectRoot, { sessionStoryLimit, retrospectiveMode }) {
+async function patchAutopilotConfig(
+  projectRoot,
+  { sessionStoryLimit, retrospectiveMode, autoPlanOnStart },
+) {
   const file = path.join(
     projectRoot,
     PROJECT_ADDON_DIR_NAME,
@@ -871,6 +898,12 @@ async function patchAutopilotConfig(projectRoot, { sessionStoryLimit, retrospect
   const original = await fs.readFile(file, 'utf8');
   let updated = applyScalar(original, 'session_story_limit', sessionStoryLimit);
   updated = applyScalar(updated, 'retrospective_mode', retrospectiveMode);
+  // v2.3.0 — auto_plan_on_start is a boolean. applyScalar handles literal
+  // values (true/false) the same way as numbers; we just need to pass the
+  // unquoted lowercase string for booleans.
+  if (autoPlanOnStart !== undefined && autoPlanOnStart !== null) {
+    updated = applyScalar(updated, 'auto_plan_on_start', autoPlanOnStart ? 'true' : 'false');
+  }
   if (updated !== original) {
     await writeAtomic(file, updated);
   }
@@ -915,6 +948,46 @@ async function readExistingComplexityProfile(projectRoot, v1Snapshot) {
   return m[1];
 }
+// v2.3.0 — post-install hygiene check. Cross-reference the project's
+// _Sprintpilot/manifest.yaml `installed_skills` against what actually
+// landed under `_Sprintpilot/skills/<name>/SKILL.md`. Catches the
+// classic "added skill to manifest but forgot to ship the files" bug
+// at install time rather than at first invocation.
+//
+// Returns { missing: string[] } — empty array means everything is
+// wired correctly. The caller chooses how to surface mismatches
+// (warning vs fail). We never fail the install on a mismatch; it's
+// hygiene, not correctness — the skill just won't appear under the
+// host tool's /-command if it's missing on disk.
+async function verifySkillManifest(projectRoot) {
+  const manifestPath = path.join(projectRoot, PROJECT_ADDON_DIR_NAME, 'manifest.yaml');
+  if (!(await fs.pathExists(manifestPath))) {
+    return { missing: [] };
+  }
+  let raw;
+  try {
+    raw = await fs.readFile(manifestPath, 'utf8');
+  } catch {
+    return { missing: [] };
+  }
+  // Parse the YAML list under `installed_skills:` via regex — bullet
+  // lines starting with `-` at consistent indent. Cheap; no YAML dep.
+  const skillNames = [];
+  const installedMatch = raw.match(/^[ \t]*installed_skills:\s*\n((?:[ \t]+- [^\n]+\n?)+)/m);
+  if (installedMatch) {
+    for (const line of installedMatch[1].split(/\n/)) {
+      const m = line.match(/^[ \t]+-\s+([A-Za-z0-9._-]+)/);
+      if (m) skillNames.push(m[1]);
+    }
+  }
+  const missing = [];
+  for (const name of skillNames) {
+    const skillFile = path.join(projectRoot, PROJECT_ADDON_DIR_NAME, 'skills', name, 'SKILL.md');
+    if (!(await fs.pathExists(skillFile))) missing.push(name);
+  }
+  return { missing };
+}
 async function patchComplexityProfile(projectRoot, profile) {
   const file = path.join(
     projectRoot,
@@ -1008,25 +1081,39 @@ async function resolveAutopilotSettings({ projectRoot, yes, dryRun, v1Snapshot }
   const existing = await readExistingAutopilotConfig(projectRoot, v1Snapshot);
   const defaultLimit = existing.sessionStoryLimit ?? DEFAULT_SESSION_STORY_LIMIT;
   const defaultMode = existing.retrospectiveMode ?? DEFAULT_RETROSPECTIVE_MODE;
+  // v2.3.0 — opt-in default false; preserve existing user choice on upgrade.
+  const defaultAutoPlan = existing.autoPlanOnStart ?? false;
   if (yes) {
-    if (existing.sessionStoryLimit != null || existing.retrospectiveMode != null) {
+    if (
+      existing.sessionStoryLimit != null ||
+      existing.retrospectiveMode != null ||
+      existing.autoPlanOnStart != null
+    ) {
       console.log(
         pc.dim(
-          `Preserving autopilot config: session_story_limit=${defaultLimit}, retrospective_mode=${defaultMode}`,
+          `Preserving autopilot config: session_story_limit=${defaultLimit}, retrospective_mode=${defaultMode}, auto_plan_on_start=${defaultAutoPlan}`,
         ),
       );
     }
-    return { sessionStoryLimit: defaultLimit, retrospectiveMode: defaultMode };
+    return {
+      sessionStoryLimit: defaultLimit,
+      retrospectiveMode: defaultMode,
+      autoPlanOnStart: defaultAutoPlan,
+    };
   }
   if (dryRun) {
     console.log(
       pc.dim(
-        `[DRY RUN] Would prompt for autopilot config (current: session_story_limit=${defaultLimit}, retrospective_mode=${defaultMode})`,
+        `[DRY RUN] Would prompt for autopilot config (current: session_story_limit=${defaultLimit}, retrospective_mode=${defaultMode}, auto_plan_on_start=${defaultAutoPlan})`,
       ),
     );
-    return { sessionStoryLimit: defaultLimit, retrospectiveMode: defaultMode };
+    return {
+      sessionStoryLimit: defaultLimit,
+      retrospectiveMode: defaultMode,
+      autoPlanOnStart: defaultAutoPlan,
+    };
   }
   const limitRaw = await prompts.text({
@@ -1065,7 +1152,17 @@ async function resolveAutopilotSettings({ projectRoot, yes, dryRun, v1Snapshot }
     initialValue: defaultMode,
   });
-  return { sessionStoryLimit, retrospectiveMode };
+  // v2.3.0 — single yes/no prompt for the new plan workflow. Default false:
+  // `autopilot start` runs in sprint-status order until the user explicitly
+  // invokes /sprintpilot-plan-sprint, which is always available regardless.
+  // Set this true to auto-trigger the planning skill on greenfield projects.
+  const autoPlanOnStart = await prompts.confirm({
+    message:
+      'Auto-build a sprint plan on first `autopilot start`? (v2.3.0; runs /sprintpilot-plan-sprint to infer dependencies. You can always invoke the skill manually regardless of this setting.)',
+    initialValue: defaultAutoPlan,
+  });
+  return { sessionStoryLimit, retrospectiveMode, autoPlanOnStart };
 }
 async function runInteractiveToolPicker(detected) {
@@ -1194,7 +1291,7 @@ async function runInstall(options = {}) {
   // runtime copy — they're NOT threaded through `renderString`, because
   // workflow.md's `{{session_story_limit}}` / `{{retrospective_mode}}`
   // variable references would collide with single-brace token matching.
-  const { sessionStoryLimit, retrospectiveMode } = await resolveAutopilotSettings({
+  const { sessionStoryLimit, retrospectiveMode, autoPlanOnStart } = await resolveAutopilotSettings({
     projectRoot,
     yes,
     dryRun,
@@ -1439,7 +1536,11 @@ async function runInstall(options = {}) {
     //     wrote the bundled default config) AND after the v1 snapshot
     //     reapply (which might have restored an older config.yaml without
     //     `retrospective_mode`). The user's prompted values always win.
-    await patchAutopilotConfig(projectRoot, { sessionStoryLimit, retrospectiveMode });
+    await patchAutopilotConfig(projectRoot, {
+      sessionStoryLimit,
+      retrospectiveMode,
+      autoPlanOnStart,
+    });
     // 6c. Persist the complexity_profile. Separate from patchAutopilotConfig
     //     so the existing upgrade test coverage (readExistingAutopilotConfig /
@@ -1501,6 +1602,56 @@ async function runInstall(options = {}) {
   console.log(
     `Total skills installed: ${totalInstalled} (${skillCount} skills x ${selectedTools.length} tools)`,
   );
+  // v2.3.0 — post-install hygiene: warn if any skill in manifest.yaml
+  // doesn't have a SKILL.md on disk. Non-blocking; surfaces packaging
+  // bugs without failing the install.
+  try {
+    const verify = await verifySkillManifest(projectRoot);
+    if (verify.missing.length > 0) {
+      console.log('');
+      console.log(
+        pc.yellow(`  WARN: manifest references skills missing from disk: ${verify.missing.join(', ')}`),
+      );
+      console.log(pc.yellow('  These won\'t appear under your host tool\'s / menu until the SKILL.md files are present.'));
+    }
+  } catch {
+    // Self-check failure is non-fatal — never block install on hygiene.
+  }
+  // v2.3.0 upgrade notes — surfaced only when the relevant signals are
+  // actually present. Greenfield installs see nothing; upgraders from
+  // v2.2.x see migration + deprecation notices.
+  const v23Notes = [];
+  const legacyDepsPath = path.join(projectRoot, '_Sprintpilot', 'sprints', 'dependencies.yaml');
+  if (await fs.pathExists(legacyDepsPath)) {
+    v23Notes.push(
+      'Legacy file detected: _Sprintpilot/sprints/dependencies.yaml',
+      '  Auto-migrated to sprint-plan.yaml on the first `autopilot start`.',
+      '  Run now: node _Sprintpilot/scripts/infer-dependencies.js migrate',
+    );
+  }
+  // Re-read so we can show the deprecation notice without threading state
+  // through every helper. Cheap (one regex scan); only happens once per install.
+  try {
+    const existingForNotes = await readExistingAutopilotConfig(projectRoot, v1ConfigSnapshot);
+    if (existingForNotes.autoInferDependencies === true) {
+      if (v23Notes.length > 0) v23Notes.push('');
+      v23Notes.push(
+        'Deprecated: autopilot.auto_infer_dependencies = true in your config.',
+        '  This flag is a no-op in v2.3.0 — superseded by auto_plan_on_start (default false).',
+        '  Safe to remove from config.yaml; the new /sprintpilot-plan-sprint workflow',
+        '  handles inference manually or on opt-in auto-trigger.',
+      );
+    }
+  } catch {
+    // Config re-read failure is non-fatal — skip the deprecation notice.
+  }
+  if (v23Notes.length > 0) {
+    console.log('');
+    console.log(pc.cyan('v2.3.0 upgrade notes:'));
+    for (const line of v23Notes) console.log('  ' + line);
+  }
   console.log('');
   console.log('Skills:');
   for (const skill of allSkills) console.log(`  - ${skill}`);
@@ -1513,6 +1664,13 @@ async function runInstall(options = {}) {
   console.log('  /sprint-autopilot-off  Disengage and show status');
   console.log('  /bmad-help             Orientation and next-step guidance (from BMad Method)');
   console.log('');
+  console.log('First steps for a new sprint:');
+  console.log('  1. BMad sprint planning:        /bmad-sprint-planning');
+  console.log('  2. (optional) Sprint plan:      /sprintpilot-plan-sprint');
+  console.log('  3. (optional) Inspect DAG:      /sprintpilot-dependency-graph mermaid');
+  console.log('  4. Start autopilot:             /sprint-autopilot-on');
+  console.log('  5. Check live progress:         /sprintpilot-sprint-progress');
+  console.log('');
   console.log('Configuration (edit these files to customize behavior):');
   console.log('');
   console.log('  _Sprintpilot/modules/git/config.yaml');
@@ -1542,15 +1700,30 @@ async function runInstall(options = {}) {
   console.log(
     `    ${apKey('autopilot.retrospective_mode')}${apVal(retrospectiveMode)} Epic-end retrospective: auto (inline) | stop (pause) | skip (not recommended)`,
   );
+  console.log(
+    `    ${apKey('autopilot.auto_plan_on_start')}${apVal(String(autoPlanOnStart))} Auto-build sprint plan on first start (v2.3.0; default off)`,
+  );
+  console.log('');
+  console.log('Sprint planning + progress (v2.3.0):');
+  console.log('  /sprintpilot-plan-sprint       Build dependency-aware sprint plan');
+  console.log('  /sprintpilot-sprint-progress   Concise health-check of autopilot execution');
+  console.log('  /sprintpilot-dependency-graph  Render DAG (mermaid / graphviz / text / layers / json)');
+  console.log('');
+  console.log('CLI utilities:');
+  console.log('  autopilot progress             Live status (--json / --story <key>)');
+  console.log('  autopilot start --no-auto-plan Skip auto-planning for one session');
   console.log('');
   console.log('Multi-agent skills — run parallel subagents for faster analysis:');
-  console.log('  /sprintpilot-code-review       Parallel 3-layer adversarial review');
   console.log('  /sprintpilot-codebase-map      5-stream brownfield codebase analysis');
   console.log('  /sprintpilot-assess            Tech debt and dependency audit');
   console.log('  /sprintpilot-reverse-architect Extract architecture from existing code');
   console.log('  /sprintpilot-migrate           Legacy migration planning');
   console.log('  /sprintpilot-research          Parallel web research');
-  console.log('  /sprintpilot-party-mode        Multi-persona agent discussions');
+  console.log('');
+  console.log('Documentation:');
+  console.log('  Sprint planning walkthrough:   docs/USAGE.md');
+  console.log('  Configuration reference:       docs/CONFIGURATION.md');
+  console.log('  Architecture deep-dive:        docs/ARCHITECTURE.md');
   const latestVersion = await latestVersionPromise;
   if (latestVersion && addonVersion && compareVersions(addonVersion, latestVersion) === 'behind') {
@@ -1586,5 +1759,6 @@ module.exports = {
     KEY_RENAMES,
     snapshotUserOwnedFiles,
     applyUserOwnedFiles,
+    verifySkillManifest,
   },
 };

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@ikunin/sprintpilot",
-  "version": "2.2.31",
+  "version": "2.3.1",
   "description": "Sprintpilot — autopilot and multi-agent addon for BMad Method v6: git workflow, parallel agents, autonomous story execution",
   "license": "Apache-2.0",
   "repository": {

package/_Sprintpilot/skills/sprintpilot-code-review/SKILL.md DELETED Viewed

@@ -1,6 +0,0 @@
----
-name: sprintpilot-code-review
-description: 'Parallel 3-layer code review via subagents. Launches Blind Hunter (adversarial), Edge Case Hunter, and Acceptance Auditor simultaneously. Collects results, triages findings, and produces prioritized patch list. Use instead of stock bmad-code-review for deeper, faster reviews.'
----
-Follow the instructions in ./workflow.md.

package/_Sprintpilot/skills/sprintpilot-code-review/agents/acceptance-auditor.md DELETED Viewed

@@ -1,51 +0,0 @@
-# Acceptance Auditor — Code Review Agent
-You are a QA auditor verifying that the implementation satisfies the story's acceptance criteria. You have the diff, the story file, and project access.
-## Rules
-- Every acceptance criterion (AC) must be explicitly verified against the code.
-- If an AC is NOT covered by the implementation, flag it as MISSING.
-- If an AC is partially covered, flag what's missing.
-- If the implementation does something NOT in the ACs, note it as EXTRA (not necessarily bad, but worth flagging).
-- Cap your response at 2000 tokens.
-## What to Check
-For each acceptance criterion in the story:
-1. **Implemented?** — Is there code that addresses this criterion?
-2. **Tested?** — Is there a test that verifies this criterion?
-3. **Correct?** — Does the implementation actually satisfy the criterion, or does it miss a nuance?
-Also check:
-4. **Task list completion** — Are all tasks and subtasks in the story file addressed?
-5. **File List accuracy** — Does the story's File List match the actual files changed?
-6. **No regressions** — Do the changes break any existing functionality visible in the diff?
-## Output Format
-```
-## AC Verification
-| AC | Status | Evidence | Notes |
-|----|--------|----------|-------|
-| AC-1: <text> | PASS/FAIL/PARTIAL | file:line | ... |
-| AC-2: <text> | PASS/FAIL/PARTIAL | file:line | ... |
-## Issues Found
-1. [SEVERITY] AC-N not satisfied — file:line
-   What's missing: ...
-   Suggested fix: ...
-2. ...
-## Extra (not in ACs)
-- <description of extra behavior>
-```
-If all ACs pass, say "All acceptance criteria verified" with the evidence table.
-## Story and Diff
-The story file content and diff follow below. Review them now.

package/_Sprintpilot/skills/sprintpilot-code-review/agents/blind-hunter.md DELETED Viewed

@@ -1,39 +0,0 @@
-# Blind Hunter — Adversarial Code Review Agent
-You are a ruthless code reviewer. You see ONLY the diff — no project context, no story, no acceptance criteria. Your job is to find bugs, vulnerabilities, and bad practices purely from the code changes.
-## Rules
-- You have NO project context. Do not ask for it. Review only what you see.
-- Be specific: cite exact file paths and line numbers.
-- Focus on things that will break in production, not style preferences.
-- Cap your response at 2000 tokens. Be concise.
-## What to Look For
-1. **Bugs**: null/undefined access, off-by-one, race conditions, resource leaks, incorrect logic
-2. **Security**: injection (SQL, XSS, command), auth bypass, exposed secrets, insecure defaults
-3. **Error handling**: swallowed exceptions, missing error paths, unchecked return values
-4. **Performance**: O(n²) in hot paths, unbounded allocations, missing pagination, N+1 queries
-5. **Type safety**: unchecked casts, any/unknown abuse, missing validation at boundaries
-## Output Format
-Return findings as a numbered list:
-```
-1. [SEVERITY] file:line — Title
-   Description of the issue.
-   Suggested fix: ...
-2. [SEVERITY] file:line — Title
-   ...
-```
-Severity: CRITICAL, HIGH, MEDIUM, LOW
-If the diff looks clean, say "No issues found" — do not manufacture findings.
-## Diff to Review
-The diff follows below. Review it now.

package/_Sprintpilot/skills/sprintpilot-code-review/agents/edge-case-hunter.md DELETED Viewed

@@ -1,46 +0,0 @@
-# Edge Case Hunter — Code Review Agent
-You are a methodical edge case analyst. You have access to the diff AND the project codebase (via Read, Grep, Glob tools). Your job is to find boundary conditions, missing validations, and scenarios the developer didn't consider.
-## Rules
-- Use Read/Grep/Glob to understand how changed code interacts with the rest of the codebase.
-- Think about inputs at the extremes: empty, null, max length, unicode, concurrent access, negative numbers.
-- Focus on cases that the tests probably DON'T cover.
-- Cap your response at 2000 tokens. Be concise.
-## What to Look For
-1. **Boundary conditions**: empty arrays, zero-length strings, max int, negative values
-2. **Missing validation**: user input not sanitized, API responses not checked, file paths not validated
-3. **State issues**: stale state after error, partial updates without rollback, cache invalidation gaps
-4. **Concurrency**: shared mutable state, missing locks, TOCTOU races
-5. **Integration boundaries**: API contract mismatches, schema drift, timezone handling, encoding issues
-6. **Error propagation**: errors swallowed at boundaries, misleading error messages, partial failure states
-## Method
-For each changed file in the diff:
-1. Read the full file (not just the diff) to understand context
-2. Grep for callers of changed functions to assess blast radius
-3. Think: "What input would make this fail?"
-4. Think: "What happens if the thing this calls fails?"
-## Output Format
-```
-1. [SEVERITY] file:line — Edge Case Title
-   Scenario: When <condition>, then <what goes wrong>
-   Impact: <what breaks>
-   Suggested fix: ...
-2. ...
-```
-Severity: CRITICAL, HIGH, MEDIUM, LOW
-If no edge cases found, say "No edge cases identified" — do not manufacture findings.
-## Diff to Review
-The diff follows below. Review it now, then explore the codebase as needed.