npm - @torka/claude-workflows - Versions diffs - 0.12.0 → 0.13.2 - Mend

@torka/claude-workflows 0.12.0 → 0.13.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (39) hide show

package/.claude-plugin/plugin.json +8 -0
package/README.md +52 -5
package/bmad-workflows/bmm/workflows/4-implementation/implement-epic-with-subagents/steps/step-01b-continue.md +9 -2
package/bmad-workflows/bmm/workflows/4-implementation/implement-epic-with-subagents/steps/step-02-orchestrate.md +108 -2
package/bmad-workflows/bmm/workflows/4-implementation/implement-epic-with-subagents/steps/step-03-complete.md +35 -1
package/commands/deep-audit.md +530 -0
package/commands/dev-story-backend.md +12 -11
package/commands/dev-story-fullstack.md +6 -2
package/commands/dev-story-ui.md +4 -4
package/commands/github-pr-resolve.md +132 -24
package/commands/plan-parallelization.md +20 -3
package/examples/settings.local.example.json +17 -0
package/install.js +14 -3
package/package.json +1 -1
package/skills/deep-audit/INSPIRATIONS.md +26 -0
package/skills/deep-audit/SKILL.md +313 -0
package/skills/deep-audit/agents/api-contract-reviewer.md +42 -0
package/skills/deep-audit/agents/architecture-and-complexity.md +59 -0
package/skills/deep-audit/agents/code-health.md +55 -0
package/skills/deep-audit/agents/data-layer-reviewer.md +50 -0
package/skills/deep-audit/agents/documentation-health.md +44 -0
package/skills/deep-audit/agents/performance-profiler.md +42 -0
package/skills/deep-audit/agents/refactoring-planner.md +161 -0
package/skills/deep-audit/agents/security-and-error-handling.md +56 -0
package/skills/deep-audit/agents/seo-accessibility-auditor.md +53 -0
package/skills/deep-audit/agents/test-strategy-analyst.md +61 -0
package/skills/deep-audit/agents/type-design-analyzer.md +49 -0
package/skills/deep-audit/templates/report-template.md +152 -0
package/skills/designer-founder/SKILL.md +8 -7
package/skills/designer-founder/steps/step-01-context.md +94 -45
package/skills/designer-founder/steps/step-02-scope.md +6 -23
package/skills/designer-founder/steps/step-03-design.md +29 -58
package/skills/designer-founder/steps/step-04-artifacts.md +137 -113
package/skills/designer-founder/steps/step-05-epic-linking.md +81 -53
package/skills/designer-founder/steps/step-06-validate.md +181 -0
package/skills/designer-founder/templates/component-strategy.md +4 -0
package/skills/designer-founder/tools/magicpatterns.md +52 -19
package/skills/designer-founder/tools/stitch.md +97 -67
package/uninstall.js +24 -0

package/commands/deep-audit.md ADDED Viewed

@@ -0,0 +1,530 @@
+---
+description: 'Multi-agent codebase audit across security, architecture, error handling, and more'
+---
+# /deep-audit
+Comprehensive multi-agent codebase audit. Spawns parallel review agents across multiple dimensions and produces a consolidated report.
+**Usage:**
+```
+/deep-audit                              # Quick mode + auto refactoring plan
+/deep-audit --full                       # Full mode + auto refactoring plan
+/deep-audit --review-before-plan         # Pause after findings, ask before plan
+/deep-audit --pr 42                      # Audit a specific PR diff
+/deep-audit --since abc123f              # Audit changes since a commit hash
+/deep-audit --since 2025-01-15           # Audit changes since a date
+/deep-audit --full --pr 42               # Full mode on a specific PR
+/deep-audit --agent security-and-error-handling     # Run only one agent
+/deep-audit --agent performance-profiler --pr 42    # Single agent on a PR
+```
+<workflow CRITICAL="TRUE">
+IT IS CRITICAL THAT YOU FOLLOW THIS WORKFLOW EXACTLY.
+## Phase 0: Argument Parsing
+Parse `$ARGUMENTS` to determine:
+1. **Mode**: Check for `--full` flag
+   - If `--full` present → `mode = "full"` (10 agents)
+   - Otherwise → `mode = "quick"` (3 agents)
+2. **Single Agent**: Check for `--agent <name>` flag
+   - If `--agent <name>` present → `single_agent = "<name>"`
+   - Otherwise → `single_agent = null`
+   - If both `--full` and `--agent` are present: warn that `--full` is ignored in single-agent mode, set `mode = "single"`, ignore `--full`
+   - If `--agent` is present without `--full`: set `mode = "single"`
+3. **Review Before Plan**: Check for `--review-before-plan` flag
+   - If `--review-before-plan` present → `review_before_plan = true`
+   - Otherwise → `review_before_plan = false`
+4. **Scope**: Check for scope flags (mutually exclusive)
+   - `--pr <number>` → `scope = "pr"`, `scope_value = <number>`
+   - `--since <value>` → detect format:
+     - If matches date pattern (YYYY-MM-DD) → `scope = "since-date"`, `scope_value = <date>`
+     - Otherwise → `scope = "since-commit"`, `scope_value = <hash>`
+   - No scope flag → `scope = "full-project"`
+If conflicting scope flags are provided, warn the user and use the first one.
+---
+## Phase 1: Scope Resolution & Resume Detection
+### Step 1: Record current state
+Run these commands to capture the current project context:
+- `git rev-parse HEAD` → `current_commit`
+- `git rev-parse --show-toplevel` → `project_root`
+- Detect project stack by scanning for key files:
+  - `package.json` → Node.js/JavaScript
+  - `tsconfig.json` → TypeScript
+  - `next.config.*` → Next.js
+  - `requirements.txt` / `pyproject.toml` → Python
+  - `go.mod` → Go
+  - `Cargo.toml` → Rust
+  - `pom.xml` / `build.gradle` → Java
+  - Store as `detected_stack` (comma-separated list)
+### Step 2: Check for resume
+Check if `_bmad-output/deep-audit/state.json` exists:
+- **If exists AND `status` != "completed"**:
+  - Read `state.json` → `previous_state`
+  - If `previous_state.start_commit` == `current_commit`:
+    - **AUTO-RESUME**: Set `resume = true`
+    - Identify which agents are still pending from `previous_state.agents`
+    - Print: `Resuming interrupted audit (same commit). X of Y agents remaining.`
+  - If commits differ:
+    - **FRESH START**: Set `resume = false`
+    - Print: `Previous audit was on a different commit. Starting fresh.`
+- **If no state.json OR status == "completed"**:
+  - **FRESH START**: Set `resume = false`
+### Step 3: Build scope context
+Generate the `scope_context` string that will be injected into every agent prompt:
+**For `full-project`:**
+```
+SCOPE: Full project audit
+PROJECT ROOT: <project_root>
+STACK: <detected_stack>
+INSTRUCTIONS: Review the entire codebase. Focus on source files, configuration, and infrastructure. Skip node_modules, dist, build, .git, and vendor directories.
+```
+**For `pr`:**
+- Run: `gh pr diff <number> --name-only` → file list
+- Run: `gh pr diff <number>` → full diff
+- Build scope_context with the diff and file list:
+```
+SCOPE: PR #<number> audit
+PROJECT ROOT: <project_root>
+STACK: <detected_stack>
+CHANGED FILES:
+<file list>
+DIFF:
+<full diff>
+INSTRUCTIONS: Focus your review on the changed files and their immediate dependencies. For architectural review, also consider how changes fit into the broader codebase.
+```
+**For `since-commit` or `since-date`:**
+- For commit: `git diff <hash>...HEAD --name-only` and `git diff <hash>...HEAD`
+- For date: `git log --since="<date>" --format="%H" | tail -1` → oldest_hash, then `git diff <oldest_hash>...HEAD --name-only` and `git diff <oldest_hash>...HEAD`
+- Build scope_context similarly to PR scope.
+Store `scope_context` for injection into agent prompts.
+---
+## Phase 2: Load Agent Definitions
+Read the SKILL.md file at the path relative to this command:
+```
+skills/deep-audit/SKILL.md
+```
+From SKILL.md, build the complete agent roster (used for both mode selection and `--agent` validation):
+**Quick mode agents:**
+```
+quick_agents = [
+  { file: "security-and-error-handling.md", model: "opus",   dimensions: ["Security", "Error Handling"] },
+  { file: "architecture-and-complexity.md", model: "opus",   dimensions: ["Architecture", "Simplification"] },
+  { file: "code-health.md",                model: "sonnet", dimensions: ["AI Slop Detection", "Dependency Health"] }
+]
+```
+**Full mode additional agents:**
+```
+full_agents = [
+  { file: "performance-profiler.md",     model: "sonnet", dimensions: ["Performance"] },
+  { file: "test-strategy-analyst.md",    model: "opus",   dimensions: ["Test Coverage", "Test Efficiency"] },
+  { file: "type-design-analyzer.md",     model: "sonnet", dimensions: ["Type Design"] },
+  { file: "data-layer-reviewer.md",      model: "opus",   dimensions: ["Data Layer & Database"] },
+  { file: "api-contract-reviewer.md",    model: "sonnet", dimensions: ["API Contracts & Interface Consistency"] },
+  { file: "seo-accessibility-auditor.md", model: "sonnet", dimensions: ["SEO & Accessibility"] },
+  { file: "documentation-health.md",    model: "sonnet", dimensions: ["Documentation Health"] }
+]
+```
+**Build the active agent list based on mode:**
+- **If `mode = "single"`**: Search both `quick_agents` and `full_agents` for an agent whose filename (without `.md`) matches `single_agent`. If found → `agents = [matched_agent]`. If NOT found → print the error below and **STOP** (do not proceed):
+  ```
+  Unknown agent: "<single_agent>"
+  Available agents:
+    Quick mode: security-and-error-handling, architecture-and-complexity, code-health
+    Full mode:  performance-profiler, test-strategy-analyst, type-design-analyzer,
+                data-layer-reviewer, api-contract-reviewer, seo-accessibility-auditor,
+                documentation-health
+  ```
+- **If `mode = "quick"`**: `agents = quick_agents`
+- **If `mode = "full"`**: `agents = quick_agents + full_agents`
+**Refactoring planner** — added when mode is NOT "single" (runs separately in Phase 6, NOT in Phase 4):
+```
+planner_agent = { file: "refactoring-planner.md", model: "opus", dimensions: ["Refactoring"] }
+```
+Include this agent in `state.agents` for tracking when mode is not "single", but do NOT spawn it in Phase 4.
+If resuming: filter `agents` to only those with status "pending" in `previous_state.agents`.
+---
+## Phase 3: Initialize State
+Create `_bmad-output/deep-audit/` directory if it doesn't exist.
+Write `_bmad-output/deep-audit/state.json`:
+```json
+{
+  "status": "in_progress",
+  "mode": "<quick|full|single>",
+  "scope": "<scope type>",
+  "scope_value": "<scope value or null>",
+  "review_before_plan": false,
+  "start_commit": "<current_commit>",
+  "start_time": "<ISO timestamp>",
+  "detected_stack": "<detected_stack>",
+  "agents": {
+    "<agent-file-name>": {
+      "status": "pending",
+      "model": "<model>",
+      "dimensions": ["<dim1>", "<dim2>"],
+      "started_at": null,
+      "completed_at": null,
+      "findings_count": 0,
+      "raw_output": null
+    }
+  },
+  "findings": [],
+  "refactoring_plan": null,
+  "report_path": null
+}
+```
+Note: The `agents` object includes the `refactoring-planner` entry with `status: "pending"`. It is tracked like all agents but spawned in Phase 6, not Phase 4.
+If resuming, merge pending agent statuses into the existing state (keep completed agents' data intact).
+Print status:
+```
+Deep Audit — <mode> mode
+Scope: <scope description>
+Stack: <detected_stack>
+Agent(s): <count> (<list of agent names>)
+Commit: <short hash>
+```
+---
+## Phase 4: Spawn Agents (Batched Parallel)
+Spawn agents in batches of up to 5 at a time using the Task tool.
+For each agent in the current batch:
+1. Read the agent prompt file from `skills/deep-audit/agents/<agent-file>`
+2. Construct the full prompt by combining:
+   - The agent prompt file content
+   - The scope_context from Phase 1
+   - A reminder to follow SKILL.md output format exactly
+3. Spawn via Task tool:
+   ```
+   Tool: Task
+   subagent_type: general-purpose
+   model: <agent.model>
+   description: "deep-audit: <agent-name>"
+   prompt: |
+     <agent prompt content>
+     ---
+     ## Scope Context (injected by orchestrator)
+     <scope_context>
+     ---
+     ## Output Format Reminder
+     You MUST produce output using the exact format defined above:
+     - === FINDING === blocks for each finding (confidence >= 80 only)
+     - === DIMENSION SUMMARY === blocks for each dimension you cover
+     Produce NO other output besides these blocks.
+   ```
+4. After each batch completes, for each agent response:
+   - Parse `=== FINDING ===` blocks: extract agent, severity, confidence, file, line, dimension, title, description, suggestion
+   - Parse `=== DIMENSION SUMMARY ===` blocks: extract dimension, score, p1_count, p2_count, p3_count, assessment
+   - Store parsed findings in state.json `findings` array
+   - Store raw output in agent's `raw_output` field
+   - Update agent status to "completed" with timestamp and findings_count
+   - Write updated state.json to disk
+5. Print progress after each batch:
+   ```
+   Batch N complete: <agents in batch>
+   Findings so far: X P1, Y P2, Z P3
+   Remaining agents: <count>
+   ```
+If any agent fails (Task tool returns error):
+- Log the error in the agent's state
+- Set agent status to "failed"
+- Continue with remaining agents (do not abort the audit)
+- Report failed agents in the final summary
+---
+## Phase 5: Deduplicate Findings
+After all agents complete:
+1. Sort all findings by severity (P1 → P2 → P3), then by file path, then by line number
+2. Deduplicate: merge findings that share ALL of these properties:
+   - Same file (exact path match)
+   - Same or overlapping line range (within 5 lines)
+   - Similar title (>70% word overlap)
+   - If merged, keep the higher severity and higher confidence, combine descriptions
+3. Assign sequential IDs: F-001, F-002, F-003, ...
+4. Store deduplicated findings back in state.json
+Print dedup results:
+```
+Deduplication: <original count> findings → <deduped count> findings (<removed count> duplicates merged)
+```
+---
+## Phase 6: Refactoring Planner
+Skip this phase if `mode = "single"` (single-agent audits don't warrant cross-cutting refactoring plans). There is no planner agent in state.json to update.
+Also skip this phase if the deduplicated findings count is 0. Set the planner agent status to "skipped" in state.json and continue.
+### Step 1: Confirm (if --review-before-plan)
+If `review_before_plan` is true:
+1. Print a findings summary to the user:
+   ```
+   FINDINGS SUMMARY: X total (Y critical, Z important, W minor)
+   Top findings:
+   1. F-001: <title> (P1)
+   2. F-002: <title> (P1)
+   3. F-003: <title> (P2)
+   ```
+2. Ask the user: **"Generate refactoring plan from these findings? (Y/n)"**
+3. If the user says no → set planner agent status to "skipped" in state.json, skip to Phase 7.
+If `review_before_plan` is false, proceed directly to Step 2.
+### Step 2: Generate plan
+1. Serialize all deduplicated findings from Phase 5 into a single text block using the `=== FINDING ===` format. Include the assigned `id` field (F-001, etc.) so the planner can reference them.
+2. Read the agent prompt from `skills/deep-audit/agents/refactoring-planner.md`
+3. Spawn via Task tool (same pattern as Phase 4):
+   ```
+   Tool: Task
+   subagent_type: general-purpose
+   model: opus
+   description: "deep-audit: refactoring-planner"
+   prompt: |
+     <agent prompt content>
+     ---
+     ## Input Findings (injected by orchestrator)
+     <serialized findings payload>
+     ---
+     ## Output Format Reminder
+     You MUST produce output using the exact format defined above:
+     - === THEME === blocks for each refactoring theme
+     - Exactly one === EXECUTION ORDER === block at the end
+     Produce NO other output besides these blocks.
+   ```
+4. Parse the response:
+   - Extract all `=== THEME ===` blocks: id, name, effort, risk, finding_ids, dependencies, coverage_gate, blast_radius, warnings, phase, summary, steps, files, tests_before, tests_after
+   - Extract the single `=== EXECUTION ORDER ===` block: phase_1 through phase_4, quick_wins, total_effort, summary
+5. Store parsed data in `state.refactoring_plan`:
+   ```json
+   {
+     "themes": [ ...parsed theme objects... ],
+     "execution_order": { ...parsed execution order... }
+   }
+   ```
+6. Update planner agent status in `state.agents` to "completed" with timestamps and raw_output.
+7. Write updated state.json to disk.
+If the planner agent fails (Task tool returns error):
+- Set planner agent status to "failed" in state.json
+- Log a warning but continue to Phase 7 (the report generates without the roadmap section)
+Print progress:
+```
+Refactoring Planner: <theme_count> themes (<quick_win_count> quick wins), total effort: <total_effort>
+```
+---
+## Phase 7: Generate Report
+Read the report template from:
+```
+skills/deep-audit/templates/report-template.md
+```
+Fill in the template:
+1. **Header fields**: date, mode, scope description, agent count, detected stack, duration (now - start_time), commit hash
+2. **Scorecard**: Build from DIMENSION SUMMARY data
+   - For each dimension, fill in: score, P1/P2/P3 counts, assessment
+   - Calculate overall health score using the weighted formula from SKILL.md:
+     - `weighted_sum = sum(score * weight for each dimension)`
+     - `total_weight = sum(weights for audited dimensions)`
+     - `overall_score = weighted_sum / total_weight` (round to 1 decimal)
+   - Map overall score to label: 9-10 "Excellent", 7-8 "Good", 5-6 "Adequate", 3-4 "Concerning", 1-2 "Critical"
+3. **Findings sections**: Group deduplicated findings by severity
+   - For each finding, render:
+     ```
+     #### F-NNN: <title> (<severity>)
+     | | |
+     |---|---|
+     | **File** | `<file>:<line>` |
+     | **Dimension** | <dimension> |
+     | **Confidence** | <confidence>% |
+     | **Agent** | <agent> |
+     <description>
+     **Suggestion:** <suggestion>
+     ---
+     ```
+4. **Action Plan**: Select the top 5 findings (by severity, then confidence) and format as a numbered action list with brief description of what to fix and why.
+5. **Refactoring Roadmap** (only if `state.refactoring_plan` is not null):
+   - Fill the `{{#IF_REFACTOR_PLAN}}` conditional block
+   - Set `{{THEME_COUNT}}`, `{{QUICK_WIN_COUNT}}`, `{{TOTAL_EFFORT}}`, `{{EXECUTION_SUMMARY}}`
+   - Render `{{QUICK_WIN_ITEMS}}`: themes flagged as quick wins
+   - Render `{{PHASE_1_THEMES}}` through `{{PHASE_4_THEMES}}`: themes grouped by phase
+   - For each theme, render using the Theme Detail Template (see report-template.md)
+6. **Statistics**: Total findings, per-severity counts, agent count, dimension count, per-agent breakdown table.
+### Write the report
+Determine the output path:
+- Base: `_bmad-output/deep-audit/deep-audit-<YYYY-MM-DD>.md`
+- If file exists, append suffix: `-2`, `-3`, etc.
+Write the filled report to the output path.
+Update state.json with `report_path`.
+---
+## Phase 8: Finalize State
+Update `_bmad-output/deep-audit/state.json`:
+- Set `status = "completed"`
+- Set `end_time` to current ISO timestamp
+- Set `report_path` to the report file path
+---
+## Phase 9: Present Summary
+Print a concise summary to the user:
+```
+═══════════════════════════════════════════════════
+  DEEP AUDIT COMPLETE
+═══════════════════════════════════════════════════
+Mode: <quick|full|single> (<agent_count> agent(s))
+Scope: <scope description>
+Duration: <duration>
+SCORECARD
+─────────────────────────────────────────────────
+<dimension>          <score>/10  (<P1> P1, <P2> P2, <P3> P3)
+<dimension>          <score>/10  (<P1> P1, <P2> P2, <P3> P3)
+...
+─────────────────────────────────────────────────
+Overall Health:      <overall>/10 — <label>
+FINDINGS: <total> total (<P1_count> critical, <P2_count> important, <P3_count> minor)
+TOP ACTIONS
+1. <action 1>
+2. <action 2>
+3. <action 3>
+Report: <report_path>
+State:  _bmad-output/deep-audit/state.json
+═══════════════════════════════════════════════════
+```
+If `state.refactoring_plan` is not null, add after TOP ACTIONS and before Report:
+```
+REFACTORING ROADMAP
+─────────────────────────────────────────────────
+<theme_count> themes across 4 phases | <quick_win_count> quick wins
+Total effort: <total_effort>
+QUICK WINS (do these now)
+1. T-NNN: <theme name> (<effort>, <risk> risk)
+2. T-NNN: <theme name> (<effort>, <risk> risk)
+...
+```
+If any agents failed (including the planner), add a section:
+```
+WARNINGS
+- Agent <name> failed: <error summary>
+```
+</workflow>
+## Error Recovery
+If the workflow fails at any point:
+1. **State is preserved**: `state.json` tracks which agents completed. Re-running `/deep-audit` with the same commit will auto-resume from where it left off.
+2. **Manual recovery**: Read `_bmad-output/deep-audit/state.json` to see:
+   - Which agents completed successfully (their findings are preserved)
+   - Which agents were pending when the failure occurred
+   - The raw output from each completed agent
+3. **Force fresh start**: Delete `_bmad-output/deep-audit/state.json` and re-run.
+## Safety Rules
+- NEVER modify source code. This is a read-only audit.
+- NEVER execute project code or tests. Only use git, gh, and file reading tools.
+- NEVER share audit findings outside the local report file.
+- If a scope flag points to a non-existent PR or commit, inform the user and stop.
+- If the project has no source files (empty repo), inform the user and stop.

package/commands/dev-story-backend.md CHANGED Viewed

@@ -189,8 +189,8 @@ Dev Agent Record update:
    - Extract functions if needed
    - Ensure consistent style
-2. After EACH refactor, run tests:
-   npm test
+2. After EACH refactor, run targeted tests:
+   npm test -- --filter "{test-file}"
    - Tests MUST stay green
    - IF tests fail: revert last change, try different approach
@@ -198,19 +198,20 @@ Dev Agent Record update:
 3. Log: "REFACTOR: Code improved, tests still green"
 ```
-### 6.4 Verify Full Suite
+### 6.4 Task Completion Check
 ```
-1. Run complete test suite:
-   npm test
+IMPORTANT: Do NOT run the full test suite after each individual task.
+The full suite runs ONCE at Step 8.1 (Final Verification) after ALL tasks are complete.
-2. Ensure no regressions:
-   - All existing tests pass
-   - New tests pass
+1. Verify targeted tests for this task pass:
+   npm test -- --filter "{test-file}"
-3. IF any failures:
+2. IF targeted tests fail:
    - Fix immediately
    - Do not proceed with failing tests
+3. Proceed to next task (or Step 7 validation gates)
 ```
 ---
@@ -224,8 +225,8 @@ Dev Agent Record update:
 ```
 - [ ] Tests exist for the functionality (written FIRST)
 - [ ] TDD cycle completed (red → green → refactor)
-- [ ] All tests pass 100%
-- [ ] No regressions (full suite passes)
+- [ ] All targeted tests pass 100%
+- [ ] No regressions in targeted tests (full suite verified at Step 8.1)
 - [ ] Acceptance criteria for this task satisfied
 ```

package/commands/dev-story-fullstack.md CHANGED Viewed

@@ -217,7 +217,9 @@ IF task type is UI:
    - E2E tests for user flows
    - Unit tests for component logic
-6. Run test suite and verify all pass
+6. Run targeted tests for affected test files:
+   npm test -- --filter "{test-file}"
+   (The full suite runs ONCE at Step 8.1 Final Verification after ALL tasks are complete.)
 ```
 ### 6.3 Backend Task Execution (TDD)
@@ -240,7 +242,9 @@ IF task type is BACKEND:
    - Remove duplication
    - Improve naming
-4. Run full test suite and verify all pass
+4. Run targeted tests for affected test files:
+   npm test -- --filter "{test-file}"
+   (The full suite runs ONCE at Step 8.1 Final Verification after ALL tasks are complete.)
 ```
 ### 6.4 Track Task Type Counts

package/commands/dev-story-ui.md CHANGED Viewed

@@ -300,11 +300,11 @@ Now that UI is visually correct, add tests:
    - Utility functions
    - Location: co-located with component (Component.test.tsx)
-3. Run full test suite:
-   npm test
-   npm run test:e2e (if available)
+3. Run targeted tests for the tests you just wrote:
+   npm test -- --filter "{test-file}"
+   (The full suite runs ONCE at Step 11.1 Final Verification after ALL tasks are complete.)
-4. Ensure all tests pass before marking task complete
+4. Ensure targeted tests pass before marking task complete
 ```
 ---