npm - forge-workflow - Versions diffs - 0.0.1 - Mend

forge-workflow 0.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (105) hide show

package/.claude/commands/dev.md +314 -0
package/.claude/commands/plan.md +389 -0
package/.claude/commands/premerge.md +179 -0
package/.claude/commands/research.md +42 -0
package/.claude/commands/review.md +442 -0
package/.claude/commands/rollback.md +721 -0
package/.claude/commands/ship.md +134 -0
package/.claude/commands/sonarcloud.md +152 -0
package/.claude/commands/status.md +77 -0
package/.claude/commands/validate.md +237 -0
package/.claude/commands/verify.md +221 -0
package/.claude/rules/greptile-review-process.md +285 -0
package/.claude/rules/workflow.md +105 -0
package/.claude/scripts/greptile-resolve.sh +526 -0
package/.claude/scripts/load-env.sh +32 -0
package/.forge/hooks/check-tdd.js +240 -0
package/.github/PLUGIN_TEMPLATE.json +32 -0
package/.mcp.json.example +12 -0
package/AGENTS.md +169 -0
package/CLAUDE.md +99 -0
package/LICENSE +21 -0
package/README.md +414 -0
package/bin/forge-cmd.js +313 -0
package/bin/forge-validate.js +303 -0
package/bin/forge.js +4228 -0
package/docs/AGENT_INSTALL_PROMPT.md +342 -0
package/docs/ENHANCED_ONBOARDING.md +602 -0
package/docs/EXAMPLES.md +482 -0
package/docs/GREPTILE_SETUP.md +400 -0
package/docs/MANUAL_REVIEW_GUIDE.md +106 -0
package/docs/ROADMAP.md +359 -0
package/docs/SETUP.md +632 -0
package/docs/TOOLCHAIN.md +849 -0
package/docs/VALIDATION.md +363 -0
package/docs/WORKFLOW.md +400 -0
package/docs/planning/PROGRESS.md +396 -0
package/docs/plans/.gitkeep +0 -0
package/docs/plans/2026-02-27-forge-test-suite-v2-decisions.md +21 -0
package/docs/plans/2026-02-27-forge-test-suite-v2-design.md +362 -0
package/docs/plans/2026-02-27-forge-test-suite-v2-tasks.md +343 -0
package/docs/plans/2026-03-02-superpowers-gaps-decisions.md +26 -0
package/docs/plans/2026-03-02-superpowers-gaps-design.md +239 -0
package/docs/plans/2026-03-02-superpowers-gaps-tasks.md +260 -0
package/docs/plans/2026-03-04-agent-command-parity-design.md +163 -0
package/docs/plans/2026-03-04-verify-worktree-cleanup-decisions.md +7 -0
package/docs/plans/2026-03-04-verify-worktree-cleanup-design.md +165 -0
package/docs/plans/2026-03-05-forge-uto-decisions.md +6 -0
package/docs/plans/2026-03-05-forge-uto-design.md +116 -0
package/docs/plans/2026-03-05-forge-uto-tasks.md +244 -0
package/docs/plans/2026-03-10-command-creator-and-eval-decisions.md +52 -0
package/docs/plans/2026-03-10-command-creator-and-eval-design.md +350 -0
package/docs/plans/2026-03-10-command-creator-and-eval-tasks.md +426 -0
package/docs/plans/2026-03-10-stale-workflow-refs-decisions.md +8 -0
package/docs/plans/2026-03-10-stale-workflow-refs-design.md +80 -0
package/docs/plans/2026-03-10-stale-workflow-refs-tasks.md +90 -0
package/docs/plans/2026-03-14-beads-plan-context-decisions.md +9 -0
package/docs/plans/2026-03-14-beads-plan-context-design.md +171 -0
package/docs/plans/2026-03-14-beads-plan-context-tasks.md +160 -0
package/docs/plans/2026-03-14-skill-eval-loop-decisions.md +33 -0
package/docs/plans/2026-03-14-skill-eval-loop-design.md +118 -0
package/docs/plans/2026-03-14-skill-eval-loop-results.md +78 -0
package/docs/plans/2026-03-14-skill-eval-loop-tasks.md +160 -0
package/docs/plans/2026-03-15-agent-command-parity-v2-decisions.md +11 -0
package/docs/plans/2026-03-15-agent-command-parity-v2-design.md +145 -0
package/docs/plans/2026-03-15-agent-command-parity-v2-tasks.md +211 -0
package/docs/research/TEMPLATE.md +292 -0
package/docs/research/advanced-testing.md +297 -0
package/docs/research/agent-permissions.md +167 -0
package/docs/research/dependency-chain.md +328 -0
package/docs/research/forge-workflow-v2.md +550 -0
package/docs/research/plugin-architecture.md +772 -0
package/docs/research/pr4-cli-automation.md +326 -0
package/docs/research/premerge-verify-restructure.md +205 -0
package/docs/research/skills-restructure.md +508 -0
package/docs/research/sonarcloud-perfection-plan.md +166 -0
package/docs/research/sonarcloud-quality-gate.md +184 -0
package/docs/research/superpowers-integration.md +403 -0
package/docs/research/superpowers.md +319 -0
package/docs/research/test-environment.md +519 -0
package/install.sh +1062 -0
package/lefthook.yml +39 -0
package/lib/agents/README.md +198 -0
package/lib/agents/claude.plugin.json +28 -0
package/lib/agents/cline.plugin.json +22 -0
package/lib/agents/codex.plugin.json +19 -0
package/lib/agents/copilot.plugin.json +24 -0
package/lib/agents/cursor.plugin.json +25 -0
package/lib/agents/kilocode.plugin.json +22 -0
package/lib/agents/opencode.plugin.json +20 -0
package/lib/agents/roo.plugin.json +23 -0
package/lib/agents-config.js +2112 -0
package/lib/commands/dev.js +513 -0
package/lib/commands/plan.js +696 -0
package/lib/commands/recommend.js +119 -0
package/lib/commands/ship.js +377 -0
package/lib/commands/status.js +378 -0
package/lib/commands/validate.js +602 -0
package/lib/context-merge.js +359 -0
package/lib/plugin-catalog.js +360 -0
package/lib/plugin-manager.js +166 -0
package/lib/plugin-recommender.js +141 -0
package/lib/project-discovery.js +491 -0
package/lib/setup.js +118 -0
package/lib/workflow-profiles.js +203 -0
package/package.json +115 -0

package/.claude/commands/dev.md ADDED Viewed

@@ -0,0 +1,314 @@
+---
+description: Subagent-driven TDD implementation per task from /plan task list
+---
+Implement each task from the /plan task list using a subagent-driven loop: implementer → spec compliance reviewer → code quality reviewer per task.
+# Dev
+This command reads the task list created by `/plan` and implements each task using a three-stage subagent loop. TDD is enforced inside each implementer subagent.
+## Usage
+```bash
+/dev
+```
+---
+## Setup
+### Step 1: Load context
+```bash
+# Find task list and design doc
+ls docs/plans/
+```
+Read:
+- **Task list**: `docs/plans/YYYY-MM-DD-<slug>-tasks.md` — extract ALL task text upfront
+- **Design doc**: `docs/plans/YYYY-MM-DD-<slug>-design.md` — including ambiguity policy section
+### Step 2: Create decisions log
+Create an empty decisions log at the start of every /dev session:
+```bash
+# docs/plans/YYYY-MM-DD-<slug>-decisions.md
+```
+Format for each entry:
+```
+## Decision N
+**Date**: YYYY-MM-DD
+**Task**: Task N — <title>
+**Gap**: [what the spec didn't cover]
+**Score**: [filled checklist total]
+**Route**: PROCEED / SPEC-REVIEWER / BLOCKED
+**Choice made**: [if PROCEED: what was decided and why]
+**Status**: RESOLVED / PENDING-DEVELOPER-INPUT
+```
+### Step 3: Pre-flight checks
+```
+<HARD-GATE: /dev start>
+Do NOT write any code until ALL confirmed:
+1. git branch --show-current output is NOT main or master
+2. git worktree list shows the worktree path for this feature
+3. Task list file confirmed to exist (use Read tool — do not assume)
+4. Decisions log file created
+</HARD-GATE>
+```
+---
+## Per-Task Loop
+Repeat for each task in the task list, in order:
+### Step A: Dispatch implementer subagent
+Provide the subagent with:
+- **Full task text** (copy the complete task content — do NOT send just the file path)
+- **Relevant design doc sections** for this task
+- **Recent git log** showing what has already been implemented
+The implementer subagent:
+1. Asks clarifying questions before writing any code
+2. Implements using RED-GREEN-REFACTOR
+3. Self-reviews for correctness
+4. Commits with a descriptive message
+```
+<HARD-GATE: TDD enforcement (inside implementer subagent)>
+Do NOT write any production code until:
+1. A FAILING test exists for that code
+2. The test has been run and output shows it FAILING
+3. The failure reason matches the expected missing behavior
+If code was written before its test: delete it. Start with the test.
+"The test would obviously fail" is not evidence. Run it and show the output.
+</HARD-GATE>
+```
+---
+### Step B: Decision gate (when implementer hits a spec gap)
+If the implementer encounters something not specified in the design doc, STOP and fill this checklist BEFORE deciding how to proceed:
+```
+Gap: [describe exactly what the spec doesn't cover]
+Score each dimension (0=No / 1=Possibly / 2=Yes):
+[ ] 1. Files affected beyond the current task?
+[ ] 2. Changes a function signature or public export?
+[ ] 3. Changes a shared module used by other tasks?
+[ ] 4. Changes or touches persistent data or schema?
+[ ] 5. Changes user-visible behavior not discussed in design doc?
+[ ] 6. Affects auth, permissions, or data exposure?
+[ ] 7. Hard to reverse without cascading changes to other files?
+TOTAL: ___ / 14
+Mandatory overrides — any of these = automatically BLOCKED:
+[ ] Security dimension (6) scored 2
+[ ] Schema migration or data model change
+[ ] Removes or changes an existing public API endpoint
+[ ] Affects a task that is already implemented and committed
+```
+**Score routing**:
+- **0-3**: PROCEED — make the decision, document in decisions log with full reasoning
+- **4-7**: SPEC-REVIEWER — route this decision to spec reviewer. Continue other independent tasks while waiting
+- **8+, or any mandatory override triggered**: BLOCKED — document in decisions log with Status=PENDING-DEVELOPER-INPUT. Complete all other independent tasks first. Surface to developer at /dev exit
+Log the decision entry before continuing.
+---
+### Step C: Spec compliance review
+After the implementer finishes the task, dispatch a **spec compliance reviewer** subagent.
+Provide:
+- Full task text (what was supposed to be implemented)
+- Relevant design doc sections
+- `git diff` for this task's commits
+Reviewer checks:
+- All requirements from the task text are implemented
+- Nothing extra was added beyond task scope
+- Edge cases documented in design doc are handled
+- TDD evidence: test exists, test was run failing, then passing
+If spec issues found: implementer fixes → re-review → repeat until ✅
+```
+<HARD-GATE: spec before quality>
+Do NOT dispatch code quality reviewer until spec compliance reviewer returns ✅ for this task.
+Running quality review before spec compliance is the wrong order.
+</HARD-GATE>
+```
+---
+### Step D: Code quality review
+After spec ✅, dispatch a **code quality reviewer** subagent.
+Provide:
+- git SHAs for this task's commits
+- The changed code (`git diff`)
+Reviewer checks:
+- Naming: clear, descriptive, consistent with codebase conventions
+- Structure: functions not too long, proper separation of concerns
+- Duplication: no copy-paste that could be extracted
+- Test coverage: tests cover happy path and at least one error path
+- No magic numbers, no commented-out code, no TODO without a Beads issue
+If quality issues found: implementer fixes → re-review → repeat until ✅
+---
+### Step E: Task completion
+```
+<HARD-GATE: task completion>
+NO COMPLETION CLAIMS WITHOUT FRESH VERIFICATION EVIDENCE.
+Do NOT mark task complete or move to next task until ALL confirmed in this session:
+1. Spec compliance reviewer returned ✅
+2. Code quality reviewer returned ✅
+3. Identify what command proves this task is done (e.g. `bun test`, a CLI invocation, a script run).
+4. Run it fresh — show the actual output. "Last run was fine" is not evidence.
+5. Tests run fresh — actual output shows passing.
+6. Implementer has committed (git log shows the commit).
+7. `bash scripts/beads-context.sh update-progress <id> <task-num> <total> "<title>" <commit-sha> <test-count> <gate-count>` ran successfully (exit code 0). If it fails: STOP. Show error. Do not proceed to next task.
+Forbidden phrases (these are not evidence):
+- "should pass"
+- "looks good"
+- "seems to work"
+</HARD-GATE>
+```
+Mark task complete. Move to next task.
+---
+## /dev Completion
+After all tasks are complete (or BLOCKED):
+### Final code review
+Dispatch a final code reviewer for the full implementation:
+- Overall coherence: does the feature hang together as a whole?
+- Cross-task consistency: naming, patterns, style consistent across all tasks?
+- Integration: do all the pieces connect correctly?
+### Surface BLOCKED decisions
+If any decisions have Status=PENDING-DEVELOPER-INPUT:
+```
+⏸️  /dev blocked — developer input needed
+The following decisions were deferred during implementation:
+Decision 1: [gap description]
+  Task: Task N — <title>
+  Score: 11/14 (mandatory override: schema change)
+  Options considered: [A] vs [B]
+  Recommendation: [A] because [reason]
+  Blocked tasks: Task 6, Task 7 (depend on this decision)
+Decision 2: ...
+Please review and respond. After decisions are resolved, the implementer
+will complete the blocked tasks and re-run spec + quality review.
+```
+Wait for developer input. After decisions resolved: implement blocked tasks → spec review → quality review → complete.
+### /dev exit gate
+```
+<HARD-GATE: /dev exit>
+Do NOT declare /dev complete until:
+1. All tasks are marked complete OR have BLOCKED status with PENDING-DEVELOPER-INPUT
+2. BLOCKED decisions have been surfaced to developer and are awaiting input
+3. Final code reviewer has approved (or issues fixed and re-reviewed)
+4. All decisions in decisions log have Status of RESOLVED or PENDING-DEVELOPER-INPUT
+5. No unresolved spec or quality issues remain
+</HARD-GATE>
+```
+### Beads update
+```bash
+bash scripts/beads-context.sh stage-transition <id> dev validate
+```
+---
+## Decision Gate Calibration
+The frequency of decision gates is a **plan quality metric**:
+- **0 gates fired**: Excellent — Phase 1 Q&A covered all cases
+- **1-2 gates fired**: Good — minor gaps, normal
+- **3-5 gates fired**: Plan was incomplete — note for Phase 1 improvement next feature
+- **5+ gates fired**: Phase 1 Q&A was insufficient — the ambiguity policy field needed to be more specific
+Document the gate count in the final commit message.
+---
+## Example Output (all tasks complete)
+```
+✓ Task 1: Types and interfaces — COMPLETE
+  Spec: ✅  Quality: ✅  Tests: 4/4 passing  Commit: abc1234
+  Decision gates: 0
+✓ Task 2: Validation logic — COMPLETE
+  Spec: ✅  Quality: ✅  Tests: 8/8 passing  Commit: def5678
+  Decision gates: 1 (PROCEED, score 2 — documented in decisions log)
+✓ Task 3: API endpoint — COMPLETE
+  Spec: ✅  Quality: ✅  Tests: 6/6 passing  Commit: ghi9012
+  Decision gates: 0
+✓ Final code review: ✅ (coherent, consistent, correctly integrated)
+✓ Decisions log: docs/plans/2026-02-26-stripe-billing-decisions.md
+  - Decision 1: RESOLVED (score 2, proceeded with conservative choice)
+  - Decision gates fired: 1 (plan quality: Good)
+✓ Beads updated: forge-xyz → implementation complete
+Ready for /validate
+```
+## Integration with Workflow
+```
+Utility: /status     → Understand current context before starting
+Stage 1: /plan       → Design intent → research → branch + worktree + task list
+Stage 2: /dev        → Implement each task with subagent-driven TDD (you are here)
+Stage 3: /validate      → Type check, lint, tests, security — all fresh output
+Stage 4: /ship       → Push + create PR
+Stage 5: /review     → Address GitHub Actions, Greptile, SonarCloud
+Stage 6: /premerge   → Update docs, hand off PR to user
+Stage 7: /verify     → Post-merge CI check on main
+```
+## Tips
+- **Send full task text to subagents**: Never send the file path — copy the complete task text directly into the subagent prompt
+- **TDD lives inside the implementer**: The implementer subagent is responsible for RED-GREEN-REFACTOR, not the orchestrating /dev session
+- **Spec before quality — always**: A task that passes quality review but fails spec compliance has still failed
+- **Decision gates are rare with a good plan**: If gates fire frequently, the Phase 1 Q&A needs more depth next time
+- **BLOCKED ≠ failed**: Surfacing a blocked decision with documentation and a recommendation is the correct behavior

package/.claude/commands/plan.md ADDED Viewed

@@ -0,0 +1,389 @@
+---
+description: Design intent → research → branch + worktree + task list
+---
+Plan a feature from scratch: brainstorm design intent, research technical approach, then set up branch, worktree, and a complete task list ready for /dev.
+# Plan
+This command runs in **3 phases**. Each phase ends with a HARD-GATE. Do not skip phases.
+---
+```
+<HARD-GATE: /plan entry — worktree isolation>
+Before ANY planning work begins:
+1. Run: git branch --show-current
+2. If the current branch is NOT master/main:
+   - STOP. Do not begin Phase 1.
+   - Tell the user: "You are on '<branch>'. Planning must start from a clean worktree on master.
+     Run: git checkout master — then re-run /plan."
+3. If on master, create the worktree NOW before asking any questions:
+   a. git worktree add -b feat/<slug> .worktrees/<slug>
+   b. cd .worktrees/<slug>
+4. Confirm: "Working in isolated worktree: .worktrees/<slug> (branch: feat/<slug>)"
+5. ONLY THEN begin Phase 1.
+Rationale: Planning commits (design docs, task lists) belong only to this feature's branch.
+If planning runs in the main directory on a non-master branch, those commits contaminate
+whatever branch is currently checked out. The worktree ensures zero cross-contamination
+between parallel features or sessions.
+</HARD-GATE>
+```
+---
+## Usage
+```bash
+/plan <feature-slug>
+/plan <feature-slug> --strategic   # Major architecture change: creates design doc PR before Phase 2
+/plan <feature-slug> --continue    # After --strategic PR is merged: run Phase 2 + 3
+```
+---
+## Phase 1: Design Intent (Brainstorming)
+**Goal**: Capture WHAT to build — purpose, constraints, success criteria, edge cases, approach.
+### Step 1: Explore project context
+Before asking any questions, read relevant files:
+- Recent commits related to this area
+- Existing code in affected modules
+- Any related docs, tests, or prior research
+### Step 2: Ask clarifying questions — one at a time
+Ask each question in sequence. Wait for user response. Use multiple choice where possible.
+Questions to cover (adapt to feature, don't ask mechanical copies):
+1. **Purpose** — What problem does this solve? Who benefits?
+2. **Constraints** — What must this NOT do? What are the hard limits?
+3. **Success criteria** — How will we know it's done? What is the minimum viable result?
+4. **Edge cases** — What happens when [key dependency] fails / [input] is missing / [state] is ambiguous?
+5. **Technical preferences** — Library A or B? Pattern X or Y? (when real options exist)
+6. **Ambiguity policy** — If a spec gap is found mid-dev, should the agent: (a) make a reasonable choice and document it, or (b) pause and wait for input?
+### Step 3: Propose approaches
+Propose 2-3 concrete approaches with:
+- Trade-offs (speed vs safety, complexity vs flexibility)
+- A clear recommendation with reasoning
+- Get user approval on the chosen approach
+### Step 4: Write design doc
+Save to `docs/plans/YYYY-MM-DD-<slug>-design.md` with these sections:
+- **Feature**: slug, date, status
+- **Purpose**: what problem it solves
+- **Success criteria**: measurable, specific
+- **Out of scope**: explicit boundaries
+- **Approach selected**: which option and why
+- **Constraints**: hard limits
+- **Edge cases**: decisions made during Q&A
+- **Ambiguity policy**: agent's fallback when spec gaps arise mid-dev
+Commit the design doc:
+```bash
+git add docs/plans/YYYY-MM-DD-<slug>-design.md
+git commit -m "docs: add design doc for <slug>"
+```
+---
+**--strategic flag** (for major architecture changes):
+After committing the design doc, push to a proposal branch and open PR:
+```bash
+git checkout -b feat/<slug>-proposal
+git push -u origin feat/<slug>-proposal
+gh pr create --title "Design: <feature-name>" \
+  --body "Design doc for review. See docs/plans/YYYY-MM-DD-<slug>-design.md"
+```
+**STOP here.** Present the PR URL. Wait for the user to merge the proposal PR.
+After merge, run `/plan <slug> --continue` to proceed to Phase 2 + 3.
+---
+```
+<HARD-GATE: Phase 1 exit>
+Do NOT begin Phase 2 (web research) until:
+1. User has approved the design in this session
+2. Design doc exists at docs/plans/YYYY-MM-DD-<slug>-design.md
+3. Design doc includes: success criteria, edge cases, out-of-scope, ambiguity policy
+4. Design doc is committed to git
+</HARD-GATE>
+```
+---
+## Phase 2: Technical Research
+**Goal**: Find HOW to build it — best practices, known issues, security risks, TDD scenarios.
+Run these in parallel:
+### Web research (parallel-deep-research skill)
+```
+Skill("parallel-deep-research")
+```
+Search for:
+- "[tech stack] [feature] best practices [year]"
+- "[library/framework] [feature] implementation patterns"
+- "Known issues / gotchas with [approach selected]"
+### OWASP Top 10 analysis
+For this feature's risk surface, document each relevant OWASP category:
+- What the risk is
+- Whether it applies to this feature
+- What mitigation will be implemented
+### Codebase exploration (Explore agent)
+- Similar existing patterns to reuse
+- Files this feature will affect
+- Existing test infrastructure to leverage
+### DRY check (mandatory — use actual search tools)
+Before finalizing the approach, run Grep/Glob/Read searches for existing implementations of the planned function or pattern. Do not rely on memory or assumptions — execute the searches.
+```
+Grep(searchTerm)   # e.g., the function or concept name
+Glob("**/*.js")    # narrow to affected file types if needed
+Read(matchedFile)  # inspect any match in context
+```
+If a match is found:
+- Update the design doc's "Approach selected" section to say "extend existing [file/function]" — not "create new".
+- Note the existing file path and line number in the design doc.
+If no match is found: proceed. The DRY gate is cleared.
+### Blast-radius search (mandatory for remove/rename/replace features)
+If this feature involves **removing**, **renaming**, or **replacing** a concept, tool, or dependency:
+1. Grep the ENTIRE codebase for the thing being removed/renamed:
+   ```
+   Grep("<thing-being-removed>")     # exact name
+   Grep("<thing-being-removed>", -i)  # case-insensitive variant
+   Glob("**/*<thing>*")              # files named after it
+   ```
+2. For EVERY match found:
+   - Note the file path and line number in the design doc
+   - Add a cleanup task to the task list (Phase 3)
+   - Flag matches in unexpected packages or config files explicitly
+3. Common hiding spots to check:
+   - `package.json` (scripts, dependencies, description)
+   - `install.sh` / setup scripts
+   - CI/CD workflows (`.github/workflows/`)
+   - Agent config files (`lib/agents/`, `.cursorrules`, etc.)
+   - Documentation (`docs/`, `README.md`, `AGENTS.md`)
+   - Import statements and require() calls
+If no removal/rename is involved, this section is skipped.
+### TDD test scenarios
+Identify at minimum 3 test scenarios:
+- Happy path
+- Error / failure path
+- Edge case from Phase 1
+Append all research findings to the design doc under a `## Technical Research` section (not a separate file).
+---
+```
+<HARD-GATE: Phase 2 exit>
+Do NOT begin Phase 3 (setup) until:
+1. OWASP analysis is documented in design doc
+2. At least 3 TDD test scenarios are identified
+3. Approach selection is confirmed (which library/pattern to use)
+4. If feature involves removal/rename: blast-radius search completed, all references added to task list
+</HARD-GATE>
+```
+---
+## Phase 3: Setup + Task List
+**Goal**: Create branch, worktree, Beads issue, and a complete task list ready for /dev.
+### Step 1: Beads issue
+```bash
+bd create --title="<feature-name>" --type=feature
+bd update <id> --status=in_progress
+```
+### Step 2: Branch + worktree
+**ALWAYS branch from master, never from the current branch.** If the working directory is on any branch other than master, the new feature branch would inherit all unmerged changes from that branch — contaminating the new feature's history.
+**Note**: If the Entry HARD-GATE already created the branch and worktree (and you are already inside `.worktrees/<slug>`), skip Steps 2b–2d — they are already done.
+```bash
+# Step 2a: Check if branch and worktree were already created by Entry HARD-GATE
+CURRENT=$(git branch --show-current)
+if [ "$CURRENT" = "feat/<slug>" ]; then
+  echo "✓ Branch feat/<slug> already exists (Entry HARD-GATE created it) — skipping 2b–2d"
+else
+  # Step 2b: Verify .worktrees/ is gitignored — add if missing
+  git check-ignore -v .worktrees/ || echo ".worktrees/" >> .gitignore
+  # Step 2c: Create branch + worktree in one command (from master)
+  # Using -b with worktree add avoids "branch already checked out" error
+  git checkout master
+  git worktree add -b feat/<slug> .worktrees/<slug>
+  cd .worktrees/<slug>
+fi
+```
+**Why this matters**: Multiple parallel features or sessions each get their own isolated worktree. Changes to one feature never bleed into another. The main working directory can stay on any branch without affecting new feature branches.
+### Step 3: Project setup in worktree
+Auto-detect and run install:
+```bash
+# e.g., bun install / npm install / pip install -r requirements.txt
+```
+### Step 4: Baseline test run
+```bash
+# Run full test suite in worktree
+bun test   # or project test command
+```
+If tests fail: report which tests are failing and ask user whether to investigate or proceed anyway. Do not silently proceed past failing baseline tests.
+### Step 5: Task list creation
+Read the design doc. Break implementation into granular tasks.
+**Task format** (each task MUST have ALL of these):
+```
+Task N: <descriptive title>
+File(s): <exact file paths>
+What to implement: <complete description — not "add feature X", but what specifically>
+TDD steps:
+  1. Write test: <test file path, what assertion, what input/output>
+  2. Run test: confirm it fails with [specific expected error message]
+  3. Implement: <exact function/class/component to write>
+  4. Run test: confirm it passes
+  5. Commit: `<type>: <message>`
+Expected output: <what running the test/code produces when done>
+```
+**Ordering rules**:
+- Foundational/shared modules FIRST (types, utils, constants)
+- Feature logic SECOND
+- Integration/wiring THIRD
+- Uncertain/ambiguous tasks LAST (so they can be deferred if blocked)
+**YAGNI filter** (after initial task draft, before saving):
+For each task, confirm it maps to a specific requirement, success criterion, or edge case in the design doc. Run `applyYAGNIFilter({ task, designDoc })` for each task.
+- Tasks that match → keep as-is.
+- Tasks with no anchor → flagged as "potential scope creep". Present flagged tasks to the user: "These tasks have no anchor in the design doc. Keep (specify which requirement it serves) or remove?"
+- If ALL tasks are flagged → return `allFlagged: true` and tell the user: "Design doc doesn't cover all tasks — needs amendment." Do not save the task list until the design doc is updated or tasks are removed.
+**Before finalizing**: flag any tasks that touch areas not fully specified in the design doc. Present flagged tasks to user for quick clarification before saving.
+Save to `docs/plans/YYYY-MM-DD-<slug>-tasks.md`.
+### Step 5b: Beads context
+After saving the task list, attach design context and acceptance criteria to the Beads issue so downstream stages (`/dev`, `/validate`, `/review`) can retrieve it without re-reading the design doc.
+```bash
+# Link design metadata (task count + task file path) to the Beads issue
+bash scripts/beads-context.sh set-design <id> <task-count> docs/plans/YYYY-MM-DD-<slug>-tasks.md
+# Record the success criteria from the design doc on the issue
+bash scripts/beads-context.sh set-acceptance <id> "<success-criteria from design doc>"
+```
+Both commands must exit with code 0. If either fails, investigate (wrong issue ID? missing script?) before continuing.
+### Step 6: User review
+Present the full task list. Allow the user to reorder, split, or remove tasks.
+---
+```
+<HARD-GATE: /plan exit>
+Do NOT proceed to /dev until ALL are confirmed:
+1. git branch --show-current output shows feat/<slug>
+2. git worktree list shows .worktrees/<slug>
+3. Baseline tests ran — either passing OR user confirmed to proceed past failures
+4. Beads issue is created with status=in_progress
+5. Task list exists at docs/plans/YYYY-MM-DD-<slug>-tasks.md
+6. User has confirmed task list is correct
+7. `beads-context.sh set-design` ran successfully (exit code 0)
+8. `beads-context.sh set-acceptance` ran successfully (exit code 0)
+</HARD-GATE>
+```
+After all HARD-GATE items pass, record the stage transition on the Beads issue:
+```bash
+bash scripts/beads-context.sh stage-transition <id> plan dev
+```
+---
+## Example Output (Phase 3 complete)
+```
+✓ Phase 1: Design intent captured
+  - Design doc: docs/plans/2026-02-26-stripe-billing-design.md
+  - Approach: Stripe SDK v4 (selected over v3)
+  - Ambiguity policy: Make conservative choice + document in decisions log
+✓ Phase 2: Technical research complete
+  - OWASP Top 10: 3 risks identified, 3 mitigations planned
+  - TDD scenarios: 5 identified
+  - Sources: 8 references
+✓ Phase 3: Setup complete
+  - Beads: forge-xyz (in_progress)
+  - Branch: feat/stripe-billing
+  - Worktree: .worktrees/stripe-billing (baseline: 24/24 tests passing)
+  - Task list: docs/plans/2026-02-26-stripe-billing-tasks.md (8 tasks)
+⏸️  Task list ready for review. Confirm to proceed.
+After confirming, run: /dev
+```
+## Integration with Workflow
+```
+Utility: /status     → Understand current context before starting
+Stage 1: /plan       → Design intent → research → branch + worktree + task list (you are here)
+Stage 2: /dev        → Implement each task with subagent-driven TDD
+Stage 3: /validate      → Type check, lint, tests, security — all fresh output
+Stage 4: /ship       → Push + create PR
+Stage 5: /review     → Address GitHub Actions, Greptile, SonarCloud
+Stage 6: /premerge   → Update docs, hand off PR to user
+Stage 7: /verify     → Post-merge CI check on main
+```
+## Tips
+- **Phase 1 quality = /dev autonomy**: Every ambiguity resolved in Phase 1 is a decision gate that won't fire during /dev
+- **One question at a time**: Don't dump all questions at once — dialogue produces better design decisions than a questionnaire
+- **Task granularity**: Target 2-5 minutes per task. If a task takes longer, split it
+- **Uncertain tasks go last**: Anything ambiguous at the end of the task list can be deferred if blocked without stopping other work
+- **Baseline failures matter**: Pre-existing test failures hide regressions. Fix or explicitly document them before /dev starts