npm - @howlil/ez-agents - Versions diffs - 2.0.0 → 2.0.1 - Mend

@howlil/ez-agents 2.0.0 → 2.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (106) hide show

package/LICENSE +21 -21
package/README.md +93 -93
package/agents/ez-plan-checker.md +2 -2
package/agents/ez-research-synthesizer.md +1 -1
package/agents/ez-ui-researcher.md +1 -1
package/agents/ez-verifier.md +1 -1
package/bin/install.js +132 -132
package/get-shit-done/bin/lib/assistant-adapter.cjs +205 -205
package/get-shit-done/bin/lib/audit-exec.cjs +150 -150
package/get-shit-done/bin/lib/auth.cjs +175 -175
package/get-shit-done/bin/lib/circuit-breaker.cjs +118 -118
package/get-shit-done/bin/lib/commands.cjs +666 -666
package/get-shit-done/bin/lib/config.cjs +183 -183
package/get-shit-done/bin/lib/core.cjs +495 -495
package/get-shit-done/bin/lib/file-lock.cjs +236 -236
package/get-shit-done/bin/lib/frontmatter.cjs +299 -299
package/get-shit-done/bin/lib/fs-utils.cjs +153 -153
package/get-shit-done/bin/lib/git-utils.cjs +203 -203
package/get-shit-done/bin/lib/health-check.cjs +163 -163
package/get-shit-done/bin/lib/index.cjs +113 -113
package/get-shit-done/bin/lib/init.cjs +710 -710
package/get-shit-done/bin/lib/logger.cjs +117 -117
package/get-shit-done/bin/lib/milestone.cjs +241 -241
package/get-shit-done/bin/lib/model-provider.cjs +146 -146
package/get-shit-done/bin/lib/phase.cjs +908 -908
package/get-shit-done/bin/lib/retry.cjs +119 -119
package/get-shit-done/bin/lib/roadmap.cjs +305 -305
package/get-shit-done/bin/lib/safe-exec.cjs +128 -128
package/get-shit-done/bin/lib/safe-path.cjs +130 -130
package/get-shit-done/bin/lib/state.cjs +721 -721
package/get-shit-done/bin/lib/temp-file.cjs +239 -239
package/get-shit-done/bin/lib/template.cjs +222 -222
package/get-shit-done/bin/lib/test-file-lock.cjs +112 -112
package/get-shit-done/bin/lib/test-graceful.cjs +93 -93
package/get-shit-done/bin/lib/test-logger.cjs +60 -60
package/get-shit-done/bin/lib/test-safe-exec.cjs +38 -38
package/get-shit-done/bin/lib/test-safe-path.cjs +33 -33
package/get-shit-done/bin/lib/test-temp-file.cjs +125 -125
package/get-shit-done/bin/lib/timeout-exec.cjs +62 -62
package/get-shit-done/bin/lib/verify.cjs +820 -820
package/get-shit-done/references/checkpoints.md +776 -776
package/get-shit-done/references/questioning.md +162 -162
package/get-shit-done/references/tdd.md +263 -263
package/get-shit-done/templates/codebase/concerns.md +310 -310
package/get-shit-done/templates/codebase/conventions.md +307 -307
package/get-shit-done/templates/codebase/integrations.md +280 -280
package/get-shit-done/templates/codebase/stack.md +186 -186
package/get-shit-done/templates/codebase/testing.md +480 -480
package/get-shit-done/templates/config.json +37 -37
package/get-shit-done/templates/continue-here.md +78 -78
package/get-shit-done/templates/milestone-archive.md +123 -123
package/get-shit-done/templates/milestone.md +115 -115
package/get-shit-done/templates/requirements.md +231 -231
package/get-shit-done/templates/research-project/ARCHITECTURE.md +204 -204
package/get-shit-done/templates/research-project/FEATURES.md +147 -147
package/get-shit-done/templates/research-project/PITFALLS.md +200 -200
package/get-shit-done/templates/research-project/STACK.md +120 -120
package/get-shit-done/templates/research-project/SUMMARY.md +170 -170
package/get-shit-done/templates/retrospective.md +54 -54
package/get-shit-done/templates/roadmap.md +202 -202
package/get-shit-done/templates/summary-minimal.md +41 -41
package/get-shit-done/templates/summary-standard.md +48 -48
package/get-shit-done/templates/summary.md +248 -248
package/get-shit-done/templates/user-setup.md +311 -311
package/get-shit-done/templates/verification-report.md +322 -322
package/get-shit-done/workflows/add-phase.md +112 -112
package/get-shit-done/workflows/add-tests.md +351 -351
package/get-shit-done/workflows/add-todo.md +158 -158
package/get-shit-done/workflows/audit-milestone.md +332 -332
package/get-shit-done/workflows/autonomous.md +743 -743
package/get-shit-done/workflows/check-todos.md +177 -177
package/get-shit-done/workflows/cleanup.md +152 -152
package/get-shit-done/workflows/complete-milestone.md +766 -766
package/get-shit-done/workflows/diagnose-issues.md +219 -219
package/get-shit-done/workflows/discovery-phase.md +289 -289
package/get-shit-done/workflows/discuss-phase.md +762 -762
package/get-shit-done/workflows/execute-phase.md +468 -468
package/get-shit-done/workflows/execute-plan.md +483 -483
package/get-shit-done/workflows/health.md +159 -159
package/get-shit-done/workflows/help.md +492 -492
package/get-shit-done/workflows/insert-phase.md +130 -130
package/get-shit-done/workflows/list-phase-assumptions.md +178 -178
package/get-shit-done/workflows/map-codebase.md +316 -316
package/get-shit-done/workflows/new-milestone.md +384 -384
package/get-shit-done/workflows/new-project.md +1111 -1111
package/get-shit-done/workflows/node-repair.md +92 -92
package/get-shit-done/workflows/pause-work.md +122 -122
package/get-shit-done/workflows/plan-milestone-gaps.md +274 -274
package/get-shit-done/workflows/plan-phase.md +651 -651
package/get-shit-done/workflows/progress.md +382 -382
package/get-shit-done/workflows/quick.md +610 -610
package/get-shit-done/workflows/remove-phase.md +155 -155
package/get-shit-done/workflows/research-phase.md +74 -74
package/get-shit-done/workflows/resume-project.md +307 -307
package/get-shit-done/workflows/set-profile.md +81 -81
package/get-shit-done/workflows/settings.md +242 -242
package/get-shit-done/workflows/stats.md +57 -57
package/get-shit-done/workflows/transition.md +544 -544
package/get-shit-done/workflows/ui-phase.md +290 -290
package/get-shit-done/workflows/ui-review.md +157 -157
package/get-shit-done/workflows/update.md +320 -320
package/get-shit-done/workflows/validate-phase.md +167 -167
package/get-shit-done/workflows/verify-phase.md +243 -243
package/package.json +1 -1
package/scripts/build-hooks.js +43 -43
package/scripts/run-tests.cjs +29 -29

package/get-shit-done/workflows/add-tests.md CHANGED Viewed

@@ -1,351 +1,351 @@
-<purpose>
-Generate unit and E2E tests for a completed phase based on its SUMMARY.md, CONTEXT.md, and implementation. Classifies each changed file into TDD (unit), E2E (browser), or Skip categories, presents a test plan for user approval, then generates tests following RED-GREEN conventions.
-Users currently hand-craft `/ez:quick` prompts for test generation after each phase. This workflow standardizes the process with proper classification, quality gates, and gap reporting.
-</purpose>
-<required_reading>
-Read all files referenced by the invoking prompt's execution_context before starting.
-</required_reading>
-<process>
-<step name="parse_arguments">
-Parse `$ARGUMENTS` for:
-- Phase number (integer, decimal, or letter-suffix) → store as `$PHASE_ARG`
-- Remaining text after phase number → store as `$EXTRA_INSTRUCTIONS` (optional)
-Example: `/ez:add-tests 12 focus on edge cases` → `$PHASE_ARG=12`, `$EXTRA_INSTRUCTIONS="focus on edge cases"`
-If no phase argument provided:
-```
-ERROR: Phase number required
-Usage: /ez:add-tests <phase> [additional instructions]
-Example: /ez:add-tests 12
-Example: /ez:add-tests 12 focus on edge cases in the pricing module
-```
-Exit.
-</step>
-<step name="init_context">
-Load phase operation context:
-```bash
-INIT=$(node "$HOME/.claude/ez-agents/bin/ez-tools.cjs" init phase-op "${PHASE_ARG}")
-if [[ "$INIT" == @file:* ]]; then INIT=$(cat "${INIT#@file:}"); fi
-```
-Extract from init JSON: `phase_dir`, `phase_number`, `phase_name`.
-Verify the phase directory exists. If not:
-```
-ERROR: Phase directory not found for phase ${PHASE_ARG}
-Ensure the phase exists in .planning/phases/
-```
-Exit.
-Read the phase artifacts (in order of priority):
-1. `${phase_dir}/*-SUMMARY.md` — what was implemented, files changed
-2. `${phase_dir}/CONTEXT.md` — acceptance criteria, decisions
-3. `${phase_dir}/*-VERIFICATION.md` — user-verified scenarios (if UAT was done)
-If no SUMMARY.md exists:
-```
-ERROR: No SUMMARY.md found for phase ${PHASE_ARG}
-This command works on completed phases. Run /ez:execute-phase first.
-```
-Exit.
-Present banner:
-```
-━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
- GSD ► ADD TESTS — Phase ${phase_number}: ${phase_name}
-━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
-```
-</step>
-<step name="analyze_implementation">
-Extract the list of files modified by the phase from SUMMARY.md ("Files Changed" or equivalent section).
-For each file, classify into one of three categories:
-| Category | Criteria | Test Type |
-|----------|----------|-----------|
-| **TDD** | Pure functions where `expect(fn(input)).toBe(output)` is writable | Unit tests |
-| **E2E** | UI behavior verifiable by browser automation | Playwright/E2E tests |
-| **Skip** | Not meaningfully testable or already covered | None |
-**TDD classification — apply when:**
-- Business logic: calculations, pricing, tax rules, validation
-- Data transformations: mapping, filtering, aggregation, formatting
-- Parsers: CSV, JSON, XML, custom format parsing
-- Validators: input validation, schema validation, business rules
-- State machines: status transitions, workflow steps
-- Utilities: string manipulation, date handling, number formatting
-**E2E classification — apply when:**
-- Keyboard shortcuts: key bindings, modifier keys, chord sequences
-- Navigation: page transitions, routing, breadcrumbs, back/forward
-- Form interactions: submit, validation errors, field focus, autocomplete
-- Selection: row selection, multi-select, shift-click ranges
-- Drag and drop: reordering, moving between containers
-- Modal dialogs: open, close, confirm, cancel
-- Data grids: sorting, filtering, inline editing, column resize
-**Skip classification — apply when:**
-- UI layout/styling: CSS classes, visual appearance, responsive breakpoints
-- Configuration: config files, environment variables, feature flags
-- Glue code: dependency injection setup, middleware registration, routing tables
-- Migrations: database migrations, schema changes
-- Simple CRUD: basic create/read/update/delete with no business logic
-- Type definitions: records, DTOs, interfaces with no logic
-Read each file to verify classification. Don't classify based on filename alone.
-</step>
-<step name="present_classification">
-Present the classification to the user for confirmation before proceeding:
-```
-AskUserQuestion(
-  header: "Test Classification",
-  question: |
-    ## Files classified for testing
-    ### TDD (Unit Tests) — {N} files
-    {list of files with brief reason}
-    ### E2E (Browser Tests) — {M} files
-    {list of files with brief reason}
-    ### Skip — {K} files
-    {list of files with brief reason}
-    {if $EXTRA_INSTRUCTIONS: "Additional instructions: ${EXTRA_INSTRUCTIONS}"}
-    How would you like to proceed?
-  options:
-    - "Approve and generate test plan"
-    - "Adjust classification (I'll specify changes)"
-    - "Cancel"
-)
-```
-If user selects "Adjust classification": apply their changes and re-present.
-If user selects "Cancel": exit gracefully.
-</step>
-<step name="discover_test_structure">
-Before generating the test plan, discover the project's existing test structure:
-```bash
-# Find existing test directories
-find . -type d -name "*test*" -o -name "*spec*" -o -name "*__tests__*" 2>/dev/null | head -20
-# Find existing test files for convention matching
-find . -type f \( -name "*.test.*" -o -name "*.spec.*" -o -name "*Tests.fs" -o -name "*Test.fs" \) 2>/dev/null | head -20
-# Check for test runners
-ls package.json *.sln 2>/dev/null
-```
-Identify:
-- Test directory structure (where unit tests live, where E2E tests live)
-- Naming conventions (`.test.ts`, `.spec.ts`, `*Tests.fs`, etc.)
-- Test runner commands (how to execute unit tests, how to execute E2E tests)
-- Test framework (xUnit, NUnit, Jest, Playwright, etc.)
-If test structure is ambiguous, ask the user:
-```
-AskUserQuestion(
-  header: "Test Structure",
-  question: "I found multiple test locations. Where should I create tests?",
-  options: [list discovered locations]
-)
-```
-</step>
-<step name="generate_test_plan">
-For each approved file, create a detailed test plan.
-**For TDD files**, plan tests following RED-GREEN-REFACTOR:
-1. Identify testable functions/methods in the file
-2. For each function: list input scenarios, expected outputs, edge cases
-3. Note: since code already exists, tests may pass immediately — that's OK, but verify they test the RIGHT behavior
-**For E2E files**, plan tests following RED-GREEN gates:
-1. Identify user scenarios from CONTEXT.md/VERIFICATION.md
-2. For each scenario: describe the user action, expected outcome, assertions
-3. Note: RED gate means confirming the test would fail if the feature were broken
-Present the complete test plan:
-```
-AskUserQuestion(
-  header: "Test Plan",
-  question: |
-    ## Test Generation Plan
-    ### Unit Tests ({N} tests across {M} files)
-    {for each file: test file path, list of test cases}
-    ### E2E Tests ({P} tests across {Q} files)
-    {for each file: test file path, list of test scenarios}
-    ### Test Commands
-    - Unit: {discovered test command}
-    - E2E: {discovered e2e command}
-    Ready to generate?
-  options:
-    - "Generate all"
-    - "Cherry-pick (I'll specify which)"
-    - "Adjust plan"
-)
-```
-If "Cherry-pick": ask user which tests to include.
-If "Adjust plan": apply changes and re-present.
-</step>
-<step name="execute_tdd_generation">
-For each approved TDD test:
-1. **Create test file** following discovered project conventions (directory, naming, imports)
-2. **Write test** with clear arrange/act/assert structure:
-   ```
-   // Arrange — set up inputs and expected outputs
-   // Act — call the function under test
-   // Assert — verify the output matches expectations
-   ```
-3. **Run the test**:
-   ```bash
-   {discovered test command}
-   ```
-4. **Evaluate result:**
-   - **Test passes**: Good — the implementation satisfies the test. Verify the test checks meaningful behavior (not just that it compiles).
-   - **Test fails with assertion error**: This may be a genuine bug discovered by the test. Flag it:
-     ```
-     ⚠️ Potential bug found: {test name}
-     Expected: {expected}
-     Actual: {actual}
-     File: {implementation file}
-     ```
-     Do NOT fix the implementation — this is a test-generation command, not a fix command. Record the finding.
-   - **Test fails with error (import, syntax, etc.)**: This is a test error. Fix the test and re-run.
-</step>
-<step name="execute_e2e_generation">
-For each approved E2E test:
-1. **Check for existing tests** covering the same scenario:
-   ```bash
-   grep -r "{scenario keyword}" {e2e test directory} 2>/dev/null
-   ```
-   If found, extend rather than duplicate.
-2. **Create test file** targeting the user scenario from CONTEXT.md/VERIFICATION.md
-3. **Run the E2E test**:
-   ```bash
-   {discovered e2e command}
-   ```
-4. **Evaluate result:**
-   - **GREEN (passes)**: Record success
-   - **RED (fails)**: Determine if it's a test issue or a genuine application bug. Flag bugs:
-     ```
-     ⚠️ E2E failure: {test name}
-     Scenario: {description}
-     Error: {error message}
-     ```
-   - **Cannot run**: Report blocker. Do NOT mark as complete.
-     ```
-     🛑 E2E blocker: {reason tests cannot run}
-     ```
-**No-skip rule:** If E2E tests cannot execute (missing dependencies, environment issues), report the blocker and mark the test as incomplete. Never mark success without actually running the test.
-</step>
-<step name="summary_and_commit">
-Create a test coverage report and present to user:
-```
-━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
- GSD ► TEST GENERATION COMPLETE
-━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
-## Results
-| Category | Generated | Passing | Failing | Blocked |
-|----------|-----------|---------|---------|---------|
-| Unit     | {N}       | {n1}    | {n2}    | {n3}    |
-| E2E      | {M}       | {m1}    | {m2}    | {m3}    |
-## Files Created/Modified
-{list of test files with paths}
-## Coverage Gaps
-{areas that couldn't be tested and why}
-## Bugs Discovered
-{any assertion failures that indicate implementation bugs}
-```
-Record test generation in project state:
-```bash
-node "$HOME/.claude/ez-agents/bin/ez-tools.cjs" state-snapshot
-```
-If there are passing tests to commit:
-```bash
-git add {test files}
-git commit -m "test(phase-${phase_number}): add unit and E2E tests from add-tests command"
-```
-Present next steps:
-```
----
-## ▶ Next Up
-{if bugs discovered:}
-**Fix discovered bugs:** `/ez:quick fix the {N} test failures discovered in phase ${phase_number}`
-{if blocked tests:}
-**Resolve test blockers:** {description of what's needed}
-{otherwise:}
-**All tests passing!** Phase ${phase_number} is fully tested.
----
-**Also available:**
-- `/ez:add-tests {next_phase}` — test another phase
-- `/ez:verify-work {phase_number}` — run UAT verification
----
-```
-</step>
-</process>
-<success_criteria>
-- [ ] Phase artifacts loaded (SUMMARY.md, CONTEXT.md, optionally VERIFICATION.md)
-- [ ] All changed files classified into TDD/E2E/Skip categories
-- [ ] Classification presented to user and approved
-- [ ] Project test structure discovered (directories, conventions, runners)
-- [ ] Test plan presented to user and approved
-- [ ] TDD tests generated with arrange/act/assert structure
-- [ ] E2E tests generated targeting user scenarios
-- [ ] All tests executed — no untested tests marked as passing
-- [ ] Bugs discovered by tests flagged (not fixed)
-- [ ] Test files committed with proper message
-- [ ] Coverage gaps documented
-- [ ] Next steps presented to user
-</success_criteria>
+<purpose>
+Generate unit and E2E tests for a completed phase based on its SUMMARY.md, CONTEXT.md, and implementation. Classifies each changed file into TDD (unit), E2E (browser), or Skip categories, presents a test plan for user approval, then generates tests following RED-GREEN conventions.
+Users currently hand-craft `/ez:quick` prompts for test generation after each phase. This workflow standardizes the process with proper classification, quality gates, and gap reporting.
+</purpose>
+<required_reading>
+Read all files referenced by the invoking prompt's execution_context before starting.
+</required_reading>
+<process>
+<step name="parse_arguments">
+Parse `$ARGUMENTS` for:
+- Phase number (integer, decimal, or letter-suffix) → store as `$PHASE_ARG`
+- Remaining text after phase number → store as `$EXTRA_INSTRUCTIONS` (optional)
+Example: `/ez:add-tests 12 focus on edge cases` → `$PHASE_ARG=12`, `$EXTRA_INSTRUCTIONS="focus on edge cases"`
+If no phase argument provided:
+```
+ERROR: Phase number required
+Usage: /ez:add-tests <phase> [additional instructions]
+Example: /ez:add-tests 12
+Example: /ez:add-tests 12 focus on edge cases in the pricing module
+```
+Exit.
+</step>
+<step name="init_context">
+Load phase operation context:
+```bash
+INIT=$(node "$HOME/.claude/ez-agents/bin/ez-tools.cjs" init phase-op "${PHASE_ARG}")
+if [[ "$INIT" == @file:* ]]; then INIT=$(cat "${INIT#@file:}"); fi
+```
+Extract from init JSON: `phase_dir`, `phase_number`, `phase_name`.
+Verify the phase directory exists. If not:
+```
+ERROR: Phase directory not found for phase ${PHASE_ARG}
+Ensure the phase exists in .planning/phases/
+```
+Exit.
+Read the phase artifacts (in order of priority):
+1. `${phase_dir}/*-SUMMARY.md` — what was implemented, files changed
+2. `${phase_dir}/CONTEXT.md` — acceptance criteria, decisions
+3. `${phase_dir}/*-VERIFICATION.md` — user-verified scenarios (if UAT was done)
+If no SUMMARY.md exists:
+```
+ERROR: No SUMMARY.md found for phase ${PHASE_ARG}
+This command works on completed phases. Run /ez:execute-phase first.
+```
+Exit.
+Present banner:
+```
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+ GSD ► ADD TESTS — Phase ${phase_number}: ${phase_name}
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+```
+</step>
+<step name="analyze_implementation">
+Extract the list of files modified by the phase from SUMMARY.md ("Files Changed" or equivalent section).
+For each file, classify into one of three categories:
+| Category | Criteria | Test Type |
+|----------|----------|-----------|
+| **TDD** | Pure functions where `expect(fn(input)).toBe(output)` is writable | Unit tests |
+| **E2E** | UI behavior verifiable by browser automation | Playwright/E2E tests |
+| **Skip** | Not meaningfully testable or already covered | None |
+**TDD classification — apply when:**
+- Business logic: calculations, pricing, tax rules, validation
+- Data transformations: mapping, filtering, aggregation, formatting
+- Parsers: CSV, JSON, XML, custom format parsing
+- Validators: input validation, schema validation, business rules
+- State machines: status transitions, workflow steps
+- Utilities: string manipulation, date handling, number formatting
+**E2E classification — apply when:**
+- Keyboard shortcuts: key bindings, modifier keys, chord sequences
+- Navigation: page transitions, routing, breadcrumbs, back/forward
+- Form interactions: submit, validation errors, field focus, autocomplete
+- Selection: row selection, multi-select, shift-click ranges
+- Drag and drop: reordering, moving between containers
+- Modal dialogs: open, close, confirm, cancel
+- Data grids: sorting, filtering, inline editing, column resize
+**Skip classification — apply when:**
+- UI layout/styling: CSS classes, visual appearance, responsive breakpoints
+- Configuration: config files, environment variables, feature flags
+- Glue code: dependency injection setup, middleware registration, routing tables
+- Migrations: database migrations, schema changes
+- Simple CRUD: basic create/read/update/delete with no business logic
+- Type definitions: records, DTOs, interfaces with no logic
+Read each file to verify classification. Don't classify based on filename alone.
+</step>
+<step name="present_classification">
+Present the classification to the user for confirmation before proceeding:
+```
+AskUserQuestion(
+  header: "Test Classification",
+  question: |
+    ## Files classified for testing
+    ### TDD (Unit Tests) — {N} files
+    {list of files with brief reason}
+    ### E2E (Browser Tests) — {M} files
+    {list of files with brief reason}
+    ### Skip — {K} files
+    {list of files with brief reason}
+    {if $EXTRA_INSTRUCTIONS: "Additional instructions: ${EXTRA_INSTRUCTIONS}"}
+    How would you like to proceed?
+  options:
+    - "Approve and generate test plan"
+    - "Adjust classification (I'll specify changes)"
+    - "Cancel"
+)
+```
+If user selects "Adjust classification": apply their changes and re-present.
+If user selects "Cancel": exit gracefully.
+</step>
+<step name="discover_test_structure">
+Before generating the test plan, discover the project's existing test structure:
+```bash
+# Find existing test directories
+find . -type d -name "*test*" -o -name "*spec*" -o -name "*__tests__*" 2>/dev/null | head -20
+# Find existing test files for convention matching
+find . -type f \( -name "*.test.*" -o -name "*.spec.*" -o -name "*Tests.fs" -o -name "*Test.fs" \) 2>/dev/null | head -20
+# Check for test runners
+ls package.json *.sln 2>/dev/null
+```
+Identify:
+- Test directory structure (where unit tests live, where E2E tests live)
+- Naming conventions (`.test.ts`, `.spec.ts`, `*Tests.fs`, etc.)
+- Test runner commands (how to execute unit tests, how to execute E2E tests)
+- Test framework (xUnit, NUnit, Jest, Playwright, etc.)
+If test structure is ambiguous, ask the user:
+```
+AskUserQuestion(
+  header: "Test Structure",
+  question: "I found multiple test locations. Where should I create tests?",
+  options: [list discovered locations]
+)
+```
+</step>
+<step name="generate_test_plan">
+For each approved file, create a detailed test plan.
+**For TDD files**, plan tests following RED-GREEN-REFACTOR:
+1. Identify testable functions/methods in the file
+2. For each function: list input scenarios, expected outputs, edge cases
+3. Note: since code already exists, tests may pass immediately — that's OK, but verify they test the RIGHT behavior
+**For E2E files**, plan tests following RED-GREEN gates:
+1. Identify user scenarios from CONTEXT.md/VERIFICATION.md
+2. For each scenario: describe the user action, expected outcome, assertions
+3. Note: RED gate means confirming the test would fail if the feature were broken
+Present the complete test plan:
+```
+AskUserQuestion(
+  header: "Test Plan",
+  question: |
+    ## Test Generation Plan
+    ### Unit Tests ({N} tests across {M} files)
+    {for each file: test file path, list of test cases}
+    ### E2E Tests ({P} tests across {Q} files)
+    {for each file: test file path, list of test scenarios}
+    ### Test Commands
+    - Unit: {discovered test command}
+    - E2E: {discovered e2e command}
+    Ready to generate?
+  options:
+    - "Generate all"
+    - "Cherry-pick (I'll specify which)"
+    - "Adjust plan"
+)
+```
+If "Cherry-pick": ask user which tests to include.
+If "Adjust plan": apply changes and re-present.
+</step>
+<step name="execute_tdd_generation">
+For each approved TDD test:
+1. **Create test file** following discovered project conventions (directory, naming, imports)
+2. **Write test** with clear arrange/act/assert structure:
+   ```
+   // Arrange — set up inputs and expected outputs
+   // Act — call the function under test
+   // Assert — verify the output matches expectations
+   ```
+3. **Run the test**:
+   ```bash
+   {discovered test command}
+   ```
+4. **Evaluate result:**
+   - **Test passes**: Good — the implementation satisfies the test. Verify the test checks meaningful behavior (not just that it compiles).
+   - **Test fails with assertion error**: This may be a genuine bug discovered by the test. Flag it:
+     ```
+     ⚠️ Potential bug found: {test name}
+     Expected: {expected}
+     Actual: {actual}
+     File: {implementation file}
+     ```
+     Do NOT fix the implementation — this is a test-generation command, not a fix command. Record the finding.
+   - **Test fails with error (import, syntax, etc.)**: This is a test error. Fix the test and re-run.
+</step>
+<step name="execute_e2e_generation">
+For each approved E2E test:
+1. **Check for existing tests** covering the same scenario:
+   ```bash
+   grep -r "{scenario keyword}" {e2e test directory} 2>/dev/null
+   ```
+   If found, extend rather than duplicate.
+2. **Create test file** targeting the user scenario from CONTEXT.md/VERIFICATION.md
+3. **Run the E2E test**:
+   ```bash
+   {discovered e2e command}
+   ```
+4. **Evaluate result:**
+   - **GREEN (passes)**: Record success
+   - **RED (fails)**: Determine if it's a test issue or a genuine application bug. Flag bugs:
+     ```
+     ⚠️ E2E failure: {test name}
+     Scenario: {description}
+     Error: {error message}
+     ```
+   - **Cannot run**: Report blocker. Do NOT mark as complete.
+     ```
+     🛑 E2E blocker: {reason tests cannot run}
+     ```
+**No-skip rule:** If E2E tests cannot execute (missing dependencies, environment issues), report the blocker and mark the test as incomplete. Never mark success without actually running the test.
+</step>
+<step name="summary_and_commit">
+Create a test coverage report and present to user:
+```
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+ GSD ► TEST GENERATION COMPLETE
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+## Results
+| Category | Generated | Passing | Failing | Blocked |
+|----------|-----------|---------|---------|---------|
+| Unit     | {N}       | {n1}    | {n2}    | {n3}    |
+| E2E      | {M}       | {m1}    | {m2}    | {m3}    |
+## Files Created/Modified
+{list of test files with paths}
+## Coverage Gaps
+{areas that couldn't be tested and why}
+## Bugs Discovered
+{any assertion failures that indicate implementation bugs}
+```
+Record test generation in project state:
+```bash
+node "$HOME/.claude/ez-agents/bin/ez-tools.cjs" state-snapshot
+```
+If there are passing tests to commit:
+```bash
+git add {test files}
+git commit -m "test(phase-${phase_number}): add unit and E2E tests from add-tests command"
+```
+Present next steps:
+```
+---
+## ▶ Next Up
+{if bugs discovered:}
+**Fix discovered bugs:** `/ez:quick fix the {N} test failures discovered in phase ${phase_number}`
+{if blocked tests:}
+**Resolve test blockers:** {description of what's needed}
+{otherwise:}
+**All tests passing!** Phase ${phase_number} is fully tested.
+---
+**Also available:**
+- `/ez:add-tests {next_phase}` — test another phase
+- `/ez:verify-work {phase_number}` — run UAT verification
+---
+```
+</step>
+</process>
+<success_criteria>
+- [ ] Phase artifacts loaded (SUMMARY.md, CONTEXT.md, optionally VERIFICATION.md)
+- [ ] All changed files classified into TDD/E2E/Skip categories
+- [ ] Classification presented to user and approved
+- [ ] Project test structure discovered (directories, conventions, runners)
+- [ ] Test plan presented to user and approved
+- [ ] TDD tests generated with arrange/act/assert structure
+- [ ] E2E tests generated targeting user scenarios
+- [ ] All tests executed — no untested tests marked as passing
+- [ ] Bugs discovered by tests flagged (not fixed)
+- [ ] Test files committed with proper message
+- [ ] Coverage gaps documented
+- [ ] Next steps presented to user
+</success_criteria>