npm - @undeemed/get-shit-done-codex - Versions diffs - 1.20.8 → 1.20.10 - Mend

@undeemed/get-shit-done-codex 1.20.8 → 1.20.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (27) hide show

package/AGENTS.md +29 -29
package/README.md +102 -30
package/agents/gsd-debugger.md +53 -8
package/agents/gsd-planner.md +86 -5
package/agents/gsd-verifier.md +15 -0
package/bin/install.js +524 -37
package/commands/gsd/add-tests.md +41 -0
package/commands/gsd/debug.md +3 -0
package/commands/gsd/join-discord.md +1 -1
package/commands/gsd/plan-phase.md +2 -1
package/get-shit-done/bin/gsd-tools.cjs +39 -4
package/get-shit-done/bin/lib/commands.cjs +5 -8
package/get-shit-done/bin/lib/core.cjs +22 -9
package/get-shit-done/bin/lib/init.cjs +17 -1
package/get-shit-done/bin/lib/milestone.cjs +2 -1
package/get-shit-done/bin/lib/phase.cjs +18 -20
package/get-shit-done/bin/lib/roadmap.cjs +7 -7
package/get-shit-done/bin/lib/state.cjs +216 -27
package/get-shit-done/bin/lib/verify.cjs +9 -8
package/get-shit-done/templates/DEBUG.md +7 -2
package/get-shit-done/templates/VALIDATION.md +18 -46
package/get-shit-done/templates/retrospective.md +54 -0
package/get-shit-done/workflows/add-tests.md +350 -0
package/get-shit-done/workflows/complete-milestone.md +63 -0
package/get-shit-done/workflows/discuss-phase.md +2 -0
package/get-shit-done/workflows/help.md +3 -0
package/package.json +2 -1

package/get-shit-done/workflows/add-tests.md ADDED Viewed

@@ -0,0 +1,350 @@
+<purpose>
+Generate unit and E2E tests for a completed phase based on its SUMMARY.md, CONTEXT.md, and implementation. Classifies each changed file into TDD (unit), E2E (browser), or Skip categories, presents a test plan for user approval, then generates tests following RED-GREEN conventions.
+Users currently hand-craft `/gsd:quick` prompts for test generation after each phase. This workflow standardizes the process with proper classification, quality gates, and gap reporting.
+</purpose>
+<required_reading>
+Read all files referenced by the invoking prompt's execution_context before starting.
+</required_reading>
+<process>
+<step name="parse_arguments">
+Parse `$ARGUMENTS` for:
+- Phase number (integer, decimal, or letter-suffix) → store as `$PHASE_ARG`
+- Remaining text after phase number → store as `$EXTRA_INSTRUCTIONS` (optional)
+Example: `/gsd:add-tests 12 focus on edge cases` → `$PHASE_ARG=12`, `$EXTRA_INSTRUCTIONS="focus on edge cases"`
+If no phase argument provided:
+```
+ERROR: Phase number required
+Usage: /gsd:add-tests <phase> [additional instructions]
+Example: /gsd:add-tests 12
+Example: /gsd:add-tests 12 focus on edge cases in the pricing module
+```
+Exit.
+</step>
+<step name="init_context">
+Load phase operation context:
+```bash
+INIT=$(node ~/.claude/get-shit-done/bin/gsd-tools.cjs init phase-op "${PHASE_ARG}")
+```
+Extract from init JSON: `phase_dir`, `phase_number`, `phase_name`.
+Verify the phase directory exists. If not:
+```
+ERROR: Phase directory not found for phase ${PHASE_ARG}
+Ensure the phase exists in .planning/phases/
+```
+Exit.
+Read the phase artifacts (in order of priority):
+1. `${phase_dir}/*-SUMMARY.md` — what was implemented, files changed
+2. `${phase_dir}/CONTEXT.md` — acceptance criteria, decisions
+3. `${phase_dir}/*-VERIFICATION.md` — user-verified scenarios (if UAT was done)
+If no SUMMARY.md exists:
+```
+ERROR: No SUMMARY.md found for phase ${PHASE_ARG}
+This command works on completed phases. Run /gsd:execute-phase first.
+```
+Exit.
+Present banner:
+```
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+ GSD ► ADD TESTS — Phase ${phase_number}: ${phase_name}
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+```
+</step>
+<step name="analyze_implementation">
+Extract the list of files modified by the phase from SUMMARY.md ("Files Changed" or equivalent section).
+For each file, classify into one of three categories:
+| Category | Criteria | Test Type |
+|----------|----------|-----------|
+| **TDD** | Pure functions where `expect(fn(input)).toBe(output)` is writable | Unit tests |
+| **E2E** | UI behavior verifiable by browser automation | Playwright/E2E tests |
+| **Skip** | Not meaningfully testable or already covered | None |
+**TDD classification — apply when:**
+- Business logic: calculations, pricing, tax rules, validation
+- Data transformations: mapping, filtering, aggregation, formatting
+- Parsers: CSV, JSON, XML, custom format parsing
+- Validators: input validation, schema validation, business rules
+- State machines: status transitions, workflow steps
+- Utilities: string manipulation, date handling, number formatting
+**E2E classification — apply when:**
+- Keyboard shortcuts: key bindings, modifier keys, chord sequences
+- Navigation: page transitions, routing, breadcrumbs, back/forward
+- Form interactions: submit, validation errors, field focus, autocomplete
+- Selection: row selection, multi-select, shift-click ranges
+- Drag and drop: reordering, moving between containers
+- Modal dialogs: open, close, confirm, cancel
+- Data grids: sorting, filtering, inline editing, column resize
+**Skip classification — apply when:**
+- UI layout/styling: CSS classes, visual appearance, responsive breakpoints
+- Configuration: config files, environment variables, feature flags
+- Glue code: dependency injection setup, middleware registration, routing tables
+- Migrations: database migrations, schema changes
+- Simple CRUD: basic create/read/update/delete with no business logic
+- Type definitions: records, DTOs, interfaces with no logic
+Read each file to verify classification. Don't classify based on filename alone.
+</step>
+<step name="present_classification">
+Present the classification to the user for confirmation before proceeding:
+```
+AskUserQuestion(
+  header: "Test Classification",
+  question: |
+    ## Files classified for testing
+    ### TDD (Unit Tests) — {N} files
+    {list of files with brief reason}
+    ### E2E (Browser Tests) — {M} files
+    {list of files with brief reason}
+    ### Skip — {K} files
+    {list of files with brief reason}
+    {if $EXTRA_INSTRUCTIONS: "Additional instructions: ${EXTRA_INSTRUCTIONS}"}
+    How would you like to proceed?
+  options:
+    - "Approve and generate test plan"
+    - "Adjust classification (I'll specify changes)"
+    - "Cancel"
+)
+```
+If user selects "Adjust classification": apply their changes and re-present.
+If user selects "Cancel": exit gracefully.
+</step>
+<step name="discover_test_structure">
+Before generating the test plan, discover the project's existing test structure:
+```bash
+# Find existing test directories
+find . -type d -name "*test*" -o -name "*spec*" -o -name "*__tests__*" 2>/dev/null | head -20
+# Find existing test files for convention matching
+find . -type f \( -name "*.test.*" -o -name "*.spec.*" -o -name "*Tests.fs" -o -name "*Test.fs" \) 2>/dev/null | head -20
+# Check for test runners
+ls package.json *.sln 2>/dev/null
+```
+Identify:
+- Test directory structure (where unit tests live, where E2E tests live)
+- Naming conventions (`.test.ts`, `.spec.ts`, `*Tests.fs`, etc.)
+- Test runner commands (how to execute unit tests, how to execute E2E tests)
+- Test framework (xUnit, NUnit, Jest, Playwright, etc.)
+If test structure is ambiguous, ask the user:
+```
+AskUserQuestion(
+  header: "Test Structure",
+  question: "I found multiple test locations. Where should I create tests?",
+  options: [list discovered locations]
+)
+```
+</step>
+<step name="generate_test_plan">
+For each approved file, create a detailed test plan.
+**For TDD files**, plan tests following RED-GREEN-REFACTOR:
+1. Identify testable functions/methods in the file
+2. For each function: list input scenarios, expected outputs, edge cases
+3. Note: since code already exists, tests may pass immediately — that's OK, but verify they test the RIGHT behavior
+**For E2E files**, plan tests following RED-GREEN gates:
+1. Identify user scenarios from CONTEXT.md/VERIFICATION.md
+2. For each scenario: describe the user action, expected outcome, assertions
+3. Note: RED gate means confirming the test would fail if the feature were broken
+Present the complete test plan:
+```
+AskUserQuestion(
+  header: "Test Plan",
+  question: |
+    ## Test Generation Plan
+    ### Unit Tests ({N} tests across {M} files)
+    {for each file: test file path, list of test cases}
+    ### E2E Tests ({P} tests across {Q} files)
+    {for each file: test file path, list of test scenarios}
+    ### Test Commands
+    - Unit: {discovered test command}
+    - E2E: {discovered e2e command}
+    Ready to generate?
+  options:
+    - "Generate all"
+    - "Cherry-pick (I'll specify which)"
+    - "Adjust plan"
+)
+```
+If "Cherry-pick": ask user which tests to include.
+If "Adjust plan": apply changes and re-present.
+</step>
+<step name="execute_tdd_generation">
+For each approved TDD test:
+1. **Create test file** following discovered project conventions (directory, naming, imports)
+2. **Write test** with clear arrange/act/assert structure:
+   ```
+   // Arrange — set up inputs and expected outputs
+   // Act — call the function under test
+   // Assert — verify the output matches expectations
+   ```
+3. **Run the test**:
+   ```bash
+   {discovered test command}
+   ```
+4. **Evaluate result:**
+   - **Test passes**: Good — the implementation satisfies the test. Verify the test checks meaningful behavior (not just that it compiles).
+   - **Test fails with assertion error**: This may be a genuine bug discovered by the test. Flag it:
+     ```
+     ⚠️ Potential bug found: {test name}
+     Expected: {expected}
+     Actual: {actual}
+     File: {implementation file}
+     ```
+     Do NOT fix the implementation — this is a test-generation command, not a fix command. Record the finding.
+   - **Test fails with error (import, syntax, etc.)**: This is a test error. Fix the test and re-run.
+</step>
+<step name="execute_e2e_generation">
+For each approved E2E test:
+1. **Check for existing tests** covering the same scenario:
+   ```bash
+   grep -r "{scenario keyword}" {e2e test directory} 2>/dev/null
+   ```
+   If found, extend rather than duplicate.
+2. **Create test file** targeting the user scenario from CONTEXT.md/VERIFICATION.md
+3. **Run the E2E test**:
+   ```bash
+   {discovered e2e command}
+   ```
+4. **Evaluate result:**
+   - **GREEN (passes)**: Record success
+   - **RED (fails)**: Determine if it's a test issue or a genuine application bug. Flag bugs:
+     ```
+     ⚠️ E2E failure: {test name}
+     Scenario: {description}
+     Error: {error message}
+     ```
+   - **Cannot run**: Report blocker. Do NOT mark as complete.
+     ```
+     🛑 E2E blocker: {reason tests cannot run}
+     ```
+**No-skip rule:** If E2E tests cannot execute (missing dependencies, environment issues), report the blocker and mark the test as incomplete. Never mark success without actually running the test.
+</step>
+<step name="summary_and_commit">
+Create a test coverage report and present to user:
+```
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+ GSD ► TEST GENERATION COMPLETE
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+## Results
+| Category | Generated | Passing | Failing | Blocked |
+|----------|-----------|---------|---------|---------|
+| Unit     | {N}       | {n1}    | {n2}    | {n3}    |
+| E2E      | {M}       | {m1}    | {m2}    | {m3}    |
+## Files Created/Modified
+{list of test files with paths}
+## Coverage Gaps
+{areas that couldn't be tested and why}
+## Bugs Discovered
+{any assertion failures that indicate implementation bugs}
+```
+Record test generation in project state:
+```bash
+node ~/.claude/get-shit-done/bin/gsd-tools.cjs state-snapshot
+```
+If there are passing tests to commit:
+```bash
+git add {test files}
+git commit -m "test(phase-${phase_number}): add unit and E2E tests from add-tests command"
+```
+Present next steps:
+```
+---
+## ▶ Next Up
+{if bugs discovered:}
+**Fix discovered bugs:** `/gsd:quick fix the {N} test failures discovered in phase ${phase_number}`
+{if blocked tests:}
+**Resolve test blockers:** {description of what's needed}
+{otherwise:}
+**All tests passing!** Phase ${phase_number} is fully tested.
+---
+**Also available:**
+- `/gsd:add-tests {next_phase}` — test another phase
+- `/gsd:verify-work {phase_number}` — run UAT verification
+---
+```
+</step>
+</process>
+<success_criteria>
+- [ ] Phase artifacts loaded (SUMMARY.md, CONTEXT.md, optionally VERIFICATION.md)
+- [ ] All changed files classified into TDD/E2E/Skip categories
+- [ ] Classification presented to user and approved
+- [ ] Project test structure discovered (directories, conventions, runners)
+- [ ] Test plan presented to user and approved
+- [ ] TDD tests generated with arrange/act/assert structure
+- [ ] E2E tests generated targeting user scenarios
+- [ ] All tests executed — no untested tests marked as passing
+- [ ] Bugs discovered by tests flagged (not fixed)
+- [ ] Test files committed with proper message
+- [ ] Coverage gaps documented
+- [ ] Next steps presented to user
+</success_criteria>

package/get-shit-done/workflows/complete-milestone.md CHANGED Viewed

@@ -438,6 +438,67 @@ rm .planning/REQUIREMENTS.md
 </step>
+<step name="write_retrospective">
+**Append to living retrospective:**
+Check for existing retrospective:
+```bash
+ls .planning/RETROSPECTIVE.md 2>/dev/null
+```
+**If exists:** Read the file, append new milestone section before the "## Cross-Milestone Trends" section.
+**If doesn't exist:** Create from template at `~/.claude/get-shit-done/templates/retrospective.md`.
+**Gather retrospective data:**
+1. From SUMMARY.md files: Extract key deliverables, one-liners, tech decisions
+2. From VERIFICATION.md files: Extract verification scores, gaps found
+3. From UAT.md files: Extract test results, issues found
+4. From git log: Count commits, calculate timeline
+5. From the milestone work: Reflect on what worked and what didn't
+**Write the milestone section:**
+```markdown
+## Milestone: v{version} — {name}
+**Shipped:** {date}
+**Phases:** {phase_count} | **Plans:** {plan_count}
+### What Was Built
+{Extract from SUMMARY.md one-liners}
+### What Worked
+{Patterns that led to smooth execution}
+### What Was Inefficient
+{Missed opportunities, rework, bottlenecks}
+### Patterns Established
+{New conventions discovered during this milestone}
+### Key Lessons
+{Specific, actionable takeaways}
+### Cost Observations
+- Model mix: {X}% opus, {Y}% sonnet, {Z}% haiku
+- Sessions: {count}
+- Notable: {efficiency observation}
+```
+**Update cross-milestone trends:**
+If the "## Cross-Milestone Trends" section exists, update the tables with new data from this milestone.
+**Commit:**
+```bash
+node ~/.claude/get-shit-done/bin/gsd-tools.cjs commit "docs: update retrospective for v${VERSION}" --files .planning/RETROSPECTIVE.md
+```
+</step>
 <step name="update_state">
 Most STATE.md updates were handled by `milestone complete`, but verify and update remaining fields:
@@ -695,6 +756,8 @@ Milestone completion is successful when:
 - [ ] Requirements completion checked against REQUIREMENTS.md traceability table
 - [ ] Incomplete requirements surfaced with proceed/audit/abort options
 - [ ] Known gaps recorded in MILESTONES.md if user proceeded with incomplete requirements
+- [ ] RETROSPECTIVE.md updated with milestone section
+- [ ] Cross-milestone trends updated
 - [ ] User knows next step (/gsd:new-milestone)
 </success_criteria>

package/get-shit-done/workflows/discuss-phase.md CHANGED Viewed

@@ -107,6 +107,8 @@ Phase: "API documentation"
 <process>
+**Express path available:** If you already have a PRD or acceptance criteria document, use `/gsd:plan-phase {phase} --prd path/to/prd.md` to skip this discussion and go straight to planning.
 <step name="initialize" priority="first">
 Phase number from argument (required).

package/get-shit-done/workflows/help.md CHANGED Viewed

@@ -99,6 +99,8 @@ Create detailed execution plan for a specific phase.
 Usage: `/gsd:plan-phase 1`
 Result: Creates `.planning/phases/01-foundation/01-01-PLAN.md`
+**PRD Express Path:** Pass `--prd path/to/requirements.md` to skip discuss-phase entirely. Your PRD becomes locked decisions in CONTEXT.md. Useful when you already have clear acceptance criteria.
 ### Execution
 **`/gsd:execute-phase <phase-number>`**
@@ -351,6 +353,7 @@ Usage: `/gsd:join-discord`
 ├── PROJECT.md            # Project vision
 ├── ROADMAP.md            # Current phase breakdown
 ├── STATE.md              # Project memory & context
+├── RETROSPECTIVE.md      # Living retrospective (updated per milestone)
 ├── config.json           # Workflow mode & gates
 ├── todos/                # Captured ideas and tasks
 │   ├── pending/          # Todos waiting to be worked on

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@undeemed/get-shit-done-codex",
-  "version": "1.20.8",
+  "version": "1.20.10",
   "description": "A meta-prompting, context engineering and spec-driven development system for OpenAI Codex (CLI and Desktop). Fork of get-shit-done by TÂCHES, adapted for Codex by undeemed.",
   "bin": {
     "get-shit-done-codex": "bin/install.js"
@@ -9,6 +9,7 @@
     "access": "public"
   },
   "files": [
+    "README.md",
     "bin",
     "commands",
     "get-shit-done",