npm - @undeemed/get-shit-done-codex - Versions diffs - 1.23.2 → 1.24.2 - Mend

@undeemed/get-shit-done-codex 1.23.2 → 1.24.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (46) hide show

package/README.md +51 -5
package/agents/gsd-debugger.md +8 -56
package/agents/gsd-planner.md +2 -118
package/agents/gsd-project-researcher.md +0 -3
package/agents/gsd-research-synthesizer.md +0 -3
package/bin/install.js +267 -5
package/commands/gsd/add-phase.md +2 -6
package/commands/gsd/add-todo.md +1 -6
package/commands/gsd/check-todos.md +2 -6
package/commands/gsd/debug.md +1 -6
package/commands/gsd/discuss-phase.md +16 -9
package/commands/gsd/execute-phase.md +2 -1
package/commands/gsd/new-milestone.md +8 -1
package/commands/gsd/pause-work.md +1 -4
package/commands/gsd/plan-phase.md +1 -2
package/commands/gsd/research-phase.md +15 -17
package/commands/gsd/verify-work.md +2 -1
package/get-shit-done/bin/gsd-tools.cjs +4951 -121
package/get-shit-done/bin/lib/commands.cjs +4 -9
package/get-shit-done/bin/lib/core.cjs +102 -23
package/get-shit-done/bin/lib/init.cjs +11 -11
package/get-shit-done/bin/lib/milestone.cjs +54 -3
package/get-shit-done/bin/lib/phase.cjs +40 -10
package/get-shit-done/bin/lib/state.cjs +86 -33
package/get-shit-done/references/checkpoints.md +0 -1
package/get-shit-done/references/model-profile-resolution.md +13 -6
package/get-shit-done/references/model-profiles.md +60 -51
package/get-shit-done/templates/context.md +14 -0
package/get-shit-done/templates/phase-prompt.md +0 -2
package/get-shit-done/workflows/audit-milestone.md +8 -63
package/get-shit-done/workflows/diagnose-issues.md +1 -1
package/get-shit-done/workflows/execute-phase.md +9 -54
package/get-shit-done/workflows/execute-plan.md +13 -17
package/get-shit-done/workflows/help.md +3 -3
package/get-shit-done/workflows/map-codebase.md +44 -32
package/get-shit-done/workflows/new-milestone.md +7 -16
package/get-shit-done/workflows/new-project.md +80 -49
package/get-shit-done/workflows/progress.md +26 -14
package/get-shit-done/workflows/quick.md +15 -24
package/get-shit-done/workflows/set-profile.md +12 -8
package/get-shit-done/workflows/settings.md +14 -21
package/get-shit-done/workflows/transition.md +0 -5
package/get-shit-done/workflows/verify-work.md +12 -11
package/hooks/dist/gsd-context-monitor.js +1 -1
package/package.json +3 -2
package/scripts/run-tests.cjs +43 -0

package/get-shit-done/workflows/execute-phase.md CHANGED Viewed

@@ -106,7 +106,7 @@ Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`
      prompt="
        <objective>
        Execute plan {plan_number} of phase {phase_number}-{phase_name}.
-       Commit each task atomically. Create SUMMARY.md. Update STATE.md and ROADMAP.md.
+       Commit each task atomically. Create SUMMARY.md. Update STATE.md.
        </objective>
        <execution_context>
@@ -118,11 +118,9 @@ Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`
        <files_to_read>
        Read these files at execution start using the Read tool:
-       - {phase_dir}/{plan_file} (Plan)
-       - .planning/STATE.md (State)
-       - .planning/config.json (Config, if exists)
-       - ./CODEX.md (Project instructions, if exists — follow project-specific guidelines and coding conventions)
-       - .agents/skills/ (Project skills, if exists — list skills, read SKILL.md for each, follow relevant rules during implementation)
+       - Plan: {phase_dir}/{plan_file}
+       - State: .planning/STATE.md
+       - Config: .planning/config.json (if exists)
        </files_to_read>
        <success_criteria>
@@ -130,7 +128,6 @@ Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`
        - [ ] Each task committed individually
        - [ ] SUMMARY.md created in plan directory
        - [ ] STATE.md updated with position and decisions
-       - [ ] ROADMAP.md updated with plan progress (via `roadmap update-plan-progress`)
        </success_criteria>
      "
    )
@@ -165,7 +162,7 @@ Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`
 5. **Handle failures:**
-   **Known Codex Code bug (classifyHandoffIfNeeded):** If an agent reports "failed" with error containing `classifyHandoffIfNeeded is not defined`, this is a Codex Code runtime bug — not a GSD or agent issue. The error fires in the completion handler AFTER all tool calls finish. In this case: run the same spot-checks as step 4 (SUMMARY.md exists, git commits present, no Self-Check: FAILED). If spot-checks PASS → treat as **successful**. If spot-checks FAIL → treat as real failure below.
+   **Known Codex CLI bug (classifyHandoffIfNeeded):** If an agent reports "failed" with error containing `classifyHandoffIfNeeded is not defined`, this is a Codex CLI runtime bug — not a GSD or agent issue. The error fires in the completion handler AFTER all tool calls finish. In this case: run the same spot-checks as step 4 (SUMMARY.md exists, git commits present, no Self-Check: FAILED). If spot-checks PASS → treat as **successful**. If spot-checks FAIL → treat as real failure below.
    For real failures: report which plan failed → ask "Continue?" or "Stop?" → if continue, dependent plans may also fail. If stop, partial completion report.
@@ -177,19 +174,7 @@ Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`
 <step name="checkpoint_handling">
 Plans with `autonomous: false` require user interaction.
-**Auto-mode checkpoint handling:**
-Read auto-advance config:
-```bash
-AUTO_CFG=$(node ~/.codex/get-shit-done/bin/gsd-tools.cjs config-get workflow.auto_advance 2>/dev/null || echo "false")
-```
-When executor returns a checkpoint AND `AUTO_CFG` is `"true"`:
-- **human-verify** → Auto-spawn continuation agent with `{user_response}` = `"approved"`. Log `⚡ Auto-approved checkpoint`.
-- **decision** → Auto-spawn continuation agent with `{user_response}` = first option from checkpoint details. Log `⚡ Auto-selected: [option]`.
-- **human-action** → Present to user (existing behavior below). Auth gates cannot be automated.
-**Standard flow (not auto-mode, or human-action type):**
+**Flow:**
 1. Spawn agent for checkpoint plan
 2. Agent runs until checkpoint task or auth gate → returns structured state
@@ -294,19 +279,12 @@ node ~/.codex/get-shit-done/bin/gsd-tools.cjs commit "docs(phase-${PARENT_PHASE}
 <step name="verify_phase_goal">
 Verify phase achieved its GOAL, not just completed tasks.
-```bash
-PHASE_REQ_IDS=$(node ~/.codex/get-shit-done/bin/gsd-tools.cjs roadmap get-phase "${PHASE_NUMBER}" | jq -r '.section' | grep -i "Requirements:" | sed 's/.*Requirements:\*\*\s*//' | sed 's/[\[\]]//g')
-```
 ```
 Task(
   prompt="Verify phase {phase_number} goal achievement.
 Phase directory: {phase_dir}
 Phase goal: {goal from ROADMAP.md}
-Phase requirement IDs: {phase_req_ids}
-Check must_haves against actual codebase.
-Cross-reference requirement IDs from PLAN frontmatter against REQUIREMENTS.md — every ID MUST be accounted for.
-Create VERIFICATION.md.",
+Check must_haves against actual codebase. Create VERIFICATION.md.",
   subagent_type="gsd-verifier",
   model="{verifier_model}"
 )
@@ -375,7 +353,7 @@ The CLI handles:
 Extract from result: `next_phase`, `next_phase_name`, `is_last_phase`.
 ```bash
-node ~/.codex/get-shit-done/bin/gsd-tools.cjs commit "docs(phase-{X}): complete phase execution" --files .planning/ROADMAP.md .planning/STATE.md .planning/REQUIREMENTS.md {phase_dir}/*-VERIFICATION.md
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs commit "docs(phase-{X}): complete phase execution" --files .planning/ROADMAP.md .planning/STATE.md .planning/REQUIREMENTS.md .planning/phases/{phase_dir}/*-VERIFICATION.md
 ```
 </step>
@@ -383,29 +361,6 @@ node ~/.codex/get-shit-done/bin/gsd-tools.cjs commit "docs(phase-{X}): complete
 **Exception:** If `gaps_found`, the `verify_phase_goal` step already presents the gap-closure path (`$gsd-plan-phase {X} --gaps`). No additional routing needed — skip auto-advance.
-**No-transition check (spawned by auto-advance chain):**
-Parse `--no-transition` flag from $ARGUMENTS.
-**If `--no-transition` flag present:**
-Execute-phase was spawned by plan-phase's auto-advance. Do NOT run transition.md.
-After verification passes and roadmap is updated, return completion status to parent:
-```
-## PHASE COMPLETE
-Phase: ${PHASE_NUMBER} - ${PHASE_NAME}
-Plans: ${completed_count}/${total_count}
-Verification: {Passed | Gaps Found}
-[Include aggregate_results output]
-```
-STOP. Do not proceed to auto-advance or transition.
-**If `--no-transition` flag is NOT present:**
 **Auto-advance detection:**
 1. Parse `--auto` flag from $ARGUMENTS
@@ -439,7 +394,7 @@ Orchestrator: ~10-15% context. Subagents: fresh 200k each. No polling (Task bloc
 </context_efficiency>
 <failure_handling>
-- **classifyHandoffIfNeeded false failure:** Agent reports "failed" but error is `classifyHandoffIfNeeded is not defined` → Codex Code bug, not GSD. Spot-check (SUMMARY exists, commits present) → if pass, treat as success
+- **classifyHandoffIfNeeded false failure:** Agent reports "failed" but error is `classifyHandoffIfNeeded is not defined` → Codex CLI bug, not GSD. Spot-check (SUMMARY exists, commits present) → if pass, treat as success
 - **Agent fails mid-plan:** Missing SUMMARY.md → report, ask user how to proceed
 - **Dependency chain breaks:** Wave 1 fails → Wave 2 dependents likely fail → user chooses attempt or skip
 - **All agents in wave fail:** Systemic issue → stop, report for investigation

package/get-shit-done/workflows/execute-plan.md CHANGED Viewed

@@ -12,13 +12,19 @@ Read config.json for planning behavior settings.
 <process>
 <step name="init_context" priority="first">
-Load execution context (paths only to minimize orchestrator context):
+Load execution context (uses `init execute-phase` for full context, including file contents):
 ```bash
-INIT=$(node ~/.codex/get-shit-done/bin/gsd-tools.cjs init execute-phase "${PHASE}")
+INIT=$(node ~/.codex/get-shit-done/bin/gsd-tools.cjs init execute-phase "${PHASE}" --include state,config)
 ```
-Extract from init JSON: `executor_model`, `commit_docs`, `phase_dir`, `phase_number`, `plans`, `summaries`, `incomplete_plans`, `state_path`, `config_path`.
+Extract from init JSON: `executor_model`, `commit_docs`, `phase_dir`, `phase_number`, `plans`, `summaries`, `incomplete_plans`.
+**File contents (from --include):** `state_content`, `config_content`. Access with:
+```bash
+STATE_CONTENT=$(echo "$INIT" | jq -r '.state_content // empty')
+CONFIG_CONTENT=$(echo "$INIT" | jq -r '.config_content // empty')
+```
 If `.planning/` missing: error.
 </step>
@@ -34,7 +40,7 @@ Find first PLAN without matching SUMMARY. Decimal phases supported (`01.1-hotfix
 ```bash
 PHASE=$(echo "$PLAN_PATH" | grep -oE '[0-9]+(\.[0-9]+)?-[0-9]+')
-# config settings can be fetched via gsd-tools config-get if needed
+# config_content already loaded via --include config in init_context
 ```
 <if mode="yolo">
@@ -106,7 +112,7 @@ Pattern B only (verify-only checkpoints). Skip for A/C.
    - Check `git log --oneline --all --grep="{phase}-{plan}"` returns ≥1 commit
    - Append `## Self-Check: PASSED` or `## Self-Check: FAILED` to SUMMARY
-   **Known Codex Code bug (classifyHandoffIfNeeded):** If any segment agent reports "failed" with `classifyHandoffIfNeeded is not defined`, this is a Codex Code runtime bug — not a real failure. Run spot-checks; if they pass, treat as successful.
+   **Known Codex CLI bug (classifyHandoffIfNeeded):** If any segment agent reports "failed" with `classifyHandoffIfNeeded is not defined`, this is a Codex CLI runtime bug — not a real failure. Run spot-checks; if they pass, treat as successful.
@@ -316,7 +322,7 @@ If user_setup exists: create `{phase}-USER-SETUP.md` using template `~/.codex/ge
 <step name="create_summary">
 Create `{phase}-{plan}-SUMMARY.md` at `.planning/phases/XX-name/`. Use `~/.codex/get-shit-done/templates/summary.md`.
-**Frontmatter:** phase, plan, subsystem, tags | requires/provides/affects | tech-stack.added/patterns | key-files.created/modified | key-decisions | requirements-completed (**MUST** copy `requirements` array from PLAN.md frontmatter verbatim) | duration ($DURATION), completed ($PLAN_END_TIME date).
+**Frontmatter:** phase, plan, subsystem, tags | requires/provides/affects | tech-stack.added/patterns | key-files.created/modified | key-decisions | duration ($DURATION), completed ($PLAN_END_TIME date).
 Title: `# Phase [X] Plan [Y]: [Name] Summary`
@@ -380,21 +386,11 @@ node ~/.codex/get-shit-done/bin/gsd-tools.cjs roadmap update-plan-progress "${PH
 Counts PLAN vs SUMMARY files on disk. Updates progress table row with correct count and status (`In Progress` or `Complete` with date).
 </step>
-<step name="update_requirements">
-Mark completed requirements from the PLAN.md frontmatter `requirements:` field:
-```bash
-node ~/.codex/get-shit-done/bin/gsd-tools.cjs requirements mark-complete ${REQ_IDS}
-```
-Extract requirement IDs from the plan's frontmatter (e.g., `requirements: [AUTH-01, AUTH-02]`). If no requirements field, skip.
-</step>
 <step name="git_commit_metadata">
 Task code already committed per-task. Commit plan metadata:
 ```bash
-node ~/.codex/get-shit-done/bin/gsd-tools.cjs commit "docs({phase}-{plan}): complete [plan-name] plan" --files .planning/phases/XX-name/{phase}-{plan}-SUMMARY.md .planning/STATE.md .planning/ROADMAP.md .planning/REQUIREMENTS.md
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs commit "docs({phase}-{plan}): complete [plan-name] plan" --files .planning/phases/XX-name/{phase}-{plan}-SUMMARY.md .planning/STATE.md .planning/ROADMAP.md
 ```
 </step>

package/get-shit-done/workflows/help.md CHANGED Viewed

@@ -309,9 +309,9 @@ Usage: `$gsd-settings`
 **`$gsd-set-profile <profile>`**
 Quick switch model profile for GSD agents.
-- `quality` — Opus everywhere except verification
-- `balanced` — Opus for planning, Sonnet for execution (default)
-- `budget` — Sonnet for writing, Haiku for research/verification
+- `quality` — xhigh thinking for decision-makers, high for analysis agents
+- `balanced` — xhigh thinking for planner/debugger, high/medium for others (default)
+- `budget` — minimal thinking — high for planner/debugger, medium everywhere else
 Usage: `$gsd-set-profile budget`

package/get-shit-done/workflows/map-codebase.md CHANGED Viewed

@@ -90,13 +90,17 @@ Use Task tool with `subagent_type="gsd-codebase-mapper"`, `model="{mapper_model}
 **Agent 1: Tech Focus**
+Task tool parameters:
 ```
-Task(
-  subagent_type="gsd-codebase-mapper",
-  model="{mapper_model}",
-  run_in_background=true,
-  description="Map codebase tech stack",
-  prompt="Focus: tech
+subagent_type: "gsd-codebase-mapper"
+model: "{mapper_model}"
+run_in_background: true
+description: "Map codebase tech stack"
+```
+Prompt:
+```
+Focus: tech
 Analyze this codebase for technology stack and external integrations.
@@ -104,19 +108,22 @@ Write these documents to .planning/codebase/:
 - STACK.md - Languages, runtime, frameworks, dependencies, configuration
 - INTEGRATIONS.md - External APIs, databases, auth providers, webhooks
-Explore thoroughly. Write documents directly using templates. Return confirmation only."
-)
+Explore thoroughly. Write documents directly using templates. Return confirmation only.
 ```
 **Agent 2: Architecture Focus**
+Task tool parameters:
+```
+subagent_type: "gsd-codebase-mapper"
+model: "{mapper_model}"
+run_in_background: true
+description: "Map codebase architecture"
+```
+Prompt:
 ```
-Task(
-  subagent_type="gsd-codebase-mapper",
-  model="{mapper_model}",
-  run_in_background=true,
-  description="Map codebase architecture",
-  prompt="Focus: arch
+Focus: arch
 Analyze this codebase architecture and directory structure.
@@ -124,19 +131,22 @@ Write these documents to .planning/codebase/:
 - ARCHITECTURE.md - Pattern, layers, data flow, abstractions, entry points
 - STRUCTURE.md - Directory layout, key locations, naming conventions
-Explore thoroughly. Write documents directly using templates. Return confirmation only."
-)
+Explore thoroughly. Write documents directly using templates. Return confirmation only.
 ```
 **Agent 3: Quality Focus**
+Task tool parameters:
 ```
-Task(
-  subagent_type="gsd-codebase-mapper",
-  model="{mapper_model}",
-  run_in_background=true,
-  description="Map codebase conventions",
-  prompt="Focus: quality
+subagent_type: "gsd-codebase-mapper"
+model: "{mapper_model}"
+run_in_background: true
+description: "Map codebase conventions"
+```
+Prompt:
+```
+Focus: quality
 Analyze this codebase for coding conventions and testing patterns.
@@ -144,27 +154,29 @@ Write these documents to .planning/codebase/:
 - CONVENTIONS.md - Code style, naming, patterns, error handling
 - TESTING.md - Framework, structure, mocking, coverage
-Explore thoroughly. Write documents directly using templates. Return confirmation only."
-)
+Explore thoroughly. Write documents directly using templates. Return confirmation only.
 ```
 **Agent 4: Concerns Focus**
+Task tool parameters:
+```
+subagent_type: "gsd-codebase-mapper"
+model: "{mapper_model}"
+run_in_background: true
+description: "Map codebase concerns"
+```
+Prompt:
 ```
-Task(
-  subagent_type="gsd-codebase-mapper",
-  model="{mapper_model}",
-  run_in_background=true,
-  description="Map codebase concerns",
-  prompt="Focus: concerns
+Focus: concerns
 Analyze this codebase for technical debt, known issues, and areas of concern.
 Write this document to .planning/codebase/:
 - CONCERNS.md - Tech debt, bugs, security, performance, fragile areas
-Explore thoroughly. Write document directly using template. Return confirmation only."
-)
+Explore thoroughly. Write document directly using template. Return confirmation only.
 ```
 Continue to collect_confirmations.

package/get-shit-done/workflows/new-milestone.md CHANGED Viewed

@@ -128,9 +128,7 @@ Focus ONLY on what's needed for the NEW features.
 <question>{QUESTION}</question>
-<files_to_read>
-- .planning/PROJECT.md (Project context)
-</files_to_read>
+<project_context>[PROJECT.md summary]</project_context>
 <downstream_consumer>{CONSUMER}</downstream_consumer>
@@ -159,12 +157,7 @@ After all 4 complete, spawn synthesizer:
 Task(prompt="
 Synthesize research outputs into SUMMARY.md.
-<files_to_read>
-- .planning/research/STACK.md
-- .planning/research/FEATURES.md
-- .planning/research/ARCHITECTURE.md
-- .planning/research/PITFALLS.md
-</files_to_read>
+Read: .planning/research/STACK.md, FEATURES.md, ARCHITECTURE.md, PITFALLS.md
 Write to: .planning/research/SUMMARY.md
 Use template: ~/.codex/get-shit-done/templates/research-project/SUMMARY.md
@@ -271,13 +264,11 @@ node ~/.codex/get-shit-done/bin/gsd-tools.cjs commit "docs: define milestone v[X
 ```
 Task(prompt="
 <planning_context>
-<files_to_read>
-- .planning/PROJECT.md
-- .planning/REQUIREMENTS.md
-- .planning/research/SUMMARY.md (if exists)
-- .planning/config.json
-- .planning/MILESTONES.md
-</files_to_read>
+@.planning/PROJECT.md
+@.planning/REQUIREMENTS.md
+@.planning/research/SUMMARY.md (if exists)
+@.planning/config.json
+@.planning/MILESTONES.md
 </planning_context>
 <instructions>