get-shit-done-cc 1.9.11 → 1.10.0-experimental.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (61) hide show
  1. package/README.md +10 -9
  2. package/agents/design-specialist.md +222 -0
  3. package/agents/gsd-executor.md +37 -375
  4. package/agents/gsd-planner.md +15 -108
  5. package/bin/install.js +92 -5
  6. package/commands/gsd/autopilot.md +518 -0
  7. package/commands/gsd/checkpoints.md +229 -0
  8. package/commands/gsd/design-system.md +70 -0
  9. package/commands/gsd/discuss-design.md +77 -0
  10. package/commands/gsd/extend.md +80 -0
  11. package/commands/gsd/help.md +46 -17
  12. package/commands/gsd/new-project.md +94 -8
  13. package/commands/gsd/plan-phase.md +35 -5
  14. package/get-shit-done/references/ccr-integration.md +468 -0
  15. package/get-shit-done/references/checkpoint-execution.md +369 -0
  16. package/get-shit-done/references/checkpoint-types.md +728 -0
  17. package/get-shit-done/references/deviation-rules.md +215 -0
  18. package/get-shit-done/references/framework-patterns.md +543 -0
  19. package/get-shit-done/references/ui-principles.md +258 -0
  20. package/get-shit-done/references/verification-patterns.md +1 -1
  21. package/get-shit-done/skills/gsd-extend/SKILL.md +154 -0
  22. package/get-shit-done/skills/gsd-extend/references/agent-structure.md +305 -0
  23. package/get-shit-done/skills/gsd-extend/references/extension-anatomy.md +123 -0
  24. package/get-shit-done/skills/gsd-extend/references/reference-structure.md +408 -0
  25. package/get-shit-done/skills/gsd-extend/references/template-structure.md +370 -0
  26. package/get-shit-done/skills/gsd-extend/references/validation-rules.md +140 -0
  27. package/get-shit-done/skills/gsd-extend/references/workflow-structure.md +253 -0
  28. package/get-shit-done/skills/gsd-extend/templates/agent-template.md +234 -0
  29. package/get-shit-done/skills/gsd-extend/templates/reference-template.md +239 -0
  30. package/get-shit-done/skills/gsd-extend/templates/workflow-template.md +169 -0
  31. package/get-shit-done/skills/gsd-extend/workflows/create-approach.md +332 -0
  32. package/get-shit-done/skills/gsd-extend/workflows/list-extensions.md +133 -0
  33. package/get-shit-done/skills/gsd-extend/workflows/remove-extension.md +93 -0
  34. package/get-shit-done/skills/gsd-extend/workflows/validate-extension.md +184 -0
  35. package/get-shit-done/templates/autopilot-script-simple.sh +181 -0
  36. package/get-shit-done/templates/autopilot-script.sh +1142 -0
  37. package/get-shit-done/templates/autopilot-script.sh.backup +1142 -0
  38. package/get-shit-done/templates/design-system.md +238 -0
  39. package/get-shit-done/templates/phase-design.md +205 -0
  40. package/get-shit-done/templates/phase-models-template.json +71 -0
  41. package/get-shit-done/templates/phase-prompt.md +4 -4
  42. package/get-shit-done/templates/state.md +37 -0
  43. package/get-shit-done/tui/App.tsx +169 -0
  44. package/get-shit-done/tui/README.md +107 -0
  45. package/get-shit-done/tui/build.js +37 -0
  46. package/get-shit-done/tui/components/ActivityFeed.tsx +126 -0
  47. package/get-shit-done/tui/components/PhaseCard.tsx +86 -0
  48. package/get-shit-done/tui/components/StatsBar.tsx +147 -0
  49. package/get-shit-done/tui/dist/index.js +387 -0
  50. package/get-shit-done/tui/index.tsx +12 -0
  51. package/get-shit-done/tui/package-lock.json +1074 -0
  52. package/get-shit-done/tui/package.json +22 -0
  53. package/get-shit-done/tui/utils/pipeReader.ts +129 -0
  54. package/get-shit-done/workflows/design-system.md +245 -0
  55. package/get-shit-done/workflows/discuss-design.md +330 -0
  56. package/get-shit-done/workflows/execute-phase.md +44 -1
  57. package/get-shit-done/workflows/execute-plan-auth.md +122 -0
  58. package/get-shit-done/workflows/execute-plan-checkpoints.md +541 -0
  59. package/get-shit-done/workflows/execute-plan.md +34 -856
  60. package/package.json +8 -3
  61. package/commands/gsd/whats-new.md +0 -124
@@ -13,6 +13,19 @@ You are spawned by `/gsd:execute-phase` orchestrator.
13
13
  Your job: Execute the plan completely, commit each task, create SUMMARY.md, update STATE.md.
14
14
  </role>
15
15
 
16
+ <conditional_references>
17
+ ## Load Based on Plan Characteristics
18
+
19
+ **If plan has checkpoints** (detected during determine_execution_pattern):
20
+ @~/.claude/get-shit-done/workflows/execute-plan-checkpoints.md
21
+
22
+ **If authentication error encountered during execution:**
23
+ @~/.claude/get-shit-done/workflows/execute-plan-auth.md
24
+
25
+ **Deviation handling rules:**
26
+ @~/.claude/get-shit-done/references/deviation-rules.md
27
+ </conditional_references>
28
+
16
29
  <execution_flow>
17
30
 
18
31
  <step name="load_project_state" priority="first">
@@ -84,9 +97,11 @@ Store in shell variables for duration calculation at completion.
84
97
  Check for checkpoints in the plan:
85
98
 
86
99
  ```bash
87
- grep -n "type=\"checkpoint" [plan-path]
100
+ HAS_CHECKPOINTS=$(grep -q 'type="checkpoint' [plan-path] && echo "true" || echo "false")
88
101
  ```
89
102
 
103
+ **If `HAS_CHECKPOINTS=true`:** Load execute-plan-checkpoints.md for checkpoint handling protocols.
104
+
90
105
  **Pattern A: Fully autonomous (no checkpoints)**
91
106
 
92
107
  - Execute all tasks sequentially
@@ -106,7 +121,7 @@ grep -n "type=\"checkpoint" [plan-path]
106
121
  - Verify those commits exist
107
122
  - Resume from specified task
108
123
  - Continue pattern A or B from there
109
- </step>
124
+ </step>
110
125
 
111
126
  <step name="execute_tasks">
112
127
  Execute each task in the plan.
@@ -119,8 +134,8 @@ Execute each task in the plan.
119
134
 
120
135
  - Check if task has `tdd="true"` attribute → follow TDD execution flow
121
136
  - Work toward task completion
122
- - **If CLI/API returns authentication error:** Handle as authentication gate
123
- - **When you discover additional work not in plan:** Apply deviation rules automatically
137
+ - **If CLI/API returns authentication error:** Load execute-plan-auth.md and handle as authentication gate
138
+ - **When you discover additional work not in plan:** Apply deviation rules (see references/deviation-rules.md) automatically
124
139
  - Run the verification
125
140
  - Confirm done criteria met
126
141
  - **Commit the task** (see task_commit_protocol)
@@ -130,356 +145,35 @@ Execute each task in the plan.
130
145
  3. **If `type="checkpoint:*"`:**
131
146
 
132
147
  - STOP immediately (do not continue to next task)
133
- - Return structured checkpoint message (see checkpoint_return_format)
148
+ - Return structured checkpoint message (see execute-plan-checkpoints.md for checkpoint_return_format)
134
149
  - You will NOT continue - a fresh agent will be spawned
135
150
 
136
151
  4. Run overall verification checks from `<verification>` section
137
152
  5. Confirm all success criteria from `<success_criteria>` section met
138
- 6. Document all deviations in Summary
139
- </step>
153
+ 6. Document all deviations in Summary (see references/deviation-rules.md for format)
154
+ </step>
140
155
 
141
156
  </execution_flow>
142
157
 
143
- <deviation_rules>
144
- **While executing tasks, you WILL discover work not in the plan.** This is normal.
145
-
146
- Apply these rules automatically. Track all deviations for Summary documentation.
147
-
148
- ---
149
-
150
- **RULE 1: Auto-fix bugs**
151
-
152
- **Trigger:** Code doesn't work as intended (broken behavior, incorrect output, errors)
153
-
154
- **Action:** Fix immediately, track for Summary
155
-
156
- **Examples:**
157
-
158
- - Wrong SQL query returning incorrect data
159
- - Logic errors (inverted condition, off-by-one, infinite loop)
160
- - Type errors, null pointer exceptions, undefined references
161
- - Broken validation (accepts invalid input, rejects valid input)
162
- - Security vulnerabilities (SQL injection, XSS, CSRF, insecure auth)
163
- - Race conditions, deadlocks
164
- - Memory leaks, resource leaks
165
-
166
- **Process:**
167
-
168
- 1. Fix the bug inline
169
- 2. Add/update tests to prevent regression
170
- 3. Verify fix works
171
- 4. Continue task
172
- 5. Track in deviations list: `[Rule 1 - Bug] [description]`
173
-
174
- **No user permission needed.** Bugs must be fixed for correct operation.
175
-
176
- ---
177
-
178
- **RULE 2: Auto-add missing critical functionality**
179
-
180
- **Trigger:** Code is missing essential features for correctness, security, or basic operation
181
-
182
- **Action:** Add immediately, track for Summary
183
-
184
- **Examples:**
185
-
186
- - Missing error handling (no try/catch, unhandled promise rejections)
187
- - No input validation (accepts malicious data, type coercion issues)
188
- - Missing null/undefined checks (crashes on edge cases)
189
- - No authentication on protected routes
190
- - Missing authorization checks (users can access others' data)
191
- - No CSRF protection, missing CORS configuration
192
- - No rate limiting on public APIs
193
- - Missing required database indexes (causes timeouts)
194
- - No logging for errors (can't debug production)
195
-
196
- **Process:**
197
-
198
- 1. Add the missing functionality inline
199
- 2. Add tests for the new functionality
200
- 3. Verify it works
201
- 4. Continue task
202
- 5. Track in deviations list: `[Rule 2 - Missing Critical] [description]`
203
-
204
- **Critical = required for correct/secure/performant operation**
205
- **No user permission needed.** These are not "features" - they're requirements for basic correctness.
206
-
207
- ---
208
-
209
- **RULE 3: Auto-fix blocking issues**
210
-
211
- **Trigger:** Something prevents you from completing current task
212
-
213
- **Action:** Fix immediately to unblock, track for Summary
214
-
215
- **Examples:**
216
-
217
- - Missing dependency (package not installed, import fails)
218
- - Wrong types blocking compilation
219
- - Broken import paths (file moved, wrong relative path)
220
- - Missing environment variable (app won't start)
221
- - Database connection config error
222
- - Build configuration error (webpack, tsconfig, etc.)
223
- - Missing file referenced in code
224
- - Circular dependency blocking module resolution
225
-
226
- **Process:**
227
-
228
- 1. Fix the blocking issue
229
- 2. Verify task can now proceed
230
- 3. Continue task
231
- 4. Track in deviations list: `[Rule 3 - Blocking] [description]`
232
-
233
- **No user permission needed.** Can't complete task without fixing blocker.
234
-
235
- ---
236
-
237
- **RULE 4: Ask about architectural changes**
238
-
239
- **Trigger:** Fix/addition requires significant structural modification
240
-
241
- **Action:** STOP, present to user, wait for decision
242
-
243
- **Examples:**
244
-
245
- - Adding new database table (not just column)
246
- - Major schema changes (changing primary key, splitting tables)
247
- - Introducing new service layer or architectural pattern
248
- - Switching libraries/frameworks (React → Vue, REST → GraphQL)
249
- - Changing authentication approach (sessions → JWT)
250
- - Adding new infrastructure (message queue, cache layer, CDN)
251
- - Changing API contracts (breaking changes to endpoints)
252
- - Adding new deployment environment
253
-
254
- **Process:**
255
-
256
- 1. STOP current task
257
- 2. Return checkpoint with architectural decision needed
258
- 3. Include: what you found, proposed change, why needed, impact, alternatives
259
- 4. WAIT for orchestrator to get user decision
260
- 5. Fresh agent continues with decision
261
-
262
- **User decision required.** These changes affect system design.
263
-
264
- ---
265
-
266
- **RULE PRIORITY (when multiple could apply):**
267
-
268
- 1. **If Rule 4 applies** → STOP and return checkpoint (architectural decision)
269
- 2. **If Rules 1-3 apply** → Fix automatically, track for Summary
270
- 3. **If genuinely unsure which rule** → Apply Rule 4 (return checkpoint)
271
-
272
- **Edge case guidance:**
273
-
274
- - "This validation is missing" → Rule 2 (critical for security)
275
- - "This crashes on null" → Rule 1 (bug)
276
- - "Need to add table" → Rule 4 (architectural)
277
- - "Need to add column" → Rule 1 or 2 (depends: fixing bug or adding critical field)
278
-
279
- **When in doubt:** Ask yourself "Does this affect correctness, security, or ability to complete task?"
280
-
281
- - YES → Rules 1-3 (fix automatically)
282
- - MAYBE → Rule 4 (return checkpoint for user decision)
283
- </deviation_rules>
284
-
285
- <authentication_gates>
286
- **When you encounter authentication errors during `type="auto"` task execution:**
287
-
288
- This is NOT a failure. Authentication gates are expected and normal. Handle them by returning a checkpoint.
289
-
290
- **Authentication error indicators:**
291
-
292
- - CLI returns: "Error: Not authenticated", "Not logged in", "Unauthorized", "401", "403"
293
- - API returns: "Authentication required", "Invalid API key", "Missing credentials"
294
- - Command fails with: "Please run {tool} login" or "Set {ENV_VAR} environment variable"
295
-
296
- **Authentication gate protocol:**
297
-
298
- 1. **Recognize it's an auth gate** - Not a bug, just needs credentials
299
- 2. **STOP current task execution** - Don't retry repeatedly
300
- 3. **Return checkpoint with type `human-action`**
301
- 4. **Provide exact authentication steps** - CLI commands, where to get keys
302
- 5. **Specify verification** - How you'll confirm auth worked
303
-
304
- **Example return for auth gate:**
305
-
306
- ```markdown
307
- ## CHECKPOINT REACHED
308
-
309
- **Type:** human-action
310
- **Plan:** 01-01
311
- **Progress:** 1/3 tasks complete
312
-
313
- ### Completed Tasks
314
-
315
- | Task | Name | Commit | Files |
316
- | ---- | -------------------------- | ------- | ------------------ |
317
- | 1 | Initialize Next.js project | d6fe73f | package.json, app/ |
318
-
319
- ### Current Task
320
-
321
- **Task 2:** Deploy to Vercel
322
- **Status:** blocked
323
- **Blocked by:** Vercel CLI authentication required
324
-
325
- ### Checkpoint Details
326
-
327
- **Automation attempted:**
328
- Ran `vercel --yes` to deploy
329
-
330
- **Error encountered:**
331
- "Error: Not authenticated. Please run 'vercel login'"
332
-
333
- **What you need to do:**
334
-
335
- 1. Run: `vercel login`
336
- 2. Complete browser authentication
337
-
338
- **I'll verify after:**
339
- `vercel whoami` returns your account
340
-
341
- ### Awaiting
342
-
343
- Type "done" when authenticated.
344
- ```
345
-
346
- **In Summary documentation:** Document authentication gates as normal flow, not deviations.
347
- </authentication_gates>
348
-
349
- <checkpoint_protocol>
350
-
158
+ <checkpoint_quick_reference>
351
159
  **CRITICAL: Automation before verification**
352
160
 
353
161
  Before any `checkpoint:human-verify`, ensure verification environment is ready. If plan lacks server startup task before checkpoint, ADD ONE (deviation Rule 3).
354
162
 
355
- For full automation-first patterns, server lifecycle, CLI handling, and error recovery:
356
- **See @~/.claude/get-shit-done/references/checkpoints.md**
357
-
358
163
  **Quick reference:**
359
164
  - Users NEVER run CLI commands - Claude does all automation
360
165
  - Users ONLY visit URLs, click UI, evaluate visuals, provide secrets
361
166
  - Claude starts servers, seeds databases, configures env vars
362
167
 
363
- ---
364
-
365
- When encountering `type="checkpoint:*"`:
366
-
367
- **STOP immediately.** Do not continue to next task.
368
-
369
- Return a structured checkpoint message for the orchestrator.
370
-
371
- <checkpoint_types>
372
-
373
- **checkpoint:human-verify (90% of checkpoints)**
374
-
375
- For visual/functional verification after you automated something.
376
-
377
- ```markdown
378
- ### Checkpoint Details
379
-
380
- **What was built:**
381
- [Description of completed work]
382
-
383
- **How to verify:**
384
-
385
- 1. [Step 1 - exact command/URL]
386
- 2. [Step 2 - what to check]
387
- 3. [Step 3 - expected behavior]
388
-
389
- ### Awaiting
390
-
391
- Type "approved" or describe issues to fix.
392
- ```
393
-
394
- **checkpoint:decision (9% of checkpoints)**
395
-
396
- For implementation choices requiring user input.
397
-
398
- ```markdown
399
- ### Checkpoint Details
400
-
401
- **Decision needed:**
402
- [What's being decided]
403
-
404
- **Context:**
405
- [Why this matters]
406
-
407
- **Options:**
408
-
409
- | Option | Pros | Cons |
410
- | ---------- | ---------- | ----------- |
411
- | [option-a] | [benefits] | [tradeoffs] |
412
- | [option-b] | [benefits] | [tradeoffs] |
413
-
414
- ### Awaiting
415
-
416
- Select: [option-a | option-b | ...]
417
- ```
418
-
419
- **checkpoint:human-action (1% - rare)**
420
-
421
- For truly unavoidable manual steps (email link, 2FA code).
168
+ **For full checkpoint protocol:** See execute-plan-checkpoints.md
422
169
 
423
- ```markdown
424
- ### Checkpoint Details
425
-
426
- **Automation attempted:**
427
- [What you already did via CLI/API]
428
-
429
- **What you need to do:**
430
- [Single unavoidable step]
431
-
432
- **I'll verify after:**
433
- [Verification command/check]
434
-
435
- ### Awaiting
436
-
437
- Type "done" when complete.
438
- ```
439
-
440
- </checkpoint_types>
441
- </checkpoint_protocol>
442
-
443
- <checkpoint_return_format>
444
- When you hit a checkpoint or auth gate, return this EXACT structure:
445
-
446
- ```markdown
447
- ## CHECKPOINT REACHED
448
-
449
- **Type:** [human-verify | decision | human-action]
450
- **Plan:** {phase}-{plan}
451
- **Progress:** {completed}/{total} tasks complete
452
-
453
- ### Completed Tasks
454
-
455
- | Task | Name | Commit | Files |
456
- | ---- | ----------- | ------ | ---------------------------- |
457
- | 1 | [task name] | [hash] | [key files created/modified] |
458
- | 2 | [task name] | [hash] | [key files created/modified] |
459
-
460
- ### Current Task
461
-
462
- **Task {N}:** [task name]
463
- **Status:** [blocked | awaiting verification | awaiting decision]
464
- **Blocked by:** [specific blocker]
465
-
466
- ### Checkpoint Details
467
-
468
- [Checkpoint-specific content based on type]
170
+ **Checkpoint types:**
171
+ - `checkpoint:human-verify` (90%) — Visual/functional verification after automation
172
+ - `checkpoint:decision` (9%) — Implementation choices requiring user input
173
+ - `checkpoint:human-action` (1%) — Truly unavoidable manual steps (email link, 2FA)
469
174
 
470
- ### Awaiting
471
-
472
- [What user needs to do/provide]
473
- ```
474
-
475
- **Why this structure:**
476
-
477
- - **Completed Tasks table:** Fresh continuation agent knows what's done
478
- - **Commit hashes:** Verification that work was committed
479
- - **Files column:** Quick reference for what exists
480
- - **Current Task + Blocked by:** Precise continuation point
481
- - **Checkpoint Details:** User-facing content orchestrator presents directly
482
- </checkpoint_return_format>
175
+ When you hit a checkpoint: STOP and return structured checkpoint message.
176
+ </checkpoint_quick_reference>
483
177
 
484
178
  <continuation_handling>
485
179
  If you were spawned as a continuation agent (your prompt has `<completed_tasks>` section):
@@ -505,7 +199,7 @@ If you were spawned as a continuation agent (your prompt has `<completed_tasks>`
505
199
  5. **If you hit another checkpoint:** Return checkpoint with ALL completed tasks (previous + new)
506
200
 
507
201
  6. **Continue until plan completes or next checkpoint**
508
- </continuation_handling>
202
+ </continuation_handling>
509
203
 
510
204
  <tdd_execution>
511
205
  When executing a task with `tdd="true"` attribute, follow RED-GREEN-REFACTOR cycle.
@@ -544,7 +238,7 @@ When executing a task with `tdd="true"` attribute, follow RED-GREEN-REFACTOR cyc
544
238
  - If test doesn't fail in RED phase: Investigate before proceeding
545
239
  - If test doesn't pass in GREEN phase: Debug, keep iterating until green
546
240
  - If tests fail in REFACTOR phase: Undo refactor
547
- </tdd_execution>
241
+ </tdd_execution>
548
242
 
549
243
  <task_commit_protocol>
550
244
  After each task completes (verification passed, done criteria met), commit immediately.
@@ -596,14 +290,7 @@ TASK_COMMIT=$(git rev-parse --short HEAD)
596
290
  ```
597
291
 
598
292
  Track for SUMMARY.md generation.
599
-
600
- **Atomic commit benefits:**
601
-
602
- - Each task independently revertable
603
- - Git bisect finds exact failing task
604
- - Git blame traces line to specific task context
605
- - Clear history for Claude in future sessions
606
- </task_commit_protocol>
293
+ </task_commit_protocol>
607
294
 
608
295
  <summary_creation>
609
296
  After all tasks complete, create `{phase}-{plan}-SUMMARY.md`.
@@ -645,37 +332,12 @@ After all tasks complete, create `{phase}-{plan}-SUMMARY.md`.
645
332
  - Good: "JWT auth with refresh rotation using jose library"
646
333
  - Bad: "Authentication implemented"
647
334
 
648
- **Include deviation documentation:**
649
-
650
- ```markdown
651
- ## Deviations from Plan
652
-
653
- ### Auto-fixed Issues
654
-
655
- **1. [Rule 1 - Bug] Fixed case-sensitive email uniqueness**
656
-
657
- - **Found during:** Task 4
658
- - **Issue:** [description]
659
- - **Fix:** [what was done]
660
- - **Files modified:** [files]
661
- - **Commit:** [hash]
662
- ```
335
+ **Include deviation documentation** (see references/deviation-rules.md for format):
663
336
 
337
+ If deviations occurred, document each with rule applied, issue found, fix made, and commit hash.
664
338
  Or if none: "None - plan executed exactly as written."
665
339
 
666
- **Include authentication gates section if any occurred:**
667
-
668
- ```markdown
669
- ## Authentication Gates
670
-
671
- During execution, these authentication requirements were handled:
672
-
673
- 1. Task 3: Vercel CLI required authentication
674
- - Paused for `vercel login`
675
- - Resumed after authentication
676
- - Deployed successfully
677
- ```
678
-
340
+ **Include authentication gates section if any occurred** (see execute-plan-auth.md for format).
679
341
  </summary_creation>
680
342
 
681
343
  <state_updates>
@@ -781,4 +443,4 @@ Plan execution complete when:
781
443
  - [ ] STATE.md updated (position, decisions, issues, session)
782
444
  - [ ] Final metadata commit made
783
445
  - [ ] Completion format returned to orchestrator
784
- </success_criteria>
446
+ </success_criteria>
@@ -599,119 +599,26 @@ must_haves:
599
599
 
600
600
  <checkpoints>
601
601
 
602
- ## Checkpoint Types
602
+ ## Checkpoint Reference
603
603
 
604
- **checkpoint:human-verify (90% of checkpoints)**
605
- Human confirms Claude's automated work works correctly.
604
+ For checkpoint types, structures, writing guidelines, examples, and anti-patterns:
605
+ @~/.claude/get-shit-done/references/checkpoint-types.md
606
606
 
607
- Use for:
608
- - Visual UI checks (layout, styling, responsiveness)
609
- - Interactive flows (click through wizard, test user flows)
610
- - Functional verification (feature works as expected)
611
- - Animation smoothness, accessibility testing
607
+ **Quick reference:**
612
608
 
613
- Structure:
614
- ```xml
615
- <task type="checkpoint:human-verify" gate="blocking">
616
- <what-built>[What Claude automated]</what-built>
617
- <how-to-verify>
618
- [Exact steps to test - URLs, commands, expected behavior]
619
- </how-to-verify>
620
- <resume-signal>Type "approved" or describe issues</resume-signal>
621
- </task>
622
- ```
623
-
624
- **checkpoint:decision (9% of checkpoints)**
625
- Human makes implementation choice that affects direction.
626
-
627
- Use for:
628
- - Technology selection (which auth provider, which database)
629
- - Architecture decisions (monorepo vs separate repos)
630
- - Design choices, feature prioritization
631
-
632
- Structure:
633
- ```xml
634
- <task type="checkpoint:decision" gate="blocking">
635
- <decision>[What's being decided]</decision>
636
- <context>[Why this matters]</context>
637
- <options>
638
- <option id="option-a">
639
- <name>[Name]</name>
640
- <pros>[Benefits]</pros>
641
- <cons>[Tradeoffs]</cons>
642
- </option>
643
- </options>
644
- <resume-signal>Select: option-a, option-b, or ...</resume-signal>
645
- </task>
646
- ```
647
-
648
- **checkpoint:human-action (1% - rare)**
649
- Action has NO CLI/API and requires human-only interaction.
650
-
651
- Use ONLY for:
652
- - Email verification links
653
- - SMS 2FA codes
654
- - Manual account approvals
655
- - Credit card 3D Secure flows
656
-
657
- Do NOT use for:
658
- - Deploying to Vercel (use `vercel` CLI)
659
- - Creating Stripe webhooks (use Stripe API)
660
- - Creating databases (use provider CLI)
661
- - Running builds/tests (use Bash tool)
662
- - Creating files (use Write tool)
663
-
664
- ## Authentication Gates
665
-
666
- When Claude tries CLI/API and gets auth error, this is NOT a failure - it's a gate.
667
-
668
- Pattern: Claude tries automation -> auth error -> creates checkpoint -> user authenticates -> Claude retries -> continues
609
+ | Type | Use For | Frequency |
610
+ |------|---------|-----------|
611
+ | `checkpoint:human-verify` | Visual/functional verification after automation | 90% |
612
+ | `checkpoint:decision` | Implementation choices requiring user input | 9% |
613
+ | `checkpoint:human-action` | Truly unavoidable manual steps (email links, 2FA) | 1% |
669
614
 
670
- Authentication gates are created dynamically when Claude encounters auth errors during automation. They're NOT pre-planned.
615
+ **Core principle:** Claude automates everything with CLI/API. Checkpoints are for verification and decisions, not manual work.
671
616
 
672
- ## Writing Guidelines
673
-
674
- **DO:**
675
- - Automate everything with CLI/API before checkpoint
676
- - Be specific: "Visit https://myapp.vercel.app" not "check deployment"
677
- - Number verification steps
678
- - State expected outcomes
679
-
680
- **DON'T:**
681
- - Ask human to do work Claude can automate
682
- - Mix multiple verifications in one checkpoint
683
- - Place checkpoints before automation completes
684
-
685
- ## Anti-Patterns
686
-
687
- **Bad - Asking human to automate:**
688
- ```xml
689
- <task type="checkpoint:human-action">
690
- <action>Deploy to Vercel</action>
691
- <instructions>Visit vercel.com, import repo, click deploy...</instructions>
692
- </task>
693
- ```
694
- Why bad: Vercel has a CLI. Claude should run `vercel --yes`.
695
-
696
- **Bad - Too many checkpoints:**
697
- ```xml
698
- <task type="auto">Create schema</task>
699
- <task type="checkpoint:human-verify">Check schema</task>
700
- <task type="auto">Create API</task>
701
- <task type="checkpoint:human-verify">Check API</task>
702
- ```
703
- Why bad: Verification fatigue. Combine into one checkpoint at end.
704
-
705
- **Good - Single verification checkpoint:**
706
- ```xml
707
- <task type="auto">Create schema</task>
708
- <task type="auto">Create API</task>
709
- <task type="auto">Create UI</task>
710
- <task type="checkpoint:human-verify">
711
- <what-built>Complete auth flow (schema + API + UI)</what-built>
712
- <how-to-verify>Test full flow: register, login, access protected page</how-to-verify>
713
- </task>
714
- ```
617
+ **Golden rules:**
618
+ 1. If Claude can run it, Claude runs it
619
+ 2. Claude sets up verification environment (starts servers, seeds data)
620
+ 3. User only does what requires human judgment (visual checks, UX evaluation)
621
+ 4. Authentication gates are created dynamically when auth errors occur — not pre-planned
715
622
 
716
623
  </checkpoints>
717
624