get-shit-done-cc 1.3.10 → 1.3.11

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -1,286 +1,110 @@
1
1
  <overview>
2
- Plans execute autonomously. Checkpoints formalize the interaction points where human verification or decisions are needed.
2
+ Plans execute autonomously. Checkpoints formalize interaction points where human verification or decisions are needed.
3
3
 
4
4
  **Core principle:** Claude automates everything with CLI/API. Checkpoints are for verification and decisions, not manual work.
5
5
  </overview>
6
6
 
7
7
  <checkpoint_types>
8
8
 
9
- <type name="human-verify">
10
- ## checkpoint:human-verify (Most Common)
9
+ ## checkpoint:human-verify (90% of checkpoints)
11
10
 
12
11
  **When:** Claude completed automated work, human confirms it works correctly.
13
12
 
14
- **Use for:**
15
- - Visual UI checks (layout, styling, responsiveness)
16
- - Interactive flows (click through wizard, test user flows)
17
- - Functional verification (feature works as expected)
18
- - Audio/video playback quality
19
- - Animation smoothness
20
- - Accessibility testing
13
+ **Use for:** Visual UI checks, interactive flows, functional verification, audio/video quality, animation smoothness, accessibility testing.
21
14
 
22
15
  **Structure:**
23
16
  ```xml
24
17
  <task type="checkpoint:human-verify" gate="blocking">
25
- <what-built>[What Claude automated and deployed/built]</what-built>
26
- <how-to-verify>
27
- [Exact steps to test - URLs, commands, expected behavior]
28
- </how-to-verify>
29
- <resume-signal>[How to continue - "approved", "yes", or describe issues]</resume-signal>
18
+ <what-built>[What Claude automated]</what-built>
19
+ <how-to-verify>[Numbered steps - URLs, commands, expected behavior]</how-to-verify>
20
+ <resume-signal>[How to continue - "approved" or describe issues]</resume-signal>
30
21
  </task>
31
22
  ```
32
23
 
33
- **Key elements:**
34
- - `<what-built>`: What Claude automated (deployed, built, configured)
35
- - `<how-to-verify>`: Exact steps to confirm it works (numbered, specific)
36
- - `<resume-signal>`: Clear indication of how to continue
37
-
38
- **Example: Vercel Deployment**
24
+ **Example:**
39
25
  ```xml
40
26
  <task type="auto">
41
27
  <name>Deploy to Vercel</name>
42
- <files>.vercel/, vercel.json</files>
43
- <action>Run `vercel --yes` to create project and deploy. Capture deployment URL from output.</action>
28
+ <action>Run `vercel --yes` to deploy. Capture URL.</action>
44
29
  <verify>vercel ls shows deployment, curl {url} returns 200</verify>
45
- <done>App deployed, URL captured</done>
46
- </task>
47
-
48
- <task type="checkpoint:human-verify" gate="blocking">
49
- <what-built>Deployed to Vercel at https://myapp-abc123.vercel.app</what-built>
50
- <how-to-verify>
51
- Visit https://myapp-abc123.vercel.app and confirm:
52
- - Homepage loads without errors
53
- - Login form is visible
54
- - No console errors in browser DevTools
55
- </how-to-verify>
56
- <resume-signal>Type "approved" to continue, or describe issues to fix</resume-signal>
57
- </task>
58
- ```
59
-
60
- **Example: UI Component**
61
- ```xml
62
- <task type="auto">
63
- <name>Build responsive dashboard layout</name>
64
- <files>src/components/Dashboard.tsx, src/app/dashboard/page.tsx</files>
65
- <action>Create dashboard with sidebar, header, and content area. Use Tailwind responsive classes for mobile.</action>
66
- <verify>npm run build succeeds, no TypeScript errors</verify>
67
- <done>Dashboard component builds without errors</done>
68
- </task>
69
-
70
- <task type="checkpoint:human-verify" gate="blocking">
71
- <what-built>Responsive dashboard layout at /dashboard</what-built>
72
- <how-to-verify>
73
- 1. Run: npm run dev
74
- 2. Visit: http://localhost:3000/dashboard
75
- 3. Desktop (>1024px): Verify sidebar left, content right, header top
76
- 4. Tablet (768px): Verify sidebar collapses to hamburger
77
- 5. Mobile (375px): Verify single column, bottom nav
78
- 6. Check: No layout shift, no horizontal scroll
79
- </how-to-verify>
80
- <resume-signal>Type "approved" or describe layout issues</resume-signal>
81
- </task>
82
- ```
83
-
84
- **Example: Xcode Build**
85
- ```xml
86
- <task type="auto">
87
- <name>Build macOS app with Xcode</name>
88
- <files>App.xcodeproj, Sources/</files>
89
- <action>Run `xcodebuild -project App.xcodeproj -scheme App build`. Check for compilation errors in output.</action>
90
- <verify>Build output contains "BUILD SUCCEEDED", no errors</verify>
91
- <done>App builds successfully</done>
92
30
  </task>
93
31
 
94
32
  <task type="checkpoint:human-verify" gate="blocking">
95
- <what-built>Built macOS app at DerivedData/Build/Products/Debug/App.app</what-built>
33
+ <what-built>Deployed to https://myapp.vercel.app</what-built>
96
34
  <how-to-verify>
97
- Open App.app and test:
98
- - App launches without crashes
99
- - Menu bar icon appears
100
- - Preferences window opens correctly
101
- - No visual glitches or layout issues
35
+ Visit URL and confirm:
36
+ 1. Homepage loads without errors
37
+ 2. All images/assets load
38
+ 3. No console errors
102
39
  </how-to-verify>
103
40
  <resume-signal>Type "approved" or describe issues</resume-signal>
104
41
  </task>
105
42
  ```
106
- </type>
107
43
 
108
- <type name="decision">
109
- ## checkpoint:decision
44
+ ## checkpoint:decision (9% of checkpoints)
110
45
 
111
46
  **When:** Human must make choice that affects implementation direction.
112
47
 
113
- **Use for:**
114
- - Technology selection (which auth provider, which database)
115
- - Architecture decisions (monorepo vs separate repos)
116
- - Design choices (color scheme, layout approach)
117
- - Feature prioritization (which variant to build)
118
- - Data model decisions (schema structure)
48
+ **Use for:** Technology selection, architecture decisions, design choices, feature prioritization.
119
49
 
120
50
  **Structure:**
121
51
  ```xml
122
52
  <task type="checkpoint:decision" gate="blocking">
123
53
  <decision>[What's being decided]</decision>
124
- <context>[Why this decision matters]</context>
54
+ <context>[Why this matters]</context>
125
55
  <options>
126
- <option id="option-a">
127
- <name>[Option name]</name>
128
- <pros>[Benefits]</pros>
129
- <cons>[Tradeoffs]</cons>
130
- </option>
131
- <option id="option-b">
132
- <name>[Option name]</name>
133
- <pros>[Benefits]</pros>
134
- <cons>[Tradeoffs]</cons>
135
- </option>
56
+ <option id="option-a"><name>[Name]</name><pros>[Benefits]</pros><cons>[Tradeoffs]</cons></option>
57
+ <option id="option-b"><name>[Name]</name><pros>[Benefits]</pros><cons>[Tradeoffs]</cons></option>
136
58
  </options>
137
59
  <resume-signal>[How to indicate choice]</resume-signal>
138
60
  </task>
139
61
  ```
140
62
 
141
- **Key elements:**
142
- - `<decision>`: What's being decided
143
- - `<context>`: Why this matters
144
- - `<options>`: Each option with balanced pros/cons (not prescriptive)
145
- - `<resume-signal>`: How to indicate choice
146
-
147
- **Example: Auth Provider Selection**
63
+ **Example:**
148
64
  ```xml
149
65
  <task type="checkpoint:decision" gate="blocking">
150
66
  <decision>Select authentication provider</decision>
151
- <context>
152
- Need user authentication for the app. Three solid options with different tradeoffs.
153
- </context>
67
+ <context>Need user auth. Three options with different tradeoffs.</context>
154
68
  <options>
155
- <option id="supabase">
156
- <name>Supabase Auth</name>
157
- <pros>Built-in with Supabase DB we're using, generous free tier, row-level security integration</pros>
158
- <cons>Less customizable UI, tied to Supabase ecosystem</cons>
159
- </option>
160
- <option id="clerk">
161
- <name>Clerk</name>
162
- <pros>Beautiful pre-built UI, best developer experience, excellent docs</pros>
163
- <cons>Paid after 10k MAU, vendor lock-in</cons>
164
- </option>
165
- <option id="nextauth">
166
- <name>NextAuth.js</name>
167
- <pros>Free, self-hosted, maximum control, widely adopted</pros>
168
- <cons>More setup work, you manage security updates, UI is DIY</cons>
169
- </option>
69
+ <option id="supabase"><name>Supabase Auth</name><pros>Built-in with DB, free tier, RLS integration</pros><cons>Less customizable, ecosystem lock-in</cons></option>
70
+ <option id="clerk"><name>Clerk</name><pros>Beautiful UI, best DX</pros><cons>Paid after 10k MAU</cons></option>
71
+ <option id="nextauth"><name>NextAuth.js</name><pros>Free, self-hosted, max control</pros><cons>More setup, DIY security</cons></option>
170
72
  </options>
171
73
  <resume-signal>Select: supabase, clerk, or nextauth</resume-signal>
172
74
  </task>
173
75
  ```
174
- </type>
175
76
 
176
- <type name="human-action">
177
- ## checkpoint:human-action (Rare)
77
+ ## checkpoint:human-action (1% - rare)
178
78
 
179
- **When:** Action has NO CLI/API and requires human-only interaction, OR Claude hit an authentication gate during automation.
79
+ **When:** Action has NO CLI/API and requires human-only interaction.
180
80
 
181
- **Use ONLY for:**
182
- - **Authentication gates** - Claude tried to use CLI/API but needs credentials to continue (this is NOT a failure)
183
- - Email verification links (account creation requires clicking email)
184
- - SMS 2FA codes (phone verification)
185
- - Manual account approvals (platform requires human review before API access)
186
- - Credit card 3D Secure flows (web-based payment authorization)
187
- - OAuth app approvals (some platforms require web-based approval)
81
+ **Use ONLY for:** Email verification links, SMS 2FA codes, manual account approvals, 3D Secure payment flows, OAuth app approvals.
188
82
 
189
- **Do NOT use for pre-planned manual work:**
190
- - Manually deploying to Vercel (use `vercel` CLI - auth gate if needed)
191
- - Manually creating Stripe webhooks (use Stripe API - auth gate if needed)
192
- - Manually creating databases (use provider CLI - auth gate if needed)
193
- - Running builds/tests manually (use Bash tool)
194
- - Creating files manually (use Write tool)
83
+ **Do NOT use for:** Deployments (use CLI), creating resources (use CLI/API), builds/tests (use Bash), file operations (use Write/Edit).
195
84
 
196
85
  **Structure:**
197
86
  ```xml
198
87
  <task type="checkpoint:human-action" gate="blocking">
199
- <action>[What human must do - Claude already did everything automatable]</action>
200
- <instructions>
201
- [What Claude already automated]
202
- [The ONE thing requiring human action]
203
- </instructions>
204
- <verification>[What Claude can check afterward]</verification>
88
+ <action>[Unavoidable manual step]</action>
89
+ <instructions>[What Claude automated] [ONE thing requiring human action]</instructions>
90
+ <verification>[What Claude checks afterward]</verification>
205
91
  <resume-signal>[How to continue]</resume-signal>
206
92
  </task>
207
93
  ```
208
94
 
209
- **Key principle:** Claude automates EVERYTHING possible first, only asks human for the truly unavoidable manual step.
210
-
211
- **Example: Email Verification**
95
+ **Example (email verification):**
212
96
  ```xml
213
- <task type="auto">
214
- <name>Create SendGrid account via API</name>
215
- <action>Use SendGrid API to create subuser account with provided email. Request verification email.</action>
216
- <verify>API returns 201, account created</verify>
217
- <done>Account created, verification email sent</done>
218
- </task>
219
-
220
97
  <task type="checkpoint:human-action" gate="blocking">
221
98
  <action>Complete email verification for SendGrid account</action>
222
99
  <instructions>
223
100
  I created the account and requested verification email.
224
- Check your inbox for SendGrid verification link and click it.
101
+ Check your inbox for verification link and click it.
225
102
  </instructions>
226
103
  <verification>SendGrid API key works: curl test succeeds</verification>
227
- <resume-signal>Type "done" when email verified</resume-signal>
104
+ <resume-signal>Type "done" when verified</resume-signal>
228
105
  </task>
229
106
  ```
230
107
 
231
- **Example: Credit Card 3D Secure**
232
- ```xml
233
- <task type="auto">
234
- <name>Create Stripe payment intent</name>
235
- <action>Use Stripe API to create payment intent for $99. Generate checkout URL.</action>
236
- <verify>Stripe API returns payment intent ID and URL</verify>
237
- <done>Payment intent created</done>
238
- </task>
239
-
240
- <task type="checkpoint:human-action" gate="blocking">
241
- <action>Complete 3D Secure authentication</action>
242
- <instructions>
243
- I created the payment intent: https://checkout.stripe.com/pay/cs_test_abc123
244
- Visit that URL and complete the 3D Secure verification flow with your test card.
245
- </instructions>
246
- <verification>Stripe webhook receives payment_intent.succeeded event</verification>
247
- <resume-signal>Type "done" when payment completes</resume-signal>
248
- </task>
249
- ```
250
-
251
- **Example: Authentication Gate (Dynamic Checkpoint)**
252
- ```xml
253
- <task type="auto">
254
- <name>Deploy to Vercel</name>
255
- <files>.vercel/, vercel.json</files>
256
- <action>Run `vercel --yes` to deploy</action>
257
- <verify>vercel ls shows deployment, curl returns 200</verify>
258
- </task>
259
-
260
- <!-- If vercel returns "Error: Not authenticated", Claude creates checkpoint on the fly -->
261
-
262
- <task type="checkpoint:human-action" gate="blocking">
263
- <action>Authenticate Vercel CLI so I can continue deployment</action>
264
- <instructions>
265
- I tried to deploy but got authentication error.
266
- Run: vercel login
267
- This will open your browser - complete the authentication flow.
268
- </instructions>
269
- <verification>vercel whoami returns your account email</verification>
270
- <resume-signal>Type "done" when authenticated</resume-signal>
271
- </task>
272
-
273
- <!-- After authentication, Claude retries the deployment -->
274
-
275
- <task type="auto">
276
- <name>Retry Vercel deployment</name>
277
- <action>Run `vercel --yes` (now authenticated)</action>
278
- <verify>vercel ls shows deployment, curl returns 200</verify>
279
- </task>
280
- ```
281
-
282
- **Key distinction:** Authentication gates are created dynamically when Claude encounters auth errors during automation. They're NOT pre-planned - Claude tries to automate first, only asks for credentials when blocked.
283
- </type>
284
108
  </checkpoint_types>
285
109
 
286
110
  <execution_protocol>
@@ -297,298 +121,167 @@ CHECKPOINT: [Type]
297
121
 
298
122
  Task [X] of [Y]: [Name]
299
123
 
300
- [Display checkpoint-specific content]
124
+ [Checkpoint-specific content]
301
125
 
302
126
  [Resume signal instruction]
303
127
  ════════════════════════════════════════
304
128
  ```
305
129
 
306
130
  3. **Wait for user response** - do not hallucinate completion
307
- 4. **Verify if possible** - check files, run tests, whatever is specified
308
- 5. **Resume execution** - continue to next task only after confirmation
131
+ 4. **Verify if possible** - check files, run tests
132
+ 5. **Resume execution** - continue only after confirmation
309
133
 
310
- **For checkpoint:human-verify:**
311
- ```
312
- ════════════════════════════════════════
313
- CHECKPOINT: Verification Required
314
- ════════════════════════════════════════
134
+ </execution_protocol>
315
135
 
316
- Task 5 of 8: Responsive dashboard layout
136
+ <authentication_gates>
317
137
 
318
- I built: Responsive dashboard at /dashboard
138
+ **Critical:** When Claude tries CLI/API and gets auth error, this is NOT a failure - it's a gate requiring human input to unblock automation.
319
139
 
320
- How to verify:
321
- 1. Run: npm run dev
322
- 2. Visit: http://localhost:3000/dashboard
323
- 3. Test: Resize browser window to mobile/tablet/desktop
324
- 4. Confirm: No layout shift, proper responsive behavior
140
+ **Pattern:** Claude tries automation → auth error → creates checkpoint → you authenticate → Claude retries → continues
325
141
 
326
- Type "approved" to continue, or describe issues.
327
- ════════════════════════════════════════
328
- ```
142
+ **Gate protocol:**
143
+ 1. Recognize it's not a failure - missing auth is expected
144
+ 2. Stop current task - don't retry repeatedly
145
+ 3. Create checkpoint:human-action dynamically
146
+ 4. Provide exact authentication steps
147
+ 5. Verify authentication works
148
+ 6. Retry the original task
149
+ 7. Continue normally
329
150
 
330
- **For checkpoint:decision:**
331
- ```
332
- ════════════════════════════════════════
333
- CHECKPOINT: Decision Required
334
- ════════════════════════════════════════
151
+ **Example (Vercel auth gate):**
152
+ ```xml
153
+ <!-- Claude tries to deploy -->
154
+ <task type="auto">
155
+ <name>Deploy to Vercel</name>
156
+ <action>Run `vercel --yes` to deploy</action>
157
+ </task>
335
158
 
336
- Task 2 of 6: Select authentication provider
159
+ <!-- If vercel returns "Error: Not authenticated" -->
160
+ <task type="checkpoint:human-action" gate="blocking">
161
+ <action>Authenticate Vercel CLI so I can continue</action>
162
+ <instructions>
163
+ I tried to deploy but got authentication error.
164
+ Run: vercel login (opens browser)
165
+ </instructions>
166
+ <verification>vercel whoami returns your account</verification>
167
+ <resume-signal>Type "done" when authenticated</resume-signal>
168
+ </task>
337
169
 
338
- Decision: Which auth provider should we use?
170
+ <!-- After auth, Claude retries automatically -->
171
+ <task type="auto">
172
+ <name>Retry deployment</name>
173
+ <action>Run `vercel --yes` (now authenticated)</action>
174
+ </task>
175
+ ```
339
176
 
340
- Context: Need user authentication. Three options with different tradeoffs.
177
+ **Key distinction:**
178
+ - Pre-planned checkpoint: "I need you to do X" (wrong - Claude should automate)
179
+ - Auth gate: "I tried to automate X but need credentials" (correct - unblocks automation)
341
180
 
342
- Options:
343
- 1. supabase - Built-in with our DB, free tier
344
- 2. clerk - Best DX, paid after 10k users
345
- 3. nextauth - Self-hosted, maximum control
181
+ </authentication_gates>
346
182
 
347
- Select: supabase, clerk, or nextauth
348
- ════════════════════════════════════════
349
- ```
350
- </execution_protocol>
183
+ <automation_reference>
351
184
 
352
- <writing_guidelines>
185
+ **The rule:** If it has CLI/API, Claude does it. Never ask human to perform automatable work.
353
186
 
354
- **DO:**
355
- - Automate everything with CLI/API before checkpoint
356
- - Be specific: "Visit https://myapp.vercel.app" not "check deployment"
357
- - Number verification steps: easier to follow
358
- - State expected outcomes: "You should see X"
359
- - Provide context: why this checkpoint exists
360
- - Make verification executable: clear, testable steps
187
+ | Service | CLI/API | Key Commands | Auth Gate |
188
+ |---------|---------|--------------|-----------|
189
+ | Vercel | `vercel` | `--yes`, `env add`, `--prod`, `ls` | `vercel login` |
190
+ | Railway | `railway` | `init`, `up`, `variables set` | `railway login` |
191
+ | Fly | `fly` | `launch`, `deploy`, `secrets set` | `fly auth login` |
192
+ | Stripe | `stripe` + API | `listen`, `trigger`, API calls | API key in .env |
193
+ | Supabase | `supabase` | `init`, `link`, `db push`, `gen types` | `supabase login` |
194
+ | Upstash | `upstash` | `redis create`, `redis get` | `upstash auth login` |
195
+ | PlanetScale | `pscale` | `database create`, `branch create` | `pscale auth login` |
196
+ | GitHub | `gh` | `repo create`, `pr create`, `secret set` | `gh auth login` |
197
+ | Node | `npm`/`pnpm` | `install`, `run build`, `test` | N/A |
198
+ | Xcode | `xcodebuild` | `-project`, `-scheme`, `build`, `test` | N/A |
361
199
 
362
- **DON'T:**
363
- - Ask human to do work Claude can automate (deploy, create resources, run builds)
364
- - Assume knowledge: "Configure the usual settings" ❌
365
- - Skip steps: "Set up database" ❌ (too vague)
366
- - Mix multiple verifications in one checkpoint (split them)
367
- - Make verification impossible (Claude can't check visual appearance without user confirmation)
368
- </writing_guidelines>
369
-
370
- <usage_guidelines>
371
-
372
- **Use checkpoint:human-verify for:**
373
- - Visual verification (UI, layouts, animations)
374
- - Interactive testing (click flows, user journeys)
375
- - Quality checks (audio/video playback, animation smoothness)
376
- - Confirming deployed apps are accessible
377
-
378
- **Use checkpoint:decision for:**
379
- - Technology selection (auth providers, databases, frameworks)
380
- - Architecture choices (monorepo, deployment strategy)
381
- - Design decisions (color schemes, layout approaches)
382
- - Feature prioritization
383
-
384
- **Use checkpoint:human-action for:**
385
- - Email verification links (no API)
386
- - SMS 2FA codes (no API)
387
- - Manual approvals with no automation
388
- - 3D Secure payment flows
389
-
390
- **Don't use checkpoints for:**
391
- - Things Claude can verify programmatically (tests pass, build succeeds)
392
- - File operations (Claude can read files to verify)
393
- - Code correctness (use tests and static analysis)
394
- - Anything automatable via CLI/API
395
- </usage_guidelines>
396
-
397
- <placement_guidelines>
398
-
399
- Place checkpoints:
400
- - **After automation completes** - not before Claude does the work
401
- - **After UI buildout** - before declaring phase complete
402
- - **Before dependent work** - decisions before implementation
403
- - **At integration points** - after configuring external services
404
-
405
- Bad placement:
406
- - Before Claude automates (asking human to do automatable work) ❌
407
- - Too frequent (every other task is a checkpoint) ❌
408
- - Too late (checkpoint is last task, but earlier tasks needed its result) ❌
409
- </placement_guidelines>
410
-
411
- <examples>
412
-
413
- ### Example 1: Deployment Flow (Correct)
200
+ **Env files:** Use Write/Edit tools. Never ask human to create .env manually.
414
201
 
415
- ```xml
416
- <!-- Claude automates everything -->
417
- <task type="auto">
418
- <name>Deploy to Vercel</name>
419
- <files>.vercel/, vercel.json, package.json</files>
420
- <action>
421
- 1. Run `vercel --yes` to create project and deploy
422
- 2. Capture deployment URL from output
423
- 3. Set environment variables with `vercel env add`
424
- 4. Trigger production deployment with `vercel --prod`
425
- </action>
426
- <verify>
427
- - vercel ls shows deployment
428
- - curl {url} returns 200
429
- - Environment variables set correctly
430
- </verify>
431
- <done>App deployed to production, URL captured</done>
432
- </task>
202
+ **Quick reference:**
433
203
 
434
- <!-- Human verifies visual/functional correctness -->
435
- <task type="checkpoint:human-verify" gate="blocking">
436
- <what-built>Deployed to https://myapp.vercel.app</what-built>
437
- <how-to-verify>
438
- Visit https://myapp.vercel.app and confirm:
439
- - Homepage loads correctly
440
- - All images/assets load
441
- - Navigation works
442
- - No console errors
443
- </how-to-verify>
444
- <resume-signal>Type "approved" or describe issues</resume-signal>
445
- </task>
446
- ```
204
+ | Action | Automatable? | Claude does it? |
205
+ |--------|--------------|-----------------|
206
+ | Deploy to Vercel | Yes (`vercel`) | YES |
207
+ | Create Stripe webhook | Yes (API) | YES |
208
+ | Write .env file | Yes (Write tool) | YES |
209
+ | Create Upstash DB | Yes (`upstash`) | YES |
210
+ | Run tests | Yes (`npm test`) | YES |
211
+ | Click email verification link | No | NO |
212
+ | Enter credit card with 3DS | No | NO |
447
213
 
448
- ### Example 2: Database Setup (Correct)
214
+ </automation_reference>
449
215
 
450
- ```xml
451
- <!-- Claude automates everything -->
452
- <task type="auto">
453
- <name>Create Upstash Redis database</name>
454
- <files>.env</files>
455
- <action>
456
- 1. Run `upstash redis create myapp-cache --region us-east-1`
457
- 2. Capture connection URL from output
458
- 3. Write to .env: UPSTASH_REDIS_URL={url}
459
- 4. Verify connection with test command
460
- </action>
461
- <verify>
462
- - upstash redis list shows database
463
- - .env contains UPSTASH_REDIS_URL
464
- - Test connection succeeds
465
- </verify>
466
- <done>Redis database created and configured</done>
467
- </task>
216
+ <guidelines>
468
217
 
469
- <!-- NO CHECKPOINT NEEDED - Claude automated everything and verified programmatically -->
470
- ```
218
+ **DO:**
219
+ - Automate everything with CLI/API before checkpoint
220
+ - Be specific: "Visit https://myapp.vercel.app" not "check deployment"
221
+ - Number verification steps
222
+ - State expected outcomes
223
+ - Make verification executable
471
224
 
472
- ### Example 3: Stripe Webhooks (Correct)
225
+ **DON'T:**
226
+ - Ask human to do work Claude can automate
227
+ - Assume knowledge: "Configure the usual settings"
228
+ - Mix multiple verifications in one checkpoint
229
+ - Use checkpoints too frequently (verification fatigue)
473
230
 
474
- ```xml
475
- <!-- Claude automates everything -->
476
- <task type="auto">
477
- <name>Configure Stripe webhooks</name>
478
- <files>.env, src/app/api/webhooks/route.ts</files>
479
- <action>
480
- 1. Use Stripe API to create webhook endpoint pointing to /api/webhooks
481
- 2. Subscribe to events: payment_intent.succeeded, customer.subscription.updated
482
- 3. Save webhook signing secret to .env
483
- 4. Implement webhook handler in route.ts
484
- </action>
485
- <verify>
486
- - Stripe API returns webhook endpoint ID
487
- - .env contains STRIPE_WEBHOOK_SECRET
488
- - curl webhook endpoint returns 200
489
- </verify>
490
- <done>Stripe webhooks configured and handler implemented</done>
491
- </task>
231
+ **Placement:**
232
+ - After automation completes (not before)
233
+ - After UI buildout
234
+ - Before dependent work (decisions)
235
+ - At integration points
492
236
 
493
- <!-- Human verifies in Stripe dashboard -->
494
- <task type="checkpoint:human-verify" gate="blocking">
495
- <what-built>Stripe webhook configured via API</what-built>
496
- <how-to-verify>
497
- Visit Stripe Dashboard > Developers > Webhooks
498
- Confirm: Endpoint shows https://myapp.com/api/webhooks with correct events
499
- </how-to-verify>
500
- <resume-signal>Type "yes" if correct</resume-signal>
501
- </task>
502
- ```
503
- </examples>
237
+ </guidelines>
504
238
 
505
239
  <anti_patterns>
506
240
 
507
- ### ❌ BAD: Asking human to automate
508
-
241
+ **BAD: Asking human to automate**
509
242
  ```xml
510
- <task type="checkpoint:human-action" gate="blocking">
243
+ <task type="checkpoint:human-action">
511
244
  <action>Deploy to Vercel</action>
512
- <instructions>
513
- 1. Visit vercel.com/new
514
- 2. Import Git repository
515
- 3. Click Deploy
516
- 4. Copy deployment URL
517
- </instructions>
518
- <verification>Deployment exists</verification>
519
- <resume-signal>Paste URL</resume-signal>
245
+ <instructions>Visit vercel.com/new, import repo, click Deploy</instructions>
520
246
  </task>
521
247
  ```
248
+ Why bad: Vercel has CLI. Use `vercel --yes`.
522
249
 
523
- **Why bad:** Vercel has a CLI. Claude should run `vercel --yes`.
524
-
525
- ### ✅ GOOD: Claude automates, human verifies
526
-
527
- ```xml
528
- <task type="auto">
529
- <name>Deploy to Vercel</name>
530
- <action>Run `vercel --yes`. Capture URL.</action>
531
- <verify>vercel ls shows deployment, curl returns 200</verify>
532
- </task>
533
-
534
- <task type="checkpoint:human-verify">
535
- <what-built>Deployed to {url}</what-built>
536
- <how-to-verify>Visit {url}, check homepage loads</how-to-verify>
537
- <resume-signal>Type "approved"</resume-signal>
538
- </task>
539
- ```
540
-
541
- ### ❌ BAD: Too many checkpoints
542
-
250
+ **BAD: Too many checkpoints**
543
251
  ```xml
544
252
  <task type="auto">Create schema</task>
545
253
  <task type="checkpoint:human-verify">Check schema</task>
546
- <task type="auto">Create API route</task>
254
+ <task type="auto">Create API</task>
547
255
  <task type="checkpoint:human-verify">Check API</task>
548
- <task type="auto">Create UI form</task>
549
- <task type="checkpoint:human-verify">Check form</task>
550
256
  ```
257
+ Why bad: Verification fatigue. Combine into one checkpoint at end.
551
258
 
552
- **Why bad:** Verification fatigue. Combine into one checkpoint at end.
553
-
554
- ### ✅ GOOD: Single verification checkpoint
555
-
259
+ **GOOD: Claude automates, human verifies once**
556
260
  ```xml
557
261
  <task type="auto">Create schema</task>
558
- <task type="auto">Create API route</task>
559
- <task type="auto">Create UI form</task>
262
+ <task type="auto">Create API</task>
263
+ <task type="auto">Create UI</task>
560
264
 
561
265
  <task type="checkpoint:human-verify">
562
- <what-built>Complete auth flow (schema + API + UI)</what-built>
266
+ <what-built>Complete auth flow</what-built>
563
267
  <how-to-verify>Test full flow: register, login, access protected page</how-to-verify>
564
- <resume-signal>Type "approved"</resume-signal>
565
- </task>
566
- ```
567
-
568
- ### ❌ BAD: Asking for automatable file operations
569
-
570
- ```xml
571
- <task type="checkpoint:human-action">
572
- <action>Create .env file</action>
573
- <instructions>
574
- 1. Create .env in project root
575
- 2. Add: DATABASE_URL=...
576
- 3. Add: STRIPE_KEY=...
577
- </instructions>
578
268
  </task>
579
269
  ```
580
270
 
581
- **Why bad:** Claude has Write tool. This should be `type="auto"`.
582
271
  </anti_patterns>
583
272
 
584
273
  <summary>
585
274
 
586
- Checkpoints formalize human-in-the-loop points. Use them when Claude cannot complete a task autonomously OR when human verification is required for correctness.
587
-
588
275
  **The golden rule:** If Claude CAN automate it, Claude MUST automate it.
589
276
 
590
277
  **Checkpoint priority:**
591
- 1. **checkpoint:human-verify** (90% of checkpoints) - Claude automated everything, human confirms visual/functional correctness
592
- 2. **checkpoint:decision** (9% of checkpoints) - Human makes architectural/technology choices
593
- 3. **checkpoint:human-action** (1% of checkpoints) - Truly unavoidable manual steps with no API/CLI
278
+ 1. **checkpoint:human-verify** (90%) - Claude automated, human confirms visual/functional correctness
279
+ 2. **checkpoint:decision** (9%) - Human makes architectural/technology choices
280
+ 3. **checkpoint:human-action** (1%) - Truly unavoidable manual steps with no API/CLI
281
+
282
+ **When NOT to use checkpoints:**
283
+ - Things Claude can verify programmatically (tests, builds)
284
+ - File operations (Claude can read/write)
285
+ - Anything with CLI/API available
286
+
594
287
  </summary>