@undeemed/get-shit-done-codex 1.6.12 → 1.20.3
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +40 -7
- package/agents/gsd-codebase-mapper.md +761 -0
- package/agents/gsd-debugger.md +1198 -0
- package/agents/gsd-executor.md +419 -0
- package/agents/gsd-integration-checker.md +423 -0
- package/agents/gsd-phase-researcher.md +469 -0
- package/agents/gsd-plan-checker.md +622 -0
- package/agents/gsd-planner.md +1159 -0
- package/agents/gsd-project-researcher.md +618 -0
- package/agents/gsd-research-synthesizer.md +236 -0
- package/agents/gsd-roadmapper.md +639 -0
- package/agents/gsd-verifier.md +541 -0
- package/bin/install.js +108 -102
- package/commands/gsd/add-phase.md +17 -185
- package/commands/gsd/add-todo.md +23 -163
- package/commands/gsd/audit-milestone.md +3 -219
- package/commands/gsd/check-todos.md +20 -196
- package/commands/gsd/cleanup.md +18 -0
- package/commands/gsd/complete-milestone.md +2 -2
- package/commands/gsd/debug.md +13 -0
- package/commands/gsd/discuss-phase.md +13 -6
- package/commands/gsd/execute-phase.md +4 -266
- package/commands/gsd/health.md +22 -0
- package/commands/gsd/help.md +8 -369
- package/commands/gsd/insert-phase.md +9 -203
- package/commands/gsd/join-discord.md +18 -0
- package/commands/gsd/list-phase-assumptions.md +4 -4
- package/commands/gsd/map-codebase.md +1 -1
- package/commands/gsd/new-milestone.md +16 -682
- package/commands/gsd/new-project.md +12 -866
- package/commands/gsd/new-project.md.bak +1041 -0
- package/commands/gsd/pause-work.md +17 -105
- package/commands/gsd/plan-milestone-gaps.md +3 -247
- package/commands/gsd/plan-phase.md +13 -444
- package/commands/gsd/progress.md +5 -337
- package/commands/gsd/quick.md +40 -0
- package/commands/gsd/reapply-patches.md +110 -0
- package/commands/gsd/remove-phase.md +9 -315
- package/commands/gsd/research-phase.md +27 -20
- package/commands/gsd/resume-work.md +2 -2
- package/commands/gsd/set-profile.md +34 -0
- package/commands/gsd/settings.md +36 -0
- package/commands/gsd/update.md +25 -160
- package/commands/gsd/verify-work.md +6 -186
- package/get-shit-done/bin/gsd-tools.cjs +5243 -0
- package/get-shit-done/bin/gsd-tools.test.cjs +2273 -0
- package/get-shit-done/references/checkpoints.md +270 -283
- package/get-shit-done/references/decimal-phase-calculation.md +65 -0
- package/get-shit-done/references/git-integration.md +7 -13
- package/get-shit-done/references/git-planning-commit.md +38 -0
- package/get-shit-done/references/model-profile-resolution.md +34 -0
- package/get-shit-done/references/model-profiles.md +92 -0
- package/get-shit-done/references/phase-argument-parsing.md +61 -0
- package/get-shit-done/references/planning-config.md +196 -0
- package/get-shit-done/references/questioning.md +5 -1
- package/get-shit-done/references/verification-patterns.md +17 -0
- package/get-shit-done/templates/DEBUG.md +4 -4
- package/get-shit-done/templates/UAT.md +1 -1
- package/get-shit-done/templates/codebase/architecture.md +1 -1
- package/get-shit-done/templates/codebase/concerns.md +1 -1
- package/get-shit-done/templates/codebase/conventions.md +1 -1
- package/get-shit-done/templates/codebase/structure.md +9 -9
- package/get-shit-done/templates/config.json +10 -0
- package/get-shit-done/templates/context.md +7 -15
- package/get-shit-done/templates/continue-here.md +1 -1
- package/get-shit-done/templates/phase-prompt.md +32 -41
- package/get-shit-done/templates/planner-subagent-prompt.md +4 -4
- package/get-shit-done/templates/project.md +1 -1
- package/get-shit-done/templates/research-project/ARCHITECTURE.md +1 -1
- package/get-shit-done/templates/research.md +27 -4
- package/get-shit-done/templates/state.md +1 -31
- package/get-shit-done/templates/summary-complex.md +59 -0
- package/get-shit-done/templates/summary-minimal.md +41 -0
- package/get-shit-done/templates/summary-standard.md +48 -0
- package/get-shit-done/templates/summary.md +5 -28
- package/get-shit-done/templates/user-setup.md +8 -20
- package/get-shit-done/templates/verification-report.md +3 -3
- package/get-shit-done/workflows/add-phase.md +111 -0
- package/get-shit-done/workflows/add-todo.md +157 -0
- package/get-shit-done/workflows/audit-milestone.md +242 -0
- package/get-shit-done/workflows/check-todos.md +176 -0
- package/get-shit-done/workflows/cleanup.md +152 -0
- package/get-shit-done/workflows/complete-milestone.md +225 -301
- package/get-shit-done/workflows/diagnose-issues.md +3 -17
- package/get-shit-done/workflows/discovery-phase.md +11 -15
- package/get-shit-done/workflows/discuss-phase.md +105 -42
- package/get-shit-done/workflows/execute-phase.md +205 -349
- package/get-shit-done/workflows/execute-plan.md +179 -1569
- package/get-shit-done/workflows/health.md +156 -0
- package/get-shit-done/workflows/help.md +486 -0
- package/get-shit-done/workflows/insert-phase.md +129 -0
- package/get-shit-done/workflows/list-phase-assumptions.md +9 -9
- package/get-shit-done/workflows/map-codebase.md +56 -18
- package/get-shit-done/workflows/new-milestone.md +373 -0
- package/get-shit-done/workflows/new-project.md +1113 -0
- package/get-shit-done/workflows/pause-work.md +122 -0
- package/get-shit-done/workflows/plan-milestone-gaps.md +256 -0
- package/get-shit-done/workflows/plan-phase.md +448 -0
- package/get-shit-done/workflows/progress.md +393 -0
- package/get-shit-done/workflows/quick.md +444 -0
- package/get-shit-done/workflows/remove-phase.md +154 -0
- package/get-shit-done/workflows/research-phase.md +74 -0
- package/get-shit-done/workflows/resume-project.md +18 -23
- package/get-shit-done/workflows/set-profile.md +80 -0
- package/get-shit-done/workflows/settings.md +200 -0
- package/get-shit-done/workflows/transition.md +78 -103
- package/get-shit-done/workflows/update.md +214 -0
- package/get-shit-done/workflows/verify-phase.md +109 -496
- package/get-shit-done/workflows/verify-work.md +22 -15
- package/hooks/dist/gsd-check-update.js +66 -0
- package/hooks/dist/gsd-statusline.js +91 -0
- package/package.json +19 -3
- package/scripts/build-hooks.js +42 -0
- package/commands/gsd/whats-new.md +0 -124
|
@@ -1,7 +1,13 @@
|
|
|
1
1
|
<overview>
|
|
2
|
-
Plans execute autonomously. Checkpoints formalize
|
|
2
|
+
Plans execute autonomously. Checkpoints formalize interaction points where human verification or decisions are needed.
|
|
3
3
|
|
|
4
|
-
**Core principle:**
|
|
4
|
+
**Core principle:** Codex automates everything with CLI/API. Checkpoints are for verification and decisions, not manual work.
|
|
5
|
+
|
|
6
|
+
**Golden rules:**
|
|
7
|
+
1. **If Codex can run it, Codex runs it** - Never ask user to execute CLI commands, start servers, or run builds
|
|
8
|
+
2. **Codex sets up the verification environment** - Start dev servers, seed databases, configure env vars
|
|
9
|
+
3. **User only does what requires human judgment** - Visual checks, UX evaluation, "does this feel right?"
|
|
10
|
+
4. **Secrets come from user, automation comes from Codex** - Ask for API keys, then Codex uses them via CLI
|
|
5
11
|
</overview>
|
|
6
12
|
|
|
7
13
|
<checkpoint_types>
|
|
@@ -9,7 +15,7 @@ Plans execute autonomously. Checkpoints formalize the interaction points where h
|
|
|
9
15
|
<type name="human-verify">
|
|
10
16
|
## checkpoint:human-verify (Most Common - 90%)
|
|
11
17
|
|
|
12
|
-
**When:**
|
|
18
|
+
**When:** Codex completed automated work, human confirms it works correctly.
|
|
13
19
|
|
|
14
20
|
**Use for:**
|
|
15
21
|
- Visual UI checks (layout, styling, responsiveness)
|
|
@@ -22,7 +28,7 @@ Plans execute autonomously. Checkpoints formalize the interaction points where h
|
|
|
22
28
|
**Structure:**
|
|
23
29
|
```xml
|
|
24
30
|
<task type="checkpoint:human-verify" gate="blocking">
|
|
25
|
-
<what-built>[What
|
|
31
|
+
<what-built>[What Codex automated and deployed/built]</what-built>
|
|
26
32
|
<how-to-verify>
|
|
27
33
|
[Exact steps to test - URLs, commands, expected behavior]
|
|
28
34
|
</how-to-verify>
|
|
@@ -30,34 +36,7 @@ Plans execute autonomously. Checkpoints formalize the interaction points where h
|
|
|
30
36
|
</task>
|
|
31
37
|
```
|
|
32
38
|
|
|
33
|
-
**
|
|
34
|
-
- `<what-built>`: What Claude automated (deployed, built, configured)
|
|
35
|
-
- `<how-to-verify>`: Exact steps to confirm it works (numbered, specific)
|
|
36
|
-
- `<resume-signal>`: Clear indication of how to continue
|
|
37
|
-
|
|
38
|
-
**Example: Vercel Deployment**
|
|
39
|
-
```xml
|
|
40
|
-
<task type="auto">
|
|
41
|
-
<name>Deploy to Vercel</name>
|
|
42
|
-
<files>.vercel/, vercel.json</files>
|
|
43
|
-
<action>Run `vercel --yes` to create project and deploy. Capture deployment URL from output.</action>
|
|
44
|
-
<verify>vercel ls shows deployment, curl {url} returns 200</verify>
|
|
45
|
-
<done>App deployed, URL captured</done>
|
|
46
|
-
</task>
|
|
47
|
-
|
|
48
|
-
<task type="checkpoint:human-verify" gate="blocking">
|
|
49
|
-
<what-built>Deployed to Vercel at https://myapp-abc123.vercel.app</what-built>
|
|
50
|
-
<how-to-verify>
|
|
51
|
-
Visit https://myapp-abc123.vercel.app and confirm:
|
|
52
|
-
- Homepage loads without errors
|
|
53
|
-
- Login form is visible
|
|
54
|
-
- No console errors in browser DevTools
|
|
55
|
-
</how-to-verify>
|
|
56
|
-
<resume-signal>Type "approved" to continue, or describe issues to fix</resume-signal>
|
|
57
|
-
</task>
|
|
58
|
-
```
|
|
59
|
-
|
|
60
|
-
**Example: UI Component**
|
|
39
|
+
**Example: UI Component (shows key pattern: Codex starts server BEFORE checkpoint)**
|
|
61
40
|
```xml
|
|
62
41
|
<task type="auto">
|
|
63
42
|
<name>Build responsive dashboard layout</name>
|
|
@@ -67,15 +46,21 @@ Plans execute autonomously. Checkpoints formalize the interaction points where h
|
|
|
67
46
|
<done>Dashboard component builds without errors</done>
|
|
68
47
|
</task>
|
|
69
48
|
|
|
49
|
+
<task type="auto">
|
|
50
|
+
<name>Start dev server for verification</name>
|
|
51
|
+
<action>Run `npm run dev` in background, wait for "ready" message, capture port</action>
|
|
52
|
+
<verify>curl http://localhost:3000 returns 200</verify>
|
|
53
|
+
<done>Dev server running at http://localhost:3000</done>
|
|
54
|
+
</task>
|
|
55
|
+
|
|
70
56
|
<task type="checkpoint:human-verify" gate="blocking">
|
|
71
|
-
<what-built>Responsive dashboard layout at
|
|
57
|
+
<what-built>Responsive dashboard layout - dev server running at http://localhost:3000</what-built>
|
|
72
58
|
<how-to-verify>
|
|
73
|
-
|
|
74
|
-
|
|
75
|
-
|
|
76
|
-
|
|
77
|
-
|
|
78
|
-
6. Check: No layout shift, no horizontal scroll
|
|
59
|
+
Visit http://localhost:3000/dashboard and verify:
|
|
60
|
+
1. Desktop (>1024px): Sidebar left, content right, header top
|
|
61
|
+
2. Tablet (768px): Sidebar collapses to hamburger menu
|
|
62
|
+
3. Mobile (375px): Single column layout, bottom nav appears
|
|
63
|
+
4. No layout shift or horizontal scroll at any size
|
|
79
64
|
</how-to-verify>
|
|
80
65
|
<resume-signal>Type "approved" or describe layout issues</resume-signal>
|
|
81
66
|
</task>
|
|
@@ -138,12 +123,6 @@ Plans execute autonomously. Checkpoints formalize the interaction points where h
|
|
|
138
123
|
</task>
|
|
139
124
|
```
|
|
140
125
|
|
|
141
|
-
**Key elements:**
|
|
142
|
-
- `<decision>`: What's being decided
|
|
143
|
-
- `<context>`: Why this matters
|
|
144
|
-
- `<options>`: Each option with balanced pros/cons (not prescriptive)
|
|
145
|
-
- `<resume-signal>`: How to indicate choice
|
|
146
|
-
|
|
147
126
|
**Example: Auth Provider Selection**
|
|
148
127
|
```xml
|
|
149
128
|
<task type="checkpoint:decision" gate="blocking">
|
|
@@ -205,38 +184,35 @@ Plans execute autonomously. Checkpoints formalize the interaction points where h
|
|
|
205
184
|
<type name="human-action">
|
|
206
185
|
## checkpoint:human-action (1% - Rare)
|
|
207
186
|
|
|
208
|
-
**When:** Action has NO CLI/API and requires human-only interaction, OR
|
|
187
|
+
**When:** Action has NO CLI/API and requires human-only interaction, OR Codex hit an authentication gate during automation.
|
|
209
188
|
|
|
210
189
|
**Use ONLY for:**
|
|
211
|
-
- **Authentication gates** -
|
|
212
|
-
- Email verification links (
|
|
190
|
+
- **Authentication gates** - Codex tried CLI/API but needs credentials (this is NOT a failure)
|
|
191
|
+
- Email verification links (clicking email)
|
|
213
192
|
- SMS 2FA codes (phone verification)
|
|
214
|
-
- Manual account approvals (platform requires human review
|
|
193
|
+
- Manual account approvals (platform requires human review)
|
|
215
194
|
- Credit card 3D Secure flows (web-based payment authorization)
|
|
216
|
-
- OAuth app approvals (
|
|
195
|
+
- OAuth app approvals (web-based approval)
|
|
217
196
|
|
|
218
197
|
**Do NOT use for pre-planned manual work:**
|
|
219
|
-
-
|
|
220
|
-
-
|
|
221
|
-
-
|
|
222
|
-
-
|
|
223
|
-
- Creating files manually (use Write tool)
|
|
198
|
+
- Deploying (use CLI - auth gate if needed)
|
|
199
|
+
- Creating webhooks/databases (use API/CLI - auth gate if needed)
|
|
200
|
+
- Running builds/tests (use Bash tool)
|
|
201
|
+
- Creating files (use Write tool)
|
|
224
202
|
|
|
225
203
|
**Structure:**
|
|
226
204
|
```xml
|
|
227
205
|
<task type="checkpoint:human-action" gate="blocking">
|
|
228
|
-
<action>[What human must do -
|
|
206
|
+
<action>[What human must do - Codex already did everything automatable]</action>
|
|
229
207
|
<instructions>
|
|
230
|
-
[What
|
|
208
|
+
[What Codex already automated]
|
|
231
209
|
[The ONE thing requiring human action]
|
|
232
210
|
</instructions>
|
|
233
|
-
<verification>[What
|
|
211
|
+
<verification>[What Codex can check afterward]</verification>
|
|
234
212
|
<resume-signal>[How to continue]</resume-signal>
|
|
235
213
|
</task>
|
|
236
214
|
```
|
|
237
215
|
|
|
238
|
-
**Key principle:** Claude automates EVERYTHING possible first, only asks human for the truly unavoidable manual step.
|
|
239
|
-
|
|
240
216
|
**Example: Email Verification**
|
|
241
217
|
```xml
|
|
242
218
|
<task type="auto">
|
|
@@ -257,26 +233,6 @@ Plans execute autonomously. Checkpoints formalize the interaction points where h
|
|
|
257
233
|
</task>
|
|
258
234
|
```
|
|
259
235
|
|
|
260
|
-
**Example: Credit Card 3D Secure**
|
|
261
|
-
```xml
|
|
262
|
-
<task type="auto">
|
|
263
|
-
<name>Create Stripe payment intent</name>
|
|
264
|
-
<action>Use Stripe API to create payment intent for $99. Generate checkout URL.</action>
|
|
265
|
-
<verify>Stripe API returns payment intent ID and URL</verify>
|
|
266
|
-
<done>Payment intent created</done>
|
|
267
|
-
</task>
|
|
268
|
-
|
|
269
|
-
<task type="checkpoint:human-action" gate="blocking">
|
|
270
|
-
<action>Complete 3D Secure authentication</action>
|
|
271
|
-
<instructions>
|
|
272
|
-
I created the payment intent: https://checkout.stripe.com/pay/cs_test_abc123
|
|
273
|
-
Visit that URL and complete the 3D Secure verification flow with your test card.
|
|
274
|
-
</instructions>
|
|
275
|
-
<verification>Stripe webhook receives payment_intent.succeeded event</verification>
|
|
276
|
-
<resume-signal>Type "done" when payment completes</resume-signal>
|
|
277
|
-
</task>
|
|
278
|
-
```
|
|
279
|
-
|
|
280
236
|
**Example: Authentication Gate (Dynamic Checkpoint)**
|
|
281
237
|
```xml
|
|
282
238
|
<task type="auto">
|
|
@@ -286,7 +242,7 @@ Plans execute autonomously. Checkpoints formalize the interaction points where h
|
|
|
286
242
|
<verify>vercel ls shows deployment, curl returns 200</verify>
|
|
287
243
|
</task>
|
|
288
244
|
|
|
289
|
-
<!-- If vercel returns "Error: Not authenticated",
|
|
245
|
+
<!-- If vercel returns "Error: Not authenticated", Codex creates checkpoint on the fly -->
|
|
290
246
|
|
|
291
247
|
<task type="checkpoint:human-action" gate="blocking">
|
|
292
248
|
<action>Authenticate Vercel CLI so I can continue deployment</action>
|
|
@@ -299,7 +255,7 @@ Plans execute autonomously. Checkpoints formalize the interaction points where h
|
|
|
299
255
|
<resume-signal>Type "done" when authenticated</resume-signal>
|
|
300
256
|
</task>
|
|
301
257
|
|
|
302
|
-
<!-- After authentication,
|
|
258
|
+
<!-- After authentication, Codex retries the deployment -->
|
|
303
259
|
|
|
304
260
|
<task type="auto">
|
|
305
261
|
<name>Retry Vercel deployment</name>
|
|
@@ -308,13 +264,13 @@ Plans execute autonomously. Checkpoints formalize the interaction points where h
|
|
|
308
264
|
</task>
|
|
309
265
|
```
|
|
310
266
|
|
|
311
|
-
**Key distinction:**
|
|
267
|
+
**Key distinction:** Auth gates are created dynamically when Codex encounters auth errors. NOT pre-planned — Codex automates first, asks for credentials only when blocked.
|
|
312
268
|
</type>
|
|
313
269
|
</checkpoint_types>
|
|
314
270
|
|
|
315
271
|
<execution_protocol>
|
|
316
272
|
|
|
317
|
-
When
|
|
273
|
+
When Codex encounters `type="checkpoint:*"`:
|
|
318
274
|
|
|
319
275
|
1. **Stop immediately** - do not proceed to next task
|
|
320
276
|
2. **Display checkpoint clearly** using the format below
|
|
@@ -334,11 +290,10 @@ Task: Responsive dashboard layout
|
|
|
334
290
|
Built: Responsive dashboard at /dashboard
|
|
335
291
|
|
|
336
292
|
How to verify:
|
|
337
|
-
1.
|
|
338
|
-
2.
|
|
339
|
-
3.
|
|
340
|
-
4.
|
|
341
|
-
5. Mobile (375px): Sidebar hidden, hamburger menu appears
|
|
293
|
+
1. Visit: http://localhost:3000/dashboard
|
|
294
|
+
2. Desktop (>1024px): Sidebar visible, content fills remaining space
|
|
295
|
+
3. Tablet (768px): Sidebar collapses to icons
|
|
296
|
+
4. Mobile (375px): Sidebar hidden, hamburger menu appears
|
|
342
297
|
|
|
343
298
|
────────────────────────────────────────────────────────
|
|
344
299
|
→ YOUR ACTION: Type "approved" or describe issues
|
|
@@ -403,9 +358,9 @@ I'll verify: vercel whoami returns your account
|
|
|
403
358
|
|
|
404
359
|
<authentication_gates>
|
|
405
360
|
|
|
406
|
-
**
|
|
361
|
+
**Auth gate = Codex tried CLI/API, got auth error.** Not a failure — a gate requiring human input to unblock.
|
|
407
362
|
|
|
408
|
-
**Pattern:**
|
|
363
|
+
**Pattern:** Codex tries automation → auth error → creates checkpoint:human-action → user authenticates → Codex retries → continues
|
|
409
364
|
|
|
410
365
|
**Gate protocol:**
|
|
411
366
|
1. Recognize it's not a failure - missing auth is expected
|
|
@@ -416,55 +371,17 @@ I'll verify: vercel whoami returns your account
|
|
|
416
371
|
6. Retry the original task
|
|
417
372
|
7. Continue normally
|
|
418
373
|
|
|
419
|
-
**Example execution flow (Vercel auth gate):**
|
|
420
|
-
|
|
421
|
-
```
|
|
422
|
-
Claude: Running `vercel --yes` to deploy...
|
|
423
|
-
|
|
424
|
-
Error: Not authenticated. Please run 'vercel login'
|
|
425
|
-
|
|
426
|
-
╔═══════════════════════════════════════════════════════╗
|
|
427
|
-
║ CHECKPOINT: Action Required ║
|
|
428
|
-
╚═══════════════════════════════════════════════════════╝
|
|
429
|
-
|
|
430
|
-
Progress: 2/8 tasks complete
|
|
431
|
-
Task: Deploy to Vercel
|
|
432
|
-
|
|
433
|
-
Attempted: vercel --yes
|
|
434
|
-
Error: Not authenticated
|
|
435
|
-
|
|
436
|
-
What you need to do:
|
|
437
|
-
1. Run: vercel login
|
|
438
|
-
2. Complete browser authentication
|
|
439
|
-
|
|
440
|
-
I'll verify: vercel whoami returns your account
|
|
441
|
-
|
|
442
|
-
────────────────────────────────────────────────────────
|
|
443
|
-
→ YOUR ACTION: Type "done" when authenticated
|
|
444
|
-
────────────────────────────────────────────────────────
|
|
445
|
-
|
|
446
|
-
User: done
|
|
447
|
-
|
|
448
|
-
Claude: Verifying authentication...
|
|
449
|
-
Running: vercel whoami
|
|
450
|
-
✓ Authenticated as: user@example.com
|
|
451
|
-
|
|
452
|
-
Retrying deployment...
|
|
453
|
-
Running: vercel --yes
|
|
454
|
-
✓ Deployed to: https://myapp-abc123.vercel.app
|
|
455
|
-
|
|
456
|
-
Task 3 complete. Continuing to task 4...
|
|
457
|
-
```
|
|
458
|
-
|
|
459
374
|
**Key distinction:**
|
|
460
|
-
- Pre-planned checkpoint: "I need you to do X" (wrong -
|
|
375
|
+
- Pre-planned checkpoint: "I need you to do X" (wrong - Codex should automate)
|
|
461
376
|
- Auth gate: "I tried to automate X but need credentials" (correct - unblocks automation)
|
|
462
377
|
|
|
463
378
|
</authentication_gates>
|
|
464
379
|
|
|
465
380
|
<automation_reference>
|
|
466
381
|
|
|
467
|
-
**The rule:** If it has CLI/API,
|
|
382
|
+
**The rule:** If it has CLI/API, Codex does it. Never ask human to perform automatable work.
|
|
383
|
+
|
|
384
|
+
## Service CLI Reference
|
|
468
385
|
|
|
469
386
|
| Service | CLI/API | Key Commands | Auth Gate |
|
|
470
387
|
|---------|---------|--------------|-----------|
|
|
@@ -476,24 +393,141 @@ Task 3 complete. Continuing to task 4...
|
|
|
476
393
|
| Upstash | `upstash` | `redis create`, `redis get` | `upstash auth login` |
|
|
477
394
|
| PlanetScale | `pscale` | `database create`, `branch create` | `pscale auth login` |
|
|
478
395
|
| GitHub | `gh` | `repo create`, `pr create`, `secret set` | `gh auth login` |
|
|
479
|
-
| Node | `npm`/`pnpm` | `install`, `run build`, `test` | N/A |
|
|
396
|
+
| Node | `npm`/`pnpm` | `install`, `run build`, `test`, `run dev` | N/A |
|
|
480
397
|
| Xcode | `xcodebuild` | `-project`, `-scheme`, `build`, `test` | N/A |
|
|
481
|
-
| Convex | `npx convex` | `dev`, `deploy`, `
|
|
398
|
+
| Convex | `npx convex` | `dev`, `deploy`, `env set`, `env get` | `npx convex login` |
|
|
399
|
+
|
|
400
|
+
## Environment Variable Automation
|
|
482
401
|
|
|
483
402
|
**Env files:** Use Write/Edit tools. Never ask human to create .env manually.
|
|
484
403
|
|
|
485
|
-
**
|
|
404
|
+
**Dashboard env vars via CLI:**
|
|
405
|
+
|
|
406
|
+
| Platform | CLI Command | Example |
|
|
407
|
+
|----------|-------------|---------|
|
|
408
|
+
| Convex | `npx convex env set` | `npx convex env set OPENAI_API_KEY sk-...` |
|
|
409
|
+
| Vercel | `vercel env add` | `vercel env add STRIPE_KEY production` |
|
|
410
|
+
| Railway | `railway variables set` | `railway variables set API_KEY=value` |
|
|
411
|
+
| Fly | `fly secrets set` | `fly secrets set DATABASE_URL=...` |
|
|
412
|
+
| Supabase | `supabase secrets set` | `supabase secrets set MY_SECRET=value` |
|
|
413
|
+
|
|
414
|
+
**Secret collection pattern:**
|
|
415
|
+
```xml
|
|
416
|
+
<!-- WRONG: Asking user to add env vars in dashboard -->
|
|
417
|
+
<task type="checkpoint:human-action">
|
|
418
|
+
<action>Add OPENAI_API_KEY to Convex dashboard</action>
|
|
419
|
+
<instructions>Go to dashboard.convex.dev → Settings → Environment Variables → Add</instructions>
|
|
420
|
+
</task>
|
|
421
|
+
|
|
422
|
+
<!-- RIGHT: Codex asks for value, then adds via CLI -->
|
|
423
|
+
<task type="checkpoint:human-action">
|
|
424
|
+
<action>Provide your OpenAI API key</action>
|
|
425
|
+
<instructions>
|
|
426
|
+
I need your OpenAI API key for Convex backend.
|
|
427
|
+
Get it from: https://platform.openai.com/api-keys
|
|
428
|
+
Paste the key (starts with sk-)
|
|
429
|
+
</instructions>
|
|
430
|
+
<verification>I'll add it via `npx convex env set` and verify</verification>
|
|
431
|
+
<resume-signal>Paste your API key</resume-signal>
|
|
432
|
+
</task>
|
|
433
|
+
|
|
434
|
+
<task type="auto">
|
|
435
|
+
<name>Configure OpenAI key in Convex</name>
|
|
436
|
+
<action>Run `npx convex env set OPENAI_API_KEY {user-provided-key}`</action>
|
|
437
|
+
<verify>`npx convex env get OPENAI_API_KEY` returns the key (masked)</verify>
|
|
438
|
+
</task>
|
|
439
|
+
```
|
|
440
|
+
|
|
441
|
+
## Dev Server Automation
|
|
442
|
+
|
|
443
|
+
| Framework | Start Command | Ready Signal | Default URL |
|
|
444
|
+
|-----------|---------------|--------------|-------------|
|
|
445
|
+
| Next.js | `npm run dev` | "Ready in" or "started server" | http://localhost:3000 |
|
|
446
|
+
| Vite | `npm run dev` | "ready in" | http://localhost:5173 |
|
|
447
|
+
| Convex | `npx convex dev` | "Convex functions ready" | N/A (backend only) |
|
|
448
|
+
| Express | `npm start` | "listening on port" | http://localhost:3000 |
|
|
449
|
+
| Django | `python manage.py runserver` | "Starting development server" | http://localhost:8000 |
|
|
450
|
+
|
|
451
|
+
**Server lifecycle:**
|
|
452
|
+
```bash
|
|
453
|
+
# Run in background, capture PID
|
|
454
|
+
npm run dev &
|
|
455
|
+
DEV_SERVER_PID=$!
|
|
456
|
+
|
|
457
|
+
# Wait for ready (max 30s)
|
|
458
|
+
timeout 30 bash -c 'until curl -s localhost:3000 > /dev/null 2>&1; do sleep 1; done'
|
|
459
|
+
```
|
|
460
|
+
|
|
461
|
+
**Port conflicts:** Kill stale process (`lsof -ti:3000 | xargs kill`) or use alternate port (`--port 3001`).
|
|
462
|
+
|
|
463
|
+
**Server stays running** through checkpoints. Only kill when plan complete, switching to production, or port needed for different service.
|
|
464
|
+
|
|
465
|
+
## CLI Installation Handling
|
|
466
|
+
|
|
467
|
+
| CLI | Auto-install? | Command |
|
|
468
|
+
|-----|---------------|---------|
|
|
469
|
+
| npm/pnpm/yarn | No - ask user | User chooses package manager |
|
|
470
|
+
| vercel | Yes | `npm i -g vercel` |
|
|
471
|
+
| gh (GitHub) | Yes | `brew install gh` (macOS) or `apt install gh` (Linux) |
|
|
472
|
+
| stripe | Yes | `npm i -g stripe` |
|
|
473
|
+
| supabase | Yes | `npm i -g supabase` |
|
|
474
|
+
| convex | No - use npx | `npx convex` (no install needed) |
|
|
475
|
+
| fly | Yes | `brew install flyctl` or curl installer |
|
|
476
|
+
| railway | Yes | `npm i -g @railway/cli` |
|
|
477
|
+
|
|
478
|
+
**Protocol:** Try command → "command not found" → auto-installable? → yes: install silently, retry → no: checkpoint asking user to install.
|
|
479
|
+
|
|
480
|
+
## Pre-Checkpoint Automation Failures
|
|
481
|
+
|
|
482
|
+
| Failure | Response |
|
|
483
|
+
|---------|----------|
|
|
484
|
+
| Server won't start | Check error, fix issue, retry (don't proceed to checkpoint) |
|
|
485
|
+
| Port in use | Kill stale process or use alternate port |
|
|
486
|
+
| Missing dependency | Run `npm install`, retry |
|
|
487
|
+
| Build error | Fix the error first (bug, not checkpoint issue) |
|
|
488
|
+
| Auth error | Create auth gate checkpoint |
|
|
489
|
+
| Network timeout | Retry with backoff, then checkpoint if persistent |
|
|
490
|
+
|
|
491
|
+
**Never present a checkpoint with broken verification environment.** If `curl localhost:3000` fails, don't ask user to "visit localhost:3000".
|
|
492
|
+
|
|
493
|
+
```xml
|
|
494
|
+
<!-- WRONG: Checkpoint with broken environment -->
|
|
495
|
+
<task type="checkpoint:human-verify">
|
|
496
|
+
<what-built>Dashboard (server failed to start)</what-built>
|
|
497
|
+
<how-to-verify>Visit http://localhost:3000...</how-to-verify>
|
|
498
|
+
</task>
|
|
499
|
+
|
|
500
|
+
<!-- RIGHT: Fix first, then checkpoint -->
|
|
501
|
+
<task type="auto">
|
|
502
|
+
<name>Fix server startup issue</name>
|
|
503
|
+
<action>Investigate error, fix root cause, restart server</action>
|
|
504
|
+
<verify>curl http://localhost:3000 returns 200</verify>
|
|
505
|
+
</task>
|
|
506
|
+
|
|
507
|
+
<task type="checkpoint:human-verify">
|
|
508
|
+
<what-built>Dashboard - server running at http://localhost:3000</what-built>
|
|
509
|
+
<how-to-verify>Visit http://localhost:3000/dashboard...</how-to-verify>
|
|
510
|
+
</task>
|
|
511
|
+
```
|
|
512
|
+
|
|
513
|
+
## Automatable Quick Reference
|
|
486
514
|
|
|
487
|
-
| Action | Automatable? |
|
|
515
|
+
| Action | Automatable? | Codex does it? |
|
|
488
516
|
|--------|--------------|-----------------|
|
|
489
517
|
| Deploy to Vercel | Yes (`vercel`) | YES |
|
|
490
518
|
| Create Stripe webhook | Yes (API) | YES |
|
|
491
519
|
| Write .env file | Yes (Write tool) | YES |
|
|
492
520
|
| Create Upstash DB | Yes (`upstash`) | YES |
|
|
493
521
|
| Run tests | Yes (`npm test`) | YES |
|
|
522
|
+
| Start dev server | Yes (`npm run dev`) | YES |
|
|
523
|
+
| Add env vars to Convex | Yes (`npx convex env set`) | YES |
|
|
524
|
+
| Add env vars to Vercel | Yes (`vercel env add`) | YES |
|
|
525
|
+
| Seed database | Yes (CLI/API) | YES |
|
|
494
526
|
| Click email verification link | No | NO |
|
|
495
527
|
| Enter credit card with 3DS | No | NO |
|
|
496
528
|
| Complete OAuth in browser | No | NO |
|
|
529
|
+
| Visually verify UI looks correct | No | NO |
|
|
530
|
+
| Test interactive user flows | No | NO |
|
|
497
531
|
|
|
498
532
|
</automation_reference>
|
|
499
533
|
|
|
@@ -502,71 +536,30 @@ Task 3 complete. Continuing to task 4...
|
|
|
502
536
|
**DO:**
|
|
503
537
|
- Automate everything with CLI/API before checkpoint
|
|
504
538
|
- Be specific: "Visit https://myapp.vercel.app" not "check deployment"
|
|
505
|
-
- Number verification steps
|
|
539
|
+
- Number verification steps
|
|
506
540
|
- State expected outcomes: "You should see X"
|
|
507
541
|
- Provide context: why this checkpoint exists
|
|
508
|
-
- Make verification executable: clear, testable steps
|
|
509
542
|
|
|
510
543
|
**DON'T:**
|
|
511
|
-
- Ask human to do work
|
|
544
|
+
- Ask human to do work Codex can automate ❌
|
|
512
545
|
- Assume knowledge: "Configure the usual settings" ❌
|
|
513
|
-
- Skip steps: "Set up database"
|
|
514
|
-
- Mix multiple verifications in one checkpoint
|
|
515
|
-
- Make verification impossible (Claude can't check visual appearance without user confirmation)
|
|
546
|
+
- Skip steps: "Set up database" (too vague) ❌
|
|
547
|
+
- Mix multiple verifications in one checkpoint ❌
|
|
516
548
|
|
|
517
549
|
**Placement:**
|
|
518
|
-
- **After automation completes** - not before
|
|
550
|
+
- **After automation completes** - not before Codex does the work
|
|
519
551
|
- **After UI buildout** - before declaring phase complete
|
|
520
552
|
- **Before dependent work** - decisions before implementation
|
|
521
553
|
- **At integration points** - after configuring external services
|
|
522
554
|
|
|
523
|
-
**Bad placement:**
|
|
524
|
-
- Before Claude automates (asking human to do automatable work) ❌
|
|
525
|
-
- Too frequent (every other task is a checkpoint) ❌
|
|
526
|
-
- Too late (checkpoint is last task, but earlier tasks needed its result) ❌
|
|
555
|
+
**Bad placement:** Before automation ❌ | Too frequent ❌ | Too late (dependent tasks already needed the result) ❌
|
|
527
556
|
</writing_guidelines>
|
|
528
557
|
|
|
529
558
|
<examples>
|
|
530
559
|
|
|
531
|
-
### Example 1:
|
|
532
|
-
|
|
533
|
-
```xml
|
|
534
|
-
<!-- Claude automates everything -->
|
|
535
|
-
<task type="auto">
|
|
536
|
-
<name>Deploy to Vercel</name>
|
|
537
|
-
<files>.vercel/, vercel.json, package.json</files>
|
|
538
|
-
<action>
|
|
539
|
-
1. Run `vercel --yes` to create project and deploy
|
|
540
|
-
2. Capture deployment URL from output
|
|
541
|
-
3. Set environment variables with `vercel env add`
|
|
542
|
-
4. Trigger production deployment with `vercel --prod`
|
|
543
|
-
</action>
|
|
544
|
-
<verify>
|
|
545
|
-
- vercel ls shows deployment
|
|
546
|
-
- curl {url} returns 200
|
|
547
|
-
- Environment variables set correctly
|
|
548
|
-
</verify>
|
|
549
|
-
<done>App deployed to production, URL captured</done>
|
|
550
|
-
</task>
|
|
551
|
-
|
|
552
|
-
<!-- Human verifies visual/functional correctness -->
|
|
553
|
-
<task type="checkpoint:human-verify" gate="blocking">
|
|
554
|
-
<what-built>Deployed to https://myapp.vercel.app</what-built>
|
|
555
|
-
<how-to-verify>
|
|
556
|
-
Visit https://myapp.vercel.app and confirm:
|
|
557
|
-
- Homepage loads correctly
|
|
558
|
-
- All images/assets load
|
|
559
|
-
- Navigation works
|
|
560
|
-
- No console errors
|
|
561
|
-
</how-to-verify>
|
|
562
|
-
<resume-signal>Type "approved" or describe issues</resume-signal>
|
|
563
|
-
</task>
|
|
564
|
-
```
|
|
565
|
-
|
|
566
|
-
### Example 2: Database Setup (No Checkpoint Needed)
|
|
560
|
+
### Example 1: Database Setup (No Checkpoint Needed)
|
|
567
561
|
|
|
568
562
|
```xml
|
|
569
|
-
<!-- Claude automates everything -->
|
|
570
563
|
<task type="auto">
|
|
571
564
|
<name>Create Upstash Redis database</name>
|
|
572
565
|
<files>.env</files>
|
|
@@ -584,42 +577,10 @@ Task 3 complete. Continuing to task 4...
|
|
|
584
577
|
<done>Redis database created and configured</done>
|
|
585
578
|
</task>
|
|
586
579
|
|
|
587
|
-
<!-- NO CHECKPOINT NEEDED -
|
|
580
|
+
<!-- NO CHECKPOINT NEEDED - Codex automated everything and verified programmatically -->
|
|
588
581
|
```
|
|
589
582
|
|
|
590
|
-
### Example
|
|
591
|
-
|
|
592
|
-
```xml
|
|
593
|
-
<!-- Claude automates everything -->
|
|
594
|
-
<task type="auto">
|
|
595
|
-
<name>Configure Stripe webhooks</name>
|
|
596
|
-
<files>.env, src/app/api/webhooks/route.ts</files>
|
|
597
|
-
<action>
|
|
598
|
-
1. Use Stripe API to create webhook endpoint pointing to /api/webhooks
|
|
599
|
-
2. Subscribe to events: payment_intent.succeeded, customer.subscription.updated
|
|
600
|
-
3. Save webhook signing secret to .env
|
|
601
|
-
4. Implement webhook handler in route.ts
|
|
602
|
-
</action>
|
|
603
|
-
<verify>
|
|
604
|
-
- Stripe API returns webhook endpoint ID
|
|
605
|
-
- .env contains STRIPE_WEBHOOK_SECRET
|
|
606
|
-
- curl webhook endpoint returns 200
|
|
607
|
-
</verify>
|
|
608
|
-
<done>Stripe webhooks configured and handler implemented</done>
|
|
609
|
-
</task>
|
|
610
|
-
|
|
611
|
-
<!-- Human verifies in Stripe dashboard -->
|
|
612
|
-
<task type="checkpoint:human-verify" gate="blocking">
|
|
613
|
-
<what-built>Stripe webhook configured via API</what-built>
|
|
614
|
-
<how-to-verify>
|
|
615
|
-
Visit Stripe Dashboard > Developers > Webhooks
|
|
616
|
-
Confirm: Endpoint shows https://myapp.com/api/webhooks with correct events
|
|
617
|
-
</how-to-verify>
|
|
618
|
-
<resume-signal>Type "yes" if correct</resume-signal>
|
|
619
|
-
</task>
|
|
620
|
-
```
|
|
621
|
-
|
|
622
|
-
### Example 4: Full Auth Flow Verification (Correct)
|
|
583
|
+
### Example 2: Full Auth Flow (Single checkpoint at end)
|
|
623
584
|
|
|
624
585
|
```xml
|
|
625
586
|
<task type="auto">
|
|
@@ -643,17 +604,23 @@ Task 3 complete. Continuing to task 4...
|
|
|
643
604
|
<verify>npm run build succeeds</verify>
|
|
644
605
|
</task>
|
|
645
606
|
|
|
607
|
+
<task type="auto">
|
|
608
|
+
<name>Start dev server for auth testing</name>
|
|
609
|
+
<action>Run `npm run dev` in background, wait for ready signal</action>
|
|
610
|
+
<verify>curl http://localhost:3000 returns 200</verify>
|
|
611
|
+
<done>Dev server running at http://localhost:3000</done>
|
|
612
|
+
</task>
|
|
613
|
+
|
|
646
614
|
<!-- ONE checkpoint at end verifies the complete flow -->
|
|
647
615
|
<task type="checkpoint:human-verify" gate="blocking">
|
|
648
|
-
<what-built>Complete authentication flow
|
|
616
|
+
<what-built>Complete authentication flow - dev server running at http://localhost:3000</what-built>
|
|
649
617
|
<how-to-verify>
|
|
650
|
-
1.
|
|
651
|
-
2.
|
|
652
|
-
3.
|
|
653
|
-
4.
|
|
654
|
-
5.
|
|
655
|
-
6.
|
|
656
|
-
7. Click logout: Session cleared
|
|
618
|
+
1. Visit: http://localhost:3000/login
|
|
619
|
+
2. Click "Sign in with GitHub"
|
|
620
|
+
3. Complete GitHub OAuth flow
|
|
621
|
+
4. Verify: Redirected to /dashboard, user name displayed
|
|
622
|
+
5. Refresh page: Session persists
|
|
623
|
+
6. Click logout: Session cleared
|
|
657
624
|
</how-to-verify>
|
|
658
625
|
<resume-signal>Type "approved" or describe issues</resume-signal>
|
|
659
626
|
</task>
|
|
@@ -662,27 +629,50 @@ Task 3 complete. Continuing to task 4...
|
|
|
662
629
|
|
|
663
630
|
<anti_patterns>
|
|
664
631
|
|
|
665
|
-
### ❌ BAD: Asking
|
|
632
|
+
### ❌ BAD: Asking user to start dev server
|
|
666
633
|
|
|
667
634
|
```xml
|
|
668
|
-
<task type="checkpoint:human-
|
|
669
|
-
<
|
|
670
|
-
<
|
|
671
|
-
1.
|
|
672
|
-
2.
|
|
673
|
-
3.
|
|
674
|
-
|
|
675
|
-
</instructions>
|
|
676
|
-
<verification>Deployment exists</verification>
|
|
677
|
-
<resume-signal>Paste URL</resume-signal>
|
|
635
|
+
<task type="checkpoint:human-verify" gate="blocking">
|
|
636
|
+
<what-built>Dashboard component</what-built>
|
|
637
|
+
<how-to-verify>
|
|
638
|
+
1. Run: npm run dev
|
|
639
|
+
2. Visit: http://localhost:3000/dashboard
|
|
640
|
+
3. Check layout is correct
|
|
641
|
+
</how-to-verify>
|
|
678
642
|
</task>
|
|
679
643
|
```
|
|
680
644
|
|
|
681
|
-
**Why bad:**
|
|
645
|
+
**Why bad:** Codex can run `npm run dev`. User should only visit URLs, not execute commands.
|
|
646
|
+
|
|
647
|
+
### ✅ GOOD: Codex starts server, user visits
|
|
648
|
+
|
|
649
|
+
```xml
|
|
650
|
+
<task type="auto">
|
|
651
|
+
<name>Start dev server</name>
|
|
652
|
+
<action>Run `npm run dev` in background</action>
|
|
653
|
+
<verify>curl localhost:3000 returns 200</verify>
|
|
654
|
+
</task>
|
|
655
|
+
|
|
656
|
+
<task type="checkpoint:human-verify" gate="blocking">
|
|
657
|
+
<what-built>Dashboard at http://localhost:3000/dashboard (server running)</what-built>
|
|
658
|
+
<how-to-verify>
|
|
659
|
+
Visit http://localhost:3000/dashboard and verify:
|
|
660
|
+
1. Layout matches design
|
|
661
|
+
2. No console errors
|
|
662
|
+
</how-to-verify>
|
|
663
|
+
</task>
|
|
664
|
+
```
|
|
682
665
|
|
|
683
|
-
### ✅ GOOD:
|
|
666
|
+
### ❌ BAD: Asking human to deploy / ✅ GOOD: Codex automates
|
|
684
667
|
|
|
685
668
|
```xml
|
|
669
|
+
<!-- BAD: Asking user to deploy via dashboard -->
|
|
670
|
+
<task type="checkpoint:human-action" gate="blocking">
|
|
671
|
+
<action>Deploy to Vercel</action>
|
|
672
|
+
<instructions>Visit vercel.com/new → Import repo → Click Deploy → Copy URL</instructions>
|
|
673
|
+
</task>
|
|
674
|
+
|
|
675
|
+
<!-- GOOD: Codex deploys, user verifies -->
|
|
686
676
|
<task type="auto">
|
|
687
677
|
<name>Deploy to Vercel</name>
|
|
688
678
|
<action>Run `vercel --yes`. Capture URL.</action>
|
|
@@ -696,22 +686,18 @@ Task 3 complete. Continuing to task 4...
|
|
|
696
686
|
</task>
|
|
697
687
|
```
|
|
698
688
|
|
|
699
|
-
### ❌ BAD: Too many checkpoints
|
|
689
|
+
### ❌ BAD: Too many checkpoints / ✅ GOOD: Single checkpoint
|
|
700
690
|
|
|
701
691
|
```xml
|
|
692
|
+
<!-- BAD: Checkpoint after every task -->
|
|
702
693
|
<task type="auto">Create schema</task>
|
|
703
694
|
<task type="checkpoint:human-verify">Check schema</task>
|
|
704
695
|
<task type="auto">Create API route</task>
|
|
705
696
|
<task type="checkpoint:human-verify">Check API</task>
|
|
706
697
|
<task type="auto">Create UI form</task>
|
|
707
698
|
<task type="checkpoint:human-verify">Check form</task>
|
|
708
|
-
```
|
|
709
|
-
|
|
710
|
-
**Why bad:** Verification fatigue. Combine into one checkpoint at end.
|
|
711
|
-
|
|
712
|
-
### ✅ GOOD: Single verification checkpoint
|
|
713
699
|
|
|
714
|
-
|
|
700
|
+
<!-- GOOD: One checkpoint at end -->
|
|
715
701
|
<task type="auto">Create schema</task>
|
|
716
702
|
<task type="auto">Create API route</task>
|
|
717
703
|
<task type="auto">Create UI form</task>
|
|
@@ -723,66 +709,67 @@ Task 3 complete. Continuing to task 4...
|
|
|
723
709
|
</task>
|
|
724
710
|
```
|
|
725
711
|
|
|
726
|
-
### ❌ BAD:
|
|
712
|
+
### ❌ BAD: Vague verification / ✅ GOOD: Specific steps
|
|
727
713
|
|
|
728
714
|
```xml
|
|
729
|
-
|
|
730
|
-
|
|
731
|
-
<
|
|
732
|
-
|
|
733
|
-
2. Add: DATABASE_URL=...
|
|
734
|
-
3. Add: STRIPE_KEY=...
|
|
735
|
-
</instructions>
|
|
715
|
+
<!-- BAD -->
|
|
716
|
+
<task type="checkpoint:human-verify">
|
|
717
|
+
<what-built>Dashboard</what-built>
|
|
718
|
+
<how-to-verify>Check it works</how-to-verify>
|
|
736
719
|
</task>
|
|
737
|
-
```
|
|
738
720
|
|
|
739
|
-
|
|
721
|
+
<!-- GOOD -->
|
|
722
|
+
<task type="checkpoint:human-verify">
|
|
723
|
+
<what-built>Responsive dashboard - server running at http://localhost:3000</what-built>
|
|
724
|
+
<how-to-verify>
|
|
725
|
+
Visit http://localhost:3000/dashboard and verify:
|
|
726
|
+
1. Desktop (>1024px): Sidebar visible, content area fills remaining space
|
|
727
|
+
2. Tablet (768px): Sidebar collapses to icons
|
|
728
|
+
3. Mobile (375px): Sidebar hidden, hamburger menu in header
|
|
729
|
+
4. No horizontal scroll at any size
|
|
730
|
+
</how-to-verify>
|
|
731
|
+
<resume-signal>Type "approved" or describe layout issues</resume-signal>
|
|
732
|
+
</task>
|
|
733
|
+
```
|
|
740
734
|
|
|
741
|
-
### ❌ BAD:
|
|
735
|
+
### ❌ BAD: Asking user to run CLI commands
|
|
742
736
|
|
|
743
737
|
```xml
|
|
744
|
-
<task type="checkpoint:human-
|
|
745
|
-
<
|
|
746
|
-
<
|
|
747
|
-
<resume-signal>Continue</resume-signal>
|
|
738
|
+
<task type="checkpoint:human-action">
|
|
739
|
+
<action>Run database migrations</action>
|
|
740
|
+
<instructions>Run: npx prisma migrate deploy && npx prisma db seed</instructions>
|
|
748
741
|
</task>
|
|
749
742
|
```
|
|
750
743
|
|
|
751
|
-
**Why bad:**
|
|
744
|
+
**Why bad:** Codex can run these commands. User should never execute CLI commands.
|
|
752
745
|
|
|
753
|
-
###
|
|
746
|
+
### ❌ BAD: Asking user to copy values between services
|
|
754
747
|
|
|
755
748
|
```xml
|
|
756
|
-
<task type="checkpoint:human-
|
|
757
|
-
<
|
|
758
|
-
<
|
|
759
|
-
1. Run: npm run dev
|
|
760
|
-
2. Visit: http://localhost:3000/dashboard
|
|
761
|
-
3. Desktop (>1024px): Sidebar visible, content area fills remaining space
|
|
762
|
-
4. Tablet (768px): Sidebar collapses to icons
|
|
763
|
-
5. Mobile (375px): Sidebar hidden, hamburger menu in header
|
|
764
|
-
6. Check: No horizontal scroll at any size
|
|
765
|
-
</how-to-verify>
|
|
766
|
-
<resume-signal>Type "approved" or describe layout issues</resume-signal>
|
|
749
|
+
<task type="checkpoint:human-action">
|
|
750
|
+
<action>Configure webhook URL in Stripe</action>
|
|
751
|
+
<instructions>Copy deployment URL → Stripe Dashboard → Webhooks → Add endpoint → Copy secret → Add to .env</instructions>
|
|
767
752
|
</task>
|
|
768
753
|
```
|
|
769
754
|
|
|
755
|
+
**Why bad:** Stripe has an API. Codex should create the webhook via API and write to .env directly.
|
|
756
|
+
|
|
770
757
|
</anti_patterns>
|
|
771
758
|
|
|
772
759
|
<summary>
|
|
773
760
|
|
|
774
|
-
Checkpoints formalize human-in-the-loop points
|
|
761
|
+
Checkpoints formalize human-in-the-loop points for verification and decisions, not manual work.
|
|
775
762
|
|
|
776
|
-
**The golden rule:** If
|
|
763
|
+
**The golden rule:** If Codex CAN automate it, Codex MUST automate it.
|
|
777
764
|
|
|
778
765
|
**Checkpoint priority:**
|
|
779
|
-
1. **checkpoint:human-verify** (90%
|
|
780
|
-
2. **checkpoint:decision** (9%
|
|
781
|
-
3. **checkpoint:human-action** (1%
|
|
766
|
+
1. **checkpoint:human-verify** (90%) - Codex automated everything, human confirms visual/functional correctness
|
|
767
|
+
2. **checkpoint:decision** (9%) - Human makes architectural/technology choices
|
|
768
|
+
3. **checkpoint:human-action** (1%) - Truly unavoidable manual steps with no API/CLI
|
|
782
769
|
|
|
783
770
|
**When NOT to use checkpoints:**
|
|
784
|
-
- Things
|
|
785
|
-
- File operations (
|
|
786
|
-
- Code correctness (
|
|
771
|
+
- Things Codex can verify programmatically (tests, builds)
|
|
772
|
+
- File operations (Codex can read files)
|
|
773
|
+
- Code correctness (tests and static analysis)
|
|
787
774
|
- Anything automatable via CLI/API
|
|
788
775
|
</summary>
|