ariadna 1.3.0 → 2.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (149) hide show
  1. checksums.yaml +4 -4
  2. data/ariadna.gemspec +0 -1
  3. data/data/agents/ariadna-codebase-mapper.md +34 -722
  4. data/data/agents/ariadna-debugger.md +44 -1139
  5. data/data/agents/ariadna-executor.md +75 -396
  6. data/data/agents/ariadna-planner.md +78 -1215
  7. data/data/agents/ariadna-roadmapper.md +55 -582
  8. data/data/agents/ariadna-verifier.md +60 -702
  9. data/data/ariadna/templates/config.json +8 -33
  10. data/data/ariadna/workflows/debug.md +28 -0
  11. data/data/ariadna/workflows/execute-phase.md +31 -513
  12. data/data/ariadna/workflows/map-codebase.md +20 -319
  13. data/data/ariadna/workflows/new-milestone.md +20 -365
  14. data/data/ariadna/workflows/new-project.md +19 -880
  15. data/data/ariadna/workflows/plan-phase.md +24 -443
  16. data/data/ariadna/workflows/progress.md +20 -376
  17. data/data/ariadna/workflows/quick.md +19 -221
  18. data/data/ariadna/workflows/roadmap-ops.md +28 -0
  19. data/data/ariadna/workflows/verify-work.md +23 -560
  20. data/data/commands/ariadna/add-phase.md +11 -22
  21. data/data/commands/ariadna/debug.md +11 -143
  22. data/data/commands/ariadna/execute-phase.md +12 -30
  23. data/data/commands/ariadna/insert-phase.md +7 -14
  24. data/data/commands/ariadna/map-codebase.md +16 -49
  25. data/data/commands/ariadna/new-milestone.md +12 -25
  26. data/data/commands/ariadna/new-project.md +22 -26
  27. data/data/commands/ariadna/plan-phase.md +13 -22
  28. data/data/commands/ariadna/progress.md +16 -6
  29. data/data/commands/ariadna/quick.md +9 -11
  30. data/data/commands/ariadna/remove-phase.md +9 -12
  31. data/data/commands/ariadna/verify-work.md +14 -19
  32. data/data/skills/rails-backend/API.md +138 -0
  33. data/data/skills/rails-backend/CONTROLLERS.md +154 -0
  34. data/data/skills/rails-backend/JOBS.md +132 -0
  35. data/data/skills/rails-backend/MODELS.md +213 -0
  36. data/data/skills/rails-backend/SKILL.md +169 -0
  37. data/data/skills/rails-frontend/ASSETS.md +154 -0
  38. data/data/skills/rails-frontend/COMPONENTS.md +253 -0
  39. data/data/skills/rails-frontend/SKILL.md +187 -0
  40. data/data/skills/rails-frontend/VIEWS.md +168 -0
  41. data/data/skills/rails-performance/PROFILING.md +106 -0
  42. data/data/skills/rails-performance/SKILL.md +217 -0
  43. data/data/skills/rails-security/AUDIT.md +118 -0
  44. data/data/skills/rails-security/SKILL.md +422 -0
  45. data/data/skills/rails-testing/FIXTURES.md +78 -0
  46. data/data/skills/rails-testing/SKILL.md +160 -0
  47. data/data/skills/rails-testing/SYSTEM-TESTS.md +73 -0
  48. data/lib/ariadna/installer.rb +11 -15
  49. data/lib/ariadna/tools/cli.rb +0 -12
  50. data/lib/ariadna/tools/config_manager.rb +10 -72
  51. data/lib/ariadna/tools/frontmatter.rb +23 -1
  52. data/lib/ariadna/tools/init.rb +201 -401
  53. data/lib/ariadna/tools/model_profiles.rb +6 -14
  54. data/lib/ariadna/tools/phase_manager.rb +1 -10
  55. data/lib/ariadna/tools/state_manager.rb +170 -451
  56. data/lib/ariadna/tools/template_filler.rb +4 -12
  57. data/lib/ariadna/tools/verification.rb +21 -399
  58. data/lib/ariadna/uninstaller.rb +9 -0
  59. data/lib/ariadna/version.rb +1 -1
  60. data/lib/ariadna.rb +1 -0
  61. metadata +20 -91
  62. data/data/agents/ariadna-backend-executor.md +0 -261
  63. data/data/agents/ariadna-frontend-executor.md +0 -259
  64. data/data/agents/ariadna-integration-checker.md +0 -418
  65. data/data/agents/ariadna-phase-researcher.md +0 -469
  66. data/data/agents/ariadna-plan-checker.md +0 -622
  67. data/data/agents/ariadna-project-researcher.md +0 -618
  68. data/data/agents/ariadna-research-synthesizer.md +0 -236
  69. data/data/agents/ariadna-test-executor.md +0 -266
  70. data/data/ariadna/references/checkpoints.md +0 -772
  71. data/data/ariadna/references/continuation-format.md +0 -249
  72. data/data/ariadna/references/decimal-phase-calculation.md +0 -65
  73. data/data/ariadna/references/git-integration.md +0 -248
  74. data/data/ariadna/references/git-planning-commit.md +0 -38
  75. data/data/ariadna/references/model-profile-resolution.md +0 -32
  76. data/data/ariadna/references/model-profiles.md +0 -73
  77. data/data/ariadna/references/phase-argument-parsing.md +0 -61
  78. data/data/ariadna/references/planning-config.md +0 -194
  79. data/data/ariadna/references/questioning.md +0 -153
  80. data/data/ariadna/references/rails-conventions.md +0 -416
  81. data/data/ariadna/references/tdd.md +0 -267
  82. data/data/ariadna/references/ui-brand.md +0 -160
  83. data/data/ariadna/references/verification-patterns.md +0 -853
  84. data/data/ariadna/templates/codebase/architecture.md +0 -481
  85. data/data/ariadna/templates/codebase/concerns.md +0 -380
  86. data/data/ariadna/templates/codebase/conventions.md +0 -434
  87. data/data/ariadna/templates/codebase/integrations.md +0 -328
  88. data/data/ariadna/templates/codebase/stack.md +0 -189
  89. data/data/ariadna/templates/codebase/structure.md +0 -418
  90. data/data/ariadna/templates/codebase/testing.md +0 -606
  91. data/data/ariadna/templates/context.md +0 -283
  92. data/data/ariadna/templates/continue-here.md +0 -78
  93. data/data/ariadna/templates/debug-subagent-prompt.md +0 -91
  94. data/data/ariadna/templates/phase-prompt.md +0 -609
  95. data/data/ariadna/templates/planner-subagent-prompt.md +0 -117
  96. data/data/ariadna/templates/research-project/ARCHITECTURE.md +0 -439
  97. data/data/ariadna/templates/research-project/FEATURES.md +0 -168
  98. data/data/ariadna/templates/research-project/PITFALLS.md +0 -406
  99. data/data/ariadna/templates/research-project/STACK.md +0 -251
  100. data/data/ariadna/templates/research-project/SUMMARY.md +0 -247
  101. data/data/ariadna/templates/state.md +0 -176
  102. data/data/ariadna/templates/summary-complex.md +0 -59
  103. data/data/ariadna/templates/summary-minimal.md +0 -41
  104. data/data/ariadna/templates/summary-standard.md +0 -48
  105. data/data/ariadna/templates/user-setup.md +0 -310
  106. data/data/ariadna/workflows/add-phase.md +0 -111
  107. data/data/ariadna/workflows/add-todo.md +0 -157
  108. data/data/ariadna/workflows/audit-milestone.md +0 -241
  109. data/data/ariadna/workflows/check-todos.md +0 -176
  110. data/data/ariadna/workflows/complete-milestone.md +0 -644
  111. data/data/ariadna/workflows/diagnose-issues.md +0 -219
  112. data/data/ariadna/workflows/discovery-phase.md +0 -289
  113. data/data/ariadna/workflows/discuss-phase.md +0 -408
  114. data/data/ariadna/workflows/execute-plan.md +0 -448
  115. data/data/ariadna/workflows/help.md +0 -470
  116. data/data/ariadna/workflows/insert-phase.md +0 -129
  117. data/data/ariadna/workflows/list-phase-assumptions.md +0 -178
  118. data/data/ariadna/workflows/pause-work.md +0 -122
  119. data/data/ariadna/workflows/plan-milestone-gaps.md +0 -256
  120. data/data/ariadna/workflows/remove-phase.md +0 -154
  121. data/data/ariadna/workflows/research-phase.md +0 -74
  122. data/data/ariadna/workflows/resume-project.md +0 -306
  123. data/data/ariadna/workflows/set-profile.md +0 -80
  124. data/data/ariadna/workflows/settings.md +0 -145
  125. data/data/ariadna/workflows/transition.md +0 -493
  126. data/data/ariadna/workflows/update.md +0 -212
  127. data/data/ariadna/workflows/verify-phase.md +0 -226
  128. data/data/commands/ariadna/add-todo.md +0 -42
  129. data/data/commands/ariadna/audit-milestone.md +0 -42
  130. data/data/commands/ariadna/check-todos.md +0 -41
  131. data/data/commands/ariadna/complete-milestone.md +0 -136
  132. data/data/commands/ariadna/discuss-phase.md +0 -86
  133. data/data/commands/ariadna/help.md +0 -22
  134. data/data/commands/ariadna/list-phase-assumptions.md +0 -50
  135. data/data/commands/ariadna/pause-work.md +0 -35
  136. data/data/commands/ariadna/plan-milestone-gaps.md +0 -40
  137. data/data/commands/ariadna/reapply-patches.md +0 -110
  138. data/data/commands/ariadna/research-phase.md +0 -187
  139. data/data/commands/ariadna/resume-work.md +0 -40
  140. data/data/commands/ariadna/set-profile.md +0 -34
  141. data/data/commands/ariadna/settings.md +0 -36
  142. data/data/commands/ariadna/update.md +0 -37
  143. data/data/guides/backend.md +0 -3069
  144. data/data/guides/frontend.md +0 -1479
  145. data/data/guides/performance.md +0 -1193
  146. data/data/guides/security.md +0 -1522
  147. data/data/guides/style-guide.md +0 -1091
  148. data/data/guides/testing.md +0 -504
  149. data/data/templates.md +0 -94
@@ -1,772 +0,0 @@
1
- <overview>
2
- Plans execute autonomously. Checkpoints formalize interaction points where human verification or decisions are needed.
3
-
4
- **Core principle:** Claude automates everything with CLI/API. Checkpoints are for verification and decisions, not manual work.
5
-
6
- **Golden rules:**
7
- 1. **If Claude can run it, Claude runs it** - Never ask user to execute CLI commands, start servers, or run builds
8
- 2. **Claude sets up the verification environment** - Start dev servers, seed databases, configure env vars
9
- 3. **User only does what requires human judgment** - Visual checks, UX evaluation, "does this feel right?"
10
- 4. **Secrets come from user, automation comes from Claude** - Ask for API keys, then Claude uses them via CLI
11
- </overview>
12
-
13
- <checkpoint_types>
14
-
15
- <type name="human-verify">
16
- ## checkpoint:human-verify (Most Common - 90%)
17
-
18
- **When:** Claude completed automated work, human confirms it works correctly.
19
-
20
- **Use for:**
21
- - Visual UI checks (layout, styling, responsiveness)
22
- - Interactive flows (click through wizard, test user flows)
23
- - Functional verification (feature works as expected)
24
- - Audio/video playback quality
25
- - Animation smoothness
26
- - Accessibility testing
27
-
28
- **Structure:**
29
- ```xml
30
- <task type="checkpoint:human-verify" gate="blocking">
31
- <what-built>[What Claude automated and deployed/built]</what-built>
32
- <how-to-verify>
33
- [Exact steps to test - URLs, commands, expected behavior]
34
- </how-to-verify>
35
- <resume-signal>[How to continue - "approved", "yes", or describe issues]</resume-signal>
36
- </task>
37
- ```
38
-
39
- **Example: UI Component (shows key pattern: Claude starts server BEFORE checkpoint)**
40
- ```xml
41
- <task type="auto">
42
- <name>Build responsive dashboard layout</name>
43
- <files>app/views/dashboard/index.html.erb, app/controllers/dashboard_controller.rb</files>
44
- <action>Create dashboard with sidebar, header, and content area. Use responsive CSS with clamp() and media queries for mobile.</action>
45
- <verify>bin/rails test succeeds, no errors</verify>
46
- <done>Dashboard component builds without errors</done>
47
- </task>
48
-
49
- <task type="auto">
50
- <name>Start dev server for verification</name>
51
- <action>Run `bin/dev` in background, wait for "Listening on" message, capture port</action>
52
- <verify>curl http://localhost:3000 returns 200</verify>
53
- <done>Dev server running at http://localhost:3000</done>
54
- </task>
55
-
56
- <task type="checkpoint:human-verify" gate="blocking">
57
- <what-built>Responsive dashboard layout - dev server running at http://localhost:3000</what-built>
58
- <how-to-verify>
59
- Visit http://localhost:3000/dashboard and verify:
60
- 1. Desktop (>1024px): Sidebar left, content right, header top
61
- 2. Tablet (768px): Sidebar collapses to hamburger menu
62
- 3. Mobile (375px): Single column layout, bottom nav appears
63
- 4. No layout shift or horizontal scroll at any size
64
- </how-to-verify>
65
- <resume-signal>Type "approved" or describe layout issues</resume-signal>
66
- </task>
67
- ```
68
-
69
- **Example: Xcode Build**
70
- ```xml
71
- <task type="auto">
72
- <name>Build macOS app with Xcode</name>
73
- <files>App.xcodeproj, Sources/</files>
74
- <action>Run `xcodebuild -project App.xcodeproj -scheme App build`. Check for compilation errors in output.</action>
75
- <verify>Build output contains "BUILD SUCCEEDED", no errors</verify>
76
- <done>App builds successfully</done>
77
- </task>
78
-
79
- <task type="checkpoint:human-verify" gate="blocking">
80
- <what-built>Built macOS app at DerivedData/Build/Products/Debug/App.app</what-built>
81
- <how-to-verify>
82
- Open App.app and test:
83
- - App launches without crashes
84
- - Menu bar icon appears
85
- - Preferences window opens correctly
86
- - No visual glitches or layout issues
87
- </how-to-verify>
88
- <resume-signal>Type "approved" or describe issues</resume-signal>
89
- </task>
90
- ```
91
- </type>
92
-
93
- <type name="decision">
94
- ## checkpoint:decision (9%)
95
-
96
- **When:** Human must make choice that affects implementation direction.
97
-
98
- **Use for:**
99
- - Technology selection (which auth provider, which database)
100
- - Architecture decisions (monorepo vs separate repos)
101
- - Design choices (color scheme, layout approach)
102
- - Feature prioritization (which variant to build)
103
- - Data model decisions (schema structure)
104
-
105
- **Structure:**
106
- ```xml
107
- <task type="checkpoint:decision" gate="blocking">
108
- <decision>[What's being decided]</decision>
109
- <context>[Why this decision matters]</context>
110
- <options>
111
- <option id="option-a">
112
- <name>[Option name]</name>
113
- <pros>[Benefits]</pros>
114
- <cons>[Tradeoffs]</cons>
115
- </option>
116
- <option id="option-b">
117
- <name>[Option name]</name>
118
- <pros>[Benefits]</pros>
119
- <cons>[Tradeoffs]</cons>
120
- </option>
121
- </options>
122
- <resume-signal>[How to indicate choice]</resume-signal>
123
- </task>
124
- ```
125
-
126
- **Example: Auth Provider Selection**
127
- Note: Only present this checkpoint if the user explicitly asks to evaluate auth gems. For new projects, default to `has_secure_password` (Rails 8 auth generator) without asking.
128
- ```xml
129
- <task type="checkpoint:decision" gate="blocking">
130
- <decision>Select authentication approach</decision>
131
- <context>
132
- Need user authentication for the app. Rails built-in auth is the recommended default.
133
- Only consider external gems if the user explicitly requests them.
134
- </context>
135
- <options>
136
- <option id="has_secure_password">
137
- <name>has_secure_password (built-in) — Recommended</name>
138
- <pros>No dependencies, full control, simple and lightweight, easy to understand, Rails 8 auth generator scaffolds everything</pros>
139
- <cons>More manual setup for advanced features (OAuth, 2FA)</cons>
140
- </option>
141
- <option id="devise">
142
- <name>Devise (only if explicitly requested)</name>
143
- <pros>Full-featured (registration, password reset, OAuth), well-maintained</pros>
144
- <cons>Heavy dependency, opinionated, can be hard to customize deeply</cons>
145
- </option>
146
- <option id="rodauth">
147
- <name>Rodauth (only if explicitly requested)</name>
148
- <pros>Security-focused, modular features, database-backed configuration, excellent 2FA</pros>
149
- <cons>Smaller community, different conventions, steeper learning curve</cons>
150
- </option>
151
- </options>
152
- <resume-signal>Select: has_secure_password (default), devise, or rodauth</resume-signal>
153
- </task>
154
- ```
155
-
156
- **Example: Database Selection**
157
- ```xml
158
- <task type="checkpoint:decision" gate="blocking">
159
- <decision>Select database for user data</decision>
160
- <context>
161
- App needs persistent storage for users, sessions, and user-generated content.
162
- Expected scale: 10k users, 1M records first year.
163
- </context>
164
- <options>
165
- <option id="postgresql">
166
- <name>PostgreSQL</name>
167
- <pros>Full SQL, excellent Rails support, JSONB columns, advanced indexing, industry standard</pros>
168
- <cons>More setup than SQLite, requires running server</cons>
169
- </option>
170
- <option id="sqlite">
171
- <name>SQLite (with Litestack)</name>
172
- <pros>Zero config, built into Rails 8 default, excellent for single-server deploys, fast</pros>
173
- <cons>Single-writer limitation, not ideal for horizontal scaling</cons>
174
- </option>
175
- <option id="mysql">
176
- <name>MySQL</name>
177
- <pros>Widely deployed, good performance, familiar to many teams</pros>
178
- <cons>Fewer advanced features than Postgres, less common in Rails ecosystem</cons>
179
- </option>
180
- </options>
181
- <resume-signal>Select: postgresql, sqlite, or mysql</resume-signal>
182
- </task>
183
- ```
184
- </type>
185
-
186
- <type name="human-action">
187
- ## checkpoint:human-action (1% - Rare)
188
-
189
- **When:** Action has NO CLI/API and requires human-only interaction, OR Claude hit an authentication gate during automation.
190
-
191
- **Use ONLY for:**
192
- - **Authentication gates** - Claude tried CLI/API but needs credentials (this is NOT a failure)
193
- - Email verification links (clicking email)
194
- - SMS 2FA codes (phone verification)
195
- - Manual account approvals (platform requires human review)
196
- - Credit card 3D Secure flows (web-based payment authorization)
197
- - OAuth app approvals (web-based approval)
198
-
199
- **Do NOT use for pre-planned manual work:**
200
- - Deploying (use CLI - auth gate if needed)
201
- - Creating webhooks/databases (use API/CLI - auth gate if needed)
202
- - Running builds/tests (use Bash tool)
203
- - Creating files (use Write tool)
204
-
205
- **Structure:**
206
- ```xml
207
- <task type="checkpoint:human-action" gate="blocking">
208
- <action>[What human must do - Claude already did everything automatable]</action>
209
- <instructions>
210
- [What Claude already automated]
211
- [The ONE thing requiring human action]
212
- </instructions>
213
- <verification>[What Claude can check afterward]</verification>
214
- <resume-signal>[How to continue]</resume-signal>
215
- </task>
216
- ```
217
-
218
- **Example: Email Verification**
219
- ```xml
220
- <task type="auto">
221
- <name>Create SendGrid account via API</name>
222
- <action>Use SendGrid API to create subuser account with provided email. Request verification email.</action>
223
- <verify>API returns 201, account created</verify>
224
- <done>Account created, verification email sent</done>
225
- </task>
226
-
227
- <task type="checkpoint:human-action" gate="blocking">
228
- <action>Complete email verification for SendGrid account</action>
229
- <instructions>
230
- I created the account and requested verification email.
231
- Check your inbox for SendGrid verification link and click it.
232
- </instructions>
233
- <verification>SendGrid API key works: curl test succeeds</verification>
234
- <resume-signal>Type "done" when email verified</resume-signal>
235
- </task>
236
- ```
237
-
238
- **Example: Authentication Gate (Dynamic Checkpoint)**
239
- ```xml
240
- <task type="auto">
241
- <name>Deploy with Kamal</name>
242
- <files>config/deploy.yml, Dockerfile</files>
243
- <action>Run `kamal deploy` to deploy</action>
244
- <verify>kamal app details shows running, curl returns 200</verify>
245
- </task>
246
-
247
- <!-- If kamal returns "SSH connection refused", Claude creates checkpoint on the fly -->
248
-
249
- <task type="checkpoint:human-action" gate="blocking">
250
- <action>Configure SSH access so I can continue deployment</action>
251
- <instructions>
252
- I tried to deploy but got SSH connection error.
253
- Add your SSH key to the server: ssh-copy-id root@your-server-ip
254
- Or configure access in config/deploy.yml under `ssh`.
255
- </instructions>
256
- <verification>ssh root@your-server-ip exit connects successfully</verification>
257
- <resume-signal>Type "done" when SSH access configured</resume-signal>
258
- </task>
259
-
260
- <!-- After SSH access configured, Claude retries the deployment -->
261
-
262
- <task type="auto">
263
- <name>Retry Kamal deployment</name>
264
- <action>Run `kamal deploy` (now with SSH access)</action>
265
- <verify>kamal app details shows running, curl returns 200</verify>
266
- </task>
267
- ```
268
-
269
- **Key distinction:** Auth gates are created dynamically when Claude encounters auth errors. NOT pre-planned — Claude automates first, asks for credentials only when blocked.
270
- </type>
271
- </checkpoint_types>
272
-
273
- <execution_protocol>
274
-
275
- When Claude encounters `type="checkpoint:*"`:
276
-
277
- 1. **Stop immediately** - do not proceed to next task
278
- 2. **Display checkpoint clearly** using the format below
279
- 3. **Wait for user response** - do not hallucinate completion
280
- 4. **Verify if possible** - check files, run tests, whatever is specified
281
- 5. **Resume execution** - continue to next task only after confirmation
282
-
283
- **For checkpoint:human-verify:**
284
- ```
285
- ╔═══════════════════════════════════════════════════════╗
286
- ║ CHECKPOINT: Verification Required ║
287
- ╚═══════════════════════════════════════════════════════╝
288
-
289
- Progress: 5/8 tasks complete
290
- Task: Responsive dashboard layout
291
-
292
- Built: Responsive dashboard at /dashboard
293
-
294
- How to verify:
295
- 1. Visit: http://localhost:3000/dashboard
296
- 2. Desktop (>1024px): Sidebar visible, content fills remaining space
297
- 3. Tablet (768px): Sidebar collapses to icons
298
- 4. Mobile (375px): Sidebar hidden, hamburger menu appears
299
-
300
- ────────────────────────────────────────────────────────
301
- → YOUR ACTION: Type "approved" or describe issues
302
- ────────────────────────────────────────────────────────
303
- ```
304
-
305
- **For checkpoint:decision:**
306
- ```
307
- ╔═══════════════════════════════════════════════════════╗
308
- ║ CHECKPOINT: Decision Required ║
309
- ╚═══════════════════════════════════════════════════════╝
310
-
311
- Progress: 2/6 tasks complete
312
- Task: Select authentication provider
313
-
314
- Decision: Which auth approach should we use?
315
-
316
- Context: Need user authentication. Three options with different tradeoffs.
317
-
318
- Options:
319
- 1. has_secure_password - Built-in Rails (Recommended)
320
- Pros: No dependencies, full control, simple, Rails 8 auth generator scaffolds everything
321
- Cons: More manual setup for advanced features (OAuth, 2FA)
322
-
323
- 2. devise - Full-featured auth gem (only if explicitly requested)
324
- Pros: Registration, password reset, OAuth support, well-maintained
325
- Cons: Heavy dependency, opinionated, hard to customize deeply
326
-
327
- 3. rodauth - Security-focused, modular (only if explicitly requested)
328
- Pros: Excellent 2FA, database-backed config, modular features
329
- Cons: Smaller community, different conventions, steeper learning curve
330
-
331
- ────────────────────────────────────────────────────────
332
- → YOUR ACTION: Select has_secure_password (default), devise, or rodauth
333
- ────────────────────────────────────────────────────────
334
- ```
335
-
336
- **For checkpoint:human-action:**
337
- ```
338
- ╔═══════════════════════════════════════════════════════╗
339
- ║ CHECKPOINT: Action Required ║
340
- ╚═══════════════════════════════════════════════════════╝
341
-
342
- Progress: 3/8 tasks complete
343
- Task: Deploy with Kamal
344
-
345
- Attempted: kamal deploy
346
- Error: SSH connection refused to root@your-server-ip
347
-
348
- What you need to do:
349
- 1. Run: ssh-copy-id root@your-server-ip
350
- 2. Or configure SSH access in config/deploy.yml
351
- 3. Return here when done
352
-
353
- I'll verify: ssh root@your-server-ip exit connects successfully
354
-
355
- ────────────────────────────────────────────────────────
356
- → YOUR ACTION: Type "done" when authenticated
357
- ────────────────────────────────────────────────────────
358
- ```
359
- </execution_protocol>
360
-
361
- <authentication_gates>
362
-
363
- **Auth gate = Claude tried CLI/API, got auth error.** Not a failure — a gate requiring human input to unblock.
364
-
365
- **Pattern:** Claude tries automation → auth error → creates checkpoint:human-action → user authenticates → Claude retries → continues
366
-
367
- **Gate protocol:**
368
- 1. Recognize it's not a failure - missing auth is expected
369
- 2. Stop current task - don't retry repeatedly
370
- 3. Create checkpoint:human-action dynamically
371
- 4. Provide exact authentication steps
372
- 5. Verify authentication works
373
- 6. Retry the original task
374
- 7. Continue normally
375
-
376
- **Key distinction:**
377
- - Pre-planned checkpoint: "I need you to do X" (wrong - Claude should automate)
378
- - Auth gate: "I tried to automate X but need credentials" (correct - unblocks automation)
379
-
380
- </authentication_gates>
381
-
382
- <automation_reference>
383
-
384
- **The rule:** If it has CLI/API, Claude does it. Never ask human to perform automatable work.
385
-
386
- ## Service CLI Reference
387
-
388
- | Service | CLI/API | Key Commands | Auth Gate |
389
- |---------|---------|--------------|-----------|
390
- | Heroku | `heroku` | `create`, `config:set`, `ps`, `logs` | `heroku login` |
391
- | Railway | `railway` | `init`, `up`, `variables set` | `railway login` |
392
- | Fly | `fly` | `launch`, `deploy`, `secrets set` | `fly auth login` |
393
- | Stripe | `stripe` + API | `listen`, `trigger`, API calls | API key in .env |
394
- | Supabase | `supabase` | `init`, `link`, `db push`, `gen types` | `supabase login` |
395
- | PlanetScale | `pscale` | `database create`, `branch create` | `pscale auth login` |
396
- | GitHub | `gh` | `repo create`, `pr create`, `secret set` | `gh auth login` |
397
- | Ruby/Rails | `bundle`/`rails` | `install`, `assets:precompile`, `test`, `server` | N/A |
398
- | Xcode | `xcodebuild` | `-project`, `-scheme`, `build`, `test` | N/A |
399
- | Kamal | `kamal` | `setup`, `deploy`, `env push` | N/A |
400
-
401
- ## Environment Variable Automation
402
-
403
- **Env files:** Use Write/Edit tools. Never ask human to create .env manually.
404
-
405
- **Dashboard env vars via CLI:**
406
-
407
- | Platform | CLI Command | Example |
408
- |----------|-------------|---------|
409
- | Rails credentials | `bin/rails credentials:edit` | `bin/rails credentials:edit --environment production` |
410
- | Kamal | `kamal env push` | `kamal env push` (reads from `.env` files) |
411
- | Fly | `fly secrets set` | `fly secrets set DATABASE_URL=...` |
412
- | Railway | `railway variables set` | `railway variables set API_KEY=value` |
413
- | Heroku | `heroku config:set` | `heroku config:set STRIPE_KEY=value` |
414
-
415
- **Secret collection pattern:**
416
- ```xml
417
- <!-- WRONG: Asking user to add env vars in dashboard -->
418
- <task type="checkpoint:human-action">
419
- <action>Add STRIPE_SECRET_KEY to Heroku dashboard</action>
420
- <instructions>Go to heroku.com → Settings → Config Vars → Add</instructions>
421
- </task>
422
-
423
- <!-- RIGHT: Claude asks for value, then adds via CLI -->
424
- <task type="checkpoint:human-action">
425
- <action>Provide your Stripe secret key</action>
426
- <instructions>
427
- I need your Stripe secret key for payment processing.
428
- Get it from: https://dashboard.stripe.com/apikeys
429
- Paste the key (starts with sk_)
430
- </instructions>
431
- <verification>I'll add it via `heroku config:set` and verify</verification>
432
- <resume-signal>Paste your API key</resume-signal>
433
- </task>
434
-
435
- <task type="auto">
436
- <name>Configure Stripe key on Heroku</name>
437
- <action>Run `heroku config:set STRIPE_SECRET_KEY={user-provided-key}`</action>
438
- <verify>`heroku config:get STRIPE_SECRET_KEY` returns the key (masked)</verify>
439
- </task>
440
- ```
441
-
442
- ## Dev Server Automation
443
-
444
- | Framework | Start Command | Ready Signal | Default URL |
445
- |-----------|---------------|--------------|-------------|
446
- | Rails | `bin/rails server` | "Listening on" | http://localhost:3000 |
447
- | Rails (with Procfile) | `bin/dev` | "Listening on" | http://localhost:3000 |
448
- | Django | `python manage.py runserver` | "Starting development server" | http://localhost:8000 |
449
-
450
- **Server lifecycle:**
451
- ```bash
452
- # Run in background, capture PID
453
- bin/dev &
454
- DEV_SERVER_PID=$!
455
-
456
- # Wait for ready (max 30s)
457
- timeout 30 bash -c 'until curl -s localhost:3000 > /dev/null 2>&1; do sleep 1; done'
458
- ```
459
-
460
- **Port conflicts:** Kill stale process (`lsof -ti:3000 | xargs kill`) or use alternate port (`--port 3001`).
461
-
462
- **Server stays running** through checkpoints. Only kill when plan complete, switching to production, or port needed for different service.
463
-
464
- ## CLI Installation Handling
465
-
466
- | CLI | Auto-install? | Command |
467
- |-----|---------------|---------|
468
- | bundle | No - ask user | User installs Ruby/Bundler |
469
- | gh (GitHub) | Yes | `brew install gh` (macOS) or `apt install gh` (Linux) |
470
- | heroku | Yes | `brew tap heroku/brew && brew install heroku` |
471
- | fly | Yes | `brew install flyctl` or curl installer |
472
- | stripe | Yes | `brew install stripe/stripe-cli/stripe` |
473
- | redis | Yes | `brew install redis` (macOS) or `apt install redis-server` (Linux) |
474
- | postgresql | Yes | `brew install postgresql` (macOS) or `apt install postgresql` (Linux) |
475
-
476
- **Protocol:** Try command → "command not found" → auto-installable? → yes: install silently, retry → no: checkpoint asking user to install.
477
-
478
- ## Pre-Checkpoint Automation Failures
479
-
480
- | Failure | Response |
481
- |---------|----------|
482
- | Server won't start | Check error, fix issue, retry (don't proceed to checkpoint) |
483
- | Port in use | Kill stale process or use alternate port |
484
- | Missing dependency | Run `bundle install`, retry |
485
- | Build error | Fix the error first (bug, not checkpoint issue) |
486
- | Auth error | Create auth gate checkpoint |
487
- | Network timeout | Retry with backoff, then checkpoint if persistent |
488
-
489
- **Never present a checkpoint with broken verification environment.** If `curl localhost:3000` fails, don't ask user to "visit localhost:3000".
490
-
491
- ```xml
492
- <!-- WRONG: Checkpoint with broken environment -->
493
- <task type="checkpoint:human-verify">
494
- <what-built>Dashboard (server failed to start)</what-built>
495
- <how-to-verify>Visit http://localhost:3000...</how-to-verify>
496
- </task>
497
-
498
- <!-- RIGHT: Fix first, then checkpoint -->
499
- <task type="auto">
500
- <name>Fix server startup issue</name>
501
- <action>Investigate error, fix root cause, restart server</action>
502
- <verify>curl http://localhost:3000 returns 200</verify>
503
- </task>
504
-
505
- <task type="checkpoint:human-verify">
506
- <what-built>Dashboard - server running at http://localhost:3000</what-built>
507
- <how-to-verify>Visit http://localhost:3000/dashboard...</how-to-verify>
508
- </task>
509
- ```
510
-
511
- ## Automatable Quick Reference
512
-
513
- | Action | Automatable? | Claude does it? |
514
- |--------|--------------|-----------------|
515
- | Deploy to Fly.io | Yes (`fly deploy`) | YES |
516
- | Create Stripe webhook | Yes (API) | YES |
517
- | Write .env file | Yes (Write tool) | YES |
518
- | Run tests | Yes (`bundle exec rake test`) | YES |
519
- | Start dev server | Yes (`bin/dev`) | YES |
520
- | Add env vars to Heroku | Yes (`heroku config:set`) | YES |
521
- | Seed database | Yes (CLI/API) | YES |
522
- | Click email verification link | No | NO |
523
- | Enter credit card with 3DS | No | NO |
524
- | Complete OAuth in browser | No | NO |
525
- | Visually verify UI looks correct | No | NO |
526
- | Test interactive user flows | No | NO |
527
-
528
- </automation_reference>
529
-
530
- <writing_guidelines>
531
-
532
- **DO:**
533
- - Automate everything with CLI/API before checkpoint
534
- - Be specific: "Visit https://myapp.fly.dev" not "check deployment"
535
- - Number verification steps
536
- - State expected outcomes: "You should see X"
537
- - Provide context: why this checkpoint exists
538
-
539
- **DON'T:**
540
- - Ask human to do work Claude can automate ❌
541
- - Assume knowledge: "Configure the usual settings" ❌
542
- - Skip steps: "Set up database" (too vague) ❌
543
- - Mix multiple verifications in one checkpoint ❌
544
-
545
- **Placement:**
546
- - **After automation completes** - not before Claude does the work
547
- - **After UI buildout** - before declaring phase complete
548
- - **Before dependent work** - decisions before implementation
549
- - **At integration points** - after configuring external services
550
-
551
- **Bad placement:** Before automation ❌ | Too frequent ❌ | Too late (dependent tasks already needed the result) ❌
552
- </writing_guidelines>
553
-
554
- <examples>
555
-
556
- ### Example 1: Background Job Setup (No Checkpoint Needed)
557
-
558
- ```xml
559
- <task type="auto">
560
- <name>Configure Solid Queue for background jobs</name>
561
- <files>config/queue.yml, config/recurring.yml, Gemfile</files>
562
- <action>
563
- 1. Add `solid_queue` to Gemfile and run `bundle install`
564
- 2. Run `bin/rails solid_queue:install` to generate config
565
- 3. Configure queues in `config/queue.yml`
566
- 4. Set `config.active_job.queue_adapter = :solid_queue` in production.rb
567
- 5. Run `bin/rails db:migrate` for Solid Queue tables
568
- </action>
569
- <verify>
570
- - bin/rails runner "SolidQueue::Job.count" returns 0
571
- - config/queue.yml exists with valid configuration
572
- - Database tables created successfully
573
- </verify>
574
- <done>Solid Queue configured and ready for background jobs</done>
575
- </task>
576
-
577
- <!-- NO CHECKPOINT NEEDED - Claude automated everything and verified programmatically -->
578
- ```
579
-
580
- ### Example 2: Full Auth Flow (Single checkpoint at end)
581
-
582
- ```xml
583
- <task type="auto">
584
- <name>Create user model and migration</name>
585
- <files>app/models/user.rb, db/migrate/xxx_create_users.rb</files>
586
- <action>Generate User model with has_secure_password (Rails 8 auth generator), run migration</action>
587
- <verify>bin/rails db:migrate succeeds, User.count returns 0</verify>
588
- </task>
589
-
590
- <task type="auto">
591
- <name>Create sessions controller and routes</name>
592
- <files>app/controllers/sessions_controller.rb, config/routes.rb</files>
593
- <action>Set up login/logout actions with session management</action>
594
- <verify>bin/rails routes | grep session shows expected routes</verify>
595
- </task>
596
-
597
- <task type="auto">
598
- <name>Create login view</name>
599
- <files>app/views/sessions/new.html.erb</files>
600
- <action>Create login page with email/password form</action>
601
- <verify>bin/rails test succeeds</verify>
602
- </task>
603
-
604
- <task type="auto">
605
- <name>Start dev server for auth testing</name>
606
- <action>Run `bin/dev` in background, wait for ready signal</action>
607
- <verify>curl http://localhost:3000 returns 200</verify>
608
- <done>Dev server running at http://localhost:3000</done>
609
- </task>
610
-
611
- <!-- ONE checkpoint at end verifies the complete flow -->
612
- <task type="checkpoint:human-verify" gate="blocking">
613
- <what-built>Complete authentication flow - dev server running at http://localhost:3000</what-built>
614
- <how-to-verify>
615
- 1. Visit: http://localhost:3000/login
616
- 2. Enter email and password
617
- 3. Click "Sign in"
618
- 4. Verify: Redirected to /dashboard, user name displayed
619
- 5. Refresh page: Session persists
620
- 6. Click logout: Session cleared
621
- </how-to-verify>
622
- <resume-signal>Type "approved" or describe issues</resume-signal>
623
- </task>
624
- ```
625
- </examples>
626
-
627
- <anti_patterns>
628
-
629
- ### ❌ BAD: Asking user to start dev server
630
-
631
- ```xml
632
- <task type="checkpoint:human-verify" gate="blocking">
633
- <what-built>Dashboard component</what-built>
634
- <how-to-verify>
635
- 1. Run: bin/dev
636
- 2. Visit: http://localhost:3000/dashboard
637
- 3. Check layout is correct
638
- </how-to-verify>
639
- </task>
640
- ```
641
-
642
- **Why bad:** Claude can run `bin/dev`. User should only visit URLs, not execute commands.
643
-
644
- ### ✅ GOOD: Claude starts server, user visits
645
-
646
- ```xml
647
- <task type="auto">
648
- <name>Start dev server</name>
649
- <action>Run `bin/dev` in background</action>
650
- <verify>curl localhost:3000 returns 200</verify>
651
- </task>
652
-
653
- <task type="checkpoint:human-verify" gate="blocking">
654
- <what-built>Dashboard at http://localhost:3000/dashboard (server running)</what-built>
655
- <how-to-verify>
656
- Visit http://localhost:3000/dashboard and verify:
657
- 1. Layout matches design
658
- 2. No console errors
659
- </how-to-verify>
660
- </task>
661
- ```
662
-
663
- ### ❌ BAD: Asking human to deploy / ✅ GOOD: Claude automates
664
-
665
- ```xml
666
- <!-- BAD: Asking user to deploy via dashboard -->
667
- <task type="checkpoint:human-action" gate="blocking">
668
- <action>Deploy to production</action>
669
- <instructions>Visit hosting dashboard → Create app → Deploy → Copy URL</instructions>
670
- </task>
671
-
672
- <!-- GOOD: Claude deploys, user verifies -->
673
- <task type="auto">
674
- <name>Deploy with Kamal</name>
675
- <action>Run `kamal deploy`. Capture URL.</action>
676
- <verify>kamal app details shows running, curl returns 200</verify>
677
- </task>
678
-
679
- <task type="checkpoint:human-verify">
680
- <what-built>Deployed to {url}</what-built>
681
- <how-to-verify>Visit {url}, check homepage loads</how-to-verify>
682
- <resume-signal>Type "approved"</resume-signal>
683
- </task>
684
- ```
685
-
686
- ### ❌ BAD: Too many checkpoints / ✅ GOOD: Single checkpoint
687
-
688
- ```xml
689
- <!-- BAD: Checkpoint after every task -->
690
- <task type="auto">Create model and migration</task>
691
- <task type="checkpoint:human-verify">Check schema</task>
692
- <task type="auto">Create controller</task>
693
- <task type="checkpoint:human-verify">Check controller</task>
694
- <task type="auto">Create views</task>
695
- <task type="checkpoint:human-verify">Check views</task>
696
-
697
- <!-- GOOD: One checkpoint at end -->
698
- <task type="auto">Create model and migration</task>
699
- <task type="auto">Create controller</task>
700
- <task type="auto">Create views</task>
701
-
702
- <task type="checkpoint:human-verify">
703
- <what-built>Complete auth flow (model + controller + views)</what-built>
704
- <how-to-verify>Test full flow: register, login, access protected page</how-to-verify>
705
- <resume-signal>Type "approved"</resume-signal>
706
- </task>
707
- ```
708
-
709
- ### ❌ BAD: Vague verification / ✅ GOOD: Specific steps
710
-
711
- ```xml
712
- <!-- BAD -->
713
- <task type="checkpoint:human-verify">
714
- <what-built>Dashboard</what-built>
715
- <how-to-verify>Check it works</how-to-verify>
716
- </task>
717
-
718
- <!-- GOOD -->
719
- <task type="checkpoint:human-verify">
720
- <what-built>Responsive dashboard - server running at http://localhost:3000</what-built>
721
- <how-to-verify>
722
- Visit http://localhost:3000/dashboard and verify:
723
- 1. Desktop (>1024px): Sidebar visible, content area fills remaining space
724
- 2. Tablet (768px): Sidebar collapses to icons
725
- 3. Mobile (375px): Sidebar hidden, hamburger menu in header
726
- 4. No horizontal scroll at any size
727
- </how-to-verify>
728
- <resume-signal>Type "approved" or describe layout issues</resume-signal>
729
- </task>
730
- ```
731
-
732
- ### ❌ BAD: Asking user to run CLI commands
733
-
734
- ```xml
735
- <task type="checkpoint:human-action">
736
- <action>Run database migrations</action>
737
- <instructions>Run: bin/rails db:migrate && bin/rails db:seed</instructions>
738
- </task>
739
- ```
740
-
741
- **Why bad:** Claude can run these commands. User should never execute CLI commands.
742
-
743
- ### ❌ BAD: Asking user to copy values between services
744
-
745
- ```xml
746
- <task type="checkpoint:human-action">
747
- <action>Configure webhook URL in Stripe</action>
748
- <instructions>Copy deployment URL → Stripe Dashboard → Webhooks → Add endpoint → Copy secret → Add to .env</instructions>
749
- </task>
750
- ```
751
-
752
- **Why bad:** Stripe has an API. Claude should create the webhook via API and write to .env directly.
753
-
754
- </anti_patterns>
755
-
756
- <summary>
757
-
758
- Checkpoints formalize human-in-the-loop points for verification and decisions, not manual work.
759
-
760
- **The golden rule:** If Claude CAN automate it, Claude MUST automate it.
761
-
762
- **Checkpoint priority:**
763
- 1. **checkpoint:human-verify** (90%) - Claude automated everything, human confirms visual/functional correctness
764
- 2. **checkpoint:decision** (9%) - Human makes architectural/technology choices
765
- 3. **checkpoint:human-action** (1%) - Truly unavoidable manual steps with no API/CLI
766
-
767
- **When NOT to use checkpoints:**
768
- - Things Claude can verify programmatically (tests, builds)
769
- - File operations (Claude can read files)
770
- - Code correctness (tests and static analysis)
771
- - Anything automatable via CLI/API
772
- </summary>