@ritualai/cli 0.3.3 → 0.5.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -51,60 +51,161 @@ When **not** to use:
51
51
  - An exploration already exists with recommendations — fetch directly via `get_exploration` + `get_recommendations`
52
52
  - The user wants to *implement* a feature from existing recommendations — use `/ritual-builder-spec` (from `@ritual-ai/cli`)
53
53
 
54
- ### Workflow — 7 phases
54
+ ### Workflow — 9 phases (Phase 8 happens after the engineer ships the work)
55
55
 
56
56
  Each phase has explicit **[USER PAUSE]** points — never skip them.
57
57
 
58
58
  #### Phase 1 — Pick a workspace
59
59
 
60
- Call `mcp__ritual__list_workspaces`. Present as a numbered list (id, name).
60
+ Resolution order:
61
61
 
62
- If the user only has one project workspace, confirm it directly: "I'll create this in *Acme Engineering*. Sound right?" — but still wait for **[USER PAUSE]**.
62
+ 1. **Project-bound workspace (preferred).** Check for a `.ritual/config.json` at the project root (you can use the Read tool — the file is a small JSON with `workspaceId` + `workspaceName`). If it exists, that's the workspace this repo is bound to. Surface it to the user: "Using workspace **`{workspaceName}`** (bound to this repo). Sound right?" — wait for **[USER PAUSE]**.
63
+ 2. **List existing project workspaces.** If no `.ritual/config.json`, call `mcp__ritual__list_workspaces` — this returns project-type workspaces (the General workspace is excluded by default; agents never use it). Present as a numbered list (id, name). **[USER PAUSE]** for selection.
64
+ 3. **Create a new one if none exist or user wants a fresh one.** Call `mcp__ritual__create_workspace` with a name — convention is to name it after the repo (basename of cwd, or origin remote). Confirm the name with the user first. **[USER PAUSE]**
63
65
 
64
- Store `workspace_id`.
66
+ Store `workspace_id` for the rest of the flow.
67
+
68
+ If you created a new workspace, suggest the user run `ritual init` to persist the binding into `.ritual/config.json` for next time (or write it yourself if the user is in a Claude Code session that can edit files).
69
+
70
+ #### Phase 1.5 — Code reconnaissance
71
+
72
+ **Skip only if the user explicitly asks ("just generate, don't read the code") OR if you're operating outside a codebase context.**
73
+
74
+ Before generating considerations, do a structured scan of the codebase so the sub-problems land specific to *this* code, not generic. The legacy system shipped considerations that read like a textbook table of contents because it skipped this step; the difference between "Conversion Timing, Trust Signals, Recovery Paths" (generic) and "Leverage `apps/checkout/session.py` anonymous-checkout path vs. adding a new step in `CheckoutSessionMixin`" (codebase-specific) is whether you did Phase 1.5.
75
+
76
+ Steps:
77
+
78
+ 1. **Read the README + top-level project structure.** Use `ls` / Glob to see top-level files (5–10 lines). Identify the language, framework, key directories.
79
+
80
+ 2. **Glob for relevance.** Derive patterns from the user's problem. Examples:
81
+ - User says "auth flow" → `**/auth/**`, `**/login*`, `**/user*`, `**/session*`
82
+ - User says "checkout" → `**/checkout/**`, `**/cart/**`, `**/order/**`, `**/payment*`
83
+ - User says "notifications" → `**/notif*`, `**/email/**`, `**/sms/**`, `**/push/**`
84
+ Cap at ~15 hits per pattern.
85
+
86
+ 3. **Skim 3–5 most-relevant files.** For each, read the first ~100 lines + scan for class/function names. Triangulate: "is the behavior we care about actually here, or does it call into somewhere else?"
87
+
88
+ 4. **Build a recon summary** — 5–10 lines, concrete file paths, key abstractions, observed constraints. Examples of GOOD summaries:
89
+
90
+ > Codebase recon (django-oscar, Django 5.0):
91
+ > - Booking flow lives in `apps/checkout/views.py` (`CheckoutSessionMixin` orchestrates the step chain)
92
+ > - Anonymous checkout already supported via `apps/checkout/session.py:CheckoutSessionData`
93
+ > - Account creation entry point: `apps/customer/forms.py:RegisterUserForm`
94
+ > - Strategy classes (`apps/partner/strategy.py`) are the conventional pluggability pattern
95
+ > - User-model split: `auth.User` is the Django auth user; `apps/customer/models.py:Customer` is the order-attached profile — they can be linked or anonymous
96
+
97
+ 5. **Surface to the user as a [USER PAUSE]**:
98
+
99
+ > Reading the codebase I see:
100
+ > <recon summary>
101
+ >
102
+ > Anything I'm missing about design intent or constraints I won't find by reading the code (e.g. business decisions, prod incidents, in-flight migrations)?
103
+
104
+ Wait for confirmation or additional context. The user's reply (if any) becomes part of the input to Phase 2.
105
+
106
+ 6. **Compose the augmented `raw_input` for Phase 2.** Concatenate:
107
+ - The user's original problem (verbatim, top)
108
+ - A `--- Codebase context ---` section with the recon summary
109
+ - The user's reply from step 5 if non-empty
110
+ - Phase 1.5 IS the difference between "generic considerations" and "considerations that name actual files and reference existing patterns". Don't skip the recon step. Don't compress the summary to one line.
111
+
112
+ 7. **Collect the `sources` array.** The file paths you read in step 3 — exactly as they appear in the repo (e.g. `"apps/checkout/views.py"`, NOT `"./apps/checkout/views.py"` or absolute paths). This list is passed alongside `raw_input` to `generate_considerations` and `generate_problem_statement` so the API can auto-inject **prior knowledge-graph context** for overlapping files. Keep the list focused — only files you actually read and consider load-bearing for this problem. 5–10 is the sweet spot; >20 dilutes the KG signal.
113
+
114
+ **[USER PAUSE]** confirmation of recon is required before proceeding.
65
115
 
66
116
  #### Phase 2 — Generate sub-problems
67
117
 
118
+ ##### 2.1 First draft
119
+
68
120
  Call `mcp__ritual__generate_considerations` with:
69
121
  - `workspace_id`
70
- - `raw_input` (the problem from the slash-command argument or chat)
122
+ - `raw_input` (the user's problem + the Phase 1.5 codebase recon, concatenated as described above)
71
123
  - `template_id` (omit unless the user specified one)
124
+ - `sources` (the file path list from Phase 1.5 step 7 — file-path strings only, e.g. `["apps/checkout/views.py", ...]`)
72
125
 
73
- LLM call, ~5-10s. Returns 5-6 sub-problems — different framing axes the agent should investigate.
126
+ LLM call, ~510s. Returns 56 sub-problems — different framing axes the agent should investigate. Track each one as `{ text, version: 1 }` in your working memory.
127
+
128
+ **If the response includes `kg_context_used` with `implementationCount > 0`:** surface this to the user BEFORE presenting the considerations. It's the visible signal that prior team decisions shaped this draft.
129
+
130
+ > Reading the codebase I overlapped with 3 prior Ritual explorations on these files:
131
+ > - **"Anonymous checkout opt-in"** (shipped 2026-04-12) — 2 decisions, 1 deferral
132
+ > - **"Payment-method routing"** (shipped 2026-03-22) — 4 decisions
133
+ > - **"Session-data persistence"** (shipped 2026-02-08) — 1 decision
134
+ >
135
+ > I factored those into the sub-problems below.
136
+
137
+ If `implementationCount === 0`: don't mention the KG check (silent — would just be noise on a cold KG).
74
138
 
75
139
  **[USER PAUSE]** Present as a numbered list and ask which to include:
76
140
 
77
- > Sub-problems the system identified:
141
+ > Sub-problems the system identified (v1):
78
142
  >
79
143
  > 1. {sub-problem 1}
80
144
  > 2. {sub-problem 2}
81
145
  > ...
82
146
  >
83
- > Which should we factor into the scope? Pick any subset, or "all".
147
+ > Which should we factor into the scope? Pick any subset, "all", or ask for a different framing (e.g. "make them more technical", "drop the measurement angle", "focus on enterprise").
148
+
149
+ ##### 2.2 Iteration loop
84
150
 
85
- Store the picked sub-problems (text strings — they'll go into `considerations[]` next call).
151
+ If the user asks for a refinement (anything that isn't "all" / specific picks / "these are good"):
152
+
153
+ Call `mcp__ritual__refine_considerations` with:
154
+ - `workspace_id`, `raw_input`, `template_id`, `sources` — unchanged from the generate call. Critical: pass the SAME `sources` array each iteration so the KG-injected priorContext stays consistent.
155
+ - `change_prompt`: the user's request verbatim
156
+ - `selected`: items from prior versions the user kept (track `{ text, from_version }`, send just `text`)
157
+ - `dismissed`: items the user explicitly rejected
158
+ - `session_id`: omitted on the first refinement; pass the `session_id` from the previous refine response on subsequent ones to chain context
159
+
160
+ Track the new items as `{ text, version: N+1 }`. Present **alongside** the prior versions, not replacing them — the user can mix selections across versions.
161
+
162
+ Loop until the user says "these are good" or picks a subset.
163
+
164
+ **Critical**: never re-call `generate_considerations` for a refinement. That endpoint is stateless and re-rolls a fresh seed; you'll lose what the user just told you. The whole point of `refine_*` is the LLM sees the iteration context.
165
+
166
+ Store the final picked sub-problems for Phase 3 — they go into `considerations[]`.
86
167
 
87
168
  #### Phase 3 — Generate scope
88
169
 
170
+ ##### 3.1 First draft
171
+
89
172
  Call `mcp__ritual__generate_problem_statement` with:
90
173
  - `workspace_id`
91
- - `raw_input`
92
- - `considerations` (the picks from Phase 2 — these are the sub-problems we just discussed)
174
+ - `raw_input` (same augmented version from Phase 2)
175
+ - `considerations` (the picks from Phase 2)
93
176
  - `template_id` (same as Phase 2 if used)
177
+ - `sources` (the same file-path list passed to generate_considerations — keeps the KG anchor consistent)
94
178
 
95
- Returns a polished "How might we ..." style scope (typically <800 chars) plus optional follow-up questions and quality scores.
179
+ Returns a polished "How might we ..." style scope (typically <800 chars) plus optional follow-up questions and quality scores. Treat this as **v1** of the problem statement.
180
+
181
+ If the response includes `kg_context_used` with `implementationCount > 0`, prepend a note to the scope presentation:
182
+
183
+ > *(Grounded in {N} prior implementation{s}: {top match name}, …)*
96
184
 
97
185
  **[USER PAUSE]** Present and ask:
98
186
 
99
- > Here's the scope:
187
+ > Here's the scope (v1):
100
188
  >
101
189
  > > **{generated scope}**
102
190
  >
103
- > Looks good? Want to tighten / broaden / change the audience? Or "ship it" to lock it in.
191
+ > Looks good? Want to tighten / broaden / change the audience / focus on a specific tier? Or "ship it" to lock it in.
192
+
193
+ ##### 3.2 Iteration loop
194
+
195
+ If the user asks for a refinement:
196
+
197
+ Call `mcp__ritual__refine_problem_statement` with:
198
+ - `workspace_id`, `raw_input`, `considerations`, `template_id`, `sources` — unchanged. (Same `sources` as the original generate call — keeps the KG anchor stable.)
199
+ - `previous_problem_statement`: the FULL TEXT of the current best draft (the v1 you just showed)
200
+ - `change_prompt`: the user's request verbatim ("tighten and drop the audience clause")
201
+ - `version`: optional label like `"v2"` — purely for telemetry
202
+ - `session_id`: omitted on the first refinement; chain on subsequent ones
104
203
 
105
- If refinement requested: regenerate with the refinement appended to `raw_input`. Iterate until accepted.
204
+ The returned text becomes `v2`. Show it. The user can iterate v2 → v3 → ... by calling refine again with `previous_problem_statement` set to the latest draft.
106
205
 
107
- Store the final scope as `problem_statement` for the next call.
206
+ **Critical**: each refinement's `previous_problem_statement` is the LATEST draft, not the original v1. Otherwise the LLM keeps refining the same starting point and the user can't compose multiple refinements ("tighter, AND drop the audience clause, AND make it past-tense").
207
+
208
+ When the user accepts ("ship it" / "looks good"), store the final text as `problem_statement` for Phase 4.
108
209
 
109
210
  #### Phase 4 — Create the exploration
110
211
 
@@ -218,6 +319,32 @@ Show the final state:
218
319
  >
219
320
  > View at: https://dev.ritualapp.cloud/e/{exploration_id}
220
321
 
322
+ #### Phase 8 — After shipping: close the loop with `sync_implementation`
323
+
324
+ This phase happens *outside* the `/ritual build` chat — after the engineer has taken the accepted recommendations into their coding work and the implementation has actually landed (PR merged, etc.). The agent in that follow-up session should call `mcp__ritual__sync_implementation` to register what shipped, which decisions were made, and what was deliberately deferred.
325
+
326
+ When sync_implementation succeeds, the response now includes:
327
+
328
+ - `decisions: [{ decisionId, area, choice }, ...]` — IDs of every architectural decision logged
329
+ - `deferrals: [{ deferralId, rbId, severity }, ...]` — IDs of every deferral
330
+ - `decisionsCount`, `deferralsCount` — totals for the summary line
331
+ - `webUrl` — clickable link to the exploration's implementation record in the web UI
332
+
333
+ **Surface ALL of this to the user**, not just "ok logged." This is the visible signal that the loop closed. Format:
334
+
335
+ > ✓ Logged implementation for **{exploration name}**
336
+ > - {decisionsCount} decision{s} registered: {first 2 decisions, e.g. "auth: OAuth not SAML; data-model: tenant-scoped indexes"}
337
+ > - {deferralsCount} deferral{s} registered: {first 1, e.g. "[major] Rate-limit per-tenant — out of scope for v1"}
338
+ > - View: {webUrl}
339
+ >
340
+ > Future `/ritual build` calls touching `{first 2 of filesChanged}` will now see this implementation in their priorContext block.
341
+
342
+ The closing sentence is the most important one: it tells the user **what just happened in the system** in product-level terms. Without it, sync_implementation feels like a write-only black hole. With it, the user understands they just contributed to the workspace's memory.
343
+
344
+ If they want to check the state at any time, point them at:
345
+
346
+ > `ritual graph status` (in their CLI) — shows the workspace's current KG counts + recent implementations.
347
+
221
348
  ### Failure modes & recovery
222
349
 
223
350
  **Discovery generation hangs (>5 min polling without `ready: true`)**: ask the user — wait longer? retry (`suggest_discovery_questions` again, new task)? or skip discovery entirely (proceed to Phase 6 without picked questions)?
@@ -51,60 +51,161 @@ When **not** to use:
51
51
  - An exploration already exists with recommendations — fetch directly via `get_exploration` + `get_recommendations`
52
52
  - The user wants to *implement* a feature from existing recommendations — use `/ritual-builder-spec` (from `@ritual-ai/cli`)
53
53
 
54
- ### Workflow — 7 phases
54
+ ### Workflow — 9 phases (Phase 8 happens after the engineer ships the work)
55
55
 
56
56
  Each phase has explicit **[USER PAUSE]** points — never skip them.
57
57
 
58
58
  #### Phase 1 — Pick a workspace
59
59
 
60
- Call `mcp__ritual__list_workspaces`. Present as a numbered list (id, name).
60
+ Resolution order:
61
61
 
62
- If the user only has one project workspace, confirm it directly: "I'll create this in *Acme Engineering*. Sound right?" — but still wait for **[USER PAUSE]**.
62
+ 1. **Project-bound workspace (preferred).** Check for a `.ritual/config.json` at the project root (you can use the Read tool — the file is a small JSON with `workspaceId` + `workspaceName`). If it exists, that's the workspace this repo is bound to. Surface it to the user: "Using workspace **`{workspaceName}`** (bound to this repo). Sound right?" — wait for **[USER PAUSE]**.
63
+ 2. **List existing project workspaces.** If no `.ritual/config.json`, call `mcp__ritual__list_workspaces` — this returns project-type workspaces (the General workspace is excluded by default; agents never use it). Present as a numbered list (id, name). **[USER PAUSE]** for selection.
64
+ 3. **Create a new one if none exist or user wants a fresh one.** Call `mcp__ritual__create_workspace` with a name — convention is to name it after the repo (basename of cwd, or origin remote). Confirm the name with the user first. **[USER PAUSE]**
63
65
 
64
- Store `workspace_id`.
66
+ Store `workspace_id` for the rest of the flow.
67
+
68
+ If you created a new workspace, suggest the user run `ritual init` to persist the binding into `.ritual/config.json` for next time (or write it yourself if the user is in a Claude Code session that can edit files).
69
+
70
+ #### Phase 1.5 — Code reconnaissance
71
+
72
+ **Skip only if the user explicitly asks ("just generate, don't read the code") OR if you're operating outside a codebase context.**
73
+
74
+ Before generating considerations, do a structured scan of the codebase so the sub-problems land specific to *this* code, not generic. The legacy system shipped considerations that read like a textbook table of contents because it skipped this step; the difference between "Conversion Timing, Trust Signals, Recovery Paths" (generic) and "Leverage `apps/checkout/session.py` anonymous-checkout path vs. adding a new step in `CheckoutSessionMixin`" (codebase-specific) is whether you did Phase 1.5.
75
+
76
+ Steps:
77
+
78
+ 1. **Read the README + top-level project structure.** Use `ls` / Glob to see top-level files (5–10 lines). Identify the language, framework, key directories.
79
+
80
+ 2. **Glob for relevance.** Derive patterns from the user's problem. Examples:
81
+ - User says "auth flow" → `**/auth/**`, `**/login*`, `**/user*`, `**/session*`
82
+ - User says "checkout" → `**/checkout/**`, `**/cart/**`, `**/order/**`, `**/payment*`
83
+ - User says "notifications" → `**/notif*`, `**/email/**`, `**/sms/**`, `**/push/**`
84
+ Cap at ~15 hits per pattern.
85
+
86
+ 3. **Skim 3–5 most-relevant files.** For each, read the first ~100 lines + scan for class/function names. Triangulate: "is the behavior we care about actually here, or does it call into somewhere else?"
87
+
88
+ 4. **Build a recon summary** — 5–10 lines, concrete file paths, key abstractions, observed constraints. Examples of GOOD summaries:
89
+
90
+ > Codebase recon (django-oscar, Django 5.0):
91
+ > - Booking flow lives in `apps/checkout/views.py` (`CheckoutSessionMixin` orchestrates the step chain)
92
+ > - Anonymous checkout already supported via `apps/checkout/session.py:CheckoutSessionData`
93
+ > - Account creation entry point: `apps/customer/forms.py:RegisterUserForm`
94
+ > - Strategy classes (`apps/partner/strategy.py`) are the conventional pluggability pattern
95
+ > - User-model split: `auth.User` is the Django auth user; `apps/customer/models.py:Customer` is the order-attached profile — they can be linked or anonymous
96
+
97
+ 5. **Surface to the user as a [USER PAUSE]**:
98
+
99
+ > Reading the codebase I see:
100
+ > <recon summary>
101
+ >
102
+ > Anything I'm missing about design intent or constraints I won't find by reading the code (e.g. business decisions, prod incidents, in-flight migrations)?
103
+
104
+ Wait for confirmation or additional context. The user's reply (if any) becomes part of the input to Phase 2.
105
+
106
+ 6. **Compose the augmented `raw_input` for Phase 2.** Concatenate:
107
+ - The user's original problem (verbatim, top)
108
+ - A `--- Codebase context ---` section with the recon summary
109
+ - The user's reply from step 5 if non-empty
110
+ - Phase 1.5 IS the difference between "generic considerations" and "considerations that name actual files and reference existing patterns". Don't skip the recon step. Don't compress the summary to one line.
111
+
112
+ 7. **Collect the `sources` array.** The file paths you read in step 3 — exactly as they appear in the repo (e.g. `"apps/checkout/views.py"`, NOT `"./apps/checkout/views.py"` or absolute paths). This list is passed alongside `raw_input` to `generate_considerations` and `generate_problem_statement` so the API can auto-inject **prior knowledge-graph context** for overlapping files. Keep the list focused — only files you actually read and consider load-bearing for this problem. 5–10 is the sweet spot; >20 dilutes the KG signal.
113
+
114
+ **[USER PAUSE]** confirmation of recon is required before proceeding.
65
115
 
66
116
  #### Phase 2 — Generate sub-problems
67
117
 
118
+ ##### 2.1 First draft
119
+
68
120
  Call `mcp__ritual__generate_considerations` with:
69
121
  - `workspace_id`
70
- - `raw_input` (the problem from the slash-command argument or chat)
122
+ - `raw_input` (the user's problem + the Phase 1.5 codebase recon, concatenated as described above)
71
123
  - `template_id` (omit unless the user specified one)
124
+ - `sources` (the file path list from Phase 1.5 step 7 — file-path strings only, e.g. `["apps/checkout/views.py", ...]`)
72
125
 
73
- LLM call, ~5-10s. Returns 5-6 sub-problems — different framing axes the agent should investigate.
126
+ LLM call, ~510s. Returns 56 sub-problems — different framing axes the agent should investigate. Track each one as `{ text, version: 1 }` in your working memory.
127
+
128
+ **If the response includes `kg_context_used` with `implementationCount > 0`:** surface this to the user BEFORE presenting the considerations. It's the visible signal that prior team decisions shaped this draft.
129
+
130
+ > Reading the codebase I overlapped with 3 prior Ritual explorations on these files:
131
+ > - **"Anonymous checkout opt-in"** (shipped 2026-04-12) — 2 decisions, 1 deferral
132
+ > - **"Payment-method routing"** (shipped 2026-03-22) — 4 decisions
133
+ > - **"Session-data persistence"** (shipped 2026-02-08) — 1 decision
134
+ >
135
+ > I factored those into the sub-problems below.
136
+
137
+ If `implementationCount === 0`: don't mention the KG check (silent — would just be noise on a cold KG).
74
138
 
75
139
  **[USER PAUSE]** Present as a numbered list and ask which to include:
76
140
 
77
- > Sub-problems the system identified:
141
+ > Sub-problems the system identified (v1):
78
142
  >
79
143
  > 1. {sub-problem 1}
80
144
  > 2. {sub-problem 2}
81
145
  > ...
82
146
  >
83
- > Which should we factor into the scope? Pick any subset, or "all".
147
+ > Which should we factor into the scope? Pick any subset, "all", or ask for a different framing (e.g. "make them more technical", "drop the measurement angle", "focus on enterprise").
148
+
149
+ ##### 2.2 Iteration loop
84
150
 
85
- Store the picked sub-problems (text strings — they'll go into `considerations[]` next call).
151
+ If the user asks for a refinement (anything that isn't "all" / specific picks / "these are good"):
152
+
153
+ Call `mcp__ritual__refine_considerations` with:
154
+ - `workspace_id`, `raw_input`, `template_id`, `sources` — unchanged from the generate call. Critical: pass the SAME `sources` array each iteration so the KG-injected priorContext stays consistent.
155
+ - `change_prompt`: the user's request verbatim
156
+ - `selected`: items from prior versions the user kept (track `{ text, from_version }`, send just `text`)
157
+ - `dismissed`: items the user explicitly rejected
158
+ - `session_id`: omitted on the first refinement; pass the `session_id` from the previous refine response on subsequent ones to chain context
159
+
160
+ Track the new items as `{ text, version: N+1 }`. Present **alongside** the prior versions, not replacing them — the user can mix selections across versions.
161
+
162
+ Loop until the user says "these are good" or picks a subset.
163
+
164
+ **Critical**: never re-call `generate_considerations` for a refinement. That endpoint is stateless and re-rolls a fresh seed; you'll lose what the user just told you. The whole point of `refine_*` is the LLM sees the iteration context.
165
+
166
+ Store the final picked sub-problems for Phase 3 — they go into `considerations[]`.
86
167
 
87
168
  #### Phase 3 — Generate scope
88
169
 
170
+ ##### 3.1 First draft
171
+
89
172
  Call `mcp__ritual__generate_problem_statement` with:
90
173
  - `workspace_id`
91
- - `raw_input`
92
- - `considerations` (the picks from Phase 2 — these are the sub-problems we just discussed)
174
+ - `raw_input` (same augmented version from Phase 2)
175
+ - `considerations` (the picks from Phase 2)
93
176
  - `template_id` (same as Phase 2 if used)
177
+ - `sources` (the same file-path list passed to generate_considerations — keeps the KG anchor consistent)
94
178
 
95
- Returns a polished "How might we ..." style scope (typically <800 chars) plus optional follow-up questions and quality scores.
179
+ Returns a polished "How might we ..." style scope (typically <800 chars) plus optional follow-up questions and quality scores. Treat this as **v1** of the problem statement.
180
+
181
+ If the response includes `kg_context_used` with `implementationCount > 0`, prepend a note to the scope presentation:
182
+
183
+ > *(Grounded in {N} prior implementation{s}: {top match name}, …)*
96
184
 
97
185
  **[USER PAUSE]** Present and ask:
98
186
 
99
- > Here's the scope:
187
+ > Here's the scope (v1):
100
188
  >
101
189
  > > **{generated scope}**
102
190
  >
103
- > Looks good? Want to tighten / broaden / change the audience? Or "ship it" to lock it in.
191
+ > Looks good? Want to tighten / broaden / change the audience / focus on a specific tier? Or "ship it" to lock it in.
192
+
193
+ ##### 3.2 Iteration loop
194
+
195
+ If the user asks for a refinement:
196
+
197
+ Call `mcp__ritual__refine_problem_statement` with:
198
+ - `workspace_id`, `raw_input`, `considerations`, `template_id`, `sources` — unchanged. (Same `sources` as the original generate call — keeps the KG anchor stable.)
199
+ - `previous_problem_statement`: the FULL TEXT of the current best draft (the v1 you just showed)
200
+ - `change_prompt`: the user's request verbatim ("tighten and drop the audience clause")
201
+ - `version`: optional label like `"v2"` — purely for telemetry
202
+ - `session_id`: omitted on the first refinement; chain on subsequent ones
104
203
 
105
- If refinement requested: regenerate with the refinement appended to `raw_input`. Iterate until accepted.
204
+ The returned text becomes `v2`. Show it. The user can iterate v2 → v3 → ... by calling refine again with `previous_problem_statement` set to the latest draft.
106
205
 
107
- Store the final scope as `problem_statement` for the next call.
206
+ **Critical**: each refinement's `previous_problem_statement` is the LATEST draft, not the original v1. Otherwise the LLM keeps refining the same starting point and the user can't compose multiple refinements ("tighter, AND drop the audience clause, AND make it past-tense").
207
+
208
+ When the user accepts ("ship it" / "looks good"), store the final text as `problem_statement` for Phase 4.
108
209
 
109
210
  #### Phase 4 — Create the exploration
110
211
 
@@ -218,6 +319,32 @@ Show the final state:
218
319
  >
219
320
  > View at: https://dev.ritualapp.cloud/e/{exploration_id}
220
321
 
322
+ #### Phase 8 — After shipping: close the loop with `sync_implementation`
323
+
324
+ This phase happens *outside* the `/ritual build` chat — after the engineer has taken the accepted recommendations into their coding work and the implementation has actually landed (PR merged, etc.). The agent in that follow-up session should call `mcp__ritual__sync_implementation` to register what shipped, which decisions were made, and what was deliberately deferred.
325
+
326
+ When sync_implementation succeeds, the response now includes:
327
+
328
+ - `decisions: [{ decisionId, area, choice }, ...]` — IDs of every architectural decision logged
329
+ - `deferrals: [{ deferralId, rbId, severity }, ...]` — IDs of every deferral
330
+ - `decisionsCount`, `deferralsCount` — totals for the summary line
331
+ - `webUrl` — clickable link to the exploration's implementation record in the web UI
332
+
333
+ **Surface ALL of this to the user**, not just "ok logged." This is the visible signal that the loop closed. Format:
334
+
335
+ > ✓ Logged implementation for **{exploration name}**
336
+ > - {decisionsCount} decision{s} registered: {first 2 decisions, e.g. "auth: OAuth not SAML; data-model: tenant-scoped indexes"}
337
+ > - {deferralsCount} deferral{s} registered: {first 1, e.g. "[major] Rate-limit per-tenant — out of scope for v1"}
338
+ > - View: {webUrl}
339
+ >
340
+ > Future `/ritual build` calls touching `{first 2 of filesChanged}` will now see this implementation in their priorContext block.
341
+
342
+ The closing sentence is the most important one: it tells the user **what just happened in the system** in product-level terms. Without it, sync_implementation feels like a write-only black hole. With it, the user understands they just contributed to the workspace's memory.
343
+
344
+ If they want to check the state at any time, point them at:
345
+
346
+ > `ritual graph status` (in their CLI) — shows the workspace's current KG counts + recent implementations.
347
+
221
348
  ### Failure modes & recovery
222
349
 
223
350
  **Discovery generation hangs (>5 min polling without `ready: true`)**: ask the user — wait longer? retry (`suggest_discovery_questions` again, new task)? or skip discovery entirely (proceed to Phase 6 without picked questions)?
@@ -51,60 +51,161 @@ When **not** to use:
51
51
  - An exploration already exists with recommendations — fetch directly via `get_exploration` + `get_recommendations`
52
52
  - The user wants to *implement* a feature from existing recommendations — use `/ritual-builder-spec` (from `@ritual-ai/cli`)
53
53
 
54
- ### Workflow — 7 phases
54
+ ### Workflow — 9 phases (Phase 8 happens after the engineer ships the work)
55
55
 
56
56
  Each phase has explicit **[USER PAUSE]** points — never skip them.
57
57
 
58
58
  #### Phase 1 — Pick a workspace
59
59
 
60
- Call `mcp__ritual__list_workspaces`. Present as a numbered list (id, name).
60
+ Resolution order:
61
61
 
62
- If the user only has one project workspace, confirm it directly: "I'll create this in *Acme Engineering*. Sound right?" — but still wait for **[USER PAUSE]**.
62
+ 1. **Project-bound workspace (preferred).** Check for a `.ritual/config.json` at the project root (you can use the Read tool — the file is a small JSON with `workspaceId` + `workspaceName`). If it exists, that's the workspace this repo is bound to. Surface it to the user: "Using workspace **`{workspaceName}`** (bound to this repo). Sound right?" — wait for **[USER PAUSE]**.
63
+ 2. **List existing project workspaces.** If no `.ritual/config.json`, call `mcp__ritual__list_workspaces` — this returns project-type workspaces (the General workspace is excluded by default; agents never use it). Present as a numbered list (id, name). **[USER PAUSE]** for selection.
64
+ 3. **Create a new one if none exist or user wants a fresh one.** Call `mcp__ritual__create_workspace` with a name — convention is to name it after the repo (basename of cwd, or origin remote). Confirm the name with the user first. **[USER PAUSE]**
63
65
 
64
- Store `workspace_id`.
66
+ Store `workspace_id` for the rest of the flow.
67
+
68
+ If you created a new workspace, suggest the user run `ritual init` to persist the binding into `.ritual/config.json` for next time (or write it yourself if the user is in a Claude Code session that can edit files).
69
+
70
+ #### Phase 1.5 — Code reconnaissance
71
+
72
+ **Skip only if the user explicitly asks ("just generate, don't read the code") OR if you're operating outside a codebase context.**
73
+
74
+ Before generating considerations, do a structured scan of the codebase so the sub-problems land specific to *this* code, not generic. The legacy system shipped considerations that read like a textbook table of contents because it skipped this step; the difference between "Conversion Timing, Trust Signals, Recovery Paths" (generic) and "Leverage `apps/checkout/session.py` anonymous-checkout path vs. adding a new step in `CheckoutSessionMixin`" (codebase-specific) is whether you did Phase 1.5.
75
+
76
+ Steps:
77
+
78
+ 1. **Read the README + top-level project structure.** Use `ls` / Glob to see top-level files (5–10 lines). Identify the language, framework, key directories.
79
+
80
+ 2. **Glob for relevance.** Derive patterns from the user's problem. Examples:
81
+ - User says "auth flow" → `**/auth/**`, `**/login*`, `**/user*`, `**/session*`
82
+ - User says "checkout" → `**/checkout/**`, `**/cart/**`, `**/order/**`, `**/payment*`
83
+ - User says "notifications" → `**/notif*`, `**/email/**`, `**/sms/**`, `**/push/**`
84
+ Cap at ~15 hits per pattern.
85
+
86
+ 3. **Skim 3–5 most-relevant files.** For each, read the first ~100 lines + scan for class/function names. Triangulate: "is the behavior we care about actually here, or does it call into somewhere else?"
87
+
88
+ 4. **Build a recon summary** — 5–10 lines, concrete file paths, key abstractions, observed constraints. Examples of GOOD summaries:
89
+
90
+ > Codebase recon (django-oscar, Django 5.0):
91
+ > - Booking flow lives in `apps/checkout/views.py` (`CheckoutSessionMixin` orchestrates the step chain)
92
+ > - Anonymous checkout already supported via `apps/checkout/session.py:CheckoutSessionData`
93
+ > - Account creation entry point: `apps/customer/forms.py:RegisterUserForm`
94
+ > - Strategy classes (`apps/partner/strategy.py`) are the conventional pluggability pattern
95
+ > - User-model split: `auth.User` is the Django auth user; `apps/customer/models.py:Customer` is the order-attached profile — they can be linked or anonymous
96
+
97
+ 5. **Surface to the user as a [USER PAUSE]**:
98
+
99
+ > Reading the codebase I see:
100
+ > <recon summary>
101
+ >
102
+ > Anything I'm missing about design intent or constraints I won't find by reading the code (e.g. business decisions, prod incidents, in-flight migrations)?
103
+
104
+ Wait for confirmation or additional context. The user's reply (if any) becomes part of the input to Phase 2.
105
+
106
+ 6. **Compose the augmented `raw_input` for Phase 2.** Concatenate:
107
+ - The user's original problem (verbatim, top)
108
+ - A `--- Codebase context ---` section with the recon summary
109
+ - The user's reply from step 5 if non-empty
110
+ - Phase 1.5 IS the difference between "generic considerations" and "considerations that name actual files and reference existing patterns". Don't skip the recon step. Don't compress the summary to one line.
111
+
112
+ 7. **Collect the `sources` array.** The file paths you read in step 3 — exactly as they appear in the repo (e.g. `"apps/checkout/views.py"`, NOT `"./apps/checkout/views.py"` or absolute paths). This list is passed alongside `raw_input` to `generate_considerations` and `generate_problem_statement` so the API can auto-inject **prior knowledge-graph context** for overlapping files. Keep the list focused — only files you actually read and consider load-bearing for this problem. 5–10 is the sweet spot; >20 dilutes the KG signal.
113
+
114
+ **[USER PAUSE]** confirmation of recon is required before proceeding.
65
115
 
66
116
  #### Phase 2 — Generate sub-problems
67
117
 
118
+ ##### 2.1 First draft
119
+
68
120
  Call `mcp__ritual__generate_considerations` with:
69
121
  - `workspace_id`
70
- - `raw_input` (the problem from the slash-command argument or chat)
122
+ - `raw_input` (the user's problem + the Phase 1.5 codebase recon, concatenated as described above)
71
123
  - `template_id` (omit unless the user specified one)
124
+ - `sources` (the file path list from Phase 1.5 step 7 — file-path strings only, e.g. `["apps/checkout/views.py", ...]`)
72
125
 
73
- LLM call, ~5-10s. Returns 5-6 sub-problems — different framing axes the agent should investigate.
126
+ LLM call, ~510s. Returns 56 sub-problems — different framing axes the agent should investigate. Track each one as `{ text, version: 1 }` in your working memory.
127
+
128
+ **If the response includes `kg_context_used` with `implementationCount > 0`:** surface this to the user BEFORE presenting the considerations. It's the visible signal that prior team decisions shaped this draft.
129
+
130
+ > Reading the codebase I overlapped with 3 prior Ritual explorations on these files:
131
+ > - **"Anonymous checkout opt-in"** (shipped 2026-04-12) — 2 decisions, 1 deferral
132
+ > - **"Payment-method routing"** (shipped 2026-03-22) — 4 decisions
133
+ > - **"Session-data persistence"** (shipped 2026-02-08) — 1 decision
134
+ >
135
+ > I factored those into the sub-problems below.
136
+
137
+ If `implementationCount === 0`: don't mention the KG check (silent — would just be noise on a cold KG).
74
138
 
75
139
  **[USER PAUSE]** Present as a numbered list and ask which to include:
76
140
 
77
- > Sub-problems the system identified:
141
+ > Sub-problems the system identified (v1):
78
142
  >
79
143
  > 1. {sub-problem 1}
80
144
  > 2. {sub-problem 2}
81
145
  > ...
82
146
  >
83
- > Which should we factor into the scope? Pick any subset, or "all".
147
+ > Which should we factor into the scope? Pick any subset, "all", or ask for a different framing (e.g. "make them more technical", "drop the measurement angle", "focus on enterprise").
148
+
149
+ ##### 2.2 Iteration loop
84
150
 
85
- Store the picked sub-problems (text strings — they'll go into `considerations[]` next call).
151
+ If the user asks for a refinement (anything that isn't "all" / specific picks / "these are good"):
152
+
153
+ Call `mcp__ritual__refine_considerations` with:
154
+ - `workspace_id`, `raw_input`, `template_id`, `sources` — unchanged from the generate call. Critical: pass the SAME `sources` array each iteration so the KG-injected priorContext stays consistent.
155
+ - `change_prompt`: the user's request verbatim
156
+ - `selected`: items from prior versions the user kept (track `{ text, from_version }`, send just `text`)
157
+ - `dismissed`: items the user explicitly rejected
158
+ - `session_id`: omitted on the first refinement; pass the `session_id` from the previous refine response on subsequent ones to chain context
159
+
160
+ Track the new items as `{ text, version: N+1 }`. Present **alongside** the prior versions, not replacing them — the user can mix selections across versions.
161
+
162
+ Loop until the user says "these are good" or picks a subset.
163
+
164
+ **Critical**: never re-call `generate_considerations` for a refinement. That endpoint is stateless and re-rolls a fresh seed; you'll lose what the user just told you. The whole point of `refine_*` is the LLM sees the iteration context.
165
+
166
+ Store the final picked sub-problems for Phase 3 — they go into `considerations[]`.
86
167
 
87
168
  #### Phase 3 — Generate scope
88
169
 
170
+ ##### 3.1 First draft
171
+
89
172
  Call `mcp__ritual__generate_problem_statement` with:
90
173
  - `workspace_id`
91
- - `raw_input`
92
- - `considerations` (the picks from Phase 2 — these are the sub-problems we just discussed)
174
+ - `raw_input` (same augmented version from Phase 2)
175
+ - `considerations` (the picks from Phase 2)
93
176
  - `template_id` (same as Phase 2 if used)
177
+ - `sources` (the same file-path list passed to generate_considerations — keeps the KG anchor consistent)
94
178
 
95
- Returns a polished "How might we ..." style scope (typically <800 chars) plus optional follow-up questions and quality scores.
179
+ Returns a polished "How might we ..." style scope (typically <800 chars) plus optional follow-up questions and quality scores. Treat this as **v1** of the problem statement.
180
+
181
+ If the response includes `kg_context_used` with `implementationCount > 0`, prepend a note to the scope presentation:
182
+
183
+ > *(Grounded in {N} prior implementation{s}: {top match name}, …)*
96
184
 
97
185
  **[USER PAUSE]** Present and ask:
98
186
 
99
- > Here's the scope:
187
+ > Here's the scope (v1):
100
188
  >
101
189
  > > **{generated scope}**
102
190
  >
103
- > Looks good? Want to tighten / broaden / change the audience? Or "ship it" to lock it in.
191
+ > Looks good? Want to tighten / broaden / change the audience / focus on a specific tier? Or "ship it" to lock it in.
192
+
193
+ ##### 3.2 Iteration loop
194
+
195
+ If the user asks for a refinement:
196
+
197
+ Call `mcp__ritual__refine_problem_statement` with:
198
+ - `workspace_id`, `raw_input`, `considerations`, `template_id`, `sources` — unchanged. (Same `sources` as the original generate call — keeps the KG anchor stable.)
199
+ - `previous_problem_statement`: the FULL TEXT of the current best draft (the v1 you just showed)
200
+ - `change_prompt`: the user's request verbatim ("tighten and drop the audience clause")
201
+ - `version`: optional label like `"v2"` — purely for telemetry
202
+ - `session_id`: omitted on the first refinement; chain on subsequent ones
104
203
 
105
- If refinement requested: regenerate with the refinement appended to `raw_input`. Iterate until accepted.
204
+ The returned text becomes `v2`. Show it. The user can iterate v2 → v3 → ... by calling refine again with `previous_problem_statement` set to the latest draft.
106
205
 
107
- Store the final scope as `problem_statement` for the next call.
206
+ **Critical**: each refinement's `previous_problem_statement` is the LATEST draft, not the original v1. Otherwise the LLM keeps refining the same starting point and the user can't compose multiple refinements ("tighter, AND drop the audience clause, AND make it past-tense").
207
+
208
+ When the user accepts ("ship it" / "looks good"), store the final text as `problem_statement` for Phase 4.
108
209
 
109
210
  #### Phase 4 — Create the exploration
110
211
 
@@ -218,6 +319,32 @@ Show the final state:
218
319
  >
219
320
  > View at: https://dev.ritualapp.cloud/e/{exploration_id}
220
321
 
322
+ #### Phase 8 — After shipping: close the loop with `sync_implementation`
323
+
324
+ This phase happens *outside* the `/ritual build` chat — after the engineer has taken the accepted recommendations into their coding work and the implementation has actually landed (PR merged, etc.). The agent in that follow-up session should call `mcp__ritual__sync_implementation` to register what shipped, which decisions were made, and what was deliberately deferred.
325
+
326
+ When sync_implementation succeeds, the response now includes:
327
+
328
+ - `decisions: [{ decisionId, area, choice }, ...]` — IDs of every architectural decision logged
329
+ - `deferrals: [{ deferralId, rbId, severity }, ...]` — IDs of every deferral
330
+ - `decisionsCount`, `deferralsCount` — totals for the summary line
331
+ - `webUrl` — clickable link to the exploration's implementation record in the web UI
332
+
333
+ **Surface ALL of this to the user**, not just "ok logged." This is the visible signal that the loop closed. Format:
334
+
335
+ > ✓ Logged implementation for **{exploration name}**
336
+ > - {decisionsCount} decision{s} registered: {first 2 decisions, e.g. "auth: OAuth not SAML; data-model: tenant-scoped indexes"}
337
+ > - {deferralsCount} deferral{s} registered: {first 1, e.g. "[major] Rate-limit per-tenant — out of scope for v1"}
338
+ > - View: {webUrl}
339
+ >
340
+ > Future `/ritual build` calls touching `{first 2 of filesChanged}` will now see this implementation in their priorContext block.
341
+
342
+ The closing sentence is the most important one: it tells the user **what just happened in the system** in product-level terms. Without it, sync_implementation feels like a write-only black hole. With it, the user understands they just contributed to the workspace's memory.
343
+
344
+ If they want to check the state at any time, point them at:
345
+
346
+ > `ritual graph status` (in their CLI) — shows the workspace's current KG counts + recent implementations.
347
+
221
348
  ### Failure modes & recovery
222
349
 
223
350
  **Discovery generation hangs (>5 min polling without `ready: true`)**: ask the user — wait longer? retry (`suggest_discovery_questions` again, new task)? or skip discovery entirely (proceed to Phase 6 without picked questions)?