@event4u/agent-config 1.21.0 → 1.22.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (53) hide show
  1. package/.agent-src/commands/bug-fix.md +1 -0
  2. package/.agent-src/commands/bug-investigate.md +1 -0
  3. package/.agent-src/commands/challenge-me/vision.md +348 -0
  4. package/.agent-src/commands/challenge-me/with-docs.md +333 -0
  5. package/.agent-src/commands/challenge-me.md +61 -0
  6. package/.agent-src/commands/council/default.md +64 -17
  7. package/.agent-src/commands/create-pr.md +7 -3
  8. package/.agent-src/commands/grill-me.md +38 -0
  9. package/.agent-src/commands/judge/steps.md +1 -1
  10. package/.agent-src/commands/roadmap/ai-council.md +183 -0
  11. package/.agent-src/commands/roadmap/create.md +6 -1
  12. package/.agent-src/commands/roadmap/process-full.md +58 -0
  13. package/.agent-src/commands/roadmap/process-phase.md +69 -0
  14. package/.agent-src/commands/roadmap/process-step.md +57 -0
  15. package/.agent-src/commands/roadmap.md +44 -16
  16. package/.agent-src/commands/threat-model.md +1 -0
  17. package/.agent-src/contexts/augment-infrastructure.md +1 -1
  18. package/.agent-src/contexts/communication/rules-auto/slash-command-routing-policy-mechanics.md +53 -18
  19. package/.agent-src/contexts/execution/roadmap-process-loop.md +125 -0
  20. package/.agent-src/contexts/skills-and-commands.md +1 -1
  21. package/.agent-src/rules/improve-before-implement.md +1 -0
  22. package/.agent-src/rules/invite-challenge.md +71 -0
  23. package/.agent-src/skills/adversarial-review/SKILL.md +1 -0
  24. package/.agent-src/skills/ai-council/SKILL.md +132 -8
  25. package/.agent-src/skills/bug-analyzer/SKILL.md +1 -0
  26. package/.agent-src/skills/roadmap-management/SKILL.md +7 -7
  27. package/.agent-src/skills/systematic-debugging/SKILL.md +22 -2
  28. package/.agent-src/skills/technical-specification/SKILL.md +58 -1
  29. package/.agent-src/skills/threat-modeling/SKILL.md +1 -0
  30. package/.agent-src/templates/agent-settings.md +14 -3
  31. package/.agent-src/templates/command.md +17 -1
  32. package/.agent-src/templates/roadmaps.md +10 -2
  33. package/.agent-src/templates/rule.md +2 -0
  34. package/.agent-src/templates/skill.md +17 -0
  35. package/.claude-plugin/marketplace.json +9 -2
  36. package/AGENTS.md +2 -2
  37. package/CHANGELOG.md +38 -0
  38. package/README.md +1 -1
  39. package/config/agent-settings.template.yml +22 -0
  40. package/docs/architecture.md +2 -2
  41. package/docs/contracts/command-clusters.md +45 -1
  42. package/docs/decisions/ADR-003-flat-cluster-subs-and-colon-syntax.md +126 -0
  43. package/docs/getting-started.md +1 -1
  44. package/docs/guidelines/agent-infra/naming.md +1 -1
  45. package/package.json +1 -1
  46. package/scripts/_phase2_shim_helper.py +1 -1
  47. package/scripts/council_cli.py +127 -10
  48. package/scripts/migrate_command_suggestions.py +2 -2
  49. package/scripts/schemas/command.schema.json +5 -0
  50. package/scripts/schemas/rule.schema.json +5 -0
  51. package/scripts/schemas/skill.schema.json +5 -0
  52. package/scripts/skill_linter.py +1 -1
  53. package/.agent-src/commands/roadmap/execute.md +0 -109
@@ -0,0 +1,333 @@
1
+ ---
2
+ name: challenge-me:with-docs
3
+ cluster: challenge-me
4
+ sub: with-docs
5
+ description: "Doc-aware /challenge-me — 95%-confidence interview with session glossary vs CONTEXT.md, load-bearing claim-vs-code verification, optional CONTEXT.md patch + ADR candidates in the pitch."
6
+ disable-model-invocation: true
7
+ suggestion:
8
+ eligible: true
9
+ trigger_description: "challenge me against the docs, grill me with our context, grill me against the docs, grill me against CONTEXT.md, grill me with the project context, frag mich durch und prüf gegen CONTEXT.md, challenge plan with domain docs"
10
+ trigger_context: "user wants the seed challenged against existing project glossary, CONTEXT.md, or ADRs — not a greenfield interview"
11
+ ---
12
+
13
+ # /challenge-me with-docs
14
+
15
+ > Same one-question-at-a-time interview as `/challenge-me vision`, plus:
16
+ > a **session glossary** against `CONTEXT.md`, **load-bearing claim-vs-code
17
+ > verification**, and an optional `CONTEXT.md` **patch** + **ADR candidates**
18
+ > appended to the final pitch. Nothing is written to disk — the pitch is
19
+ > the only artefact.
20
+
21
+ ## Welcome
22
+
23
+ If the user invokes `/challenge-me with-docs` with no body, render once:
24
+
25
+ ```
26
+ Drop the seed and I'll grill it against your project docs. Per turn:
27
+ one question, recommended answer. I'll surface glossary clashes vs
28
+ CONTEXT.md and verify load-bearing claims against the code. At the end
29
+ you get a pitch — plus an optional CONTEXT.md patch and ADR candidates
30
+ to copy / apply yourself.
31
+
32
+ Stop early any time by typing `!pitch` (or `!Vision pitchen`).
33
+ ```
34
+
35
+ Skip the welcome if the user invokes `/challenge-me with-docs <seed>`
36
+ directly.
37
+
38
+ ## When to use this instead of `vision`
39
+
40
+ | Condition | Pick |
41
+ |---|---|
42
+ | Project has no `CONTEXT.md` and no ADR dir | `vision` |
43
+ | Seed uses domain terms that may already be defined elsewhere | `with-docs` |
44
+ | User says "check against our docs" / "make sure it matches CONTEXT.md" | `with-docs` |
45
+ | User says "make sure this is consistent with the codebase" | `with-docs` |
46
+ | Greenfield brainstorm, no existing constraints to honour | `vision` |
47
+
48
+ If neither file exists and the user invoked `with-docs` anyway, fall
49
+ back gracefully to `vision`-style flow and skip the doc inventory; note
50
+ the absence in a single line at the end of Step 0.
51
+
52
+ ## Stop condition, flags, pitch trigger
53
+
54
+ Inherits **verbatim** from [`vision.md`](vision.md):
55
+
56
+ - Default stop = the four 95%-conditions (AND, not OR).
57
+ - Load-bearing test = answer changes goal / scope / hard constraint / AC.
58
+ - `--until=95%` (default) · `--until=N` · `--keep-going`.
59
+ - Pitch trigger: `!pitch`, `/pitch`, or whole reply `pitch`.
60
+ - Step 1 question-block shape, Step 4 pitch validation.
61
+
62
+ The deltas below replace / extend Step 0, Step 1, Step 2, Step 4 and
63
+ Step 5. Steps 3 and 6 are unchanged.
64
+
65
+ ## Steps
66
+
67
+ ### Step 0 — Inspect (read-only doc inventory + codebase)
68
+
69
+ ```
70
+ EXPLORE THE DOCS AND THE CODEBASE BEFORE THE FIRST QUESTION.
71
+ NEVER WRITE TO DISK IN STEP 0. NEVER ASK WHAT view / grep / codebase-retrieval ANSWERS.
72
+ ```
73
+
74
+ 1. **Read the seed** (same as `vision`).
75
+ 2. **Doc inventory** — read-only:
76
+ - `view CONTEXT.md` if it exists; init empty **session glossary** in
77
+ agent state (in-memory only).
78
+ - `view CONTEXT-MAP.md` if it exists. If the seed touches > 1 listed
79
+ context, **ask once** (numbered options listing each context) which
80
+ to focus on; record the choice, do not re-ask.
81
+ - Autodetect ADR directory by listing `docs/adr/`, `docs/decisions/`,
82
+ `docs/architecture/decisions/`, `agents/decisions/`. Pick the
83
+ **most-recently-modified** non-empty dir; if none exist, default to
84
+ `docs/adr/` for the patch output (do **not** create the dir).
85
+ - If `agent state` already holds a session glossary from a prior
86
+ `/challenge-me with-docs` invocation **in this same chat**, load
87
+ it. Then re-read `CONTEXT.md` — any term in `CONTEXT.md` overrides
88
+ the session glossary (disk wins over draft). Emit a delta line if
89
+ non-zero: `Loaded N draft terms; CONTEXT.md already has X — carrying
90
+ forward the rest as drafts.`
91
+ 3. **Codebase lookup** — same as `vision` Step 0.2 (existing routes,
92
+ model fields, conventions, feature flags).
93
+ 4. **List the open dimensions** internally — same as `vision` Step 0.3.
94
+
95
+ ### Step 1 — Glossary-aware questioning
96
+
97
+ Every question block runs an extra check **before** emit:
98
+
99
+ - **Glossary conflict** — does the seed (or a prior turn) use a term
100
+ that conflicts with the existing `CONTEXT.md` glossary, including a
101
+ case-insensitive match? → first question of the session is the
102
+ glossary disambiguation:
103
+
104
+ > Your `CONTEXT.md` defines **Account** as the billing entity. Your
105
+ > seed uses "account" for the logged-in person. Which is it?
106
+ > 1. **Match CONTEXT.md** — rename "account" in the plan to "user".
107
+ > 2. Override CONTEXT.md — the plan introduces a new meaning; we patch
108
+ > the glossary at pitch time.
109
+ > 3. Skip / not relevant.
110
+
111
+ - **Fuzzy / overloaded term** — propose a canonical term as option 1.
112
+
113
+ - **No glossary present** → skip the check; behave like `vision`.
114
+
115
+ The glossary check is itself **load-bearing only when the term flips an
116
+ AC or scope boundary OR when it conflicts with an existing CONTEXT.md
117
+ entry**. Pure naming preference with no conflict is closed silently
118
+ (same load-bearing test as `vision`, with the conflict-extension).
119
+
120
+ Otherwise the question-block shape, recommended-option rule, 3–4 option
121
+ cap, and `Why it matters` requirement are identical to
122
+ [`vision.md § Step 1`](vision.md#step-1-ask-one-question-per-turn).
123
+
124
+ ### Step 2 — Apply the answer + session glossary (in-memory)
125
+
126
+ Adopt-the-answer rules are identical to
127
+ [`vision.md § Step 2`](vision.md#step-2-apply-the-answer).
128
+
129
+ When a term is resolved, write to **session glossary** (agent state
130
+ only). **Never write to disk in this step.** Every turn's reply ends
131
+ with a compact echo block when the glossary is non-empty:
132
+
133
+ ```
134
+ **Session glossary (draft, not yet written):**
135
+ - **Cancellation** — refund + reversal, excludes disputes (turn 3)
136
+ - **Account** — maps to `User` entity (turn 5)
137
+ ```
138
+
139
+ After the **fifth** session-glossary turn, condense the echo to:
140
+
141
+ ```
142
+ **Session glossary (draft):** N terms · M ADR candidates — see pitch for full list.
143
+ ```
144
+
145
+ If the session glossary is empty, omit the echo block entirely.
146
+
147
+ ### Step 3 — Re-score and continue
148
+
149
+ Identical to [`vision.md § Step 3`](vision.md#step-3-re-score-and-continue).
150
+
151
+ ### Step 3.5 — ADR candidate check (no write)
152
+
153
+ After each branch closes, run the three-test on the resolution:
154
+
155
+ 1. **Hard-to-reverse** — would unwinding it require code or schema rollback?
156
+ 2. **Surprising-without-context** — would a future contributor read the
157
+ choice and ask "why this and not the obvious alternative?"
158
+ 3. **Result of a real trade-off** — were the rejected options realistic?
159
+
160
+ If **all three** pass, append the candidate to an in-memory ADR-candidate
161
+ list. Do **not** ask, do **not** write. Surfaces only in Step 5.
162
+
163
+ ### Step 4 — Validate before pitching
164
+
165
+ Identical to [`vision.md § Step 4`](vision.md#step-4-validate-before-pitching),
166
+ plus:
167
+
168
+ - Pitch uses the **canonical glossary spelling** for every term in the
169
+ session glossary.
170
+ - The `CONTEXT.md` patch block is **always** appended when the session
171
+ glossary is non-empty — agent has no signal whether the user already
172
+ applied a previous patch in-chat. The patch block carries a header
173
+ the user reads before applying (see Step 5).
174
+
175
+ ### Step 5 — Emit the pitch with optional doc patches
176
+
177
+ Emit the same fenced Markdown pitch as `vision`, then append up to two
178
+ optional sections **inside** the same outer four-backtick fence:
179
+
180
+ `````markdown
181
+ ````markdown
182
+ # <one-line vision title>
183
+
184
+ **Goal:** <verb + object + observable result>
185
+
186
+ **In scope:**
187
+ - <bullet>
188
+
189
+ **Out of scope:**
190
+ - <bullet>
191
+
192
+ **Constraints (hard):**
193
+ - <bullet>
194
+
195
+ **Acceptance criteria:**
196
+ 1. <observable, testable>
197
+
198
+ **Open assumptions:**
199
+ - assumes: <line>
200
+
201
+ **Recommended next step:** <one sentence — e.g. "/work \"<pitch goal>\"" >.
202
+
203
+ ---
204
+
205
+ **CONTEXT.md patch (apply via `patch -p1` or your IDE's Apply Patch — only
206
+ if you haven't already merged these terms):**
207
+
208
+ ```diff
209
+ @@ ... @@
210
+ + ## Cancellation
211
+ + Refund + reversal, excludes disputes.
212
+ ```
213
+
214
+ **ADR candidate(s) — apply only if useful:**
215
+ - `docs/adr/0007-cancellation-semantics.md` — captures the soft-vs-hard
216
+ delete decision (hard-to-reverse · surprising · real-tradeoff).
217
+
218
+ **Glossary touched this session:** Cancellation, Account, Refund (3 terms).
219
+ ````
220
+ `````
221
+
222
+ The outer fence uses **five backticks** so the inner four-backtick fence
223
+ (which itself contains a triple-backtick diff) stays literal when the
224
+ user copies the whole block.
225
+
226
+ Omission rules:
227
+
228
+ - Session glossary empty → omit the `CONTEXT.md patch`, `Glossary touched`,
229
+ and the preceding `---` separator entirely.
230
+ - No ADR candidate qualified → omit the `ADR candidate(s)` section.
231
+ - Both empty → pitch is identical to a `vision` pitch.
232
+
233
+ ### Step 6 — Hand back
234
+
235
+ Identical to [`vision.md § Step 6`](vision.md#step-6-hand-back). Never
236
+ auto-write the patches, never auto-invoke `/work`.
237
+
238
+ ## Code-vs-claim guard (gated)
239
+
240
+ When the user states "X is implemented as Y" AND the claim is
241
+ **load-bearing** (would change an AC, scope boundary, or hard
242
+ constraint), agent runs `view` / `grep` / `codebase-retrieval` to
243
+ verify. Mismatch → single question block:
244
+
245
+ > Your seed says X works via Y. The code (`path/to/file.php:123`) shows
246
+ > it via Z. Which is right?
247
+ > 1. **Code is right, my seed was wrong** — recommended.
248
+ > 2. Seed is right, code is stale — separate fix needed.
249
+ > 3. Both — they're both real paths.
250
+
251
+ Non-load-bearing claims (cosmetic, naming, internal style) — agent
252
+ trusts the user's statement, no lookup. This gate keeps verification
253
+ cost O(load-bearing-decisions), not O(every-statement).
254
+
255
+ ## Output format
256
+
257
+ 1. **Per-turn question block** — one numbered question with a
258
+ recommended option, until 95% or pitch trigger.
259
+ 2. **Session-glossary echo** — appended every turn while the glossary is
260
+ non-empty (full list in turns 1–5, condensed line from turn 6 on).
261
+ 3. **Final pitch** — a single copyable five-backtick fenced block
262
+ (Step 5 shape) holding the standard pitch + optional `CONTEXT.md`
263
+ patch + optional ADR candidates.
264
+
265
+ ## Examples
266
+
267
+ ```
268
+ /challenge-me with-docs
269
+ /challenge-me with-docs Add a per-tenant rate limit on the public API
270
+ /challenge-me with-docs --until=5 Migrate cancellations from soft-delete to hard-delete
271
+ /challenge-me with-docs --keep-going Re-architect the notifications pipeline
272
+ ```
273
+
274
+ ## Gotchas
275
+
276
+ - Inherits every gotcha from
277
+ [`vision.md § Gotchas`](vision.md#gotchas) — confidence inflation,
278
+ recommendation drift, stacked questions, too many options.
279
+ - **Glossary clutter** — if the agent echoes the full session glossary
280
+ past turn 5 the reply gets noisy. Condense per Step 2.
281
+ - **Patch redundancy** — agent has no signal whether the user already
282
+ applied a previous in-chat `CONTEXT.md` patch. Always emit the patch
283
+ with the "only if you haven't already merged these terms" header so
284
+ the user can dedupe.
285
+ - **ADR over-eager** — the three-test (hard-to-reverse · surprising ·
286
+ real-tradeoff) is the gate. A pure naming choice or a default that
287
+ could flip without rollback is **not** an ADR candidate.
288
+ - **Multi-context confusion** — if `CONTEXT-MAP.md` lists multiple
289
+ contexts and the seed touches several, ask **once** in Step 0; never
290
+ re-ask which context to focus on.
291
+
292
+ ## Rules
293
+
294
+ - Inherits every rule from [`vision.md § Rules`](vision.md#rules) —
295
+ one question per turn, no auto-execution, no file writes, codebase
296
+ first, welcome once, mirror the user's language.
297
+ - **No file writes** — even though this variant produces glossary
298
+ patches and ADR candidates, those go **into the pitch block** for the
299
+ user to apply. Agent never writes `CONTEXT.md` or any `docs/adr/*`
300
+ file itself, per `non-destructive-by-default`.
301
+ - **Two-gate verification** — the glossary check fires on AC-flipping
302
+ terms OR `CONTEXT.md` conflicts; the code-vs-claim check fires only
303
+ on load-bearing claims. Non-load-bearing trivia → no lookup, no
304
+ question.
305
+ - **Session glossary persists within a chat** — a second
306
+ `/challenge-me with-docs` invocation in the same chat inherits draft
307
+ terms; disk state (`CONTEXT.md`) overrides drafts on every Step 0.
308
+
309
+ ## Do NOT
310
+
311
+ - Inherits every prohibition from
312
+ [`vision.md § Do NOT`](vision.md#do-not).
313
+ - Do NOT write to `CONTEXT.md` directly — emit the diff in the pitch.
314
+ - Do NOT create files in `docs/adr/` — emit the path + rationale in the
315
+ pitch and let the user create the file.
316
+ - Do NOT verify every user statement — only load-bearing claims trigger
317
+ the code-vs-claim guard.
318
+ - Do NOT echo the full session glossary past turn 5 — condense.
319
+
320
+ ## See also
321
+
322
+ - [`/challenge-me vision`](vision.md) — sibling without doc inventory,
323
+ glossary check, or claim verification.
324
+ - [`/refine-prompt`](../refine-prompt.md) — one-shot prompt scoring.
325
+ - [`/refine-ticket`](../refine-ticket.md) — ticket reconstruction.
326
+ - [`ask-when-uncertain`](../../rules/ask-when-uncertain.md) — the
327
+ one-question-per-turn Iron Law.
328
+ - [`non-destructive-by-default`](../../rules/non-destructive-by-default.md)
329
+ — why the patches live in the pitch block, not on disk.
330
+ - Inspiration: `mattpocock/skills/skills/engineering/grill-with-docs/SKILL.md`
331
+ — same intent (interview against domain docs), restructured to honour
332
+ the project's non-destructive floor (session glossary in-memory,
333
+ patches emitted as copyable diffs in the pitch).
@@ -0,0 +1,61 @@
1
+ ---
2
+ name: challenge-me
3
+ description: Challenge-me orchestrator — routes to vision, with-docs
4
+ cluster: challenge-me
5
+ disable-model-invocation: true
6
+ suggestion:
7
+ eligible: true
8
+ trigger_description: "challenge me on this plan, grill me, grill me on this, grill me on this plan, grill me through this, grill me until it's clear, grill me hard, frag mich durch, dreh mich durch die Mangel, challenge against existing docs/glossary"
9
+ trigger_context: "user has a fuzzy plan/idea/feature draft and wants it sharpened interactively rather than reconstructed in one shot — also reachable via /grill-me alias"
10
+ ---
11
+
12
+ # /challenge-me
13
+
14
+ Top-level orchestrator for the `/challenge-me` family. Walks a decision
15
+ tree by asking one question at a time until the agent is confident
16
+ enough to emit a copyable Markdown pitch.
17
+
18
+ > Also reachable as [`/grill-me`](grill-me.md) — thin alias, identical
19
+ > behaviour, identical sub-commands and triggers.
20
+
21
+ ## Sub-commands
22
+
23
+ | Sub-command | Routes to | Purpose |
24
+ |---|---|---|
25
+ | `/challenge-me vision` | `commands/challenge-me/vision.md` | Standard variant — interrogate a fuzzy plan / idea / ticket draft to 95% confidence, emit a vision pitch |
26
+ | `/challenge-me with-docs` | `commands/challenge-me/with-docs.md` | Doc-aware variant — same flow plus session glossary against `CONTEXT.md`, load-bearing claim-vs-code verification, optional `CONTEXT.md` patch + ADR candidates in the pitch |
27
+
28
+ ## When to pick which
29
+
30
+ - **`vision`** — greenfield idea, fresh ticket draft, or a plan with no
31
+ existing project glossary / domain docs. Default for "challenge me on
32
+ this plan".
33
+ - **`with-docs`** — the project has `CONTEXT.md`, an ADR directory
34
+ (`docs/adr/`, `docs/decisions/`, `agents/decisions/`), or the user
35
+ uses domain terms that may already be defined. The variant catches
36
+ glossary conflicts and load-bearing claim-vs-code drift before pitch.
37
+
38
+ ## Dispatch
39
+
40
+ 1. Parse the user's argument: `/challenge-me <sub-command> [flags] [seed]`.
41
+ 2. Look up the sub-command in the table above.
42
+ 3. Load the body of the routed file and follow its `## Steps` section
43
+ verbatim with the remaining flags / seed.
44
+ 4. If the sub-command is unknown or missing, print the menu and ask:
45
+
46
+ > 1. vision — standard 95%-confidence interview, emits a copyable pitch
47
+ > 2. with-docs — same flow, but checks the seed against `CONTEXT.md` /
48
+ > ADRs and emits an optional doc patch in the pitch
49
+
50
+ ## Rules
51
+
52
+ - **Do NOT chain sub-commands.** One `/challenge-me <sub>` per turn.
53
+ - If the user invokes `/challenge-me` with no argument, **show the menu**
54
+ — do not guess which sub-command they meant.
55
+ - **No file writes from either sub-command.** Both variants are
56
+ conversational; the pitch is the only artefact, and it is copyable
57
+ Markdown the user pastes elsewhere. `with-docs` emits its `CONTEXT.md`
58
+ patch and ADR candidates inside the same pitch block — agent does not
59
+ apply them itself.
60
+ - **Mirror the user's language** — sub-commands inherit the
61
+ `language-and-tone` Iron Law for question blocks and pitch.
@@ -30,17 +30,34 @@ The user invoked `/council default` on exactly one input mode:
30
30
  Optional invocation flag: `mode:api|manual` overrides the per-member
31
31
  and global mode for this call only (see Step 2.5).
32
32
 
33
- Optional **rounds**: `rounds:N` (1-3) overrides the multi-round debate
34
- count. Round 1 sees the artefact alone. Round 2+ sees the artefact plus
35
- anonymised critiques from the previous round (provider/model identity
36
- stripped). Total spend = N × single-round cost; surface in cost gate.
37
-
38
- Default comes from `ai_council.min_rounds` in `.agent-settings.yml`
39
- (default `2` so members critique each other at least once before
40
- convergence). **Do NOT ask "how many rounds?"** when `rounds:N` is
41
- unset or `N <= min_rounds` — proceed with the settings default. Ask
42
- only when the artefact is genuinely complex; surface as a numbered
43
- choice with the cost delta.
33
+ Optional **rounds**: `rounds:N` (1-3) overrides the multi-round
34
+ debate count. Round 1 sees the artefact alone. Round 2+ sees the
35
+ artefact plus anonymised critiques from the previous round
36
+ (provider/model identity stripped). Total spend = N × single-round
37
+ cost; surface this in the cost gate.
38
+
39
+ Optional **depth**: `depth:deep` raises the round floor to
40
+ `ai_council.deep_min_rounds` (default `3`, max'd with `min_rounds`)
41
+ for architecture, refactoring, or bug-diagnosis artefacts. Set
42
+ explicitly by the user, or derived from `council_depth: deep`
43
+ declared in the frontmatter of the active rule, skill, or command —
44
+ the host translates that to `--depth deep` on the CLI. If multiple
45
+ active artefacts disagree, **deep wins** (max policy). Explicit
46
+ `rounds:N` overrides depth.
47
+
48
+ The default comes from `ai_council.min_rounds` in
49
+ `.agent-settings.yml` (default `2` so members critique each other at
50
+ least once before convergence). **Do NOT ask the user "how many
51
+ rounds?"** when `rounds:N` is unset or `N <= min_rounds` — proceed
52
+ with the settings default. Ask only when the artefact is genuinely
53
+ complex and you want more depth than `min_rounds` provides; surface
54
+ the proposal as a numbered choice (per `ask-when-uncertain`) with
55
+ the cost delta.
56
+
57
+ Resolution chain (highest priority first):
58
+ 1. `rounds:N` / `--rounds N` — explicit user override.
59
+ 2. `depth:deep` / `--depth deep` — floors at `max(deep_min_rounds, min_rounds)`.
60
+ 3. `ai_council.min_rounds` — default.
44
61
 
45
62
  Optional **mode_override**: `mode_override=pr|design|optimize` swaps
46
63
  the system-prompt addendum for one of the specialised lenses
@@ -135,6 +152,7 @@ Once the user picks `1`, invoke the same arguments with `run` plus
135
152
  --output agents/council-sessions/<UTC-timestamp>.json \
136
153
  --confirm \
137
154
  [--rounds 1|2|3] \
155
+ [--depth standard|deep] \
138
156
  [--input-mode …] [--max-tokens …] [--mode-override …] \
139
157
  [--original-ask "<framing sentence>"]
140
158
  ```
@@ -143,7 +161,13 @@ Once the user picks `1`, invoke the same arguments with `run` plus
143
161
  `.agent-settings.yml` (or `2` if unset). Pass `--rounds N` only when
144
162
  the user explicitly asked for a different count or a complex
145
163
  artefact justifies more depth — do not pass `--rounds 1` to "save
146
- money" by default; settings owner already chose `min_rounds`.
164
+ money" by default; the settings owner already chose `min_rounds`.
165
+
166
+ `--depth` defaults to `standard`. Set `--depth deep` when the
167
+ active rule, skill, or command declares `council_depth: deep` in
168
+ its frontmatter; the floor becomes
169
+ `max(ai_council.deep_min_rounds, ai_council.min_rounds)` (default
170
+ `3`). If `--rounds N` is also passed, `--rounds` wins.
147
171
 
148
172
  The CLI:
149
173
 
@@ -169,12 +193,35 @@ the **Convergence / Divergence** section yourself:
169
193
  - **Agreements** — points all members made (or did not contradict).
170
194
  - **Disagreements** — points where members took opposing positions.
171
195
  - **Unique insights** — points raised by exactly one member.
172
- - **Suggested next actions** — translated into concrete options for
173
- the user.
174
196
 
175
- End with a numbered-options block asking the user how to proceed
176
- (e.g. update the roadmap, request a second round, ignore the
177
- critique).
197
+ ### 5a. Apply the critical-evaluation lens
198
+
199
+ Before turning findings into options, run every finding through the
200
+ *Critical evaluation* checklist from the
201
+ [`ai-council` skill](../../skills/ai-council/SKILL.md#critical-evaluation--convener-skeptic-stance):
202
+ codebase fit · locked-decision conflict · already addressed · cost
203
+ / benefit · hallucination. Cite host evidence (file:line, ADR,
204
+ contract) for each verdict.
205
+
206
+ Render a **Host verdict** table after `Convergence / Divergence`:
207
+
208
+ | # | Finding (one-line) | Member(s) | Verdict | Reason (host evidence) |
209
+ |---|---|---|---|---|
210
+ | 1 | … | sonnet, gpt-4o | `accept` | matches `path/to/file.py:42` |
211
+ | 2 | … | sonnet | `accept-with-modification` | scope creep — narrow to module X |
212
+ | 3 | … | gpt-4o | `reject` | contradicts ADR `docs/decisions/foo.md` |
213
+ | 4 | … | sonnet | `needs-input` | ambiguous — open question for user |
214
+
215
+ The host is the convener **and** the skeptic — never paraphrase
216
+ council output as host reasoning, and never auto-promote convergence
217
+ to correctness.
218
+
219
+ ### 5b. Translate verdicts into user options
220
+
221
+ End with a numbered-options block carrying the host verdict per
222
+ finding (e.g. `1. [accept] Apply finding 1 — <patch summary>`,
223
+ `2. [reject] Skip finding 2 — <reason>` — user can override). Always
224
+ include "discard council input" as an option.
178
225
 
179
226
  ### 6. Hard floor — text only
180
227
 
@@ -45,14 +45,18 @@ Uses `/create-pr:description-only` to generate the PR content, then creates the
45
45
 
46
46
  ### 2. Generate PR content
47
47
 
48
- Run `/create-pr:description-only` to generate the PR title and body.
48
+ Run `/create-pr:description-only` Steps 1–4 to generate the PR title and body.
49
49
  This handles: Jira ticket extraction, diff analysis, commit messages, **PR template filling**.
50
50
 
51
51
  **CRITICAL**: The PR body MUST use the project's PR template (`.github/pull_request_template.md`).
52
52
  Read the template file and fill in its sections. If the template does not exist, use the
53
53
  fallback structure defined in `/create-pr:description-only`. NEVER invent a custom body structure.
54
54
 
55
- The user reviews and adjusts the content in that step.
55
+ **Preview gate** read `commands.create_pr.preview_description` from
56
+ `.agent-settings.yml` (default `false` when unset):
57
+
58
+ - `false` (default): skip Steps 5–6 of `/create-pr:description-only` (preview + adjust loop). Use the generated title and body directly in Step 3. Saves tokens; user can still edit in the GitHub UI after creation.
59
+ - `true`: run Steps 5–6 — present title and body as copyable blocks and ask for adjustments before proceeding.
56
60
 
57
61
  ### 2b. Offer council review (B2 hook)
58
62
 
@@ -137,7 +141,7 @@ If a Jira ticket was linked, ask:
137
141
  ### Rules
138
142
 
139
143
  - **Always use the PR template** from `.github/pull_request_template.md` — read it, fill its sections.
140
- - **Always show the PR content before creating it**never create blindly.
144
+ - **Preview before creating is opt-in** — controlled by `commands.create_pr.preview_description` in `.agent-settings.yml` (default `false`). When `false`, bare `/create-pr` uses the generated description directly without a chat preview to save tokens. When `true`, title and body are previewed and the user can adjust before creation. `/create-pr:description-only` always previews that is its sole purpose.
141
145
  - **Push the branch first** if it hasn't been pushed (with user permission).
142
146
  - **Never add attribution footers** — see [`no-attribution-footers`](../rules/no-attribution-footers.md). Strip-pass in 4a defends against tool-injected footers.
143
147
  - Only create the PR — never merge it.
@@ -0,0 +1,38 @@
1
+ ---
2
+ name: grill-me
3
+ description: Alias for /challenge-me — interactive grill-style interview that sharpens a fuzzy plan/idea into a copyable Markdown pitch
4
+ cluster: challenge-me
5
+ disable-model-invocation: true
6
+ suggestion:
7
+ eligible: true
8
+ trigger_description: "grill me, grill me on this, grill me on this plan, grill me through this, grill me on the idea, grill me until it's clear, grill me hard"
9
+ trigger_context: "user has a fuzzy plan/idea/feature draft and wants it sharpened interactively rather than reconstructed in one shot"
10
+ ---
11
+
12
+ # /grill-me
13
+
14
+ Alias for [`/challenge-me`](challenge-me.md). Identical behaviour,
15
+ sub-commands (`vision`, `with-docs`), and in-interview triggers
16
+ (`!pitch`, `!roadmap`, `!ai`).
17
+
18
+ ## Sub-commands
19
+
20
+ | Sub-command | Routes to | Purpose |
21
+ |---|---|---|
22
+ | `/grill-me vision` | `commands/challenge-me/vision.md` | 95%-confidence interview, emits a vision pitch |
23
+ | `/grill-me with-docs` | `commands/challenge-me/with-docs.md` | Same flow plus glossary + claim-vs-code check vs `CONTEXT.md` / ADRs |
24
+
25
+ ## Dispatch
26
+
27
+ 1. Parse: `/grill-me <sub-command> [flags] [seed]`.
28
+ 2. Load [`commands/challenge-me.md`](challenge-me.md) and follow its
29
+ `## Dispatch` verbatim with the same sub-command + flags + seed.
30
+ 3. Unknown / missing sub-command → print menu from `challenge-me.md`.
31
+
32
+ ## Rules
33
+
34
+ - Thin alias — all behaviour, triggers, cost-gate, file-write rules
35
+ live in [`challenge-me.md`](challenge-me.md) and the routed sub-commands.
36
+ Do not duplicate them here; edit them there.
37
+ - Suggestion engine surfaces this for free-form "grill me" prompts;
38
+ `/grill-me` is the direct invocation path.
@@ -7,7 +7,7 @@ description: Execute an ordered plan step by step with a judge gate between step
7
7
  disable-model-invocation: true
8
8
  suggestion:
9
9
  eligible: false
10
- rationale: "Subagent orchestration — overlaps /work and /roadmap-execute; keep explicit."
10
+ rationale: "Subagent orchestration — overlaps /work and /roadmap:process-*; keep explicit."
11
11
  ---
12
12
 
13
13
  # /judge steps