@event4u/agent-config 1.21.0 → 1.22.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.agent-src/commands/bug-fix.md +1 -0
- package/.agent-src/commands/bug-investigate.md +1 -0
- package/.agent-src/commands/challenge-me/vision.md +348 -0
- package/.agent-src/commands/challenge-me/with-docs.md +333 -0
- package/.agent-src/commands/challenge-me.md +61 -0
- package/.agent-src/commands/council/default.md +64 -17
- package/.agent-src/commands/create-pr.md +7 -3
- package/.agent-src/commands/grill-me.md +38 -0
- package/.agent-src/commands/judge/steps.md +1 -1
- package/.agent-src/commands/roadmap/ai-council.md +183 -0
- package/.agent-src/commands/roadmap/create.md +6 -1
- package/.agent-src/commands/roadmap/process-full.md +58 -0
- package/.agent-src/commands/roadmap/process-phase.md +69 -0
- package/.agent-src/commands/roadmap/process-step.md +57 -0
- package/.agent-src/commands/roadmap.md +44 -16
- package/.agent-src/commands/threat-model.md +1 -0
- package/.agent-src/contexts/augment-infrastructure.md +1 -1
- package/.agent-src/contexts/communication/rules-auto/slash-command-routing-policy-mechanics.md +53 -18
- package/.agent-src/contexts/execution/roadmap-process-loop.md +125 -0
- package/.agent-src/contexts/skills-and-commands.md +1 -1
- package/.agent-src/rules/improve-before-implement.md +1 -0
- package/.agent-src/rules/invite-challenge.md +71 -0
- package/.agent-src/skills/adversarial-review/SKILL.md +1 -0
- package/.agent-src/skills/ai-council/SKILL.md +132 -8
- package/.agent-src/skills/bug-analyzer/SKILL.md +1 -0
- package/.agent-src/skills/roadmap-management/SKILL.md +7 -7
- package/.agent-src/skills/systematic-debugging/SKILL.md +22 -2
- package/.agent-src/skills/technical-specification/SKILL.md +58 -1
- package/.agent-src/skills/threat-modeling/SKILL.md +1 -0
- package/.agent-src/templates/agent-settings.md +14 -3
- package/.agent-src/templates/command.md +17 -1
- package/.agent-src/templates/roadmaps.md +10 -2
- package/.agent-src/templates/rule.md +2 -0
- package/.agent-src/templates/skill.md +17 -0
- package/.claude-plugin/marketplace.json +9 -2
- package/AGENTS.md +2 -2
- package/CHANGELOG.md +38 -0
- package/README.md +1 -1
- package/config/agent-settings.template.yml +22 -0
- package/docs/architecture.md +2 -2
- package/docs/contracts/command-clusters.md +45 -1
- package/docs/decisions/ADR-003-flat-cluster-subs-and-colon-syntax.md +126 -0
- package/docs/getting-started.md +1 -1
- package/docs/guidelines/agent-infra/naming.md +1 -1
- package/package.json +1 -1
- package/scripts/_phase2_shim_helper.py +1 -1
- package/scripts/council_cli.py +127 -10
- package/scripts/migrate_command_suggestions.py +2 -2
- package/scripts/schemas/command.schema.json +5 -0
- package/scripts/schemas/rule.schema.json +5 -0
- package/scripts/schemas/skill.schema.json +5 -0
- package/scripts/skill_linter.py +1 -1
- package/.agent-src/commands/roadmap/execute.md +0 -109
|
@@ -0,0 +1,333 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: challenge-me:with-docs
|
|
3
|
+
cluster: challenge-me
|
|
4
|
+
sub: with-docs
|
|
5
|
+
description: "Doc-aware /challenge-me — 95%-confidence interview with session glossary vs CONTEXT.md, load-bearing claim-vs-code verification, optional CONTEXT.md patch + ADR candidates in the pitch."
|
|
6
|
+
disable-model-invocation: true
|
|
7
|
+
suggestion:
|
|
8
|
+
eligible: true
|
|
9
|
+
trigger_description: "challenge me against the docs, grill me with our context, grill me against the docs, grill me against CONTEXT.md, grill me with the project context, frag mich durch und prüf gegen CONTEXT.md, challenge plan with domain docs"
|
|
10
|
+
trigger_context: "user wants the seed challenged against existing project glossary, CONTEXT.md, or ADRs — not a greenfield interview"
|
|
11
|
+
---
|
|
12
|
+
|
|
13
|
+
# /challenge-me with-docs
|
|
14
|
+
|
|
15
|
+
> Same one-question-at-a-time interview as `/challenge-me vision`, plus:
|
|
16
|
+
> a **session glossary** against `CONTEXT.md`, **load-bearing claim-vs-code
|
|
17
|
+
> verification**, and an optional `CONTEXT.md` **patch** + **ADR candidates**
|
|
18
|
+
> appended to the final pitch. Nothing is written to disk — the pitch is
|
|
19
|
+
> the only artefact.
|
|
20
|
+
|
|
21
|
+
## Welcome
|
|
22
|
+
|
|
23
|
+
If the user invokes `/challenge-me with-docs` with no body, render once:
|
|
24
|
+
|
|
25
|
+
```
|
|
26
|
+
Drop the seed and I'll grill it against your project docs. Per turn:
|
|
27
|
+
one question, recommended answer. I'll surface glossary clashes vs
|
|
28
|
+
CONTEXT.md and verify load-bearing claims against the code. At the end
|
|
29
|
+
you get a pitch — plus an optional CONTEXT.md patch and ADR candidates
|
|
30
|
+
to copy / apply yourself.
|
|
31
|
+
|
|
32
|
+
Stop early any time by typing `!pitch` (or `!Vision pitchen`).
|
|
33
|
+
```
|
|
34
|
+
|
|
35
|
+
Skip the welcome if the user invokes `/challenge-me with-docs <seed>`
|
|
36
|
+
directly.
|
|
37
|
+
|
|
38
|
+
## When to use this instead of `vision`
|
|
39
|
+
|
|
40
|
+
| Condition | Pick |
|
|
41
|
+
|---|---|
|
|
42
|
+
| Project has no `CONTEXT.md` and no ADR dir | `vision` |
|
|
43
|
+
| Seed uses domain terms that may already be defined elsewhere | `with-docs` |
|
|
44
|
+
| User says "check against our docs" / "make sure it matches CONTEXT.md" | `with-docs` |
|
|
45
|
+
| User says "make sure this is consistent with the codebase" | `with-docs` |
|
|
46
|
+
| Greenfield brainstorm, no existing constraints to honour | `vision` |
|
|
47
|
+
|
|
48
|
+
If neither file exists and the user invoked `with-docs` anyway, fall
|
|
49
|
+
back gracefully to `vision`-style flow and skip the doc inventory; note
|
|
50
|
+
the absence in a single line at the end of Step 0.
|
|
51
|
+
|
|
52
|
+
## Stop condition, flags, pitch trigger
|
|
53
|
+
|
|
54
|
+
Inherits **verbatim** from [`vision.md`](vision.md):
|
|
55
|
+
|
|
56
|
+
- Default stop = the four 95%-conditions (AND, not OR).
|
|
57
|
+
- Load-bearing test = answer changes goal / scope / hard constraint / AC.
|
|
58
|
+
- `--until=95%` (default) · `--until=N` · `--keep-going`.
|
|
59
|
+
- Pitch trigger: `!pitch`, `/pitch`, or whole reply `pitch`.
|
|
60
|
+
- Step 1 question-block shape, Step 4 pitch validation.
|
|
61
|
+
|
|
62
|
+
The deltas below replace / extend Step 0, Step 1, Step 2, Step 4 and
|
|
63
|
+
Step 5. Steps 3 and 6 are unchanged.
|
|
64
|
+
|
|
65
|
+
## Steps
|
|
66
|
+
|
|
67
|
+
### Step 0 — Inspect (read-only doc inventory + codebase)
|
|
68
|
+
|
|
69
|
+
```
|
|
70
|
+
EXPLORE THE DOCS AND THE CODEBASE BEFORE THE FIRST QUESTION.
|
|
71
|
+
NEVER WRITE TO DISK IN STEP 0. NEVER ASK WHAT view / grep / codebase-retrieval ANSWERS.
|
|
72
|
+
```
|
|
73
|
+
|
|
74
|
+
1. **Read the seed** (same as `vision`).
|
|
75
|
+
2. **Doc inventory** — read-only:
|
|
76
|
+
- `view CONTEXT.md` if it exists; init empty **session glossary** in
|
|
77
|
+
agent state (in-memory only).
|
|
78
|
+
- `view CONTEXT-MAP.md` if it exists. If the seed touches > 1 listed
|
|
79
|
+
context, **ask once** (numbered options listing each context) which
|
|
80
|
+
to focus on; record the choice, do not re-ask.
|
|
81
|
+
- Autodetect ADR directory by listing `docs/adr/`, `docs/decisions/`,
|
|
82
|
+
`docs/architecture/decisions/`, `agents/decisions/`. Pick the
|
|
83
|
+
**most-recently-modified** non-empty dir; if none exist, default to
|
|
84
|
+
`docs/adr/` for the patch output (do **not** create the dir).
|
|
85
|
+
- If `agent state` already holds a session glossary from a prior
|
|
86
|
+
`/challenge-me with-docs` invocation **in this same chat**, load
|
|
87
|
+
it. Then re-read `CONTEXT.md` — any term in `CONTEXT.md` overrides
|
|
88
|
+
the session glossary (disk wins over draft). Emit a delta line if
|
|
89
|
+
non-zero: `Loaded N draft terms; CONTEXT.md already has X — carrying
|
|
90
|
+
forward the rest as drafts.`
|
|
91
|
+
3. **Codebase lookup** — same as `vision` Step 0.2 (existing routes,
|
|
92
|
+
model fields, conventions, feature flags).
|
|
93
|
+
4. **List the open dimensions** internally — same as `vision` Step 0.3.
|
|
94
|
+
|
|
95
|
+
### Step 1 — Glossary-aware questioning
|
|
96
|
+
|
|
97
|
+
Every question block runs an extra check **before** emit:
|
|
98
|
+
|
|
99
|
+
- **Glossary conflict** — does the seed (or a prior turn) use a term
|
|
100
|
+
that conflicts with the existing `CONTEXT.md` glossary, including a
|
|
101
|
+
case-insensitive match? → first question of the session is the
|
|
102
|
+
glossary disambiguation:
|
|
103
|
+
|
|
104
|
+
> Your `CONTEXT.md` defines **Account** as the billing entity. Your
|
|
105
|
+
> seed uses "account" for the logged-in person. Which is it?
|
|
106
|
+
> 1. **Match CONTEXT.md** — rename "account" in the plan to "user".
|
|
107
|
+
> 2. Override CONTEXT.md — the plan introduces a new meaning; we patch
|
|
108
|
+
> the glossary at pitch time.
|
|
109
|
+
> 3. Skip / not relevant.
|
|
110
|
+
|
|
111
|
+
- **Fuzzy / overloaded term** — propose a canonical term as option 1.
|
|
112
|
+
|
|
113
|
+
- **No glossary present** → skip the check; behave like `vision`.
|
|
114
|
+
|
|
115
|
+
The glossary check is itself **load-bearing only when the term flips an
|
|
116
|
+
AC or scope boundary OR when it conflicts with an existing CONTEXT.md
|
|
117
|
+
entry**. Pure naming preference with no conflict is closed silently
|
|
118
|
+
(same load-bearing test as `vision`, with the conflict-extension).
|
|
119
|
+
|
|
120
|
+
Otherwise the question-block shape, recommended-option rule, 3–4 option
|
|
121
|
+
cap, and `Why it matters` requirement are identical to
|
|
122
|
+
[`vision.md § Step 1`](vision.md#step-1-ask-one-question-per-turn).
|
|
123
|
+
|
|
124
|
+
### Step 2 — Apply the answer + session glossary (in-memory)
|
|
125
|
+
|
|
126
|
+
Adopt-the-answer rules are identical to
|
|
127
|
+
[`vision.md § Step 2`](vision.md#step-2-apply-the-answer).
|
|
128
|
+
|
|
129
|
+
When a term is resolved, write to **session glossary** (agent state
|
|
130
|
+
only). **Never write to disk in this step.** Every turn's reply ends
|
|
131
|
+
with a compact echo block when the glossary is non-empty:
|
|
132
|
+
|
|
133
|
+
```
|
|
134
|
+
**Session glossary (draft, not yet written):**
|
|
135
|
+
- **Cancellation** — refund + reversal, excludes disputes (turn 3)
|
|
136
|
+
- **Account** — maps to `User` entity (turn 5)
|
|
137
|
+
```
|
|
138
|
+
|
|
139
|
+
After the **fifth** session-glossary turn, condense the echo to:
|
|
140
|
+
|
|
141
|
+
```
|
|
142
|
+
**Session glossary (draft):** N terms · M ADR candidates — see pitch for full list.
|
|
143
|
+
```
|
|
144
|
+
|
|
145
|
+
If the session glossary is empty, omit the echo block entirely.
|
|
146
|
+
|
|
147
|
+
### Step 3 — Re-score and continue
|
|
148
|
+
|
|
149
|
+
Identical to [`vision.md § Step 3`](vision.md#step-3-re-score-and-continue).
|
|
150
|
+
|
|
151
|
+
### Step 3.5 — ADR candidate check (no write)
|
|
152
|
+
|
|
153
|
+
After each branch closes, run the three-test on the resolution:
|
|
154
|
+
|
|
155
|
+
1. **Hard-to-reverse** — would unwinding it require code or schema rollback?
|
|
156
|
+
2. **Surprising-without-context** — would a future contributor read the
|
|
157
|
+
choice and ask "why this and not the obvious alternative?"
|
|
158
|
+
3. **Result of a real trade-off** — were the rejected options realistic?
|
|
159
|
+
|
|
160
|
+
If **all three** pass, append the candidate to an in-memory ADR-candidate
|
|
161
|
+
list. Do **not** ask, do **not** write. Surfaces only in Step 5.
|
|
162
|
+
|
|
163
|
+
### Step 4 — Validate before pitching
|
|
164
|
+
|
|
165
|
+
Identical to [`vision.md § Step 4`](vision.md#step-4-validate-before-pitching),
|
|
166
|
+
plus:
|
|
167
|
+
|
|
168
|
+
- Pitch uses the **canonical glossary spelling** for every term in the
|
|
169
|
+
session glossary.
|
|
170
|
+
- The `CONTEXT.md` patch block is **always** appended when the session
|
|
171
|
+
glossary is non-empty — agent has no signal whether the user already
|
|
172
|
+
applied a previous patch in-chat. The patch block carries a header
|
|
173
|
+
the user reads before applying (see Step 5).
|
|
174
|
+
|
|
175
|
+
### Step 5 — Emit the pitch with optional doc patches
|
|
176
|
+
|
|
177
|
+
Emit the same fenced Markdown pitch as `vision`, then append up to two
|
|
178
|
+
optional sections **inside** the same outer four-backtick fence:
|
|
179
|
+
|
|
180
|
+
`````markdown
|
|
181
|
+
````markdown
|
|
182
|
+
# <one-line vision title>
|
|
183
|
+
|
|
184
|
+
**Goal:** <verb + object + observable result>
|
|
185
|
+
|
|
186
|
+
**In scope:**
|
|
187
|
+
- <bullet>
|
|
188
|
+
|
|
189
|
+
**Out of scope:**
|
|
190
|
+
- <bullet>
|
|
191
|
+
|
|
192
|
+
**Constraints (hard):**
|
|
193
|
+
- <bullet>
|
|
194
|
+
|
|
195
|
+
**Acceptance criteria:**
|
|
196
|
+
1. <observable, testable>
|
|
197
|
+
|
|
198
|
+
**Open assumptions:**
|
|
199
|
+
- assumes: <line>
|
|
200
|
+
|
|
201
|
+
**Recommended next step:** <one sentence — e.g. "/work \"<pitch goal>\"" >.
|
|
202
|
+
|
|
203
|
+
---
|
|
204
|
+
|
|
205
|
+
**CONTEXT.md patch (apply via `patch -p1` or your IDE's Apply Patch — only
|
|
206
|
+
if you haven't already merged these terms):**
|
|
207
|
+
|
|
208
|
+
```diff
|
|
209
|
+
@@ ... @@
|
|
210
|
+
+ ## Cancellation
|
|
211
|
+
+ Refund + reversal, excludes disputes.
|
|
212
|
+
```
|
|
213
|
+
|
|
214
|
+
**ADR candidate(s) — apply only if useful:**
|
|
215
|
+
- `docs/adr/0007-cancellation-semantics.md` — captures the soft-vs-hard
|
|
216
|
+
delete decision (hard-to-reverse · surprising · real-tradeoff).
|
|
217
|
+
|
|
218
|
+
**Glossary touched this session:** Cancellation, Account, Refund (3 terms).
|
|
219
|
+
````
|
|
220
|
+
`````
|
|
221
|
+
|
|
222
|
+
The outer fence uses **five backticks** so the inner four-backtick fence
|
|
223
|
+
(which itself contains a triple-backtick diff) stays literal when the
|
|
224
|
+
user copies the whole block.
|
|
225
|
+
|
|
226
|
+
Omission rules:
|
|
227
|
+
|
|
228
|
+
- Session glossary empty → omit the `CONTEXT.md patch`, `Glossary touched`,
|
|
229
|
+
and the preceding `---` separator entirely.
|
|
230
|
+
- No ADR candidate qualified → omit the `ADR candidate(s)` section.
|
|
231
|
+
- Both empty → pitch is identical to a `vision` pitch.
|
|
232
|
+
|
|
233
|
+
### Step 6 — Hand back
|
|
234
|
+
|
|
235
|
+
Identical to [`vision.md § Step 6`](vision.md#step-6-hand-back). Never
|
|
236
|
+
auto-write the patches, never auto-invoke `/work`.
|
|
237
|
+
|
|
238
|
+
## Code-vs-claim guard (gated)
|
|
239
|
+
|
|
240
|
+
When the user states "X is implemented as Y" AND the claim is
|
|
241
|
+
**load-bearing** (would change an AC, scope boundary, or hard
|
|
242
|
+
constraint), agent runs `view` / `grep` / `codebase-retrieval` to
|
|
243
|
+
verify. Mismatch → single question block:
|
|
244
|
+
|
|
245
|
+
> Your seed says X works via Y. The code (`path/to/file.php:123`) shows
|
|
246
|
+
> it via Z. Which is right?
|
|
247
|
+
> 1. **Code is right, my seed was wrong** — recommended.
|
|
248
|
+
> 2. Seed is right, code is stale — separate fix needed.
|
|
249
|
+
> 3. Both — they're both real paths.
|
|
250
|
+
|
|
251
|
+
Non-load-bearing claims (cosmetic, naming, internal style) — agent
|
|
252
|
+
trusts the user's statement, no lookup. This gate keeps verification
|
|
253
|
+
cost O(load-bearing-decisions), not O(every-statement).
|
|
254
|
+
|
|
255
|
+
## Output format
|
|
256
|
+
|
|
257
|
+
1. **Per-turn question block** — one numbered question with a
|
|
258
|
+
recommended option, until 95% or pitch trigger.
|
|
259
|
+
2. **Session-glossary echo** — appended every turn while the glossary is
|
|
260
|
+
non-empty (full list in turns 1–5, condensed line from turn 6 on).
|
|
261
|
+
3. **Final pitch** — a single copyable five-backtick fenced block
|
|
262
|
+
(Step 5 shape) holding the standard pitch + optional `CONTEXT.md`
|
|
263
|
+
patch + optional ADR candidates.
|
|
264
|
+
|
|
265
|
+
## Examples
|
|
266
|
+
|
|
267
|
+
```
|
|
268
|
+
/challenge-me with-docs
|
|
269
|
+
/challenge-me with-docs Add a per-tenant rate limit on the public API
|
|
270
|
+
/challenge-me with-docs --until=5 Migrate cancellations from soft-delete to hard-delete
|
|
271
|
+
/challenge-me with-docs --keep-going Re-architect the notifications pipeline
|
|
272
|
+
```
|
|
273
|
+
|
|
274
|
+
## Gotchas
|
|
275
|
+
|
|
276
|
+
- Inherits every gotcha from
|
|
277
|
+
[`vision.md § Gotchas`](vision.md#gotchas) — confidence inflation,
|
|
278
|
+
recommendation drift, stacked questions, too many options.
|
|
279
|
+
- **Glossary clutter** — if the agent echoes the full session glossary
|
|
280
|
+
past turn 5 the reply gets noisy. Condense per Step 2.
|
|
281
|
+
- **Patch redundancy** — agent has no signal whether the user already
|
|
282
|
+
applied a previous in-chat `CONTEXT.md` patch. Always emit the patch
|
|
283
|
+
with the "only if you haven't already merged these terms" header so
|
|
284
|
+
the user can dedupe.
|
|
285
|
+
- **ADR over-eager** — the three-test (hard-to-reverse · surprising ·
|
|
286
|
+
real-tradeoff) is the gate. A pure naming choice or a default that
|
|
287
|
+
could flip without rollback is **not** an ADR candidate.
|
|
288
|
+
- **Multi-context confusion** — if `CONTEXT-MAP.md` lists multiple
|
|
289
|
+
contexts and the seed touches several, ask **once** in Step 0; never
|
|
290
|
+
re-ask which context to focus on.
|
|
291
|
+
|
|
292
|
+
## Rules
|
|
293
|
+
|
|
294
|
+
- Inherits every rule from [`vision.md § Rules`](vision.md#rules) —
|
|
295
|
+
one question per turn, no auto-execution, no file writes, codebase
|
|
296
|
+
first, welcome once, mirror the user's language.
|
|
297
|
+
- **No file writes** — even though this variant produces glossary
|
|
298
|
+
patches and ADR candidates, those go **into the pitch block** for the
|
|
299
|
+
user to apply. Agent never writes `CONTEXT.md` or any `docs/adr/*`
|
|
300
|
+
file itself, per `non-destructive-by-default`.
|
|
301
|
+
- **Two-gate verification** — the glossary check fires on AC-flipping
|
|
302
|
+
terms OR `CONTEXT.md` conflicts; the code-vs-claim check fires only
|
|
303
|
+
on load-bearing claims. Non-load-bearing trivia → no lookup, no
|
|
304
|
+
question.
|
|
305
|
+
- **Session glossary persists within a chat** — a second
|
|
306
|
+
`/challenge-me with-docs` invocation in the same chat inherits draft
|
|
307
|
+
terms; disk state (`CONTEXT.md`) overrides drafts on every Step 0.
|
|
308
|
+
|
|
309
|
+
## Do NOT
|
|
310
|
+
|
|
311
|
+
- Inherits every prohibition from
|
|
312
|
+
[`vision.md § Do NOT`](vision.md#do-not).
|
|
313
|
+
- Do NOT write to `CONTEXT.md` directly — emit the diff in the pitch.
|
|
314
|
+
- Do NOT create files in `docs/adr/` — emit the path + rationale in the
|
|
315
|
+
pitch and let the user create the file.
|
|
316
|
+
- Do NOT verify every user statement — only load-bearing claims trigger
|
|
317
|
+
the code-vs-claim guard.
|
|
318
|
+
- Do NOT echo the full session glossary past turn 5 — condense.
|
|
319
|
+
|
|
320
|
+
## See also
|
|
321
|
+
|
|
322
|
+
- [`/challenge-me vision`](vision.md) — sibling without doc inventory,
|
|
323
|
+
glossary check, or claim verification.
|
|
324
|
+
- [`/refine-prompt`](../refine-prompt.md) — one-shot prompt scoring.
|
|
325
|
+
- [`/refine-ticket`](../refine-ticket.md) — ticket reconstruction.
|
|
326
|
+
- [`ask-when-uncertain`](../../rules/ask-when-uncertain.md) — the
|
|
327
|
+
one-question-per-turn Iron Law.
|
|
328
|
+
- [`non-destructive-by-default`](../../rules/non-destructive-by-default.md)
|
|
329
|
+
— why the patches live in the pitch block, not on disk.
|
|
330
|
+
- Inspiration: `mattpocock/skills/skills/engineering/grill-with-docs/SKILL.md`
|
|
331
|
+
— same intent (interview against domain docs), restructured to honour
|
|
332
|
+
the project's non-destructive floor (session glossary in-memory,
|
|
333
|
+
patches emitted as copyable diffs in the pitch).
|
|
@@ -0,0 +1,61 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: challenge-me
|
|
3
|
+
description: Challenge-me orchestrator — routes to vision, with-docs
|
|
4
|
+
cluster: challenge-me
|
|
5
|
+
disable-model-invocation: true
|
|
6
|
+
suggestion:
|
|
7
|
+
eligible: true
|
|
8
|
+
trigger_description: "challenge me on this plan, grill me, grill me on this, grill me on this plan, grill me through this, grill me until it's clear, grill me hard, frag mich durch, dreh mich durch die Mangel, challenge against existing docs/glossary"
|
|
9
|
+
trigger_context: "user has a fuzzy plan/idea/feature draft and wants it sharpened interactively rather than reconstructed in one shot — also reachable via /grill-me alias"
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
# /challenge-me
|
|
13
|
+
|
|
14
|
+
Top-level orchestrator for the `/challenge-me` family. Walks a decision
|
|
15
|
+
tree by asking one question at a time until the agent is confident
|
|
16
|
+
enough to emit a copyable Markdown pitch.
|
|
17
|
+
|
|
18
|
+
> Also reachable as [`/grill-me`](grill-me.md) — thin alias, identical
|
|
19
|
+
> behaviour, identical sub-commands and triggers.
|
|
20
|
+
|
|
21
|
+
## Sub-commands
|
|
22
|
+
|
|
23
|
+
| Sub-command | Routes to | Purpose |
|
|
24
|
+
|---|---|---|
|
|
25
|
+
| `/challenge-me vision` | `commands/challenge-me/vision.md` | Standard variant — interrogate a fuzzy plan / idea / ticket draft to 95% confidence, emit a vision pitch |
|
|
26
|
+
| `/challenge-me with-docs` | `commands/challenge-me/with-docs.md` | Doc-aware variant — same flow plus session glossary against `CONTEXT.md`, load-bearing claim-vs-code verification, optional `CONTEXT.md` patch + ADR candidates in the pitch |
|
|
27
|
+
|
|
28
|
+
## When to pick which
|
|
29
|
+
|
|
30
|
+
- **`vision`** — greenfield idea, fresh ticket draft, or a plan with no
|
|
31
|
+
existing project glossary / domain docs. Default for "challenge me on
|
|
32
|
+
this plan".
|
|
33
|
+
- **`with-docs`** — the project has `CONTEXT.md`, an ADR directory
|
|
34
|
+
(`docs/adr/`, `docs/decisions/`, `agents/decisions/`), or the user
|
|
35
|
+
uses domain terms that may already be defined. The variant catches
|
|
36
|
+
glossary conflicts and load-bearing claim-vs-code drift before pitch.
|
|
37
|
+
|
|
38
|
+
## Dispatch
|
|
39
|
+
|
|
40
|
+
1. Parse the user's argument: `/challenge-me <sub-command> [flags] [seed]`.
|
|
41
|
+
2. Look up the sub-command in the table above.
|
|
42
|
+
3. Load the body of the routed file and follow its `## Steps` section
|
|
43
|
+
verbatim with the remaining flags / seed.
|
|
44
|
+
4. If the sub-command is unknown or missing, print the menu and ask:
|
|
45
|
+
|
|
46
|
+
> 1. vision — standard 95%-confidence interview, emits a copyable pitch
|
|
47
|
+
> 2. with-docs — same flow, but checks the seed against `CONTEXT.md` /
|
|
48
|
+
> ADRs and emits an optional doc patch in the pitch
|
|
49
|
+
|
|
50
|
+
## Rules
|
|
51
|
+
|
|
52
|
+
- **Do NOT chain sub-commands.** One `/challenge-me <sub>` per turn.
|
|
53
|
+
- If the user invokes `/challenge-me` with no argument, **show the menu**
|
|
54
|
+
— do not guess which sub-command they meant.
|
|
55
|
+
- **No file writes from either sub-command.** Both variants are
|
|
56
|
+
conversational; the pitch is the only artefact, and it is copyable
|
|
57
|
+
Markdown the user pastes elsewhere. `with-docs` emits its `CONTEXT.md`
|
|
58
|
+
patch and ADR candidates inside the same pitch block — agent does not
|
|
59
|
+
apply them itself.
|
|
60
|
+
- **Mirror the user's language** — sub-commands inherit the
|
|
61
|
+
`language-and-tone` Iron Law for question blocks and pitch.
|
|
@@ -30,17 +30,34 @@ The user invoked `/council default` on exactly one input mode:
|
|
|
30
30
|
Optional invocation flag: `mode:api|manual` overrides the per-member
|
|
31
31
|
and global mode for this call only (see Step 2.5).
|
|
32
32
|
|
|
33
|
-
Optional **rounds**: `rounds:N` (1-3) overrides the multi-round
|
|
34
|
-
count. Round 1 sees the artefact alone. Round 2+ sees the
|
|
35
|
-
anonymised critiques from the previous round
|
|
36
|
-
stripped). Total spend = N × single-round
|
|
37
|
-
|
|
38
|
-
|
|
39
|
-
|
|
40
|
-
|
|
41
|
-
|
|
42
|
-
|
|
43
|
-
|
|
33
|
+
Optional **rounds**: `rounds:N` (1-3) overrides the multi-round
|
|
34
|
+
debate count. Round 1 sees the artefact alone. Round 2+ sees the
|
|
35
|
+
artefact plus anonymised critiques from the previous round
|
|
36
|
+
(provider/model identity stripped). Total spend = N × single-round
|
|
37
|
+
cost; surface this in the cost gate.
|
|
38
|
+
|
|
39
|
+
Optional **depth**: `depth:deep` raises the round floor to
|
|
40
|
+
`ai_council.deep_min_rounds` (default `3`, max'd with `min_rounds`)
|
|
41
|
+
for architecture, refactoring, or bug-diagnosis artefacts. Set
|
|
42
|
+
explicitly by the user, or derived from `council_depth: deep`
|
|
43
|
+
declared in the frontmatter of the active rule, skill, or command —
|
|
44
|
+
the host translates that to `--depth deep` on the CLI. If multiple
|
|
45
|
+
active artefacts disagree, **deep wins** (max policy). Explicit
|
|
46
|
+
`rounds:N` overrides depth.
|
|
47
|
+
|
|
48
|
+
The default comes from `ai_council.min_rounds` in
|
|
49
|
+
`.agent-settings.yml` (default `2` so members critique each other at
|
|
50
|
+
least once before convergence). **Do NOT ask the user "how many
|
|
51
|
+
rounds?"** when `rounds:N` is unset or `N <= min_rounds` — proceed
|
|
52
|
+
with the settings default. Ask only when the artefact is genuinely
|
|
53
|
+
complex and you want more depth than `min_rounds` provides; surface
|
|
54
|
+
the proposal as a numbered choice (per `ask-when-uncertain`) with
|
|
55
|
+
the cost delta.
|
|
56
|
+
|
|
57
|
+
Resolution chain (highest priority first):
|
|
58
|
+
1. `rounds:N` / `--rounds N` — explicit user override.
|
|
59
|
+
2. `depth:deep` / `--depth deep` — floors at `max(deep_min_rounds, min_rounds)`.
|
|
60
|
+
3. `ai_council.min_rounds` — default.
|
|
44
61
|
|
|
45
62
|
Optional **mode_override**: `mode_override=pr|design|optimize` swaps
|
|
46
63
|
the system-prompt addendum for one of the specialised lenses
|
|
@@ -135,6 +152,7 @@ Once the user picks `1`, invoke the same arguments with `run` plus
|
|
|
135
152
|
--output agents/council-sessions/<UTC-timestamp>.json \
|
|
136
153
|
--confirm \
|
|
137
154
|
[--rounds 1|2|3] \
|
|
155
|
+
[--depth standard|deep] \
|
|
138
156
|
[--input-mode …] [--max-tokens …] [--mode-override …] \
|
|
139
157
|
[--original-ask "<framing sentence>"]
|
|
140
158
|
```
|
|
@@ -143,7 +161,13 @@ Once the user picks `1`, invoke the same arguments with `run` plus
|
|
|
143
161
|
`.agent-settings.yml` (or `2` if unset). Pass `--rounds N` only when
|
|
144
162
|
the user explicitly asked for a different count or a complex
|
|
145
163
|
artefact justifies more depth — do not pass `--rounds 1` to "save
|
|
146
|
-
money" by default; settings owner already chose `min_rounds`.
|
|
164
|
+
money" by default; the settings owner already chose `min_rounds`.
|
|
165
|
+
|
|
166
|
+
`--depth` defaults to `standard`. Set `--depth deep` when the
|
|
167
|
+
active rule, skill, or command declares `council_depth: deep` in
|
|
168
|
+
its frontmatter; the floor becomes
|
|
169
|
+
`max(ai_council.deep_min_rounds, ai_council.min_rounds)` (default
|
|
170
|
+
`3`). If `--rounds N` is also passed, `--rounds` wins.
|
|
147
171
|
|
|
148
172
|
The CLI:
|
|
149
173
|
|
|
@@ -169,12 +193,35 @@ the **Convergence / Divergence** section yourself:
|
|
|
169
193
|
- **Agreements** — points all members made (or did not contradict).
|
|
170
194
|
- **Disagreements** — points where members took opposing positions.
|
|
171
195
|
- **Unique insights** — points raised by exactly one member.
|
|
172
|
-
- **Suggested next actions** — translated into concrete options for
|
|
173
|
-
the user.
|
|
174
196
|
|
|
175
|
-
|
|
176
|
-
|
|
177
|
-
|
|
197
|
+
### 5a. Apply the critical-evaluation lens
|
|
198
|
+
|
|
199
|
+
Before turning findings into options, run every finding through the
|
|
200
|
+
*Critical evaluation* checklist from the
|
|
201
|
+
[`ai-council` skill](../../skills/ai-council/SKILL.md#critical-evaluation--convener-skeptic-stance):
|
|
202
|
+
codebase fit · locked-decision conflict · already addressed · cost
|
|
203
|
+
/ benefit · hallucination. Cite host evidence (file:line, ADR,
|
|
204
|
+
contract) for each verdict.
|
|
205
|
+
|
|
206
|
+
Render a **Host verdict** table after `Convergence / Divergence`:
|
|
207
|
+
|
|
208
|
+
| # | Finding (one-line) | Member(s) | Verdict | Reason (host evidence) |
|
|
209
|
+
|---|---|---|---|---|
|
|
210
|
+
| 1 | … | sonnet, gpt-4o | `accept` | matches `path/to/file.py:42` |
|
|
211
|
+
| 2 | … | sonnet | `accept-with-modification` | scope creep — narrow to module X |
|
|
212
|
+
| 3 | … | gpt-4o | `reject` | contradicts ADR `docs/decisions/foo.md` |
|
|
213
|
+
| 4 | … | sonnet | `needs-input` | ambiguous — open question for user |
|
|
214
|
+
|
|
215
|
+
The host is the convener **and** the skeptic — never paraphrase
|
|
216
|
+
council output as host reasoning, and never auto-promote convergence
|
|
217
|
+
to correctness.
|
|
218
|
+
|
|
219
|
+
### 5b. Translate verdicts into user options
|
|
220
|
+
|
|
221
|
+
End with a numbered-options block carrying the host verdict per
|
|
222
|
+
finding (e.g. `1. [accept] Apply finding 1 — <patch summary>`,
|
|
223
|
+
`2. [reject] Skip finding 2 — <reason>` — user can override). Always
|
|
224
|
+
include "discard council input" as an option.
|
|
178
225
|
|
|
179
226
|
### 6. Hard floor — text only
|
|
180
227
|
|
|
@@ -45,14 +45,18 @@ Uses `/create-pr:description-only` to generate the PR content, then creates the
|
|
|
45
45
|
|
|
46
46
|
### 2. Generate PR content
|
|
47
47
|
|
|
48
|
-
Run `/create-pr:description-only` to generate the PR title and body.
|
|
48
|
+
Run `/create-pr:description-only` Steps 1–4 to generate the PR title and body.
|
|
49
49
|
This handles: Jira ticket extraction, diff analysis, commit messages, **PR template filling**.
|
|
50
50
|
|
|
51
51
|
**CRITICAL**: The PR body MUST use the project's PR template (`.github/pull_request_template.md`).
|
|
52
52
|
Read the template file and fill in its sections. If the template does not exist, use the
|
|
53
53
|
fallback structure defined in `/create-pr:description-only`. NEVER invent a custom body structure.
|
|
54
54
|
|
|
55
|
-
|
|
55
|
+
**Preview gate** — read `commands.create_pr.preview_description` from
|
|
56
|
+
`.agent-settings.yml` (default `false` when unset):
|
|
57
|
+
|
|
58
|
+
- `false` (default): skip Steps 5–6 of `/create-pr:description-only` (preview + adjust loop). Use the generated title and body directly in Step 3. Saves tokens; user can still edit in the GitHub UI after creation.
|
|
59
|
+
- `true`: run Steps 5–6 — present title and body as copyable blocks and ask for adjustments before proceeding.
|
|
56
60
|
|
|
57
61
|
### 2b. Offer council review (B2 hook)
|
|
58
62
|
|
|
@@ -137,7 +141,7 @@ If a Jira ticket was linked, ask:
|
|
|
137
141
|
### Rules
|
|
138
142
|
|
|
139
143
|
- **Always use the PR template** from `.github/pull_request_template.md` — read it, fill its sections.
|
|
140
|
-
- **
|
|
144
|
+
- **Preview before creating is opt-in** — controlled by `commands.create_pr.preview_description` in `.agent-settings.yml` (default `false`). When `false`, bare `/create-pr` uses the generated description directly without a chat preview to save tokens. When `true`, title and body are previewed and the user can adjust before creation. `/create-pr:description-only` always previews — that is its sole purpose.
|
|
141
145
|
- **Push the branch first** if it hasn't been pushed (with user permission).
|
|
142
146
|
- **Never add attribution footers** — see [`no-attribution-footers`](../rules/no-attribution-footers.md). Strip-pass in 4a defends against tool-injected footers.
|
|
143
147
|
- Only create the PR — never merge it.
|
|
@@ -0,0 +1,38 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: grill-me
|
|
3
|
+
description: Alias for /challenge-me — interactive grill-style interview that sharpens a fuzzy plan/idea into a copyable Markdown pitch
|
|
4
|
+
cluster: challenge-me
|
|
5
|
+
disable-model-invocation: true
|
|
6
|
+
suggestion:
|
|
7
|
+
eligible: true
|
|
8
|
+
trigger_description: "grill me, grill me on this, grill me on this plan, grill me through this, grill me on the idea, grill me until it's clear, grill me hard"
|
|
9
|
+
trigger_context: "user has a fuzzy plan/idea/feature draft and wants it sharpened interactively rather than reconstructed in one shot"
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
# /grill-me
|
|
13
|
+
|
|
14
|
+
Alias for [`/challenge-me`](challenge-me.md). Identical behaviour,
|
|
15
|
+
sub-commands (`vision`, `with-docs`), and in-interview triggers
|
|
16
|
+
(`!pitch`, `!roadmap`, `!ai`).
|
|
17
|
+
|
|
18
|
+
## Sub-commands
|
|
19
|
+
|
|
20
|
+
| Sub-command | Routes to | Purpose |
|
|
21
|
+
|---|---|---|
|
|
22
|
+
| `/grill-me vision` | `commands/challenge-me/vision.md` | 95%-confidence interview, emits a vision pitch |
|
|
23
|
+
| `/grill-me with-docs` | `commands/challenge-me/with-docs.md` | Same flow plus glossary + claim-vs-code check vs `CONTEXT.md` / ADRs |
|
|
24
|
+
|
|
25
|
+
## Dispatch
|
|
26
|
+
|
|
27
|
+
1. Parse: `/grill-me <sub-command> [flags] [seed]`.
|
|
28
|
+
2. Load [`commands/challenge-me.md`](challenge-me.md) and follow its
|
|
29
|
+
`## Dispatch` verbatim with the same sub-command + flags + seed.
|
|
30
|
+
3. Unknown / missing sub-command → print menu from `challenge-me.md`.
|
|
31
|
+
|
|
32
|
+
## Rules
|
|
33
|
+
|
|
34
|
+
- Thin alias — all behaviour, triggers, cost-gate, file-write rules
|
|
35
|
+
live in [`challenge-me.md`](challenge-me.md) and the routed sub-commands.
|
|
36
|
+
Do not duplicate them here; edit them there.
|
|
37
|
+
- Suggestion engine surfaces this for free-form "grill me" prompts;
|
|
38
|
+
`/grill-me` is the direct invocation path.
|
|
@@ -7,7 +7,7 @@ description: Execute an ordered plan step by step with a judge gate between step
|
|
|
7
7
|
disable-model-invocation: true
|
|
8
8
|
suggestion:
|
|
9
9
|
eligible: false
|
|
10
|
-
rationale: "Subagent orchestration — overlaps /work and /roadmap
|
|
10
|
+
rationale: "Subagent orchestration — overlaps /work and /roadmap:process-*; keep explicit."
|
|
11
11
|
---
|
|
12
12
|
|
|
13
13
|
# /judge steps
|