universal-dev-standards 5.7.2 → 5.8.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bundled/ai/standards/acceptance-criteria-traceability.ai.yaml +49 -3
- package/bundled/ai/standards/behavior-snapshot.ai.yaml +168 -0
- package/bundled/ai/standards/feature-manifest-standard.ai.yaml +146 -0
- package/bundled/core/acceptance-criteria-traceability.md +41 -6
- package/bundled/core/behavior-snapshot.md +239 -0
- package/bundled/core/feature-manifest-standard.md +213 -0
- package/bundled/locales/zh-CN/CHANGELOG.md +17 -3
- package/bundled/locales/zh-CN/README.md +1 -1
- package/bundled/locales/zh-TW/CHANGELOG.md +17 -3
- package/bundled/locales/zh-TW/README.md +1 -1
- package/bundled/skills/brainstorm-assistant/SKILL.md +183 -30
- package/bundled/skills/brainstorm-assistant/guide.md +607 -117
- package/package.json +1 -1
- package/src/commands/update.js +12 -16
- package/standards-registry.json +29 -5
|
@@ -10,8 +10,8 @@ description: |
|
|
|
10
10
|
|
|
11
11
|
> **Language**: English | [繁體中文](../../locales/zh-TW/skills/brainstorm-assistant/guide.md)
|
|
12
12
|
|
|
13
|
-
**Version**:
|
|
14
|
-
**Last Updated**: 2026-
|
|
13
|
+
**Version**: 2.0.0
|
|
14
|
+
**Last Updated**: 2026-05-09
|
|
15
15
|
**Applicability**: All software projects
|
|
16
16
|
**Scope**: universal
|
|
17
17
|
**Type**: Utility Skill (no core standard)
|
|
@@ -31,31 +31,133 @@ This skill fills the ideation gap in the UDS workflow:
|
|
|
31
31
|
```
|
|
32
32
|
/brainstorm → /requirement → /sdd → Implementation
|
|
33
33
|
▲ ▲ ▲
|
|
34
|
-
(
|
|
34
|
+
(this) Existing Existing
|
|
35
35
|
```
|
|
36
36
|
|
|
37
37
|
---
|
|
38
38
|
|
|
39
|
+
## Research Foundations | 認知科學依據
|
|
40
|
+
|
|
41
|
+
The v2.0 workflow is grounded in three research findings that challenge
|
|
42
|
+
assumptions in Osborn's classic brainstorming rules:
|
|
43
|
+
|
|
44
|
+
v2.0 流程基於三項研究發現,這些發現挑戰了 Osborn 經典腦力激盪規則中的假設:
|
|
45
|
+
|
|
46
|
+
### 1. Independent thinking before merging (Nominal Group Technique)
|
|
47
|
+
|
|
48
|
+
Groups where members first generate ideas independently — then combine — consistently
|
|
49
|
+
outperform interacting groups in both quantity and quality. The mechanism is
|
|
50
|
+
**production blocking**: while listening to others (or reading AI output), your
|
|
51
|
+
own thought stream is interrupted.
|
|
52
|
+
|
|
53
|
+
**Application in this skill:** Phase 0 PRE-FLIGHT collects three user ideas
|
|
54
|
+
before the AI generates anything, preventing AI-first anchoring.
|
|
55
|
+
|
|
56
|
+
**在本 Skill 的應用:** Phase 0 PRE-FLIGHT 在 AI 生成任何內容前收集使用者的三個想法,
|
|
57
|
+
防止 AI 先說話導致的錨定效應。
|
|
58
|
+
|
|
59
|
+
### 2. Creative ideas appear in the second half of divergence (Nijstad et al.)
|
|
60
|
+
|
|
61
|
+
Studies consistently show that the first 3–5 ideas in any brainstorming session
|
|
62
|
+
are almost always the most familiar and obvious. Truly creative ideas emerge
|
|
63
|
+
after the "obvious answer zone" is exhausted — typically after idea 7 or 8.
|
|
64
|
+
|
|
65
|
+
**Application in this skill:** The 10-idea minimum gate and semantic batching in
|
|
66
|
+
Phase 2 force users past this threshold before evaluation begins.
|
|
67
|
+
|
|
68
|
+
**在本 Skill 的應用:** Phase 2 的 10 個想法最低門檻和語義批次化,強制使用者在開始評估
|
|
69
|
+
前突破「顯而易見答案區」。
|
|
70
|
+
|
|
71
|
+
### 3. Debate produces more and better ideas than "no criticism" rules (Nemeth, 1995)
|
|
72
|
+
|
|
73
|
+
Charlan Nemeth's research directly challenges Osborn's core rule of deferring
|
|
74
|
+
all judgment. Groups instructed to debate and criticise generated more ideas,
|
|
75
|
+
of higher quality, than groups following traditional "no criticism" brainstorming
|
|
76
|
+
rules. The mechanism: criticism forces explicit defence of assumptions, surfacing
|
|
77
|
+
hidden weaknesses before commitment.
|
|
78
|
+
|
|
79
|
+
**Application in this skill:** The Rebuttal Round in Phase 3 introduces
|
|
80
|
+
structured debate on the top-ranked ideas before final selection.
|
|
81
|
+
|
|
82
|
+
**在本 Skill 的應用:** Phase 3 的反駁輪在最終選擇前對排名最高的想法引入結構化辯論。
|
|
83
|
+
|
|
84
|
+
---
|
|
85
|
+
|
|
39
86
|
## Quick Reference
|
|
40
87
|
|
|
41
88
|
### Workflow Overview
|
|
42
89
|
|
|
43
90
|
```
|
|
44
|
-
┌────────────┐
|
|
45
|
-
│ FRAME
|
|
46
|
-
│ Define the │
|
|
47
|
-
│ problem │
|
|
48
|
-
|
|
91
|
+
┌─────────────┐ ┌────────────┐ ┌─────────────────┐ ┌─────────────┐ ┌────────────┐
|
|
92
|
+
│ PRE-FLIGHT │─▶│ FRAME │─▶│ DIVERGE │─▶│ CONVERGE │─▶│ OUTPUT │
|
|
93
|
+
│ (Phase 0) │ │ Define the │ │ Batch 1 (1-5) │ │ Score + │ │ Brainstorm │
|
|
94
|
+
│ User writes │ │ problem │ │ Batch 2 (6-10+) │ │ Rebuttal │ │ Report │
|
|
95
|
+
│ 3 ideas │ │ │ │ Gate: ≥10 ideas │ │ Round │ │ │
|
|
96
|
+
└─────────────┘ └────────────┘ └─────────────────┘ └─────────────┘ └────────────┘
|
|
49
97
|
```
|
|
50
98
|
|
|
51
99
|
### Phase Summary
|
|
52
100
|
|
|
53
|
-
| Phase | Goal | Key
|
|
54
|
-
|
|
55
|
-
| **
|
|
56
|
-
| **
|
|
57
|
-
| **
|
|
58
|
-
| **
|
|
101
|
+
| Phase | Goal | Key Mechanism | Time |
|
|
102
|
+
|-------|------|---------------|------|
|
|
103
|
+
| **PRE-FLIGHT** | Prevent AI anchoring | User writes 3 ideas first | 3–5 min |
|
|
104
|
+
| **FRAME** | Define problem clearly | 5 Whys, HMW | 10–15 min |
|
|
105
|
+
| **DIVERGE** | Generate ≥10 diverse ideas | Batching + semantic gate | 15–25 min |
|
|
106
|
+
| **CONVERGE** | Select battle-tested ideas | Scoring + Rebuttal Round | 15–20 min |
|
|
107
|
+
| **OUTPUT** | Actionable report | Brainstorm Report template | 5–10 min |
|
|
108
|
+
|
|
109
|
+
---
|
|
110
|
+
|
|
111
|
+
## Phase 0: PRE-FLIGHT | 防止 AI 錨定
|
|
112
|
+
|
|
113
|
+
> Goal: Establish independent thinking before any AI content is generated.
|
|
114
|
+
>
|
|
115
|
+
> 目標:在任何 AI 內容生成之前,建立使用者的獨立思考框架。
|
|
116
|
+
|
|
117
|
+
### Why this matters
|
|
118
|
+
|
|
119
|
+
The single highest-leverage change in v2.0. Research shows that once a person
|
|
120
|
+
sees any AI-generated framing, subsequent ideas cluster within that semantic
|
|
121
|
+
space. Pre-flight creates an "intellectual immune system" against this bias.
|
|
122
|
+
|
|
123
|
+
v2.0 中槓桿效應最高的改動。研究顯示,一旦看到任何 AI 生成的框架,後續想法就會在該語義
|
|
124
|
+
空間內聚集。Pre-flight 為這種偏見創造了「智識免疫系統」。
|
|
125
|
+
|
|
126
|
+
### Prompt the user to provide
|
|
127
|
+
|
|
128
|
+
```
|
|
129
|
+
Before we start brainstorming, please take 2–3 minutes to write:
|
|
130
|
+
|
|
131
|
+
1. Problem (one sentence): What is the core problem you want to solve?
|
|
132
|
+
2. Your initial ideas (3, any quality):
|
|
133
|
+
- Idea A:
|
|
134
|
+
- Idea B:
|
|
135
|
+
- Idea C:
|
|
136
|
+
3. What I do NOT want (solution type to avoid, or N/A):
|
|
137
|
+
|
|
138
|
+
Submit when ready. The AI will read these before generating anything.
|
|
139
|
+
```
|
|
140
|
+
|
|
141
|
+
### AI behaviour after receiving Pre-flight input
|
|
142
|
+
|
|
143
|
+
1. Acknowledge the user's ideas without evaluating them
|
|
144
|
+
2. Proceed to FRAME
|
|
145
|
+
3. In DIVERGE Batch 1, explicitly explore directions the user did not mention
|
|
146
|
+
4. If the user declared an unwanted solution type, exclude that type from all
|
|
147
|
+
generated ideas throughout the session
|
|
148
|
+
|
|
149
|
+
### Skipping Pre-flight
|
|
150
|
+
|
|
151
|
+
Use `--skip-preflight` to bypass. A one-line warning is displayed:
|
|
152
|
+
|
|
153
|
+
```
|
|
154
|
+
⚠ Skipping Pre-flight may cause AI anchoring
|
|
155
|
+
```
|
|
156
|
+
|
|
157
|
+
The session continues immediately to FRAME. Pre-flight skip is appropriate when:
|
|
158
|
+
- The user has already written extensive notes elsewhere
|
|
159
|
+
- This is a repeat session on a well-understood problem
|
|
160
|
+
- Time is severely constrained (use `--quick` instead when possible)
|
|
59
161
|
|
|
60
162
|
---
|
|
61
163
|
|
|
@@ -158,21 +260,63 @@ This grounds ideation in reality and prevents proposing ideas that conflict with
|
|
|
158
260
|
|
|
159
261
|
---
|
|
160
262
|
|
|
161
|
-
## Phase 2: DIVERGE |
|
|
263
|
+
## Phase 2: DIVERGE | 發散思考(v2.0 升級版)
|
|
162
264
|
|
|
163
|
-
> Goal: Generate
|
|
265
|
+
> Goal: Generate at least 10 ideas across two semantic batches before evaluating any.
|
|
164
266
|
>
|
|
165
|
-
>
|
|
267
|
+
> 目標:在評估之前,跨越兩個語義批次產生至少 10 個想法。
|
|
268
|
+
|
|
269
|
+
### The 10-Idea Gate
|
|
270
|
+
|
|
271
|
+
**Research basis:** Nijstad et al. show that the most creative ideas appear in
|
|
272
|
+
the second half of a divergence session. Stopping at 3–5 ideas almost always
|
|
273
|
+
means stopping in the "obvious answer zone."
|
|
274
|
+
|
|
275
|
+
The "Enter CONVERGE" option is hidden until 10 ideas are generated. Below 10,
|
|
276
|
+
the status shows `Continue diverging (N/10)`.
|
|
277
|
+
|
|
278
|
+
### Batch 1 — Intuition Batch (ideas 1–5) | 直覺批
|
|
279
|
+
|
|
280
|
+
Generate fast, unfiltered ideas. Do not evaluate at this stage.
|
|
281
|
+
|
|
282
|
+
Rules:
|
|
283
|
+
- Speed over depth
|
|
284
|
+
- No idea is wrong
|
|
285
|
+
- Label the batch "Intuition Batch — fast, unfiltered"
|
|
286
|
+
- Display `✓ Intuition batch complete` after idea 5
|
|
287
|
+
|
|
288
|
+
### Batch 2 — Extension Batch (ideas 6–10) | 延伸批
|
|
289
|
+
|
|
290
|
+
Generate ideas that cross the semantic boundary of Batch 1.
|
|
291
|
+
|
|
292
|
+
Display before starting:
|
|
293
|
+
```
|
|
294
|
+
Extension Batch: ideas must cross the semantic boundary of Batch 1.
|
|
295
|
+
If your next idea is in the same theme category as a Batch 1 idea,
|
|
296
|
+
try a different angle first.
|
|
297
|
+
```
|
|
166
298
|
|
|
167
|
-
|
|
168
|
-
|
|
169
|
-
|
|
170
|
-
|
|
171
|
-
4. Encourage wild ideas — they often lead to practical breakthroughs
|
|
299
|
+
Semantic overlap detection (non-blocking):
|
|
300
|
+
- If a proposed idea shares a theme type with any Batch 1 idea, flag:
|
|
301
|
+
`⚠ Semantic overlap — try a different direction`
|
|
302
|
+
- The user may still submit the idea; the flag is advisory only
|
|
172
303
|
|
|
173
|
-
###
|
|
304
|
+
### Continuing past 10
|
|
174
305
|
|
|
175
|
-
|
|
306
|
+
Users may continue beyond 10 ideas without limit. No upper gate exists.
|
|
307
|
+
After 10, the "Enter CONVERGE" option appears alongside "Continue diverging".
|
|
308
|
+
|
|
309
|
+
### Techniques
|
|
310
|
+
|
|
311
|
+
| Technique | When to Use | 使用時機 |
|
|
312
|
+
|-----------|-------------|----------|
|
|
313
|
+
| **HMW Questions** | Default starting point | 預設起點 |
|
|
314
|
+
| **SCAMPER** | Improving existing features | 改善現有功能 |
|
|
315
|
+
| **Six Thinking Hats** | Need multiple perspectives | 需要多角度思考 |
|
|
316
|
+
|
|
317
|
+
#### Technique A: HMW Brainstorming (Default)
|
|
318
|
+
|
|
319
|
+
For each HMW question, generate 3–5 solution ideas.
|
|
176
320
|
|
|
177
321
|
**Template:**
|
|
178
322
|
|
|
@@ -187,7 +331,7 @@ Ideas:
|
|
|
187
331
|
5. [Idea] — [Brief explanation]
|
|
188
332
|
```
|
|
189
333
|
|
|
190
|
-
|
|
334
|
+
#### Technique B: SCAMPER
|
|
191
335
|
|
|
192
336
|
Apply 7 creative prompts to an existing feature or process. Best for improving what already exists.
|
|
193
337
|
|
|
@@ -215,7 +359,7 @@ E - Eliminate: [idea]
|
|
|
215
359
|
R - Reverse: [idea]
|
|
216
360
|
```
|
|
217
361
|
|
|
218
|
-
|
|
362
|
+
#### Technique C: Six Thinking Hats
|
|
219
363
|
|
|
220
364
|
Examine the problem from 6 distinct perspectives. Best when you need comprehensive analysis.
|
|
221
365
|
|
|
@@ -228,41 +372,17 @@ Examine the problem from 6 distinct perspectives. Best when you need comprehensi
|
|
|
228
372
|
| 5 | Green | Creativity | What new ideas emerge? What if we...? |
|
|
229
373
|
| 6 | Blue | Process & Summary | What's the big picture? What's our next step? |
|
|
230
374
|
|
|
231
|
-
**Template:**
|
|
232
|
-
|
|
233
|
-
```
|
|
234
|
-
Topic: [topic]
|
|
235
|
-
|
|
236
|
-
White Hat (Facts):
|
|
237
|
-
- [fact/data point]
|
|
238
|
-
|
|
239
|
-
Red Hat (Feelings):
|
|
240
|
-
- [intuition/emotion]
|
|
241
|
-
|
|
242
|
-
Black Hat (Risks):
|
|
243
|
-
- [risk/concern]
|
|
244
|
-
|
|
245
|
-
Yellow Hat (Benefits):
|
|
246
|
-
- [opportunity/benefit]
|
|
247
|
-
|
|
248
|
-
Green Hat (Ideas):
|
|
249
|
-
- [creative idea]
|
|
250
|
-
|
|
251
|
-
Blue Hat (Summary):
|
|
252
|
-
- [synthesis and next step]
|
|
253
|
-
```
|
|
254
|
-
|
|
255
375
|
---
|
|
256
376
|
|
|
257
|
-
## Phase 3: CONVERGE |
|
|
377
|
+
## Phase 3: CONVERGE | 反駁收斂(v2.0 升級版)
|
|
258
378
|
|
|
259
|
-
> Goal:
|
|
379
|
+
> Goal: Select ideas that survive both positive scoring AND structured debate.
|
|
260
380
|
>
|
|
261
|
-
>
|
|
381
|
+
> 目標:選出同時通過正向評分和結構化辯論的想法。
|
|
262
382
|
|
|
263
|
-
### Evaluation Matrix
|
|
383
|
+
### Step 3a: Evaluation Matrix | 評分
|
|
264
384
|
|
|
265
|
-
Score each idea on 4 criteria (1
|
|
385
|
+
Score each idea on 4 criteria (1–5 scale):
|
|
266
386
|
|
|
267
387
|
| Criterion | Weight | Score Guide |
|
|
268
388
|
|-----------|--------|-------------|
|
|
@@ -286,14 +406,50 @@ Score = (Feasibility × 0.3) + (Impact × 0.3) + (Effort × 0.2) + (Alignment ×
|
|
|
286
406
|
| 3 | Progressive form | 5 | 4 | 4 | 4 | **4.3** |
|
|
287
407
|
| 4 | Guest checkout | 5 | 3 | 5 | 3 | **4.0** |
|
|
288
408
|
|
|
289
|
-
###
|
|
409
|
+
### Step 3b: Rebuttal Round | 反駁輪
|
|
410
|
+
|
|
411
|
+
**Research basis (Nemeth, 1995):** Groups allowed to debate produce more and
|
|
412
|
+
better ideas than groups following "no-criticism" rules. The mechanism is that
|
|
413
|
+
criticism forces explicit defence of assumptions, which surfaces hidden
|
|
414
|
+
weaknesses before commitment.
|
|
415
|
+
|
|
416
|
+
**For each of the top 3 ideas**, the AI presents **2 specific counterarguments.**
|
|
417
|
+
|
|
418
|
+
Format for each counterargument:
|
|
419
|
+
```
|
|
420
|
+
"This idea will fail in [specific context] because [specific reason]."
|
|
421
|
+
```
|
|
422
|
+
|
|
423
|
+
**NOT acceptable** (too vague):
|
|
424
|
+
- "This might be difficult to implement."
|
|
425
|
+
- "There could be edge cases."
|
|
426
|
+
|
|
427
|
+
**Acceptable** (specific failure condition):
|
|
428
|
+
- "This idea will fail for enterprise customers because their IT policy
|
|
429
|
+
prohibits storing OAuth tokens in browser localStorage."
|
|
430
|
+
- "This idea will fail during peak traffic because the synchronous
|
|
431
|
+
API call blocks the render thread, causing visible jank at 500ms+."
|
|
432
|
+
|
|
433
|
+
### User response options
|
|
434
|
+
|
|
435
|
+
The user MUST select one of three options per counterargument before the session advances:
|
|
436
|
+
|
|
437
|
+
| Option | Action |
|
|
438
|
+
|--------|--------|
|
|
439
|
+
| (a) Accept criticism | Provide a modified version of the idea that addresses the failure |
|
|
440
|
+
| (b) Disagree | Provide a specific reason why the counterargument does not apply |
|
|
441
|
+
| (c) Criticism valid | Remove the idea from the ranking |
|
|
290
442
|
|
|
291
|
-
|
|
443
|
+
### Rebuttal outcome in report
|
|
292
444
|
|
|
293
|
-
|
|
294
|
-
|
|
295
|
-
|
|
296
|
-
|
|
445
|
+
Each idea that remains after the rebuttal round receives a badge in the final report:
|
|
446
|
+
|
|
447
|
+
```
|
|
448
|
+
✓ Passed rebuttal — [one-line summary of user's response]
|
|
449
|
+
```
|
|
450
|
+
|
|
451
|
+
**Skipping:** `--no-rebuttal` skips the rebuttal round. The report section is
|
|
452
|
+
marked "Rebuttal: skipped".
|
|
297
453
|
|
|
298
454
|
---
|
|
299
455
|
|
|
@@ -311,6 +467,8 @@ When the evaluation matrix feels too heavy, use dot voting:
|
|
|
311
467
|
**Date**: YYYY-MM-DD
|
|
312
468
|
**Participants**: [human, AI assistant]
|
|
313
469
|
**Techniques Used**: [HMW, SCAMPER, etc.]
|
|
470
|
+
**Pre-flight**: [Completed / Skipped]
|
|
471
|
+
**Rebuttal Round**: [Completed / Skipped]
|
|
314
472
|
|
|
315
473
|
## Problem Statement
|
|
316
474
|
|
|
@@ -324,36 +482,37 @@ When the evaluation matrix feels too heavy, use dot voting:
|
|
|
324
482
|
|
|
325
483
|
## Ideas Generated
|
|
326
484
|
|
|
327
|
-
| # | Idea | Source Technique | Feasibility | Impact | Effort | Alignment | Score |
|
|
328
|
-
|
|
329
|
-
| 1 | ... | SCAMPER-R | 4 | 5 | 3 | 5 | 4.3 |
|
|
330
|
-
| 2 | ... | HMW | 3 | 4 | 2 | 4 | 3.3 |
|
|
331
|
-
| 3 | ... | Six Hats-Green | 5 | 4 | 4 | 4 | 4.3 |
|
|
485
|
+
| # | Idea | Batch | Source Technique | Feasibility | Impact | Effort | Alignment | Score |
|
|
486
|
+
|---|------|-------|-----------------|-------------|--------|--------|-----------|-------|
|
|
487
|
+
| 1 | ... | B1 | SCAMPER-R | 4 | 5 | 3 | 5 | 4.3 |
|
|
488
|
+
| 2 | ... | B2 | HMW | 3 | 4 | 2 | 4 | 3.3 |
|
|
489
|
+
| 3 | ... | B2 | Six Hats-Green | 5 | 4 | 4 | 4 | 4.3 |
|
|
332
490
|
|
|
333
491
|
## Top 3 Recommendations
|
|
334
492
|
|
|
335
|
-
### 1. [Idea Name] (Score: X.X)
|
|
493
|
+
### 1. [Idea Name] (Score: X.X) ✓ Passed rebuttal
|
|
336
494
|
- **Why**: [Reasoning]
|
|
337
495
|
- **Key Benefit**: [Primary value]
|
|
338
|
-
- **
|
|
496
|
+
- **Rebuttal response**: [One-line summary of how user addressed the challenge]
|
|
339
497
|
- **Estimated Scope**: [Small / Medium / Large]
|
|
340
498
|
|
|
341
|
-
### 2. [Idea Name] (Score: X.X)
|
|
499
|
+
### 2. [Idea Name] (Score: X.X) ✓ Passed rebuttal
|
|
342
500
|
- **Why**: [Reasoning]
|
|
343
501
|
- **Key Benefit**: [Primary value]
|
|
344
|
-
- **
|
|
502
|
+
- **Rebuttal response**: [One-line summary]
|
|
345
503
|
- **Estimated Scope**: [Small / Medium / Large]
|
|
346
504
|
|
|
347
|
-
### 3. [Idea Name] (Score: X.X)
|
|
505
|
+
### 3. [Idea Name] (Score: X.X) ✓ Passed rebuttal
|
|
348
506
|
- **Why**: [Reasoning]
|
|
349
507
|
- **Key Benefit**: [Primary value]
|
|
350
|
-
- **
|
|
508
|
+
- **Rebuttal response**: [One-line summary]
|
|
351
509
|
- **Estimated Scope**: [Small / Medium / Large]
|
|
352
510
|
|
|
353
511
|
## Discarded Ideas (with reasons)
|
|
354
512
|
|
|
355
513
|
| Idea | Reason for Discarding |
|
|
356
514
|
|------|-----------------------|
|
|
515
|
+
| ... | Removed during rebuttal round (counterargument accepted) |
|
|
357
516
|
| ... | Low feasibility (score: 1/5) |
|
|
358
517
|
|
|
359
518
|
## Next Steps
|
|
@@ -365,6 +524,33 @@ When the evaluation matrix feels too heavy, use dot voting:
|
|
|
365
524
|
|
|
366
525
|
---
|
|
367
526
|
|
|
527
|
+
## Flags Reference | 旗標參考
|
|
528
|
+
|
|
529
|
+
| Flag | Phase affected | Behaviour |
|
|
530
|
+
|------|---------------|-----------|
|
|
531
|
+
| `--skip-preflight` | Phase 0 | Bypass Pre-flight; display one-line anchoring warning |
|
|
532
|
+
| `--no-rebuttal` | Phase 3 | Skip rebuttal round; mark report section "Rebuttal: skipped" |
|
|
533
|
+
| `--quick` | All | 3-idea fast mode; Phase 0, 10-idea gate, and rebuttal all exempt |
|
|
534
|
+
| `--technique scamper` | Phase 2 | Force SCAMPER as primary divergence technique |
|
|
535
|
+
|
|
536
|
+
### Quick Mode (`--quick`)
|
|
537
|
+
|
|
538
|
+
Delivers results in under 5 minutes. Output is 20 lines maximum.
|
|
539
|
+
|
|
540
|
+
```
|
|
541
|
+
1 HMW question → 3 ideas → 1 recommendation → next steps
|
|
542
|
+
```
|
|
543
|
+
|
|
544
|
+
All cognitive-science gates (Pre-flight, 10-idea minimum, Rebuttal Round) are
|
|
545
|
+
exempt in quick mode. Quick mode is appropriate for:
|
|
546
|
+
- Mid-coding-session decisions
|
|
547
|
+
- Re-scoping an already-understood problem
|
|
548
|
+
- Initial orientation before a full session
|
|
549
|
+
|
|
550
|
+
Always offer to expand: "Would you like to run a full brainstorming session?"
|
|
551
|
+
|
|
552
|
+
---
|
|
553
|
+
|
|
368
554
|
## Integration with UDS Workflow
|
|
369
555
|
|
|
370
556
|
The Brainstorm Report maps directly to downstream tools:
|
|
@@ -386,7 +572,7 @@ The Brainstorm Report maps directly to downstream tools:
|
|
|
386
572
|
| Problem Statement | Summary / Motivation |
|
|
387
573
|
| Top Recommendation | Proposed Solution |
|
|
388
574
|
| Evaluation Matrix | Trade-offs / Alternatives Considered |
|
|
389
|
-
|
|
|
575
|
+
| Rebuttal responses | Risks section |
|
|
390
576
|
| Estimated Scope | Scope section |
|
|
391
577
|
|
|
392
578
|
---
|
|
@@ -402,61 +588,359 @@ When invoked in a project directory, the brainstorm assistant will:
|
|
|
402
588
|
|
|
403
589
|
---
|
|
404
590
|
|
|
405
|
-
## Example Walkthrough
|
|
591
|
+
## Example Walkthrough (v2.0)
|
|
406
592
|
|
|
407
593
|
### Scenario: "We need to improve user retention"
|
|
408
594
|
|
|
595
|
+
**PRE-FLIGHT (user submits):**
|
|
596
|
+
```
|
|
597
|
+
Problem: 30-day retention is 15%, well below industry average of 40%
|
|
598
|
+
My ideas:
|
|
599
|
+
A: Send re-engagement emails after 7 days of inactivity
|
|
600
|
+
B: Add an achievement / gamification system
|
|
601
|
+
C: Show users a "what's new" summary on login
|
|
602
|
+
Do NOT want: Solutions requiring backend ML models (too slow to ship)
|
|
603
|
+
```
|
|
604
|
+
|
|
409
605
|
**FRAME:**
|
|
410
606
|
```
|
|
411
607
|
5 Whys:
|
|
412
|
-
Problem:
|
|
608
|
+
Problem: Retention at 15%
|
|
609
|
+
Why 1: Users stop using the app after initial signup → don't discover key features
|
|
610
|
+
Why 2: Why don't they discover features? → onboarding shows only basic setup
|
|
611
|
+
Why 3: Why basic setup only? → one-time wizard design assumption
|
|
612
|
+
Why 4: Why wizard? → team assumed users would explore independently
|
|
613
|
+
Why 5: Why does that assumption fail? → 20+ features, no progressive disclosure
|
|
614
|
+
|
|
615
|
+
Root Cause: No progressive onboarding — users see everything or nothing
|
|
616
|
+
|
|
617
|
+
HMW Questions:
|
|
618
|
+
1. How might we guide users to discover features at the right moment?
|
|
619
|
+
2. How might we make feature discovery feel natural, not forced?
|
|
620
|
+
3. How might we celebrate milestones to build engagement habits?
|
|
621
|
+
```
|
|
413
622
|
|
|
414
|
-
|
|
415
|
-
|
|
623
|
+
**DIVERGE — Batch 1 (Intuition):**
|
|
624
|
+
```
|
|
625
|
+
AI note: Exploring directions beyond the user's A/B/C ideas
|
|
626
|
+
|
|
627
|
+
1. Contextual tooltips triggered by user behaviour [HMW-1]
|
|
628
|
+
2. Progressive checklist replacing one-time wizard [SCAMPER-S]
|
|
629
|
+
3. First-task onboarding (skip setup, do real work first) [SCAMPER-C]
|
|
630
|
+
4. Peer-mentor matching for new users [SCAMPER-R]
|
|
631
|
+
5. Habit streak tracker (daily login reward) [HMW-3]
|
|
632
|
+
✓ Intuition batch complete
|
|
633
|
+
```
|
|
416
634
|
|
|
417
|
-
|
|
418
|
-
|
|
635
|
+
**DIVERGE — Batch 2 (Extension — cross semantic boundary):**
|
|
636
|
+
```
|
|
637
|
+
Extension Batch: ideas must cross the semantic boundary of Batch 1
|
|
419
638
|
|
|
420
|
-
|
|
421
|
-
|
|
639
|
+
6. API-driven onboarding: detect user's data and auto-populate examples [HMW-1]
|
|
640
|
+
7. Reverse onboarding: show power-user workflow first, simplify on request [SCAMPER-R]
|
|
641
|
+
8. Social proof: show "your peers use feature X 3× per week" [HMW-2]
|
|
642
|
+
9. Feature unlock gates: earn access to advanced features via usage [HMW-3]
|
|
643
|
+
10. Cohort-based pacing: group users by signup week, send same tips together [HMW-1]
|
|
644
|
+
```
|
|
422
645
|
|
|
423
|
-
|
|
424
|
-
|
|
646
|
+
**CONVERGE — Scoring:**
|
|
647
|
+
```
|
|
648
|
+
| # | Idea | Feasibility | Impact | Effort | Align | Score |
|
|
649
|
+
|---|-------------------------------|-------------|--------|--------|-------|-------|
|
|
650
|
+
| 2 | Progressive checklist | 5 | 5 | 4 | 5 | 4.8 |
|
|
651
|
+
| 6 | API-driven onboarding | 4 | 5 | 3 | 5 | 4.3 |
|
|
652
|
+
| 1 | Contextual tooltips | 4 | 4 | 3 | 5 | 4.0 |
|
|
653
|
+
```
|
|
425
654
|
|
|
426
|
-
|
|
427
|
-
|
|
655
|
+
**CONVERGE — Rebuttal Round:**
|
|
656
|
+
```
|
|
657
|
+
Idea #2: Progressive checklist
|
|
428
658
|
|
|
429
|
-
|
|
659
|
+
Counterargument 1: "This idea will fail for returning users who re-install the
|
|
660
|
+
app because the checklist state is lost if tied to the device, causing
|
|
661
|
+
experienced users to re-do beginner tasks and feel patronised."
|
|
430
662
|
|
|
431
|
-
|
|
432
|
-
|
|
433
|
-
|
|
434
|
-
|
|
663
|
+
User response (b — disagree): "Checklist state is stored server-side tied to
|
|
664
|
+
user ID, so returning users resume from where they left off."
|
|
665
|
+
|
|
666
|
+
Counterargument 2: "This idea will fail for power users who feel checklists are
|
|
667
|
+
infantilising and will disable them immediately if there's no way to opt out."
|
|
668
|
+
|
|
669
|
+
User response (a — accept): Modified → Add a one-click 'I know this already'
|
|
670
|
+
dismiss on each checklist item, with a 'hide checklist' option in settings.
|
|
671
|
+
|
|
672
|
+
→ ✓ Passed rebuttal
|
|
673
|
+
```
|
|
674
|
+
|
|
675
|
+
**OUTPUT:** Top recommendation is "Progressive checklist (modified)" → proceed to `/requirement`.
|
|
676
|
+
|
|
677
|
+
---
|
|
678
|
+
|
|
679
|
+
## Mode Selection Guide | 模式選擇指引
|
|
680
|
+
|
|
681
|
+
The mode selection table in SKILL.md uses objective triggers to remove the
|
|
682
|
+
"which mode should I use?" decision overhead. This section explains the
|
|
683
|
+
rationale behind each rule.
|
|
684
|
+
|
|
685
|
+
SKILL.md 的模式選擇表使用客觀觸發條件,消除「我應該用哪個模式?」的決策負擔。本節說明各規則的設計理由。
|
|
686
|
+
|
|
687
|
+
### Why objective triggers instead of subjective diagnosis
|
|
688
|
+
|
|
689
|
+
The v2.0 brainstorm rebuttal session itself surfaced this problem: if users must
|
|
690
|
+
first diagnose "is my problem strategic or execution-type?", that meta-decision
|
|
691
|
+
consumes cognitive resources before the session even starts. Objective triggers
|
|
692
|
+
(word count, presence of a flag, existence of a spec) eliminate this.
|
|
693
|
+
|
|
694
|
+
### Trigger calibration
|
|
695
|
+
|
|
696
|
+
The `< 20 words` threshold is a starting heuristic, not a permanent rule.
|
|
697
|
+
After 5–10 sessions, review whether short inputs consistently led to
|
|
698
|
+
under-explored problems or whether they were legitimately simple. Adjust
|
|
699
|
+
the threshold based on observation, not intuition.
|
|
700
|
+
|
|
701
|
+
---
|
|
702
|
+
|
|
703
|
+
## Self-Evaluation Framework | 自我評估框架
|
|
704
|
+
|
|
705
|
+
Use these three metrics after every brainstorming session to build an
|
|
706
|
+
empirical record of quality over time. Do NOT evaluate v2.0 vs v1.0 based
|
|
707
|
+
on a single session — draw conclusions only after collecting at least 3
|
|
708
|
+
comparable sessions.
|
|
709
|
+
|
|
710
|
+
每次腦力激盪結束後使用這三個指標,建立長期品質紀錄。不要以單次工作階段評估 v2.0 vs v1.0,至少收集 3 次可比較的工作階段後再下結論。
|
|
711
|
+
|
|
712
|
+
### The Three Metrics | 三個指標
|
|
713
|
+
|
|
714
|
+
#### 1. Adoption Rate | 採用率
|
|
715
|
+
**Question:** Of all ideas generated in this session, how many will you actually use or investigate further?
|
|
716
|
+
|
|
717
|
+
**Scale:**
|
|
718
|
+
- 5 = 3+ ideas directly actionable
|
|
719
|
+
- 4 = 2 ideas actionable, 1+ worth exploring
|
|
720
|
+
- 3 = 1 idea actionable
|
|
721
|
+
- 2 = No idea directly actionable, but useful frames emerged
|
|
722
|
+
- 1 = Session produced nothing useful
|
|
723
|
+
|
|
724
|
+
**Why this matters:** Adoption rate is the closest proxy to "did the brainstorming solve the right problem?" It corrects for sessions that feel productive but produce ideas that are never revisited.
|
|
725
|
+
|
|
726
|
+
#### 2. Diversity | 語義多樣性
|
|
727
|
+
**Question:** Were the Extension Batch ideas (6–10) noticeably different in theme and approach from the Intuition Batch ideas (1–5)?
|
|
728
|
+
|
|
729
|
+
**Scale:**
|
|
730
|
+
- 5 = Extension Batch explored completely different problem dimensions
|
|
731
|
+
- 4 = Extension Batch had 3+ ideas that clearly crossed semantic boundaries
|
|
732
|
+
- 3 = Some extension, but most ideas were variations on Batch 1 themes
|
|
733
|
+
- 2 = Extension Batch was effectively a continuation of Batch 1
|
|
734
|
+
- 1 = No meaningful semantic difference between batches
|
|
735
|
+
|
|
736
|
+
**Why this matters:** Diversity measures whether the 10-idea gate actually pushed past the "obvious answer zone." If diversity scores are consistently low, the semantic boundary instruction in Batch 2 may need strengthening.
|
|
737
|
+
|
|
738
|
+
#### 3. Cognitive Load | 認知負擔
|
|
739
|
+
**Question:** How mentally taxing was this session? (Higher score = lower burden)
|
|
740
|
+
|
|
741
|
+
**Scale:**
|
|
742
|
+
- 5 = Session felt effortless and generative
|
|
743
|
+
- 4 = Some friction but overall productive
|
|
744
|
+
- 3 = Moderate effort, a few frustrating moments
|
|
745
|
+
- 2 = Session felt like work throughout
|
|
746
|
+
- 1 = Exhausting; would avoid repeating this format
|
|
747
|
+
|
|
748
|
+
**Why this matters:** A brainstorming method that consistently scores 1–2 on cognitive load will be abandoned in favour of informal thinking, regardless of quality improvements. Target: cognitive load ≥ 3 while adoption rate and diversity are also improving.
|
|
749
|
+
|
|
750
|
+
### Session Log Template | 工作階段記錄模板
|
|
751
|
+
|
|
752
|
+
```
|
|
753
|
+
Date: YYYY-MM-DD
|
|
754
|
+
Topic: [one sentence]
|
|
755
|
+
Mode: [Full / Quick / No-Rebuttal / Skip-Preflight]
|
|
756
|
+
Duration: [minutes]
|
|
757
|
+
|
|
758
|
+
Adoption Rate: /5 — [reason]
|
|
759
|
+
Diversity: /5 — [reason]
|
|
760
|
+
Cognitive Load: /5 — [reason]
|
|
761
|
+
|
|
762
|
+
Notable observation:
|
|
763
|
+
[One sentence on what worked or what felt wrong]
|
|
435
764
|
```
|
|
436
765
|
|
|
437
|
-
|
|
766
|
+
### Interpreting Trends | 趨勢判讀
|
|
767
|
+
|
|
768
|
+
After 3+ sessions, look for these patterns:
|
|
769
|
+
|
|
770
|
+
| Pattern | Interpretation | Action |
|
|
771
|
+
|---------|---------------|--------|
|
|
772
|
+
| Adoption Rate consistently ≤ 2 | Problem framing failing in FRAME, not technique | Spend more time on 5 Whys before diverging |
|
|
773
|
+
| Diversity consistently ≤ 2 | 10-idea gate not producing semantic breadth | Enforce explicit topic-change before Batch 2 |
|
|
774
|
+
| Cognitive Load consistently ≤ 2 | Process overhead too high for problem complexity | Switch to `--no-rebuttal` or `--quick` for lower-stakes problems |
|
|
775
|
+
| All three metrics ≥ 4 | v2.0 working well for this problem type | No change needed |
|
|
776
|
+
| Adoption Rate ↑ but Cognitive Load ↓ over sessions | Habituation — the new flow is becoming natural | Continue; occasional `--quick` refreshes |
|
|
777
|
+
|
|
778
|
+
---
|
|
779
|
+
|
|
780
|
+
## A/B Experiment Protocol | A/B 實驗協議
|
|
781
|
+
|
|
782
|
+
Use this protocol to validate whether v2.0 genuinely outperforms v1.0 for
|
|
783
|
+
your specific problem types, rather than relying on the research assumptions
|
|
784
|
+
alone.
|
|
785
|
+
|
|
786
|
+
使用此協議驗證 v2.0 是否真的對你的特定問題類型優於 v1.0,而非單純依賴研究假設。
|
|
787
|
+
|
|
788
|
+
### Protocol Design
|
|
789
|
+
|
|
790
|
+
**Duration:** 3 paired sessions (minimum)
|
|
791
|
+
**Method:** Same category of problem, alternating method
|
|
792
|
+
|
|
793
|
+
**Session pairing:**
|
|
794
|
+
```
|
|
795
|
+
Session A1: Problem of type X → v1.0 (AI generates first, no Pre-flight)
|
|
796
|
+
Session B1: Problem of type X → v2.0 (Pre-flight + full flow)
|
|
797
|
+
[one week gap]
|
|
798
|
+
Session A2: Problem of type Y → v2.0
|
|
799
|
+
Session B2: Problem of type Y → v1.0
|
|
800
|
+
[one week gap]
|
|
801
|
+
Session A3: Problem of type X → v2.0
|
|
802
|
+
Session B3: Problem of type X → v1.0
|
|
803
|
+
```
|
|
804
|
+
|
|
805
|
+
Alternating reduces order effects (learning from session A affecting session B).
|
|
806
|
+
|
|
807
|
+
**Critical: evaluate each session immediately after completion.** Do not wait — memory of cognitive load fades fastest.
|
|
808
|
+
|
|
809
|
+
### What to Measure
|
|
810
|
+
|
|
811
|
+
For each session, record:
|
|
812
|
+
|
|
813
|
+
| Measure | v1.0 session | v2.0 session |
|
|
814
|
+
|---------|-------------|-------------|
|
|
815
|
+
| Adoption Rate (1–5) | | |
|
|
816
|
+
| Diversity (1–5) | | |
|
|
817
|
+
| Cognitive Load (1–5) | | |
|
|
818
|
+
| Time to complete (min) | | |
|
|
819
|
+
| Ideas generated (count) | | |
|
|
820
|
+
| Ideas in Batch 2 that surprised you | N/A | |
|
|
821
|
+
|
|
822
|
+
### Interpreting Results
|
|
823
|
+
|
|
824
|
+
- **v2.0 wins** if Adoption Rate and Diversity are both higher, and Cognitive
|
|
825
|
+
Load difference is ≤ 1 point
|
|
826
|
+
- **v1.0 wins** if v2.0 Cognitive Load is ≥ 2 points lower *and* Adoption Rate
|
|
827
|
+
difference is < 1 point
|
|
828
|
+
- **Situational** if results differ by problem type → implement full situation
|
|
829
|
+
routing (see Mode Selection section)
|
|
830
|
+
|
|
831
|
+
### Key Hypothesis to Validate
|
|
832
|
+
|
|
833
|
+
The three research assumptions underlying v2.0 have different levels of
|
|
834
|
+
external validity risk in AI-assisted solo contexts (see Research Validity
|
|
835
|
+
Caveats section below). The A/B protocol should specifically check:
|
|
836
|
+
|
|
837
|
+
1. **Pre-flight hypothesis:** After writing 3 ideas yourself, does the AI's
|
|
838
|
+
first batch explore genuinely different territory? Track explicitly.
|
|
839
|
+
2. **10-idea gate hypothesis:** Are ideas 7–10 consistently more diverse or
|
|
840
|
+
useful than ideas 1–3? Review the session log after each run.
|
|
841
|
+
3. **Rebuttal hypothesis:** After the rebuttal round, did you actually modify
|
|
842
|
+
or discard any ideas? If never, the rebuttal round may not be adding value
|
|
843
|
+
for your problem types.
|
|
844
|
+
|
|
845
|
+
---
|
|
846
|
+
|
|
847
|
+
## Research Validity Caveats | 研究效度說明
|
|
848
|
+
|
|
849
|
+
v2.0 is grounded in three research findings. Each has a different level of
|
|
850
|
+
external validity risk when applied to AI-assisted single-user brainstorming.
|
|
851
|
+
Understand the limitations before treating the research as settled fact.
|
|
852
|
+
|
|
853
|
+
v2.0 基於三項研究發現。每項在應用於 AI 輔助單人腦力激盪時,有不同程度的外部效度風險。在將研究視為確定事實之前,請理解其局限性。
|
|
854
|
+
|
|
855
|
+
### Assumption 1: AI output anchors thinking (NGT basis)
|
|
856
|
+
|
|
857
|
+
**Original finding:** Nominal Group Technique shows that individuals brainstorming
|
|
858
|
+
separately then merging outperform interacting groups. Mechanism: production
|
|
859
|
+
blocking and conformity pressure in groups.
|
|
860
|
+
|
|
861
|
+
**Application to v2.0:** Pre-flight assumes AI output anchors your thinking the
|
|
862
|
+
same way a dominant human voice does in a group.
|
|
863
|
+
|
|
864
|
+
**External validity risk: MEDIUM**
|
|
865
|
+
|
|
866
|
+
The anchoring mechanism may differ. Human social anchoring involves
|
|
867
|
+
conformity pressure and real-time interruption. AI output is static text you
|
|
868
|
+
can choose to ignore. The question is whether *reading* AI output before writing
|
|
869
|
+
your own ideas meaningfully narrows your semantic exploration.
|
|
870
|
+
|
|
871
|
+
**How to validate:** In your A/B experiment, after v2.0 sessions, note whether
|
|
872
|
+
the AI's first batch was genuinely different from your Pre-flight ideas. If yes,
|
|
873
|
+
Pre-flight is working. If the AI frequently produces ideas similar to yours
|
|
874
|
+
anyway, Pre-flight's value may be in forcing you to articulate your starting
|
|
875
|
+
point — a different but still valid benefit.
|
|
876
|
+
|
|
877
|
+
### Assumption 2: Best ideas appear in the second half (Nijstad basis)
|
|
878
|
+
|
|
879
|
+
**Original finding:** Nijstad et al. show that early ideas in brainstorming
|
|
880
|
+
sessions are the most conventional. Creative ideas emerge after the obvious
|
|
881
|
+
zone is exhausted.
|
|
882
|
+
|
|
883
|
+
**Application to v2.0:** The 10-idea minimum gate and semantic batching assume
|
|
884
|
+
this temporal pattern holds in AI-assisted contexts.
|
|
885
|
+
|
|
886
|
+
**External validity risk: LOW**
|
|
887
|
+
|
|
888
|
+
This finding is about cognitive pattern (exhausting obvious associations first),
|
|
889
|
+
not about group dynamics. It is more likely to transfer to AI-assisted solo
|
|
890
|
+
contexts because the underlying mechanism (semantic network traversal) applies
|
|
891
|
+
regardless of whether a human or AI is generating ideas.
|
|
892
|
+
|
|
893
|
+
**How to validate:** After each session, review ideas 1–5 vs ideas 7–10. Are
|
|
894
|
+
the later ideas consistently less obvious? If yes, the gate is earning its keep.
|
|
895
|
+
|
|
896
|
+
### Assumption 3: Debate produces better ideas (Nemeth basis)
|
|
897
|
+
|
|
898
|
+
**Original finding:** Nemeth (1995) shows that groups instructed to debate
|
|
899
|
+
produce more and better ideas than groups following "no criticism" rules.
|
|
900
|
+
|
|
901
|
+
**Application to v2.0:** The Rebuttal Round assumes that structured debate with
|
|
902
|
+
AI counterarguments improves idea quality.
|
|
903
|
+
|
|
904
|
+
**External validity risk: HIGH**
|
|
905
|
+
|
|
906
|
+
This is the most tenuous transfer. Nemeth's finding is about *genuine
|
|
907
|
+
disagreement between people with different perspectives and stakes*. An AI
|
|
908
|
+
playing "devil's advocate" on demand may produce counterarguments that are
|
|
909
|
+
formally correct but lack the authentic dissent that drives Nemeth's effect.
|
|
910
|
+
The counterarguments may feel adversarial without being genuinely revelatory.
|
|
911
|
+
|
|
912
|
+
**How to validate:** After each rebuttal round, honestly assess: did the
|
|
913
|
+
counterarguments surface something you had genuinely not considered? If the
|
|
914
|
+
answer is consistently "no, I had already thought of this", the rebuttal round
|
|
915
|
+
may be providing false confidence rather than genuine challenge. Consider
|
|
916
|
+
switching to `--no-rebuttal` for well-understood problem domains.
|
|
917
|
+
|
|
918
|
+
---
|
|
919
|
+
|
|
920
|
+
## Gradual Adoption Protocol | 漸進採用協議
|
|
921
|
+
|
|
922
|
+
If the full v2.0 flow feels overwhelming, adopt the three changes in sequence
|
|
923
|
+
rather than all at once. Spend two weeks on each phase before adding the next.
|
|
924
|
+
|
|
925
|
+
如果完整 v2.0 流程感覺負擔過重,按順序逐步採用三個改動,而非一次全部導入。每個階段使用兩週再加入下一個。
|
|
926
|
+
|
|
927
|
+
### Phase 1: Pre-flight only (weeks 1–2)
|
|
928
|
+
|
|
929
|
+
Run `/brainstorm --no-rebuttal [topic]` with Pre-flight enabled but skip
|
|
930
|
+
rebuttal. Focus on: does writing 3 ideas first change what the AI produces?
|
|
931
|
+
|
|
932
|
+
### Phase 2: Add the 10-idea gate (weeks 3–4)
|
|
438
933
|
|
|
439
|
-
|
|
440
|
-
|
|
441
|
-
| 1 | Contextual tooltips triggered by user behavior | HMW-1 |
|
|
442
|
-
| 2 | Weekly "Did you know?" email with one feature | HMW-1 |
|
|
443
|
-
| 3 | Achievement system with unlock badges | HMW-3 |
|
|
444
|
-
| 4 | Replace wizard with progressive checklist | SCAMPER-S |
|
|
445
|
-
| 5 | Combine onboarding with first real task | SCAMPER-C |
|
|
446
|
-
| 6 | Adapt Duolingo's streak system | SCAMPER-A |
|
|
447
|
-
| 7 | Minimize onboarding to 1 question: "What's your goal?" | SCAMPER-M |
|
|
448
|
-
| 8 | Eliminate signup wall, let users try first | SCAMPER-E |
|
|
449
|
-
| 9 | Reverse: let power users mentor new users | SCAMPER-R |
|
|
934
|
+
Continue with Pre-flight. Now enforce the 10-idea minimum. Focus on: are ideas
|
|
935
|
+
7–10 better than ideas 1–5?
|
|
450
936
|
|
|
451
|
-
|
|
937
|
+
### Phase 3: Add the Rebuttal Round (weeks 5–6)
|
|
452
938
|
|
|
453
|
-
|
|
454
|
-
|
|
455
|
-
| 7 | Goal-based onboarding | 5 | 5 | 4 | 5 | **4.8** |
|
|
456
|
-
| 1 | Contextual tooltips | 4 | 4 | 3 | 5 | **4.0** |
|
|
457
|
-
| 5 | Onboarding via real task | 3 | 5 | 2 | 5 | **3.8** |
|
|
939
|
+
Run the full v2.0 flow. Focus on: do the counterarguments surface issues you
|
|
940
|
+
had not considered?
|
|
458
941
|
|
|
459
|
-
|
|
942
|
+
After 6 weeks, review session logs and the A/B experiment data to calibrate
|
|
943
|
+
which combination works best for your problem types.
|
|
460
944
|
|
|
461
945
|
---
|
|
462
946
|
|
|
@@ -464,19 +948,23 @@ HMW Questions:
|
|
|
464
948
|
|
|
465
949
|
### Do's
|
|
466
950
|
|
|
467
|
-
-
|
|
468
|
-
|
|
469
|
-
-
|
|
470
|
-
-
|
|
471
|
-
-
|
|
951
|
+
- Complete Pre-flight before starting — it takes 3 minutes and dramatically
|
|
952
|
+
improves idea diversity
|
|
953
|
+
- Run all the way to 10+ ideas — the best ideas appear late
|
|
954
|
+
- Take the rebuttal round seriously — vague defences are a warning sign
|
|
955
|
+
- Record three evaluation metrics after every session
|
|
956
|
+
- Use `--quick` for time-constrained situations, full mode for important decisions
|
|
957
|
+
- Review session log trends after every 3 sessions
|
|
472
958
|
|
|
473
959
|
### Don'ts
|
|
474
960
|
|
|
961
|
+
- Don't read the AI output before writing your Pre-flight ideas
|
|
475
962
|
- Don't evaluate during DIVERGE phase
|
|
476
|
-
- Don't
|
|
963
|
+
- Don't stop at 5 ideas — push through the "obvious answer zone"
|
|
964
|
+
- Don't accept vague AI counterarguments ("this might be hard") — insist on
|
|
965
|
+
specific failure conditions
|
|
477
966
|
- Don't skip the 5 Whys — surface-level problems lead to surface-level solutions
|
|
478
|
-
- Don't
|
|
479
|
-
- Don't force all ideas through the same technique
|
|
967
|
+
- Don't draw conclusions from a single session — wait for 3+ data points
|
|
480
968
|
|
|
481
969
|
---
|
|
482
970
|
|
|
@@ -494,6 +982,8 @@ HMW Questions:
|
|
|
494
982
|
|
|
495
983
|
| Version | Date | Changes |
|
|
496
984
|
|---------|------|---------|
|
|
985
|
+
| 2.1.0 | 2026-05-09 | XSPEC-196 Phase 2: Mode Selection objective routing table; Self-Evaluation Framework (3 metrics + session log template + trend interpretation); A/B Experiment Protocol; Research Validity Caveats (3 assumptions with external validity risk ratings); Gradual Adoption Protocol (3-phase, 6-week plan) |
|
|
986
|
+
| 2.0.0 | 2026-05-09 | XSPEC-196: Phase 0 Pre-flight (anti-anchoring), Rebuttal Round (Nemeth Protocol), 10-idea minimum gate + semantic batching; research foundations section added |
|
|
497
987
|
| 1.0.0 | 2026-02-12 | Initial release |
|
|
498
988
|
|
|
499
989
|
---
|