mindsystem-cc 3.17.1 → 3.18.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (37) hide show
  1. package/agents/ms-consolidator.md +4 -4
  2. package/agents/ms-debugger.md +3 -3
  3. package/agents/ms-designer.md +33 -70
  4. package/agents/ms-executor.md +7 -6
  5. package/agents/ms-plan-writer.md +52 -26
  6. package/agents/ms-researcher.md +13 -13
  7. package/commands/ms/check-phase.md +1 -1
  8. package/commands/ms/complete-milestone.md +47 -54
  9. package/commands/ms/design-phase.md +33 -30
  10. package/commands/ms/review-design.md +106 -395
  11. package/mindsystem/references/principles.md +3 -3
  12. package/mindsystem/references/routing/next-phase-routing.md +1 -1
  13. package/mindsystem/references/scope-estimation.md +22 -35
  14. package/mindsystem/templates/design-iteration.md +13 -13
  15. package/mindsystem/templates/design.md +145 -327
  16. package/mindsystem/templates/knowledge.md +1 -1
  17. package/mindsystem/templates/milestone-archive.md +3 -3
  18. package/mindsystem/templates/phase-prompt.md +6 -7
  19. package/mindsystem/templates/research-subagent-prompt.md +2 -2
  20. package/mindsystem/templates/research.md +7 -7
  21. package/mindsystem/templates/roadmap.md +1 -1
  22. package/mindsystem/templates/verification-report.md +1 -1
  23. package/mindsystem/workflows/complete-milestone.md +52 -227
  24. package/mindsystem/workflows/discuss-phase.md +3 -3
  25. package/mindsystem/workflows/execute-plan.md +1 -1
  26. package/mindsystem/workflows/plan-phase.md +22 -50
  27. package/mindsystem/workflows/verify-phase.md +1 -1
  28. package/package.json +1 -1
  29. package/scripts/archive-milestone-files.sh +68 -0
  30. package/scripts/archive-milestone-phases.sh +138 -0
  31. package/scripts/gather-milestone-stats.sh +179 -0
  32. package/scripts/ms-lookup/ms_lookup/backends/context7.py +17 -5
  33. package/scripts/ms-lookup/ms_lookup/backends/perplexity.py +17 -3
  34. package/scripts/ms-lookup-wrapper.sh +1 -1
  35. package/scripts/scan-planning-context.py +186 -36
  36. package/scripts/validate-execution-order.sh +4 -5
  37. package/scripts/cleanup-phase-artifacts.sh +0 -68
@@ -1,7 +1,7 @@
1
1
  ---
2
2
  name: ms:review-design
3
- description: Audit and improve design of already-implemented features using quality-forcing principles
4
- argument-hint: "[phase, file path, feature name, or description]"
3
+ description: Review a screen's design like a designer friend would — focused, practical, high-impact
4
+ argument-hint: "[screenshot path, file path, feature name, or description]"
5
5
  allowed-tools:
6
6
  - Read
7
7
  - Write
@@ -9,27 +9,23 @@ allowed-tools:
9
9
  - Bash
10
10
  - Glob
11
11
  - Grep
12
+ - Skill
12
13
  - AskUserQuestion
13
14
  ---
14
15
 
15
16
  <objective>
16
- Review and improve the design of already-implemented features, applying the same quality-forcing principles used in `/ms:design-phase`.
17
+ Review the design of an existing screen or flow the way a designer friend would: understand what the screen is trying to do, simulate the user experience, and suggest the 3-5 highest-impact improvements.
18
+
19
+ Not a comprehensive audit. Not a redesign. A focused review that finds what matters most and fixes it.
17
20
 
18
21
  **When to use:**
19
- - Features implemented before Mindsystem was added (no DESIGN.md exists)
20
- - Features where `/ms:design-phase` was skipped
21
- - Periodic design quality audits on existing code
22
- - After receiving user feedback that UI "feels off"
23
-
24
- **What this does:**
25
- 1. Identifies target code from arguments
26
- 2. Analyzes against design quality criteria
27
- 3. Presents improvements with benefits and trade-offs
28
- 4. Applies user-approved changes
29
- 5. Runs platform-specific verification
30
- 6. Creates DESIGN-REVIEW.md report
31
-
32
- **Not a replacement for:** `/ms:design-phase` — use that BEFORE implementing new features.
22
+ - A screen "feels off" but you can't pinpoint why
23
+ - After implementing a feature, before shipping
24
+ - Periodic quality pass on existing screens
25
+
26
+ **Input modes:**
27
+ - **Screenshot + code** (best): Provide a screenshot path. Visual input lets you reason about hierarchy, whitespace, and flow directly instead of reconstructing from code.
28
+ - **Code only** (good): Reconstruct the screen mentally from the widget/component tree and reason about the user experience from there.
33
29
  </objective>
34
30
 
35
31
  <context>
@@ -42,447 +38,162 @@ $ARGUMENTS
42
38
 
43
39
  <process>
44
40
 
45
- ## Phase 1: Identify Target Code
46
-
47
- ### Step 1.1: Parse Arguments
41
+ ## Step 1: Identify Target
48
42
 
49
- Analyze `$ARGUMENTS` to determine what code to review:
43
+ Parse `$ARGUMENTS`:
50
44
 
51
- - **Phase number** (e.g., `4`, `04`) → Find phase directory, read SUMMARY.md files for implemented code
52
- - **File path** (e.g., `lib/features/home/home_screen.dart`) → Read and analyze that file
53
- - **Feature/area name** (e.g., `home feature`, `authentication`) → Search for relevant files in that area
54
- - **Description** (e.g., `the code I just wrote`) Check recent git changes or conversation context
55
- - **Empty or unclear** → Use AskUserQuestion:
45
+ - **Screenshot path** (image file) → Read the image. Identify and read the corresponding code files.
46
+ - **File path** → Read that file and related files in the same feature (widgets, providers, models)
47
+ - **Feature/area name** → Search for relevant files
48
+ - **Phase number** Find phase directory, read SUMMARY.md for implemented files
49
+ - **Description** → Check recent git changes or conversation context
50
+ - **Empty or unclear** → AskUserQuestion:
56
51
 
57
52
  ```
58
- Question: "What code should I review for design improvements?"
53
+ Question: "What should I review?"
59
54
  Options:
60
- - "Uncommitted changes" - Review files with uncommitted modifications
61
- - "Specific file" - I'll provide a file path
62
- - "Recent feature work" - Review files from recent commits
63
- - "Mindsystem phase" - Review all code from a specific phase
64
- ```
65
-
66
- ### Step 1.2: Resolve Phase Directory (if phase-based)
67
-
68
- ```bash
69
- # Find phase directory
70
- PHASE_DIR=$(ls -d .planning/phases/${PHASE_ARG}-* 2>/dev/null | head -1)
71
-
72
- # Find SUMMARY files to identify implemented code
73
- ls "$PHASE_DIR"/*-SUMMARY.md 2>/dev/null
55
+ - "I'll share a screenshot" - Paste or provide a path to a screenshot
56
+ - "Specific screen" - I'll provide a file path or feature name
57
+ - "Recent work" - Review files from recent commits
74
58
  ```
75
59
 
76
- Parse SUMMARY.md files to extract:
77
- - Files created/modified
78
- - Features implemented
79
- - Components added
80
-
81
- ### Step 1.3: Gather Target Files
82
-
83
- Based on scope:
84
- - Read target file(s)
85
- - For features, also read related files (widgets, providers, models)
86
- - For phases, collect all files mentioned in SUMMARY.md
87
-
88
- Store file list for analysis.
89
-
90
- ## Phase 2: Load Context Chain
91
-
92
- ### Step 2.1: Mandatory Context
60
+ If no screenshot was provided, ask once:
93
61
 
94
- ```bash
95
- # Load PROJECT.md for product context
96
- cat .planning/PROJECT.md 2>/dev/null
97
-
98
- # Load ROADMAP.md for phase requirements (if phase-based)
99
- grep -A30 "Phase ${PHASE_ARG}:" .planning/ROADMAP.md 2>/dev/null
100
62
  ```
101
-
102
- Extract from PROJECT.md:
103
- - What This Is (product type determines commercial benchmark)
104
- - Core Value (design must serve this)
105
- - Context (target audience)
106
- - Constraints (platform, technical limits)
107
-
108
- ### Step 2.2: Check for Existing DESIGN.md
109
-
110
- ```bash
111
- # Check for existing design
112
- ls .planning/phases/${PHASE_ARG}-*/*-DESIGN.md 2>/dev/null
63
+ Question: "Do you have a screenshot? It significantly improves review quality."
64
+ Options:
65
+ - "Yes, let me share it" - Paste or provide a path
66
+ - "No, review from code" - Analyze implementation files directly
113
67
  ```
114
68
 
115
- **If exists:** Load as baseline for comparison
116
- **If not exists:** Flag for retroactive creation
117
-
118
- ### Step 2.3: Optional Context
119
-
120
- **Project UI skill (if exists):**
121
-
122
- Check your available skills for one that provides domain expertise relevant to this project's UI implementation patterns. If found, invoke it via the Skill tool and use as authoritative source for existing patterns.
123
-
124
- **Codebase analysis:**
125
- - Detect platform (Flutter, React, etc.)
126
- - Find existing component/theme files
127
- - Document discovered patterns
128
-
129
- ## Phase 3: Analyze for Design Improvements
130
-
131
- Review target code against these quality dimensions:
132
-
133
- ### 3.1: Visual Quality
69
+ Read all relevant code files for the target screen/flow.
134
70
 
135
- Apply quality-forcing patterns from `ai-driven-ui-design-system.md`:
71
+ ## Step 2: Understand the Screen's Job
136
72
 
137
- **Commercial benchmark check:**
138
- > "Does this look like a commercial $50-200 [product type] — intentional decisions, not defaults?"
73
+ Before analyzing, establish what this screen is trying to accomplish.
139
74
 
140
- **Anti-pattern detection:**
141
- - Generic dark gray with blue accents (unless specifically requested)
142
- - Default spacing with no intentional rhythm
143
- - Controls that look like styled HTML inputs
144
- - Typography using only system fonts without spacing compensation
145
- - Elements that appear positioned without thought
75
+ **Load context:**
76
+ - Read `.planning/PROJECT.md` for product type, core value, target audience
77
+ - If phase-based, grep `.planning/ROADMAP.md` for phase requirements
78
+ - Glob for existing DESIGN.md in `.planning/phases/` for design intent
79
+ - Check available skills for one matching `*-ui` or `*-design` in name or description — if found, invoke via Skill tool for existing patterns
146
80
 
147
- **Check for:**
148
- - Color palette character (distinctive vs generic)
149
- - Intentional spacing (consistent scale vs arbitrary)
150
- - Visual hierarchy (size/weight/contrast differentiation)
151
- - Refined controls (proper states, sizing, polish)
81
+ **State the screen's job in one sentence:**
82
+ > "This screen helps the user [accomplish X]."
152
83
 
153
- ### 3.2: UX Flows
84
+ This sentence filters every subsequent suggestion. An improvement that doesn't make the screen better at its job doesn't make the cut.
154
85
 
155
- **Check for:**
156
- - Complete user journeys (entry → action → feedback → completion)
157
- - Loading states (skeleton, spinner, or progressive)
158
- - Error states (clear messages, recovery path)
159
- - Empty states (helpful guidance when no data)
160
- - Edge cases (offline, timeout, validation)
86
+ ## Step 3: Experience Simulation
161
87
 
162
- ### 3.3: Platform Compliance
88
+ Walk through the screen as a user. This is the core analysis — a simulation, not a checklist. Do not organize findings by design categories.
163
89
 
164
- **Flutter/Mobile:**
165
- - Touch targets 48dp (Android) / 44pt (iOS)
166
- - Safe area insets respected
167
- - Text sizes 16sp for body text
168
- - Contrast ratios meet WCAG AA (4.5:1 body, 3:1 large)
90
+ **If screenshot available**, look at it and answer:
91
+ 1. What draws my eye first? Is that the most important element on this screen?
92
+ 2. Can I immediately tell what this screen wants me to do?
93
+ 3. Where does my attention flow after the first element? Is the reading order clear?
94
+ 4. What feels cluttered, or where are elements competing for attention that shouldn't be?
95
+ 5. Is there enough breathing room around important content?
96
+ 6. Does the text help me understand what to do, or is it vague/jargon-heavy?
97
+ 7. Are unrelated elements sitting close enough to feel grouped, or related elements spread far enough apart to feel disconnected?
98
+ 8. Does this feel like a native app screen, or like a web form wearing a native shell?
99
+ 9. If I'm a new user, would I know how to complete the primary action?
169
100
 
170
- **Web:**
171
- - Touch targets 44px for interactive elements
172
- - Keyboard navigation functional
173
- - Focus states visible
174
- - Responsive breakpoints defined
101
+ **If code only**, reconstruct the screen from the widget/component tree, then answer the same questions. Pay special attention to:
102
+ - Widget nesting depth as proxy for visual complexity
103
+ - Padding/margin values for whitespace
104
+ - String literals for copy quality
105
+ - Button/action placement for flow
106
+ - Conditional rendering for state completeness (loading, empty, error)
175
107
 
176
- ### 3.4: Code Organization
108
+ ## Step 4: Top 3-5 Improvements
177
109
 
178
- **Flutter-specific (from simplify-flutter patterns):**
179
- - Large `build()` methods → extract to local variables or builder methods
180
- - Complex subtrees → separate widget files
181
- - Scattered boolean flags → sealed class variants
182
- - Manual try-catch → `AsyncValue.guard()`
183
- - Mutation patterns → immutable methods
110
+ From the simulation, select only the 3-5 changes with the highest user impact.
184
111
 
185
- **General:**
186
- - Consistent naming conventions
187
- - Related logic grouped together
188
- - Clear separation of concerns
112
+ **Filtering question:** "If I could only change ONE thing, what would make the biggest difference to a user?" Then the next, then the next. Stop at 5.
189
113
 
190
- ### 3.5: Pattern Alignment
114
+ **Priority order:**
115
+ 1. **Flow friction** — user can't complete their primary task intuitively
116
+ 2. **Hierarchy/attention** — wrong thing dominates, or user can't find what they need
117
+ 3. **Density/whitespace** — screen feels cramped, overwhelming, or unbalanced
118
+ 4. **Copy/labeling** — user doesn't understand what something means or what to do next
119
+ 5. **Missing states** — user hits a dead end (no loading feedback, unhelpful empty state, cryptic error)
191
120
 
192
- If project UI skill or codebase patterns exist:
193
- - Check for consistent color usage
194
- - Check for consistent component patterns
195
- - Check for consistent spacing scale
196
- - Identify deviations that should be aligned
197
-
198
- ## Phase 4: Create Retroactive DESIGN.md (If Missing)
199
-
200
- **If no existing DESIGN.md:**
201
-
202
- Document current implementation patterns:
203
-
204
- ```bash
205
- mkdir -p "$PHASE_DIR"
206
- ```
207
-
208
- Create `{phase}-DESIGN.md` following the standard template:
209
-
210
- 1. **Visual Identity** — Extract philosophy from implemented code
211
- 2. **Screen Layouts** — Create ASCII wireframes from actual screens
212
- 3. **Component Specifications** — Document existing component patterns
213
- 4. **UX Flows** — Map current user journeys
214
- 5. **Design System Decisions** — Extract colors, typography, spacing with rationale
215
- 6. **Platform-Specific Notes** — Document current responsive/platform handling
216
- 7. **Verification Criteria** — Define observable behaviors
217
-
218
- Write to: `.planning/phases/{phase}-{slug}/{phase}-DESIGN.md`
219
-
220
- ## Phase 5: Present Improvements
221
-
222
- For each identified improvement, document:
223
-
224
- ### Improvement Format
121
+ **Frame each as a user experience story:**
225
122
 
226
123
  ```markdown
227
- ### [Category]: [Brief Description]
228
-
229
- **Current state:**
230
- [What exists now]
124
+ ### 1. [Brief description of what changes]
231
125
 
232
- **Proposed change:**
233
- [What should change]
126
+ **What a user experiences:** [The friction or confusion from the user's perspective]
234
127
 
235
- **Benefits:**
236
- - [Benefit 1]
237
- - [Benefit 2]
128
+ **Suggested fix:** [Specific, implementable change — layout shift, text rewrite, element repositioning, spacing adjustment, visual weight change]
238
129
 
239
- **Trade-offs:**
240
- - [Any functionality lost or changed]
241
- - [Migration effort if significant]
242
- - [Risk factors]
243
-
244
- **Affected files:**
245
- - [file1.dart]
246
- - [file2.dart]
247
-
248
- **Effort:** [low | medium | high]
130
+ **Files:** `path/to/file` (lines ~N-M)
249
131
  ```
250
132
 
251
- ### Categories
252
-
253
- Group improvements by:
254
- 1. **Visual Quality** — Color, spacing, typography, hierarchy
255
- 2. **UX Completeness** — States, flows, error handling
256
- 3. **Platform Compliance** — Touch targets, accessibility
257
- 4. **Code Organization** — Widget extraction, pattern consistency
258
- 5. **Pattern Alignment** — Consistency with existing design system
133
+ Good framing: "The primary action button competes with 3 secondary actions at equal visual weight — a user has to read all four to find what they came here to do."
259
134
 
260
- ### Present to User
135
+ Bad framing: "Button color should be changed from gray to blue for better contrast."
261
136
 
262
- Display all improvements with summary:
137
+ ## Step 5: Present and Confirm
263
138
 
264
139
  ```markdown
265
- ## Design Review: [Scope]
140
+ ## Design Review: [Screen/Feature Name]
266
141
 
267
- Found **N improvements** across M files.
142
+ **Screen's job:** [one-sentence job]
268
143
 
269
- ### Summary
144
+ **Top [N] improvements** (highest user impact first):
270
145
 
271
- | Category | Count | Effort |
272
- |----------|-------|--------|
273
- | Visual Quality | X | [avg effort] |
274
- | UX Completeness | X | [avg effort] |
275
- | Platform Compliance | X | [avg effort] |
276
- | Code Organization | X | [avg effort] |
277
- | Pattern Alignment | X | [avg effort] |
278
-
279
- ### Improvements
280
-
281
- [List all improvements with full details]
146
+ [improvements from Step 4]
282
147
  ```
283
148
 
284
- ### User Selection
285
-
286
- Use AskUserQuestion to confirm scope:
149
+ Use AskUserQuestion:
287
150
 
288
151
  ```
289
152
  Question: "Which improvements should I apply?"
290
153
  Options:
291
- - "All improvements" - Apply everything identified
292
- - "Visual + UX only" - Focus on user-facing changes
293
- - "Platform compliance only" - Fix accessibility/sizing issues
294
- - "Let me select" - I'll specify which ones
154
+ - "Apply all" - Implement all suggested changes
155
+ - "Let me pick" - I'll specify which ones
156
+ - "Report only" - No code changes needed
295
157
  ```
296
158
 
297
- If "Let me select": Present numbered list, ask for comma-separated numbers.
298
-
299
- ## Phase 6: Apply Approved Changes
159
+ ## Step 6: Apply Approved Changes
300
160
 
301
161
  For each approved improvement:
162
+ 1. Read current file state
163
+ 2. Apply the change via Edit tool
164
+ 3. Log what changed
302
165
 
303
- 1. **Read current file state**
304
- 2. **Apply the change** using Edit tool
305
- 3. **Log what was changed** for summary
306
-
307
- **Apply principles:**
308
- - One logical change at a time
309
- - Preserve all public APIs
310
- - Maintain existing test coverage expectations
311
- - Don't introduce new dependencies without flagging
312
-
313
- ## Phase 7: Run Verification
314
-
315
- ### Step 7.1: Detect Platform
316
-
317
- ```bash
318
- # Flutter
319
- if [ -f "pubspec.yaml" ]; then
320
- echo "Platform: Flutter"
321
- fi
166
+ One logical change at a time. Preserve public APIs and test expectations.
322
167
 
323
- # Web (Node/npm)
324
- if [ -f "package.json" ]; then
325
- echo "Platform: Web"
326
- fi
327
- ```
328
-
329
- ### Step 7.2: Run Platform-Specific Checks
330
-
331
- **Flutter:**
332
- ```bash
333
- flutter analyze
334
- flutter test
335
- ```
336
-
337
- **Web (React/Next/etc.):**
338
- ```bash
339
- npm run lint 2>/dev/null || npx eslint . 2>/dev/null
340
- npm test 2>/dev/null || echo "No test script"
341
- ```
168
+ ## Step 7: Verify and Summarize
342
169
 
343
- ### Step 7.3: Handle Failures
170
+ Run platform-appropriate static analysis and tests. If failures occur, identify which change caused it and offer to revert.
344
171
 
345
- If verification fails:
346
- 1. Identify which change caused the failure
347
- 2. Offer to revert that specific change
348
- 3. Re-run verification
349
-
350
- ## Phase 8: Write Design Review Report
351
-
352
- Write to: `.planning/phases/{phase}-{slug}/{phase}-DESIGN-REVIEW.md`
172
+ Present:
353
173
 
354
174
  ```markdown
355
- ---
356
- status: complete
357
- phase: XX-name
358
- reviewed: [ISO timestamp]
359
- scope: [files/features reviewed]
360
- ---
361
-
362
- # Design Review: Phase [X] - [Name]
363
-
364
- **Reviewed:** [date]
365
- **Scope:** [what was reviewed]
366
- **Platform:** [detected platform]
367
-
368
- ## Review Summary
369
-
370
- | Category | Found | Applied | Skipped |
371
- |----------|-------|---------|---------|
372
- | Visual Quality | X | X | X |
373
- | UX Completeness | X | X | X |
374
- | Platform Compliance | X | X | X |
375
- | Code Organization | X | X | X |
376
- | Pattern Alignment | X | X | X |
377
-
378
- **Total:** X improvements identified, Y applied, Z skipped
379
-
380
- ## Changes Applied
381
-
382
- ### 1. [Category]: [Description]
383
-
384
- **File:** `path/to/file`
385
- **Change:** [what was changed]
386
- **Rationale:** [why this improves design]
387
-
388
- ### 2. [Next change...]
389
-
390
- ## Improvements Skipped
391
-
392
- [If any were skipped by user choice]
393
-
394
- ### 1. [Category]: [Description]
395
-
396
- **Reason skipped:** [user's reason or "Not selected"]
397
-
398
- ## Verification Results
175
+ ## Review Complete
399
176
 
400
- - **flutter analyze:** [pass/fail]
401
- - **flutter test:** [pass/fail]
177
+ **Screen:** [name]
178
+ **Applied:** N improvements
402
179
 
403
- [If failures, document what was reverted]
180
+ ### Changes
181
+ - `path/to/file`: [what changed]
404
182
 
405
- ## Remaining Recommendations
406
-
407
- [Any improvements that couldn't be auto-applied or need manual attention]
408
-
409
- ## Retroactive Documentation
410
-
411
- [If DESIGN.md was created]
412
- - Created: `.planning/phases/{phase}-{slug}/{phase}-DESIGN.md`
413
- - Documents current implementation patterns for future reference
414
-
415
- ---
416
-
417
- *Review completed: [timestamp]*
183
+ ### Next
184
+ Run the app and check this screen — [describe the expected visual difference in one sentence].
418
185
  ```
419
186
 
420
- ## Phase 9: Present Results
187
+ ## Step 8: Update Last Command
421
188
 
422
- ```markdown
423
- ## Design Review Complete
424
-
425
- **Scope:** [what was reviewed]
426
- **Improvements:** X applied, Y skipped
427
-
428
- ### Verification
429
- - flutter analyze: [pass/fail]
430
- - flutter test: [pass/fail]
431
-
432
- ### Files Modified
433
- - [list of files changed]
434
-
435
- ### Report
436
- `.planning/phases/{phase}-{slug}/{phase}-DESIGN-REVIEW.md`
437
-
438
- ---
439
-
440
- ## Next Steps
441
-
442
- - Review changes in editor
443
- - Run app to verify visual changes
444
- - Commit when satisfied: `git add . && git commit -m "design: apply review improvements"`
445
- ```
446
-
447
- ## Phase 10: Update Last Command
448
-
449
- Update `.planning/STATE.md` Last Command field:
450
- - Find line starting with `Last Command:` in Current Position section
451
- - Replace with: `Last Command: ms:review-design $ARGUMENTS | YYYY-MM-DD HH:MM`
452
- - If line doesn't exist, add it after `Status:` line
189
+ Run: `sed -i '' "s/^Last Command:.*/Last Command: ms:review-design $ARGUMENTS | $(date '+%Y-%m-%d %H:%M')/" .planning/STATE.md`
453
190
 
454
191
  </process>
455
192
 
456
- <quality_forcing>
457
- Apply these patterns throughout the review:
458
-
459
- **Commercial Benchmark:**
460
- > "This must look like a commercial $50-200 [product type] — intentional decisions, not defaults."
461
-
462
- **Pre-emptive Criticism:**
463
- > "Assume someone will say 'This looks like generic AI output.' Does the current design prove them wrong?"
464
-
465
- **Accountability Check:**
466
- > "Could a professional designer look at this and say it's skilled work?"
467
-
468
- **Explicit Anti-Patterns:**
469
- - Generic dark gray + blue accents
470
- - Default spacing with no rhythm
471
- - Controls that look like styled HTML inputs
472
- - Same-sized everything (no hierarchy)
473
- - Centered everything (lazy layout)
474
- </quality_forcing>
475
-
476
193
  <success_criteria>
477
- - [ ] Target code scope clarified (via arguments or AskUserQuestion)
478
- - [ ] Context chain loaded (PROJECT.md, DESIGN.md if exists, project UI skill)
479
- - [ ] Code analyzed across all quality dimensions
480
- - [ ] Retroactive DESIGN.md created if missing
481
- - [ ] Improvements presented with benefits AND trade-offs
482
- - [ ] User selected which improvements to apply
483
- - [ ] Approved changes applied to code
484
- - [ ] Platform-specific verification run (analyze/test)
485
- - [ ] Failures handled (revert problematic changes)
486
- - [ ] DESIGN-REVIEW.md report written to phase directory
487
- - [ ] Clear summary provided with next steps
194
+ - [ ] Improvements framed as user experience stories, not property changes
195
+ - [ ] Screen's job identified in one sentence every suggestion filtered through it
196
+ - [ ] Experience simulation performed (screenshot or code-based walkthrough)
197
+ - [ ] Maximum 5 improvements, prioritized by user impact
198
+ - [ ] Approved changes applied and verified
488
199
  </success_criteria>
@@ -33,10 +33,10 @@ Plans must complete within reasonable context usage.
33
33
  - 50-70% context: Degrading quality
34
34
  - 70%+ context: Poor quality
35
35
 
36
- **Solution:** Aggressive atomicity - split into small, focused plans.
37
- - 2-3 tasks per plan maximum
36
+ **Solution:** Budget-aware consolidation - prefer larger plans, calibrate from real data.
37
+ - Budget-based grouping (sum of weights within 45%)
38
38
  - Each plan independently executable
39
- - Better to have many small plans than few large ones
39
+ - Consolidate plans under ~10% with related work
40
40
  </scope_control>
41
41
 
42
42
  <claude_automates>
@@ -66,9 +66,9 @@ ELSE:
66
66
 
67
67
  | Pre-work | Status | Topics/Focus |
68
68
  |----------|--------|--------------|
69
- | Research | Likely | {research topics} |
70
69
  | Discuss | Likely | {discuss topics} |
71
70
  | Design | Likely | {design focus} |
71
+ | Research | Likely | {research topics} |
72
72
 
73
73
  {If all Unlikely: "No pre-work flagged — ready to plan directly."}
74
74