@exaudeus/workrail 3.27.0 → 3.29.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (160) hide show
  1. package/dist/console/assets/{index-FtTaDku8.js → index-BZ6HkxGf.js} +1 -1
  2. package/dist/console/index.html +1 -1
  3. package/dist/manifest.json +3 -3
  4. package/docs/README.md +57 -0
  5. package/docs/adrs/001-hybrid-storage-backend.md +38 -0
  6. package/docs/adrs/002-four-layer-context-classification.md +38 -0
  7. package/docs/adrs/003-checkpoint-trigger-strategy.md +35 -0
  8. package/docs/adrs/004-opt-in-encryption-strategy.md +36 -0
  9. package/docs/adrs/005-agent-first-workflow-execution-tokens.md +105 -0
  10. package/docs/adrs/006-append-only-session-run-event-log.md +76 -0
  11. package/docs/adrs/007-resume-and-checkpoint-only-sessions.md +51 -0
  12. package/docs/adrs/008-blocked-nodes-architectural-upgrade.md +178 -0
  13. package/docs/adrs/009-bridge-mode-single-instance-mcp.md +195 -0
  14. package/docs/adrs/010-release-pipeline.md +89 -0
  15. package/docs/architecture/README.md +7 -0
  16. package/docs/architecture/refactor-audit.md +364 -0
  17. package/docs/authoring-v2.md +527 -0
  18. package/docs/authoring.md +873 -0
  19. package/docs/changelog-recent.md +201 -0
  20. package/docs/configuration.md +505 -0
  21. package/docs/ctc-mcp-proposal.md +518 -0
  22. package/docs/design/README.md +22 -0
  23. package/docs/design/agent-cascade-protocol.md +96 -0
  24. package/docs/design/autonomous-console-design-candidates.md +253 -0
  25. package/docs/design/autonomous-console-design-review.md +111 -0
  26. package/docs/design/autonomous-platform-mvp-discovery.md +525 -0
  27. package/docs/design/claude-code-source-deep-dive.md +713 -0
  28. package/docs/design/console-cyberpunk-ui-discovery.md +504 -0
  29. package/docs/design/console-execution-trace-candidates-final.md +160 -0
  30. package/docs/design/console-execution-trace-candidates.md +211 -0
  31. package/docs/design/console-execution-trace-design-candidates-v2.md +113 -0
  32. package/docs/design/console-execution-trace-design-review.md +74 -0
  33. package/docs/design/console-execution-trace-discovery.md +394 -0
  34. package/docs/design/console-execution-trace-final-review.md +77 -0
  35. package/docs/design/console-execution-trace-review.md +92 -0
  36. package/docs/design/console-performance-discovery.md +415 -0
  37. package/docs/design/console-ui-backlog.md +280 -0
  38. package/docs/design/daemon-architecture-discovery.md +853 -0
  39. package/docs/design/daemon-design-candidates.md +318 -0
  40. package/docs/design/daemon-design-review-findings.md +119 -0
  41. package/docs/design/daemon-engine-design-candidates.md +210 -0
  42. package/docs/design/daemon-engine-design-review.md +131 -0
  43. package/docs/design/daemon-execution-engine-discovery.md +280 -0
  44. package/docs/design/daemon-gap-analysis.md +554 -0
  45. package/docs/design/daemon-owns-console-plan.md +168 -0
  46. package/docs/design/daemon-owns-console-review.md +91 -0
  47. package/docs/design/daemon-owns-console.md +195 -0
  48. package/docs/design/data-model-erd.md +11 -0
  49. package/docs/design/design-candidates-consolidate-dev-staleness.md +98 -0
  50. package/docs/design/design-candidates-walk-cache-depth-limit.md +80 -0
  51. package/docs/design/design-review-consolidate-dev-staleness.md +54 -0
  52. package/docs/design/design-review-walk-cache-depth-limit.md +48 -0
  53. package/docs/design/implementation-plan-consolidate-dev-staleness.md +142 -0
  54. package/docs/design/implementation-plan-walk-cache-depth-limit.md +141 -0
  55. package/docs/design/layer3b-ghost-nodes-design-candidates.md +229 -0
  56. package/docs/design/layer3b-ghost-nodes-design-review.md +93 -0
  57. package/docs/design/layer3b-ghost-nodes-implementation-plan.md +219 -0
  58. package/docs/design/list-workflows-latency-fix-plan.md +128 -0
  59. package/docs/design/list-workflows-latency-fix-review.md +55 -0
  60. package/docs/design/list-workflows-latency-fix.md +109 -0
  61. package/docs/design/native-context-management-api.md +11 -0
  62. package/docs/design/performance-sweep-2026-04.md +96 -0
  63. package/docs/design/routines-guide.md +219 -0
  64. package/docs/design/sequence-diagrams.md +11 -0
  65. package/docs/design/subagent-design-principles.md +220 -0
  66. package/docs/design/temporal-patterns-design-candidates.md +312 -0
  67. package/docs/design/temporal-patterns-design-review-findings.md +163 -0
  68. package/docs/design/test-isolation-from-config-file.md +335 -0
  69. package/docs/design/v2-core-design-locks.md +2746 -0
  70. package/docs/design/v2-lock-registry.json +734 -0
  71. package/docs/design/workflow-authoring-v2.md +1044 -0
  72. package/docs/design/workflow-docs-spec.md +218 -0
  73. package/docs/design/workflow-extension-points.md +687 -0
  74. package/docs/design/workrail-auto-trigger-system.md +359 -0
  75. package/docs/design/workrail-config-file-discovery.md +513 -0
  76. package/docs/docker.md +110 -0
  77. package/docs/generated/v2-lock-closure-plan.md +26 -0
  78. package/docs/generated/v2-lock-coverage.json +797 -0
  79. package/docs/generated/v2-lock-coverage.md +177 -0
  80. package/docs/ideas/backlog.md +3927 -0
  81. package/docs/ideas/design-candidates-mcp-resilience.md +208 -0
  82. package/docs/ideas/design-review-findings-mcp-resilience.md +119 -0
  83. package/docs/ideas/implementation_plan.md +249 -0
  84. package/docs/ideas/third-party-workflow-setup-design-thinking.md +1948 -0
  85. package/docs/implementation/02-architecture.md +316 -0
  86. package/docs/implementation/04-testing-strategy.md +124 -0
  87. package/docs/implementation/09-simple-workflow-guide.md +835 -0
  88. package/docs/implementation/13-advanced-validation-guide.md +874 -0
  89. package/docs/implementation/README.md +21 -0
  90. package/docs/integrations/claude-code.md +300 -0
  91. package/docs/integrations/firebender.md +315 -0
  92. package/docs/migration/v0.1.0.md +147 -0
  93. package/docs/naming-conventions.md +45 -0
  94. package/docs/planning/README.md +104 -0
  95. package/docs/planning/github-ticketing-playbook.md +195 -0
  96. package/docs/plans/README.md +24 -0
  97. package/docs/plans/agent-managed-ticketing-design.md +605 -0
  98. package/docs/plans/agentic-orchestration-roadmap.md +112 -0
  99. package/docs/plans/assessment-gates-engine-handoff.md +536 -0
  100. package/docs/plans/content-coherence-and-references.md +151 -0
  101. package/docs/plans/library-extraction-plan.md +340 -0
  102. package/docs/plans/mr-review-workflow-redesign.md +1451 -0
  103. package/docs/plans/native-context-management-epic.md +11 -0
  104. package/docs/plans/perf-fixes-design-candidates.md +225 -0
  105. package/docs/plans/perf-fixes-design-review-findings.md +61 -0
  106. package/docs/plans/perf-fixes-new-issues-candidates.md +264 -0
  107. package/docs/plans/perf-fixes-new-issues-review.md +110 -0
  108. package/docs/plans/prompt-fragments.md +53 -0
  109. package/docs/plans/ui-ux-workflow-design-candidates.md +120 -0
  110. package/docs/plans/ui-ux-workflow-discovery.md +100 -0
  111. package/docs/plans/ui-ux-workflow-review.md +48 -0
  112. package/docs/plans/v2-followup-enhancements.md +587 -0
  113. package/docs/plans/workflow-categories-candidates.md +105 -0
  114. package/docs/plans/workflow-categories-discovery.md +110 -0
  115. package/docs/plans/workflow-categories-review.md +51 -0
  116. package/docs/plans/workflow-discovery-model-candidates.md +94 -0
  117. package/docs/plans/workflow-discovery-model-discovery.md +74 -0
  118. package/docs/plans/workflow-discovery-model-review.md +48 -0
  119. package/docs/plans/workflow-source-setup-phase-1.md +245 -0
  120. package/docs/plans/workflow-source-setup-phase-2.md +361 -0
  121. package/docs/plans/workflow-staleness-detection-candidates.md +104 -0
  122. package/docs/plans/workflow-staleness-detection-review.md +58 -0
  123. package/docs/plans/workflow-staleness-detection.md +80 -0
  124. package/docs/plans/workflow-v2-design.md +69 -0
  125. package/docs/plans/workflow-v2-roadmap.md +74 -0
  126. package/docs/plans/workflow-validation-design.md +98 -0
  127. package/docs/plans/workflow-validation-roadmap.md +108 -0
  128. package/docs/plans/workrail-platform-vision.md +420 -0
  129. package/docs/reference/agent-context-cleaner-snippet.md +94 -0
  130. package/docs/reference/agent-context-guidance.md +140 -0
  131. package/docs/reference/context-optimization.md +284 -0
  132. package/docs/reference/example-workflow-repository-template/.github/workflows/validate.yml +125 -0
  133. package/docs/reference/example-workflow-repository-template/README.md +268 -0
  134. package/docs/reference/example-workflow-repository-template/workflows/example-workflow.json +80 -0
  135. package/docs/reference/external-workflow-repositories.md +916 -0
  136. package/docs/reference/feature-flags-architecture.md +472 -0
  137. package/docs/reference/feature-flags.md +349 -0
  138. package/docs/reference/god-tier-workflow-validation.md +272 -0
  139. package/docs/reference/loop-optimization.md +209 -0
  140. package/docs/reference/loop-validation.md +176 -0
  141. package/docs/reference/loops.md +465 -0
  142. package/docs/reference/mcp-platform-constraints.md +59 -0
  143. package/docs/reference/recovery.md +88 -0
  144. package/docs/reference/releases.md +177 -0
  145. package/docs/reference/troubleshooting.md +105 -0
  146. package/docs/reference/workflow-execution-contract.md +998 -0
  147. package/docs/roadmap/README.md +22 -0
  148. package/docs/roadmap/legacy-planning-status.md +103 -0
  149. package/docs/roadmap/now-next-later.md +70 -0
  150. package/docs/roadmap/open-work-inventory.md +389 -0
  151. package/docs/tickets/README.md +39 -0
  152. package/docs/tickets/next-up.md +76 -0
  153. package/docs/workflow-management.md +317 -0
  154. package/docs/workflow-templates.md +423 -0
  155. package/docs/workflow-validation.md +184 -0
  156. package/docs/workflows.md +254 -0
  157. package/package.json +3 -1
  158. package/spec/authoring-spec.json +61 -16
  159. package/workflows/workflow-for-workflows.json +252 -93
  160. package/workflows/workflow-for-workflows.v2.json +188 -77
@@ -0,0 +1,53 @@
1
+ # Prompt Fragments
2
+
3
+ This is the **canonical summary doc** for the conditional prompt fragments feature.
4
+
5
+ Use it for:
6
+
7
+ - what the feature is
8
+ - the final design stance
9
+ - shipped status and accepted tensions
10
+
11
+ ## What it is
12
+
13
+ `promptFragments` lets a workflow step declare small conditional prompt additions that are evaluated at render time against session context.
14
+
15
+ ## Final design stance
16
+
17
+ - fragments are **runtime-only**
18
+ - they do **not** participate in the compiled workflow hash
19
+ - they are **additive only**
20
+ - they are assembled at the existing prompt-rendering seam
21
+ - steps without `promptFragments` behave unchanged
22
+
23
+ ## Why the final design won
24
+
25
+ The chosen design kept fragment evaluation at render time instead of pushing it into compilation or scattering new context-plumbing across call sites.
26
+
27
+ That preserved:
28
+
29
+ - backward compatibility
30
+ - deterministic compiled hashes
31
+ - a single prompt-construction seam
32
+
33
+ ## Shipped status
34
+
35
+ This feature is effectively **done**.
36
+
37
+ Delivered work includes:
38
+
39
+ - workflow/schema support for `promptFragments`
40
+ - render-time fragment assembly
41
+ - `in` operator support for conditions
42
+ - validation rejecting `{{wr.*}}` tokens in fragment text
43
+ - focused tests covering rendering and validation behavior
44
+
45
+ ## Accepted tensions
46
+
47
+ - fragment provenance is not separately persisted as its own runtime artifact
48
+ - context projection during rendering is accepted as part of the existing prompt-rendering seam
49
+ - there is no heavy lifecycle/integration harness dedicated solely to fragments
50
+
51
+ ## Source history
52
+
53
+ The earlier design/review/verification docs for this feature have been removed (they were superseded stubs). This file plus the shipped code/tests is the canonical view.
@@ -0,0 +1,120 @@
1
+ # UI/UX Workflow Design Candidates
2
+
3
+ ## Problem Understanding
4
+
5
+ **Core claim**: AI agents fail at UI/UX design not because they lack knowledge but because they skip the process that converts knowledge into good design. They anchor on solutions before understanding users, produce single outputs instead of alternatives, violate UX laws they can accurately recite, and ignore edge cases, accessibility, and content entirely.
6
+
7
+ **The 6 failure categories:**
8
+ 1. **Process failures** — skips empathy, jumps to single solution, happy-path only, no iteration
9
+ 2. **UX law violations** — knows Hick/Miller/Jakob/Fitts/Peak-End/Tesler but doesn't apply them
10
+ 3. **Content/communication failures** — designs layout, ignores microcopy, no error states, no onboarding
11
+ 4. **Accessibility failures** — color contrast, keyboard nav, touch targets, screen readers ignored
12
+ 5. **Context blindness** — no design system, platform conventions, brand, user research knowledge
13
+ 6. **Inherent limitations** — can't see/render designs, can't test with users
14
+
15
+ **Core tension**: Agents CAN recite all UX principles. The failure is enforcement and process, not knowledge.
16
+
17
+ **What makes this hard**: A checklist prompt doesn't work — an agent can "check" each item while still fundamentally designing the wrong thing. The forcing function must be structural: impossible to advance past Problem Framing without user goals explicitly stated and confirmed.
18
+
19
+ ## Philosophy Constraints
20
+
21
+ - **Make illegal states unrepresentable**: Solution proposals before user framing are an illegal state
22
+ - **Validate at boundaries**: User goals/problem definition is the boundary; validate before design begins
23
+ - **Structured freedom**: Constrain outcomes (user framing required, alternatives required), not cognition steps
24
+ - **Anti-lazy wording**: 'design a UI for X' is insufficient input; workflow must force specificity
25
+ - **YAGNI**: Don't add phases that don't address a real failure mode
26
+
27
+ ## Candidates
28
+
29
+ ### A: Minimal — Problem-First Gate
30
+
31
+ **Summary**: 3-phase workflow enforcing exactly one invariant: no solution proposals until user goals and constraints are explicitly stated and confirmed (requireConfirmation).
32
+
33
+ - **Tensions resolved**: premature convergence
34
+ - **Tensions accepted**: UX law violations, edge cases, accessibility all left to agent judgment
35
+ - **Failure mode**: agent states user goals then proposes a single solution anyway
36
+ - **Repo pattern**: similar to bug-investigation hypothesis-first gate
37
+ - **Gains**: minimal ceremony, fast, blocks root cause
38
+ - **Losses**: only addresses 1 of 6 failure categories
39
+ - **Scope**: too narrow
40
+ - **Philosophy**: honors validate-at-boundaries; conflicts with make-illegal-states-unrepresentable
41
+
42
+ ---
43
+
44
+ ### B: Process Enforcement — Adapted Production Readiness Audit (RECOMMENDED for creation)
45
+
46
+ **Summary**: 6-phase creation workflow with hypothesis-first, neutral context packet, parallel reviewer families per failure category, synthesis loop, and design spec handoff.
47
+
48
+ **Phase structure:**
49
+ - Phase 0: Problem framing + user goals + constraints (requireConfirmation always)
50
+ - Phase 1: State design hypothesis — what does the agent currently believe is the right direction?
51
+ - Phase 2: Freeze context packet + declare reviewer families based on declared concerns
52
+ - Phase 3: Parallel reviewer bundle (IA reviewer, UX laws reviewer, accessibility reviewer, edge-cases reviewer, content/microcopy reviewer)
53
+ - Phase 4: Synthesis + contradiction loop
54
+ - Phase 5: Design spec handoff with per-dimension findings
55
+
56
+ **Complexity branching**: Simple (single-screen, minor change) → skip Phase 1-3, go direct. Standard/Complex → full pipeline.
57
+
58
+ - **Tensions resolved**: all 6 failure categories; forces alternatives at hypothesis stage; evidence-based per-dimension findings
59
+ - **Tensions accepted**: inherent visual limitations; spec not mockup
60
+ - **Failure mode**: reviewer families produce generic UX advice not tied to actual design context
61
+ - **Repo pattern**: directly adapts `production-readiness-audit.json` structure; auditComplexity branching from `adaptive-ticket-creation.json`
62
+ - **Gains**: comprehensive, structured freedom, all failure categories covered
63
+ - **Losses**: heavier than minimal for simple tasks (mitigated by Simple fast path)
64
+ - **Scope**: best-fit for feature-level and screen-level design work
65
+ - **Philosophy**: all principles satisfied
66
+
67
+ ---
68
+
69
+ ### C: Alternatives-First — Double Diamond
70
+
71
+ **Summary**: Forces divergence (3 fundamentally different design directions with explicit IA sketches) before any convergence, with adversarial challenge before the agent recommends one.
72
+
73
+ - **Tensions resolved**: single-solution anchoring; forces genuine exploration
74
+ - **Tensions accepted**: UX laws/accessibility not explicitly enforced
75
+ - **Failure mode**: 3 directions are superficially different (same IA, different metaphors)
76
+ - **Repo pattern**: adapts `architecture-scalability-audit.json` dimension-declaration
77
+ - **Gains**: best for exploring solution space; documents tradeoffs
78
+ - **Losses**: lighter on UX law enforcement; accessibility second-class
79
+ - **Scope**: best as a mechanism within B rather than a standalone workflow
80
+ - **Philosophy**: honors structured freedom; YAGNI tension (adds divergence without enforcing quality)
81
+
82
+ ---
83
+
84
+ ### D: UX Audit — Review Against Principles (RECOMMENDED as companion)
85
+
86
+ **Summary**: Review workflow for existing designs. User provides a design description/spec; agent audits it against explicit UX dimensions with per-dimension verdicts and evidence.
87
+
88
+ **Dimensions**: information architecture, Hick/Miller/Jakob laws, Fitts + touch targets, Peak-End + emotional journey, accessibility (WCAG checklist), edge cases (empty/error/loading/first-use), content + microcopy.
89
+
90
+ - **Tensions resolved**: turns agent UX knowledge into structured application; fully evidence-based
91
+ - **Tensions accepted**: doesn't help with design-from-scratch; requires existing design as input
92
+ - **Failure mode**: agent audits what's in the spec but misses implicit design assumptions not stated
93
+ - **Repo pattern**: directly adapts `architecture-scalability-audit.json`
94
+ - **Gains**: actionable per-dimension findings with references; complements B
95
+ - **Losses**: review only, not creation
96
+ - **Scope**: best-fit as standalone for design review; or used after B to audit the output
97
+ - **Philosophy**: all principles satisfied; mirrors architecture-scalability-audit exactly
98
+
99
+ ## Comparison and Recommendation
100
+
101
+ **Recommendation: Build B and D as two separate workflows that naturally compose.**
102
+
103
+ B (creation) and D (audit) serve different moments: B for designing from scratch, D for auditing before implementation. Together they cover the full design process. Neither alone is sufficient.
104
+
105
+ C's divergence mechanism (3 design directions) should be incorporated as a step within B's hypothesis phase, not as a separate workflow.
106
+
107
+ A is too narrow — prevents the worst failure but leaves 5 of 6 failure categories entirely to agent judgment.
108
+
109
+ ## Self-Critique
110
+
111
+ **Strongest counter-argument**: B is too heavy for everyday use. Adding a button to a screen shouldn't require 6 phases and reviewer families. Counter: the Simple fast path (auditComplexity branching) addresses this — simple changes skip everything except problem framing and spec.
112
+
113
+ **Pivot condition**: If agents produce hollow reviewer findings (generic advice not tied to the actual design context), the reviewer-family approach needs replacement with a simpler rubric where the agent self-scores against explicit criteria rather than running independent reviewers.
114
+
115
+ ## Open Questions for Main Agent
116
+
117
+ 1. Should D be a standalone workflow or a phase within B? (Standalone seems cleaner — used at a different moment in the process.)
118
+ 2. What's the right output format for B? Markdown design spec? JSON with structured fields? Decision records?
119
+ 3. Should B include a visual vocabulary section — letting the agent describe layout in structured prose (grid, card, list, etc.) as a partial substitute for visual mockups?
120
+ 4. Is there a third workflow worth building: a **Design System Audit** that checks whether a proposed design is consistent with an existing component library?
@@ -0,0 +1,100 @@
1
+ # UI/UX Design Workflow Discovery
2
+
3
+ ## Context / Ask
4
+
5
+ Map the specific ways AI agents fail at UI/UX design work and identify which failures are addressable through workflow structure. Goal: inform the design of a WorkRail workflow for UI/UX design.
6
+
7
+ ## Path Recommendation
8
+
9
+ `full_spectrum` — need both landscape (catalog of AI failure modes with evidence from UX principles) and reframing (which failures a workflow can actually address vs. which are inherent AI limitations).
10
+
11
+ ## Constraints / Anti-goals
12
+
13
+ - Not building a visual design tool — agents can't render or see designs
14
+ - Not replacing the human designer — augmenting and structuring agent-assisted design
15
+ - Must produce something an agent can actually execute, not just describe
16
+
17
+ ## Landscape Packet
18
+
19
+ *(populated in Phase 1)*
20
+
21
+ ## Problem Frame Packet
22
+
23
+ *(populated in Phase 1d)*
24
+
25
+ ## Candidate Directions
26
+
27
+ *(populated in Phase 3)*
28
+
29
+ ## Challenge Notes
30
+
31
+ *(populated in Phase 4)*
32
+
33
+ ## Resolution Notes
34
+
35
+ *(populated in Phase 5)*
36
+
37
+ ## Decision Log
38
+
39
+ *(populated as decisions are made)*
40
+
41
+ ## Final Summary
42
+
43
+ *(populated at end)*
44
+
45
+ ## Candidate Directions
46
+
47
+ See `docs/plans/ui-ux-workflow-design-candidates.md` for full candidate analysis.
48
+
49
+ **Recommendation: Two composing workflows**
50
+
51
+ ### Workflow B: UI/UX Design Creation Workflow
52
+ For designing UI/UX from scratch. Adapted from `production-readiness-audit.json`.
53
+
54
+ **Phase structure:**
55
+ - Phase 0: Problem framing, user goals, constraints, existing design context (requireConfirmation always)
56
+ - Phase 1: State 2-3 design directions with IA sketches (requireConfirmation — forces alternatives before convergence)
57
+ - Phase 2: Freeze context packet + select reviewer families based on declared concerns
58
+ - Phase 3: Parallel reviewer bundle (IA, UX laws, accessibility, edge cases, content/microcopy)
59
+ - Phase 4: Synthesis + contradiction loop
60
+ - Phase 5: Design spec handoff
61
+
62
+ **Complexity branching**: Simple (single component, no new flows) skips Phases 1-3.
63
+
64
+ ### Workflow D: UI/UX Design Audit Workflow
65
+ For reviewing an existing design description/spec before implementation. Adapted from `architecture-scalability-audit.json`.
66
+
67
+ User provides design description; agent audits against declared dimensions: information architecture, Hick/Miller/Jakob/Fitts laws, accessibility (WCAG), edge cases (empty/error/loading/first-use), content/microcopy, visual hierarchy.
68
+
69
+ **Composability**: Run B to create the design spec, then D to audit it before handing to implementation.
70
+
71
+ ## Decision Log
72
+
73
+ - A (Minimal gate) rejected: only addresses 1 of 6 failure categories
74
+ - C (Alternatives-first) incorporated as required step in B Phase 1, not standalone
75
+ - B + D selected: B for creation, D for audit, different moments in design process
76
+ - Challenge: reviewer generic findings — mitigated by context packet constraint
77
+ - Challenge: too heavy — mitigated by Simple fast path
78
+
79
+ ## Final Summary
80
+
81
+ **The core insight**: AI agents don't fail at UI/UX because they lack knowledge — they fail because nothing forces them through the process that converts knowledge into good design. The workflow's job is to make the right process structurally unavoidable.
82
+
83
+ **What agents are bad at (6 categories):**
84
+ 1. Process — premature convergence on single solution, happy-path only
85
+ 2. UX law application — knows Hick/Miller/Jakob/Peak-End but skips them
86
+ 3. Content — designs layout, ignores microcopy and error states
87
+ 4. Accessibility — all 17 WCAG categories routinely missed
88
+ 5. Context — doesn't know design system, platform, brand
89
+ 6. Inherent — can't see/render, can't test with users
90
+
91
+ **What a workflow can fix (categories 1-4):**
92
+ Structural enforcement through phases, reviewer families, and requireConfirmation gates.
93
+
94
+ **What requires human input (categories 5-6):**
95
+ Design system, platform conventions, brand personality, real user research, visual review.
96
+
97
+ **Residual risks:**
98
+ - Reviewer families may produce generic findings without context packet constraint
99
+ - Simple path may be overused — needs explicit criteria
100
+ - Output is a spec, not a mockup — acceptable for developer-AI teams
@@ -0,0 +1,48 @@
1
+ # UI/UX Workflow Design Review Findings
2
+
3
+ ## Tradeoff Review
4
+
5
+ | Tradeoff | Verdict | Fails if |
6
+ |---|---|---|
7
+ | Spec not mockup | Acceptable | Team needs visual approval before implementation — add external review checkpoint |
8
+ | Context blindness | Acceptable | Agent proceeds without surfacing missing context — Phase 0 requireConfirmation handles this |
9
+ | Simple path overuse | Acceptable | Agent misclassifies complex work as Simple — requireConfirmation on classification |
10
+
11
+ ## Failure Mode Review
12
+
13
+ | Failure Mode | Risk | Coverage | Missing Mitigation |
14
+ |---|---|---|---|
15
+ | Generic reviewer findings | **High** | Partially | Reviewers must cite specific elements from the design description using frozen context packet |
16
+ | Simple path overuse | Medium | Partially | requireConfirmation on complexity classification with explicit Simple criteria |
17
+ | Wrong output format | Low | N/A | Not a workflow problem — addressed by how the spec is structured |
18
+
19
+ **Highest risk: generic reviewer findings.** If reviewers produce platitudes ("consider grouping related items") instead of specific findings, the workflow produces the appearance of rigor without substance.
20
+
21
+ ## Runner-Up / Simpler Alternative Review
22
+
23
+ - C (Alternatives-First) already incorporated as required step in B's hypothesis phase
24
+ - Simpler variant (remove context packet phase) rejected — reviewers lose consistency without shared context
25
+ - No hybrid opportunities
26
+
27
+ ## Philosophy Alignment
28
+
29
+ All principles satisfied: make-illegal-states-unrepresentable, validate-at-boundaries, structured-freedom, errors-are-data, YAGNI (via Simple fast path). Minor acceptable tension on YAGNI resolved by Simple path.
30
+
31
+ ## Findings
32
+
33
+ **Orange — Reviewer finding quality gate missing**
34
+ Both B and D need explicit constraints that reviewer families must cite specific elements from the design description or context packet. Generic UX advice without reference to the actual design is a failing output, not a valid finding. Add to metaGuidance or reviewer instructions.
35
+
36
+ **Yellow — Simple complexity criteria need explicit definition**
37
+ "Simple" must be defined concretely in Phase 0: single component modification, no IA changes, no new user flows, no new interaction patterns. Without this, agents will self-classify to avoid rigor.
38
+
39
+ ## Recommended Revisions
40
+
41
+ 1. Add to reviewer family instructions: "Each finding must cite a specific element, component, or decision from the frozen context packet. Generic UX principles without specific reference are not findings."
42
+ 2. Define Simple explicitly: single component, no new flows, no IA changes, no new interaction patterns — everything else is Standard or Complex.
43
+ 3. Consider a `designVocabulary` section in the context packet: let the agent describe layout using structured prose terms (grid, card, list, modal, drawer, etc.) as a partial substitute for visual mockups.
44
+
45
+ ## Residual Concerns
46
+
47
+ - The audit workflow (D) is most useful after B produces a spec — the workflow catalog should make this composability explicit in descriptions
48
+ - Neither B nor D addresses the design system consistency problem (does this design match existing components?) — potential third workflow or reviewer family