@exaudeus/workrail 3.27.0 → 3.29.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/dist/console/assets/{index-FtTaDku8.js → index-BZ6HkxGf.js} +1 -1
- package/dist/console/index.html +1 -1
- package/dist/manifest.json +3 -3
- package/docs/README.md +57 -0
- package/docs/adrs/001-hybrid-storage-backend.md +38 -0
- package/docs/adrs/002-four-layer-context-classification.md +38 -0
- package/docs/adrs/003-checkpoint-trigger-strategy.md +35 -0
- package/docs/adrs/004-opt-in-encryption-strategy.md +36 -0
- package/docs/adrs/005-agent-first-workflow-execution-tokens.md +105 -0
- package/docs/adrs/006-append-only-session-run-event-log.md +76 -0
- package/docs/adrs/007-resume-and-checkpoint-only-sessions.md +51 -0
- package/docs/adrs/008-blocked-nodes-architectural-upgrade.md +178 -0
- package/docs/adrs/009-bridge-mode-single-instance-mcp.md +195 -0
- package/docs/adrs/010-release-pipeline.md +89 -0
- package/docs/architecture/README.md +7 -0
- package/docs/architecture/refactor-audit.md +364 -0
- package/docs/authoring-v2.md +527 -0
- package/docs/authoring.md +873 -0
- package/docs/changelog-recent.md +201 -0
- package/docs/configuration.md +505 -0
- package/docs/ctc-mcp-proposal.md +518 -0
- package/docs/design/README.md +22 -0
- package/docs/design/agent-cascade-protocol.md +96 -0
- package/docs/design/autonomous-console-design-candidates.md +253 -0
- package/docs/design/autonomous-console-design-review.md +111 -0
- package/docs/design/autonomous-platform-mvp-discovery.md +525 -0
- package/docs/design/claude-code-source-deep-dive.md +713 -0
- package/docs/design/console-cyberpunk-ui-discovery.md +504 -0
- package/docs/design/console-execution-trace-candidates-final.md +160 -0
- package/docs/design/console-execution-trace-candidates.md +211 -0
- package/docs/design/console-execution-trace-design-candidates-v2.md +113 -0
- package/docs/design/console-execution-trace-design-review.md +74 -0
- package/docs/design/console-execution-trace-discovery.md +394 -0
- package/docs/design/console-execution-trace-final-review.md +77 -0
- package/docs/design/console-execution-trace-review.md +92 -0
- package/docs/design/console-performance-discovery.md +415 -0
- package/docs/design/console-ui-backlog.md +280 -0
- package/docs/design/daemon-architecture-discovery.md +853 -0
- package/docs/design/daemon-design-candidates.md +318 -0
- package/docs/design/daemon-design-review-findings.md +119 -0
- package/docs/design/daemon-engine-design-candidates.md +210 -0
- package/docs/design/daemon-engine-design-review.md +131 -0
- package/docs/design/daemon-execution-engine-discovery.md +280 -0
- package/docs/design/daemon-gap-analysis.md +554 -0
- package/docs/design/daemon-owns-console-plan.md +168 -0
- package/docs/design/daemon-owns-console-review.md +91 -0
- package/docs/design/daemon-owns-console.md +195 -0
- package/docs/design/data-model-erd.md +11 -0
- package/docs/design/design-candidates-consolidate-dev-staleness.md +98 -0
- package/docs/design/design-candidates-walk-cache-depth-limit.md +80 -0
- package/docs/design/design-review-consolidate-dev-staleness.md +54 -0
- package/docs/design/design-review-walk-cache-depth-limit.md +48 -0
- package/docs/design/implementation-plan-consolidate-dev-staleness.md +142 -0
- package/docs/design/implementation-plan-walk-cache-depth-limit.md +141 -0
- package/docs/design/layer3b-ghost-nodes-design-candidates.md +229 -0
- package/docs/design/layer3b-ghost-nodes-design-review.md +93 -0
- package/docs/design/layer3b-ghost-nodes-implementation-plan.md +219 -0
- package/docs/design/list-workflows-latency-fix-plan.md +128 -0
- package/docs/design/list-workflows-latency-fix-review.md +55 -0
- package/docs/design/list-workflows-latency-fix.md +109 -0
- package/docs/design/native-context-management-api.md +11 -0
- package/docs/design/performance-sweep-2026-04.md +96 -0
- package/docs/design/routines-guide.md +219 -0
- package/docs/design/sequence-diagrams.md +11 -0
- package/docs/design/subagent-design-principles.md +220 -0
- package/docs/design/temporal-patterns-design-candidates.md +312 -0
- package/docs/design/temporal-patterns-design-review-findings.md +163 -0
- package/docs/design/test-isolation-from-config-file.md +335 -0
- package/docs/design/v2-core-design-locks.md +2746 -0
- package/docs/design/v2-lock-registry.json +734 -0
- package/docs/design/workflow-authoring-v2.md +1044 -0
- package/docs/design/workflow-docs-spec.md +218 -0
- package/docs/design/workflow-extension-points.md +687 -0
- package/docs/design/workrail-auto-trigger-system.md +359 -0
- package/docs/design/workrail-config-file-discovery.md +513 -0
- package/docs/docker.md +110 -0
- package/docs/generated/v2-lock-closure-plan.md +26 -0
- package/docs/generated/v2-lock-coverage.json +797 -0
- package/docs/generated/v2-lock-coverage.md +177 -0
- package/docs/ideas/backlog.md +3927 -0
- package/docs/ideas/design-candidates-mcp-resilience.md +208 -0
- package/docs/ideas/design-review-findings-mcp-resilience.md +119 -0
- package/docs/ideas/implementation_plan.md +249 -0
- package/docs/ideas/third-party-workflow-setup-design-thinking.md +1948 -0
- package/docs/implementation/02-architecture.md +316 -0
- package/docs/implementation/04-testing-strategy.md +124 -0
- package/docs/implementation/09-simple-workflow-guide.md +835 -0
- package/docs/implementation/13-advanced-validation-guide.md +874 -0
- package/docs/implementation/README.md +21 -0
- package/docs/integrations/claude-code.md +300 -0
- package/docs/integrations/firebender.md +315 -0
- package/docs/migration/v0.1.0.md +147 -0
- package/docs/naming-conventions.md +45 -0
- package/docs/planning/README.md +104 -0
- package/docs/planning/github-ticketing-playbook.md +195 -0
- package/docs/plans/README.md +24 -0
- package/docs/plans/agent-managed-ticketing-design.md +605 -0
- package/docs/plans/agentic-orchestration-roadmap.md +112 -0
- package/docs/plans/assessment-gates-engine-handoff.md +536 -0
- package/docs/plans/content-coherence-and-references.md +151 -0
- package/docs/plans/library-extraction-plan.md +340 -0
- package/docs/plans/mr-review-workflow-redesign.md +1451 -0
- package/docs/plans/native-context-management-epic.md +11 -0
- package/docs/plans/perf-fixes-design-candidates.md +225 -0
- package/docs/plans/perf-fixes-design-review-findings.md +61 -0
- package/docs/plans/perf-fixes-new-issues-candidates.md +264 -0
- package/docs/plans/perf-fixes-new-issues-review.md +110 -0
- package/docs/plans/prompt-fragments.md +53 -0
- package/docs/plans/ui-ux-workflow-design-candidates.md +120 -0
- package/docs/plans/ui-ux-workflow-discovery.md +100 -0
- package/docs/plans/ui-ux-workflow-review.md +48 -0
- package/docs/plans/v2-followup-enhancements.md +587 -0
- package/docs/plans/workflow-categories-candidates.md +105 -0
- package/docs/plans/workflow-categories-discovery.md +110 -0
- package/docs/plans/workflow-categories-review.md +51 -0
- package/docs/plans/workflow-discovery-model-candidates.md +94 -0
- package/docs/plans/workflow-discovery-model-discovery.md +74 -0
- package/docs/plans/workflow-discovery-model-review.md +48 -0
- package/docs/plans/workflow-source-setup-phase-1.md +245 -0
- package/docs/plans/workflow-source-setup-phase-2.md +361 -0
- package/docs/plans/workflow-staleness-detection-candidates.md +104 -0
- package/docs/plans/workflow-staleness-detection-review.md +58 -0
- package/docs/plans/workflow-staleness-detection.md +80 -0
- package/docs/plans/workflow-v2-design.md +69 -0
- package/docs/plans/workflow-v2-roadmap.md +74 -0
- package/docs/plans/workflow-validation-design.md +98 -0
- package/docs/plans/workflow-validation-roadmap.md +108 -0
- package/docs/plans/workrail-platform-vision.md +420 -0
- package/docs/reference/agent-context-cleaner-snippet.md +94 -0
- package/docs/reference/agent-context-guidance.md +140 -0
- package/docs/reference/context-optimization.md +284 -0
- package/docs/reference/example-workflow-repository-template/.github/workflows/validate.yml +125 -0
- package/docs/reference/example-workflow-repository-template/README.md +268 -0
- package/docs/reference/example-workflow-repository-template/workflows/example-workflow.json +80 -0
- package/docs/reference/external-workflow-repositories.md +916 -0
- package/docs/reference/feature-flags-architecture.md +472 -0
- package/docs/reference/feature-flags.md +349 -0
- package/docs/reference/god-tier-workflow-validation.md +272 -0
- package/docs/reference/loop-optimization.md +209 -0
- package/docs/reference/loop-validation.md +176 -0
- package/docs/reference/loops.md +465 -0
- package/docs/reference/mcp-platform-constraints.md +59 -0
- package/docs/reference/recovery.md +88 -0
- package/docs/reference/releases.md +177 -0
- package/docs/reference/troubleshooting.md +105 -0
- package/docs/reference/workflow-execution-contract.md +998 -0
- package/docs/roadmap/README.md +22 -0
- package/docs/roadmap/legacy-planning-status.md +103 -0
- package/docs/roadmap/now-next-later.md +70 -0
- package/docs/roadmap/open-work-inventory.md +389 -0
- package/docs/tickets/README.md +39 -0
- package/docs/tickets/next-up.md +76 -0
- package/docs/workflow-management.md +317 -0
- package/docs/workflow-templates.md +423 -0
- package/docs/workflow-validation.md +184 -0
- package/docs/workflows.md +254 -0
- package/package.json +3 -1
- package/spec/authoring-spec.json +61 -16
- package/workflows/workflow-for-workflows.json +252 -93
- package/workflows/workflow-for-workflows.v2.json +188 -77
|
@@ -0,0 +1,53 @@
|
|
|
1
|
+
# Prompt Fragments
|
|
2
|
+
|
|
3
|
+
This is the **canonical summary doc** for the conditional prompt fragments feature.
|
|
4
|
+
|
|
5
|
+
Use it for:
|
|
6
|
+
|
|
7
|
+
- what the feature is
|
|
8
|
+
- the final design stance
|
|
9
|
+
- shipped status and accepted tensions
|
|
10
|
+
|
|
11
|
+
## What it is
|
|
12
|
+
|
|
13
|
+
`promptFragments` lets a workflow step declare small conditional prompt additions that are evaluated at render time against session context.
|
|
14
|
+
|
|
15
|
+
## Final design stance
|
|
16
|
+
|
|
17
|
+
- fragments are **runtime-only**
|
|
18
|
+
- they do **not** participate in the compiled workflow hash
|
|
19
|
+
- they are **additive only**
|
|
20
|
+
- they are assembled at the existing prompt-rendering seam
|
|
21
|
+
- steps without `promptFragments` behave unchanged
|
|
22
|
+
|
|
23
|
+
## Why the final design won
|
|
24
|
+
|
|
25
|
+
The chosen design kept fragment evaluation at render time instead of pushing it into compilation or scattering new context-plumbing across call sites.
|
|
26
|
+
|
|
27
|
+
That preserved:
|
|
28
|
+
|
|
29
|
+
- backward compatibility
|
|
30
|
+
- deterministic compiled hashes
|
|
31
|
+
- a single prompt-construction seam
|
|
32
|
+
|
|
33
|
+
## Shipped status
|
|
34
|
+
|
|
35
|
+
This feature is effectively **done**.
|
|
36
|
+
|
|
37
|
+
Delivered work includes:
|
|
38
|
+
|
|
39
|
+
- workflow/schema support for `promptFragments`
|
|
40
|
+
- render-time fragment assembly
|
|
41
|
+
- `in` operator support for conditions
|
|
42
|
+
- validation rejecting `{{wr.*}}` tokens in fragment text
|
|
43
|
+
- focused tests covering rendering and validation behavior
|
|
44
|
+
|
|
45
|
+
## Accepted tensions
|
|
46
|
+
|
|
47
|
+
- fragment provenance is not separately persisted as its own runtime artifact
|
|
48
|
+
- context projection during rendering is accepted as part of the existing prompt-rendering seam
|
|
49
|
+
- there is no heavy lifecycle/integration harness dedicated solely to fragments
|
|
50
|
+
|
|
51
|
+
## Source history
|
|
52
|
+
|
|
53
|
+
The earlier design/review/verification docs for this feature have been removed (they were superseded stubs). This file plus the shipped code/tests is the canonical view.
|
|
@@ -0,0 +1,120 @@
|
|
|
1
|
+
# UI/UX Workflow Design Candidates
|
|
2
|
+
|
|
3
|
+
## Problem Understanding
|
|
4
|
+
|
|
5
|
+
**Core claim**: AI agents fail at UI/UX design not because they lack knowledge but because they skip the process that converts knowledge into good design. They anchor on solutions before understanding users, produce single outputs instead of alternatives, violate UX laws they can accurately recite, and ignore edge cases, accessibility, and content entirely.
|
|
6
|
+
|
|
7
|
+
**The 6 failure categories:**
|
|
8
|
+
1. **Process failures** — skips empathy, jumps to single solution, happy-path only, no iteration
|
|
9
|
+
2. **UX law violations** — knows Hick/Miller/Jakob/Fitts/Peak-End/Tesler but doesn't apply them
|
|
10
|
+
3. **Content/communication failures** — designs layout, ignores microcopy, no error states, no onboarding
|
|
11
|
+
4. **Accessibility failures** — color contrast, keyboard nav, touch targets, screen readers ignored
|
|
12
|
+
5. **Context blindness** — no design system, platform conventions, brand, user research knowledge
|
|
13
|
+
6. **Inherent limitations** — can't see/render designs, can't test with users
|
|
14
|
+
|
|
15
|
+
**Core tension**: Agents CAN recite all UX principles. The failure is enforcement and process, not knowledge.
|
|
16
|
+
|
|
17
|
+
**What makes this hard**: A checklist prompt doesn't work — an agent can "check" each item while still fundamentally designing the wrong thing. The forcing function must be structural: impossible to advance past Problem Framing without user goals explicitly stated and confirmed.
|
|
18
|
+
|
|
19
|
+
## Philosophy Constraints
|
|
20
|
+
|
|
21
|
+
- **Make illegal states unrepresentable**: Solution proposals before user framing are an illegal state
|
|
22
|
+
- **Validate at boundaries**: User goals/problem definition is the boundary; validate before design begins
|
|
23
|
+
- **Structured freedom**: Constrain outcomes (user framing required, alternatives required), not cognition steps
|
|
24
|
+
- **Anti-lazy wording**: 'design a UI for X' is insufficient input; workflow must force specificity
|
|
25
|
+
- **YAGNI**: Don't add phases that don't address a real failure mode
|
|
26
|
+
|
|
27
|
+
## Candidates
|
|
28
|
+
|
|
29
|
+
### A: Minimal — Problem-First Gate
|
|
30
|
+
|
|
31
|
+
**Summary**: 3-phase workflow enforcing exactly one invariant: no solution proposals until user goals and constraints are explicitly stated and confirmed (requireConfirmation).
|
|
32
|
+
|
|
33
|
+
- **Tensions resolved**: premature convergence
|
|
34
|
+
- **Tensions accepted**: UX law violations, edge cases, accessibility all left to agent judgment
|
|
35
|
+
- **Failure mode**: agent states user goals then proposes a single solution anyway
|
|
36
|
+
- **Repo pattern**: similar to bug-investigation hypothesis-first gate
|
|
37
|
+
- **Gains**: minimal ceremony, fast, blocks root cause
|
|
38
|
+
- **Losses**: only addresses 1 of 6 failure categories
|
|
39
|
+
- **Scope**: too narrow
|
|
40
|
+
- **Philosophy**: honors validate-at-boundaries; conflicts with make-illegal-states-unrepresentable
|
|
41
|
+
|
|
42
|
+
---
|
|
43
|
+
|
|
44
|
+
### B: Process Enforcement — Adapted Production Readiness Audit (RECOMMENDED for creation)
|
|
45
|
+
|
|
46
|
+
**Summary**: 6-phase creation workflow with hypothesis-first, neutral context packet, parallel reviewer families per failure category, synthesis loop, and design spec handoff.
|
|
47
|
+
|
|
48
|
+
**Phase structure:**
|
|
49
|
+
- Phase 0: Problem framing + user goals + constraints (requireConfirmation always)
|
|
50
|
+
- Phase 1: State design hypothesis — what does the agent currently believe is the right direction?
|
|
51
|
+
- Phase 2: Freeze context packet + declare reviewer families based on declared concerns
|
|
52
|
+
- Phase 3: Parallel reviewer bundle (IA reviewer, UX laws reviewer, accessibility reviewer, edge-cases reviewer, content/microcopy reviewer)
|
|
53
|
+
- Phase 4: Synthesis + contradiction loop
|
|
54
|
+
- Phase 5: Design spec handoff with per-dimension findings
|
|
55
|
+
|
|
56
|
+
**Complexity branching**: Simple (single-screen, minor change) → skip Phase 1-3, go direct. Standard/Complex → full pipeline.
|
|
57
|
+
|
|
58
|
+
- **Tensions resolved**: all 6 failure categories; forces alternatives at hypothesis stage; evidence-based per-dimension findings
|
|
59
|
+
- **Tensions accepted**: inherent visual limitations; spec not mockup
|
|
60
|
+
- **Failure mode**: reviewer families produce generic UX advice not tied to actual design context
|
|
61
|
+
- **Repo pattern**: directly adapts `production-readiness-audit.json` structure; auditComplexity branching from `adaptive-ticket-creation.json`
|
|
62
|
+
- **Gains**: comprehensive, structured freedom, all failure categories covered
|
|
63
|
+
- **Losses**: heavier than minimal for simple tasks (mitigated by Simple fast path)
|
|
64
|
+
- **Scope**: best-fit for feature-level and screen-level design work
|
|
65
|
+
- **Philosophy**: all principles satisfied
|
|
66
|
+
|
|
67
|
+
---
|
|
68
|
+
|
|
69
|
+
### C: Alternatives-First — Double Diamond
|
|
70
|
+
|
|
71
|
+
**Summary**: Forces divergence (3 fundamentally different design directions with explicit IA sketches) before any convergence, with adversarial challenge before the agent recommends one.
|
|
72
|
+
|
|
73
|
+
- **Tensions resolved**: single-solution anchoring; forces genuine exploration
|
|
74
|
+
- **Tensions accepted**: UX laws/accessibility not explicitly enforced
|
|
75
|
+
- **Failure mode**: 3 directions are superficially different (same IA, different metaphors)
|
|
76
|
+
- **Repo pattern**: adapts `architecture-scalability-audit.json` dimension-declaration
|
|
77
|
+
- **Gains**: best for exploring solution space; documents tradeoffs
|
|
78
|
+
- **Losses**: lighter on UX law enforcement; accessibility second-class
|
|
79
|
+
- **Scope**: best as a mechanism within B rather than a standalone workflow
|
|
80
|
+
- **Philosophy**: honors structured freedom; YAGNI tension (adds divergence without enforcing quality)
|
|
81
|
+
|
|
82
|
+
---
|
|
83
|
+
|
|
84
|
+
### D: UX Audit — Review Against Principles (RECOMMENDED as companion)
|
|
85
|
+
|
|
86
|
+
**Summary**: Review workflow for existing designs. User provides a design description/spec; agent audits it against explicit UX dimensions with per-dimension verdicts and evidence.
|
|
87
|
+
|
|
88
|
+
**Dimensions**: information architecture, Hick/Miller/Jakob laws, Fitts + touch targets, Peak-End + emotional journey, accessibility (WCAG checklist), edge cases (empty/error/loading/first-use), content + microcopy.
|
|
89
|
+
|
|
90
|
+
- **Tensions resolved**: turns agent UX knowledge into structured application; fully evidence-based
|
|
91
|
+
- **Tensions accepted**: doesn't help with design-from-scratch; requires existing design as input
|
|
92
|
+
- **Failure mode**: agent audits what's in the spec but misses implicit design assumptions not stated
|
|
93
|
+
- **Repo pattern**: directly adapts `architecture-scalability-audit.json`
|
|
94
|
+
- **Gains**: actionable per-dimension findings with references; complements B
|
|
95
|
+
- **Losses**: review only, not creation
|
|
96
|
+
- **Scope**: best-fit as standalone for design review; or used after B to audit the output
|
|
97
|
+
- **Philosophy**: all principles satisfied; mirrors architecture-scalability-audit exactly
|
|
98
|
+
|
|
99
|
+
## Comparison and Recommendation
|
|
100
|
+
|
|
101
|
+
**Recommendation: Build B and D as two separate workflows that naturally compose.**
|
|
102
|
+
|
|
103
|
+
B (creation) and D (audit) serve different moments: B for designing from scratch, D for auditing before implementation. Together they cover the full design process. Neither alone is sufficient.
|
|
104
|
+
|
|
105
|
+
C's divergence mechanism (3 design directions) should be incorporated as a step within B's hypothesis phase, not as a separate workflow.
|
|
106
|
+
|
|
107
|
+
A is too narrow — prevents the worst failure but leaves 5 of 6 failure categories entirely to agent judgment.
|
|
108
|
+
|
|
109
|
+
## Self-Critique
|
|
110
|
+
|
|
111
|
+
**Strongest counter-argument**: B is too heavy for everyday use. Adding a button to a screen shouldn't require 6 phases and reviewer families. Counter: the Simple fast path (auditComplexity branching) addresses this — simple changes skip everything except problem framing and spec.
|
|
112
|
+
|
|
113
|
+
**Pivot condition**: If agents produce hollow reviewer findings (generic advice not tied to the actual design context), the reviewer-family approach needs replacement with a simpler rubric where the agent self-scores against explicit criteria rather than running independent reviewers.
|
|
114
|
+
|
|
115
|
+
## Open Questions for Main Agent
|
|
116
|
+
|
|
117
|
+
1. Should D be a standalone workflow or a phase within B? (Standalone seems cleaner — used at a different moment in the process.)
|
|
118
|
+
2. What's the right output format for B? Markdown design spec? JSON with structured fields? Decision records?
|
|
119
|
+
3. Should B include a visual vocabulary section — letting the agent describe layout in structured prose (grid, card, list, etc.) as a partial substitute for visual mockups?
|
|
120
|
+
4. Is there a third workflow worth building: a **Design System Audit** that checks whether a proposed design is consistent with an existing component library?
|
|
@@ -0,0 +1,100 @@
|
|
|
1
|
+
# UI/UX Design Workflow Discovery
|
|
2
|
+
|
|
3
|
+
## Context / Ask
|
|
4
|
+
|
|
5
|
+
Map the specific ways AI agents fail at UI/UX design work and identify which failures are addressable through workflow structure. Goal: inform the design of a WorkRail workflow for UI/UX design.
|
|
6
|
+
|
|
7
|
+
## Path Recommendation
|
|
8
|
+
|
|
9
|
+
`full_spectrum` — need both landscape (catalog of AI failure modes with evidence from UX principles) and reframing (which failures a workflow can actually address vs. which are inherent AI limitations).
|
|
10
|
+
|
|
11
|
+
## Constraints / Anti-goals
|
|
12
|
+
|
|
13
|
+
- Not building a visual design tool — agents can't render or see designs
|
|
14
|
+
- Not replacing the human designer — augmenting and structuring agent-assisted design
|
|
15
|
+
- Must produce something an agent can actually execute, not just describe
|
|
16
|
+
|
|
17
|
+
## Landscape Packet
|
|
18
|
+
|
|
19
|
+
*(populated in Phase 1)*
|
|
20
|
+
|
|
21
|
+
## Problem Frame Packet
|
|
22
|
+
|
|
23
|
+
*(populated in Phase 1d)*
|
|
24
|
+
|
|
25
|
+
## Candidate Directions
|
|
26
|
+
|
|
27
|
+
*(populated in Phase 3)*
|
|
28
|
+
|
|
29
|
+
## Challenge Notes
|
|
30
|
+
|
|
31
|
+
*(populated in Phase 4)*
|
|
32
|
+
|
|
33
|
+
## Resolution Notes
|
|
34
|
+
|
|
35
|
+
*(populated in Phase 5)*
|
|
36
|
+
|
|
37
|
+
## Decision Log
|
|
38
|
+
|
|
39
|
+
*(populated as decisions are made)*
|
|
40
|
+
|
|
41
|
+
## Final Summary
|
|
42
|
+
|
|
43
|
+
*(populated at end)*
|
|
44
|
+
|
|
45
|
+
## Candidate Directions
|
|
46
|
+
|
|
47
|
+
See `docs/plans/ui-ux-workflow-design-candidates.md` for full candidate analysis.
|
|
48
|
+
|
|
49
|
+
**Recommendation: Two composing workflows**
|
|
50
|
+
|
|
51
|
+
### Workflow B: UI/UX Design Creation Workflow
|
|
52
|
+
For designing UI/UX from scratch. Adapted from `production-readiness-audit.json`.
|
|
53
|
+
|
|
54
|
+
**Phase structure:**
|
|
55
|
+
- Phase 0: Problem framing, user goals, constraints, existing design context (requireConfirmation always)
|
|
56
|
+
- Phase 1: State 2-3 design directions with IA sketches (requireConfirmation — forces alternatives before convergence)
|
|
57
|
+
- Phase 2: Freeze context packet + select reviewer families based on declared concerns
|
|
58
|
+
- Phase 3: Parallel reviewer bundle (IA, UX laws, accessibility, edge cases, content/microcopy)
|
|
59
|
+
- Phase 4: Synthesis + contradiction loop
|
|
60
|
+
- Phase 5: Design spec handoff
|
|
61
|
+
|
|
62
|
+
**Complexity branching**: Simple (single component, no new flows) skips Phases 1-3.
|
|
63
|
+
|
|
64
|
+
### Workflow D: UI/UX Design Audit Workflow
|
|
65
|
+
For reviewing an existing design description/spec before implementation. Adapted from `architecture-scalability-audit.json`.
|
|
66
|
+
|
|
67
|
+
User provides design description; agent audits against declared dimensions: information architecture, Hick/Miller/Jakob/Fitts laws, accessibility (WCAG), edge cases (empty/error/loading/first-use), content/microcopy, visual hierarchy.
|
|
68
|
+
|
|
69
|
+
**Composability**: Run B to create the design spec, then D to audit it before handing to implementation.
|
|
70
|
+
|
|
71
|
+
## Decision Log
|
|
72
|
+
|
|
73
|
+
- A (Minimal gate) rejected: only addresses 1 of 6 failure categories
|
|
74
|
+
- C (Alternatives-first) incorporated as required step in B Phase 1, not standalone
|
|
75
|
+
- B + D selected: B for creation, D for audit, different moments in design process
|
|
76
|
+
- Challenge: reviewer generic findings — mitigated by context packet constraint
|
|
77
|
+
- Challenge: too heavy — mitigated by Simple fast path
|
|
78
|
+
|
|
79
|
+
## Final Summary
|
|
80
|
+
|
|
81
|
+
**The core insight**: AI agents don't fail at UI/UX because they lack knowledge — they fail because nothing forces them through the process that converts knowledge into good design. The workflow's job is to make the right process structurally unavoidable.
|
|
82
|
+
|
|
83
|
+
**What agents are bad at (6 categories):**
|
|
84
|
+
1. Process — premature convergence on single solution, happy-path only
|
|
85
|
+
2. UX law application — knows Hick/Miller/Jakob/Peak-End but skips them
|
|
86
|
+
3. Content — designs layout, ignores microcopy and error states
|
|
87
|
+
4. Accessibility — all 17 WCAG categories routinely missed
|
|
88
|
+
5. Context — doesn't know design system, platform, brand
|
|
89
|
+
6. Inherent — can't see/render, can't test with users
|
|
90
|
+
|
|
91
|
+
**What a workflow can fix (categories 1-4):**
|
|
92
|
+
Structural enforcement through phases, reviewer families, and requireConfirmation gates.
|
|
93
|
+
|
|
94
|
+
**What requires human input (categories 5-6):**
|
|
95
|
+
Design system, platform conventions, brand personality, real user research, visual review.
|
|
96
|
+
|
|
97
|
+
**Residual risks:**
|
|
98
|
+
- Reviewer families may produce generic findings without context packet constraint
|
|
99
|
+
- Simple path may be overused — needs explicit criteria
|
|
100
|
+
- Output is a spec, not a mockup — acceptable for developer-AI teams
|
|
@@ -0,0 +1,48 @@
|
|
|
1
|
+
# UI/UX Workflow Design Review Findings
|
|
2
|
+
|
|
3
|
+
## Tradeoff Review
|
|
4
|
+
|
|
5
|
+
| Tradeoff | Verdict | Fails if |
|
|
6
|
+
|---|---|---|
|
|
7
|
+
| Spec not mockup | Acceptable | Team needs visual approval before implementation — add external review checkpoint |
|
|
8
|
+
| Context blindness | Acceptable | Agent proceeds without surfacing missing context — Phase 0 requireConfirmation handles this |
|
|
9
|
+
| Simple path overuse | Acceptable | Agent misclassifies complex work as Simple — requireConfirmation on classification |
|
|
10
|
+
|
|
11
|
+
## Failure Mode Review
|
|
12
|
+
|
|
13
|
+
| Failure Mode | Risk | Coverage | Missing Mitigation |
|
|
14
|
+
|---|---|---|---|
|
|
15
|
+
| Generic reviewer findings | **High** | Partially | Reviewers must cite specific elements from the design description using frozen context packet |
|
|
16
|
+
| Simple path overuse | Medium | Partially | requireConfirmation on complexity classification with explicit Simple criteria |
|
|
17
|
+
| Wrong output format | Low | N/A | Not a workflow problem — addressed by how the spec is structured |
|
|
18
|
+
|
|
19
|
+
**Highest risk: generic reviewer findings.** If reviewers produce platitudes ("consider grouping related items") instead of specific findings, the workflow produces the appearance of rigor without substance.
|
|
20
|
+
|
|
21
|
+
## Runner-Up / Simpler Alternative Review
|
|
22
|
+
|
|
23
|
+
- C (Alternatives-First) already incorporated as required step in B's hypothesis phase
|
|
24
|
+
- Simpler variant (remove context packet phase) rejected — reviewers lose consistency without shared context
|
|
25
|
+
- No hybrid opportunities
|
|
26
|
+
|
|
27
|
+
## Philosophy Alignment
|
|
28
|
+
|
|
29
|
+
All principles satisfied: make-illegal-states-unrepresentable, validate-at-boundaries, structured-freedom, errors-are-data, YAGNI (via Simple fast path). Minor acceptable tension on YAGNI resolved by Simple path.
|
|
30
|
+
|
|
31
|
+
## Findings
|
|
32
|
+
|
|
33
|
+
**Orange — Reviewer finding quality gate missing**
|
|
34
|
+
Both B and D need explicit constraints that reviewer families must cite specific elements from the design description or context packet. Generic UX advice without reference to the actual design is a failing output, not a valid finding. Add to metaGuidance or reviewer instructions.
|
|
35
|
+
|
|
36
|
+
**Yellow — Simple complexity criteria need explicit definition**
|
|
37
|
+
"Simple" must be defined concretely in Phase 0: single component modification, no IA changes, no new user flows, no new interaction patterns. Without this, agents will self-classify to avoid rigor.
|
|
38
|
+
|
|
39
|
+
## Recommended Revisions
|
|
40
|
+
|
|
41
|
+
1. Add to reviewer family instructions: "Each finding must cite a specific element, component, or decision from the frozen context packet. Generic UX principles without specific reference are not findings."
|
|
42
|
+
2. Define Simple explicitly: single component, no new flows, no IA changes, no new interaction patterns — everything else is Standard or Complex.
|
|
43
|
+
3. Consider a `designVocabulary` section in the context packet: let the agent describe layout using structured prose terms (grid, card, list, modal, drawer, etc.) as a partial substitute for visual mockups.
|
|
44
|
+
|
|
45
|
+
## Residual Concerns
|
|
46
|
+
|
|
47
|
+
- The audit workflow (D) is most useful after B produces a spec — the workflow catalog should make this composability explicit in descriptions
|
|
48
|
+
- Neither B nor D addresses the design system consistency problem (does this design match existing components?) — potential third workflow or reviewer family
|