picasso-skill 2.3.1 → 2.4.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/agents/picasso.md +4 -2
- package/package.json +1 -1
- package/references/ux-evaluation.md +211 -0
package/agents/picasso.md
CHANGED
|
@@ -49,9 +49,10 @@ Before showing anything or asking anything:
|
|
|
49
49
|
|
|
50
50
|
1. **Read the codebase** -- understand what the app does, the tech stack, existing design patterns, current colors/fonts/layout
|
|
51
51
|
2. **Identify the product type** -- SaaS dashboard, marketing site, e-commerce, portfolio, internal tool, mobile app
|
|
52
|
-
3. **
|
|
52
|
+
3. **Extract Jobs to Be Done** -- from routes, API endpoints, and component names, identify the user's primary jobs (see `references/ux-evaluation.md` Section 2). What triggers bring users here? What outcome are they after? What context are they in (rushed? focused? mobile?)?
|
|
53
53
|
4. **Study 2-3 real competitors** in the same space -- what do actual products in this industry look like?
|
|
54
54
|
5. **Load `references/style-presets.md`** -- find the 8-12 presets most relevant to this product type
|
|
55
|
+
6. **Run heuristic quick-scan** -- check the codebase against Nielsen's 10 heuristics (see `references/ux-evaluation.md` Section 1) to identify the biggest UX gaps. This informs which design directions to generate.
|
|
55
56
|
|
|
56
57
|
This step is silent. Do not ask the user anything. Just gather context.
|
|
57
58
|
|
|
@@ -315,6 +316,7 @@ skills/picasso/references/design-system.md # DESIGN.md, theming, token
|
|
|
315
316
|
skills/picasso/references/generative-art.md # p5.js, SVG, canvas
|
|
316
317
|
skills/picasso/references/component-patterns.md # Naming, taxonomy, state matrix
|
|
317
318
|
skills/picasso/references/ux-psychology.md # Gestalt, Fitts's Law, heuristics
|
|
319
|
+
skills/picasso/references/ux-evaluation.md # Nielsen's 10 heuristics, JTBD, state machines, prompt enhancement
|
|
318
320
|
skills/picasso/references/ux-writing.md # Error messages, microcopy, CTAs
|
|
319
321
|
skills/picasso/references/data-visualization.md # Chart matrix, dashboards, Tufte
|
|
320
322
|
skills/picasso/references/conversion-design.md # Landing pages, CTAs, pricing
|
|
@@ -861,7 +863,7 @@ Run a comprehensive scoring algorithm:
|
|
|
861
863
|
1. **Typography (0-15 pts)**: font choice (not banned default: 3), type scale consistency (3), max-width on text (3), line-height correctness (3), letter-spacing on caps (3)
|
|
862
864
|
2. **Color (0-15 pts)**: no pure black/gray (3), OKLCH or HSL usage (3), tinted neutrals (3), 60-30-10 rule (3), semantic colors exist (3)
|
|
863
865
|
3. **Spacing (0-10 pts)**: consistent scale (5), Gestalt grouping (5)
|
|
864
|
-
4. **
|
|
866
|
+
4. **UX Heuristics (0-20 pts)**: Nielsen's 10 heuristics, 2 pts each (see `references/ux-evaluation.md` Section 5). Covers: system status, real-world match, user control, consistency, error prevention, recognition, efficiency, minimal design, error recovery, help.
|
|
865
867
|
5. **Motion (0-10 pts)**: no transition:all (3), stagger pattern (3), reduced-motion support (2), no bounce easing (2)
|
|
866
868
|
6. **Responsive (0-10 pts)**: works at 375px (5), touch targets (3), no horizontal scroll (2)
|
|
867
869
|
7. **Performance (0-10 pts)**: Lighthouse perf score mapped (0-100 -> 0-10)
|
package/package.json
CHANGED
|
@@ -0,0 +1,211 @@
|
|
|
1
|
+
# UX Evaluation Reference
|
|
2
|
+
|
|
3
|
+
Structured frameworks for evaluating interface quality. Use these during /score, /roast, /audit, and the visual discovery crawl phase.
|
|
4
|
+
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
## 1. Nielsen's 10 Usability Heuristics (Evaluation Checklist)
|
|
8
|
+
|
|
9
|
+
For each heuristic, check the listed indicators. Score pass/fail for each.
|
|
10
|
+
|
|
11
|
+
### H1: Visibility of System Status
|
|
12
|
+
The system should always keep users informed about what is going on.
|
|
13
|
+
- [ ] Loading states exist for async actions (skeletons, spinners, progress bars)
|
|
14
|
+
- [ ] Form submission shows pending/success/error feedback
|
|
15
|
+
- [ ] Current page/section is highlighted in navigation
|
|
16
|
+
- [ ] Active filters/sorts are visually indicated
|
|
17
|
+
- [ ] Upload progress is shown
|
|
18
|
+
- **Check in code:** grep for loading states, skeleton components, progress indicators
|
|
19
|
+
- **Check in screenshot:** is the current nav item highlighted? Are there loading indicators?
|
|
20
|
+
|
|
21
|
+
### H2: Match Between System and Real World
|
|
22
|
+
Use language and concepts familiar to the user, not system-oriented terms.
|
|
23
|
+
- [ ] Button labels use verbs the user understands ("Save changes" not "Submit")
|
|
24
|
+
- [ ] Error messages explain the problem in plain language
|
|
25
|
+
- [ ] Navigation labels match user mental models
|
|
26
|
+
- [ ] Icons are conventional (trash = delete, pencil = edit, plus = add)
|
|
27
|
+
- **Check in code:** grep for generic labels ("Submit", "Click here", "Data")
|
|
28
|
+
|
|
29
|
+
### H3: User Control and Freedom
|
|
30
|
+
Users need a clear emergency exit when they make mistakes.
|
|
31
|
+
- [ ] Modals have close buttons AND escape key support
|
|
32
|
+
- [ ] Destructive actions have confirmation OR undo
|
|
33
|
+
- [ ] Multi-step flows have back navigation
|
|
34
|
+
- [ ] Users can cancel in-progress operations
|
|
35
|
+
- **Check in code:** grep for confirm() dialogs, undo patterns, modal close handlers
|
|
36
|
+
|
|
37
|
+
### H4: Consistency and Standards
|
|
38
|
+
Follow platform conventions. Same action = same result everywhere.
|
|
39
|
+
- [ ] Primary buttons look the same across all pages
|
|
40
|
+
- [ ] Same icon means the same thing everywhere
|
|
41
|
+
- [ ] Spacing and typography follow a consistent scale
|
|
42
|
+
- [ ] Color meanings are consistent (red = error, green = success)
|
|
43
|
+
- **Check in code:** grep for hardcoded colors, inconsistent button styles
|
|
44
|
+
|
|
45
|
+
### H5: Error Prevention
|
|
46
|
+
Prevent problems from occurring in the first place.
|
|
47
|
+
- [ ] Required fields are marked before submission
|
|
48
|
+
- [ ] Date inputs use pickers (not free text)
|
|
49
|
+
- [ ] Destructive buttons are visually distinct (red/outlined, not primary)
|
|
50
|
+
- [ ] Inline validation catches errors before form submission
|
|
51
|
+
- **Check in code:** grep for required fields, inline validation, input types
|
|
52
|
+
|
|
53
|
+
### H6: Recognition Rather Than Recall
|
|
54
|
+
Minimize memory load. Make options visible.
|
|
55
|
+
- [ ] Navigation is always visible (not hidden behind hamburger on desktop)
|
|
56
|
+
- [ ] Search results show context around matches
|
|
57
|
+
- [ ] Forms show labels (not placeholder-only)
|
|
58
|
+
- [ ] Recent items, favorites, or shortcuts are available
|
|
59
|
+
- **Check in screenshot:** are labels visible? Is navigation persistent?
|
|
60
|
+
|
|
61
|
+
### H7: Flexibility and Efficiency of Use
|
|
62
|
+
Allow experts to speed up their workflow.
|
|
63
|
+
- [ ] Keyboard shortcuts exist for frequent actions
|
|
64
|
+
- [ ] Bulk operations are available for lists
|
|
65
|
+
- [ ] Command palette or search exists (Cmd+K)
|
|
66
|
+
- [ ] Default values are intelligent
|
|
67
|
+
- **Check in code:** grep for keyboard event listeners, bulk action patterns
|
|
68
|
+
|
|
69
|
+
### H8: Aesthetic and Minimalist Design
|
|
70
|
+
Every extra element competes with relevant information.
|
|
71
|
+
- [ ] No decorative elements that don't serve a purpose
|
|
72
|
+
- [ ] Information hierarchy is clear (most important = most prominent)
|
|
73
|
+
- [ ] White space is used to group related elements
|
|
74
|
+
- [ ] No more than 3-4 colors for data categories
|
|
75
|
+
- **Check in screenshot:** squint test -- does hierarchy still read?
|
|
76
|
+
|
|
77
|
+
### H9: Help Users Recognize, Diagnose, and Recover from Errors
|
|
78
|
+
Error messages should be in plain language, indicate the problem, and suggest a fix.
|
|
79
|
+
- [ ] Error messages follow: what happened + why + how to fix
|
|
80
|
+
- [ ] Form errors appear next to the relevant field
|
|
81
|
+
- [ ] API errors don't show raw technical messages to users
|
|
82
|
+
- [ ] Empty states guide the user on what to do next
|
|
83
|
+
- **Check in code:** grep for error handling, error messages, empty states
|
|
84
|
+
|
|
85
|
+
### H10: Help and Documentation
|
|
86
|
+
Even though a system should be usable without docs, help should be available.
|
|
87
|
+
- [ ] Tooltips explain non-obvious UI elements
|
|
88
|
+
- [ ] Onboarding exists for first-time users
|
|
89
|
+
- [ ] Complex features have inline help or documentation links
|
|
90
|
+
- [ ] Keyboard shortcuts are discoverable
|
|
91
|
+
- **Check in code:** grep for tooltip components, help text, onboarding flows
|
|
92
|
+
|
|
93
|
+
---
|
|
94
|
+
|
|
95
|
+
## 2. Jobs to Be Done (JTBD) Framework
|
|
96
|
+
|
|
97
|
+
Use JTBD to understand WHY users interact with the app, not just WHAT they do. This informs design decisions during the crawl phase.
|
|
98
|
+
|
|
99
|
+
### Extracting JTBD from Code
|
|
100
|
+
|
|
101
|
+
Analyze the codebase to identify user jobs:
|
|
102
|
+
|
|
103
|
+
1. **Route structure** reveals user tasks:
|
|
104
|
+
- `/dashboard` = "When I start my day, I want to see what needs attention"
|
|
105
|
+
- `/clients/[id]` = "When I work on a client, I want all their info in one place"
|
|
106
|
+
- `/billing` = "When I need to invoice, I want to track time and generate bills"
|
|
107
|
+
- `/analyze` = "When I receive a contract, I want to understand the risks"
|
|
108
|
+
|
|
109
|
+
2. **API endpoints** reveal user actions:
|
|
110
|
+
- POST /api/clients = "I want to onboard a new client"
|
|
111
|
+
- POST /api/analyze = "I want AI to review this document"
|
|
112
|
+
- GET /api/dashboard = "I want a summary of my practice"
|
|
113
|
+
|
|
114
|
+
3. **Component names** reveal UI functions:
|
|
115
|
+
- `<ClientForm>` = data entry job
|
|
116
|
+
- `<TimerWidget>` = time tracking job
|
|
117
|
+
- `<RedlineView>` = document review job
|
|
118
|
+
|
|
119
|
+
### Using JTBD to Inform Design
|
|
120
|
+
|
|
121
|
+
For each identified job, ask:
|
|
122
|
+
- **What's the trigger?** When does the user need to do this?
|
|
123
|
+
- **What's the desired outcome?** What does success look like?
|
|
124
|
+
- **What's the anxiety?** What could go wrong?
|
|
125
|
+
- **What's the context?** Where/when do they do this? (mobile? desktop? in a meeting?)
|
|
126
|
+
|
|
127
|
+
Design decisions should optimize for the job:
|
|
128
|
+
- High-frequency jobs need the fastest path (fewest clicks, most prominent placement)
|
|
129
|
+
- High-stakes jobs need the most clarity (larger text, explicit confirmation, clear feedback)
|
|
130
|
+
- Time-pressured jobs need efficiency (keyboard shortcuts, bulk actions, smart defaults)
|
|
131
|
+
|
|
132
|
+
---
|
|
133
|
+
|
|
134
|
+
## 3. Prompt Enhancement
|
|
135
|
+
|
|
136
|
+
When a user gives a vague design request, enhance it before proceeding.
|
|
137
|
+
|
|
138
|
+
### Vague-to-Specific Mapping
|
|
139
|
+
|
|
140
|
+
| User Says | What They Mean | What to Do |
|
|
141
|
+
|-----------|---------------|------------|
|
|
142
|
+
| "Make it look good" | It looks amateur, fix the obvious issues | Run /audit, fix critical+high |
|
|
143
|
+
| "Make it modern" | It looks dated, update the aesthetic | Check font (is it Arial?), colors (pure gray?), radius (sharp corners?) |
|
|
144
|
+
| "Make it clean" | Too much visual noise, simplify | Remove decorative elements, increase whitespace, reduce color count |
|
|
145
|
+
| "Make it pop" | Not enough visual hierarchy, too flat | Increase contrast, add depth, strengthen heading sizes |
|
|
146
|
+
| "Make it professional" | It looks like a student project | Fix typography scale, add consistent spacing, tighten color palette |
|
|
147
|
+
| "I don't know what I want" | They need visual discovery | Generate the 10-20 sample gallery and let them react |
|
|
148
|
+
|
|
149
|
+
### Enhancement Process
|
|
150
|
+
|
|
151
|
+
1. Identify the complaint (what's wrong) vs. the goal (what they want)
|
|
152
|
+
2. Map to specific design properties (typography, color, spacing, layout, motion)
|
|
153
|
+
3. Propose concrete changes with before/after preview
|
|
154
|
+
4. Never ask "what do you mean by modern?" -- instead, show 3 interpretations and ask which fits
|
|
155
|
+
|
|
156
|
+
---
|
|
157
|
+
|
|
158
|
+
## 4. State Machine for Interactive Components
|
|
159
|
+
|
|
160
|
+
Map all states for each interactive element. Missing states are the #1 source of unpolished UI.
|
|
161
|
+
|
|
162
|
+
### The 8 States
|
|
163
|
+
|
|
164
|
+
Every interactive element should define:
|
|
165
|
+
|
|
166
|
+
| State | Visual Treatment | Trigger |
|
|
167
|
+
|-------|-----------------|---------|
|
|
168
|
+
| **Default** | Base appearance | Page load |
|
|
169
|
+
| **Hover** | Subtle background/border change | Mouse enters |
|
|
170
|
+
| **Focus** | Visible ring/outline (2px+ solid) | Tab navigation |
|
|
171
|
+
| **Active/Pressed** | Scale down slightly (0.97-0.98) | Mouse down |
|
|
172
|
+
| **Disabled** | Reduced opacity (0.5), no pointer | Programmatic |
|
|
173
|
+
| **Loading** | Spinner or pulse, disabled interaction | Async action |
|
|
174
|
+
| **Error** | Red border/text, error message | Validation fail |
|
|
175
|
+
| **Success** | Green indicator, confirmation | Action complete |
|
|
176
|
+
|
|
177
|
+
### Audit Checklist
|
|
178
|
+
|
|
179
|
+
For each component type, verify states exist:
|
|
180
|
+
|
|
181
|
+
| Component | States to Check |
|
|
182
|
+
|-----------|----------------|
|
|
183
|
+
| Button | default, hover, focus, active, disabled, loading |
|
|
184
|
+
| Input | default, hover, focus, filled, error, disabled |
|
|
185
|
+
| Card (clickable) | default, hover, focus, active |
|
|
186
|
+
| Link | default, hover, focus, visited |
|
|
187
|
+
| Toggle | off, on, hover, focus, disabled |
|
|
188
|
+
| Select | default, hover, focus, open, selected, error |
|
|
189
|
+
| Modal | enter, exit, backdrop |
|
|
190
|
+
|
|
191
|
+
---
|
|
192
|
+
|
|
193
|
+
## 5. Scoring with Heuristics
|
|
194
|
+
|
|
195
|
+
When running /score, add heuristic evaluation points:
|
|
196
|
+
|
|
197
|
+
```
|
|
198
|
+
Heuristic Evaluation (0-20 pts):
|
|
199
|
+
H1 System status: /2 (loading states, feedback)
|
|
200
|
+
H2 Real world match: /2 (language, icons)
|
|
201
|
+
H3 User control: /2 (undo, escape, back)
|
|
202
|
+
H4 Consistency: /2 (styles, patterns)
|
|
203
|
+
H5 Error prevention: /2 (validation, confirmation)
|
|
204
|
+
H6 Recognition: /2 (labels, navigation)
|
|
205
|
+
H7 Efficiency: /2 (shortcuts, bulk ops)
|
|
206
|
+
H8 Minimal design: /2 (hierarchy, whitespace)
|
|
207
|
+
H9 Error recovery: /2 (messages, guidance)
|
|
208
|
+
H10 Help: /2 (tooltips, onboarding)
|
|
209
|
+
```
|
|
210
|
+
|
|
211
|
+
This replaces the ad-hoc accessibility scoring with a structured UX evaluation.
|