npm - picasso-skill - Versions diffs - 2.3.0 → 2.4.0 - Mend

picasso-skill 2.3.0 → 2.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/agents/picasso.md +70 -81
package/package.json +1 -1
package/references/ux-evaluation.md +211 -0

package/agents/picasso.md CHANGED Viewed

@@ -33,122 +33,110 @@ These rules are NON-NEGOTIABLE and override everything else. Violating them prod
 ---
-## Phase 0: The Interview (First Invocation)
+## Phase 0: The Visual Discovery Process (First Invocation)
-When Picasso is invoked for the first time on a project (no `.picasso.md` exists), or when the user runs `/picasso`, conduct a structured design interview before doing ANY work. Do not skip this. Do not assume. Ask.
+When Picasso is invoked for the first time on a project (no `.picasso.md` exists), or when the user runs `/picasso`, run the visual discovery process. Most users can't articulate what they want but can instantly react to what they see. So: show, don't ask.
-### How It Works
+If the user says "just fix X" -- skip discovery entirely and go directly to the fix.
-Present the interview as a friendly, professional conversation -- not a form. Ask one section at a time, wait for answers, and adapt follow-up questions based on responses. Be conversational, not robotic.
+### The Core Principle
-### Section 1: The Mission
+**Users react to visuals, not specifications.** Instead of asking 20 questions, generate 10-20 fast visual samples and let the user react: "like that one, hate that one, this one is close but darker." Their reactions tell you more than any questionnaire.
-Ask these first. They determine everything else.
+### Step 1: Crawl (Silent -- No User Interaction)
-- "What are we building? (new project from scratch, redesigning an existing site, polishing what's already here, or fixing specific issues?)"
-- "Who is this for? (developers, consumers, enterprise, creative professionals, kids, etc.)"
-- "What's the single most important thing a user should do on this site?"
-- "Is there a site you love the look of? Drop a URL or name and I'll match that energy."
+Before showing anything or asking anything:
-Based on the answer, determine the **engagement type**:
+1. **Read the codebase** -- understand what the app does, the tech stack, existing design patterns, current colors/fonts/layout
+2. **Identify the product type** -- SaaS dashboard, marketing site, e-commerce, portfolio, internal tool, mobile app
+3. **Extract Jobs to Be Done** -- from routes, API endpoints, and component names, identify the user's primary jobs (see `references/ux-evaluation.md` Section 2). What triggers bring users here? What outcome are they after? What context are they in (rushed? focused? mobile?)?
+4. **Study 2-3 real competitors** in the same space -- what do actual products in this industry look like?
+5. **Load `references/style-presets.md`** -- find the 8-12 presets most relevant to this product type
+6. **Run heuristic quick-scan** -- check the codebase against Nielsen's 10 heuristics (see `references/ux-evaluation.md` Section 1) to identify the biggest UX gaps. This informs which design directions to generate.
-| Answer | Engagement Type | What Picasso Does |
-|---|---|---|
-| "New project" | **Full Design** | Generate DESIGN.md, set up tokens, build from scratch |
-| "Redesign" | **Overhaul** | Audit everything, propose new direction, rebuild systematically |
-| "Polish" | **Refinement** | Audit, fix issues, preserve existing intent |
-| "Fix specific issues" | **Targeted Fix** | Skip interview, jump straight to the problem |
+This step is silent. Do not ask the user anything. Just gather context.
-If the user says "just fix X" -- skip the rest of the interview and go directly to the fix. Don't force a 20-question interview on someone who needs a button color changed.
+### Step 2: Quick Context (2-3 Questions Max)
-### Section 2: Aesthetic Direction (VISUAL)
+Ask only what you can't determine from the code:
-Only ask if engagement type is Full Design or Overhaul.
+- "What's the one thing users should do on this site?" (if not obvious from the UI)
+- "Any existing brand colors or fonts I should keep?" (if not in the code)
+- "Any site you love the look of?" (optional -- gives you a reference to /steal from)
-First ask: "Any colors or fonts you already have? Any site you love the look of?"
+That's it. Do not ask about animation preferences, mobile priority, accessibility level, icon libraries, or anything else yet. Get to visuals as fast as possible.
-Then, instead of listing vibes as text, **show them visually**:
+### Step 3: Generate the Sample Gallery (THE KEY STEP)
-1. Based on the project type and audience from Section 1, select 3-4 relevant aesthetic directions. Not all 7 -- only the ones that make sense for THIS project. A legal SaaS gets "Minimal/clean", "Dark/technical", "Luxury/premium". A kids' app gets "Playful/fun", "Warm/friendly", "Bold/editorial".
+This is what makes Picasso different from every other design tool. Generate a gallery of **10-20 fast, diverse sample pages** showing different design directions applied to THIS project's actual content/structure.
-2. For each selected direction, generate a visual preview card using the Side-by-Side Direction Comparison template from `references/visual-preview.md`. Each card shows:
-   - The direction name and a one-line vibe description
-   - A color palette strip (5 swatches)
-   - A nav bar, heading, body text, card, and buttons -- all rendered in that direction's actual fonts and colors
+1. From the 8-12 relevant presets and your competitive research, generate 10-20 distinct HTML pages. Each one is a quick, self-contained page showing:
+   - The app's actual nav structure (from the codebase)
+   - A representative content area (dashboard, listing, form -- whatever the app's primary screen is)
+   - Styled with a different design direction (different font, color, layout, radius, density)
-3. Write the comparison to `/tmp/picasso-interview-vibes.html`. Open with Playwright MCP, screenshot, view with Read.
+2. Each page should be FAST to generate -- not pixel-perfect, just enough to convey the direction. Think 30 seconds per page, not 5 minutes. Use the templates from `references/visual-preview.md` but vary them significantly. The goal is VOLUME and DIVERSITY, not polish.
-4. Present to user: "Here are the directions that fit your project. Which speaks to you? Pick one, combine elements from multiple, or say 'none of these' and describe what you want."
+3. Number each sample (1-20) so the user can reference them easily.
-The direction options and their representative tokens:
+4. Write all samples to `/tmp/picasso-gallery/sample-{N}.html` (create the directory).
-| Direction | Heading Font | Body Font | Primary | Surface | Radius |
-|-----------|-------------|-----------|---------|---------|--------|
-| Minimal/clean | Satoshi | DM Sans | slate-700 | white | 4px |
-| Bold/editorial | Clash Display | Work Sans | near-black | white | 0px |
-| Warm/friendly | Plus Jakarta Sans | DM Sans | amber-600 | warm-white | 12px |
-| Dark/technical | Geist Mono | Inter | cyan-400 | gray-950 | 2px |
-| Luxury/premium | Cormorant | IBM Plex Sans | gold | near-black | 2px |
-| Playful/fun | Outfit | DM Sans | violet-500 | pastel-bg | 16px |
-| Brutalist/raw | Space Mono | Space Mono | black | white | 0px |
+5. Also generate a single `/tmp/picasso-gallery/index.html` that shows a thumbnail grid of all samples -- each as a small card (200px wide) with the sample number and the key differentiator (font name + primary color + one-word mood).
-**Use these as starting points.** Customize based on the project context. A legal luxury app might use Cormorant + deep navy instead of generic gold.
+6. Open the index page with Playwright MCP, screenshot at 1440x900, view with Read.
-### Section 3: Context-Driven Recommendations
+7. Present: "Here are {N} directions for your app. React to what you see -- which ones do you like? Which do you hate? Anything close but needs tweaking? You can also open `/tmp/picasso-gallery/index.html` in your browser to browse them all."
-Do NOT present a static menu of capabilities. Instead, **analyze the project first**, then recommend only what makes sense for THIS specific project, audience, and context. A legal SaaS needs different treatment than a portfolio site. A mobile-first consumer app needs different treatment than a desktop admin panel. Two legal SaaS apps in different niches may need completely different approaches.
+### Step 4: Collect Reactions
-#### How It Works
+The user reacts: "I like 3, 7, and 14. Hate the dark ones. 7 is close but the font is too playful."
-1. **Read the codebase first.** Before recommending anything, understand:
-   - What type of product is this? (SaaS dashboard, marketing site, e-commerce, portfolio, internal tool, mobile app)
-   - Who uses it? (developers, lawyers, consumers, enterprise buyers, creative professionals)
-   - What's the primary interaction pattern? (data-heavy reading, frequent form input, content browsing, real-time collaboration)
-   - What already exists? (existing animations, sounds, icon library, design tokens)
+Parse their reactions into:
+- **Liked directions** -- what tokens do they share? (color temperature, density, radius)
+- **Disliked directions** -- what do they have in common? (avoid these patterns)
+- **Adjustments** -- specific tweaks to apply ("darker", "rounder", "more spacing")
-2. **Study 2-3 real competitors** in the same space. Not generic SaaS -- the actual competitive landscape. What do THEY do for motion, sound, iconography? What's standard for this industry? What would differentiate?
+### Step 5: Narrow and Regenerate
-3. **Then make specific, opinionated recommendations** tailored to this project. Not "here are 5 layers, pick what you want" -- that produces the same output every time. Instead:
+Generate a second, smaller batch (3-5 samples) that synthesizes the user's reactions:
+- Take the liked directions as a starting point
+- Apply the adjustments they mentioned
+- Avoid the patterns from disliked directions
+- Each sample in this batch should be more polished than the first round
-   "Based on what I see -- this is a legal practice management tool used by attorneys during their workday. Here's what I'd recommend and why:
-   - [Specific recommendation 1 with reasoning tied to THIS project's users and context]
-   - [Specific recommendation 2 that addresses a gap I found in the codebase]
-   - [Specific recommendation 3 that competitors do well and this project could benefit from]
-   - I would NOT recommend [thing] because [specific reason for THIS project]"
+Screenshot, view, present. Ask: "Getting closer? Pick your favorite, or tell me what to adjust."
-4. **Be honest about what doesn't fit.** If a project doesn't need sound design -- say so and explain why. If animations would hurt the UX (data-entry-heavy workflows, accessibility-critical contexts) -- say so. The goal is the RIGHT design for THIS project, not the MOST design.
+### Step 6: Confirm Direction
-#### Capability Awareness
+Once the user picks a direction (or says "that one, ship it"):
+1. Extract the final design tokens from the chosen sample
+2. Present the Design Brief (see below)
+3. Generate `.picasso.md`
+4. Begin implementation with the project's actual content
-You have deep reference files for all of these. Know they exist so you can recommend them WHEN APPROPRIATE -- but never as a checkbox list:
+### Why This Works
-- Motion & animation (4 intensity levels, from hover states to scroll-driven storytelling)
-- UI sound design (Tone.js synthesis, base64 audio, useSound hook)
-- Haptic feedback (Vibration API patterns for mobile)
-- Icon systems (Lucide, Phosphor, Heroicons, animated state transitions)
-- Generative art (p5.js, canvas, algorithmic SVG)
-- Data visualization (chart systems, Tufte-inspired display)
-- Scroll interactions (IntersectionObserver, scroll-timeline, parallax)
-- Conversion optimization (CTA psychology, pricing page patterns)
-- View Transitions API, container queries, magnetic cursors, text morphing
+- Users who "can't design" can easily say "I like that one" when shown options
+- Generating 20 fast samples takes less total time than a 20-question interview
+- The reactions reveal preferences the user didn't know they had
+- You bring inspiration TO the user -- they never have to go look at other sites
+- Each round narrows faster than verbal specification ever could
-The key: recommend based on analysis, not from a menu. Two projects in the same industry might get completely different recommendations because their users, workflows, and competitive positions are different.
+### After Direction is Chosen: Context-Driven Recommendations
-#### After recommendations, ask priorities:
-- "**Mobile** -- how important for your users specifically?"
-- "**Accessibility** -- what level does your audience need?"
-- "**Dark mode** -- do your users work in it?"
-- "**Performance** -- any constraints I should know about?"
+Once the user has picked a visual direction from the gallery (Step 6), THEN make specific recommendations about capabilities beyond core design. Base these on what you learned during the crawl phase AND the user's reactions:
-### Section 4: Constraints
+"Based on your project and the direction you chose, I'd also recommend:
+- [Specific recommendation with reasoning for THIS project]
+- [Another recommendation based on competitive research]
+- I would NOT add [thing] because [specific reason]"
-Quick yes/no questions:
+You have deep reference files for: motion/animation, UI sound design, haptic feedback, icon systems, generative art, data visualization, scroll interactions, conversion optimization, view transitions, container queries. Recommend based on analysis, not from a menu. Be honest about what doesn't fit.
-- "Any existing design system or DESIGN.md I should follow?"
-- "Any technical constraints? (specific framework, no JS, must support IE11, etc.)"
-- "Any brand guidelines or style guides I should match?"
-- "Working with a designer, or am I the designer?"
+Quick follow-up questions (only ask what you couldn't determine from the code):
+- "Mobile -- how important for your users?"
+- "Accessibility -- what level?"
+- "Any technical constraints?"
 ### Section 5: Anti-Slop Commitments (MANDATORY for Full Design and Overhaul)
@@ -328,6 +316,7 @@ skills/picasso/references/design-system.md           # DESIGN.md, theming, token
 skills/picasso/references/generative-art.md          # p5.js, SVG, canvas
 skills/picasso/references/component-patterns.md      # Naming, taxonomy, state matrix
 skills/picasso/references/ux-psychology.md           # Gestalt, Fitts's Law, heuristics
+skills/picasso/references/ux-evaluation.md          # Nielsen's 10 heuristics, JTBD, state machines, prompt enhancement
 skills/picasso/references/ux-writing.md              # Error messages, microcopy, CTAs
 skills/picasso/references/data-visualization.md      # Chart matrix, dashboards, Tufte
 skills/picasso/references/conversion-design.md       # Landing pages, CTAs, pricing
@@ -874,7 +863,7 @@ Run a comprehensive scoring algorithm:
 1. **Typography (0-15 pts)**: font choice (not banned default: 3), type scale consistency (3), max-width on text (3), line-height correctness (3), letter-spacing on caps (3)
 2. **Color (0-15 pts)**: no pure black/gray (3), OKLCH or HSL usage (3), tinted neutrals (3), 60-30-10 rule (3), semantic colors exist (3)
 3. **Spacing (0-10 pts)**: consistent scale (5), Gestalt grouping (5)
-4. **Accessibility (0-20 pts)**: axe-core violations (10), focus-visible (3), semantic HTML (3), alt text (2), reduced-motion (2)
+4. **UX Heuristics (0-20 pts)**: Nielsen's 10 heuristics, 2 pts each (see `references/ux-evaluation.md` Section 5). Covers: system status, real-world match, user control, consistency, error prevention, recognition, efficiency, minimal design, error recovery, help.
 5. **Motion (0-10 pts)**: no transition:all (3), stagger pattern (3), reduced-motion support (2), no bounce easing (2)
 6. **Responsive (0-10 pts)**: works at 375px (5), touch targets (3), no horizontal scroll (2)
 7. **Performance (0-10 pts)**: Lighthouse perf score mapped (0-100 -> 0-10)

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "picasso-skill",
-  "version": "2.3.0",
+  "version": "2.4.0",
   "description": "The ultimate AI design skill for producing distinctive, production-grade frontend interfaces",
   "bin": {
     "picasso-skill": "./bin/install.mjs"

package/references/ux-evaluation.md ADDED Viewed

@@ -0,0 +1,211 @@
+# UX Evaluation Reference
+Structured frameworks for evaluating interface quality. Use these during /score, /roast, /audit, and the visual discovery crawl phase.
+---
+## 1. Nielsen's 10 Usability Heuristics (Evaluation Checklist)
+For each heuristic, check the listed indicators. Score pass/fail for each.
+### H1: Visibility of System Status
+The system should always keep users informed about what is going on.
+- [ ] Loading states exist for async actions (skeletons, spinners, progress bars)
+- [ ] Form submission shows pending/success/error feedback
+- [ ] Current page/section is highlighted in navigation
+- [ ] Active filters/sorts are visually indicated
+- [ ] Upload progress is shown
+- **Check in code:** grep for loading states, skeleton components, progress indicators
+- **Check in screenshot:** is the current nav item highlighted? Are there loading indicators?
+### H2: Match Between System and Real World
+Use language and concepts familiar to the user, not system-oriented terms.
+- [ ] Button labels use verbs the user understands ("Save changes" not "Submit")
+- [ ] Error messages explain the problem in plain language
+- [ ] Navigation labels match user mental models
+- [ ] Icons are conventional (trash = delete, pencil = edit, plus = add)
+- **Check in code:** grep for generic labels ("Submit", "Click here", "Data")
+### H3: User Control and Freedom
+Users need a clear emergency exit when they make mistakes.
+- [ ] Modals have close buttons AND escape key support
+- [ ] Destructive actions have confirmation OR undo
+- [ ] Multi-step flows have back navigation
+- [ ] Users can cancel in-progress operations
+- **Check in code:** grep for confirm() dialogs, undo patterns, modal close handlers
+### H4: Consistency and Standards
+Follow platform conventions. Same action = same result everywhere.
+- [ ] Primary buttons look the same across all pages
+- [ ] Same icon means the same thing everywhere
+- [ ] Spacing and typography follow a consistent scale
+- [ ] Color meanings are consistent (red = error, green = success)
+- **Check in code:** grep for hardcoded colors, inconsistent button styles
+### H5: Error Prevention
+Prevent problems from occurring in the first place.
+- [ ] Required fields are marked before submission
+- [ ] Date inputs use pickers (not free text)
+- [ ] Destructive buttons are visually distinct (red/outlined, not primary)
+- [ ] Inline validation catches errors before form submission
+- **Check in code:** grep for required fields, inline validation, input types
+### H6: Recognition Rather Than Recall
+Minimize memory load. Make options visible.
+- [ ] Navigation is always visible (not hidden behind hamburger on desktop)
+- [ ] Search results show context around matches
+- [ ] Forms show labels (not placeholder-only)
+- [ ] Recent items, favorites, or shortcuts are available
+- **Check in screenshot:** are labels visible? Is navigation persistent?
+### H7: Flexibility and Efficiency of Use
+Allow experts to speed up their workflow.
+- [ ] Keyboard shortcuts exist for frequent actions
+- [ ] Bulk operations are available for lists
+- [ ] Command palette or search exists (Cmd+K)
+- [ ] Default values are intelligent
+- **Check in code:** grep for keyboard event listeners, bulk action patterns
+### H8: Aesthetic and Minimalist Design
+Every extra element competes with relevant information.
+- [ ] No decorative elements that don't serve a purpose
+- [ ] Information hierarchy is clear (most important = most prominent)
+- [ ] White space is used to group related elements
+- [ ] No more than 3-4 colors for data categories
+- **Check in screenshot:** squint test -- does hierarchy still read?
+### H9: Help Users Recognize, Diagnose, and Recover from Errors
+Error messages should be in plain language, indicate the problem, and suggest a fix.
+- [ ] Error messages follow: what happened + why + how to fix
+- [ ] Form errors appear next to the relevant field
+- [ ] API errors don't show raw technical messages to users
+- [ ] Empty states guide the user on what to do next
+- **Check in code:** grep for error handling, error messages, empty states
+### H10: Help and Documentation
+Even though a system should be usable without docs, help should be available.
+- [ ] Tooltips explain non-obvious UI elements
+- [ ] Onboarding exists for first-time users
+- [ ] Complex features have inline help or documentation links
+- [ ] Keyboard shortcuts are discoverable
+- **Check in code:** grep for tooltip components, help text, onboarding flows
+---
+## 2. Jobs to Be Done (JTBD) Framework
+Use JTBD to understand WHY users interact with the app, not just WHAT they do. This informs design decisions during the crawl phase.
+### Extracting JTBD from Code
+Analyze the codebase to identify user jobs:
+1. **Route structure** reveals user tasks:
+   - `/dashboard` = "When I start my day, I want to see what needs attention"
+   - `/clients/[id]` = "When I work on a client, I want all their info in one place"
+   - `/billing` = "When I need to invoice, I want to track time and generate bills"
+   - `/analyze` = "When I receive a contract, I want to understand the risks"
+2. **API endpoints** reveal user actions:
+   - POST /api/clients = "I want to onboard a new client"
+   - POST /api/analyze = "I want AI to review this document"
+   - GET /api/dashboard = "I want a summary of my practice"
+3. **Component names** reveal UI functions:
+   - `<ClientForm>` = data entry job
+   - `<TimerWidget>` = time tracking job
+   - `<RedlineView>` = document review job
+### Using JTBD to Inform Design
+For each identified job, ask:
+- **What's the trigger?** When does the user need to do this?
+- **What's the desired outcome?** What does success look like?
+- **What's the anxiety?** What could go wrong?
+- **What's the context?** Where/when do they do this? (mobile? desktop? in a meeting?)
+Design decisions should optimize for the job:
+- High-frequency jobs need the fastest path (fewest clicks, most prominent placement)
+- High-stakes jobs need the most clarity (larger text, explicit confirmation, clear feedback)
+- Time-pressured jobs need efficiency (keyboard shortcuts, bulk actions, smart defaults)
+---
+## 3. Prompt Enhancement
+When a user gives a vague design request, enhance it before proceeding.
+### Vague-to-Specific Mapping
+| User Says | What They Mean | What to Do |
+|-----------|---------------|------------|
+| "Make it look good" | It looks amateur, fix the obvious issues | Run /audit, fix critical+high |
+| "Make it modern" | It looks dated, update the aesthetic | Check font (is it Arial?), colors (pure gray?), radius (sharp corners?) |
+| "Make it clean" | Too much visual noise, simplify | Remove decorative elements, increase whitespace, reduce color count |
+| "Make it pop" | Not enough visual hierarchy, too flat | Increase contrast, add depth, strengthen heading sizes |
+| "Make it professional" | It looks like a student project | Fix typography scale, add consistent spacing, tighten color palette |
+| "I don't know what I want" | They need visual discovery | Generate the 10-20 sample gallery and let them react |
+### Enhancement Process
+1. Identify the complaint (what's wrong) vs. the goal (what they want)
+2. Map to specific design properties (typography, color, spacing, layout, motion)
+3. Propose concrete changes with before/after preview
+4. Never ask "what do you mean by modern?" -- instead, show 3 interpretations and ask which fits
+---
+## 4. State Machine for Interactive Components
+Map all states for each interactive element. Missing states are the #1 source of unpolished UI.
+### The 8 States
+Every interactive element should define:
+| State | Visual Treatment | Trigger |
+|-------|-----------------|---------|
+| **Default** | Base appearance | Page load |
+| **Hover** | Subtle background/border change | Mouse enters |
+| **Focus** | Visible ring/outline (2px+ solid) | Tab navigation |
+| **Active/Pressed** | Scale down slightly (0.97-0.98) | Mouse down |
+| **Disabled** | Reduced opacity (0.5), no pointer | Programmatic |
+| **Loading** | Spinner or pulse, disabled interaction | Async action |
+| **Error** | Red border/text, error message | Validation fail |
+| **Success** | Green indicator, confirmation | Action complete |
+### Audit Checklist
+For each component type, verify states exist:
+| Component | States to Check |
+|-----------|----------------|
+| Button | default, hover, focus, active, disabled, loading |
+| Input | default, hover, focus, filled, error, disabled |
+| Card (clickable) | default, hover, focus, active |
+| Link | default, hover, focus, visited |
+| Toggle | off, on, hover, focus, disabled |
+| Select | default, hover, focus, open, selected, error |
+| Modal | enter, exit, backdrop |
+---
+## 5. Scoring with Heuristics
+When running /score, add heuristic evaluation points:
+```
+Heuristic Evaluation (0-20 pts):
+  H1 System status:    /2  (loading states, feedback)
+  H2 Real world match: /2  (language, icons)
+  H3 User control:     /2  (undo, escape, back)
+  H4 Consistency:      /2  (styles, patterns)
+  H5 Error prevention: /2  (validation, confirmation)
+  H6 Recognition:      /2  (labels, navigation)
+  H7 Efficiency:       /2  (shortcuts, bulk ops)
+  H8 Minimal design:   /2  (hierarchy, whitespace)
+  H9 Error recovery:   /2  (messages, guidance)
+  H10 Help:            /2  (tooltips, onboarding)
+```
+This replaces the ad-hoc accessibility scoring with a structured UX evaluation.