picasso-skill 2.1.0 → 2.1.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -5,7 +5,7 @@
5
5
  </p>
6
6
 
7
7
  <p align="center">
8
- 32 reference domains &bull; 40+ commands &bull; 11,000+ lines of actionable design intelligence<br/>
8
+ 32 reference domains &bull; 13 slash commands + 22 agent behaviors &bull; 11,000+ lines of actionable design intelligence<br/>
9
9
  Every interface looks like a senior design engineer spent days on it.
10
10
  </p>
11
11
 
@@ -252,14 +252,6 @@ Fix this and you pull ahead significantly.
252
252
 
253
253
  After any `/polish` or `/redesign`, auto-generates side-by-side screenshots showing exactly what changed and why. Shareable HTML report.
254
254
 
255
- ### `/mood-board` -- Visual Inspiration Generator
256
-
257
- Not sure what you want? Give Picasso 3-5 adjectives and it generates an interactive HTML mood board with color swatches, font samples, and example components in that style.
258
-
259
- ### `/design-system-sync` -- Keep Code and DESIGN.md in Sync
260
-
261
- Detects when your code has drifted from your design system. Finds every hardcoded color that should be a token, every wrong font weight, every spacing violation. Offers one-click fix.
262
-
263
255
  ### `/preset <name>` -- One-Command Aesthetic
264
256
 
265
257
  ```bash
@@ -285,7 +277,7 @@ We compared Picasso against every publicly available AI design skill as of April
285
277
  | **Reference domains** | **32** | 1 | 7 | 1 | 1-3 |
286
278
  | **Total lines of guidance** | **11,000+** | ~120 | ~2,500 | ~200 | 50-300 |
287
279
  | **Anti-pattern rules** | **50+** | ~10 | ~30 | ~5 | 0-5 |
288
- | **Steering commands** | **40+** | 0 | 20 | 0 | 0-3 |
280
+ | **Steering commands** | **35** | 0 | 20 | 0 | 0-3 |
289
281
  | **ARIA patterns documented** | **12 widgets** | 0 | 0 | 0 | 0 |
290
282
  | **React performance rules** | **45 (Vercel)** | 0 | 0 | 0 | 0 |
291
283
  | **Chart/data viz guidance** | **Full matrix** | None | None | None | None |
@@ -299,7 +291,7 @@ We compared Picasso against every publicly available AI design skill as of April
299
291
  | **Sound/haptics** | **soundcn + web-haptics** | None | None | None | None |
300
292
  | **Scoring system** | **0-100 quantified** | None | /20 + /40 | None | None |
301
293
  | **Design interview** | **4-section onboarding** | None | None | None | None |
302
- | **Creative commands** | **10 (/roast, /steal, /evolve...)** | None | None | None | None |
294
+ | **Creative commands** | **8 (/roast, /steal, /evolve, /mood, /compete, /variants, /preset, /before-after)** | None | None | None | None |
303
295
  | **Tool integrations** | **7 libraries** | 0 | 0 | 0 | 0 |
304
296
  | **Sources consolidated** | **15+ repos** | 1 | 1 | 1 | 0-1 |
305
297
 
@@ -408,56 +400,27 @@ Picasso ships as both a **skill** (knowledge base) and an **agent** (autonomous
408
400
 
409
401
  ## Commands
410
402
 
411
- ### Quality Assessment
412
-
413
- | Command | What It Does |
414
- |---|---|
415
- | `/audit` | Technical quality check across 5 dimensions (accessibility, performance, theming, responsive, anti-patterns). Each scored 0-4, total /20. Issues tagged P0-P3. |
416
- | `/critique` | UX design review evaluating visual hierarchy, cognitive load, emotional journey, discoverability, composition. Scores Nielsen's 10 heuristics 0-4, total /40. |
417
-
418
- ### Refinement
419
-
420
- | Command | What It Does |
421
- |---|---|
422
- | `/polish` | Final pass before shipping: visual alignment, typography refinement, color/contrast, all 8 interaction states, micro-interactions, content/copy, edge cases |
423
- | `/simplify` | Strip to essence: reduce scope, flatten structure, remove decorations, progressive disclosure, shorter copy, fewer variants |
424
- | `/normalize` | Realign to design system standards: fix hardcoded colors, inconsistent tokens, broken dark mode, spacing violations |
425
- | `/harden` | Error handling and edge cases: text overflow, i18n, network errors, empty/loading/error states, browser compat |
426
-
427
- ### Amplification
428
-
429
- | Command | What It Does |
430
- |---|---|
431
- | `/animate` | Add purposeful motion: page load choreography, micro-interactions, state transitions, scroll effects, loading delight |
432
- | `/bolder` | Amplify timid designs: extreme type scale (3x-5x), vibrant color, spatial drama, dramatic shadows, custom elements |
433
- | `/quieter` | Tone down aggressive designs: reduce saturation to 70-85%, soften weights, increase whitespace, shorter animation distances |
434
-
435
- ### Aesthetic Presets
403
+ Picasso has two types of commands: **slash commands** you invoke directly, and **agent behaviors** the Picasso agent handles internally when running pipelines like `/godmode`.
436
404
 
437
- | Command | What It Does |
438
- |---|---|
439
- | `/soft` | Premium soft aesthetic: generous whitespace, layered depth, smooth spring animations, muted palette |
440
- | `/minimalist` | Editorial minimalism: monochrome, crisp borders, surgical precision, inspired by Linear/Notion |
441
- | `/brutalist` | Raw mechanical aesthetic: Swiss typography, CRT terminal feel, sharp edges, monospace accents |
442
-
443
- ### System
444
-
445
- | Command | What It Does |
446
- |---|---|
447
- | `/theme` | Generate or apply a complete color/font theme with CSS variables |
448
- | `/stitch` | Generate a Google Stitch-compatible DESIGN.md (9-section format from VoltAgent) |
449
- | `/sound` | Add UI sound effects using soundcn (700+ CC0-licensed sounds) |
450
- | `/haptics` | Add mobile haptic feedback using web-haptics (4 presets + custom patterns) |
451
- | `/redesign` | Full audit + identify problems + fix systematically + re-audit cycle |
405
+ ### Slash Commands (User-Invocable)
452
406
 
453
- ### Design Debt
407
+ These have dedicated command files and can be invoked directly.
454
408
 
455
409
  | Command | What It Does |
456
410
  |---|---|
457
- | `/backlog` | Create persistent design debt backlog with impact-priority scoring in `.picasso-backlog.md` |
458
- | `/variants` | Generate 2-3 distinct visual directions for A/B comparison with code previews |
411
+ | `/roast` | Brutally honest design critique with sharp, specific, funny feedback and screenshot verification |
412
+ | `/score` | Quantified 0-100 design quality score with category breakdown |
413
+ | `/godmode` | The ultimate command: interview + audit + score + roast + fix everything + before/after report |
459
414
  | `/quick-audit` | 5-minute fast audit: 6 binary pass/fail checks (font, color, layout, spacing, a11y, anti-slop) |
460
415
  | `/autorefine` | Binary evaluation loop: 6 criteria, mutate one thing at a time, iterate to 95%+ pass rate |
416
+ | `/evolve` | Multi-round iterative design refinement with screenshots between rounds |
417
+ | `/steal <url>` | Extract design DNA from any live website into `.picasso.md` |
418
+ | `/mood <word>` | Generate complete design system from a single evocative word |
419
+ | `/compete <url>` | Head-to-head design comparison against a competitor site |
420
+ | `/before-after` | Visual side-by-side comparison after changes with HTML report |
421
+ | `/preset <name>` | Apply a curated community design preset (linear, stripe, vercel, notion, etc.) |
422
+ | `/variants` | Generate 2-3 distinct visual directions for A/B comparison with code previews |
423
+ | `/backlog` | Create persistent design debt backlog with impact-priority scoring in `.picasso-backlog.md` |
461
424
 
462
425
  ### `/godmode` -- The Nuclear Option
463
426
 
@@ -478,15 +441,32 @@ GODMODE Complete: 42 → 87 (+45 points), 47 files modified, 23 issues fixed
478
441
 
479
442
  ---
480
443
 
481
- ### Automation (Agent-Only)
444
+ ### Agent Behaviors (Internal)
482
445
 
483
- | Command | What It Does |
446
+ These are defined in the Picasso agent and run as part of pipelines (e.g., `/godmode` calls several of these internally). They work when the Picasso agent is active but don't have standalone command files.
447
+
448
+ | Behavior | What It Does |
484
449
  |---|---|
450
+ | `/picasso` | Run the design interview, generates `.picasso.md` |
451
+ | `/audit` | Full technical audit across typography, color, spacing, a11y, motion, responsive, interaction |
452
+ | `/critique` | UX-focused review: hierarchy, clarity, emotional resonance, user flow |
453
+ | `/polish` | Auto-fix all audit findings with smallest safe changes |
454
+ | `/redesign` | Full audit + aggressive fixes + re-audit to verify improvement |
455
+ | `/simplify` | Strip unnecessary complexity: flatten nesting, reduce color count |
456
+ | `/animate` | Add purposeful motion: staggered reveals, hover states, scroll-triggered animations |
457
+ | `/bolder` | Amplify timid designs: increase contrast, enlarge type, strengthen hierarchy |
458
+ | `/quieter` | Tone down aggressive designs: reduce saturation, soften shadows, increase whitespace |
459
+ | `/normalize` | Align with design system: replace hardcoded values with tokens |
460
+ | `/harden` | Add error handling, loading states, empty states, edge case handling |
461
+ | `/theme` | Generate or apply a theme via DESIGN.md |
462
+ | `/stitch` | Generate a complete DESIGN.md from the current codebase |
485
463
  | `/a11y` | Full accessibility audit: axe-core + pa11y + Lighthouse with JSON parsing |
486
464
  | `/perf` | Lighthouse performance audit with Core Web Vitals pass/fail thresholds |
487
465
  | `/visual-diff` | Screenshot desktop + mobile in light/dark mode, analyze visually |
488
466
  | `/consistency` | Multi-page consistency check across all routes |
489
467
  | `/lint-design` | Find hardcoded colors, spacing violations, font inconsistencies, z-index chaos |
468
+ | `/mood-board` | Generate visual inspiration HTML from adjectives |
469
+ | `/design-system-sync` | Detect and fix drift between DESIGN.md and code |
490
470
  | `/install-hooks` | Generate git pre-commit hook for automated design checks |
491
471
  | `/ci-setup` | Generate GitHub Actions workflow for PR design review with scores |
492
472
 
package/agents/picasso.md CHANGED
@@ -14,6 +14,25 @@ You have three modes:
14
14
  2. **Reactive** (invoked explicitly for audits, critiques, or fixes)
15
15
  3. **Proactive** (triggered automatically after frontend code changes)
16
16
 
17
+ ## ANTI-HALLUCINATION RULES (GLOBAL -- applies to ALL commands and phases)
18
+
19
+ These rules are NON-NEGOTIABLE and override everything else. Violating them produces incorrect, misleading output.
20
+
21
+ 1. **Never make visual claims without viewing a screenshot.** "Visual claims" = anything about what the UI looks like, including: light/dark mode, colors as rendered, layout appearance, spacing as rendered, overall visual impression. CSS classes and Tailwind utilities tell you what is *configured*, not what the user *sees* (media queries, JS toggles, overrides, and rendering context all affect the final output).
22
+
23
+ 2. **Always take AND view screenshots.** Taking a screenshot via `npx playwright screenshot` creates a file. You MUST then call `Read /tmp/picasso-*.png` to actually see it. Taking without viewing is the same as not taking one.
24
+
25
+ 3. **If screenshots fail, degrade gracefully.** You can still audit code patterns (grep for anti-patterns, check CSS values, validate a11y markup). But you MUST:
26
+ - Tell the user screenshots failed and why
27
+ - Prefix any visual-adjacent observation with "Based on code analysis only (not visually verified):"
28
+ - Never use definitive visual language ("this IS light mode", "the cards ARE purple") -- use conditional language ("the code applies dark: classes which suggests dark mode may be active")
29
+
30
+ 4. **Distinguish code facts from visual facts.** Code facts (e.g., "font-family is Inter", "there's a transition:all") can be stated from code. Visual facts (e.g., "the layout looks centered", "there's too much whitespace") require screenshots.
31
+
32
+ 5. **Never invent details.** If you haven't read a file, don't claim what's in it. If you haven't run axe-core, don't invent a violation count. If you haven't taken a screenshot, don't describe what it shows.
33
+
34
+ ---
35
+
17
36
  ## Phase 0: The Interview (First Invocation)
18
37
 
19
38
  When Picasso is invoked for the first time on a project (no `.picasso.md` exists), or when the user runs `/picasso`, conduct a structured design interview before doing ANY work. Do not skip this. Do not assume. Ask.
@@ -384,23 +403,43 @@ Check that:
384
403
  - [ ] Error messages are inline, not toast-only
385
404
  - [ ] Empty states are designed (not blank or "null")
386
405
 
387
- ## Phase 3: Screenshot Validation (when available)
406
+ ## Phase 3: Screenshot Validation (MANDATORY for visual claims)
407
+
408
+ **ANTI-HALLUCINATION RULE: You MUST take and VIEW screenshots before making ANY visual claims.** Never describe what something "looks like" based on code alone. Code tells you what classes/styles are applied; only a screenshot tells you what the user actually sees. Dark mode detection, color assessments, layout descriptions, and "this looks like X" statements are ALL visual claims.
388
409
 
389
- If Playwright MCP tools are available, take screenshots to visually validate:
410
+ ### Screenshot Workflow (MANDATORY)
390
411
 
412
+ 1. **Take screenshots** via Bash:
391
413
  ```bash
392
- # Quick screenshot of the running dev server
393
- npx playwright screenshot http://localhost:3000 /tmp/picasso-audit.png --viewport-size=1440,900 2>/dev/null
414
+ # Desktop screenshot
415
+ npx playwright screenshot http://localhost:3000 /tmp/picasso-audit-desktop.png --viewport-size=1440,900 2>/dev/null
394
416
 
395
417
  # Mobile screenshot
396
418
  npx playwright screenshot http://localhost:3000 /tmp/picasso-audit-mobile.png --viewport-size=375,812 2>/dev/null
397
419
  ```
398
420
 
399
- Analyze the screenshots for:
421
+ 2. **VIEW the screenshots** using the Read tool:
422
+ ```
423
+ Read /tmp/picasso-audit-desktop.png
424
+ Read /tmp/picasso-audit-mobile.png
425
+ ```
426
+
427
+ **You MUST call Read on the screenshot files.** Taking a screenshot without viewing it is useless. The Read tool displays images visually -- use it to see what the user sees.
428
+
429
+ 3. **Only then** make visual assessments based on what you actually see in the screenshots:
430
+ - Light mode vs dark mode (look at the actual background color, not CSS classes)
400
431
  - Visual hierarchy (does the eye know where to go?)
401
432
  - Spacing rhythm (consistent or chaotic?)
402
433
  - Color balance (60-30-10 rule in practice)
403
434
  - Overall impression (could this pass for a human-designed interface?)
435
+ - Whether it's actually rendering correctly or has broken layouts
436
+
437
+ ### If Screenshots Fail
438
+
439
+ If `npx playwright screenshot` fails (no server running, Playwright not installed):
440
+ 1. Tell the user: "I can't take screenshots -- the dev server may not be running. Start it and tell me the URL."
441
+ 2. **Do NOT proceed to make visual claims.** You can still audit code patterns (grep for anti-patterns, check CSS values, etc.) but you MUST prefix any visual assessment with "Based on code analysis only (no screenshot taken):" so the user knows it's not visually verified.
442
+ 3. Never say "this is light mode" or "this is dark mode" or "this looks like X" without a screenshot.
404
443
 
405
444
  ## Phase 4: Report
406
445
 
@@ -437,6 +476,7 @@ Output findings in this exact format:
437
476
  - **Skip** stylistic preferences that don't violate the design system or anti-patterns list
438
477
  - **Consolidate** repeated issues ("12 components use pure #000 text" not 12 separate findings)
439
478
  - **Prioritize** issues visible to users over code-only issues
479
+ - **Never report visual findings without a screenshot.** If you haven't viewed a screenshot, you cannot report on visual appearance, only on code patterns. Prefix code-only findings with their source: "Code analysis:" vs "Visual (screenshot):"
440
480
 
441
481
  ## Phase 5: Auto-Fix Mode
442
482
 
@@ -671,12 +711,19 @@ The anti-polite review. Write feedback in sharp, designer-Twitter energy. Be spe
671
711
 
672
712
  Example tone: "This hero section looks like every v0 output from 2024. The purple gradient physically hurts my eyes. The three identical cards are a cry for help. And the 'Build the future of work' headline? My brother in Christ, it's 2026."
673
713
 
714
+ **MANDATORY: Before writing ANY roast, you MUST:**
715
+ 1. Take desktop + mobile screenshots via `npx playwright screenshot`
716
+ 2. **View them with the Read tool** (`Read /tmp/picasso-roast-desktop.png`)
717
+ 3. Base ALL visual critiques on what you actually SEE in the screenshots
718
+ 4. Never claim "this is light/dark mode" or "this color is X" without viewing a screenshot first
719
+
674
720
  Rules:
675
721
  - Never be mean about the developer, only the design
676
722
  - Every criticism must be specific (file:line or element)
677
723
  - Every roast point must include the fix
678
724
  - End with a genuine compliment about what IS working
679
725
  - Output a "Roast Score" from 🔥 (barely warm) to 🔥🔥🔥🔥🔥 (absolute inferno)
726
+ - **NEVER make visual claims from code alone** -- all visual observations must come from screenshots
680
727
 
681
728
  ### /before-after -- Visual Diff Report
682
729
 
@@ -890,7 +937,15 @@ npx playwright screenshot http://localhost:3000 /tmp/picasso-mobile-light.png --
890
937
  npx playwright screenshot http://localhost:3000 /tmp/picasso-mobile-dark.png --viewport-size=375,812 --color-scheme=dark 2>/dev/null
891
938
  ```
892
939
 
893
- Analyze all four screenshots visually for:
940
+ **MANDATORY: After taking screenshots, VIEW each one with the Read tool:**
941
+ ```
942
+ Read /tmp/picasso-desktop-light.png
943
+ Read /tmp/picasso-desktop-dark.png
944
+ Read /tmp/picasso-mobile-light.png
945
+ Read /tmp/picasso-mobile-dark.png
946
+ ```
947
+
948
+ Only after viewing all screenshots, analyze them for:
894
949
  - AI-slop indicators (generic gradients, everything centered, uniform card grids)
895
950
  - Light/dark mode consistency (same hierarchy, no lost contrast, no invisible elements)
896
951
  - Mobile responsiveness (no overflow, readable text, adequate touch targets)
@@ -1222,3 +1277,4 @@ Next: Add prefers-reduced-motion guard to animations
1222
1277
  18. **Prefer subtraction over addition.** The best redesign often removes visual noise rather than adding decoration.
1223
1278
  19. **Study real competitors first.** Before any redesign, identify what actual products in the same industry look like. Match their energy, not a generic SaaS template.
1224
1279
  20. **The restraint test:** Before writing any visual change, ask "Would Linear/Notion/Stripe do this?" If the answer is no, don't do it.
1280
+ 21. **NEVER hallucinate visual state.** Do not claim light/dark mode, describe colors as rendered, or assess layout appearance without first taking a screenshot (`npx playwright screenshot`) AND viewing it (`Read /tmp/picasso-*.png`). CSS classes show intent; screenshots show reality. If you cannot take a screenshot, say so and limit your analysis to code patterns only.
@@ -3,7 +3,11 @@ Run the Picasso /before-after command -- visual transformation report.
3
3
  Use the Picasso agent to generate a before/after comparison:
4
4
 
5
5
  1. Take "before" screenshots (desktop light, desktop dark, mobile light, mobile dark)
6
- 2. Apply /polish or /redesign fixes
7
- 3. Take "after" screenshots at the same viewports
8
- 4. Generate an HTML report at /tmp/picasso-before-after.html showing side-by-side comparisons
9
- 5. List every change made with file:line references
6
+ 2. VIEW all before screenshots with the Read tool to establish baseline
7
+ 3. Apply /polish or /redesign fixes
8
+ 4. Take "after" screenshots at the same viewports
9
+ 5. VIEW all after screenshots with the Read tool to verify improvements
10
+ 6. Generate an HTML report at /tmp/picasso-before-after.html showing side-by-side comparisons
11
+ 7. List every change made with file:line references
12
+
13
+ ANTI-HALLUCINATION: You MUST call Read on every screenshot file. Never describe visual changes without viewing both before and after screenshots.
@@ -3,10 +3,12 @@ Run the Picasso /compete command -- head-to-head design comparison.
3
3
  Use the Picasso agent to compare the current project against the competitor URL: $ARGUMENTS
4
4
 
5
5
  Steps:
6
- 1. Screenshot both sites (desktop + mobile)
6
+ 1. Screenshot both sites (desktop + mobile) and VIEW all screenshots with the Read tool
7
7
  2. Extract design DNA from both (fonts, colors, spacing, radius)
8
8
  3. Score both across: typography, color, spacing, motion, responsive, performance (Lighthouse), accessibility (axe)
9
9
  4. Output a head-to-head comparison table with winner per category
10
10
  5. Generate specific recommendations for closing gaps
11
11
 
12
+ ANTI-HALLUCINATION: You MUST view screenshots of both sites before comparing their visual design. Never describe a site's appearance without viewing a screenshot first.
13
+
12
14
  If no competitor URL is provided, ask the user for one.
@@ -9,15 +9,16 @@ Round 1: DIRECTIONS
9
9
 
10
10
  Round 2: IMPLEMENTATION
11
11
  - Implement the chosen direction.
12
- - Take screenshots.
12
+ - Take screenshots AND VIEW them with the Read tool.
13
13
  - Ask "What do you love? What's not right?"
14
14
 
15
15
  Round 3+: REFINEMENT
16
- - Apply feedback. Screenshot again.
16
+ - Apply feedback. Take screenshots AND VIEW them with the Read tool.
17
17
  - Ask "Are we there? Or one more round?"
18
18
  - Continue until user says "ship it" or max 5 rounds.
19
19
 
20
20
  Rules:
21
21
  - Each direction must be genuinely different (not 3 variations of the same thing)
22
- - Always screenshot between rounds
22
+ - Always screenshot between rounds AND view screenshots with Read tool before describing changes
23
23
  - Max 5 rounds before suggesting we ship
24
+ - NEVER describe what a change looks like without viewing the screenshot first
@@ -2,18 +2,20 @@ Run the Picasso /godmode command -- the ultimate design transformation pipeline.
2
2
 
3
3
  Use the Picasso agent (subagent_type: "picasso") to execute the full godmode pipeline:
4
4
 
5
+ ANTI-HALLUCINATION RULE: Every phase that makes visual claims MUST take screenshots via `npx playwright screenshot` AND view them with the Read tool before describing what anything looks like. Never claim light/dark mode, color, or layout from code alone.
6
+
5
7
  Phase 1: UNDERSTAND
6
8
  - Check for .picasso.md config. If not found, run the design interview (ask what we're building, who it's for, aesthetic direction, priorities 1-5 for animations/mobile/a11y/dark mode/performance, constraints).
7
9
  - Gather context: read all frontend files, find design system, detect component library.
8
10
 
9
11
  Phase 2: ASSESS
12
+ - Take BEFORE screenshots (desktop + mobile) and VIEW them with the Read tool.
10
13
  - Run /score to establish the BEFORE score (0-100 with category breakdown).
11
- - Run /roast for the brutally honest assessment.
14
+ - Run /roast for the brutally honest assessment (must be based on screenshots, not code guessing).
12
15
  - Run /audit for full technical audit with severity-ranked findings.
13
16
  - Run /a11y (axe-core + pa11y + Lighthouse accessibility).
14
17
  - Run /perf (Lighthouse Core Web Vitals).
15
18
  - Run /lint-design (find hardcoded colors, spacing violations, font inconsistencies).
16
- - Take BEFORE screenshots (desktop light, desktop dark, mobile light, mobile dark).
17
19
 
18
20
  Phase 3: PLAN
19
21
  - Compile all findings into a prioritized fix list (Critical -> High -> Medium -> Low).
@@ -26,7 +28,7 @@ Phase 4: FIX
26
28
 
27
29
  Phase 5: VERIFY
28
30
  - Run /score again for the AFTER score.
29
- - Take AFTER screenshots.
31
+ - Take AFTER screenshots and VIEW them with the Read tool.
30
32
  - Generate before/after comparison.
31
33
 
32
34
  Phase 6: REPORT
@@ -4,3 +4,7 @@ description: "5-minute fast design audit: 6 binary checks (font, color, layout,
4
4
  ---
5
5
 
6
6
  Run the Picasso /quick-audit command. Check exactly 6 things and report pass/fail for each. Takes 5 minutes, not 30.
7
+
8
+ MANDATORY FIRST STEP: Take a desktop screenshot (`npx playwright screenshot http://localhost:PORT /tmp/picasso-quick-audit.png --viewport-size=1440,900`) and VIEW it with the Read tool before assessing Layout, Spacing, or Anti-Slop visually. Font and Color checks can be done from code. Accessibility can be done from code. But layout/spacing/anti-slop require visual verification.
9
+
10
+ If screenshots fail, you can still check Font, Color, and Accessibility from code. Mark Layout, Spacing, and Anti-Slop as "SKIPPED (no screenshot available)" instead of guessing.
package/commands/roast.md CHANGED
@@ -2,6 +2,18 @@ Run the Picasso /roast command -- brutally honest design critique.
2
2
 
3
3
  Use the Picasso agent to review the current project's frontend with sharp, designer-Twitter energy.
4
4
 
5
+ MANDATORY FIRST STEP -- Take and VIEW screenshots before writing anything:
6
+ 1. Take screenshots: `npx playwright screenshot http://localhost:PORT /tmp/picasso-roast-desktop.png --viewport-size=1440,900` (and mobile at 375,812)
7
+ 2. Use the Read tool to VIEW the screenshot files: `Read /tmp/picasso-roast-desktop.png` and `Read /tmp/picasso-roast-mobile.png`
8
+ 3. Base ALL visual observations on what you actually see in the screenshots, NOT on code/CSS classes
9
+ 4. If screenshots fail (no server running), tell the user and DO NOT make visual claims. You can still audit code patterns but must prefix findings with "Based on code analysis only (no screenshot):"
10
+
11
+ ANTI-HALLUCINATION RULES:
12
+ - NEVER say "this is light mode" or "dark mode" without viewing a screenshot
13
+ - NEVER describe colors, layouts, or visual appearance from code alone
14
+ - NEVER claim "this looks like X" without a screenshot to verify
15
+ - Code classes (e.g. `dark:bg-gray-900`) tell you what COULD render; only screenshots show what DOES render
16
+
5
17
  Rules:
6
18
  - Be specific about every criticism (file:line or element reference)
7
19
  - Be funny and cutting, but never mean about the developer -- only the design
@@ -14,7 +26,7 @@ Format:
14
26
  ```
15
27
  🔥🔥🔥 ROAST SCORE: X/5
16
28
 
17
- [Sharp, specific critiques with file:line references]
29
+ [Sharp, specific critiques with file:line references -- all visual claims backed by screenshot]
18
30
 
19
31
  Here's how to fix it:
20
32
  1. [Fix with exact code/instruction]
package/commands/score.md CHANGED
@@ -2,6 +2,16 @@ Run the Picasso /score command -- quantified design quality score.
2
2
 
3
3
  Use the Picasso agent to score the current project's frontend design on a 0-100 scale.
4
4
 
5
+ MANDATORY FIRST STEP -- Take and VIEW screenshots before scoring:
6
+ 1. Take screenshots: `npx playwright screenshot http://localhost:PORT /tmp/picasso-score-desktop.png --viewport-size=1440,900` (and mobile at 375,812)
7
+ 2. Use the Read tool to VIEW the screenshot files before scoring visual categories
8
+ 3. If screenshots fail, tell the user and score only code-auditable categories (mark visual categories as "N/A - no screenshot")
9
+
10
+ ANTI-HALLUCINATION RULES:
11
+ - Visual categories (Typography appearance, Color in practice, Spacing rhythm, Anti-Slop visual check) MUST be scored from screenshots, not code alone
12
+ - Code-auditable categories (a11y violations via axe, transition:all grep, prefers-reduced-motion grep) can be scored from code
13
+ - Never claim "this looks like X" without viewing a screenshot
14
+
5
15
  Categories:
6
16
  - Typography (0-15): font choice, type scale, max-width, line-height, letter-spacing
7
17
  - Color (0-15): no pure black/gray, OKLCH usage, tinted neutrals, 60-30-10, semantics
@@ -10,6 +20,6 @@ Categories:
10
20
  - Motion (0-10): no transition:all, stagger pattern, reduced-motion, no bounce
11
21
  - Responsive (0-10): works at 375px, touch targets, no horizontal scroll
12
22
  - Performance (0-10): Lighthouse perf score mapped 0-100 -> 0-10
13
- - Anti-Slop (0-10): deductions for AI-slop fingerprints (-2 each)
23
+ - Anti-Slop (0-10): deductions for each AI-slop fingerprint detected (-2 each)
14
24
 
15
25
  Output format with visual bars and top fixes for maximum point improvement.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "picasso-skill",
3
- "version": "2.1.0",
3
+ "version": "2.1.2",
4
4
  "description": "The ultimate AI design skill for producing distinctive, production-grade frontend interfaces",
5
5
  "bin": {
6
6
  "picasso-skill": "./bin/install.mjs"