npm - @arclabs561/ai-visual-test - Versions diffs - 0.5.1 → 0.7.4 - Mend

@arclabs561/ai-visual-test 0.5.1 → 0.7.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (74) hide show

package/CHANGELOG.md +127 -11
package/DEPLOYMENT.md +225 -9
package/README.md +71 -80
package/index.d.ts +902 -5
package/package.json +10 -51
package/src/batch-optimizer.mjs +39 -0
package/src/cache.mjs +241 -16
package/src/config.mjs +33 -91
package/src/constants.mjs +54 -0
package/src/convenience.mjs +113 -10
package/src/cost-optimization.mjs +1 -0
package/src/cost-tracker.mjs +134 -2
package/src/data-extractor.mjs +36 -7
package/src/dynamic-few-shot.mjs +69 -11
package/src/errors.mjs +6 -2
package/src/experience-propagation.mjs +12 -0
package/src/experience-tracer.mjs +12 -3
package/src/game-player.mjs +222 -43
package/src/graceful-shutdown.mjs +126 -0
package/src/helpers/playwright.mjs +22 -8
package/src/human-validation-manager.mjs +99 -2
package/src/index.mjs +48 -3
package/src/integrations/playwright.mjs +140 -0
package/src/judge.mjs +699 -24
package/src/load-env.mjs +2 -1
package/src/logger.mjs +31 -3
package/src/model-tier-selector.mjs +1 -221
package/src/natural-language-specs.mjs +31 -3
package/src/persona-enhanced.mjs +4 -2
package/src/persona-experience.mjs +1 -1
package/src/pricing.mjs +28 -0
package/src/prompt-composer.mjs +162 -5
package/src/provider-data.mjs +115 -0
package/src/render-change-detector.mjs +5 -0
package/src/research-enhanced-validation.mjs +7 -5
package/src/retry.mjs +21 -7
package/src/rubrics.mjs +4 -0
package/src/safe-logger.mjs +71 -0
package/src/session-cost-tracker.mjs +320 -0
package/src/smart-validator.mjs +8 -8
package/src/spec-templates.mjs +52 -6
package/src/startup-validation.mjs +127 -0
package/src/temporal-adaptive.mjs +2 -2
package/src/temporal-decision-manager.mjs +1 -271
package/src/temporal-logic.mjs +104 -0
package/src/temporal-note-pruner.mjs +119 -0
package/src/temporal-preprocessor.mjs +1 -543
package/src/temporal.mjs +681 -79
package/src/utils/action-hallucination-detector.mjs +301 -0
package/src/utils/baseline-validator.mjs +82 -0
package/src/utils/cache-stats.mjs +104 -0
package/src/utils/cached-llm.mjs +164 -0
package/src/utils/capability-stratifier.mjs +108 -0
package/src/utils/counterfactual-tester.mjs +83 -0
package/src/utils/error-recovery.mjs +117 -0
package/src/utils/explainability-scorer.mjs +119 -0
package/src/utils/exploratory-automation.mjs +131 -0
package/src/utils/index.mjs +10 -0
package/src/utils/intent-recognizer.mjs +201 -0
package/src/utils/log-sanitizer.mjs +165 -0
package/src/utils/path-validator.mjs +88 -0
package/src/utils/performance-logger.mjs +316 -0
package/src/utils/performance-measurement.mjs +280 -0
package/src/utils/prompt-sanitizer.mjs +213 -0
package/src/utils/rate-limiter.mjs +144 -0
package/src/validation-framework.mjs +24 -20
package/src/validation-result-normalizer.mjs +35 -1
package/src/validation.mjs +75 -25
package/src/validators/accessibility-validator.mjs +144 -0
package/src/validators/hybrid-validator.mjs +48 -4
package/api/health.js +0 -34
package/api/validate.js +0 -252
package/public/index.html +0 -149
package/vercel.json +0 -27

package/CHANGELOG.md CHANGED Viewed

@@ -1,24 +1,140 @@
 # Changelog
-All notable changes to @arclabs561/ai-visual-test will be documented in this file.
+All notable changes to ai-visual-test will be documented in this file.
-## [0.5.1] - 2025-11-14
+## [0.7.4] - 2026-03-03
+### Added
+- **Structured result fields at top level** - `result.richIssues`, `result.recommendations`, `result.strengths` promoted from `result.semantic` to the top-level result object, eliminating the need to reach into `result.semantic` for structured output.
+  - `richIssues`: array of `{ description, importance, annoyance, impact, evidence, suggestion }` objects
+  - `recommendations`: array of `{ priority, suggestion, expectedImpact }` objects
+  - `strengths`: array of strings describing what works well
+- **TypeScript types** for `RichIssue`, `Recommendation`; updated `SemanticInfo` and `ValidationResult` interfaces.
+### Fixed
+- `result.issues` (flat strings) is preserved for backward compatibility; `result.richIssues` adds the structured version alongside it.
+## [0.7.3] - 2026-03-02
+### Added
+- **Visual anchors** - domain-level grounding cues (text + image) injected into VLM prompts. Supports `AnchorEntry` union type: plain strings, dimension-scoped text, image references, or combinations. Config-level anchors merge with per-call `context.anchors`.
+- **Dimension-scoped anchors** - tag anchors with rubric dimension names for targeted evaluation.
+- **Image anchor resolution** - file paths, data URIs, and raw base64 supported for reference screenshots.
+### Fixed
+- Prompt composer: proper `\n\n` separation between anchor section and base prompt.
+- Judge: always warn on missing anchor images (not just verbose mode).
+- Build script: strip `scripts` and `devDependencies` from dist `package.json`.
+- Publish workflow: run only unit tests in CI; audit prod deps only.
+## [0.6.0] - 2025-01-17
 ### Changed
-- **Package renamed to scoped** - Now published as `@arclabs561/ai-visual-test` for consistency with other @arclabs561 packages
-- **Breaking change**: Update imports from `ai-visual-test` to `@arclabs561/ai-visual-test`
+- **Selective Obfuscation** - Core algorithms obfuscated while maintaining debuggability
+  - Obfuscates only Tier 1 files (temporal decision, cost optimization, activity preprocessing)
+  - Keeps API surface, validators, utilities, and cache system readable
+  - Transparent about obfuscation strategy in README
+  - TypeScript definitions enhanced with comprehensive JSDoc (survives obfuscation)
+- **Documentation Strategy** - Minimal, self-contained documentation in package
+  - `API_QUICK_REFERENCE.md` - Essential API patterns (in package)
+  - `EXAMPLES.md` - Working code examples (in package)
+  - Enhanced TypeScript definitions with examples and usage patterns
+  - README updated with obfuscation transparency section
+  - All documentation self-contained (no external hosting, GitHub is private)
+### Security
+- **Path Traversal Prevention** - Added comprehensive path validation to prevent directory traversal attacks
+  - `src/utils/path-validator.mjs` - Centralized path validation utilities
+  - All image paths validated before file operations
+  - Absolute paths properly resolved and validated
+- **Prompt Injection Protection** - Protection against prompt injection attacks
+  - `src/utils/prompt-sanitizer.mjs` - Prompt sanitization and security validation
+  - Strict mode validation (default) or sanitization mode
+  - Detects and prevents malicious prompt patterns
+- **Image Format Validation** - Magic byte validation to prevent MIME type spoofing
+  - Validates PNG, JPEG, GIF, WebP formats using file signatures
+  - Prevents malicious file uploads disguised as images
+- **Library-Level Rate Limiting** - Configurable request and cost-based rate limiting
+  - `src/utils/rate-limiter.mjs` - Request and cost-based rate limiting
+  - Prevents API abuse and cost overruns
+  - Configurable limits per window
+- **Log Sanitization** - All logged output sanitized to prevent information leakage
+  - `src/utils/log-sanitizer.mjs` - Utilities for sanitizing sensitive data
+  - Error messages use basename for file paths
+  - Sensitive data removed from logs
+- **Input Validation** - Comprehensive input validation
+  - Prompt length limits (10k characters max)
+  - File path validation for all file operations
+  - Error message sanitization
+### Changed
+- **Repository Privacy** - GitHub repository made private
+  - Source code, history, and internal documentation no longer publicly accessible
+- **Selective Obfuscation** - Protects proprietary algorithms while maintaining usability
+  - Obfuscates: `temporal-decision-manager.mjs`, `cost-optimization.mjs`, `model-tier-selector.mjs`, `temporal-preprocessor.mjs`
+  - Readable: API surface, validators, utilities, cache system, error handling
+  - Build script shows which files are obfuscated (🔒) vs readable (📄)
+  - Transparent documentation about obfuscation strategy
+- **Package Cleanup** - Removed deployment-specific files from npm package
+  - Removed `vercel.json`, `api/**/*.js`, `public/**/*.html` from package
+  - Package now contains only library code (115 files)
+  - Cleaner, library-only distribution
+### Added
+- **Security Utilities**
+  - `src/utils/path-validator.mjs` - Path validation and traversal prevention
+  - `src/utils/prompt-sanitizer.mjs` - Prompt injection protection
+  - `src/utils/rate-limiter.mjs` - Library-level rate limiting
+  - `src/utils/log-sanitizer.mjs` - Log sanitization utilities
+- **Build System**
+  - `scripts/build-obfuscated.mjs` - Obfuscation build script
+  - `scripts/cleanup-root-docs.mjs` - Repository cleanup automation
+  - `npm run build` - Build obfuscated package
+  - `npm run build:skip-obfuscation` - Build without obfuscation (testing)
+- **Documentation**
+  - `API_QUICK_REFERENCE.md` - Essential API patterns (in package)
+  - `EXAMPLES.md` - Working code examples (in package)
+  - Enhanced TypeScript definitions with comprehensive JSDoc comments
+  - `docs/OBFUSCATION_STRATEGY.md` - Complete obfuscation strategy
+  - `docs/OBFUSCATION_IMPLEMENTATION.md` - Implementation details
+### Improved
+- **Error Handling** - Enhanced error messages with sanitization
+  - File paths use basename in error messages
+  - No sensitive information in error output
+  - Better error categorization
+- **Secret Detection** - Improved false positive handling
+  - Added patterns for common code constructs
+  - Excluded script from self-checking
+  - Better detection of actual secrets vs. code patterns
+### Fixed
+- **Test Failures** - Fixed ExploratoryStrategy test (shared state issue)
+- **Build Script** - Fixed obfuscator detection logic
+- **Package Paths** - Fixed package.json paths for dist/ directory
+### Repository
+- **Cleanup** - Archived 14 temporary documentation files
+- **Organization** - Root directory reduced from ~20+ to 7 essential files
+- **Gitignore** - Updated to exclude temporary files and deployment configs
+### Security Rating
+- Improved from **LOW-MEDIUM** to **8.5/10**
+- All critical vulnerabilities addressed
+- Production-ready security posture
 ## [0.5.0] - 2025-11-13
 ### Added
 - **API Sub-Modules** - Organized API into logical sub-modules for better tree-shaking
-  - `@arclabs561/ai-visual-test/validators` - All validation functionality
-  - `@arclabs561/ai-visual-test/temporal` - Temporal aggregation and decision-making
-  - `@arclabs561/ai-visual-test/multi-modal` - Multi-modal validation features
-  - `@arclabs561/ai-visual-test/ensemble` - Ensemble judging and bias detection
-  - `@arclabs561/ai-visual-test/persona` - Persona-based testing
-  - `@arclabs561/ai-visual-test/specs` - Natural language specifications
-  - `@arclabs561/ai-visual-test/utils` - Utility functions and infrastructure
+  - `ai-visual-test/validators` - All validation functionality
+  - `ai-visual-test/temporal` - Temporal aggregation and decision-making
+  - `ai-visual-test/multi-modal` - Multi-modal validation features
+  - `ai-visual-test/ensemble` - Ensemble judging and bias detection
+  - `ai-visual-test/persona` - Persona-based testing
+  - `ai-visual-test/specs` - Natural language specifications
+  - `ai-visual-test/utils` - Utility functions and infrastructure
+  - Main export (`ai-visual-test`) still works for backward compatibility
 - **Smart Validators** - Automatically select the best validator type based on available context
   - `validateSmart()` - Universal smart validator that auto-selects best method
   - `validateAccessibilitySmart()` - Smart accessibility validation (programmatic/VLLM/hybrid)

package/DEPLOYMENT.md CHANGED Viewed

@@ -1,5 +1,14 @@
 # Deployment Guide
+## Overview
+This guide covers deploying `@arclabs561/ai-visual-test` in production environments, including:
+- Vercel serverless deployment
+- Docker containerization
+- Health checks and monitoring
+- Graceful shutdown
+- Environment variable validation
 ## Vercel Deployment
 ### Quick Deploy
@@ -15,22 +24,95 @@ vercel
 ### Environment Variables
-Set these in Vercel dashboard:
+**Required** (at least one API key):
+- `GEMINI_API_KEY` - For Gemini provider
+- `OPENAI_API_KEY` - For OpenAI provider
+- `ANTHROPIC_API_KEY` - For Claude/Anthropic provider
+- `GROQ_API_KEY` - For Groq provider (high-frequency decisions)
+**Optional**:
+- `VLM_PROVIDER` - Provider to use (auto-detected if not set): `gemini`, `openai`, `claude`, `groq`
+- `VLM_MODEL` - Explicit model override
+- `VLM_MODEL_TIER` - Model tier: `fast`, `balanced`, `best`
+- `API_KEY` or `VLLM_API_KEY` - For API endpoint authentication
+- `REQUIRE_AUTH` - Set to `true` to enforce authentication (default: `true` if API_KEY is set)
+- `RATE_LIMIT_MAX_REQUESTS` - Max requests per minute (default: 10)
+- `DISABLE_LLM_CACHE` - Set to `true` to disable caching globally
+### Startup Validation
-- `GEMINI_API_KEY` (or `OPENAI_API_KEY`, `ANTHROPIC_API_KEY`)
-- `VLM_PROVIDER` (optional)
-- `API_KEY` or `VLLM_API_KEY` (optional, for API authentication)
-- `REQUIRE_AUTH` (optional, set to `true` to enforce authentication)
-- `RATE_LIMIT_MAX_REQUESTS` (optional, default: 10 requests per minute)
+The library automatically validates configuration at startup. If required environment variables are missing, you'll get clear error messages:
+```javascript
+import { validateStartup } from '@arclabs561/ai-visual-test';
+// Strict validation (throws on missing vars)
+try {
+  validateStartup();
+  console.log('✅ Configuration valid');
+} catch (error) {
+  console.error('❌ Configuration invalid:', error.message);
+  // Error includes actionable guidance:
+  // "Missing required environment variables for provider 'gemini': GEMINI_API_KEY"
+}
+// Soft validation (returns warnings)
+const result = validateStartupSoft();
+if (!result.valid) {
+  console.warn('⚠️  Configuration warnings:', result.warnings);
+}
+```
 ### API Endpoints
 After deployment, you'll have:
-- `https://your-site.vercel.app/api/validate` - Validation endpoint
-- `https://your-site.vercel.app/api/health` - Health check
+- `https://your-site.vercel.app/api/validate` - Validation endpoint (POST)
+- `https://your-site.vercel.app/api/health` - Health check (GET)
 - `https://your-site.vercel.app/` - Web interface
+#### Health Check Endpoint
+The health check endpoint provides comprehensive status:
+```bash
+curl https://your-site.vercel.app/api/health
+```
+**Response**:
+```json
+{
+  "status": "healthy",
+  "timestamp": "2025-01-17T12:00:00.000Z",
+  "version": "0.5.5",
+  "config": {
+    "enabled": true,
+    "provider": "gemini",
+    "hasApiKey": true
+  },
+  "validation": {
+    "valid": true,
+    "warnings": []
+  },
+  "cache": {
+    "enabled": true,
+    "hits": 1234,
+    "misses": 567,
+    "hitRate": 0.685
+  }
+}
+```
+**Status Codes**:
+- `200` - Healthy (all checks pass)
+- `503` - Degraded (configuration issues, but service may still work)
+- `500` - Error (health check itself failed)
+Use this endpoint for:
+- Load balancer health checks
+- Monitoring and alerting
+- Deployment verification
 ### Usage
 ```javascript
@@ -66,6 +148,123 @@ const remaining = response.headers.get('X-RateLimit-Remaining');
 const resetAt = response.headers.get('X-RateLimit-Reset');
 ```
+## Docker Deployment
+### Dockerfile Example
+```dockerfile
+FROM node:18-alpine
+WORKDIR /app
+# Copy package files
+COPY package*.json ./
+RUN npm ci --only=production
+# Copy source code
+COPY src ./src
+COPY api ./api
+# Set environment
+ENV NODE_ENV=production
+# Expose port (if running as server)
+EXPOSE 3000
+# Health check
+HEALTHCHECK --interval=30s --timeout=3s --start-period=5s --retries=3 \
+  CMD node -e "require('http').get('http://localhost:3000/api/health', (r) => { process.exit(r.statusCode === 200 ? 0 : 1) })"
+# Start application
+CMD ["node", "api/server.js"]
+```
+### Docker Compose Example
+```yaml
+version: '3.8'
+services:
+  ai-visual-test:
+    build: .
+    ports:
+      - "3000:3000"
+    environment:
+      - GEMINI_API_KEY=${GEMINI_API_KEY}
+      - VLM_PROVIDER=gemini
+      - NODE_ENV=production
+    healthcheck:
+      test: ["CMD", "curl", "-f", "http://localhost:3000/api/health"]
+      interval: 30s
+      timeout: 10s
+      retries: 3
+      start_period: 40s
+```
+## Graceful Shutdown
+The library includes graceful shutdown handling for long-running processes:
+```javascript
+import { initGracefulShutdown, registerShutdownHandler } from '@arclabs561/ai-visual-test';
+// Initialize (automatically done in library, but can be customized)
+initGracefulShutdown({ timeout: 30000 }); // 30 second timeout
+// Register custom shutdown handlers
+registerShutdownHandler(async () => {
+  // Clean up your resources
+  await closeDatabase();
+  await flushLogs();
+}, 10); // Priority (higher = called first)
+```
+**Features**:
+- Handles `SIGTERM` and `SIGINT` signals
+- Executes shutdown handlers in priority order
+- Flushes caches and cleans up resources
+- Timeout protection (default: 30s)
+- Handles uncaught exceptions
+## Monitoring and Observability
+### Health Checks
+Monitor the `/api/health` endpoint:
+- **Interval**: Check every 30-60 seconds
+- **Timeout**: 3-5 seconds
+- **Alert on**: Status `503` (degraded) or `500` (error)
+### Metrics to Monitor
+1. **Health Check Status**
+   - `status: "healthy"` vs `"degraded"` vs `"error"`
+   - Validation warnings
+2. **Cache Performance**
+   - Hit rate (should be >50% in production)
+   - Cache size
+3. **API Performance**
+   - Response times (via performance logger)
+   - Error rates
+   - Cost tracking
+### Logging
+The library includes comprehensive logging:
+- API call performance (latency, retries, costs)
+- Cache operations (hits, misses, evictions)
+- Temporal decisions (when prompts trigger/skip)
+- Error patterns
+Enable debug logging:
+```javascript
+import { setDebugEnabled } from '@arclabs561/ai-visual-test';
+setDebugEnabled(true);
+```
 ## Local Development
 ```bash
@@ -76,5 +275,22 @@ npm install
 npm test
 # Use as library
-import { validateScreenshot } from '@ai-visual-test/core';
+import { validateScreenshot } from '@arclabs561/ai-visual-test';
+# Validate startup configuration
+import { validateStartup } from '@arclabs561/ai-visual-test';
+validateStartup(); // Throws if configuration invalid
 ```
+## Production Checklist
+- [ ] Set required API keys in environment
+- [ ] Configure `VLM_PROVIDER` if using specific provider
+- [ ] Set `API_KEY` for endpoint authentication (if exposing API)
+- [ ] Configure `RATE_LIMIT_MAX_REQUESTS` based on expected load
+- [ ] Set up health check monitoring
+- [ ] Configure logging aggregation
+- [ ] Set up cost tracking and alerts
+- [ ] Test graceful shutdown
+- [ ] Verify cache directory permissions (if using file cache)
+- [ ] Review security settings (`REQUIRE_AUTH`, rate limits)

package/README.md CHANGED Viewed

@@ -1,18 +1,20 @@
-# @arclabs561/ai-visual-test
+# ai-visual-test
-AI-powered visual testing. Uses vision language models to understand screenshots instead of pixel-diffing.
+Visual testing framework using Vision Language Models. Validates screenshots, checks accessibility, and can play games.
-## Why
+## Why This Package
-Pixel-based testing breaks when content changes or layouts shift. This tool asks "does this look correct?" instead of "did pixels change?"
+Pixel-based testing breaks when content changes. This tool asks "does this look correct?" instead of "did pixels change?"
-## Install
+## Installation
 ```bash
 npm install @arclabs561/ai-visual-test
 ```
-Set an API key:
+## Configuration
+Set an API key in a `.env` file:
 ```bash
 # .env file
@@ -23,7 +25,26 @@ OPENAI_API_KEY=your-key-here
 ANTHROPIC_API_KEY=your-key-here
 ```
-## Use
+## Quick Start
+### With Playwright
+```javascript
+import { validatePage } from '@arclabs561/ai-visual-test';
+import { chromium } from 'playwright';
+const browser = await chromium.launch();
+const page = await browser.newPage();
+await page.goto('https://example.com');
+// validatePage() handles screenshotting
+const result = await validatePage(page, 'Check for visual bugs and accessibility issues');
+console.log(result.score);  // 7 (0-10 scale)
+console.log(result.issues); // ['Missing error messages', 'Low contrast']
+```
+### With Screenshot Path
 ```javascript
 import { validateScreenshot } from '@arclabs561/ai-visual-test';
@@ -33,109 +54,79 @@ const result = await validateScreenshot(
   'Check if this payment form is accessible and usable'
 );
-console.log(result.score); // 0-10
+console.log(result.score);  // 7 (0-10 scale)
 console.log(result.issues); // ['Missing error messages', 'Low contrast']
 ```
-## What it's good for
-- **Accessibility** - Fast programmatic checks or VLLM semantic evaluation
-- **Design principles** - Validates brutalist, minimal, or other styles
-- **Temporal testing** - Analyzes animations and gameplay over time
-- **State validation** - Fast programmatic or VLLM extraction
-- **Game testing** - Validate gameplay with variable goals
-- **Natural language specs** - Write tests in plain English
+## Key Features
-## What it's not good for
-- Pixel-perfect layout testing (use pixel-diffing tools)
-- Exact color matching (use design tools)
-- Performance testing (use Lighthouse)
-- Unit testing (use Jest/Vitest)
-## API
-### Core
+### 1. Hybrid Validation
+Combines deterministic code checks (contrast ratios, aria-labels) with AI visual judgment.
 ```javascript
-import { validateScreenshot, createConfig } from '@arclabs561/ai-visual-test';
+import { validateAccessibilityHybrid } from '@arclabs561/ai-visual-test/validators';
+// Checks code AND pixels
+const result = await validateAccessibilityHybrid(page, 'shot.png');
+```
-// Configure (optional - auto-detects from env)
-const config = createConfig({
-  provider: 'gemini',
-  apiKey: process.env.GEMINI_API_KEY
-});
+### 2. AI Game Agent
+Plays Canvas/WebGL games by analyzing screenshots and planning actions. Includes Reflexion (learning from mistakes) and Chain of Thought.
-// Validate
-const result = await validateScreenshot(
-  'screenshot.png',
-  'Evaluate this screenshot',
-  { testType: 'payment-screen' }
-);
+```javascript
+import { playGame } from '@arclabs561/ai-visual-test';
+await playGame(page, { goal: 'Win the level', maxSteps: 50 });
 ```
-### Sub-modules (better tree-shaking)
+### 3. Cost Optimization
+Caching, model tiering, and provider selection. See `test/performance/optimization-claims-validation.test.mjs` for validation.
-```javascript
-// Validators
-import { StateValidator } from '@arclabs561/ai-visual-test/validators';
+## Documentation
-// Temporal
-import { aggregateTemporalNotes } from '@arclabs561/ai-visual-test/temporal';
+- [**EXAMPLES.md**](./EXAMPLES.md) - Code snippets for Game Playing, Hybrid Validation, Playwright integration.
+- [**API_QUICK_REFERENCE.md**](./API_QUICK_REFERENCE.md) - Function signatures and options.
+- [**examples/**](./examples/) - Runnable examples.
+- **TypeScript**: Type definitions included.
-// Multi-modal
-import { multiModalValidation } from '@arclabs561/ai-visual-test/multi-modal';
+## Playwright Integration
-// Ensemble
-import { EnsembleJudge } from '@arclabs561/ai-visual-test/ensemble';
+Custom matchers for Playwright tests. **Requires `@playwright/test` to be installed** (already in devDependencies for this project).
-// Persona
-import { experiencePageAsPersona } from '@arclabs561/ai-visual-test/persona';
+### Setup
-// Specs
-import { parseSpec } from '@arclabs561/ai-visual-test/specs';
+```javascript
+import { expect } from '@playwright/test';
+import { createMatchers } from '@arclabs561/ai-visual-test/playwright';
-// Utils
-import { getCacheStats } from '@arclabs561/ai-visual-test/utils';
+// Extend expect with custom matchers (call once in your test setup)
+createMatchers(expect);
 ```
-### With Playwright
+### Usage in Tests
 ```javascript
-import { test } from '@playwright/test';
-import { validateScreenshot } from '@arclabs561/ai-visual-test';
-test('payment screen', async ({ page }) => {
-  await page.goto('https://example.com/checkout');
-  await page.screenshot({ path: 'checkout.png' });
+test('visual quality', async ({ page }) => {
+  await page.goto('https://example.com');
-  const result = await validateScreenshot(
-    'checkout.png',
-    'Check if payment form is accessible'
-  );
+  // Visual quality check
+  await expect(page).toHaveVisualScore(7, 'Check visual quality');
-  assert(result.score >= 8, 'Payment form should score at least 8');
+  // Hybrid accessibility check (programmatic + AI)
+  await expect(page).toBeAccessibleHybrid(4.5);
 });
 ```
-## Features
-- **Multi-provider** - Gemini, OpenAI, Claude
-- **Cost-effective** - Auto-selects cheapest provider, includes caching
-- **Multi-modal** - Screenshots + rendered code + context
-- **Temporal** - Time-series validation for animations
-- **Multi-perspective** - Multiple personas evaluate same state
-- **Zero dependencies** - Pure ES Modules
+### Installation
-## Examples
+For development in this project, Playwright is already installed. For use in other projects:
-See `examples/` directory for complete examples.
+```bash
+npm install --save-dev @playwright/test
+npx playwright install chromium
+```
-## Documentation
+See `examples/playwright-setup.mjs` for setup example.
-- `docs/API_SUBMODULES.md` - Sub-module usage
-- `docs/API_SURFACE_ORGANIZATION.md` - API organization
-- `CHANGELOG.md` - Version history
+Documentation: [docs/PLAYWRIGHT_INTEGRATION.md](./docs/PLAYWRIGHT_INTEGRATION.md)
 ## License