npm - sentienceapi - Versions diffs - 0.90.1 - Mend

sentienceapi 0.90.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (107) hide show

package/LICENSE.md +43 -0
package/README.md +946 -0
package/dist/actions.d.ts +54 -0
package/dist/actions.d.ts.map +1 -0
package/dist/actions.js +349 -0
package/dist/actions.js.map +1 -0
package/dist/agent.d.ts +157 -0
package/dist/agent.d.ts.map +1 -0
package/dist/agent.js +437 -0
package/dist/agent.js.map +1 -0
package/dist/browser.d.ts +46 -0
package/dist/browser.d.ts.map +1 -0
package/dist/browser.js +622 -0
package/dist/browser.js.map +1 -0
package/dist/cli.d.ts +5 -0
package/dist/cli.d.ts.map +1 -0
package/dist/cli.js +174 -0
package/dist/cli.js.map +1 -0
package/dist/conversational-agent.d.ts +123 -0
package/dist/conversational-agent.d.ts.map +1 -0
package/dist/conversational-agent.js +327 -0
package/dist/conversational-agent.js.map +1 -0
package/dist/expect.d.ts +16 -0
package/dist/expect.d.ts.map +1 -0
package/dist/expect.js +66 -0
package/dist/expect.js.map +1 -0
package/dist/generator.d.ts +16 -0
package/dist/generator.d.ts.map +1 -0
package/dist/generator.js +205 -0
package/dist/generator.js.map +1 -0
package/dist/index.d.ts +21 -0
package/dist/index.d.ts.map +1 -0
package/dist/index.js +70 -0
package/dist/index.js.map +1 -0
package/dist/inspector.d.ts +13 -0
package/dist/inspector.d.ts.map +1 -0
package/dist/inspector.js +147 -0
package/dist/inspector.js.map +1 -0
package/dist/llm-provider.d.ts +60 -0
package/dist/llm-provider.d.ts.map +1 -0
package/dist/llm-provider.js +106 -0
package/dist/llm-provider.js.map +1 -0
package/dist/query.d.ts +8 -0
package/dist/query.d.ts.map +1 -0
package/dist/query.js +337 -0
package/dist/query.js.map +1 -0
package/dist/read.d.ts +40 -0
package/dist/read.d.ts.map +1 -0
package/dist/read.js +86 -0
package/dist/read.js.map +1 -0
package/dist/recorder.d.ts +44 -0
package/dist/recorder.d.ts.map +1 -0
package/dist/recorder.js +256 -0
package/dist/recorder.js.map +1 -0
package/dist/screenshot.d.ts +17 -0
package/dist/screenshot.d.ts.map +1 -0
package/dist/screenshot.js +37 -0
package/dist/screenshot.js.map +1 -0
package/dist/snapshot.d.ts +23 -0
package/dist/snapshot.d.ts.map +1 -0
package/dist/snapshot.js +187 -0
package/dist/snapshot.js.map +1 -0
package/dist/tracing/cloud-sink.d.ts +74 -0
package/dist/tracing/cloud-sink.d.ts.map +1 -0
package/dist/tracing/cloud-sink.js +262 -0
package/dist/tracing/cloud-sink.js.map +1 -0
package/dist/tracing/index.d.ts +12 -0
package/dist/tracing/index.d.ts.map +1 -0
package/dist/tracing/index.js +28 -0
package/dist/tracing/index.js.map +1 -0
package/dist/tracing/jsonl-sink.d.ts +41 -0
package/dist/tracing/jsonl-sink.d.ts.map +1 -0
package/dist/tracing/jsonl-sink.js +168 -0
package/dist/tracing/jsonl-sink.js.map +1 -0
package/dist/tracing/sink.d.ts +24 -0
package/dist/tracing/sink.d.ts.map +1 -0
package/dist/tracing/sink.js +15 -0
package/dist/tracing/sink.js.map +1 -0
package/dist/tracing/tracer-factory.d.ts +57 -0
package/dist/tracing/tracer-factory.d.ts.map +1 -0
package/dist/tracing/tracer-factory.js +274 -0
package/dist/tracing/tracer-factory.js.map +1 -0
package/dist/tracing/tracer.d.ts +74 -0
package/dist/tracing/tracer.d.ts.map +1 -0
package/dist/tracing/tracer.js +131 -0
package/dist/tracing/tracer.js.map +1 -0
package/dist/tracing/types.d.ts +63 -0
package/dist/tracing/types.d.ts.map +1 -0
package/dist/tracing/types.js +8 -0
package/dist/tracing/types.js.map +1 -0
package/dist/types.d.ts +110 -0
package/dist/types.d.ts.map +1 -0
package/dist/types.js +6 -0
package/dist/types.js.map +1 -0
package/dist/utils.d.ts +29 -0
package/dist/utils.d.ts.map +1 -0
package/dist/utils.js +74 -0
package/dist/utils.js.map +1 -0
package/dist/wait.d.ts +20 -0
package/dist/wait.d.ts.map +1 -0
package/dist/wait.js +63 -0
package/dist/wait.js.map +1 -0
package/package.json +72 -0
package/spec/README.md +72 -0
package/spec/SNAPSHOT_V1.md +208 -0
package/spec/sdk-types.md +259 -0
package/spec/snapshot.schema.json +148 -0

package/README.md ADDED Viewed

@@ -0,0 +1,946 @@
+# Sentience TypeScript SDK
+The SDK is open under ELv2; the core semantic geometry and reliability logic runs in Sentience-hosted services.
+## Installation
+```bash
+# Install from npm
+npm install sentienceapi
+# Install Playwright browsers (required)
+npx playwright install chromium
+```
+**For local development:**
+```bash
+npm install
+npm run build
+```
+## Quick Start: Choose Your Abstraction Level
+Sentience SDK offers **4 levels of abstraction** - choose based on your needs:
+### 💬 Level 4: Conversational Agent (Highest Abstraction) - **NEW in v0.3.0**
+Complete automation with natural conversation. Just describe what you want, and the agent plans and executes everything:
+```typescript
+import { SentienceBrowser, ConversationalAgent, OpenAIProvider } from 'sentienceapi';
+const browser = await SentienceBrowser.create({ apiKey: process.env.SENTIENCE_API_KEY });
+const llm = new OpenAIProvider(process.env.OPENAI_API_KEY!, 'gpt-4o');
+const agent = new ConversationalAgent({ llmProvider: llm, browser });
+// Navigate to starting page
+await browser.getPage().goto('https://amazon.com');
+// ONE command does it all - automatic planning and execution!
+const response = await agent.execute(
+  "Search for 'wireless mouse' and tell me the price of the top result"
+);
+console.log(response); // "I found the top result for wireless mouse on Amazon. It's priced at $24.99..."
+// Follow-up questions maintain context
+const followUp = await agent.chat("Add it to cart");
+console.log(followUp);
+await browser.close();
+```
+**When to use:** Complex multi-step tasks, conversational interfaces, maximum convenience
+**Code reduction:** 99% less code - describe goals in natural language
+**Requirements:** OpenAI or Anthropic API key
+### 🤖 Level 3: Agent (Natural Language Commands) - **Recommended for Most Users**
+Zero coding knowledge needed. Just write what you want in plain English:
+```typescript
+import { SentienceBrowser, SentienceAgent, OpenAIProvider } from 'sentienceapi';
+const browser = await SentienceBrowser.create({ apiKey: process.env.SENTIENCE_API_KEY });
+const llm = new OpenAIProvider(process.env.OPENAI_API_KEY!, 'gpt-4o-mini');
+const agent = new SentienceAgent(browser, llm);
+await browser.getPage().goto('https://www.amazon.com');
+// Just natural language commands - agent handles everything!
+await agent.act('Click the search box');
+await agent.act("Type 'wireless mouse' into the search field");
+await agent.act('Press Enter key');
+await agent.act('Click the first product result');
+// Automatic token tracking
+console.log(`Tokens used: ${agent.getTokenStats().totalTokens}`);
+await browser.close();
+```
+**When to use:** Quick automation, non-technical users, rapid prototyping
+**Code reduction:** 95-98% less code vs manual approach
+**Requirements:** OpenAI API key (or Anthropic for Claude)
+### 🔧 Level 2: Direct SDK (Technical Control)
+Full control with semantic selectors. For technical users who want precision:
+```typescript
+import { SentienceBrowser, snapshot, find, click, typeText, press } from 'sentienceapi';
+const browser = await SentienceBrowser.create({ apiKey: process.env.SENTIENCE_API_KEY });
+await browser.getPage().goto('https://www.amazon.com');
+// Get semantic snapshot
+const snap = await snapshot(browser);
+// Find elements using query DSL
+const searchBox = find(snap, 'role=textbox text~"search"');
+await click(browser, searchBox!.id);
+// Type and submit
+await typeText(browser, searchBox!.id, 'wireless mouse');
+await press(browser, 'Enter');
+await browser.close();
+```
+**When to use:** Need precise control, debugging, custom workflows
+**Code reduction:** Still 80% less code vs raw Playwright
+**Requirements:** Only Sentience API key
+### ⚙️ Level 1: Raw Playwright (Maximum Control)
+For when you need complete low-level control (rare):
+```typescript
+import { chromium } from 'playwright';
+const browser = await chromium.launch();
+const page = await browser.newPage();
+await page.goto('https://www.amazon.com');
+await page.fill('#twotabsearchtextbox', 'wireless mouse');
+await page.press('#twotabsearchtextbox', 'Enter');
+await browser.close();
+```
+**When to use:** Very specific edge cases, custom browser configs
+**Tradeoffs:** No semantic intelligence, brittle selectors, more code
+---
+## Agent Execution Tracing (NEW in v0.3.1)
+Record complete agent execution traces for debugging, analysis, and replay. Traces capture every step, snapshot, LLM decision, and action in a structured JSONL format.
+### Quick Start: Agent with Tracing
+```typescript
+import {
+  SentienceBrowser,
+  SentienceAgent,
+  OpenAIProvider,
+  Tracer,
+  JsonlTraceSink
+} from 'sentienceapi';
+import { randomUUID } from 'crypto';
+const browser = await SentienceBrowser.create({ apiKey: process.env.SENTIENCE_API_KEY });
+const llm = new OpenAIProvider(process.env.OPENAI_API_KEY!, 'gpt-4o');
+// Create a tracer
+const runId = randomUUID();
+const sink = new JsonlTraceSink(`traces/${runId}.jsonl`);
+const tracer = new Tracer(runId, sink);
+// Create agent with tracer
+const agent = new SentienceAgent(browser, llm, 50, true, tracer);
+// Emit run_start
+tracer.emitRunStart('SentienceAgent', 'gpt-4o');
+try {
+  await browser.getPage().goto('https://google.com');
+  // Every action is automatically traced!
+  await agent.act('Click the search box');
+  await agent.act("Type 'sentience ai' into the search field");
+  await agent.act('Press Enter');
+  tracer.emitRunEnd(3);
+} finally {
+  // Flush trace to disk
+  await agent.closeTracer();
+  await browser.close();
+}
+console.log(`✅ Trace saved to: traces/${runId}.jsonl`);
+```
+### What Gets Traced
+Each agent action generates multiple events:
+1. **step_start** - Before action execution (goal, URL, attempt)
+2. **snapshot** - Page state with all interactive elements
+3. **llm_response** - LLM decision (model, tokens, response)
+4. **action** - Executed action (type, element ID, success)
+5. **error** - Any failures (error message, retry attempt)
+**Example trace output:**
+```jsonl
+{"v":1,"type":"run_start","ts":"2025-12-26T10:00:00.000Z","run_id":"abc-123","seq":1,"data":{"agent":"SentienceAgent","llm_model":"gpt-4o"}}
+{"v":1,"type":"step_start","ts":"2025-12-26T10:00:01.000Z","run_id":"abc-123","seq":2,"step_id":"step-1","data":{"step_index":1,"goal":"Click the search box","attempt":0,"url":"https://google.com"}}
+{"v":1,"type":"snapshot","ts":"2025-12-26T10:00:01.500Z","run_id":"abc-123","seq":3,"step_id":"step-1","data":{"url":"https://google.com","elements":[...]}}
+{"v":1,"type":"llm_response","ts":"2025-12-26T10:00:02.000Z","run_id":"abc-123","seq":4,"step_id":"step-1","data":{"model":"gpt-4o","prompt_tokens":250,"completion_tokens":10,"response_text":"CLICK(42)"}}
+{"v":1,"type":"action","ts":"2025-12-26T10:00:02.500Z","run_id":"abc-123","seq":5,"step_id":"step-1","data":{"action_type":"click","element_id":42,"success":true}}
+{"v":1,"type":"run_end","ts":"2025-12-26T10:00:03.000Z","run_id":"abc-123","seq":6,"data":{"steps":1}}
+```
+### Reading and Analyzing Traces
+```typescript
+import * as fs from 'fs';
+// Read trace file
+const content = fs.readFileSync(`traces/${runId}.jsonl`, 'utf-8');
+const events = content.trim().split('\n').map(JSON.parse);
+console.log(`Total events: ${events.length}`);
+// Analyze events
+events.forEach(event => {
+  console.log(`[${event.seq}] ${event.type} - ${event.ts}`);
+});
+// Filter by type
+const actions = events.filter(e => e.type === 'action');
+console.log(`Actions taken: ${actions.length}`);
+// Get token usage
+const llmEvents = events.filter(e => e.type === 'llm_response');
+const totalTokens = llmEvents.reduce((sum, e) => sum + (e.data.prompt_tokens || 0) + (e.data.completion_tokens || 0), 0);
+console.log(`Total tokens: ${totalTokens}`);
+```
+### Tracing Without Agent (Manual)
+You can also use the tracer directly for custom workflows:
+```typescript
+import { Tracer, JsonlTraceSink } from 'sentienceapi';
+import { randomUUID } from 'crypto';
+const runId = randomUUID();
+const sink = new JsonlTraceSink(`traces/${runId}.jsonl`);
+const tracer = new Tracer(runId, sink);
+// Emit custom events
+tracer.emit('custom_event', {
+  message: 'Something happened',
+  details: { foo: 'bar' }
+});
+// Use convenience methods
+tracer.emitRunStart('MyAgent', 'gpt-4o');
+tracer.emitStepStart('step-1', 1, 'Do something');
+tracer.emitError('step-1', 'Something went wrong');
+tracer.emitRunEnd(1);
+// Flush to disk
+await tracer.close();
+```
+### Schema Compatibility
+Traces are **100% compatible** with Python SDK traces - use the same tools to analyze traces from both TypeScript and Python agents!
+**See full example:** [examples/agent-with-tracing.ts](examples/agent-with-tracing.ts)
+---
+## Agent Layer Examples
+### Google Search (6 lines of code)
+```typescript
+import { SentienceBrowser, SentienceAgent, OpenAIProvider } from 'sentienceapi';
+const browser = await SentienceBrowser.create({ apiKey: apiKey });
+const llm = new OpenAIProvider(openaiKey, 'gpt-4o-mini');
+const agent = new SentienceAgent(browser, llm);
+await browser.getPage().goto('https://www.google.com');
+await agent.act('Click the search box');
+await agent.act("Type 'mechanical keyboards' into the search field");
+await agent.act('Press Enter key');
+await agent.act('Click the first non-ad search result');
+await browser.close();
+```
+**See full example:** [examples/agent-google-search.ts](examples/agent-google-search.ts)
+### Using Anthropic Claude Instead of GPT
+```typescript
+import { SentienceAgent, AnthropicProvider } from 'sentienceapi';
+// Swap OpenAI for Anthropic - same API!
+const llm = new AnthropicProvider(
+  process.env.ANTHROPIC_API_KEY!,
+  'claude-3-5-sonnet-20241022'
+);
+const agent = new SentienceAgent(browser, llm);
+await agent.act('Click the search button'); // Works exactly the same
+```
+**BYOB (Bring Your Own Brain):** OpenAI, Anthropic, or implement `LLMProvider` for any model.
+**See full example:** [examples/agent-with-anthropic.ts](examples/agent-with-anthropic.ts)
+### Amazon Shopping (98% code reduction)
+**Before (manual approach):** 350 lines
+**After (agent layer):** 6 lines
+```typescript
+await agent.act('Click the search box');
+await agent.act("Type 'wireless mouse' into the search field");
+await agent.act('Press Enter key');
+await agent.act('Click the first visible product in the search results');
+await agent.act("Click the 'Add to Cart' button");
+```
+**See full example:** [examples/agent-amazon-shopping.ts](examples/agent-amazon-shopping.ts)
+---
+## Installation for Agent Layer
+```bash
+# Install core SDK
+npm install sentienceapi
+# Install LLM provider (choose one or both)
+npm install openai              # For GPT-4, GPT-4o, GPT-4o-mini
+npm install @anthropic-ai/sdk   # For Claude 3.5 Sonnet
+# Set API keys
+export SENTIENCE_API_KEY="your-sentience-key"
+export OPENAI_API_KEY="your-openai-key"        # OR
+export ANTHROPIC_API_KEY="your-anthropic-key"
+```
+---
+## Direct SDK Quick Start
+```typescript
+import { SentienceBrowser, snapshot, find, click } from './src';
+async function main() {
+  const browser = new SentienceBrowser();
+  try {
+    await browser.start();
+    await browser.goto('https://example.com');
+    await browser.getPage().waitForLoadState('networkidle');
+    // Take snapshot - captures all interactive elements
+    const snap = await snapshot(browser);
+    console.log(`Found ${snap.elements.length} elements`);
+    // Find and click a link using semantic selectors
+    const link = find(snap, 'role=link text~"More information"');
+    if (link) {
+      const result = await click(browser, link.id);
+      console.log(`Click success: ${result.success}`);
+    }
+  } finally {
+    await browser.close();
+  }
+}
+main();
+```
+## Real-World Example: Amazon Shopping Bot
+This example demonstrates navigating Amazon, finding products, and adding items to cart:
+```typescript
+import { SentienceBrowser, snapshot, find, click } from './src';
+async function main() {
+  const browser = new SentienceBrowser(undefined, undefined, false);
+  try {
+    await browser.start();
+    // Navigate to Amazon Best Sellers
+    await browser.goto('https://www.amazon.com/gp/bestsellers/');
+    await browser.getPage().waitForLoadState('networkidle');
+    await new Promise(resolve => setTimeout(resolve, 2000));
+    // Take snapshot and find products
+    const snap = await snapshot(browser);
+    console.log(`Found ${snap.elements.length} elements`);
+    // Find first product in viewport using spatial filtering
+    const products = snap.elements
+      .filter(el =>
+        el.role === 'link' &&
+        el.visual_cues.is_clickable &&
+        el.in_viewport &&
+        !el.is_occluded &&
+        el.bbox.y < 600  // First row
+      );
+    if (products.length > 0) {
+      // Sort by position (left to right, top to bottom)
+      products.sort((a, b) => a.bbox.y - b.bbox.y || a.bbox.x - b.bbox.x);
+      const firstProduct = products[0];
+      console.log(`Clicking: ${firstProduct.text}`);
+      const result = await click(browser, firstProduct.id);
+      // Wait for product page
+      await browser.getPage().waitForLoadState('networkidle');
+      await new Promise(resolve => setTimeout(resolve, 2000));
+      // Find and click "Add to Cart" button
+      const productSnap = await snapshot(browser);
+      const addToCart = find(productSnap, 'role=button text~"add to cart"');
+      if (addToCart) {
+        const cartResult = await click(browser, addToCart.id);
+        console.log(`Added to cart: ${cartResult.success}`);
+      }
+    }
+  } finally {
+    await browser.close();
+  }
+}
+main();
+```
+**See the complete tutorial**: [Amazon Shopping Guide](../docs/AMAZON_SHOPPING_GUIDE.md)
+## Running Examples
+**⚠️ Important**: You cannot use `node` directly to run TypeScript files. Use one of these methods:
+### Option 1: Using npm scripts (recommended)
+```bash
+npm run example:hello
+npm run example:basic
+npm run example:query
+npm run example:wait
+```
+### Option 2: Using ts-node directly
+```bash
+npx ts-node examples/hello.ts
+# or if ts-node is installed globally:
+ts-node examples/hello.ts
+```
+### Option 3: Compile then run
+```bash
+npm run build
+# Then use compiled JavaScript from dist/
+```
+## Core Features
+### Browser Control
+- **`SentienceBrowser`** - Playwright browser with Sentience extension pre-loaded
+- **`browser.goto(url)`** - Navigate with automatic extension readiness checks
+- Automatic bot evasion and stealth mode
+- Configurable headless/headed mode
+### Snapshot - Intelligent Page Analysis
+- **`snapshot(browser, options?)`** - Capture page state with AI-ranked elements
+- Returns semantic elements with roles, text, importance scores, and bounding boxes
+- Optional screenshot capture (PNG/JPEG)
+- TypeScript types for type safety
+**Example:**
+```typescript
+const snap = await snapshot(browser, { screenshot: true });
+// Access structured data
+console.log(`URL: ${snap.url}`);
+console.log(`Viewport: ${snap.viewport.width}x${snap.viewport.height}`);
+console.log(`Elements: ${snap.elements.length}`);
+// Iterate over elements
+for (const element of snap.elements) {
+  console.log(`${element.role}: ${element.text} (importance: ${element.importance})`);
+}
+```
+### Query Engine - Semantic Element Selection
+- **`query(snapshot, selector)`** - Find all matching elements
+- **`find(snapshot, selector)`** - Find single best match (by importance)
+- Powerful query DSL with multiple operators
+**Query Examples:**
+```typescript
+// Find by role and text
+const button = find(snap, 'role=button text="Sign in"');
+// Substring match (case-insensitive)
+const link = find(snap, 'role=link text~"more info"');
+// Spatial filtering
+const topLeft = find(snap, 'bbox.x<=100 bbox.y<=200');
+// Multiple conditions (AND logic)
+const primaryBtn = find(snap, 'role=button clickable=true visible=true importance>800');
+// Prefix/suffix matching
+const startsWith = find(snap, 'text^="Add"');
+const endsWith = find(snap, 'text$="Cart"');
+// Numeric comparisons
+const important = query(snap, 'importance>=700');
+const firstRow = query(snap, 'bbox.y<600');
+```
+**📖 [Complete Query DSL Guide](docs/QUERY_DSL.md)** - All operators, fields, and advanced patterns
+### Actions - Interact with Elements
+- **`click(browser, elementId)`** - Click element by ID
+- **`clickRect(browser, rect)`** - Click at center of rectangle (coordinate-based)
+- **`typeText(browser, elementId, text)`** - Type into input fields
+- **`press(browser, key)`** - Press keyboard keys (Enter, Escape, Tab, etc.)
+All actions return `ActionResult` with success status, timing, and outcome:
+```typescript
+const result = await click(browser, element.id);
+console.log(`Success: ${result.success}`);
+console.log(`Outcome: ${result.outcome}`);  // "navigated", "dom_updated", "error"
+console.log(`Duration: ${result.duration_ms}ms`);
+console.log(`URL changed: ${result.url_changed}`);
+```
+**Coordinate-based clicking:**
+```typescript
+import { clickRect } from './src';
+// Click at center of rectangle (x, y, width, height)
+await clickRect(browser, { x: 100, y: 200, w: 50, h: 30 });
+// With visual highlight (default: red border for 2 seconds)
+await clickRect(browser, { x: 100, y: 200, w: 50, h: 30 }, true, 2.0);
+// Using element's bounding box
+const snap = await snapshot(browser);
+const element = find(snap, 'role=button');
+if (element) {
+  await clickRect(browser, {
+    x: element.bbox.x,
+    y: element.bbox.y,
+    w: element.bbox.width,
+    h: element.bbox.height
+  });
+}
+```
+### Wait & Assertions
+- **`waitFor(browser, selector, timeout?, interval?, useApi?)`** - Wait for element to appear
+- **`expect(browser, selector)`** - Assertion helper with fluent API
+**Examples:**
+```typescript
+// Wait for element (auto-detects optimal interval based on API usage)
+const result = await waitFor(browser, 'role=button text="Submit"', 10000);
+if (result.found) {
+  console.log(`Found after ${result.duration_ms}ms`);
+}
+// Use local extension with fast polling (250ms interval)
+const result = await waitFor(browser, 'role=button', 5000, undefined, false);
+// Use remote API with network-friendly polling (1500ms interval)
+const result = await waitFor(browser, 'role=button', 5000, undefined, true);
+// Custom interval override
+const result = await waitFor(browser, 'role=button', 5000, 500, false);
+// Semantic wait conditions
+await waitFor(browser, 'clickable=true', 5000);  // Wait for clickable element
+await waitFor(browser, 'importance>100', 5000);  // Wait for important element
+await waitFor(browser, 'role=link visible=true', 5000);  // Wait for visible link
+// Assertions
+await expect(browser, 'role=button text="Submit"').toExist(5000);
+await expect(browser, 'role=heading').toBeVisible();
+await expect(browser, 'role=button').toHaveText('Submit');
+await expect(browser, 'role=link').toHaveCount(10);
+```
+### Content Reading
+- **`read(browser, options?)`** - Extract page content
+  - `format: "text"` - Plain text extraction
+  - `format: "markdown"` - High-quality markdown conversion (uses Turndown)
+  - `format: "raw"` - Cleaned HTML (default)
+**Example:**
+```typescript
+import { read } from './src';
+// Get markdown content
+const result = await read(browser, { format: 'markdown' });
+console.log(result.content);  // Markdown text
+// Get plain text
+const result = await read(browser, { format: 'text' });
+console.log(result.content);  // Plain text
+```
+### Screenshots
+- **`screenshot(browser, options?)`** - Standalone screenshot capture
+  - Returns base64-encoded data URL
+  - PNG or JPEG format
+  - Quality control for JPEG (1-100)
+**Example:**
+```typescript
+import { screenshot } from './src';
+import { writeFileSync } from 'fs';
+// Capture PNG screenshot
+const dataUrl = await screenshot(browser, { format: 'png' });
+// Save to file
+const base64Data = dataUrl.split(',')[1];
+const imageData = Buffer.from(base64Data, 'base64');
+writeFileSync('screenshot.png', imageData);
+// JPEG with quality control (smaller file size)
+const dataUrl = await screenshot(browser, { format: 'jpeg', quality: 85 });
+```
+## Element Properties
+Elements returned by `snapshot()` have the following properties:
+```typescript
+element.id              // Unique identifier for interactions
+element.role            // ARIA role (button, link, textbox, heading, etc.)
+element.text            // Visible text content
+element.importance      // AI importance score (0-1000)
+element.bbox            // Bounding box (x, y, width, height)
+element.visual_cues     // Visual analysis (is_primary, is_clickable, background_color)
+element.in_viewport     // Is element visible in current viewport?
+element.is_occluded     // Is element covered by other elements?
+element.z_index         // CSS stacking order
+```
+## Query DSL Reference
+### Basic Operators
+| Operator | Description | Example |
+|----------|-------------|---------|
+| `=` | Exact match | `role=button` |
+| `!=` | Exclusion | `role!=link` |
+| `~` | Substring (case-insensitive) | `text~"sign in"` |
+| `^=` | Prefix match | `text^="Add"` |
+| `$=` | Suffix match | `text$="Cart"` |
+| `>`, `>=` | Greater than | `importance>500` |
+| `<`, `<=` | Less than | `bbox.y<600` |
+### Supported Fields
+- **Role**: `role=button|link|textbox|heading|...`
+- **Text**: `text`, `text~`, `text^=`, `text$=`
+- **Visibility**: `clickable=true|false`, `visible=true|false`
+- **Importance**: `importance`, `importance>=N`, `importance<N`
+- **Position**: `bbox.x`, `bbox.y`, `bbox.width`, `bbox.height`
+- **Layering**: `z_index`
+## Examples
+See the `examples/` directory for complete working examples:
+### Agent Layer (Level 3 - Natural Language)
+- **`agent-google-search.ts`** - Google search automation with natural language commands
+- **`agent-amazon-shopping.ts`** - Amazon shopping bot (6 lines vs 350 lines manual code)
+- **`agent-with-anthropic.ts`** - Using Anthropic Claude instead of OpenAI GPT
+### Direct SDK (Level 2 - Technical Control)
+- **`hello.ts`** - Extension bridge verification
+- **`basic-agent.ts`** - Basic snapshot and element inspection
+- **`query-demo.ts`** - Query engine demonstrations
+- **`wait-and-click.ts`** - Waiting for elements and performing actions
+- **`read-markdown.ts`** - Content extraction and markdown conversion
+## Testing
+```bash
+# Run all tests
+npm test
+# Run with coverage
+npm run test:coverage
+# Run specific test file
+npm test -- snapshot.test.ts
+```
+## Configuration
+### Viewport Size
+Default viewport is **1280x800** pixels. You can customize it using Playwright's API:
+```typescript
+const browser = new SentienceBrowser();
+await browser.start();
+// Set custom viewport before navigating
+await browser.getPage().setViewportSize({ width: 1920, height: 1080 });
+await browser.goto('https://example.com');
+```
+### Headless Mode
+```typescript
+// Headed mode (shows browser window)
+const browser = new SentienceBrowser(undefined, undefined, false);
+// Headless mode
+const browser = new SentienceBrowser(undefined, undefined, true);
+// Auto-detect based on environment (default)
+const browser = new SentienceBrowser();  // headless=true if CI=true, else false
+```
+### Residential Proxy Support
+For users running from datacenters (AWS, DigitalOcean, etc.), you can configure a residential proxy to prevent IP-based detection by Cloudflare, Akamai, and other anti-bot services.
+**Supported Formats:**
+- HTTP: `http://username:password@host:port`
+- HTTPS: `https://username:password@host:port`
+- SOCKS5: `socks5://username:password@host:port`
+**Usage:**
+```typescript
+// Via constructor parameter
+const browser = new SentienceBrowser(
+  undefined,
+  undefined,
+  false,
+  'http://username:password@residential-proxy.com:8000'
+);
+await browser.start();
+// Via environment variable
+process.env.SENTIENCE_PROXY = 'http://username:password@proxy.com:8000';
+const browser = new SentienceBrowser();
+await browser.start();
+// With agent
+import { SentienceAgent, OpenAIProvider } from 'sentienceapi';
+const browser = new SentienceBrowser(
+  'your-api-key',
+  undefined,
+  false,
+  'http://user:pass@proxy.com:8000'
+);
+await browser.start();
+const agent = new SentienceAgent(browser, new OpenAIProvider('openai-key'));
+await agent.act('Navigate to example.com');
+```
+**WebRTC Protection:**
+The SDK automatically adds WebRTC leak protection flags when a proxy is configured, preventing your real datacenter IP from being exposed via WebRTC even when using proxies.
+**HTTPS Certificate Handling:**
+The SDK automatically ignores HTTPS certificate errors when a proxy is configured, as residential proxies often use self-signed certificates for SSL interception. This ensures seamless navigation to HTTPS sites through the proxy.
+**Example:**
+```bash
+# Run with proxy via environment variable
+SENTIENCE_PROXY=http://user:pass@proxy.com:8000 npm run example:proxy
+# Or via command line argument
+ts-node examples/proxy-example.ts --proxy=http://user:pass@proxy.com:8000
+```
+**Note:** The proxy is configured at the browser level, so all traffic (including the Chrome extension) routes through the proxy. No changes to the extension are required.
+### Authentication Session Injection
+Inject pre-recorded authentication sessions (cookies + localStorage) to start your agent already logged in, bypassing login screens, 2FA, and CAPTCHAs. This saves tokens and reduces costs by eliminating login steps.
+```typescript
+// Workflow 1: Inject pre-recorded session from file
+import { SentienceBrowser, saveStorageState } from 'sentienceapi';
+// Save session after manual login
+const browser = new SentienceBrowser();
+await browser.start();
+await browser.getPage().goto('https://example.com');
+// ... log in manually ...
+await saveStorageState(browser.getContext(), 'auth.json');
+// Use saved session in future runs
+const browser2 = new SentienceBrowser(
+  undefined, // apiKey
+  undefined, // apiUrl
+  false,     // headless
+  undefined,  // proxy
+  undefined,  // userDataDir
+  'auth.json' // storageState - inject saved session
+);
+await browser2.start();
+// Agent starts already logged in!
+// Workflow 2: Persistent sessions (cookies persist across runs)
+const browser3 = new SentienceBrowser(
+  undefined,      // apiKey
+  undefined,      // apiUrl
+  false,          // headless
+  undefined,      // proxy
+  './chrome_profile', // userDataDir - persist cookies
+  undefined       // storageState
+);
+await browser3.start();
+// First run: Log in
+// Second run: Already logged in (cookies persist automatically)
+```
+**Benefits:**
+- Bypass login screens and CAPTCHAs with valid sessions
+- Save 5-10 agent steps and hundreds of tokens per run
+- Maintain stateful sessions for accessing authenticated pages
+- Act as authenticated users (e.g., "Go to my Orders page")
+See `examples/auth-injection-agent.ts` for complete examples.
+## Best Practices
+### 1. Wait for Dynamic Content
+```typescript
+await browser.goto('https://example.com');
+await browser.getPage().waitForLoadState('networkidle');
+await new Promise(resolve => setTimeout(resolve, 1000));  // Extra buffer
+```
+### 2. Use Multiple Strategies for Finding Elements
+```typescript
+// Try exact match first
+let btn = find(snap, 'role=button text="Add to Cart"');
+// Fallback to fuzzy match
+if (!btn) {
+  btn = find(snap, 'role=button text~"cart"');
+}
+```
+### 3. Check Element Visibility Before Clicking
+```typescript
+if (element.in_viewport && !element.is_occluded) {
+  await click(browser, element.id);
+}
+```
+### 4. Handle Navigation
+```typescript
+const result = await click(browser, linkId);
+if (result.url_changed) {
+  await browser.getPage().waitForLoadState('networkidle');
+}
+```
+### 5. Use Screenshots Sparingly
+```typescript
+// Fast - no screenshot (only element data)
+const snap = await snapshot(browser);
+// Slower - with screenshot (for debugging/verification)
+const snap = await snapshot(browser, { screenshot: true });
+```
+### 6. Always Close Browser
+```typescript
+const browser = new SentienceBrowser();
+try {
+  await browser.start();
+  // ... your automation code
+} finally {
+  await browser.close();  // Always clean up
+}
+```
+## Troubleshooting
+### "Extension failed to load"
+**Solution:** Build the extension first:
+```bash
+cd sentience-chrome
+./build.sh
+```
+### "Cannot use import statement outside a module"
+**Solution:** Don't use `node` directly. Use `ts-node` or npm scripts:
+```bash
+npx ts-node examples/hello.ts
+# or
+npm run example:hello
+```
+### "Element not found"
+**Solutions:**
+- Ensure page is loaded: `await browser.getPage().waitForLoadState('networkidle')`
+- Use `waitFor()`: `await waitFor(browser, 'role=button', 10000)`
+- Debug elements: `console.log(snap.elements.map(el => el.text))`
+### Button not clickable
+**Solutions:**
+- Check visibility: `element.in_viewport && !element.is_occluded`
+- Scroll to element: `await browser.getPage().evaluate(\`window.sentience_registry[${element.id}].scrollIntoView()\`)`
+## Documentation
+- **📖 [Amazon Shopping Guide](../docs/AMAZON_SHOPPING_GUIDE.md)** - Complete tutorial with real-world example
+- **📖 [Query DSL Guide](docs/QUERY_DSL.md)** - Advanced query patterns and operators
+- **📄 [API Contract](../spec/SNAPSHOT_V1.md)** - Snapshot API specification
+- **📄 [Type Definitions](../spec/sdk-types.md)** - TypeScript/Python type definitions
+## License
+📜 **License**
+This SDK is licensed under the **Elastic License 2.0 (ELv2)**.
+The Elastic License 2.0 allows you to use, modify, and distribute this SDK for internal, research, and non-competitive purposes. It **does not permit offering this SDK or a derivative as a hosted or managed service**, nor using it to build a competing product or service.
+### Important Notes
+- This SDK is a **client-side library** that communicates with proprietary Sentience services and browser components.
+- The Sentience backend services (including semantic geometry grounding, ranking, visual cues, and trace processing) are **not open source** and are governed by Sentience’s Terms of Service.
+- Use of this SDK does **not** grant rights to operate, replicate, or reimplement Sentience’s hosted services.
+For commercial usage, hosted offerings, or enterprise deployments, please contact Sentience to obtain a commercial license.
+See the full license text in [`LICENSE`](./LICENSE.md).