npm - rehydra - Versions diffs - 0.3.4 → 0.4.0 - Mend

rehydra 0.3.4 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (76) hide show

package/README.md +170 -760
package/dist/index.d.ts +2 -0
package/dist/index.d.ts.map +1 -1
package/dist/index.js +4 -0
package/dist/index.js.map +1 -1
package/dist/proxy/index.d.ts +12 -0
package/dist/proxy/index.d.ts.map +1 -0
package/dist/proxy/index.js +11 -0
package/dist/proxy/index.js.map +1 -0
package/dist/proxy/providers/anthropic.d.ts +17 -0
package/dist/proxy/providers/anthropic.d.ts.map +1 -0
package/dist/proxy/providers/anthropic.js +117 -0
package/dist/proxy/providers/anthropic.js.map +1 -0
package/dist/proxy/providers/index.d.ts +19 -0
package/dist/proxy/providers/index.d.ts.map +1 -0
package/dist/proxy/providers/index.js +40 -0
package/dist/proxy/providers/index.js.map +1 -0
package/dist/proxy/providers/openai.d.ts +17 -0
package/dist/proxy/providers/openai.d.ts.map +1 -0
package/dist/proxy/providers/openai.js +92 -0
package/dist/proxy/providers/openai.js.map +1 -0
package/dist/proxy/providers/types.d.ts +29 -0
package/dist/proxy/providers/types.d.ts.map +1 -0
package/dist/proxy/providers/types.js +6 -0
package/dist/proxy/providers/types.js.map +1 -0
package/dist/proxy/proxy-server.d.ts +53 -0
package/dist/proxy/proxy-server.d.ts.map +1 -0
package/dist/proxy/proxy-server.js +146 -0
package/dist/proxy/proxy-server.js.map +1 -0
package/dist/proxy/rehydra-fetch.d.ts +35 -0
package/dist/proxy/rehydra-fetch.d.ts.map +1 -0
package/dist/proxy/rehydra-fetch.js +217 -0
package/dist/proxy/rehydra-fetch.js.map +1 -0
package/dist/proxy/rehydra-proxy.d.ts +40 -0
package/dist/proxy/rehydra-proxy.d.ts.map +1 -0
package/dist/proxy/rehydra-proxy.js +82 -0
package/dist/proxy/rehydra-proxy.js.map +1 -0
package/dist/proxy/sse-parser.d.ts +59 -0
package/dist/proxy/sse-parser.d.ts.map +1 -0
package/dist/proxy/sse-parser.js +112 -0
package/dist/proxy/sse-parser.js.map +1 -0
package/dist/proxy/types.d.ts +49 -0
package/dist/proxy/types.d.ts.map +1 -0
package/dist/proxy/types.js +5 -0
package/dist/proxy/types.js.map +1 -0
package/dist/proxy/wrap-client.d.ts +47 -0
package/dist/proxy/wrap-client.d.ts.map +1 -0
package/dist/proxy/wrap-client.js +70 -0
package/dist/proxy/wrap-client.js.map +1 -0
package/dist/storage/session.d.ts +3 -0
package/dist/storage/session.d.ts.map +1 -1
package/dist/storage/session.js +16 -0
package/dist/storage/session.js.map +1 -1
package/dist/storage/types.d.ts +16 -0
package/dist/storage/types.d.ts.map +1 -1
package/dist/streaming/anonymizer-stream.d.ts +63 -0
package/dist/streaming/anonymizer-stream.d.ts.map +1 -0
package/dist/streaming/anonymizer-stream.js +184 -0
package/dist/streaming/anonymizer-stream.js.map +1 -0
package/dist/streaming/index.d.ts +9 -0
package/dist/streaming/index.d.ts.map +1 -0
package/dist/streaming/index.js +8 -0
package/dist/streaming/index.js.map +1 -0
package/dist/streaming/sentence-buffer.d.ts +78 -0
package/dist/streaming/sentence-buffer.d.ts.map +1 -0
package/dist/streaming/sentence-buffer.js +238 -0
package/dist/streaming/sentence-buffer.js.map +1 -0
package/dist/streaming/stream-factory.d.ts +38 -0
package/dist/streaming/stream-factory.d.ts.map +1 -0
package/dist/streaming/stream-factory.js +69 -0
package/dist/streaming/stream-factory.js.map +1 -0
package/dist/streaming/types.d.ts +121 -0
package/dist/streaming/types.d.ts.map +1 -0
package/dist/streaming/types.js +5 -0
package/dist/streaming/types.js.map +1 -0
package/package.json +18 -1

package/README.md CHANGED Viewed

@@ -6,898 +6,308 @@
 ![Issues](https://img.shields.io/github/issues/rehydra-ai/rehydra)
 [![codecov](https://codecov.io/github/rehydra-ai/rehydra/graph/badge.svg?token=WX5RI0ZZJG)](https://codecov.io/github/rehydra-ai/rehydra)
-On-device PII anonymization module for high-privacy AI workflows. Detects and replaces Personally Identifiable Information (PII) with semantically valuable placeholder tags while maintaining an encrypted mapping for rehydration.
-```bash
-npm install rehydra
-```
-**Works in Node.js, Bun, and browsers**
-## Features
-- **Structured PII Detection**: Regex-based detection for emails, phones, IBANs, credit cards, IPs, URLs
-- **Soft PII Detection**: ONNX-powered NER model for names, organizations, locations (auto-downloads on first use if enabled)
-- **Semantic Enrichment**: AI/MT-friendly tags with gender/location attributes
-- **Secure PII Mapping**: AES-256-GCM encrypted storage of original PII values
-- **Cross-Platform**: Works identically in Node.js, Bun, and browsers
-- **Configurable Policies**: Customizable detection rules, thresholds, and allowlists
-- **Validation & Leak Scanning**: Built-in validation and optional leak detection
-## Installation
-### Node.js
+On-device PII anonymization for AI workflows. Detects names, emails, phones, IBANs, and more — replaces them with encrypted placeholder tags — and rehydrates them back after processing.
 ```bash
 npm install rehydra
 ```
-For bun support see [Bun Support](#bun-support)
-### Browser (with bundler)
-```bash
-npm install rehydra onnxruntime-web
-```
-### Browser (without bundler)
-```html
-<script type="module">
-  // Import directly from your dist folder or CDN
-  import { createAnonymizer } from './node_modules/rehydra/dist/index.js';
-  // onnxruntime-web is automatically loaded from CDN when needed
-</script>
-```
+**Works in Node.js, Bun, and browsers.** No data leaves your machine.
 ## Quick Start
-### Full pipeline (Anonymize → LLM → Rehydrate)
-The full workflow for privacy-preserving LLM workflows:
 ```typescript
-import {
-  createAnonymizer,
-  decryptPIIMap,
-  rehydrate,
-  InMemoryKeyProvider
-} from 'rehydra';
+import { createAnonymizer, decryptPIIMap, rehydrate, InMemoryKeyProvider } from 'rehydra';
-// 1. Create a key provider (required to decrypt later)
 const keyProvider = new InMemoryKeyProvider();
-// 2. Create anonymizer with key provider
 const anonymizer = createAnonymizer({
-  ner: { mode: 'quantized' },
-  semantic: { enabled: true },
-  keyProvider: keyProvider
+  ner: { mode: 'quantized' },  // ~280 MB model, auto-downloads on first use
+  keyProvider,
 });
-await anonymizer.initialize();
-// 3. Anonymize before translation
-const original = 'Hello John Smith from Acme Corp in Berlin!';
-const result = await anonymizer.anonymize(original);
+const result = await anonymizer.anonymize(
+  'Email john.smith@acme-corp.com or call John at +41 79 123 45 67'
+);
 console.log(result.anonymizedText);
-// "Hello <PII type="PERSON" gender="male" id="1"/> from <PII type="ORG" id="2"/> in <PII type="LOCATION" scope="city" id="3"/>!"
-// 4. Translate (or do other AI workloads that preserve placeholders)
-const translated = await yourAIWorkflow(result.anonymizedText, { from: 'en', to: 'de' });
-// "Hallo <PII type="PERSON" gender="male" id="1"/> von <PII type="ORG" id="2"/> in <PII type="LOCATION" scope="city" id="3"/>!"
+// "Email <PII type="EMAIL" id="1"/> or call <PII type="PERSON" id="2"/> at <PII type="PHONE" id="3"/>"
-// 5. Decrypt the PII map using the same key
-const encryptionKey = await keyProvider.getKey();
-const piiMap = await decryptPIIMap(result.piiMap, encryptionKey);
-// 6. Rehydrate - replace placeholders with original values
-const rehydrated = rehydrate(translated, piiMap);
-// "Hallo John Smith von Acme Corp in Berlin!"
+// Rehydrate after translation or other processing
+const key = await keyProvider.getKey();
+const piiMap = await decryptPIIMap(result.piiMap!, key);
+const original = rehydrate(result.anonymizedText, piiMap);
+// "Email john.smith@acme-corp.com or call John at +41 79 123 45 67"
-// 7. Clean up
 await anonymizer.dispose();
 ```
-### Regex-Only Mode (No Downloads Required)
-For structured PII like emails, phones, IBANs, credit cards:
-```typescript
-import { anonymizeRegexOnly } from 'rehydra';
-const result = await anonymizeRegexOnly(
-  'Contact john@example.com or call +49 30 123456. IBAN: DE89370400440532013000'
-);
-console.log(result.anonymizedText);
-// "Contact <PII type="EMAIL" id="1"/> or call <PII type="PHONE" id="2"/>. IBAN: <PII type="IBAN" id="3"/>"
-```
+## LLM Proxy
-### Full Mode with NER (Detects Names, Organizations, Locations)
+Drop-in middleware that anonymizes prompts before they leave your machine and rehydrates responses. Works with OpenAI, Anthropic, and any OpenAI-compatible API.
-The NER model is automatically downloaded on first use (~280 MB for quantized):
+### Wrap any fetch-based client
 ```typescript
-import { createAnonymizer } from 'rehydra';
-const anonymizer = createAnonymizer({
-  ner: {
-    mode: 'quantized',  // or 'standard' for full model (~1.1 GB)
-    onStatus: (status) => console.log(status),
-  }
+import OpenAI from 'openai';
+import { createRehydraFetch, InMemoryKeyProvider, InMemoryPIIStorageProvider } from 'rehydra';
+const openai = new OpenAI({
+  fetch: createRehydraFetch({
+    anonymizer: { ner: { mode: 'quantized' } },
+    keyProvider: new InMemoryKeyProvider(),
+    piiStorageProvider: new InMemoryPIIStorageProvider(),
+  }),
 });
-await anonymizer.initialize();  // Downloads model if needed
-const result = await anonymizer.anonymize(
-  'Hello John Smith from Acme Corp in Berlin!'
-);
-console.log(result.anonymizedText);
-// "Hello <PII type="PERSON" id="1"/> from <PII type="ORG" id="2"/> in <PII type="LOCATION" id="3"/>!"
-// Clean up when done
-await anonymizer.dispose();
+// PII is anonymized before leaving your machine, response is rehydrated automatically
+const response = await openai.chat.completions.create({
+  model: 'gpt-4o',
+  messages: [{ role: 'user', content: 'Draft a reply to john@example.com about the meeting' }],
+});
 ```
-### With Semantic Enrichment
-Add gender and location scope for better machine translation:
+### Or use wrapLLMClient for even less code
 ```typescript
-import { createAnonymizer } from 'rehydra';
+import OpenAI from 'openai';
+import { wrapLLMClient, InMemoryKeyProvider, InMemoryPIIStorageProvider } from 'rehydra';
-const anonymizer = createAnonymizer({
-  ner: { mode: 'quantized' },
-  semantic: {
-    enabled: true,  // Downloads ~12 MB of semantic data on first use
-    onStatus: (status) => console.log(status),
-  }
+const openai = wrapLLMClient(new OpenAI(), {
+  keyProvider: new InMemoryKeyProvider(),
+  piiStorageProvider: new InMemoryPIIStorageProvider(),
 });
-await anonymizer.initialize();
-const result = await anonymizer.anonymize(
-  'Hello Maria Schmidt from Berlin!'
-);
-console.log(result.anonymizedText);
-// "Hello <PII type="PERSON" gender="female" id="1"/> from <PII type="LOCATION" scope="city" id="2"/>!"
 ```
-## API Reference
-Full documentation on [https://docs.rehydra.ai](https://docs.rehydra.ai).
+### Standalone proxy server
-### Configuration Options
+Point any LLM client at a local proxy — zero code changes needed:
 ```typescript
-import { createAnonymizer, InMemoryKeyProvider } from 'rehydra';
+import { createRehydraProxyServer, InMemoryKeyProvider, InMemoryPIIStorageProvider } from 'rehydra';
-const anonymizer = createAnonymizer({
-  // NER configuration
-  ner: {
-    mode: 'quantized',              // 'standard' | 'quantized' | 'disabled' | 'custom'
-    backend: 'local',               // 'local' (default) | 'inference-server'
-    autoDownload: true,             // Auto-download model if not present
-    onStatus: (status) => {},       // Status messages callback
-    onDownloadProgress: (progress) => {
-      console.log(`${progress.file}: ${progress.percent}%`);
-    },
-    // For 'inference-server' backend:
-    inferenceServerUrl: 'http://localhost:8080',
-    // For 'custom' mode only:
-    modelPath: './my-model.onnx',
-    vocabPath: './vocab.txt',
-  },
-  // Semantic enrichment (adds gender/scope attributes)
-  semantic: {
-    enabled: true,                  // Enable MT-friendly attributes
-    autoDownload: true,             // Auto-download semantic data (~12 MB)
-    onStatus: (status) => {},
-    onDownloadProgress: (progress) => {},
-  },
-  // Encryption key provider
+const proxy = await createRehydraProxyServer({
+  port: 8080,
+  upstream: 'https://api.openai.com',
   keyProvider: new InMemoryKeyProvider(),
-  // Custom policy (optional)
-  defaultPolicy: { /* see Policy section */ },
+  piiStorageProvider: new InMemoryPIIStorageProvider(),
 });
-await anonymizer.initialize();
+// Point your client at the proxy
+const openai = new OpenAI({ baseURL: 'http://localhost:8080/v1' });
 ```
-### NER Modes
-| Mode | Description | Size | Auto-Download |
-|------|-------------|------|---------------|
-| `'disabled'` | No NER, regex only | 0 | N/A |
-| `'quantized'` | Smaller model, ~95% accuracy | ~280 MB | Yes |
-| `'standard'` | Full model, best accuracy | ~1.1 GB | Yes |
-| `'custom'` | Your own ONNX model | Varies | No |
+Supports non-streaming and streaming (SSE) responses for both OpenAI and Anthropic APIs.
-### ONNX Session Options
+## Streaming
-Fine-tune ONNX Runtime performance with session options:
+Process text chunk-by-chunk with constant memory. Works as a Node.js Transform stream.
 ```typescript
-const anonymizer = createAnonymizer({
-  ner: {
-    mode: 'quantized',
-    sessionOptions: {
-      // Graph optimization level: 'disabled' | 'basic' | 'extended' | 'all'
-      graphOptimizationLevel: 'all',  // default
-      // Threading (Node.js only)
-      intraOpNumThreads: 4,   // threads within operators
-      interOpNumThreads: 1,   // threads between operators
-      // Memory optimization
-      enableCpuMemArena: true,
-      enableMemPattern: true,
-    }
-  }
-});
-```
-#### Execution Providers
-By default, Rehydra uses:
-- **Node.js**: CPU (fastest for quantized models)
-- **Browsers**: CPU (WASM)
+import { createReadStream, createWriteStream } from 'fs';
+import { createAnonymizerStream, InMemoryKeyProvider } from 'rehydra';
-> For NVIDIA GPU acceleration with CUDA/TensorRT, use the inference server backend (see [GPU Acceleration](#gpu-acceleration-enterprise)).
-### GPU Acceleration (Enterprise)
-For high-throughput production deployments, Rehydra supports GPU-accelerated inference via a dedicated inference server. This is useful for large documents.
-```typescript
-const anonymizer = createAnonymizer({
-  ner: {
-    backend: 'inference-server',
-    inferenceServerUrl: 'http://localhost:8080',
-  }
+const stream = await createAnonymizerStream({
+  anonymizer: { ner: { mode: 'quantized' } },
+  keyProvider: new InMemoryKeyProvider(),
+  sessionId: 'batch-job-001',
+  piiStorageProvider: storage,
 });
-await anonymizer.initialize();
+createReadStream('input.txt').pipe(stream).pipe(createWriteStream('anonymized.txt'));
 ```
-**Performance Comparison:**
-| Text Size | CPU (local) | GPU (server) |
-|-----------|-------------|--------------|
-| Short (~40 chars) | 4.3ms | 62ms |
-| Medium (~500 chars) | 26ms | 73ms |
-| Long (~2000 chars) | 93ms | 117ms |
-| Entity-dense | 13ms | 68ms |
-Local CPU faster for most use cases due to network overhead. GPU is beneficial for batch processing and large documents.
-**Backend Options:**
-| Backend | Description | Latency (2K chars) |
-|---------|-------------|-------------------|
-| `'local'` | CPU inference (default) | ~4,300ms |
-| `'inference-server'` | GPU server (enterprise) | ~117ms |
-### Main Functions
-#### `createAnonymizer(config?)`
+### Low-latency mode for LLM token streams
-Creates a reusable anonymizer instance:
+Regex-only, smaller buffers, flushes aggressively — designed for real-time token streams:
 ```typescript
-const anonymizer = createAnonymizer({
-  ner: { mode: 'quantized' }
+const stream = await createAnonymizerStream({
+  buffer: { lowLatency: true },
 });
-await anonymizer.initialize();
-const result = await anonymizer.anonymize('text');
-await anonymizer.dispose();
-```
-#### `anonymize(text, locale?, policy?)`
-One-off anonymization (regex-only by default):
-```typescript
-import { anonymize } from 'rehydra';
-const result = await anonymize('Contact test@example.com');
-```
-#### `anonymizeWithNER(text, nerConfig, policy?)`
-One-off anonymization with NER:
-```typescript
-import { anonymizeWithNER } from 'rehydra';
-const result = await anonymizeWithNER(
-  'Hello John Smith',
-  { mode: 'quantized' }
-);
-```
-#### `anonymizeRegexOnly(text, policy?)`
-Fast regex-only anonymization:
-```typescript
-import { anonymizeRegexOnly } from 'rehydra';
-const result = await anonymizeRegexOnly('Card: 4111111111111111');
-```
-### Rehydration Functions
-#### `decryptPIIMap(encryptedMap, key)`
-Decrypts the PII map for rehydration:
-```typescript
-import { decryptPIIMap } from 'rehydra';
-const piiMap = await decryptPIIMap(result.piiMap, encryptionKey);
-// Returns Map<string, string> where key is "PERSON:1" and value is "John Smith"
-```
-#### `rehydrate(text, piiMap)`
-Replaces placeholders with original values:
-```typescript
-import { rehydrate } from 'rehydra';
-const original = rehydrate(translatedText, piiMap);
-```
-### Result Structure
-```typescript
-interface AnonymizationResult {
-  // Text with PII replaced by placeholder tags
-  anonymizedText: string;
-  // Detected entities (without original text for safety)
-  entities: Array<{
-    type: PIIType;
-    id: number;
-    start: number;
-    end: number;
-    confidence: number;
-    source: 'REGEX' | 'NER';
-  }>;
-  // Encrypted PII mapping (for later rehydration)
-  piiMap: {
-    ciphertext: string;  // Base64
-    iv: string;          // Base64
-    authTag: string;     // Base64
-  };
-  // Processing statistics
-  stats: {
-    countsByType: Record<PIIType, number>;
-    totalEntities: number;
-    processingTimeMs: number;
-    modelVersion: string;
-    leakScanPassed?: boolean;
-  };
-}
-```
-## Supported PII Types
-| Type | Description | Detection | Semantic Attributes |
-|------|-------------|-----------|---------------------|
-| `EMAIL` | Email addresses | Regex | - |
-| `PHONE` | Phone numbers (international) | Regex | - |
-| `IBAN` | International Bank Account Numbers | Regex + Checksum | - |
-| `BIC_SWIFT` | Bank Identifier Codes | Regex | - |
-| `CREDIT_CARD` | Credit card numbers | Regex + Luhn | - |
-| `IP_ADDRESS` | IPv4 and IPv6 addresses | Regex | - |
-| `URL` | Web URLs | Regex | - |
-| `CASE_ID` | Case/ticket numbers | Regex (configurable) | - |
-| `CUSTOMER_ID` | Customer identifiers | Regex (configurable) | - |
-| `PERSON` | Person names | NER | `gender` (male/female/neutral) |
-| `ORG` | Organization names | NER | - |
-| `LOCATION` | Location/place names | NER | `scope` (city/country/region) |
-| `ADDRESS` | Physical addresses | NER | - |
-| `DATE_OF_BIRTH` | Dates of birth | NER | - |
-## Configuration
-### Anonymization Policy
-```typescript
-import { createAnonymizer, PIIType } from 'rehydra';
-const anonymizer = createAnonymizer({
-  ner: { mode: 'quantized' },
-  defaultPolicy: {
-    // Which PII types to detect
-    enabledTypes: new Set([PIIType.EMAIL, PIIType.PHONE, PIIType.PERSON]),
-    // Confidence thresholds per type (0.0 - 1.0)
-    confidenceThresholds: new Map([
-      [PIIType.PERSON, 0.8],
-      [PIIType.EMAIL, 0.5],
-    ]),
-    // Terms to never treat as PII
-    allowlistTerms: new Set(['Customer Service', 'Help Desk']),
-    // Enable semantic enrichment (gender/scope)
-    enableSemanticMasking: true,
-    // Enable leak scanning on output
-    enableLeakScan: true,
-  },
+llmTokenStream.pipe(stream).on('data', (chunk) => {
+  ws.send(chunk.toString());
 });
 ```
-### Custom Recognizers
-Add domain-specific patterns:
+### Stream from a session
 ```typescript
-import { createCustomIdRecognizer, PIIType, createAnonymizer } from 'rehydra';
-const customRecognizer = createCustomIdRecognizer([
-  {
-    name: 'Order Number',
-    pattern: /\bORD-[A-Z0-9]{8}\b/g,
-    type: PIIType.CASE_ID,
-  },
-]);
-const anonymizer = createAnonymizer();
-anonymizer.getRegistry().register(customRecognizer);
+const session = anonymizer.session('chat-123');
+const stream = await session.createStream();
+input.pipe(stream).pipe(output);
 ```
-## Data & Model Storage
-Models and semantic data are cached locally for offline use.
+## Sessions
-### Node.js Cache Locations
-| Data | macOS | Linux | Windows |
-|------|-------|-------|---------|
-| NER Models | `~/Library/Caches/rehydra/models/` | `~/.cache/rehydra/models/` | `%LOCALAPPDATA%/rehydra/models/` |
-| Semantic Data | `~/Library/Caches/rehydra/semantic-data/` | `~/.cache/rehydra/semantic-data/` | `%LOCALAPPDATA%/rehydra/semantic-data/` |
-### Browser Cache
-In browsers, data is stored using:
-- **IndexedDB**: For semantic data and smaller files
-- **Origin Private File System (OPFS)**: For large model files (~280 MB)
-Data persists across page reloads and browser sessions.
-### Manual Data Management
+For multi-message conversations where PII IDs need to stay consistent and PII maps need to persist:
 ```typescript
-import {
-  // Model management
-  isModelDownloaded,
-  downloadModel,
-  clearModelCache,
-  listDownloadedModels,
-  // Semantic data management
-  isSemanticDataDownloaded,
-  downloadSemanticData,
-  clearSemanticDataCache,
+import {
+  createAnonymizer,
+  InMemoryKeyProvider,
+  SQLitePIIStorageProvider,  // or InMemoryPIIStorageProvider, IndexedDBPIIStorageProvider
 } from 'rehydra';
-// Check if model is downloaded
-const hasModel = await isModelDownloaded('quantized');
-// Manually download model with progress
-await downloadModel('quantized', (progress) => {
-  console.log(`${progress.file}: ${progress.percent}%`);
+const anonymizer = createAnonymizer({
+  ner: { mode: 'quantized' },
+  keyProvider: new InMemoryKeyProvider(),
+  piiStorageProvider: new SQLitePIIStorageProvider('./pii.db'),
 });
-// Check semantic data
-const hasSemanticData = await isSemanticDataDownloaded();
-// List downloaded models
-const models = await listDownloadedModels();
-// Clear caches
-await clearModelCache('quantized');  // or clearModelCache() for all
-await clearSemanticDataCache();
-```
-## Encryption & Security
-The PII map is encrypted using **AES-256-GCM** via the Web Crypto API (works in both Node.js and browsers).
+const session = anonymizer.session('chat-123');
-### Key Providers
+// Message 1
+await session.anonymize('Contact me at user@example.com');
+// → "Contact me at <PII type="EMAIL" id="1"/>"
-```typescript
-import {
-  InMemoryKeyProvider,    // For development/testing
-  ConfigKeyProvider,      // For production with pre-configured key
-  KeyProvider,            // Interface for custom implementations
-  generateKey,
-} from 'rehydra';
+// Message 2 — same email gets the same ID
+await session.anonymize('CC: user@example.com and admin@example.com');
+// → "CC: <PII type="EMAIL" id="1"/> and <PII type="EMAIL" id="2"/>"
-// Development: In-memory key (generates random key, lost on page refresh)
-const devKeyProvider = new InMemoryKeyProvider();
-// Production: Pre-configured key
-// Generate key: openssl rand -base64 32
-const keyBase64 = process.env.PII_ENCRYPTION_KEY;  // or read from config
-const prodKeyProvider = new ConfigKeyProvider(keyBase64);
-// Custom: Implement KeyProvider interface
-class SecureKeyProvider implements KeyProvider {
-  async getKey(): Promise<Uint8Array> {
-    // Retrieve from secure storage, HSM, keychain, etc.
-    return await getKeyFromSecureStorage();
-  }
-}
+// Rehydrate any message — auto-loads the PII map from storage
+const original = await session.rehydrate(translatedText);
 ```
-### Security Best Practices
-- **Never log the raw PII map** - Always use encrypted storage
-- **Persist the encryption key securely** - Use platform keystores (iOS Keychain, Android Keystore, etc.)
-- **Rotate keys** - Implement key rotation for long-running applications
-- **Enable leak scanning** - Catch any missed PII in output
-## PII Map Storage
-For applications that need to persist encrypted PII maps (e.g., chat applications where you need to rehydrate later), use sessions with built-in storage providers.
 ### Storage Providers
-| Provider | Environment | Persistence | Use Case |
-|----------|-------------|-------------|----------|
-| `InMemoryPIIStorageProvider` | All | None (lost on restart) | Development, testing |
-| `SQLitePIIStorageProvider` | Node.js, Bun only* | File-based | Server-side applications |
-| `IndexedDBPIIStorageProvider` | Browser | Browser storage | Client-side applications |
+| Provider | Environment | Persistence |
+|----------|-------------|-------------|
+| `InMemoryPIIStorageProvider` | All | None (lost on restart) |
+| `SQLitePIIStorageProvider` | Node.js, Bun | File-based (`better-sqlite3` on Node, `bun:sqlite` on Bun) |
+| `IndexedDBPIIStorageProvider` | Browser | Browser storage |
+## Supported PII Types
-*\*Not available in browser builds. Use `IndexedDBPIIStorageProvider` for browser applications.*
+| Type | Detection | Notes |
+|------|-----------|-------|
+| `PERSON` | NER | Names, with optional `gender` attribute |
+| `ORG` | NER | Organization names |
+| `LOCATION` | NER | Places, with optional `scope` attribute (city/country/region) |
+| `ADDRESS` | NER | Physical addresses |
+| `DATE_OF_BIRTH` | NER | Dates of birth |
+| `EMAIL` | Regex | Email addresses |
+| `PHONE` | Regex | International phone numbers |
+| `IBAN` | Regex + checksum | International Bank Account Numbers |
+| `BIC_SWIFT` | Regex | Bank Identifier Codes |
+| `CREDIT_CARD` | Regex + Luhn | Credit card numbers |
+| `IP_ADDRESS` | Regex | IPv4 and IPv6 |
+| `URL` | Regex | Web URLs |
+| `CASE_ID` | Regex | Configurable case/ticket patterns |
+| `CUSTOMER_ID` | Regex | Configurable customer ID patterns |
-### Important: Storage Only Works with Sessions
+## Configuration
-> **Note:** The `piiStorageProvider` is only used when you call `anonymizer.session()`.
-> Calling `anonymizer.anonymize()` directly does NOT save to storage - the encrypted PII map
-> is only returned in the result for you to handle manually.
+### NER Modes
-```typescript
-// ❌ Storage NOT used - you must handle the PII map yourself
-const result = await anonymizer.anonymize('Hello John!');
-// result.piiMap is returned but NOT saved to storage
-// ✅ Storage IS used - auto-saves and auto-loads
-const session = anonymizer.session('conversation-123');
-const result = await session.anonymize('Hello John!');
-// result.piiMap is automatically saved to storage
-```
+| Mode | Size | Description |
+|------|------|-------------|
+| `'disabled'` | 0 | Regex only — no model download |
+| `'quantized'` | ~280 MB | Recommended — good accuracy, smaller download |
+| `'standard'` | ~1.1 GB | Best accuracy |
+| `'custom'` | Varies | Bring your own ONNX model |
-### Example: Without Storage (Simple One-Off Usage)
+### Semantic Enrichment
-For simple use cases where you don't need persistence:
+Adds gender/scope attributes for better machine translation:
 ```typescript
-import { createAnonymizer, decryptPIIMap, rehydrate, InMemoryKeyProvider } from 'rehydra';
-const keyProvider = new InMemoryKeyProvider();
 const anonymizer = createAnonymizer({
   ner: { mode: 'quantized' },
-  keyProvider,
+  semantic: { enabled: true },  // Downloads ~12 MB of name/location data
 });
-await anonymizer.initialize();
-// Anonymize
-const result = await anonymizer.anonymize('Hello John Smith!');
-// Translate (or other processing)
-const translated = await translateAPI(result.anonymizedText);
-// Rehydrate manually using the returned PII map
-const key = await keyProvider.getKey();
-const piiMap = await decryptPIIMap(result.piiMap, key);
-const original = rehydrate(translated, piiMap);
+// "Hello <PII type="PERSON" gender="female" id="1"/> from <PII type="LOCATION" scope="city" id="2"/>!"
 ```
-### Example: With Storage (Persistent Sessions)
-For applications that need to persist PII maps across requests/restarts:
+### Anonymization Policy
 ```typescript
-import {
-  createAnonymizer,
-  InMemoryKeyProvider,
-  SQLitePIIStorageProvider,
-} from 'rehydra';
-// 1. Setup storage (once at app start)
-const storage = new SQLitePIIStorageProvider('./pii-maps.db');
-await storage.initialize();
-// 2. Create anonymizer with storage and key provider
 const anonymizer = createAnonymizer({
   ner: { mode: 'quantized' },
-  keyProvider: new InMemoryKeyProvider(),
-  piiStorageProvider: storage,
+  defaultPolicy: {
+    enabledTypes: new Set([PIIType.EMAIL, PIIType.PHONE, PIIType.PERSON]),
+    confidenceThresholds: new Map([[PIIType.PERSON, 0.8]]),
+    allowlistTerms: new Set(['Customer Service']),
+    enableLeakScan: true,
+  },
 });
-await anonymizer.initialize();
-// 3. Create a session for each conversation
-const session = anonymizer.session('conversation-123');
-// 4. Anonymize - auto-saves to storage
-const result = await session.anonymize('Hello John Smith from Acme Corp!');
-console.log(result.anonymizedText);
-// "Hello <PII type="PERSON" id="1"/> from <PII type="ORG" id="1"/>!"
-// 5. Later (even after app restart): rehydrate - auto-loads and decrypts
-const translated = await translateAPI(result.anonymizedText);
-const original = await session.rehydrate(translated);
-console.log(original);
-// "Hello John Smith from Acme Corp!"
-// 6. Optional: check existence or delete
-await session.exists();  // true
-await session.delete();  // removes from storage
 ```
-### Example: Multiple Conversations
-Each session ID maps to a separate stored PII map:
+### Anonymization Modes
 ```typescript
-// Different chat sessions
-const chat1 = anonymizer.session('user-alice-chat');
-const chat2 = anonymizer.session('user-bob-chat');
-await chat1.anonymize('Alice: Contact me at alice@example.com');
-await chat2.anonymize('Bob: My number is +49 30 123456');
+// Pseudonymize (default): reversible, returns encrypted PII map
+const anonymizer = createAnonymizer({ mode: 'pseudonymize' });
-// Each session has independent storage
-await chat1.rehydrate(translatedText1);  // Uses Alice's PII map
-await chat2.rehydrate(translatedText2);  // Uses Bob's PII map
+// Anonymize: irreversible, no PII map returned
+const anonymizer = createAnonymizer({ mode: 'anonymize' });
 ```
-### Multi-Message Conversations
-Within a session, entity IDs are consistent across multiple `anonymize()` calls:
-```typescript
-const session = anonymizer.session('chat-123');
-// Message 1: User provides contact info
-const msg1 = await session.anonymize('Contact me at user@example.com');
-// → "Contact me at <PII type="EMAIL" id="1"/>"
-// Message 2: References same email + new one
-const msg2 = await session.anonymize('CC: user@example.com and admin@example.com');
-// → "CC: <PII type="EMAIL" id="1"/> and <PII type="EMAIL" id="2"/>"
-//        ↑ Same ID (reused)                ↑ New ID
-// Message 3: No PII
-await session.anonymize('Please translate to German');
-// Previous PII preserved
-// All messages can be rehydrated correctly
-await session.rehydrate(msg1.anonymizedText); // ✓
-await session.rehydrate(msg2.anonymizedText); // ✓
-```
-This ensures that follow-up messages referencing the same PII produce consistent placeholders, and rehydration works correctly across the entire conversation.
-### SQLite Provider (Node.js + Bun only)
-The SQLite provider works on both Node.js and Bun with automatic runtime detection.
-> **Note:** `SQLitePIIStorageProvider` is **not available in browser builds**. When bundling for browser with Vite/webpack, use `IndexedDBPIIStorageProvider` instead. The browser-safe build automatically excludes SQLite to avoid bundling Node.js dependencies.
+### Custom Recognizers
 ```typescript
-// Node.js / Bun only
-import { SQLitePIIStorageProvider } from 'rehydra';
-// Or explicitly: import { SQLitePIIStorageProvider } from 'rehydra/storage/sqlite';
+import { createCustomIdRecognizer, PIIType } from 'rehydra';
-// File-based database
-const storage = new SQLitePIIStorageProvider('./data/pii-maps.db');
-await storage.initialize();
+const recognizer = createCustomIdRecognizer([{
+  name: 'Order Number',
+  pattern: /\bORD-[A-Z0-9]{8}\b/g,
+  type: PIIType.CASE_ID,
+}]);
-// Or in-memory for testing
-const testStorage = new SQLitePIIStorageProvider(':memory:');
-await testStorage.initialize();
+anonymizer.getRegistry().register(recognizer);
 ```
-**Dependencies:**
-- **Bun**: Uses built-in `bun:sqlite` (no additional install needed)
-- **Node.js**: Requires `better-sqlite3`:
+### GPU Acceleration
-```bash
-npm install better-sqlite3
-```
-### IndexedDB Provider (Browser)
+For high-throughput batch processing, use a remote inference server with GPU:
 ```typescript
-import {
-  createAnonymizer,
-  InMemoryKeyProvider,
-  IndexedDBPIIStorageProvider,
-} from 'rehydra';
-// Custom database name (defaults to 'rehydra-pii-storage')
-const storage = new IndexedDBPIIStorageProvider('my-app-pii');
 const anonymizer = createAnonymizer({
-  ner: { mode: 'quantized' },
-  keyProvider: new InMemoryKeyProvider(),
-  piiStorageProvider: storage,
+  ner: {
+    backend: 'inference-server',
+    inferenceServerUrl: 'http://localhost:8080',
+  },
 });
-await anonymizer.initialize();
-// Use sessions as usual
-const session = anonymizer.session('browser-chat-123');
-const result = await session.anonymize('Hello John!');
-const original = await session.rehydrate(result.anonymizedText);
 ```
-### Session Interface
+## Encryption
-The session object provides these methods:
+PII maps are encrypted with **AES-256-GCM** via the Web Crypto API.
 ```typescript
-interface AnonymizerSession {
-  readonly sessionId: string;
-  anonymize(text: string, locale?: string, policy?: Partial<AnonymizationPolicy>): Promise<AnonymizationResult>;
-  rehydrate(text: string): Promise<string>;
-  load(): Promise<StoredPIIMap | null>;
-  delete(): Promise<boolean>;
-  exists(): Promise<boolean>;
-}
-```
-### Data Retention
-**Entries persist forever by default.** Use `cleanup()` on the storage provider to remove old entries:
-```typescript
-// Delete entries older than 7 days
-const count = await storage.cleanup(new Date(Date.now() - 7 * 24 * 60 * 60 * 1000));
-// Or delete specific sessions
-await session.delete();
-// List all stored sessions
-const sessionIds = await storage.list();
-```
-## Browser Usage
-The library works seamlessly in browsers without any special configuration.
-### Browser Notes
-- **First-use downloads**: NER model (~280 MB) and semantic data (~12 MB) are downloaded on first use
-- **ONNX runtime**: Automatically loaded from CDN if not bundled
-- **Offline support**: After initial download, everything works offline
-- **Storage**: Uses IndexedDB and OPFS - data persists across sessions
-### Bundler Support (Vite, webpack, esbuild)
-The package uses [conditional exports](https://nodejs.org/api/packages.html#conditional-exports) to automatically provide a browser-safe build when bundling for the web. This means:
-- **Automatic**: Vite, webpack, esbuild, and other modern bundlers will automatically use `dist/browser.js`
-- **No Node.js modules**: The browser build excludes `SQLitePIIStorageProvider` and other Node.js-specific code
-- **Tree-shakable**: Only the code you use is included in your bundle
-```json
-// package.json exports (simplified)
-{
-  "exports": {
-    ".": {
-      "browser": "./dist/browser.js",
-      "node": "./dist/index.js",
-      "default": "./dist/index.js"
-    }
-  }
-}
-```
-## Bun Support
-This library works with [Bun](https://bun.sh). Since `onnxruntime-node` is a native Node.js addon, Bun uses `onnxruntime-web`:
+// Development: random key, lost on restart
+const keyProvider = new InMemoryKeyProvider();
-```bash
-bun add rehydra onnxruntime-web
+// Production: persistent key (generate with: openssl rand -base64 32)
+const keyProvider = new ConfigKeyProvider(process.env.PII_ENCRYPTION_KEY!);
 ```
-Usage is identical - the library auto-detects the runtime.
-## Performance
-Benchmarks on Apple M-series (CPU) and NVIDIA T4 (GPU). Run `npm run benchmark:compare` to measure on your hardware.
-### Backend Comparison
-| Backend | Short (~40 chars) | Medium (~500 chars) | Long (~2K chars) | Entity-dense |
-|---------|-------------------|---------------------|------------------|--------------|
-| **Regex-only** | 0.38 ms | 0.50 ms | 0.91 ms | 0.35 ms |
-| **NER CPU** | 4.3 ms | 26 ms | 93 ms | 13 ms |
-| **NER GPU** | 62 ms | 73 ms | 117 ms | 68 ms |
-Local CPU inference is faster than GPU for typical workloads due to network overhead. GPU servers are beneficial for high-throughput batch processing where many requests can be parallelized.
-### Throughput (ops/sec)
-| Backend | Short | Medium | Long |
-|---------|-------|--------|------|
-| **Regex-only** | ~2,640 | ~2,017 | ~1,096 |
-| **NER CPU** | ~234 | ~38 | ~11 |
-| **NER GPU** | ~16 | ~14 | ~9 |
-### Model Downloads
-| Model | Size | First-Use Download |
-|-------|------|-------------------|
-| Quantized NER | ~265 MB | ~30s on fast connection |
-| Standard NER | ~1.1 GB | ~2min on fast connection |
-| Semantic Data | ~12 MB | ~5s on fast connection |
-### Recommendations
-| Use Case | Recommended Backend |
-|----------|---------------------|
-| Structured PII only (email, phone, IBAN) | Regex-only |
-| General use with name/org/location detection | **NER CPU (default)** |
-| High-throughput batch processing (1000s of docs) | NER GPU |
-| Privacy-sensitive / zero-knowledge required | NER CPU (data never leaves device) |
-> **Note:** Local CPU inference now outperforms GPU for most use cases due to network overhead elimination. The trie-based tokenizer provides O(token_length) lookups instead of O(vocab_size), making local inference practical for production use.
-## Requirements
+## Platform Support
 | Environment | Version | Notes |
 |-------------|---------|-------|
 | Node.js | >= 18.0.0 | Uses native `onnxruntime-node` |
-| Bun | >= 1.0.0 | Requires `onnxruntime-web` |
-| Browsers | Chrome 86+, Firefox 89+, Safari 15.4+, Edge 86+ | Uses OPFS for model storage |
+| Bun | >= 1.0.0 | Install `onnxruntime-web`: `bun add rehydra onnxruntime-web` |
+| Browsers | Chrome 86+, Firefox 89+, Safari 15.4+ | Uses OPFS for model storage |
-## Development
+The browser build (`rehydra/browser`) automatically excludes Node.js dependencies. Modern bundlers (Vite, webpack, esbuild) select the right entry point via conditional exports.
-```bash
-# Install dependencies
-npm install
-# Run tests
-npm test
-# Build
-npm run build
-# Lint
-npm run lint
-```
-### Building Custom Models
-For development or custom models:
+## Development
 ```bash
-# Requires Python 3.8+
-npm run setup:ner              # Standard model
-npm run setup:ner:quantized    # Quantized model
+npm install              # Install dependencies
+npm run build            # Compile TypeScript
+npm test                 # Run tests (watch mode)
+npm run test:run         # Run tests once
+npm run lint             # ESLint
+npm run setup:ner        # Pre-download NER model (~280 MB)
+npm run benchmark        # Run benchmarks
+# Integration tests (require API keys)
+npm run test:streaming                                      # No API key needed
+OPENAI_API_KEY=... npm run test:proxy:openai -- --ner       # OpenAI with NER
+ANTHROPIC_API_KEY=... npm run test:proxy:anthropic -- --ner # Anthropic with NER
 ```
 ## License