npm - ai-sdk-guardrails - Versions diffs - 4.0.0 → 5.0.1 - Mend

ai-sdk-guardrails 4.0.0 → 5.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +472 -735
package/package.json +27 -23

package/README.md CHANGED Viewed

@@ -1,895 +1,632 @@
 # AI SDK Guardrails
-Middleware for the Vercel AI SDK that adds safety, quality control, and cost management to your AI applications by intercepting prompts and responses.
+**Safety and quality controls for Vercel AI SDK**
-Block harmful inputs, filter low-quality outputs, and gain observability, all in just a few lines of code.
+Add guardrails to your AI applications in one line of code. Block PII, prevent prompt injection, enforce output quality - while keeping your existing telemetry and observability stack intact.
-![Guardrails Demo](./media/guardrail-example.gif)
-## ⚡ TL;DR
-Quickly add input and output validation to any AI SDK-compatible model.
+[![npm version](https://img.shields.io/npm/v/ai-sdk-guardrails.svg?logo=npm&label=npm)](https://www.npmjs.com/package/ai-sdk-guardrails)
+[![downloads](https://img.shields.io/npm/dw/ai-sdk-guardrails.svg?label=downloads)](https://www.npmjs.com/package/ai-sdk-guardrails)
+[![bundle size](https://img.shields.io/bundlephobia/minzip/ai-sdk-guardrails.svg?label=minzipped)](https://bundlephobia.com/package/ai-sdk-guardrails)
+[![license](https://img.shields.io/npm/l/ai-sdk-guardrails.svg?label=license)](./LICENSE)
+![types](https://img.shields.io/badge/TypeScript-Ready-3178C6?logo=typescript&logoColor=white)
-```typescript
-import { openai } from '@ai-sdk/openai';
-import { generateText } from 'ai';
-import {
-  wrapWithGuardrails,
-  defineInputGuardrail,
-  defineOutputGuardrail,
-} from 'ai-sdk-guardrails';
+![Guardrails Demo](./media/guardrail-example.gif)
-// 1. Define your guardrails
-const inputGuard = defineInputGuardrail({
-  name: 'length-check',
-  execute: async ({ prompt }) =>
-    prompt.length > 100
-      ? { tripwireTriggered: true, message: 'Input too long' }
-      : { tripwireTriggered: false },
-});
+## Drop-in Guardrails for any AI model
-const outputGuard = defineOutputGuardrail({
-  name: 'quality-check',
-  execute: async ({ result }) =>
-    result.text.length < 10
-      ? { tripwireTriggered: true, message: 'Response too short' }
-      : { tripwireTriggered: false },
-});
+```ts
+import { withGuardrails, piiDetector } from 'ai-sdk-guardrails';
+const model = openai('gpt-4o'); // or any other AI model
-// 2. Wrap your model
-const guardedModel = wrapWithGuardrails(openai('gpt-4o'), {
-  inputGuardrails: [inputGuard],
-  outputGuardrails: [outputGuard],
+// Everything else stays the same
+const safeModel = withGuardrails(model, {
+  inputGuardrails: [piiDetector()],
 });
-// 3. Use it! Guardrails will run automatically.
-const { text } = await generateText({
-  model: guardedModel,
-  prompt: 'A prompt that is definitely not too long.',
-});
+// Your existing code, telemetry, and logging still works
+await generateText({ model: safeModel, prompt: '...' });
 ```
-## How It Works
+**That's it.** Your AI now blocks PII automatically.
-### Without Guardrails (Inefficient, Poor Quality)
+## Installation
-```mermaid
-flowchart LR
-    A[User Input<br/>'hello'] --> B[AI Model] --> C[Response<br/>⚠️ Wastes resources<br/>😞 Often useless]
+```bash
+npm install ai-sdk-guardrails
 ```
-### With Input Guardrails (Save Resources)
+## Why Guardrails Matter
-```mermaid
-flowchart LR
-    A[User Input<br/>'hello'] --> B[Input Guardrails] --> C[❌ STOPPED<br/>✅ No API call made]
-```
+Real problems that guardrails solve:
-### With Output Guardrails (Ensure Quality)
+❌ **Without guardrails:**
-```mermaid
-flowchart LR
-    A[AI Response<br/>'Here's my SSN: 123-45-6789'] --> B[Output Guardrails] --> C[❌ BLOCKED<br/>🛡️ Privacy protected]
+```ts
+// User: "My email is john@company.com, help me..."
+// → Sends PII to model → Compliance violation → $$$
 ```
-### Complete Protection
+✅ **With guardrails:**
-```mermaid
-flowchart LR
-    A[User Input] --> B[Input Guardrails] --> C[AI Model] --> D[Output Guardrails] --> E[Clean Response]
+```ts
+const model = withGuardrails(baseModel, {
+  inputGuardrails: [piiDetector()], // Blocks before API call
+});
+// → Request blocked → No PII leak → No cost → Compliant
 ```
-That's it! Input guardrails optimize resource usage by stopping inefficient requests. Output guardrails ensure quality by filtering responses.
+Common use cases:
-## 📦 Installation
+- 🛡️ **Compliance**: Block PII before it reaches your model
+- 💰 **Cost control**: Stop bad requests before they cost money
+- 🔒 **Security**: Prevent prompt injection and data exfiltration
+- ✅ **Quality**: Enforce minimum response standards
+- 🔧 **Production**: Works with your existing observability tools
-```bash
-npm install ai-sdk-guardrails
+## Copy-Paste Examples
-# or
+### Basic Protection (Most Common)
-yarn add ai-sdk-guardrails
+```ts
+import { generateText } from 'ai';
+import { openai } from '@ai-sdk/openai';
+import {
+  withGuardrails,
+  piiDetector,
+  promptInjectionDetector,
+} from 'ai-sdk-guardrails';
-# or
+const model = withGuardrails(openai('gpt-4o'), {
+  inputGuardrails: [piiDetector(), promptInjectionDetector()],
+});
-pnpm add ai-sdk-guardrails
+// Use exactly like before - nothing else changes
+const { text } = await generateText({
+  model,
+  prompt: 'Write a friendly email',
+});
 ```
-## 🔄 Migration Guide
+### Input + Output Protection
-For breaking changes from v3 to v4 (including the new analytics-rich callbacks), see [v3-v4-MIGRATION.md](./v3-v4-MIGRATION.md).
+```ts
+import {
+  withGuardrails,
+  piiDetector,
+  sensitiveDataFilter,
+  minLengthRequirement,
+} from 'ai-sdk-guardrails';
-## 🚀 Quick Start
+const model = withGuardrails(openai('gpt-4o'), {
+  inputGuardrails: [piiDetector()], // Block PII in prompts
+  outputGuardrails: [
+    sensitiveDataFilter(), // Remove secrets from responses
+    minLengthRequirement(100), // Enforce quality standards
+  ],
+});
+```
-Add smart validation to your AI applications in just 3 steps:
+### Works With Streaming
-### 1. Prevent Unnecessary AI Calls
+```ts
+import { streamText } from 'ai';
-```typescript
-import { generateText } from 'ai';
-import { openai } from '@ai-sdk/openai';
-import {
-  wrapWithInputGuardrails,
-  defineInputGuardrail,
-} from 'ai-sdk-guardrails';
-import { extractTextContent } from 'ai-sdk-guardrails/guardrails/input';
-// Block inefficient requests before calling the AI model
-const lengthGuard = defineInputGuardrail({
-  name: 'blocked-keywords',
-  execute: async (context) => {
-    const { prompt } = extractTextContent(context);
-    const blockedWords = ['spam', 'test', 'hello'];
-    const foundWord = blockedWords.find((word) =>
-      prompt.toLowerCase().includes(word.toLowerCase()),
-    );
-    if (foundWord) {
-      return {
-        tripwireTriggered: true,
-        message: `Blocked keyword detected: ${foundWord}`,
-        severity: 'medium',
-      };
-    }
-    return { tripwireTriggered: false };
-  },
+const model = withGuardrails(openai('gpt-4o'), {
+  outputGuardrails: [minLengthRequirement(100)],
 });
-const optimizedModel = wrapWithInputGuardrails(openai('gpt-4'), {
-  inputGuardrails: [lengthGuard],
+// Streaming just works - guardrails run after stream completes
+const { textStream } = await streamText({ model, prompt: '...' });
+for await (const chunk of textStream) {
+  process.stdout.write(chunk);
+}
+```
+### Production Setup (With Error Handling)
+```ts
+import { isGuardrailsError } from 'ai-sdk-guardrails';
+const model = withGuardrails(openai('gpt-4o'), {
+  inputGuardrails: [piiDetector(), promptInjectionDetector()],
+  outputGuardrails: [sensitiveDataFilter()],
+  throwOnBlocked: true, // Throw errors instead of silent blocking
 });
-// This would normally waste an API call for a useless response
 try {
-  const result = await generateText({
-    model: optimizedModel,
-    prompt: 'hello', // ❌ Blocked - prevents unnecessary API call
-  });
+  const { text } = await generateText({ model, prompt: '...' });
+  console.log(text);
 } catch (error) {
-  console.log('Blocked request, saved money!');
+  if (isGuardrailsError(error)) {
+    console.error('Blocked by guardrail:', error.message);
+    // Show user-friendly message
+  }
 }
-// This generates valuable content
-const goodResult = await generateText({
-  model: optimizedModel,
-  prompt: 'Write a product description for our new software', // ✅ This creates value
-});
 ```
-### 2. Ensure Quality Output
+## How It Works
-```typescript
-import {
-  wrapWithOutputGuardrails,
-  defineOutputGuardrail,
-} from 'ai-sdk-guardrails';
-import { extractContent } from 'ai-sdk-guardrails/guardrails/output';
+Guardrails run **in parallel** with your AI calls as middleware:
-const qualityGuard = defineOutputGuardrail({
-  name: 'sensitive-info-detector',
-  execute: async (context) => {
-    const { text } = extractContent(context.result);
-    // Simple sensitive info patterns
-    const sensitivePatterns = [
-      /\b\d{3}-\d{2}-\d{4}\b/, // SSN
-      /\b[\w\.-]+@[\w\.-]+\.\w+\b/, // Email
-      /\b\d{3}-\d{3}-\d{4}\b/, // Phone
-    ];
-    const foundPattern = sensitivePatterns.find((pattern) =>
-      pattern.test(text),
-    );
-    if (foundPattern) {
-      return {
-        tripwireTriggered: true,
-        message: 'Sensitive information detected in response',
-        severity: 'high',
-      };
-    }
-    return { tripwireTriggered: false };
-  },
-});
+```mermaid
+flowchart LR
+  A[Input] --> B[Input Guardrails]
+  B -->|✅ Clean| C[AI Model]
+  B -->|❌ Blocked| X[No API Call]
+  C --> D[Output Guardrails]
+  D -->|✅ Clean| E[Response]
+  D -->|❌ Blocked| R[Retry/Replace/Block]
+```
-const qualityModel = wrapWithOutputGuardrails(openai('gpt-4'), {
-  outputGuardrails: [qualityGuard],
-  onOutputBlocked: (executionSummary) => {
-    console.log(
-      'Prevented sensitive data leak:',
-      executionSummary.blockedResults[0]?.message,
-    );
-    // Access comprehensive analytics (New in v4.0.0)
-    console.log(
-      `Blocked ${executionSummary.stats.blocked} of ${executionSummary.guardrailsExecuted} guardrails`,
-    );
-  },
-});
+**Three-step workflow:**
-const result = await generateText({
-  model: qualityModel,
-  prompt: 'Create a user profile example',
-});
-// Automatically blocks responses containing emails, phone numbers, or SSNs
-```
+1. **Receive**: Input or output arrives
+2. **Check**: Guardrails run (PII detection, validation, etc.)
+3. **Decide**: Pass through, block, or retry
-### 3. Custom Business Logic
+**Key benefit**: Non-invasive. Your existing telemetry, logging, and observability tools keep working because guardrails are just middleware.
-```typescript
-const businessHoursGuard = defineInputGuardrail({
-  name: 'business-hours-only',
-  execute: async () => {
-    const hour = new Date().getUTCHours();
-    // Only allow requests between 9 AM and 5 PM UTC
-    if (hour < 9 || hour > 17) {
-      return {
-        tripwireTriggered: true,
-        message:
-          'Requests are only permitted during business hours (9:00-17:00 UTC).',
-        severity: 'low',
-      };
-    }
-    return { tripwireTriggered: false };
-  },
-});
+## Built-in Guardrails
-const smartEducationModel = wrapWithInputGuardrails(openai('gpt-4'), {
-  inputGuardrails: [businessHoursGuard],
-});
-```
+### Input Guardrails (Run Before Model)
-### 4. Type-Safe Metadata (TypeScript)
+| Guardrail                   | Purpose                          | Example             |
+| --------------------------- | -------------------------------- | ------------------- |
+| `piiDetector()`             | Block emails, phones, SSNs       | Compliance, privacy |
+| `promptInjectionDetector()` | Detect injection attempts        | Security            |
+| `blockedKeywords()`         | Block specific terms             | Content policy      |
+| `inputLengthLimit()`        | Enforce max input length         | Cost control        |
+| `rateLimiting()`            | Per-user rate limits             | Abuse prevention    |
+| `profanityFilter()`         | Block offensive language         | Content moderation  |
+| `toxicityDetector()`        | Detect toxic content             | Safety              |
+| `allowedToolsGuardrail()`   | Restrict which tools can be used | Tool security       |
-The library automatically infers metadata types from your guardrail definitions - no manual type annotations needed!
+### Output Guardrails (Run After Model)
-```typescript
-// Define metadata interface for your guardrail
-interface PIIMetadata extends Record<string, unknown> {
-  detectedTypes: Array<{ type: string; description: string }>;
-  count: number;
-}
+| Guardrail                 | Purpose                     | Example                   |
+| ------------------------- | --------------------------- | ------------------------- |
+| `sensitiveDataFilter()`   | Remove secrets, API keys    | Security                  |
+| `minLengthRequirement()`  | Enforce minimum length      | Quality control           |
+| `outputLengthLimit()`     | Enforce maximum length      | Cost/UX control           |
+| `toxicityFilter()`        | Block toxic responses       | Safety                    |
+| `jsonValidation()`        | Validate JSON structure     | Structured output         |
+| `schemaValidation()`      | Validate against Zod schema | Type safety               |
+| `confidenceThreshold()`   | Require minimum confidence  | Quality                   |
+| `hallucinationDetector()` | Detect uncertain claims     | Accuracy                  |
+| `secretRedaction()`       | Redact secrets from output  | Security                  |
+| `mcpSecurityGuardrail()`  | MCP tool security           | Prevent data exfiltration |
-// Create guardrail with typed metadata
-const piiDetectionGuardrail = defineInputGuardrail({
-  name: 'pii-detection',
-  execute: async (context) => {
-    const { prompt } = extractTextContent(context);
-    const patterns = [
-      {
-        name: 'SSN',
-        regex: /\b\d{3}-\d{2}-\d{4}\b/,
-        description: 'Social Security Number',
-      },
-      {
-        name: 'Email',
-        regex: /\b[\w\.-]+@[\w\.-]+\.\w+\b/,
-        description: 'Email address',
-      },
-    ];
-    const detected = patterns.filter((p) => p.regex.test(prompt));
-    if (detected.length > 0) {
-      // TypeScript knows this metadata matches PIIMetadata
-      const metadata: PIIMetadata = {
-        detectedTypes: detected.map((p) => ({
-          type: p.name,
-          description: p.description,
-        })),
-        count: detected.length,
-      };
-      return {
-        tripwireTriggered: true,
-        message: `PII detected: ${detected.map((p) => p.name).join(', ')}`,
-        severity: 'high',
-        metadata, // Type is automatically inferred!
-      };
-    }
-    return { tripwireTriggered: false };
-  },
-});
+### MCP Security Guardrails
-// Use the guardrail - types flow through automatically!
-const protectedModel = wrapWithInputGuardrails(model, [piiDetectionGuardrail], {
-  onInputBlocked: (summary) => {
-    // TypeScript knows the metadata type - no casting needed!
-    const metadata = summary.blockedResults[0]?.metadata;
-    if (metadata?.detectedTypes) {
-      // Full type safety and autocomplete for metadata.detectedTypes
-      for (const type of metadata.detectedTypes) {
-        console.log(`Detected: ${type.type} - ${type.description}`);
-      }
-    }
-  },
+Protect against prompt injection and data exfiltration when using Model Context Protocol (MCP) tools:
+```ts
+import { mcpSecurityGuardrail, mcpResponseSanitizer } from 'ai-sdk-guardrails';
+const model = withGuardrails(openai('gpt-4o'), {
+  outputGuardrails: [
+    mcpSecurityGuardrail({
+      detectExfiltration: true, // Detect data exfiltration attempts
+      scanEncodedContent: true, // Scan base64/hex encoded content
+      allowedDomains: ['api.company.com'], // Domain allowlist
+      maxContentSize: 51200, // 50KB limit
+      injectionThreshold: 0.7, // Sensitivity (lower = stricter)
+    }),
+    mcpResponseSanitizer(), // Clean malicious content vs blocking
+  ],
 });
 ```
-**That's it!** Your AI application now optimizes resource usage, ensures quality, prevents inappropriate responses, and provides full type safety automatically.
-## ✨ Features
-- 🛡️ **Input & Output Guardrails**: Enforce custom safety, compliance, and quality policies on both prompts and LLM responses.
-- 💰 **Cost Control**: Block invalid or wasteful prompts before they are sent to your LLM provider, saving you money.
-- 🎯 **Quality Improvement**: Automatically filter, flag, or retry low-quality or irrelevant model outputs.
-- 🔒 **Security Protection**: Built-in defenses against prompt injection, jailbreak attempts, PII leakage, secret exposure, and tool call validation.
-- 🏛️ **Compliance & Governance**: Enforce regulatory guidelines and business rules for enterprise applications with jurisdiction-specific compliance.
-- 🔄 **Streaming Support**: Works seamlessly with both streaming (streamText) and standard (generateText) API responses with real-time content monitoring.
-- 📊 **Observability Hooks**: Built-in callbacks (onInputBlocked, onOutputBlocked, etc.) for logging and monitoring with comprehensive execution analytics.
-- ⚙️ **Configurable Execution**: Run guardrails in parallel or sequentially and set custom timeouts.
-- 🚀 **AI SDK Native**: Designed from the ground up to integrate cleanly with AI SDK middleware patterns.
-- 🧠 **AI-Powered Verification**: LLM-as-judge capabilities for hallucination detection and quality assessment.
-- 🌍 **Global Compliance**: Support for multiple jurisdictions (US, EU, UK, CA, AU, JP, CN, IN) with region-specific policies.
-- 📝 **Content Protection**: Copyright and IP protection with originality scoring and verbatim passage detection.
-- 🔐 **Data Integrity**: Comprehensive table validation, SQL code safety, and schema enforcement.
-- 🌐 **Network Security**: Domain allowlisting, URL sanitization, and external access controls.
-- 🔒 **Privacy & Memory**: PII redaction, memory minimization, and secure logging practices.
-- 🛡️ **Safety & Escalation**: Toxicity de-escalation, human review workflows, and streaming early termination.
-## 📚 API Overview
-| Function                     | Description                                                                   |
-| ---------------------------- | ----------------------------------------------------------------------------- |
-| `defineInputGuardrail()`     | Creates a guardrail to validate, inspect, or block prompts.                   |
-| `defineOutputGuardrail()`    | Creates a guardrail to validate, filter, or re-route LLM outputs.             |
-| `wrapWithGuardrails()`       | ⭐ **Recommended** - The easiest way to add both input and output guardrails. |
-| `wrapWithInputGuardrails()`  | Attaches input-only guardrails to a model.                                    |
-| `wrapWithOutputGuardrails()` | Attaches output-only guardrails to a model.                                   |
-| `isGuardrailsError()`, etc.  | Error handling utilities and structured error types.                          |
-## 🧠 Design Philosophy
-- ✅ **Helper-First**: Simple, chainable utility functions provide a great developer experience for fast adoption.
-- 🧩 **Composable**: Multiple guardrails can be chained together and will run in your specified order (or in parallel).
-- 🧾 **Type-Safe**: Full TypeScript support with automatic type inference for guardrail metadata - no manual type annotations needed!
-- 🧪 **Sensible Defaults**: Get started quickly with zero-config default behaviors that can be easily overridden.
-## Architecture Overview
-The library leverages the Vercel AI SDK's middleware architecture to provide composable guardrails that integrate seamlessly with your existing AI applications:
+**Attack vectors prevented:**
-```mermaid
-graph TB
-    subgraph "Your Application"
-        App[Your App Code]
-        Config[Guardrail Configuration]
-    end
-    subgraph "AI SDK Guardrails Middleware"
-        InputMW[Input Guardrails Middleware]
-        OutputMW[Output Guardrails Middleware]
-        subgraph "Input Guardrails Layer"
-            Length[Length Validation]
-            Spam[Spam Detection]
-            PII[PII Detection]
-            Business[Business Rules]
-            Custom1[Custom Guards]
-        end
-        subgraph "Output Guardrails Layer"
-            Quality[Quality Assurance]
-            Sensitive[Sensitive Info Filter]
-            Professional[Professional Tone]
-            Factual[Factual Validation]
-            Custom2[Custom Guards]
-        end
-    end
-    subgraph "AI SDK Core"
-        Wrapper[wrapLanguageModel]
-        Generator[generateText/Object/Stream]
-    end
-    subgraph "External Services"
-        AI[AI Model Provider]
-        Log[Logging & Telemetry]
-    end
-    App --> Config
-    Config --> InputMW
-    InputMW --> Length
-    InputMW --> Spam
-    InputMW --> PII
-    InputMW --> Business
-    InputMW --> Custom1
-    InputMW -->|Valid Request| Wrapper
-    InputMW -->|Blocked Request| Log
-    Wrapper --> Generator
-    Generator --> AI
-    AI --> OutputMW
-    OutputMW --> Quality
-    OutputMW --> Sensitive
-    OutputMW --> Professional
-    OutputMW --> Factual
-    OutputMW --> Custom2
-    OutputMW -->|Clean Response| App
-    OutputMW -->|Quality Issues| Log
-    style InputMW fill:#e1f5fe
-    style OutputMW fill:#f3e5f5
-    style AI fill:#fff3e0
-    style App fill:#e8f5e8
-```
+- ✅ Direct prompt injection
+- ✅ Tool response poisoning
+- ✅ Data exfiltration via URLs
+- ✅ Encoded attacks (base64/hex)
+- ✅ Cascading exploits
+- ✅ Context poisoning
-## 🍳 Recipes & Use Cases
+See [MCP Security documentation](#mcp-security-guardrails-advanced) for full details.
-Guardrails can enforce any custom logic. Here are a few common patterns.
+## Advanced Features
-### Rate Limiting
+### Custom Guardrails
-Pass a userId in the metadata of your generateText call to enforce per-user rate limits.
+Create domain-specific guardrails:
-```typescript
-const rateLimitGuard = defineInputGuardrail({
-  name: 'user-rate-limit',
-  execute: async ({ metadata }) => {
-    const userId = metadata?.userId ?? 'anonymous';
-    const allowed = await checkRateLimit(userId); // Your rate-limiting logic
+```ts
+import { defineInputGuardrail, defineOutputGuardrail } from 'ai-sdk-guardrails';
+import { extractContent } from 'ai-sdk-guardrails/guardrails/output';
-    return allowed
+// Custom input guardrail
+const businessHours = defineInputGuardrail({
+  name: 'business-hours',
+  execute: async () => {
+    const hour = new Date().getHours();
+    return hour >= 9 && hour <= 17
       ? { tripwireTriggered: false }
-      : {
-          tripwireTriggered: true,
-          message: `Rate limit exceeded for user: ${userId}`,
-        };
+      : { tripwireTriggered: true, message: 'Outside business hours' };
   },
 });
-```
-### LLM-as-Judge for Quality Scoring
-Use a cheaper, faster model to "judge" the output of a more powerful one.
-```typescript
-const qualityJudge = defineOutputGuardrail({
-  name: 'llm-quality-judge',
+// Custom output guardrail
+const minQuality = defineOutputGuardrail({
+  name: 'min-quality',
   execute: async ({ result }) => {
-    // Use a cheap model to score the primary model's output
-    const judgement = await generateText({
-      model: openai('gpt-3.5-turbo'),
-      prompt: `Is the following response helpful and safe? Answer YES or NO. \n\nResponse: "${result.text}"`,
-    });
-    const isSafe = judgement.text.includes('YES');
-    return isSafe
+    const { text } = extractContent(result);
+    return text.length >= 100
       ? { tripwireTriggered: false }
-      : {
-          tripwireTriggered: true,
-          message: `Output failed LLM-as-judge quality check.`,
-          metadata: { originalText: result.text },
-        };
+      : { tripwireTriggered: true, message: 'Response too short' };
   },
 });
-```
-### Advanced Input Validation
-```typescript
-import { extractTextContent } from 'ai-sdk-guardrails/guardrails/input';
-const comprehensiveInputGuard = defineInputGuardrail({
-  name: 'comprehensive-input-validation',
-  execute: async (context) => {
-    const { prompt } = extractTextContent(context);
-    // Length validation
-    if (prompt.length < 10) {
-      return {
-        tripwireTriggered: true,
-        message: 'Input too short - likely to produce low-value response',
-        severity: 'medium',
-        suggestion: 'Please provide more detailed input for better results',
-      };
-    }
-    if (prompt.length > 4000) {
-      return {
-        tripwireTriggered: true,
-        message: 'Input too long - may exceed token limits',
-        severity: 'high',
-        suggestion: 'Break your request into smaller, focused parts',
-      };
-    }
-    // Content quality checks
-    const spamPatterns = [
-      /^(.)\1{10,}$/, // Repeated characters
-      /^(test|hello|hi|hey)$/i, // Common spam words
-    ];
-    const foundSpam = spamPatterns.find((pattern) => pattern.test(prompt));
-    if (foundSpam) {
-      return {
-        tripwireTriggered: true,
-        message: 'Low-quality input detected',
-        severity: 'high',
-      };
-    }
-    return { tripwireTriggered: false };
-  },
+const model = withGuardrails(openai('gpt-4o'), {
+  inputGuardrails: [businessHours],
+  outputGuardrails: [minQuality],
 });
 ```
-### Professional Output Quality Control
+### Auto-Retry on Failures
-```typescript
-import { extractContent } from 'ai-sdk-guardrails/guardrails/output';
+Automatically retry when output doesn't meet requirements:
-const professionalQualityGuard = defineOutputGuardrail({
-  name: 'professional-quality-control',
-  execute: async (context) => {
-    const { text } = extractContent(context.result);
-    const qualityIssues = [];
-    // Check for unprofessional language
-    const unprofessionalTerms = ['lol', 'wtf', 'omg', 'ur', 'u r'];
-    const hasUnprofessional = unprofessionalTerms.some((term) =>
-      text.toLowerCase().includes(term),
-    );
-    if (hasUnprofessional) {
-      qualityIssues.push('Contains unprofessional language');
-    }
-    // Check for placeholder text
-    const placeholders = ['[insert', '[add', '[your', 'TODO:', 'FIXME:'];
-    const hasPlaceholders = placeholders.some((placeholder) =>
-      text.includes(placeholder),
-    );
-    if (hasPlaceholders) {
-      qualityIssues.push('Contains placeholder text - incomplete response');
-    }
-    // Check for excessive repetition
-    const sentences = text.split(/[.!?]+/).filter((s) => s.trim());
-    const uniqueSentences = new Set(
-      sentences.map((s) => s.trim().toLowerCase()),
-    );
-    const repetitionRatio = uniqueSentences.size / sentences.length;
-    if (sentences.length > 3 && repetitionRatio < 0.6) {
-      qualityIssues.push('Excessive repetition detected');
-    }
-    if (qualityIssues.length > 0) {
-      return {
-        tripwireTriggered: true,
-        message: `Quality issues found: ${qualityIssues.join(', ')}`,
-        severity: 'medium',
-        suggestion: 'Request a more professional, complete response',
-        metadata: {
-          issues: qualityIssues,
-          quality_score: repetitionRatio,
-        },
-      };
-    }
-    return { tripwireTriggered: false };
+```ts
+import {
+  wrapWithOutputGuardrails,
+  minLengthRequirement,
+} from 'ai-sdk-guardrails';
+const model = wrapWithOutputGuardrails(
+  openai('gpt-4o'),
+  [minLengthRequirement(100)],
+  {
+    retry: {
+      maxRetries: 2,
+      buildRetryParams: ({ lastParams }) => ({
+        ...lastParams,
+        // Increase max tokens on retry
+        maxOutputTokens: (lastParams.maxOutputTokens ?? 400) + 200,
+        // Add context about the failure
+        prompt: [
+          ...lastParams.prompt,
+          {
+            role: 'user',
+            content: 'Please provide a more detailed response.',
+          },
+        ],
+      }),
+    },
   },
-});
+);
 ```
-## 🔄 Streaming Support
+### Reusable Configurations
-Guardrails work with streams out-of-the-box. By default, output guardrails run after the complete response has been streamed (buffer mode).
+Create reusable guardrail sets:
-```typescript
-import { streamText } from 'ai';
+```ts
+import {
+  createGuardrails,
+  piiDetector,
+  sensitiveDataFilter,
+} from 'ai-sdk-guardrails';
-const guardedModel = wrapWithGuardrails(openai('gpt-4o'), {
-  outputGuardrails: [qualityJudge],
+// Define once
+const productionGuards = createGuardrails({
+  inputGuardrails: [piiDetector()],
+  outputGuardrails: [sensitiveDataFilter()],
+  throwOnBlocked: true,
 });
-const { textStream } = await streamText({
-  model: guardedModel,
-  prompt: 'Tell me a short story about a robot.',
-});
+// Apply to multiple models
+const gpt4 = productionGuards(openai('gpt-4o'));
+const claude = productionGuards(anthropic('claude-3-sonnet'));
+```
-// Stream the response to the client
-for await (const delta of textStream) {
-  process.stdout.write(delta);
-}
+### Streaming Modes
+Control when guardrails run during streaming:
-// The qualityJudge guardrail will run after the stream is complete.
+```ts
+const model = withGuardrails(openai('gpt-4o'), {
+  outputGuardrails: [minLengthRequirement(100)],
+  streamMode: 'progressive', // Run guardrails as tokens arrive
+  replaceOnBlocked: true, // Replace blocked output with fallback
+});
 ```
-### Progressive Streaming (opt-in)
+- `buffer` (default): Wait for stream to complete, then check
+- `progressive`: Check guardrails as tokens arrive (early termination)
-For early blocking, enable progressive evaluation:
+### Agent Support
+Guardrails work with AI SDK Agents:
 ```ts
-const guardedModel = wrapWithGuardrails(openai('gpt-4o'), {
-  outputGuardrails: [qualityJudge],
-  // Evaluate on the fly and stop early when blocked
-  streamMode: 'progressive',
-  // Replace blocked output with a placeholder (default: true)
-  replaceOnBlocked: true,
-});
+import { withAgentGuardrails } from 'ai-sdk-guardrails';
+import { tool } from 'ai';
+const agent = withAgentGuardrails(
+  {
+    model: openai('gpt-4o'),
+    tools: { search: searchTool },
+    system: 'You are a helpful assistant.',
+  },
+  {
+    inputGuardrails: [piiDetector()],
+    outputGuardrails: [sensitiveDataFilter()],
+    toolGuardrails: [
+      toolEgressPolicy({
+        allowedHosts: ['api.company.com'],
+        scanForUrls: true,
+      }),
+    ],
+  },
+);
+const result = await agent.generate({ prompt: '...' });
 ```
-In progressive mode, guardrails evaluate text as it arrives. If blocked:
+## MCP Security Guardrails (Advanced)
+**Production-Ready**: Protect against the ["lethal trifecta" vulnerability](https://simonwillison.net/2025/Jun/16/the-lethal-trifecta/) when using Model Context Protocol (MCP) tools.
-- with `throwOnBlocked: true`, the stream errors.
-- with `replaceOnBlocked: true`, a placeholder message is streamed and the stream ends.
-- otherwise, the original chunks continue (with a callback via `onOutputBlocked`).
+### The Problem
-Note: Progressive mode runs guardrails more frequently and may increase overhead for long streams.
+AI agents with MCP tools are vulnerable when they have:
-### Configuration Highlights
+1. **Access to private data** (through tools)
+2. **Process untrusted content** (from tool responses)
+3. **Can communicate externally** (make web requests)
-- `replaceOnBlocked` (output): defaults to `true` for safer behavior.
-- `executionOptions.logLevel`: defaults to `'warn'` (respects `'none' | 'error' | 'warn' | 'info' | 'debug'`).
-- `onInputBlocked` / `onOutputBlocked`: receive a `GuardrailExecutionSummary` with analytics.
+Malicious tool responses can contain hidden instructions that trick the AI into exfiltrating sensitive data.
-### Cancellation Support
+### Production Configuration
-Guardrails can receive an `AbortSignal` and should abort work on timeout or caller-initiated cancel:
+Full configurability with sensible defaults:
 ```ts
-const guard = defineInputGuardrail({
-  name: 'long-check',
-  async execute(context, { signal }) {
-    await doWork({ signal }); // Pass signal to your async ops
-    return { tripwireTriggered: false };
-  },
+import {
+  withGuardrails,
+  promptInjectionDetector,
+  mcpSecurityGuardrail,
+  mcpResponseSanitizer,
+  toolEgressPolicy,
+} from 'ai-sdk-guardrails';
+// Conservative production setup (high security)
+const secureModel = withGuardrails(openai('gpt-4o'), {
+  inputGuardrails: [
+    promptInjectionDetector({ threshold: 0.6, includeExamples: true }),
+  ],
+  outputGuardrails: [
+    mcpSecurityGuardrail({
+      injectionThreshold: 0.5, // Lower = more sensitive
+      maxSuspiciousUrls: 0, // Zero tolerance
+      maxContentSize: 25600, // 25KB limit
+      minEncodedLength: 15, // Detect shorter encoded attacks
+      encodedInjectionThreshold: 0.2, // Combined threshold
+      highRiskThreshold: 0.3, // High-risk cascade blocking
+      authorityThreshold: 0.5, // Authority manipulation detection
+      allowedDomains: ['api.company.com', 'trusted-partner.com'],
+      customSuspiciousDomains: ['evil.com'],
+      blockCascadingCalls: true,
+      scanEncodedContent: true,
+      detectExfiltration: true,
+    }),
+    mcpResponseSanitizer(), // Clean vs block
+    toolEgressPolicy({
+      allowedHosts: ['api.company.com'],
+      blockedHosts: ['webhook.site', 'requestcatcher.com'],
+      scanForUrls: true,
+    }),
+  ],
 });
+```
+### Environment-Based Configuration
-// Timeouts are enforced by guardrail execution; if it times out, you'll get a GuardrailTimeoutError.
+```ts
+function getSecurityConfig(env: 'production' | 'staging' | 'development') {
+  const configs = {
+    production: {
+      injectionThreshold: 0.5, // High security
+      maxContentSize: 25600, // 25KB
+      authorityThreshold: 0.5,
+    },
+    staging: {
+      injectionThreshold: 0.7, // Balanced
+      maxContentSize: 51200, // 50KB
+      authorityThreshold: 0.7,
+    },
+    development: {
+      injectionThreshold: 0.8, // Permissive
+      maxContentSize: 102400, // 100KB
+      authorityThreshold: 0.8,
+    },
+  };
+  return configs[env];
+}
+const model = withGuardrails(openai('gpt-4o'), {
+  outputGuardrails: [mcpSecurityGuardrail(getSecurityConfig('production'))],
+});
 ```
-## 🛠️ Error Handling
+### Configuration Options
-When `throwOnBlocked: true` (the default), you can catch structured errors to handle blocks gracefully.
+| Option                      | Default | Description                                      |
+| --------------------------- | ------- | ------------------------------------------------ |
+| `injectionThreshold`        | 0.7     | Prompt injection confidence threshold (0-1)      |
+| `maxSuspiciousUrls`         | 0       | Max allowed suspicious URLs (0 = zero tolerance) |
+| `maxContentSize`            | 51200   | Max content size in bytes (50KB default)         |
+| `minEncodedLength`          | 20      | Min encoded content length to analyze            |
+| `encodedInjectionThreshold` | 0.3     | Combined encoded + injection threshold           |
+| `authorityThreshold`        | 0.7     | Authority manipulation detection sensitivity     |
+| `allowedDomains`            | []      | Allowed domains for URL construction             |
+| `customSuspiciousDomains`   | []      | Additional suspicious domain patterns            |
-```typescript
-import { generateText } from 'ai';
-import { isGuardrailsError } from 'ai-sdk-guardrails';
+See complete examples:
+- [Production MCP Configuration](./examples/44-production-mcp-config.ts)
+- [MCP Security Test Suite](./examples/41-mcp-security-test.ts)
+- [Enhanced Security Testing](./examples/43-enhanced-mcp-security-test.ts)
+## Error Handling
+### Throw Errors on Block
+```ts
+const model = withGuardrails(openai('gpt-4o'), {
+  inputGuardrails: [piiDetector()],
+  throwOnBlocked: true, // Throw errors instead of silent blocking
+});
 try {
-  const result = await generateText({
-    model: guardedModel,
-    prompt: 'A prompt that might be blocked...',
-  });
+  const { text } = await generateText({ model, prompt: '...' });
 } catch (error) {
   if (isGuardrailsError(error)) {
-    // Error was thrown by one of our guardrails
-    console.error('Guardrail check failed:', error.message);
-    console.error('Triggered Guards:', error.results);
-  } else {
-    // Some other error occurred
-    console.error('An unexpected error occurred:', error);
+    console.error('Blocked:', error.message);
+    // error.results gives details per guardrail
   }
 }
 ```
-### User-Friendly Error Messages
-Transform technical guardrail messages into user-friendly guidance:
+### Error Types
-```typescript
-function createUserFriendlyMessage(guardrailResult): string {
-  const guardrailName = guardrailResult.context?.guardrailName;
+- `GuardrailsInputError` - Input guardrail blocked
+- `GuardrailsOutputError` - Output guardrail blocked
+- `GuardrailExecutionError` - Guardrail threw an error
+- `GuardrailTimeoutError` - Guardrail exceeded timeout
+- `GuardrailConfigurationError` - Invalid configuration
-  switch (guardrailName) {
-    case 'content-length-limit':
-      return 'Your message is too long. Please keep it under 500 characters for the best response.';
+## API Reference
-    case 'blocked-keywords':
-      return "I can't help with that topic. Try asking about something else I can assist with.";
+### Primary Functions
-    case 'user-rate-limit':
-      return "You're sending requests too quickly. Please wait a moment before trying again.";
+| Function                  | Purpose                                  |
+| ------------------------- | ---------------------------------------- |
+| `withGuardrails`          | Wrap model with guardrails (main API)    |
+| `createGuardrails`        | Create reusable guardrail configurations |
+| `withAgentGuardrails`     | Wrap AI SDK Agents with guardrails       |
+| `defineInputGuardrail`    | Create custom input guardrail            |
+| `defineOutputGuardrail`   | Create custom output guardrail           |
+| `executeInputGuardrails`  | Run input guardrails programmatically    |
+| `executeOutputGuardrails` | Run output guardrails programmatically   |
-    default:
-      return (
-        guardrailResult.suggestion ||
-        'Please refine your request and try again.'
-      );
-  }
-}
-```
+### Error Utilities
-## Complete AI SDK Integration
+| Function            | Purpose                              |
+| ------------------- | ------------------------------------ |
+| `isGuardrailsError` | Check if error is from guardrails    |
+| `extractErrorInfo`  | Extract structured error information |
-The library seamlessly integrates with all AI SDK functions:
+### Retry Utilities
-```typescript
-// Create your production-ready model once
-const productionModel = wrapWithGuardrails(openai('gpt-4'), {
-  inputGuardrails: [lengthGuard, spamGuard, rateLimitGuard],
-  outputGuardrails: [qualityGuard, sensitiveInfoGuard],
-  throwOnBlocked: false,
-  onInputBlocked: (executionSummary) => {
-    console.log('Input blocked:', executionSummary.blockedResults[0]?.message);
+| Function                     | Purpose                           |
+| ---------------------------- | --------------------------------- |
+| `retry`                      | Standalone retry utility          |
+| `exponentialBackoff`         | Exponential backoff strategy      |
+| `linearBackoff`              | Linear backoff strategy           |
+| `jitteredExponentialBackoff` | Jittered exponential backoff      |
+| `backoffPresets`             | Pre-configured backoff strategies |
-    // Enhanced analytics available in v4.0.0
-    console.log(`Execution time: ${executionSummary.totalExecutionTime}ms`);
-    console.log(
-      `Guardrails: ${executionSummary.stats.blocked} blocked, ${executionSummary.stats.passed} passed`,
-    );
-  },
-  onOutputBlocked: (executionSummary) => {
-    console.log(
-      'Output filtered:',
-      executionSummary.blockedResults[0]?.message,
-    );
-    // Track comprehensive metrics
-    analytics.track('output_blocked', {
-      severity: executionSummary.blockedResults[0]?.severity,
-      totalGuardrails: executionSummary.guardrailsExecuted,
-      executionTime: executionSummary.totalExecutionTime,
-    });
-  },
-});
+See source for all built-in guardrails:
-// Use with any AI SDK function
-const textResult = await generateText({
-  model: productionModel,
-  prompt: 'Write a professional email response',
-});
-const objectResult = await generateObject({
-  model: productionModel,
-  prompt: 'Create a user profile',
-  schema: userProfileSchema,
-});
-const textStream = await streamText({
-  model: productionModel,
-  prompt: 'Explain our product features',
-});
-```
+- Input helpers: [`./src/guardrails/input.ts`](./src/guardrails/input.ts)
+- Output helpers: [`./src/guardrails/output.ts`](./src/guardrails/output.ts)
+- Tool helpers: [`./src/guardrails/tools.ts`](./src/guardrails/tools.ts)
+- MCP security: [`./src/guardrails/mcp-security.ts`](./src/guardrails/mcp-security.ts)
 ## Examples
-Explore **30 comprehensive examples** that demonstrate practical performance optimization, security protection, quality assurance, and enterprise-grade safety patterns:
+Browse 48+ runnable examples: [examples/README.md](./examples/README.md)
-### Core Foundation Examples
+### Quick Starts
-- **[Input Length Limits](examples/01-input-length-limit.ts)** - Foundation patterns for input validation
-- **[Blocked Keywords](examples/02-blocked-keywords.ts)** - Block prompts with specific keywords and content filtering
-- **[Output Length Check](examples/04-output-length-check.ts)** - Ensure minimum output length and quality control
-- **[Quality Assessment](examples/06-quality-assessment.ts)** - Assess response quality and content analysis
-- **[Combined Protection](examples/07-combined-protection.ts)** - Simple input/output validation for efficiency and quality
-- **[Simple Combined Protection](examples/07a-simple-combined-protection.ts)** - Simplified combined guardrails example
-- **[Blocking vs Warning](examples/08-blocking-vs-warning.ts)** - Compare blocking and warning modes with error handling
+| Example                    | Description                     | File                                                                              |
+| -------------------------- | ------------------------------- | --------------------------------------------------------------------------------- |
+| Simple combined protection | Minimal input and output setup  | [07a-simple-combined-protection.ts](./examples/07a-simple-combined-protection.ts) |
+| Auto retry on output       | Retry until output meets a rule | [32-auto-retry-output.ts](./examples/32-auto-retry-output.ts)                     |
+| LLM judge auto-retry       | Judge feedback drives retry     | [35-judge-auto-retry.ts](./examples/35-judge-auto-retry.ts)                       |
+| Weather assistant          | End-to-end input/output + retry | [33-blog-post-weather-assistant.ts](./examples/33-blog-post-weather-assistant.ts) |
-### Security & Protection Examples
+### Input Safety
-- **[PII Detection](examples/03-pii-detection.ts)** - Detect and block personal information in inputs
-- **[Sensitive Output Filter](examples/05-sensitive-output-filter.ts)** - Filter sensitive data from responses
-- **[Prompt Injection Detection](examples/16-prompt-injection-detection.ts)** - Comprehensive prompt injection detection with pattern matching and heuristic scoring
-- **[Tool Call Validation](examples/17-tool-call-validation.ts)** - Tool call validation with security patterns and dangerous operation detection
-- **[Basic Tool Allowlist](examples/17a-basic-tool-allowlist.ts)** - Basic tool allowlisting for secure tool usage
-- **[Tool Parameter Validation](examples/17b-tool-parameter-validation.ts)** - Validate tool parameters for security
-- **[Secret Leakage Scan](examples/18-secret-leakage-scan.ts)** - Secret leakage scanning with automatic redaction and entropy calculation
-- **[Jailbreak Detection](examples/30-jailbreak-detection.ts)** - Jailbreak detection with safe response templates and pattern recognition
+| Example            | Description                         | File                                                            |
+| ------------------ | ----------------------------------- | --------------------------------------------------------------- |
+| Input length limit | Enforce max input length            | [01-input-length-limit.ts](./examples/01-input-length-limit.ts) |
+| Blocked keywords   | Block specific terms                | [02-blocked-keywords.ts](./examples/02-blocked-keywords.ts)     |
+| PII detection      | Detect PII before calling the model | [03-pii-detection.ts](./examples/03-pii-detection.ts)           |
+| Rate limiting      | Simple per-user rate limit          | [13-rate-limiting.ts](./examples/13-rate-limiting.ts)           |
-### Content Quality & Validation Examples
+### Output Safety
-- **[Autoevals Guardrails](examples/31-autoevals-guardrails.ts)** - AI-powered quality evaluation using Autoevals library for factuality checking
-- **[Business Logic](examples/14-business-logic.ts)** - Custom business rules, work hours, and professional standards
-- **[LLM-as-Judge](examples/15-llm-as-judge.ts)** - AI-powered quality evaluation and scoring
-- **[Simple Quality Judge](examples/15a-simple-quality-judge.ts)** - Simplified quality assessment example
-- **[Hallucination Detection](examples/19-hallucination-detection.ts)** - Hallucination detection with LLM-as-judge verification and fact-checking
-- **[Response Consistency](examples/22-response-consistency.ts)** - Response consistency validation and coherence checking
+| Example                 | Description                         | File                                                                      |
+| ----------------------- | ----------------------------------- | ------------------------------------------------------------------------- |
+| Output length check     | Require min/max output length       | [04-output-length-check.ts](./examples/04-output-length-check.ts)         |
+| Sensitive output filter | Filter secrets and PII in responses | [05-sensitive-output-filter.ts](./examples/05-sensitive-output-filter.ts) |
+| Hallucination detection | Flag uncertain factual claims       | [19-hallucination-detection.ts](./examples/19-hallucination-detection.ts) |
-### Compliance & Regulation Examples
+### Streaming
-- **[Regulated Advice Compliance](examples/21-regulated-advice-compliance.ts)** - Regulated advice compliance with jurisdiction-specific rules (US, EU, UK, CA, AU, JP, CN, IN)
-- **[Role Hierarchy Enforcement](examples/23-role-hierarchy-enforcement.ts)** - Role hierarchy enforcement with multi-layered violation detection
+| Example           | Description                        | File                                                                              |
+| ----------------- | ---------------------------------- | --------------------------------------------------------------------------------- |
+| Streaming limits  | Apply limits in buffered streaming | [11-streaming-limits.ts](./examples/11-streaming-limits.ts)                       |
+| Streaming quality | Quality checks with streaming      | [12-streaming-quality.ts](./examples/12-streaming-quality.ts)                     |
+| Early termination | Stop streams early when blocked    | [28-streaming-early-termination.ts](./examples/28-streaming-early-termination.ts) |
-### Data Integrity & Code Safety Examples
+### Advanced
-- **[Schema Validation](examples/09-schema-validation.ts)** - Schema validation and structured output quality
-- **[Object Content Filter](examples/10-object-content-filter.ts)** - Filter inappropriate content in generated objects
-- **[SQL Code Safety](examples/24-sql-code-safety.ts)** - SQL code safety with dangerous operation blocking and injection detection
+| Example                    | Description                   | File                                                                            |
+| -------------------------- | ----------------------------- | ------------------------------------------------------------------------------- |
+| Simple quality judge       | Cheaper model judges quality  | [15a-simple-quality-judge.ts](./examples/15a-simple-quality-judge.ts)           |
+| Secret leakage scan        | Scan responses for secrets    | [18-secret-leakage-scan.ts](./examples/18-secret-leakage-scan.ts)               |
+| SQL code safety            | Basic SQL safety checks       | [24-sql-code-safety.ts](./examples/24-sql-code-safety.ts)                       |
+| Role hierarchy enforcement | Enforce role rules in prompts | [23-role-hierarchy-enforcement.ts](./examples/23-role-hierarchy-enforcement.ts) |
-### Network & External Access Examples
+## Migration from v3.x
-- **[Domain Allowlisting](examples/25-browsing-domain-allowlist.ts)** - Domain allowlisting with URL sanitization and security validation
+API naming has been improved in v4.x (old names still work but are deprecated):
-### Privacy & Memory Management Examples
+```ts
+// Before (v3.x - still works but deprecated)
+import { wrapWithGuardrails, InputBlockedError } from 'ai-sdk-guardrails';
+const model = wrapWithGuardrails(openai('gpt-4o'), { ... });
-- **[Memory Minimization](examples/26-memory-minimization.ts)** - Memory minimization with PII redaction and multiple redaction strategies
-- **[Logging Redaction](examples/27-logging-redaction.ts)** - Logging redaction with secure logging practices and compliance frameworks
+// After (v4.x - recommended)
+import { withGuardrails, GuardrailsInputError } from 'ai-sdk-guardrails';
+const model = withGuardrails(openai('gpt-4o'), { ... });
+```
-### Safety & Escalation Examples
+Changes:
-- **[Human Review Escalation](examples/20-human-review-escalation.ts)** - Human review escalation with content flagging, review routing, and quality control workflows
-- **[Toxicity & Harassment De-escalation](examples/29-toxicity-harassment-deescalation.ts)** - Toxicity and harassment de-escalation with safe response generation and user escalation tracking
+- `wrapWithGuardrails` → `withGuardrails`
+- `wrapAgentWithGuardrails` → `withAgentGuardrails`
+- `InputBlockedError` → `GuardrailsInputError`
+- `OutputBlockedError` → `GuardrailsOutputError`
-### Streaming Examples
+## Compatibility
-- **[Streaming Limits](examples/11-streaming-limits.ts)** - Apply guardrails to streaming responses with real-time validation
-- **[Streaming Quality](examples/12-streaming-quality.ts)** - Real-time quality monitoring for streams
-- **[Streaming Early Termination](examples/28-streaming-early-termination.ts)** - Streaming early termination with real-time content monitoring and session state management
+- **Runtime**: Node.js 18+ recommended
+- **AI SDK**: Compatible with AI SDK 5.x (`ai@^5`)
+- **TypeScript**: Full type safety with TypeScript 5+
+- **Works with any model**: OpenAI, Anthropic, Mistral, Groq, etc.
-### Resource Management Examples
+## Why This Library?
-- **[Rate Limiting](examples/13-rate-limiting.ts)** - Smart rate limiting that prevents resource overuse
+**Non-invasive**: Guardrails are middleware. Your existing code, telemetry (Langfuse, Helicone), and logging stay intact.
-### Running Examples
+**Production-ready**: Used in production by teams who need compliance, security, and cost control without rebuilding their infrastructure.
-```bash
-# Install dependencies
-pnpm install
-# Run core foundation examples
-tsx examples/01-input-length-limit.ts      # Basic input validation
-tsx examples/02-blocked-keywords.ts        # Keyword blocking
-tsx examples/04-output-length-check.ts     # Output length validation
-tsx examples/06-quality-assessment.ts      # Quality assessment
-tsx examples/07-combined-protection.ts     # Combined input/output protection
-tsx examples/07a-simple-combined-protection.ts # Simplified combined protection
-tsx examples/08-blocking-vs-warning.ts     # Blocking vs warning modes
-# Run security examples
-tsx examples/03-pii-detection.ts           # PII protection
-tsx examples/05-sensitive-output-filter.ts # Sensitive output filtering
-tsx examples/16-prompt-injection-detection.ts # Prompt injection protection
-tsx examples/17-tool-call-validation.ts    # Tool call validation
-tsx examples/17a-basic-tool-allowlist.ts   # Basic tool allowlisting
-tsx examples/17b-tool-parameter-validation.ts # Tool parameter validation
-tsx examples/18-secret-leakage-scan.ts     # Secret leakage prevention
-tsx examples/30-jailbreak-detection.ts     # Jailbreak detection
-# Run content quality examples
-tsx examples/31-autoevals-guardrails.ts    # AI-powered quality evaluation with Autoevals
-tsx examples/14-business-logic.ts          # Business-specific rules
-tsx examples/15-llm-as-judge.ts            # AI-powered quality control
-tsx examples/15a-simple-quality-judge.ts   # Simplified quality assessment
-tsx examples/19-hallucination-detection.ts # Hallucination detection
-tsx examples/22-response-consistency.ts    # Response consistency
-# Run compliance examples
-tsx examples/21-regulated-advice-compliance.ts # Regulatory compliance
-tsx examples/23-role-hierarchy-enforcement.ts # Role hierarchy enforcement
-# Run data integrity examples
-tsx examples/09-schema-validation.ts       # Schema validation
-tsx examples/10-object-content-filter.ts   # Object content filtering
-tsx examples/24-sql-code-safety.ts         # SQL code safety
-# Run network security examples
-tsx examples/25-browsing-domain-allowlist.ts # Domain allowlisting
-# Run privacy examples
-tsx examples/26-memory-minimization.ts     # Memory minimization
-tsx examples/27-logging-redaction.ts       # Logging redaction
-# Run safety examples
-tsx examples/20-human-review-escalation.ts # Human review escalation
-tsx examples/29-toxicity-harassment-deescalation.ts # Toxicity de-escalation
-# Run streaming examples
-tsx examples/11-streaming-limits.ts        # Streaming limits
-tsx examples/12-streaming-quality.ts       # Streaming quality monitoring
-tsx examples/28-streaming-early-termination.ts # Streaming early termination
-# Run resource management examples
-tsx examples/13-rate-limiting.ts           # Rate limiting
-```
+**Developer experience**: One line to add safety. Progressive complexity - start simple, add advanced features when needed.
+**Type-safe**: Rich TypeScript types and inference throughout.
-## 🤝 Contributing
+## Contributing
-Contributions of all sizes are welcome! Please open issues and pull requests on [GitHub](https://github.com/jagreehal/ai-sdk-guardrails).
+Issues and PRs are welcome.
-## 📄 License
+## License
-MIT © [Jag Reehal](https://github.com/jagreehal) – See LICENSE for full details.
+MIT © Jag Reehal. See [LICENSE](./LICENSE) for details.