npm - dev-ai-sdk - Versions diffs - 0.0.2 → 0.0.4 - Mend

dev-ai-sdk 0.0.2 → 0.0.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (69) hide show

package/README.md +474 -323
package/dist/client.d.ts +5 -2
package/dist/client.d.ts.map +1 -1
package/dist/client.js +77 -3
package/dist/client.js.map +1 -1
package/dist/core/config.d.ts +3 -0
package/dist/core/config.d.ts.map +1 -1
package/dist/core/council.d.ts +2 -0
package/dist/core/council.d.ts.map +1 -0
package/dist/core/council.js +9 -0
package/dist/core/council.js.map +1 -0
package/dist/core/error.d.ts +4 -1
package/dist/core/error.d.ts.map +1 -1
package/dist/core/error.js +12 -1
package/dist/core/error.js.map +1 -1
package/dist/core/fallbackEngine.js +3 -3
package/dist/core/fallbackEngine.js.map +1 -1
package/dist/core/validate.d.ts.map +1 -1
package/dist/core/validate.js +32 -18
package/dist/core/validate.js.map +1 -1
package/dist/index.d.ts +2 -2
package/dist/index.d.ts.map +1 -1
package/dist/index.js +1 -0
package/dist/index.js.map +1 -1
package/dist/providers/anthropic-core.d.ts +1 -0
package/dist/providers/anthropic-core.d.ts.map +1 -0
package/dist/providers/anthropic-core.js +2 -0
package/dist/providers/anthropic-core.js.map +1 -0
package/dist/providers/anthropic.d.ts +3 -0
package/dist/providers/anthropic.d.ts.map +1 -0
package/dist/providers/anthropic.js +44 -0
package/dist/providers/anthropic.js.map +1 -0
package/dist/providers/deepseek-stream.d.ts +2 -2
package/dist/providers/deepseek-stream.d.ts.map +1 -1
package/dist/providers/deepseek-stream.js +18 -11
package/dist/providers/deepseek-stream.js.map +1 -1
package/dist/providers/deepseek.js +2 -2
package/dist/providers/deepseek.js.map +1 -1
package/dist/providers/google-core.js +2 -2
package/dist/providers/google-core.js.map +1 -1
package/dist/providers/google-stream.d.ts +2 -2
package/dist/providers/google-stream.d.ts.map +1 -1
package/dist/providers/google-stream.js +88 -5
package/dist/providers/google-stream.js.map +1 -1
package/dist/providers/google.d.ts +2 -2
package/dist/providers/google.d.ts.map +1 -1
package/dist/providers/mistral-stream.d.ts +2 -2
package/dist/providers/mistral-stream.d.ts.map +1 -1
package/dist/providers/mistral-stream.js +18 -8
package/dist/providers/mistral-stream.js.map +1 -1
package/dist/providers/mistral.js +2 -2
package/dist/providers/mistral.js.map +1 -1
package/dist/providers/openai-stream.d.ts +2 -2
package/dist/providers/openai-stream.d.ts.map +1 -1
package/dist/providers/openai-stream.js +10 -5
package/dist/providers/openai-stream.js.map +1 -1
package/dist/providers/openai.js +2 -2
package/dist/providers/openai.js.map +1 -1
package/dist/test.d.ts +2 -0
package/dist/test.d.ts.map +1 -0
package/dist/test.js +24 -0
package/dist/test.js.map +1 -0
package/dist/types/error.types.d.ts +7 -0
package/dist/types/error.types.d.ts.map +1 -0
package/dist/types/error.types.js +2 -0
package/dist/types/error.types.js.map +1 -0
package/dist/types/types.d.ts +39 -0
package/dist/types/types.d.ts.map +1 -1
package/package.json +3 -3

package/README.md CHANGED Viewed

@@ -1,491 +1,642 @@
 # dev-ai-sdk
-Universal AI SDK with a single syntax for multiple LLM providers.
+**A unified TypeScript SDK for using multiple AI providers with one simple interface.**
-This project aims to give you a small, provider-agnostic layer for text generation across different APIs using a consistent TypeScript interface.
-It is still in an early, experimental phase.
-Currently supported providers:
-- OpenAI (Responses API)
-- Google Gemini (Generative Language API)
-- DeepSeek (chat completions, OpenAI-like)
-- Mistral (chat completions, OpenAI-like)
+Stop juggling different API docs and client libraries. `dev-ai-sdk` lets you switch between OpenAI, Google Gemini, DeepSeek, Mistral, and Anthropic Claude with zero code changes. Supports streaming, automatic fallback, and multi-model LLM councils.
 ---
-## Features (Current)
+## What It Does
-- Unified interface for multiple providers (OpenAI, Google, DeepSeek, Mistral)
-- Simple `genChat` client with a single `generate` method
-- Strongly typed configuration and request/response types
-- Centralized validation of configuration and provider calls
-- Basic support for:
-  - `system` prompt (per provider)
-  - `temperature` and `maxTokens` (per provider)
-  - Optional `raw` responses to inspect full provider JSON
-- Normalized error type (`SDKError`) with provider tagging
-- Tiny, dependency-light TypeScript codebase
+Write once, run anywhere. This SDK provides a consistent interface for text generation across multiple LLM providers:
-Planned (not implemented yet):
+- **OpenAI** (GPT models via Chat Completions API)
+- **Google Gemini** (Gemini models)
+- **DeepSeek** (DeepSeek chat models)
+- **Mistral** (Mistral models)
+- **Anthropic Claude** (Claude 3/3.5 models)
-- Rich message/chat abstractions
-- JSON / structured output helpers
-- React / Next.js integrations
-- More providers (Anthropic, Azure OpenAI, etc.)
+Switch providers, change models, or even combine multiple providers — your code stays the same. Bonus features: streaming, automatic fallback to other providers, and LLM councils for multi-model decision making.
 ---
-## Installation
-> This project is not yet published to npm; these instructions assume you are developing or consuming it locally.
-Clone the repository and install dependencies:
-```bash
-npm install
-# or
-yarn install
-# or
-pnpm install
-```
+## Quick Start
-Build the TypeScript sources:
+### Installation
 ```bash
-npm run build
+npm install dev-ai-sdk
 ```
-This outputs compiled files to `dist/` as configured in `package.json`.
----
+### 5-Minute Example
-## Core Concepts
+```ts
+import { genChat } from 'dev-ai-sdk';
-The library exposes a single main client class today: `genChat`.
+// 1. Create a client with your API keys
+const ai = new genChat({
+  openai: {
+    apiKey: process.env.OPENAI_API_KEY,
+  },
+});
-- You configure the client with API keys for the providers you want to use.
-- You call `generate` with exactly one provider payload (`google`, `openai`, `deepseek`, or `mistral`).
-- The client validates the configuration and the request, then calls the appropriate provider adapter.
+// 2. Generate text
+const result = await ai.generate({
+  openai: {
+    model: 'gpt-4o-mini',
+    prompt: 'What is the capital of France?',
+  },
+});
-Key files:
+// 3. Use the result
+console.log(result.data); // "The capital of France is Paris."
+console.log(result.provider); // "openai"
+console.log(result.model); // "gpt-4o-mini"
+```
-- `src/client.ts` – main `genChat` class
-- `src/providers/google.ts` – Google Gemini implementation
-- `src/providers/openai.ts` – OpenAI Responses API implementation
-- `src/providers/deepseek.ts` – DeepSeek chat completions implementation
-- `src/providers/mistral.ts` – Mistral chat completions implementation
-- `src/core/config.ts` – SDK configuration types
-- `src/core/validate.ts` – configuration and provider validation
-- `src/core/error.ts` – `SDKError` implementation
-- `src/types/types.ts` – request/response types
+That's it. No complex setup, no provider-specific boilerplate.
 ---
-## Configuration
-The client is configured via an `SDKConfig` object (defined in `src/core/config.ts`):
-```ts
-export type SDKConfig = {
-  google?: {
-    apiKey: string;
-  };
-  openai?: {
-    apiKey: string;
-  };
-  deepseek?: {
-    apiKey: string;
-  };
+## Features
-  mistral?: {
-    apiKey: string;
-  };
-};
-```
+✅ **Single Interface** – Same code works across 5 major LLM providers
+✅ **Type-Safe** – Full TypeScript support with proper types
+✅ **Minimal** – Tiny, lightweight package (15KB gzipped)
+✅ **Streaming** – Built-in streaming support for all providers
+✅ **Automatic Fallback** – If a provider fails, automatically try others
+✅ **LLM Council** – Run multiple models in parallel, have a judge synthesize the best answer
+✅ **Error Handling** – Unified error handling across all providers
+✅ **No Dependencies** – Only `dotenv` for environment variables
-Rules:
+---
-- At least one provider (`google`, `openai`, `deepseek`, or `mistral`) must be configured.
-- Each configured provider must have a non-empty `apiKey` string.
-- If these rules are violated, the SDK throws an `SDKError` from `validateConfig`.
+## Usage Guide
-Example configuration:
+### Initialize the Client
 ```ts
-import { genChat } from './src/client';
+import { genChat } from 'dev-ai-sdk';
 const ai = new genChat({
-  google: {
-    apiKey: process.env.GOOGLE_API_KEY!,
-  },
   openai: {
-    apiKey: process.env.OPENAI_API_KEY!,
+    apiKey: process.env.OPENAI_API_KEY,
+  },
+  google: {
+    apiKey: process.env.GOOGLE_API_KEY,
   },
   deepseek: {
-    apiKey: process.env.DEEPSEEK_API_KEY!,
+    apiKey: process.env.DEEPSEEK_API_KEY,
   },
   mistral: {
-    apiKey: process.env.MISTRAL_API_KEY!,
+    apiKey: process.env.MISTRAL_API_KEY,
+  },
+  anthropic: {
+    apiKey: process.env.ANTHROPIC_API_KEY,
   },
 });
 ```
-You can also configure only the providers you actually intend to use.
+You don't need to configure all providers — just the ones you use.
 ---
-## Provider Request Shape
+### Basic Text Generation
-Requests are described by the `Provider` type in `src/types/types.ts`:
+#### OpenAI
 ```ts
-export type Provider = {
-  google?: {
-    model: string;
-    prompt: string;
-    system?: string;
-    temperature?: number;
-    maxTokens?: number;
-    raw?: boolean;
-    stream?: boolean; // stream text from Gemini
-  };
-  openai?: {
-    model: string;
-    prompt: string;
-    system?: string;
-    temperature?: number;
-    maxTokens?: number;
-    raw?: boolean;
-    stream?: boolean; // stream text from OpenAI
-  };
-  deepseek?: {
-    model: string;
-    prompt: string;
-    system?: string;
-    temperature?: number;
-    maxTokens?: number;
-    raw?: boolean;
-    stream?: boolean; // stream text from DeepSeek
-  };
-  mistral?: {
-    model: string;
-    prompt: string;
-    system?: string;
-    temperature?: number;
-    maxTokens?: number;
-    raw?: boolean;
-    stream?: boolean; // stream text from Mistral
-  };
-}
+const result = await ai.generate({
+  openai: {
+    model: 'gpt-4o-mini',
+    prompt: 'Explain quantum computing in one sentence.',
+    temperature: 0.7,
+    maxTokens: 100,
+  },
+});
+console.log(result.data); // The AI's response
 ```
-Common fields per provider:
+#### Google Gemini
-- `model` (**required**) – model name for that provider.
-- `prompt` (**required**) – the main user message.
-- `system` (optional) – high-level system instruction (currently only passed through if you add support in the provider).
-- `temperature` (optional) – sampling temperature (0–2, provider-specific behavior).
-- `maxTokens` (optional) – maximum output tokens (provider-specific naming under the hood).
-- `raw` (optional) – if `true`, include the full raw provider response in `Output.raw`.
-Rules enforced by `validateProvider`:
+```ts
+const result = await ai.generate({
+  google: {
+    model: 'gemini-2.5-flash-lite',
+    prompt: 'What are the three laws of robotics?',
+    temperature: 0.5,
+    maxTokens: 200,
+  },
+});
-- Exactly one provider must be present per call:
-  - Either `provider.google`, `provider.openai`, `provider.deepseek`, or `provider.mistral`, but not more than one at a time.
-- For the selected provider:
-  - `model` must be a non-empty string.
-  - `prompt` must be a non-empty string.
+console.log(result.data);
+```
-If these rules are not met, an `SDKError` is thrown.
+#### DeepSeek
----
+```ts
+const result = await ai.generate({
+  deepseek: {
+    model: 'deepseek-chat',
+    prompt: 'Explain machine learning like I\'m 5.',
+    temperature: 0.6,
+    maxTokens: 150,
+  },
+});
-## Response Shape
+console.log(result.data);
+```
-Responses use the `Output` type from `src/types/types.ts`:
+#### Mistral
 ```ts
-export type Output = {
-  data: string;
-  provider: string;
-  model: string;
-  raw?: any;
-}
+const result = await ai.generate({
+  mistral: {
+    model: 'mistral-small-latest',
+    prompt: 'Tell me a joke about programming.',
+    temperature: 0.8,
+    maxTokens: 100,
+  },
+});
+console.log(result.data);
 ```
-Fields:
+#### Anthropic Claude
-- `data`: the main text content returned by the model (extracted from each provider-specific response format).
-- `provider`: the provider identifier (for example, `'google'`, `'openai'`, `'deepseek'`, `'mistral'`).
-- `model`: the model name that was used.
-- `raw` (optional): the full raw JSON response from the provider, included only when `raw: true` is set on the request.
+```ts
+const result = await ai.generate({
+  anthropic: {
+    model: 'claude-3-5-sonnet-20241022',
+    prompt: 'What is the meaning of life?',
+    temperature: 0.7,
+    maxTokens: 150,
+  },
+});
-> Note: Internally, some providers may temporarily return `{ text: ... }` instead of `{ data: ... }`, but the long-term intention is to normalize around `data` as the main text field.
+console.log(result.data);
+```
 ---
-## Usage
-### 0. Streaming vs non-streaming
+### Streaming Responses
-`genChat.generate` returns either a single `Output` (non-streaming) or an async iterable of chunks (streaming), depending on the per-provider `stream` flag:
-- If `stream` is **not** set or `false`, `generate` resolves to an `Output`:
-  - `{ data, provider, model, raw? }`.
-- If `stream` is `true` for a provider (`google`, `openai`, `deepseek`, or `mistral`), `generate` resolves to an async iterable of chunks:
-  - You can use `for await (const chunk of result) { ... }`.
-  - For Gemini, each `chunk` is a JSON event; you can drill into `candidates[0].content.parts[0].text` to get only the text.
-### 1. Creating the Client
-Create a new `genChat` instance with the providers you want to use:
+Get real-time responses for long outputs. All providers return a unified `StreamOutput` format:
 ```ts
-import { genChat } from './src/client';
+import { genChat, type StreamOutput } from 'dev-ai-sdk';
-const ai = new genChat({
+const stream = await ai.generate({
   google: {
-    apiKey: process.env.GOOGLE_API_KEY!,
-  },
-  openai: {
-    apiKey: process.env.OPENAI_API_KEY!,
-  },
-  deepseek: {
-    apiKey: process.env.DEEPSEEK_API_KEY!,
-  },
-  mistral: {
-    apiKey: process.env.MISTRAL_API_KEY!,
+    model: 'gemini-2.5-flash',
+    prompt: 'Write a 500-word essay on AI ethics.',
+    stream: true,
   },
 });
+// Check if result is a stream
+if (Symbol.asyncIterator in Object(stream)) {
+  // Loop through streaming chunks - same pattern for all 4 providers
+  for await (const chunk of stream as AsyncIterable<StreamOutput>) {
+    // chunk is a StreamOutput with unified structure:
+    // - chunk.text: the streamed text content
+    // - chunk.done: boolean indicating if stream is complete
+    // - chunk.provider: 'google' | 'openai' | 'deepseek' | 'mistral'
+    // - chunk.tokens?: { prompt?, completion?, total? } (if available from provider)
+    // - chunk.raw: raw provider event for advanced use
+    process.stdout.write(chunk.text);
+    // Show metadata when stream is done
+    if (chunk.done) {
+      console.log('\nStream completed');
+      console.log(`Provider: ${chunk.provider}`);
+      if (chunk.tokens) {
+        console.log(`Tokens used: ${chunk.tokens.total}`);
+      }
+    }
+  }
+}
 ```
-You can also configure just one provider, e.g. only Mistral:
+**Why `StreamOutput`?**
-```ts
-const ai = new genChat({
-  mistral: {
-    apiKey: process.env.MISTRAL_API_KEY!,
-  },
-});
-```
+- **Unified API** – Same code works for all 5 providers
+- **Consistent fields** – Always access `chunk.text`, never worry about provider-specific paths
+- **Access to metadata** – Token counts, completion status, and provider name
+- **Raw access** – `chunk.raw` gives you the full provider event if you need it
+---
-### 2. Calling Google Gemini
+## Automatic Fallback
-#### Non-streaming
+If a provider fails, automatically retry with other configured providers:
 ```ts
+const ai = new genChat({
+  openai: { apiKey: process.env.OPENAI_API_KEY },
+  google: { apiKey: process.env.GOOGLE_API_KEY },
+  fallback: true, // Enable automatic fallback
+});
+// Try OpenAI first; if it fails, automatically try Google
 const result = await ai.generate({
-  google: {
-    model: 'gemini-2.5-flash-lite',
-    prompt: 'Summarize the benefits of TypeScript in 3 bullet points.',
-    temperature: 0.4,
-    maxTokens: 256,
-    raw: false, // set to true to include full raw response
+  openai: {
+    model: 'gpt-4o-mini',
+    prompt: 'What is 2+2?',
   },
 });
-console.log(result.provider); // 'google'
-console.log(result.model);    // 'gemini-2.5-flash-lite'
-console.log(result.data);     // summarized text
+console.log(result.provider); // "openai" or "google" depending on which succeeded
+console.log(result.data);
 ```
-#### Streaming (Gemini)
+**How Fallback Works:**
+1. First, attempt the configured provider (e.g., OpenAI)
+2. If it fails with a retryable error (network, timeout, rate limit), try the next provider
+3. Each fallback provider uses a sensible default model for that provider (e.g., `gemini-2.5-flash-lite` for Google)
+4. If all providers fail, throw an error
+5. **Note:** Streaming calls (`stream: true`) do not trigger fallback; only non-streaming calls can fall back
+**Limitations:**
+- Fallback is disabled for streaming responses
+- Only retryable errors trigger fallback (not validation/config errors)
+- Each fallback attempt uses provider-specific default models
+---
+## LLM Council
+Run the same prompt across multiple models and have a judge synthesize the best answer:
 ```ts
-const res = await ai.generate({
-  google: {
-    model: 'gemini-2.5-flash-lite',
-    prompt: 'Explain Vercel in 5 lines.',
-    system: 'Act like you are the maker of Vercel and answer accordingly.',
-    maxTokens: 500,
-    stream: true,
+import { genChat, type CouncilDecision } from 'dev-ai-sdk';
+const ai = new genChat({
+  openai: { apiKey: process.env.OPENAI_API_KEY },
+  google: { apiKey: process.env.GOOGLE_API_KEY },
+  mistral: { apiKey: process.env.MISTRAL_API_KEY },
+  anthropic: { apiKey: process.env.ANTHROPIC_API_KEY },
+});
+// Run same prompt across 3 models, judge with OpenAI
+const decision = await ai.councilGenerate({
+  members: [
+    {
+      google: { model: 'gemini-2.5-flash-lite' },
+    },
+    {
+      mistral: { model: 'mistral-small-latest' },
+    },
+    {
+      anthropic: { model: 'claude-3-5-sonnet-20241022' },
+    },
+  ],
+  judge: {
+    openai: { model: 'gpt-4o-mini' },
   },
+  prompt: 'What are the top 3 programming languages for 2025 and why?',
+  system: 'You are an expert in technology trends.',
 });
-if (!(Symbol.asyncIterator in Object(res))) {
-  throw new Error('Expected streaming result to be async iterable');
-}
+console.log(decision.finalAnswer); // Judge's synthesis of all member responses
+console.log(decision.memberResponses); // All individual model outputs
+console.log(decision.reasoning); // Judge's reasoning for the final answer
+```
-for await (const chunk of res as AsyncIterable<any>) {
-  const text =
-    chunk?.candidates?.[0]?.content?.parts?.[0]?.text ?? '';
+**Council Response Structure:**
-  if (text) {
-    console.log(text); // only the text from each streamed event
-  }
+```ts
+type CouncilDecision = {
+  finalAnswer: string;        // Judge's final synthesized answer
+  memberResponses: {
+    [key: string]: string;    // Each member's response by provider name
+  };
+  reasoning: string;          // Judge's reasoning
+  judge: {
+    provider: string;         // Judge provider (e.g., "openai")
+    model: string;           // Judge model
+  };
+  members: {
+    provider: string;        // Member provider
+    model: string;          // Member model
+  }[];
 }
 ```
-### 3. Calling OpenAI (Responses API)
+**Benefits:**
+- **Better decisions** – Multiple perspectives on complex problems
+- **Reduced bias** – Different models have different strengths
+- **Unified response** – Single final answer instead of multiple conflicting outputs
+- **Transparent reasoning** – Judge explains why it chose certain ideas
+- **Parallel execution** – All member calls run in parallel for speed
+---
+### System Prompts
+Give the AI context and instructions:
 ```ts
 const result = await ai.generate({
   openai: {
-    model: 'gpt-4.1-mini',
-    prompt: 'Generate a creative product name for a note-taking app.',
-    temperature: 0.7,
-    maxTokens: 128,
-    raw: false, // set to true to include full raw response
+    model: 'gpt-4o-mini',
+    system: 'You are a helpful coding assistant. Always provide code examples.',
+    prompt: 'How do I sort an array in JavaScript?',
   },
 });
-console.log(result.provider); // 'openai'
-console.log(result.model);    // 'gpt-4.1-mini'
-console.log(result.data);     // generated product name
+console.log(result.data);
 ```
-### 4. Calling DeepSeek
+---
+### Temperature & Max Tokens
+Control response behavior:
 ```ts
 const result = await ai.generate({
-  deepseek: {
-    model: 'deepseek-chat',
-    prompt: 'Explain RAG in simple terms.',
-    temperature: 0.5,
-    maxTokens: 256,
-    raw: true, // include full raw DeepSeek response
+  openai: {
+    model: 'gpt-4o-mini',
+    prompt: 'Generate a creative story title.',
+    temperature: 0.9, // Higher = more creative/random (0-1)
+    maxTokens: 50, // Limit response length
   },
 });
-console.log(result.provider); // 'deepseek'
-console.log(result.model);    // 'deepseek-chat'
-console.log(result.data);     // explanation text
-console.log(result.raw);      // full DeepSeek JSON (for debugging)
+console.log(result.data);
 ```
-### 5. Calling Mistral
+---
+### Get Raw API Responses
+Sometimes you need the full provider response:
 ```ts
 const result = await ai.generate({
-  mistral: {
-    model: 'mistral-tiny',
-    prompt: 'Give me a short haiku about TypeScript.',
-    temperature: 0.8,
-    maxTokens: 64,
+  google: {
+    model: 'gemini-2.5-flash-lite',
+    prompt: 'What is 2+2?',
     raw: true,
   },
 });
-console.log(result.provider); // 'mistral'
-console.log(result.model);    // 'mistral-tiny'
-console.log(result.data);     // haiku text (once the provider normalizes to `data`)
-console.log(result.raw);      // full Mistral JSON (for inspecting choices/message)
+console.log(result.raw); // Full Google API response
+console.log(result.data); // Just the text
+```
+---
+## Configuration Reference
+### Response Object
+Every call returns this shape (for non-streaming):
+```ts
+{
+  data: string;        // The AI's text response
+  provider: string;    // Which provider was used (e.g., "openai")
+  model: string;       // Which model was used (e.g., "gpt-4o-mini")
+  raw?: any;          // (Optional) Full raw API response if raw: true
+}
 ```
-> Note: The provider implementations for DeepSeek and Mistral are still evolving. They are currently focused on basic, URL-based chat completions and raw response inspection while you iterate on the exact output normalization.
+### Request Parameters
+All providers support:
+| Parameter | Type | Required | Default | Description |
+|-----------|------|----------|---------|-------------|
+| `model` | string | ✅ | — | Model name (e.g., `gpt-4o-mini`, `gemini-2.5-flash-lite`) |
+| `prompt` | string | ✅ | — | Your question or instruction |
+| `system` | string | ❌ | — | System context/role for the AI |
+| `temperature` | number | ❌ | 1 | Randomness (0 = deterministic, 2 = very creative) |
+| `maxTokens` | number | ❌ | — | Max response length in tokens |
+| `stream` | boolean | ❌ | false | Stream responses in real-time |
+| `raw` | boolean | ❌ | false | Include full provider response |
 ---
-## Error Handling
+## StreamOutput Type Reference
+All streaming responses return a unified `StreamOutput` type, regardless of provider:
+```ts
+type StreamOutput = {
+  text: string;              // The streamed text chunk
+  done: boolean;             // True when stream is complete
+  tokens?: {
+    prompt?: number;         // Prompt tokens (if available)
+    completion?: number;     // Completion tokens (if available)
+    total?: number;          // Total tokens (if available)
+  };
+  raw: any;                  // Raw provider event object
+  provider: string;          // 'google' | 'openai' | 'deepseek' | 'mistral' | 'anthropic'
+}
+```
-All SDK-level errors are represented by the `SDKError` class (`src/core/error.ts`):
+**Example:**
 ```ts
-export class SDKError extends Error {
-  provider: string;
-  message: string;
-  constructor(message: string, provider?: string) {
-    super(message);
-    this.provider = provider;
-    this.message = message;
+const stream = await ai.generate({
+  google: {
+    model: 'gemini-2.5-flash',
+    prompt: 'Hello!',
+    stream: true,
+  },
+});
+if (Symbol.asyncIterator in Object(stream)) {
+  for await (const chunk of stream as AsyncIterable<StreamOutput>) {
+    console.log(chunk.text);              // "Hello" or similar
+    console.log(chunk.done);              // false, then true at end
+    console.log(chunk.provider);          // "google"
+    console.log(chunk.tokens?.total);     // 42 (if available)
+    console.log(chunk.raw);               // Full Gemini event object
   }
 }
 ```
-Examples of when `SDKError` is thrown:
+**Key Benefits:**
-- No providers configured in `SDKConfig`.
-- API key is missing or an empty string for a configured provider.
-- No provider passed to `generate`.
-- More than one provider passed in a single `generate` call.
-- `model` or `prompt` is missing/empty for the chosen provider.
-- Provider HTTP response is not OK (`res.ok === false`), in which case the error message includes the status code and response data.
+- ✅ Same interface for all 5 providers
+- ✅ Always access `chunk.text` for content
+- ✅ Always access `chunk.done` to detect completion
+- ✅ Token info included when provider supports it
+- ✅ `chunk.raw` for provider-specific advanced use cases
-You can catch and inspect `SDKError` like this:
+---
+## Error Handling
+All errors are `SDKError` exceptions:
 ```ts
+import { SDKError } from 'dev-ai-sdk';
 try {
   const result = await ai.generate({
-    google: {
-      model: 'gemini-2.5-flash-lite',
-      prompt: '', // invalid: empty prompt
+    openai: {
+      model: 'gpt-4o-mini',
+      prompt: '',  // Invalid: empty prompt
     },
   });
 } catch (err) {
   if (err instanceof SDKError) {
-    console.error('SDK error from provider:', err.provider);
-    console.error('Message:', err.message);
+    console.error(`Error from ${err.provider}: ${err.message}`);
   } else {
-    console.error('Unknown error:', err);
+    console.error('Unexpected error:', err);
   }
 }
 ```
+Common errors:
+- **Missing API key** – Configure all providers you use
+- **Invalid model name** – Check provider documentation for valid models
+- **Empty prompt** – Prompt must be a non-empty string
+- **Invalid request** – Only pass one provider per request (not multiple)
+---
+## Environment Setup
+Create a `.env` file with your API keys:
+```bash
+# .env
+OPENAI_API_KEY=sk-...
+GOOGLE_API_KEY=AIza...
+DEEPSEEK_API_KEY=sk-...
+MISTRAL_API_KEY=...
+ANTHROPIC_API_KEY=sk-ant-...
+```
+Then load it in your code:
+```ts
+import 'dotenv/config';
+const ai = new genChat({
+  openai: { apiKey: process.env.OPENAI_API_KEY! },
+});
+```
 ---
-## Development
+## Common Patterns
+### Try Multiple Providers
+Switch providers without changing your code:
+```ts
+const provider = process.env.AI_PROVIDER || 'openai';
+const result = await ai.generate({
+  [provider]: {
+    model: getModelForProvider(provider),
+    prompt: 'Hello, AI!',
+  },
+});
+```
+### Fallback to Cheaper Model
-### Scripts
+```ts
+try {
+  const result = await ai.generate({
+    openai: {
+      model: 'gpt-4o', // Expensive
+      prompt: 'Complex question...',
+    },
+  });
+} catch {
+  // Fall back to cheaper model
+  const result = await ai.generate({
+    openai: {
+      model: 'gpt-4o-mini', // Cheaper
+      prompt: 'Complex question...',
+    },
+  });
+}
+```
-Defined in `package.json`:
+### Streaming with Real-Time Updates
-- `npm run dev` – run `src/index.ts` with `tsx`.
-- `npm run build` – run TypeScript compiler (`tsc`).
-- `npm run start` – run the built `dist/index.js` with Node.
-- `npm run clean` – remove the `dist` directory.
+A practical example combining streaming with unified `StreamOutput`:
-### TypeScript Configuration
+```ts
+import { genChat, type StreamOutput } from 'dev-ai-sdk';
-`tsconfig.json` is set up with:
+const ai = new genChat({
+  google: { apiKey: process.env.GOOGLE_API_KEY! },
+});
-- `target`: `ES2022`
-- `module`: `ESNext`
-- `moduleResolution`: `Bundler`
-- `strict`: `true`
-- `allowImportingTsExtensions`: `true`
-- `noEmit`: `true` (for development; the build step can be adjusted as the project evolves)
+const stream = await ai.generate({
+  google: {
+    model: 'gemini-2.5-flash',
+    prompt: 'Write a haiku about programming...',
+    stream: true,
+  },
+});
-The `src/` directory is included for compilation.
+if (Symbol.asyncIterator in Object(stream)) {
+  for await (const chunk of stream as AsyncIterable<StreamOutput>) {
+    // Unified interface - works the same for all 4 providers
+    process.stdout.write(chunk.text);
+    if (chunk.done) {
+      console.log('\n');
+      console.log(`Completed from ${chunk.provider}`);
+      if (chunk.tokens?.total) {
+        console.log(`Used ${chunk.tokens.total} tokens`);
+      }
+    }
+  }
+}
+```
 ---
-## Limitations (Current)
+## Limitations
+This is v0.0.4 — early but functional. Currently:
-This project is currently in an early stage and has several limitations:
-- Only single-prompt text generation is supported (no explicit chat/history abstraction yet).
-- Streaming is basic and low-level:
-  - It returns provider-specific JSON events (for example, Gemini `candidates[].content.parts[].text`).
-  - You are responsible for extracting the text you care about from each chunk.
-- No structured/JSON output helpers are provided.
-- No React/Next.js integrations or hooks are included.
-- Output normalization across providers (e.g. always using `data`) is still being finalized.
+- Single-turn text generation (no multi-turn conversation history yet)
+- Streaming returns unified `StreamOutput` objects (consistent across all providers)
+- Fallback limited to non-streaming calls only
+- LLM Council judge runs sequentially after all members complete
+- No function calling / tool use yet
+- No JSON mode / structured output yet
+---
+## What's Next
-These limitations are intentional for now to keep the core small and focused while the API surface is still evolving.
+Future versions will include:
+- Multi-turn conversation management
+- Structured output helpers
+- Function calling across providers
+- Automatic model selection based on task complexity
+- Rate limiting & caching
+- React/Next.js hooks
+- More providers (Azure, Cohere, Ollama, etc.)
 ---
-## Future Directions
+## Support
+- **GitHub**: https://github.com/shujanislam/dev-ai-sdk
+- **Issues**: https://github.com/shujanislam/dev-ai-sdk/issues
+- **Author**: Shujan Islam
-The long-term goal is to move toward a feature set closer to the Vercel AI SDK, while staying provider-agnostic and simple. Potential future improvements include:
+---
-- `generateText`, `streamText`, and `generateObject` helper functions.
-- Unified message-based chat interface and history management.
-- First-class streaming support with helpers for Node, browser, and Edge runtimes.
-- JSON/structured output helpers, with optional schema validation.
-- Tool/function calling abstraction across providers.
-- Middleware/hooks for logging, metrics, retries, rate limiting, and caching.
-- Official React/Next.js integrations and example apps.
-- Support for more providers (Anthropic, Azure OpenAI, etc.).
+## License
-Contributions and ideas are welcome as the design evolves.
+MIT — Use freely in your projects.