npm - @context-chef/ai-sdk-middleware - Versions diffs - 0.1.0 → 0.1.2 - Mend

@context-chef/ai-sdk-middleware 0.1.0 → 0.1.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +30 -6
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -1,13 +1,17 @@
 # @context-chef/ai-sdk-middleware
 [![npm version](https://img.shields.io/npm/v/@context-chef/ai-sdk-middleware.svg)](https://www.npmjs.com/package/@context-chef/ai-sdk-middleware)
+[![npm downloads](https://img.shields.io/npm/dm/@context-chef/ai-sdk-middleware.svg)](https://www.npmjs.com/package/@context-chef/ai-sdk-middleware)
+[![License](https://img.shields.io/npm/l/@context-chef/ai-sdk-middleware.svg)](https://github.com/MyPrototypeWhat/context-chef/blob/main/LICENSE)
+[![TypeScript](https://img.shields.io/badge/TypeScript-5.9-blue.svg)](https://www.typescriptlang.org/)
+[![AI SDK](https://img.shields.io/badge/AI%20SDK-v6-black.svg)](https://ai-sdk.dev)
-[Vercel AI SDK](https://sdk.vercel.ai) middleware powered by [context-chef](https://github.com/MyPrototypeWhat/context-chef). Transparent history compression, tool result truncation, and token budget management — zero code changes required.
+[Vercel AI SDK](https://ai-sdk.dev) middleware powered by [context-chef](https://github.com/MyPrototypeWhat/context-chef). Transparent history compression, tool result truncation, and token budget management — zero code changes required.
 ## Installation
 ```bash
-npm install @context-chef/ai-sdk-middleware @context-chef/core ai
+npm install @context-chef/ai-sdk-middleware ai
 ```
 ## Quick Start
@@ -19,9 +23,11 @@ import { generateText } from 'ai';
 const model = withContextChef(openai('gpt-4o'), {
   contextWindow: 128_000,
+  compress: { model: openai('gpt-4o-mini') },
+  truncate: { threshold: 5000, headChars: 500, tailChars: 1000 },
 });
-// Use exactly like normal — everything below is unchanged
+// Everything below stays exactly the same — works with generateText and streamText
 const result = await generateText({
   model,
   messages: conversationHistory,
@@ -29,7 +35,7 @@ const result = await generateText({
 });
 ```
-That's it. History compression and token budget tracking happen automatically behind the scenes.
+That's it. History compression, tool result truncation, and token budget tracking happen automatically behind the scenes.
 ## Features
@@ -72,6 +78,22 @@ const model = withContextChef(openai('gpt-4o'), {
 });
 ```
+Optionally persist the original content via a storage adapter so the LLM can retrieve it later via a `context://vfs/` URI:
+```typescript
+import { FileSystemAdapter } from '@context-chef/core';
+const model = withContextChef(openai('gpt-4o'), {
+  contextWindow: 128_000,
+  truncate: {
+    threshold: 5000,
+    headChars: 500,
+    tailChars: 1000,
+    storage: new FileSystemAdapter('.context_vfs'), // or your own DB adapter
+  },
+});
+```
 ### Token Budget Tracking
 The middleware automatically extracts token usage from `generateText` and `streamText` responses and feeds it back to the compression engine. No manual `reportTokenUsage()` calls needed.
@@ -100,6 +122,7 @@ const wrappedModel = withContextChef(model, options);
 | `truncate.threshold` | `number` | Yes (if truncate) | Character count to trigger truncation |
 | `truncate.headChars` | `number` | No | Characters to preserve from start (default: `0`) |
 | `truncate.tailChars` | `number` | No | Characters to preserve from end (default: `1000`) |
+| `truncate.storage` | `VFSStorageAdapter` | No | Storage adapter to persist original content before truncation |
 | `tokenizer` | `(msgs) => number` | No | Custom tokenizer for precise counting |
 | `onCompress` | `(summary, count) => void` | No | Hook called after compression |
@@ -132,11 +155,12 @@ const aiSdkPrompt = toAISDK(irMessages);
 ## How It Works
 ```
-generateText({ model: wrappedModel, messages })
+generateText / streamText ({ model: wrappedModel, messages })
   |
   v
 transformParams (before LLM call)
   1. Truncate large tool results (if configured)
+     - Optionally persist originals to storage adapter
   2. Convert AI SDK messages -> context-chef IR
   3. Run Janitor compression (if over token budget)
   4. Convert back to AI SDK messages
@@ -157,7 +181,7 @@ The middleware is **stateful** — it tracks token usage across calls to know wh
 ## Need More Control?
-The middleware covers the most common use case: transparent compression and truncation. For advanced features like dynamic state injection, tool namespaces, memory, or snapshot/restore, use [`@context-chef/core`](../core) directly.
+The middleware covers the most common use case: transparent compression and truncation. For advanced features like dynamic state injection, tool namespaces, memory, or snapshot/restore, use [`@context-chef/core`](https://www.npmjs.com/package/@context-chef/core) directly.
 ## License

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@context-chef/ai-sdk-middleware",
-  "version": "0.1.0",
+  "version": "0.1.2",
   "type": "module",
   "main": "./dist/index.cjs",
   "module": "./dist/index.mjs",