npm - ai-retry - Versions diffs - 1.0.0-beta.1 → 1.0.0-beta.2 - Mend

ai-retry 1.0.0-beta.1 → 1.0.0-beta.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/README.md +134 -15
package/dist/index.d.mts +2 -2
package/dist/index.mjs +90 -55
package/dist/retryables/index.d.mts +1 -1
package/dist/{types-D7G-2JLh.d.mts → types-JvcFQz93.d.mts} +63 -24
package/package.json +18 -18

package/README.md CHANGED Viewed

@@ -176,6 +176,58 @@ const retryableModel = createRetryable({
 In this example, if the base model fails with code 429 or a service overloaded error, it will retry with `gpt-4-mini` on Azure. In any other error case, it will fallback to `claude-3-haiku-20240307` on Anthropic. If the order would be reversed, the static retryable would catch all errors first, and the dynamic retryable would never be reached.
+#### Errors vs Results
+Dynamic retryables can be further divided based on what triggers them:
+- **Error-based retryables** handle API errors where the request throws an error (e.g., timeouts, rate limits, service unavailable, etc.)
+- **Result-based retryables** handle successful responses that still need retrying (e.g., content filtering, guardrails, etc.)
+Both types of retryables have the same interface and receive the current attempt as context. You can use the `isErrorAttempt` and `isResultAttempt` type guards to check the type of the current attempt.
+```typescript
+import { generateText } from 'ai';
+import { createRetryable, isErrorAttempt, isResultAttempt } from 'ai-retry';
+import type { Retryable } from 'ai-retry';
+// Error-based retryable: handles thrown errors (e.g., timeouts, rate limits)
+const errorBasedRetry: Retryable = (context) => {
+  if (isErrorAttempt(context.current)) {
+    const { error } = context.current;
+    // The request threw an error - e.g., network timeout, 429 rate limit
+    console.log('Request failed with error:', error);
+    return { model: anthropic('claude-3-haiku-20240307') };
+  }
+  return undefined;
+};
+// Result-based retryable: handles successful responses that need retrying
+const resultBasedRetry: Retryable = (context) => {
+  if (isResultAttempt(context.current)) {
+    const { result } = context.current;
+    // The request succeeded, but the response indicates a problem
+    if (result.finishReason === 'content-filter') {
+      console.log('Content was filtered, trying different model');
+      return { model: openai('gpt-4') };
+    }
+  }
+  return undefined;
+};
+const retryableModel = createRetryable({
+  model: azure('gpt-4-mini'),
+  retries: [
+    // Error-based: catches thrown errors like timeouts, rate limits, etc.
+    errorBasedRetry,
+    // Result-based: catches successful responses that need retrying
+    resultBasedRetry,
+  ],
+});
+```
+Result-based retryables are only available for generate calls like `generateText` and `generateObject`. They are not available for streaming calls like `streamText` and `streamObject`.
 #### Fallbacks
 If you don't need precise error matching with custom logic and just want to fallback to different models on any error, you can simply provide a list of models.
@@ -362,11 +414,23 @@ Handle service overload errors (status code 529) by switching to a provider.
 import { serviceOverloaded } from 'ai-retry/retryables';
 const retryableModel = createRetryable({
-  model: azure('gpt-4'),
+  model: anthropic('claude-sonnet-4-0'),
   retries: [
-    serviceOverloaded(openai('gpt-4')), // Switch to OpenAI if Azure is overloaded
+    // Retry with delay and exponential backoff
+    serviceOverloaded(anthropic('claude-sonnet-4-0'), {
+      delay: 5_000,
+      backoffFactor: 2,
+      maxAttempts: 5,
+    }),
+    // Or switch to a different provider
+    serviceOverloaded(openai('gpt-4')),
   ],
 });
+const result = streamText({
+  model: retryableModel,
+  prompt: 'Write a story about a robot...',
+});
 ```
 #### Service Unavailable
@@ -575,6 +639,58 @@ const result = await generateText({
 The retry's `providerOptions` will completely replace the original ones during retry attempts. This works for all model types (language and embedding) and all operations (generate, stream, embed).
+#### Call Options
+You can override various call options when retrying requests. This is useful for adjusting parameters like temperature, max tokens, or even the prompt itself for retry attempts. Call options are specified in the `options` field of the retry object.
+```typescript
+const retryableModel = createRetryable({
+  model: openai('gpt-4'),
+  retries: [
+    {
+      model: anthropic('claude-3-haiku'),
+      options: {
+        // Override generation parameters for more deterministic output
+        temperature: 0.3,
+        topP: 0.9,
+        maxOutputTokens: 500,
+        // Set a seed for reproducibility
+        seed: 42,
+      },
+    },
+  ],
+});
+```
+The following options can be overridden:
+> [!NOTE]
+> Override options completely replace the original values (they are not merged). If you don't specify an option, the original value from the request is used.
+##### Language Model Options
+| Option | Description |
+|--------|-------------|
+| [`prompt`](https://ai-sdk.dev/docs/reference/ai-sdk-core/generate-text#prompt) | Override the entire prompt for the retry |
+| [`temperature`](https://ai-sdk.dev/docs/reference/ai-sdk-core/generate-text#temperature) | Temperature setting for controlling randomness |
+| [`topP`](https://ai-sdk.dev/docs/reference/ai-sdk-core/generate-text#topp) | Nucleus sampling parameter |
+| [`topK`](https://ai-sdk.dev/docs/reference/ai-sdk-core/generate-text#topk) | Top-K sampling parameter |
+| [`maxOutputTokens`](https://ai-sdk.dev/docs/reference/ai-sdk-core/generate-text#max-output-tokens) | Maximum number of tokens to generate |
+| [`seed`](https://ai-sdk.dev/docs/reference/ai-sdk-core/generate-text#seed) | Random seed for deterministic generation |
+| [`stopSequences`](https://ai-sdk.dev/docs/reference/ai-sdk-types/generate-text#stopsequences) | Stop sequences to end generation |
+| [`presencePenalty`](https://ai-sdk.dev/docs/reference/ai-sdk-core/generate-text#presencepenalty) | Presence penalty for reducing repetition |
+| [`frequencyPenalty`](https://ai-sdk.dev/docs/reference/ai-sdk-core/generate-text#frequencypenalty) | Frequency penalty for reducing repetition |
+| [`headers`](https://ai-sdk.dev/docs/reference/ai-sdk-core/generate-text#headers) | Additional HTTP headers |
+| [`providerOptions`](https://ai-sdk.dev/docs/reference/ai-sdk-types/generate-text#provideroptions) | Provider-specific options |
+##### Embedding Model Options
+| Option | Description |
+|--------|-------------|
+| [`values`](https://ai-sdk.dev/docs/reference/ai-sdk-core/embed#values) | Override the values to embed |
+| [`headers`](https://ai-sdk.dev/docs/reference/ai-sdk-core/embed#headers) | Additional HTTP headers |
+| [`providerOptions`](https://ai-sdk.dev/docs/reference/ai-sdk-core/embed#provideroptions) | Provider-specific options |
 #### Logging
 You can use the following callbacks to log retry attempts and errors:
@@ -641,7 +757,7 @@ type Retryable = (
 #### `Retry`
-A `Retry` specifies the model to retry and optional settings like `maxAttempts`, `delay`, `backoffFactor`, `timeout`, and `providerOptions`.
+A `Retry` specifies the model to retry and optional settings. The available options depend on the model type (language model or embedding model).
 ```typescript
 interface Retry {
@@ -650,18 +766,11 @@ interface Retry {
   delay?: number;            // Delay in milliseconds before retrying
   backoffFactor?: number;    // Multiplier for exponential backoff
   timeout?: number;          // Timeout in milliseconds for the retry attempt
-  providerOptions?: ProviderOptions; // Provider-specific options for the retry
+  providerOptions?: ProviderOptions; // @deprecated - use options.providerOptions instead
+  options?: LanguageModelV2CallOptions | EmbeddingModelV2CallOptions; // Call options to override for this retry
 }
 ```
-**Options:**
-- `model`: The model to use for the retry attempt.
-- `maxAttempts`: Maximum number of times this model can be retried. Default is 1.
-- `delay`: Delay in milliseconds to wait before retrying. The delay respects abort signals from the request.
-- `backoffFactor`: Multiplier for exponential backoff (`delay × backoffFactor^attempt`). If not provided, uses fixed delay.
-- `timeout`: Timeout in milliseconds for creating a fresh `AbortSignal.timeout()` for the retry attempt. This replaces any existing abort signal.
-- `providerOptions`: Provider-specific options that override the original request's provider options during retry attempts.
 #### `RetryContext`
 The `RetryContext` object contains information about the current attempt and all previous attempts.
@@ -675,13 +784,23 @@ interface RetryContext {
 #### `RetryAttempt`
-A `RetryAttempt` represents a single attempt with a specific model, which can be either an error or a successful result that triggered a retry.
+A `RetryAttempt` represents a single attempt with a specific model, which can be either an error or a successful result that triggered a retry. Each attempt includes the call options that were used for that specific attempt. For retry attempts, this will reflect any overridden options from the retry configuration.
 ```typescript
 // For both language and embedding models
 type RetryAttempt =
-  | { type: 'error'; error: unknown; model: LanguageModelV2 | EmbeddingModelV2 }
-  | { type: 'result'; result: LanguageModelV2Generate; model: LanguageModelV2 };
+  | {
+      type: 'error';
+      error: unknown;
+      model: LanguageModelV2 | EmbeddingModelV2;
+      options: LanguageModelV2CallOptions | EmbeddingModelV2CallOptions;
+    }
+  | {
+      type: 'result';
+      result: LanguageModelV2Generate;
+      model: LanguageModelV2;
+      options: LanguageModelV2CallOptions;
+    };
 // Note: Result-based retries only apply to language models, not embedding models

package/dist/index.d.mts CHANGED Viewed

@@ -1,4 +1,4 @@
-import { S as RetryableOptions, _ as RetryContext, a as LanguageModel, b as Retryable, c as LanguageModelStream, d as ResolvableLanguageModel, f as ResolvableModel, g as RetryAttempt, h as Retry, i as GatewayLanguageModelId, l as LanguageModelStreamPart, m as Retries, n as EmbeddingModelCallOptions, o as LanguageModelCallOptions, p as ResolvedModel, r as EmbeddingModelEmbed, s as LanguageModelGenerate, t as EmbeddingModel, u as ProviderOptions, v as RetryErrorAttempt, x as RetryableModelOptions, y as RetryResultAttempt } from "./types-D7G-2JLh.mjs";
+import { C as RetryableModelOptions, S as Retryable, _ as Retry, a as GatewayLanguageModelId, b as RetryErrorAttempt, c as LanguageModelGenerate, d as LanguageModelStreamPart, f as ProviderOptions, g as Retries, h as ResolvedModel, i as EmbeddingModelRetryCallOptions, l as LanguageModelRetryCallOptions, m as ResolvableModel, n as EmbeddingModelCallOptions, o as LanguageModel, p as ResolvableLanguageModel, r as EmbeddingModelEmbed, s as LanguageModelCallOptions, t as EmbeddingModel, u as LanguageModelStream, v as RetryAttempt, w as RetryableOptions, x as RetryResultAttempt, y as RetryContext } from "./types-JvcFQz93.mjs";
 import * as _ai_sdk_provider0 from "@ai-sdk/provider";
 //#region src/create-retryable-model.d.ts
@@ -63,4 +63,4 @@ declare const isStreamContentPart: (part: LanguageModelStreamPart) => part is _a
   rawValue: unknown;
 };
 //#endregion
-export { EmbeddingModel, EmbeddingModelCallOptions, EmbeddingModelEmbed, GatewayLanguageModelId, LanguageModel, LanguageModelCallOptions, LanguageModelGenerate, LanguageModelStream, LanguageModelStreamPart, ProviderOptions, ResolvableLanguageModel, ResolvableModel, ResolvedModel, Retries, Retry, RetryAttempt, RetryContext, RetryErrorAttempt, RetryResultAttempt, Retryable, RetryableModelOptions, RetryableOptions, createRetryable, getModelKey, isEmbeddingModel, isErrorAttempt, isGenerateResult, isLanguageModel, isModel, isObject, isResultAttempt, isStreamContentPart, isStreamResult, isString };
+export { EmbeddingModel, EmbeddingModelCallOptions, EmbeddingModelEmbed, EmbeddingModelRetryCallOptions, GatewayLanguageModelId, LanguageModel, LanguageModelCallOptions, LanguageModelGenerate, LanguageModelRetryCallOptions, LanguageModelStream, LanguageModelStreamPart, ProviderOptions, ResolvableLanguageModel, ResolvableModel, ResolvedModel, Retries, Retry, RetryAttempt, RetryContext, RetryErrorAttempt, RetryResultAttempt, Retryable, RetryableModelOptions, RetryableOptions, createRetryable, getModelKey, isEmbeddingModel, isErrorAttempt, isGenerateResult, isLanguageModel, isModel, isObject, isResultAttempt, isStreamContentPart, isStreamResult, isString };

package/dist/index.mjs CHANGED Viewed

@@ -130,6 +130,19 @@ var RetryableEmbeddingModel = class {
 		return typeof this.options.disabled === "function" ? this.options.disabled() : this.options.disabled;
 	}
 	/**
+	* Get the retry call options overrides from a retry configuration.
+	*/
+	getRetryCallOptions(callOptions, currentRetry) {
+		const retryOptions = currentRetry?.options ?? {};
+		return {
+			...callOptions,
+			values: retryOptions.values ?? callOptions.values,
+			headers: retryOptions.headers ?? callOptions.headers,
+			providerOptions: retryOptions.providerOptions ?? currentRetry?.providerOptions ?? callOptions.providerOptions,
+			abortSignal: currentRetry?.timeout ? AbortSignal.timeout(currentRetry.timeout) : callOptions.abortSignal
+		};
+	}
+	/**
 	* Execute a function with retry logic for handling errors
 	*/
 	async withRetry(input) {
@@ -160,13 +173,17 @@ var RetryableEmbeddingModel = class {
 				};
 				this.options.onRetry?.(context);
 			}
+			/**
+			* Get the retry call options overrides for this attempt
+			*/
+			const retryCallOptions = this.getRetryCallOptions(input.callOptions, currentRetry);
 			try {
 				return {
-					result: await input.fn(currentRetry),
+					result: await input.fn(retryCallOptions),
 					attempts
 				};
 			} catch (error) {
-				const { retryModel, attempt } = await this.handleError(error, attempts);
+				const { retryModel, attempt } = await this.handleError(error, attempts, retryCallOptions);
 				attempts.push(attempt);
 				if (retryModel.delay) {
 					/**
@@ -178,7 +195,7 @@ var RetryableEmbeddingModel = class {
 					* - Attempt 3: 4000ms
 					*/
 					const modelAttemptsCount = countModelAttempts(retryModel.model, attempts);
-					await delay(calculateExponentialBackoff(retryModel.delay, retryModel.backoffFactor, modelAttemptsCount), { abortSignal: input.abortSignal });
+					await delay(calculateExponentialBackoff(retryModel.delay, retryModel.backoffFactor, modelAttemptsCount), { abortSignal: retryCallOptions.abortSignal });
 				}
 				this.currentModel = retryModel.model;
 				currentRetry = retryModel;
@@ -188,11 +205,12 @@ var RetryableEmbeddingModel = class {
 	/**
 	* Handle an error and determine if a retry is needed
 	*/
-	async handleError(error, attempts) {
+	async handleError(error, attempts, callOptions) {
 		const errorAttempt = {
 			type: "error",
 			error,
-			model: this.currentModel
+			model: this.currentModel,
+			options: callOptions
 		};
 		/**
 		* Save the current attempt
@@ -217,7 +235,7 @@ var RetryableEmbeddingModel = class {
 			attempt: errorAttempt
 		};
 	}
-	async doEmbed(options) {
+	async doEmbed(callOptions) {
 		/**
 		* Always start with the original model
 		*/
@@ -225,17 +243,12 @@ var RetryableEmbeddingModel = class {
 		/**
 		* If retries are disabled, bypass retry machinery entirely
 		*/
-		if (this.isDisabled()) return this.currentModel.doEmbed(options);
+		if (this.isDisabled()) return this.currentModel.doEmbed(callOptions);
 		const { result } = await this.withRetry({
-			fn: async (currentRetry) => {
-				const callOptions = {
-					...options,
-					providerOptions: currentRetry?.providerOptions ?? options.providerOptions,
-					abortSignal: currentRetry?.timeout ? AbortSignal.timeout(currentRetry.timeout) : options.abortSignal
-				};
-				return this.currentModel.doEmbed(callOptions);
+			fn: async (retryCallOptions) => {
+				return this.currentModel.doEmbed(retryCallOptions);
 			},
-			abortSignal: options.abortSignal
+			callOptions
 		});
 		return result;
 	}
@@ -270,6 +283,27 @@ var RetryableLanguageModel = class {
 		return typeof this.options.disabled === "function" ? this.options.disabled() : this.options.disabled;
 	}
 	/**
+	* Get the retry call options overrides from a retry configuration.
+	*/
+	getRetryCallOptions(callOptions, currentRetry) {
+		const retryOptions = currentRetry?.options ?? {};
+		return {
+			...callOptions,
+			prompt: retryOptions.prompt ?? callOptions.prompt,
+			maxOutputTokens: retryOptions.maxOutputTokens ?? callOptions.maxOutputTokens,
+			temperature: retryOptions.temperature ?? callOptions.temperature,
+			stopSequences: retryOptions.stopSequences ?? callOptions.stopSequences,
+			topP: retryOptions.topP ?? callOptions.topP,
+			topK: retryOptions.topK ?? callOptions.topK,
+			presencePenalty: retryOptions.presencePenalty ?? callOptions.presencePenalty,
+			frequencyPenalty: retryOptions.frequencyPenalty ?? callOptions.frequencyPenalty,
+			seed: retryOptions.seed ?? callOptions.seed,
+			headers: retryOptions.headers ?? callOptions.headers,
+			providerOptions: retryOptions.providerOptions ?? currentRetry?.providerOptions ?? callOptions.providerOptions,
+			abortSignal: currentRetry?.timeout ? AbortSignal.timeout(currentRetry.timeout) : callOptions.abortSignal
+		};
+	}
+	/**
 	* Execute a function with retry logic for handling errors
 	*/
 	async withRetry(input) {
@@ -280,7 +314,7 @@ var RetryableLanguageModel = class {
 		/**
 		* Track current retry configuration.
 		*/
-		let currentRetry;
+		let currentRetry = input.currentRetry;
 		while (true) {
 			/**
 			* The previous attempt that triggered a retry, or undefined if this is the first attempt
@@ -300,16 +334,20 @@ var RetryableLanguageModel = class {
 				};
 				this.options.onRetry?.(context);
 			}
+			/**
+			* Get the retry call options overrides for this attempt
+			*/
+			const retryCallOptions = this.getRetryCallOptions(input.callOptions, currentRetry);
 			try {
 				/**
 				* Call the function that may need to be retried
 				*/
-				const result = await input.fn(currentRetry);
+				const result = await input.fn(retryCallOptions);
 				/**
 				* Check if the result should trigger a retry (only for generate results, not streams)
 				*/
 				if (isGenerateResult(result)) {
-					const { retryModel, attempt } = await this.handleResult(result, attempts);
+					const { retryModel, attempt } = await this.handleResult(result, attempts, retryCallOptions);
 					attempts.push(attempt);
 					if (retryModel) {
 						if (retryModel.delay) {
@@ -322,7 +360,7 @@ var RetryableLanguageModel = class {
 							* - Attempt 3: 4000ms
 							*/
 							const modelAttemptsCount = countModelAttempts(retryModel.model, attempts);
-							await delay(calculateExponentialBackoff(retryModel.delay, retryModel.backoffFactor, modelAttemptsCount), { abortSignal: input.abortSignal });
+							await delay(calculateExponentialBackoff(retryModel.delay, retryModel.backoffFactor, modelAttemptsCount), { abortSignal: retryCallOptions.abortSignal });
 						}
 						this.currentModel = retryModel.model;
 						currentRetry = retryModel;
@@ -337,7 +375,7 @@ var RetryableLanguageModel = class {
 					attempts
 				};
 			} catch (error) {
-				const { retryModel, attempt } = await this.handleError(error, attempts);
+				const { retryModel, attempt } = await this.handleError(error, attempts, retryCallOptions);
 				attempts.push(attempt);
 				if (retryModel.delay) {
 					/**
@@ -345,7 +383,7 @@ var RetryableLanguageModel = class {
 					* The delay grows exponentially: baseDelay * backoffFactor^attempts
 					*/
 					const modelAttemptsCount = countModelAttempts(retryModel.model, attempts);
-					await delay(calculateExponentialBackoff(retryModel.delay, retryModel.backoffFactor, modelAttemptsCount), { abortSignal: input.abortSignal });
+					await delay(calculateExponentialBackoff(retryModel.delay, retryModel.backoffFactor, modelAttemptsCount), { abortSignal: retryCallOptions.abortSignal });
 				}
 				this.currentModel = retryModel.model;
 				currentRetry = retryModel;
@@ -355,11 +393,12 @@ var RetryableLanguageModel = class {
 	/**
 	* Handle a successful result and determine if a retry is needed
 	*/
-	async handleResult(result, attempts) {
+	async handleResult(result, attempts, callOptions) {
 		const resultAttempt = {
 			type: "result",
 			result,
-			model: this.currentModel
+			model: this.currentModel,
+			options: callOptions
 		};
 		const context = {
 			current: resultAttempt,
@@ -373,11 +412,12 @@ var RetryableLanguageModel = class {
 	/**
 	* Handle an error and determine if a retry is needed
 	*/
-	async handleError(error, attempts) {
+	async handleError(error, attempts, callOptions) {
 		const errorAttempt = {
 			type: "error",
 			error,
-			model: this.currentModel
+			model: this.currentModel,
+			options: callOptions
 		};
 		/**
 		* Save the current attempt
@@ -402,7 +442,7 @@ var RetryableLanguageModel = class {
 			attempt: errorAttempt
 		};
 	}
-	async doGenerate(options) {
+	async doGenerate(callOptions) {
 		/**
 		* Always start with the original model
 		*/
@@ -410,21 +450,16 @@ var RetryableLanguageModel = class {
 		/**
 		* If retries are disabled, bypass retry machinery entirely
 		*/
-		if (this.isDisabled()) return this.currentModel.doGenerate(options);
+		if (this.isDisabled()) return this.currentModel.doGenerate(callOptions);
 		const { result } = await this.withRetry({
-			fn: async (currentRetry) => {
-				const callOptions = {
-					...options,
-					providerOptions: currentRetry?.providerOptions ?? options.providerOptions,
-					abortSignal: currentRetry?.timeout ? AbortSignal.timeout(currentRetry.timeout) : options.abortSignal
-				};
-				return this.currentModel.doGenerate(callOptions);
+			fn: async (retryCallOptions) => {
+				return this.currentModel.doGenerate(retryCallOptions);
 			},
-			abortSignal: options.abortSignal
+			callOptions
 		});
 		return result;
 	}
-	async doStream(options) {
+	async doStream(callOptions) {
 		/**
 		* Always start with the original model
 		*/
@@ -432,22 +467,21 @@ var RetryableLanguageModel = class {
 		/**
 		* If retries are disabled, bypass retry machinery entirely
 		*/
-		if (this.isDisabled()) return this.currentModel.doStream(options);
+		if (this.isDisabled()) return this.currentModel.doStream(callOptions);
 		/**
 		* Perform the initial call to doStream with retry logic to handle errors before any data is streamed.
 		*/
 		let { result, attempts } = await this.withRetry({
-			fn: async (currentRetry) => {
-				const callOptions = {
-					...options,
-					providerOptions: currentRetry?.providerOptions ?? options.providerOptions,
-					abortSignal: currentRetry?.timeout ? AbortSignal.timeout(currentRetry.timeout) : options.abortSignal
-				};
-				return this.currentModel.doStream(callOptions);
+			fn: async (retryCallOptions) => {
+				return this.currentModel.doStream(retryCallOptions);
 			},
-			abortSignal: options.abortSignal
+			callOptions
 		});
 		/**
+		* Track the current retry model for computing call options in the stream handler
+		*/
+		let currentRetry;
+		/**
 		* Wrap the original stream to handle retries if an error occurs during streaming.
 		*/
 		const retryableStream = new ReadableStream({ start: async (controller) => {
@@ -477,11 +511,15 @@ var RetryableLanguageModel = class {
 				controller.close();
 				break;
 			} catch (error) {
+				/**
+				* Get the retry call options for the failed attempt
+				*/
+				const retryCallOptions = this.getRetryCallOptions(callOptions, currentRetry);
 				/**
 				* Check if the error from the stream can be retried.
 				* Otherwise it will rethrow the error.
 				*/
-				const { retryModel, attempt } = await this.handleError(error, attempts);
+				const { retryModel, attempt } = await this.handleError(error, attempts, retryCallOptions);
 				/**
 				* Save the attempt
 				*/
@@ -492,24 +530,21 @@ var RetryableLanguageModel = class {
 					* The delay grows exponentially: baseDelay * backoffFactor^attempts
 					*/
 					const modelAttemptsCount = countModelAttempts(retryModel.model, attempts);
-					await delay(calculateExponentialBackoff(retryModel.delay, retryModel.backoffFactor, modelAttemptsCount), { abortSignal: options.abortSignal });
+					await delay(calculateExponentialBackoff(retryModel.delay, retryModel.backoffFactor, modelAttemptsCount), { abortSignal: retryCallOptions.abortSignal });
 				}
 				this.currentModel = retryModel.model;
+				currentRetry = retryModel;
 				/**
 				* Retry the request by calling doStream again.
 				* This will create a new stream.
 				*/
 				const retriedResult = await this.withRetry({
-					fn: async () => {
-						const callOptions = {
-							...options,
-							providerOptions: retryModel.providerOptions ?? options.providerOptions,
-							abortSignal: retryModel.timeout ? AbortSignal.timeout(retryModel.timeout) : options.abortSignal
-						};
-						return this.currentModel.doStream(callOptions);
+					fn: async (retryCallOptions$1) => {
+						return this.currentModel.doStream(retryCallOptions$1);
 					},
+					callOptions,
 					attempts,
-					abortSignal: options.abortSignal
+					currentRetry
 				});
 				/**
 				* Cancel the previous reader and stream if we are retrying

package/dist/retryables/index.d.mts CHANGED Viewed

@@ -1,4 +1,4 @@
-import { S as RetryableOptions, b as Retryable, d as ResolvableLanguageModel, t as EmbeddingModel } from "../types-D7G-2JLh.mjs";
+import { S as Retryable, p as ResolvableLanguageModel, t as EmbeddingModel, w as RetryableOptions } from "../types-JvcFQz93.mjs";
 //#region src/retryables/content-filter-triggered.d.ts

package/dist/{types-D7G-2JLh.d.mts → types-JvcFQz93.d.mts} RENAMED Viewed

@@ -4,7 +4,7 @@ import { EmbeddingModelV3, LanguageModelV3, LanguageModelV3CallOptions, Language
 //#region src/types.d.ts
 type Literals<T> = T extends string ? string extends T ? never : T : never;
 type LanguageModel = LanguageModelV3;
-type EmbeddingModel<VALUE = any> = EmbeddingModelV3<VALUE>;
+type EmbeddingModel = EmbeddingModelV3;
 type LanguageModelCallOptions = LanguageModelV3CallOptions;
 type LanguageModelStreamPart = LanguageModelV3StreamPart;
 type ProviderOptions = SharedV3ProviderOptions;
@@ -13,28 +13,13 @@ type ResolvableLanguageModel = LanguageModel | Literals<GatewayLanguageModelId>;
 type ResolvableModel<MODEL extends LanguageModel | EmbeddingModel> = MODEL extends LanguageModel ? ResolvableLanguageModel : EmbeddingModel;
 type ResolvedModel<MODEL extends ResolvableLanguageModel | EmbeddingModel> = MODEL extends ResolvableLanguageModel ? LanguageModel : EmbeddingModel;
 /**
- * Options for creating a retryable model.
+ * Call options that can be overridden during retry for language models.
  */
-interface RetryableModelOptions<MODEL extends LanguageModel | EmbeddingModel> {
-  model: MODEL;
-  retries: Retries<MODEL>;
-  disabled?: boolean | (() => boolean);
-  onError?: (context: RetryContext<MODEL>) => void;
-  onRetry?: (context: RetryContext<MODEL>) => void;
-}
+type LanguageModelRetryCallOptions = Partial<Pick<LanguageModelCallOptions, 'prompt' | 'maxOutputTokens' | 'temperature' | 'stopSequences' | 'topP' | 'topK' | 'presencePenalty' | 'frequencyPenalty' | 'seed' | 'headers' | 'providerOptions'>>;
 /**
- * The context provided to Retryables with the current attempt and all previous attempts.
+ * Call options that can be overridden during retry for embedding models.
  */
-type RetryContext<MODEL extends ResolvableLanguageModel | EmbeddingModel> = {
-  /**
-   * Current attempt that caused the retry
-   */
-  current: RetryAttempt<ResolvedModel<MODEL>>;
-  /**
-   * All attempts made so far, including the current one
-   */
-  attempts: Array<RetryAttempt<ResolvedModel<MODEL>>>;
-};
+type EmbeddingModelRetryCallOptions = Partial<Pick<EmbeddingModelCallOptions, 'values' | 'headers' | 'providerOptions'>>;
 /**
  * A retry attempt with an error
  */
@@ -43,6 +28,10 @@ type RetryErrorAttempt<MODEL extends LanguageModel | EmbeddingModel> = {
   error: unknown;
   result?: undefined;
   model: MODEL;
+  /**
+   * The call options used for this attempt.
+   */
+  options: MODEL extends LanguageModel ? LanguageModelCallOptions : EmbeddingModelCallOptions;
 };
 /**
  * A retry attempt with a successful result
@@ -52,11 +41,38 @@ type RetryResultAttempt = {
   result: LanguageModelGenerate;
   error?: undefined;
   model: LanguageModel;
+  /**
+   * The call options used for this attempt.
+   */
+  options: LanguageModelCallOptions;
 };
 /**
  * A retry attempt with either an error or a result and the model used
  */
 type RetryAttempt<MODEL extends LanguageModel | EmbeddingModel> = RetryErrorAttempt<MODEL> | RetryResultAttempt;
+/**
+ * The context provided to Retryables with the current attempt and all previous attempts.
+ */
+type RetryContext<MODEL extends ResolvableLanguageModel | EmbeddingModel> = {
+  /**
+   * Current attempt that caused the retry
+   */
+  current: RetryAttempt<ResolvedModel<MODEL>>;
+  /**
+   * All attempts made so far, including the current one
+   */
+  attempts: Array<RetryAttempt<ResolvedModel<MODEL>>>;
+};
+/**
+ * Options for creating a retryable model.
+ */
+interface RetryableModelOptions<MODEL extends LanguageModel | EmbeddingModel> {
+  model: MODEL;
+  retries: Retries<MODEL>;
+  disabled?: boolean | (() => boolean);
+  onError?: (context: RetryContext<MODEL>) => void;
+  onRetry?: (context: RetryContext<MODEL>) => void;
+}
 /**
  * A model to retry with and the maximum number of attempts for that model.
  *
@@ -70,11 +86,34 @@ type RetryAttempt<MODEL extends LanguageModel | EmbeddingModel> = RetryErrorAtte
  */
 type Retry<MODEL extends ResolvableLanguageModel | EmbeddingModel> = {
   model: MODEL;
+  /**
+   * Maximum number of attempts for this model.
+   */
   maxAttempts?: number;
+  /**
+   * Delay in milliseconds before retrying.
+   */
   delay?: number;
+  /**
+   * Factor to multiply the delay by for exponential backoff.
+   */
   backoffFactor?: number;
-  providerOptions?: ProviderOptions;
+  /**
+   * Timeout in milliseconds for the retry request.
+   * Creates a new AbortSignal with this timeout.
+   */
   timeout?: number;
+  /**
+   * Call options to override for this retry.
+   */
+  options?: MODEL extends LanguageModel ? Partial<LanguageModelRetryCallOptions> : Partial<EmbeddingModelRetryCallOptions>;
+  /**
+   * @deprecated Use `options.providerOptions` instead.
+   * Provider options to override for this retry.
+   * If both `providerOptions` and `options.providerOptions` are set,
+   * `options.providerOptions` takes precedence.
+   */
+  providerOptions?: SharedV3ProviderOptions;
 };
 /**
  * A function that determines whether to retry with a different model based on the current attempt and all previous attempts.
@@ -84,7 +123,7 @@ type Retries<MODEL extends LanguageModel | EmbeddingModel> = Array<Retryable<Res
 type RetryableOptions<MODEL extends ResolvableLanguageModel | EmbeddingModel> = Partial<Omit<Retry<MODEL>, 'model'>>;
 type LanguageModelGenerate = Awaited<ReturnType<LanguageModel['doGenerate']>>;
 type LanguageModelStream = Awaited<ReturnType<LanguageModel['doStream']>>;
-type EmbeddingModelCallOptions<VALUE> = Parameters<EmbeddingModel<VALUE>['doEmbed']>[0];
-type EmbeddingModelEmbed<VALUE = any> = Awaited<ReturnType<EmbeddingModel<VALUE>['doEmbed']>>;
+type EmbeddingModelCallOptions = Parameters<EmbeddingModel['doEmbed']>[0];
+type EmbeddingModelEmbed = Awaited<ReturnType<EmbeddingModel['doEmbed']>>;
 //#endregion
-export { RetryableOptions as S, RetryContext as _, LanguageModel as a, Retryable as b, LanguageModelStream as c, ResolvableLanguageModel as d, ResolvableModel as f, RetryAttempt as g, Retry as h, GatewayLanguageModelId as i, LanguageModelStreamPart as l, Retries as m, EmbeddingModelCallOptions as n, LanguageModelCallOptions as o, ResolvedModel as p, EmbeddingModelEmbed as r, LanguageModelGenerate as s, EmbeddingModel as t, ProviderOptions as u, RetryErrorAttempt as v, RetryableModelOptions as x, RetryResultAttempt as y };
+export { RetryableModelOptions as C, Retryable as S, Retry as _, GatewayLanguageModelId as a, RetryErrorAttempt as b, LanguageModelGenerate as c, LanguageModelStreamPart as d, ProviderOptions as f, Retries as g, ResolvedModel as h, EmbeddingModelRetryCallOptions as i, LanguageModelRetryCallOptions as l, ResolvableModel as m, EmbeddingModelCallOptions as n, LanguageModel as o, ResolvableLanguageModel as p, EmbeddingModelEmbed as r, LanguageModelCallOptions as s, EmbeddingModel as t, LanguageModelStream as u, RetryAttempt as v, RetryableOptions as w, RetryResultAttempt as x, RetryContext as y };

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "ai-retry",
-  "version": "1.0.0-beta.1",
+  "version": "1.0.0-beta.2",
   "description": "AI SDK Retry",
   "main": "./dist/index.mjs",
   "module": "./dist/index.mjs",
@@ -33,30 +33,30 @@
     "ai": "6.x"
   },
   "devDependencies": {
-    "@ai-sdk/anthropic": "3.0.0-beta.59",
-    "@ai-sdk/azure": "3.0.0-beta.68",
-    "@ai-sdk/gateway": "2.0.0-beta.66",
-    "@ai-sdk/groq": "3.0.0-beta.36",
-    "@ai-sdk/openai": "3.0.0-beta.66",
+    "@ai-sdk/anthropic": "3.0.0-beta.80",
+    "@ai-sdk/azure": "3.0.0-beta.91",
+    "@ai-sdk/gateway": "2.0.18",
+    "@ai-sdk/groq": "3.0.0-beta.48",
+    "@ai-sdk/openai": "3.0.0-beta.89",
     "@ai-sdk/test-server": "1.0.0-beta.1",
     "@arethetypeswrong/cli": "^0.18.2",
-    "@biomejs/biome": "^2.3.7",
+    "@biomejs/biome": "^2.3.8",
     "@total-typescript/tsconfig": "^1.0.4",
-    "@types/node": "^24.10.1",
-    "ai": "6.0.0-beta.116",
+    "@types/node": "^24.10.2",
+    "ai": "6.0.0-beta.139",
     "husky": "^9.1.7",
-    "msw": "^2.12.3",
-    "pkg-pr-new": "^0.0.60",
-    "publint": "^0.3.15",
-    "tsdown": "^0.16.7",
-    "tsx": "^4.20.6",
-    "typescript": "^5.9.2",
-    "vitest": "^4.0.14",
+    "msw": "^2.12.4",
+    "pkg-pr-new": "^0.0.62",
+    "publint": "^0.3.16",
+    "tsdown": "^0.17.2",
+    "tsx": "^4.21.0",
+    "typescript": "^5.9.3",
+    "vitest": "^4.0.15",
     "zod": "^4.1.13"
   },
   "dependencies": {
-    "@ai-sdk/provider": "3.0.0-beta.17",
-    "@ai-sdk/provider-utils": "4.0.0-beta.34"
+    "@ai-sdk/provider": "3.0.0-beta.26",
+    "@ai-sdk/provider-utils": "4.0.0-beta.45"
   },
   "scripts": {
     "build": "tsdown",