npm - @ai-sdk/xai - Versions diffs - 4.0.0-beta.3 → 4.0.0-beta.30 - Mend

@ai-sdk/xai 4.0.0-beta.3 → 4.0.0-beta.30

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (34) hide show

package/CHANGELOG.md +221 -8
package/README.md +2 -0
package/dist/index.d.mts +132 -50
package/dist/index.d.ts +132 -50
package/dist/index.js +421 -94
package/dist/index.js.map +1 -1
package/dist/index.mjs +433 -88
package/dist/index.mjs.map +1 -1
package/docs/01-xai.mdx +173 -381
package/package.json +4 -6
package/src/convert-to-xai-chat-messages.ts +22 -6
package/src/convert-xai-chat-usage.ts +2 -2
package/src/files/xai-files-api.ts +16 -0
package/src/files/xai-files-options.ts +15 -0
package/src/files/xai-files.ts +93 -0
package/src/index.ts +1 -0
package/src/map-xai-finish-reason.ts +2 -2
package/src/responses/convert-to-xai-responses-input.ts +21 -7
package/src/responses/convert-xai-responses-usage.ts +2 -2
package/src/responses/map-xai-responses-finish-reason.ts +3 -2
package/src/responses/xai-responses-api.ts +31 -1
package/src/responses/xai-responses-language-model.ts +104 -32
package/src/responses/xai-responses-options.ts +4 -0
package/src/responses/xai-responses-prepare-tools.ts +6 -6
package/src/xai-chat-language-model.ts +41 -22
package/src/xai-chat-options.ts +3 -6
package/src/xai-chat-prompt.ts +2 -1
package/src/xai-image-model.ts +35 -12
package/src/xai-image-options.ts +2 -0
package/src/xai-image-settings.ts +0 -2
package/src/xai-prepare-tools.ts +6 -6
package/src/xai-provider.ts +34 -21
package/src/xai-video-model.ts +127 -20
package/src/xai-video-options.ts +136 -14

package/docs/01-xai.mdx CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 title: xAI Grok
-description: Learn how to use xAI Grok.
+description: Learn how to use xAI Grok and Imagine.
 ---
 # xAI Grok Provider
@@ -73,13 +73,13 @@ You can use the following optional settings to customize the xAI provider instan
 ## Language Models
 You can create [xAI models](https://console.x.ai) using a provider instance. The
-first argument is the model id, e.g. `grok-3`.
+first argument is the model id, e.g. `grok-4.20-non-reasoning`.
 ```ts
-const model = xai('grok-3');
+const model = xai('grok-4.20-non-reasoning');
 ```
-By default, `xai(modelId)` uses the Chat API. To use the Responses API with server-side agentic tools, explicitly use `xai.responses(modelId)`.
+By default, `xai(modelId)` uses the Responses API. To use the [Chat Completions API](https://docs.x.ai/docs/api-reference#chat-completions) (legacy), use `xai.chat(modelId)`.
 ### Example
@@ -90,7 +90,7 @@ import { xai } from '@ai-sdk/xai';
 import { generateText } from 'ai';
 const { text } = await generateText({
-  model: xai('grok-3'),
+  model: xai('grok-4.20-non-reasoning'),
   prompt: 'Write a vegetarian lasagna recipe for 4 people.',
 });
 ```
@@ -99,50 +99,12 @@ xAI language models can also be used in the `streamText` function
 and support structured data generation with [`Output`](/docs/reference/ai-sdk-core/output)
 (see [AI SDK Core](/docs/ai-sdk-core)).
-### Provider Options
-xAI chat models support additional provider options that are not part of
-the [standard call settings](/docs/ai-sdk-core/settings). You can pass them in the `providerOptions` argument:
-```ts
-import { xai, type XaiLanguageModelChatOptions } from '@ai-sdk/xai';
-const model = xai('grok-3-mini');
-await generateText({
-  model,
-  providerOptions: {
-    xai: {
-      reasoningEffort: 'high',
-    } satisfies XaiLanguageModelChatOptions,
-  },
-});
-```
-The following optional provider options are available for xAI chat models:
-- **reasoningEffort** _'low' | 'high'_
-  Reasoning effort for reasoning models.
-- **logprobs** _boolean_
-  Return log probabilities for output tokens.
-- **topLogprobs** _number_
-  Number of most likely tokens to return per token position (0-8). When set, `logprobs` is automatically enabled.
-- **parallel_function_calling** _boolean_
-  Whether to enable parallel function calling during tool use. When true, the model can call multiple functions in parallel. When false, the model will call functions sequentially. Defaults to `true`.
 ## Responses API (Agentic Tools)
-You can use the xAI Responses API with the `xai.responses(modelId)` factory method for server-side agentic tool calling. This enables the model to autonomously orchestrate tool calls and research on xAI's servers.
+The xAI Responses API is the default when using `xai(modelId)`. You can also use `xai.responses(modelId)` explicitly. This enables the model to autonomously orchestrate tool calls and research on xAI's servers.
 ```ts
-const model = xai.responses('grok-4-fast-non-reasoning');
+const model = xai.responses('grok-4.20-non-reasoning');
 ```
 The Responses API provides server-side tools that the model can autonomously execute during its reasoning process:
@@ -164,7 +126,7 @@ import { xai } from '@ai-sdk/xai';
 import { generateText } from 'ai';
 const { text } = await generateText({
-  model: xai.responses('grok-2-vision-1212'),
+  model: xai.responses('grok-3'),
   messages: [
     {
       role: 'user',
@@ -186,7 +148,7 @@ import { xai } from '@ai-sdk/xai';
 import { generateText } from 'ai';
 const { text, sources } = await generateText({
-  model: xai.responses('grok-4-fast-non-reasoning'),
+  model: xai.responses('grok-4.20-non-reasoning'),
   prompt: 'What are the latest developments in AI?',
   tools: {
     web_search: xai.tools.webSearch({
@@ -220,7 +182,7 @@ The X search tool enables searching X (Twitter) for posts, with filtering by han
 ```ts
 const { text, sources } = await generateText({
-  model: xai.responses('grok-4-fast-non-reasoning'),
+  model: xai.responses('grok-4.20-non-reasoning'),
   prompt: 'What are people saying about AI on X this week?',
   tools: {
     x_search: xai.tools.xSearch({
@@ -266,7 +228,7 @@ The code execution tool enables the model to write and execute Python code for c
 ```ts
 const { text } = await generateText({
-  model: xai.responses('grok-4-fast-non-reasoning'),
+  model: xai.responses('grok-4.20-non-reasoning'),
   prompt:
     'Calculate the compound interest for $10,000 at 5% annually for 10 years',
   tools: {
@@ -281,7 +243,7 @@ The view image tool enables the model to view and analyze images:
 ```ts
 const { text } = await generateText({
-  model: xai.responses('grok-4-fast-non-reasoning'),
+  model: xai.responses('grok-4.20-non-reasoning'),
   prompt: 'Describe what you see in the image',
   tools: {
     view_image: xai.tools.viewImage(),
@@ -295,7 +257,7 @@ The view X video tool enables the model to view and analyze videos from X (Twitt
 ```ts
 const { text } = await generateText({
-  model: xai.responses('grok-4-fast-non-reasoning'),
+  model: xai.responses('grok-4.20-non-reasoning'),
   prompt: 'Summarize the content of this X video',
   tools: {
     view_x_video: xai.tools.viewXVideo(),
@@ -309,7 +271,7 @@ The MCP server tool enables the model to connect to remote [Model Context Protoc
 ```ts
 const { text } = await generateText({
-  model: xai.responses('grok-4-fast-non-reasoning'),
+  model: xai.responses('grok-4.20-non-reasoning'),
   prompt: 'Use the weather tool to check conditions in San Francisco',
   tools: {
     weather_server: xai.tools.mcpServer({
@@ -357,7 +319,7 @@ import { xai, type XaiLanguageModelResponsesOptions } from '@ai-sdk/xai';
 import { streamText } from 'ai';
 const result = streamText({
-  model: xai.responses('grok-4-1-fast-reasoning'),
+  model: xai.responses('grok-4.20-reasoning'),
   prompt: 'What documents do you have access to?',
   tools: {
     file_search: xai.tools.fileSearch({
@@ -390,7 +352,7 @@ const result = streamText({
   Include file search results in the response. When set to `['file_search_call.results']`, the response will contain the actual search results with file content and scores.
 <Note>
-  File search requires grok-4 family models and the Responses API. Vector stores
+  File search requires grok-4 family models (including grok-4.20) and the Responses API. Vector stores
   can be created using the [xAI
   API](https://docs.x.ai/docs/guides/using-collections/api).
 </Note>
@@ -404,7 +366,7 @@ import { xai } from '@ai-sdk/xai';
 import { streamText } from 'ai';
 const { fullStream } = streamText({
-  model: xai.responses('grok-4-fast-non-reasoning'),
+  model: xai.responses('grok-4.20-non-reasoning'),
   prompt: 'Research AI safety developments and calculate risk metrics',
   tools: {
     web_search: xai.tools.webSearch(),
@@ -438,7 +400,7 @@ import { xai, type XaiLanguageModelResponsesOptions } from '@ai-sdk/xai';
 import { generateText } from 'ai';
 const result = await generateText({
-  model: xai.responses('grok-4-fast-non-reasoning'),
+  model: xai.responses('grok-4.20-non-reasoning'),
   providerOptions: {
     xai: {
       reasoningEffort: 'high',
@@ -479,311 +441,20 @@ The following provider options are available:
   tools with client-side function tools in the same request.
 </Note>
-## Live Search
-xAI models support Live Search functionality, allowing them to query real-time data from various sources and include it in responses with citations.
-### Basic Search
-To enable search, specify `searchParameters` with a search mode:
-```ts
-import { xai, type XaiLanguageModelChatOptions } from '@ai-sdk/xai';
-import { generateText } from 'ai';
-const { text, sources } = await generateText({
-  model: xai('grok-3-latest'),
-  prompt: 'What are the latest developments in AI?',
-  providerOptions: {
-    xai: {
-      searchParameters: {
-        mode: 'auto', // 'auto', 'on', or 'off'
-        returnCitations: true,
-        maxSearchResults: 5,
-      },
-    } satisfies XaiLanguageModelChatOptions,
-  },
-});
-console.log(text);
-console.log('Sources:', sources);
-```
-### Search Parameters
-The following search parameters are available:
-- **mode** _'auto' | 'on' | 'off'_
-  Search mode preference:
-  - `'auto'` (default): Model decides whether to search
-  - `'on'`: Always enables search
-  - `'off'`: Disables search completely
-- **returnCitations** _boolean_
-  Whether to return citations in the response. Defaults to `true`.
-- **fromDate** _string_
-  Start date for search data in ISO8601 format (`YYYY-MM-DD`).
-- **toDate** _string_
-  End date for search data in ISO8601 format (`YYYY-MM-DD`).
-- **maxSearchResults** _number_
-  Maximum number of search results to consider. Defaults to 20, max 50.
-- **sources** _Array&lt;SearchSource&gt;_
-  Data sources to search from. Defaults to `["web", "x"]` if not specified.
-### Search Sources
-You can specify different types of data sources for search:
-#### Web Search
-```ts
-import { xai, type XaiLanguageModelChatOptions } from '@ai-sdk/xai';
-const result = await generateText({
-  model: xai('grok-3-latest'),
-  prompt: 'Best ski resorts in Switzerland',
-  providerOptions: {
-    xai: {
-      searchParameters: {
-        mode: 'on',
-        sources: [
-          {
-            type: 'web',
-            country: 'CH', // ISO alpha-2 country code
-            allowedWebsites: ['ski.com', 'snow-forecast.com'],
-            safeSearch: true,
-          },
-        ],
-      },
-    } satisfies XaiLanguageModelChatOptions,
-  },
-});
-```
-#### Web source parameters
-- **country** _string_: ISO alpha-2 country code
-- **allowedWebsites** _string[]_: Max 5 allowed websites
-- **excludedWebsites** _string[]_: Max 5 excluded websites
-- **safeSearch** _boolean_: Enable safe search (default: true)
-#### X (Twitter) Search
-```ts
-import { xai, type XaiLanguageModelChatOptions } from '@ai-sdk/xai';
-const result = await generateText({
-  model: xai('grok-3-latest'),
-  prompt: 'Latest updates on Grok AI',
-  providerOptions: {
-    xai: {
-      searchParameters: {
-        mode: 'on',
-        sources: [
-          {
-            type: 'x',
-            includedXHandles: ['grok', 'xai'],
-            excludedXHandles: ['openai'],
-            postFavoriteCount: 10,
-            postViewCount: 100,
-          },
-        ],
-      },
-    } satisfies XaiLanguageModelChatOptions,
-  },
-});
-```
-#### X source parameters
-- **includedXHandles** _string[]_: Array of X handles to search (without @ symbol)
-- **excludedXHandles** _string[]_: Array of X handles to exclude from search (without @ symbol)
-- **postFavoriteCount** _number_: Minimum favorite count of the X posts to consider.
-- **postViewCount** _number_: Minimum view count of the X posts to consider.
-#### News Search
-```ts
-import { xai, type XaiLanguageModelChatOptions } from '@ai-sdk/xai';
-const result = await generateText({
-  model: xai('grok-3-latest'),
-  prompt: 'Recent tech industry news',
-  providerOptions: {
-    xai: {
-      searchParameters: {
-        mode: 'on',
-        sources: [
-          {
-            type: 'news',
-            country: 'US',
-            excludedWebsites: ['tabloid.com'],
-            safeSearch: true,
-          },
-        ],
-      },
-    } satisfies XaiLanguageModelChatOptions,
-  },
-});
-```
-#### News source parameters
-- **country** _string_: ISO alpha-2 country code
-- **excludedWebsites** _string[]_: Max 5 excluded websites
-- **safeSearch** _boolean_: Enable safe search (default: true)
-#### RSS Feed Search
-```ts
-import { xai, type XaiLanguageModelChatOptions } from '@ai-sdk/xai';
-const result = await generateText({
-  model: xai('grok-3-latest'),
-  prompt: 'Latest status updates',
-  providerOptions: {
-    xai: {
-      searchParameters: {
-        mode: 'on',
-        sources: [
-          {
-            type: 'rss',
-            links: ['https://status.x.ai/feed.xml'],
-          },
-        ],
-      },
-    } satisfies XaiLanguageModelChatOptions,
-  },
-});
-```
-#### RSS source parameters
-- **links** _string[]_: Array of RSS feed URLs (max 1 currently supported)
-### Multiple Sources
-You can combine multiple data sources in a single search:
-```ts
-import { xai, type XaiLanguageModelChatOptions } from '@ai-sdk/xai';
-const result = await generateText({
-  model: xai('grok-3-latest'),
-  prompt: 'Comprehensive overview of recent AI breakthroughs',
-  providerOptions: {
-    xai: {
-      searchParameters: {
-        mode: 'on',
-        returnCitations: true,
-        maxSearchResults: 15,
-        sources: [
-          {
-            type: 'web',
-            allowedWebsites: ['arxiv.org', 'openai.com'],
-          },
-          {
-            type: 'news',
-            country: 'US',
-          },
-          {
-            type: 'x',
-            includedXHandles: ['openai', 'deepmind'],
-          },
-        ],
-      },
-    } satisfies XaiLanguageModelChatOptions,
-  },
-});
-```
-### Sources and Citations
-When search is enabled with `returnCitations: true`, the response includes sources that were used to generate the answer:
-```ts
-import { xai, type XaiLanguageModelChatOptions } from '@ai-sdk/xai';
-const { text, sources } = await generateText({
-  model: xai('grok-3-latest'),
-  prompt: 'What are the latest developments in AI?',
-  providerOptions: {
-    xai: {
-      searchParameters: {
-        mode: 'auto',
-        returnCitations: true,
-      },
-    } satisfies XaiLanguageModelChatOptions,
-  },
-});
-// Access the sources used
-for (const source of sources) {
-  if (source.sourceType === 'url') {
-    console.log('Source:', source.url);
-  }
-}
-```
-### Streaming with Search
-Live Search works with streaming responses. Citations are included when the stream completes:
-```ts
-import { xai, type XaiLanguageModelChatOptions } from '@ai-sdk/xai';
-import { streamText } from 'ai';
-const result = streamText({
-  model: xai('grok-3-latest'),
-  prompt: 'What has happened in tech recently?',
-  providerOptions: {
-    xai: {
-      searchParameters: {
-        mode: 'auto',
-        returnCitations: true,
-      },
-    } satisfies XaiLanguageModelChatOptions,
-  },
-});
-for await (const textPart of result.textStream) {
-  process.stdout.write(textPart);
-}
-console.log('Sources:', await result.sources);
-```
 ## Model Capabilities
 | Model                         | Image Input         | Object Generation   | Tool Usage          | Tool Streaming      | Reasoning           |
 | ----------------------------- | ------------------- | ------------------- | ------------------- | ------------------- | ------------------- |
-| `grok-4-1`                    | <Cross size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
+| `grok-4.20-reasoning`         | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> |
+| `grok-4.20-non-reasoning`     | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
 | `grok-4-1-fast-reasoning`     | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> |
 | `grok-4-1-fast-non-reasoning` | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
-| `grok-4-fast-non-reasoning`   | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
+| `grok-4-1`                    | <Cross size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
 | `grok-4-fast-reasoning`       | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> |
+| `grok-4-fast-non-reasoning`   | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
 | `grok-code-fast-1`            | <Cross size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> |
-| `grok-4`                      | <Cross size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
-| `grok-4-0709`                 | <Cross size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
-| `grok-4-latest`               | <Cross size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
 | `grok-3`                      | <Cross size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
-| `grok-3-latest`               | <Cross size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
 | `grok-3-mini`                 | <Cross size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> |
-| `grok-3-mini-latest`          | <Cross size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> |
-| `grok-2-vision`               | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
-| `grok-2-vision-latest`        | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
-| `grok-2-vision-1212`          | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Cross size={18} /> |
 <Note>
   The table above lists popular models. Please see the [xAI
@@ -800,7 +471,7 @@ import { xai } from '@ai-sdk/xai';
 import { generateImage } from 'ai';
 const { image } = await generateImage({
-  model: xai.image('grok-2-image'),
+  model: xai.image('grok-imagine-image'),
   prompt: 'A futuristic cityscape at sunset',
 });
 ```
@@ -813,7 +484,7 @@ const { image } = await generateImage({
 ### Image Editing
-xAI supports image editing through the `grok-2-image` and `grok-imagine-image` models. Pass input images via `prompt.images` to transform or edit existing images.
+xAI supports image editing through the `grok-imagine-image` model. Pass input images via `prompt.images` to transform or edit existing images.
 <Note>
   xAI image editing does not support masks. Editing is prompt-driven - describe
@@ -832,7 +503,7 @@ import { readFileSync } from 'fs';
 const imageBuffer = readFileSync('./input-image.png');
 const { images } = await generateImage({
-  model: xai.image('grok-2-image'),
+  model: xai.image('grok-imagine-image'),
   prompt: {
     text: 'Turn the cat into a golden retriever dog',
     images: [imageBuffer],
@@ -842,7 +513,7 @@ const { images } = await generateImage({
 #### Multi-Image Editing
-Combine or reference multiple input images (up to 3) in the prompt:
+Combine or reference multiple input images in the prompt:
 ```ts
 import { xai } from '@ai-sdk/xai';
@@ -869,7 +540,7 @@ Apply artistic styles to an image:
 const imageBuffer = readFileSync('./input-image.png');
 const { images } = await generateImage({
-  model: xai.image('grok-2-image'),
+  model: xai.image('grok-imagine-image'),
   prompt: {
     text: 'Transform this into a watercolor painting style',
     images: [imageBuffer],
@@ -880,38 +551,53 @@ const { images } = await generateImage({
 <Note>
   Input images can be provided as `Buffer`, `ArrayBuffer`, `Uint8Array`, or
-  base64-encoded strings. Up to 3 input images are supported per request.
+  base64-encoded strings.
 </Note>
-### Model-specific options
+### Image Provider Options
-You can customize the image generation behavior with model-specific settings:
+You can customize the image generation behavior with provider-specific settings via `providerOptions.xai`:
 ```ts
-import { xai } from '@ai-sdk/xai';
+import { xai, type XaiImageModelOptions } from '@ai-sdk/xai';
 import { generateImage } from 'ai';
 const { images } = await generateImage({
-  model: xai.image('grok-2-image'),
+  model: xai.image('grok-imagine-image-pro'),
   prompt: 'A futuristic cityscape at sunset',
   aspectRatio: '16:9',
-  n: 2,
+  providerOptions: {
+    xai: {
+      resolution: '2k',
+      quality: 'high',
+    } satisfies XaiImageModelOptions,
+  },
 });
 ```
-### Model Capabilities
+- **resolution** _'1k' | '2k'_
+  Output resolution. `1k` produces ~1024×1024 images, `2k` produces ~2048×2048
+  images (actual dimensions vary based on aspect ratio). Available for
+  `grok-imagine-image-pro`.
+- **quality** _'low' | 'medium' | 'high'_
+  Image quality level. Higher quality may increase generation time.
+### Image Model Capabilities
-| Model                | Aspect Ratios                                                                                               | Image Editing       |
-| -------------------- | ----------------------------------------------------------------------------------------------------------- | ------------------- |
-| `grok-2-image`       | `1:1`, `16:9`, `9:16`, `4:3`, `3:4`, `3:2`, `2:3`, `2:1`, `1:2`, `19.5:9`, `9:19.5`, `20:9`, `9:20`, `auto` | <Check size={18} /> |
-| `grok-imagine-image` | `1:1`, `16:9`, `9:16`, `4:3`, `3:4`, `3:2`, `2:3`, `2:1`, `1:2`, `19.5:9`, `9:19.5`, `20:9`, `9:20`, `auto` | <Check size={18} /> |
+| Model                    | Resolution   | Aspect Ratios                                                                                               | Image Editing       |
+| ------------------------ | ------------ | ----------------------------------------------------------------------------------------------------------- | ------------------- |
+| `grok-imagine-image-pro` | `1k`, `2k`   | `1:1`, `16:9`, `9:16`, `4:3`, `3:4`, `3:2`, `2:3`, `2:1`, `1:2`, `19.5:9`, `9:19.5`, `20:9`, `9:20`, `auto` | <Check size={18} /> |
+| `grok-imagine-image`     | `1k`         | `1:1`, `16:9`, `9:16`, `4:3`, `3:4`, `3:2`, `2:3`, `2:1`, `1:2`, `19.5:9`, `9:19.5`, `20:9`, `9:20`, `auto` | <Check size={18} /> |
 ## Video Models
 You can create xAI video models using the `.video()` factory method.
 For more on video generation with the AI SDK see [generateVideo()](/docs/reference/ai-sdk-core/generate-video).
-This provider supports three video generation modes: text-to-video, image-to-video, and video editing.
+This provider supports standard video generation from text prompts or image input, plus explicit video editing, video extension, and reference-to-video (R2V) operations.
 ### Text-to-Video
@@ -921,7 +607,7 @@ Generate videos from text prompts:
 import { xai, type XaiVideoModelOptions } from '@ai-sdk/xai';
 import { experimental_generateVideo as generateVideo } from 'ai';
-const { videos } = await generateVideo({
+const { video } = await generateVideo({
   model: xai.video('grok-imagine-video'),
   prompt: 'A chicken flying into the sunset in the style of 90s anime.',
   aspectRatio: '16:9',
@@ -934,15 +620,15 @@ const { videos } = await generateVideo({
 });
 ```
-### Image-to-Video
+### Generation with Image Input
-Generate videos using an image as the starting frame with an optional text prompt:
+Generate videos using an image as the starting frame with an optional text prompt. This uses the standard generation path rather than a separate provider mode:
 ```ts
 import { xai, type XaiVideoModelOptions } from '@ai-sdk/xai';
 import { experimental_generateVideo as generateVideo } from 'ai';
-const { videos } = await generateVideo({
+const { video } = await generateVideo({
   model: xai.video('grok-imagine-video'),
   prompt: {
     image: 'https://example.com/start-frame.png',
@@ -965,11 +651,12 @@ Edit an existing video using a text prompt by providing a source video URL via p
 import { xai, type XaiVideoModelOptions } from '@ai-sdk/xai';
 import { experimental_generateVideo as generateVideo } from 'ai';
-const { videos } = await generateVideo({
+const { video } = await generateVideo({
   model: xai.video('grok-imagine-video'),
   prompt: 'Give the person sunglasses and a hat',
   providerOptions: {
     xai: {
+      mode: 'edit-video',
       videoUrl: 'https://example.com/source-video.mp4',
       pollTimeoutMs: 600000, // 10 minutes
     } satisfies XaiVideoModelOptions,
@@ -995,6 +682,7 @@ import { experimental_generateVideo as generateVideo } from 'ai';
 const providerOptions = {
   xai: {
+    mode: 'edit-video',
     videoUrl: 'https://example.com/source-video.mp4',
     pollTimeoutMs: 600000,
   } satisfies XaiVideoModelOptions,
@@ -1016,19 +704,99 @@ const [withSunglasses, withScarf] = await Promise.all([
     model: xai.video('grok-imagine-video'),
     prompt: 'Add sunglasses',
     providerOptions: {
-      xai: { videoUrl: step1VideoUrl, pollTimeoutMs: 600000 },
+      xai: { mode: 'edit-video', videoUrl: step1VideoUrl, pollTimeoutMs: 600000 },
     },
   }),
   generateVideo({
     model: xai.video('grok-imagine-video'),
     prompt: 'Add a scarf',
     providerOptions: {
-      xai: { videoUrl: step1VideoUrl, pollTimeoutMs: 600000 },
+      xai: { mode: 'edit-video', videoUrl: step1VideoUrl, pollTimeoutMs: 600000 },
     },
   }),
 ]);
 ```
+### Video Extension
+Extend an existing video from its last frame. The `duration` controls the length of the extension only, not the total output. The output inherits `aspectRatio` and `resolution` from the source video.
+```ts
+import { xai, type XaiVideoModelOptions } from '@ai-sdk/xai';
+import { experimental_generateVideo as generateVideo } from 'ai';
+// Step 1: Generate a source video
+const source = await generateVideo({
+  model: xai.video('grok-imagine-video'),
+  prompt: 'A cat sitting on a sunlit windowsill, tail gently swishing.',
+  duration: 5,
+  aspectRatio: '16:9',
+  providerOptions: {
+    xai: {
+      pollTimeoutMs: 600000,
+    } satisfies XaiVideoModelOptions,
+  },
+});
+const sourceUrl = source.providerMetadata?.xai?.videoUrl as string;
+// Step 2: Extend the video with a new scene
+const extended = await generateVideo({
+  model: xai.video('grok-imagine-video'),
+  prompt: 'The cat turns its head, notices a butterfly, and leaps off.',
+  duration: 6,
+  providerOptions: {
+    xai: {
+      mode: 'extend-video',
+      videoUrl: sourceUrl,
+      pollTimeoutMs: 600000,
+    } satisfies XaiVideoModelOptions,
+  },
+});
+```
+<Note>
+  Video extension does not support custom `aspectRatio` or `resolution` — the
+  output inherits those from the source video. `duration` is supported and
+  controls how long the extension is (not the total video length).
+</Note>
+### Reference-to-Video (R2V)
+Provide reference images to guide the video's style and content. Unlike image-to-video, reference images are not used as the first frame — the model incorporates their visual elements into the generated video. Each reference image can be a public HTTPS URL or a base64 data URI.
+```ts
+import { xai, type XaiVideoModelOptions } from '@ai-sdk/xai';
+import { experimental_generateVideo as generateVideo } from 'ai';
+const { video } = await generateVideo({
+  model: xai.video('grok-imagine-video'),
+  prompt:
+    'The comic cat from <IMAGE_1> and the comic dog from <IMAGE_2> ' +
+    'are having a playful chase through a sunlit park. ' +
+    'Cinematic slow-motion, warm afternoon light.',
+  duration: 8,
+  aspectRatio: '16:9',
+  providerOptions: {
+    xai: {
+      mode: 'reference-to-video',
+      referenceImageUrls: [
+        'https://example.com/comic-cat.png',
+        'https://example.com/comic-dog.png',
+      ],
+      pollTimeoutMs: 600000,
+    } satisfies XaiVideoModelOptions,
+  },
+});
+```
+Use `<IMAGE_1>`, `<IMAGE_2>`, etc. in your prompt to reference specific images. Up to 7 reference images are supported per request.
+<Note>
+  Reference-to-video supports `duration`, `aspectRatio`, and `resolution`. Use
+  `mode` to select the operation — each mode is mutually exclusive.
+</Note>
 ### Video Provider Options
 The following provider options are available via `providerOptions.xai`.
@@ -1048,10 +816,27 @@ You can validate the provider options using the `XaiVideoModelOptions` type.
   `1280x720` maps to `720p` and `854x480` maps to `480p`.
   Use this provider option to pass the native format directly.
+- **mode** _'edit-video' | 'extend-video' | 'reference-to-video'_
+  Selects the explicit video operation. Each mode is mutually exclusive:
+  - `'edit-video'` — edit an existing video (requires `videoUrl`)
+  - `'extend-video'` — extend a video from its last frame (requires `videoUrl`)
+  - `'reference-to-video'` — generate from reference images (requires `referenceImageUrls`)
+  When omitted, standard generation is used. Legacy inputs are still auto-detected from fields for backward compatibility.
 - **videoUrl** _string_
-  URL of a source video for video editing. When provided, the prompt is used
-  to describe the desired edits to the video.
+  URL of a source video. Used with `mode: 'edit-video'` for video editing
+  and `mode: 'extend-video'` for video extension.
+- **referenceImageUrls** _string[]_
+  Array of reference image URLs (1–7 images) or base64 data URIs for
+  reference-to-video (R2V) generation. The model incorporates visual
+  elements from these images without using them as the first frame. Use
+  `<IMAGE_1>`, `<IMAGE_2>`, etc. in the prompt to reference specific
+  images. Used with `mode: 'reference-to-video'`.
 <Note>
   Video generation is an asynchronous process that can take several minutes.
@@ -1071,14 +856,21 @@ desired ratio.
 For **video editing**, the output matches the input video's aspect ratio and
 resolution. Custom `duration`, `aspectRatio`, and `resolution` are not
-supported - the output resolution is capped at 720p (e.g., a 1080p input
+supported — the output resolution is capped at 720p (e.g., a 1080p input
 will be downsized to 720p).
+For **video extension**, the output inherits `aspectRatio` and `resolution`
+from the source video. `duration` is supported and controls only the
+extension length.
+For **reference-to-video (R2V)**, you can specify `duration`, `aspectRatio`,
+and `resolution` just like text-to-video.
 ### Video Model Capabilities
-| Model                | Duration | Aspect Ratios                                     | Resolution     | Image-to-Video      | Video Editing       |
-| -------------------- | -------- | ------------------------------------------------- | -------------- | ------------------- | ------------------- |
-| `grok-imagine-video` | 1–15s    | `1:1`, `16:9`, `9:16`, `4:3`, `3:4`, `3:2`, `2:3` | `480p`, `720p` | <Check size={18} /> | <Check size={18} /> |
+| Model                | Duration | Aspect Ratios                                     | Resolution     | Image-to-Video      | Editing             | Extension           | R2V                 |
+| -------------------- | -------- | ------------------------------------------------- | -------------- | ------------------- | ------------------- | ------------------- | ------------------- |
+| `grok-imagine-video` | 1–15s    | `1:1`, `16:9`, `9:16`, `4:3`, `3:4`, `3:2`, `2:3` | `480p`, `720p` | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> |
 <Note>
   You can also pass any available provider model ID as a string if needed.