npm - @ai-sdk/google-vertex - Versions diffs - 5.0.0-beta.10 → 5.0.0-beta.108 - Mend

@ai-sdk/google-vertex 5.0.0-beta.10 → 5.0.0-beta.108

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (80) hide show

package/CHANGELOG.md +930 -8
package/README.md +65 -2
package/dist/anthropic/edge/index.d.ts +18 -151
package/dist/anthropic/edge/index.js +88 -70
package/dist/anthropic/edge/index.js.map +1 -1
package/dist/anthropic/index.d.ts +18 -151
package/dist/anthropic/index.js +88 -81
package/dist/anthropic/index.js.map +1 -1
package/dist/edge/index.d.ts +58 -29
package/dist/edge/index.js +581 -278
package/dist/edge/index.js.map +1 -1
package/dist/index.d.ts +70 -32
package/dist/index.js +590 -296
package/dist/index.js.map +1 -1
package/dist/maas/edge/index.d.ts +76 -0
package/dist/maas/edge/index.js +209 -0
package/dist/maas/edge/index.js.map +1 -0
package/dist/maas/index.d.ts +60 -0
package/dist/maas/index.js +109 -0
package/dist/maas/index.js.map +1 -0
package/dist/xai/edge/index.d.ts +92 -0
package/dist/xai/edge/index.js +259 -0
package/dist/xai/edge/index.js.map +1 -0
package/dist/xai/index.d.ts +76 -0
package/dist/xai/index.js +159 -0
package/dist/xai/index.js.map +1 -0
package/docs/16-google-vertex.mdx +640 -102
package/maas/edge.d.ts +1 -0
package/maas/index.d.ts +1 -0
package/package.json +49 -24
package/src/anthropic/edge/google-vertex-anthropic-provider-edge.ts +17 -13
package/src/anthropic/edge/index.ts +6 -2
package/src/anthropic/{google-vertex-anthropic-messages-options.ts → google-vertex-anthropic-options.ts} +4 -1
package/src/anthropic/google-vertex-anthropic-provider-node.ts +18 -13
package/src/anthropic/google-vertex-anthropic-provider.ts +68 -19
package/src/anthropic/index.ts +6 -2
package/src/edge/google-vertex-provider-edge.ts +10 -12
package/src/edge/index.ts +8 -1
package/src/google-vertex-auth-google-auth-library.ts +13 -26
package/src/google-vertex-config.ts +2 -2
package/src/{google-vertex-embedding-options.ts → google-vertex-embedding-model-options.ts} +1 -0
package/src/google-vertex-embedding-model.ts +35 -10
package/src/google-vertex-image-model-options.ts +74 -0
package/src/google-vertex-image-model.ts +106 -133
package/src/google-vertex-options.ts +1 -1
package/src/google-vertex-provider-base.ts +311 -0
package/src/google-vertex-provider.ts +43 -233
package/src/google-vertex-speech-model-options.ts +11 -0
package/src/google-vertex-transcription-model-options.ts +46 -0
package/src/google-vertex-transcription-model.ts +231 -0
package/src/google-vertex-video-model-options.ts +49 -0
package/src/google-vertex-video-model.ts +39 -75
package/src/index.ts +20 -5
package/src/maas/edge/google-vertex-maas-provider-edge.ts +64 -0
package/src/maas/edge/index.ts +13 -0
package/src/maas/google-vertex-maas-options.ts +15 -0
package/src/maas/google-vertex-maas-provider-node.ts +65 -0
package/src/maas/google-vertex-maas-provider.ts +122 -0
package/src/maas/index.ts +13 -0
package/src/xai/edge/google-vertex-xai-provider-edge.ts +61 -0
package/src/xai/edge/index.ts +9 -0
package/src/xai/google-vertex-xai-options.ts +7 -0
package/src/xai/google-vertex-xai-provider-node.ts +62 -0
package/src/xai/google-vertex-xai-provider.ts +212 -0
package/src/xai/index.ts +9 -0
package/xai/edge.d.ts +1 -0
package/xai/index.d.ts +1 -0
package/dist/anthropic/edge/index.d.mts +0 -231
package/dist/anthropic/edge/index.mjs +0 -259
package/dist/anthropic/edge/index.mjs.map +0 -1
package/dist/anthropic/index.d.mts +0 -215
package/dist/anthropic/index.mjs +0 -164
package/dist/anthropic/index.mjs.map +0 -1
package/dist/edge/index.d.mts +0 -160
package/dist/edge/index.mjs +0 -1049
package/dist/edge/index.mjs.map +0 -1
package/dist/index.d.mts +0 -219
package/dist/index.mjs +0 -960
package/dist/index.mjs.map +0 -1
package/src/google-vertex-provider-node.ts +0 -49

package/docs/16-google-vertex.mdx CHANGED Viewed

@@ -5,19 +5,20 @@ description: Learn how to use the Google Vertex AI provider.
 # Google Vertex Provider
-The Google Vertex provider for the [AI SDK](/docs) contains language model support for the [Google Vertex AI](https://cloud.google.com/vertex-ai) APIs. This includes support for [Google's Gemini models](https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models) and [Anthropic's Claude partner models](https://cloud.google.com/vertex-ai/generative-ai/docs/partner-models/use-claude).
+The Google Vertex provider for the [AI SDK](/docs) contains language model support for the [Google Vertex AI](https://cloud.google.com/vertex-ai) APIs. This includes support for [Google's Gemini models](https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models), [Anthropic's Claude partner models](https://cloud.google.com/vertex-ai/generative-ai/docs/partner-models/use-claude), [xAI's Grok partner models](https://cloud.google.com/vertex-ai/generative-ai/docs/partner-models/grok), and [MaaS (Model as a Service) open models](https://cloud.google.com/vertex-ai/generative-ai/docs/maas/use-open-models).
 <Note>
   The Google Vertex provider is compatible with both Node.js and Edge runtimes.
   The Edge runtime is supported through the `@ai-sdk/google-vertex/edge`
   sub-module. More details can be found in the [Google Vertex Edge
-  Runtime](#google-vertex-edge-runtime) and [Google Vertex Anthropic Edge
-  Runtime](#google-vertex-anthropic-edge-runtime) sections below.
+  Runtime](#google-vertex-edge-runtime), [Google Vertex Anthropic Edge
+  Runtime](#google-vertex-anthropic-edge-runtime), and [Google Vertex MaaS Edge
+  Runtime](#google-vertex-maas-edge-runtime) sections below.
 </Note>
 ## Setup
-The Google Vertex and Google Vertex Anthropic providers are both available in the `@ai-sdk/google-vertex` module. You can install it with
+The Google Vertex, Google Vertex Anthropic, Google Vertex xAI, and Google Vertex MaaS providers are available in the `@ai-sdk/google-vertex` module. You can install it with
 <Tabs items={['pnpm', 'npm', 'yarn', 'bun']}>
   <Tab>
@@ -44,18 +45,18 @@ The Google Vertex provider instance is used to create model instances that call
 ### Provider Instance
-You can import the default provider instance `vertex` from `@ai-sdk/google-vertex`:
+You can import the default provider instance `googleVertex` from `@ai-sdk/google-vertex`:
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 ```
-If you need a customized setup, you can import `createVertex` from `@ai-sdk/google-vertex` and create a provider instance with your settings:
+If you need a customized setup, you can import `createGoogleVertex` from `@ai-sdk/google-vertex` and create a provider instance with your settings:
 ```ts
-import { createVertex } from '@ai-sdk/google-vertex';
+import { createGoogleVertex } from '@ai-sdk/google-vertex';
-const vertex = createVertex({
+const googleVertex = createGoogleVertex({
   project: 'my-project', // optional
   location: 'us-central1', // optional
 });
@@ -67,12 +68,12 @@ Google Vertex supports multiple authentication methods depending on your runtime
 The Node.js runtime is the default runtime supported by the AI SDK. It supports all standard Google Cloud authentication options through the [`google-auth-library`](https://github.com/googleapis/google-auth-library-nodejs?tab=readme-ov-file#ways-to-authenticate). Typical use involves setting a path to a json credentials file in the `GOOGLE_APPLICATION_CREDENTIALS` environment variable. The credentials file can be obtained from the [Google Cloud Console](https://console.cloud.google.com/apis/credentials).
-If you want to customize the Google authentication options you can pass them as options to the `createVertex` function, for example:
+If you want to customize the Google authentication options you can pass them as options to the `createGoogleVertex` function, for example:
 ```ts
-import { createVertex } from '@ai-sdk/google-vertex';
+import { createGoogleVertex } from '@ai-sdk/google-vertex';
-const vertex = createVertex({
+const googleVertex = createGoogleVertex({
   googleAuthOptions: {
     credentials: {
       client_email: 'my-email',
@@ -99,7 +100,6 @@ You can use the following optional settings to customize the provider instance:
 - **googleAuthOptions** _object_
   Optional. The Authentication options used by the [Google Auth Library](https://github.com/googleapis/google-auth-library-nodejs/). See also the [GoogleAuthOptions](https://github.com/googleapis/google-auth-library-nodejs/blob/08978822e1b7b5961f0e355df51d738e012be392/src/auth/googleauth.ts#L87C18-L87C35) interface.
   - **authClient** _object_
     An `AuthClient` to use.
@@ -127,7 +127,6 @@ You can use the following optional settings to customize the provider instance:
 - **headers** _Resolvable&lt;Record&lt;string, string | undefined&gt;&gt;_
   Headers to include in the requests. Can be provided in multiple formats:
   - A record of header key-value pairs: `Record<string, string | undefined>`
   - A function that returns headers: `() => Record<string, string | undefined>`
   - An async function that returns headers: `async () => Record<string, string | undefined>`
@@ -155,10 +154,10 @@ For example, direct file system access is not available, and many Node.js-specif
 The Edge runtime version of the Google Vertex provider supports Google's [Application Default Credentials](https://github.com/googleapis/google-auth-library-nodejs?tab=readme-ov-file#application-default-credentials) through environment variables. The values can be obtained from a json credentials file from the [Google Cloud Console](https://console.cloud.google.com/apis/credentials).
-You can import the default provider instance `vertex` from `@ai-sdk/google-vertex/edge`:
+You can import the default provider instance `googleVertex` from `@ai-sdk/google-vertex/edge`:
 ```ts
-import { vertex } from '@ai-sdk/google-vertex/edge';
+import { googleVertex } from '@ai-sdk/google-vertex/edge';
 ```
 <Note>
@@ -167,12 +166,12 @@ import { vertex } from '@ai-sdk/google-vertex/edge';
   `@ai-sdk/google-vertex/edge` to differentiate it from the Node.js provider.
 </Note>
-If you need a customized setup, you can import `createVertex` from `@ai-sdk/google-vertex/edge` and create a provider instance with your settings:
+If you need a customized setup, you can import `createGoogleVertex` from `@ai-sdk/google-vertex/edge` and create a provider instance with your settings:
 ```ts
-import { createVertex } from '@ai-sdk/google-vertex/edge';
+import { createGoogleVertex } from '@ai-sdk/google-vertex/edge';
-const vertex = createVertex({
+const googleVertex = createGoogleVertex({
   project: 'my-project', // optional
   location: 'us-central1', // optional
 });
@@ -203,7 +202,6 @@ You can use the following optional settings to customize the provider instance:
 - **googleCredentials** _object_
   Optional. The credentials used by the Edge provider for authentication. These credentials are typically set through environment variables and are derived from a service account JSON file.
   - **clientEmail** _string_
     The client email from the service account JSON file. Defaults to the contents of the `GOOGLE_CLIENT_EMAIL` environment variable.
@@ -216,7 +214,6 @@ You can use the following optional settings to customize the provider instance:
 - **headers** _Resolvable&lt;Record&lt;string, string | undefined&gt;&gt;_
   Headers to include in the requests. Can be provided in multiple formats:
   - A record of header key-value pairs: `Record<string, string | undefined>`
   - A function that returns headers: `() => Record<string, string | undefined>`
   - An async function that returns headers: `async () => Record<string, string | undefined>`
@@ -234,9 +231,9 @@ You can use the following optional settings to customize the provider instance:
 Express mode provides a simplified authentication method using an API key instead of OAuth or service account credentials. When using express mode, the `project` and `location` settings are not required.
 ```ts
-import { createVertex } from '@ai-sdk/google-vertex';
+import { createGoogleVertex } from '@ai-sdk/google-vertex';
-const vertex = createVertex({
+const googleVertex = createGoogleVertex({
   apiKey: process.env.GOOGLE_VERTEX_API_KEY,
 });
 ```
@@ -254,7 +251,7 @@ You can create models that call the Vertex API using the provider instance.
 The first argument is the model id, e.g. `gemini-2.5-pro`.
 ```ts
-const model = vertex('gemini-2.5-pro');
+const model = googleVertex('gemini-2.5-pro');
 ```
 <Note>
@@ -268,10 +265,10 @@ of the [standard call settings](/docs/ai-sdk-core/settings). You can pass them a
 an options argument:
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { type GoogleLanguageModelOptions } from '@ai-sdk/google';
-const model = vertex('gemini-2.5-pro');
+const model = googleVertex('gemini-2.5-pro');
 await generateText({
   model,
@@ -309,11 +306,9 @@ The following optional provider options are available for Google Vertex models:
 - **safetySettings** _Array\<\{ category: string; threshold: string \}\>_
   Optional. Safety settings for the model.
   - **category** _string_
     The category of the safety setting. Can be one of the following:
     - `HARM_CATEGORY_UNSPECIFIED`
     - `HARM_CATEGORY_HATE_SPEECH`
     - `HARM_CATEGORY_DANGEROUS_CONTENT`
@@ -324,7 +319,6 @@ The following optional provider options are available for Google Vertex models:
   - **threshold** _string_
     The threshold of the safety setting. Can be one of the following:
     - `HARM_BLOCK_THRESHOLD_UNSPECIFIED`
     - `BLOCK_LOW_AND_ABOVE`
     - `BLOCK_MEDIUM_AND_ABOVE`
@@ -344,14 +338,52 @@ The following optional provider options are available for Google Vertex models:
   Consult [Google's Documentation](https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/add-labels-to-api-calls) for usage details.
+- **streamFunctionCallArguments** _boolean_
+  Optional. When set to true, function call arguments will be streamed
+  incrementally in streaming responses. This enables `tool-input-delta` events
+  to arrive as the model generates function call arguments, reducing perceived
+  latency for tool calls. Defaults to `false`. Only supported on the Vertex AI API (not the Gemini API) with Gemini 3+ models.
+  Consult [Google's Documentation](https://docs.cloud.google.com/vertex-ai/generative-ai/docs/multimodal/function-calling#streaming-fc) for details.
+- **sharedRequestType** _'priority' | 'flex' | 'standard'_
+  Optional. Selects a pay-as-you-go (PayGo) tier by setting the
+  `X-Vertex-AI-LLM-Shared-Request-Type` request header. Use `'priority'` for
+  consistent low-latency performance at a premium, or `'flex'` for a 50%
+  discount with longer expected latency. Both are supported only on the
+  `global` endpoint and on a subset of Gemini models.
+  By default — with Provisioned Throughput allocated and `requestType` unset
+  — the request consumes Provisioned Throughput quota first and only falls
+  back to the chosen shared tier if PT capacity is exhausted. To bypass
+  Provisioned Throughput entirely, also set `requestType: 'shared'`.
+  The served tier is reported back on
+  `result.providerMetadata.googleVertex.usageMetadata.trafficType` as
+  `ON_DEMAND_PRIORITY`, `ON_DEMAND_FLEX`, or (if downgraded under load) plain
+  `ON_DEMAND`.
+  See [Priority PayGo](https://docs.cloud.google.com/gemini-enterprise-agent-platform/models/priority-paygo)
+  and [Flex PayGo](https://docs.cloud.google.com/gemini-enterprise-agent-platform/models/flex-paygo)
+  for supported models, ramp limits, and downgrade behavior.
+- **requestType** _'shared'_
+  Optional. Sets the `X-Vertex-AI-LLM-Request-Type` request header. Combine
+  with `sharedRequestType` to skip Provisioned Throughput entirely and route
+  the request through shared PayGo capacity. See
+  [Priority PayGo](https://docs.cloud.google.com/gemini-enterprise-agent-platform/models/priority-paygo).
 You can use Google Vertex language models to generate text with the `generateText` function:
 ```ts highlight="1,4"
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { generateText } from 'ai';
 const { text } = await generateText({
-  model: vertex('gemini-2.5-pro'),
+  model: googleVertex('gemini-2.5-pro'),
   prompt: 'Write a vegetarian lasagna recipe for 4 people.',
 });
 ```
@@ -366,12 +398,12 @@ With [Code Execution](https://cloud.google.com/vertex-ai/generative-ai/docs/mult
 You can enable code execution by adding the `code_execution` tool to your request.
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { generateText } from 'ai';
 const result = await generateText({
-  model: vertex('gemini-2.5-pro'),
-  tools: { code_execution: vertex.tools.codeExecution({}) },
+  model: googleVertex('gemini-2.5-pro'),
+  tools: { code_execution: googleVertex.tools.codeExecution({}) },
   prompt:
     'Use python to calculate 20th fibonacci number. Then find the nearest palindrome to it.',
 });
@@ -384,12 +416,12 @@ The response will contain `tool-call` and `tool-result` parts for the executed c
 URL Context allows Gemini models to retrieve and analyze content from URLs. Supported models: Gemini 2.5 Flash-Lite, 2.5 Pro, 2.5 Flash, 2.0 Flash.
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { generateText } from 'ai';
 const result = await generateText({
-  model: vertex('gemini-2.5-pro'),
-  tools: { url_context: vertex.tools.urlContext({}) },
+  model: googleVertex('gemini-2.5-pro'),
+  tools: { url_context: googleVertex.tools.urlContext({}) },
   prompt: 'What are the key points from https://example.com/article?',
 });
 ```
@@ -399,12 +431,12 @@ const result = await generateText({
 Google Search enables Gemini models to access real-time web information. Supported models: Gemini 2.5 Flash-Lite, 2.5 Flash, 2.0 Flash, 2.5 Pro.
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { generateText } from 'ai';
 const result = await generateText({
-  model: vertex('gemini-2.5-pro'),
-  tools: { google_search: vertex.tools.googleSearch({}) },
+  model: googleVertex('gemini-2.5-pro'),
+  tools: { google_search: googleVertex.tools.googleSearch({}) },
   prompt: 'What are the latest developments in AI?',
 });
 ```
@@ -414,13 +446,13 @@ const result = await generateText({
 [Enterprise Web Search](https://cloud.google.com/vertex-ai/generative-ai/docs/grounding/web-grounding-enterprise) provides grounding using a compliance-focused web index designed for highly-regulated industries such as finance, healthcare, and the public sector. Unlike standard Google Search grounding, Enterprise Web Search does not log customer data and supports VPC service controls. Supported models: Gemini 2.0 and newer.
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { generateText } from 'ai';
 const result = await generateText({
-  model: vertex('gemini-2.5-flash'),
+  model: googleVertex('gemini-2.5-flash'),
   tools: {
-    enterprise_web_search: vertex.tools.enterpriseWebSearch({}),
+    enterprise_web_search: googleVertex.tools.enterpriseWebSearch({}),
   },
   prompt: 'What are the latest FDA regulations for clinical trials?',
 });
@@ -431,14 +463,14 @@ const result = await generateText({
 Google Maps grounding enables Gemini models to access Google Maps data for location-aware responses. Supported models: Gemini 2.5 Flash-Lite, 2.5 Flash, 2.0 Flash, 2.5 Pro, 3.0 Pro.
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { type GoogleLanguageModelOptions } from '@ai-sdk/google';
 import { generateText } from 'ai';
 const result = await generateText({
-  model: vertex('gemini-2.5-flash'),
+  model: googleVertex('gemini-2.5-flash'),
   tools: {
-    google_maps: vertex.tools.googleMaps({}),
+    google_maps: googleVertex.tools.googleMaps({}),
   },
   providerOptions: {
     vertex: {
@@ -453,6 +485,59 @@ const result = await generateText({
 The optional `retrievalConfig.latLng` provider option provides location context for queries about nearby places. This configuration applies to any grounding tools that support location context.
+#### Streaming Function Call Arguments
+For Gemini 3 Pro and later models on Vertex AI, you can stream function call
+arguments as they are generated by setting `streamFunctionCallArguments` to
+`true`. This reduces perceived latency when functions need to be called, as
+`tool-input-delta` events arrive incrementally instead of waiting for the
+complete arguments. This option defaults to `false`.
+```ts
+import { googleVertex } from '@ai-sdk/google-vertex';
+import { type GoogleLanguageModelOptions } from '@ai-sdk/google';
+import { streamText } from 'ai';
+import { z } from 'zod';
+const result = streamText({
+  model: googleVertex('gemini-3.1-pro-preview'),
+  prompt: 'What is the weather in Boston and San Francisco?',
+  tools: {
+    getWeather: {
+      description: 'Get the current weather in a given location',
+      inputSchema: z.object({
+        location: z.string().describe('City name'),
+      }),
+    },
+  },
+  providerOptions: {
+    vertex: {
+      streamFunctionCallArguments: true,
+    } satisfies GoogleLanguageModelOptions,
+  },
+});
+for await (const part of result.stream) {
+  switch (part.type) {
+    case 'tool-input-start':
+      console.log(`Tool call started: ${part.toolName}`);
+      break;
+    case 'tool-input-delta':
+      process.stdout.write(part.delta);
+      break;
+    case 'tool-call':
+      console.log(`Tool call complete: ${part.toolName}`, part.input);
+      break;
+  }
+}
+```
+<Note>
+  This feature is only available on the Vertex AI API. It is not supported on
+  the Gemini API. When used with the Google provider, a warning will be emitted
+  and the option will be ignored.
+</Note>
 #### Reasoning (Thinking Tokens)
 Google Vertex AI, through its support for Gemini models, can also emit "thinking" tokens, representing the model's reasoning process. The AI SDK exposes these as reasoning information.
@@ -460,13 +545,13 @@ Google Vertex AI, through its support for Gemini models, can also emit "thinking
 To enable thinking tokens for compatible Gemini models via Vertex, set `includeThoughts: true` in the `thinkingConfig` provider option. These options are passed through `providerOptions.vertex`:
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { type GoogleLanguageModelOptions } from '@ai-sdk/google';
 import { generateText, streamText } from 'ai';
 // For generateText:
 const { text, reasoningText, reasoning } = await generateText({
-  model: vertex('gemini-2.0-flash-001'), // Or other supported model via Vertex
+  model: googleVertex('gemini-2.0-flash-001'), // Or other supported model via Vertex
   providerOptions: {
     vertex: {
       thinkingConfig: {
@@ -484,7 +569,7 @@ console.log('Final Text:', text);
 // For streamText:
 const result = streamText({
-  model: vertex('gemini-2.0-flash-001'), // Or other supported model via Vertex
+  model: googleVertex('gemini-2.0-flash-001'), // Or other supported model via Vertex
   providerOptions: {
     vertex: {
       thinkingConfig: {
@@ -496,7 +581,7 @@ const result = streamText({
   prompt: 'Explain quantum computing in simple terms.',
 });
-for await (const part of result.fullStream) {
+for await (const part of result.stream) {
   if (part.type === 'reasoning') {
     process.stdout.write(`THOUGHT: ${part.textDelta}\n`);
   } else if (part.type === 'text-delta') {
@@ -521,11 +606,11 @@ When `includeThoughts` is true, parts of the API response marked with `thought:
 The Google Vertex provider supports file inputs, e.g. PDF files.
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { generateText } from 'ai';
 const { text } = await generateText({
-  model: vertex('gemini-2.5-pro'),
+  model: googleVertex('gemini-2.5-pro'),
   messages: [
     {
       role: 'user',
@@ -560,7 +645,7 @@ Google Vertex AI supports both explicit and implicit caching to help reduce cost
 #### Implicit Caching
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { generateText } from 'ai';
 // Structure prompts with consistent content at the beginning
@@ -568,13 +653,13 @@ const baseContext =
   'You are a cooking assistant with expertise in Italian cuisine. Here are 1000 lasagna recipes for reference...';
 const { text: veggieLasagna } = await generateText({
-  model: vertex('gemini-2.5-pro'),
+  model: googleVertex('gemini-2.5-pro'),
   prompt: `${baseContext}\n\nWrite a vegetarian lasagna recipe for 4 people.`,
 });
 // Second request with same prefix - eligible for cache hit
 const { text: meatLasagna, providerMetadata } = await generateText({
-  model: vertex('gemini-2.5-pro'),
+  model: googleVertex('gemini-2.5-pro'),
   prompt: `${baseContext}\n\nWrite a meat lasagna recipe for 12 people.`,
 });
@@ -632,12 +717,12 @@ console.log('Cache created:', cache.name);
 Then use the cache with the AI SDK:
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { type GoogleLanguageModelOptions } from '@ai-sdk/google';
 import { generateText } from 'ai';
 const { text: veggieLasagnaRecipe } = await generateText({
-  model: vertex('gemini-2.5-pro'),
+  model: googleVertex('gemini-2.5-pro'),
   prompt: 'Write a vegetarian lasagna recipe for 4 people.',
   providerOptions: {
     vertex: {
@@ -647,7 +732,7 @@ const { text: veggieLasagnaRecipe } = await generateText({
 });
 const { text: meatLasagnaRecipe } = await generateText({
-  model: vertex('gemini-2.5-pro'),
+  model: googleVertex('gemini-2.5-pro'),
   prompt: 'Write a meat lasagna recipe for 12 people.',
   providerOptions: {
     vertex: {
@@ -716,12 +801,12 @@ By default, structured outputs are enabled (and for tool calling they are requir
 You can disable structured outputs for object generation as a workaround:
 ```ts highlight="7,12"
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { type GoogleLanguageModelOptions } from '@ai-sdk/google';
 import { generateText, Output } from 'ai';
 const result = await generateText({
-  model: vertex('gemini-2.5-pro'),
+  model: googleVertex('gemini-2.5-pro'),
   providerOptions: {
     vertex: {
       structuredOutputs: false,
@@ -756,6 +841,7 @@ The following Zod features are known to not work with Google Vertex:
 | Model                  | Image Input         | Object Generation   | Tool Usage          | Tool Streaming      |
 | ---------------------- | ------------------- | ------------------- | ------------------- | ------------------- |
+| `gemini-3.5-flash`     | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> |
 | `gemini-3-pro-preview` | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> |
 | `gemini-2.5-pro`       | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> |
 | `gemini-2.5-flash`     | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> | <Check size={18} /> |
@@ -773,19 +859,19 @@ The following Zod features are known to not work with Google Vertex:
 You can create models that call the Google Vertex AI embeddings API using the `.embeddingModel()` factory method:
 ```ts
-const model = vertex.embeddingModel('text-embedding-005');
+const model = googleVertex.embeddingModel('text-embedding-005');
 ```
 Google Vertex AI embedding models support additional settings. You can pass them as an options argument:
 ```ts
 import {
-  vertex,
+  googleVertex,
   type GoogleVertexEmbeddingModelOptions,
 } from '@ai-sdk/google-vertex';
 import { embed } from 'ai';
-const model = vertex.embeddingModel('text-embedding-005');
+const model = googleVertex.embeddingModel('text-embedding-005');
 const { embedding } = await embed({
   model,
@@ -809,7 +895,6 @@ The following optional provider options are available for Google Vertex AI embed
 - **taskType**: _string_
   Optional. Specifies the task type for generating embeddings. Supported task types include:
   - `SEMANTIC_SIMILARITY`: Optimized for text similarity.
   - `CLASSIFICATION`: Optimized for text classification.
   - `CLUSTERING`: Optimized for clustering texts based on similarity.
@@ -832,6 +917,7 @@ The following optional provider options are available for Google Vertex AI embed
 | Model                        | Max Values Per Call | Parallel Calls      | Multimodal          |
 | ---------------------------- | ------------------- | ------------------- | ------------------- |
 | `text-embedding-005`         | 2048                | <Check size={18} /> | <Cross size={18} /> |
+| `gemini-embedding-2`         | 2048                | <Check size={18} /> | <Check size={18} /> |
 | `gemini-embedding-2-preview` | 2048                | <Check size={18} /> | <Check size={18} /> |
 <Note>
@@ -848,11 +934,11 @@ You can create image models using the `.image()` factory method. The Google Vert
 [Imagen models](https://cloud.google.com/vertex-ai/generative-ai/docs/image/generate-images) generate images using the Imagen on Vertex AI API.
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { generateImage } from 'ai';
 const { image } = await generateImage({
-  model: vertex.image('imagen-4.0-generate-001'),
+  model: googleVertex.image('imagen-4.0-generate-001'),
   prompt: 'A futuristic cityscape at sunset',
   aspectRatio: '16:9',
 });
@@ -861,12 +947,12 @@ const { image } = await generateImage({
 Further configuration can be done using Google Vertex provider options. You can validate the provider options using the `GoogleVertexImageModelOptions` type.
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { GoogleVertexImageModelOptions } from '@ai-sdk/google-vertex';
 import { generateImage } from 'ai';
 const { image } = await generateImage({
-  model: vertex.image('imagen-4.0-generate-001'),
+  model: googleVertex.image('imagen-4.0-generate-001'),
   providerOptions: {
     vertex: {
       negativePrompt: 'pixelated, blurry, low-quality',
@@ -901,12 +987,12 @@ The following provider options are available:
 Additional information about the images can be retrieved using Google Vertex meta data.
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { GoogleVertexImageModelOptions } from '@ai-sdk/google-vertex';
 import { generateImage } from 'ai';
 const { image, providerMetadata } = await generateImage({
-  model: vertex.image('imagen-4.0-generate-001'),
+  model: googleVertex.image('imagen-4.0-generate-001'),
   prompt: 'A futuristic cityscape at sunset',
   aspectRatio: '16:9',
 });
@@ -930,7 +1016,10 @@ Google Vertex Imagen models support image editing through inpainting, outpaintin
 Insert or replace objects in specific areas using a mask:
 ```ts
-import { vertex, GoogleVertexImageModelOptions } from '@ai-sdk/google-vertex';
+import {
+  googleVertex,
+  GoogleVertexImageModelOptions,
+} from '@ai-sdk/google-vertex';
 import { generateImage } from 'ai';
 import fs from 'fs';
@@ -938,7 +1027,7 @@ const image = fs.readFileSync('./input-image.png');
 const mask = fs.readFileSync('./mask.png'); // White = edit area
 const { images } = await generateImage({
-  model: vertex.image('imagen-3.0-capability-001'),
+  model: googleVertex.image('imagen-3.0-capability-001'),
   prompt: {
     text: 'A sunlit indoor lounge area with a pool containing a flamingo',
     images: [image],
@@ -962,7 +1051,10 @@ const { images } = await generateImage({
 Extend an image beyond its original boundaries:
 ```ts
-import { vertex, GoogleVertexImageModelOptions } from '@ai-sdk/google-vertex';
+import {
+  googleVertex,
+  GoogleVertexImageModelOptions,
+} from '@ai-sdk/google-vertex';
 import { generateImage } from 'ai';
 import fs from 'fs';
@@ -970,7 +1062,7 @@ const image = fs.readFileSync('./input-image.png');
 const mask = fs.readFileSync('./outpaint-mask.png'); // White = extend area
 const { images } = await generateImage({
-  model: vertex.image('imagen-3.0-capability-001'),
+  model: googleVertex.image('imagen-3.0-capability-001'),
   prompt: {
     text: 'Extend the scene with more of the forest background',
     images: [image],
@@ -993,7 +1085,6 @@ const { images } = await generateImage({
 The following options are available under `providerOptions.vertex.edit`:
 - **mode** - The edit mode to use:
   - `EDIT_MODE_INPAINT_INSERTION` - Insert objects into masked areas
   - `EDIT_MODE_INPAINT_REMOVAL` - Remove objects from masked areas
   - `EDIT_MODE_OUTPAINT` - Extend image beyond boundaries
@@ -1004,7 +1095,6 @@ The following options are available under `providerOptions.vertex.edit`:
 - **baseSteps** _number_ - Number of sampling steps (35-75). Higher values = better quality but slower.
 - **maskMode** - How to interpret the mask:
   - `MASK_MODE_USER_PROVIDED` - Use the provided mask directly
   - `MASK_MODE_DEFAULT` - Default mask mode
   - `MASK_MODE_DETECTION_BOX` - Mask from detected bounding boxes
@@ -1035,11 +1125,11 @@ The following options are available under `providerOptions.vertex.edit`:
 [Gemini image models](https://cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/2-5-flash-image) (e.g. `gemini-2.5-flash-image`) are multimodal output language models that can be used with `generateImage()` for a simpler image generation experience. Internally, the provider calls the language model API with `responseModalities: ['IMAGE']`.
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { generateImage } from 'ai';
 const { image } = await generateImage({
-  model: vertex.image('gemini-2.5-flash-image'),
+  model: googleVertex.image('gemini-2.5-flash-image'),
   prompt: 'A photorealistic image of a cat wearing a wizard hat',
   aspectRatio: '1:1',
 });
@@ -1048,14 +1138,14 @@ const { image } = await generateImage({
 Gemini image models also support image editing by providing input images:
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { generateImage } from 'ai';
 import fs from 'node:fs';
 const sourceImage = fs.readFileSync('./cat.png');
 const { image } = await generateImage({
-  model: vertex.image('gemini-2.5-flash-image'),
+  model: googleVertex.image('gemini-2.5-flash-image'),
   prompt: {
     text: 'Add a small wizard hat to this cat',
     images: [sourceImage],
@@ -1066,11 +1156,11 @@ const { image } = await generateImage({
 You can also use URLs (including `gs://` Cloud Storage URIs) for input images:
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { generateImage } from 'ai';
 const { image } = await generateImage({
-  model: vertex.image('gemini-2.5-flash-image'),
+  model: googleVertex.image('gemini-2.5-flash-image'),
   prompt: {
     text: 'Add a small wizard hat to this cat',
     images: ['https://example.com/cat.png'],
@@ -1111,11 +1201,11 @@ You can create [Veo](https://cloud.google.com/vertex-ai/generative-ai/docs/video
 using the `.video()` factory method. For more on video generation with the AI SDK see [generateVideo()](/docs/reference/ai-sdk-core/generate-video).
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { experimental_generateVideo as generateVideo } from 'ai';
 const { video } = await generateVideo({
-  model: vertex.video('veo-3.1-generate-001'),
+  model: googleVertex.video('veo-3.1-generate-001'),
   prompt:
     'A pangolin curled on a mossy stone in a glowing bioluminescent forest',
   aspectRatio: '16:9',
@@ -1125,11 +1215,11 @@ const { video } = await generateVideo({
 You can configure resolution and duration:
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { experimental_generateVideo as generateVideo } from 'ai';
 const { video } = await generateVideo({
-  model: vertex.video('veo-3.1-generate-001'),
+  model: googleVertex.video('veo-3.1-generate-001'),
   prompt: 'A serene mountain landscape at sunset',
   aspectRatio: '16:9',
   resolution: '1920x1080',
@@ -1142,12 +1232,12 @@ const { video } = await generateVideo({
 Further configuration can be done using Google Vertex provider options. You can validate the provider options using the `GoogleVertexVideoModelOptions` type.
 ```ts
-import { vertex } from '@ai-sdk/google-vertex';
+import { googleVertex } from '@ai-sdk/google-vertex';
 import { GoogleVertexVideoModelOptions } from '@ai-sdk/google-vertex';
 import { experimental_generateVideo as generateVideo } from 'ai';
 const { video } = await generateVideo({
-  model: vertex.video('veo-3.1-generate-001'),
+  model: googleVertex.video('veo-3.1-generate-001'),
   prompt: 'A serene mountain landscape at sunset',
   aspectRatio: '16:9',
   providerOptions: {
@@ -1210,6 +1300,118 @@ The following provider options are available:
   model ID as a string if needed.
 </Note>
+### Speech Models
+You can create [Gemini text-to-speech](https://docs.cloud.google.com/text-to-speech/docs/gemini-tts)
+models that call the Vertex AI API using the `.speech()` factory method. For more on speech
+generation with the AI SDK see [generateSpeech()](/docs/reference/ai-sdk-core/generate-speech).
+```ts
+import { googleVertex } from '@ai-sdk/google-vertex';
+import { generateSpeech } from 'ai';
+const result = await generateSpeech({
+  model: googleVertex.speech('gemini-2.5-flash-tts'),
+  text: 'Hello, world!',
+  voice: 'Kore', // Gemini voice name
+});
+```
+The `voice` argument accepts one of Gemini's [30 prebuilt voices](https://ai.google.dev/gemini-api/docs/speech-generation#voices)
+(e.g. `Kore`, `Puck`, `Zephyr`); it defaults to `Kore`. Multi-speaker dialogue is available via
+`providerOptions.googleVertex.multiSpeakerVoiceConfig`.
+By default the audio is returned as a playable WAV (Gemini returns raw PCM, which the provider
+wraps). Set `outputFormat: 'pcm'` for the raw signed 16-bit little-endian mono bytes; the sample
+rate is reported in `result.providerMetadata.google.sampleRate`.
+#### Speech Model Capabilities
+| Model                               | Multi-speaker       | Style via instructions |
+| ----------------------------------- | ------------------- | ---------------------- |
+| `gemini-2.5-flash-tts`              | <Check size={18} /> | <Check size={18} />    |
+| `gemini-2.5-pro-tts`                | <Check size={18} /> | <Check size={18} />    |
+| `gemini-2.5-flash-lite-preview-tts` | <Check size={18} /> | <Check size={18} />    |
+| `gemini-3.1-flash-tts-preview`      | <Check size={18} /> | <Check size={18} />    |
+### Transcription Models
+You can transcribe audio with Google Cloud Speech-to-Text models using the
+`.transcription()` factory method together with
+[`transcribe()`](/docs/reference/ai-sdk-core/transcribe).
+```ts
+import { googleVertex } from '@ai-sdk/google-vertex';
+import { transcribe } from 'ai';
+import { readFile } from 'fs/promises';
+const result = await transcribe({
+  model: googleVertex.transcription('chirp_2'),
+  audio: await readFile('audio.wav'),
+});
+```
+The provider supports [Chirp](https://docs.cloud.google.com/speech-to-text/docs/models/chirp-3)
+models `chirp_2` and `chirp_3`, plus `telephony` for phone-call audio.
+Speech-to-Text uses standard Google Cloud credentials (OAuth, Application Default
+Credentials, or a service account) and calls the Cloud Speech-to-Text API.
+Express Mode API keys are not supported for transcription models. Set
+`GOOGLE_VERTEX_LOCATION` (or `providerOptions.googleVertex.region`) to a
+Speech-to-Text region. For Chirp, `chirp_2` is available in `us-central1`,
+`europe-west4`, and `asia-southeast1`; `chirp_3` in the `us` and `eu`
+multi-regions. Chirp is not available in the `global` Speech-to-Text location,
+and these regions differ from Vertex AI regions. `telephony` availability
+depends on the selected Speech-to-Text region and language.
+The synchronous API transcribes audio up to one minute or 10 MB, whichever is
+reached first. The spoken language is auto-detected by default; pass
+`languageCodes` to restrict it. For `telephony`, pass a supported language code
+such as `['en-US']`.
+```ts
+const result = await transcribe({
+  model: googleVertex.transcription('chirp_3'),
+  audio: await readFile('audio.wav'),
+  providerOptions: {
+    googleVertex: {
+      region: 'us',
+      languageCodes: ['en-US'],
+    },
+  },
+});
+```
+The following provider options are available:
+- **languageCodes** _string[]_
+  BCP-47 language codes to recognize, or `['auto']` to detect the spoken
+  language. Defaults to `['auto']`. Multiple explicit language codes require a
+  multi-region Speech-to-Text endpoint such as `us` or `eu`.
+- **enableAutomaticPunctuation** _boolean_
+  Whether to add punctuation to the transcript. Defaults to `true`.
+- **enableWordTimeOffsets** _boolean_
+  Whether to include word-level timestamps in `result.segments`. Defaults to
+  `true`. Google notes that enabling word-level timestamps can reduce
+  transcription quality and speed.
+- **region** _string_
+  The Speech-to-Text region for the request. Defaults to the provider
+  `location`.
+#### Transcription Model Capabilities
+| Model       | Word timestamps                                           | Language detection                                                             |
+| ----------- | --------------------------------------------------------- | ------------------------------------------------------------------------------ |
+| `chirp_2`   | Available with a potential quality and speed tradeoff     | Auto detection with `['auto']`                                                 |
+| `chirp_3`   | Available with a potential transcription quality tradeoff | Auto detection with `['auto']`                                                 |
+| `telephony` | Available                                                 | Explicit supported language codes, with alternative language detection support |
 ## Google Vertex Anthropic Provider Usage
 The Google Vertex Anthropic provider for the [AI SDK](/docs) offers support for Anthropic's Claude models through the Google Vertex AI APIs. This section provides details on how to set up and use the Google Vertex Anthropic provider.
@@ -1267,7 +1469,6 @@ You can use the following optional settings to customize the Google Vertex Anthr
 - **googleAuthOptions** _object_
   Optional. The Authentication options used by the [Google Auth Library](https://github.com/googleapis/google-auth-library-nodejs/). See also the [GoogleAuthOptions](https://github.com/googleapis/google-auth-library-nodejs/blob/08978822e1b7b5961f0e355df51d738e012be392/src/auth/googleauth.ts#L87C18-L87C35) interface.
   - **authClient** _object_
     An `AuthClient` to use.
@@ -1295,7 +1496,6 @@ You can use the following optional settings to customize the Google Vertex Anthr
 - **headers** _Resolvable&lt;Record&lt;string, string | undefined&gt;&gt;_
   Headers to include in the requests. Can be provided in multiple formats:
   - A record of header key-value pairs: `Record<string, string | undefined>`
   - A function that returns headers: `() => Record<string, string | undefined>`
   - An async function that returns headers: `async () => Record<string, string | undefined>`
@@ -1358,7 +1558,6 @@ You can use the following optional settings to customize the provider instance:
 - **googleCredentials** _object_
   Optional. The credentials used by the Edge provider for authentication. These credentials are typically set through environment variables and are derived from a service account JSON file.
   - **clientEmail** _string_
     The client email from the service account JSON file. Defaults to the contents of the `GOOGLE_CLIENT_EMAIL` environment variable.
@@ -1371,7 +1570,6 @@ You can use the following optional settings to customize the provider instance:
 - **headers** _Resolvable&lt;Record&lt;string, string | undefined&gt;&gt;_
   Headers to include in the requests. Can be provided in multiple formats:
   - A record of header key-value pairs: `Record<string, string | undefined>`
   - A function that returns headers: `() => Record<string, string | undefined>`
   - An async function that returns headers: `async () => Record<string, string | undefined>`
@@ -1429,6 +1627,11 @@ The following optional provider options are available for Anthropic models:
   Optional. See [Reasoning section](#reasoning) for more details.
+- `metadata` _object_
+  Optional. Metadata to include with the request. See the [Anthropic API documentation](https://platform.claude.com/docs/en/api/messages/create) for details.
+  - `userId` _string_ - An external identifier for the end-user.
 ### Reasoning
 Anthropic has reasoning support for the `claude-3-7-sonnet@20250219` model.
@@ -1469,13 +1672,12 @@ on how to integrate reasoning into your chatbot.
 In the messages and message parts, you can use the `providerOptions` property to set cache control breakpoints.
 You need to set the `anthropic` property in the `providerOptions` object to `{ cacheControl: { type: 'ephemeral' } }` to set a cache control breakpoint.
-The cache creation input tokens are then returned in the `providerMetadata` object
-for `generateText`, again under the `anthropic` property.
-When you use `streamText`, the response contains a promise
-that resolves to the metadata. Alternatively you can receive it in the
-`onFinish` callback.
+Cache read and cache write (creation) token counts are returned on the standard
+`usage` object for both `generateText` and `streamText`. You can access them at
+`result.usage.inputTokenDetails.cacheReadTokens` and
+`result.usage.inputTokenDetails.cacheWriteTokens`.
-```ts highlight="8,18-20,29-30"
+```ts highlight="8,16-18,27-31"
 import { vertexAnthropic } from '@ai-sdk/google-vertex/anthropic';
 import { generateText } from 'ai';
@@ -1502,13 +1704,19 @@ const result = await generateText({
 });
 console.log(result.text);
-console.log(result.providerMetadata?.anthropic);
-// e.g. { cacheCreationInputTokens: 2118, cacheReadInputTokens: 0 }
+console.log(
+  'Cache read tokens:',
+  result.usage.inputTokenDetails.cacheReadTokens,
+);
+console.log(
+  'Cache write tokens:',
+  result.usage.inputTokenDetails.cacheWriteTokens,
+);
 ```
 You can also use cache control on system messages by providing multiple system messages at the head of your messages array:
-```ts highlight="3,9-11"
+```ts highlight="3,7-9"
 const result = await generateText({
   model: vertexAnthropic('claude-3-5-sonnet-20240620'),
   messages: [
@@ -1548,6 +1756,12 @@ Google Vertex Anthropic supports a subset of Anthropic's built-in tools. The fol
   `@ai-sdk/anthropic` provider if you need access to all Anthropic tools.
 </Note>
+<Note>
+  Google Vertex Anthropic does not support strict mode on tool definitions.
+  Setting `strict: true` on a tool will be ignored and a warning will be
+  emitted.
+</Note>
 For more background on Anthropic tools, see [Anthropic's documentation](https://platform.claude.com/docs/en/agents-and-tools/tool-use/overview).
 #### Bash Tool
@@ -1640,7 +1854,7 @@ const computerTool = vertexAnthropic.tools.computer_20241022({
   toModelOutput({ output }) {
     return typeof output === 'string'
       ? [{ type: 'text', text: output }]
-      : [{ type: 'image', data: output.data, mediaType: 'image/png' }];
+      : [{ type: 'file-data', data: output.data, mediaType: 'image/png' }];
   },
 });
 ```
@@ -1704,3 +1918,327 @@ See also [Anthropic Model Comparison](https://docs.anthropic.com/en/docs/about-c
   The table above lists popular models. You can also pass any available provider
   model ID as a string if needed.
 </Note>
+## Google Vertex xAI Provider Usage
+The Google Vertex xAI provider offers support for xAI's Grok partner models through the Google Vertex AI OpenAI-compatible Chat Completions API.
+For more information, see the [Vertex AI Grok documentation](https://cloud.google.com/vertex-ai/generative-ai/docs/partner-models/grok).
+### Provider Instance
+You can import the default provider instance `googleVertexXai` from `@ai-sdk/google-vertex/xai`:
+```typescript
+import { googleVertexXai } from '@ai-sdk/google-vertex/xai';
+```
+If you need a customized setup, you can import `createGoogleVertexXai` from `@ai-sdk/google-vertex/xai` and create a provider instance with your settings:
+```typescript
+import { createGoogleVertexXai } from '@ai-sdk/google-vertex/xai';
+const googleVertexXai = createGoogleVertexXai({
+  project: 'my-project', // optional
+  location: 'global', // optional, defaults to 'global'
+});
+```
+#### Node.js Runtime
+For Node.js environments, the Google Vertex xAI provider supports all standard Google Cloud authentication options through the `google-auth-library`:
+```typescript
+import { createGoogleVertexXai } from '@ai-sdk/google-vertex/xai';
+const googleVertexXai = createGoogleVertexXai({
+  googleAuthOptions: {
+    credentials: {
+      client_email: 'my-email',
+      private_key: 'my-private-key',
+    },
+  },
+});
+```
+##### Optional Provider Settings
+- **project** _string_
+  The Google Cloud project ID. Defaults to the `GOOGLE_VERTEX_PROJECT` environment variable.
+- **location** _string_
+  The Google Cloud location. Grok models are available on the global endpoint. Defaults to the `GOOGLE_VERTEX_LOCATION` environment variable. If not set, defaults to `global`.
+- **googleAuthOptions** _object_
+  Optional. The Authentication options used by the [Google Auth Library](https://github.com/googleapis/google-auth-library-nodejs/).
+- **headers** _Resolvable&lt;Record&lt;string, string | undefined&gt;&gt;_
+  Headers to include in requests.
+- **fetch** _(input: RequestInfo, init?: RequestInit) => Promise&lt;Response&gt;_
+  Custom [fetch](https://developer.mozilla.org/en-US/docs/Web/API/fetch) implementation.
+<a id="google-vertex-xai-edge-runtime"></a>
+#### Edge Runtime
+For Edge runtimes, import from `@ai-sdk/google-vertex/xai/edge`:
+```typescript
+import { googleVertexXai } from '@ai-sdk/google-vertex/xai/edge';
+```
+```typescript
+import { createGoogleVertexXai } from '@ai-sdk/google-vertex/xai/edge';
+const googleVertexXai = createGoogleVertexXai({
+  project: 'my-project',
+  location: 'global',
+});
+```
+For Edge runtime authentication, set these environment variables:
+- `GOOGLE_CLIENT_EMAIL`
+- `GOOGLE_PRIVATE_KEY`
+- `GOOGLE_PRIVATE_KEY_ID` (optional)
+### Language Models
+You can create models using the provider instance. The first argument is the model ID:
+```ts
+import { googleVertexXai } from '@ai-sdk/google-vertex/xai';
+import { generateText } from 'ai';
+const { text } = await generateText({
+  model: googleVertexXai('xai/grok-4.1-fast-reasoning'),
+  prompt: 'Invent a new holiday and describe its traditions.',
+});
+```
+Streaming is also supported:
+```ts
+import { googleVertexXai } from '@ai-sdk/google-vertex/xai';
+import { streamText } from 'ai';
+const result = streamText({
+  model: googleVertexXai('xai/grok-4.1-fast-reasoning'),
+  prompt: 'Invent a new holiday and describe its traditions.',
+});
+for await (const textPart of result.textStream) {
+  process.stdout.write(textPart);
+}
+```
+### Function Calling
+Grok models on Vertex support OpenAI-compatible function calling. You can use AI SDK tools as usual:
+```ts
+import { googleVertexXai } from '@ai-sdk/google-vertex/xai';
+import { generateText, tool } from 'ai';
+import { z } from 'zod';
+const result = await generateText({
+  model: googleVertexXai('xai/grok-4.1-fast-reasoning'),
+  tools: {
+    weather: tool({
+      description: 'Get the weather in a city',
+      inputSchema: z.object({ city: z.string() }),
+      execute: async ({ city }) => `The weather in ${city} is sunny.`,
+    }),
+  },
+  prompt: 'What is the weather in San Francisco?',
+});
+```
+### Structured Outputs
+Grok models on Vertex support JSON mode and schema-backed structured outputs:
+```ts
+import { googleVertexXai } from '@ai-sdk/google-vertex/xai';
+import { generateText, Output } from 'ai';
+import { z } from 'zod';
+const result = await generateText({
+  model: googleVertexXai('xai/grok-4.1-fast-reasoning'),
+  output: Output.object({
+    schema: z.object({
+      name: z.string(),
+      date: z.string(),
+      participants: z.array(z.string()),
+    }),
+  }),
+  prompt: 'Alice and Bob are going to a science fair on Friday.',
+});
+```
+### Available Models
+The following models are available through the Google Vertex xAI provider. You can also pass any valid model ID as a string.
+| Model ID                          | Reasoning |
+| --------------------------------- | --------- |
+| `xai/grok-4.20-reasoning`         | Yes       |
+| `xai/grok-4.20-non-reasoning`     | No        |
+| `xai/grok-4.1-fast-reasoning`     | Yes       |
+| `xai/grok-4.1-fast-non-reasoning` | No        |
+<Note>
+  Grok reasoning models on Vertex report reasoning token counts in usage
+  metadata. They do not support the `reasoning_effort` request parameter.
+</Note>
+## Google Vertex MaaS Provider Usage
+The Google Vertex MaaS (Model as a Service) provider offers access to partner and open models hosted on Vertex AI through an OpenAI-compatible Chat Completions API. This includes models from DeepSeek, Qwen, Meta, MiniMax, Moonshot, and OpenAI.
+For more information, see the [Vertex AI MaaS documentation](https://cloud.google.com/vertex-ai/generative-ai/docs/maas/use-open-models).
+### Provider Instance
+You can import the default provider instance `vertexMaas` from `@ai-sdk/google-vertex/maas`:
+```typescript
+import { vertexMaas } from '@ai-sdk/google-vertex/maas';
+```
+If you need a customized setup, you can import `createVertexMaas` from `@ai-sdk/google-vertex/maas` and create a provider instance with your settings:
+```typescript
+import { createVertexMaas } from '@ai-sdk/google-vertex/maas';
+const vertexMaas = createVertexMaas({
+  project: 'my-project', // optional
+  location: 'us-east5', // optional, defaults to 'global'
+});
+```
+#### Node.js Runtime
+For Node.js environments, the Google Vertex MaaS provider supports all standard Google Cloud authentication options through the `google-auth-library`:
+```typescript
+import { createVertexMaas } from '@ai-sdk/google-vertex/maas';
+const vertexMaas = createVertexMaas({
+  googleAuthOptions: {
+    credentials: {
+      client_email: 'my-email',
+      private_key: 'my-private-key',
+    },
+  },
+});
+```
+##### Optional Provider Settings
+- **project** _string_
+  The Google Cloud project ID. Defaults to the `GOOGLE_VERTEX_PROJECT` environment variable.
+- **location** _string_
+  The Google Cloud location, e.g. `us-east5` or `global`. Defaults to the `GOOGLE_VERTEX_LOCATION` environment variable. If not set, defaults to `global`.
+- **googleAuthOptions** _object_
+  Optional. The Authentication options used by the [Google Auth Library](https://github.com/googleapis/google-auth-library-nodejs/).
+- **headers** _Resolvable&lt;Record&lt;string, string | undefined&gt;&gt;_
+  Headers to include in requests.
+- **fetch** _(input: RequestInfo, init?: RequestInit) => Promise&lt;Response&gt;_
+  Custom [fetch](https://developer.mozilla.org/en-US/docs/Web/API/fetch) implementation.
+<a id="google-vertex-maas-edge-runtime"></a>
+#### Edge Runtime
+For Edge runtimes, import from `@ai-sdk/google-vertex/maas/edge`:
+```typescript
+import { vertexMaas } from '@ai-sdk/google-vertex/maas/edge';
+```
+```typescript
+import { createVertexMaas } from '@ai-sdk/google-vertex/maas/edge';
+const vertexMaas = createVertexMaas({
+  project: 'my-project',
+  location: 'us-east5',
+});
+```
+For Edge runtime authentication, set these environment variables:
+- `GOOGLE_CLIENT_EMAIL`
+- `GOOGLE_PRIVATE_KEY`
+- `GOOGLE_PRIVATE_KEY_ID` (optional)
+### Language Models
+You can create models using the provider instance. The first argument is the model ID:
+```ts
+import { vertexMaas } from '@ai-sdk/google-vertex/maas';
+import { generateText } from 'ai';
+const { text } = await generateText({
+  model: vertexMaas('deepseek-ai/deepseek-v3.2-maas'),
+  prompt: 'Invent a new holiday and describe its traditions.',
+});
+```
+Streaming is also supported:
+```ts
+import { vertexMaas } from '@ai-sdk/google-vertex/maas';
+import { streamText } from 'ai';
+const result = streamText({
+  model: vertexMaas('deepseek-ai/deepseek-v3.2-maas'),
+  prompt: 'Invent a new holiday and describe its traditions.',
+});
+for await (const textPart of result.textStream) {
+  process.stdout.write(textPart);
+}
+```
+### Available Models
+The following models are available through the MaaS provider. You can also pass any valid model ID as a string.
+| Model ID                                       | Provider |
+| ---------------------------------------------- | -------- |
+| `deepseek-ai/deepseek-r1-0528-maas`            | DeepSeek |
+| `deepseek-ai/deepseek-v3.1-maas`               | DeepSeek |
+| `deepseek-ai/deepseek-v3.2-maas`               | DeepSeek |
+| `openai/gpt-oss-120b-maas`                     | OpenAI   |
+| `openai/gpt-oss-20b-maas`                      | OpenAI   |
+| `meta/llama-4-maverick-17b-128e-instruct-maas` | Meta     |
+| `meta/llama-4-scout-17b-16e-instruct-maas`     | Meta     |
+| `minimax/minimax-m2-maas`                      | MiniMax  |
+| `qwen/qwen3-coder-480b-a35b-instruct-maas`     | Qwen     |
+| `qwen/qwen3-next-80b-a3b-instruct-maas`        | Qwen     |
+| `qwen/qwen3-next-80b-a3b-thinking-maas`        | Qwen     |
+| `moonshotai/kimi-k2-thinking-maas`             | Moonshot |
+<Note>
+  Model availability depends on your Google Cloud project and region. Check the
+  [Vertex AI Model
+  Garden](https://console.cloud.google.com/vertex-ai/model-garden) for the
+  latest available models.
+</Note>