npm - @mastra/mcp-docs-server - Versions diffs - 1.1.25-alpha.7 → 1.1.25 - Mend

@mastra/mcp-docs-server 1.1.25-alpha.7 → 1.1.25

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (33) hide show

package/.docs/docs/agents/processors.md +33 -0
package/.docs/docs/mcp/overview.md +17 -0
package/.docs/docs/memory/message-history.md +6 -0
package/.docs/docs/memory/observational-memory.md +6 -0
package/.docs/docs/voice/overview.md +8 -5
package/.docs/guides/build-your-ui/ai-sdk-ui.md +32 -0
package/.docs/guides/deployment/mastra-platform.md +2 -0
package/.docs/models/gateways/openrouter.md +2 -8
package/.docs/models/gateways/vercel.md +1 -1
package/.docs/models/index.md +1 -1
package/.docs/models/providers/anthropic.md +12 -2
package/.docs/models/providers/fireworks-ai.md +2 -1
package/.docs/models/providers/google.md +5 -1
package/.docs/models/providers/inception.md +9 -11
package/.docs/models/providers/kilo.md +2 -2
package/.docs/models/providers/nano-gpt.md +1 -2
package/.docs/models/providers/nvidia.md +2 -1
package/.docs/models/providers/openai.md +4 -0
package/.docs/models/providers/opencode-go.md +4 -2
package/.docs/models/providers/opencode.md +4 -2
package/.docs/models/providers/poe.md +5 -16
package/.docs/models/providers/xai.md +4 -0
package/.docs/reference/ai-sdk/handle-chat-stream.md +17 -0
package/.docs/reference/ai-sdk/to-ai-sdk-stream.md +15 -0
package/.docs/reference/cli/mastra.md +100 -1
package/.docs/reference/index.md +1 -0
package/.docs/reference/processors/prefill-error-handler.md +70 -0
package/.docs/reference/processors/processor-interface.md +110 -12
package/.docs/reference/tools/create-tool.md +30 -0
package/.docs/reference/tools/mcp-client.md +45 -0
package/.docs/reference/voice/sarvam.md +29 -23
package/CHANGELOG.md +16 -0
package/package.json +6 -6

package/.docs/docs/agents/processors.md CHANGED Viewed

@@ -11,6 +11,8 @@ You can use individual [`Processor`](https://mastra.ai/reference/processors/proc
 Some processors implement both input and output logic and can be used in either array depending on where the transformation should occur.
+Some built-in processors also persist hidden system reminder messages using `<system-reminder>...</system-reminder>` text plus `metadata.systemReminder`. These reminders stay available in raw memory history and retry/prompt reconstruction paths, but standard UI-facing message conversions and default memory recall hide them unless you explicitly opt in.
 ## When to use processors
 Use processors to:
@@ -536,6 +538,37 @@ The retry mechanism:
 - Tracks retry count via the `retryCount` parameter
 - Respects `maxProcessorRetries` limit on the agent
+## API error handling
+The `processAPIError` method handles LLM API rejections — errors where the API rejects the request (such as 400 or 422 status codes) rather than network or server failures. This lets you modify the request and retry when the API rejects the message format.
+```typescript
+import { APICallError } from '@ai-sdk/provider'
+import type { Processor, ProcessAPIErrorArgs, ProcessAPIErrorResult } from '@mastra/core/processors'
+export class ContextLengthHandler implements Processor {
+  id = 'context-length-handler'
+  processAPIError({
+    error,
+    messageList,
+    retryCount,
+  }: ProcessAPIErrorArgs): ProcessAPIErrorResult | void {
+    if (retryCount > 0) return
+    if (APICallError.isInstance(error) && error.message.includes('context length exceeded')) {
+      const messages = messageList.get.all.db()
+      if (messages.length > 4) {
+        messageList.removeByIds([messages[1]!.id, messages[2]!.id])
+        return { retry: true }
+      }
+    }
+  }
+}
+```
+Mastra includes a built-in [`PrefillErrorHandler`](https://mastra.ai/reference/processors/prefill-error-handler) that automatically handles the Anthropic "assistant message prefill" error. This processor is auto-injected and requires no configuration.
 ## Related documentation
 - [Guardrails](https://mastra.ai/docs/agents/guardrails): Security and validation processors

package/.docs/docs/mcp/overview.md CHANGED Viewed

@@ -89,6 +89,23 @@ export const testAgent = new Agent({
 > **Info:** Visit [Agent Class](https://mastra.ai/reference/agents/agent) for a full list of configuration options.
+## Tool approval
+You can require human approval before MCP tools are executed by setting `requireToolApproval` on a server definition. This integrates with the existing [human-in-the-loop](https://mastra.ai/docs/workflows/human-in-the-loop) approval flow.
+```typescript
+export const mcp = new MCPClient({
+  servers: {
+    github: {
+      url: new URL('http://localhost:3000/mcp'),
+      requireToolApproval: true,
+    },
+  },
+})
+```
+You can also pass a function to decide dynamically per-call. See the [MCPClient reference](https://mastra.ai/reference/tools/mcp-client) for the full API.
 ## Configuring `MCPServer`
 To expose agents, tools, and workflows from your Mastra application to external systems over HTTP(S) use the `MCPServer` class. This makes them accessible to any system or agent that supports the protocol.

package/.docs/docs/memory/message-history.md CHANGED Viewed

@@ -6,6 +6,12 @@ You can also retrieve message history to display past conversations in your UI.
 > **Info:** Each message belongs to a thread (the conversation) and a resource (the user or entity it's associated with). See [Threads and resources](https://mastra.ai/docs/memory/storage) for more detail.
+> **Warning:** When you use memory with a client application, send **only the new message** from the client instead of the full conversation history.
+>
+> Sending the full history is redundant because Mastra loads messages from storage, and it can cause message ordering bugs when client-side timestamps conflict with stored timestamps.
+>
+> For an AI SDK example, see [Using Mastra Memory](https://mastra.ai/guides/build-your-ui/ai-sdk-ui).
 ## Getting started
 Install the Mastra memory module along with a [storage adapter](https://mastra.ai/docs/memory/storage) for your database. The examples below use `@mastra/libsql`, which stores data locally in a `mastra.db` file.

package/.docs/docs/memory/observational-memory.md CHANGED Viewed

@@ -38,6 +38,12 @@ const memory = new Memory({
 See [configuration options](https://mastra.ai/reference/memory/observational-memory) for full API details.
+> **Warning:** When you use OM with a client application, send **only the new message** from the client instead of the full conversation history.
+>
+> Observational memory still relies on stored conversation history. Sending the full history is redundant and can cause message ordering bugs when client-side timestamps conflict with stored timestamps.
+>
+> For an AI SDK example, see [Using Mastra Memory](https://mastra.ai/guides/build-your-ui/ai-sdk-ui).
 > **Note:** OM currently only supports `@mastra/pg`, `@mastra/libsql`, and `@mastra/mongodb` storage adapters. It uses background agents for managing memory. When using `observationalMemory: true`, the default model is `google/gemini-2.5-flash`. When passing a config object, a `model` must be explicitly set.
 ## Benefits

package/.docs/docs/voice/overview.md CHANGED Viewed

@@ -265,7 +265,7 @@ const { text } = await voiceAgent.generate('What color is the sky?')
 // Convert text to speech to an Audio Stream
 const audioStream = await voiceAgent.voice.speak(text, {
-  speaker: 'default', // Optional: specify a speaker
+  speaker: 'shubh', // Optional: specify a bulbul:v3 speaker
 })
 playAudio(audioStream)
@@ -760,12 +760,15 @@ Visit the [Speechify Voice Reference](https://mastra.ai/reference/voice/speechif
 // Sarvam Voice Configuration
 const voice = new SarvamVoice({
   speechModel: {
-    name: 'sarvam-voice', // Example model name
+    model: 'bulbul:v3', // TTS model (bulbul:v2 or bulbul:v3)
+    apiKey: process.env.SARVAM_API_KEY,
+    language: 'en-IN', // BCP-47 language code
+  },
+  listeningModel: {
+    model: 'saarika:v2.5', // STT model (saarika:v2.5 or saaras:v3)
     apiKey: process.env.SARVAM_API_KEY,
-    language: 'en-IN', // Language code
-    style: 'conversational', // Style setting
   },
-  // Sarvam may not have a separate listening model
+  speaker: 'shubh', // Default bulbul:v3 speaker
 })
 ```

package/.docs/guides/build-your-ui/ai-sdk-ui.md CHANGED Viewed

@@ -238,6 +238,38 @@ export default function Chat() {
 Use [`prepareSendMessagesRequest`](https://ai-sdk.dev/docs/reference/ai-sdk-ui/use-chat#transport.default-chat-transport.prepare-send-messages-request) to customize the request sent to the chat route, for example to pass additional configuration to the agent.
+## Using Mastra Memory
+When your agent has [memory](https://mastra.ai/docs/memory/overview) configured, Mastra loads conversation history from storage on the server. Send only the new message from the client instead of the full conversation history.
+Sending the full history is redundant and can cause message ordering bugs because client-side timestamps can conflict with the timestamps stored in your database.
+```typescript
+import { useChat } from '@ai-sdk/react'
+import { DefaultChatTransport } from 'ai'
+const { messages, sendMessage } = useChat({
+  transport: new DefaultChatTransport({
+    api: 'http://localhost:4111/chat/weatherAgent',
+    prepareSendMessagesRequest({ messages }) {
+      return {
+        body: {
+          messages: [messages[messages.length - 1]],
+          threadId: 'user-thread-123',
+          resourceId: 'user-123',
+        },
+      }
+    },
+  }),
+})
+```
+Set `threadId` and `resourceId` from your app's own state, such as URL params, auth context, or your database.
+See [Message history](https://mastra.ai/docs/memory/message-history) for more on how Mastra memory loads and stores messages.
+[`chatRoute()`](https://mastra.ai/reference/ai-sdk/chat-route) and [`handleChatStream()`](https://mastra.ai/reference/ai-sdk/handle-chat-stream) already work with memory. Configure the client to send only the new message and include the thread and resource identifiers.
 ### `useCompletion()`
 The `useCompletion()` hook handles single-turn completions between your frontend and a Mastra agent, allowing you to send a prompt and receive a streamed response over HTTP.

package/.docs/guides/deployment/mastra-platform.md CHANGED Viewed

@@ -182,6 +182,8 @@ The CLI reads `organizationId` and `projectId` from `.mastra-project.json` by de
 ## Related
 - [CLI reference: `mastra server deploy`](https://mastra.ai/reference/cli/mastra)
+- [CLI reference: `mastra server pause`](https://mastra.ai/reference/cli/mastra)
+- [CLI reference: `mastra server restart`](https://mastra.ai/reference/cli/mastra)
 - [CLI reference: `mastra studio deploy`](https://mastra.ai/reference/cli/mastra)
 - [CLI reference: `mastra auth tokens`](https://mastra.ai/reference/cli/mastra)
 - [Mastra platform overview](https://mastra.ai/docs/mastra-platform/overview)

package/.docs/models/gateways/openrouter.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![OpenRouter logo](https://models.dev/logos/openrouter.svg)OpenRouter
-OpenRouter aggregates models from multiple providers with enhanced features like rate limiting and failover. Access 176 models through Mastra's model router.
+OpenRouter aggregates models from multiple providers with enhanced features like rate limiting and failover. Access 170 models through Mastra's model router.
 Learn more in the [OpenRouter documentation](https://openrouter.ai/models).
@@ -46,7 +46,6 @@ ANTHROPIC_API_KEY=ant-...
 | `anthropic/claude-sonnet-4.6`                                   |
 | `arcee-ai/trinity-large-preview:free`                           |
 | `arcee-ai/trinity-large-thinking`                               |
-| `arcee-ai/trinity-mini:free`                                    |
 | `black-forest-labs/flux.2-flex`                                 |
 | `black-forest-labs/flux.2-klein-4b`                             |
 | `black-forest-labs/flux.2-max`                                  |
@@ -88,9 +87,8 @@ ANTHROPIC_API_KEY=ant-...
 | `google/gemma-4-26b-a4b-it:free`                                |
 | `google/gemma-4-31b-it`                                         |
 | `google/gemma-4-31b-it:free`                                    |
-| `inception/mercury`                                             |
 | `inception/mercury-2`                                           |
-| `inception/mercury-coder`                                       |
+| `inception/mercury-edit-2`                                      |
 | `liquid/lfm-2.5-1.2b-instruct:free`                             |
 | `liquid/lfm-2.5-1.2b-thinking:free`                             |
 | `meta-llama/llama-3.2-11b-vision-instruct`                      |
@@ -117,7 +115,6 @@ ANTHROPIC_API_KEY=ant-...
 | `moonshotai/kimi-k2-0905`                                       |
 | `moonshotai/kimi-k2-0905:exacto`                                |
 | `moonshotai/kimi-k2-thinking`                                   |
-| `moonshotai/kimi-k2:free`                                       |
 | `moonshotai/kimi-k2.5`                                          |
 | `nousresearch/hermes-3-llama-3.1-405b:free`                     |
 | `nousresearch/hermes-4-405b`                                    |
@@ -168,15 +165,12 @@ ANTHROPIC_API_KEY=ant-...
 | `qwen/qwen3-235b-a22b-thinking-2507`                            |
 | `qwen/qwen3-30b-a3b-instruct-2507`                              |
 | `qwen/qwen3-30b-a3b-thinking-2507`                              |
-| `qwen/qwen3-4b:free`                                            |
 | `qwen/qwen3-coder`                                              |
 | `qwen/qwen3-coder-30b-a3b-instruct`                             |
 | `qwen/qwen3-coder-flash`                                        |
 | `qwen/qwen3-coder:exacto`                                       |
-| `qwen/qwen3-coder:free`                                         |
 | `qwen/qwen3-max`                                                |
 | `qwen/qwen3-next-80b-a3b-instruct`                              |
-| `qwen/qwen3-next-80b-a3b-instruct:free`                         |
 | `qwen/qwen3-next-80b-a3b-thinking`                              |
 | `qwen/qwen3.5-397b-a17b`                                        |
 | `qwen/qwen3.5-flash-02-23`                                      |

package/.docs/models/gateways/vercel.md CHANGED Viewed

@@ -119,7 +119,7 @@ ANTHROPIC_API_KEY=ant-...
 | `google/text-embedding-005`                    |
 | `google/text-multilingual-embedding-002`       |
 | `inception/mercury-2`                          |
-| `inception/mercury-coder-small`                |
+| `inception/mercury-edit-2`                     |
 | `kwaipilot/kat-coder-pro-v1`                   |
 | `kwaipilot/kat-coder-pro-v2`                   |
 | `meituan/longcat-flash-chat`                   |

package/.docs/models/index.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # Model Providers
-Mastra provides a unified interface for working with LLMs across multiple providers, giving you access to 3610 models from 99 providers through a single API.
+Mastra provides a unified interface for working with LLMs across multiple providers, giving you access to 3596 models from 99 providers through a single API.
 ## Features

package/.docs/models/providers/anthropic.md CHANGED Viewed

@@ -114,13 +114,23 @@ const response = await agent.generate("Hello!", {
 **cacheControl** (`{ type: "ephemeral"; ttl?: "5m" | "1h" | undefined; } | undefined`)
+**metadata** (`{ userId?: string | undefined; } | undefined`)
+**mcpServers** (`{ type: "url"; name: string; url: string; authorizationToken?: string | null | undefined; toolConfiguration?: { enabled?: boolean | null | undefined; allowedTools?: string[] | null | undefined; } | null | undefined; }[] | undefined`)
 **container** (`{ id?: string | undefined; skills?: { type: "anthropic" | "custom"; skillId: string; version?: string | undefined; }[] | undefined; } | undefined`)
+**toolStreaming** (`boolean | undefined`)
 **effort** (`"low" | "medium" | "high" | "max" | undefined`)
-**speed** (`"fast" | undefined`)
+**speed** (`"fast" | "standard" | undefined`)
+**inferenceGeo** (`"us" | "global" | undefined`)
+**anthropicBeta** (`string[] | undefined`)
-**contextManagement** (`{ edits: ({ type: "clear_01"; trigger?: { type: "input_tokens"; value: number; } | undefined; keep?: "all" | { type: "thinking_turns"; value: number; } | undefined; } | { type: "compact_20260112"; trigger?: { ...; } | undefined; pauseAfterCompaction?: boolean | undefined; instructions?: string | undefined; })[]; } |...`)
+**contextManagement** (`{ edits: ({ type: "clear_tool_uses_20250919"; trigger?: { type: "input_tokens"; value: number; } | { type: "tool_uses"; value: number; } | undefined; keep?: { type: "tool_uses"; value: number; } | undefined; clearAtLeast?: { ...; } | undefined; clearToolInputs?: boolean | undefined; excludeTools?: string[] | undefin...`)
 ## Direct provider installation

package/.docs/models/providers/fireworks-ai.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![Fireworks AI logo](https://models.dev/logos/fireworks-ai.svg)Fireworks AI
-Access 16 Fireworks AI models through Mastra's model router. Authentication is handled automatically using the `FIREWORKS_API_KEY` environment variable.
+Access 17 Fireworks AI models through Mastra's model router. Authentication is handled automatically using the `FIREWORKS_API_KEY` environment variable.
 Learn more in the [Fireworks AI documentation](https://fireworks.ai/docs/).
@@ -48,6 +48,7 @@ for await (const chunk of stream) {
 | `fireworks-ai/accounts/fireworks/models/kimi-k2p5`        | 256K    |       |           |       |       |       | $0.60      | $3          |
 | `fireworks-ai/accounts/fireworks/models/minimax-m2p1`     | 200K    |       |           |       |       |       | $0.30      | $1          |
 | `fireworks-ai/accounts/fireworks/models/minimax-m2p5`     | 197K    |       |           |       |       |       | $0.30      | $1          |
+| `fireworks-ai/accounts/fireworks/models/minimax-m2p7`     | 197K    |       |           |       |       |       | $0.30      | $1          |
 | `fireworks-ai/accounts/fireworks/models/qwen3p6-plus`     | 128K    |       |           |       |       |       | $0.50      | $3          |
 | `fireworks-ai/accounts/fireworks/routers/kimi-k2p5-turbo` | 256K    |       |           |       |       |       | —          | —           |

package/.docs/models/providers/google.md CHANGED Viewed

@@ -137,10 +137,14 @@ const response = await agent.generate("Hello!", {
 **mediaResolution** (`"MEDIA_RESOLUTION_UNSPECIFIED" | "MEDIA_RESOLUTION_LOW" | "MEDIA_RESOLUTION_MEDIUM" | "MEDIA_RESOLUTION_HIGH" | undefined`)
-**imageConfig** (`{ aspectRatio?: "1:1" | "2:3" | "3:2" | "3:4" | "4:3" | "4:5" | "5:4" | "9:16" | "16:9" | "21:9" | undefined; imageSize?: "1K" | "2K" | "4K" | undefined; } | undefined`)
+**imageConfig** (`{ aspectRatio?: "1:1" | "2:3" | "3:2" | "3:4" | "4:3" | "4:5" | "5:4" | "9:16" | "16:9" | "21:9" | "1:8" | "8:1" | "1:4" | "4:1" | undefined; imageSize?: "1K" | "2K" | "4K" | "512" | undefined; } | undefined`)
 **retrievalConfig** (`{ latLng?: { latitude: number; longitude: number; } | undefined; } | undefined`)
+**streamFunctionCallArguments** (`boolean | undefined`)
+**serviceTier** (`"standard" | "flex" | "priority" | undefined`)
 ## Direct provider installation
 This provider can also be installed directly as a standalone package, which can be used instead of the Mastra model router string. View the [package documentation](https://www.npmjs.com/package/@ai-sdk/google) for more details.

package/.docs/models/providers/inception.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![Inception logo](https://models.dev/logos/inception.svg)Inception
-Access 4 Inception models through Mastra's model router. Authentication is handled automatically using the `INCEPTION_API_KEY` environment variable.
+Access 2 Inception models through Mastra's model router. Authentication is handled automatically using the `INCEPTION_API_KEY` environment variable.
 Learn more in the [Inception documentation](https://platform.inceptionlabs.ai/docs).
@@ -15,7 +15,7 @@ const agent = new Agent({
   id: "my-agent",
   name: "My Agent",
   instructions: "You are a helpful assistant",
-  model: "inception/mercury"
+  model: "inception/mercury-2"
 });
 // Generate a response
@@ -32,12 +32,10 @@ for await (const chunk of stream) {
 ## Models
-| Model                     | Context | Tools | Reasoning | Image | Audio | Video | Input $/1M | Output $/1M |
-| ------------------------- | ------- | ----- | --------- | ----- | ----- | ----- | ---------- | ----------- |
-| `inception/mercury`       | 128K    |       |           |       |       |       | $0.25      | $1          |
-| `inception/mercury-2`     | 128K    |       |           |       |       |       | $0.25      | $0.75       |
-| `inception/mercury-coder` | 128K    |       |           |       |       |       | $0.25      | $1          |
-| `inception/mercury-edit`  | 128K    |       |           |       |       |       | $0.25      | $0.75       |
+| Model                      | Context | Tools | Reasoning | Image | Audio | Video | Input $/1M | Output $/1M |
+| -------------------------- | ------- | ----- | --------- | ----- | ----- | ----- | ---------- | ----------- |
+| `inception/mercury-2`      | 128K    |       |           |       |       |       | $0.25      | $0.75       |
+| `inception/mercury-edit-2` | 128K    |       |           |       |       |       | $0.25      | $0.75       |
 ## Advanced configuration
@@ -49,7 +47,7 @@ const agent = new Agent({
   name: "custom-agent",
   model: {
     url: "https://api.inceptionlabs.ai/v1/",
-    id: "inception/mercury",
+    id: "inception/mercury-2",
     apiKey: process.env.INCEPTION_API_KEY,
     headers: {
       "X-Custom-Header": "value"
@@ -67,8 +65,8 @@ const agent = new Agent({
   model: ({ requestContext }) => {
     const useAdvanced = requestContext.task === "complex";
     return useAdvanced
-      ? "inception/mercury-edit"
-      : "inception/mercury";
+      ? "inception/mercury-edit-2"
+      : "inception/mercury-2";
   }
 });
 ```

package/.docs/models/providers/kilo.md CHANGED Viewed

@@ -127,9 +127,8 @@ for await (const chunk of stream) {
 | `kilo/google/lyria-3-pro-preview`                   | 1.0M    |       |           |       |       |       | —          | —           |
 | `kilo/gryphe/mythomax-l2-13b`                       | 4K      |       |           |       |       |       | $0.06      | $0.06       |
 | `kilo/ibm-granite/granite-4.0-h-micro`              | 131K    |       |           |       |       |       | $0.02      | $0.11       |
-| `kilo/inception/mercury`                            | 128K    |       |           |       |       |       | $0.25      | $0.75       |
 | `kilo/inception/mercury-2`                          | 128K    |       |           |       |       |       | $0.25      | $0.75       |
-| `kilo/inception/mercury-coder`                      | 128K    |       |           |       |       |       | $0.25      | $0.75       |
+| `kilo/inception/mercury-edit-2`                     | 128K    |       |           |       |       |       | $0.25      | $0.75       |
 | `kilo/inflection/inflection-3-pi`                   | 8K      |       |           |       |       |       | $3         | $10         |
 | `kilo/inflection/inflection-3-productivity`         | 8K      |       |           |       |       |       | $3         | $10         |
 | `kilo/kilo-auto/balanced`                           | 205K    |       |           |       |       |       | $0.60      | $3          |
@@ -268,6 +267,7 @@ for await (const chunk of stream) {
 | `kilo/openai/o4-mini-high`                          | 200K    |       |           |       |       |       | $1         | $4          |
 | `kilo/openrouter/auto`                              | 2.0M    |       |           |       |       |       | —          | —           |
 | `kilo/openrouter/bodybuilder`                       | 128K    |       |           |       |       |       | —          | —           |
+| `kilo/openrouter/elephant-alpha`                    | 262K    |       |           |       |       |       | —          | —           |
 | `kilo/openrouter/free`                              | 200K    |       |           |       |       |       | —          | —           |
 | `kilo/perplexity/sonar`                             | 127K    |       |           |       |       |       | $1         | $1          |
 | `kilo/perplexity/sonar-deep-research`               | 128K    |       |           |       |       |       | $2         | $8          |

package/.docs/models/providers/nano-gpt.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![NanoGPT logo](https://models.dev/logos/nano-gpt.svg)NanoGPT
-Access 519 NanoGPT models through Mastra's model router. Authentication is handled automatically using the `NANO_GPT_API_KEY` environment variable.
+Access 518 NanoGPT models through Mastra's model router. Authentication is handled automatically using the `NANO_GPT_API_KEY` environment variable.
 Learn more in the [NanoGPT documentation](https://docs.nano-gpt.com).
@@ -319,7 +319,6 @@ for await (const chunk of stream) {
 | `nano-gpt/meganova-ai/manta-mini-1.0`                               | 8K      |       |           |       |       |       | $0.02      | $0.16       |
 | `nano-gpt/meganova-ai/manta-pro-1.0`                                | 33K     |       |           |       |       |       | $0.06      | $0.50       |
 | `nano-gpt/meituan-longcat/LongCat-Flash-Chat-FP8`                   | 128K    |       |           |       |       |       | $0.15      | $0.70       |
-| `nano-gpt/mercury-coder-small`                                      | 33K     |       |           |       |       |       | $0.25      | $1          |
 | `nano-gpt/Meta-Llama-3-1-8B-Instruct-FP8`                           | 128K    |       |           |       |       |       | $0.02      | $0.03       |
 | `nano-gpt/meta-llama/llama-3.1-8b-instruct`                         | 131K    |       |           |       |       |       | $0.05      | $0.05       |
 | `nano-gpt/meta-llama/llama-3.2-3b-instruct`                         | 131K    |       |           |       |       |       | $0.03      | $0.05       |

package/.docs/models/providers/nvidia.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![Nvidia logo](https://models.dev/logos/nvidia.svg)Nvidia
-Access 75 Nvidia models through Mastra's model router. Authentication is handled automatically using the `NVIDIA_API_KEY` environment variable.
+Access 76 Nvidia models through Mastra's model router. Authentication is handled automatically using the `NVIDIA_API_KEY` environment variable.
 Learn more in the [Nvidia documentation](https://docs.api.nvidia.com/nim/).
@@ -71,6 +71,7 @@ for await (const chunk of stream) {
 | `nvidia/microsoft/phi-4-mini-instruct`                 | 131K    |       |           |       |       |       | —          | —           |
 | `nvidia/minimaxai/minimax-m2.1`                        | 205K    |       |           |       |       |       | —          | —           |
 | `nvidia/minimaxai/minimax-m2.5`                        | 205K    |       |           |       |       |       | —          | —           |
+| `nvidia/minimaxai/minimax-m2.7`                        | 205K    |       |           |       |       |       | $0.30      | $1          |
 | `nvidia/mistralai/codestral-22b-instruct-v0.1`         | 128K    |       |           |       |       |       | —          | —           |
 | `nvidia/mistralai/devstral-2-123b-instruct-2512`       | 262K    |       |           |       |       |       | —          | —           |
 | `nvidia/mistralai/mamba-codestral-7b-v0.1`             | 128K    |       |           |       |       |       | —          | —           |

package/.docs/models/providers/openai.md CHANGED Viewed

@@ -171,6 +171,10 @@ const response = await agent.generate("Hello!", {
 **user** (`string | null | undefined`)
+**systemMessageMode** (`"remove" | "system" | "developer" | undefined`)
+**forceReasoning** (`boolean | undefined`)
 ## Direct provider installation
 This provider can also be installed directly as a standalone package, which can be used instead of the Mastra model router string. View the [package documentation](https://www.npmjs.com/package/@ai-sdk/openai) for more details.

package/.docs/models/providers/opencode-go.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![OpenCode Go logo](https://models.dev/logos/opencode-go.svg)OpenCode Go
-Access 7 OpenCode Go models through Mastra's model router. Authentication is handled automatically using the `OPENCODE_API_KEY` environment variable.
+Access 9 OpenCode Go models through Mastra's model router. Authentication is handled automatically using the `OPENCODE_API_KEY` environment variable.
 Learn more in the [OpenCode Go documentation](https://opencode.ai/docs/zen).
@@ -41,6 +41,8 @@ for await (const chunk of stream) {
 | `opencode-go/mimo-v2-pro`  | 1.0M    |       |           |       |       |       | $1         | $3          |
 | `opencode-go/minimax-m2.5` | 205K    |       |           |       |       |       | $0.30      | $1          |
 | `opencode-go/minimax-m2.7` | 205K    |       |           |       |       |       | $0.30      | $1          |
+| `opencode-go/qwen3.5-plus` | 262K    |       |           |       |       |       | $0.20      | $1          |
+| `opencode-go/qwen3.6-plus` | 262K    |       |           |       |       |       | $0.50      | $3          |
 ## Advanced configuration
@@ -70,7 +72,7 @@ const agent = new Agent({
   model: ({ requestContext }) => {
     const useAdvanced = requestContext.task === "complex";
     return useAdvanced
-      ? "opencode-go/minimax-m2.7"
+      ? "opencode-go/qwen3.6-plus"
       : "opencode-go/glm-5";
   }
 });

package/.docs/models/providers/opencode.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![OpenCode Zen logo](https://models.dev/logos/opencode.svg)OpenCode Zen
-Access 32 OpenCode Zen models through Mastra's model router. Authentication is handled automatically using the `OPENCODE_API_KEY` environment variable.
+Access 34 OpenCode Zen models through Mastra's model router. Authentication is handled automatically using the `OPENCODE_API_KEY` environment variable.
 Learn more in the [OpenCode Zen documentation](https://opencode.ai/docs/zen).
@@ -66,6 +66,8 @@ for await (const chunk of stream) {
 | `opencode/minimax-m2.5`          | 205K    |       |           |       |       |       | $0.30      | $1          |
 | `opencode/minimax-m2.5-free`     | 205K    |       |           |       |       |       | —          | —           |
 | `opencode/nemotron-3-super-free` | 205K    |       |           |       |       |       | —          | —           |
+| `opencode/qwen3.5-plus`          | 262K    |       |           |       |       |       | $0.20      | $1          |
+| `opencode/qwen3.6-plus`          | 262K    |       |           |       |       |       | $0.50      | $3          |
 ## Advanced configuration
@@ -95,7 +97,7 @@ const agent = new Agent({
   model: ({ requestContext }) => {
     const useAdvanced = requestContext.task === "complex";
     return useAdvanced
-      ? "opencode/nemotron-3-super-free"
+      ? "opencode/qwen3.6-plus"
       : "opencode/big-pickle";
   }
 });

package/.docs/models/providers/poe.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![Poe logo](https://models.dev/logos/poe.svg)Poe
-Access 128 Poe models through Mastra's model router. Authentication is handled automatically using the `POE_API_KEY` environment variable.
+Access 117 Poe models through Mastra's model router. Authentication is handled automatically using the `POE_API_KEY` environment variable.
 Learn more in the [Poe documentation](https://creator.poe.com/docs/external-applications/openai-compatible-api).
@@ -41,17 +41,12 @@ for await (const chunk of stream) {
 | `poe/anthropic/claude-opus-4.1`        | 197K    |       |           |       |       |       | $13        | $64         |
 | `poe/anthropic/claude-opus-4.5`        | 197K    |       |           |       |       |       | $4         | $21         |
 | `poe/anthropic/claude-opus-4.6`        | 983K    |       |           |       |       |       | $4         | $21         |
-| `poe/anthropic/claude-sonnet-3.5`      | 189K    |       |           |       |       |       | $3         | $13         |
-| `poe/anthropic/claude-sonnet-3.5-june` | 189K    |       |           |       |       |       | $3         | $13         |
 | `poe/anthropic/claude-sonnet-3.7`      | 197K    |       |           |       |       |       | $3         | $13         |
 | `poe/anthropic/claude-sonnet-4`        | 983K    |       |           |       |       |       | $3         | $13         |
 | `poe/anthropic/claude-sonnet-4.5`      | 983K    |       |           |       |       |       | $3         | $13         |
 | `poe/anthropic/claude-sonnet-4.6`      | 983K    |       |           |       |       |       | $3         | $13         |
-| `poe/cerebras/gpt-oss-120b-cs`         | —       |       |           |       |       |       | —          | —           |
-| `poe/cerebras/llama-3.1-8b-cs`         | —       |       |           |       |       |       | —          | —           |
-| `poe/cerebras/llama-3.3-70b-cs`        | —       |       |           |       |       |       | —          | —           |
-| `poe/cerebras/qwen3-235b-2507-cs`      | —       |       |           |       |       |       | —          | —           |
-| `poe/cerebras/qwen3-32b-cs`            | —       |       |           |       |       |       | —          | —           |
+| `poe/cerebras/gpt-oss-120b-cs`         | 128K    |       |           |       |       |       | $0.35      | $0.75       |
+| `poe/cerebras/llama-3.1-8b-cs`         | 128K    |       |           |       |       |       | $0.10      | $0.10       |
 | `poe/elevenlabs/elevenlabs-music`      | 2K      |       |           |       |       |       | —          | —           |
 | `poe/elevenlabs/elevenlabs-v2.5-turbo` | 128K    |       |           |       |       |       | —          | —           |
 | `poe/elevenlabs/elevenlabs-v3`         | 128K    |       |           |       |       |       | —          | —           |
@@ -62,10 +57,8 @@ for await (const chunk of stream) {
 | `poe/google/gemini-2.5-flash-lite`     | 1.0M    |       |           |       |       |       | $0.07      | $0.28       |
 | `poe/google/gemini-2.5-pro`            | 1.1M    |       |           |       |       |       | $0.87      | $7          |
 | `poe/google/gemini-3-flash`            | 1.0M    |       |           |       |       |       | $0.40      | $2          |
-| `poe/google/gemini-3-pro`              | 1.0M    |       |           |       |       |       | $2         | $10         |
 | `poe/google/gemini-3.1-flash-lite`     | 1.0M    |       |           |       |       |       | $0.25      | $2          |
 | `poe/google/gemini-3.1-pro`            | 1.0M    |       |           |       |       |       | $2         | $12         |
-| `poe/google/gemini-deep-research`      | 1.0M    |       |           |       |       |       | $2         | $10         |
 | `poe/google/gemma-4-31b`               | 262K    |       |           |       |       |       | —          | —           |
 | `poe/google/imagen-3`                  | 480     |       |           |       |       |       | —          | —           |
 | `poe/google/imagen-3-fast`             | 480     |       |           |       |       |       | —          | —           |
@@ -88,20 +81,16 @@ for await (const chunk of stream) {
 | `poe/novita/deepseek-v3.2`             | 128K    |       |           |       |       |       | $0.27      | $0.40       |
 | `poe/novita/glm-4.6`                   | —       |       |           |       |       |       | —          | —           |
 | `poe/novita/glm-4.6v`                  | 131K    |       |           |       |       |       | —          | —           |
-| `poe/novita/glm-4.7`                   | 205K    |       |           |       |       |       | —          | —           |
 | `poe/novita/glm-4.7-flash`             | 200K    |       |           |       |       |       | —          | —           |
 | `poe/novita/glm-4.7-n`                 | 205K    |       |           |       |       |       | —          | —           |
-| `poe/novita/glm-5`                     | 205K    |       |           |       |       |       | —          | —           |
+| `poe/novita/glm-5`                     | 205K    |       |           |       |       |       | $1         | $3          |
 | `poe/novita/kimi-k2-thinking`          | 256K    |       |           |       |       |       | —          | —           |
-| `poe/novita/kimi-k2.5`                 | 256K    |       |           |       |       |       | —          | —           |
+| `poe/novita/kimi-k2.5`                 | 128K    |       |           |       |       |       | $0.60      | $3          |
 | `poe/novita/minimax-m2.1`              | 205K    |       |           |       |       |       | —          | —           |
-| `poe/openai/chatgpt-4o-latest`         | 128K    |       |           |       |       |       | $5         | $14         |
 | `poe/openai/dall-e-3`                  | 800     |       |           |       |       |       | —          | —           |
 | `poe/openai/gpt-3.5-turbo`             | 16K     |       |           |       |       |       | $0.45      | $1          |
 | `poe/openai/gpt-3.5-turbo-instruct`    | 4K      |       |           |       |       |       | $1         | $2          |
 | `poe/openai/gpt-3.5-turbo-raw`         | 5K      |       |           |       |       |       | $0.45      | $1          |
-| `poe/openai/gpt-4-classic`             | 8K      |       |           |       |       |       | $27        | $54         |
-| `poe/openai/gpt-4-classic-0314`        | 8K      |       |           |       |       |       | $27        | $54         |
 | `poe/openai/gpt-4-turbo`               | 128K    |       |           |       |       |       | $9         | $27         |
 | `poe/openai/gpt-4.1`                   | 1.0M    |       |           |       |       |       | $2         | $7          |
 | `poe/openai/gpt-4.1-mini`              | 1.0M    |       |           |       |       |       | $0.36      | $1          |

package/.docs/models/providers/xai.md CHANGED Viewed

@@ -109,6 +109,10 @@ const response = await agent.generate("Hello!", {
 **reasoningEffort** (`"low" | "high" | undefined`)
+**logprobs** (`boolean | undefined`)
+**topLogprobs** (`number | undefined`)
 **parallel\_function\_calling** (`boolean | undefined`)
 **searchParameters** (`{ mode: "off" | "auto" | "on"; returnCitations?: boolean | undefined; fromDate?: string | undefined; toDate?: string | undefined; maxSearchResults?: number | undefined; sources?: ({ ...; } | ... 2 more ... | { ...; })[] | undefined; } | undefined`)

package/.docs/reference/ai-sdk/handle-chat-stream.md CHANGED Viewed

@@ -8,6 +8,23 @@ Framework-agnostic handler for streaming agent chat in AI SDK-compatible format.
 Use [`chatRoute()`](https://mastra.ai/reference/ai-sdk/chat-route) if you want to create a chat route inside a Mastra server.
+## Structured output in UI streams
+When you pass `structuredOutput` to the underlying agent execution, the final structured output object is emitted in the AI SDK-compatible UI stream as a custom data part:
+```json
+{
+  "type": "data-structured-output",
+  "data": {
+    "object": {}
+  }
+}
+```
+The `object` field contains your full structured output value. Mastra emits this event for the final structured output object only. Partial structured output chunks are not exposed in the UI stream.
+Read this event with AI SDK UI's custom data handling, such as `onData`, or render it from message data parts.
 ## Usage example
 Next.js App Router example:

package/.docs/reference/ai-sdk/to-ai-sdk-stream.md CHANGED Viewed

@@ -6,6 +6,21 @@ This is useful when building custom streaming endpoints outside Mastra's provide
 `toAISdkStream()` keeps the existing AI SDK v5/default behavior. If your app is typed against AI SDK v6, pass `version: 'v6'` in the options object.
+## Structured output in UI streams
+When the source agent stream includes a final structured output object, `toAISdkStream()` emits it as a custom AI SDK UI data part:
+```json
+{
+  "type": "data-structured-output",
+  "data": {
+    "object": {}
+  }
+}
+```
+The `object` field contains your full structured output value. This maps Mastra's final structured output chunk into the AI SDK UI stream. Partial structured output chunks are not emitted.
 ## Usage example
 Next.js App Router example: