npm - @mastra/mcp-docs-server - Versions diffs - 1.1.13 → 1.1.14-alpha.2 - Mend

@mastra/mcp-docs-server 1.1.13 → 1.1.14-alpha.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (38) hide show

package/.docs/docs/deployment/studio.md +9 -24
package/.docs/docs/getting-started/studio.md +28 -16
package/.docs/docs/observability/tracing/exporters/braintrust.md +15 -0
package/.docs/docs/server/auth.md +6 -7
package/.docs/docs/server/custom-api-routes.md +56 -0
package/.docs/docs/server/mastra-server.md +2 -2
package/.docs/guides/deployment/cloudflare.md +1 -1
package/.docs/models/gateways/openrouter.md +4 -1
package/.docs/models/gateways/vercel.md +7 -1
package/.docs/models/index.md +1 -1
package/.docs/models/providers/anthropic.md +2 -2
package/.docs/models/providers/baseten.md +12 -13
package/.docs/models/providers/chutes.md +5 -5
package/.docs/models/providers/deepinfra.md +30 -23
package/.docs/models/providers/google.md +1 -1
package/.docs/models/providers/kilo.md +342 -272
package/.docs/models/providers/nano-gpt.md +36 -36
package/.docs/models/providers/nebius.md +3 -2
package/.docs/models/providers/perplexity-agent.md +19 -18
package/.docs/models/providers/synthetic.md +1 -1
package/.docs/models/providers/vultr.md +17 -12
package/.docs/models/providers/zai-coding-plan.md +3 -2
package/.docs/models/providers/zai.md +3 -2
package/.docs/reference/agents/generate.md +2 -0
package/.docs/reference/agents/network.md +2 -0
package/.docs/reference/ai-sdk/chat-route.md +4 -0
package/.docs/reference/configuration.md +4 -2
package/.docs/reference/deployer/cloudflare.md +12 -1
package/.docs/reference/processors/unicode-normalizer.md +1 -1
package/.docs/reference/streaming/agents/stream.md +2 -0
package/.docs/reference/workflows/run-methods/restart.md +2 -0
package/.docs/reference/workflows/run-methods/resume.md +2 -0
package/.docs/reference/workflows/run-methods/start.md +2 -0
package/.docs/reference/workflows/run-methods/timeTravel.md +2 -0
package/CHANGELOG.md +15 -0
package/dist/prompts/migration.d.ts.map +1 -1
package/dist/stdio.js.map +1 -1
package/package.json +8 -8

package/.docs/docs/deployment/studio.md CHANGED Viewed

@@ -1,4 +1,4 @@
-# Deploying studio
+# Deploying Studio
 [Studio](https://mastra.ai/docs/getting-started/studio) provides an interactive UI for building and testing your agents. It's a React-based Single Page Application (SPA) that runs in the browser and connects to a running [Mastra server](https://mastra.ai/docs/deployment/mastra-server).
@@ -213,35 +213,20 @@ Follow the example below to create a SPA using Vite.
    MASTRA_STUDIO_BASE_PATH=
    MASTRA_TELEMETRY_DISABLED=true
    MASTRA_HIDE_CLOUD_CTA=false
+   MASTRA_TEMPLATES=false
    MASTRA_CLOUD_API_ENDPOINT=
    MASTRA_EXPERIMENTAL_FEATURES=false
    MASTRA_REQUEST_CONTEXT_PRESETS=
    ```
-7. Run the build script to generate the static files in the `dist` folder:
+   ````text
+   </StepItem>
-   **npm**:
+   <StepItem>
+   Run the build script to generate the static files in the `dist` folder:
-   ```bash
+   ```bash npm2yarn
    npm run build
-   ```
-   **pnpm**:
-   ```bash
-   pnpm run build
-   ```
-   **Yarn**:
-   ```bash
-   yarn build
-   ```
-   **Bun**:
-   ```bash
-   bun run build
-   ```
+   ````
-8. Point your hosting provider to the `dist` folder and deploy!
+7. Point your hosting provider to the `dist` folder and deploy!

package/.docs/docs/getting-started/studio.md CHANGED Viewed

@@ -1,16 +1,14 @@
 # Studio
-Studio provides an interactive UI for building and testing your agents, along with a REST API that exposes your Mastra application as a local service. This lets you start building without worrying about integration right away.
+Studio provides an interactive UI for building, testing, and managing your agents, workflows, and tools. Run it locally during development, or [deploy it](https://mastra.ai/docs/deployment/studio) to production so your team can manage agents, monitor performance, and gain insights through built-in observability.
-As your project evolves, Studio's development environment helps you iterate on your agent quickly. Meanwhile, Observability and Scorer features give you visibility into performance at every stage.
-To get started, run Studio locally using the instructions below, or [create a project in Mastra Cloud](https://mastra.ai/docs/mastra-cloud/setup) to collaborate with your team.
+Add [authentication](https://mastra.ai/docs/server/auth) to protect your deployed Studio with login screens, role-based access control, and permission-based UI rendering so you can control what each team member can see and do. You can also [create a project in Mastra Cloud](https://mastra.ai/docs/mastra-cloud/setup) for a hosted option.
 [YouTube video player](https://www.youtube-nocookie.com/embed/ojGu6Bi4wYk)
 ## Start Studio
-If you created your application with `create mastra`, start the local development server using the `dev` script. You can also run it directly with `mastra dev`.
+If you created your application with `create mastra`, start the development server using the `dev` script. You can also run it directly with `mastra dev`.
 **npm**:
@@ -36,14 +34,16 @@ yarn dev
 bun run dev
 ```
-Once the server's running, you can:
+Once the server is running, you can:
-- Open the Studio UI at <http://localhost:4111/> to test your agent interactively.
+- Open the Studio UI at <http://localhost:4111/> to interact with your agents, workflows, and tools.
 - Visit <http://localhost:4111/swagger-ui> to discover and interact with the underlying REST API.
+To run Studio in production, see [Deploy Studio](https://mastra.ai/docs/deployment/studio).
 ## Studio UI
-The Studio UI provides an interactive development environment for you to test your agents, workflows, and tools, observe exactly what happens under the hood with each interaction, and tweak things as you go.
+The Studio UI lets you interact with your agents, workflows, and tools, observe exactly what happens under the hood with each interaction, and tweak things as you go.
 ### Agents
@@ -61,39 +61,51 @@ When running a workflow, you can also view detailed traces showing tool calls, r
 Run tools in isolation to observe their behavior. Test them before assigning them to your agent, or isolate them to debug issues should something go wrong.
+### Processors
+View the input and output processors attached to each agent. The agent detail panel lists every processor by name and type, so you can verify your guardrails, token limiters, and custom processors are wired up correctly before testing.
+See [Processors](https://mastra.ai/docs/agents/processors) and [Guardrails](https://mastra.ai/docs/agents/guardrails) for configuration details.
 ### MCP
 List the MCP servers attached to your Mastra instance and explore their available tools.
-![MCP Servers Studio](/assets/images/local-dev-mcp-server-playground-8551b0af59838b2ef0bf4756ce94dcf5.jpg)
 ### Observability
 When you run an agent or workflow, the Observability tab displays traces that highlight the key AI operations such as model calls, tool executions, and workflow steps. Follow these traces to see how data moves, where time is spent, and what's happening under the hood.
-![](https://mastra.ai/_next/image?url=%2Ftracingafter.png\&w=1920\&q=75)
 Tracing filters out low-level framework details so your traces stay focused and readable.
 ### Scorers
 The Scorers tab displays the results of your agent's scorers as they run. When messages pass through your agent, the defined scorers evaluate each output asynchronously and render their results here. This allows you to understand how your scorers respond to different interactions, compare performance across test cases, and identify areas for improvement.
+### Datasets
+Create and manage collections of test cases to evaluate your agents and workflows. Import items from CSV or JSON, define input and ground-truth schemas, and pin to specific versions so you can reproduce experiments exactly. Run experiments with [scorers](https://mastra.ai/docs/evals/overview) to compare quality across prompts, models, or code changes.
+See [Datasets overview](https://mastra.ai/docs/observability/datasets/overview) for the full API and versioning details.
 ## REST API
-The local development server exposes a complete set of REST API routes, allowing you to programmatically interact with your agents, workflows, and tools during development. This is particularly helpful if you plan to deploy the Mastra server, since the local development server uses the exact same API routes as the [Mastra Server](https://mastra.ai/docs/server/mastra-server), allowing you to develop and test against it with full parity.
+Studio is backed by a complete set of REST API routes that let you programmatically interact with your agents, workflows, and tools. These are the same routes exposed by the [Mastra Server](https://mastra.ai/docs/server/mastra-server), so everything you build against locally works identically in production.
 You can explore all available endpoints in the OpenAPI specification at <http://localhost:4111/api/openapi.json>, which details every endpoint and its request and response schemas.
 To explore the API interactively, visit the Swagger UI at <http://localhost:4111/swagger-ui>. Here, you can discover endpoints and test them directly from your browser.
-> **Info:** The OpenAPI and Swagger endpoints are disabled in production by default. To enable them, set [`server.build.openAPIDocs`](https://mastra.ai/reference/configuration) and [`server.build.swaggerUI`](https://mastra.ai/reference/configuration) to `true` respectively.
+> **Note:** The OpenAPI and Swagger endpoints are disabled in production by default. To enable them, set [`server.build.openAPIDocs`](https://mastra.ai/reference/configuration) and [`server.build.swaggerUI`](https://mastra.ai/reference/configuration) to `true` respectively.
 ## Configuration
-By default, Studio runs at <http://localhost:4111>. You can change the [`host`](https://mastra.ai/reference/configuration), [`port`](https://mastra.ai/reference/configuration), and [`studioBase`](https://mastra.ai/reference/configuration) in the Mastra server configuration. This allows you to customize where and how Studio is hosted.
+By default, Studio runs at <http://localhost:4111>. You can change the [`host`](https://mastra.ai/reference/configuration), [`port`](https://mastra.ai/reference/configuration), and [`studioBase`](https://mastra.ai/reference/configuration) in the Mastra server configuration.
+For production deployments, see [Deploy Studio](https://mastra.ai/docs/deployment/studio) to learn about hosting Studio alongside your server, as a standalone SPA, or on a CDN.
+Add [authentication](https://mastra.ai/docs/server/auth) to control who can access Studio in production. Studio displays the appropriate login UI, which can be an SSO button, an email/password form, or both. All API routes require authentication. This applies to any request made to your Mastra API, whether from Studio or a direct API call.
-Furthermore, Mastra supports local HTTPS development through the [`--https`](https://mastra.ai/reference/cli/mastra) flag, which automatically creates and manages certificates for your project. When you run `mastra dev --https`, a private key and certificate are generated for localhost (or your configured host). Visit the [HTTPS reference](https://mastra.ai/reference/configuration) to learn more.
+Mastra also supports HTTPS development through the [`--https`](https://mastra.ai/reference/cli/mastra) flag, which automatically creates and manages certificates for your project. When you run `mastra dev --https`, a private key and certificate are generated for localhost (or your configured host). Visit the [HTTPS reference](https://mastra.ai/reference/configuration) to learn more.
 ## Next steps

package/.docs/docs/observability/tracing/exporters/braintrust.md CHANGED Viewed

@@ -105,6 +105,21 @@ new BraintrustExporter({
 })
 ```
+## Querying Braintrust with returned `spanId`
+For Braintrust, use `spanId` as the root span identifier when searching for traces because Braintrust root-span queries are typically faster than trace-id queries.
+```typescript
+const result = await agent.stream('Summarize this ticket')
+console.log('Mastra trace ID:', result.traceId)
+console.log('Braintrust root span ID:', result.spanId)
+// Use result.spanId in your Braintrust lookup/query path
+```
+The same applies to `agent.generate()` and workflow run results (`run.start()`, `run.stream()` final state, `run.resume()`).
 ## Related
 - [Tracing Overview](https://mastra.ai/docs/observability/tracing/overview)

package/.docs/docs/server/auth.md CHANGED Viewed

@@ -1,18 +1,17 @@
 # Auth overview
-Mastra lets you choose how you handle authentication, so you can secure access to your application's endpoints using the identity system that fits your stack.
+Mastra lets you choose how you handle authentication, so you can secure access to your API and [Studio](https://mastra.ai/docs/getting-started/studio) using the identity system that fits your stack.
 You can start with basic shared secret JWT authentication and switch to providers like Supabase, Firebase Auth, Auth0, Clerk, or WorkOS when you need more advanced identity features.
-## Default behavior
+## What auth secures
-Authentication is optional in Mastra. When you configure authentication:
+Configuring authentication locks down two things at once:
-- **All built-in API routes** (`/api/agents/*`, `/api/workflows/*`, etc.) require authentication by default
-- **Custom API routes** also require authentication by default
-- **Public access** can be enabled on custom routes using `requiresAuth: false`
+- **Studio UI**: Studio displays a login screen (SSO, email/password, or both) and enforces role-based access control to determine what each team member can see and do.
+- **API routes**: All built-in routes (`/api/agents/*`, `/api/workflows/*`, etc.) and custom routes require authentication, whether requests come from Studio or direct API calls.
-If no authentication is configured, all routes are publicly accessible.
+Authentication is optional. If no auth is configured, all routes and Studio are publicly accessible. Public access can be enabled on individual custom routes using `requiresAuth: false`.
 See [Custom API Routes](https://mastra.ai/docs/server/custom-api-routes) for controlling authentication on custom endpoints.

package/.docs/docs/server/custom-api-routes.md CHANGED Viewed

@@ -260,6 +260,62 @@ registerApiRoute('/user-profile', {
 For more information about authentication providers, see the [Auth documentation](https://mastra.ai/docs/server/auth).
+## Continue generation after client disconnect
+Built-in streaming helpers such as [`chatRoute()`](https://mastra.ai/reference/ai-sdk/chat-route) forward the incoming request's `AbortSignal` to `agent.stream()`. That is the right default when a browser disconnect should cancel the model call.
+If you want the server to keep generating and persist the final response even after the client disconnects, build a custom route around the underlying `MastraModelOutput`. Start the agent stream without forwarding `c.req.raw.signal`, then call `consumeStream()` in the background so generation continues server-side.
+```typescript
+import {
+  createUIMessageStream,
+  createUIMessageStreamResponse,
+  InferUIMessageChunk,
+  UIMessage,
+} from 'ai'
+import { toAISdkStream } from '@mastra/ai-sdk'
+import { Mastra } from '@mastra/core'
+import { registerApiRoute } from '@mastra/core/server'
+export const mastra = new Mastra({
+  server: {
+    apiRoutes: [
+      registerApiRoute('/chat/persist/:agentId', {
+        method: 'POST',
+        handler: async c => {
+          const { messages, memory } = await c.req.json()
+          const mastra = c.get('mastra')
+          const agent = mastra.getAgent(c.req.param('agentId'))
+          const stream = await agent.stream(messages, {
+            memory,
+            // Do not pass c.req.raw.signal if this route should keep running
+            // after the client disconnects.
+          })
+          void stream.consumeStream().catch(error => {
+            mastra.getLogger()?.error('Background stream consumption failed', { error })
+          })
+          const uiStream = createUIMessageStream({
+            originalMessages: messages,
+            execute: async ({ writer }) => {
+              for await (const part of toAISdkStream(stream, { from: 'agent' })) {
+                writer.write(part as InferUIMessageChunk<UIMessage>)
+              }
+            },
+          })
+          return createUIMessageStreamResponse({ stream: uiStream })
+        },
+      }),
+    ],
+  },
+})
+```
+> **Note:** Use this pattern only when you intentionally want work to continue after the HTTP client is gone. If you want disconnects to cancel generation, keep using `chatRoute()` or forward the request `AbortSignal` yourself.
 ## Related
 - [registerApiRoute() Reference](https://mastra.ai/reference/server/register-api-route) - Full API reference

package/.docs/docs/server/mastra-server.md CHANGED Viewed

@@ -25,8 +25,8 @@ import { Mastra } from '@mastra/core'
 export const mastra = new Mastra({
   server: {
-    port: 3000, // Defaults to 4111
-    host: '0.0.0.0', // Defaults to 'localhost'
+    port: 3000, // Defaults to PORT env var or 4111
+    host: '0.0.0.0', // Defaults to MASTRA_HOST env var or 'localhost'
   },
 })
 ```

package/.docs/guides/deployment/cloudflare.md CHANGED Viewed

@@ -86,7 +86,7 @@ After setting up your project, push it to your remote Git provider of choice (e.
 1. Connect your repository to Cloudflare. On the "Workers & Pages" dashboard, select **Create application** and choose your Git provider in the next step. Continue with the setup process and select the repository you want to deploy.
-   > **Note:** Remember to set your environment variables needed to run your application (e.g. your [model provider](https://mastra.ai/models/providers) API key).
+   > **Note:** Remember to set your environment variables needed to run your application (e.g. your [model provider](https://mastra.ai/models/providers) API key). You can upload secrets from your `.env` file using `npx wrangler secret bulk .env`. See [Secrets](https://mastra.ai/reference/deployer/cloudflare) for details.
 2. Once you're ready, click the **Deploy** button and wait for the first deployment to complete.

package/.docs/models/gateways/openrouter.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![OpenRouter logo](https://models.dev/logos/openrouter.svg)OpenRouter
-OpenRouter aggregates models from multiple providers with enhanced features like rate limiting and failover. Access 195 models through Mastra's model router.
+OpenRouter aggregates models from multiple providers with enhanced features like rate limiting and failover. Access 198 models through Mastra's model router.
 Learn more in the [OpenRouter documentation](https://openrouter.ai/models).
@@ -79,6 +79,7 @@ ANTHROPIC_API_KEY=ant-...
 | `google/gemini-2.5-pro-preview-06-05`                           |
 | `google/gemini-3-flash-preview`                                 |
 | `google/gemini-3-pro-preview`                                   |
+| `google/gemini-3.1-flash-lite-preview`                          |
 | `google/gemini-3.1-pro-preview`                                 |
 | `google/gemini-3.1-pro-preview-customtools`                     |
 | `google/gemma-2-9b-it`                                          |
@@ -218,6 +219,8 @@ ANTHROPIC_API_KEY=ant-...
 | `x-ai/grok-4`                                                   |
 | `x-ai/grok-4-fast`                                              |
 | `x-ai/grok-4.1-fast`                                            |
+| `x-ai/grok-4.20-beta`                                           |
+| `x-ai/grok-4.20-multi-agent-beta`                               |
 | `x-ai/grok-code-fast-1`                                         |
 | `xiaomi/mimo-v2-flash`                                          |
 | `z-ai/glm-4.5`                                                  |

package/.docs/models/gateways/vercel.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![Vercel logo](https://models.dev/logos/vercel.svg)Vercel
-Vercel aggregates models from multiple providers with enhanced features like rate limiting and failover. Access 208 models through Mastra's model router.
+Vercel aggregates models from multiple providers with enhanced features like rate limiting and failover. Access 214 models through Mastra's model router.
 Learn more in the [Vercel documentation](https://ai-sdk.dev/providers/ai-sdk-providers).
@@ -118,6 +118,7 @@ ANTHROPIC_API_KEY=ant-...
 | `kwaipilot/kat-coder-pro-v1`                   |
 | `meituan/longcat-flash-chat`                   |
 | `meituan/longcat-flash-thinking`               |
+| `meituan/longcat-flash-thinking-2601`          |
 | `meta/llama-3.1-70b`                           |
 | `meta/llama-3.1-8b`                            |
 | `meta/llama-3.2-11b`                           |
@@ -131,6 +132,7 @@ ANTHROPIC_API_KEY=ant-...
 | `minimax/minimax-m2.1`                         |
 | `minimax/minimax-m2.1-lightning`               |
 | `minimax/minimax-m2.5`                         |
+| `minimax/minimax-m2.5-highspeed`               |
 | `mistral/codestral`                            |
 | `mistral/codestral-embed`                      |
 | `mistral/devstral-2`                           |
@@ -229,6 +231,9 @@ ANTHROPIC_API_KEY=ant-...
 | `xai/grok-4-fast-reasoning`                    |
 | `xai/grok-4.1-fast-non-reasoning`              |
 | `xai/grok-4.1-fast-reasoning`                  |
+| `xai/grok-4.20-multi-agent-beta`               |
+| `xai/grok-4.20-non-reasoning-beta`             |
+| `xai/grok-4.20-reasoning-beta`                 |
 | `xai/grok-code-fast-1`                         |
 | `xai/grok-imagine-image`                       |
 | `xai/grok-imagine-image-pro`                   |
@@ -240,5 +245,6 @@ ANTHROPIC_API_KEY=ant-...
 | `zai/glm-4.6v`                                 |
 | `zai/glm-4.6v-flash`                           |
 | `zai/glm-4.7`                                  |
+| `zai/glm-4.7-flash`                            |
 | `zai/glm-4.7-flashx`                           |
 | `zai/glm-5`                                    |

package/.docs/models/index.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # Model Providers
-Mastra provides a unified interface for working with LLMs across multiple providers, giving you access to 3259 models from 92 providers through a single API.
+Mastra provides a unified interface for working with LLMs across multiple providers, giving you access to 3353 models from 92 providers through a single API.
 ## Features

package/.docs/models/providers/anthropic.md CHANGED Viewed

@@ -49,12 +49,12 @@ for await (const chunk of stream) {
 | `anthropic/claude-opus-4-20250514`     | 200K    |       |           |       |       |       | $15        | $75         |
 | `anthropic/claude-opus-4-5`            | 200K    |       |           |       |       |       | $5         | $25         |
 | `anthropic/claude-opus-4-5-20251101`   | 200K    |       |           |       |       |       | $5         | $25         |
-| `anthropic/claude-opus-4-6`            | 200K    |       |           |       |       |       | $5         | $25         |
+| `anthropic/claude-opus-4-6`            | 1.0M    |       |           |       |       |       | $5         | $25         |
 | `anthropic/claude-sonnet-4-0`          | 200K    |       |           |       |       |       | $3         | $15         |
 | `anthropic/claude-sonnet-4-20250514`   | 200K    |       |           |       |       |       | $3         | $15         |
 | `anthropic/claude-sonnet-4-5`          | 200K    |       |           |       |       |       | $3         | $15         |
 | `anthropic/claude-sonnet-4-5-20250929` | 200K    |       |           |       |       |       | $3         | $15         |
-| `anthropic/claude-sonnet-4-6`          | 200K    |       |           |       |       |       | $3         | $15         |
+| `anthropic/claude-sonnet-4-6`          | 1.0M    |       |           |       |       |       | $3         | $15         |
 ## Advanced configuration

package/.docs/models/providers/baseten.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![Baseten logo](https://models.dev/logos/baseten.svg)Baseten
-Access 10 Baseten models through Mastra's model router. Authentication is handled automatically using the `BASETEN_API_KEY` environment variable.
+Access 9 Baseten models through Mastra's model router. Authentication is handled automatically using the `BASETEN_API_KEY` environment variable.
 Learn more in the [Baseten documentation](https://docs.baseten.co/development/model-apis/overview).
@@ -32,18 +32,17 @@ for await (const chunk of stream) {
 ## Models
-| Model                                         | Context | Tools | Reasoning | Image | Audio | Video | Input $/1M | Output $/1M |
-| --------------------------------------------- | ------- | ----- | --------- | ----- | ----- | ----- | ---------- | ----------- |
-| `baseten/deepseek-ai/DeepSeek-V3.2`           | 164K    |       |           |       |       |       | $0.30      | $0.45       |
-| `baseten/MiniMaxAI/MiniMax-M2.5`              | 204K    |       |           |       |       |       | $0.30      | $1          |
-| `baseten/moonshotai/Kimi-K2-Instruct-0905`    | 262K    |       |           |       |       |       | $0.60      | $3          |
-| `baseten/moonshotai/Kimi-K2-Thinking`         | 262K    |       |           |       |       |       | $0.60      | $3          |
-| `baseten/moonshotai/Kimi-K2.5`                | 262K    |       |           |       |       |       | $0.60      | $3          |
-| `baseten/nvidia/Nemotron-3-Super`             | 262K    |       |           |       |       |       | $0.30      | $0.75       |
-| `baseten/Qwen/Qwen3-Coder-480B-A35B-Instruct` | 262K    |       |           |       |       |       | $0.38      | $2          |
-| `baseten/zai-org/GLM-4.6`                     | 200K    |       |           |       |       |       | $0.60      | $2          |
-| `baseten/zai-org/GLM-4.7`                     | 205K    |       |           |       |       |       | $0.60      | $2          |
-| `baseten/zai-org/GLM-5`                       | 203K    |       |           |       |       |       | $0.95      | $3          |
+| Model                                  | Context | Tools | Reasoning | Image | Audio | Video | Input $/1M | Output $/1M |
+| -------------------------------------- | ------- | ----- | --------- | ----- | ----- | ----- | ---------- | ----------- |
+| `baseten/deepseek-ai/DeepSeek-V3-0324` | 164K    |       |           |       |       |       | $0.77      | $0.77       |
+| `baseten/deepseek-ai/DeepSeek-V3.1`    | 164K    |       |           |       |       |       | $0.50      | $2          |
+| `baseten/MiniMaxAI/MiniMax-M2.5`       | 204K    |       |           |       |       |       | $0.30      | $1          |
+| `baseten/moonshotai/Kimi-K2.5`         | 262K    |       |           |       |       |       | $0.60      | $3          |
+| `baseten/nvidia/Nemotron-3-Super`      | 262K    |       |           |       |       |       | $0.30      | $0.75       |
+| `baseten/openai/gpt-oss-120b`          | 128K    |       |           |       |       |       | $0.10      | $0.50       |
+| `baseten/zai-org/GLM-4.6`              | 200K    |       |           |       |       |       | $0.60      | $2          |
+| `baseten/zai-org/GLM-4.7`              | 205K    |       |           |       |       |       | $0.60      | $2          |
+| `baseten/zai-org/GLM-5`                | 203K    |       |           |       |       |       | $0.95      | $3          |
 ## Advanced configuration

package/.docs/models/providers/chutes.md CHANGED Viewed

@@ -44,9 +44,9 @@ for await (const chunk of stream) {
 | `chutes/deepseek-ai/DeepSeek-V3.1-TEE`                 | 164K    |       |           |       |       |       | $0.20      | $0.80       |
 | `chutes/deepseek-ai/DeepSeek-V3.1-Terminus-TEE`        | 164K    |       |           |       |       |       | $0.23      | $0.90       |
 | `chutes/deepseek-ai/DeepSeek-V3.2-Speciale-TEE`        | 164K    |       |           |       |       |       | $0.27      | $0.41       |
-| `chutes/deepseek-ai/DeepSeek-V3.2-TEE`                 | 164K    |       |           |       |       |       | $0.25      | $0.38       |
+| `chutes/deepseek-ai/DeepSeek-V3.2-TEE`                 | 131K    |       |           |       |       |       | $0.28      | $0.42       |
 | `chutes/MiniMaxAI/MiniMax-M2.1-TEE`                    | 197K    |       |           |       |       |       | $0.27      | $1          |
-| `chutes/MiniMaxAI/MiniMax-M2.5-TEE`                    | 197K    |       |           |       |       |       | $0.15      | $0.60       |
+| `chutes/MiniMaxAI/MiniMax-M2.5-TEE`                    | 197K    |       |           |       |       |       | $0.30      | $1          |
 | `chutes/miromind-ai/MiroThinker-v1.5-235B`             | 262K    |       |           |       |       |       | $0.30      | $1          |
 | `chutes/mistralai/Devstral-2-123B-Instruct-2512-TEE`   | 262K    |       |           |       |       |       | $0.05      | $0.22       |
 | `chutes/moonshotai/Kimi-K2-Instruct-0905`              | 262K    |       |           |       |       |       | $0.39      | $2          |
@@ -76,7 +76,7 @@ for await (const chunk of stream) {
 | `chutes/Qwen/Qwen3-Coder-Next`                         | 262K    |       |           |       |       |       | $0.07      | $0.30       |
 | `chutes/Qwen/Qwen3-Next-80B-A3B-Instruct`              | 262K    |       |           |       |       |       | $0.10      | $0.80       |
 | `chutes/Qwen/Qwen3-VL-235B-A22B-Instruct`              | 262K    |       |           |       |       |       | $0.30      | $1          |
-| `chutes/Qwen/Qwen3.5-397B-A17B-TEE`                    | 262K    |       |           |       |       |       | $0.30      | $1          |
+| `chutes/Qwen/Qwen3.5-397B-A17B-TEE`                    | 262K    |       |           |       |       |       | $0.39      | $2          |
 | `chutes/Qwen/Qwen3Guard-Gen-0.6B`                      | 33K     |       |           |       |       |       | $0.01      | $0.01       |
 | `chutes/rednote-hilab/dots.ocr`                        | 131K    |       |           |       |       |       | $0.01      | $0.01       |
 | `chutes/tngtech/DeepSeek-R1T-Chimera`                  | 164K    |       |           |       |       |       | $0.30      | $1          |
@@ -95,12 +95,12 @@ for await (const chunk of stream) {
 | `chutes/zai-org/GLM-4.5-FP8`                           | 131K    |       |           |       |       |       | $0.30      | $1          |
 | `chutes/zai-org/GLM-4.5-TEE`                           | 131K    |       |           |       |       |       | $0.35      | $2          |
 | `chutes/zai-org/GLM-4.6-FP8`                           | 203K    |       |           |       |       |       | $0.30      | $1          |
-| `chutes/zai-org/GLM-4.6-TEE`                           | 203K    |       |           |       |       |       | $0.35      | $2          |
+| `chutes/zai-org/GLM-4.6-TEE`                           | 203K    |       |           |       |       |       | $0.40      | $2          |
 | `chutes/zai-org/GLM-4.6V`                              | 131K    |       |           |       |       |       | $0.30      | $0.90       |
 | `chutes/zai-org/GLM-4.7-Flash`                         | 203K    |       |           |       |       |       | $0.06      | $0.35       |
 | `chutes/zai-org/GLM-4.7-FP8`                           | 203K    |       |           |       |       |       | $0.30      | $1          |
 | `chutes/zai-org/GLM-4.7-TEE`                           | 203K    |       |           |       |       |       | $0.40      | $2          |
-| `chutes/zai-org/GLM-5-TEE`                             | 203K    |       |           |       |       |       | $0.75      | $3          |
+| `chutes/zai-org/GLM-5-TEE`                             | 203K    |       |           |       |       |       | $0.95      | $3          |
 | `chutes/zai-org/GLM-5-Turbo`                           | 203K    |       |           |       |       |       | $0.49      | $2          |
 ## Advanced configuration

package/.docs/models/providers/deepinfra.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![Deep Infra logo](https://models.dev/logos/deepinfra.svg)Deep Infra
-Access 20 Deep Infra models through Mastra's model router. Authentication is handled automatically using the `DEEPINFRA_API_KEY` environment variable.
+Access 27 Deep Infra models through Mastra's model router. Authentication is handled automatically using the `DEEPINFRA_API_KEY` environment variable.
 Learn more in the [Deep Infra documentation](https://deepinfra.com/models).
@@ -30,28 +30,35 @@ for await (const chunk of stream) {
 ## Models
-| Model                                                 | Context | Tools | Reasoning | Image | Audio | Video | Input $/1M | Output $/1M |
-| ----------------------------------------------------- | ------- | ----- | --------- | ----- | ----- | ----- | ---------- | ----------- |
-| `deepinfra/anthropic/claude-3-7-sonnet-latest`        | 200K    |       |           |       |       |       | $3         | $17         |
-| `deepinfra/anthropic/claude-4-opus`                   | 200K    |       |           |       |       |       | $17        | $83         |
-| `deepinfra/deepseek-ai/DeepSeek-R1-0528`              | 164K    |       |           |       |       |       | $0.50      | $2          |
-| `deepinfra/deepseek-ai/DeepSeek-V3.2`                 | 164K    |       |           |       |       |       | $0.26      | $0.38       |
-| `deepinfra/MiniMaxAI/MiniMax-M2`                      | 262K    |       |           |       |       |       | $0.25      | $1          |
-| `deepinfra/MiniMaxAI/MiniMax-M2.1`                    | 197K    |       |           |       |       |       | $0.28      | $1          |
-| `deepinfra/MiniMaxAI/MiniMax-M2.5`                    | 205K    |       |           |       |       |       | $0.27      | $0.95       |
-| `deepinfra/moonshotai/Kimi-K2-Instruct`               | 131K    |       |           |       |       |       | $0.50      | $2          |
-| `deepinfra/moonshotai/Kimi-K2-Instruct-0905`          | 262K    |       |           |       |       |       | $0.40      | $2          |
-| `deepinfra/moonshotai/Kimi-K2-Thinking`               | 131K    |       |           |       |       |       | $0.47      | $2          |
-| `deepinfra/moonshotai/Kimi-K2.5`                      | 262K    |       |           |       |       |       | $0.50      | $3          |
-| `deepinfra/openai/gpt-oss-120b`                       | 131K    |       |           |       |       |       | $0.05      | $0.24       |
-| `deepinfra/openai/gpt-oss-20b`                        | 131K    |       |           |       |       |       | $0.03      | $0.14       |
-| `deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct`       | 262K    |       |           |       |       |       | $0.40      | $2          |
-| `deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo` | 262K    |       |           |       |       |       | $0.30      | $1          |
-| `deepinfra/zai-org/GLM-4.6`                           | 205K    |       |           |       |       |       | $0.43      | $2          |
-| `deepinfra/zai-org/GLM-4.6V`                          | 205K    |       |           |       |       |       | $0.30      | $0.90       |
-| `deepinfra/zai-org/GLM-4.7`                           | 203K    |       |           |       |       |       | $0.43      | $2          |
-| `deepinfra/zai-org/GLM-4.7-Flash`                     | 203K    |       |           |       |       |       | $0.06      | $0.40       |
-| `deepinfra/zai-org/GLM-5`                             | 203K    |       |           |       |       |       | $0.80      | $3          |
+| Model                                                         | Context | Tools | Reasoning | Image | Audio | Video | Input $/1M | Output $/1M |
+| ------------------------------------------------------------- | ------- | ----- | --------- | ----- | ----- | ----- | ---------- | ----------- |
+| `deepinfra/anthropic/claude-3-7-sonnet-latest`                | 200K    |       |           |       |       |       | $3         | $17         |
+| `deepinfra/anthropic/claude-4-opus`                           | 200K    |       |           |       |       |       | $17        | $83         |
+| `deepinfra/deepseek-ai/DeepSeek-R1-0528`                      | 164K    |       |           |       |       |       | $0.50      | $2          |
+| `deepinfra/deepseek-ai/DeepSeek-V3.2`                         | 164K    |       |           |       |       |       | $0.26      | $0.38       |
+| `deepinfra/meta-llama/Llama-3.1-70B-Instruct`                 | 131K    |       |           |       |       |       | $0.40      | $0.40       |
+| `deepinfra/meta-llama/Llama-3.1-70B-Instruct-Turbo`           | 131K    |       |           |       |       |       | $0.40      | $0.40       |
+| `deepinfra/meta-llama/Llama-3.1-8B-Instruct`                  | 131K    |       |           |       |       |       | $0.02      | $0.05       |
+| `deepinfra/meta-llama/Llama-3.1-8B-Instruct-Turbo`            | 131K    |       |           |       |       |       | $0.02      | $0.03       |
+| `deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo`           | 131K    |       |           |       |       |       | $0.10      | $0.32       |
+| `deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8` | 1.0M    |       |           |       |       |       | $0.15      | $0.60       |
+| `deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct`         | 10.0M   |       |           |       |       |       | $0.08      | $0.30       |
+| `deepinfra/MiniMaxAI/MiniMax-M2`                              | 262K    |       |           |       |       |       | $0.25      | $1          |
+| `deepinfra/MiniMaxAI/MiniMax-M2.1`                            | 197K    |       |           |       |       |       | $0.28      | $1          |
+| `deepinfra/MiniMaxAI/MiniMax-M2.5`                            | 205K    |       |           |       |       |       | $0.27      | $0.95       |
+| `deepinfra/moonshotai/Kimi-K2-Instruct`                       | 131K    |       |           |       |       |       | $0.50      | $2          |
+| `deepinfra/moonshotai/Kimi-K2-Instruct-0905`                  | 262K    |       |           |       |       |       | $0.40      | $2          |
+| `deepinfra/moonshotai/Kimi-K2-Thinking`                       | 131K    |       |           |       |       |       | $0.47      | $2          |
+| `deepinfra/moonshotai/Kimi-K2.5`                              | 262K    |       |           |       |       |       | $0.50      | $3          |
+| `deepinfra/openai/gpt-oss-120b`                               | 131K    |       |           |       |       |       | $0.05      | $0.24       |
+| `deepinfra/openai/gpt-oss-20b`                                | 131K    |       |           |       |       |       | $0.03      | $0.14       |
+| `deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct`               | 262K    |       |           |       |       |       | $0.40      | $2          |
+| `deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo`         | 262K    |       |           |       |       |       | $0.30      | $1          |
+| `deepinfra/zai-org/GLM-4.6`                                   | 205K    |       |           |       |       |       | $0.43      | $2          |
+| `deepinfra/zai-org/GLM-4.6V`                                  | 205K    |       |           |       |       |       | $0.30      | $0.90       |
+| `deepinfra/zai-org/GLM-4.7`                                   | 203K    |       |           |       |       |       | $0.43      | $2          |
+| `deepinfra/zai-org/GLM-4.7-Flash`                             | 203K    |       |           |       |       |       | $0.06      | $0.40       |
+| `deepinfra/zai-org/GLM-5`                                     | 203K    |       |           |       |       |       | $0.80      | $3          |
 ## Advanced configuration

package/.docs/models/providers/google.md CHANGED Viewed

@@ -54,7 +54,7 @@ for await (const chunk of stream) {
 | `google/gemini-3-flash-preview`                     | 1.0M    |       |           |       |       |       | $0.50      | $3          |
 | `google/gemini-3-pro-preview`                       | 1.0M    |       |           |       |       |       | $2         | $12         |
 | `google/gemini-3.1-flash-image-preview`             | 131K    |       |           |       |       |       | $0.25      | $60         |
-| `google/gemini-3.1-flash-lite-preview`              | 1.0M    |       |           |       |       |       | $0.50      | $3          |
+| `google/gemini-3.1-flash-lite-preview`              | 1.0M    |       |           |       |       |       | $0.25      | $2          |
 | `google/gemini-3.1-pro-preview`                     | 1.0M    |       |           |       |       |       | $2         | $12         |
 | `google/gemini-3.1-pro-preview-customtools`         | 1.0M    |       |           |       |       |       | $2         | $12         |
 | `google/gemini-embedding-001`                       | 2K      |       |           |       |       |       | $0.15      | —           |