npm - @mastra/mcp-docs-server - Versions diffs - 1.1.35-alpha.3 → 1.1.35-alpha.6 - Mend

@mastra/mcp-docs-server 1.1.35-alpha.3 → 1.1.35-alpha.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (18) hide show

package/.docs/docs/agents/processors.md +26 -2
package/.docs/docs/memory/observational-memory.md +2 -1
package/.docs/docs/memory/overview.md +2 -1
package/.docs/models/index.md +1 -1
package/.docs/models/providers/kiro.md +110 -0
package/.docs/models/providers/llmgateway.md +1 -1
package/.docs/models/providers/opencode-go.md +2 -4
package/.docs/models/providers/qiniu-ai.md +2 -2
package/.docs/models/providers/xiaomi.md +2 -2
package/.docs/models/providers/zenmux.md +1 -1
package/.docs/models/providers.md +1 -0
package/.docs/reference/harness/harness-class.md +2 -0
package/.docs/reference/index.md +1 -0
package/.docs/reference/processors/processor-interface.md +74 -12
package/.docs/reference/processors/provider-history-compat.md +132 -0
package/.docs/reference/tools/mcp-client.md +47 -0
package/CHANGELOG.md +14 -0
package/package.json +6 -6

package/.docs/docs/agents/processors.md CHANGED Viewed

@@ -211,6 +211,22 @@ The method receives the current `stepNumber`, `model`, `tools`, `toolChoice`, `m
 See the [`Processor` reference](https://mastra.ai/reference/processors/processor-interface) for all available arguments and return types.
+### Rewrite the LLM request before the provider call
+Use `processLLMRequest()` when you need to rewrite the final prompt that Mastra sends to the model. This hook runs after Mastra converts the `MessageList` into the provider-facing prompt format (`LanguageModelV2Prompt`) and immediately before the provider call.
+Use the message-based hooks for conversation changes:
+- `processInput()`: Change the conversation once before the agentic loop starts.
+- `processInputStep()`: Change messages or step configuration before each LLM call.
+- `processLLMRequest()`: Change only the outbound prompt for the current provider call.
+Changes returned from `processLLMRequest()` are transient. They don't persist back to `MessageList`, memory, UI history, or future provider calls. This makes the hook a good fit for provider compatibility rewrites, role/content normalization, or other model-specific prompt changes that shouldn't alter stored conversation history.
+The method receives `prompt`, `model`, `stepNumber`, `steps`, `state`, and the shared processor context. Calling `abort()` from `processLLMRequest()` emits the normal tripwire response and stops the call.
+See the [`Processor` reference](https://mastra.ai/reference/processors/processor-interface) for all available arguments and return types.
 ### Use the `prepareStep()` callback
 The `prepareStep()` callback on `generate()` or `stream()` is a shorthand for `processInputStep()`. Internally, Mastra wraps it in a processor that calls your function at each step. It accepts the same arguments and return type as `processInputStep()`, but doesn't require creating a class:
@@ -317,7 +333,7 @@ For more on retry behavior, see [Retry mechanism](#retry-mechanism) in Advanced
 ### Persist data across chunks and steps
-Output methods receive a `state` object that persists for the lifetime of one request. State is keyed by the processor's `id`, so each processor sees only its own data, and it is shared between `processOutputStream`, `processOutputStep`, and `processOutputResult`. A new state object is created for every new `agent.generate()` or `agent.stream()` call.
+Output methods receive a `state` object that persists for the lifetime of one request. State is keyed by the processor's `id`, so each processor sees only its own data, and it's shared between `processOutputStream`, `processOutputStep`, and `processOutputResult`. A new state object is created for every new `agent.generate()` or `agent.stream()` call.
 ```typescript
 import type { Processor } from '@mastra/core/processors'
@@ -383,6 +399,14 @@ Enables dynamic tool discovery for agents with large tool libraries. Instead of
 See the [`ToolSearchProcessor` reference](https://mastra.ai/reference/processors/tool-search-processor) for configuration options and usage examples.
+### `ProviderHistoryCompat`
+Handles provider-specific history incompatibilities when agents reuse messages across model providers. It can rewrite the outbound LLM request before the provider call, or recover from known provider API errors and retry.
+Add `ProviderHistoryCompat` explicitly when you need provider history compatibility rules, reactive API error recovery, custom compatibility rules, or predictable processor ordering.
+See the [`ProviderHistoryCompat` reference](https://mastra.ai/reference/processors/provider-history-compat) for setup, built-in rules, and custom rule options.
 ## Advanced patterns
 ### Ensure a final response with `maxSteps`
@@ -494,7 +518,7 @@ for await (const chunk of stream.fullStream) {
 Custom chunk types must use the `data-` prefix (e.g., `data-moderation-update`, `data-status`).
-By default, `processOutputStream()` skips `data-*` chunks so it does not accidentally operate on tool telemetry or other processors' output. To inspect, modify, or block these chunks in a processor, set `processDataParts = true` on that processor:
+By default, `processOutputStream()` skips `data-*` chunks so it doesn't accidentally operate on tool telemetry or other processors' output. To inspect, modify, or block these chunks in a processor, set `processDataParts = true` on that processor:
 ```typescript
 class ModerationCollector implements Processor {

package/.docs/docs/memory/observational-memory.md CHANGED Viewed

@@ -458,4 +458,5 @@ In practical terms, OM replaces both working memory and message history, and has
 - [Observational Memory Reference](https://mastra.ai/reference/memory/observational-memory)
 - [Memory Overview](https://mastra.ai/docs/memory/overview)
 - [Message History](https://mastra.ai/docs/memory/message-history)
-- [Memory Processors](https://mastra.ai/docs/memory/memory-processors)
+- [Memory Processors](https://mastra.ai/docs/memory/memory-processors)
+- [Mastra Code](https://code.mastra.ai/): A coding agent using Observational Memory

package/.docs/docs/memory/overview.md CHANGED Viewed

@@ -237,4 +237,5 @@ export const memoryAgent = new Agent({
 - [`Memory` reference](https://mastra.ai/reference/memory/memory-class)
 - [Tracing](https://mastra.ai/docs/observability/tracing/overview)
-- [Request Context](https://mastra.ai/docs/server/request-context)
+- [Request Context](https://mastra.ai/docs/server/request-context)
+- [Mastra Code](https://code.mastra.ai/): A coding agent using Mastra's memory system

package/.docs/models/index.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # Model Providers
-Mastra provides a unified interface for working with LLMs across multiple providers, giving you access to 3879 models from 107 providers through a single API.
+Mastra provides a unified interface for working with LLMs across multiple providers, giving you access to 3889 models from 108 providers through a single API.
 ## Features

package/.docs/models/providers/kiro.md ADDED Viewed

@@ -0,0 +1,110 @@
+# ![Kiro logo](https://models.dev/logos/kiro.svg)Kiro
+Access 12 Kiro models through Mastra's model router. Authentication is handled automatically using the `KIRO_API_KEY` environment variable.
+Learn more in the [Kiro documentation](https://kiro.dev).
+```bash
+KIRO_API_KEY=your-api-key
+```
+```typescript
+import { Agent } from "@mastra/core/agent";
+const agent = new Agent({
+  id: "my-agent",
+  name: "My Agent",
+  instructions: "You are a helpful assistant",
+  model: "kiro/auto"
+});
+// Generate a response
+const response = await agent.generate("Hello!");
+// Stream a response
+const stream = await agent.stream("Tell me a story");
+for await (const chunk of stream) {
+  console.log(chunk);
+}
+```
+> **Info:** Mastra uses the OpenAI-compatible `/chat/completions` endpoint. Some provider-specific features may not be available. Check the [Kiro documentation](https://kiro.dev) for details.
+## Models
+| Model                    | Context | Tools | Reasoning | Image | Audio | Video | Input $/1M | Output $/1M |
+| ------------------------ | ------- | ----- | --------- | ----- | ----- | ----- | ---------- | ----------- |
+| `kiro/auto`              | 1.0M    |       |           |       |       |       | —          | —           |
+| `kiro/claude-haiku-4.5`  | 200K    |       |           |       |       |       | —          | —           |
+| `kiro/claude-opus-4.5`   | 200K    |       |           |       |       |       | —          | —           |
+| `kiro/claude-opus-4.6`   | 1.0M    |       |           |       |       |       | —          | —           |
+| `kiro/claude-opus-4.7`   | 1.0M    |       |           |       |       |       | —          | —           |
+| `kiro/claude-sonnet-4`   | 200K    |       |           |       |       |       | —          | —           |
+| `kiro/claude-sonnet-4.5` | 200K    |       |           |       |       |       | —          | —           |
+| `kiro/claude-sonnet-4.6` | 1.0M    |       |           |       |       |       | —          | —           |
+| `kiro/deepseek-3.2`      | 164K    |       |           |       |       |       | —          | —           |
+| `kiro/minimax-m2.1`      | 196K    |       |           |       |       |       | —          | —           |
+| `kiro/minimax-m2.5`      | 196K    |       |           |       |       |       | —          | —           |
+| `kiro/qwen3-coder-next`  | 256K    |       |           |       |       |       | —          | —           |
+## Advanced configuration
+### Custom headers
+```typescript
+const agent = new Agent({
+  id: "custom-agent",
+  name: "custom-agent",
+  model: {
+    url: "https://q.us-east-1.amazonaws.com",
+    id: "kiro/auto",
+    apiKey: process.env.KIRO_API_KEY,
+    headers: {
+      "X-Custom-Header": "value"
+    }
+  }
+});
+```
+### Dynamic model selection
+```typescript
+const agent = new Agent({
+  id: "dynamic-agent",
+  name: "Dynamic Agent",
+  model: ({ requestContext }) => {
+    const useAdvanced = requestContext.task === "complex";
+    return useAdvanced
+      ? "kiro/qwen3-coder-next"
+      : "kiro/auto";
+  }
+});
+```
+## Direct provider installation
+This provider can also be installed directly as a standalone package, which can be used instead of the Mastra model router string. View the [package documentation](https://www.npmjs.com/package/kiro-acp-ai-provider) for more details.
+**npm**:
+```bash
+npm install kiro-acp-ai-provider
+```
+**pnpm**:
+```bash
+pnpm add kiro-acp-ai-provider
+```
+**Yarn**:
+```bash
+yarn add kiro-acp-ai-provider
+```
+**Bun**:
+```bash
+bun add kiro-acp-ai-provider
+```

package/.docs/models/providers/llmgateway.md CHANGED Viewed

@@ -153,7 +153,7 @@ for await (const chunk of stream) {
 | `llmgateway/llama-4-maverick-17b-instruct`         | 8K      |       |           |       |       |       | $0.24      | $0.97       |
 | `llmgateway/llama-4-scout`                         | 33K     |       |           |       |       |       | $0.18      | $0.59       |
 | `llmgateway/llama-4-scout-17b-instruct`            | 8K      |       |           |       |       |       | $0.17      | $0.66       |
-| `llmgateway/mimo-v2-flash`                         | 256K    |       |           |       |       |       | $0.10      | $0.30       |
+| `llmgateway/mimo-v2-flash`                         | 262K    |       |           |       |       |       | $0.10      | $0.30       |
 | `llmgateway/minimax-m2`                            | 197K    |       |           |       |       |       | $0.30      | $1          |
 | `llmgateway/minimax-m2.1`                          | 205K    |       |           |       |       |       | $0.30      | $1          |
 | `llmgateway/minimax-m2.1-lightning`                | 197K    |       |           |       |       |       | $0.12      | $0.48       |

package/.docs/models/providers/opencode-go.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # ![OpenCode Go logo](https://models.dev/logos/opencode-go.svg)OpenCode Go
-Access 14 OpenCode Go models through Mastra's model router. Authentication is handled automatically using the `OPENCODE_API_KEY` environment variable.
+Access 12 OpenCode Go models through Mastra's model router. Authentication is handled automatically using the `OPENCODE_API_KEY` environment variable.
 Learn more in the [OpenCode Go documentation](https://opencode.ai/docs/zen).
@@ -39,9 +39,7 @@ for await (const chunk of stream) {
 | `opencode-go/glm-5`             | 203K    |       |           |       |       |       | $1         | $3          |
 | `opencode-go/glm-5.1`           | 203K    |       |           |       |       |       | $1         | $4          |
 | `opencode-go/kimi-k2.5`         | 262K    |       |           |       |       |       | $0.60      | $3          |
-| `opencode-go/kimi-k2.6`         | 262K    |       |           |       |       |       | $0.32      | $1          |
-| `opencode-go/mimo-v2-omni`      | 262K    |       |           |       |       |       | $0.40      | $2          |
-| `opencode-go/mimo-v2-pro`       | 1.0M    |       |           |       |       |       | $1         | $3          |
+| `opencode-go/kimi-k2.6`         | 262K    |       |           |       |       |       | $0.95      | $4          |
 | `opencode-go/mimo-v2.5`         | 1.0M    |       |           |       |       |       | $0.40      | $2          |
 | `opencode-go/mimo-v2.5-pro`     | 1.0M    |       |           |       |       |       | $1         | $3          |
 | `opencode-go/minimax-m2.5`      | 205K    |       |           |       |       |       | $0.30      | $1          |

package/.docs/models/providers/qiniu-ai.md CHANGED Viewed

@@ -81,7 +81,7 @@ for await (const chunk of stream) {
 | `qiniu-ai/kling-v2-6`                               | 100.0M  |       |           |       |       |       | —          | —           |
 | `qiniu-ai/meituan/longcat-flash-chat`               | 131K    |       |           |       |       |       | —          | —           |
 | `qiniu-ai/meituan/longcat-flash-lite`               | 256K    |       |           |       |       |       | —          | —           |
-| `qiniu-ai/mimo-v2-flash`                            | 256K    |       |           |       |       |       | —          | —           |
+| `qiniu-ai/mimo-v2-flash`                            | 256K    |       |           |       |       |       | $0.10      | $0.30       |
 | `qiniu-ai/MiniMax-M1`                               | 1.0M    |       |           |       |       |       | —          | —           |
 | `qiniu-ai/minimax/minimax-m2`                       | 200K    |       |           |       |       |       | —          | —           |
 | `qiniu-ai/minimax/minimax-m2.1`                     | 205K    |       |           |       |       |       | —          | —           |
@@ -120,7 +120,7 @@ for await (const chunk of stream) {
 | `qiniu-ai/x-ai/grok-4.1-fast-non-reasoning`         | 2.0M    |       |           |       |       |       | —          | —           |
 | `qiniu-ai/x-ai/grok-4.1-fast-reasoning`             | 20.0M   |       |           |       |       |       | —          | —           |
 | `qiniu-ai/x-ai/grok-code-fast-1`                    | 256K    |       |           |       |       |       | —          | —           |
-| `qiniu-ai/xiaomi/mimo-v2-flash`                     | 256K    |       |           |       |       |       | —          | —           |
+| `qiniu-ai/xiaomi/mimo-v2-flash`                     | 256K    |       |           |       |       |       | $0.10      | $0.30       |
 | `qiniu-ai/z-ai/autoglm-phone-9b`                    | 13K     |       |           |       |       |       | —          | —           |
 | `qiniu-ai/z-ai/glm-4.6`                             | 200K    |       |           |       |       |       | —          | —           |
 | `qiniu-ai/z-ai/glm-4.7`                             | 200K    |       |           |       |       |       | —          | —           |

package/.docs/models/providers/xiaomi.md CHANGED Viewed

@@ -34,8 +34,8 @@ for await (const chunk of stream) {
 | Model                  | Context | Tools | Reasoning | Image | Audio | Video | Input $/1M | Output $/1M |
 | ---------------------- | ------- | ----- | --------- | ----- | ----- | ----- | ---------- | ----------- |
-| `xiaomi/mimo-v2-flash` | 256K    |       |           |       |       |       | $0.10      | $0.30       |
-| `xiaomi/mimo-v2-omni`  | 256K    |       |           |       |       |       | $0.40      | $2          |
+| `xiaomi/mimo-v2-flash` | 262K    |       |           |       |       |       | $0.10      | $0.30       |
+| `xiaomi/mimo-v2-omni`  | 262K    |       |           |       |       |       | $0.40      | $2          |
 | `xiaomi/mimo-v2-pro`   | 1.0M    |       |           |       |       |       | $1         | $3          |
 | `xiaomi/mimo-v2.5`     | 1.0M    |       |           |       |       |       | $0.40      | $2          |
 | `xiaomi/mimo-v2.5-pro` | 1.0M    |       |           |       |       |       | $1         | $3          |

package/.docs/models/providers/zenmux.md CHANGED Viewed

@@ -114,7 +114,7 @@ for await (const chunk of stream) {
 | `zenmux/x-ai/grok-code-fast-1`                | 256K    |       |           |       |       |       | $0.20      | $2          |
 | `zenmux/xiaomi/mimo-v2-flash`                 | 262K    |       |           |       |       |       | $0.10      | $0.30       |
 | `zenmux/xiaomi/mimo-v2-omni`                  | 265K    |       |           |       |       |       | $0.40      | $2          |
-| `zenmux/xiaomi/mimo-v2-pro`                   | 1.0M    |       |           |       |       |       | $2         | $5          |
+| `zenmux/xiaomi/mimo-v2-pro`                   | 1.0M    |       |           |       |       |       | $1         | $3          |
 | `zenmux/xiaomi/mimo-v2.5`                     | 1.0M    |       |           |       |       |       | $0.40      | $2          |
 | `zenmux/xiaomi/mimo-v2.5-pro`                 | 1.0M    |       |           |       |       |       | $1         | $3          |
 | `zenmux/z-ai/glm-4.5`                         | 128K    |       |           |       |       |       | $0.35      | $2          |

package/.docs/models/providers.md CHANGED Viewed

@@ -45,6 +45,7 @@ Direct access to individual AI model providers. Each provider offers unique mode
 - [Jiekou.AI](https://mastra.ai/models/providers/jiekou)
 - [Kilo Gateway](https://mastra.ai/models/providers/kilo)
 - [Kimi For Coding](https://mastra.ai/models/providers/kimi-for-coding)
+- [Kiro](https://mastra.ai/models/providers/kiro)
 - [KUAE Cloud Coding Plan](https://mastra.ai/models/providers/kuae-cloud-coding-plan)
 - [Llama](https://mastra.ai/models/providers/llama)
 - [LLM Gateway](https://mastra.ai/models/providers/llmgateway)

package/.docs/reference/harness/harness-class.md CHANGED Viewed

@@ -3,6 +3,8 @@
 **Added in:** `@mastra/core@1.5.0`
 > **Warning:** The `Harness` class is in alpha stage and subject to change. It won't follow semantic versioning guarantees until it graduates from experimental status. Use with caution and expect breaking changes in minor versions.
+>
+> [Mastra Code](https://code.mastra.ai/) is the flagship implementation of the `Harness` class, showcasing how it can be used to build a powerful terminal-based coding agent with multi-model support, persistent conversations, and built-in tools.
 The `Harness` class orchestrates multiple agent modes, shared state, memory, and storage. It provides a control layer that a TUI or other UI can drive to manage threads, switch models and modes, send messages, handle tool approvals, and track events.

package/.docs/reference/index.md CHANGED Viewed

@@ -168,6 +168,7 @@ The Reference section provides documentation of Mastra's API, including paramete
 - [PrefillErrorHandler](https://mastra.ai/reference/processors/prefill-error-handler)
 - [Processor Interface](https://mastra.ai/reference/processors/processor-interface)
 - [PromptInjectionDetector](https://mastra.ai/reference/processors/prompt-injection-detector)
+- [ProviderHistoryCompat](https://mastra.ai/reference/processors/provider-history-compat)
 - [RegexFilterProcessor](https://mastra.ai/reference/processors/regex-filter-processor)
 - [SemanticRecall](https://mastra.ai/reference/processors/semantic-recall-processor)
 - [SkillSearchProcessor](https://mastra.ai/reference/processors/skill-search-processor)

package/.docs/reference/processors/processor-interface.md CHANGED Viewed

@@ -4,7 +4,7 @@ The `Processor` interface defines the contract for all processors in Mastra. Pro
 ## When processor methods run
-The six processor methods run at different points in the agent execution lifecycle:
+The seven processor methods run at different points in the agent execution lifecycle:
 ```text
 ┌─────────────────────────────────────────────────────────────────┐
@@ -26,6 +26,11 @@ The six processor methods run at different points in the agent execution lifecyc
 │  │  └──────────┬──────────┘                                │    │
 │  │             │                                           │    │
 │  │             ▼                                           │    │
+│  │  ┌─────────────────────┐                                │    │
+│  │  │  processLLMRequest   │  ← Runs before provider call   │    │
+│  │  └──────────┬──────────┘                                │    │
+│  │             │                                           │    │
+│  │             ▼                                           │    │
 │  │       LLM Execution ──── API Error? ──┐                │    │
 │  │             │                          │                │    │
 │  │             │              ┌───────────────────┐        │    │
@@ -59,14 +64,15 @@ The six processor methods run at different points in the agent execution lifecyc
 └─────────────────────────────────────────────────────────────────┘
 ```
-| Method                | When it runs                                           | Use case                                                                      |
-| --------------------- | ------------------------------------------------------ | ----------------------------------------------------------------------------- |
-| `processInput`        | Once at the start, before the agentic loop             | Validate/transform initial user input, add context                            |
-| `processInputStep`    | At each step of the agentic loop, before each LLM call | Transform messages between steps, handle tool results                         |
-| `processAPIError`     | When an LLM API call fails                             | Inspect API rejections, optionally mutate state/messages, and request a retry |
-| `processOutputStream` | On each streaming chunk during LLM response            | Filter/modify streaming content, detect patterns in real-time                 |
-| `processOutputStep`   | After each LLM response, before tool execution         | Validate output quality, implement guardrails with retry                      |
-| `processOutputResult` | Once after generation completes                        | Post-process final response, log results                                      |
+| Method                | When it runs                                           | Use case                                                                                     |
+| --------------------- | ------------------------------------------------------ | -------------------------------------------------------------------------------------------- |
+| `processInput`        | Once at the start, before the agentic loop             | Validate/transform initial user input, add context                                           |
+| `processInputStep`    | At each step of the agentic loop, before each LLM call | Transform messages between steps, handle tool results                                        |
+| `processLLMRequest`   | After LLM request conversion, before the provider call | Rewrite the outbound `LanguageModelV2Prompt` for the current call without persisting changes |
+| `processAPIError`     | When an LLM API call fails                             | Inspect API rejections, optionally mutate state/messages, and request a retry                |
+| `processOutputStream` | On each streaming chunk during LLM response            | Filter/modify streaming content, detect patterns in real-time                                |
+| `processOutputStep`   | After each LLM response, before tool execution         | Validate output quality, implement guardrails with retry                                     |
+| `processOutputResult` | Once after generation completes                        | Post-process final response, log results                                                     |
 ## Interface definition
@@ -97,6 +103,10 @@ interface Processor<TId extends string = string, TTripwireMetadata = unknown> {
     | void
     | undefined
+  processLLMRequest?(
+    args: ProcessLLMRequestArgs<TTripwireMetadata>,
+  ): Promise<ProcessLLMRequestResult> | ProcessLLMRequestResult
   processAPIError?(
     args: ProcessAPIErrorArgs<TTripwireMetadata>,
   ): Promise<ProcessAPIErrorResult | void> | ProcessAPIErrorResult | void
@@ -243,9 +253,10 @@ processInputStep?<TTripwireMetadata = unknown>(
 1. `processInput` (once at start)
 2. `processInputStep` from inputProcessors (at each step, before LLM call)
 3. `prepareStep` callback (runs as part of the processInputStep pipeline, after inputProcessors)
-4. LLM execution
-5. Tool execution (if needed)
-6. Repeat from step 2 if tools were called
+4. `processLLMRequest` from inputProcessors (after prompt conversion, before the provider call)
+5. LLM execution
+6. Tool execution (if needed)
+7. Repeat from step 2 if tools were called
 #### `ProcessInputStepArgs`
@@ -339,6 +350,57 @@ System messages are **reset to their original values** at the start of each step
 ***
+### `processLLMRequest`
+Processes the final LLM request after Mastra converts the `MessageList` into `LanguageModelV2Prompt` and before the provider call. Use this method for transient, model-aware rewrites that should affect only the current outbound request.
+Returned prompt changes are forwarded to the model for the current call only. They are not persisted back to `MessageList`, memory, UI history, or later provider calls.
+```typescript
+processLLMRequest?(
+  args: ProcessLLMRequestArgs,
+): Promise<ProcessLLMRequestResult> | ProcessLLMRequestResult;
+```
+#### `ProcessLLMRequestArgs`
+**prompt** (`LanguageModelV2Prompt`): The LLM request prompt that will be sent to the provider for this call.
+**model** (`MastraLanguageModel`): The resolved model that will receive the prompt. Use this to scope provider-specific rewrites.
+**stepNumber** (`number`): Current step number (0-indexed). Step 0 is the initial LLM call.
+**steps** (`StepResult[]`): Results from previous steps, including text, toolCalls, and toolResults.
+**state** (`Record<string, unknown>`): Per-processor state that persists across all method calls within this request.
+**abort** (`(reason?: string, options?: { retry?: boolean; metadata?: unknown }) => never`): Function to abort processing. Throws a TripWire error that stops execution and emits a \`tripwire\` chunk.
+**retryCount** (`number`): Current retry attempt count from \`ProcessorContext\`. Starts at \`0\`; use to cap processor-triggered retries.
+**requestContext** (`RequestContext`): Request-scoped context with execution metadata.
+**tracingContext** (`TracingContext`): Tracing context for observability.
+**writer** (`ProcessorStreamWriter`): Stream writer for emitting custom data chunks during streaming. Use \`writer.custom()\` to send transient UI signals.
+**abortSignal** (`AbortSignal`): Signal for cancelling the operation.
+#### Return value
+`processLLMRequest` returns `ProcessLLMRequestResult`, which is `{ prompt?: LanguageModelV2Prompt } | undefined | void`.
+- Return `{ prompt }` to replace the outbound prompt for the current provider call.
+- Return `undefined` or `void` to forward the original prompt unchanged.
+#### Use cases
+- Removing or reshaping provider-specific prompt parts before a model call
+- Normalizing roles or content to match a provider's input requirements
+- Adapting tool result formats when switching providers mid-loop
+***
 ### `processAPIError`
 Handles LLM API rejection errors before they surface as final errors. This runs when the API call fails with a non-retryable error (such as a 400 or 422 status code). Unlike `processOutputStep` which runs after successful responses, this runs when the API rejects the request.

package/.docs/reference/processors/provider-history-compat.md ADDED Viewed

@@ -0,0 +1,132 @@
+# ProviderHistoryCompat
+The `ProviderHistoryCompat` processor handles provider-specific history incompatibilities. It can rewrite the outbound language model prompt before a provider call, or react to API errors and retry with repaired message history.
+Use it when an agent may switch between model providers, reuse message history across providers, or call a provider that rejects fields emitted by another provider.
+## Usage example
+Add `ProviderHistoryCompat` to `inputProcessors` when you want all built-in compatibility rules available for an agent:
+```typescript
+import { Agent } from '@mastra/core/agent'
+import { ProviderHistoryCompat } from '@mastra/core/processors'
+export const agent = new Agent({
+  name: 'my-agent',
+  instructions: 'You are a helpful assistant.',
+  model: 'anthropic/claude-sonnet-4-5',
+  inputProcessors: [new ProviderHistoryCompat()],
+})
+```
+Mastra agents don't add this processor automatically. Add it explicitly when you need provider history compatibility rules, reactive API error recovery, custom rules, or predictable processor ordering.
+## Constructor parameters
+**opts** (`{ additionalRules?: CompatRule[] }`): Configuration options for provider history compatibility rules.
+**opts.additionalRules** (`CompatRule[]`): Custom compatibility rules to run after the built-in rules. Rules can rewrite the outbound prompt or repair persisted messages after matching an API error.
+## Properties
+**id** (`'provider-history-compat'`): Processor identifier.
+**name** (`'Provider History Compat'`): Processor display name.
+**processLLMRequest** (`(args: ProcessLLMRequestArgs) => ProcessLLMRequestResult`): Runs preemptive compatibility rules against the converted LanguageModelV2Prompt immediately before the provider call. Returned prompt changes are transient and are not persisted to memory or message history.
+**processAPIError** (`(args: ProcessAPIErrorArgs) => Promise<ProcessAPIErrorResult | void>`): Runs reactive compatibility rules when a provider rejects the request. Matching rules can mutate the message list and return retry: true on the first retry attempt.
+## Built-in rules
+`ProviderHistoryCompat` includes these built-in compatibility rules:
+| Rule                                        | Provider  | Timing                      | Behavior                                                                                                                          |
+| ------------------------------------------- | --------- | --------------------------- | --------------------------------------------------------------------------------------------------------------------------------- |
+| `anthropic-tool-id-format`                  | Anthropic | Reactive API error recovery | Rewrites tool call IDs that contain characters outside `[a-zA-Z0-9_-]` and retries the request.                                   |
+| `cerebras-strip-reasoning-content`          | Cerebras  | Preemptive prompt rewrite   | Removes assistant `reasoning` parts from the outbound prompt so they're not serialized as unsupported `reasoning_content` fields. |
+| `anthropic-strip-foreign-reasoning-content` | Anthropic | Preemptive prompt rewrite   | Removes non-Anthropic assistant `reasoning` parts from the outbound prompt. Anthropic-native thinking history is preserved.       |
+Preemptive rules run through `processLLMRequest` after Mastra converts messages to the model prompt format and before the prompt is sent to the provider. These rewrites affect only the current provider call.
+Reactive rules run through `processAPIError` after a provider rejection. They can update the persisted `messageList` and request a retry.
+## `CompatRule`
+A `CompatRule` defines one provider history compatibility fix:
+```typescript
+import type { CompatRule } from '@mastra/core/processors'
+const removeUnsupportedPromptParts: CompatRule = {
+  name: 'remove-unsupported-prompt-parts',
+  applyToPrompt({ prompt, model }) {
+    // Return a modified LanguageModelV2Prompt, or undefined to leave it unchanged.
+    return undefined
+  },
+}
+```
+**name** (`string`): Human-readable rule identifier for logs and debugging.
+**errorPatterns** (`RegExp[]`): Patterns matched against provider API error messages and response bodies. Required for reactive rules that implement fix.
+**fix** (`(messages: MastraDBMessage[]) => boolean`): Reactive fix that mutates persisted database messages after a matching API error. Return true when the rule changed messages and the request should retry.
+**applyToPrompt** (`(args: { prompt: LanguageModelV2Prompt; model: unknown }) => LanguageModelV2Prompt | undefined`): Preemptive fix that rewrites the outbound prompt for the current provider call. Return undefined when no prompt change is needed.
+## Custom rules
+Pass custom rules through `additionalRules`. Custom rules run after the built-in rules:
+```typescript
+import { Agent } from '@mastra/core/agent'
+import { ProviderHistoryCompat, type CompatRule } from '@mastra/core/processors'
+const stripUnsupportedAssistantMetadata: CompatRule = {
+  name: 'strip-unsupported-assistant-metadata',
+  applyToPrompt({ prompt, model }) {
+    if (typeof model !== 'string' || !model.startsWith('example-provider/')) {
+      return undefined
+    }
+    let changed = false
+    const nextPrompt = prompt.map(message => {
+      if (message.role !== 'assistant' || typeof message.content === 'string') {
+        return message
+      }
+      const nextContent = message.content.map(part => {
+        if (!('providerOptions' in part)) return part
+        changed = true
+        const { providerOptions: _providerOptions, ...rest } = part
+        return rest
+      })
+      return { ...message, content: nextContent }
+    })
+    return changed ? nextPrompt : undefined
+  },
+}
+export const agent = new Agent({
+  name: 'custom-provider-agent',
+  instructions: 'You are a helpful assistant.',
+  model: 'example-provider/model',
+  inputProcessors: [
+    new ProviderHistoryCompat({
+      additionalRules: [stripUnsupportedAssistantMetadata],
+    }),
+  ],
+})
+```
+Use `applyToPrompt` for provider-specific rewrites that shouldn't be saved to memory. Use `fix` with `errorPatterns` when the provider rejects persisted message history and the repaired history should be reused on future turns.
+## Related
+- [Processor interface](https://mastra.ai/reference/processors/processor-interface)
+- [Processors](https://mastra.ai/docs/agents/processors)
+- [PrefillErrorHandler](https://mastra.ai/reference/processors/prefill-error-handler)

package/.docs/reference/tools/mcp-client.md CHANGED Viewed

@@ -987,6 +987,53 @@ await agent.generate('Hello!', {
 })
 ```
+## Handling auth failures inside custom fetch
+A custom `fetch` should not `throw` when authentication is unavailable. The Streamable HTTP transport in the MCP SDK opens a long-lived `GET /mcp` "standalone listener" stream in the background to receive server-pushed notifications. Errors on that stream are retried with exponential backoff, and a thrown `fetch` or a cleanly-closed stream can produce an indefinite reconnect loop at roughly one attempt per second.
+Return a synthetic `Response` instead. The [MCP Streamable HTTP specification](https://modelcontextprotocol.io/specification/2025-03-26/basic/transports) defines `405 Method Not Allowed` as the signal a server returns when it does not offer the GET SSE stream, and the SDK honors it as a terminal status that stops the listener cleanly. Use this to disable the listener when your server does not push notifications.
+The following pattern waits for an auth token on POST requests, attaches it to outgoing headers, and short-circuits the GET listener with a synthetic 405:
+```typescript
+async function waitForToken(timeoutMs = 5000): Promise<string | null> {
+  // Replace with your token lookup. Return null if no token is available.
+  return getAuthToken({ timeoutMs })
+}
+const mcpClient = new MCPClient({
+  servers: {
+    apiServer: {
+      url: new URL('https://api.example.com/mcp'),
+      fetch: async (url, init) => {
+        const method = (init?.method || 'GET').toUpperCase()
+        // The SDK opens a background GET stream for server-pushed notifications.
+        // If your server does not use it, short-circuit with 405 to stop reconnect attempts.
+        if (method === 'GET') {
+          return new Response(null, { status: 405, statusText: 'Method Not Allowed' })
+        }
+        // POST: wait for the token, then forward the request with an Authorization header.
+        const token = await waitForToken()
+        if (!token) {
+          // Forward the request without a token and let the server reject it.
+          // The SDK surfaces non-2xx POST responses as errors to the caller of
+          // tools/list, tools/call, etc., which is the desired behavior here.
+          return fetch(url, init)
+        }
+        const headers = new Headers(init?.headers)
+        headers.set('authorization', `Bearer ${token}`)
+        return fetch(url, { ...init, headers })
+      },
+    },
+  },
+})
+```
+Return `405` for the GET listener only when your server does not push notifications back to the client. If your server uses the standalone GET stream, attach the auth token on `GET` requests as well and let the request through.
 ## Using SSE request headers
 When using the legacy SSE MCP transport, you must configure both `requestInit` and `eventSourceInit` due to a bug in the MCP SDK. Alternatively, you can use a custom `fetch` function which will be automatically used for both POST requests and SSE connections:

package/CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,19 @@
 # @mastra/mcp-docs-server
+## 1.1.35-alpha.6
+### Patch Changes
+- Updated dependencies [[`d1fdbd0`](https://github.com/mastra-ai/mastra/commit/d1fdbd012add5623cb7e6b7f882b605ab358bbb4), [`d91ebe2`](https://github.com/mastra-ai/mastra/commit/d91ebe28ee065d8f2ed6df741c3c07f58d359529)]:
+  - @mastra/core@1.33.0-alpha.2
+## 1.1.35-alpha.4
+### Patch Changes
+- Updated dependencies [[`dccd8f1`](https://github.com/mastra-ai/mastra/commit/dccd8f1f8b8f1ad203b77556207e5529567c616d)]:
+  - @mastra/core@1.33.0-alpha.1
 ## 1.1.35-alpha.1
 ### Patch Changes

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@mastra/mcp-docs-server",
-  "version": "1.1.35-alpha.3",
+  "version": "1.1.35-alpha.6",
   "description": "MCP server for accessing Mastra.ai documentation, changelogs, and news.",
   "type": "module",
   "main": "dist/index.js",
@@ -29,8 +29,8 @@
     "jsdom": "^26.1.0",
     "local-pkg": "^1.1.2",
     "zod": "^4.3.6",
-    "@mastra/core": "1.33.0-alpha.0",
-    "@mastra/mcp": "^1.7.0"
+    "@mastra/mcp": "^1.7.0",
+    "@mastra/core": "1.33.0-alpha.2"
   },
   "devDependencies": {
     "@hono/node-server": "^1.19.11",
@@ -46,9 +46,9 @@
     "tsx": "^4.21.0",
     "typescript": "^6.0.3",
     "vitest": "4.1.5",
-    "@internal/types-builder": "0.0.67",
-    "@mastra/core": "1.33.0-alpha.0",
-    "@internal/lint": "0.0.92"
+    "@internal/lint": "0.0.92",
+    "@mastra/core": "1.33.0-alpha.2",
+    "@internal/types-builder": "0.0.67"
   },
   "homepage": "https://mastra.ai",
   "repository": {