npm - @mastra/mcp-docs-server - Versions diffs - 0.13.10 → 0.13.11-alpha.0 - Mend

@mastra/mcp-docs-server 0.13.10 → 0.13.11-alpha.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (59) hide show

package/.docs/organized/changelogs/%40internal%2Fstorage-test-utils.md +9 -9
package/.docs/organized/changelogs/%40internal%2Ftypes-builder.md +2 -0
package/.docs/organized/changelogs/%40mastra%2Fclient-js.md +18 -18
package/.docs/organized/changelogs/%40mastra%2Fcore.md +23 -23
package/.docs/organized/changelogs/%40mastra%2Fdeployer-cloudflare.md +20 -20
package/.docs/organized/changelogs/%40mastra%2Fdeployer-netlify.md +20 -20
package/.docs/organized/changelogs/%40mastra%2Fdeployer-vercel.md +20 -20
package/.docs/organized/changelogs/%40mastra%2Fdeployer.md +20 -20
package/.docs/organized/changelogs/%40mastra%2Ffirecrawl.md +13 -13
package/.docs/organized/changelogs/%40mastra%2Flibsql.md +9 -9
package/.docs/organized/changelogs/%40mastra%2Fmcp-docs-server.md +18 -18
package/.docs/organized/changelogs/%40mastra%2Fmemory.md +12 -12
package/.docs/organized/changelogs/%40mastra%2Fpg.md +9 -9
package/.docs/organized/changelogs/%40mastra%2Fplayground-ui.md +21 -21
package/.docs/organized/changelogs/%40mastra%2Frag.md +12 -12
package/.docs/organized/changelogs/%40mastra%2Fschema-compat.md +7 -0
package/.docs/organized/changelogs/%40mastra%2Fserver.md +18 -18
package/.docs/organized/changelogs/create-mastra.md +9 -9
package/.docs/organized/changelogs/mastra.md +22 -22
package/.docs/organized/code-examples/agent-network.md +4 -3
package/.docs/organized/code-examples/agent.md +33 -2
package/.docs/raw/agents/overview.mdx +21 -1
package/.docs/raw/getting-started/mcp-docs-server.mdx +2 -2
package/.docs/raw/rag/chunking-and-embedding.mdx +11 -0
package/.docs/raw/reference/agents/agent.mdx +64 -38
package/.docs/raw/reference/agents/generate.mdx +206 -202
package/.docs/raw/reference/agents/getAgent.mdx +23 -38
package/.docs/raw/reference/agents/getDefaultGenerateOptions.mdx +62 -0
package/.docs/raw/reference/agents/getDefaultStreamOptions.mdx +62 -0
package/.docs/raw/reference/agents/getDefaultVNextStreamOptions.mdx +62 -0
package/.docs/raw/reference/agents/getDescription.mdx +30 -0
package/.docs/raw/reference/agents/getInstructions.mdx +36 -73
package/.docs/raw/reference/agents/getLLM.mdx +69 -0
package/.docs/raw/reference/agents/getMemory.mdx +42 -119
package/.docs/raw/reference/agents/getModel.mdx +36 -75
package/.docs/raw/reference/agents/getScorers.mdx +62 -0
package/.docs/raw/reference/agents/getTools.mdx +36 -128
package/.docs/raw/reference/agents/getVoice.mdx +36 -83
package/.docs/raw/reference/agents/getWorkflows.mdx +37 -74
package/.docs/raw/reference/agents/stream.mdx +263 -226
package/.docs/raw/reference/agents/streamVNext.mdx +208 -402
package/.docs/raw/reference/rag/chunk.mdx +51 -2
package/.docs/raw/reference/scorers/answer-relevancy.mdx +6 -6
package/.docs/raw/reference/scorers/bias.mdx +6 -6
package/.docs/raw/reference/scorers/completeness.mdx +2 -2
package/.docs/raw/reference/scorers/content-similarity.mdx +1 -1
package/.docs/raw/reference/scorers/create-scorer.mdx +445 -0
package/.docs/raw/reference/scorers/faithfulness.mdx +6 -6
package/.docs/raw/reference/scorers/hallucination.mdx +6 -6
package/.docs/raw/reference/scorers/keyword-coverage.mdx +2 -2
package/.docs/raw/reference/scorers/mastra-scorer.mdx +116 -158
package/.docs/raw/reference/scorers/toxicity.mdx +2 -2
package/.docs/raw/scorers/custom-scorers.mdx +166 -268
package/.docs/raw/scorers/overview.mdx +21 -13
package/.docs/raw/server-db/local-dev-playground.mdx +3 -3
package/package.json +3 -3
package/.docs/raw/reference/agents/createTool.mdx +0 -241
package/.docs/raw/reference/scorers/custom-code-scorer.mdx +0 -155
package/.docs/raw/reference/scorers/llm-scorer.mdx +0 -210

package/.docs/raw/scorers/custom-scorers.mdx CHANGED Viewed

@@ -1,319 +1,217 @@
 ## Creating scorers
-Mastra provides two approaches for creating custom scorers:
+Mastra provides a unified `createScorer` factory that allows you to build custom evaluation logic using either JavaScript functions or LLM-based prompt objects for each step. This flexibility lets you choose the best approach for each part of your evaluation pipeline.
-**Code scorers** use programmatic logic and algorithms. They're ideal for deterministic evaluations, performance-critical scenarios, and cases where you have clear algorithmic criteria.
+### The Four-Step Pipeline
-**LLM scorers** use language models as judges. They're perfect for subjective evaluations, complex criteria that are difficult to code algorithmically, and cases where human-like judgment is needed.
+All scorers in Mastra follow a consistent four-step evaluation pipeline:
-### Code-based scorers
+1. **preprocess** (optional): Prepare or transform input/output data
+2. **analyze** (optional): Perform evaluation analysis and gather insights
+3. **generateScore** (required): Convert analysis into a numerical score
+4. **generateReason** (optional): Generate human-readable explanations
-Code scorers use `createScorer` to build evaluation logic with programmatic algorithms. They're ideal for deterministic evaluations, performance-critical scenarios, and cases where you have clear algorithmic criteria or need integration with existing libraries.
+Each step can use either **functions** or **prompt objects** (LLM-based evaluation), giving you the flexibility to combine deterministic algorithms with AI judgment as needed.
-Code scorers follow Mastra's three-step evaluation pipeline:
-- an optional **extract** step for preprocessing complex data
-- a required **analyze** step for core evaluation and scoring
-- and an optional **reason** step for generating explanations.
+### Functions vs Prompt Objects
-For the complete API reference, see [`createScorer`](/reference/scorers/custom-code-scorer), and for a detailed explanation of the pipeline, see [evaluation process](/docs/scorers/overview#evaluation-pipeline).
+**Functions** use JavaScript for deterministic logic. They're ideal for:
+- Algorithmic evaluations with clear criteria
+- Performance-critical scenarios
+- Integration with existing libraries
+- Consistent, reproducible results
-#### Extract Step
+**Prompt Objects** use LLMs as judges for evaluation. They're perfect for:
+- Subjective evaluations requiring human-like judgment
+- Complex criteria difficult to code algorithmically
+- Natural language understanding tasks
+- Nuanced context evaluation
-This optional step preprocesses input/output data when you need to evaluate multiple distinct elements, filter content, or focus analysis on specific parts of complex data.
+You can mix and match approaches within a single scorer - for example, use a function for preprocessing data and an LLM for analyzing quality.
-- **Receives:**
-  - `input`: User messages (when used with agents) or workflow step input (when used with workflow steps)
-  - `output`: Agent's response (when used with agents) or workflow step output (when used with workflow steps)
-  - `runtimeContext`: Runtime context from the agent or workflow step being evaluated
-- **Must return:** `{ results: any }`
-- **Data flow:** The `results` value is passed to the analyze step as `extractStepResult`
+### Initializing a Scorer
-```typescript filename="src/mastra/scorers/keyword-coverage-scorer.ts" showLineNumbers copy
-import { createScorer } from "@mastra/core/scores";
-import keywordExtractor from "keyword-extractor";
+Every scorer starts with the `createScorer` factory function, which requires a name and description, and optionally accepts a judge configuration for LLM-based steps.
-export const keywordCoverageScorer = createScorer({
-  name: "Keyword Coverage",
-  description: "Evaluates how well the output covers keywords from the input",
+```typescript
+import { createScorer } from '@mastra/core/scores';
+import { openai } from '@ai-sdk/openai';
-  // Step 1: Extract keywords from input and output
-  extract: async ({ input, output }) => {
-    const inputText = input?.map(i => i.content).join(", ") || "";
-    const outputText = output.text;
-    const extractKeywords = (text: string) => {
-      return keywordExtractor.extract(text);
-    };
-    const inputKeywords = new Set(extractKeywords(inputText));
-    const outputKeywords = new Set(extractKeywords(outputText));
-    return {
-      results: {
-        inputKeywords,
-        outputKeywords,
-      },
-    };
-  },
-  // ... analyze and reason steps
-});
+const glutenCheckerScorer = createScorer({
+  name: 'Gluten Checker',
+  description: 'Check if recipes contain gluten ingredients',
+  judge: {                    // Optional: for prompt object steps
+    model: openai('gpt-4o'),
+    instructions: 'You are a Chef that identifies if recipes contain gluten.'
+  }
+})
+// Chain step methods here
+.preprocess(...)
+.analyze(...)
+.generateScore(...)
+.generateReason(...)
 ```
-#### Analyze Step
+The judge configuration is only needed if you plan to use prompt objects in any step. Individual steps can override this default configuration with their own judge settings.
-This required step performs the core evaluation and generates the numerical score for all scorers.
+### Step-by-Step Breakdown
-- **Receives:** Everything from extract step, plus:
-  - `extractStepResult`: Results from the extract step (if extract step was defined)
-- **Must return:** `{ score: number, results?: any }`
-- **Data flow:** The `score` and optional `results` are passed to the reason step
+#### preprocess Step (Optional)
-```typescript filename="src/mastra/scorers/keyword-coverage-scorer.ts" showLineNumbers copy
-export const keywordCoverageScorer = createScorer({
-  // ... name, description, extract step
+Prepares input/output data when you need to extract specific elements, filter content, or transform complex data structures.
-  // Step 2: Analyze keyword coverage and calculate score
-  analyze: async ({ input, output, extractStepResult }) => {
-    const { inputKeywords, outputKeywords } = extractStepResult.results;
-    if (inputKeywords.size === 0) {
-      return { score: 1, results: { coverage: 1, matched: 0, total: 0 } };
-    }
+**Functions:** `({ run, results }) => any`
-    const matchedKeywords = [...inputKeywords].filter(keyword =>
-      outputKeywords.has(keyword)
-    );
-    const coverage = matchedKeywords.length / inputKeywords.size;
-    return {
-      score: coverage,
-      results: {
-        coverage,
-        matched: matchedKeywords.length,
-        total: inputKeywords.size,
-        matchedKeywords,
-      },
-    };
-  },
-  // ... reason step
-});
+```typescript
+const glutenCheckerScorer = createScorer(...)
+.preprocess(({ run }) => {
+  // Extract and clean recipe text
+  const recipeText = run.output.text.toLowerCase();
+  const wordCount = recipeText.split(' ').length;
+  return {
+    recipeText,
+    wordCount,
+    hasCommonGlutenWords: /flour|wheat|bread|pasta/.test(recipeText)
+  };
+})
 ```
-#### Reason Step
-This optional step generates human-readable explanations for scores, useful for actionable feedback, debugging transparency, or compliance documentation.
-- **Receives:** Everything from analyze step, plus:
-  - `score`: The numerical score (0-1) calculated by the analyze step
-  - `analyzeStepResult`: Results from the analyze step (contains the score and any additional results)
-- **Must return:** `{ reason: string }`
-```typescript filename="src/mastra/scorers/keyword-coverage-scorer.ts" showLineNumbers copy
-export const keywordCoverageScorer = createScorer({
-  // ... name, description, extract and analyze steps
-  // Step 3: Generate explanation for the score
-  reason: async ({ score, analyzeStepResult, extractStepResult }) => {
-    const { matched, total, matchedKeywords } = analyzeStepResult.results;
-    const { inputKeywords } = extractStepResult.results;
+**Prompt Objects:** Use `description`, `outputSchema`, and `createPrompt` to structure LLM-based preprocessing.
+```typescript
+const glutenCheckerScorer = createScorer(...)
+.preprocess({
+  description: 'Extract ingredients from the recipe',
+  outputSchema: z.object({
+    ingredients: z.array(z.string()),
+    cookingMethods: z.array(z.string())
+  }),
+  createPrompt: ({ run }) => `
+    Extract all ingredients and cooking methods from this recipe:
+    ${run.output.text}
-    const percentage = Math.round(score * 100);
-    const missedKeywords = [...inputKeywords].filter(
-      keyword => !matchedKeywords.includes(keyword)
-    );
-    let reason = `The output achieved ${percentage}% keyword coverage (${matched}/${total} keywords).`;
-    if (matchedKeywords.length > 0) {
-      reason += ` Covered keywords: ${matchedKeywords.join(", ")}.`;
-    }
-    if (missedKeywords.length > 0) {
-      reason += ` Missing keywords: ${missedKeywords.join(", ")}.`;
-    }
-    return { reason };
-  },
-});
+    Return JSON with ingredients and cookingMethods arrays.
+  `
+})
 ```
-**Examples and Resources:**
-- [Custom Native JavaScript Scorer Example](/examples/scorers/custom-native-javascript-eval) - Example walkthrough.
-- [Built-in Code Scorers](https://github.com/mastra-ai/mastra/tree/main/packages/evals/src/scorers/code) - Real implementations for reference
-### LLM-based scorers
-LLM scorers use `createLLMScorer` to build evaluations that leverage language models as judges. They're perfect for subjective evaluations that require understanding context, complex criteria that are difficult to code algorithmically, natural language understanding tasks, and cases where human-like judgment is needed.
-LLM scorers follow the same evaluation pipeline as code scorers with an additional `calculateScore` function:
-- an optional **extract** step where the LLM processes input/output and returns structured data
-- a required **analyze** step where the LLM performs evaluation and returns structured analysis
-- a required **calculateScore** function that converts LLM analysis into numerical score
-- and an optional **reason** step where the LLM generates human-readable explanations
-The `calculateScore` function leverages the best of both approaches: LLMs excel at qualitative analysis and understanding, while deterministic functions ensure precise and consistent numerical scoring.
-For the complete API reference, see [`createLLMScorer`](/reference/scorers/llm-scorer), and for a detailed explanation of the pipeline, see [evaluation process](/docs/scorers/overview#evaluation-pipeline).
+**Data Flow:** Results are available to subsequent steps as `results.preprocessStepResult`
-#### Judge Configuration
+#### analyze Step (Optional)
-All LLM scorer steps share this required configuration that defines the model and system instructions.
+Performs core evaluation analysis, gathering insights that will inform the scoring decision.
-- **Configuration:** `judge` object containing:
-  - **model:** The LLM model instance for evaluation
-  - **instructions:** System prompt that guides the LLM's behavior
+**Functions:** `({ run, results }) => any`
-```typescript filename="src/mastra/scorers/tone-scorer.ts" showLineNumbers copy
-import { openai } from "@ai-sdk/openai";
-import { createLLMScorer } from "@mastra/core/scores";
-export const toneScorer = createLLMScorer({
-  name: 'Tone Scorer',
-  description: 'Evaluates the tone of the output',
+```typescript
+const glutenCheckerScorer = createScorer({...})
+.preprocess(...)
+.analyze(({ run, results }) => {
+  const { recipeText, hasCommonGlutenWords } = results.preprocessStepResult;
-  // Shared judge configuration
-  judge: {
-    model: openai('gpt-4o'),
-    instructions: 'You are an expert in analyzing text tone and communication style.',
-  },
+  // Simple gluten detection algorithm
+  const glutenKeywords = ['wheat', 'flour', 'barley', 'rye', 'bread'];
+  const foundGlutenWords = glutenKeywords.filter(word =>
+    recipeText.includes(word)
+  );
-  // ... other steps
-});
+  return {
+    isGlutenFree: foundGlutenWords.length === 0,
+    detectedGlutenSources: foundGlutenWords,
+    confidence: hasCommonGlutenWords ? 0.9 : 0.7
+  };
+})
 ```
-#### Extract Step
-This optional step uses an LLM to preprocess input/output data when you need to evaluate multiple distinct elements, filter content, or focus analysis on specific parts of complex data.
-- **Configuration:** `{ description, outputSchema, createPrompt }`
-- **Data flow:** The structured output (defined by outputSchema) is passed to the analyze step as `extractStepResult`
-```typescript filename="src/mastra/scorers/content-scorer.ts" showLineNumbers copy
-export const contentScorer = createLLMScorer({
-  // ... judge configuration
-  extract: {
-    description: 'Extract key themes and topics from the content',
-    outputSchema: z.object({
-      themes: z.array(z.string()),
-      topics: z.array(z.string()),
-      keyPhrases: z.array(z.string())
-    }),
-    createPrompt: ({ run }) => `
-      Analyze this content and extract:
-      1. Main themes (3-5 high-level concepts)
-      2. Specific topics mentioned
-      3. Key phrases that capture the essence
-      Content: ${run.output.text}
-      Return a JSON object with themes, topics, and keyPhrases arrays.
-    `,
-  },
-  // ... other steps
-});
+**Prompt Objects:** Use `description`, `outputSchema`, and `createPrompt` for LLM-based analysis.
+```typescript
+const glutenCheckerScorer = createScorer({...})
+.preprocess(...)
+.analyze({
+  description: 'Analyze recipe for gluten content',
+  outputSchema: z.object({
+    isGlutenFree: z.boolean(),
+    glutenSources: z.array(z.string()),
+    confidence: z.number().min(0).max(1)
+  }),
+  createPrompt: ({ run, results }) => `
+    Analyze this recipe for gluten content:
+    "${results.preprocessStepResult.recipeText}"
+    Look for wheat, barley, rye, and hidden sources like soy sauce.
+    Return JSON with isGlutenFree, glutenSources array, and confidence (0-1).
+  `
+})
 ```
-#### Analyze Step
+**Data Flow:** Results are available to subsequent steps as `results.analyzeStepResult`
-This required step uses an LLM to perform the core evaluation and return structured analysis that will be converted to a numerical score.
+#### generateScore Step (Required)
-- **Configuration:** `{ description, outputSchema, createPrompt }`
-- **Data flow:** The structured output is passed to the calculateScore function and then to the reason step
+Converts analysis results into a numerical score. This is the only required step in the pipeline.
-```typescript filename="src/mastra/scorers/quality-scorer.ts" showLineNumbers copy
-export const qualityScorer = createLLMScorer({
-  // ... judge configuration
-  analyze: {
-    description: 'Evaluate content quality across multiple dimensions',
-    outputSchema: z.object({
-      clarity: z.number().min(1).max(5),
-      accuracy: z.number().min(1).max(5),
-      completeness: z.number().min(1).max(5),
-      relevance: z.number().min(1).max(5)
-    }),
-    createPrompt: ({ run }) => `
-      Evaluate this content on a scale of 1-5 for:
-      - Clarity: How clear and understandable is it?
-      - Accuracy: How factually correct does it appear?
-      - Completeness: How thorough is the response?
-      - Relevance: How well does it address the input?
-      Input: ${run.input.map(i => i.content).join(', ')}
-      Output: ${run.output.text}
-      Return a JSON object with numeric scores for each dimension.
-    `,
-  },
+**Functions:** `({ run, results }) => number`
+```typescript
+const glutenCheckerScorer = createScorer({...})
+.preprocess(...)
+.analyze(...)
+.generateScore(({ results }) => {
+  const { isGlutenFree, confidence } = results.analyzeStepResult;
-  // ... other steps
-});
+  // Return 1 for gluten-free, 0 for contains gluten
+  // Weight by confidence level
+  return isGlutenFree ? confidence : 0;
+})
 ```
-#### Calculate Score Step
+**Prompt Objects:** See the [`createScorer`](/reference/scorers/create-scorer) API reference for details on using prompt objects with generateScore, including required `calculateScore` function.
-This required function converts the LLM's structured analysis into a numerical score, providing deterministic scoring logic since LLMs aren't reliable for consistent numerical outputs.
+**Data Flow:** The score is available to generateReason as the `score` parameter
-- **Configuration:** `calculateScore` function that receives `{ run }` and returns a number
-- **Data flow:** Converts the analyze step's structured output into a numerical score (0-1 range)
+#### generateReason Step (Optional)
-```typescript filename="src/mastra/scorers/quality-scorer.ts" showLineNumbers copy
-export const qualityScorer = createLLMScorer({
-  // ... previous steps
-  calculateScore: ({ run }) => {
-    const { clarity, accuracy, completeness, relevance } = run.analyzeStepResult;
-    // Calculate weighted average (scale of 1-5 to 0-1)
-    const weights = { clarity: 0.3, accuracy: 0.3, completeness: 0.2, relevance: 0.2 };
-    const weightedSum = (clarity * weights.clarity) +
-                       (accuracy * weights.accuracy) +
-                       (completeness * weights.completeness) +
-                       (relevance * weights.relevance);
-    // Convert from 1-5 scale to 0-1 scale
-    return (weightedSum - 1) / 4;
-  },
-  // ... other steps
-});
-```
+Generates human-readable explanations for the score, useful for debugging, transparency, or user feedback.
-#### Reason Step
+**Functions:** `({ run, results, score }) => string`
-This optional step uses an LLM to generate human-readable explanations for scores, useful for actionable feedback, debugging transparency, or compliance documentation.
-- **Configuration:** `{ description, createPrompt }`
-- **Data flow:** Receives all previous step results and score, returns a string explanation
-```typescript filename="src/mastra/scorers/quality-scorer.ts" showLineNumbers copy
-export const qualityScorer = createLLMScorer({
-  // ... previous steps
+```typescript
+const glutenCheckerScorer = createScorer({...})
+.preprocess(...)
+.analyze(...)
+.generateScore(...)
+.generateReason(({ results, score }) => {
+  const { isGlutenFree, glutenSources } = results.analyzeStepResult;
-  reason: {
-    createPrompt: ({ run }) => {
-      const { clarity, accuracy, completeness, relevance } = run.analyzeStepResult;
-      const percentage = Math.round(run.score * 100);
-      return `
-        The content received a ${percentage}% quality score based on:
-        - Clarity: ${clarity}/5
-        - Accuracy: ${accuracy}/5
-        - Completeness: ${completeness}/5
-        - Relevance: ${relevance}/5
-        Provide a brief explanation of what contributed to this score.
-      `;
-    },
-  },
-});
+  if (isGlutenFree) {
+    return `Score: ${score}. This recipe is gluten-free with no harmful ingredients detected.`;
+  } else {
+    return `Score: ${score}. Contains gluten from: ${glutenSources.join(', ')}`;
+  }
+})
+```
+**Prompt Objects:** Use `description` and `createPrompt` for LLM-generated explanations.
+```typescript
+const glutenCheckerScorer = createScorer({...})
+.preprocess(...)
+.analyze(...)
+.generateScore(...)
+.generateReason({
+  description: 'Explain the gluten assessment',
+  createPrompt: ({ results, score }) => `
+    Explain why this recipe received a score of ${score}.
+    Analysis: ${JSON.stringify(results.analyzeStepResult)}
+    Provide a clear explanation for someone with dietary restrictions.
+  `
+})
 ```
 **Examples and Resources:**
-- [Custom LLM Judge Scorer Example](/examples/scorers/custom-llm-judge-eval) - Example Walkthrough with gluten checker
-- [Built-in LLM Scorers](https://github.com/mastra-ai/mastra/tree/main/packages/evals/src/scorers/llm) - Real implementations for reference
+- [Custom Scorer Example](/examples/scorers/custom-scorer) - Complete walkthrough
+- [createScorer API Reference](/reference/scorers/create-scorer) - Complete technical documentation
+- [Built-in Scorers Source Code](https://github.com/mastra-ai/mastra/tree/main/packages/evals/src/scorers) - Real implementations for reference

package/.docs/raw/scorers/overview.mdx CHANGED Viewed

@@ -11,28 +11,36 @@ description: Overview of scorers in Mastra, detailing their capabilities for eva
 ## Evaluation pipeline
-Mastra scorers follow an optional three-step pipeline that allows for evaluation workflows:
+Mastra scorers follow a flexible four-step pipeline that allows for simple to complex evaluation workflows:
-1. **Extract** (Optional): Identify and isolate relevant content for focused evaluation
-2. **Analyze** (Required): Perform the core evaluation and generate a score
-3. **Reason** (Optional): Provide explanations or justifications for the score
+1. **preprocess** (Optional): Prepare or transform input/output data for evaluation
+2. **analyze** (Optional): Perform evaluation analysis and gather insights
+3. **generateScore** (Required): Convert analysis into a numerical score
+4. **generateReason** (Optional): Generate explanations or justifications for the score
 This modular structure enables both simple single-step evaluations and complex multi-stage analysis workflows, allowing you to build evaluations that match your specific needs.
 ### When to use each step
-**Extract step** - Use when your content is complex or needs preprocessing:
-- Separating facts from opinions in mixed responses
-- Focusing evaluation on specific sections of long outputs
+**preprocess step** - Use when your content is complex or needs preprocessing:
+- Extracting specific elements from complex data structures
+- Cleaning or normalizing text before analysis
 - Parsing multiple claims that need individual evaluation
-- Example: Bias detection that first identifies opinion statements
+- Filtering content to focus evaluation on relevant sections
-**Analyze step** - Always required for core evaluation:
-- Straightforward scenarios: Direct scoring of input/output pairs
-- Complex scenarios: Evaluate preprocessed content and generate detailed results
-- Applies your scoring criteria and calculates the numerical score
+**analyze step** - Use when you need structured evaluation analysis:
+- Gathering insights that inform the scoring decision
+- Breaking down complex evaluation criteria into components
+- Performing detailed analysis that generateScore will use
+- Collecting evidence or reasoning data for transparency
-**Reason step** - Use when explanations are important:
+**generateScore step** - Always required for converting analysis to scores:
+- Simple scenarios: Direct scoring of input/output pairs
+- Complex scenarios: Converting detailed analysis results into numerical scores
+- Applying business logic and weighting to analysis results
+- The only step that produces the final numerical score
+**generateReason step** - Use when explanations are important:
 - Users need to understand why a score was assigned
 - Debugging and transparency are critical
 - Compliance or auditing requires explanations

package/.docs/raw/server-db/local-dev-playground.mdx CHANGED Viewed

@@ -44,7 +44,7 @@ The Playground lets you interact with your agents, workflows, and tools. It prov
 Quickly test and debug your agents during development using the interactive chat interface in the Agent Playground.
 <VideoPlayer
-  src="https://res.cloudinary.com/dygi6femd/video/upload/v1751406022/local-dev-agents-playground_100_m3begx.mp4"
+  src="https://res.cloudinary.com/mastra-assets/video/upload/v1751406022/local-dev-agents-playground_100_m3begx.mp4"
 />
 Key features:
@@ -60,7 +60,7 @@ Key features:
 Validate workflows by supplying defined inputs and visualizing each step within the Workflow Playground.
 <VideoPlayer
-  src="https://res.cloudinary.com/dygi6femd/video/upload/v1751406027/local-dev-workflows-playground_100_rbc466.mp4"
+  src="https://res.cloudinary.com/mastra-assets/video/upload/v1751406027/local-dev-workflows-playground_100_rbc466.mp4"
 />
 Key features:
@@ -76,7 +76,7 @@ Key features:
 Quickly test and debug custom tools in isolation using the Tools Playground, without running a full agent or workflow.
 <VideoPlayer
-  src="https://res.cloudinary.com/dygi6femd/video/upload/v1751406316/local-dev-agents-tools_100_fe1jdt.mp4"
+  src="https://res.cloudinary.com/mastra-assets/video/upload/v1751406316/local-dev-agents-tools_100_fe1jdt.mp4"
 />
 Key features:

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@mastra/mcp-docs-server",
-  "version": "0.13.10",
+  "version": "0.13.11-alpha.0",
   "description": "MCP server for accessing Mastra.ai documentation, changelogs, and news.",
   "type": "module",
   "main": "dist/index.js",
@@ -32,7 +32,7 @@
     "uuid": "^11.1.0",
     "zod": "^3.25.67",
     "zod-to-json-schema": "^3.24.5",
-    "@mastra/core": "0.13.1",
+    "@mastra/core": "0.13.2-alpha.0",
     "@mastra/mcp": "^0.10.10"
   },
   "devDependencies": {
@@ -49,7 +49,7 @@
     "typescript": "^5.8.3",
     "vitest": "^3.2.4",
     "@internal/lint": "0.0.28",
-    "@mastra/core": "0.13.1"
+    "@mastra/core": "0.13.2-alpha.0"
   },
   "scripts": {
     "prepare-docs": "cross-env PREPARE=true node dist/prepare-docs/prepare.js",