npm - @mastra/vectorize - Versions diffs - 1.0.0-beta.3 → 1.0.1-alpha.0 - Mend

@mastra/vectorize 1.0.0-beta.3 → 1.0.1-alpha.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (15) hide show

package/CHANGELOG.md +43 -0
package/dist/docs/SKILL.md +15 -21
package/dist/docs/{SOURCE_MAP.json → assets/SOURCE_MAP.json} +1 -1
package/dist/docs/{rag/02-retrieval.md → references/docs-rag-retrieval.md} +155 -189
package/dist/docs/references/docs-rag-vector-databases.md +645 -0
package/dist/docs/references/reference-vectors-vectorize.md +147 -0
package/dist/index.cjs +9 -0
package/dist/index.cjs.map +1 -1
package/dist/index.js +9 -0
package/dist/index.js.map +1 -1
package/dist/vector/index.d.ts.map +1 -1
package/package.json +10 -11
package/dist/docs/README.md +0 -32
package/dist/docs/rag/01-vector-databases.md +0 -638
package/dist/docs/vectors/01-reference.md +0 -102

package/dist/docs/{rag/02-retrieval.md → references/docs-rag-retrieval.md} RENAMED Viewed

@@ -1,5 +1,3 @@
-> Guide on retrieval processes in Mastra
 # Retrieval in RAG Systems
 After storing embeddings, you need to retrieve relevant chunks to answer user queries.
@@ -21,29 +19,29 @@ Mastra provides flexible retrieval options with support for semantic search, fil
 The simplest approach is direct semantic search. This method uses vector similarity to find chunks that are semantically similar to the query:
 ```ts
-import { embed } from "ai";
-import { PgVector } from "@mastra/pg";
-import { ModelRouterEmbeddingModel } from "@mastra/core/llm";
+import { embed } from 'ai'
+import { PgVector } from '@mastra/pg'
+import { ModelRouterEmbeddingModel } from '@mastra/core/llm'
 // Convert query to embedding
 const { embedding } = await embed({
-  value: "What are the main points in the article?",
-  model: new ModelRouterEmbeddingModel("openai/text-embedding-3-small"),
-});
+  value: 'What are the main points in the article?',
+  model: new ModelRouterEmbeddingModel('openai/text-embedding-3-small'),
+})
 // Query vector store
 const pgVector = new PgVector({
   id: 'pg-vector',
   connectionString: process.env.POSTGRES_CONNECTION_STRING,
-});
+})
 const results = await pgVector.query({
-  indexName: "embeddings",
+  indexName: 'embeddings',
   queryVector: embedding,
   topK: 10,
-});
+})
 // Display results
-console.log(results);
+console.log(results)
 ```
 The `topK` parameter specifies the maximum number of most similar results to return from the vector search.
@@ -53,16 +51,16 @@ Results include both the text content and a similarity score:
 ```ts
 [
   {
-    text: "Climate change poses significant challenges...",
+    text: 'Climate change poses significant challenges...',
     score: 0.89,
-    metadata: { source: "article1.txt" },
+    metadata: { source: 'article1.txt' },
   },
   {
-    text: "Rising temperatures affect crop yields...",
+    text: 'Rising temperatures affect crop yields...',
     score: 0.82,
-    metadata: { source: "article1.txt" },
+    metadata: { source: 'article1.txt' },
   },
-];
+]
 ```
 ## Advanced Retrieval options
@@ -73,63 +71,63 @@ Filter results based on metadata fields to narrow down the search space. This ap
 This is useful when you have documents from different sources, time periods, or with specific attributes. Mastra provides a unified MongoDB-style query syntax that works across all supported vector stores.
-For detailed information about available operators and syntax, see the [Metadata Filters Reference](https://mastra.ai/reference/v1/rag/metadata-filters).
+For detailed information about available operators and syntax, see the [Metadata Filters Reference](https://mastra.ai/reference/rag/metadata-filters).
 Basic filtering examples:
 ```ts
 // Simple equality filter
 const results = await pgVector.query({
-  indexName: "embeddings",
+  indexName: 'embeddings',
   queryVector: embedding,
   topK: 10,
   filter: {
-    source: "article1.txt",
+    source: 'article1.txt',
   },
-});
+})
 // Numeric comparison
 const results = await pgVector.query({
-  indexName: "embeddings",
+  indexName: 'embeddings',
   queryVector: embedding,
   topK: 10,
   filter: {
     price: { $gt: 100 },
   },
-});
+})
 // Multiple conditions
 const results = await pgVector.query({
-  indexName: "embeddings",
+  indexName: 'embeddings',
   queryVector: embedding,
   topK: 10,
   filter: {
-    category: "electronics",
+    category: 'electronics',
     price: { $lt: 1000 },
     inStock: true,
   },
-});
+})
 // Array operations
 const results = await pgVector.query({
-  indexName: "embeddings",
+  indexName: 'embeddings',
   queryVector: embedding,
   topK: 10,
   filter: {
-    tags: { $in: ["sale", "new"] },
+    tags: { $in: ['sale', 'new'] },
   },
-});
+})
 // Logical operators
 const results = await pgVector.query({
-  indexName: "embeddings",
+  indexName: 'embeddings',
   queryVector: embedding,
   topK: 10,
   filter: {
-    $or: [{ category: "electronics" }, { category: "accessories" }],
+    $or: [{ category: 'electronics' }, { category: 'accessories' }],
     $and: [{ price: { $gt: 50 } }, { price: { $lt: 200 } }],
   },
-});
+})
 ```
 Common use cases for metadata filtering:
@@ -146,14 +144,14 @@ Common use cases for metadata filtering:
 Sometimes you want to give your agent the ability to query a vector database directly. The Vector Query Tool allows your agent to be in charge of retrieval decisions, combining semantic search with optional filtering and reranking based on the agent's understanding of the user's needs.
 ```ts
-import { createVectorQueryTool } from "@mastra/rag";
-import { ModelRouterEmbeddingModel } from "@mastra/core/llm";
+import { createVectorQueryTool } from '@mastra/rag'
+import { ModelRouterEmbeddingModel } from '@mastra/core/llm'
 const vectorQueryTool = createVectorQueryTool({
-  vectorStoreName: "pgVector",
-  indexName: "embeddings",
-  model: new ModelRouterEmbeddingModel("openai/text-embedding-3-small"),
-});
+  vectorStoreName: 'pgVector',
+  indexName: 'embeddings',
+  model: new ModelRouterEmbeddingModel('openai/text-embedding-3-small'),
+})
 ```
 When creating the tool, pay special attention to the tool's name and description - these help the agent understand when and how to use the retrieval capabilities. For example, you might name it "SearchKnowledgeBase" and describe it as "Search through our documentation to find relevant information about X topic."
@@ -168,32 +166,31 @@ This is particularly useful when:
 The Vector Query Tool supports database-specific configurations that enable you to leverage unique features and optimizations of different vector stores.
-> **Note:**
-These configurations are for **query-time options** like namespaces, performance tuning, and filtering—not for database connection setup.
-Connection credentials (URLs, auth tokens) are configured when you instantiate the vector store class (e.g., `new LibSQLVector({ connectionUrl: '...' })`).
+> **Note:** These configurations are for **query-time options** like namespaces, performance tuning, and filtering—not for database connection setup.
+>
+> Connection credentials (URLs, auth tokens) are configured when you instantiate the vector store class (e.g., `new LibSQLVector({ url: '...' })`).
 ```ts
-import { createVectorQueryTool } from "@mastra/rag";
-import { ModelRouterEmbeddingModel } from "@mastra/core/llm";
+import { createVectorQueryTool } from '@mastra/rag'
+import { ModelRouterEmbeddingModel } from '@mastra/core/llm'
 // Pinecone with namespace
 const pineconeQueryTool = createVectorQueryTool({
-  vectorStoreName: "pinecone",
-  indexName: "docs",
-  model: new ModelRouterEmbeddingModel("openai/text-embedding-3-small"),
+  vectorStoreName: 'pinecone',
+  indexName: 'docs',
+  model: new ModelRouterEmbeddingModel('openai/text-embedding-3-small'),
   databaseConfig: {
     pinecone: {
-      namespace: "production", // Isolate data by environment
+      namespace: 'production', // Isolate data by environment
     },
   },
-});
+})
 // pgVector with performance tuning
 const pgVectorQueryTool = createVectorQueryTool({
-  vectorStoreName: "postgres",
-  indexName: "embeddings",
-  model: new ModelRouterEmbeddingModel("openai/text-embedding-3-small"),
+  vectorStoreName: 'postgres',
+  indexName: 'embeddings',
+  model: new ModelRouterEmbeddingModel('openai/text-embedding-3-small'),
   databaseConfig: {
     pgvector: {
       minScore: 0.7, // Filter low-quality results
@@ -201,33 +198,33 @@ const pgVectorQueryTool = createVectorQueryTool({
       probes: 10, // IVFFlat probe parameter
     },
   },
-});
+})
 // Chroma with advanced filtering
 const chromaQueryTool = createVectorQueryTool({
-  vectorStoreName: "chroma",
-  indexName: "documents",
-  model: new ModelRouterEmbeddingModel("openai/text-embedding-3-small"),
+  vectorStoreName: 'chroma',
+  indexName: 'documents',
+  model: new ModelRouterEmbeddingModel('openai/text-embedding-3-small'),
   databaseConfig: {
     chroma: {
-      where: { category: "technical" },
-      whereDocument: { $contains: "API" },
+      where: { category: 'technical' },
+      whereDocument: { $contains: 'API' },
     },
   },
-});
+})
 // LanceDB with table specificity
 const lanceQueryTool = createVectorQueryTool({
-  vectorStoreName: "lance",
-  indexName: "documents",
-  model: new ModelRouterEmbeddingModel("openai/text-embedding-3-small"),
+  vectorStoreName: 'lance',
+  indexName: 'documents',
+  model: new ModelRouterEmbeddingModel('openai/text-embedding-3-small'),
   databaseConfig: {
     lance: {
-      tableName: "myVectors", // Specify which table to query
+      tableName: 'myVectors', // Specify which table to query
       includeAllColumns: true, // Include all metadata columns in results
     },
   },
-});
+})
 ```
 **Key Benefits:**
@@ -249,238 +246,211 @@ const lanceQueryTool = createVectorQueryTool({
 You can also override these configurations at runtime using the request context:
 ```ts
-import { RequestContext } from "@mastra/core/request-context";
+import { RequestContext } from '@mastra/core/request-context'
-const requestContext = new RequestContext();
-requestContext.set("databaseConfig", {
+const requestContext = new RequestContext()
+requestContext.set('databaseConfig', {
   pinecone: {
-    namespace: "runtime-namespace",
+    namespace: 'runtime-namespace',
   },
-});
+})
-await pineconeQueryTool.execute({
-  context: { queryText: "search query" },
-  mastra,
-  requestContext,
-});
+await pineconeQueryTool.execute({ queryText: 'search query' }, { mastra, requestContext })
 ```
-For detailed configuration options and advanced usage, see the [Vector Query Tool Reference](https://mastra.ai/reference/v1/tools/vector-query-tool).
+For detailed configuration options and advanced usage, see the [Vector Query Tool Reference](https://mastra.ai/reference/tools/vector-query-tool).
 ### Vector Store Prompts
-Vector store prompts define query patterns and filtering capabilities for each vector database implementation.
-When implementing filtering, these prompts are required in the agent's instructions to specify valid operators and syntax for each vector store implementation.
+Vector store prompts define query patterns and filtering capabilities for each vector database implementation. When implementing filtering, these prompts are required in the agent's instructions to specify valid operators and syntax for each vector store implementation.
-  **pgvector:**
+**pgVector**:
 ```ts
-import { PGVECTOR_PROMPT } from "@mastra/pg";
+import { PGVECTOR_PROMPT } from '@mastra/pg'
 export const ragAgent = new Agent({
-  id: "rag-agent",
-  name: "RAG Agent",
-  model: "openai/gpt-5.1",
+  id: 'rag-agent',
+  name: 'RAG Agent',
+  model: 'openai/gpt-5.1',
   instructions: `
   Process queries using the provided context. Structure responses to be concise and relevant.
   ${PGVECTOR_PROMPT}
   `,
   tools: { vectorQueryTool },
-});
+})
 ```
-  **pinecone:**
+**Pinecone**:
-```ts title="vector-store.ts"
-import { PINECONE_PROMPT } from "@mastra/pinecone";
+```ts
+import { PINECONE_PROMPT } from '@mastra/pinecone'
 export const ragAgent = new Agent({
-  id: "rag-agent",
-  name: "RAG Agent",
-  model: "openai/gpt-5.1",
+  id: 'rag-agent',
+  name: 'RAG Agent',
+  model: 'openai/gpt-5.1',
   instructions: `
   Process queries using the provided context. Structure responses to be concise and relevant.
   ${PINECONE_PROMPT}
   `,
   tools: { vectorQueryTool },
-});
+})
 ```
-  **qdrant:**
+**Qdrant**:
-```ts title="vector-store.ts"
-import { QDRANT_PROMPT } from "@mastra/qdrant";
+```ts
+import { QDRANT_PROMPT } from '@mastra/qdrant'
 export const ragAgent = new Agent({
-  id: "rag-agent",
-  name: "RAG Agent",
-  model: "openai/gpt-5.1",
+  id: 'rag-agent',
+  name: 'RAG Agent',
+  model: 'openai/gpt-5.1',
   instructions: `
   Process queries using the provided context. Structure responses to be concise and relevant.
   ${QDRANT_PROMPT}
   `,
   tools: { vectorQueryTool },
-});
+})
 ```
+**Chroma**:
-  **chroma:**
-```ts title="vector-store.ts"
-import { CHROMA_PROMPT } from "@mastra/chroma";
+```ts
+import { CHROMA_PROMPT } from '@mastra/chroma'
 export const ragAgent = new Agent({
-  id: "rag-agent",
-  name: "RAG Agent",
-  model: "openai/gpt-5.1",
+  id: 'rag-agent',
+  name: 'RAG Agent',
+  model: 'openai/gpt-5.1',
   instructions: `
   Process queries using the provided context. Structure responses to be concise and relevant.
   ${CHROMA_PROMPT}
   `,
   tools: { vectorQueryTool },
-});
+})
 ```
-  **astra:**
+**Astra**:
-```ts title="vector-store.ts"
-import { ASTRA_PROMPT } from "@mastra/astra";
+```ts
+import { ASTRA_PROMPT } from '@mastra/astra'
 export const ragAgent = new Agent({
-  id: "rag-agent",
-  name: "RAG Agent",
-  model: "openai/gpt-5.1",
+  id: 'rag-agent',
+  name: 'RAG Agent',
+  model: 'openai/gpt-5.1',
   instructions: `
   Process queries using the provided context. Structure responses to be concise and relevant.
   ${ASTRA_PROMPT}
   `,
   tools: { vectorQueryTool },
-});
+})
 ```
-  **libsql:**
+**libSQL**:
-```ts title="vector-store.ts"
-import { LIBSQL_PROMPT } from "@mastra/libsql";
+```ts
+import { LIBSQL_PROMPT } from '@mastra/libsql'
 export const ragAgent = new Agent({
-  id: "rag-agent",
-  name: "RAG Agent",
-  model: "openai/gpt-5.1",
+  id: 'rag-agent',
+  name: 'RAG Agent',
+  model: 'openai/gpt-5.1',
   instructions: `
   Process queries using the provided context. Structure responses to be concise and relevant.
   ${LIBSQL_PROMPT}
   `,
   tools: { vectorQueryTool },
-});
+})
 ```
+**Upstash**:
-  **upstash:**
-```ts title="vector-store.ts"
-import { UPSTASH_PROMPT } from "@mastra/upstash";
+```ts
+import { UPSTASH_PROMPT } from '@mastra/upstash'
 export const ragAgent = new Agent({
-  id: "rag-agent",
-  name: "RAG Agent",
-  model: "openai/gpt-5.1",
+  id: 'rag-agent',
+  name: 'RAG Agent',
+  model: 'openai/gpt-5.1',
   instructions: `
   Process queries using the provided context. Structure responses to be concise and relevant.
   ${UPSTASH_PROMPT}
   `,
   tools: { vectorQueryTool },
-});
+})
 ```
-  **vectorize:**
+**Vectorize**:
-```ts title="vector-store.ts"
-import { VECTORIZE_PROMPT } from "@mastra/vectorize";
+```ts
+import { VECTORIZE_PROMPT } from '@mastra/vectorize'
 export const ragAgent = new Agent({
-  id: "rag-agent",
-  name: "RAG Agent",
-  model: "openai/gpt-5.1",
+  id: 'rag-agent',
+  name: 'RAG Agent',
+  model: 'openai/gpt-5.1',
   instructions: `
   Process queries using the provided context. Structure responses to be concise and relevant.
   ${VECTORIZE_PROMPT}
   `,
   tools: { vectorQueryTool },
-});
+})
 ```
-  **mongodb:**
+**MongoDB**:
-```ts title="vector-store.ts"
-import { MONGODB_PROMPT } from "@mastra/mongodb";
+```ts
+import { MONGODB_PROMPT } from '@mastra/mongodb'
 export const ragAgent = new Agent({
-  id: "rag-agent",
-  name: "RAG Agent",
-  model: "openai/gpt-5.1",
+  id: 'rag-agent',
+  name: 'RAG Agent',
+  model: 'openai/gpt-5.1',
   instructions: `
   Process queries using the provided context. Structure responses to be concise and relevant.
   ${MONGODB_PROMPT}
   `,
   tools: { vectorQueryTool },
-});
+})
 ```
+**OpenSearch**:
-  **opensearch:**
-```ts title="vector-store.ts"
-import { OPENSEARCH_PROMPT } from "@mastra/opensearch";
+```ts
+import { OPENSEARCH_PROMPT } from '@mastra/opensearch'
 export const ragAgent = new Agent({
-  id: "rag-agent",
-  name: "RAG Agent",
-  model: "openai/gpt-5.1",
+  id: 'rag-agent',
+  name: 'RAG Agent',
+  model: 'openai/gpt-5.1',
   instructions: `
   Process queries using the provided context. Structure responses to be concise and relevant.
   ${OPENSEARCH_PROMPT}
   `,
   tools: { vectorQueryTool },
-});
+})
 ```
-  **s3vectors:**
+**S3Vectors**:
-```ts title="vector-store.ts"
-import { S3VECTORS_PROMPT } from "@mastra/s3vectors";
+```ts
+import { S3VECTORS_PROMPT } from '@mastra/s3vectors'
 export const ragAgent = new Agent({
-  id: "rag-agent",
-  name: "RAG Agent",
-  model: "openai/gpt-5.1",
+  id: 'rag-agent',
+  name: 'RAG Agent',
+  model: 'openai/gpt-5.1',
   instructions: `
   Process queries using the provided context. Structure responses to be concise and relevant.
   ${S3VECTORS_PROMPT}
   `,
   tools: { vectorQueryTool },
-});
+})
 ```
 ### Re-ranking
 Initial vector similarity search can sometimes miss nuanced relevance. Re-ranking is a more computationally expensive process, but more accurate algorithm that improves results by:
@@ -492,20 +462,17 @@ Initial vector similarity search can sometimes miss nuanced relevance. Re-rankin
 Here's how to use re-ranking:
 ```ts
-import {
-  rerankWithScorer as rerank,
-  MastraAgentRelevanceScorer
-} from "@mastra/rag";
+import { rerankWithScorer as rerank, MastraAgentRelevanceScorer } from '@mastra/rag'
 // Get initial results from vector search
 const initialResults = await pgVector.query({
-  indexName: "embeddings",
+  indexName: 'embeddings',
   queryVector: queryEmbedding,
   topK: 10,
-});
+})
 // Create a relevance scorer
-const relevanceProvider = new MastraAgentRelevanceScorer('relevance-scorer', "openai/gpt-5.1");
+const relevanceProvider = new MastraAgentRelevanceScorer('relevance-scorer', 'openai/gpt-5.1')
 // Re-rank the results
 const rerankedResults = await rerank({
@@ -520,7 +487,7 @@ const rerankedResults = await rerank({
     },
     topK: 10,
   },
-);
+})
 ```
 The weights control how different factors influence the final ranking:
@@ -529,21 +496,20 @@ The weights control how different factors influence the final ranking:
 - `vector`: Higher values favor the original vector similarity scores
 - `position`: Higher values help maintain the original ordering of results
-> **Note:**
-For semantic scoring to work properly during re-ranking, each result must include the text content in its `metadata.text` field.
+> **Note:** For semantic scoring to work properly during re-ranking, each result must include the text content in its `metadata.text` field.
 You can also use other relevance score providers like Cohere or ZeroEntropy:
 ```ts
-const relevanceProvider = new CohereRelevanceScorer("rerank-v3.5");
+const relevanceProvider = new CohereRelevanceScorer('rerank-v3.5')
 ```
 ```ts
-const relevanceProvider = new ZeroEntropyRelevanceScorer("zerank-1");
+const relevanceProvider = new ZeroEntropyRelevanceScorer('zerank-1')
 ```
 The re-ranked results combine vector similarity with semantic understanding to improve retrieval quality.
-For more details about re-ranking, see the [rerank()](https://mastra.ai/reference/v1/rag/rerankWithScorer) method.
+For more details about re-ranking, see the [rerank()](https://mastra.ai/reference/rag/rerankWithScorer) method.
-For graph-based retrieval that follows connections between chunks, see the [GraphRAG](https://mastra.ai/docs/v1/rag/graph-rag) documentation.
+For graph-based retrieval that follows connections between chunks, see the [GraphRAG](https://mastra.ai/docs/rag/graph-rag) documentation.