npm - @afterxleep/doc-bot - Versions diffs - 1.0.1 → 1.1.0 - Mend

@afterxleep/doc-bot 1.0.1 → 1.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

package/README.md +90 -27
package/bin/doc-bot.js +10 -10
package/package.json +1 -1
package/src/services/DocumentIndex.js +388 -0
package/src/services/InferenceEngine.js +43 -2
package/src/services/__tests__/DocumentIndex.test.js +807 -0
package/src/services/__tests__/InferenceEngine.integration.test.js +300 -0

package/README.md CHANGED Viewed

@@ -3,36 +3,39 @@
 [![npm version](https://img.shields.io/npm/v/@afterxleep/doc-bot)](https://www.npmjs.com/package/@afterxleep/doc-bot)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
-A generic MCP (Model Context Protocol) server that provides intelligent documentation access for any project. Works with AI agents like Claude Code, Cursor, and any other MCP-compatible tools.
+A generic MCP (Model Context Protocol) server that provides intelligent documentation access for any project. Works with any MCP-compatible AI tools and IDEs.
 ## What is doc-bot?
 doc-bot is an intelligent documentation server that:
 - 🔍 **Searches** your project documentation instantly
-- 🧠 **Infers** relevant docs based on your current work
+- 🧠 **Auto-indexes** content for smart inference (no manual keyword mapping!)
 - 📋 **Applies** global rules to every AI interaction
 - 🎯 **Suggests** contextual documentation based on file patterns
+- 🤖 **Detects** code patterns, frameworks, and keywords automatically
 - 🔄 **Updates** automatically when docs change
 ## Installation
-Add doc-bot to your Claude Code MCP configuration:
+1. **Create your documentation folder** in your project root (see organization section below)
-1. **Add to Claude Code configuration** (`~/Library/Application Support/Claude/claude_desktop_config.json`):
+2. **Add doc-bot to your MCP-compatible AI tool configuration**:
+   **For Claude Code** (`~/Library/Application Support/Claude/claude_desktop_config.json`):
    ```json
    {
      "mcpServers": {
        "docs": {
          "command": "npx",
-         "args": ["@afterxleep/doc-bot"]
+         "args": ["@afterxleep/doc-bot", "--docs", "./doc-bot"]
        }
      }
    }
    ```
-2. **Restart Claude Code**
+   **For Cursor or other MCP tools**: Add similar configuration pointing to your documentation folder
-3. **Create your documentation folder** (if it doesn't exist, doc-bot will show you how when you first use it)
+3. **Restart your AI tool**
 ## How to organize your documentation
@@ -53,9 +56,9 @@ your-project/
 └── package.json
 ```
-**Note:** You can use a different folder name by passing the `--docs` parameter:
-```bash
-npx @afterxleep/doc-bot --docs ./my-custom-docs
+**Note:** You can use any folder name - just specify it in your MCP configuration:
+```json
+"args": ["@afterxleep/doc-bot", "--docs", "./my-custom-docs"]
 ```
 ### Documentation types:
@@ -64,6 +67,48 @@ npx @afterxleep/doc-bot --docs ./my-custom-docs
 - **Guides** (`guides/`): Step-by-step instructions for specific tasks
 - **Reference** (`reference/`): Quick lookups and troubleshooting
+### Example documentation files:
+**Global Rule Example** (`doc-bot/core/coding-standards.md`):
+```markdown
+---
+title: "Coding Standards"
+description: "Core coding standards that apply to all code"
+keywords: ["code-quality", "standards", "best-practices"]
+tags: ["core", "quality"]
+---
+# Coding Standards
+- Use 2 spaces for indentation
+- Maximum line length: 100 characters
+- Always use const/let, never var
+- Prefer async/await over promises
+- Write descriptive variable names
+```
+**Contextual Rule Example** (`doc-bot/guides/testing.md`):
+```markdown
+---
+title: "Testing Guide"
+description: "How to write and run tests"
+keywords: ["testing", "jest", "tdd", "unit-tests"]
+tags: ["testing", "quality"]
+---
+# Testing Guide
+All test files should:
+- Use describe/it blocks for organization
+- Include both positive and negative test cases
+- Mock external dependencies
+- Aim for 80%+ code coverage
+Run tests with: `npm test`
+```
+**👀 See `examples/` folder for complete example files with proper frontmatter and content structure.**
 ## The manifest file
 The `doc-bot/manifest.json` file controls how your documentation works:
@@ -82,18 +127,6 @@ The `doc-bot/manifest.json` file controls how your documentation works:
     "*.spec.js": ["guides/testing.md"],
     "src/components/*": ["guides/react-components.md"],
     "src/api/*": ["guides/api-development.md"]
-  },
-  "inference": {
-    "keywords": {
-      "testing": ["guides/testing.md"],
-      "deployment": ["guides/deployment.md"],
-      "api": ["guides/api-development.md"]
-    },
-    "patterns": {
-      "describe(": ["guides/testing.md"],
-      "it(": ["guides/testing.md"],
-      "fetch(": ["guides/api-development.md"]
-    }
   }
 }
 ```
@@ -102,8 +135,38 @@ The `doc-bot/manifest.json` file controls how your documentation works:
 - **`globalRules`**: Documents that apply to every AI interaction
 - **`contextualRules`**: Documents triggered by specific file patterns (e.g., test files → testing guide)
-- **`inference.keywords`**: Documents suggested when certain words appear in queries
-- **`inference.patterns`**: Documents suggested when certain code patterns are detected
+### 🎯 Automatic Inference (New!)
+doc-bot automatically analyzes your documentation content to build smart indexes. No more manual keyword mappings! It automatically:
+- **Extracts keywords** from document metadata (frontmatter)
+- **Detects technical terms** in your documentation content
+- **Recognizes code patterns** in code blocks (React hooks, SQL commands, etc.)
+- **Identifies frameworks** mentioned in your docs
+- **Indexes file extensions** referenced in documentation
+Just write good documentation with descriptive frontmatter, and doc-bot handles the rest!
+### Writing documentation for best results
+To maximize the automatic inference capabilities, include frontmatter in your markdown files:
+```markdown
+---
+title: "React Component Guidelines"
+description: "Best practices for building React components"
+keywords: ["react", "components", "hooks", "jsx"]
+tags: ["frontend", "development"]
+category: "guides"
+---
+# React Component Guidelines
+Your documentation content here...
+```
+The automatic indexing will use this metadata along with analyzing your content to provide intelligent suggestions.
 ## Development setup
@@ -137,7 +200,7 @@ The `doc-bot/manifest.json` file controls how your documentation works:
 ### Testing your setup
-Ask Claude something like "What documentation is available?" to test that doc-bot is working.
+Ask your AI assistant something like "What documentation is available?" to test that doc-bot is working.
 ### CLI Options
@@ -145,9 +208,9 @@ Ask Claude something like "What documentation is available?" to test that doc-bo
 doc-bot [options]
 Options:
+  -d, --docs <path>        Path to docs folder (required)
+  -c, --config <path>      Path to manifest file (default: <docs-path>/manifest.json)
   -p, --port <port>        Port to run server on (default: 3000)
-  -d, --docs <path>        Path to docs folder (default: ./doc-bot)
-  -c, --config <path>      Path to manifest file (default: ./doc-bot/manifest.json)
   -v, --verbose           Enable verbose logging
   -w, --watch             Watch for file changes
   -h, --help              Show help

package/bin/doc-bot.js CHANGED Viewed

@@ -8,10 +8,10 @@ const { DocsServer } = require('../src/index.js');
 program
   .name('doc-bot')
   .description('Generic MCP server for intelligent documentation access')
-  .version('1.0.1')
+  .version('1.0.2')
   .option('-p, --port <port>', 'Port to run server on', '3000')
-  .option('-d, --docs <path>', 'Path to docs folder', './doc-bot')
-  .option('-c, --config <path>', 'Path to manifest file', './doc-bot/manifest.json')
+  .requiredOption('-d, --docs <path>', 'Path to docs folder')
+  .option('-c, --config <path>', 'Path to manifest file')
   .option('-v, --verbose', 'Enable verbose logging')
   .option('-w, --watch', 'Watch for file changes')
   .parse();
@@ -20,19 +20,19 @@ const options = program.opts();
 async function main() {
   const docsPath = path.resolve(options.docs);
-  const configPath = path.resolve(options.config);
+  const configPath = options.config ? path.resolve(options.config) : path.resolve(options.docs, 'manifest.json');
-  // Check if doc-bot folder exists
+  // Check if documentation folder exists
   if (!await fs.pathExists(docsPath)) {
     console.error(`❌ Documentation folder not found: ${docsPath}`);
     console.log('');
-    console.log('📖 To get started, create a doc-bot folder in your project:');
+    console.log('📖 To get started, create your documentation folder:');
     console.log('');
-    console.log('  mkdir doc-bot');
-    console.log('  echo \'{"name": "My Project Documentation", "globalRules": []}\' > doc-bot/manifest.json');
-    console.log('  echo "# Getting Started" > doc-bot/README.md');
+    console.log(`  mkdir ${path.basename(docsPath)}`);
+    console.log(`  echo '{"name": "My Project Documentation", "globalRules": []}' > ${path.basename(docsPath)}/manifest.json`);
+    console.log(`  echo "# Getting Started" > ${path.basename(docsPath)}/README.md`);
     console.log('');
-    console.log('Then run: npx @afterxleep/doc-bot');
+    console.log('Then configure your MCP client to use this folder.');
     process.exit(1);
   }

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@afterxleep/doc-bot",
-  "version": "1.0.1",
+  "version": "1.1.0",
   "description": "Generic MCP server for intelligent documentation access in any project",
   "main": "src/index.js",
   "bin": {

package/src/services/DocumentIndex.js ADDED Viewed

@@ -0,0 +1,388 @@
+class DocumentIndex {
+  constructor() {
+    this.keywordIndex = new Map();
+    this.topicIndex = new Map();
+    this.patternIndex = new Map();
+    this.extensionIndex = new Map();
+  }
+  async buildIndexes(documents) {
+    for (const document of documents) {
+      await this.indexDocument(document);
+    }
+  }
+  async indexDocument(document) {
+    if (!document) {
+      return;
+    }
+    // Index keywords from metadata (if present)
+    if (document.metadata?.keywords) {
+      const keywords = Array.isArray(document.metadata.keywords)
+        ? document.metadata.keywords
+        : [document.metadata.keywords];
+      for (const keyword of keywords) {
+        this.addToIndex(this.keywordIndex, keyword.toLowerCase(), document, 10);
+      }
+    }
+    // Index topics from tags and category (if present)
+    if (document.metadata?.tags) {
+      const tags = Array.isArray(document.metadata.tags)
+        ? document.metadata.tags
+        : [document.metadata.tags];
+      for (const tag of tags) {
+        this.addToIndex(this.topicIndex, tag.toLowerCase(), document, 5);
+      }
+    }
+    if (document.metadata?.category) {
+      this.addToIndex(this.topicIndex, document.metadata.category.toLowerCase(), document, 5);
+    }
+    // Index content keywords
+    if (document.content) {
+      await this.indexContentKeywords(document);
+    }
+  }
+  async indexContentKeywords(document) {
+    const content = document.content;
+    // Extract keywords from code blocks
+    this.extractCodeBlockKeywords(content, document);
+    // Extract keywords from headings
+    this.extractHeadingKeywords(content, document);
+    // Extract file extensions
+    this.extractFileExtensions(content, document);
+    // Extract framework and library names
+    this.extractFrameworkNames(content, document);
+    // Extract code patterns
+    this.extractCodePatterns(content, document);
+  }
+  extractCodeBlockKeywords(content, document) {
+    // Match code blocks with language specifiers
+    const codeBlockRegex = /```(\w+)?\n([\s\S]*?)```/g;
+    let match;
+    while ((match = codeBlockRegex.exec(content)) !== null) {
+      const codeContent = match[2];
+      // Extract common library/framework names from code
+      const patterns = [
+        /require\(['"]([^'"]+)['"]\)/g,
+        /import\s+(\w+)/g,
+        /from\s+['"]([^'"]+)['"]/g,
+        /\b(express|mongoose|bodyParser|flask|sqlalchemy|react|vue|angular|django|fastapi|axios|lodash|moment|uuid)\b/gi
+      ];
+      for (const pattern of patterns) {
+        let patternMatch;
+        while ((patternMatch = pattern.exec(codeContent)) !== null) {
+          const keyword = patternMatch[1]?.toLowerCase() || patternMatch[0]?.toLowerCase();
+          if (keyword && !this.isCommonWord(keyword)) {
+            this.addToIndex(this.keywordIndex, keyword, document, 3); // Lower score for content keywords
+          }
+        }
+      }
+    }
+  }
+  extractHeadingKeywords(content, document) {
+    // Extract from markdown headings
+    const headingRegex = /^#{1,6}\s+(.+)$/gm;
+    let match;
+    while ((match = headingRegex.exec(content)) !== null) {
+      const heading = match[1];
+      const words = heading.split(/\s+/);
+      for (const word of words) {
+        const cleanWord = word.toLowerCase().replace(/[^\w\-\/]/g, '');
+        if (cleanWord && !this.isCommonWord(cleanWord)) {
+          this.addToIndex(this.keywordIndex, cleanWord, document, 2); // Lower score for content keywords
+        }
+      }
+    }
+  }
+  extractFileExtensions(content, document) {
+    // Extract file extensions mentioned in content
+    const extensionRegex = /\*\.(\w+)\b/g;
+    let match;
+    while ((match = extensionRegex.exec(content)) !== null) {
+      const extension = match[1].toLowerCase();
+      this.addToIndex(this.extensionIndex, extension, document);
+    }
+  }
+  extractFrameworkNames(content, document) {
+    // Common framework and technology names
+    const techPatterns = [
+      /\b(react|vue|angular|svelte|next\.js|nuxt\.js|gatsby)\b/gi,
+      /\b(node\.js|express|fastify|koa|nest\.js)\b/gi,
+      /\b(postgresql|mysql|mongodb|redis|elasticsearch)\b/gi,
+      /\b(docker|kubernetes|terraform|ansible)\b/gi,
+      /\b(aws|azure|gcp|heroku|vercel|netlify)\b/gi,
+      /\b(typescript|javascript|python|java|golang|rust)\b/gi
+    ];
+    for (const pattern of techPatterns) {
+      let match;
+      while ((match = pattern.exec(content)) !== null) {
+        const keyword = match[0].toLowerCase();
+        if (!this.isCommonWord(keyword)) {
+          this.addToIndex(this.keywordIndex, keyword, document, 2); // Lower score for content keywords
+        }
+      }
+    }
+  }
+  extractCodePatterns(content, document) {
+    // Match code blocks with language specifiers
+    const codeBlockRegex = /```(\w+)?\n([\s\S]*?)```/g;
+    let match;
+    while ((match = codeBlockRegex.exec(content)) !== null) {
+      const language = match[1]?.toLowerCase();
+      const codeContent = match[2];
+      // Define patterns for different languages
+      const patterns = this.getCodePatterns(language);
+      for (const pattern of patterns) {
+        let patternMatch;
+        while ((patternMatch = pattern.regex.exec(codeContent)) !== null) {
+          const patternKey = pattern.key || patternMatch[0];
+          this.addToIndex(this.patternIndex, patternKey, document, 6); // Medium-high score for patterns
+        }
+      }
+    }
+  }
+  getCodePatterns(language) {
+    const patterns = [];
+    // JavaScript/TypeScript patterns
+    if (!language || language === 'javascript' || language === 'js' || language === 'typescript' || language === 'ts') {
+      patterns.push(
+        { regex: /\buseState\b/g, key: 'useState' },
+        { regex: /\buseEffect\b/g, key: 'useEffect' },
+        { regex: /\buseCallback\b/g, key: 'useCallback' },
+        { regex: /\buseMemo\b/g, key: 'useMemo' },
+        { regex: /\buseContext\b/g, key: 'useContext' },
+        { regex: /\buseReducer\b/g, key: 'useReducer' },
+        { regex: /app\.get\(/g, key: 'app.get' },
+        { regex: /app\.post\(/g, key: 'app.post' },
+        { regex: /app\.put\(/g, key: 'app.put' },
+        { regex: /app\.delete\(/g, key: 'app.delete' },
+        { regex: /describe\(/g, key: 'describe(' },
+        { regex: /it\(/g, key: 'it(' },
+        { regex: /test\(/g, key: 'test(' },
+        { regex: /expect\(/g, key: 'expect(' },
+        { regex: /async\s+function/g, key: 'async function' },
+        { regex: /\.then\(/g, key: '.then(' },
+        { regex: /\.catch\(/g, key: '.catch(' },
+        { regex: /await\s+/g, key: 'await' }
+      );
+    }
+    // Python patterns
+    if (language === 'python' || language === 'py') {
+      patterns.push(
+        { regex: /\bdef\s+/g, key: 'def ' },
+        { regex: /\bclass\s+/g, key: 'class ' },
+        { regex: /\b__init__\b/g, key: '__init__' },
+        { regex: /\bif\s+__name__\s*==\s*['"]__main__['"]/g, key: 'if __name__' },
+        { regex: /\bimport\s+/g, key: 'import ' },
+        { regex: /\bfrom\s+\w+\s+import/g, key: 'from import' },
+        { regex: /\btry:/g, key: 'try:' },
+        { regex: /\bexcept\s+/g, key: 'except ' },
+        { regex: /\bwith\s+/g, key: 'with ' },
+        { regex: /@\w+/g, key: 'decorator' }
+      );
+    }
+    // SQL patterns
+    if (language === 'sql') {
+      patterns.push(
+        { regex: /\bSELECT\b/gi, key: 'SELECT' },
+        { regex: /\bINSERT\s+INTO\b/gi, key: 'INSERT INTO' },
+        { regex: /\bUPDATE\b/gi, key: 'UPDATE' },
+        { regex: /\bDELETE\s+FROM\b/gi, key: 'DELETE FROM' },
+        { regex: /\bCREATE\s+TABLE\b/gi, key: 'CREATE TABLE' },
+        { regex: /\bALTER\s+TABLE\b/gi, key: 'ALTER TABLE' },
+        { regex: /\bDROP\s+TABLE\b/gi, key: 'DROP TABLE' },
+        { regex: /\bJOIN\b/gi, key: 'JOIN' },
+        { regex: /\bLEFT\s+JOIN\b/gi, key: 'LEFT JOIN' },
+        { regex: /\bINNER\s+JOIN\b/gi, key: 'INNER JOIN' }
+      );
+    }
+    // Java patterns
+    if (language === 'java') {
+      patterns.push(
+        { regex: /\bpublic\s+class\b/g, key: 'public class' },
+        { regex: /\bprivate\s+\w+/g, key: 'private' },
+        { regex: /\bpublic\s+static\s+void\s+main/g, key: 'main method' },
+        { regex: /@Override/g, key: '@Override' },
+        { regex: /\bnew\s+\w+\(/g, key: 'new' }
+      );
+    }
+    // Docker patterns
+    if (language === 'dockerfile' || language === 'docker') {
+      patterns.push(
+        { regex: /\bFROM\b/gi, key: 'FROM' },
+        { regex: /\bRUN\b/gi, key: 'RUN' },
+        { regex: /\bCOPY\b/gi, key: 'COPY' },
+        { regex: /\bADD\b/gi, key: 'ADD' },
+        { regex: /\bEXPOSE\b/gi, key: 'EXPOSE' },
+        { regex: /\bCMD\b/gi, key: 'CMD' },
+        { regex: /\bENTRYPOINT\b/gi, key: 'ENTRYPOINT' }
+      );
+    }
+    return patterns;
+  }
+  isCommonWord(word) {
+    const commonWords = new Set([
+      'the', 'a', 'an', 'and', 'or', 'but', 'in', 'on', 'at', 'to', 'for', 'of', 'with', 'by',
+      'is', 'are', 'was', 'were', 'be', 'been', 'being', 'have', 'has', 'had', 'do', 'does', 'did',
+      'will', 'would', 'could', 'should', 'may', 'might', 'can', 'must', 'shall',
+      'this', 'that', 'these', 'those', 'i', 'you', 'he', 'she', 'it', 'we', 'they',
+      'me', 'him', 'her', 'us', 'them', 'my', 'your', 'his', 'her', 'its', 'our', 'their',
+      'about', 'above', 'across', 'after', 'against', 'along', 'among', 'around', 'before',
+      'behind', 'below', 'beneath', 'beside', 'between', 'beyond', 'during', 'except',
+      'from', 'inside', 'into', 'near', 'outside', 'over', 'since', 'through', 'under',
+      'until', 'up', 'upon', 'within', 'without', 'how', 'what', 'when', 'where', 'why',
+      'who', 'which', 'whose', 'whom', 'very', 'so', 'too', 'quite', 'rather', 'such',
+      'guide', 'documentation', 'helps', 'developers', 'system', 'useful', 'explains',
+      'use', 'using', 'used', 'get', 'getting', 'set', 'setting', 'make', 'making',
+      'create', 'creating', 'build', 'building', 'run', 'running', 'start', 'starting'
+    ]);
+    return commonWords.has(word.toLowerCase()) || word.length < 2;
+  }
+  addToIndex(index, key, document, score = 1) {
+    if (!index.has(key)) {
+      index.set(key, []);
+    }
+    index.get(key).push({ document, score });
+  }
+  findRelevantDocs(context) {
+    if (!context || Object.keys(context).length === 0) {
+      return [];
+    }
+    const candidates = new Map();
+    // Search by query keywords
+    if (context.query) {
+      this.searchKeywords(context.query, candidates);
+    }
+    // Search by code snippet patterns
+    if (context.codeSnippet) {
+      this.searchCodePatterns(context.codeSnippet, candidates);
+    }
+    // Search by file extension
+    if (context.filePath) {
+      this.searchFileExtension(context.filePath, candidates);
+    }
+    return this.scoreAndRank(candidates);
+  }
+  searchKeywords(query, candidates) {
+    const queryLower = query.toLowerCase();
+    const words = queryLower.split(/\s+/);
+    for (const word of words) {
+      // Search in keyword index
+      if (this.keywordIndex.has(word)) {
+        const entries = this.keywordIndex.get(word);
+        for (const entry of entries) {
+          this.addCandidate(candidates, entry.document, entry.score);
+        }
+      }
+      // Search in topic index
+      if (this.topicIndex.has(word)) {
+        const entries = this.topicIndex.get(word);
+        for (const entry of entries) {
+          this.addCandidate(candidates, entry.document, entry.score);
+        }
+      }
+    }
+  }
+  searchCodePatterns(codeSnippet, candidates) {
+    if (this.patternIndex.size > 0) {
+      // Search for patterns in the code snippet
+      for (const [pattern, entries] of this.patternIndex) {
+        // Check if the pattern exists in the code snippet
+        let found = false;
+        // For SQL patterns, do case-insensitive matching
+        if (pattern.toUpperCase() === pattern) {
+          found = codeSnippet.toUpperCase().includes(pattern);
+        } else {
+          found = codeSnippet.includes(pattern);
+        }
+        if (found) {
+          for (const entry of entries) {
+            this.addCandidate(candidates, entry.document, 8); // High score for pattern match
+          }
+        }
+      }
+    }
+  }
+  searchFileExtension(filePath, candidates) {
+    // For now, implement basic extension matching
+    // This will be enhanced in later iterations
+    if (this.extensionIndex.size > 0) {
+      const extension = filePath.split('.').pop()?.toLowerCase();
+      if (extension && this.extensionIndex.has(extension)) {
+        const entries = this.extensionIndex.get(extension);
+        for (const entry of entries) {
+          this.addCandidate(candidates, entry.document, 3); // Lower score for extension match
+        }
+      }
+    }
+  }
+  addCandidate(candidates, document, score) {
+    const key = document.fileName || document.filePath;
+    if (!candidates.has(key)) {
+      candidates.set(key, { document, score: 0 });
+    }
+    candidates.get(key).score += score;
+  }
+  scoreAndRank(candidates) {
+    const results = Array.from(candidates.values());
+    // Sort by score (descending)
+    results.sort((a, b) => b.score - a.score);
+    return results;
+  }
+}
+module.exports = { DocumentIndex };