npm - smart-coding-mcp - Versions diffs - 1.2.4 → 1.3.1 - Mend

smart-coding-mcp 1.2.4 → 1.3.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (21) hide show

package/README.md +28 -168
package/config.json +4 -3
package/example.png +0 -0
package/features/clear-cache.js +30 -7
package/features/index-codebase.js +507 -37
package/how-its-works.png +0 -0
package/index.js +2 -2
package/lib/cache.js +5 -0
package/lib/config.js +29 -4
package/lib/embedding-worker.js +67 -0
package/lib/tokenizer.js +142 -0
package/lib/utils.js +113 -25
package/package.json +9 -3
package/test/clear-cache.test.js +288 -0
package/test/embedding-model.test.js +230 -0
package/test/helpers.js +128 -0
package/test/hybrid-search.test.js +243 -0
package/test/index-codebase.test.js +246 -0
package/test/integration.test.js +223 -0
package/test/tokenizer.test.js +225 -0
package/vitest.config.js +29 -0

package/README.md CHANGED Viewed

@@ -1,6 +1,11 @@
 # Smart Coding MCP
-An extensible Model Context Protocol (MCP) server that provides intelligent semantic code search for AI assistants. Built with local AI models, inspired by Cursor's semantic search research.
+[![npm version](https://img.shields.io/npm/v/smart-coding-mcp.svg)](https://www.npmjs.com/package/smart-coding-mcp)
+[![npm downloads](https://img.shields.io/npm/dm/smart-coding-mcp.svg)](https://www.npmjs.com/package/smart-coding-mcp)
+[![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://opensource.org/licenses/MIT)
+[![Node.js](https://img.shields.io/badge/Node.js-%3E%3D18-green.svg)](https://nodejs.org/)
+An extensible Model Context Protocol (MCP) server that provides intelligent semantic code search for AI assistants. Built with local AI models (RAG), inspired by Cursor's semantic search research.
 ## What This Does
@@ -8,6 +13,8 @@ AI coding assistants work better when they can find relevant code quickly. Tradi
 This MCP server solves that by indexing your codebase with AI embeddings. Your AI assistant can search by meaning instead of exact keywords, finding relevant code even when the terminology differs.
+![Example](example.png)
 ## Why Use This
 **Better Code Understanding**
@@ -36,6 +43,12 @@ Install globally via npm:
 npm install -g smart-coding-mcp
 ```
+To update to the latest version:
+```bash
+npm update -g smart-coding-mcp
+```
 ## Configuration
 Add to your MCP configuration file. The location depends on your IDE and OS:
@@ -80,33 +93,23 @@ Add the server configuration to the `mcpServers` object in your config file:
 }
 ```
-### Option 3: Auto-Detect Current Directory
-```json
-{
-  "mcpServers": {
-    "smart-coding-mcp": {
-      "command": "smart-coding-mcp"
-    }
-  }
-}
-```
 ## Environment Variables
 Override configuration settings via environment variables in your MCP config:
-| Variable                         | Type    | Default   | Description                           |
-| -------------------------------- | ------- | --------- | ------------------------------------- |
-| `SMART_CODING_VERBOSE`           | boolean | `false`   | Enable detailed logging               |
-| `SMART_CODING_BATCH_SIZE`        | number  | `100`     | Files to process in parallel          |
-| `SMART_CODING_MAX_FILE_SIZE`     | number  | `1048576` | Max file size in bytes (1MB)          |
-| `SMART_CODING_CHUNK_SIZE`        | number  | `15`      | Lines of code per chunk               |
-| `SMART_CODING_MAX_RESULTS`       | number  | `5`       | Max search results                    |
-| `SMART_CODING_SMART_INDEXING`    | boolean | `true`    | Enable smart project detection        |
-| `SMART_CODING_WATCH_FILES`       | boolean | `false`   | Enable file watching for auto-reindex |
-| `SMART_CODING_SEMANTIC_WEIGHT`   | number  | `0.7`     | Weight for semantic similarity (0-1)  |
-| `SMART_CODING_EXACT_MATCH_BOOST` | number  | `1.5`     | Boost for exact text matches          |
+| Variable                         | Type    | Default                   | Description                           |
+| -------------------------------- | ------- | ------------------------- | ------------------------------------- |
+| `SMART_CODING_VERBOSE`           | boolean | `false`                   | Enable detailed logging               |
+| `SMART_CODING_BATCH_SIZE`        | number  | `100`                     | Files to process in parallel          |
+| `SMART_CODING_MAX_FILE_SIZE`     | number  | `1048576`                 | Max file size in bytes (1MB)          |
+| `SMART_CODING_CHUNK_SIZE`        | number  | `25`                      | Lines of code per chunk               |
+| `SMART_CODING_MAX_RESULTS`       | number  | `5`                       | Max search results                    |
+| `SMART_CODING_SMART_INDEXING`    | boolean | `true`                    | Enable smart project detection        |
+| `SMART_CODING_WATCH_FILES`       | boolean | `false`                   | Enable file watching for auto-reindex |
+| `SMART_CODING_SEMANTIC_WEIGHT`   | number  | `0.7`                     | Weight for semantic similarity (0-1)  |
+| `SMART_CODING_EXACT_MATCH_BOOST` | number  | `1.5`                     | Boost for exact text matches          |
+| `SMART_CODING_EMBEDDING_MODEL`   | string  | `Xenova/all-MiniLM-L6-v2` | AI embedding model to use             |
+| `SMART_CODING_WORKER_THREADS`    | string  | `auto`                    | Worker threads (`auto` or 1-32)       |
 **Example with environment variables:**
@@ -160,60 +163,7 @@ The server indexes your code in four steps:
 When you search, your query is converted to the same vector format and compared against all code chunks using cosine similarity. The most relevant matches are returned.
-### Smart Project Detection
-The server detects your project type by looking for marker files and automatically applies appropriate ignore patterns:
-**JavaScript/Node** (package.json found)
-- Ignores: node_modules, dist, build, .next, coverage
-**Python** (requirements.txt or pyproject.toml)
-- Ignores: **pycache**, venv, .pytest_cache, .tox
-**Android** (build.gradle)
-- Ignores: .gradle, build artifacts, generated code
-**iOS** (Podfile)
-- Ignores: Pods, DerivedData, xcuserdata
-**And more**: Go, PHP, Rust, Ruby, .NET
-This typically reduces indexed file count by 100x. A project with 50,000 files (including node_modules) indexes just 500 actual source files.
-## Configuration
-The server works out of the box with sensible defaults. Create a `config.json` file in your workspace to customize:
-```json
-{
-  "searchDirectory": ".",
-  "fileExtensions": ["js", "ts", "py", "java", "go"],
-  "excludePatterns": ["**/my-custom-ignore/**"],
-  "smartIndexing": true,
-  "verbose": false,
-  "enableCache": true,
-  "cacheDirectory": "./.smart-coding-cache",
-  "watchFiles": true,
-  "chunkSize": 15,
-  "batchSize": 100,
-  "maxFileSize": 1048576,
-  "maxResults": 5
-}
-```
-**Key options:**
-- `smartIndexing`: Enable automatic project type detection and smart ignore patterns (default: true)
-- `verbose`: Show detailed indexing logs (default: false)
-- `watchFiles`: Automatically reindex when files change (default: true)
-- `enableCache`: Cache embeddings to disk (default: true)
-- `chunkSize`: Lines of code per chunk - smaller = more precise, larger = more context (default: 15)
-- `batchSize`: Number of files to process in parallel (default: 100)
-- `maxFileSize`: Skip files larger than this size in bytes (default: 1MB)
+![How It Works](how-its-works.png)
 ## Examples
@@ -243,85 +193,6 @@ Query: "error handling and exceptions"
 Finds all try/catch blocks and error handling patterns.
-## Performance
-Tested on a typical JavaScript project:
-| Metric         | Without Smart Indexing | With Smart Indexing |
-| -------------- | ---------------------- | ------------------- |
-| Files scanned  | 50,000+                | 500                 |
-| Indexing time  | 10+ min                | 2-3 min             |
-| Memory usage   | 2GB+                   | ~200MB              |
-| Search latency | N/A                    | <100ms              |
-## Supported File Types
-Languages: JavaScript, TypeScript, Python, Java, Kotlin, Scala, C, C++, C#, Go, Rust, Ruby, PHP, Swift, Shell
-Web: HTML, CSS, SCSS, Sass, XML, SVG
-Config/Data: JSON, YAML, TOML, SQL
-Total: 36 file extensions
-## Architecture
-```
-smart-coding-mcp/
-├── index.js                  # MCP server entry point
-├── lib/
-│   ├── config.js            # Configuration + smart detection
-│   ├── cache.js             # Embeddings persistence
-│   ├── utils.js             # Smart chunking
-│   ├── ignore-patterns.js   # Language-specific patterns
-│   └── project-detector.js  # Project type detection
-└── features/
-    ├── hybrid-search.js     # Semantic + exact match search
-    ├── index-codebase.js    # File indexing + watching
-    └── clear-cache.js       # Cache management
-```
-The modular design makes it easy to add new features. See ARCHITECTURE.md for implementation details.
-## Troubleshooting
-**"Server can't find config.json"**
-Make sure `cwd` is set in your MCP configuration to the full path of smart-coding-mcp.
-**"Indexing takes too long"**
-- Verify `smartIndexing` is enabled
-- Add more patterns to `excludePatterns`
-- Reduce `fileExtensions` to only what you need
-**"Search results aren't relevant"**
-- Try more specific queries
-- Increase `maxResults` to see more options
-- Run `index_codebase` to force a full reindex
-**"Cache corruption errors"**
-Use the `clear_cache` tool or run:
-```bash
-npm run clear-cache
-```
-## CLI Commands
-```bash
-# Start the server
-npm start
-# Development mode with auto-restart
-npm run dev
-# Clear embeddings cache
-npm run clear-cache
-```
 ## Privacy
 - AI model runs entirely on your machine
@@ -353,17 +224,6 @@ This project builds on research from Cursor showing that semantic search improve
 See: https://cursor.com/blog/semsearch
-## Contributing
-Contributions are welcome. See CONTRIBUTING.md for guidelines.
-Potential areas for improvement:
-- Additional language support
-- Code complexity analysis
-- Refactoring pattern detection
-- Documentation generation
 ## License
 MIT License

package/config.json CHANGED Viewed

@@ -50,8 +50,8 @@
     "**/.smart-coding-cache/**"
   ],
   "smartIndexing": true,
-  "chunkSize": 15,
-  "chunkOverlap": 3,
+  "chunkSize": 25,
+  "chunkOverlap": 5,
   "batchSize": 100,
   "maxFileSize": 1048576,
   "maxResults": 5,
@@ -61,5 +61,6 @@
   "verbose": false,
   "embeddingModel": "Xenova/all-MiniLM-L6-v2",
   "semanticWeight": 0.7,
-  "exactMatchBoost": 1.5
+  "exactMatchBoost": 1.5,
+  "workerThreads": "auto"
 }

package/example.png ADDED Viewed

Binary file

package/features/clear-cache.js CHANGED Viewed

@@ -1,16 +1,39 @@
 export class CacheClearer {
-  constructor(embedder, cache, config) {
+  constructor(embedder, cache, config, indexer) {
     this.cache = cache;
     this.config = config;
+    this.indexer = indexer;
+    this.isClearing = false;
   }
   async execute() {
-    await this.cache.clear();
-    return {
-      success: true,
-      message: `Cache cleared successfully. Next indexing will be a full rebuild.`,
-      cacheDirectory: this.config.cacheDirectory
-    };
+    // Check if indexing is in progress
+    if (this.indexer && this.indexer.isIndexing) {
+      throw new Error("Cannot clear cache while indexing is in progress. Please wait for indexing to complete.");
+    }
+    // Check if cache is currently being saved (race condition prevention)
+    if (this.cache.isSaving) {
+      throw new Error("Cannot clear cache while cache is being saved. Please try again in a moment.");
+    }
+    // Check if a clear operation is already in progress (prevent concurrent clears)
+    if (this.isClearing) {
+      throw new Error("Cache clear operation already in progress. Please wait for it to complete.");
+    }
+    this.isClearing = true;
+    try {
+      await this.cache.clear();
+      return {
+        success: true,
+        message: `Cache cleared successfully. Next indexing will be a full rebuild.`,
+        cacheDirectory: this.config.cacheDirectory
+      };
+    } finally {
+      this.isClearing = false;
+    }
   }
 }