npm - semantic-code-mcp - Versions diffs - 2.0.1 → 2.1.1 - Mend

semantic-code-mcp 2.0.1 → 2.1.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (4) hide show

package/README.md +518 -111
package/features/index-codebase.js +18 -24
package/lib/config.js +31 -28
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -4,11 +4,44 @@
 [![npm downloads](https://img.shields.io/npm/dm/semantic-code-mcp.svg)](https://www.npmjs.com/package/semantic-code-mcp)
 [![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://opensource.org/licenses/MIT)
 [![Node.js](https://img.shields.io/badge/Node.js-%3E%3D18-green.svg)](https://nodejs.org/)
+[![Non-Blocking](https://img.shields.io/badge/Indexing-Non--Blocking-brightgreen)]()
+[![Multi-Agent](https://img.shields.io/badge/Multi--Agent-Concurrent-orange)]()
+[![Milvus](https://img.shields.io/badge/Vector%20DB-Milvus%20%7C%20Zilliz-00A1EA)](https://milvus.io)
-AI-powered semantic code search for coding agents. An MCP server that indexes your codebase with vector embeddings so AI assistants can find code by **meaning**, not just keywords.
+AI-powered semantic code search for coding agents. An MCP server with **non-blocking background indexing**, **multi-provider embeddings** (Gemini, Vertex AI, OpenAI, local), and **Milvus / Zilliz Cloud** vector storage — designed for **multi-agent concurrent access**.
+Run Claude Code, Codex, Copilot, and Antigravity against the same code index simultaneously. Indexing runs in the background; search works immediately while indexing continues.
 > Ask *"where do we handle authentication?"* and find code that uses `login`, `session`, `verifyCredentials` — even when no file contains the word "authentication."
+## Quick Start
+```bash
+npx -y semantic-code-mcp@latest --workspace /path/to/your/project
+```
+MCP config:
+```json
+{
+  "mcpServers": {
+    "semantic-code-mcp": {
+      "command": "npx",
+      "args": ["-y", "semantic-code-mcp@latest", "--workspace", "/path/to/project"]
+    }
+  }
+}
+```
+```mermaid
+graph LR
+    A["Claude Code"] --> M["Milvus Standalone<br/>(Docker)"]
+    B["Codex"] --> M
+    C["Copilot"] --> M
+    D["Antigravity"] --> M
+    M --> V["Shared Vector Index"]
+```
 ## Why
 Traditional `grep` and keyword search break down when you don't know the exact terms used in the codebase. Semantic search bridges that gap:
@@ -20,40 +53,135 @@ Traditional `grep` and keyword search break down when you don't know the exact t
 Based on [Cursor's research](https://cursor.com/blog/semsearch) showing semantic search improves AI agent performance by 12.5%.
-## Quick Start
+## Setup
-```bash
-npx -y semantic-code-mcp@latest --workspace /path/to/your/project
+<details>
+<summary><strong>Claude Code / Claude Desktop</strong></summary>
+```json
+{
+  "mcpServers": {
+    "semantic-code-mcp": {
+      "command": "npx",
+      "args": ["-y", "semantic-code-mcp@latest", "--workspace", "/path/to/project"]
+    }
+  }
+}
 ```
-Recommended MCP config (portable, no local script dependency):
+Claude Code: `~/.claude/settings.local.json` → `mcpServers`
+Claude Desktop: `~/Library/Application Support/Claude/claude_desktop_config.json`
+</details>
+<details>
+<summary><strong>VS Code / Cursor / Windsurf (Copilot)</strong></summary>
+Create `.vscode/mcp.json` in your project root:
+```json
+{
+  "servers": {
+    "semantic-code-mcp": {
+      "command": "npx",
+      "args": ["-y", "semantic-code-mcp@latest", "--workspace", "${workspaceFolder}"]
+    }
+  }
+}
+```
+> VS Code and Cursor support `${workspaceFolder}`. Windsurf requires absolute paths.
+</details>
+<details>
+<summary><strong>Codex (OpenAI)</strong></summary>
+`~/.codex/config.toml`:
+```toml
+[mcp_servers.semantic-code-mcp]
+command = "npx"
+args = ["-y", "semantic-code-mcp@latest", "--workspace", "/path/to/project"]
+```
+</details>
+<details>
+<summary><strong>Antigravity (Google)</strong></summary>
+`~/.gemini/antigravity/mcp_config.json`:
 ```json
 {
   "mcpServers": {
     "semantic-code-mcp": {
       "command": "npx",
-      "args": ["-y", "semantic-code-mcp@latest", "--workspace", "/path/to/your/project"]
+      "args": ["-y", "semantic-code-mcp@latest", "--workspace", "/path/to/project"]
     }
   }
 }
 ```
-Do not use machine-specific script paths such as `~/.codex/bin/start-smart-coding-mcp.sh` in shared documentation.
+</details>
-That's it. Your AI assistant now has semantic code search.
+<details>
+<summary><strong>🐚 Shell Script (Monorepo / Large Codebases)</strong></summary>
+For monorepos or workspaces with 1000+ files, a shell wrapper script gives you:
+- **Real-time logs** — see indexing progress, error details, 429 retry status
+- **No MCP timeout** — long-running index operations won't be killed
+- **Environment isolation** — pin provider credentials per project
+Create `start-semantic-code-mcp.sh`:
+```bash
+#!/bin/bash
+export SMART_CODING_WORKSPACE="/path/to/monorepo"
+export SMART_CODING_EMBEDDING_PROVIDER="vertex"
+export SMART_CODING_VECTOR_STORE_PROVIDER="milvus"
+export SMART_CODING_MILVUS_ADDRESS="http://localhost:19530"
+export GOOGLE_APPLICATION_CREDENTIALS="/path/to/service-account.json"
+export SMART_CODING_VERTEX_PROJECT="your-gcp-project-id"
+cd /path/to/semantic-code-mcp
+exec node index.js
+```
+```bash
+chmod +x start-semantic-code-mcp.sh
+```
+Then reference in your MCP config:
+```json
+{
+  "semantic-code-mcp": {
+    "command": "/absolute/path/to/start-semantic-code-mcp.sh",
+    "args": []
+  }
+}
+```
+> **When to use shell scripts over npx:**
+> - Monorepo with multiple sub-projects sharing one index
+> - 1000+ files requiring long initial indexing
+> - Debugging 429 rate-limit or gRPC errors (need real-time stderr)
+> - Pinning specific provider credentials per workspace
+</details>
 ## Features
 ### Multi-Provider Embeddings
-| Provider | Model | Privacy | Speed |
-|----------|-------|---------|-------|
-| **Local** (default) | nomic-embed-text-v1.5 | 100% local | ~50ms/chunk |
-| **Gemini** | gemini-embedding-001 | API call | Fast, batched |
-| **OpenAI** | text-embedding-3-small | API call | Fast |
-| **OpenAI-compatible** | Any compatible endpoint | Varies | Varies |
-| **Vertex AI** | Google Cloud models | GCP | Fast |
+| Provider              | Model                   | Privacy    | Speed         |
+| --------------------- | ----------------------- | ---------- | ------------- |
+| **Local** (default)   | nomic-embed-text-v1.5   | 100% local | ~50ms/chunk   |
+| **Gemini**            | gemini-embedding-001    | API call   | Fast, batched |
+| **OpenAI**            | text-embedding-3-small  | API call   | Fast          |
+| **OpenAI-compatible** | Any compatible endpoint | Varies     | Varies        |
+| **Vertex AI**         | Google Cloud models     | GCP        | Fast          |
 ### Flexible Vector Storage
@@ -72,28 +200,180 @@ Three modes to match your codebase:
 CPU capped at 50% during indexing. Your machine stays responsive.
+### Multi-Agent Concurrent Access
+Multiple AI agents (Claude Code, Codex, Copilot, Antigravity) can query the same vector index simultaneously via **Milvus Standalone** (Docker). No file locking, no index corruption.
+<details>
+<summary><strong>Docker Setup (Milvus Standalone)</strong></summary>
+Milvus Standalone runs **3 containers** working together:
+```mermaid
+graph LR
+    A["semantic-code-mcp"] -->|"gRPC :19530"| M["milvus standalone"]
+    M -->|"object storage"| S["minio :9000"]
+    M -->|"metadata"| E["etcd :2379"]
+```
+| Container      | Role                                  | Image             |
+| -------------- | ------------------------------------- | ----------------- |
+| **standalone** | Vector engine (gRPC :19530)           | `milvusdb/milvus` |
+| **etcd**       | Metadata store (cluster coordination) | `coreos/etcd`     |
+| **minio**      | Object storage (index files, logs)    | `minio/minio`     |
+#### Performance Guidelines
+| Resource | Minimum  | Recommended                   |
+| -------- | -------- | ----------------------------- |
+| RAM      | **4 GB** | 8 GB+                         |
+| Disk     | 10 GB    | 50 GB+ (scales with codebase) |
+| CPU      | 2 cores  | 4+ cores                      |
+| Docker   | v20+     | Latest                        |
+> ⚠️ **RAM is the critical bottleneck.** Milvus Standalone idles at ~2.5 GB RAM across the 3 containers. Machines with < 4 GB will experience swap thrashing and gRPC timeouts. Check with `docker stats`.
+#### 1. Install with Docker Compose
+```yaml
+# docker-compose.yml
+version: '3.5'
+services:
+  etcd:
+    image: coreos/etcd:v3.5.18
+    environment:
+      ETCD_AUTO_COMPACTION_MODE: revision
+      ETCD_AUTO_COMPACTION_RETENTION: "1000"
+      ETCD_QUOTA_BACKEND_BYTES: "4294967296"
+    command: etcd -advertise-client-urls=http://127.0.0.1:2379 -listen-client-urls http://0.0.0.0:2379 --data-dir /etcd
+    volumes:
+      - etcd-data:/etcd
+  minio:
+    image: minio/minio:RELEASE.2023-03-20T20-16-18Z
+    environment:
+      MINIO_ACCESS_KEY: minioadmin
+      MINIO_SECRET_KEY: minioadmin
+    command: minio server /minio_data --console-address ":9001"
+    ports:
+      - "9000:9000"
+      - "9001:9001"
+    volumes:
+      - minio-data:/minio_data
+  standalone:
+    image: milvusdb/milvus:v2.5.1
+    command: ["milvus", "run", "standalone"]
+    environment:
+      ETCD_ENDPOINTS: etcd:2379
+      MINIO_ADDRESS: minio:9000
+    ports:
+      - "19530:19530"
+      - "9091:9091"
+    volumes:
+      - milvus-data:/var/lib/milvus
+    depends_on:
+      - etcd
+      - minio
+volumes:
+  etcd-data:
+  minio-data:
+  milvus-data:
+```
+#### 2. Start & Verify
+```bash
+# Start all 3 containers
+docker compose up -d
+# Verify all 3 containers are running
+docker compose ps
+# NAME         STATUS
+# etcd         running
+# minio        running
+# standalone   running (healthy)
+# Check RAM usage (expect ~2.5 GB total idle)
+docker stats --no-stream
+```
+#### 3. Configure MCP to use Milvus
+```json
+{
+  "env": {
+    "SMART_CODING_VECTOR_STORE_PROVIDER": "milvus",
+    "SMART_CODING_MILVUS_ADDRESS": "http://localhost:19530"
+  }
+}
+```
+#### 4. Verify connection
+```bash
+# Should return collection list (may be empty initially)
+curl http://localhost:19530/v1/vector/collections
+```
+#### 5. Lifecycle Management
+```bash
+# Stop all containers (preserves data)
+docker compose stop
+# Restart after reboot
+docker compose start
+# Full reset (removes all indexed vectors)
+docker compose down -v
+# View logs for debugging
+docker compose logs -f standalone
+```
+#### 6. Monitoring
+- **MinIO Console**: http://localhost:9001 (minioadmin / minioadmin)
+- **Milvus Health**: http://localhost:9091/healthz
+- **Container RAM**: `docker stats --no-stream`
+#### Troubleshooting
+| Symptom                               | Cause                        | Fix                                                                              |
+| ------------------------------------- | ---------------------------- | -------------------------------------------------------------------------------- |
+| gRPC timeout / connection refused     | Milvus not fully started     | Wait 30–60s after `docker compose up -d`, check `docker compose logs standalone` |
+| Swap thrashing, slow queries          | < 4 GB RAM                   | Upgrade RAM or use SQLite for single-agent setups                                |
+| `etcd: mvcc: database space exceeded` | etcd compaction backlog      | `docker compose restart etcd`                                                    |
+| Milvus OOM killed                     | RAM pressure from other apps | Close heavy apps or increase Docker memory limit                                 |
+> **SQLite vs Milvus:** SQLite is single-process — only one agent can write at a time. Milvus handles concurrent reads/writes from multiple agents without conflicts. Use Milvus when running 2+ agents on the same codebase.
+</details>
 ## Tools
-| Tool | Description |
-|------|-------------|
-| `a_semantic_search` | Find code by meaning. Hybrid semantic + exact match scoring. |
-| `b_index_codebase` | Trigger manual reindex (normally automatic & incremental). |
-| `c_clear_cache` | Reset embeddings cache entirely. |
-| `d_check_last_version` | Look up latest package version from 20+ registries. |
-| `e_set_workspace` | Switch project at runtime without restart. |
-| `f_get_status` | Server health: version, index progress, config. |
+| Tool                   | Description                                                  |
+| ---------------------- | ------------------------------------------------------------ |
+| `a_semantic_search`    | Find code by meaning. Hybrid semantic + exact match scoring. |
+| `b_index_codebase`     | Trigger manual reindex (normally automatic & incremental).   |
+| `c_clear_cache`        | Reset embeddings cache entirely.                             |
+| `d_check_last_version` | Look up latest package version from 20+ registries.          |
+| `e_set_workspace`      | Switch project at runtime without restart.                   |
+| `f_get_status`         | Server health: version, index progress, config.              |
 ## IDE Setup
-| IDE / App | Guide | `${workspaceFolder}` |
-|-----------|-------|----------------------|
-| **VS Code** | [Setup](docs/ide-setup/vscode.md) | ✅ |
-| **Cursor** | [Setup](docs/ide-setup/cursor.md) | ✅ |
-| **Windsurf** | [Setup](docs/ide-setup/windsurf.md) | ❌ |
-| **Claude Desktop** | [Setup](docs/ide-setup/claude-desktop.md) | ❌ |
-| **OpenCode** | [Setup](docs/ide-setup/opencode.md) | ❌ |
-| **Raycast** | [Setup](docs/ide-setup/raycast.md) | ❌ |
-| **Antigravity** | [Setup](docs/ide-setup/antigravity.md) | ❌ |
+| IDE / App          | Guide                                     | `${workspaceFolder}` |
+| ------------------ | ----------------------------------------- | -------------------- |
+| **VS Code**        | [Setup](docs/ide-setup/vscode.md)         | ✅                    |
+| **Cursor**         | [Setup](docs/ide-setup/cursor.md)         | ✅                    |
+| **Windsurf**       | [Setup](docs/ide-setup/windsurf.md)       | ❌                    |
+| **Claude Desktop** | [Setup](docs/ide-setup/claude-desktop.md) | ❌                    |
+| **OpenCode**       | [Setup](docs/ide-setup/opencode.md)       | ❌                    |
+| **Raycast**        | [Setup](docs/ide-setup/raycast.md)        | ❌                    |
+| **Antigravity**    | [Setup](docs/ide-setup/antigravity.md)    | ❌                    |
 ### Multi-Project
@@ -118,67 +398,92 @@ All settings via environment variables. Prefix: `SMART_CODING_`.
 ### Core
-| Variable | Default | Description |
-|----------|---------|-------------|
-| `SMART_CODING_VERBOSE` | `false` | Detailed logging |
-| `SMART_CODING_MAX_RESULTS` | `5` | Search results returned |
-| `SMART_CODING_BATCH_SIZE` | `100` | Files per parallel batch |
-| `SMART_CODING_MAX_FILE_SIZE` | `1048576` | Max file size (1MB) |
-| `SMART_CODING_CHUNK_SIZE` | `25` | Lines per chunk |
-| `SMART_CODING_CHUNKING_MODE` | `smart` | `smart` / `ast` / `line` |
-| `SMART_CODING_WATCH_FILES` | `false` | Auto-reindex on changes |
-| `SMART_CODING_AUTO_INDEX_DELAY` | `5000` | Background index delay (ms) |
-| `SMART_CODING_MAX_CPU_PERCENT` | `50` | CPU cap during indexing |
+| Variable                        | Default   | Description                                                                                             |
+| ------------------------------- | --------- | ------------------------------------------------------------------------------------------------------- |
+| `SMART_CODING_VERBOSE`          | `false`   | Detailed logging                                                                                        |
+| `SMART_CODING_MAX_RESULTS`      | `5`       | Search results returned                                                                                 |
+| `SMART_CODING_BATCH_SIZE`       | `100`     | Files per parallel batch                                                                                |
+| `SMART_CODING_MAX_FILE_SIZE`    | `1048576` | Max file size (1MB)                                                                                     |
+| `SMART_CODING_CHUNK_SIZE`       | `25`      | Lines per chunk                                                                                         |
+| `SMART_CODING_CHUNKING_MODE`    | `smart`   | `smart` / `ast` / `line`                                                                                |
+| `SMART_CODING_WATCH_FILES`      | `false`   | Auto-reindex on changes                                                                                 |
+| `SMART_CODING_AUTO_INDEX_DELAY` | `false`   | Background index on startup. `false`=off (multi-agent safe), `true`=5s, or ms value. Single-agent only. |
+| `SMART_CODING_MAX_CPU_PERCENT`  | `50`      | CPU cap during indexing                                                                                 |
 ### Embedding Provider
-| Variable | Default | Description |
-|----------|---------|-------------|
-| `SMART_CODING_EMBEDDING_PROVIDER` | `local` | `local` / `gemini` / `openai` / `openai-compatible` / `vertex` |
-| `SMART_CODING_EMBEDDING_MODEL` | `nomic-ai/nomic-embed-text-v1.5` | Model name |
-| `SMART_CODING_EMBEDDING_DIMENSION` | `128` | MRL dimension (64–768) |
-| `SMART_CODING_DEVICE` | `auto` | `cpu` / `webgpu` / `auto` |
+| Variable                           | Default                          | Description                                                    |
+| ---------------------------------- | -------------------------------- | -------------------------------------------------------------- |
+| `SMART_CODING_EMBEDDING_PROVIDER`  | `local`                          | `local` / `gemini` / `openai` / `openai-compatible` / `vertex` |
+| `SMART_CODING_EMBEDDING_MODEL`     | `nomic-ai/nomic-embed-text-v1.5` | Model name                                                     |
+| `SMART_CODING_EMBEDDING_DIMENSION` | `128`                            | MRL dimension (64–768)                                         |
+| `SMART_CODING_DEVICE`              | `auto`                           | `cpu` / `webgpu` / `auto`                                      |
 ### Gemini
-| Variable | Default | Description |
-|----------|---------|-------------|
-| `SMART_CODING_GEMINI_API_KEY` | — | API key |
-| `SMART_CODING_GEMINI_MODEL` | `gemini-embedding-001` | Model |
-| `SMART_CODING_GEMINI_DIMENSIONS` | `768` | Output dimensions |
-| `SMART_CODING_GEMINI_BATCH_SIZE` | `24` | Micro-batch size |
-| `SMART_CODING_GEMINI_MAX_RETRIES` | `3` | Retry count |
+| Variable                          | Default                | Description       |
+| --------------------------------- | ---------------------- | ----------------- |
+| `SMART_CODING_GEMINI_API_KEY`     | —                      | API key           |
+| `SMART_CODING_GEMINI_MODEL`       | `gemini-embedding-001` | Model             |
+| `SMART_CODING_GEMINI_DIMENSIONS`  | `768`                  | Output dimensions |
+| `SMART_CODING_GEMINI_BATCH_SIZE`  | `24`                   | Micro-batch size  |
+| `SMART_CODING_GEMINI_MAX_RETRIES` | `3`                    | Retry count       |
 ### OpenAI / Compatible
-| Variable | Default | Description |
-|----------|---------|-------------|
-| `SMART_CODING_EMBEDDING_API_KEY` | — | API key |
-| `SMART_CODING_EMBEDDING_BASE_URL` | — | Base URL (compatible only) |
+| Variable                          | Default | Description                |
+| --------------------------------- | ------- | -------------------------- |
+| `SMART_CODING_EMBEDDING_API_KEY`  | —       | API key                    |
+| `SMART_CODING_EMBEDDING_BASE_URL` | —       | Base URL (compatible only) |
 ### Vertex AI
-| Variable | Default | Description |
-|----------|---------|-------------|
-| `SMART_CODING_VERTEX_PROJECT` | — | GCP project ID |
-| `SMART_CODING_VERTEX_LOCATION` | `us-central1` | Region |
+| Variable                       | Default       | Description    |
+| ------------------------------ | ------------- | -------------- |
+| `SMART_CODING_VERTEX_PROJECT`  | —             | GCP project ID |
+| `SMART_CODING_VERTEX_LOCATION` | `us-central1` | Region         |
 ### Vector Store
-| Variable | Default | Description |
-|----------|---------|-------------|
-| `SMART_CODING_VECTOR_STORE_PROVIDER` | `sqlite` | `sqlite` / `milvus` |
-| `SMART_CODING_MILVUS_ADDRESS` | — | Milvus endpoint |
-| `SMART_CODING_MILVUS_TOKEN` | — | Auth token |
-| `SMART_CODING_MILVUS_DATABASE` | `default` | Database name |
-| `SMART_CODING_MILVUS_COLLECTION` | `smart_coding_embeddings` | Collection |
+| Variable                             | Default                   | Description                            |
+| ------------------------------------ | ------------------------- | -------------------------------------- |
+| `SMART_CODING_VECTOR_STORE_PROVIDER` | `sqlite`                  | `sqlite` / `milvus`                    |
+| `SMART_CODING_MILVUS_ADDRESS`        | —                         | Milvus endpoint or Zilliz Cloud URI    |
+| `SMART_CODING_MILVUS_TOKEN`          | —                         | Auth token (required for Zilliz Cloud) |
+| `SMART_CODING_MILVUS_DATABASE`       | `default`                 | Database name                          |
+| `SMART_CODING_MILVUS_COLLECTION`     | `smart_coding_embeddings` | Collection                             |
+### Zilliz Cloud (Managed Milvus)
+For teams or serverless deployments, use [Zilliz Cloud](https://zilliz.com) instead of self-hosted Docker:
+```json
+{
+  "env": {
+    "SMART_CODING_VECTOR_STORE_PROVIDER": "milvus",
+    "SMART_CODING_MILVUS_ADDRESS": "https://in03-xxxx.api.gcp-us-west1.zillizcloud.com",
+    "SMART_CODING_MILVUS_TOKEN": "your-zilliz-api-key"
+  }
+}
+```
+| Feature     | Milvus Standalone (Docker) | Zilliz Cloud                |
+| ----------- | -------------------------- | --------------------------- |
+| Setup       | Self-hosted, 3 containers  | Managed SaaS                |
+| RAM         | ~2.5 GB idle               | None (serverless)           |
+| Multi-agent | ✅ via shared Docker        | ✅ via shared endpoint       |
+| Scaling     | Manual                     | Auto-scaling                |
+| Free tier   | —                          | 2 collections, 1M vectors   |
+| Best for    | Local dev, single machine  | Team use, CI/CD, production |
+> Get your Zilliz Cloud URI and API key from the [Zilliz Console](https://cloud.zilliz.com) → Cluster → Connect.
 ### Search Tuning
-| Variable | Default | Description |
-|----------|---------|-------------|
-| `SMART_CODING_SEMANTIC_WEIGHT` | `0.7` | Semantic vs exact weight |
-| `SMART_CODING_EXACT_MATCH_BOOST` | `1.5` | Exact match multiplier |
+| Variable                         | Default | Description              |
+| -------------------------------- | ------- | ------------------------ |
+| `SMART_CODING_SEMANTIC_WEIGHT`   | `0.7`   | Semantic vs exact weight |
+| `SMART_CODING_EXACT_MATCH_BOOST` | `1.5`   | Exact match multiplier   |
 ### Example with Gemini + Milvus
@@ -201,48 +506,141 @@ All settings via environment variables. Prefix: `SMART_CODING_`.
 ## Architecture
-```
-semantic-code-mcp/
-├── index.js              # MCP server entry point
-├── lib/
-│   ├── config.js         # Configuration loader
-│   ├── cache-factory.js  # SQLite / Milvus provider selection
-│   ├── cache.js          # SQLite vector store
-│   ├── milvus-cache.js   # Milvus vector store
-│   ├── mrl-embedder.js   # Local MRL embedder
-│   ├── gemini-embedder.js# Gemini API embedder
-│   ├── ast-chunker.js    # Tree-sitter AST chunking
-│   ├── tokenizer.js      # Token counting
-│   └── utils.js          # Cosine similarity, hashing, smart chunking
-├── features/
-│   ├── hybrid-search.js  # Semantic + exact match search
-│   ├── index-codebase.js # File discovery & incremental indexing
-│   ├── clear-cache.js    # Cache reset
-│   ├── check-last-version.js  # Package version lookup
-│   ├── set-workspace.js  # Runtime workspace switching
-│   └── get-status.js     # Server status
-└── test/                 # Vitest test suite
+```mermaid
+graph TD
+    A["MCP Server — index.js"] --> B["Features"]
+    B --> B1["hybrid-search"]
+    B --> B2["index-codebase"]
+    B --> B3["set-workspace / get-status / clear-cache"]
+    B2 --> C["Code Chunking — AST or Smart Regex"]
+    C --> D["Embedding — Local / Gemini / Vertex / OpenAI"]
+    D --> E["Vector Store — SQLite or Milvus"]
+    B1 --> D
+    B1 --> E
 ```
 ## How It Works
-```
-Your code files
-    ↓ glob + .gitignore-aware discovery
-Smart/AST chunking
-    ↓ language-aware splitting
-AI embedding (local or API)
-    ↓ vector generation
-SQLite or Milvus storage
-    ↓ incremental, hash-based updates
-Search query
-    ↓ embed query → cosine similarity → exact match boost
-Top N results with relevance scores
+```mermaid
+flowchart LR
+    A["📁 Source Files"] -->|glob + .gitignore| B["✂️ Smart/AST<br/>Chunking"]
+    B -->|language-aware| C["🧠 AI Embedding<br/>(Local or API)"]
+    C -->|vectors| D["💾 SQLite / Milvus<br/>Storage"]
+    D -->|incremental hash| D
+    E["🔍 Search Query"] -->|embed| C
+    C -->|cosine similarity| F["📊 Hybrid Scoring<br/>semantic + exact match"]
+    F --> G["🎯 Top N Results<br/>with relevance scores"]
+    style A fill:#2d3748,color:#e2e8f0
+    style C fill:#553c9a,color:#e9d8fd
+    style D fill:#2a4365,color:#bee3f8
+    style G fill:#22543d,color:#c6f6d5
 ```
 **Progressive indexing** — search works immediately while indexing continues in the background. Only changed files are re-indexed on subsequent runs.
+## Incremental Indexing & Optimization
+Semantic Code MCP uses a **hash-based incremental indexing** strategy to minimize redundant work:
+```mermaid
+flowchart TD
+    A["File discovered"] --> B{"Hash changed?"}
+    B -->|No| C["Skip — use cached vectors"]
+    B -->|Yes| D["Re-chunk & re-embed"]
+    D --> E["Update vector store"]
+    F["Deleted file detected"] --> G["Prune stale vectors"]
+    style C fill:#22543d,color:#c6f6d5
+    style D fill:#744210,color:#fefcbf
+    style G fill:#742a2a,color:#fed7d7
+```
+**How it works:**
+1. **File discovery** — glob patterns with `.gitignore`-aware filtering
+2. **Hash comparison** — each file's `mtime + size` is compared against the cached index
+3. **Delta processing** — only changed/new files are chunked and embedded
+4. **Stale pruning** — deleted files are removed from the vector store automatically
+5. **Progressive search** — queries work immediately, even mid-indexing
+**Performance characteristics:**
+| Scenario                    | Behavior          | Typical Time                   |
+| --------------------------- | ----------------- | ------------------------------ |
+| First run (500 files)       | Full index        | ~30–60s (API), ~2–5min (local) |
+| Subsequent run (no changes) | Hash check only   | < 1s                           |
+| 10 files changed            | Incremental delta | ~2–5s                          |
+| Branch switch               | Partial re-index  | ~5–15s                         |
+| `force=true`                | Full rebuild      | Same as first run              |
+> ⚠️ **Multi-agent warning:** Auto-index is **disabled by default** to prevent concurrent Milvus writes when multiple agents share the same server. Set `SMART_CODING_AUTO_INDEX_DELAY=true` (5s) only if a **single agent** connects to this MCP server. Use `b_index_codebase` for explicit on-demand indexing in multi-agent setups.
+<details>
+<summary><strong>🐚 Shell Reindex for Bulk Operations</strong></summary>
+MCP tool calls have timeout limits and don't expose real-time logs. For bulk operations (initial setup, full rebuild, migration), use the CLI reindex script directly:
+```bash
+cd /path/to/semantic-code-mcp
+node reindex.js /path/to/workspace --force
+```
+**When to use CLI over MCP tools:**
+| Scenario                     | Use                                 |
+| ---------------------------- | ----------------------------------- |
+| Daily incremental updates    | MCP `b_index_codebase(force=false)` |
+| Initial workspace setup      | CLI `node reindex.js /path --force` |
+| Full rebuild after migration | CLI `node reindex.js /path --force` |
+| 1000+ file bulk update       | CLI (timeout-safe, real-time logs)  |
+| Debugging 429 / gRPC errors  | CLI (stderr visible)                |
+> The CLI reindex script uses the same incremental engine under the hood. `--force` only forces re-embedding; it still uses the same hash-based delta for efficiency.
+</details>
+## Non-Blocking Indexing Workflow
+All indexing operations run in the **background** and return immediately. The agent can search while indexing continues.
+```mermaid
+sequenceDiagram
+    participant Agent
+    participant MCP as semantic-code-mcp
+    participant BG as Background Thread
+    participant Store as Milvus / SQLite
+    Agent->>MCP: b_index_codebase(force=false)
+    MCP->>BG: startBackgroundIndexing()
+    MCP-->>Agent: {status: "started", message: "..."}
+    Note over Agent: ⚡ Returns instantly
+    loop Poll every 2-3s
+        Agent->>MCP: f_get_status()
+        MCP-->>Agent: {index.status: "indexing", progress: "150/500 files"}
+    end
+    BG->>Store: upsert vectors
+    BG-->>MCP: done
+    Agent->>MCP: f_get_status()
+    MCP-->>Agent: {index.status: "ready"}
+    Agent->>MCP: a_semantic_search(query)
+    MCP-->>Agent: [results]
+```
+**Rules for agents:**
+1. **Always call `f_get_status` first** — check workspace and indexing status
+2. **Use `e_set_workspace` if workspace is wrong** — before any indexing
+3. **Poll `f_get_status` until `index.status: "ready"`** before relying on search results
+4. **Progressive search is supported** — `a_semantic_search` works during indexing with partial results
+5. **`SMART_CODING_AUTO_INDEX_DELAY=false`** by default — use `b_index_codebase` for explicit on-demand indexing in multi-agent setups
 ## Privacy
 - **Local mode**: everything runs on your machine. Code never leaves your system.
@@ -256,6 +654,15 @@ Copyright (c) 2025 Omar Haris (original), bitkyc08 (modifications, 2026)
 See [LICENSE](LICENSE) for full text.
----
+### About
+This project is a fork of [smart-coding-mcp](https://github.com/omarHaris/smart-coding-mcp) by Omar Haris, heavily extended for production use.
-*Built on [smart-coding-mcp](https://github.com/omarHaris/smart-coding-mcp) by Omar Haris. Extended with multi-provider embeddings, Milvus ANN search, AST chunking, resource throttling, and comprehensive test suite.*
+**Key additions over upstream**:
+- Multi-provider embeddings (Gemini, Vertex AI, OpenAI, OpenAI-compatible)
+- Milvus vector store with ANN search for large codebases
+- AST-based code chunking via Tree-sitter
+- Resource throttling (CPU cap at 50%)
+- Runtime workspace switching (`e_set_workspace`)
+- Package version checker across 20+ registries (`d_check_last_version`)
+- Comprehensive IDE setup guides (VS Code, Cursor, Windsurf, Claude Desktop, Antigravity)

package/features/index-codebase.js CHANGED Viewed

@@ -935,7 +935,7 @@ export class CodebaseIndexer {
 export function getToolDefinition() {
   return {
     name: "b_index_codebase",
-    description: "Manually trigger a full reindex of the codebase. This will scan all files and update the embeddings cache. Useful after large code changes or if the index seems out of date.",
+    description: "Trigger codebase reindex. Returns IMMEDIATELY (non-blocking). Poll f_get_status until index.status='ready' before calling a_semantic_search. Do NOT search while indexing.",
     inputSchema: {
       type: "object",
       properties: {
@@ -959,41 +959,35 @@ export function getToolDefinition() {
 // Tool handler
 export async function handleToolCall(request, indexer) {
   const force = request.params.arguments?.force || false;
-  const result = await indexer.indexAll(force);
-  // Handle case when indexing was skipped due to concurrent request
-  if (result?.skipped) {
+  // Guard: already indexing
+  if (indexer.isIndexing) {
+    const status = indexer.getIndexingStatus();
     return {
       content: [{
         type: "text",
-        text: `Indexing skipped: ${result.reason}\n\nPlease wait for the current indexing operation to complete before requesting another reindex.`
+        text: JSON.stringify({
+          accepted: false,
+          status: "rejected",
+          message: "Indexing already in progress. Use f_get_status to poll.",
+          progress: status
+        }, null, 2)
       }]
     };
   }
-  // Get current stats from cache
-  const cacheStats = await resolveCacheStats(indexer.cache);
-  const stats = {
-    totalChunks: result?.totalChunks ?? cacheStats.totalChunks,
-    totalFiles: result?.totalFiles ?? cacheStats.totalFiles,
-    filesProcessed: result?.filesProcessed ?? 0,
-    chunksCreated: result?.chunksCreated ?? 0
-  };
-  let message = result?.message
-    ? `Codebase reindexed successfully.\n\n${result.message}`
-    : `Codebase reindexed successfully.`;
-  message += `\n\nStatistics:\n- Total files in index: ${stats.totalFiles}\n- Total code chunks: ${stats.totalChunks}`;
-  if (stats.filesProcessed > 0) {
-    message += `\n- Files processed this run: ${stats.filesProcessed}\n- Chunks created this run: ${stats.chunksCreated}`;
-  }
+  // Fire-and-forget — returns immediately
+  indexer.startBackgroundIndexing(force);
   return {
     content: [{
       type: "text",
-      text: message
+      text: JSON.stringify({
+        accepted: true,
+        status: "started",
+        message: "Indexing started in background. Use f_get_status to poll progress.",
+        force
+      }, null, 2)
     }]
   };
 }

package/lib/config.js CHANGED Viewed

@@ -859,15 +859,15 @@ const DEFAULT_CONFIG = {
   semanticWeight: 0.7,
   exactMatchBoost: 1.5,
   smartIndexing: true,
   // Resource throttling (balanced performance/responsiveness)
   maxCpuPercent: 50,        // Max CPU usage during indexing (default: 50%)
   batchDelay: 10,           // Delay between batches in ms (default: 10ms)
   maxWorkers: 'auto',       // Max worker threads ('auto' = 50% of cores, or specific number)
   // Startup behavior
-  autoIndexDelay: 5000,     // Delay before background indexing starts (ms), false = disabled
+  autoIndexDelay: false,    // Auto-index on startup: false = disabled (safe for multi-agent). Set to ms delay (e.g. 5000) for single-agent setups.
   // Progressive indexing
   incrementalSaveInterval: 5, // Save to cache every N batches
   allowPartialSearch: true    // Allow searches while indexing is in progress
@@ -880,7 +880,7 @@ export async function loadConfig(workspaceDir = null) {
     // Determine the base directory for configuration
     let baseDir;
     let configPath;
     if (workspaceDir) {
       // Workspace mode: load config from workspace root
       baseDir = path.resolve(workspaceDir);
@@ -892,7 +892,7 @@ export async function loadConfig(workspaceDir = null) {
       baseDir = path.resolve(scriptDir, '..');
       configPath = path.join(baseDir, "config.json");
     }
     let userConfig = {};
     try {
       const configData = await fs.readFile(configPath, "utf-8");
@@ -904,9 +904,9 @@ export async function loadConfig(workspaceDir = null) {
         console.error(`[Config] No config.json found: ${configError.message}`);
       }
     }
     config = { ...DEFAULT_CONFIG, ...userConfig };
     // Set workspace-specific directories
     if (workspaceDir) {
       config.searchDirectory = baseDir;
@@ -915,7 +915,7 @@ export async function loadConfig(workspaceDir = null) {
       config.searchDirectory = path.resolve(baseDir, config.searchDirectory);
       config.cacheDirectory = path.resolve(baseDir, config.cacheDirectory);
     }
     // Smart project detection
     if (config.smartIndexing !== false) {
       const detector = new ProjectDetector(config.searchDirectory);
@@ -941,13 +941,13 @@ export async function loadConfig(workspaceDir = null) {
       }
       console.error(`[Config] Applied ${smartPatterns.length} smart ignore patterns`);
     }
     console.error("[Config] Loaded configuration from config.json");
   } catch (error) {
     console.error("[Config] Using default configuration (config.json not found or invalid)");
     console.error(`[Config] Error: ${error.message}`);
   }
   // Apply environment variable overrides (prefix: SMART_CODING_) with validation
   if (process.env.SMART_CODING_VERBOSE !== undefined) {
     const value = process.env.SMART_CODING_VERBOSE;
@@ -955,7 +955,7 @@ export async function loadConfig(workspaceDir = null) {
       config.verbose = value === 'true';
     }
   }
   if (process.env.SMART_CODING_BATCH_SIZE !== undefined) {
     const value = parseInt(process.env.SMART_CODING_BATCH_SIZE, 10);
     if (!isNaN(value) && value > 0 && value <= 1000) {
@@ -964,7 +964,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_BATCH_SIZE: ${process.env.SMART_CODING_BATCH_SIZE}, using default`);
     }
   }
   if (process.env.SMART_CODING_MAX_FILE_SIZE !== undefined) {
     const value = parseInt(process.env.SMART_CODING_MAX_FILE_SIZE, 10);
     if (!isNaN(value) && value > 0) {
@@ -973,7 +973,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_MAX_FILE_SIZE: ${process.env.SMART_CODING_MAX_FILE_SIZE}, using default`);
     }
   }
   if (process.env.SMART_CODING_CHUNK_SIZE !== undefined) {
     const value = parseInt(process.env.SMART_CODING_CHUNK_SIZE, 10);
     if (!isNaN(value) && value > 0 && value <= 100) {
@@ -982,7 +982,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_CHUNK_SIZE: ${process.env.SMART_CODING_CHUNK_SIZE}, using default`);
     }
   }
   if (process.env.SMART_CODING_MAX_RESULTS !== undefined) {
     const value = parseInt(process.env.SMART_CODING_MAX_RESULTS, 10);
     if (!isNaN(value) && value > 0 && value <= 100) {
@@ -991,14 +991,14 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_MAX_RESULTS: ${process.env.SMART_CODING_MAX_RESULTS}, using default`);
     }
   }
   if (process.env.SMART_CODING_SMART_INDEXING !== undefined) {
     const value = process.env.SMART_CODING_SMART_INDEXING;
     if (value === 'true' || value === 'false') {
       config.smartIndexing = value === 'true';
     }
   }
   if (process.env.SMART_CODING_WATCH_FILES !== undefined) {
     const value = process.env.SMART_CODING_WATCH_FILES;
     if (value === 'true' || value === 'false') {
@@ -1045,7 +1045,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Milvus collection: ${value}`);
     }
   }
   if (process.env.SMART_CODING_SEMANTIC_WEIGHT !== undefined) {
     const value = parseFloat(process.env.SMART_CODING_SEMANTIC_WEIGHT);
     if (!isNaN(value) && value >= 0 && value <= 1) {
@@ -1054,7 +1054,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_SEMANTIC_WEIGHT: ${process.env.SMART_CODING_SEMANTIC_WEIGHT}, using default (must be 0-1)`);
     }
   }
   if (process.env.SMART_CODING_EXACT_MATCH_BOOST !== undefined) {
     const value = parseFloat(process.env.SMART_CODING_EXACT_MATCH_BOOST);
     if (!isNaN(value) && value >= 0) {
@@ -1063,7 +1063,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_EXACT_MATCH_BOOST: ${process.env.SMART_CODING_EXACT_MATCH_BOOST}, using default`);
     }
   }
   if (process.env.SMART_CODING_EMBEDDING_MODEL !== undefined) {
     const value = process.env.SMART_CODING_EMBEDDING_MODEL.trim();
     if (value.length > 0) {
@@ -1180,7 +1180,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_GEMINI_MAX_RETRIES: ${process.env.SMART_CODING_GEMINI_MAX_RETRIES}, using default (must be 0-10)`);
     }
   }
   if (process.env.SMART_CODING_WORKER_THREADS !== undefined) {
     const value = process.env.SMART_CODING_WORKER_THREADS.trim().toLowerCase();
     if (value === 'auto') {
@@ -1194,7 +1194,7 @@ export async function loadConfig(workspaceDir = null) {
       }
     }
   }
   // MRL embedding dimension
   if (process.env.SMART_CODING_EMBEDDING_DIMENSION !== undefined) {
     const value = parseInt(process.env.SMART_CODING_EMBEDDING_DIMENSION, 10);
@@ -1206,7 +1206,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_EMBEDDING_DIMENSION: ${value}, using default (must be 64, 128, 256, 512, or 768)`);
     }
   }
   // Device selection
   if (process.env.SMART_CODING_DEVICE !== undefined) {
     const value = process.env.SMART_CODING_DEVICE.trim().toLowerCase();
@@ -1218,7 +1218,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_DEVICE: ${value}, using default (must be 'cpu', 'webgpu', or 'auto')`);
     }
   }
   // Chunking mode
   if (process.env.SMART_CODING_CHUNKING_MODE !== undefined) {
     const value = process.env.SMART_CODING_CHUNKING_MODE.trim().toLowerCase();
@@ -1230,7 +1230,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_CHUNKING_MODE: ${value}, using default (must be 'smart', 'ast', or 'line')`);
     }
   }
   // Resource throttling - Max CPU percent
   if (process.env.SMART_CODING_MAX_CPU_PERCENT !== undefined) {
     const value = parseInt(process.env.SMART_CODING_MAX_CPU_PERCENT, 10);
@@ -1241,7 +1241,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_MAX_CPU_PERCENT: ${value}, using default (must be 10-100)`);
     }
   }
   // Resource throttling - Batch delay
   if (process.env.SMART_CODING_BATCH_DELAY !== undefined) {
     const value = parseInt(process.env.SMART_CODING_BATCH_DELAY, 10);
@@ -1252,7 +1252,7 @@ export async function loadConfig(workspaceDir = null) {
       console.error(`[Config] Invalid SMART_CODING_BATCH_DELAY: ${value}, using default (must be 0-5000)`);
     }
   }
   // Resource throttling - Max workers
   if (process.env.SMART_CODING_MAX_WORKERS !== undefined) {
     const value = process.env.SMART_CODING_MAX_WORKERS.trim().toLowerCase();
@@ -1275,13 +1275,16 @@ export async function loadConfig(workspaceDir = null) {
     if (value === 'false' || value === '0') {
       config.autoIndexDelay = false;
       console.error(`[Config] Auto-indexing disabled`);
+    } else if (value === 'true') {
+      config.autoIndexDelay = 5000;
+      console.error(`[Config] Auto-indexing enabled (5000ms delay)`);
     } else {
       const numValue = parseInt(value, 10);
       if (!isNaN(numValue) && numValue >= 0 && numValue <= 60000) {
         config.autoIndexDelay = numValue;
         console.error(`[Config] Auto-index delay: ${numValue}ms`);
       } else {
-        console.error(`[Config] Invalid SMART_CODING_AUTO_INDEX_DELAY: ${value}, using default (must be 0-60000 or 'false')`);
+        console.error(`[Config] Invalid SMART_CODING_AUTO_INDEX_DELAY: ${value}, using default (must be 0-60000, 'true', or 'false')`);
       }
     }
   }

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "semantic-code-mcp",
-  "version": "2.0.1",
+  "version": "2.1.1",
   "description": "AI-powered semantic code search for coding agents. MCP server with multi-provider embeddings and hybrid search.",
   "type": "module",
   "main": "index.js",