npm - droid-mode - Versions diffs - 0.0.12 → 0.0.14 - Mend

droid-mode 0.0.12 → 0.0.14

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +269 -158
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -1,251 +1,354 @@
-# Droid Mode
+<div align="center">
-Progressive Code-Mode MCP integration for Factory.ai Droid.
+<img src="https://res.cloudinary.com/ds7w5yhjh/image/upload/v1767480336/droid-mode-readme_ymx61e.webp" alt="Droid Mode" width="600" />
-Access MCP tools **without** loading them into your context window.
+**Progressive MCP for AI Agents. Zero Context Bloat.**
+[![npm version](https://img.shields.io/npm/v/droid-mode.svg)](https://www.npmjs.com/package/droid-mode)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
+[![Node.js](https://img.shields.io/badge/Node.js-%3E%3D18-green.svg)](https://nodejs.org/)
+[![Status](https://img.shields.io/badge/Status-Experimental-orange.svg)](#experimental-status)
+Access MCP tools on-demand without loading schemas into your context window.
+[Quick Start](#quick-start) · [Architecture](#architecture) · [Benchmarks](#performance-benchmarks) · [Commands](#command-reference)
+</div>
+---
 ## The Problem
-When you configure MCP servers in Factory.ai Droid, all their tools get injected into the model's context window. This causes:
+When you configure MCP servers in Factory.ai Droid, **all tool schemas get injected into every prompt** ([source](https://www.anthropic.com/engineering/code-execution-with-mcp)). This creates a compounding cost:
-- **Token bloat** — A single MCP server can consume 2,400+ tokens in schema definitions alone
-- **Cognitive overload** — Too many tools confuse the model
-- **Server limits** — Each additional MCP server compounds the context cost
+| Configuration | Token Overhead | Impact |
+|---------------|----------------|--------|
+| 1 server (22 tools) | ~2,400 tokens | Acceptable |
+| 3 servers (~60 tools) | ~7,200 tokens | Noticeable |
+| 5 servers (~100 tools) | **~12,000 tokens** | Significant |
+Those tokens are consumed before your agent writes a single line. On an 8K context window, a single server claims ~30% of your budget; five servers would exceed it entirely. Even on 200K windows, ~12,000 tokens represents ~6% overhead, and it compounds across every message in a conversation.
 ## The Solution
-Droid Mode lets you:
+Droid Mode introduces **progressive disclosure** for MCP tools:
-1. Keep MCP servers `disabled: true` in `mcp.json` (they won't bloat context)
-2. Access them **on-demand** through progressive discovery
-3. Run procedural workflows that call MCP tools **outside** the LLM loop
+1. **Discover**: List available servers (`dm servers`) - *~10 tokens*
+2. **Index**: Browse tool names and descriptions (`dm index`) - *~50 tokens*
+3. **Hydrate**: Load full schemas only when needed (`dm hydrate`) - *zero tokens*
+4. **Execute**: Run tools via daemon or workflow (`dm call`, `dm run`) - *zero tokens*
-Because tools are hydrated only when needed, you can configure **any number of MCP servers** without impacting context. A project with 10 servers and 200+ tools costs zero tokens until you actually use one.
+> **Code-Mode Execution**: Steps 3-4 run as shell commands, not LLM tool calls. The LLM never sees the schemas or results unless you explicitly include them. This is why token cost is zero: operations happen outside the context window entirely.
-## Benchmarks
+> **Key Insight**: Servers with `disabled: true` in `mcp.json` are fully accessible to Droid Mode. The `disabled` flag tells Factory.ai Droid "don't inject these tools into context", but Droid Mode connects directly, bypassing context injection entirely.
-Independent benchmarks comparing Droid Mode against Native MCP (direct HTTP).
+### Token Efficiency
-### Summary
+| Scenario | Native MCP | Droid Mode | Savings |
+|----------|------------|------------|---------|
+| 3 tools used | ~330 tokens | **0 tokens** | 100% |
+| 10 tools | ~1,100 tokens | **0 tokens** | 100% |
+| 22 tools (full server) | ~2,400 tokens | **0 tokens** | 100% |
+| 5 servers (~100 tools) | ~12,000 tokens | **0 tokens** | 100% |
-| Configuration | Per-Tool Latency | vs Native MCP |
-|---------------|------------------|---------------|
-| **Droid Mode (Daemon)** | **616ms** | **11% faster** |
-| Native MCP (HTTP) | 695ms | baseline |
-| Droid Mode (No Daemon) | 2,365ms | 240% slower |
+Tools are loaded only when called. Your context window stays clean.
-### Single Tool Performance
+---
-| Call | No Daemon | Daemon | Native MCP |
-|------|-----------|--------|------------|
-| 1 | 2948ms | 845ms | 866ms |
-| 2 | 2442ms | 610ms | 789ms |
-| 3 | 2300ms | 613ms | 798ms |
-| 4 | 2415ms | 706ms | 742ms |
-| 5 | 2334ms | 617ms | 690ms |
-| **Average** | **2488ms** | **678ms** | **777ms** |
+## Architecture
-### Scale Performance (10 Tools)
+Droid Mode uses a **daemon architecture** to maintain persistent MCP connections, eliminating the overhead of spawning new processes for each tool call.
-| Metric | No Daemon | Daemon | Native MCP |
-|--------|-----------|--------|------------|
-| Total Time | 23.7s | **6.2s** | 6.9s |
-| Per-Tool Avg | 2365ms | **616ms** | 695ms |
+```mermaid
+flowchart LR
+    A[AI Agent] --> B[dm CLI]
+    B --> C[Unix Socket]
+    C --> D[Droid Mode Daemon]
+    D --> E[Connection Pool]
+    E --> F1[MCP Server 1]
+    E --> F2[MCP Server 2]
+    E --> F3[MCP Server N]
+```
-### Key Findings
+### How the Daemon Works
-1. **Daemon beats Native MCP** — 11-14% faster per-tool latency
-2. **Scales linearly** — No cumulative overhead at 10+ tools
-3. **Consistent** — ±262ms variance vs ±236ms for Native MCP
-4. **No-daemon is prohibitive** — 74% slower, only for one-off calls
+| Component | Purpose |
+|-----------|---------|
+| **Unix Socket** | IPC channel at `/tmp/dm-daemon.sock` for low-latency communication |
+| **Connection Pool** | Lazy-initialized clients with automatic lifecycle management |
+| **Auto-Warm** | Pre-connects frequently used servers on daemon start |
+| **Idle Pruning** | Closes unused connections after 10 minutes (configurable) |
-### Token Efficiency
+The daemon starts automatically on first `dm call`. Without it, each call spawns a fresh MCP process (~2.5s average). With the daemon, calls reuse pooled connections (~680ms average).
-Droid Mode eliminates schema overhead by keeping tool definitions out of the LLM context.
+---
-| Metric | Native MCP | Droid Mode |
-|--------|------------|------------|
-| Schema overhead (3 tools used) | 329 tokens | **0 tokens** |
-| Full server loaded (22 tools) | 2,400 tokens | **0 tokens** |
-| Typical setup (5 servers, ~100 tools) | ~11,000 tokens | **0 tokens** |
-| Total tokens (3-tool session) | 1,072 | **897** |
-| **Savings** | — | **16%** |
+## Performance Benchmarks
-Industry reports show MCP tools consuming 20-30% of context windows before any work begins. With Droid Mode, that overhead drops to zero — tools are loaded only when called.
+Independent benchmarks comparing Droid Mode against native MCP (direct stdio).
-Schema cost scales with tool count:
+| Configuration | Per-Tool Latency | 10-Tool Total | vs. Native |
+|---------------|------------------|---------------|------------|
+| **Droid Mode (Daemon)** | **678ms** | **6.8s** | **13% faster** |
+| Native MCP | 777ms | 7.8s | baseline |
+| Droid Mode (No Daemon) | 2,488ms | 24.9s | 220% slower |
-| Tools Enabled | Native MCP Cost | Droid Mode Cost |
-|---------------|-----------------|-----------------|
-| 3 tools | ~329 tokens | 0 |
-| 10 tools | ~1,100 tokens | 0 |
-| 22 tools (full server) | ~2,400 tokens | 0 |
+<details>
+<summary>Methodology</summary>
-This matters most when context is constrained or when running many sessions at scale.
+- **Hardware**: macOS Darwin 25.2.0
+- **MCP Server**: Context Repo MCP (`context-repo-mcp`)
+- **Protocol**: MCP 2025-06-18
+- **Runs**: 5 iterations averaged
+- **Date**: January 2026
-*Benchmarks: macOS Darwin 25.2.0, Context Repo MCP server, January 2026*
+Single-tool breakdown (5 runs):
-## Daemon Mode
+| Run | No Daemon | Daemon | Native MCP |
+|-----|-----------|--------|------------|
+| 1 | 2,948ms | 845ms | 866ms |
+| 2 | 2,442ms | 610ms | 789ms |
+| 3 | 2,300ms | 613ms | 798ms |
+| 4 | 2,415ms | 706ms | 742ms |
+| 5 | 2,334ms | 617ms | 690ms |
+| **Avg** | **2,488ms** | **678ms** | **777ms** |
-The daemon maintains persistent MCP connections via a Unix socket, eliminating stdio spawn overhead.
+</details>
-### How It Works
+**Key finding**: The daemon maintains persistent connections, beating native MCP by ~13% while eliminating all schema overhead from your context window.
-```
-┌──────────────────────────────────────────────────┐
-│            dm daemon (background)                │
-│  Connection Pool:                                │
-│  ├── context-repo: connected (15 calls)         │
-│  ├── convex: idle                               │
-│  └── firecrawl: connected (3 calls)             │
-└──────────────────────────────────────────────────┘
-                    │
-            dm call --server X
-                    ↓
-         ~620ms instead of ~2,900ms
-```
+---
-### Usage
+## Quick Start
 ```bash
-# Daemon auto-starts on first dm call
-dm call list_collections --server context-repo
-# Or start manually
-dm daemon start
-# Check connection status
-dm daemon status
+# 1. Initialize Droid Mode in your project
+npx droid-mode init
+```
-# Pre-warm specific servers
-dm daemon warm context-repo
+```
+✓ Initialized successfully   12 files created
-# Stop daemon
-dm daemon stop
+QUICK START
+1. Discover MCP servers        dm servers
+2. Index tools from server     dm index --server <name>
+3. Run a workflow              dm run --server <name> --tools a,b --workflow file.js
+```
-# Bypass daemon for direct call
-dm call tool --server X --no-daemon
+```bash
+# 2. Discover available MCP servers
+dm servers
 ```
-### Configuration
+```
+MCP Servers (from ~/.factory/mcp.json)
+┌─────────────────┬───────┬──────────────────┐
+│ Name            │ Type  │ Status           │
+├─────────────────┼───────┼──────────────────┤
+│ context-repo    │ stdio │ disabled (good!) │
+│ convex          │ stdio │ disabled (good!) │
+└─────────────────┴───────┴──────────────────┘
+```
 ```bash
-# Disable auto-warm for a server
-dm config context-repo autoWarm false
+# 3. Call a tool
+dm call list_collections --server context-repo
 ```
-The daemon is optional. Use `--no-daemon` to bypass it.
+```json
+{
+  "collections": [
+    { "id": "docs", "name": "Documentation", "count": 42 },
+    { "id": "code", "name": "Code Samples", "count": 18 }
+  ]
+}
+```
-## Installation
+The daemon starts automatically on first call. For manual control:
 ```bash
-npx droid-mode init
+dm daemon start    # Start daemon
+dm daemon status   # Check connections
+dm daemon stop     # Stop daemon
 ```
-This scaffolds the skill into `.factory/skills/droid-mode/`
+---
-## Quick Start
+## Progressive Disclosure Model
-```bash
-# 1. Discover available MCP servers
-dm servers
+| Level | Command | What You Get | Token Cost |
+|-------|---------|--------------|------------|
+| 1 | `dm servers` | List of configured MCP servers | ~10 |
+| 2 | `dm index --server X` | Tool names, descriptions, required params | ~50 |
+| 3 | `dm search "query" --server X` | Filtered tools matching keyword | ~20 |
+| 4 | `dm hydrate tool1 tool2 --server X` | Full JSON schemas + TypeScript types | on-demand |
+| 5 | `dm call tool --server X` | Execute tool, get result | 0 |
+| 6 | `dm run --workflow file.js --server X` | Procedural multi-tool workflow | 0 |
-# 2. List tools on a server
-dm index --server context-repo
+---
-# 3. Search for relevant tools
-dm search "collections" --server context-repo
+## Command Reference
-# 4. Call a tool directly
-dm call list_collections --server context-repo
+### Discovery
+| Command | Description |
+|---------|-------------|
+| `dm servers` | List all MCP servers from `mcp.json` |
+| `dm index --server <name>` | List tools with required parameters |
+| `dm search "<query>" --server <name>` | Search tools by keyword |
+| `dm hydrate <tools...> --server <name>` | Get full schemas + generate TypeScript types |
+### Execution
+| Command | Description |
+|---------|-------------|
+| `dm call <tool> --server <name>` | Call a single tool |
+| `dm run --workflow <file> --tools <a,b> --server <name>` | Execute procedural workflow |
+### Daemon
+| Command | Description |
+|---------|-------------|
+| `dm daemon start` | Start background daemon |
+| `dm daemon stop` | Stop daemon |
+| `dm daemon status` | Show connection pool status |
+| `dm daemon warm [server]` | Pre-warm server connection(s) |
+| `dm call ... --no-daemon` | Bypass daemon for single call |
+### Diagnostics
+| Command | Description |
+|---------|-------------|
+| `dm doctor --server <name>` | Diagnose connection issues |
+| `dm config <server> <key> <value>` | Configure server settings |
+---
+## Workflows
+Workflows let you run procedural logic across multiple MCP tools in a sandboxed environment.
-# 5. Run a workflow
+```javascript
+// my-workflow.js
+workflow = async () => {
+  const collections = await t.listCollections({})
+  log("Found", collections.length, "collections")
+  for (const col of collections.slice(0, 3)) {
+    const docs = await t.listDocuments({ collection: col.id })
+    log(`  ${col.name}: ${docs.length} documents`)
+  }
+  return { success: true, count: collections.length }
+}
+```
+```bash
 dm run --server context-repo \
-  --tools list_collections,get_document \
+  --tools list_collections,list_documents \
   --workflow my-workflow.js
 ```
-## Progressive Disclosure Model
+```
+Found 5 collections
+  Documentation: 42 documents
+  Code Samples: 18 documents
+  Architecture: 7 documents
+Workflow completed in 1.2s
+Result: { success: true, count: 5 }
+Trace: .factory/droid-mode/runs/context-repo/20260103T142531/run.json
+```
+### Sandbox Security
+Workflows execute in a restricted VM context:
+- **Blocked**: `require`, `import`, `fetch`, `process`, `eval`
+- **Allowed**: `t.*` (tool calls), `log()`, `sleep()`, `assert()`
+- **Traced**: Every tool call is logged with timing and result hash
-| Level | Command | Purpose |
-|-------|---------|---------|
-| 1 | `dm servers` | Discover available MCP servers |
-| 2 | `dm index --server X` | List tools on a server |
-| 3 | `dm search "..." --server X` | Find relevant tools |
-| 4 | `dm hydrate tool1 --server X` | Get full schemas |
-| 5 | `dm run --server X ...` | Execute workflow |
+---
-## Working with "Disabled" Servers
+## Configuration
-The key insight: `disabled: true` in `mcp.json` tells Droid "don't load these tools into context" - but Droid Mode can still access them directly!
+### Recommended `mcp.json` Setup
 ```json
-// ~/.factory/mcp.json - Recommended setup
 {
   "mcpServers": {
     "context-repo": {
       "type": "stdio",
       "command": "npx",
       "args": ["-y", "context-repo-mcp"],
-      "disabled": true  // ← Good! Keeps context clean
+      "disabled": true
+    },
+    "another-server": {
+      "type": "stdio",
+      "command": "node",
+      "args": ["./path/to/server.js"],
+      "disabled": true
     }
   }
 }
 ```
-## Workflow Example
+> Set `"disabled": true` for all MCP servers you want to access via Droid Mode. They remain fully functional, just not injected into your context window.
-Create a workflow file:
+### Configuration Locations
-```javascript
-// my-workflow.js
-workflow = async () => {
-  const collections = await t.listCollections({})
-  log("Found", collections.length, "collections")
-  const docs = await t.listDocuments({})
-  return { collections, docs }
-}
-```
+- **Project**: `.factory/mcp.json`
+- **User**: `~/.factory/mcp.json` (user config takes precedence)
-Run it:
+For more information on Factory.ai Droid and MCP configuration, see the [Factory.ai documentation](https://docs.factory.ai/).
-```bash
-dm run --server context-repo \
-  --tools list_collections,list_documents \
-  --workflow my-workflow.js
-```
+---
-## Commands Reference
+## Artifacts
-| Command | Description |
-|---------|-------------|
-| `dm servers` | List all available MCP servers |
-| `dm index --server <name>` | List tools with required parameters |
-| `dm search "<query>" --server <name>` | Search tools by keyword |
-| `dm hydrate <tools...> --server <name>` | Get full schemas |
-| `dm call <tool> --server <name>` | Call a tool directly |
-| `dm run --workflow <file> --tools <a,b> --server <name>` | Execute workflow |
-| `dm doctor --server <name>` | Diagnose connection |
-| `dm daemon start` | Start background daemon |
-| `dm daemon stop` | Stop daemon |
-| `dm daemon status` | Show connection pool status |
-| `dm daemon warm [server]` | Pre-warm server connections |
-| `dm config <server> autoWarm false` | Disable auto-warm for server |
+All outputs are written to `.factory/droid-mode/`:
+| Path | Contents |
+|------|----------|
+| `cache/<server>/tools.json` | Cached tool inventory |
+| `hydrated/<server>/<timestamp>/schemas.json` | Full JSON schemas |
+| `hydrated/<server>/<timestamp>/types.d.ts` | Generated TypeScript types |
+| `runs/<server>/<timestamp>/run.json` | Workflow execution trace |
+---
+## Requirements & Limitations
+### Requirements
+- **Node.js** >= 18
+- **Factory.ai Droid** CLI
+- MCP servers configured in `~/.factory/mcp.json` or `.factory/mcp.json`
+### Current Limitations
+- **Windows**: Daemon mode uses Unix sockets (`/tmp/dm-daemon.sock`). Windows support is not yet implemented.
+- **HTTP Transport**: Exists in code but documentation pending.
+- **Hooks**: PreToolUse hooks exist in `examples/hooks/` but are not yet documented.
+---
+## Experimental Status
+> **⚠️ This is experimental software (v0.0.x)**
+>
+> Droid Mode is under active development to improve MCP usability in Factory.ai Droid. The API may change between versions.
+>
+> **Feedback welcome!** Open an issue on GitHub or reach out on X.
+---
 ## Design Philosophy
-> **Treat MCP as infrastructure, Skills as capability boundaries, and code as a reasoning amplifier — not as an authority.**
+> Treat MCP as infrastructure, Skills as capability boundaries, and code as a reasoning amplifier, not as authority.
 Inspired by [Cloudflare's Code Mode](https://blog.cloudflare.com/code-mode/) concept, adapted for Factory.ai's Skill architecture.
-## Requirements
-- Node.js >= 18
-- Factory.ai Droid CLI
-- MCP servers configured in `~/.factory/mcp.json`
+---
 ## License
@@ -254,3 +357,11 @@ MIT
 ## Author
 [GitMaxd](https://github.com/Gitmaxd) · [@gitmaxd](https://x.com/gitmaxd)
+---
+<div align="center">
+**[GitHub](https://github.com/Gitmaxd/droid-mode)** · **[npm](https://www.npmjs.com/package/droid-mode)** · **[Issues](https://github.com/Gitmaxd/droid-mode/issues)**
+</div>

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "droid-mode",
-  "version": "0.0.12",
+  "version": "0.0.14",
   "description": "Progressive Code-Mode MCP integration for Factory.ai Droid - access MCP tools without context bloat",
   "type": "module",
   "main": "dist/cli.js",