npm - miii-cli - Versions diffs - 1.3.0 → 1.3.1 - Mend

miii-cli 1.3.0 → 1.3.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

package/README.md +159 -127
package/dist/config.js +1 -1
package/dist/llm/stream.js +181 -18
package/dist/mcp/client.js +3 -1
package/dist/memory/extractor.js +34 -3
package/dist/tasks/compactor.js +4 -1
package/dist/tui/components/ConfigPicker.js +12 -2
package/dist/tui/components/InputArea.js +5 -2
package/dist/tui/hooks/useRunLoop.js +156 -80
package/dist/tui/hooks/useSubmit.js +12 -0
package/dist/tui/printer.js +6 -4
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -1,19 +1,20 @@
-# Miii — Local-First AI Coding Agent
+# miii — Ollama Coding CLI. 176 KB. No API Key.
-> **The only coding CLI that runs fully local or cloud — any model, zero lock-in, zero monthly bill.**
+> **Claude Code UX. Ollama models. No invoice.**
 ![MIII Demo](mii-cli.gif)
 [![npm version](https://img.shields.io/npm/v/miii-cli)](https://www.npmjs.com/package/miii-cli)
-[![npm downloads](https://img.shields.io/npm/dm/miii-cli)](https://www.npmjs.com/package/miii-cli)
 [![license](https://img.shields.io/npm/l/miii-cli)](LICENSE)
 [![node](https://img.shields.io/node/v/miii-cli)](https://nodejs.org)
+**176 KB · no API key · works offline**
 ---
-**Miii is a fully autonomous coding agent that runs entirely on your machine.** It plans, edits files, runs your tests, searches the web, indexes your codebase semantically, and iterates until the job is done — all without a single byte of your code leaving your network.
+Buy hardware once. Pay for AI never.
-Zero subscription. Zero cloud dependency. Zero Python overhead. **Lightning fast startup.**
+Your code never leaves your machine. Nothing sent to Anthropic, OpenAI, or anyone. If you're already running Ollama, miii adds $0 to your stack.
 ```bash
 npm install -g miii-cli && miii
@@ -23,13 +24,59 @@ npm install -g miii-cli && miii
 ## Why Miii Exists
-Claude Code is impressive. It's also cloud-only, costs $20–200/month, and sends every line of your codebase to a server you don't control.
+**You're probably paying for something miii does for free.**
-OpenCode and Codex CLI have the same problem — they're all cloud-first, all locked to specific providers, and all charge you indefinitely for the privilege of reading your private code.
+Claude Code bills against your Anthropic API key. miii runs open models on Ollama — Llama, Mistral, Qwen, Phi. Fully local. $0. Claude Code has no built-in undo for file changes. A bad edit is a bad edit. Miii checkpoints every file before touching it.
-**Miii flips the model.** Run on Ollama: $0/month, fully offline, code never leaves your machine. Switch to Anthropic or OpenAI when you need cloud power. Change providers live inside the app — no config files, no restarts.
+The gap is what miii adds on top: file checkpoints before every edit, npm skills, live model switching, and full air-gap support.
-Your compute. Your data. Your rules.
+- **16 GB RAM, a GPU** — if you're already running Ollama, miii adds $0 to your stack
+- **Try Llama 3, Mistral, Qwen, Phi** side by side without switching tools
+- **Literally cannot use cloud AI** — miii with Ollama is purpose-built for zero-internet environments
+---
+## What Miii Actually Does
+Not a chatbot with a file-write button. Miii is a **full autonomous agent loop** — reasons, plans, acts, self-corrects until the task is done.
+1. Describe a goal in plain English
+2. Miii reads your codebase, maps the changes, shows the plan
+3. Asks permission before touching anything — every edit, command, delete
+4. Shows exact diff of what changes *before* you approve
+5. Runs tests. If they fail, reads the error, fixes autonomously
+6. Every file checkpointed — hit Esc and everything rolls back
+---
+## What a Real Session Looks Like
+```
+> refactor the auth module to use JWT instead of sessions
+  ● thinking…
+  ● read_file src/auth/session.ts  (42 lines)
+  ● read_file src/middleware/auth.ts  (28 lines)
+  ─ plan  (2 actions)
+    ◦ edit_file  src/auth/session.ts
+    ◦ edit_file  src/middleware/auth.ts
+  ⚠  edit_file  src/auth/session.ts
+  ┌─ diff preview ──────────────────────┐
+  │ - const session = req.session.user  │
+  │ + const token = verifyJWT(req)      │
+  └─────────────────────────────────────┘
+  y approve   s approve all   n deny
+  > s
+  ● edit_file src/auth/session.ts    done
+  ● edit_file src/middleware/auth.ts  done
+  ● run_tests  ✅ passed
+  ─ done in 14.2s  ·  branch: miii/task-2025-05-17-14-32
+```
+Parallel file reads. Diff preview before approval. Auto-branched off `main`. Tests ran. Session over.
 ---
@@ -39,169 +86,153 @@ Your compute. Your data. Your rules.
 |---|:---:|:---:|:---:|:---:|:---:|
 | Monthly cost | **$0** | $20–200 | API cost | API cost | $0 |
 | Bundle size | **176 KB** | ~50 MB | ~30 MB | ~20 MB | ~200 MB |
+| Startup time | **<100ms** | ~2s | ~1s | ~1s | ~4s |
 | Local / offline (Ollama) | **✅** | ❌ | partial | ❌ | ⚠️ |
 | Air-gapped | **✅** | ❌ | ❌ | ❌ | ❌ |
-| Switch provider live | **✅** | ❌ | ❌ | ❌ | ❌ |
+| Any model | **✅** | ❌ | partial | ❌ | ✅ |
 | File checkpoints (undo) | **✅** | ❌ | ❌ | ❌ | ❌ |
-| Permission gates | **✅** | ✅ | partial | ✅ | ❌ |
-| MCP client | **✅** | ✅ | ✅ | ❌ | ❌ |
+| Diff preview before approve | **✅** | ❌ | ❌ | ❌ | ❌ |
+| Git auto-branch on edit | **✅** | ❌ | ❌ | ❌ | ❌ |
+| Switch provider live | **✅** | ❌ | ❌ | ❌ | ❌ |
+| Native tool_calls (Anthropic + OpenAI) | **✅** | ✅ | ✅ | ✅ | ❌ |
+| Parallel read-only tools | **✅** | partial | ❌ | ❌ | ❌ |
+| Two-phase plan→execute | **✅** | ❌ | ❌ | ❌ | ❌ |
+| Live streaming toggle | **✅** | always on | always on | always on | ❌ |
 | Semantic codebase index | **✅** | ❌ | ❌ | ❌ | ❌ |
-| Skill/extension system | **✅** | plugins | ❌ | ❌ | ❌ |
-| Startup time | **<100ms** | ~2s | ~1s | ~1s | ~4s |
+| npm skills | **✅** | plugins | ❌ | ❌ | ❌ |
+| MCP client | **✅** | ✅ | ✅ | ❌ | ❌ |
 | License | **MIT** | Proprietary | MIT | MIT | Apache 2.0 |
 ---
-## How it Works
+## Eight Core Capabilities
-Miii isn't just autocomplete—it's a **full autonomous agent loop** that reasons through complex tasks:
+**Local / Offline** — Ollama runs on your machine. No internet required after model pull.
-1. You describe a goal
-2. Miii reads your codebase, plans the changes, edits the files
-3. It asks your permission before touching anything (edit, delete, run commands)
-4. It runs your test suite automatically after every change
-5. If tests fail, it reads the error, fixes the code, re-runs
-6. It repeats until the work is done — and checkpoints every file so you can abort safely
+**Air-Gapped Ready** — regulated industries, defense, offline infrastructure. miii with Ollama works where cloud literally cannot.
----
-## What a Session Looks Like
-```
-> refactor the auth module to use JWT instead of sessions
+**Any Model** — Llama 3, Mistral, Qwen, Phi, or switch to Anthropic/OpenAI live. One tool, every model.
-  ● Researching: refactor auth module to use JWT
-  ● Reading src/auth/session.ts
-    Read 42 lines
-  ● Reading src/middleware/auth.ts
-    Read 28 lines
+**File Checkpoints** — every file snapshotted before edit. Abort = full rollback. No bad edits stick.
-  ─ plan (2 actions)
-    ◦ edit_file src/auth/session.ts
-    ◦ edit_file src/middleware/auth.ts
+**Permission Gates + Diff Preview** — approve every write, delete, or command. See the exact diff before you say yes.
-  ⚠ edit_file  src/auth/session.ts   y approve  n deny
-  > y
+**MCP Client** — plug in any MCP-compatible tool server. Tools discovered automatically.
-  ● edit_file src/auth/session.ts
-    Wrote 12 lines
-  ● edit_file src/middleware/auth.ts
-    Wrote 8 lines
-  ● run_tests
-    ✅ Tests passed
+**npm Skills** — extend miii with plain Markdown files or npm packages. Ship reusable agent behaviors to your whole team.
-  ─ refactor done — 2 file(s) processed
-```
+**$0 / Month** — no subscription, no invoice, no API key required for local use.
 ---
-## 🚀 Core Capabilities
-**🔒 Privacy-First, Local by Default**
-Run on Ollama and your code never leaves your machine. No account. No API key. No monthly bill. Switch to Anthropic or OpenAI when you need it — one command, live, mid-session.
-**🔄 Live Provider Switching**
-Type `/config` to open an interactive picker. Arrow-navigate between Ollama, Anthropic, and OpenAI-compatible endpoints. Change model, API key, base URL, or Tavily key without restarting. Config saves automatically.
+## Features Worth Knowing
-**🛡 Permission Gates + File Checkpoints**
-Miii asks before every edit, delete, or shell command — just like Claude Code. Every file is checkpointed before it's touched. Hit Esc to abort and all changes roll back automatically.
+**Git Auto-Branch** — first approved edit auto-creates `miii/task-YYYY-MM-DD-HH-MM`. Your `main` is never touched until you decide.
-**🔍 Semantic Codebase Indexing**
-Build a vector index of your entire codebase using local embeddings. Ask "where is the auth logic?" and Miii finds it by meaning, not keyword. No data leaves your machine.
+**Parallel Read-Only Tools** — reading five files + git status + web search? All fire at once. Write ops stay sequential. Speed where safe, safety where it matters.
-**🧠 Deep Think Engine**
-Before answering complex questions, Miii runs a constrained research phase — reading files, checking git history, searching the web — then synthesizes a grounded answer.
-**🌐 Real-Time Web Access**
-Tavily-powered web search, built in. Ask about breaking changes in a library you just upgraded. Get an answer that's actually current.
-**🛠 Surgical File Editing**
-`patch_file` replaces exact strings in your files. No full rewrites. No formatting destruction. Exactly the change, nothing more.
-**🔁 Self-Healing Test Loop**
-Runs `npm test` after every file change. If something breaks, reads the failure trace and fixes it autonomously — up to 3 retries before surfacing the issue.
+**Two-Phase Plan → Execute**
+```
+/plan exec refactor the payment module
+```
+First turn: numbered plan, tools disabled — you read it, decide. Second turn: execution with plan as context. No surprises.
-**📂 Persistent Sessions**
-Pick up exactly where you left off. Named sessions mean your context, history, and goal survive terminal restarts.
+**Native Tool Calls** — Anthropic uses `tool_use` blocks, OpenAI uses `tool_calls` arrays, exactly as the API intended. Faster, more reliable, less hallucination. Ollama uses compact XML fallback.
-**📦 Skill System**
-Extend Miii with plain Markdown files or npm packages. Ship reusable agent behaviors as versioned packages your whole team can pull.
+**Live Streaming Toggle** — turn on in `/config` to watch tokens appear in real time. Turn off for clean batch output. Toggle mid-session, no restart.
-**🔌 MCP Client**
-Connect any MCP-compatible tool server. Miii discovers tools automatically and makes them available to the agent.
+**Semantic Codebase Search** — local vector index, no embeddings sent anywhere. `/index build` once. Ask "where is the payment logic?" by meaning, not grep.
 ---
-## ⚡ Quick Start
+## Quick Start
 ```bash
-# 1. Start Ollama and pull a model
+# Local — free, offline (recommended)
 ollama pull qwen2.5-coder:7b
+npm install -g miii-cli
+cd your-project && miii
-# 2. Install Miii
+# Anthropic Claude
 npm install -g miii-cli
+ANTHROPIC_API_KEY=sk-... miii
-# 3. Go to your project and start
-cd your-project
-miii
+# OpenAI or compatible endpoint
+npm install -g miii-cli
+miii   # set key + base URL in /config
 ```
-No API keys. No account. No sign-up form. First run walks you through setup interactively.
+Hardware requirements are real — this runs on your machine, not a server farm.
+| | Minimum | Recommended |
+|---|---|---|
+| RAM | 16 GB | 32 GB+ |
+| GPU | integrated | dedicated |
+| Storage | 10 GB | 20 GB+ |
 ---
-## ⌨️ Power Commands
+## Commands
 | Command | What it does |
 |---|---|
-| `/config` | Open interactive picker — change provider, model, API key, base URL, Tavily key live |
-| `/think <question>` | Deep research: reads files + web, then answers |
-| `/refactor <goal>` | Autonomous multi-file refactor with test validation |
-| `/index build` | Build semantic vector index of your codebase |
-| `/index search <query>` | Find code by meaning, not string match |
-| `/git review` | AI reviews your current diff for bugs and issues |
-| `/git commit <msg>` | Stage everything and commit in one shot |
-| `/plan <topic>` | Structured planning mode before you write a line |
-| `/model <name>` | Hot-swap your LLM mid-conversation |
-| `/session <name>` | Switch between named project sessions |
-| `/watch <path>` | Monitor files for changes and trigger agent reactions |
-| `@filename` | Inject any file directly into context |
----
-## Semantic Codebase Indexing
-For large codebases, Miii builds and queries a local vector index — no third-party APIs, no embeddings sent anywhere.
-```bash
-# Pull an embedding model (one time)
-ollama pull nomic-embed-text
-# Index your project
-/index build
-# The agent calls search_codebase automatically when it needs to find code by concept
-```
+| `/config` | Interactive picker — provider, model, API key, base URL, Tavily, streaming |
+| `/plan exec <task>` | Two-phase: plan turn (no tools) → execute with plan as context |
+| `/think <question>` | Deep research: reads files + web, synthesizes answer |
+| `/index build` | Build local semantic vector index |
+| `/index search <q>` | Find code by concept, not string match |
+| `/git review` | AI reviews current diff — bugs, risks, style |
+| `/git commit <msg>` | Stage everything, commit in one shot |
+| `/model <name>` | Hot-swap LLM mid-conversation |
+| `/session <name>` | Named sessions — resume exactly where you left off |
+| `@filename` | Inject any file into context |
+Commands open in a picker — select to insert into input, Enter to run.
 ---
 ## Configuration
-**Interactive (recommended):** type `/config` inside Miii to open the picker.
+**Interactive:** type `/config` inside miii.
-**File-based:** drop a `.miii.json` in your project root or `~/.config/miii/config.json` globally:
+**File-based:** `.miii.json` in project root or `~/.config/miii/config.json` globally:
 ```json
 {
-  "model": "qwen2.5-coder:7b",
   "provider": "ollama",
   "baseUrl": "http://localhost:11434",
   "gitContext": true,
+  "streaming": false,
   "embedModel": "nomic-embed-text"
 }
 ```
-Providers: `ollama` (local, free) · `anthropic` (Claude API) · `openai-compat` (OpenAI or any compatible endpoint)
+---
+## MCP — Connect Any Tool Server
+```json
+{
+  "mcpServers": {
+    "postgres": {
+      "command": "npx",
+      "args": ["-y", "@modelcontextprotocol/server-postgres", "postgresql://localhost/mydb"]
+    }
+  }
+}
+```
+Drop into global config. Tools discovered automatically.
+---
+## Semantic Index Setup
+```bash
+ollama pull nomic-embed-text   # one time
+/index build                   # inside your project
+# agent calls search_codebase automatically from here
+```
 ---
@@ -214,27 +245,28 @@ cd miii-cli && npm install && npm run build && npm link
 ---
-## Who Should Use Miii
+## Who This Is For
-- **Privacy-conscious developers** — won't send proprietary code to Anthropic or OpenAI
-- **Cost-sensitive teams** — API bills compound; Ollama is $0
-- **Air-gapped environments** — regulated industries, defense, offline infra
-- **Model experimenters** — want to try llama3, mistral, qwen, Claude side-by-side without switching tools
+**Privacy-conscious developers** — proprietary code stays on your machine, always.
----
+**Cost-sensitive teams** — API bills compound for every developer on the team, every month.
-## The Bottom Line
+**Air-gapped environments** — regulated industries, defense, offline infrastructure where cloud is not an option.
-The AI coding tools you're paying for right now will raise their prices, change their terms, and keep reading your code. **Miii won't.** It's MIT licensed, runs locally, and gets better every time Ollama ships a new model.
+**Model experimenters** — benchmark Llama 3 vs Qwen vs Claude vs GPT-4o in the same workflow.
-If this saves you time or money, **star the repo** — it's the only metric that tells other engineers this is worth their attention.
+**Anyone who's had an AI silently rewrite something they didn't want rewritten.**
-**[⭐ Star on GitHub](https://github.com/maruakshay/miii-cli)**
+---
-> Built by [@maruakshay](https://github.com/maruakshay) — open to PRs, issues, and model recommendations.
+The AI coding tools you're paying for will raise prices, change terms, and keep reading your code. Miii won't. MIT licensed, runs locally, gets better every time Ollama ships a new model.
----
+**If this is the tool you've been waiting for — [⭐ star it](https://github.com/maruakshay/miii-cli) and tell someone.**
-## License
+> Built by [@maruakshay](https://github.com/maruakshay) — PRs, issues, and model recommendations welcome.
+>
+> [miii.in](https://www.miii.in)
+---
 MIT — do whatever you want with it.

package/dist/config.js CHANGED Viewed

@@ -6,7 +6,7 @@ const defaults = {
     provider: 'ollama',
     baseUrl: 'http://localhost:11434',
 };
-const ALLOWED_KEYS = new Set(['model', 'provider', 'baseUrl', 'systemPrompt', 'apiKey', 'gitContext', 'tavilyApiKey', 'embedModel']);
+const ALLOWED_KEYS = new Set(['model', 'provider', 'baseUrl', 'systemPrompt', 'apiKey', 'gitContext', 'streaming', 'tavilyApiKey', 'embedModel']);
 const PROJECT_CONFIG = join(process.cwd(), '.miii.json');
 const GLOBAL_CONFIG = join(homedir(), '.config', 'miii', 'config.json');
 export function saveConfig(config) {

package/dist/llm/stream.js CHANGED Viewed

@@ -1,9 +1,7 @@
-// Transient errors worth retrying: rate limits + server-side faults
 const RETRYABLE_STATUS = new Set([429, 500, 502, 503, 529]);
 const MAX_RETRIES = 4;
 const MAX_DELAY_MS = 30_000;
 function retryDelay(attempt) {
-    // Exponential backoff: 1s → 2s → 4s → 8s, capped at 30s, ±20% jitter
     const base = 1_000 * Math.pow(2, attempt);
     const capped = Math.min(base, MAX_DELAY_MS);
     return Math.round(capped * (0.8 + Math.random() * 0.4));
@@ -43,6 +41,37 @@ async function fetchWithRetry(url, init, signal, onRetry) {
     }
     throw new Error('fetchWithRetry: exhausted retries without returning');
 }
+// Convert Tool params string to JSON Schema for native tool_calls APIs
+function paramsToSchema(paramsStr) {
+    try {
+        const obj = JSON.parse(paramsStr);
+        const properties = {};
+        const required = [];
+        for (const [key, typeStr] of Object.entries(obj)) {
+            const isOptional = typeStr.toLowerCase().includes('optional');
+            const isArray = typeStr.toLowerCase().includes('[]') || typeStr.toLowerCase().startsWith('array');
+            const base = typeStr.split(' ')[0].toLowerCase().replace('[]', '');
+            if (isArray) {
+                properties[key] = { type: 'array', items: { type: 'string' } };
+            }
+            else if (base === 'boolean') {
+                properties[key] = { type: 'boolean' };
+            }
+            else if (base === 'number') {
+                properties[key] = { type: 'number' };
+            }
+            else {
+                properties[key] = { type: 'string' };
+            }
+            if (!isOptional)
+                required.push(key);
+        }
+        return { type: 'object', properties, required };
+    }
+    catch {
+        return { type: 'object', properties: {}, required: [] };
+    }
+}
 export async function warmup(provider, baseUrl, model) {
     if (provider !== 'ollama')
         return;
@@ -121,12 +150,21 @@ async function chatOllama(cfg) {
     }
 }
 async function chatOpenAI(cfg) {
-    const { model, messages, baseUrl, apiKey, signal, onDone, onError, onUsage, onChunk, onRetry } = cfg;
+    const { model, messages, baseUrl, apiKey, signal, onDone, onError, onUsage, onChunk, onRetry, tools, toolChoice } = cfg;
+    const body = { model, messages, stream: !!onChunk };
+    if (tools?.length) {
+        body.tools = tools.map(t => ({
+            type: 'function',
+            function: { name: t.name, description: t.description, parameters: paramsToSchema(t.params) },
+        }));
+        if (toolChoice === 'none')
+            body.tool_choice = 'none';
+    }
     try {
         const res = await fetchWithRetry(`${baseUrl}/v1/chat/completions`, {
             method: 'POST',
             headers: { 'Content-Type': 'application/json', Authorization: `Bearer ${apiKey ?? 'local'}` },
-            body: JSON.stringify({ model, messages, stream: !!onChunk }),
+            body: JSON.stringify(body),
         }, signal, onRetry);
         if (!res.ok) {
             onError(new Error(`LLM ${res.status}: ${await res.text()}`));
@@ -135,13 +173,26 @@ async function chatOpenAI(cfg) {
         if (!onChunk) {
             const obj = await res.json();
             onUsage?.(obj?.usage?.prompt_tokens ?? 0, obj?.usage?.completion_tokens ?? 0);
-            await onDone(obj?.choices?.[0]?.message?.content ?? '');
+            const message = obj?.choices?.[0]?.message;
+            let text = message?.content ?? '';
+            if (message?.tool_calls?.length) {
+                for (const tc of message.tool_calls) {
+                    let args = {};
+                    try {
+                        args = JSON.parse(tc.function?.arguments ?? '{}');
+                    }
+                    catch { }
+                    text += `\n<tool_call>\n{"name": ${JSON.stringify(tc.function?.name)}, "args": ${JSON.stringify(args)}}\n</tool_call>`;
+                }
+            }
+            await onDone(text);
             return;
         }
         const reader = res.body.getReader();
         const decoder = new TextDecoder();
         let full = '';
         let buf = '';
+        const tcAccum = {};
         while (true) {
             const { done, value } = await reader.read();
             if (done)
@@ -157,15 +208,41 @@ async function chatOpenAI(cfg) {
                     continue;
                 try {
                     const obj = JSON.parse(data);
-                    const chunk = obj?.choices?.[0]?.delta?.content ?? '';
+                    const delta = obj?.choices?.[0]?.delta;
+                    if (!delta)
+                        continue;
+                    const chunk = delta.content ?? '';
                     if (chunk) {
                         full += chunk;
                         onChunk(chunk);
                     }
+                    if (delta.tool_calls) {
+                        for (const tc of delta.tool_calls) {
+                            const idx = tc.index ?? 0;
+                            if (!tcAccum[idx])
+                                tcAccum[idx] = { id: '', name: '', args: '' };
+                            if (tc.id)
+                                tcAccum[idx].id = tc.id;
+                            if (tc.function?.name)
+                                tcAccum[idx].name += tc.function.name;
+                            if (tc.function?.arguments)
+                                tcAccum[idx].args += tc.function.arguments;
+                        }
+                    }
                 }
                 catch { }
             }
         }
+        // Serialize accumulated tool_calls to XML for run loop compatibility
+        for (const idx of Object.keys(tcAccum).map(Number).sort((a, b) => a - b)) {
+            const tc = tcAccum[idx];
+            let args = {};
+            try {
+                args = JSON.parse(tc.args);
+            }
+            catch { }
+            full += `\n<tool_call>\n{"name": ${JSON.stringify(tc.name)}, "args": ${JSON.stringify(args)}}\n</tool_call>`;
+        }
         await onDone(full);
     }
     catch (err) {
@@ -174,20 +251,30 @@ async function chatOpenAI(cfg) {
     }
 }
 async function chatAnthropic(cfg) {
-    const { model, messages, baseUrl, apiKey, signal, onDone, onError, onUsage, onRetry } = cfg;
+    const { model, messages, baseUrl, apiKey, signal, onDone, onError, onUsage, onChunk, onRetry, tools, toolChoice } = cfg;
     const url = baseUrl && baseUrl !== 'http://localhost:11434'
         ? `${baseUrl}/v1/messages`
         : 'https://api.anthropic.com/v1/messages';
     const systemParts = messages.filter(m => m.role === 'system').map(m => m.content);
     const filtered = messages.filter(m => m.role !== 'system');
+    const body = {
+        model,
+        max_tokens: 8192,
+        stream: !!onChunk,
+        messages: filtered,
+    };
+    if (systemParts.length)
+        body.system = systemParts.join('\n\n');
+    if (tools?.length) {
+        body.tools = tools.map(t => ({
+            name: t.name,
+            description: t.description,
+            input_schema: paramsToSchema(t.params),
+        }));
+        if (toolChoice === 'none')
+            body.tool_choice = { type: 'none' };
+    }
     try {
-        const body = {
-            model,
-            max_tokens: 8192,
-            messages: filtered,
-        };
-        if (systemParts.length)
-            body.system = systemParts.join('\n\n');
         const res = await fetchWithRetry(url, {
             method: 'POST',
             headers: {
@@ -201,10 +288,86 @@ async function chatAnthropic(cfg) {
             onError(new Error(`Anthropic ${res.status}: ${await res.text()}`));
             return;
         }
-        const obj = await res.json();
-        const text = (obj.content ?? []).filter(c => c.type === 'text').map(c => c.text).join('');
-        onUsage?.(obj.usage?.input_tokens ?? 0, obj.usage?.output_tokens ?? 0);
-        await onDone(text);
+        if (!onChunk) {
+            const obj = await res.json();
+            let fullText = '';
+            for (const block of obj?.content ?? []) {
+                if (block.type === 'text')
+                    fullText += block.text ?? '';
+                else if (block.type === 'tool_use') {
+                    const args = block.input ?? {};
+                    fullText += `\n<tool_call>\n{"name": ${JSON.stringify(block.name ?? '')}, "args": ${JSON.stringify(args)}}\n</tool_call>`;
+                }
+            }
+            onUsage?.(obj?.usage?.input_tokens ?? 0, obj?.usage?.output_tokens ?? 0);
+            await onDone(fullText);
+            return;
+        }
+        const reader = res.body.getReader();
+        const decoder = new TextDecoder();
+        let buf = '';
+        let fullText = '';
+        let promptTokens = 0;
+        let completionTokens = 0;
+        // Track native tool_use content blocks
+        const toolBlocks = [];
+        let activeToolIdx = -1;
+        while (true) {
+            const { done, value } = await reader.read();
+            if (done)
+                break;
+            buf += decoder.decode(value, { stream: true });
+            const lines = buf.split('\n');
+            buf = lines.pop() ?? '';
+            for (const line of lines) {
+                if (!line.startsWith('data: '))
+                    continue;
+                const data = line.slice(6).trim();
+                if (!data || data === '[DONE]')
+                    continue;
+                try {
+                    const evt = JSON.parse(data);
+                    if (evt.type === 'message_start') {
+                        promptTokens = (evt.message?.usage?.input_tokens) ?? 0;
+                    }
+                    else if (evt.type === 'content_block_start') {
+                        const block = evt.content_block;
+                        if (block.type === 'tool_use') {
+                            activeToolIdx = toolBlocks.length;
+                            toolBlocks.push({ id: block.id ?? '', name: block.name ?? '', inputJson: '' });
+                        }
+                    }
+                    else if (evt.type === 'content_block_delta') {
+                        const delta = evt.delta;
+                        if (delta.type === 'text_delta' && delta.text) {
+                            fullText += delta.text;
+                            onChunk?.(delta.text);
+                        }
+                        else if (delta.type === 'input_json_delta' && activeToolIdx >= 0) {
+                            toolBlocks[activeToolIdx].inputJson += delta.partial_json ?? '';
+                        }
+                    }
+                    else if (evt.type === 'content_block_stop') {
+                        activeToolIdx = -1;
+                    }
+                    else if (evt.type === 'message_delta') {
+                        completionTokens = (evt.usage?.output_tokens) ?? 0;
+                    }
+                }
+                catch { }
+            }
+        }
+        // Serialize native tool_use blocks to XML for run loop compatibility
+        for (const block of toolBlocks) {
+            let args = {};
+            try {
+                args = JSON.parse(block.inputJson);
+            }
+            catch { }
+            fullText += `\n<tool_call>\n{"name": ${JSON.stringify(block.name)}, "args": ${JSON.stringify(args)}}\n</tool_call>`;
+        }
+        onUsage?.(promptTokens, completionTokens);
+        await onDone(fullText);
     }
     catch (err) {
         if (err?.name !== 'AbortError')

package/dist/mcp/client.js CHANGED Viewed

@@ -22,7 +22,9 @@ export class MCPClient {
             stdio: ['pipe', 'pipe', 'pipe'],
             env: { ...process.env, ...cfg.env },
         });
-        this.proc.stderr?.on('data', () => { });
+        this.proc.stderr?.on('data', (d) => {
+            d.toString().split('\n').filter(Boolean).forEach(line => process.stderr.write(`[MCP:${this.name}] ${line}\n`));
+        });
         const rl = createInterface({ input: this.proc.stdout });
         rl.on('line', (line) => {
             if (!line.trim())

package/dist/memory/extractor.js CHANGED Viewed

@@ -26,12 +26,43 @@ export function extractFacts(messages, config, model) {
             ],
             onDone(text) {
                 try {
-                    const m = text.match(/\[[\s\S]*?\]/);
-                    if (!m) {
+                    const start = text.indexOf('[');
+                    if (start === -1) {
                         resolve([]);
                         return;
                     }
-                    const arr = JSON.parse(m[0]);
+                    let depth = 0, inStr = false, esc = false, end = -1;
+                    for (let i = start; i < text.length; i++) {
+                        const ch = text[i];
+                        if (esc) {
+                            esc = false;
+                            continue;
+                        }
+                        if (ch === '\\' && inStr) {
+                            esc = true;
+                            continue;
+                        }
+                        if (ch === '"') {
+                            inStr = !inStr;
+                            continue;
+                        }
+                        if (inStr)
+                            continue;
+                        if (ch === '[')
+                            depth++;
+                        else if (ch === ']') {
+                            depth--;
+                            if (depth === 0) {
+                                end = i;
+                                break;
+                            }
+                        }
+                    }
+                    if (end === -1) {
+                        resolve([]);
+                        return;
+                    }
+                    const arr = JSON.parse(text.slice(start, end + 1));
                     resolve(Array.isArray(arr) ? arr.filter((f) => typeof f === 'string') : []);
                 }
                 catch {

package/dist/tasks/compactor.js CHANGED Viewed

@@ -30,7 +30,7 @@ What still needs to be done, if anything.
 Any constraints, errors encountered, important facts the agent must remember to continue correctly.
 Be factual. No padding. Include file paths, error messages, and command outputs verbatim when relevant.`;
-export async function compactContext(messages, cfg, goal) {
+export async function compactContext(messages, cfg, goal, signal) {
     if (contextSize(messages) <= COMPACT_CHAR_THRESHOLD)
         return messages;
     const system = messages[0]?.role === 'system' ? messages[0] : null;
@@ -50,6 +50,7 @@ export async function compactContext(messages, cfg, goal) {
     let compactErr = '';
     await chat({
         ...cfg,
+        signal,
         messages: [
             { role: 'system', content: COMPACT_SYSTEM },
             { role: 'user', content: userPrompt },
@@ -59,6 +60,8 @@ export async function compactContext(messages, cfg, goal) {
     });
     if (compactErr)
         console.error(`[compactor] LLM error: ${compactErr}`);
+    if (signal?.aborted)
+        return messages;
     // Fallback to dumb compaction if LLM fails
     if (!summary)
         return dumbCompact(messages, goal);

package/dist/tui/components/ConfigPicker.js CHANGED Viewed

@@ -12,6 +12,7 @@ const MENU_ITEMS = [
     { key: 'key', label: 'API Key' },
     { key: 'url', label: 'Base URL' },
     { key: 'tavily', label: 'Tavily Key' },
+    { key: 'streaming', label: 'Streaming' },
 ];
 function truncate(s, n) {
     return s.length > n ? s.slice(0, n) + '…' : s;
@@ -76,7 +77,12 @@ export function ConfigPicker({ config, currentModel, tavilyKey, onUpdate, onTavi
                 return;
             }
             if (key.return) {
-                openScreen(MENU_ITEMS[menuIdx].key);
+                const item = MENU_ITEMS[menuIdx];
+                if (item.key === 'streaming') {
+                    onUpdate({ streaming: !config.streaming });
+                    return;
+                }
+                openScreen(item.key);
                 return;
             }
             return;
@@ -161,7 +167,11 @@ export function ConfigPicker({ config, currentModel, tavilyKey, onUpdate, onTavi
                     val = truncate(config.baseUrl, 36);
                 if (item.key === 'tavily')
                     val = tavilyDisplay;
-                return (_jsxs(Box, { children: [_jsxs(Text, { color: active ? 'cyan' : 'white', bold: active, children: [active ? '▶ ' : '  ', item.label.padEnd(12)] }), _jsx(Text, { color: active ? 'white' : 'gray', children: val })] }, item.key));
+                const isStreaming = item.key === 'streaming';
+                const streamingOn = config.streaming === true;
+                return (_jsxs(Box, { children: [_jsxs(Text, { color: active ? 'cyan' : 'white', bold: active, children: [active ? '▶ ' : '  ', item.label.padEnd(12)] }), isStreaming
+                            ? _jsx(Text, { color: streamingOn ? 'green' : 'gray', children: streamingOn ? 'on' : 'off' })
+                            : _jsx(Text, { color: active ? 'white' : 'gray', children: val })] }, item.key));
             }), screen === 'provider' && PROVIDERS.map((p, i) => {
                 const active = i === provIdx;
                 const current = p.key === config.provider;

package/dist/tui/components/InputArea.js CHANGED Viewed

@@ -21,6 +21,7 @@ const BUILTIN_COMMANDS = [
     { ns: 'builtin', name: 'list', description: 'list all loaded skills and their descriptions' },
     // ── AI modes ─────────────────────────────────────────────────────────────
     { ns: 'builtin', name: 'plan', description: 'enter planning mode — AI helps think through a goal step-by-step' },
+    { ns: 'builtin', name: 'plan exec', description: 'two-phase: AI outputs plan first (no tools), say "go" to execute — /plan exec <task>' },
     { ns: 'builtin', name: 'refactor', description: 'multi-file AI refactor — plans, reads, then edits — /refactor <goal>' },
     { ns: 'builtin', name: 'think', description: 'deep research before answering — reads files + optional web — /think <query>' },
     { ns: 'builtin', name: 'watch', description: 'watch for file changes, run tests, auto-fix failures — /watch stop to cancel' },
@@ -176,8 +177,10 @@ export function InputArea({ status, skills, cwd, planningMode, permissionRequest
             : skill.ns === 'git'
                 ? `/git ${skill.name}`
                 : `/${skill.ns}:${skill.name}`;
-        clearInput();
-        onSubmit(name);
+        setLines([name]);
+        setCursor({ row: 0, col: name.length });
+        setOverlay('none');
+        setOverlayIdx(0);
     }
     function selectFile(file) {
         const r = cursor.row;

package/dist/tui/hooks/useRunLoop.js CHANGED Viewed

@@ -1,5 +1,8 @@
 import { useState, useRef, useCallback, useEffect } from 'react';
 import { readFileSync, writeFileSync, unlinkSync, existsSync } from 'fs';
+import { exec } from 'child_process';
+import { promisify } from 'util';
+const runCmd = promisify(exec);
 import { chat } from '../../llm/stream.js';
 import { tools as staticTools } from '../../tools/index.js';
 import { StreamParser, extractBareToolCall } from '../../parser/stream-parser.js';
@@ -10,6 +13,7 @@ const FILE_EDIT_TOOLS = new Set(['edit_file', 'create_file', 'update_file', 'del
 const SHOW_RESULT_TOOLS = new Set(['run_tests', 'git_commit']);
 const PERMISSION_TOOLS = new Set(['edit_file', 'update_file', 'delete_file', 'create_file', 'move_file', 'run_command', 'git_commit']);
 const CHECKPOINT_TOOLS = new Set(['edit_file', 'update_file', 'create_file', 'delete_file']);
+const PARALLEL_SAFE = new Set(['read_file', 'list_files', 'git_status', 'git_log', 'git_diff', 'web_search', 'web_extract']);
 // Tool result messages that are ephemeral — never worth storing in memory or compact summaries
 const EPHEMERAL_PATTERN = /^Tool (read_file|list_files|run_tests) result:|^\[current state of|^\[Context compacted|^\[file updated:/;
 export function stripEphemeral(messages) {
@@ -23,6 +27,7 @@ export function useRunLoop(config, currentModelRef, pushHistory, extraTools = []
     const [permissionRequest, setPermissionRequest] = useState(null);
     const permissionResolveRef = useRef(null);
     const checkpointRef = useRef(new Map());
+    const autoBranchedRef = useRef(null);
     const sessionApprovedRef = useRef(new Set());
     const thinkingStartRef = useRef(0);
     const extraToolsRef = useRef(extraTools);
@@ -42,7 +47,7 @@ export function useRunLoop(config, currentModelRef, pushHistory, extraTools = []
         const t = setInterval(() => setTick(n => n + 1), 80);
         return () => clearInterval(t);
     }, [status]);
-    const runLoop = useCallback(async (contextMsgs, depth = 0, goal) => {
+    const runLoop = useCallback(async (contextMsgs, depth = 0, goal, options) => {
         if (depth >= MAX_TOOL_DEPTH) {
             abortRef.current = null;
             setStatus('idle');
@@ -52,7 +57,9 @@ export function useRunLoop(config, currentModelRef, pushHistory, extraTools = []
         if (depth === 0) {
             thinkingStartRef.current = Date.now();
             checkpointRef.current.clear();
+            autoBranchedRef.current = null;
         }
+        abortRef.current = new AbortController();
         let msgs = contextMsgs;
         if (shouldCompact(contextMsgs)) {
             printer.systemMsg('compacting context…');
@@ -62,17 +69,31 @@ export function useRunLoop(config, currentModelRef, pushHistory, extraTools = []
                 model: currentModelRef.current,
                 baseUrl: config.baseUrl,
                 apiKey: config.apiKey,
-            }, goal);
+            }, goal, abortRef.current.signal);
+            if (abortRef.current.signal.aborted) {
+                setStatus('idle');
+                return;
+            }
             printer.systemMsg(`compacted: ${contextMsgs.length} → ${msgs.length} messages`);
             replaceHistoryRef.current?.(msgs.filter(m => m.role !== 'system'));
         }
-        abortRef.current = new AbortController();
+        let didStream = false;
         await chat({
             provider: config.provider,
             model: currentModelRef.current,
             baseUrl: config.baseUrl,
+            apiKey: config.apiKey,
             messages: msgs,
+            tools: config.provider !== 'ollama' && !(options?.noTools && depth === 0) ? [...staticTools, ...extraToolsRef.current] : undefined,
+            toolChoice: (options?.noTools && depth === 0) ? 'none' : undefined,
             signal: abortRef.current.signal,
+            onChunk: config.streaming ? (chunk) => {
+                if (!didStream) {
+                    printer.streamStart();
+                    didStream = true;
+                }
+                printer.streamChunk(chunk);
+            } : undefined,
             onRetry(attempt, max, delayMs) {
                 printer.systemMsg(`retry ${attempt}/${max} — waiting ${Math.round(delayMs / 1000)}s`);
             },
@@ -91,8 +112,10 @@ export function useRunLoop(config, currentModelRef, pushHistory, extraTools = []
                     if (bare)
                         pendingTools.push({ name: bare.name, args: bare.args });
                 }
+                if (didStream)
+                    printer.streamEnd();
                 const displayText = textParts.join('').trim();
-                if (displayText)
+                if (displayText && !didStream)
                     printer.assistantMsg(displayText);
                 pushHistoryRef.current({ role: 'assistant', content: fullText });
                 if (pendingTools.length)
@@ -107,105 +130,154 @@ export function useRunLoop(config, currentModelRef, pushHistory, extraTools = []
                         await runLoop([...msgs, { role: 'assistant', content: fullText }, nudge], depth + 1, goal);
                         return;
                     }
+                    if (autoBranchedRef.current)
+                        printer.systemMsg(`branch: ${autoBranchedRef.current}  (git checkout main when done)`);
                     printer.systemMsg(`done in ${printer.formatElapsed(Date.now() - thinkingStartRef.current)}`);
                     setStatus('idle');
                     return;
                 }
                 setStatus('tool');
                 const next = [...msgs, { role: 'assistant', content: fullText }];
-                try {
-                    for (const tc of pendingTools) {
+                const allParallelSafe = pendingTools.every(tc => PARALLEL_SAFE.has(tc.name));
+                if (allParallelSafe && pendingTools.length > 1) {
+                    try {
+                        setCurrentTool(pendingTools[0].name);
                         const allTools = [...staticTools, ...extraToolsRef.current];
-                        const tool = allTools.find(t => t.name === tc.name);
-                        setCurrentTool(tc.name);
-                        if (PERMISSION_TOOLS.has(tc.name)) {
-                            const sessionKey = tc.name;
-                            let decision;
-                            if (sessionApprovedRef.current.has(sessionKey)) {
-                                decision = 'yes';
+                        const settled = await Promise.allSettled(pendingTools.map(async (tc) => {
+                            const tool = allTools.find(t => t.name === tc.name);
+                            printer.toolCallStart(tc.name, tc.args);
+                            if (!tool)
+                                throw new Error(`unknown tool: ${tc.name}`);
+                            const result = await tool.execute(tc.args);
+                            printer.toolResultSummary(tc.name, tc.args, result);
+                            if (SHOW_RESULT_TOOLS.has(tc.name))
+                                printer.toolMsg(tc.name, result);
+                            return { tc, result };
+                        }));
+                        for (const r of settled) {
+                            if (r.status === 'fulfilled') {
+                                next.push({ role: 'user', content: `Tool ${r.value.tc.name} result:\n${r.value.result}` });
                             }
                             else {
-                                decision = await new Promise(resolve => {
-                                    permissionResolveRef.current = resolve;
-                                    setPermissionRequest({ toolName: tc.name, args: tc.args });
-                                });
-                            }
-                            if (decision === 'session')
-                                sessionApprovedRef.current.add(sessionKey);
-                            if (decision === 'no') {
-                                printer.systemMsg(`denied: ${tc.name}`);
-                                const remaining = pendingTools.slice(pendingTools.indexOf(tc) + 1).map(t => t.name);
-                                const skippedNote = remaining.length ? ` The following tools were also skipped: ${remaining.join(', ')}.` : '';
-                                next.push({ role: 'user', content: `Tool ${tc.name} was denied by the user.${skippedNote} Do not retry these tools unless the user explicitly asks.` });
-                                break;
+                                const err = `Tool error: ${r.reason}`;
+                                printer.errorMsg(err);
+                                next.push({ role: 'user', content: err });
                             }
-                            // Checkpoint: store pre-execution file state
-                            if (CHECKPOINT_TOOLS.has(tc.name)) {
-                                const path = tc.args.path;
-                                if (path && !checkpointRef.current.has(path)) {
-                                    try {
-                                        checkpointRef.current.set(path, readFileSync(path, 'utf-8'));
+                        }
+                    }
+                    finally {
+                        setCurrentTool(undefined);
+                    }
+                }
+                else {
+                    try {
+                        for (const tc of pendingTools) {
+                            const allTools = [...staticTools, ...extraToolsRef.current];
+                            const tool = allTools.find(t => t.name === tc.name);
+                            setCurrentTool(tc.name);
+                            if (PERMISSION_TOOLS.has(tc.name)) {
+                                const sessionKey = tc.name;
+                                let decision;
+                                if (sessionApprovedRef.current.has(sessionKey)) {
+                                    decision = 'yes';
+                                }
+                                else {
+                                    decision = await new Promise(resolve => {
+                                        permissionResolveRef.current = resolve;
+                                        setPermissionRequest({ toolName: tc.name, args: tc.args });
+                                    });
+                                }
+                                if (decision === 'session')
+                                    sessionApprovedRef.current.add(sessionKey);
+                                if (decision === 'no') {
+                                    printer.systemMsg(`denied: ${tc.name}`);
+                                    const remaining = pendingTools.slice(pendingTools.indexOf(tc) + 1).map(t => t.name);
+                                    const skippedNote = remaining.length ? ` The following tools were also skipped: ${remaining.join(', ')}.` : '';
+                                    next.push({ role: 'user', content: `Tool ${tc.name} was denied by the user.${skippedNote} Do not retry these tools unless the user explicitly asks.` });
+                                    break;
+                                }
+                                // Checkpoint: store pre-execution file state + auto-branch on first edit
+                                if (CHECKPOINT_TOOLS.has(tc.name)) {
+                                    const path = tc.args.path;
+                                    if (path && !checkpointRef.current.has(path)) {
+                                        try {
+                                            checkpointRef.current.set(path, readFileSync(path, 'utf-8'));
+                                        }
+                                        catch {
+                                            checkpointRef.current.set(path, null);
+                                        }
                                     }
-                                    catch {
-                                        checkpointRef.current.set(path, null);
+                                    if (!autoBranchedRef.current) {
+                                        try {
+                                            const { stdout } = await runCmd('git rev-parse --abbrev-ref HEAD', { timeout: 3000 });
+                                            const branch = stdout.trim();
+                                            if (branch === 'main' || branch === 'master') {
+                                                const ts = new Date().toISOString().slice(0, 16).replace(/[T:]/g, '-');
+                                                const newBranch = `miii/task-${ts}`;
+                                                await runCmd(`git checkout -b ${newBranch}`, { timeout: 5000 });
+                                                autoBranchedRef.current = newBranch;
+                                                printer.systemMsg(`auto-branched: ${newBranch}`);
+                                            }
+                                        }
+                                        catch { }
                                     }
                                 }
                             }
-                        }
-                        if (tool) {
-                            try {
-                                // Guard: for update_file, verify old text still matches before executing.
-                                // If stale, inject fresh file content and skip — model will retry.
-                                if (tc.name === 'update_file') {
-                                    const filePath = tc.args.path;
-                                    const oldText = tc.args.old;
-                                    if (filePath && oldText && existsSync(filePath)) {
-                                        const norm = (s) => s.replace(/\r\n/g, '\n');
-                                        const current = readFileSync(filePath, 'utf-8');
-                                        const occurrences = norm(current).split(norm(oldText)).length - 1;
-                                        if (occurrences === 0) {
-                                            printer.errorMsg(`patch stale: old text not found in ${filePath} — injecting fresh content`);
-                                            next.push({ role: 'user', content: `Tool read_file result:\n${current}` });
-                                            next.push({ role: 'user', content: `update_file failed: the <old> text you used does not exist in ${filePath}. The CURRENT file content is shown above. Re-read it carefully, find the exact text you want to replace, and retry update_file using text that exactly matches what is in the file now.` });
-                                            continue;
+                            if (tool) {
+                                try {
+                                    // Guard: for update_file, verify old text still matches before executing.
+                                    // If stale, inject fresh file content and skip — model will retry.
+                                    if (tc.name === 'update_file') {
+                                        const filePath = tc.args.path;
+                                        const oldText = tc.args.old;
+                                        if (filePath && oldText && existsSync(filePath)) {
+                                            const norm = (s) => s.replace(/\r\n/g, '\n');
+                                            const current = readFileSync(filePath, 'utf-8');
+                                            const occurrences = norm(current).split(norm(oldText)).length - 1;
+                                            if (occurrences === 0) {
+                                                printer.errorMsg(`patch stale: old text not found in ${filePath} — injecting fresh content`);
+                                                next.push({ role: 'user', content: `Tool read_file result:\n${current}` });
+                                                next.push({ role: 'user', content: `update_file failed: the <old> text you used does not exist in ${filePath}. The CURRENT file content is shown above. Re-read it carefully, find the exact text you want to replace, and retry update_file using text that exactly matches what is in the file now.` });
+                                                continue;
+                                            }
+                                            if (occurrences > 1) {
+                                                printer.errorMsg(`patch ambiguous: old text matches ${occurrences} locations in ${filePath} — injecting fresh content`);
+                                                next.push({ role: 'user', content: `Tool read_file result:\n${current}` });
+                                                next.push({ role: 'user', content: `update_file failed: the <old> text matches ${occurrences} locations in ${filePath}. Add more surrounding lines to the <old> block to make it unique, then retry.` });
+                                                continue;
+                                            }
                                         }
-                                        if (occurrences > 1) {
-                                            printer.errorMsg(`patch ambiguous: old text matches ${occurrences} locations in ${filePath} — injecting fresh content`);
-                                            next.push({ role: 'user', content: `Tool read_file result:\n${current}` });
-                                            next.push({ role: 'user', content: `update_file failed: the <old> text matches ${occurrences} locations in ${filePath}. Add more surrounding lines to the <old> block to make it unique, then retry.` });
-                                            continue;
+                                    }
+                                    printer.toolCallStart(tc.name, tc.args);
+                                    const result = await tool.execute(tc.args);
+                                    printer.toolResultSummary(tc.name, tc.args, result);
+                                    if (SHOW_RESULT_TOOLS.has(tc.name))
+                                        printer.toolMsg(tc.name, result);
+                                    next.push({ role: 'user', content: `Tool ${tc.name} result:\n${result}` });
+                                    if (FILE_EDIT_TOOLS.has(tc.name)) {
+                                        const filePath = tc.args.path;
+                                        if (filePath && existsSync(filePath)) {
+                                            const lineCount = readFileSync(filePath, 'utf-8').split('\n').length;
+                                            next.push({ role: 'user', content: `[file updated: ${filePath} — ${lineCount} lines]` });
                                         }
                                     }
                                 }
-                                printer.toolCallStart(tc.name, tc.args);
-                                const result = await tool.execute(tc.args);
-                                printer.toolResultSummary(tc.name, tc.args, result);
-                                if (SHOW_RESULT_TOOLS.has(tc.name))
-                                    printer.toolMsg(tc.name, result);
-                                next.push({ role: 'user', content: `Tool ${tc.name} result:\n${result}` });
-                                if (FILE_EDIT_TOOLS.has(tc.name)) {
-                                    const filePath = tc.args.path;
-                                    if (filePath && existsSync(filePath)) {
-                                        const lineCount = readFileSync(filePath, 'utf-8').split('\n').length;
-                                        next.push({ role: 'user', content: `[file updated: ${filePath} — ${lineCount} lines]` });
-                                    }
+                                catch (e) {
+                                    const err = `Tool ${tc.name} error: ${e}`;
+                                    printer.errorMsg(err);
+                                    next.push({ role: 'user', content: err });
                                 }
                             }
-                            catch (e) {
-                                const err = `Tool ${tc.name} error: ${e}`;
-                                printer.errorMsg(err);
-                                next.push({ role: 'user', content: err });
+                            else {
+                                printer.errorMsg(`unknown tool: ${tc.name}`);
+                                next.push({ role: 'user', content: `unknown tool: ${tc.name}` });
                             }
                         }
-                        else {
-                            printer.errorMsg(`unknown tool: ${tc.name}`);
-                            next.push({ role: 'user', content: `unknown tool: ${tc.name}` });
-                        }
                     }
-                }
-                finally {
-                    setCurrentTool(undefined);
-                }
+                    finally {
+                        setCurrentTool(undefined);
+                    }
+                } // end sequential else
                 // For file-edit turns: slim context (system + goal + fresh file states + recent results)
                 // For non-edit turns: full next (model needs full conversational context)
                 const didEditFiles = pendingTools.some(tc => FILE_EDIT_TOOLS.has(tc.name));
@@ -261,6 +333,10 @@ export function useRunLoop(config, currentModelRef, pushHistory, extraTools = []
             if (restored > 0)
                 printer.systemMsg(`restored ${restored} file(s) to pre-session state`);
         }
+        if (autoBranchedRef.current) {
+            printer.systemMsg(`task branch preserved: ${autoBranchedRef.current}`);
+            autoBranchedRef.current = null;
+        }
         setStatus('idle');
     }, []);
     return {

package/dist/tui/hooks/useSubmit.js CHANGED Viewed

@@ -326,6 +326,18 @@ Analyze what exists, then implement the design. Use the design system above if a
             }
             return;
         }
+        if (cmd.startsWith('/plan exec ')) {
+            const task = cmd.slice(11).trim();
+            if (!task) {
+                printer.systemMsg('usage: /plan exec <task>');
+                return;
+            }
+            const planPrompt = `PLANNING TURN — output a numbered plan of exactly what you will do to accomplish this task. List which files to read, which to edit, and what changes to make. Do NOT call any tools in this response. After I review the plan and respond "go", you will execute.\n\nTask: ${task}`;
+            printer.userMsg(`/plan exec ${task}`);
+            pushHistory({ role: 'user', content: planPrompt });
+            await runLoop(buildContext(), 0, task, { noTools: true });
+            return;
+        }
         if (cmd === '/plan' || cmd.startsWith('/plan ')) {
             const topic = cmd.slice(5).trim();
             setPlanningMode(true);

package/dist/tui/printer.js CHANGED Viewed

@@ -161,8 +161,11 @@ export function assistantMsg(text) {
     const tail = lines.slice(idx + 1).join('\n');
     write(`\n${blue('●')} ${head}${tail ? '\n' + tail : ''}\n`);
 }
-export const EDIT_TOOLS = new Set(['edit_file', 'update_file', 'create_file', 'write_file']);
-export const DELETE_TOOLS = new Set(['delete_file', 'remove_file']);
+export function streamStart() { write(`\n${blue('●')} `); }
+export function streamChunk(s) { write(s); }
+export function streamEnd() { write('\n'); }
+export const EDIT_TOOLS = new Set(['edit_file', 'update_file', 'create_file']);
+export const DELETE_TOOLS = new Set(['delete_file']);
 const PERM_DESC = {
     delete_file: 'delete this file',
     update_file: 'edit this file',
@@ -291,8 +294,7 @@ export function toolResultSummary(name, args, result) {
     const lines = result.trim().split('\n').filter(Boolean);
     let summary = '';
     switch (name) {
-        case 'edit_file':
-        case 'write_file': {
+        case 'edit_file': {
             const n = (a.content ?? '').split('\n').length;
             summary = `Wrote ${n} line${n === 1 ? '' : 's'}`;
             break;

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "miii-cli",
-  "version": "1.3.0",
+  "version": "1.3.1",
   "type": "module",
   "description": "The high-performance local AI coding agent for your terminal. Automate complex workflows with local LLMs.",
   "license": "MIT",