PyPI - mlx-code - Versions diffs - 0.0.20__tar.gz → 0.0.22__tar.gz - Mend

mlx-code 0.0.20tar.gz → 0.0.22tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (29) hide show

{mlx_code-0.0.20 → mlx_code-0.0.22}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: mlx-code
-Version: 0.0.20
+Version: 0.0.22
 Summary: Coding Agent for Mac
 Home-page: https://josefalbers.github.io/mlx-code/
 Author: J Joe
@@ -17,6 +17,8 @@ Requires-Dist: httpx
 Requires-Dist: pydantic
 Requires-Dist: textual>=8.2.7
 Requires-Dist: rich>=15.0.0
+Requires-Dist: starlette
+Requires-Dist: uvicorn
 Provides-Extra: all
 Requires-Dist: python-lsp-server[all]; extra == "all"
 Requires-Dist: GitPython; extra == "all"
@@ -38,16 +40,16 @@ Dynamic: summary
 A Git-native coding agent that can run entirely on your Mac. No API keys, no cloud, and no data leaving your machine. Powered by Apple MLX, it turns commits, branches, and worktrees into the agent’s state, history, and execution model
-https://github.com/user-attachments/assets/0569d101-8d0a-4e67-9e82-fce84a5ef3f0
+[![v0.0.27](https://github.com/user-attachments/assets/8a1c131a-dda1-4b52-9fa6-9c0fbccb5ea6)](https://youtube.com/shorts/1LuifKFKixc)
 ---
 ## Architecture
 ```
-Conversation tree (nodes = git commits with embedded chat history)
+Worktrees:
-  main ──●──●──●──●──●──●──●──●──●──●
+  main ──●──●──●──●──●──●──●──●──●──●──●──●──●──●───────────► Node = git commit + chat hx
             │        │
             │        └── branch-1 ──●──●──●
             │                          │ ┌────────────┐
@@ -55,32 +57,30 @@ Conversation tree (nodes = git commits with embedded chat history)
             │                            └─────┬──────┘
             └── branch-0 ──●──●──●             │
                                                │
+Tabs:                                          ├────────────► Tab = git branch + Agent
                                                │
-REPL tabs (each tab = a git branch + agent)    │
-                                               │
-                                               │
-┌──────────────────────────────────────────────┼─────────┐
+┌──────────────────────────────────────────────│─────────┐
 │  TUI tabs                                    │         │
 │  ┌──────┐  ┌──────────┐  ┌──────────┐  ┌─────┴──────┐  │
 │  │ main │  │ branch-0 │  │ branch-1 │  │ branch-1-0 │  │
 │  └──────┘  └────┬─────┘  └──────────┘  └────────────┘  │
-└─────────────────┼──────────────────────────────────────┘
+└─────────────────│──────────────────────────────────────┘
                   │
-                  ├────────────────────────────────────► each tab is an independent Agent
+Agents:           ├─────────────────────────────────────────► Each tab runs its own Agent
                   │
-             ┌────┴────────────────────────────────┐
-             │  Agent                              │
-             │  ┌──────────────┐  ┌──────────────┐ │
-             │  │ API:         │  │ Tools:       │ │
-             │  │ MLX (local)  │  │ Read  Write  │ │
-             │  │ Claude       │  │ Edit  Bash   │ │
-             │  │ Gemini       │  │ Grep  Find   │ │
-             │  │ OpenAI       │  │ Ls  Skill    │ │
-             │  └──────────────┘  │ Agent ───────┼─┼───► spawns child Agent
-             │                    └──────────────┘ │     (each with own tools + worktree + etc)
-             │  Git worktree                       │
-             │  (isolation + session state)        │
-             └─────────────────────────────────────┘
+             ┌────┴─────────────────────────────────────┐
+             │  Agent                                   │
+             │  ┌────────────────┐  ┌────────────────┐  │
+             │  │ API:           │  │ Tools:         │  │
+             │  │ Local (mlx-lm) │  │ Read    Write  │  │
+             │  │ Gemini         │  │ Edit    Bash   │  │
+             │  │ Claude         │  │ Grep    Find   │  │
+             │  │ Codex          │  │ Ls      Skill  │  │
+             │  │ DeepSeek       │  │ Agent ─────────┼──┼───► Recursively spawns sub-Agents
+             │  └────────────────┘  └────────────────┘  │
+             │  Git worktree                            │
+             │  (isolation + session state)             │
+             └──────────────────────────────────────────┘
 ```
 Each layer is importable and composable on its own. A commit records state, a branch records an alternative path, and a tab is just a live view over an `Agent`.
@@ -95,28 +95,31 @@ result = await agent.run('refactor utils.py to use dataclasses')
 ---
+## Core ideas
+- **Git is the state machine.** Every file-changing agent step is committed with the conversation that produced it, so you can inspect, resume, and branch from any checkpoint.
+- **Branches are alternative futures.** A branch is not just a Git branch; it is a different reasoning path with its own worktree and session state.
+- **Agents are the primitive.** Tabs, branches, and delegated subtasks are all instances of the same `Agent` abstraction.
+- **Worktrees provide isolation.** The agent edits in a separate worktree, so your main checkout stays clean and recoverable.
+---
 ## Quick start
 ```bash
+# ephemeral run (no installation)
+uvx --from mlx-code mlc
+# or install into the current environment
 pip install mlx-code
-mlc                              # launch with local MLX model
+# launch
+mlc                              # with a local MLX model
 mlc-run --api gemini             # or use a remote provider
-mlc-run --api deepseek --model deepseek-v4-flash
 ```
 That's it. The first run starts a local inference server and drops you into the REPL.
-[![Link](https://raw.githubusercontent.com/JosefAlbers/mlx-code/main/assets/mlx-code-v0.0.20.gif)](https://youtu.be/0lkY7YQCyCo)
----
-## Core ideas
-- **Git is the state machine.** Every file-changing agent step is committed with the conversation that produced it, so you can inspect, resume, and branch from any checkpoint.
-- **Branches are alternative futures.** A branch is not just a Git branch; it is a different reasoning path with its own worktree and session state.
-- **Agents are the primitive.** Tabs, branches, and delegated subtasks are all instances of the same `Agent` abstraction.
-- **Worktrees provide isolation.** The agent edits in a separate worktree, so your main checkout stays clean and recoverable.
 ---
 ## Why mlx-code
@@ -125,12 +128,12 @@ That's it. The first run starts a local inference server and drops you into the
 **Git is the database.** When the agent makes file changes, they’re committed to a git worktree with the full conversation embedded in the commit message. Resume any past session by hash, branch from any checkpoint, and inspect the agent timeline with `git log`. No proprietary state files, just Git.
-**Your working directory is never at risk.** The agent operates inside a `git worktree`, not your checkout. It can make a mess, and you can inspect or discard it without ever touching `main`.
-**Built-in safety nets.** Subprocess environment variables go through an explicit allowlist, so secrets in your shell are never leaked to agent-spawned processes.
+**Built-in safety nets.** Your working directory is never at risk. The agent operates inside a `git worktree`, not your checkout. It can make a mess, and you can inspect or discard it without ever touching `main`. Subprocess environment variables go through an explicit allowlist, so secrets in your shell are never leaked to agent-spawned processes.
 **Batteries included.** Everything ships in one pip install: the MLX inference engine, the multi-protocol API server, the agent loop, the tools, and the TUI. No llama.cpp, no ollama, no vLLM bridge to find and configure. And the server natively speaks OpenAI, Anthropic, Gemini, and Codex wire formats simultaneously, so `claude`, `codex`, and `gemini` CLIs can all work against your local model without a translation layer.
+**Continuous batching.** The local inference server runs a continuous batching engine that processes multiple sequences concurrently. When you spawn parallel agents (eg, multiple tabs, `asyncio.gather` pipelines, or delegated sub-tasks) they all share the same GPU context and are stepped together each tick. A prefix cache persists KV snapshots to disk, so repeated system prompts and conversation prefixes are prefilled once and reused across sessions. No request queueing, no waiting for the previous agent to finish.
 ---
 ## Agent primitive
@@ -168,12 +171,12 @@ agent.messages = messages
 await agent.run("now add unit tests")
 ```
-Branch from any point in the conversation — each branch gets its own worktree:
+Branch from any point in the conversation. Each branch gets its own worktree:
 ```
 /branch                      # branch from current state
 /branch --rev 2              # branch from the 2nd user turn
-/branch --rev 3 --as-worktree try different approach
+/branch --rev 3 make it use httpx instead
 ```
 Since it's just git, you can inspect the timeline outside the REPL:
@@ -238,6 +241,43 @@ Reliability comes from specialization plus constraint. A read-only reviewer can'
 ---
+## Continuous batching
+The local server can run multiple inference sequences concurrently inside a single batch step. Instead of a global lock that serialises one request at a time, the batching engine maintains a live set of active sequences and yields tokens for all of them on every step.
+```bash
+mlc --engine batch            # continuous batching + built-in REPL
+```
+This unlocks true parallelism for multi-agent workloads:
+```python
+import asyncio
+from mlx_code.repl import Agent
+async def main():
+    agents = [Agent() for _ in range(4)]
+    await asyncio.gather(*[
+        a.run(f"Research topic: {t}")
+        for a, t in zip(agents, ["consensus", "cryptography", "networking", "storage"])
+    ])
+asyncio.run(main())
+```
+All four agents generate simultaneously inside the same batch. No sequential blocking.
+### Health endpoint
+```bash
+curl http://127.0.0.1:8000/health
+# {"status":"ok","model":"mlx-community/Qwen3.5-4B-OptiQ-4bit","active_sequences":2,"prefix_cache_files":5}
+```
+`active_sequences` shows how many agents are generating right now; `prefix_cache_files` shows how many prefix KV snapshots are stored on disk.
+---
 ## Command Line
 ### `mlc`: local server + harness
@@ -245,20 +285,20 @@ Reliability comes from specialization plus constraint. A read-only reviewer can'
 Starts the MLX inference server and launches the built-in TUI harness against it.
 ```bash
-# Default: local server + default TUI
+# Default: local server + default harness
 mlc
-# Use a simple terminal REPL instead of the TUI
-mlc --notui
+# Continuous batching mode (default is sequential caching mode)
+mlc --engine batch
+# Server only, no harness
+mlc --leash none
 # Use a different harness (routes traffic through the local server)
 mlc --leash claude
 mlc --leash gemini
 mlc --leash codex
-# Server only, no harness
-mlc --leash none
 # Specify a model
 mlc --model mlx-community/Qwen3.5-4B-OptiQ-4bit
@@ -309,10 +349,9 @@ mlc-run --api codex
 echo "explain lsp.py" | mlc-run -a deepseek | cat - PLAN.md | mlc-run --url http://localhost:9000
 # Simple terminal REPL (no TUI)
-mlc-run --notui
+mlc-run --bare
 ```
 ---
 ## Using as a Library
@@ -435,18 +474,19 @@ agent = Agent(extra_tool_classes=[LiveDBTool], tool_names=["QueryDB"])
 | Command | Description |
 |---|---|
-| `/help` | Show command reference |
+| `/branch [--rev N] [prompt]` | Open a new branch tab from the current (or earlier) checkpoint |
+| `/diff [--all]` | Show a side-by-side diff of changes in the worktree |
 | `/clear [--config F]` | Clear conversation; `--config` reloads agent from a JSON/YAML file |
+| `/tab [N]` | Jump to tab N |
 | `/history [--raw]` | Show conversation transcript; `--raw` shows the raw API message log |
-| `/diff [--all]` | Show a side-by-side diff of changes in the worktree |
-| `/errors` | Show timestamped error log for the current tab |
 | `/tools` | List active tools |
-| `/branch [--rev N] [prompt]` | Open a new branch tab from the current (or earlier) checkpoint |
 | `/abort` | Abort the running agent |
+| `/errors` | Show timestamped error log for the current tab |
 | `/export [path]` | Export session to JSON |
-| `/exit` or `/quit` | Close branch tab, or exit the app |
-| `!command` | Run a shell command; output captured in the TUI |
-| `!!command` | Run an interactive command (TUI suspends, terminal handed to process) |
+| `/exit [--all]` | Close branch tab, or exit the app |
+| `/help` | Show command reference |
+| `!command` | Run a shell command; output captured in the TUI (eg, `ls`, `cat hello.c`) |
+| `$command` | Run an interactive command (eg, `vim`, `yazi`, `less hello.c`) |
 ### Key bindings
@@ -454,9 +494,9 @@ agent = Agent(extra_tool_classes=[LiveDBTool], tool_names=["QueryDB"])
 |---|---|
 | `Enter` | Submit |
 | `Ctrl-J` | Insert newline |
-| `Alt-1` … `Alt-9` | Jump to tab N |
-| `Tab` / `Shift-Tab` | Cycle through tabs |
-| `Ctrl-C` | Abort running agent |
+| `Ctrl-1` … `Ctrl-9` | Jump to tab N |
+| `Ctrl-,` / `Ctrl-.` | Cycle through tabs |
+| `Ctrl-C` | Clear input, or abort running agent |
 | `Ctrl-D` | Close branch tab, or exit app |
 | `Ctrl-R` | Recall last prompt into editor |
@@ -474,16 +514,16 @@ agent = Agent(extra_tool_classes=[LiveDBTool], tool_names=["QueryDB"])
 | `Skill` | Retrieve named skill instructions from config |
 | `Agent` | Spawn an autonomous sub-agent for delegated work |
-All file tools enforce path sandboxing — the agent cannot read or write outside the worktree.
+All file tools enforce path sandboxing. The agent cannot read or write outside the worktree.
 ### Backends
 | Backend | Flag | Notes |
 |---------|------|-------|
-| MLX (local) | `--api noapi` | Default. Runs on-device, no API key needed |
+| MLX-LM (local) | `--api noapi` | Default. Runs on-device, no API key needed |
 | Claude | `--api claude` | Requires `ANTHROPIC_API_KEY` |
 | Gemini | `--api gemini` | Requires `GOOGLE_API_KEY` |
-| DeepSeek | `--api deepseek` | DeepSeek API or compatible endpoint |
+| DeepSeek | `--api deepseek` | Requires `DEEPSEEK_API_KEY` |
 | Codex | `--api codex` | OpenAI Codex CLI integration |
 | OpenAI | `--api openai` | Any OpenAI-compatible endpoint |
@@ -492,10 +532,13 @@ All file tools enforce path sandboxing — the agent cannot read or write outsid
 The local MLX server speaks OpenAI, Anthropic, and Gemini wire formats simultaneously, so you can use any compatible CLI as the frontend:
 ```bash
-mlc --leash claude       # claude CLI routes through local model
-mlc --leash codex        # codex CLI routes through local model
-mlc --leash gemini       # gemini CLI routes through local model
-mlc --leash none         # server only
+mlc                      # default
+mlc --web                # web UI (api.mlx-code.com)
+mlc --bare               # no TUI
+mlc --leash none         # no harness
+mlc --leash codex        # codex CLI
+mlc --leash gemini       # gemini CLI
+mlc --leash claude       # claude code
 ```
 ---

{mlx_code-0.0.20 → mlx_code-0.0.22}/README.md RENAMED Viewed

@@ -2,16 +2,16 @@
 A Git-native coding agent that can run entirely on your Mac. No API keys, no cloud, and no data leaving your machine. Powered by Apple MLX, it turns commits, branches, and worktrees into the agent’s state, history, and execution model
-https://github.com/user-attachments/assets/0569d101-8d0a-4e67-9e82-fce84a5ef3f0
+[![v0.0.27](https://github.com/user-attachments/assets/8a1c131a-dda1-4b52-9fa6-9c0fbccb5ea6)](https://youtube.com/shorts/1LuifKFKixc)
 ---
 ## Architecture
 ```
-Conversation tree (nodes = git commits with embedded chat history)
+Worktrees:
-  main ──●──●──●──●──●──●──●──●──●──●
+  main ──●──●──●──●──●──●──●──●──●──●──●──●──●──●───────────► Node = git commit + chat hx
             │        │
             │        └── branch-1 ──●──●──●
             │                          │ ┌────────────┐
@@ -19,32 +19,30 @@ Conversation tree (nodes = git commits with embedded chat history)
             │                            └─────┬──────┘
             └── branch-0 ──●──●──●             │
                                                │
+Tabs:                                          ├────────────► Tab = git branch + Agent
                                                │
-REPL tabs (each tab = a git branch + agent)    │
-                                               │
-                                               │
-┌──────────────────────────────────────────────┼─────────┐
+┌──────────────────────────────────────────────│─────────┐
 │  TUI tabs                                    │         │
 │  ┌──────┐  ┌──────────┐  ┌──────────┐  ┌─────┴──────┐  │
 │  │ main │  │ branch-0 │  │ branch-1 │  │ branch-1-0 │  │
 │  └──────┘  └────┬─────┘  └──────────┘  └────────────┘  │
-└─────────────────┼──────────────────────────────────────┘
+└─────────────────│──────────────────────────────────────┘
                   │
-                  ├────────────────────────────────────► each tab is an independent Agent
+Agents:           ├─────────────────────────────────────────► Each tab runs its own Agent
                   │
-             ┌────┴────────────────────────────────┐
-             │  Agent                              │
-             │  ┌──────────────┐  ┌──────────────┐ │
-             │  │ API:         │  │ Tools:       │ │
-             │  │ MLX (local)  │  │ Read  Write  │ │
-             │  │ Claude       │  │ Edit  Bash   │ │
-             │  │ Gemini       │  │ Grep  Find   │ │
-             │  │ OpenAI       │  │ Ls  Skill    │ │
-             │  └──────────────┘  │ Agent ───────┼─┼───► spawns child Agent
-             │                    └──────────────┘ │     (each with own tools + worktree + etc)
-             │  Git worktree                       │
-             │  (isolation + session state)        │
-             └─────────────────────────────────────┘
+             ┌────┴─────────────────────────────────────┐
+             │  Agent                                   │
+             │  ┌────────────────┐  ┌────────────────┐  │
+             │  │ API:           │  │ Tools:         │  │
+             │  │ Local (mlx-lm) │  │ Read    Write  │  │
+             │  │ Gemini         │  │ Edit    Bash   │  │
+             │  │ Claude         │  │ Grep    Find   │  │
+             │  │ Codex          │  │ Ls      Skill  │  │
+             │  │ DeepSeek       │  │ Agent ─────────┼──┼───► Recursively spawns sub-Agents
+             │  └────────────────┘  └────────────────┘  │
+             │  Git worktree                            │
+             │  (isolation + session state)             │
+             └──────────────────────────────────────────┘
 ```
 Each layer is importable and composable on its own. A commit records state, a branch records an alternative path, and a tab is just a live view over an `Agent`.
@@ -59,28 +57,31 @@ result = await agent.run('refactor utils.py to use dataclasses')
 ---
+## Core ideas
+- **Git is the state machine.** Every file-changing agent step is committed with the conversation that produced it, so you can inspect, resume, and branch from any checkpoint.
+- **Branches are alternative futures.** A branch is not just a Git branch; it is a different reasoning path with its own worktree and session state.
+- **Agents are the primitive.** Tabs, branches, and delegated subtasks are all instances of the same `Agent` abstraction.
+- **Worktrees provide isolation.** The agent edits in a separate worktree, so your main checkout stays clean and recoverable.
+---
 ## Quick start
 ```bash
+# ephemeral run (no installation)
+uvx --from mlx-code mlc
+# or install into the current environment
 pip install mlx-code
-mlc                              # launch with local MLX model
+# launch
+mlc                              # with a local MLX model
 mlc-run --api gemini             # or use a remote provider
-mlc-run --api deepseek --model deepseek-v4-flash
 ```
 That's it. The first run starts a local inference server and drops you into the REPL.
-[![Link](https://raw.githubusercontent.com/JosefAlbers/mlx-code/main/assets/mlx-code-v0.0.20.gif)](https://youtu.be/0lkY7YQCyCo)
----
-## Core ideas
-- **Git is the state machine.** Every file-changing agent step is committed with the conversation that produced it, so you can inspect, resume, and branch from any checkpoint.
-- **Branches are alternative futures.** A branch is not just a Git branch; it is a different reasoning path with its own worktree and session state.
-- **Agents are the primitive.** Tabs, branches, and delegated subtasks are all instances of the same `Agent` abstraction.
-- **Worktrees provide isolation.** The agent edits in a separate worktree, so your main checkout stays clean and recoverable.
 ---
 ## Why mlx-code
@@ -89,12 +90,12 @@ That's it. The first run starts a local inference server and drops you into the
 **Git is the database.** When the agent makes file changes, they’re committed to a git worktree with the full conversation embedded in the commit message. Resume any past session by hash, branch from any checkpoint, and inspect the agent timeline with `git log`. No proprietary state files, just Git.
-**Your working directory is never at risk.** The agent operates inside a `git worktree`, not your checkout. It can make a mess, and you can inspect or discard it without ever touching `main`.
-**Built-in safety nets.** Subprocess environment variables go through an explicit allowlist, so secrets in your shell are never leaked to agent-spawned processes.
+**Built-in safety nets.** Your working directory is never at risk. The agent operates inside a `git worktree`, not your checkout. It can make a mess, and you can inspect or discard it without ever touching `main`. Subprocess environment variables go through an explicit allowlist, so secrets in your shell are never leaked to agent-spawned processes.
 **Batteries included.** Everything ships in one pip install: the MLX inference engine, the multi-protocol API server, the agent loop, the tools, and the TUI. No llama.cpp, no ollama, no vLLM bridge to find and configure. And the server natively speaks OpenAI, Anthropic, Gemini, and Codex wire formats simultaneously, so `claude`, `codex`, and `gemini` CLIs can all work against your local model without a translation layer.
+**Continuous batching.** The local inference server runs a continuous batching engine that processes multiple sequences concurrently. When you spawn parallel agents (eg, multiple tabs, `asyncio.gather` pipelines, or delegated sub-tasks) they all share the same GPU context and are stepped together each tick. A prefix cache persists KV snapshots to disk, so repeated system prompts and conversation prefixes are prefilled once and reused across sessions. No request queueing, no waiting for the previous agent to finish.
 ---
 ## Agent primitive
@@ -132,12 +133,12 @@ agent.messages = messages
 await agent.run("now add unit tests")
 ```
-Branch from any point in the conversation — each branch gets its own worktree:
+Branch from any point in the conversation. Each branch gets its own worktree:
 ```
 /branch                      # branch from current state
 /branch --rev 2              # branch from the 2nd user turn
-/branch --rev 3 --as-worktree try different approach
+/branch --rev 3 make it use httpx instead
 ```
 Since it's just git, you can inspect the timeline outside the REPL:
@@ -202,6 +203,43 @@ Reliability comes from specialization plus constraint. A read-only reviewer can'
 ---
+## Continuous batching
+The local server can run multiple inference sequences concurrently inside a single batch step. Instead of a global lock that serialises one request at a time, the batching engine maintains a live set of active sequences and yields tokens for all of them on every step.
+```bash
+mlc --engine batch            # continuous batching + built-in REPL
+```
+This unlocks true parallelism for multi-agent workloads:
+```python
+import asyncio
+from mlx_code.repl import Agent
+async def main():
+    agents = [Agent() for _ in range(4)]
+    await asyncio.gather(*[
+        a.run(f"Research topic: {t}")
+        for a, t in zip(agents, ["consensus", "cryptography", "networking", "storage"])
+    ])
+asyncio.run(main())
+```
+All four agents generate simultaneously inside the same batch. No sequential blocking.
+### Health endpoint
+```bash
+curl http://127.0.0.1:8000/health
+# {"status":"ok","model":"mlx-community/Qwen3.5-4B-OptiQ-4bit","active_sequences":2,"prefix_cache_files":5}
+```
+`active_sequences` shows how many agents are generating right now; `prefix_cache_files` shows how many prefix KV snapshots are stored on disk.
+---
 ## Command Line
 ### `mlc`: local server + harness
@@ -209,20 +247,20 @@ Reliability comes from specialization plus constraint. A read-only reviewer can'
 Starts the MLX inference server and launches the built-in TUI harness against it.
 ```bash
-# Default: local server + default TUI
+# Default: local server + default harness
 mlc
-# Use a simple terminal REPL instead of the TUI
-mlc --notui
+# Continuous batching mode (default is sequential caching mode)
+mlc --engine batch
+# Server only, no harness
+mlc --leash none
 # Use a different harness (routes traffic through the local server)
 mlc --leash claude
 mlc --leash gemini
 mlc --leash codex
-# Server only, no harness
-mlc --leash none
 # Specify a model
 mlc --model mlx-community/Qwen3.5-4B-OptiQ-4bit
@@ -273,10 +311,9 @@ mlc-run --api codex
 echo "explain lsp.py" | mlc-run -a deepseek | cat - PLAN.md | mlc-run --url http://localhost:9000
 # Simple terminal REPL (no TUI)
-mlc-run --notui
+mlc-run --bare
 ```
 ---
 ## Using as a Library
@@ -399,18 +436,19 @@ agent = Agent(extra_tool_classes=[LiveDBTool], tool_names=["QueryDB"])
 | Command | Description |
 |---|---|
-| `/help` | Show command reference |
+| `/branch [--rev N] [prompt]` | Open a new branch tab from the current (or earlier) checkpoint |
+| `/diff [--all]` | Show a side-by-side diff of changes in the worktree |
 | `/clear [--config F]` | Clear conversation; `--config` reloads agent from a JSON/YAML file |
+| `/tab [N]` | Jump to tab N |
 | `/history [--raw]` | Show conversation transcript; `--raw` shows the raw API message log |
-| `/diff [--all]` | Show a side-by-side diff of changes in the worktree |
-| `/errors` | Show timestamped error log for the current tab |
 | `/tools` | List active tools |
-| `/branch [--rev N] [prompt]` | Open a new branch tab from the current (or earlier) checkpoint |
 | `/abort` | Abort the running agent |
+| `/errors` | Show timestamped error log for the current tab |
 | `/export [path]` | Export session to JSON |
-| `/exit` or `/quit` | Close branch tab, or exit the app |
-| `!command` | Run a shell command; output captured in the TUI |
-| `!!command` | Run an interactive command (TUI suspends, terminal handed to process) |
+| `/exit [--all]` | Close branch tab, or exit the app |
+| `/help` | Show command reference |
+| `!command` | Run a shell command; output captured in the TUI (eg, `ls`, `cat hello.c`) |
+| `$command` | Run an interactive command (eg, `vim`, `yazi`, `less hello.c`) |
 ### Key bindings
@@ -418,9 +456,9 @@ agent = Agent(extra_tool_classes=[LiveDBTool], tool_names=["QueryDB"])
 |---|---|
 | `Enter` | Submit |
 | `Ctrl-J` | Insert newline |
-| `Alt-1` … `Alt-9` | Jump to tab N |
-| `Tab` / `Shift-Tab` | Cycle through tabs |
-| `Ctrl-C` | Abort running agent |
+| `Ctrl-1` … `Ctrl-9` | Jump to tab N |
+| `Ctrl-,` / `Ctrl-.` | Cycle through tabs |
+| `Ctrl-C` | Clear input, or abort running agent |
 | `Ctrl-D` | Close branch tab, or exit app |
 | `Ctrl-R` | Recall last prompt into editor |
@@ -438,16 +476,16 @@ agent = Agent(extra_tool_classes=[LiveDBTool], tool_names=["QueryDB"])
 | `Skill` | Retrieve named skill instructions from config |
 | `Agent` | Spawn an autonomous sub-agent for delegated work |
-All file tools enforce path sandboxing — the agent cannot read or write outside the worktree.
+All file tools enforce path sandboxing. The agent cannot read or write outside the worktree.
 ### Backends
 | Backend | Flag | Notes |
 |---------|------|-------|
-| MLX (local) | `--api noapi` | Default. Runs on-device, no API key needed |
+| MLX-LM (local) | `--api noapi` | Default. Runs on-device, no API key needed |
 | Claude | `--api claude` | Requires `ANTHROPIC_API_KEY` |
 | Gemini | `--api gemini` | Requires `GOOGLE_API_KEY` |
-| DeepSeek | `--api deepseek` | DeepSeek API or compatible endpoint |
+| DeepSeek | `--api deepseek` | Requires `DEEPSEEK_API_KEY` |
 | Codex | `--api codex` | OpenAI Codex CLI integration |
 | OpenAI | `--api openai` | Any OpenAI-compatible endpoint |
@@ -456,10 +494,13 @@ All file tools enforce path sandboxing — the agent cannot read or write outsid
 The local MLX server speaks OpenAI, Anthropic, and Gemini wire formats simultaneously, so you can use any compatible CLI as the frontend:
 ```bash
-mlc --leash claude       # claude CLI routes through local model
-mlc --leash codex        # codex CLI routes through local model
-mlc --leash gemini       # gemini CLI routes through local model
-mlc --leash none         # server only
+mlc                      # default
+mlc --web                # web UI (api.mlx-code.com)
+mlc --bare               # no TUI
+mlc --leash none         # no harness
+mlc --leash codex        # codex CLI
+mlc --leash gemini       # gemini CLI
+mlc --leash claude       # claude code
 ```
 ---

mlx_code-0.0.20/mlx_code/ntui.py → mlx_code-0.0.22/mlx_code/bare.py RENAMED Viewed

@@ -110,6 +110,7 @@ class SimpleRepl:
                 if out_text:
                     self._write_delta(prefix + out_text, 'tool_result')
                 self._last_stream_type = t
+                print()
             elif t == 'commit':
                 self._pending_nls = 0
                 self._awaiting_content = False

mlx-code 0.0.20__tar.gz → 0.0.22__tar.gz

mlx-code 0.0.20tar.gz → 0.0.22tar.gz