npm - nex-code - Versions diffs - 0.4.15 → 0.4.17 - Mend

nex-code 0.4.15 → 0.4.17

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/README.md CHANGED Viewed

@@ -19,7 +19,7 @@
   <img src="https://img.shields.io/badge/Ollama_Cloud-supported-brightgreen.svg" alt="Ollama Cloud: supported">
   <img src="https://img.shields.io/badge/node-%3E%3D18-brightgreen.svg" alt="Node >= 18">
   <img src="https://img.shields.io/badge/dependencies-2-green.svg" alt="Dependencies: 2">
-  <img src="https://img.shields.io/badge/tests-3074-blue.svg" alt="Tests: 3074">
+  <img src="https://img.shields.io/badge/tests-3453-blue.svg" alt="Tests: 3453">
   <img src="https://img.shields.io/badge/VS_Code-extension-007ACC.svg" alt="VS Code extension">
 </p>
@@ -107,7 +107,7 @@ npm update -g nex-code
 **Extensible.** Plugin API (`registerTool` + lifecycle hooks), skill system (install from any git URL), MCP server support.
-**Tested.** 3074 tests, 85% coverage, CI on every push.
+**Tested.** 3453 tests, 79% coverage, CI on every push.
 ---
@@ -119,19 +119,18 @@ Rankings are based on nex-code's own `/benchmark` — 15 tool-calling tasks agai
 ### Flat-Rate / Pay-as-you-go
 <!-- nex-benchmark-start -->
-<!-- Updated: 2026-03-20 — run `/benchmark --discover` after new Ollama Cloud releases -->
+<!-- Updated: 2026-03-26 — run `/benchmark --discover` after new Ollama Cloud releases -->
-| Rank | Model                  | Score  | Avg Latency | Context | Best For                                              |
-| ---- | ---------------------- | ------ | ----------- | ------- | ----------------------------------------------------- |
-| 🥇   | `devstral-2:123b`      | **84** | 1.5s        | 131K    | Default — fastest + most reliable tool selection      |
-| 🥈   | `qwen3-coder:480b`     | 79     | 2.9s        | 131K    | Coding-heavy sessions, heavy sub-agents               |
-| 🥉   | `kimi-k2:1t`           | 79     | 2.7s        | 256K    | Large repos (>100K tokens)                            |
-| —    | `minimax-m2.7:cloud`   | 73     | 3.5s        | 200K    | Complex swarm / multi-agent sessions (Toolathon SOTA) |
-| —    | `devstral-small-2:24b` | 73     | 1.0s        | 131K    | Fast sub-agents, simple lookups                       |
+| Rank | Model | Score | Avg Latency | Context | Best For |
+|---|---|---|---|---|---|
+| 🥇 | `devstral-2:123b` | **82.5** | 1.7s | 131K | Default — fastest + most reliable tool selection |
+| 🥈 | `devstral-small-2:24b` | 75 | 3.1s | 131K | Fast sub-agents, simple lookups |
+| 🥉 | `qwen3-coder:480b` | 72.5 | 8.4s | 131K | Coding-heavy sessions, heavy sub-agents |
+| — | `kimi-k2:1t` | 67.5 | 6.6s | 256K | Large repos (>100K tokens) |
+| — | `minimax-m2.7:cloud` | 64.1 | 5.0s | 200K | Complex swarm / multi-agent sessions (Toolathon SOTA) |
 > Rankings are nex-code-specific: tool name accuracy, argument validity, schema compliance.
 > Toolathon (Minimax SOTA) measures different task types — run `/benchmark --discover` after model releases.
 <!-- nex-benchmark-end -->
 ### Recommended `.env` for Ollama Cloud (Flat-Rate)
@@ -332,20 +331,20 @@ nex-code --prompt-file /tmp/task.txt --yolo --json
 # → {"success":true,"response":"..."}
 ```
-| Flag                   | Description                                                                                                   |
-| ---------------------- | ------------------------------------------------------------------------------------------------------------- |
-| `--task <prompt>`      | Run a single prompt and exit                                                                                  |
-| `--prompt-file <path>` | Read prompt from a UTF-8 file and run headless                                                                |
-| `--delete-prompt-file` | Delete the prompt file after reading (use with `--prompt-file`)                                               |
-| `--auto`               | Skip confirmations (non-interactive, no REPL banner)                                                          |
-| `--yolo`               | Skip all confirmations including dangerous commands (also configurable via `.nex/config.json` `"yolo": true`) |
-| `--server`             | Start JSON-lines IPC server (used by the VS Code extension)                                                   |
-| `--json`               | Output `{"success":true,"response":"..."}` to stdout                                                          |
-| `--max-turns <n>`      | Override the agentic loop iteration limit                                                                     |
-| `--model <spec>`       | Use a specific model (e.g. `anthropic:claude-sonnet-4-6`)                                                     |
-| `--debug`              | Show internal diagnostic messages (compression, loop detection, guards)                                       |
-| `--auto-orchestrate`   | Automatically use the multi-agent orchestrator when ≥3 goals are detected (also: `NEX_AUTO_ORCHESTRATE=true`) |
-| `--orchestrator-model <m>` | Model for decomposition/synthesis step (default: `kimi-k2.5`)                                            |
+| Flag                       | Description                                                                                                   |
+| -------------------------- | ------------------------------------------------------------------------------------------------------------- |
+| `--task <prompt>`          | Run a single prompt and exit                                                                                  |
+| `--prompt-file <path>`     | Read prompt from a UTF-8 file and run headless                                                                |
+| `--delete-prompt-file`     | Delete the prompt file after reading (use with `--prompt-file`)                                               |
+| `--auto`                   | Skip confirmations (non-interactive, no REPL banner)                                                          |
+| `--yolo`                   | Skip all confirmations including dangerous commands (also configurable via `.nex/config.json` `"yolo": true`) |
+| `--server`                 | Start JSON-lines IPC server (used by the VS Code extension)                                                   |
+| `--json`                   | Output `{"success":true,"response":"..."}` to stdout                                                          |
+| `--max-turns <n>`          | Override the agentic loop iteration limit                                                                     |
+| `--model <spec>`           | Use a specific model (e.g. `anthropic:claude-sonnet-4-6`)                                                     |
+| `--debug`                  | Show internal diagnostic messages (compression, loop detection, guards)                                       |
+| `--no-auto-orchestrate`    | Disable auto-orchestration for multi-goal prompts (on by default; also: `NEX_AUTO_ORCHESTRATE=false`)         |
+| `--orchestrator-model <m>` | Model for decomposition/synthesis step (default: `kimi-k2.5`)                                                 |
 ---
@@ -965,19 +964,19 @@ Spawn parallel sub-agents for independent tasks:
 For complex tasks with multiple independent goals (e.g. "fix all TypeScript errors in auth/, add tests for utils/, and update the README"), the orchestrator decomposes the prompt into parallel sub-tasks, runs dedicated sub-agents on each, and synthesizes the results.
-**Trigger:**
+**Auto-orchestration is on by default** for prompts with ≥3 goals.
 ```bash
-# One-off: pass flag on the CLI
-nex-code --auto-orchestrate --task "fix all type errors in src/, add JSDoc to utils/, update CHANGELOG"
+# Just use it — multi-goal prompts auto-decompose into parallel agents
+nex-code --task "fix all type errors in src/, add JSDoc to utils/, update CHANGELOG"
-# Headless with custom orchestrator model
-nex-code --auto-orchestrate --orchestrator-model kimi-k2.5 --task "..."
+# Custom orchestrator model
+nex-code --orchestrator-model kimi-k2.5 --task "..."
-# Always-on via env var
-NEX_AUTO_ORCHESTRATE=true nex-code
+# Disable auto-orchestration
+NEX_AUTO_ORCHESTRATE=false nex-code
-# Set minimum goal count before orchestrator activates (default: 3)
+# Lower the goal threshold (default: 3)
 NEX_ORCHESTRATE_THRESHOLD=2 nex-code
 ```
@@ -1011,17 +1010,17 @@ Suggested commit: fix: resolve auth type errors and add utility docs
 **Env vars:**
-| Variable                  | Default       | Description                                              |
-| ------------------------- | ------------- | -------------------------------------------------------- |
-| `NEX_AUTO_ORCHESTRATE`    | `false`       | Set to `true` to always use the orchestrator             |
-| `NEX_ORCHESTRATE_THRESHOLD` | `3`         | Minimum number of detected goals before auto-triggering  |
+| Variable                    | Default | Description                                             |
+| --------------------------- | ------- | ------------------------------------------------------- |
+| `NEX_AUTO_ORCHESTRATE`      | `true`  | Set to `false` to disable auto-orchestration            |
+| `NEX_ORCHESTRATE_THRESHOLD` | `3`     | Minimum number of detected goals before auto-triggering |
 **Model roles in orchestration:**
-| Role           | Default model       | Purpose                                      |
-| -------------- | ------------------- | -------------------------------------------- |
-| Orchestrator   | `kimi-k2.5`         | Decomposes prompt, synthesizes results       |
-| Worker         | `devstral-2:123b`   | Executes each sub-task (one agent per task)  |
+| Role         | Default model     | Purpose                                     |
+| ------------ | ----------------- | ------------------------------------------- |
+| Orchestrator | `kimi-k2.5`       | Decomposes prompt, synthesizes results      |
+| Worker       | `devstral-2:123b` | Executes each sub-task (one agent per task) |
 Override via `--orchestrator-model` (orchestrator) or `DEFAULT_MODEL` / `NEX_STANDARD_MODEL` (workers).
@@ -1554,9 +1553,9 @@ npm test              # Run all tests with coverage
 npm run test:watch    # Watch mode
 ```
-57 test suites, 2059 tests, 84% statement / 77% branch coverage.
+83 test suites, 3453 tests, 79% statement / 71% branch coverage.
-CI runs on GitHub Actions (Node 18/20/22).
+CI runs on GitHub Actions (Node 20 LTS).
 ---