npm - nex-code - Versions diffs - 0.3.57 → 0.3.59 - Mend

nex-code 0.3.57 → 0.3.59

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ```
  ██▄▄██   nex-code  v0.3.54
- █▀██▀█   qwen3-coder:480b  ·  /help
+ █▀██▀█   devstral-2:123b  ·  /help
  ▀████▀
 ```
@@ -66,47 +66,88 @@ npm update -g nex-code
 ## Why nex-code?
-| | **nex-code** | Claude Code | Gemini CLI | Aider |
-|---|---|---|---|---|
-| **VS Code extension** | ✅ Built-in sidebar panel | ✅ | ❌ | ❌ |
-| **Free with Ollama** | ✅ Native, first-class | ⚠️ Workaround | ❌ | ✅ |
-| **Ollama Cloud support** | ✅ 47+ models, native | ⚠️ API-compat only | ❌ | ✅ |
-| **Multi-provider runtime swap** | ✅ 5 providers, no restart | ❌ Claude-only | ❌ Gemini-only | ✅ |
-| **Tool tiers (adapts to model)** | ✅ essential/standard/full | ❌ | ❌ | ❌ |
-| **5-layer open-model auto-fix** | ✅ | ❌ | ❌ | ⚠️ |
-| **Undo / Redo (persistent)** | ✅ Survives restart | ❌ | ❌ | ❌ |
-| **Cost tracking + budgets** | ✅ | ❌ | ❌ | ❌ |
-| **Pre-push secret detection** | ✅ | ❌ | ❌ | ❌ |
-| **Browser agent (headless)** | ✅ Playwright-based | ❌ | ⚠️ Experimental | ❌ |
-| **Grounded web search** | ✅ Perplexity/DDG | ❌ | ✅ Google grounded | ❌ |
-| **GitHub Actions tools** | ✅ native | ❌ | ❌ | ❌ |
-| **SSH server management** | ✅ native (AlmaLinux/macOS) | ❌ | ❌ | ❌ |
-| **Docker tools** | ✅ local + remote via SSH | ❌ | ❌ | ❌ |
-| **Deploy tool (rsync)** | ✅ named configs | ❌ | ❌ | ❌ |
-| **Open-source** | ✅ MIT | ❌ | ✅ Apache 2.0 | ✅ |
-| **Runtime dependencies** | **2** (axios, dotenv) | Many | Many | Heavy (Python) |
-| **Startup time** | **~100ms** | ~400ms | ~300ms | Slow |
-| **Plugin API** | ✅ registerTool + hooks | ❌ | ❌ | ❌ |
-| **Skill marketplace** | ✅ Install from git | ❌ | ❌ | ❌ |
-| **Audit logging** | ✅ JSONL + sanitization | ❌ | ❌ | ❌ |
-| **Test coverage** | 3074 tests, 85% | — | — | — |
+**Provider-agnostic by design.** Run fully free with a local Ollama server, use Ollama Cloud's 47+ models on a flat-rate plan, or connect OpenAI, Anthropic, or Gemini — switch at runtime with `/model`, no restart needed. The fallback chain automatically retries failed requests on the next configured provider.
+**Open-model first.** nex-code was built around open models, not locked to any single vendor. Tool tiers (`essential / standard / full`) adapt automatically to the model's capability level, so smaller models don't receive tool schemas they can't handle. A 5-layer auto-fix loop catches and retries malformed tool calls without user intervention.
+**Smart model routing.** The built-in `/benchmark` system tests all configured models against 33 real nex-code tool-calling tasks across 5 task categories. The results feed a routing table so nex-code can automatically switch to the best model for the detected task type:
+| Detected task | Routed model (example) |
+|---|---|
+| Frontend / CSS / React | `qwen3-coder:480b` |
+| Sysadmin / Docker / nginx | `devstral-2:123b` |
+| Data / SQL / migrations | `devstral-2:123b` |
+| Agentic swarms | `minimax-m2.7:cloud` |
+| General coding | `devstral-2:123b` (default) |
+**Built-in VS Code extension.** A sidebar chat panel with streaming output, collapsible tool cards, and native VS Code theme support — shipped in the same repo, no separate install.
+**Lightweight.** 2 runtime dependencies (`axios`, `dotenv`). Starts in ~100ms. No Python, no heavy runtime, no daemon process.
+**Infrastructure tools built in:**
+- SSH server management (AlmaLinux, macOS, any Linux)
+- Docker tools — local and remote via SSH
+- Kubernetes overview (`/k8s`)
+- GitHub Actions tools (trigger, monitor runs)
+- Named deploy configs (`rsync`-based, `/deploy`)
+- Browser agent via Playwright (optional, not bundled)
+- Grounded web search via Perplexity or DuckDuckGo
+**Developer safety:**
+- Pre-push secret detection — blocks commits that contain API keys or tokens
+- Full audit log (JSONL + sanitization)
+- Undo/Redo with persistence across restarts
+- Cost tracking and per-provider budget limits
+- Plan mode — analysis-only pass before any file changes
+**Extensible.** Plugin API (`registerTool` + lifecycle hooks), skill system (install from any git URL), MCP server support.
+**Tested.** 3074 tests, 85% coverage, CI on every push.
 ---
-## Ollama Cloud — The Free-by-Default Model Tier
+## Ollama Cloud — Recommended Model Setup
 nex-code was built with Ollama Cloud as its primary provider. No subscription, no billing surprises.
-Use powerful open models like **Qwen3 Coder**, **Kimi K2.5**, **Devstral**, and **DeepSeek R1** for free.
+Rankings are based on nex-code's own `/benchmark` — 15 tool-calling tasks against real nex-code schemas.
+### Flat-Rate / Pay-as-you-go
+<!-- nex-benchmark-start -->
+<!-- Updated: 2026-03-20 — run `/benchmark --discover` after new Ollama Cloud releases -->
-| Model | Context | Best For |
-|---|---|---|
-| `qwen3-coder:480b` | 131K | Code generation, tool calling |
-| `kimi-k2.5` | 256K | Large repos, reasoning |
-| `devstral-2:123b` | 131K | Reliable tool calling |
-| `devstral-small-2:24b` | 131K | Fast, efficient |
-| `qwen3.5:35b-a3b` | 256K | MoE, very fast |
+| Rank | Model | Score | Avg Latency | Context | Best For |
+|---|---|---|---|---|---|
+> Rankings are nex-code-specific: tool name accuracy, argument validity, schema compliance.
+> Toolathon (Minimax SOTA) measures different task types — run `/benchmark --discover` after model releases.
+<!-- nex-benchmark-end -->
+### Recommended `.env` for Ollama Cloud (Flat-Rate)
+```env
+DEFAULT_PROVIDER=ollama
+DEFAULT_MODEL=devstral-2:123b         # nex-code benchmark winner (84/100, 1.5s)
+# Sub-agent routing
+NEX_HEAVY_MODEL=qwen3-coder:480b      # complex multi-step coding
+NEX_STANDARD_MODEL=devstral-2:123b    # routine tasks
+NEX_FAST_MODEL=devstral-small-2:24b   # quick lookups, fast sub-agents
+```
+### Run the benchmark yourself
+```bash
+/benchmark             # full run: 15 tasks × 5 models
+/benchmark --quick     # fast run: 7 tasks × 3 models
+/benchmark --discover  # detect new Ollama Cloud models, benchmark + auto-update README
+/benchmark --models=minimax-m2.7:cloud,qwen3-coder:480b
+/benchmark --history   # show OpenClaw nightly trend
+```
-Switch anytime: `/model ollama:qwen3-coder:480b` or add your `OLLAMA_API_KEY` to `.env`.
+Switch anytime: `/model devstral-2:123b` or update `DEFAULT_MODEL` in `.env`.
+Auto-discovery runs weekly via the scheduled improvement task and updates this table automatically.
 ---
@@ -155,7 +196,7 @@ PERPLEXITY_API_KEY=your-key   # Perplexity (optional — enables grounded web se
 # Optional tuning
 DEFAULT_PROVIDER=ollama        # Active provider on startup
-DEFAULT_MODEL=qwen3-coder:480b # Active model on startup
+DEFAULT_MODEL=devstral-2:123b  # Active model on startup (see /benchmark for ranking)
 FALLBACK_CHAIN=anthropic,openai # Providers tried on failure (comma-separated)
 NEX_STALE_WARN_MS=60000        # Warn if no tokens received for N ms (default: 60000)
 NEX_STALE_ABORT_MS=120000      # Abort and retry stream after N ms of silence (default: 120000)
@@ -299,7 +340,7 @@ npm run package        # syncs version, builds, and creates .vsix
 |---------|---------|-------------|
 | `nexCode.executablePath` | `nex-code` | Path to the nex-code binary |
 | `nexCode.defaultProvider` | `ollama` | LLM provider |
-| `nexCode.defaultModel` | `qwen3-coder:480b` | Model name |
+| `nexCode.defaultModel` | `devstral-2:123b` | Model name |
 | `nexCode.anthropicApiKey` | — | Anthropic API key |
 | `nexCode.openaiApiKey` | — | OpenAI API key |
 | `nexCode.ollamaApiKey` | — | Ollama Cloud API key |
@@ -400,7 +441,7 @@ Type `/` to see inline suggestions as you type. Tab completion is supported for
 | `/review [--strict] [file]` | Deep code review: 3-phase protocol (broad scan → grep deep-dive → report), score table, diff fix snippets. `--strict` forces ≥3 critical findings. |
 | `/k8s [user@host]` | Kubernetes overview: namespaces + pod health (remote via SSH optional) |
 | `/setup` | Interactive setup wizard — configure provider, API keys, web search |
-| `/benchmark` | Show model benchmark results (7-day trend) |
+| `/benchmark [--quick\|--discover\|--history]` | Rank models on nex-code tool-calling tasks, auto-update routing |
 | `/install-skill <url>` | Install a skill from a git repo |
 | `/search-skill <query>` | Search GitHub for nex-code skills |
 | `/remove-skill <name>` | Remove an installed skill |