npm - consult-llm-mcp - Versions diffs - 2.11.0 → 2.12.0 - Mend

consult-llm-mcp 2.11.0 → 2.12.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,22 @@
 # Changelog
+## v2.11.0 (2026-03-27)
+- Added multi-turn thread support for API backends. Threads are stored as JSON
+  files under `$XDG_STATE_HOME/consult-llm-mcp/threads/` and replayed as the
+  messages array on each call. Expired threads (>7 days) are cleaned up
+  automatically. All backends now support `thread_id`.
+- Fixed API cost tracking undercounting tokens for thinking models (e.g.
+  gemini-3.1-pro-preview). Thinking tokens excluded from `completion_tokens` are
+  now derived from `total_tokens`.
+- Monitor: show cost information in history table, detail view header, usage
+  separator lines, and thread detail header. Cost is only shown for API backend
+  consultations.
+- Monitor: show files as compact path list in detail view instead of inlined
+  file contents
+- Fixed `reasoning_effort` incorrectly showing for non-codex models on
+  cursor_cli backend
 ## v2.10.0 (2026-03-15)
 - Monitor: cycle between sibling consultations (started around the same time)

package/README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 # consult-llm-mcp
 An MCP server that lets Claude Code consult stronger AI models (GPT-5.4, Gemini
-3.1 Pro, DeepSeek Reasoner) when Sonnet has you running in circles and you need
+3.1 Pro, DeepSeek Reasoner, MiniMax M2.7) when Sonnet has you running in circles and you need
 to bring in the heavy artillery. Supports multi-turn conversations.
 ```
@@ -28,7 +28,8 @@ to bring in the heavy artillery. Supports multi-turn conversations.
 ## Features
-- Query powerful AI models (GPT-5.4, Gemini 3.1 Pro, DeepSeek Reasoner) with
+- Query powerful AI models (GPT-5.4, Gemini 3.1 Pro, DeepSeek Reasoner, MiniMax
+  M2.7) with
   relevant files as context
 - Include git changes for code review
 - Comprehensive logging with cost estimation (if using API)
@@ -37,6 +38,8 @@ to bring in the heavy artillery. Supports multi-turn conversations.
 - [Codex CLI backend](#codex-cli): Use the `codex` CLI for OpenAI models
 - [Cursor CLI backend](#cursor-cli): Use the `cursor-agent` CLI to route GPT and
   Gemini models through a single tool
+- [OpenCode CLI backend](#opencode-cli): Use `opencode` CLI with Copilot, OpenRouter,
+  or any of 75+ providers
 - [Multi-turn conversations](#multi-turn-conversations): Resume CLI sessions
   across requests with `thread_id`
 - [Web mode](#web-mode): Copy formatted prompts to clipboard for browser-based
@@ -85,6 +88,7 @@ to bring in the heavy artillery. Supports multi-turn conversations.
      -e OPENAI_API_KEY=your_openai_key \
      -e GEMINI_API_KEY=your_gemini_key \
      -e DEEPSEEK_API_KEY=your_deepseek_key \
+     -e MINIMAX_API_KEY=your_minimax_key \
      -- npx -y consult-llm-mcp
    ```
@@ -335,6 +339,7 @@ Each model is routed to a **backend** — either an API endpoint or a CLI tool.
 | **Gemini CLI** | Shells out to `gemini` CLI       | Free quota (Gemini), existing subscriptions, or prefer CLI tools |
 | **Codex CLI**  | Shells out to `codex` CLI        | OpenAI models via Codex subscription                             |
 | **Cursor CLI** | Shells out to `cursor-agent` CLI | Route GPT and Gemini through one tool                            |
+| **OpenCode CLI** | Shells out to `opencode` CLI   | Use Copilot subscription, OpenCode's 75+ providers               |
 | **Web**        | Copies prompt to clipboard       | You prefer browser UIs or want to review prompts                 |
 ### API (default)
@@ -429,6 +434,58 @@ review), allow them in `~/.cursor/cli-config.json`:
 Glob patterns are supported. The `deny` list takes precedence over `allow`.
+#### OpenCode CLI
+Use [OpenCode](https://opencode.ai) as a backend to route models through any of
+its 75+ supported providers — including GitHub Copilot, OpenRouter, and local
+models via Ollama.
+**Requirements:**
+1. Install [OpenCode](https://opencode.ai/docs/installation/)
+2. Configure providers via `opencode providers`
+**Setup:**
+```bash
+# Route MiniMax models through OpenCode
+claude mcp add consult-llm \
+  -e CONSULT_LLM_MINIMAX_BACKEND=opencode \
+  -- npx -y consult-llm-mcp
+# Route OpenAI models through Copilot subscription
+claude mcp add consult-llm \
+  -e CONSULT_LLM_OPENAI_BACKEND=opencode \
+  -e CONSULT_LLM_OPENCODE_OPENAI_PROVIDER=copilot \
+  -- npx -y consult-llm-mcp
+# Route everything through OpenCode
+claude mcp add consult-llm \
+  -e CONSULT_LLM_OPENAI_BACKEND=opencode \
+  -e CONSULT_LLM_GEMINI_BACKEND=opencode \
+  -e CONSULT_LLM_DEEPSEEK_BACKEND=opencode \
+  -e CONSULT_LLM_MINIMAX_BACKEND=opencode \
+  -- npx -y consult-llm-mcp
+```
+The executor maps model IDs to OpenCode's `provider/model` format automatically.
+For example, `MiniMax-M2.7` becomes `opencode run --model minimax/MiniMax-M2.7`.
+**Provider prefix overrides:**
+By default, each provider family maps to its natural OpenCode provider ID
+(`openai`, `google`, `deepseek`, `minimax`). Override with per-family env vars
+when you want to route through a different OpenCode provider:
+- `CONSULT_LLM_OPENCODE_OPENAI_PROVIDER` — default: `openai`
+- `CONSULT_LLM_OPENCODE_GEMINI_PROVIDER` — default: `google`
+- `CONSULT_LLM_OPENCODE_DEEPSEEK_PROVIDER` — default: `deepseek`
+- `CONSULT_LLM_OPENCODE_MINIMAX_PROVIDER` — default: `minimax`
+- `CONSULT_LLM_OPENCODE_PROVIDER` — global fallback for all families
+For example, `CONSULT_LLM_OPENCODE_OPENAI_PROVIDER=copilot` turns
+`gpt-5.2` into `opencode run --model copilot/gpt-5.2`.
 #### Multi-turn conversations
 CLI backends support multi-turn conversations via the `thread_id` parameter. The
@@ -492,14 +549,20 @@ See the "Using web mode..." example above for a concrete transcript.
 - `GEMINI_API_KEY` - Your Google AI API key (required for Gemini models in API
   mode)
 - `DEEPSEEK_API_KEY` - Your DeepSeek API key (required for DeepSeek models)
+- `MINIMAX_API_KEY` - Your MiniMax API key (required for MiniMax models)
 - `CONSULT_LLM_DEFAULT_MODEL` - Override the default model (optional)
-  - Accepts selectors (`gemini`, `openai`, `deepseek`) or exact model IDs
+  - Accepts selectors (`gemini`, `openai`, `deepseek`, `minimax`) or exact model
+    IDs
     (`gpt-5.4`, `gemini-3.1-pro-preview`, etc.)
   - Selectors are resolved to the best available model at startup
 - `CONSULT_LLM_GEMINI_BACKEND` - Backend for Gemini models (optional)
-  - Options: `api` (default), `gemini-cli`, `cursor-cli`
+  - Options: `api` (default), `gemini-cli`, `cursor-cli`, `opencode`
 - `CONSULT_LLM_OPENAI_BACKEND` - Backend for OpenAI models (optional)
-  - Options: `api` (default), `codex-cli`, `cursor-cli`
+  - Options: `api` (default), `codex-cli`, `cursor-cli`, `opencode`
+- `CONSULT_LLM_DEEPSEEK_BACKEND` - Backend for DeepSeek models (optional)
+  - Options: `api` (default), `opencode`
+- `CONSULT_LLM_MINIMAX_BACKEND` - Backend for MiniMax models (optional)
+  - Options: `api` (default), `opencode`
 - `CONSULT_LLM_ALLOWED_MODELS` - Restrict which concrete models can be used
   (optional)
   - Comma-separated list, e.g., `gpt-5.4,gemini-3.1-pro-preview`
@@ -511,10 +574,14 @@ See the "Using web mode..." example above for a concrete transcript.
   - Comma-separated list, e.g., `grok-3,kimi-k2.5`
   - Merged with built-in models and included in the tool schema
   - Useful for newly released models with a known provider prefix (`gpt-`,
-    `gemini-`, `deepseek-`)
+    `gemini-`, `deepseek-`, `MiniMax-`)
 - `CONSULT_LLM_CODEX_REASONING_EFFORT` - Configure reasoning effort for Codex
   CLI (optional, default: `high`)
   - See [Codex CLI](#codex-cli) for details and available options
+- `CONSULT_LLM_OPENCODE_PROVIDER` - Global OpenCode provider prefix (optional)
+  - Overrides the default provider ID for all families when using the `opencode`
+    backend
+  - See [OpenCode CLI](#opencode-cli) for details and per-family overrides
 - `CONSULT_LLM_SYSTEM_PROMPT_PATH` - Custom path to system prompt file
   (optional)
   - Overrides the default `~/.consult-llm-mcp/SYSTEM_PROMPT.md` location
@@ -645,6 +712,7 @@ models complex questions.
 - **gemini-3-pro-preview**: Google's Gemini 3 Pro Preview
 - **gemini-3.1-pro-preview**: Google's Gemini 3.1 Pro Preview
 - **deepseek-reasoner**: DeepSeek's reasoning model
+- **MiniMax-M2.7**: MiniMax's M2.7 reasoning model (204K context)
 - **gpt-5.4**: OpenAI's GPT-5.4 model
 - **gpt-5.2**: OpenAI's GPT-5.2 model
 - **gpt-5.3-codex**: OpenAI's Codex model based on GPT-5.3

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "consult-llm-mcp",
-  "version": "2.11.0",
+  "version": "2.12.0",
   "description": "MCP server for consulting powerful AI models",
   "repository": {
     "type": "git",
@@ -31,9 +31,9 @@
     "ai"
   ],
   "optionalDependencies": {
-    "consult-llm-mcp-darwin-arm64": "2.11.0",
-    "consult-llm-mcp-darwin-x64": "2.11.0",
-    "consult-llm-mcp-linux-x64": "2.11.0",
-    "consult-llm-mcp-linux-arm64": "2.11.0"
+    "consult-llm-mcp-darwin-arm64": "2.12.0",
+    "consult-llm-mcp-darwin-x64": "2.12.0",
+    "consult-llm-mcp-linux-x64": "2.12.0",
+    "consult-llm-mcp-linux-arm64": "2.12.0"
   }
 }