npm - consult-llm-mcp - Versions diffs - 1.4.2 → 1.4.4 - Mend

consult-llm-mcp 1.4.2 → 1.4.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

package/README.md CHANGED Viewed

@@ -24,8 +24,8 @@ This SQL query is timing out on large datasets. Can you help optimize it? Ask Ge
 ## Features
-- Query powerful AI models (o3, Gemini 2.5 Pro, DeepSeek Reasoner, GPT-5.1
-  Codex) with relevant files as context
+- Query powerful AI models (o3, Gemini 2.5 Pro, Gemini 3 Pro Preview, DeepSeek
+  Reasoner, GPT-5.1 Codex) with relevant files as context
 - Direct queries with optional file context
 - Include git changes for code review and analysis
 - Comprehensive logging with cost estimation
@@ -239,7 +239,7 @@ This is useful when:
 </details>
-## Web Mode
+## Web mode
 When you want Claude Code to prepare the prompt but send it through an LLM web
 UI yourself (ChatGPT, Claude.ai, Gemini, etc.), ask it to "use consult LLM with
@@ -260,7 +260,7 @@ wherever you like.
 See the "Using web mode..." example above for a concrete transcript of this
 flow.
-## Gemini CLI Mode
+## Gemini CLI mode
 Use Gemini's local CLI when you want to take advantage of Google's free quota or
 keep prompts off the API by enabling CLI mode so consult-llm spawns the `gemini`
@@ -283,7 +283,7 @@ binary locally rather than sending the prompt through the API.
      use). It will call `consult_llm` with the Gemini model, assemble the
      prompt, and shell out to the CLI automatically.
-## Codex CLI Mode
+## Codex CLI mode
 Use OpenAI's Codex CLI when you want to use OpenAI models locally through the
 CLI instead of making API calls.
@@ -304,7 +304,7 @@ CLI instead of making API calls.
      call `consult_llm` with the specified model, assemble the prompt, and shell
      out to the Codex CLI automatically.
-### Configuring Reasoning Effort
+### Configuring reasoning effort
 When using Codex CLI mode, you can control the reasoning effort level using the
 `CODEX_REASONING_EFFORT` environment variable:
@@ -325,7 +325,7 @@ longer to complete. This is passed to the Codex CLI as
 ## Configuration
-### Environment Variables
+### Environment variables
 - `OPENAI_API_KEY` - Your OpenAI API key (required for OpenAI models in API
   mode)
@@ -333,8 +333,9 @@ longer to complete. This is passed to the Codex CLI as
   mode)
 - `DEEPSEEK_API_KEY` - Your DeepSeek API key (required for DeepSeek models)
 - `CONSULT_LLM_DEFAULT_MODEL` - Override the default model (optional)
-  - Options: `o3` (default), `gemini-2.5-pro`, `deepseek-reasoner`,
-    `gpt-5.1-codex-max`, `gpt-5.1-codex`, `gpt-5.1-codex-mini`, `gpt-5.1`
+  - Options: `o3` (default), `gemini-2.5-pro`, `gemini-3-pro-preview`,
+    `deepseek-reasoner`, `gpt-5.1-codex-max`, `gpt-5.1-codex`,
+    `gpt-5.1-codex-mini`, `gpt-5.1`
 - `GEMINI_MODE` - Choose between API or CLI mode for Gemini models (optional)
   - Options: `api` (default), `cli`
   - CLI mode uses the system-installed `gemini` CLI tool
@@ -344,7 +345,7 @@ longer to complete. This is passed to the Codex CLI as
 - `CODEX_REASONING_EFFORT` - Configure reasoning effort for Codex CLI (optional)
   - See [Codex CLI Mode](#codex-cli-mode) for details and available options
-### Custom System Prompt
+### Custom system prompt
 You can customize the system prompt used when consulting LLMs by creating a
 `SYSTEM_PROMPT.md` file in `~/.consult-llm-mcp/`:
@@ -359,7 +360,7 @@ request, so changes take effect immediately without restarting the server.
 To revert to the default prompt, simply delete the `SYSTEM_PROMPT.md` file.
-## MCP Tool: consult_llm
+## MCP tool: consult_llm
 The server provides a single tool called `consult_llm` for asking powerful AI
 models complex questions.
@@ -372,8 +373,8 @@ models complex questions.
   - All files are added as context with file paths and code blocks
 - **model** (optional): LLM model to use
-  - Options: `o3` (default), `gemini-2.5-pro`, `deepseek-reasoner`,
-    `gpt-5.1-codex`, `gpt-5.1-codex-mini`, `gpt-5.1`
+  - Options: `o3` (default), `gemini-2.5-pro`, `gemini-3-pro-preview`,
+    `deepseek-reasoner`, `gpt-5.1-codex`, `gpt-5.1-codex-mini`, `gpt-5.1`
 - **web_mode** (optional): Copy prompt to clipboard instead of querying LLM
   - Default: `false`
@@ -387,10 +388,12 @@ models complex questions.
     directory)
   - **base_ref** (optional): Git reference to compare against (defaults to HEAD)
-## Supported Models
+## Supported models
 - **o3**: OpenAI's reasoning model ($2/$8 per million tokens)
 - **gemini-2.5-pro**: Google's Gemini 2.5 Pro ($1.25/$10 per million tokens)
+- **gemini-3-pro-preview**: Google's Gemini 3 Pro Preview ($2/$12 per million
+  tokens for prompts ≤200k tokens, $4/$18 for prompts >200k tokens)
 - **deepseek-reasoner**: DeepSeek's reasoning model ($0.55/$2.19 per million
   tokens)
 - **gpt-5.1-codex**: OpenAI's Codex model optimized for coding
@@ -468,7 +471,7 @@ CRITICAL: When asking, don't present options, this will bias the answer.
 Claude Code seems to know pretty well when to use this MCP even without this
 instruction however.
-## Example Skill
+## Example skill
 Here's an example [Claude Code skill](https://code.claude.com/docs/en/skills)
 that uses the `consult_llm` MCP tool to create commands like "ask gemini" or
@@ -531,6 +534,10 @@ When consulting with external LLMs:
 Save this as `~/.claude/skills/consult-llm/SKILL.md` and you can then use it by
 typing "ask gemini about X" or "ask codex about X" in Claude Code.
+This one is not strictly necessary either, Claude (or other agent) can infer
+from the schema that "Ask gemini" should call this MCP, but it might be helpful
+in case you want to have more precise control over how the agent calls this MCP.
 ## Development
 To work on the MCP server locally and use your development version:

package/dist/config.d.ts CHANGED Viewed

@@ -6,6 +6,7 @@ declare const Config: z.ZodObject<{
     defaultModel: z.ZodOptional<z.ZodEnum<{
         o3: "o3";
         "gemini-2.5-pro": "gemini-2.5-pro";
+        "gemini-3-pro-preview": "gemini-3-pro-preview";
         "deepseek-reasoner": "deepseek-reasoner";
         "gpt-5.1-codex-max": "gpt-5.1-codex-max";
         "gpt-5.1-codex": "gpt-5.1-codex";
@@ -36,7 +37,7 @@ export declare const config: {
     openaiApiKey?: string | undefined;
     geminiApiKey?: string | undefined;
     deepseekApiKey?: string | undefined;
-    defaultModel?: "o3" | "gemini-2.5-pro" | "deepseek-reasoner" | "gpt-5.1-codex-max" | "gpt-5.1-codex" | "gpt-5.1-codex-mini" | "gpt-5.1" | undefined;
+    defaultModel?: "o3" | "gemini-2.5-pro" | "gemini-3-pro-preview" | "deepseek-reasoner" | "gpt-5.1-codex-max" | "gpt-5.1-codex" | "gpt-5.1-codex-mini" | "gpt-5.1" | undefined;
     codexReasoningEffort?: "none" | "minimal" | "low" | "medium" | "high" | "xhigh" | undefined;
 };
 export {};

package/dist/llm-cost.js CHANGED Viewed

@@ -7,6 +7,10 @@ const MODEL_PRICING = {
         inputCostPerMillion: 1.25,
         outputCostPerMillion: 10.0,
     },
+    'gemini-3-pro-preview': {
+        inputCostPerMillion: 2.0,
+        outputCostPerMillion: 12.0,
+    },
     'deepseek-reasoner': {
         inputCostPerMillion: 0.55,
         outputCostPerMillion: 2.19,

package/dist/llm.js CHANGED Viewed

@@ -124,7 +124,7 @@ const geminiCliConfig = {
 const codexCliConfig = {
     cliName: 'codex',
     buildArgs: (model, fullPrompt) => {
-        const args = ['exec', '-m', model];
+        const args = ['exec', '--skip-git-repo-check', '-m', model];
         if (config.codexReasoningEffort) {
             args.push('-c', `model_reasoning_effort="${config.codexReasoningEffort}"`);
         }

package/dist/llm.test.js CHANGED Viewed

@@ -105,11 +105,12 @@ describe('CLI executor', () => {
         expect(args?.[0]).toBe('codex');
         const cliArgs = args?.[1];
         expect(cliArgs[0]).toBe('exec');
-        expect(cliArgs[1]).toBe('-m');
-        expect(cliArgs[2]).toBe('gpt-5.1');
-        expect(cliArgs[3]).toContain('system');
-        expect(cliArgs[3]).toContain('user');
-        expect(cliArgs[3]).toContain('Files: @');
+        expect(cliArgs[1]).toBe('--skip-git-repo-check');
+        expect(cliArgs[2]).toBe('-m');
+        expect(cliArgs[3]).toBe('gpt-5.1');
+        expect(cliArgs[4]).toContain('system');
+        expect(cliArgs[4]).toContain('user');
+        expect(cliArgs[4]).toContain('Files: @');
         const result = await promise;
         expect(result.response).toBe('result');
         expect(result.usage).toBeNull();

package/dist/schema.d.ts CHANGED Viewed

@@ -2,6 +2,7 @@ import { z } from 'zod/v4';
 export declare const SupportedChatModel: z.ZodEnum<{
     o3: "o3";
     "gemini-2.5-pro": "gemini-2.5-pro";
+    "gemini-3-pro-preview": "gemini-3-pro-preview";
     "deepseek-reasoner": "deepseek-reasoner";
     "gpt-5.1-codex-max": "gpt-5.1-codex-max";
     "gpt-5.1-codex": "gpt-5.1-codex";
@@ -15,6 +16,7 @@ export declare const ConsultLlmArgs: z.ZodObject<{
     model: z.ZodDefault<z.ZodOptional<z.ZodEnum<{
         o3: "o3";
         "gemini-2.5-pro": "gemini-2.5-pro";
+        "gemini-3-pro-preview": "gemini-3-pro-preview";
         "deepseek-reasoner": "deepseek-reasoner";
         "gpt-5.1-codex-max": "gpt-5.1-codex-max";
         "gpt-5.1-codex": "gpt-5.1-codex";

package/dist/schema.js CHANGED Viewed

@@ -2,6 +2,7 @@ import { z } from 'zod/v4';
 export const SupportedChatModel = z.enum([
     'o3',
     'gemini-2.5-pro',
+    'gemini-3-pro-preview',
     'deepseek-reasoner',
     'gpt-5.1-codex-max',
     'gpt-5.1-codex',

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "consult-llm-mcp",
-  "version": "1.4.2",
+  "version": "1.4.4",
   "description": "MCP server for consulting powerful AI models",
   "type": "module",
   "main": "dist/main.js",