npm - copilot-custom-endpoint - Versions diffs - 1.1.1 → 1.2.1 - Mend

copilot-custom-endpoint 1.1.1 → 1.2.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +164 -19
package/package.json +2 -2

package/README.md CHANGED Viewed

@@ -29,6 +29,7 @@ This repo is for those situations: validated, copy-paste-ready configs when Open
 | **Xiaomi MiMo**               | `mimo-v2.5`     | No                                 | ✅         | ✅        | ✅           | ✅⁴    |
 | **Xiaomi MiMo**               | `mimo-v2.5-pro` | No                                 | ✅         | ✅        | ✅           | ❌     |
 | **Xiaomi MiMo**               | `mimo-v2-flash` | No                                 | ✅         | ✅        | ✅           | ❌     |
+| **MiniMax**                   | `MiniMax-M3`    | No                                 | ✅         | ✅        | ✅           | ✅     |
 ¹ Proxy is optional: direct path works with static `enable_thinking: false`. Proxy adds dynamic thinking suppression (thinking ON in plain chat, OFF in tool loops).
 ² With proxy: reasoning visible in plain chat. Without proxy: always suppressed.
@@ -37,26 +38,37 @@ This repo is for those situations: validated, copy-paste-ready configs when Open
 Pick the model you want and follow the corresponding section below.
-### Config file location
+### Config setup: two-step workflow
-The Kimi and Qwen setups require editing the same VS Code config file:
+VS Code separates **model configuration** from **API key storage** for security. You set up each provider in two steps:
-| OS      | Path                                                              |
-| ------- | ----------------------------------------------------------------- |
-| Windows | `%APPDATA%\Code\User\chatLanguageModels.json`                     |
-| macOS   | `~/Library/Application Support/Code/User/chatLanguageModels.json` |
-| Linux   | `~/.config/Code/User/chatLanguageModels.json`                     |
+1. **Create/update `chatLanguageModels.json`** — this file defines the models, URLs, and settings. API keys are **not** stored here (leave `apiKey` out entirely, or use an empty string).
+   | OS      | Path                                                              |
+   | ------- | ----------------------------------------------------------------- |
+   | Windows | `%APPDATA%\Code\User\chatLanguageModels.json`                     |
+   | macOS   | `~/Library/Application Support/Code/User/chatLanguageModels.json` |
+   | Linux   | `~/.config/Code/User/chatLanguageModels.json`                     |
+2. **Set each API key through the Language Models UI:**
+   - Open the Command Palette (`Ctrl+Shift+P`).
+   - Run **Chat: Manage Language Models**.
+   - Find your provider group in the list.
+   - Right-click the group name → **Update API Key**.
+   - Paste your key. It is stored securely (not in the JSON file).
+> **Why this way?** The JSON config file is often tracked in dotfile repos or shared across machines. API keys don't belong there. The VS Code UI stores them in your OS keychain instead.
 ### Full example config
-Here's a complete, real-world example of `chatLanguageModels.json` combining all the providers documented in this repo.
+Here's a complete, real-world example of `chatLanguageModels.json` combining all the providers documented in this repo. Note the `apiKey` fields are left as empty strings — you'll set them via the Language Models UI instead. After you set a key via the UI, VS Code replaces the empty string with a `${input:chat.lm.secret.<id>}` secret reference.
 ```json
 [
   {
     "name": "Qwen",
     "vendor": "customendpoint",
-    "apiKey": "<your-dashscope-key>",
+    "apiKey": "",
     "apiType": "chat-completions",
     "models": [
       {
@@ -86,7 +98,7 @@ Here's a complete, real-world example of `chatLanguageModels.json` combining all
   {
     "name": "Kimi",
     "vendor": "customendpoint",
-    "apiKey": "<your-moonshot-key>",
+    "apiKey": "",
     "apiType": "chat-completions",
     "models": [
       {
@@ -107,7 +119,7 @@ Here's a complete, real-world example of `chatLanguageModels.json` combining all
   {
     "name": "MiMo",
     "vendor": "customendpoint",
-    "apiKey": "<your-mimo-api-key>",
+    "apiKey": "",
     "apiType": "chat-completions",
     "models": [
       {
@@ -156,6 +168,30 @@ Here's a complete, real-world example of `chatLanguageModels.json` combining all
         }
       }
     ]
+  },
+  {
+    "name": "MiniMax",
+    "vendor": "customendpoint",
+    "apiKey": "",
+    "apiType": "chat-completions",
+    "models": [
+      {
+        "id": "MiniMax-M3",
+        "name": "MiniMax M3",
+        "url": "https://api.minimax.io/v1/chat/completions",
+        "toolCalling": true,
+        "vision": true,
+        "streaming": true,
+        "maxInputTokens": 1048576,
+        "maxOutputTokens": 131072,
+        "requestBody": {
+          "thinking": { "type": "adaptive" },
+          "reasoning_split": true,
+          "temperature": 1,
+          "top_p": 0.95
+        }
+      }
+    ]
   }
 ]
 ```
@@ -208,6 +244,10 @@ You should see:
 ```
 [kimi-proxy] listening on http://127.0.0.1:3457/v1/chat/completions
+[kimi-proxy] forwarding to https://api.moonshot.ai/v1/chat/completions
+[kimi-proxy] forcing temperature=1, non-thinking temperature=0.6, and top_p=0.95
+[kimi-proxy] disable thinking with tools=true
+[kimi-proxy] writing redacted request summaries to debug_log/kimi-proxy.ndjson
 ```
 Check it's alive:
@@ -232,13 +272,13 @@ Expected response:
 #### 3. Register the model in VS Code
-Open (or create) your user config file (see [Config file location](#config-file-location) above) and paste this entry (replace `<your-moonshot-key>`):
+First, open (or create) your user config file (see [Config file location](#config-file-location) above) and paste this entry (leave `apiKey` as empty string — you'll set it via the UI):
 ```json
 {
   "name": "Kimi",
   "vendor": "customendpoint",
-  "apiKey": "<your-moonshot-key>",
+  "apiKey": "",
   "apiType": "chat-completions",
   "models": [
     {
@@ -260,6 +300,13 @@ Open (or create) your user config file (see [Config file location](#config-file-
 > **Note:** The `requestBody.temperature` here is a hint to VS Code, but the proxy will enforce the exact values Kimi requires regardless.
+Then set your Moonshot API key via the Language Models UI:
+- Open the Command Palette (`Ctrl+Shift+P`).
+- Run **Chat: Manage Language Models**.
+- Find the **Kimi** group, right-click it → **Update API Key**.
+- Paste your Moonshot API key.
 #### 4. Chat!
 - Open the Copilot chat panel (`Ctrl+Alt+I` / `Cmd+Ctrl+I`).
@@ -298,13 +345,13 @@ Create an API key [here](https://modelstudio.console.alibabacloud.com/ap-southea
 #### 2. Register the models in VS Code
-Open (or create) your user config file (see [Config file location](#config-file-location) above) and paste this entry (replace `<your-dashscope-key>`):
+First, open (or create) your user config file (see [Config file location](#config-file-location) above) and paste this entry (leave `apiKey` as empty string — you'll set it via the UI):
 ```json
 {
   "name": "Qwen",
   "vendor": "customendpoint",
-  "apiKey": "<your-dashscope-key>",
+  "apiKey": "",
   "apiType": "chat-completions",
   "models": [
     {
@@ -333,6 +380,13 @@ Open (or create) your user config file (see [Config file location](#config-file-
 }
 ```
+Then set your DashScope API key via the Language Models UI:
+- Open the Command Palette (`Ctrl+Shift+P`).
+- Run **Chat: Manage Language Models**.
+- Find the **Qwen** group, right-click it → **Update API Key**.
+- Paste your DashScope API key.
 > **Trade-off:** `enable_thinking: false` suppresses reasoning in all requests (both plain chat and tool loops). Tool loops stay stable, but you never see the model's thought process. The [optional proxy](#optional-local-proxy-for-dynamic-thinking) below avoids this trade-off.
 #### 3. Chat!
@@ -392,13 +446,13 @@ Expected response:
 }
 ```
-Then update your VS Code config to point URLs at the proxy and remove `requestBody` — the proxy handles thinking dynamically:
+Then update your VS Code config to point URLs at the proxy and remove `requestBody` — the proxy handles thinking dynamically (remember, `apiKey` stays empty — set it via the UI):
 ```json
 {
   "name": "Qwen",
   "vendor": "customendpoint",
-  "apiKey": "<your-dashscope-key>",
+  "apiKey": "",
   "apiType": "chat-completions",
   "models": [
     {
@@ -499,13 +553,13 @@ Sign up at [platform.xiaomimimo.com](https://platform.xiaomimimo.com) and create
 #### 2. Register the models in VS Code
-Open your user config file (see [Config file location](#config-file-location) above) and paste this entry (replace `<your-mimo-api-key>`):
+First, open your user config file (see [Config file location](#config-file-location) above) and paste this entry (leave `apiKey` as empty string — you'll set it via the UI):
 ```json
 {
   "name": "MiMo",
   "vendor": "customendpoint",
-  "apiKey": "<your-mimo-api-key>",
+  "apiKey": "",
   "apiType": "chat-completions",
   "models": [
     {
@@ -557,6 +611,13 @@ Open your user config file (see [Config file location](#config-file-location) ab
 }
 ```
+Then set your MiMo API key via the Language Models UI:
+- Open the Command Palette (`Ctrl+Shift+P`).
+- Run **Chat: Manage Language Models**.
+- Find the **MiMo** group, right-click it → **Update API Key**.
+- Paste your MiMo API key.
 > **Note:** `thinking: { "type": "disabled" }` is required for tool-calling stability. Without it, MiMo returns a 400 error when conversation history contains tool calls with missing `reasoning_content`.
 #### 3. Chat!
@@ -577,11 +638,91 @@ Open your user config file (see [Config file location](#config-file-location) ab
 ---
+<details>
+<summary>MiniMax M3 (MiniMax)</summary>
+### MiniMax M3 (MiniMax)
+MiniMax works **directly** with the OpenAI-compatible Chat Completions endpoint — no proxy needed. The recommended config enables MiniMax's native reasoning via `thinking: { "type": "adaptive" }` + `reasoning_split: true`.
+#### 1. Grab a MiniMax API key
+Create an API key at the [MiniMax Developer Platform](https://platform.minimax.io/user-center/basic-information/interface-key).
+> **Regional endpoints:** MiniMax offers endpoints for different regions. API keys are region-specific.
+>
+> - **International (default):** `https://api.minimax.io/v1/chat/completions`
+> - **China:** `https://api.minimaxi.com/v1/chat/completions`
+#### 2. Register the model in VS Code
+First, open (or create) your user config file (see [Config file location](#config-file-location) above) and paste this entry (leave `apiKey` as empty string — you'll set it via the UI):
+```json
+{
+  "name": "MiniMax",
+  "vendor": "customendpoint",
+  "apiKey": "",
+  "apiType": "chat-completions",
+  "models": [
+    {
+      "id": "MiniMax-M3",
+      "name": "MiniMax M3",
+      "url": "https://api.minimax.io/v1/chat/completions",
+      "toolCalling": true,
+      "vision": true,
+      "streaming": true,
+      "maxInputTokens": 1048576,
+      "maxOutputTokens": 131072,
+      "requestBody": {
+        "thinking": { "type": "adaptive" },
+        "reasoning_split": true,
+        "temperature": 1,
+        "top_p": 0.95
+      }
+    }
+  ]
+}
+```
+Then set your MiniMax API key via the Language Models UI:
+- Open the Command Palette (`Ctrl+Shift+P`).
+- Run **Chat: Manage Language Models**.
+- Find the **MiniMax** group, right-click it → **Update API Key**.
+- Paste your MiniMax API key.
+**Why this config?**
+- `thinking: { "type": "adaptive" }` — MiniMax's documented default. The model decides when to reason.
+- `reasoning_split: true` — the server returns reasoning in a structured `reasoning_details` field instead of mixing `<think>` tags into `content`. VS Code sees a clean OpenAI-format message.
+> **Note:** `thinking: { "type": "disabled" }` is **not** a hard override — Phase 1 testing confirmed MiniMax-M3 still reasons internally regardless of this setting, and emits `<think>` tags in `content` either way. Setting it to `disabled` only changes the response field layout, not actual model behavior. We recommend `adaptive` for clarity.
+#### 3. Chat!
+- Open the Copilot chat panel (`Ctrl+Alt+I` / `Cmd+Ctrl+I`).
+- Click the model picker and select **MiniMax M3**.
+- Ask something. Plain chat, streaming, tool use, and vision all work.
+#### Troubleshooting (MiniMax)
+| Symptom                              | Fix                                                                                                           |
+| ------------------------------------ | ------------------------------------------------------------------------------------------------------------- |
+| Model not appearing in picker        | Check your `chatLanguageModels.json` syntax. Reload the VS Code window.                                       |
+| 400 on tool calls                    | Confirm the model ID is `MiniMax-M3` (capital M's, lowercase i, hyphen). Check the API key region.            |
+| Responses show leaked `<think>` tags | Make sure `"reasoning_split": true` is set in `requestBody` so reasoning goes to `reasoning_details` instead. |
+</details>
+---
 For the full research notes, tested values, and known limitations, see:
 - [`docs/models/kimi-k2.6.md`](docs/models/kimi-k2.6.md)
 - [`docs/models/qwen.md`](docs/models/qwen.md)
 - [`docs/models/mimo.md`](docs/models/mimo.md)
+- [`docs/models/minimax.md`](docs/models/minimax.md)
 ## Pricing comparison
@@ -637,6 +778,7 @@ These are the models available through GitHub Copilot's model roster as of June
 | **MiMo V2.5 Pro**     | Xiaomi    | $1.00                         | $3.00                                   | 1M             |
 | **Qwen 3.6 Plus**     | DashScope | $0.50 (≤256K) / $2.00 (>256K) | $3.00 (≤256K) / $6.00 (>256K)           | 1M             |
 | **Qwen 3.7 Max**      | DashScope | $2.50 (≤1M)                   | $7.50 (≤1M)                             | 1M             |
+| **MiniMax M3**        | MiniMax   | $0.60 (≤512K) / $1.20 (>512K) | $2.40 (≤512K) / $4.80 (>512K)           | 1M             |
 > **Notes:**
 >
@@ -648,6 +790,7 @@ These are the models available through GitHub Copilot's model roster as of June
 > - **Qwen** models use **tiered pricing** — determined by total input tokens per request. Prices above are for non-thinking mode.
 > - **Kimi K2.6** pricing is from the **Moonshot platform** (direct). Via DashScope: $0.89 input / $3.71 output.
 > - **DashScope** offers a **free quota** of 1M input + 1M output tokens per model, valid for 90 days.
+> - **MiniMax M3** uses **tiered pricing** — input price doubles above 512K input tokens. A 7-day 50% off promotion is available for new accounts.
 > - **MiMo** offers a **Token Plan** subscription model with discounted rates and a free cache-writing promotion.
 > - For typical Copilot chat usage (short-to-medium prompts), you'll almost always fall in the lowest pricing tier.
@@ -662,6 +805,7 @@ These are the models available through GitHub Copilot's model roster as of June
 | Kimi K2.6 (thinking)     | ~$0.48                 | —                    |
 | Gemini 3 Flash           | ~$0.55                 | ~55                  |
 | Qwen 3.6 Plus            | ~$0.55                 | —                    |
+| MiniMax M3               | ~$0.54                 | —                    |
 | MiMo V2.5 Pro            | ~$0.80                 | —                    |
 | GPT-5.4 mini             | ~$0.83                 | ~83                  |
 | Claude Haiku 4.5         | ~$1.00                 | ~100                 |
@@ -687,6 +831,7 @@ These are the models available through GitHub Copilot's model roster as of June
 > - [DashScope pricing](https://www.alibabacloud.com/help/en/model-studio/billing-for-model-studio)
 > - [DeepSeek pricing](https://api-docs.deepseek.com/quick_start/pricing)
 > - [MiMo pricing](https://platform.xiaomimimo.com/docs/en-US/pricing)
+> - [MiniMax pricing](https://platform.minimax.io/docs/pricing/overview)
 ## Repo layout

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "copilot-custom-endpoint",
-  "version": "1.1.1",
+  "version": "1.2.1",
   "description": "Local proxies for VS Code Copilot custom endpoints — Kimi K2 & Qwen 3.x",
   "license": "MIT",
   "type": "module",
@@ -51,4 +51,4 @@
   "dependencies": {
     "dotenv": "^17.4.2"
   }
-}
+}