npm - copilot-custom-endpoint - Versions diffs - 1.4.0 → 1.4.2 - Mend

copilot-custom-endpoint 1.4.0 → 1.4.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/README.md CHANGED Viewed

@@ -88,21 +88,23 @@ npx copilot-custom-endpoint clean    # Remove debug_log/
 ## Pricing snapshot
-All prices are **USD per 1M tokens** (cache miss). 1 AI credit = $0.01. **MiniMax M3** figures reflect a permanent 50% off list price — see the model doc for the full rate card.
-| Model                        | Input | Output | Context |
-| ---------------------------- | ----- | ------ | ------- |
-| **MiMo V2 Flash** 🏆         | $0.10 | $0.30  | 256K    |
-| **DeepSeek V4 Flash** 🏆     | $0.14 | $0.28  | 1M      |
-| **Kimi K2.6** (non-thinking) | $0.16 | $0.95  | 262K    |
-| **Kimi K2.7 Code**           | $0.19 | $4.00  | 262K    |
-| **MiniMax M3**               | $0.30 | $1.20  | 1M      |
-| **MiMo V2.5**                | $0.40 | $2.00  | 1M      |
-| **Qwen 3.7 Plus**            | $0.40 | $1.60  | 1M      |
-| **MiMo V2.5 Pro**            | $1.00 | $3.00  | 1M      |
-| **GLM 5V Turbo**             | $1.20 | $4.00  | 200K    |
-| **GLM 5.1**                  | $1.40 | $4.40  | 200K    |
-| **Qwen 3.7 Max**             | $2.50 | $7.50  | 1M      |
+All prices are **USD per 1M tokens** (cache miss). 1 AI credit = $0.01. **MiniMax M3** figures reflect a permanent 50% off list price — see the model doc for the full rate card. Context window ¹ covers input + output combined.
+| Model                        | Input | Output | Context ¹ |
+| ---------------------------- | ----- | ------ | --------- |
+| **MiMo V2 Flash** 🏆         | $0.10 | $0.30  | 256K      |
+| **DeepSeek V4 Flash** 🏆     | $0.14 | $0.28  | 1M        |
+| **Kimi K2.6** (non-thinking) | $0.16 | $0.95  | 262K      |
+| **Kimi K2.6** (thinking)     | $0.16 | $4.00  | 262K      |
+| **Kimi K2.7 Code**           | $0.19 | $4.00  | 262K      |
+| **MiniMax M3**               | $0.30 | $1.20  | 1M        |
+| **MiMo V2.5**                | $0.40 | $2.00  | 1M        |
+| **Qwen 3.7 Plus**            | $0.40 | $1.60  | 1M        |
+| **MAI-Code-1-Flash**         | $0.75 | $4.50  | —         |
+| **MiMo V2.5 Pro**            | $1.00 | $3.00  | 1M        |
+| **GLM 5V Turbo**             | $1.20 | $4.00  | 200K      |
+| **GLM 5.1**                  | $1.40 | $4.40  | 200K      |
+| **Qwen 3.7 Max**             | $2.50 | $7.50  | 1M        |
 For the full pricing comparison (cached rates, full Copilot roster, footnotes, sources) see [docs/pricing.md](docs/pricing.md). For a copy-paste config containing **all providers at once**, see [docs/example-config.md](docs/example-config.md).

package/docs/models/kimi.md CHANGED Viewed

@@ -159,11 +159,11 @@ All can be set in a `.env` file at the repo root (both proxies `import 'dotenv/c
 ### Thinking mode
-| Model       | Turn type    | Behavior                                                    |
-| ----------- | ------------ | ----------------------------------------------------------- |
-| K2.5 / K2.6 | Plain chat   | Thinking enabled, `temperature: 1`, `top_p: 0.95`           |
+| Model       | Turn type    | Behavior                                                                   |
+| ----------- | ------------ | -------------------------------------------------------------------------- |
+| K2.5 / K2.6 | Plain chat   | Thinking enabled, `temperature: 1`, `top_p: 0.95`                          |
 | K2.5 / K2.6 | Tool-enabled | `thinking: { type: "disabled" }` forced, `temperature: 0.6`, `top_p: 0.95` |
-| K2.7 Code   | All turns    | Always-thinking, `temperature: 1`, `top_p: 0.95`            |
+| K2.7 Code   | All turns    | Always-thinking, `temperature: 1`, `top_p: 0.95`                           |
 ### Capabilities

package/docs/models/mimo.md CHANGED Viewed

@@ -4,15 +4,15 @@
 ## At a Glance
-| Field                  | Value                                            |
-| ---------------------- | ------------------------------------------------ |
-| Mode                   | **Direct** (no proxy)                            |
-| Vision                 | ✅ Yes (`mimo-v2.5` only)                        |
-| Tool calling           | ✅ Yes (with `thinking: disabled`)               |
-| Context                | 1M (V2.5 Pro / V2.5) / 256K (V2 Flash)           |
+| Field                  | Value                                               |
+| ---------------------- | --------------------------------------------------- |
+| Mode                   | **Direct** (no proxy)                               |
+| Vision                 | ✅ Yes (`mimo-v2.5` only)                           |
+| Tool calling           | ✅ Yes (with `thinking: disabled`)                  |
+| Context                | 1M (V2.5 Pro / V2.5) / 256K (V2 Flash)              |
 | Max output             | 131072 (V2.5 Pro) / 32768 (V2.5) / 65536 (V2 Flash) |
-| Required `requestBody` | `thinking: { type: "disabled" }`                 |
-| Endpoint               | `https://api.xiaomimimo.com/v1/chat/completions` |
+| Required `requestBody` | `thinking: { type: "disabled" }`                    |
+| Endpoint               | `https://api.xiaomimimo.com/v1/chat/completions`    |
 ### Models at a glance

package/docs/models/qwen.md CHANGED Viewed

@@ -4,8 +4,8 @@
 ## At a Glance
-| Field                           | Value                                                                     |
-| ------------------------------- | ------------------------------------------------------------------------- |
+| Field                           | Value                                                                            |
+| ------------------------------- | -------------------------------------------------------------------------------- |
 | Mode                            | **Proxy** (local on `:3458`) **or** **Direct** (static `enable_thinking: false`) |
 | Vision                          | ✅ Yes (`qwen3.7-plus`)                                                          |
 | Tool calling                    | ✅ Yes                                                                           |

package/docs/pricing.md CHANGED Viewed

@@ -22,23 +22,25 @@ All prices below are in **USD per 1M tokens** (non-cached). To convert to AI cre
 These are the models available through GitHub Copilot's model roster as of June 1, 2026.
-| Model                 | Provider  | Tier        | Input (per 1M) | Cached input | Output (per 1M) | Context window |
-| --------------------- | --------- | ----------- | -------------- | ------------ | --------------- | -------------- |
-| **Raptor mini**       | GitHub    | Versatile   | $0.25          | $0.025       | $2.00           | 264K           |
-| **Gemini 3 Flash**    | Google    | Lightweight | $0.50          | $0.05        | $3.00           | 173K           |
-| **GPT-5.4 mini**      | OpenAI    | Lightweight | $0.75          | $0.075       | $4.50           | 400K           |
-| **Claude Haiku 4.5**  | Anthropic | Versatile   | $1.00          | $0.10        | $5.00           | 160K           |
-| **Gemini 2.5 Pro**    | Google    | Powerful    | $1.25¹         | $0.125       | $10.00¹         | 173K           |
-| **Gemini 3.5 Flash**  | Google    | Lightweight | $1.50          | $0.15        | $9.00           | 1M             |
-| **GPT-5.3-Codex**     | OpenAI    | Powerful    | $1.75          | $0.175       | $14.00          | 400K           |
-| **Gemini 3.1 Pro**    | Google    | Powerful    | $2.00¹         | $0.20        | $12.00¹         | 1M             |
-| **GPT-5.4**           | OpenAI    | Versatile   | $2.50          | $0.25        | $15.00          | 1M             |
-| **Claude Sonnet 4.6** | Anthropic | Versatile   | $3.00          | $0.30        | $15.00          | 1M             |
-| **Claude Opus 4.8**   | Anthropic | Powerful    | $5.00          | $0.50        | $25.00          | 1M             |
-| **Claude Opus 4.7**   | Anthropic | Powerful    | $5.00          | $0.50        | $25.00          | 1M             |
-| **GPT-5.5**           | OpenAI    | Powerful    | $5.00          | $0.50        | $30.00          | 1M             |
+| Model                  | Provider  | Tier        | Input (per 1M) | Cached input | Output (per 1M) | Context window |
+| ---------------------- | --------- | ----------- | -------------- | ------------ | --------------- | -------------- |
+| **Raptor mini**        | GitHub    | Versatile   | $0.25          | $0.025       | $2.00           | 264K           |
+| **Gemini 3 Flash**     | Google    | Lightweight | $0.50          | $0.05        | $3.00           | 173K           |
+| **GPT-5.4 mini**       | OpenAI    | Lightweight | $0.75          | $0.075       | $4.50           | 400K           |
+| **MAI-Code-1-Flash** ² | Microsoft | Lightweight | $0.75          | $0.075       | $4.50           | —              |
+| **Claude Haiku 4.5**   | Anthropic | Versatile   | $1.00          | $0.10        | $5.00           | 160K           |
+| **Gemini 2.5 Pro**     | Google    | Powerful    | $1.25¹         | $0.125       | $10.00¹         | 173K           |
+| **Gemini 3.5 Flash**   | Google    | Lightweight | $1.50          | $0.15        | $9.00           | 1M             |
+| **GPT-5.3-Codex**      | OpenAI    | Powerful    | $1.75          | $0.175       | $14.00          | 400K           |
+| **Gemini 3.1 Pro**     | Google    | Powerful    | $2.00¹         | $0.20        | $12.00¹         | 1M             |
+| **GPT-5.4**            | OpenAI    | Versatile   | $2.50          | $0.25        | $15.00          | 1M             |
+| **Claude Sonnet 4.6**  | Anthropic | Versatile   | $3.00          | $0.30        | $15.00          | 1M             |
+| **Claude Opus 4.8**    | Anthropic | Powerful    | $5.00          | $0.50        | $25.00          | 1M             |
+| **Claude Opus 4.7**    | Anthropic | Powerful    | $5.00          | $0.50        | $25.00          | 1M             |
+| **GPT-5.5**            | OpenAI    | Powerful    | $5.00          | $0.50        | $30.00          | 1M             |
 ¹ Gemini 3.1 Pro and 2.5 Pro pricing applies to prompts ≤200K tokens.
+² MAI-Code-1-Flash is a continuously improving model — performance and behavior may evolve over time as new checkpoints are released.
 ## Custom-endpoint alternatives
@@ -59,6 +61,7 @@ These are the models available through GitHub Copilot's model roster as of June
 > **Notes:**
 >
+> - **MAI-Code-1-Flash** is a continuously improving model — performance and behavior may evolve over time as new checkpoints are released.
 > - **DeepSeek V4** input pricing shown is the **cache miss** price. Cache hits are significantly cheaper ($0.0028/M for Flash, $0.003625/M for Pro).
 > - **MiMo** input pricing shown is the **cache miss** price. Cache hits are 5× cheaper for V2.5 Pro ($0.20/M) and V2.5 ($0.08/M), and 10× cheaper for V2 Flash ($0.01/M).
 > - **Gemini 3 Flash** is priced at $0.50/MTok input (text/image/video) and $1.00/MTok input for audio.
@@ -90,6 +93,7 @@ For a typical coding session (~10K input + ~2K output tokens per turn, 50 turns)
 | Kimi K2.7 Code           | ~$0.50                 |
 | Gemini 3 Flash           | ~$0.55                 |
 | MiMo V2.5 Pro            | ~$0.80                 |
+| MAI-Code-1-Flash         | ~$0.83                 |
 | GPT-5.4 mini             | ~$0.83                 |
 | Claude Haiku 4.5         | ~$1.00                 |
 | Qwen 3.7 Max             | ~$1.33                 |
@@ -104,9 +108,10 @@ For a typical coding session (~10K input + ~2K output tokens per turn, 50 turns)
 > **How long does 7,000 credits last?** A Pro+ subscriber running 50-turn sessions could afford roughly **13 GPT-5.5 sessions**, **23 Opus sessions**, or **212 Raptor mini sessions** per month — or mix and match. (Multiply session cost by 100 to convert to AI credits.)
-> Prices last verified: June 9, 2026. Always check the official pages for the latest rates:
+> Prices last verified: June 14, 2026. Always check the official pages for the latest rates:
 >
 > - [GitHub Copilot models & pricing](https://docs.github.com/en/copilot/reference/copilot-billing/models-and-pricing)
+> - [Microsoft MAI-Code-1-Flash model card](https://docs.github.com/en/copilot/reference/ai-models/model-comparison#task-general-purpose-coding-and-writing)
 > - [OpenAI pricing](https://openai.com/api/pricing/)
 > - [Anthropic (Claude) pricing](https://platform.claude.com/docs/en/about-claude/pricing)
 > - [Google Gemini pricing](https://ai.google.dev/pricing)

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "copilot-custom-endpoint",
-  "version": "1.4.0",
+  "version": "1.4.2",
   "description": "Local proxies for VS Code Copilot custom endpoints — Kimi K2 & Qwen 3.x",
   "license": "MIT",
   "type": "module",
@@ -55,4 +55,4 @@
   "dependencies": {
     "dotenv": "^17.4.2"
   }
-}
+}