copilot-custom-endpoint 1.4.0 โ†’ 1.4.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -88,21 +88,22 @@ npx copilot-custom-endpoint clean # Remove debug_log/
88
88
 
89
89
  ## Pricing snapshot
90
90
 
91
- All prices are **USD per 1M tokens** (cache miss). 1 AI credit = $0.01. **MiniMax M3** figures reflect a permanent 50% off list price โ€” see the model doc for the full rate card.
92
-
93
- | Model | Input | Output | Context |
94
- | ---------------------------- | ----- | ------ | ------- |
95
- | **MiMo V2 Flash** ๐Ÿ† | $0.10 | $0.30 | 256K |
96
- | **DeepSeek V4 Flash** ๐Ÿ† | $0.14 | $0.28 | 1M |
97
- | **Kimi K2.6** (non-thinking) | $0.16 | $0.95 | 262K |
98
- | **Kimi K2.7 Code** | $0.19 | $4.00 | 262K |
99
- | **MiniMax M3** | $0.30 | $1.20 | 1M |
100
- | **MiMo V2.5** | $0.40 | $2.00 | 1M |
101
- | **Qwen 3.7 Plus** | $0.40 | $1.60 | 1M |
102
- | **MiMo V2.5 Pro** | $1.00 | $3.00 | 1M |
103
- | **GLM 5V Turbo** | $1.20 | $4.00 | 200K |
104
- | **GLM 5.1** | $1.40 | $4.40 | 200K |
105
- | **Qwen 3.7 Max** | $2.50 | $7.50 | 1M |
91
+ All prices are **USD per 1M tokens** (cache miss). 1 AI credit = $0.01. **MiniMax M3** figures reflect a permanent 50% off list price โ€” see the model doc for the full rate card. Context window ยน covers input + output combined.
92
+
93
+ | Model | Input | Output | Context ยน |
94
+ | ---------------------------- | ----- | ------ | --------- |
95
+ | **MiMo V2 Flash** ๐Ÿ† | $0.10 | $0.30 | 256K |
96
+ | **DeepSeek V4 Flash** ๐Ÿ† | $0.14 | $0.28 | 1M |
97
+ | **Kimi K2.6** (non-thinking) | $0.16 | $0.95 | 262K |
98
+ | **Kimi K2.6** (thinking) | $0.16 | $4.00 | 262K |
99
+ | **Kimi K2.7 Code** | $0.19 | $4.00 | 262K |
100
+ | **MiniMax M3** | $0.30 | $1.20 | 1M |
101
+ | **MiMo V2.5** | $0.40 | $2.00 | 1M |
102
+ | **Qwen 3.7 Plus** | $0.40 | $1.60 | 1M |
103
+ | **MiMo V2.5 Pro** | $1.00 | $3.00 | 1M |
104
+ | **GLM 5V Turbo** | $1.20 | $4.00 | 200K |
105
+ | **GLM 5.1** | $1.40 | $4.40 | 200K |
106
+ | **Qwen 3.7 Max** | $2.50 | $7.50 | 1M |
106
107
 
107
108
  For the full pricing comparison (cached rates, full Copilot roster, footnotes, sources) see [docs/pricing.md](docs/pricing.md). For a copy-paste config containing **all providers at once**, see [docs/example-config.md](docs/example-config.md).
108
109
 
@@ -159,11 +159,11 @@ All can be set in a `.env` file at the repo root (both proxies `import 'dotenv/c
159
159
 
160
160
  ### Thinking mode
161
161
 
162
- | Model | Turn type | Behavior |
163
- | ----------- | ------------ | ----------------------------------------------------------- |
164
- | K2.5 / K2.6 | Plain chat | Thinking enabled, `temperature: 1`, `top_p: 0.95` |
162
+ | Model | Turn type | Behavior |
163
+ | ----------- | ------------ | -------------------------------------------------------------------------- |
164
+ | K2.5 / K2.6 | Plain chat | Thinking enabled, `temperature: 1`, `top_p: 0.95` |
165
165
  | K2.5 / K2.6 | Tool-enabled | `thinking: { type: "disabled" }` forced, `temperature: 0.6`, `top_p: 0.95` |
166
- | K2.7 Code | All turns | Always-thinking, `temperature: 1`, `top_p: 0.95` |
166
+ | K2.7 Code | All turns | Always-thinking, `temperature: 1`, `top_p: 0.95` |
167
167
 
168
168
  ### Capabilities
169
169
 
@@ -4,15 +4,15 @@
4
4
 
5
5
  ## At a Glance
6
6
 
7
- | Field | Value |
8
- | ---------------------- | ------------------------------------------------ |
9
- | Mode | **Direct** (no proxy) |
10
- | Vision | โœ… Yes (`mimo-v2.5` only) |
11
- | Tool calling | โœ… Yes (with `thinking: disabled`) |
12
- | Context | 1M (V2.5 Pro / V2.5) / 256K (V2 Flash) |
7
+ | Field | Value |
8
+ | ---------------------- | --------------------------------------------------- |
9
+ | Mode | **Direct** (no proxy) |
10
+ | Vision | โœ… Yes (`mimo-v2.5` only) |
11
+ | Tool calling | โœ… Yes (with `thinking: disabled`) |
12
+ | Context | 1M (V2.5 Pro / V2.5) / 256K (V2 Flash) |
13
13
  | Max output | 131072 (V2.5 Pro) / 32768 (V2.5) / 65536 (V2 Flash) |
14
- | Required `requestBody` | `thinking: { type: "disabled" }` |
15
- | Endpoint | `https://api.xiaomimimo.com/v1/chat/completions` |
14
+ | Required `requestBody` | `thinking: { type: "disabled" }` |
15
+ | Endpoint | `https://api.xiaomimimo.com/v1/chat/completions` |
16
16
 
17
17
  ### Models at a glance
18
18
 
@@ -4,8 +4,8 @@
4
4
 
5
5
  ## At a Glance
6
6
 
7
- | Field | Value |
8
- | ------------------------------- | ------------------------------------------------------------------------- |
7
+ | Field | Value |
8
+ | ------------------------------- | -------------------------------------------------------------------------------- |
9
9
  | Mode | **Proxy** (local on `:3458`) **or** **Direct** (static `enable_thinking: false`) |
10
10
  | Vision | โœ… Yes (`qwen3.7-plus`) |
11
11
  | Tool calling | โœ… Yes |
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "copilot-custom-endpoint",
3
- "version": "1.4.0",
3
+ "version": "1.4.1",
4
4
  "description": "Local proxies for VS Code Copilot custom endpoints โ€” Kimi K2 & Qwen 3.x",
5
5
  "license": "MIT",
6
6
  "type": "module",
@@ -55,4 +55,4 @@
55
55
  "dependencies": {
56
56
  "dotenv": "^17.4.2"
57
57
  }
58
- }
58
+ }