copilot-custom-endpoint 1.2.4 → 1.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (2) hide show
  1. package/README.md +22 -2
  2. package/package.json +1 -1
package/README.md CHANGED
@@ -1,6 +1,6 @@
1
1
  # GitHub Copilot Custom Endpoints
2
2
 
3
- > **TL;DR** — GitHub Copilot switched to usage-based billing on **June 1, 2026**. Every chat and agent session now burns AI credits — fast. This repo shows you how to plug **cheaper non-GitHub models** (DeepSeek, Kimi, Qwen, MiMo, MiniMax) into VS Code's Copilot chat — often **5–55× cheaper** than the built-ins — while keeping agent mode, tools, streaming, and vision.
3
+ > **TL;DR** — GitHub Copilot switched to usage-based billing on **June 1, 2026**. Every chat and agent session now burns AI credits — fast. This repo shows you how to plug **cheaper non-GitHub models** (DeepSeek, Kimi, Qwen, MiMo, MiniMax, GLM) into VS Code's Copilot chat — often **5–55× cheaper** than the built-ins — while keeping agent mode, tools, streaming, and vision.
4
4
 
5
5
  ## What is this?
6
6
 
@@ -24,10 +24,13 @@ That's it. No code, no servers to manage (unless the model specifically needs th
24
24
  | **MiMo V2 Flash** | Xiaomi | No | ❌ | [Setup](docs/models/mimo.md) |
25
25
  | **MiMo V2.5** | Xiaomi | No | ✅ | [Setup](docs/models/mimo.md) |
26
26
  | **MiMo V2.5 Pro** | Xiaomi | No | ❌ | [Setup](docs/models/mimo.md) |
27
- | **Kimi K2.6** | Moonshot | **Yes** | ✅ | [Setup](docs/models/kimi-k2.6.md) |
27
+ | **Kimi K2.6** | Moonshot | **Yes** | ✅ | [Setup](docs/models/kimi.md) |
28
28
  | **Qwen 3.6 Plus** | DashScope | Optional | ✅ | [Setup](docs/models/qwen.md) |
29
29
  | **Qwen 3.7 Max** | DashScope | Optional | ❌ | [Setup](docs/models/qwen.md) |
30
30
  | **MiniMax M3** | MiniMax | No | ✅ | [Setup](docs/models/minimax.md) |
31
+ | **GLM 5.1** | Z.ai | No | ❌ | [Setup](docs/models/glm.md) |
32
+ | **GLM 4.7 Flash (free)** | Z.ai | No | ❌ | [Setup](docs/models/glm.md) |
33
+ | **GLM 5V Turbo** | Z.ai | No | ✅ | [Setup](docs/models/glm.md) |
31
34
  | **DeepSeek V4 Pro / Flash** | DeepSeek | No (uses an extension) | ✅ via proxy | [Marketplace](https://marketplace.visualstudio.com/items?itemName=Vizards.deepseek-v4-for-copilot) |
32
35
 
33
36
  ## Setup
@@ -97,10 +100,27 @@ All prices are **USD per 1M tokens** (cache miss). 1 AI credit = $0.01.
97
100
  | **Qwen 3.6 Plus** | $0.50 | $3.00 | 1M |
98
101
  | **MiniMax M3** | $0.60 | $2.40 | 1M |
99
102
  | **MiMo V2.5 Pro** | $1.00 | $3.00 | 1M |
103
+ | **GLM 5V Turbo** | $1.20 | $4.00 | 200K |
104
+ | **GLM 5.1** | $1.40 | $4.40 | 200K |
100
105
  | **Qwen 3.7 Max** | $2.50 | $7.50 | 1M |
101
106
 
102
107
  For the full pricing comparison (cached rates, full Copilot roster, footnotes, sources) see [docs/pricing.md](docs/pricing.md). For a copy-paste config containing **all providers at once**, see [docs/example-config.md](docs/example-config.md).
103
108
 
109
+ ## Companion tools
110
+
111
+ These work alongside the providers above and fill gaps that VS Code's built-in tool surface doesn't cover natively.
112
+
113
+ ### 🎬 [Video Context MCP](https://www.videocontextmcp.com/) — _video understanding for AI coding assistants_
114
+
115
+ VS Code's built-in `view_image` tool only accepts **static images** (PNG, JPG, GIF, WebP). That's a hard wall if you want to ask an AI assistant about a screen recording, a screencast, a product demo, or any other video. Several vision-capable models in this repo actually accept video natively — but VS Code's tool pipeline never gets the chance to forward it.
116
+
117
+ **Video Context MCP** is a small MCP server that bridges that gap. It works with **GitHub Copilot, Cursor, and Claude Code** out of the box, and:
118
+
119
+ - **Extracts frames** from local files or remote URLs (no `ffmpeg` gymnastics required).
120
+ - **Routes them through a multi-provider fallback chain** — `Gemini → GLM-4.6V → Qwen3.6 → Kimi K2.6 → MiMo-V2.5` — so a single `GLM 5V Turbo` rate-limit hiccup doesn't kill your session.
121
+ - **Answers natural-language questions** about the video grounded in actual frames: "what does the speaker click in the last 30 seconds?", "summarize the demo", "find the frame where the error appears".
122
+ - **Extras:** timestamp search, audio transcription with speaker diarization, and video metadata (resolution, duration, codec).
123
+
104
124
  ## Need help?
105
125
 
106
126
  - **Per-model issues:** check the troubleshooting section at the bottom of each model's doc.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "copilot-custom-endpoint",
3
- "version": "1.2.4",
3
+ "version": "1.3.0",
4
4
  "description": "Local proxies for VS Code Copilot custom endpoints — Kimi K2 & Qwen 3.x",
5
5
  "license": "MIT",
6
6
  "type": "module",