kimiflare 0.74.0 → 0.74.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -8,12 +8,12 @@
8
8
  <a href="https://github.com/sinameraji/kimiflare/blob/main/LICENSE"><img src="https://img.shields.io/github/license/sinameraji/kimiflare?style=flat-square&color=2ea44f" alt="license"></a>
9
9
  <img src="https://img.shields.io/badge/node-%3E%3D20-339933?style=flat-square&logo=nodedotjs&logoColor=white" alt="Node.js >= 20">
10
10
  <img src="https://img.shields.io/badge/typescript-5.7-3178c6?style=flat-square&logo=typescript&logoColor=white" alt="TypeScript">
11
- <a href="https://developers.cloudflare.com/ai-gateway/"><img src="https://img.shields.io/badge/powered%20by-Multiple%20Models-f59e0b?style=flat-square" alt="Powered by Multiple Models"></a>
11
+ <a href="https://developers.cloudflare.com/workers-ai/models/kimi-k2"><img src="https://img.shields.io/badge/powered%20by-Kimi%20K2.6-f59e0b?style=flat-square" alt="Powered by Kimi K2.6"></a>
12
12
  </p>
13
13
 
14
14
  <p align="center">
15
- <strong>A terminal coding agent powered by multiple models (Kimi-K2.6, Claude, GPT-5, Gemini, Llama, and more), routed through your own <a href="https://developers.cloudflare.com/ai-gateway/">Cloudflare AI Gateway</a>.</strong><br>
16
- Switch models anytime with <code>/model</code>. First-class observability, caching, and authoritative cost — all on your Cloudflare account.
15
+ <strong>A terminal coding agent powered by <a href="https://developers.cloudflare.com/workers-ai/models/kimi-k2">Kimi K2.6</a> on <a href="https://developers.cloudflare.com/workers-ai/">Cloudflare Workers AI</a> with optional routing through your own <a href="https://developers.cloudflare.com/ai-gateway/">AI Gateway</a> for first-class observability, caching, and authoritative cost.</strong><br>
16
+ All on your Cloudflare account.
17
17
  </p>
18
18
 
19
19
  <p align="center">
@@ -22,9 +22,9 @@
22
22
 
23
23
  ## How it works
24
24
 
25
- You bring your own Cloudflare **Account ID** + **API Token**. KimiFlare provisions (or reuses) an **AI Gateway** in your account and routes every model request through it. Nothing leaves your Cloudflare tenancy.
25
+ You bring your own Cloudflare **Account ID** + **API Token**. KimiFlare calls **Workers AI** directly by default — fastest path, fewest moving parts. You can optionally turn on routing through an **AI Gateway** in your account (provisioned or reused on first run) for observability, caching, and cost reporting. Either way, nothing leaves your Cloudflare tenancy.
26
26
 
27
- You get this for free:
27
+ With AI Gateway enabled you get this for free:
28
28
 
29
29
  - **Per-request logs** with full payload, latency, and status — visible in the Cloudflare dashboard
30
30
  - **Response caching** with configurable TTL (`/gateway cache-ttl <seconds>`)
@@ -91,32 +91,13 @@ Edit your token at: https://dash.cloudflare.com/profile/api-tokens
91
91
 
92
92
  Once configured, `/cost` shows the Gateway-confirmed totals, cache hit ratio, per-feature breakdown, and direct dashboard links to each request log. `/gateway status` shows the current TTL, skip-cache flag, metadata tags, and live cache-hit ratio.
93
93
 
94
- ### Model selection
94
+ ### Model
95
95
 
96
- KimiFlare supports **11 models** across multiple providers, all routed through Cloudflare AI Gateway:
96
+ KimiFlare runs on **Kimi K2.6** via Cloudflare Workers AI no API key needed beyond your Cloudflare token:
97
97
 
98
- **Cloudflare Workers AI** (default, no API key needed):
99
98
  - `@cf/moonshotai/kimi-k2.6` — 262k context, reasoning, tools
100
- - `@cf/meta/llama-3.3-70b-instruct-fp8-fast` — 24k context, tools
101
- - `@cf/meta/llama-4-scout-17b-16e-instruct` — 131k context, tools
102
99
 
103
- **Anthropic** (requires API key):
104
- - `anthropic/claude-opus-4-7` — 1M context, reasoning, tools
105
- - `anthropic/claude-sonnet-4-6` — 1M context, reasoning, tools
106
- - `anthropic/claude-haiku-4-5` — 200k context, tools
107
-
108
- **OpenAI** (requires API key):
109
- - `openai/gpt-5` — 400k context, reasoning, tools
110
- - `openai/gpt-5-mini` — 400k context, reasoning, tools
111
-
112
- **Google** (requires API key):
113
- - `google-ai-studio/gemini-2.5-pro` — 1M context, reasoning, tools
114
- - `google-ai-studio/gemini-2.5-flash` — 1M context, tools
115
-
116
- **Other OpenAI-compatible** (requires API key):
117
- - `groq/llama-3.3-70b-versatile` — 128k context, tools
118
-
119
- Switch models anytime in the TUI with `/model`, or set at startup with `--model <id>` or the config file.
100
+ `@cf/moonshotai/kimi-k2.5` is also available for older sessions.
120
101
 
121
102
  ### One-shot mode
122
103