npm - free-coding-models - Versions diffs - 0.1.67 → 0.1.68 - Mend

free-coding-models 0.1.67 → 0.1.68

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/README.md +81 -15
package/bin/free-coding-models.js +780 -53
package/lib/config.js +164 -3
package/lib/utils.js +172 -5
package/package.json +1 -1
package/sources.js +17 -0

package/README.md CHANGED Viewed

@@ -24,7 +24,7 @@
 <p align="center">
   <strong>Find the fastest coding LLM models in seconds</strong><br>
-  <sub>Ping free coding models from 17 providers in real-time — pick the best one for OpenCode, OpenClaw, or any AI coding assistant</sub>
+  <sub>Ping free coding models from 18 providers in real-time — pick the best one for OpenCode, OpenClaw, or any AI coding assistant</sub>
 </p>
 <p align="center">
@@ -49,7 +49,7 @@
 ## ✨ Features
 - **🎯 Coding-focused** — Only LLM models optimized for code generation, not chat or vision
-- **🌐 Multi-provider** — 134 models from NVIDIA NIM, Groq, Cerebras, SambaNova, OpenRouter, Hugging Face Inference, Replicate, DeepInfra, Fireworks AI, Codestral, Hyperbolic, Scaleway, Google AI, SiliconFlow, Together AI, Cloudflare Workers AI, and Perplexity API
+- **🌐 Multi-provider** — Models from NVIDIA NIM, Groq, Cerebras, SambaNova, OpenRouter, Hugging Face Inference, Replicate, DeepInfra, Fireworks AI, Codestral, Hyperbolic, Scaleway, Google AI, SiliconFlow, Together AI, Cloudflare Workers AI, Perplexity API, and ZAI
 - **⚙️ Settings screen** — Press `P` to manage provider API keys, enable/disable providers, test keys live, and manually check/install updates
 - **🚀 Parallel pings** — All models tested simultaneously via native `fetch`
 - **📊 Real-time animation** — Watch latency appear live in alternate screen buffer
@@ -95,6 +95,7 @@ Before using `free-coding-models`, make sure you have:
    - **Together AI** — [api.together.ai/settings/api-keys](https://api.together.ai/settings/api-keys) → API Keys (credits/promotions vary)
    - **Cloudflare Workers AI** — [dash.cloudflare.com](https://dash.cloudflare.com) → Create API token + set `CLOUDFLARE_ACCOUNT_ID` (Free: 10k neurons/day)
    - **Perplexity API** — [perplexity.ai/settings/api](https://www.perplexity.ai/settings/api) → API Key (tiered limits by spend)
+   - **ZAI** — [z.ai](https://z.ai) → Get API key (Coding Plan subscription)
 3. **OpenCode** *(optional)* — [Install OpenCode](https://github.com/opencode-ai/opencode) to use the OpenCode integration
 4. **OpenClaw** *(optional)* — [Install OpenClaw](https://openclaw.ai) to use the OpenClaw integration
@@ -179,13 +180,13 @@ When you run `free-coding-models` without `--opencode` or `--openclaw`, you get
 Use `↑↓` arrows to select, `Enter` to confirm. Then the TUI launches with your chosen mode shown in the header badge.
 **How it works:**
-1. **Ping phase** — All enabled models are pinged in parallel (up to 134 across 17 providers)
-2. **Continuous monitoring** — Models are re-pinged every 3 seconds forever
+1. **Ping phase** — All enabled models are pinged in parallel (up to 139 across 18 providers)
+2. **Continuous monitoring** — Models are re-pinged every 60 seconds forever
 3. **Real-time updates** — Watch "Latest", "Avg", and "Up%" columns update live
 4. **Select anytime** — Use ↑↓ arrows to navigate, press Enter on a model to act
 5. **Smart detection** — Automatically detects if NVIDIA NIM is configured in OpenCode or OpenClaw
-Setup wizard (first run — walks through all 17 providers):
+Setup wizard (first run — walks through all 18 providers):
 ```
   🔑 First-time setup — API keys
@@ -268,6 +269,7 @@ SILICONFLOW_API_KEY=sk_xxx free-coding-models
 TOGETHER_API_KEY=together_xxx free-coding-models
 CLOUDFLARE_API_TOKEN=cf_xxx CLOUDFLARE_ACCOUNT_ID=your_account_id free-coding-models
 PERPLEXITY_API_KEY=pplx_xxx free-coding-models
+ZAI_API_KEY=zai-xxx free-coding-models
 FREE_CODING_MODELS_TELEMETRY=0 free-coding-models
 ```
@@ -350,13 +352,24 @@ When enabled, telemetry events include: event name, app version, selected mode,
 1. Sign up at [perplexity.ai/settings/api](https://www.perplexity.ai/settings/api)
 2. Create API key (`PERPLEXITY_API_KEY`)
-> 💡 **Free tiers** — each provider exposes a dev/free tier with its own quotas.
+**ZAI** (5 models, GLM family):
+1. Sign up at [z.ai](https://z.ai)
+2. Subscribe to Coding Plan
+3. Get API key from dashboard
+> 💡 **Free tiers** — each provider exposes a dev/free tier with its own quotas. ZAI requires a Coding Plan subscription.
 ---
 ## 🤖 Coding Models
-**134 coding models** across 17 providers and 8 tiers, ranked by [SWE-bench Verified](https://www.swebench.com) — the industry-standard benchmark measuring real GitHub issue resolution. Scores are self-reported by providers unless noted.
+**139 coding models** across 18 providers and 8 tiers, ranked by [SWE-bench Verified](https://www.swebench.com) — the industry-standard benchmark measuring real GitHub issue resolution. Scores are self-reported by providers unless noted.
+### ZAI Coding Plan (5 models)
+| Tier | SWE-bench | Model |
+|------|-----------|-------|
+| **S+** ≥70% | GLM-5 (77.8%), GLM-4.5 (75.0%), GLM-4.7 (73.8%), GLM-4.5-Air (72.0%), GLM-4.6 (70.0%) |
 ### NVIDIA NIM (44 models)
@@ -528,6 +541,18 @@ You can force a specific port:
 OPENCODE_PORT=4098 free-coding-models --opencode
 ```
+### ZAI provider proxy
+OpenCode doesn't natively support ZAI's API path format (`/api/coding/paas/v4/*`). When you select a ZAI model, `free-coding-models` automatically starts a local reverse proxy that translates OpenCode's standard `/v1/*` requests to ZAI's API. This is fully transparent -- just select a ZAI model and press Enter.
+**How it works:**
+1. A localhost HTTP proxy starts on a random available port
+2. OpenCode is configured with a `zai` provider pointing at `http://localhost:<port>/v1`
+3. The proxy rewrites `/v1/models` to `/api/coding/paas/v4/models` and `/v1/chat/completions` to `/api/coding/paas/v4/chat/completions`
+4. When OpenCode exits, the proxy shuts down automatically
+No manual configuration needed -- the proxy lifecycle is managed entirely by `free-coding-models`.
 ### Manual OpenCode Setup (Optional)
 Create or edit `~/.config/opencode/opencode.json`:
@@ -682,7 +707,7 @@ This script:
 │  1. Enter alternate screen buffer (like vim/htop/less)           │
 │  2. Ping ALL models in parallel                                  │
 │  3. Display real-time table with Latest/Avg/Stability/Up%        │
-│  4. Re-ping ALL models every 3 seconds (forever)                │
+│  4. Re-ping ALL models every 60 seconds (forever)               │
 │  5. Update rolling averages + stability scores per model        │
 │  6. User can navigate with ↑↓ and select with Enter            │
 │  7. On Enter (OpenCode): set model, launch OpenCode             │
@@ -717,6 +742,7 @@ This script:
 | `CLOUDFLARE_API_TOKEN` / `CLOUDFLARE_API_KEY` | Cloudflare Workers AI token/key |
 | `CLOUDFLARE_ACCOUNT_ID` | Cloudflare account ID (required for Workers AI endpoint URL) |
 | `PERPLEXITY_API_KEY` / `PPLX_API_KEY` | Perplexity API key |
+| `ZAI_API_KEY` | ZAI key |
 | `FREE_CODING_MODELS_TELEMETRY` | `0` disables analytics, `1` enables analytics |
 | `FREE_CODING_MODELS_POSTHOG_KEY` | PostHog project API key used for anonymous event capture |
 | `FREE_CODING_MODELS_POSTHOG_HOST` | Optional PostHog ingest host (`https://eu.i.posthog.com` default) |
@@ -736,7 +762,8 @@ This script:
     "siliconflow": "sk_xxx",
     "together": "together_xxx",
     "cloudflare": "cf_xxx",
-    "perplexity": "pplx_xxx"
+    "perplexity": "pplx_xxx",
+    "zai":      "zai-xxx"
   },
   "providers": {
     "nvidia":   { "enabled": true },
@@ -749,7 +776,8 @@ This script:
     "siliconflow": { "enabled": true },
     "together": { "enabled": true },
     "cloudflare": { "enabled": true },
-    "perplexity": { "enabled": true }
+    "perplexity": { "enabled": true },
+    "zai":      { "enabled": true }
   },
   "favorites": [
     "nvidia/deepseek-ai/deepseek-v3.2"
@@ -764,7 +792,7 @@ This script:
 **Configuration:**
 - **Ping timeout**: 15 seconds per attempt (slow models get more time)
-- **Ping interval**: 3 seconds between complete re-pings of all models (adjustable with W/X keys)
+- **Ping interval**: 60 seconds between complete re-pings of all models (adjustable with W/X keys)
 - **Monitor mode**: Interface stays open forever, press Ctrl+C to exit
 **Flags:**
@@ -782,6 +810,8 @@ This script:
 | `--tier A` | Show only A+, A, A- tier models |
 | `--tier B` | Show only B+, B tier models |
 | `--tier C` | Show only C tier models |
+| `--profile <name>` | Load a saved config profile on startup |
+| `--recommend` | Auto-open Smart Recommend overlay on start |
 **Keyboard shortcuts (main TUI):**
 - **↑↓** — Navigate models
@@ -790,7 +820,12 @@ This script:
 - **F** — Toggle favorite on selected model (⭐ in Model column, pinned at top)
 - **T** — Cycle tier filter (All → S+ → S → A+ → A → A- → B+ → B → C → All)
 - **Z** — Cycle mode (OpenCode CLI → OpenCode Desktop → OpenClaw)
-- **P** — Open Settings (manage API keys, provider toggles, analytics toggle, manual update)
+- **P** — Open Settings (manage API keys, provider toggles, analytics toggle, manual update, profiles)
+- **Shift+P** — Cycle through saved profiles (switches live TUI settings)
+- **Shift+S** — Save current TUI settings as a named profile (inline prompt)
+- **Q** — Open Smart Recommend overlay (find the best model for your task)
+- **E** — Elevate tier filter (show higher tiers)
+- **D** — Descend tier filter (show lower tiers)
 - **W** — Decrease ping interval (faster pings)
 - **X** — Increase ping interval (slower pings)
 - **K** / **Esc** — Show/hide help overlay
@@ -799,15 +834,46 @@ This script:
 Pressing **K** now shows a full in-app reference: main hotkeys, settings hotkeys, and CLI flags with usage examples.
 **Keyboard shortcuts (Settings screen — `P` key):**
-- **↑↓** — Navigate providers, analytics row, and maintenance row
-- **Enter** — Edit API key inline, toggle analytics on analytics row, or check/install update on maintenance row
-- **Space** — Toggle provider enabled/disabled, or toggle analytics on analytics row
+- **↑↓** — Navigate providers, analytics row, maintenance row, and profile rows
+- **Enter** — Edit API key inline, toggle analytics, check/install update, or load a profile
+- **Space** — Toggle provider enabled/disabled, or toggle analytics
 - **T** — Test current provider's API key (fires a live ping)
 - **U** — Check for updates manually from settings
+- **Backspace** — Delete the selected profile (only on profile rows)
 - **Esc** — Close settings and return to main TUI
 ---
+### 📋 Config Profiles
+Profiles let you save and restore different TUI configurations — useful if you switch between work/personal setups, different tier preferences, or want to keep separate favorites lists.
+**What's stored in a profile:**
+- Favorites (starred models)
+- Sort column and direction
+- Tier filter
+- Ping interval
+- API keys
+**Saving a profile:**
+1. Configure the TUI the way you want (favorites, sort, tier, etc.)
+2. Press **Shift+S** — an inline prompt appears at the bottom
+3. Type a name (e.g. `work`, `fast-only`, `presentation`) and press **Enter**
+4. The profile is saved and becomes the active profile (shown as a purple badge in the header)
+**Switching profiles:**
+- **Shift+P** in the main table — cycles through saved profiles (or back to raw config)
+- **`--profile <name>`** — load a specific profile on startup
+**Managing profiles:**
+- Open Settings (**P** key) — scroll down to the **Profiles** section
+- **Enter** on a profile row to load it
+- **Backspace** on a profile row to delete it
+Profiles are stored inside `~/.free-coding-models.json` under the `profiles` key.
+---
 ## 🔧 Development
 ```bash