oioxo-mcp 0.5.4 → 0.5.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (3) hide show
  1. package/README.md +16 -16
  2. package/bundle/cli.js +107 -102
  3. package/package.json +61 -61
package/README.md CHANGED
@@ -1,12 +1,12 @@
1
- # OIOXO — Context Engine + On-Device Coding Agent
1
+ # OIOXO — cut your AI bill ~90%, and code privately from the terminal
2
2
 
3
- **Your AI agents read 10× more code than they need. You pay for every token of it.**
3
+ **Your AI tools read far more of your code than they need and you pay for every word.**
4
4
 
5
- OIOXO does two things from one CLI:
6
- 1. **Context engine** sits between your codebase and your AI agents (GitHub Copilot, Claude Code, Cursor, Windsurf, Gemini CLI, Codex) and hands them the *minimal relevant slice* of your project instead of letting them read whole files.
7
- 2. **Coding agent** `oioxo code "<task>"` plans, edits, and verifies in your repo, using your own key **or a local Ollama model** — nothing leaves your machine. See [Code in your terminal](#code-in-your-terminal--oioxo-code).
5
+ OIOXO fixes that from one command-line tool. It does two things:
6
+ 1. **Makes Copilot, Cursor & Claude ~90% cheaper.** It sits between your project and your AI tools and hands them only the small, relevant slice instead of letting them read whole files. Same answers, a fraction of the cost.
7
+ 2. **Codes for you, privately.** `oioxo code "<task>"` plans, edits, and checks the work right in your repo using your own key **or a local model** — so nothing leaves your machine. See [Code in your terminal](#code-in-your-terminal--oioxo-code).
8
8
 
9
- **90–92% fewer context tokens per question, measured on a real production codebase.** A question that costs an agent ~50,000 tokens of file reading comes back as a ~5,000-token capsule — same answer, a fraction of the cost.
9
+ **~90% fewer words per question** (measured on a real codebase). A question that would cost ~50,000 words comes back as ~5,000 — same answer, a fraction of the cost.
10
10
 
11
11
  ## Quick start
12
12
 
@@ -40,20 +40,20 @@ OIOXO_PROVIDER=ollama OIOXO_MODEL=qwen2.5-coder oioxo code "add input validation
40
40
 
41
41
  Flags: `--verify "<cmd>"` (override the test command) · `--yes` (apply without the prompt) · `--max-iters N` · `--local` (built-in on-device coder, no key — one-time `npm i -g node-llama-cpp`).
42
42
 
43
- ## Compute Mesh lend a hand, or borrow one
43
+ ## Let your devices build together
44
44
 
45
- On the same Wi-Fi, your devices work as one engine and the CLI is a lend-only peer. Add a
46
- device with `oioxo invite` (or `oioxo join`) and this machine lends its coder to another
47
- device's build, so your phone or a thin laptop builds on *this* machine's GPU. The handshake
48
- is a short code a QR or a string with no signaling server and nothing relayed through
49
- OIOXO. Lend compute and you earn free coding time for it.
45
+ Put two of your own devices on the **same Wi-Fi** and they work as one — this machine can lend
46
+ its power to another device's build (so your phone or a thin laptop builds on *this* machine's
47
+ GPU), or borrow power the other way. Start it with `oioxo invite` (or `oioxo join`); pairing is
48
+ just a short code (a QR or a string), with nothing going through our servers. Lend power and you
49
+ earn **free coding time** for it. *(Feature name: Compute Mesh.)*
50
50
 
51
51
  ## Why developers use it
52
52
 
53
- - 💸 **Subscriptions go further** — Copilot premium requests, Claude limits, and API keys all last longer.
54
- - ⚡ **Better answers** — focused context beats 75k tokens of noise. Your agent gets the exact code in play, plus the parts of the project it actually depends on.
55
- - 🔒 **100% on-device** — your code is indexed and queried locally, never uploaded. Only the saved-token *count* is metered.
56
- - 🪄 **One command** — `init` detects Claude Code, VS Code/Copilot, Cursor, Windsurf, Gemini CLI and Codex, and merges their configs without touching your other MCP servers.
53
+ - 💸 **Your AI costs drop ~90%** — Copilot premium requests, Claude limits, and API keys all last several times longer.
54
+ - ⚡ **Better answers, too** — focused context beats 50k words of noise. Your AI gets the exact code in play, plus the parts it actually depends on.
55
+ - 🔒 **100% on your device** — your code is read and searched locally, never uploaded. Only the *amount* you saved is counted.
56
+ - 🪄 **One command** — `init` finds Claude Code, VS Code/Copilot, Cursor, Windsurf, Gemini CLI and Codex, and sets them up without touching your other tools.
57
57
 
58
58
  ## How it works (for you)
59
59