npm - reasonix - Versions diffs - 0.0.4 → 0.0.6 - Mend

reasonix 0.0.4 → 0.0.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (15) hide show

package/README.md +140 -58
package/dist/cli/index.js +921 -70
package/dist/cli/index.js.map +1 -1
package/dist/index.d.ts +138 -6
package/dist/index.js +586 -32
package/dist/index.js.map +1 -1
package/package.json +1 -1
package/dist/chunk-Y7L6L5QS.js +0 -262
package/dist/chunk-Y7L6L5QS.js.map +0 -1
package/dist/cli/chunk-T2ODXAJP.js +0 -263
package/dist/cli/chunk-T2ODXAJP.js.map +0 -1
package/dist/cli/client-RIVGDOJP.js +0 -10
package/dist/cli/client-RIVGDOJP.js.map +0 -1
package/dist/client-KEA2D52Q.js +0 -9
package/dist/client-KEA2D52Q.js.map +0 -1

package/README.md CHANGED Viewed

@@ -1,66 +1,135 @@
 # Reasonix
+[![npm version](https://img.shields.io/npm/v/reasonix.svg)](https://www.npmjs.com/package/reasonix)
+[![CI](https://github.com/esengine/reasonix/actions/workflows/ci.yml/badge.svg)](https://github.com/esengine/reasonix/actions/workflows/ci.yml)
+[![license](https://img.shields.io/npm/l/reasonix.svg)](./LICENSE)
+[![downloads](https://img.shields.io/npm/dm/reasonix.svg)](https://www.npmjs.com/package/reasonix)
+[![node](https://img.shields.io/node/v/reasonix.svg)](./package.json)
 **The DeepSeek-native agent framework.** TypeScript. Ink TUI. No LangChain.
-Reasonix is not another generic agent framework. It does one thing: take DeepSeek's
-unusual economic and behavioral profile — dirt-cheap tokens, R1 reasoning traces,
-automatic prefix caching — and turn them into agent-loop superpowers that generic
-frameworks leave on the table.
+Reasonix is not another generic agent wrapper. Every abstraction is justified
+by a DeepSeek-specific property — dirt-cheap tokens, R1 reasoning traces,
+automatic prefix caching, JSON mode. Generic frameworks treat DeepSeek as
+"OpenAI with a different base URL" and leave these advantages on the table.
+Reasonix leans into them.
 ```bash
-npx reasonix chat          # prompts for your DeepSeek key on first run,
-                           # then live TUI with real-time cache/cost panel
+npx reasonix chat          # first run prompts for your DeepSeek key
+                           # inside the TUI, type /help for everything else
 ```
-On first run the TUI asks for your DeepSeek API key (get one at
-[platform.deepseek.com/api_keys](https://platform.deepseek.com/api_keys)) and
-saves it to `~/.reasonix/config.json`. Set `DEEPSEEK_API_KEY` in the
-environment to override.
+No flag soup. All feature toggles live behind slash commands in the TUI.
-## Why Reasonix?
+---
-Every other framework treats DeepSeek as an OpenAI-compatible endpoint with a
-different base URL. That works, but it leaves most of DeepSeek's advantages
-unused. Reasonix is opinionated about three things:
+## What you get
-### 1. Cache-First Loop
-DeepSeek bills cached input tokens at **~10% of the miss rate**. Reasonix
-structures the agent loop as `[Immutable Prefix] + [Append-Only Log] +
-[Volatile Scratch]` so every turn reuses the exact byte prefix.
+| Feature | How it works | Opt in |
+|---|---|---|
+| **Cache-First Loop** | Immutable prefix + append-only log = prefix byte-stable across turns → DeepSeek's automatic prefix cache hits at 70–95% | always on |
+| **R1 Thought Harvesting** | Parses `reasoning_content` into typed `{ subgoals, hypotheses, uncertainties, rejectedPaths }` via a cheap V3 call | `--harvest` |
+| **Self-Consistency Branching** | Runs N parallel samples at spread temperatures; picks the one with the fewest flagged uncertainties | `--branch <N>` |
+| **Tool-Call Repair** | Auto-flattens deep/wide schemas, scavenges tool calls leaked into `<think>`, repairs truncated JSON, breaks call-storms | always on |
+| **Retry layer** | Exponential backoff + jitter on 408/429/500/502/503/504 and network errors. 4xx auth errors don't retry | always on |
+| **Ink TUI** | Live cache-hit / cost panel. Streams R1 thinking to a compact preview. Renders Markdown (bold / lists / code / stripped LaTeX) | always on |
-**Validated on real DeepSeek API (`deepseek-chat`):**
+---
-| scenario | turns | cache hit | cost | cost on Claude Sonnet 4.6 | savings |
-|---|---|---|---|---|---|
-| Chinese multi-turn chat | 5 | **85.2%** | $0.000923 | $0.015174 | **93.9%** |
-| Tool-use (calculator) | 2 | **94.9%** | $0.000142 | $0.003351 | **95.8%** |
+## Why not just use LangChain?
-### 2. R1 Thought Harvesting
-R1's `reasoning_content` contains a *plan*, not just trivia to display. Reasonix
-pipes it through a cheap V3 call (~$0.0001 / turn) in JSON mode and extracts
-a typed plan state:
+Even on the default `fast` preset (no harvest, no branching), Reasonix bakes
+in five DeepSeek-specific defences that generic agent frameworks leave to you:
-```ts
-{ subgoals: string[], hypotheses: string[], uncertainties: string[], rejectedPaths: string[] }
-```
+| | Reasonix default | generic frameworks |
+|---|---|---|
+| Prefix-stable loop (→ 85–95% cache hit) | ✅ | ❌ prompts rebuilt each turn |
+| Auto-flatten deep tool schemas | ✅ | ❌ DeepSeek drops args |
+| Retry with jittered backoff (429/503) | ✅ | ❌ custom callbacks |
+| Scavenge tool calls leaked into `<think>` | ✅ | ❌ |
+| Call-storm breaker on identical-arg repeats | ✅ | ❌ |
+| Live cache-hit / cost / vs-Claude panel | ✅ | ❌ |
+| First-run config prompt + Markdown TUI | ✅ | ❌ |
+Harvest and self-consistency branching are bonuses on top. The everyday
+win is that **a plain chat with Reasonix already pays for ~40% less tokens
+than the same chat through a naive LangChain setup**, because the prefix
+actually stays byte-stable.
+## Validated numbers
-Opt-in to keep default cost identical: `reasonix chat --harvest` or
-`new CacheFirstLoop({ harvest: true })`. The TUI renders the harvested state
-as a compact magenta block above the answer.
+Measured on live DeepSeek API:
-### 3. Tool-Call Repair
-R1/V3 have known quirks — tool calls leaking into `<think>`, dropped arguments
-on deep schemas, truncated JSON, call-storm loops. Reasonix ships a full repair
-pipeline: **scavenge + flatten + truncation recovery + storm breaker**.
+| scenario | model | turns | cache hit | cost | Claude 4.6 would be | savings |
+|---|---|---|---|---|---|---|
+| Chinese multi-turn chat | `deepseek-chat` | 5 | **85.2%** | $0.000923 | $0.015174 | **93.9%** |
+| Tool-use (calculator) | `deepseek-chat` | 2 | **94.9%** | $0.000142 | $0.003351 | **95.8%** |
+| R1 math + harvest | `deepseek-reasoner` | 1 | 72.7% | $0.006478 | $0.044484 | 85.4% |
+---
 ## Usage
+### CLI
+```bash
+npx reasonix chat                # auto-saves to session 'default'; resumes next time
+npx reasonix chat --session work # use a different named session
+npx reasonix chat --no-session   # ephemeral — nothing persisted
+npx reasonix run "ask anything"  # one-shot, streams to stdout
+npx reasonix stats session.jsonl # read back a saved transcript
+```
+Sessions live as JSONL under `~/.reasonix/sessions/<name>.jsonl` — every
+turn's message log is appended atomically, so killing the CLI never loses
+context. Inside the TUI: `/sessions` to list, `/forget` to delete the
+current one.
+### Inside the chat — slash commands
+A command strip runs under the input box so you don't have to memorize
+anything. Type `/help` for the full list. The biggest shortcut:
+```
+/preset fast     deepseek-chat, no harvest, no branch        (default)
+/preset smart    reasoner + harvest                           (~10x cost)
+/preset max      reasoner + harvest + branch 3                (~30x cost, slowest)
+```
+One-tap switch between fast daily driver, careful thinker, and max-quality
+self-consistency. Individual knobs are available too:
+```
+/status          show current model / harvest / branch / stream
+/model <id>      deepseek-chat or deepseek-reasoner
+/harvest [on|off] Pillar 2 — parse R1 reasoning into typed plan state
+/branch <N|off>  run N parallel samples per turn, pick most confident
+/clear           clear displayed history (log is kept)
+/exit            quit
+```
+The top panel shows active flags live: `· harvest · branch3` appear next to
+the model once enabled.
+### Flags (for automation / CI)
+The same knobs are also available as CLI flags if you're scripting:
+```bash
+npx reasonix chat -m deepseek-reasoner --harvest --branch 3 --transcript session.jsonl
+```
 ### Library
 ```ts
-import { CacheFirstLoop, DeepSeekClient, ImmutablePrefix, ToolRegistry } from "reasonix";
-const client = new DeepSeekClient();
+import {
+  CacheFirstLoop,
+  DeepSeekClient,
+  ImmutablePrefix,
+  ToolRegistry,
+} from "reasonix";
+const client = new DeepSeekClient(); // reads DEEPSEEK_API_KEY from env
 const tools = new ToolRegistry();
 tools.register({
@@ -71,55 +140,68 @@ tools.register({
     properties: { a: { type: "integer" }, b: { type: "integer" } },
     required: ["a", "b"],
   },
-  fn: ({ a, b }) => a + b,
+  fn: ({ a, b }: { a: number; b: number }) => a + b,
 });
 const loop = new CacheFirstLoop({
   client,
+  tools,
   prefix: new ImmutablePrefix({
     system: "You are a math helper.",
     toolSpecs: tools.specs(),
   }),
-  tools,
+  harvest: true,
+  branch: 3, // self-consistency budget
 });
 for await (const ev of loop.step("What is 17 + 25?")) {
-  console.log(ev);
+  if (ev.role === "assistant_final") console.log(ev.content);
 }
 console.log(loop.stats.summary());
 ```
-### CLI / TUI
+### Configuration
+On first run the CLI prompts for your DeepSeek API key and saves it to
+`~/.reasonix/config.json`. Alternatives:
 ```bash
-reasonix chat             # full-screen Ink TUI, live cache/cost panel
-reasonix run "task"       # one-shot, streaming output
-reasonix stats <file>     # summarize transcript JSONL
-reasonix version
+export DEEPSEEK_API_KEY=sk-...        # env var (wins over config file)
+export DEEPSEEK_BASE_URL=https://...  # optional alternate endpoint
 ```
-## Status
+Get a key (free credit on signup): <https://platform.deepseek.com/api_keys>
-Pre-alpha. All three pillars ship working end-to-end as of v0.0.3.
-See [docs/ARCHITECTURE.md](docs/ARCHITECTURE.md).
+---
 ## Non-goals
-- Multi-agent orchestration (use LangGraph if you need it).
-- RAG / vector stores.
-- Multi-provider abstraction. **Reasonix does DeepSeek, deeply.**
+- Multi-agent orchestration (use LangGraph).
+- RAG / vector stores (use LlamaIndex or do it yourself).
+- Multi-provider abstraction (use LiteLLM).
 - Web UI / SaaS.
+Reasonix does DeepSeek, deeply.
+---
 ## Development
 ```bash
+git clone https://github.com/esengine/reasonix.git
+cd reasonix
 npm install
-npm run dev chat          # run CLI directly from TS (tsx)
-npm run build             # bundle to dist/
-npm test                  # vitest
-npm run lint              # biome
+npm run dev chat        # run CLI from source via tsx
+npm run build           # tsup to dist/
+npm test                # vitest (89 tests)
+npm run lint            # biome
+npm run typecheck       # tsc --noEmit
 ```
+See [docs/ARCHITECTURE.md](docs/ARCHITECTURE.md) for internals.
+---
 ## License
 MIT