npm - tokengolf - Versions diffs - 0.4.3 → 0.5.1 - Mend

tokengolf 0.4.3 → 0.5.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (21) hide show

package/CHANGELOG.md +10 -0
package/CLAUDE.md +12 -4
package/README.md +225 -109
package/dist/cli.js +121 -46
package/docs/assets/banner.svg +45 -0
package/docs/index.html +135 -113
package/hooks/session-end.js +12 -27
package/hooks/statusline.sh +36 -20
package/install.sh +69 -0
package/package.json +3 -2
package/scripts/update-homebrew.sh +39 -0
package/src/components/ActiveRun.js +7 -3
package/src/components/ScoreCard.js +7 -9
package/src/components/StartRun.js +20 -4
package/src/components/StatsView.js +31 -11
package/src/lib/demo.js +37 -19
package/src/lib/ui.js +16 -0
package/assets/demo-hud.png +0 -0
package/assets/scorecard.png +0 -0
package/docs/assets/demo-hud.png +0 -0
package/docs/assets/scorecard.png +0 -0

package/CHANGELOG.md CHANGED Viewed

@@ -6,6 +6,16 @@ TokenGolf patch notes — what changed, what it measures, and why the mechanic e
 ## [Unreleased]
+### Changed
+- **Design D HUD** — StatusLine HUD redesigned with `██` accent bar, inline `▓░` progress bars for budget and context, no separator lines. 1 line when context <50%, 2 lines when context visible. Accent bar turns red when budget >75%. Matches Design D across all UI surfaces.
+- **Design D block accent UI** — All bordered boxes replaced with left-only `██` block accent bars. Eliminates persistent right-border misalignment caused by emoji/unicode width differences across terminals. Color-coded: yellow for won, red for died, gray for neutral.
+- ScoreCard, StatsView, ActiveRun, StartRun components all use custom Ink `borderStyle` with `left: '██'`, no right/top/bottom borders
+- session-end.js ANSI scorecard uses `██` prefix per line with `─` horizontal separators
+- Achievement badges in StatsView rendered inline (no individual bordered boxes)
+- Death tip in ScoreCard rendered as indented text (no bordered box)
+- Landing page terminal demos updated to match `██` style
+- README screenshots replaced with inline code block demos — no more stale PNGs
 ### Fixed
 - StatusLine HUD effort label now reads exclusively from live `settings.json` — `/model` changes (High, Low, Max) reflect immediately without requiring a new session
 - Medium effort no longer shown in HUD or scorecard — it's the default, so it's omitted (same as not annotating Sonnet as "normal difficulty")

package/CLAUDE.md CHANGED Viewed

@@ -347,10 +347,13 @@ Nine hooks in `hooks/` directory, installed via `tokengolf install`. Most comple
 ### `StatusLine` (`statusline.sh`)
 - Bash script; uses `TG_SESSION_JSON=... python3 - "$STATE_FILE" <<'PYEOF'` pattern to avoid heredoc/stdin conflict
 - Receives live session JSON (cost, context %, model) via stdin
-- Shows: quest/mode | tier emoji + cost [/budget pct%] | [efficiency rating] | [🪶/🎒/📦 ctx%] | model label | [floor]
+- **Design D accent bar**: `██` prefix on each line, color-coded (yellow normal, red when budget >75%)
+- Line 1: `██ ⛳ quest  $cost/budget ▓▓▓░░░ pct%  RATING  model  F1/5`
+- Line 2 (optional, when context ≥50%): `██ 🧠 ▓▓▓▓░░░ ctx% 🪶/🎒/📦`
+- Budget progress bar: `▓` filled, `░` empty, 11 chars wide. Red when >75%, yellow otherwise
+- Context progress bar: `▓░` 10 chars wide. Green (50–74%), yellow (75–89%), red (90%+); hidden below 50%
 - Model label: `⚔️ Sonnet`, `⚔️ Sonnet·High`, `🏹 Haiku`, `🧙 Opus·Max`, etc. Effort appended only when explicitly set in settings.json (medium omitted — it's the default)
-- Context load: 🪶 green (50–74%), 🎒 yellow (75–89%), 📦 red (90%+); hidden below 50%
-- Separator lines (`───────────────`) above and below HUD row
+- 1 line when context <50% (flow mode is often just 1 line), 2 lines when context visible
 - statusLine config must be an object: `{type:"command", command:"...statusline.sh", padding:1}`
 ### Hook installation
@@ -388,9 +391,11 @@ Thinking tokens are estimated from character count ÷ 4 (approximate — display
 7. **Death marks fire before the early return** — `calculateAchievements` has an `if (!won) return []` early exit, but death marks (blowout, so_close, tool_happy, silent_death, fumble, expensive_taste, hubris) fire before it. `indecisive` (model switches) and `expensive_taste` also fire on won runs — they're behavior patterns, not death verdicts.
+8. **Design D: ██ block accent, no right borders** — All UI cards use a left-only `██` block accent bar instead of full box borders. This eliminates persistent right-border misalignment caused by emoji/unicode width calculation differences across terminals. Color-coded: yellow `██` for won, red `██` for died, gray `██` for neutral (stats, wizard). Ink components use a custom `borderStyle` object with `left: '██'` and `borderRight/Top/Bottom={false}`, `paddingLeft={3}`. session-end.js ANSI scorecard uses `██` prefix + `─` horizontal separators. No screenshots — README uses inline code block demos.
 ---
-## Current Status: v0.3
+## Current Status: v0.4
 ### Done
 - [x] Full project scaffold with esbuild pipeline
@@ -418,6 +423,9 @@ Thinking tokens are estimated from character count ÷ 4 (approximate — display
 - [x] 28 new achievements: prompting skill, tool mastery, cost/prompt, time, subagents, turn discipline, death marks
 - [x] 3 new hooks: PostToolUseFailure, SubagentStart, Stop
 - [x] Vitest test suite — 120 tests covering all achievements + pure score functions
+- [x] Design D block accent UI — ██ left bar, no right borders, color-coded state
+- [x] Landing page terminal demos updated to ██ style
+- [x] README inline code block demos (replaced PNG screenshots)
 ### Next up (v0.4)
 - [ ] `tokengolf floor` command to advance floor manually

package/README.md CHANGED Viewed

@@ -1,49 +1,73 @@
-# ⛳ TokenGolf
-> Flow mode tracks you. Roguelike mode trains you.
-Turn Claude Code token efficiency into a game. Declare a quest, commit to a budget, pick a character class. Work normally. At the end, get a score based on how efficiently you used your budget.
-**Better prompting → fewer tokens → higher score.**
+<p align="center">
+  <img src="docs/assets/banner.svg" alt="TokenGolf" width="800" />
+</p>
-**[tokengolf.dev](https://josheche.github.io/tokengolf/)** · [npm](https://www.npmjs.com/package/tokengolf) · [GitHub](https://github.com/josheche/tokengolf)
+<p align="center">
+  <strong>Flow mode tracks you. Roguelike mode trains you.</strong>
+</p>
----
+<p align="center">
+  <a href="https://www.npmjs.com/package/tokengolf"><img src="https://img.shields.io/npm/v/tokengolf?style=flat&color=d4a840&label=npm" alt="npm version" /></a>
+  <a href="https://www.npmjs.com/package/tokengolf"><img src="https://img.shields.io/npm/dm/tokengolf?style=flat&color=6bb54a&label=downloads" alt="npm downloads" /></a>
+  <a href="https://github.com/josheche/tokengolf/blob/main/LICENSE"><img src="https://img.shields.io/npm/l/tokengolf?style=flat&color=7cb8d4" alt="license" /></a>
+  <a href="https://nodejs.org"><img src="https://img.shields.io/node/v/tokengolf?style=flat&color=9a9180" alt="node version" /></a>
+</p>
-<!-- SCREENSHOT: tokengolf start wizard — quest/class/effort/budget selection -->
+<p align="center">
+  <a href="https://josheche.github.io/tokengolf/">tokengolf.dev</a> · <a href="https://www.npmjs.com/package/tokengolf">npm</a> · <a href="https://github.com/josheche/tokengolf">GitHub</a>
+</p>
 ---
-## Why "TokenGolf"?
+Turn Claude Code token efficiency into a game. Declare a quest, commit to a budget, pick a character class. Work normally. At the end — a scorecard, achievements, and a score that measures how well you prompt.
-[Code golf](https://en.wikipedia.org/wiki/Code_golf) is the engineering practice of solving a problem in as few characters (or lines, or bytes) as possible. The constraint isn't the point — the *discipline the constraint creates* is the point. Writing the shortest possible solution forces you to understand the problem deeply and use your tools precisely.
+**Better prompting → fewer tokens → higher score.** 60+ achievements. 9 hooks. 4 character classes. Zero config beyond `npm install`.
-Token golf is the same idea applied to AI sessions. Your budget is par. Every unnecessary prompt, every redundant context dump, every "can you also..." tacked onto a request is a stroke over par. The game doesn't literally resemble golf — it borrows the concept: **optimize under constraint, measure your score, improve your game.**
+```
+██  🏆  SESSION COMPLETE
+██  add pagination to /users endpoint
+██  ──────────────────────────────────────────────────
+██  $0.2300  /$1.50  15%  ⚔️ Sonnet  🥇 Gold
+██  🌟 LEGENDARY
+██  ──────────────────────────────────────────────────
+██  🎯 Sniper  🥈 Silver  🔥 No Rest  ✅ Clean Run  🧰 Toolbox  🤫 Silent Run
+```
 ---
-## Two Modes
-### ⛳ Flow Mode
-Just work. TokenGolf auto-creates a tracking session when you open Claude Code. `/exit` the session and the scorecard appears automatically. No pre-configuration required.
+## Quick Start
-### ☠️ Roguelike Mode
-Pre-commit before you start. Declare a quest, pick a class and effort level, set a budget. Go over budget = permadeath — the run is logged as a death. The deliberate pressure trains better prompting habits, which makes your Flow sessions cheaper over time.
+**npm** (recommended)
+```bash
+npm install -g tokengolf
+```
----
+**Homebrew**
+```bash
+brew tap josheche/tokengolf
+brew install tokengolf
+```
-## Install
+**curl**
+```bash
+curl -fsSL https://raw.githubusercontent.com/josheche/tokengolf/main/install.sh | bash
+```
+Then set up the hooks (npm and brew only — curl does this automatically):
 ```bash
-npm install -g tokengolf
-tokengolf install
+tokengolf install          # patches ~/.claude/settings.json
 ```
-`tokengolf install` patches `~/.claude/settings.json` with the hooks that power live tracking, the HUD, and the auto scorecard.
+That's it. Open Claude Code, work normally, `/exit` — scorecard appears automatically (Flow mode). Or declare a run first:
----
+```bash
+tokengolf start            # pick quest, class, effort, budget
+# ... work in Claude Code ...
+tokengolf win              # complete run, auto-detect cost
+```
-## Commands
+<details>
+<summary>All commands</summary>
 ```bash
 tokengolf start       # declare quest + class + effort + budget, begin a run
@@ -53,103 +77,92 @@ tokengolf bust        # manual permadeath override
 tokengolf scorecard   # last run scorecard
 tokengolf stats       # career dashboard
 tokengolf install     # patch ~/.claude/settings.json with hooks
-tokengolf demo        # show all HUD states (for screenshots)
+tokengolf demo        # show all UI states (for screenshots)
 ```
+</details>
 ---
-## Character Classes & Effort
+## Two Modes
-| Class | Model | Effort | Feel |
-|-------|-------|--------|------|
-| 🏹 Rogue | Haiku | — *(skips effort step)* | Glass cannon. Prompt precisely or die. |
-| ⚔️ Fighter | Sonnet | Low / **Medium** / High | Balanced. The default run. |
-| 🧙 Warlock | Opus | Low / **Medium** / High / Max | Powerful but costly. |
-| ⚜️ Paladin | Opus (plan mode) | Low / **Medium** / High / Max | Strategic planner. Thinks before acting. |
-| ⚡ Warlock·Fast | Opus + fast mode | any | 2× cost. Maximum danger mode. |
+### ⛳ Flow Mode
+Just work. TokenGolf auto-creates a tracking session when you open Claude Code. `/exit` and the scorecard appears. No pre-configuration.
-`max` effort is Opus-only — the API returns an error if used on other models. Fast mode is toggled mid-session with `/fast` in Claude Code and is auto-detected by TokenGolf.
+### ☠️ Roguelike Mode
+Pre-commit before you start. Declare a quest, pick a class and effort level, set a budget. Go over budget = permadeath — the run is logged as a death. The pressure trains better prompting habits, which makes your Flow sessions cheaper over time.
 ---
-## Budget Presets (Model-Calibrated)
+## Character Classes
-The wizard shows different amounts depending on your class — same relative difficulty, different absolute cost. Anchored to the ~$0.75/task Sonnet average from Anthropic's $6/day Claude Code benchmark.
-| Tier | Haiku 🏹 | Sonnet ⚔️ | Opus 🧙 | Feel |
-|------|---------|---------|--------|------|
-| 💎 Diamond | $0.15 | $0.50 | $2.50 | Surgical micro-task |
-| 🥇 Gold | $0.40 | $1.50 | $7.50 | Focused small task |
-| 🥈 Silver | $1.00 | $4.00 | $20.00 | Medium task |
-| 🥉 Bronze | $2.50 | $10.00 | $50.00 | Heavy / complex |
-| ✏️ Custom | any | any | any | Set your own bust threshold |
+| Class | Model | Difficulty | Feel |
+|-------|-------|------------|------|
+| 🏹 **Rogue** | Haiku | Nightmare | Glass cannon. Prompt precisely or die. |
+| ⚔️ **Fighter** | Sonnet | Standard | Balanced. The default run. |
+| 🧙 **Warlock** | Opus | Casual | Powerful but expensive. |
+| ⚜️ **Paladin** | Opus (plan mode) | Tactical | Strategic planner. Thinks before acting. |
+| ⚡ **Warlock·Fast** | Opus + fast mode | Danger | 2× cost. Maximum risk, maximum speed. |
-These are **bust thresholds** — your commitment. Efficiency tiers derive as percentages of whatever you commit to.
+Effort levels: Low / **Medium** / High / Max (Opus-only). Fast mode toggled with `/fast` in Claude Code, auto-detected by TokenGolf.
 ---
 ## Scoring
-**Efficiency rating** (roguelike mode — % of your budget used):
+**Efficiency** (roguelike — % of budget used):
 | 🌟 LEGENDARY | ⚡ EFFICIENT | ✓ SOLID | 😅 CLOSE CALL | 💀 BUSTED |
 |---|---|---|---|---|
 | < 25% | < 50% | < 75% | < 100% | > 100% |
-**Spend tier** (absolute cost, shown on every scorecard):
+**Spend tier** (absolute, shown on every scorecard):
-| 💎 | 🥇 | 🥈 | 🥉 | 💸 |
+| 💎 Diamond | 🥇 Gold | 🥈 Silver | 🥉 Bronze | 💸 Reckless |
 |---|---|---|---|---|
 | < $0.10 | < $0.30 | < $1.00 | < $3.00 | > $3.00 |
----
-## Ultrathink
-Write `ultrathink` anywhere in your prompt to trigger extended thinking mode. It's not a slash command — just say it in natural language:
-> *"ultrathink: is this the right architecture before I write anything?"*
-> *"can you ultrathink through the tradeoffs here?"*
-Extended thinking tokens are billed at full output rate. A single ultrathink on Sonnet can cost $0.50–2.00 depending on problem depth. TokenGolf detects thinking blocks from your session transcripts and tracks invocations and estimated thinking tokens — both show on your scorecard.
-**The skill is knowing when to ultrathink.** One expensive deep-think that prevents five wrong turns is efficient. Ultrathinking every prompt when you're at 80% budget is hubris.
+Budget presets are model-calibrated — Haiku Diamond is $0.15, Opus Diamond is $2.50. Same relative difficulty, different absolute cost.
 ---
-## The Meta Loop
-The dungeon crawl framing maps directly to real session behaviors:
-- **Overencumbered** = context bloat slowing you down
-- **Made Camp** = hit usage limits, came back next session
-- **Ghost Run** = surgical context management before the boss
-- **Hubris** = reached for ultrathink on a tight budget and paid for it
-- **Silent Run** = solved it with pure prompting discipline, no extended thinking needed
-- **Lone Wolf** = didn't spawn a single subagent; held the whole problem in one context
-- **Agentic** = gave Claude the wheel and it ran with it — 3+ turns per prompt
+## Achievements
-Roguelike mode surfaces these patterns explicitly. Flow mode lets them compound over time. The meta loop: **roguelike practice makes Flow sessions better. Better Flow = lower daily spend = better scores without even trying.**
+60+ achievements tracking how you prompt, what tools you use, and how efficiently you spend. Here are some highlights:
----
+| | Achievement | How |
+|---|---|---|
+| 🥊 | **One Shot** | Completed in a single prompt |
+| 🎯 | **Sniper** | Under 25% of budget used |
+| 💎 | **Diamond** | Haiku under $0.10 total |
+| 🔪 | **Surgeon** | 1–3 Edit calls, under budget |
+| 🤫 | **Silent Run** | No extended thinking, SOLID or better |
+| 👑 | **Archmagus** | Opus at max effort, completed |
+| 🥷 | **Ghost Run** | Manual compact at ≤30% context |
+| 🐺 | **Lone Wolf** | No subagents spawned |
+| 🤦 | **Hubris** | Used ultrathink, busted anyway *(death mark)* |
+| 💥 | **Blowout** | Spent 2× your budget *(death mark)* |
-## Achievements
+<details>
+<summary><strong>Full achievement list (60+)</strong></summary>
 **Class**
 - 💎 Diamond — Haiku under $0.10 total spend
 - 🥇 Gold — Completed with Haiku
 - 🥈 Silver — Completed with Sonnet
 - 🥉 Bronze — Completed with Opus
+- ⚜️ Paladin — Completed as Paladin (Opus plan mode)
+- ♟️ Grand Strategist — LEGENDARY efficiency as Paladin
 **Efficiency**
 - 🎯 Sniper — Under 25% of budget used
 - ⚡ Efficient — Under 50% of budget used
 - 🪙 Penny Pincher — Total spend under $0.10
-- 🪙 Cheap Shots — Under $0.01 per prompt (≥3 prompts)
+- 💲 Cheap Shots — Under $0.01 per prompt (≥3 prompts)
 - 🍷 Expensive Taste — Over $0.50 per prompt (≥3 prompts)
 **Prompting skill**
-- 🎯 One Shot — Completed in a single prompt
+- 🥊 One Shot — Completed in a single prompt
 - 💬 Conversationalist — 20+ prompts in one run
 - 🤐 Terse — ≤3 prompts, ≥10 tool calls
 - 🪑 Backseat Driver — 15+ prompts but <1 tool call per prompt
@@ -164,12 +177,12 @@ Roguelike mode surfaces these patterns explicitly. Flow mode lets them compound
 - 🧰 Toolbox — 5+ distinct tools used
 **Effort**
-- 🎯 Speedrunner — Low effort, completed under budget
-- 💪 Tryhard — High/max effort, LEGENDARY efficiency
+- 🏎️ Speedrunner — Low effort, completed under budget
+- 🏋️ Tryhard — High/max effort, LEGENDARY efficiency
 - 👑 Archmagus — Opus at max effort, completed
 **Fast mode**
-- ⚡ Lightning Run — Opus fast mode, completed under budget
+- ⛈️ Lightning Run — Opus fast mode, completed under budget
 - 🎰 Daredevil — Opus fast mode, LEGENDARY efficiency
 **Time**
@@ -179,19 +192,29 @@ Roguelike mode surfaces these patterns explicitly. Flow mode lets them compound
 **Ultrathink**
 - 🔮 Spell Cast — Used extended thinking during the run
-- 🧠 Calculated Risk — Ultrathink + LEGENDARY efficiency
+- 🧮 Calculated Risk — Ultrathink + LEGENDARY efficiency
 - 🌀 Deep Thinker — 3+ ultrathink invocations, completed under budget
-- 🤫 Silent Run — No extended thinking, SOLID or better *(think without thinking)*
-- 🤦 Hubris — Used ultrathink, busted anyway *(death achievement)*
+- 🤫 Silent Run — No extended thinking, SOLID or better
+- 🤦 Hubris — Used ultrathink, busted anyway *(death mark)*
 **Multi-model**
 - 🏹 Frugal — Haiku handled ≥50% of session cost
 - 🎲 Rogue Run — Haiku handled ≥75% of session cost
+- 🔷 Purist — Single model family throughout
+- 🦎 Chameleon — Multiple model families used, under budget
+- 🔀 Tactical Switch — Exactly 1 model switch, under budget
+- 🔒 Committed — No switches, one model family
+- ⚠️ Class Defection — Declared one class but cost skewed to another
+**Paladin planning ratio**
+- 🏛️ Architect — Opus handled >60% of cost (heavy planner)
+- 💨 Blitz — Opus handled <25% of cost (light plan, fast execution)
+- ⚖️ Equilibrium — Opus/Sonnet balanced at 40–60%
 **Rest & recovery**
-- ⚡ No Rest for the Wicked — Completed in one session
+- 🔥 No Rest for the Wicked — Completed in one session
 - 🏕️ Made Camp — Completed across multiple sessions
-- 💪 Came Back — Fainted (hit usage limits) and finished anyway
+- 🧟 Came Back — Fainted (hit usage limits) and finished anyway
 **Context management (gear)**
 - 📦 Overencumbered — Context auto-compacted during run
@@ -199,16 +222,16 @@ Roguelike mode surfaces these patterns explicitly. Flow mode lets them compound
 - 🪶 Ultralight — Manual compact at ≤40% context
 - 🥷 Ghost Run — Manual compact at ≤30% context
-**Tool reliability** *(requires PostToolUseFailure hook)*
+**Tool reliability**
 - ✅ Clean Run — Zero failed tool calls (≥5 total)
 - 🐂 Stubborn — 10+ failed tool calls, still won
-**Subagents** *(requires SubagentStart hook)*
+**Subagents**
 - 🐺 Lone Wolf — Completed with no subagents spawned
 - 📡 Summoner — 5+ subagents spawned
 - 🪖 Army of One — 10+ subagents, under 50% budget used
-**Turn discipline** *(requires Stop hook)*
+**Turn discipline**
 - 🤖 Agentic — 3+ Claude turns per user prompt
 - 🐕 Obedient — Exactly one turn per prompt (≥3 prompts)
@@ -218,60 +241,153 @@ Roguelike mode surfaces these patterns explicitly. Flow mode lets them compound
 - 🔨 Tool Happy — Died with 30+ tool calls
 - 🪦 Silent Death — Died with ≤2 prompts
 - 🤡 Fumble — Died with 5+ failed tool calls
+- 🎲 Indecisive — 3+ model switches
+- 🍷 Expensive Taste — Over $0.50 per prompt
+</details>
 ---
 ## Live HUD
-After `tokengolf install`, a status line appears in every Claude Code session:
+After `tokengolf install`, a status line appears in every Claude Code session showing quest, cost, efficiency, context load, and model class.
-- **tier emoji** (💎🥇🥈🥉💸) updates live as cost accumulates
-- **🪶 green** at 50–74% context — traveling light
-- **🎒 yellow** at 75–89% context — getting heavy
-- **📦 red** at 90%+ context — overencumbered, consider compacting
-- **💤** instead of ⛳ if the previous session fainted (hit usage limits)
-- Roguelike runs show floor progress; Flow runs omit budget/efficiency
+```
+██ ⛳ add pagination to /users  $0.54/1.50 ▓▓▓▓░░░░░░░ 36%  EFFICIENT  ⚔️ Sonnet  F2/5
-Run `tokengolf demo` to see all HUD states rendered in your terminal:
+██ ⛳ refactor auth middleware  $0.82/4.00 ▓▓░░░░░░░░░ 21%  LEGENDARY  🧙 Opus  F3/5
+██ 🧠 ▓▓▓▓▓░░░░░ 52% 🪶
+```
-![TokenGolf HUD demo — all game states](assets/demo-hud.png)
+Budget bar turns red above 75%. Context bar (line 2) appears at 50%+: **🪶** green · **🎒** yellow · **📦** red. Accent `██` turns red in danger. **💤** replaces ⛳ when fainted.
 ---
 ## Auto Scorecard
-When you `/exit` a Claude Code session, the scorecard appears automatically — cost, model breakdown, achievements, ultrathink stats, tool usage.
+When you `/exit`, the scorecard appears automatically — cost, model breakdown, achievements, tool usage.
-<p align="center">
-  <img src="assets/scorecard.png" alt="TokenGolf scorecard — session complete" width="620" />
-</p>
+```
+██  🏆  SESSION COMPLETE
+██  add pagination to /users endpoint
+██  ──────────────────────────────────────────────────
+██  SPENT      BUDGET    USED    MODEL        TIER
+██  $0.2300    $1.50     15%     ⚔️ Sonnet     🥇 Gold
+██
+██  🌟 LEGENDARY
+██  ──────────────────────────────────────────────────
+██  Achievements unlocked:
+██   🎯 Sniper — Under 25% budget
+██   🥈 Silver — Completed with Sonnet
+██   🔥 No Rest for the Wicked — One session
+██   ✅ Clean Run — Zero failed tool calls
+██   🧰 Toolbox — 5+ distinct tool types
+██   🤫 Silent Run — No extended thinking
+██  ──────────────────────────────────────────────────
+██  Model usage:  🏹 17% Haiku
+██  Sonnet 83% $0.19   Haiku 17% $0.04
+██  ──────────────────────────────────────────────────
+██  Tool calls:
+██  Read ×8  Edit ×4  Bash ×3  Grep ×2  Glob ×1
+```
-*Real scorecard from building TokenGolf itself. $40 flow session, 27 ultrathink invocations, Class Defection for declaring Sonnet but spending 89% on Opus. The game plays you back.*
+```
+██  💀  BUDGET BUSTED
+██  migrate postgres schema to v3
+██  ──────────────────────────────────────────────────
+██  $3.8700  /$1.50  258%  ⚔️ Sonnet  💸 Reckless
+██  ──────────────────────────────────────────────────
+██  🤦 Hubris   💥 Blowout   🤡 Fumble   🔨 Tool Happy
+██  ──────────────────────────────────────────────────
+██  Cause of death: Budget exceeded by $2.37
+██  Tip: Use Read with line ranges instead of full file reads.
+```
 ---
-## Hooks
+<details>
+<summary><strong>Ultrathink</strong></summary>
+Write `ultrathink` anywhere in your prompt to trigger extended thinking mode. It's not a slash command — just say it in natural language:
+> *"ultrathink: is this the right architecture before I write anything?"*
+> *"can you ultrathink through the tradeoffs here?"*
+Extended thinking tokens are billed at full output rate. A single ultrathink on Sonnet can cost $0.50–2.00 depending on problem depth. TokenGolf detects thinking blocks from your session transcripts and tracks invocations and estimated thinking tokens — both show on your scorecard.
+**The skill is knowing when to ultrathink.** One expensive deep-think that prevents five wrong turns is efficient. Ultrathinking every prompt when you're at 80% budget is hubris.
+</details>
+<details>
+<summary><strong>The Meta Loop</strong></summary>
+The dungeon crawl framing maps directly to real session behaviors:
+- **Overencumbered** = context bloat slowing you down
+- **Made Camp** = hit usage limits, came back next session
+- **Ghost Run** = surgical context management before the boss
+- **Hubris** = reached for ultrathink on a tight budget and paid for it
+- **Silent Run** = solved it with pure prompting discipline, no extended thinking needed
+- **Lone Wolf** = didn't spawn a single subagent; held the whole problem in one context
+- **Agentic** = gave Claude the wheel and it ran with it — 3+ turns per prompt
+Roguelike mode surfaces these patterns explicitly. Flow mode lets them compound over time. The meta loop: **roguelike practice makes Flow sessions better. Better Flow = lower daily spend = better scores without even trying.**
+</details>
+<details>
+<summary><strong>Budget presets (model-calibrated)</strong></summary>
+The wizard shows different amounts depending on your class — same relative difficulty, different absolute cost. Anchored to the ~$0.75/task Sonnet average from Anthropic's $6/day Claude Code benchmark.
+| Tier | Haiku 🏹 | Sonnet ⚔️ | Opus 🧙 | Feel |
+|------|---------|---------|--------|------|
+| 💎 Diamond | $0.15 | $0.50 | $2.50 | Surgical micro-task |
+| 🥇 Gold | $0.40 | $1.50 | $7.50 | Focused small task |
+| 🥈 Silver | $1.00 | $4.00 | $20.00 | Medium task |
+| 🥉 Bronze | $2.50 | $10.00 | $50.00 | Heavy / complex |
+| ✏️ Custom | any | any | any | Set your own bust threshold |
+These are **bust thresholds** — your commitment. Efficiency tiers derive as percentages of whatever you commit to.
+</details>
+<details>
+<summary><strong>Why "TokenGolf"?</strong></summary>
-Nine hooks installed via `tokengolf install`:
+[Code golf](https://en.wikipedia.org/wiki/Code_golf) is the engineering practice of solving a problem in as few characters (or lines, or bytes) as possible. The constraint isn't the point — the *discipline the constraint creates* is the point. Writing the shortest possible solution forces you to understand the problem deeply and use your tools precisely.
+Token golf is the same idea applied to AI sessions. Your budget is par. Every unnecessary prompt, every redundant context dump, every "can you also..." tacked onto a request is a stroke over par. The game doesn't literally resemble golf — it borrows the concept: **optimize under constraint, measure your score, improve your game.**
+</details>
+<details>
+<summary><strong>Hooks (9 total)</strong></summary>
+All installed via `tokengolf install`:
 | Hook | When | What it does |
 |------|------|-------------|
-| `SessionStart` | Session opens | Injects quest/budget/floor into Claude's context. Auto-creates Flow run if none active. Increments session count for multi-session runs. |
+| `SessionStart` | Session opens | Injects quest/budget/floor into Claude's context. Auto-creates Flow run if none active. |
 | `PostToolUse` | After every tool | Tracks tool usage by type. Fires budget warning at 80%. |
 | `PostToolUseFailure` | After a tool error | Increments `failedToolCalls` — powers Clean Run, Stubborn, Fumble. |
 | `UserPromptSubmit` | Each prompt | Counts prompts. Injects halfway nudge at 50% budget. |
 | `PreCompact` | Before compaction | Records manual vs auto compact + context % — powers gear achievements. |
-| `SessionEnd` | Session closes | Scans transcripts for cost + ultrathink blocks, saves run, displays ANSI scorecard. Detects Fainted if session ended unexpectedly (usage limit hit). |
+| `SessionEnd` | Session closes | Scans transcripts for cost + ultrathink, saves run, displays scorecard. |
 | `SubagentStart` | Subagent spawned | Increments `subagentSpawns` — powers Lone Wolf, Summoner, Army of One. |
 | `Stop` | Claude finishes a turn | Increments `turnCount` — powers Agentic, Obedient. |
 | `StatusLine` | Continuously | Live HUD with cost, tier, efficiency, context %, model class. |
----
+</details>
-## State
+<details>
+<summary><strong>State</strong></summary>
 All data lives in `~/.tokengolf/`:
 - `current-run.json` — active run
 - `runs.json` — completed run history
 No database, no native deps, no compilation.
+</details>