npm - jeo-code - Versions diffs - 0.5.9 → 0.5.12 - Mend

jeo-code 0.5.9 → 0.5.12

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (13) hide show

package/CHANGELOG.md +280 -0
package/README.ja.md +3 -3
package/README.ko.md +3 -3
package/README.md +3 -3
package/README.zh.md +3 -3
package/package.json +2 -1
package/src/agent/engine.ts +19 -3
package/src/agent/json.ts +105 -25
package/src/commands/launch.ts +35 -2
package/src/commands/whats-new.ts +62 -0
package/src/tui/app.ts +28 -9
package/src/tui/components/transcript.ts +44 -14
package/src/util/whats-new.ts +272 -0

package/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,280 @@
+# Changelog
+All notable changes to **jeo-code** are documented here.
+The format follows [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
+and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+The README mirrors the latest 5 entries — regenerate with `bun run changelog:sync`.
+## [0.5.12] - 2026-06-15
+_Yellow status animation while a process runs, and elapsed `(Nms)` on every completed tool card._
+### Added
+- The live status animation (spinner + activity gradient) turns amber/yellow while a tool/process is executing — "the agent is running/verifying a process" now reads at a glance, distinct from the cool thinking gradient (gjc `theme.fg("warning")` parity).
+- Every completed tool card and light-tool ledger line now shows how long the tool ran, dim after the ✓/✗ glyph — `✓ Bash · (8ms)`, `✓ Read x.ts · (3ms)` (gjc duration-detail parity; sub-second as `Nms`, else `N.Ns`/`Nm Ns`).
+## [0.5.11] - 2026-06-15
+_Backspace on an empty prompt line no longer quits jeo._
+### Fixed
+- Pressing Backspace with an empty input line could terminate the process on some Bun builds: an empty-line Backspace is a no-op edit, but those builds turn it into a spurious readline `close`, which the REPL treats as a hard exit. The multi-line input filter now swallows a standalone Backspace (DEL `0x7f` / BS `0x08`) when the line buffer is already empty, so the byte never reaches readline and the close can't fire. Backspace with text in the buffer still deletes normally, and Ctrl-C / paste / Enter are untouched.
+## [0.5.10] - 2026-06-15
+_`/resume` transcript no longer dumps raw JSON for batched tool calls._
+### Fixed
+- Resuming a session that contains a BATCHED tool step (`{reasoning, tools:[…]}`) printed the raw JSON object into the transcript instead of a readable history. `formatTranscript` only recognized the single-call `{tool,arguments}` shape; the batch shape parsed to no `tool` field, fell through to the prose branch, and dumped the JSON. It now renders one compact `✔/✗ <tool> — <result>` ledger line per batched call (verdicts parsed in call order from the combined `Tool [x] result` message), matching how single calls already render. `/resume` and `/history` both go through this path, so both are fixed.
+## [0.5.9] - 2026-06-15
+_Bounded per-frame wrap for the live thinking/tool-output blocks — re-render cost no longer grows with stream length._
+### Changed
+- The live "thinking" and tool-output tail blocks only ever DISPLAY their last 6/8 wrapped rows, but they accumulate the whole step's text and were re-wrapping the FULL string on every 120ms repaint — so per-frame CPU and GC churn grew linearly with how much had streamed (a long reasoning trace or a chatty tool can reach hundreds of KB). The wrap input is now bounded to a fixed 16 KiB trailing window (`tailForWrap`), capping per-frame work at O(window) regardless of total size. The visible tail is byte-identical for normal multi-line content; an unbroken multi-hundred-KB blob still shows its genuine end. No memory leak existed (every live buffer — StreamRegion, ToolList, activityLog, forgeSummaries — is already ring-capped and the per-turn TUI instance is GC'd); this removes the one remaining O(stream-length) hot path in the repaint loop.
+## [0.5.8] - 2026-06-15
+_Native Opik observability for the turn loop (opt-in `JEO_OPIK`, pure-TS no-op when unset) + autopilot convergence tracking._
+### Added
+- Native Opik observability for the agent turn loop (opt-in via `JEO_OPIK`): each turn becomes one Opik trace, each step/tool a span, with token usage and the `completed` / `verified` / `efficiency` eval scores attached. Implemented in pure TypeScript over `fetch` — no Python, no `opik` npm package — per jeo's zero-native-dependency constraint. Hard invariants: an unset `JEO_OPIK` is a complete no-op (zero Opik HTTP calls); no tracer error ever escapes an events callback; the API key travels only in the `Authorization` header; and engine output is identical regardless of tracing outcome.
+### Changed
+- Autopilot convergence tracking: a "keep" (a provable min/max improvement or a gate pass) now counts as forward progress, and anything else extends the no-progress streak toward the convergence cap.
+## [0.5.7] - 2026-06-15
+_`/model` picker is default-only, `/clear` resets to the initial screen, ESC clears the input box, and a launch process-listener leak is fixed._
+### Changed
+- The `/model` model-action picker now offers ONLY the DEFAULT section: "Set model only" plus the default thinking levels. The per-role rows ("Set as EXECUTOR/PLANNER/ARCHITECT/CRITIC") and the "Apply OpenAI Codex role preset" entry are gone — role model and thinking are configured exclusively in `/agents` (and `/agents edit`). The default heading now points there (`roles → /agents`). This completes the 0.5.6 split: `/model` owns the default, `/agents` owns the roles. Dead helpers (`orderedModelRoles`, `applyOpenAiCodexRolePreset`, `CORE_MODEL_ACTION_ROLE_ORDER`) were removed with the menus.
+### Added
+- `/clear` now returns to the initial screen (fresh session view) and ESC clears the input box, matching the expected reset gestures.
+### Fixed
+- Launch no longer leaks process listeners: the prompt-scoped stdin `data`/`keypress` and stdout `resize` handlers are now tracked and drained on every exit path, so repeated `launch()` (e.g. the test harness) no longer accumulates listeners past Node's 10-listener default (`MaxListenersExceededWarning` + a real leak).
+## [0.5.6] - 2026-06-15
+_`/model` sets only the default thinking; per-role reasoning moved to `/agents`._
+### Changed
+- Thinking ownership split: `/model` now configures reasoning for the DEFAULT agent only, and per-role (subagent) thinking is owned by `/agents`. In the model-action picker the role rows offer just "Set model only" (the per-level thinking rows are gone), `/model subagent <role> thinking <level>` redirects to `/agents` instead of saving, and picking a role's model no longer prompts for a reasoning level. Set role reasoning via `/agents <role> thinking <level|inherit>` or the `/agents edit` picker. `/model thinking <level>` still sets the default, so nothing the default knob did is lost.
+## [0.5.5] - 2026-06-15
+_Full multi-line visibility — the input box scrolls to the caret and the submitted card shows every line._
+### Fixed
+- Full multi-line visibility for the multi-line input added in 0.5.4: the input box now scrolls so the caret row always stays in view as you move through a long draft (`…` markers flag rows hidden above/below, so no line is unreachable), and a submitted multi-line query renders ALL of its wrapped lines in the user card — the card lives in scrollback rather than the bounded live frame, so nothing is truncated.
+## [0.5.4] - 2026-06-15
+_Reliable multi-line input is ON by default — a paste fills the box and submits as one message._
+### Changed
+- Multi-line input is now enabled by default on any interactive TTY (no flag needed): a bracketed paste arrives as ONE buffer, fills the input box, and submits as a single message instead of being split into one message per line. The prompt's stdin is routed through a filter that rewrites line breaks to a private-use sentinel before `node:readline` sees them, avoiding the per-line submit/race. The lone-`\n` Shift+Enter rule stays opt-in via `JEO_MULTILINE=1` (it needs ghostty's `keybind = shift+enter=text:\n` and could misfire on terminals that send LF for Enter), and `JEO_NO_MULTILINE=1` fully disables the filter (reads stdin directly, exactly as before).
+## [0.5.3] - 2026-06-15
+_`$` chains multiple skills in one line (all run, in order), plus multi-line prompt input — paste-merge and gated Shift+Enter._
+### Added
+- Multi-skill `$` chaining: a leading run of `$skill` tokens now invokes every resolved skill in order, sharing the trailing text as one intent — `$ralplan $team build the auth flow` runs ralplan then team, both with intent "build the auth flow". A lone `$skill` is just a chain of one, so existing single invocations are unchanged. Prefixes still resolve (`$te $ultra …`), `$UPPERCASE` env-var tokens end the chain and pass through to the model, and unknown tokens are collected so several typos report together (`No skills: $nope, $bad.`) instead of one at a time. Applies to both the interactive prompt and one-shot `-p`/`/skill:` runs.
+- Multi-line paste merges into ONE message: a pasted block with embedded newlines is submitted as a single prompt instead of being split into one message per line.
+- Shift+Enter multi-line input (opt-in via `JEO_MULTILINE=1`): Shift+Enter inserts a real line break instead of submitting. Terminal-specific Shift+Enter encodings (ghostty legacy `\x1b[27;2;13~`, kitty `\x1b[13;2u`, and a lone `\n`) are rewritten to a private-use sentinel before `node:readline` sees them, so it works through tmux (extended-keys off) without mangling; default OFF reads stdin exactly as before.
+### Changed
+- The one-shot skill path was refactored to a shared `runOneSkillShot` runner so the single-invocation and chain paths execute identically (bundle workflow → engine; regular skill → agent turn).
+## [0.5.2] - 2026-06-14
+_`$skill` prompt invocation with prefix/fuzzy suggestions, and a per-session input-box hue (amber in cmd-mode)._
+### Added
+- `$skill` prompt invocation: typing `$<name>` runs a bundled/loaded skill directly. A unique prefix resolves precisely (`$te` → `$team`); an ambiguous prefix lists candidates (`Ambiguous skill '$te'. Did you mean: $team, …`); and an unknown `$word` lists the available skills (prefix-first, then fuzzy-subsequence) instead of silently sending the typo to the model. `$UPPERCASE` env-var-style tokens (e.g. `$HOME`) pass through untouched.
+- Per-session input-box border hue: each newly opened jeo session gets a distinct border color so several concurrent sessions are tellable apart at a glance, and cmd-mode (`!`) overrides it with a caution amber so entering the shell escape is unmistakable.
+## [0.5.1] - 2026-06-14
+_cmd-mode `!<command>` shell escape — run a shell command without engaging the agent._
+### Added
+- gjc-parity cmd-mode shell escape: typing `!<command>` at the prompt runs the command directly in the shell and prints its output WITHOUT engaging the agent or touching conversation history (a REPL-style shell escape). Because the user is explicitly driving their own shell, the deep-interview mutation guard — which gates the AGENT's tools — does not apply; bare `!` prints a short usage hint.
+## [0.5.0] - 2026-06-14
+_Performance: workspace-scan, workflow-state, and DNA-Claw HUD caches; plus a credential-safety fix that never wipes OAuth over an invalid config._
+### Fixed
+- A schema-invalid `config.json` no longer wipes stored OAuth / provider credentials: the invalid config falls back to defaults while the credential block is preserved, so a bad config edit can't silently sign you out of every provider.
+### Changed
+- Workspace-tree scan is now LRU-cached per resolved cwd (cap 32), so a long session that touches many trees (subagents, worktrees, cross-tree `/view`) neither re-scans the same directory nor grows the cache without bound.
+- Per-skill workflow-state JSON reads are mtime + size-validated cached. The mutation guard reads the workflow lock before every mutating tool, so re-reading and re-parsing each call was wasteful; the cache stays cross-process-safe — a write from another process bumps mtime/size and invalidates it, so an active interview lock can never be missed.
+- DNA Claw HUD frames are memoized (LRU-capped) keyed on every input that affects output (grand/unicode/cols/color/level/phase/frame). The live HUD cycles a fixed ~60-frame set at ~120ms, so the 2nd+ cycle is now O(1) lookups instead of recomputing per-line ANSI gradients — lower steady-state HUD CPU.
+## [0.4.9] - 2026-06-14
+_Live-frame width-clamp (content-sized height) replaces the constant-height approach, typed text shows during a running turn, and a docs/AGENTS refresh._
+### Fixed
+- Typed text now appears in the input box DURING a running turn — keystrokes entered mid-turn render live in the prompt box (and as a pending steering card) instead of staying invisible until the turn ends.
+- Live-frame anchor drift: 0.4.8's constant-height padding could grow the reserve and drift the cursor anchor by one row, reintroducing a duplicate model bar mid-turn. The live frame is content-sized again, and every rendered line is width-clamped to the terminal width so a long line (e.g. the model bar with a deep cwd) can't soft-wrap into a second physical row and desync the differential renderer's 1-line = 1-row accounting — keeping completed cards visible in scrollback above the live frame.
+- Renderer `reset()` → `insertAbove()` ordering now erase-line-clears the old frame rows the inserted block did not cover (`occupied = max(prev, coverRows)`), closing the remaining duplicate-model-bar / orphaned-border case.
+### Changed
+- Regenerated every directory `AGENTS.md` guide and pruned stale working docs (the rolling improvements log, promo assets, and one-off review/analysis notes) so the tracked docs reflect the current tree.
+## [0.4.8] - 2026-06-14
+_Live-frame stability: constant-height live turn, renderer self-heal off-by-one fix, and frame-safe child-stdout sanitizing — no more duplicate model bar or torn escapes._
+### Fixed
+- The live turn now renders at a CONSTANT height: the in-flight tool-output / thinking block reserves a fixed row count (bottom-anchored, blank-padded at the top) and the whole frame is padded to exactly the terminal's rows. Streaming stdout growth no longer thrashes the frame height every 100ms — the height change that desynced the differential renderer and duplicated the model bar is gone.
+- Renderer self-heal reset now remembers how many rows are physically on screen (`coverRows`), so a repaint of a SHORTER frame erase-line-clears the rows it no longer covers — fixing the persistent off-by-one that left a duplicate model bar / orphaned borders after a reset.
+- Raw child stdout is sanitized before entering the live frame (`sanitizeForFrame`): carriage returns, erase-line/cursor-move escapes, OSC sequences, and incomplete trailing escapes are stripped (SGR color kept) so a streaming `bun test`'s `\r\x1b[2K` progress lines can no longer tear the renderer's own `\x1b[2K` (printing a literal "2K") or hijack the cursor.
+## [0.4.7] - 2026-06-14
+_Detached subagents + `subagent` control tool, live shaded in-flight output, registry-driven providers, fuller `read` budget, styled italics in the final report, and `gjc` retired._
+### Added
+- Detached subagents: `task {detached:true}` launches a background subagent and returns immediately; a new `subagent` control tool lists, inspects, awaits (optionally bounded), and cancels them (gjc parity, in-process turn-scoped registry — `cancelAll()` on teardown prevents background-promise leaks).
+- Live shaded in-flight output: the running tool's stdout (bash) and native thinking deltas stream as a DIMMED bounded block above the status line, then flush UN-dimmed into scrollback once the model commits — gjc's "shaded until complete" effect.
+- Update-check disk cache (`~/.jeo`): the update banner is instant from cache with a background refresh, and clears itself after an interim upgrade.
+### Changed
+- Provider registry bootstrap: `register-providers.ts` registers every built-in adapter; `model-manager` resolves adapters through the registry alone and no longer imports or names concrete providers — new built-ins register in one place.
+- `read` default (no `lineRange`) now fills the model-visible output budget with WHOLE lines instead of a fixed 500-line cap, so a single read returns more of a file and forces far less needless pagination.
+- Tool-output handling (model-visible budget, both-ends truncation, recoverable artifact spilling) extracted from `engine.ts` into `tool-output.ts`; `engine.ts` re-exports for compatibility.
+- Final-report markdown: single `*italic*` / `_italic_` is now styled (list-bullet- and snake_case-safe), and a heading that follows content gets one blank line of breathing room above it.
+- Mid-turn steering query now renders as a `user` card in scrollback; per-theme `userCard` palette and todo-card rendering refinements.
+### Removed
+- The `gjc` command and the bundled `gjc` skill — the skills catalog now ships five workflows (deep-interview, deep-dive, ralplan, team, ultragoal).
+## [0.4.6] - 2026-06-14
+_Width-correct forge cards for CJK/emoji, red borders on failed tool cards, aligned `ooo ralph` monitor HUD, and a per-theme user-card palette._
+### Fixed
+- Forge tool cards no longer tear their right border when the body contains CJK/Hangul/emoji: content wraps by DISPLAY width (wide glyphs count 2 columns) instead of code-point count, so a Korean line that previously rendered ~2× wide now stays inside the card at every width.
+- The `ooo ralph` monitoring HUD box borders stay flush: every row is padded by display width (ANSI-aware) and the box auto-sizes to its widest row, replacing `String.padEnd` on colored strings that counted SGR escape bytes and spilled the right edge.
+### Changed
+- Failed forge tool cards now render with a red border (gjc-style state-encoded border) so failures pop out of scrollback at a glance; successful/neutral cards keep the theme accent identity.
+- Every built-in theme now ships a `userCard` palette (accent/border/shadow/fill) for the mid-turn steering user card.
+- `describeModel`/`resolveModelId` accept an already-read config to skip a redundant global-config read on the turn hot path.
+## [0.4.5] - 2026-06-14
+_First-class filesystem make/remove tools._
+### Added
+- `mkdir {dirPath}` tool: create a directory (parents included, idempotent) as a first-class tool instead of shelling out to `bash` — honors the deep-interview mutation lock and prefix-restricted roles.
+- `delete {path, recursive?}` tool: remove a file (or a directory with `recursive:true`); refuses to wipe the working directory, treats a missing path as a soft error, and clears the file-freshness snapshot so a later write to the same path is not rejected as stale.
+### Changed
+- Read-only subagent lanes (planner/architect/critic) now also drop `mkdir`/`delete`, keeping review roles physically unable to mutate the repo.
+## [0.4.4] - 2026-06-13
+_Live subagent status mirroring, always-useful Ctrl+O activity tail, read lineRange crash guard._
+### Added
+- Per-turn activity-history ring (bounded at 200 plain-text entries): Ctrl+O now always answers "what has been happening" — the detail panel appends a timestamped `+N.Ns` recent-activity tail even before the first reply or tool detail exists.
+### Changed
+- The live status row now mirrors a delegated subagent's LATEST nested event (`EXECUTOR ✓ read src/…`) instead of a static `Task: executor …` title — a long `task` no longer reads as an opaque "calling model" stall.
+### Fixed
+- `read` no longer crashes with `spec.split is not a function` when the model passes a numeric/JSON `lineRange` (field bug, reproduced live twice): numbers are coerced, junk degrades to a polite selector error.
+## [0.4.3] - 2026-06-13
+_Readability pass for autopilot, subagent activity, and worked-history review._
+### Added
+- `jeo autopilot status` now renders a yellow ratchet status field with task, eval, score direction, keep/revert counts, patience, and the recommended next action.
+- `/history` transcript output now adds turn headers and folds the first tool-result line into each tool activity row, so scrollback reads as user → activity → jeo instead of raw protocol traffic.
+### Changed
+- Subagent activity lines now render as an `AGENT` tree (`▸ ROLE`, `├─ ROLE`, `└─ ROLE`) in both TUI scrollback and non-TTY progress output for faster scanning.
+- README command tables now call out `/history` and `autopilot` as first-class readable operation surfaces.
+- Removed the standalone `jeo models` command/menu path; model discovery and assignment now stay inside `/model`, `/provider`, and setup/doctor flows.
+## [0.4.2] - 2026-06-13
+_Thinking-loop termination guarantees (cycle guard + turn wall-clock budget), unboxed live status without step counters, self-contained `.jeo` namespace, live next-prompt input card, role-targeted model/thinking picker._
+### Added
+- Agent-loop cycle guard: an A↔B tool-call ping-pong (re-reading one file ↔ re-running one command forever) now gets ONE corrective bounce, then a hard stop — the "stuck in thinking" spin the exact-repeat guard could never see.
+- Turn wall-clock budget (`JEO_TURN_MAX_MS`, default 30 minutes, `0` disables): step budgets bound the COUNT of model calls, this bounds their total TIME — a turn that crosses it consolidates a wrap-up instead of spinning for hours.
+- Live next-prompt input box in the TUI — text typed during a running turn stays in the same query surface instead of a separate queued row.
+- jeo discovers skills from its own `~/.jeo/agent/skills` (+ project `.jeo/agent/skills`) and resolves hooks/rules under `.jeo` instead of referencing `.gjc`.
+- Config-driven custom subagent roles: a non-bundled id declaring `title`/`description`/`prompt` becomes a first-class role at runtime.
+- Ctrl+O mid-turn detail view: flush the full last reply + tool output into scrollback.
+- `/fast [on|off|status]` slash command: enables minimal/low reasoning fast mode only when the active model advertises support.
+- Task/team subagents now receive the same project context block as the parent agent, sourced from `JEO.md`, `AGENTS.md`, `.jeo/context.md`, `.agents/*`, and `.jeo/*` guidance — legacy `.gjc` context is not loaded.
+### Changed
+- Live status is UNBOXED: a flat `⠙ thinking · <live activity> ⟦esc⟧` row plus one dim metrics row replaces the bordered status box — the message is never trapped inside a border.
+- Removed meaningless `step N/M` counters everywhere (status row, footer, plain-stream `[step N/M]` headers, nested subagent lines) along with step-driven `eta`/`evo %`: the dynamic step budget keeps extending the denominator, so the counters carried no information. The evolution stage track stays.
+- Tool-call signature bookkeeping (repeat/cycle guards, step-budget novelty set) now stores fixed-size FNV digests instead of full JSON argument strings — a long turn's guard memory stays flat even when `write` calls embed whole file bodies.
+- Unified model targeting: `/model` can now set default thinking, pick a model, apply it to the default agent or any subagent role, and set that target's thinking level in one flow.
+- `/model` picker now shows DEFAULT/role badges with each target's thinking level, and the post-pick action menu uses the unified Set-as-role format plus an OpenAI Codex role preset.
+- `/model` action selection now uses a Ralph-style nested sub-list: each DEFAULT/role header expands into selectable thinking rows, so target and thinking are chosen in one TUI screen.
+- During a live reasoning turn, typed next-user text now renders as a styled pending `user` card with dark background while the normal input box remains editable.
+- Update availability now renders as a yellow full-width field instead of a boxed card, matching the status-field TUI treatment.
+- Removed the legacy `/models` slash-menu path; `/model` and `/provider` own interactive model selection.
+- Canonicalized runtime naming on `.jeo` and `JEO_` only.
+### Fixed
+- jeo can no longer sit in "thinking" forever: every turn now terminates via the cycle guard, the wall-clock budget, or the existing step/repeat/failure guards — pathological spins consolidate a wrap-up instead of running unbounded.
+- Ctrl-C now force-quits jeo immediately instead of being softened into an abort prompt.
+- Done-time todo reconciliation gate — stale Todos can no longer survive a finished turn.
+- MCP stdio framing for ralph tools.
+## [0.4.1] - 2026-06-12
+_TUI card parity polish + done-time todo reconciliation._
+### Added
+- gjc card parity: `⟦Ctrl+O for more⟧` clip hint, code highlighting in card bodies, and full tool output via Ctrl+O.
+### Fixed
+- Clip hint also covers summarize-stage markers.
+- Done-time todo reconciliation gate so a finished turn's checklist reflects what actually completed.
+## [0.4.0] - 2026-06-12
+_Verified TUI, resilient engine, batch input, multilingual docs._
+### Added
+- Bracketed-paste batch input — a multi-line paste runs one command per line, in order (prompt_toolkit paste contract).
+- jeo-ref transcript parity: Todo Write tree cards, line-numbered write previews, agent-name reasoning blocks, tree-style skill detail.
+- Seed writer/parser round-trip integrity with a freeze-time assert.
+- Slow-drip stream deadline + acceptance-criteria quality floor.
+### Fixed
+- Anthropic refusal recovery: context reset per the provider contract, neutral continuation note, OAuth/API-key guidance.
+- Model discovery repaired against live endpoints (codex `client_version`, gemini pagination).
+## [0.3.0] - 2026-06-02
+_OAuth credentials + local Ollama provider._
+### Added
+- OAuth login (`jeo auth`) and a local Ollama provider.
+- `jeo doctor`, multi-tool-call grouping, and CI/install hardening.
+## [0.2.1] - 2026-06-02
+_Setup and model configuration._
+### Added
+- `jeo setup` and `jeo models`; default Gemini 2.5-flash; verified real LLM turn.
+## [0.2.0] - 2026-06-02
+_Real LLM coding agent._
+### Added
+- Real LLM coding agent with provider + model configuration.
+## [0.1.0] - 2026-06-01
+_Initial release._
+### Added
+- Initial jeo-code agent and CLI.
+[Unreleased]: https://github.com/akillness/jeo-code/compare/v0.4.5...HEAD
+[0.4.5]: https://github.com/akillness/jeo-code/releases/tag/v0.4.5
+[0.4.4]: https://github.com/akillness/jeo-code/releases/tag/v0.4.4
+[0.4.3]: https://github.com/akillness/jeo-code/releases/tag/v0.4.3
+[0.4.2]: https://github.com/akillness/jeo-code/releases/tag/v0.4.2
+[0.4.1]: https://github.com/akillness/jeo-code/releases/tag/v0.4.1
+[0.4.0]: https://github.com/akillness/jeo-code/releases/tag/v0.4.0
+[0.3.0]: https://github.com/akillness/jeo-code/releases/tag/v0.3.0
+[0.2.1]: https://github.com/akillness/jeo-code/releases/tag/v0.2.1
+[0.2.0]: https://github.com/akillness/jeo-code/releases/tag/v0.2.0
+[0.1.0]: https://github.com/akillness/jeo-code/releases/tag/v0.1.0

package/README.ja.md CHANGED Viewed

@@ -150,11 +150,11 @@ CI は `.github/workflows/npm-publish.yml` で公開します — GitHub リリ
 ## 変更履歴 (Changelog)
 <!-- CHANGELOG:START (auto-generated from CHANGELOG.md — run `bun run changelog:sync`) -->
+- **[0.5.12]** (2026-06-15) — Yellow status animation while a process runs, and elapsed `(Nms)` on every completed tool card.
+- **[0.5.11]** (2026-06-15) — Backspace on an empty prompt line no longer quits jeo.
+- **[0.5.10]** (2026-06-15) — `/resume` transcript no longer dumps raw JSON for batched tool calls.
 - **[0.5.9]** (2026-06-15) — Bounded per-frame wrap for the live thinking/tool-output blocks — re-render cost no longer grows with stream length.
 - **[0.5.8]** (2026-06-15) — Native Opik observability for the turn loop (opt-in `JEO_OPIK`, pure-TS no-op when unset) + autopilot convergence tracking.
-- **[0.5.7]** (2026-06-15) — `/model` picker is default-only, `/clear` resets to the initial screen, ESC clears the input box, and a launch process-listener leak is fixed.
-- **[0.5.6]** (2026-06-15) — `/model` sets only the default thinking; per-role reasoning moved to `/agents`.
-- **[0.5.5]** (2026-06-15) — Full multi-line visibility — the input box scrolls to the caret and the submitted card shows every line.
 See [CHANGELOG.md](CHANGELOG.md) for the full history.
 <!-- CHANGELOG:END -->

package/README.ko.md CHANGED Viewed

@@ -150,11 +150,11 @@ CI는 `.github/workflows/npm-publish.yml`로 배포합니다 — GitHub 릴리
 ## 변경 이력 (Changelog)
 <!-- CHANGELOG:START (auto-generated from CHANGELOG.md — run `bun run changelog:sync`) -->
+- **[0.5.12]** (2026-06-15) — Yellow status animation while a process runs, and elapsed `(Nms)` on every completed tool card.
+- **[0.5.11]** (2026-06-15) — Backspace on an empty prompt line no longer quits jeo.
+- **[0.5.10]** (2026-06-15) — `/resume` transcript no longer dumps raw JSON for batched tool calls.
 - **[0.5.9]** (2026-06-15) — Bounded per-frame wrap for the live thinking/tool-output blocks — re-render cost no longer grows with stream length.
 - **[0.5.8]** (2026-06-15) — Native Opik observability for the turn loop (opt-in `JEO_OPIK`, pure-TS no-op when unset) + autopilot convergence tracking.
-- **[0.5.7]** (2026-06-15) — `/model` picker is default-only, `/clear` resets to the initial screen, ESC clears the input box, and a launch process-listener leak is fixed.
-- **[0.5.6]** (2026-06-15) — `/model` sets only the default thinking; per-role reasoning moved to `/agents`.
-- **[0.5.5]** (2026-06-15) — Full multi-line visibility — the input box scrolls to the caret and the submitted card shows every line.
 See [CHANGELOG.md](CHANGELOG.md) for the full history.
 <!-- CHANGELOG:END -->

package/README.md CHANGED Viewed

@@ -150,11 +150,11 @@ Required npm token permissions (repository secret `NPM_TOKEN`):
 ## Changelog
 <!-- CHANGELOG:START (auto-generated from CHANGELOG.md — run `bun run changelog:sync`) -->
+- **[0.5.12]** (2026-06-15) — Yellow status animation while a process runs, and elapsed `(Nms)` on every completed tool card.
+- **[0.5.11]** (2026-06-15) — Backspace on an empty prompt line no longer quits jeo.
+- **[0.5.10]** (2026-06-15) — `/resume` transcript no longer dumps raw JSON for batched tool calls.
 - **[0.5.9]** (2026-06-15) — Bounded per-frame wrap for the live thinking/tool-output blocks — re-render cost no longer grows with stream length.
 - **[0.5.8]** (2026-06-15) — Native Opik observability for the turn loop (opt-in `JEO_OPIK`, pure-TS no-op when unset) + autopilot convergence tracking.
-- **[0.5.7]** (2026-06-15) — `/model` picker is default-only, `/clear` resets to the initial screen, ESC clears the input box, and a launch process-listener leak is fixed.
-- **[0.5.6]** (2026-06-15) — `/model` sets only the default thinking; per-role reasoning moved to `/agents`.
-- **[0.5.5]** (2026-06-15) — Full multi-line visibility — the input box scrolls to the caret and the submitted card shows every line.
 See [CHANGELOG.md](CHANGELOG.md) for the full history.
 <!-- CHANGELOG:END -->

package/README.zh.md CHANGED Viewed

@@ -150,11 +150,11 @@ CI 通过 `.github/workflows/npm-publish.yml` 发布 — GitHub 发布 release
 ## 更新日志 (Changelog)
 <!-- CHANGELOG:START (auto-generated from CHANGELOG.md — run `bun run changelog:sync`) -->
+- **[0.5.12]** (2026-06-15) — Yellow status animation while a process runs, and elapsed `(Nms)` on every completed tool card.
+- **[0.5.11]** (2026-06-15) — Backspace on an empty prompt line no longer quits jeo.
+- **[0.5.10]** (2026-06-15) — `/resume` transcript no longer dumps raw JSON for batched tool calls.
 - **[0.5.9]** (2026-06-15) — Bounded per-frame wrap for the live thinking/tool-output blocks — re-render cost no longer grows with stream length.
 - **[0.5.8]** (2026-06-15) — Native Opik observability for the turn loop (opt-in `JEO_OPIK`, pure-TS no-op when unset) + autopilot convergence tracking.
-- **[0.5.7]** (2026-06-15) — `/model` picker is default-only, `/clear` resets to the initial screen, ESC clears the input box, and a launch process-listener leak is fixed.
-- **[0.5.6]** (2026-06-15) — `/model` sets only the default thinking; per-role reasoning moved to `/agents`.
-- **[0.5.5]** (2026-06-15) — Full multi-line visibility — the input box scrolls to the caret and the submitted card shows every line.
 See [CHANGELOG.md](CHANGELOG.md) for the full history.
 <!-- CHANGELOG:END -->

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "jeo-code",
-  "version": "0.5.9",
+  "version": "0.5.12",
   "description": "Clean, highly optimized AI coding agent using spec-first loop",
   "type": "module",
   "main": "src/cli.ts",
@@ -12,6 +12,7 @@
     "scripts/install.sh",
     "scripts/uninstall.sh",
     "tsconfig.json",
+    "CHANGELOG.md",
     "README.md",
     "README.ko.md",
     "README.ja.md",

package/src/agent/engine.ts CHANGED Viewed

@@ -79,6 +79,8 @@ export const TOOL_PROTOCOL = [
   "Alternatively, you may batch up to 6 independent calls in a single turn using the following format:",
   '{ "reasoning": "<one short sentence>", "tools": [{ "tool": "<name>", "arguments": { ... } }, ...] }',
   "Batch only independent calls; NEVER batch 'done', and NEVER put a mutating tool (write/edit/bash) after another mutating tool in one batch whose inputs depend on the earlier one.",
+  "Tool calibration: scale calls to difficulty — one for a known fact, a few for a normal task, more only when evidence is genuinely missing. Locate before you open: search/find first, then read the hit, instead of guessing paths.",
+  "web_search reflex: if the request hinges on a name, version, library, or event you do not actually recognize, search before answering instead of guessing; never claim a result's absence proves nonexistence.",
 ].join("\n");
 /** Restricted protocol for read-only subagent roles (planner/architect/critic):
@@ -109,25 +111,39 @@ export const WORKING_DISCIPLINE = [
   "- Correctness first, maintainability second, brevity third. Prefer boring, explicit code.",
   "- Never present partial work as complete; never suppress tests or warnings to make code pass.",
   "- Never fabricate tool results or test outcomes; verification claims must match what was actually run.",
+  "- Don't assume disk/state matches expectations or that a referenced file exists — read to verify first.",
+  "- Don't fabricate API/library surfaces from memory; check the source or --help for unfamiliar APIs.",
   "- Never ship stubs, placeholders, or TODO-only code as a delivered feature.",
   "- Never substitute the requested problem with an easier adjacent one.",
   "- Update directly affected callsites, tests, and docs — or state why they are unchanged.",
   "- Reuse existing patterns; parallel conventions are prohibited. Fix problems at their source.",
   "- You are not alone in the repository: treat unexpected changes as user work; never revert or delete them.",
-  "- Re-read before acting if a tool fails or a file may have changed.",
+  "- Trust tool output as truth, but re-read/re-run if a tool fails, a file changed, or output looks stale or self-contradictory.",
   "- Prefer dedicated tools over shell pipelines: read (not cat), search (not grep), edit (not sed).",
 ].join("\n");
+/** Reply discipline (FABLE-5 tone + gjc communication/soul): shapes the agent's
+ *  user-facing prose. Injected into the interactive + executor system prompts only;
+ *  read-only subagents carry their own output contracts. */
+export const OUTPUT_DISCIPLINE = [
+  "Reply discipline:",
+  "- Lead with the answer or result; no preamble, no progress narration, no restating the task.",
+  "- Default to tight prose; use headers/bullets/tables ONLY when the content is genuinely multi-part or the user asked — never bullet a one-idea answer.",
+  "- Report only what is done or in progress; never announce future work instead of doing it.",
+  "- Match reply length to the task: a one-line change gets a one-line report.",
+].join("\n");
 export function executorSystemPrompt(
   role = "Executor Agent, a senior software developer",
   protocol: string = TOOL_PROTOCOL,
-  verificationDirective = "Always verify (run tests / execute the program) before calling done.",
+  verificationDirective = "Before calling done, self-check: did I run the test or command that exercises this change, are directly-affected callsites/tests/docs updated, and does my claim match real output? If any answer is no, keep working — do not call done.",
 ): string {
   return (
     `You are the ${role}.\n` +
     `Accomplish the user's request by calling tools and verifying your work.\n\n` +
     `${protocol}\n\n` +
     `${WORKING_DISCIPLINE}\n\n` +
+    `${OUTPUT_DISCIPLINE}\n\n` +
     verificationDirective
   );
 }
@@ -503,7 +519,7 @@ export async function runAgentLoop(history: Message[], opts: AgentLoopOptions):
     let invocation: any;
     try {
-      invocation = extractJsonObject<any>(responseText);
+      invocation = extractJsonObject<any>(responseText, { preferKeys: ["tool", "tools"] });
     } catch (err) {
       ev.onAssistant?.(responseText, null);
       // Prose salvage: a reply with no JSON object at all is a chat-style final

package/src/agent/json.ts CHANGED Viewed

@@ -4,45 +4,109 @@
  * Models (especially non-jsonMode backends like Anthropic/Ollama) routinely
  * wrap JSON in prose, ```json fences, or trailing commentary. This recovers the
  * first balanced top-level `{...}` object, respecting strings and escapes.
+ *
+ * gjc-robustness hardening (the JSON-mode path is the hot path for the default
+ * antigravity provider, which is text-only and cannot use native tool-calling):
+ *   - tolerate trailing commas before `}`/`]` (a frequent small-model slip);
+ *   - `preferKeys` lets the tool-call caller prefer the balanced object that
+ *     actually carries a `tool`/`tools` field over an earlier stray JSON object.
  */
-export function extractJsonObject<T = unknown>(text: string): T {
+export function extractJsonObject<T = unknown>(
+  text: string,
+  opts?: { preferKeys?: string[] },
+): T {
   const raw = text.trim();
+  const preferKeys = opts?.preferKeys;
-  // Fast path: already pure JSON.
-  try {
-    return JSON.parse(raw) as T;
-  } catch {
-    /* fall through to recovery */
-  }
+  // Fast path: already pure JSON (optionally with a trailing comma to repair).
+  const fast = tryParse<T>(raw);
+  if (fast !== undefined) return fast;
-  // Strip common code fences and retry.
+  // Strip common code fences and retry (pure, then trailing-comma-repaired).
   const defenced = raw.replace(/```(?:json|JSON)?/g, "").trim();
-  try {
-    return JSON.parse(defenced) as T;
-  } catch {
-    /* fall through to brace scan */
-  }
+  const fromDefencedWhole = tryParse<T>(defenced);
+  if (fromDefencedWhole !== undefined) return fromDefencedWhole;
-  const parsedFromDefenced = findAndParseBalancedObject<T>(defenced);
-  if (parsedFromDefenced !== null) {
+  const parsedFromDefenced = findAndParseBalancedObject<T>(defenced, preferKeys);
+  if (parsedFromDefenced !== undefined) {
     return parsedFromDefenced;
   }
-  const parsedFromRaw = findAndParseBalancedObject<T>(raw);
-  if (parsedFromRaw !== null) {
+  const parsedFromRaw = findAndParseBalancedObject<T>(raw, preferKeys);
+  if (parsedFromRaw !== undefined) {
     return parsedFromRaw;
   }
   throw new Error(`No parseable JSON object found in model output: ${truncate(raw, 200)}`);
 }
 /** Like {@link extractJsonObject} but returns null instead of throwing. */
-export function tryExtractJsonObject<T = unknown>(text: string): T | null {
+export function tryExtractJsonObject<T = unknown>(
+  text: string,
+  opts?: { preferKeys?: string[] },
+): T | null {
   try {
-    return extractJsonObject<T>(text);
+    return extractJsonObject<T>(text, opts);
   } catch {
     return null;
   }
 }
+/**
+ * Parse `s` as JSON, tolerating a single class of common model slip: a trailing
+ * comma right before a closing `}` or `]`. Returns `undefined` (not null — a bare
+ * `null`/`false` is a legal JSON value) when nothing parses.
+ */
+function tryParse<T>(s: string): T | undefined {
+  try {
+    return JSON.parse(s) as T;
+  } catch {
+    /* fall through to a trailing-comma repair */
+  }
+  const repaired = stripTrailingCommas(s);
+  if (repaired !== s) {
+    try {
+      return JSON.parse(repaired) as T;
+    } catch {
+      /* unrecoverable here */
+    }
+  }
+  return undefined;
+}
+/**
+ * Remove a comma that directly precedes a closing `}` or `]` (ignoring
+ * whitespace), but never a comma inside a string literal. String/escape state is
+ * tracked so a comma inside `"a,}"` is preserved.
+ */
+function stripTrailingCommas(s: string): string {
+  let out = "";
+  let inString = false;
+  let escaped = false;
+  for (let i = 0; i < s.length; i++) {
+    const ch = s[i]!;
+    if (inString) {
+      out += ch;
+      if (escaped) escaped = false;
+      else if (ch === "\\") escaped = true;
+      else if (ch === '"') inString = false;
+      continue;
+    }
+    if (ch === '"') {
+      inString = true;
+      out += ch;
+      continue;
+    }
+    if (ch === ",") {
+      let j = i + 1;
+      while (j < s.length && (s[j] === " " || s[j] === "\t" || s[j] === "\n" || s[j] === "\r")) j++;
+      if (j < s.length && (s[j] === "}" || s[j] === "]")) {
+        continue; // drop the trailing comma
+      }
+    }
+    out += ch;
+  }
+  return out;
+}
 /** Scan for a brace-balanced object starting at startIndex, ignoring braces inside strings. */
 function extractBalancedObject(text: string, startIndex: number): string | null {
   let depth = 0;
@@ -66,20 +130,36 @@ function extractBalancedObject(text: string, startIndex: number): string | null
   return null;
 }
-function findAndParseBalancedObject<T>(text: string): T | null {
+/**
+ * Find the first balanced `{...}` that parses as JSON. When `preferKeys` is given,
+ * keep scanning past an earlier parseable object that lacks every preferred key and
+ * return the first object that DOES carry one (e.g. the real `{ "tool": ... }` call
+ * after a stray JSON-looking object in reasoning prose); fall back to the first
+ * parseable object when none carries a preferred key.
+ */
+function findAndParseBalancedObject<T>(text: string, preferKeys?: string[]): T | undefined {
+  let firstParsed: T | undefined;
   for (let i = 0; i < text.length; i++) {
     if (text[i] === "{") {
       const candidate = extractBalancedObject(text, i);
       if (candidate) {
-        try {
-          return JSON.parse(candidate) as T;
-        } catch {
-          // ignore, try next
+        const parsed = tryParse<T>(candidate);
+        if (parsed !== undefined) {
+          if (firstParsed === undefined) firstParsed = parsed;
+          if (!preferKeys || preferKeys.length === 0) return parsed;
+          if (
+            parsed !== null &&
+            typeof parsed === "object" &&
+            preferKeys.some(k => k in (parsed as Record<string, unknown>))
+          ) {
+            return parsed;
+          }
+          // else: keep scanning for an object that carries a preferred key
         }
       }
     }
   }
-  return null;
+  return firstParsed;
 }
 function truncate(s: string, n: number): string {

package/src/commands/launch.ts CHANGED Viewed

@@ -1,7 +1,7 @@
 import { createInterface } from "node:readline/promises";
 import { emitKeypressEvents } from "node:readline";
 import { PassThrough } from "node:stream";
-import { runAgentLoop, executorSystemPrompt, DEFAULT_TOOLS, TOOL_PROTOCOL, WORKING_DISCIPLINE, type AgentLoopEvents } from "../agent/engine";
+import { runAgentLoop, executorSystemPrompt, DEFAULT_TOOLS, TOOL_PROTOCOL, WORKING_DISCIPLINE, OUTPUT_DISCIPLINE, type AgentLoopEvents } from "../agent/engine";
 import { createOpikTracer, wrapEvents } from "../agent/opik-tracer";
 import { initialDynamicStepLimit } from "../agent/step-budget";
 import { memoryPromptSection, spawnDetachedDistill } from "../agent/memory";
@@ -27,6 +27,7 @@ import { renderWelcome, playWelcomeSweep } from "../tui/components/welcome";
 import { checkForUpdate, readUpdateCache, writeUpdateCache } from "../util/update-check";
 import { jeoEnv } from "../util/env";
 import { renderUpdateBox } from "../tui/components/update-box";
+import { consumeLaunchWhatsNew } from "../util/whats-new";
 import { supportsUnicode } from "../tui/components/capability";
 import pkg from "../../package.json";
 import chalk from "chalk";
@@ -498,6 +499,16 @@ interface AbortHarnessOptions {
 export const PASTE_START = "\u001b[200~";
 export const PASTE_END = "\u001b[201~";
+/** True when a stdin chunk is ONLY backspace bytes (DEL 0x7f or BS 0x08) — i.e. a
+ *  standalone Backspace keystroke with nothing else. A backspace on an EMPTY input
+ *  line is a no-op edit, but some Bun readline builds turn it into a spurious `close`
+ *  event, which the REPL would treat as a hard exit ("Backspace quits jeo"). The
+ *  input filter swallows these when the line buffer is already empty so the byte never
+ *  reaches readline and the close can't fire. */
+export function isStandaloneBackspace(chunk: string): boolean {
+  return chunk.length > 0 && /^[\x7f\b]+$/.test(chunk);
+}
 export interface PromptInputQueue {
   pendingLines: string[];
   partial: string;
@@ -963,6 +974,7 @@ export function buildToolProtocol(allowedTools: Set<string>): string {
   lines.push("Reply with STRICT JSON only — no code fences. You MAY include an optional leading");
   lines.push('"reasoning" string (one short sentence on your plan, shown live to the user) before "tool":');
   lines.push('{ "reasoning": "<one short sentence>", "tool": "<name>", "arguments": { ... } }');
+  lines.push("Tool calibration: scale calls to difficulty — one for a known fact, a few for a normal task, more only when evidence is genuinely missing. Locate before you open: search/find first, then read the hit, instead of guessing paths.");
   return lines.join("\n");
 }
@@ -1197,7 +1209,8 @@ export async function runLaunchCommand(args: string[]): Promise<void> {
   const baseSystemPrompt =
     preamble + "\n\n" + protocol + "\n\n" +
     WORKING_DISCIPLINE + "\n\n" +
-    "Always verify (run tests / execute the program) before calling done." +
+    OUTPUT_DISCIPLINE + "\n\n" +
+    "Before calling done, self-check: did I run the test or command that exercises this change, are directly-affected callsites/tests/docs updated, and does my claim match real output? If any answer is no, keep working — do not call done." +
     "\nWhen you have finished the user's request, or need to reply to or ask the user something, call done with {\"reason\": <your natural-language reply to the user>}. The reason text is shown to the user as your message." +
     (allowedTools.has("task") ? "\n\nDelegation: " + taskToolProtocolLine(cfg) +
     " Call task with {\"role\": <one of the advertised roles>, \"task\": <assignment>, \"context\": <optional>} to hand a focused slice to a subagent." : "") +
@@ -1770,6 +1783,17 @@ export async function runLaunchCommand(args: string[]): Promise<void> {
   };
   showUpdateBanner(await readUpdateCache(pkg.version));
   showUpdateBanner(await Promise.race([updatePromise, new Promise<null>(r => setTimeout(() => r(null), 1200))]));
+  // First launch after a version bump: surface the bundled release notes ONCE
+  // (offline, from the new package's CHANGELOG.md) and record the seen version
+  // so it never repeats. Screen-safe: prints BEFORE the prompt is armed.
+  try {
+    const whatsNew = await consumeLaunchWhatsNew({
+      cols: Math.min(100, Math.max(40, (process.stdout.columns ?? 80) - 2)),
+      unicode: supportsUnicode(),
+      color: welcomeTheme.color,
+    });
+    if (whatsNew && whatsNew.length) console.log(whatsNew.join("\n"));
+  } catch { /* release notes are a courtesy; never block launch */ }
   if (!LaunchTui.usable(flags.noTui)) console.log("(plain output)");
   const useTui = LaunchTui.usable(flags.noTui);
@@ -1954,6 +1978,9 @@ export async function runLaunchCommand(args: string[]): Promise<void> {
     for (const off of promptListenerCleanups.splice(0)) { try { off(); } catch { /* best effort */ } }
   };
   let keyFilter: PassThrough | undefined;
+  // Holder for the active readline so the input filter can see the current line
+  // buffer (used by the empty-line backspace guard below). Set after rl is created.
+  let activeRl: { line?: string } | undefined;
   if (multilineInput) {
     const kf = new PassThrough();
     (kf as unknown as { isTTY: boolean }).isTTY = true;
@@ -1975,6 +2002,11 @@ export async function runLaunchCommand(args: string[]): Promise<void> {
     let kfInPaste = false;
     const kfDataHandler = (chunk: Buffer) => {
       const data = chunk.toString("utf8");
+      // Empty-line Backspace guard: a standalone Backspace with nothing to delete is a
+      // no-op, but some Bun readline builds emit a spurious `close` for it — which the
+      // REPL treats as a hard exit. Drop it before it reaches readline. (Inside a paste,
+      // or with text in the buffer, backspace is forwarded normally so editing works.)
+      if (!kfInPaste && isStandaloneBackspace(data) && !(activeRl?.line ?? "")) return;
       let out = "";
       let i = 0;
       while (i < data.length) {
@@ -2015,6 +2047,7 @@ export async function runLaunchCommand(args: string[]): Promise<void> {
     output: gatedStdout(process.stdout, () => previewArmed || promptActive || pickerActive || interactiveTurnActive),
     completer: (line: string) => readlineCompleter(line, completionContext()),
   });
+  activeRl = rl; // wire the input filter's empty-line backspace guard to the live buffer
   const promptStdin = process.stdin as typeof process.stdin & { isRaw?: boolean; setRawMode?(raw: boolean): void };
   const promptWasRaw = !!promptStdin.isRaw;
   let promptRawChanged = false;

package/src/commands/whats-new.ts ADDED Viewed

@@ -0,0 +1,62 @@
+import pkg from "../../package.json";
+import {
+  loadBundledChangelog,
+  parseChangelogSections,
+  releaseSections,
+  renderWhatsNew,
+} from "../util/whats-new";
+import { supportsUnicode } from "../tui/components/capability";
+/** `jeo whats-new` — show the release notes bundled with the running version. */
+export async function runWhatsNewCommand(args: string[] = []): Promise<void> {
+  const isHelp = args.includes("--help") || args.includes("-h");
+  const hasAll = args.includes("--all");
+  const hasJson = args.includes("--json");
+  const KNOWN = new Set(["--all", "--json", "-h", "--help"]);
+  for (const arg of args) {
+    if (!KNOWN.has(arg)) {
+      console.log(`Unknown flag: ${arg}`);
+      printUsage();
+      process.exitCode = 1;
+      return;
+    }
+  }
+  if (isHelp) {
+    printUsage();
+    return;
+  }
+  const md = await loadBundledChangelog();
+  const all = md ? releaseSections(parseChangelogSections(md)) : [];
+  const sections = hasAll ? all : all.slice(0, 1);
+  if (hasJson) {
+    console.log(JSON.stringify({ version: pkg.version, entries: sections }, null, 2));
+    return;
+  }
+  if (sections.length === 0) {
+    console.log(`No release notes found for jeo-code ${pkg.version}.`);
+    return;
+  }
+  const cols = process.stdout.columns ?? 80;
+  console.log(renderWhatsNew(sections, {
+    cols: Math.min(100, Math.max(40, cols - 2)),
+    unicode: supportsUnicode(),
+    color: process.stdout.isTTY === true,
+  }).join("\n"));
+}
+function printUsage(): void {
+  console.log("Usage: jeo whats-new [options]");
+  console.log("");
+  console.log("Show the release notes bundled with the installed jeo-code version.");
+  console.log("");
+  console.log("Options:");
+  console.log("  --all        Show notes for every released version, not just the latest");
+  console.log("  --json       Output the notes as JSON");
+  console.log("  -h, --help   Show this help message");
+}

package/src/tui/app.ts CHANGED Viewed

@@ -32,7 +32,7 @@ import { renderMarkdownTables } from "./components/markdown-table";
 import { stripMarkdown, renderMarkdownAnsi } from "./components/markdown-text";
 import { visibleWidth, wrapTextWithAnsi, truncateToWidth, sanitizeForFrame } from "./components/width";
 import { categoryBadge } from "./components/category-index";
-import { formatStepTimeline, stepsFromTools, formatStepHeader, formatStepTimelineCompact, type StepState } from "./components/step-timeline";
+import { formatStepTimeline, stepsFromTools, formatStepHeader, formatStepTimelineCompact, formatDuration as formatToolMs, type StepState } from "./components/step-timeline";
 import { formatHintBar } from "./components/hints";
 import { formatDuration, formatUsage } from "./components/duration";
 import { renderHud, derivePhase, type JeoPhase } from "./components/hud";
@@ -112,6 +112,12 @@ export const FRAME_WRAP_TAIL_CHARS = 16 * 1024;
 export function tailForWrap(text: string, maxChars = FRAME_WRAP_TAIL_CHARS): string {
   return text.length > maxChars ? text.slice(text.length - maxChars) : text;
 }
+/** Status animation palette while a tool/process runs (background verification): an
+ *  amber→yellow gradient, distinct from the cool thinking gradient, so "the agent is
+ *  running a process / verifying" reads at a glance (gjc parity: `theme.fg("warning")`
+ *  on the in-flight tool line). */
+export const STATUS_VERIFY_PALETTE = ["#ffd24a", "#ffb300"] as const;
 const DEFAULT_MAX_STEPS = 100;
 // Tools light enough that they never get a forge card (gjc parity): completion is a
 // single ✓/✗ ledger line; only failures surface a result card with the error body.
@@ -169,6 +175,8 @@ export class LaunchTui {
   private thinking = false;
   private hudPhase: JeoPhase = "thinking";
   private runningTool = false;
+  // When the current tool started (Date.now()); drives the result card's elapsed `(Nms)`.
+  private toolStartedAt = 0;
   // Latest transient provider notice (rate-limit auto-retry countdown); pinned into the
   // [STEP] status row while waiting so backoff is visible at a glance. Cleared on the
   // next step / model reply.
@@ -412,6 +420,7 @@ export class LaunchTui {
         this.streamingActivity = "";
         if (invocation && invocation.tool !== "done") {
           this.runningTool = true;
+          this.toolStartedAt = Date.now();
           this.hudPhase = "executing";
           const toolName = invocation.tool || "(no tool)";
           this.pendingIndex = this.tools.start(toolName);
@@ -464,6 +473,12 @@ export class LaunchTui {
         // tool checklist — no category/status badge clutter.
         const mark = this.unicode ? (success ? "✓" : "✗") : success ? "v" : "x";
         const paintedMark = this.theme.color ? (success ? chalk.green(mark) : chalk.red(mark)) : mark;
+        // gjc-parity timing detail: the completed card shows how long the tool ran,
+        // dim after the ✓/✗ glyph (e.g. `✓ Bash · (438ms)`).
+        const toolMs = this.toolStartedAt ? Date.now() - this.toolStartedAt : 0;
+        this.toolStartedAt = 0;
+        const durDim = this.theme.color ? chalk.dim : (s: string) => s;
+        const durSuffix = toolMs > 0 ? durDim(` ${this.unicode ? "·" : "-"} (${formatToolMs(toolMs)})`) : "";
         const result = summarizeForgeResult(tool, success, output);
         const card = this.pendingForge;
         this.pendingForge = null;
@@ -471,7 +486,7 @@ export class LaunchTui {
           // gjc-style single Bash card: command echo + `Output` divider + body + exit
           // note, under one ✓/✗-marked header — mutated in place so the live frame and
           // the non-TTY summary both show the merged card.
-          card.title = `${paintedMark} Bash`;
+          card.title = `${paintedMark} Bash${durSuffix}`;
           card.lines.push(...result.lines);
           this.flushForgeCard(card, success);
         } else if (card && t === "web_search" && success && webSearchCardLines(output, { unicode: this.unicode })) {
@@ -480,11 +495,11 @@ export class LaunchTui {
           // the structured tool output (provider chain — Anthropic native or the
           // keyless DuckDuckGo fallback).
           const ws = webSearchCardLines(output, { unicode: this.unicode })!;
-          card.title = `${paintedMark} Web Search: ${ws.titleMeta}`;
+          card.title = `${paintedMark} Web Search: ${ws.titleMeta}${durSuffix}`;
           card.lines = ws.lines;
           this.flushForgeCard(card, success);
         } else if (card) {
-          card.title = `${paintedMark} ${card.title}`;
+          card.title = `${paintedMark} ${card.title}${durSuffix}`;
           if (!success) this.rememberForge(result);
           this.flushForgeCard(card, success);
           if (!success) this.flushForgeCard(result, false);
@@ -492,7 +507,7 @@ export class LaunchTui {
           // Light tool: one ✓/✗ line, plus a dim result tree for list-shaped output
           // (find/search/ls) and an error card when the tool failed.
           const { suffix, children } = this.ledgerTree(tool, success, output);
-          this.appendLedger(`${paintedMark} ${target}${suffix}\n${children.map(c => `${c}\n`).join("")}`, "tool");
+          this.appendLedger(`${paintedMark} ${target}${suffix}${durSuffix}\n${children.map(c => `${c}\n`).join("")}`, "tool");
           if (!success) {
             this.rememberForge(result);
             this.flushForgeCard(result, false);
@@ -1164,12 +1179,16 @@ export class LaunchTui {
     // the ⟦esc⟧ cancel hint visible without trapping the message inside a border.
     if (isThinking) {
       const grad = themeGradient(this.theme, idx);
+      // While a tool/process runs (background verification), the status animation turns
+      // amber/yellow — distinct from the cool thinking gradient (gjc warning-color parity).
+      const verifying = this.runningTool;
+      const verifySpin = verifying && this.theme.color ? chalk.yellow(this.spinner.current()) : this.spinner.current();
       const costUsd = costForUsage(this.footer.model, this.turnUsage) ?? undefined;
       const stats = this.tools.stats();
       tail.push(...renderStatusBox({
         cols: Math.max(24, Math.min(120, cols)),
         phaseLabel: this.workflowStatus ? `${this.workflowStatus.skill}:${this.workflowStatus.phase}` : this.hudPhase,
-        spinner: this.spinner.current(),
+        spinner: verifySpin,
         activity: this.retryNotice ?? (this.streamingActivity || this.currentActivity()),
         escHint: true,
         elapsedMs,
@@ -1184,7 +1203,7 @@ export class LaunchTui {
         color: this.theme.color,
         colorLevel,
         phase,
-        palette: [grad.from, grad.to],
+        palette: verifying ? [...STATUS_VERIFY_PALETTE] : [grad.from, grad.to],
         isThinking: true,
         usage: this.turnUsage,
         costUsd,
@@ -1350,7 +1369,7 @@ export class LaunchTui {
         for (const line of renderStatusBox({
           cols: innerWidth,
           phaseLabel: this.workflowStatus ? `${this.workflowStatus.skill}:${this.workflowStatus.phase}` : this.hudPhase,
-          spinner: this.spinner.current(),
+          spinner: this.runningTool && this.theme.color ? chalk.yellow(this.spinner.current()) : this.spinner.current(),
           activity: this.retryNotice ?? (this.streamingActivity || statusMsg),
           escHint: true,
           elapsedMs,
@@ -1365,7 +1384,7 @@ export class LaunchTui {
           color: this.theme.color,
           colorLevel,
           phase,
-          palette,
+          palette: this.runningTool ? [...STATUS_VERIFY_PALETTE] : palette,
           isThinking: true,
           usage: this.turnUsage,
           costUsd,

package/src/tui/components/transcript.ts CHANGED Viewed

@@ -46,6 +46,20 @@ function firstToolResultLine(text: string | undefined): string {
     .slice(0, 96) ?? "";
 }
+const TOOL_RESULT_GLOBAL = /^Tool \[([^\]]+)\] result \((ok|fail)\):/gm;
+/** Split a tool-result user message — which for a BATCH holds several
+ *  `Tool [x] result (ok|fail):` blocks joined by blank lines — into per-call
+ *  verdicts in the order the engine emitted them (= the batch's call order). */
+function parseToolVerdicts(text: string | undefined): { tool: string; status: string; firstLine: string }[] {
+  if (!text) return [];
+  const matches = [...text.matchAll(TOOL_RESULT_GLOBAL)];
+  return matches.map((mt, k) => {
+    const start = mt.index ?? 0;
+    const end = k + 1 < matches.length ? (matches[k + 1]!.index ?? text.length) : text.length;
+    return { tool: mt[1]!, status: mt[2]!, firstLine: firstToolResultLine(text.slice(start, end)) };
+  });
+}
 /** Format engine history as a scrollback-friendly transcript. */
 export function formatTranscript(messages: readonly Message[], opts: TranscriptOptions = {}): string[] {
   const color = opts.color !== false;
@@ -92,25 +106,41 @@ export function formatTranscript(messages: readonly Message[], opts: TranscriptO
       lines.push(...clipBody(m.content, bodyCap));
       continue;
     }
-    // assistant: a JSON tool call (one compact ledger line) or a prose reply.
-    let invocation: { tool?: unknown; arguments?: unknown } | null = null;
+    // assistant: one or more JSON tool calls (compact ledger lines) or a prose reply.
+    // Handles BOTH the single `{tool,arguments}` form AND the batched `{tools:[...]}`
+    // form — the batch case previously parsed to no `tool` field, fell through, and
+    // dumped the raw JSON object into the transcript (the "/resume shows JSON" bug).
+    let parsed: { tool?: unknown; tools?: unknown; arguments?: unknown } | null = null;
     try {
-      const parsed = JSON.parse(m.content) as { tool?: unknown; arguments?: unknown };
-      if (parsed && typeof parsed === "object" && typeof parsed.tool === "string") invocation = parsed;
+      const p: unknown = JSON.parse(m.content);
+      if (p && typeof p === "object") parsed = p as { tool?: unknown; tools?: unknown; arguments?: unknown };
     } catch { /* prose reply */ }
-    if (invocation && typeof invocation.tool === "string" && invocation.tool !== "done") {
-      // The matching `Tool [x] result (ok|fail)` user message tells success/failure.
+    const calls: { tool: string; arguments?: unknown }[] =
+      parsed && typeof parsed.tool === "string"
+        ? [{ tool: parsed.tool, arguments: parsed.arguments }]
+        : parsed && Array.isArray(parsed.tools)
+          ? (parsed.tools as { tool?: unknown; arguments?: unknown }[])
+              .filter(c => c && typeof c.tool === "string")
+              .map(c => ({ tool: c.tool as string, arguments: c.arguments }))
+          : [];
+    const toolCalls = calls.filter(c => c.tool !== "done");
+    if (toolCalls.length > 0) {
+      // The matching `Tool [x] result (ok|fail)` user message follows; for a batch it
+      // is ONE message with several blocks. Parse verdicts in call order.
       const next = messages[i + 1];
-      const verdict = next?.role === "user" ? next.content.match(TOOL_RESULT_RE) : null;
-      const mark = verdict?.[2] === "fail" ? red(bad) : green(ok);
-      const title = summarizeForgeInvocation(invocation.tool, invocation.arguments).title;
-      const resultLine = firstToolResultLine(next?.content);
-      const suffix = resultLine ? dim(` — ${resultLine}`) : "";
-      lines.push(`  ${mark} ${title}${suffix}`);
+      const verdicts = next?.role === "user" ? parseToolVerdicts(next.content) : [];
+      toolCalls.forEach((c, ci) => {
+        const v = verdicts[ci] ?? verdicts.find(x => x.tool === c.tool);
+        const mark = v?.status === "fail" ? red(bad) : green(ok);
+        const title = summarizeForgeInvocation(c.tool, c.arguments).title;
+        const suffix = v?.firstLine ? dim(` — ${v.firstLine}`) : "";
+        lines.push(`  ${mark} ${title}${suffix}`);
+      });
       continue;
     }
-    const reason = invocation
-      ? String((invocation.arguments as { reason?: unknown } | undefined)?.reason ?? "")
+    // A lone `done` (show its reason) or a plain prose reply.
+    const reason = parsed
+      ? String((parsed.arguments as { reason?: unknown } | undefined)?.reason ?? "")
       : m.content;
     if (!reason.trim()) continue;
     lines.push(`${magentaBold(`jeo ${jeoMark}`)}`);

package/src/util/whats-new.ts ADDED Viewed

@@ -0,0 +1,272 @@
+import pkg from "../../package.json";
+import { compareVersions } from "../commands/update";
+import * as os from "node:os";
+import * as path from "node:path";
+import { readFile, writeFile, mkdir } from "node:fs/promises";
+import { jeoEnv } from "./env";
+import chalk from "chalk";
+import { boxBlock, BOX_UNICODE, BOX_ASCII } from "../tui/components/layout";
+// ---- "What's New" release notes ---------------------------------------------
+// Mirrors gjc's post-upgrade release-notes surface. The bundled CHANGELOG.md
+// (shipped via package.json `files`) is always the RUNNING version's changelog,
+// so after a `bun install -g jeo-code` upgrade the next launch reads the NEW
+// notes offline. `jeo whats-new` shows them on demand; `jeo update --install`
+// shows them right after a successful self-update.
+export interface ChangelogGroup {
+  /** "Added" | "Changed" | "Fixed" | "" (ungrouped). */
+  label: string;
+  items: string[];
+}
+export interface ChangelogSection {
+  version: string; // "0.5.9" | "Unreleased"
+  date?: string;
+  summary: string; // the `_italic_` one-liner under the header
+  groups: ChangelogGroup[];
+}
+const HEADER = /^##\s+\[([^\]]+)\](?:\s*-\s*(\S+))?\s*$/;
+const SUBHEADER = /^###\s+(.+?)\s*$/;
+const BULLET = /^[-*]\s+(.+)$/;
+const ITALIC = /^_(.+)_$/;
+/** Parse `## [version] - date` sections with their summary line and grouped bullets. */
+export function parseChangelogSections(markdown: string): ChangelogSection[] {
+  const lines = markdown.split(/\r?\n/);
+  const sections: ChangelogSection[] = [];
+  let current: ChangelogSection | null = null;
+  let group: ChangelogGroup | null = null;
+  for (const raw of lines) {
+    const line = raw.trimEnd();
+    const head = line.match(HEADER);
+    if (head) {
+      current = { version: head[1]!, date: head[2], summary: "", groups: [] };
+      group = null;
+      sections.push(current);
+      continue;
+    }
+    if (!current) continue;
+    const trimmed = line.trim();
+    if (trimmed === "") continue;
+    const sub = trimmed.match(SUBHEADER);
+    if (sub) {
+      group = { label: sub[1]!, items: [] };
+      current.groups.push(group);
+      continue;
+    }
+    const bullet = trimmed.match(BULLET);
+    if (bullet) {
+      if (!group) {
+        group = { label: "", items: [] };
+        current.groups.push(group);
+      }
+      group.items.push(bullet[1]!.trim());
+      continue;
+    }
+    const italic = trimmed.match(ITALIC);
+    if (italic && !current.summary && current.groups.length === 0) {
+      current.summary = italic[1]!.trim();
+    }
+  }
+  return sections;
+}
+/** Real releases only (drops an `Unreleased` heading), newest-first as written. */
+export function releaseSections(sections: ChangelogSection[]): ChangelogSection[] {
+  return sections.filter(s => s.version.toLowerCase() !== "unreleased");
+}
+/**
+ * Sections strictly newer than `fromVersion` and at most `toVersion`.
+ * `fromVersion` null → every release up to and including `toVersion`.
+ */
+export function selectNewSections(
+  sections: ChangelogSection[],
+  fromVersion: string | null,
+  toVersion: string,
+): ChangelogSection[] {
+  return releaseSections(sections).filter(s => {
+    const newerThanFrom = fromVersion ? compareVersions(s.version, fromVersion) > 0 : true;
+    const atMostTo = compareVersions(s.version, toVersion) <= 0;
+    return newerThanFrom && atMostTo;
+  });
+}
+export interface WhatsNewRenderOpts {
+  cols?: number;
+  unicode?: boolean;
+  color?: boolean;
+  /** Cap rendered body lines so a multi-release jump stays bounded. Default 22. */
+  maxBodyLines?: number;
+}
+/** Word-wrap plain text to `width` columns, hard-breaking any single over-long token. */
+function wrapWords(text: string, width: number): string[] {
+  const w = Math.max(1, width);
+  const out: string[] = [];
+  let cur = "";
+  const flush = () => { if (cur) { out.push(cur); cur = ""; } };
+  for (let word of text.split(/\s+/).filter(Boolean)) {
+    while (word.length > w) {
+      flush();
+      out.push(word.slice(0, w));
+      word = word.slice(w);
+    }
+    if (!cur) cur = word;
+    else if (cur.length + 1 + word.length <= w) cur += " " + word;
+    else { out.push(cur); cur = word; }
+  }
+  flush();
+  return out.length ? out : [""];
+}
+/** Render a boxed "What's New" panel for the given sections (newest-first). */
+export function renderWhatsNew(sections: ChangelogSection[], opts?: WhatsNewRenderOpts): string[] {
+  if (sections.length === 0) return [];
+  const cols = opts?.cols ?? 80;
+  const useColor = opts?.color !== false;
+  const useUnicode = opts?.unicode !== false;
+  const maxBody = opts?.maxBodyLines ?? 22;
+  const width = Math.max(32, Math.min(120, cols));
+  const inner = Math.max(8, width - 2);
+  const accent = useColor ? chalk.hex("#f2b84b") : (s: string) => s;
+  const bold = useColor ? (s: string) => chalk.bold(accent(s)) : (s: string) => s;
+  const boldPlain = useColor ? chalk.bold : (s: string) => s;
+  const dim = useColor ? chalk.dim : (s: string) => s;
+  const bulletChar = useUnicode ? "•" : "-";
+  const body: string[] = [];
+  let clipped = false;
+  // Wrap `text` to the inner width, paint each line, and push — bullets indent
+  // their continuation lines under the marker. Returns false once the cap is hit.
+  const emit = (text: string, paint: (s: string) => string, kind: "plain" | "bullet" = "plain"): boolean => {
+    const avail = kind === "bullet" ? inner - 2 : inner;
+    const wrapped = wrapWords(text, avail);
+    for (let i = 0; i < wrapped.length; i++) {
+      if (body.length >= maxBody) { clipped = true; return false; }
+      const prefix = kind === "bullet" ? (i === 0 ? `${bulletChar} ` : "  ") : "";
+      body.push(paint(prefix + wrapped[i]!));
+    }
+    return true;
+  };
+  const headerLabel = sections.length === 1
+    ? `What's New in jeo ${sections[0]!.version}`
+    : `What's New (${sections.length} releases)`;
+  emit(headerLabel, bold);
+  outer:
+  for (const s of sections) {
+    if (sections.length > 1) {
+      if (body.length >= maxBody) { clipped = true; break; }
+      body.push("DIVIDER");
+      const when = s.date ? ` — ${s.date}` : "";
+      if (!emit(`${s.version}${when}`, accent)) break;
+    }
+    if (s.summary && !emit(s.summary, dim)) break;
+    for (const g of s.groups) {
+      if (g.label && !emit(g.label, boldPlain)) break outer;
+      for (const item of g.items) {
+        if (!emit(item, (l: string) => l, "bullet")) break outer;
+      }
+    }
+  }
+  if (clipped) body.push(dim("… see CHANGELOG.md for the full notes"));
+  return boxBlock(body, width, {
+    glyphs: useUnicode ? BOX_UNICODE : BOX_ASCII,
+    paint: accent,
+    align: "left",
+  });
+}
+// ---- Bundled changelog + last-seen-version state ----------------------------
+/** Read the CHANGELOG.md that ships next to this package (running version's notes). */
+export async function loadBundledChangelog(): Promise<string | null> {
+  // This module lives at src/util/whats-new.ts; CHANGELOG.md sits at the package
+  // root, i.e. two levels up. Resolve against the module dir so it works from a
+  // global install path just as from the repo.
+  const candidate = path.join(import.meta.dir, "..", "..", "CHANGELOG.md");
+  try {
+    return await readFile(candidate, "utf-8");
+  } catch {
+    return null;
+  }
+}
+interface WhatsNewState {
+  lastSeenVersion: string;
+  updatedAt: number;
+}
+function stateDir(): string {
+  return jeoEnv("CONFIG_DIR") || path.join(os.homedir(), ".jeo");
+}
+function statePath(): string {
+  return path.join(stateDir(), "whats-new.json");
+}
+export async function readLastSeenVersion(): Promise<string | null> {
+  try {
+    const raw = await readFile(statePath(), "utf-8");
+    const data = JSON.parse(raw) as Partial<WhatsNewState>;
+    if (!data || typeof data.lastSeenVersion !== "string" || !data.lastSeenVersion) return null;
+    return data.lastSeenVersion;
+  } catch {
+    return null;
+  }
+}
+/** Persist the last version the user has seen notes for (best-effort; never throws). */
+export async function writeLastSeenVersion(version: string): Promise<void> {
+  if (typeof version !== "string" || !version) return;
+  try {
+    await mkdir(stateDir(), { recursive: true, mode: 0o700 });
+    const payload: WhatsNewState = { lastSeenVersion: version, updatedAt: Date.now() };
+    await writeFile(statePath(), JSON.stringify(payload, null, 2), { encoding: "utf-8", mode: 0o600 });
+  } catch {
+    // State is an optimization; a write failure must never break launch/update.
+  }
+}
+/**
+ * Launch-time hook: returns rendered notes the FIRST time jeo runs after an
+ * upgrade, then records the current version so it never repeats. A fresh install
+ * (no prior state) records silently and shows nothing — only genuine upgrades
+ * surface notes, matching gjc / npm update-notice behaviour.
+ */
+export async function consumeLaunchWhatsNew(opts?: WhatsNewRenderOpts): Promise<string[] | null> {
+  const current = pkg.version;
+  const lastSeen = await readLastSeenVersion();
+  if (!lastSeen) {
+    await writeLastSeenVersion(current);
+    return null;
+  }
+  if (compareVersions(current, lastSeen) <= 0) {
+    // Not an upgrade (equal, or a local downgrade). Keep state monotonic-ish.
+    if (compareVersions(current, lastSeen) < 0) await writeLastSeenVersion(current);
+    return null;
+  }
+  // Genuine upgrade: mark seen up front so a render/parse failure never repeats.
+  const md = await loadBundledChangelog();
+  await writeLastSeenVersion(current);
+  if (!md) return null;
+  const sections = selectNewSections(parseChangelogSections(md), lastSeen, current);
+  if (sections.length === 0) return null;
+  return renderWhatsNew(sections, opts);
+}