npm - @levistudio/redline - Versions diffs - 0.2.0 → 0.4.0 - Mend

@levistudio/redline 0.2.0 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (27) hide show

package/AGENTS.md +227 -0
package/CHANGELOG.md +22 -1
package/CLAUDE.md +9 -0
package/README.md +39 -21
package/SECURITY.md +3 -3
package/bin/redline.cjs +4 -1
package/package.json +4 -1
package/scripts/install-skill.sh +104 -39
package/skills/redline-review/SKILL.md +9 -9
package/src/agent.ts +19 -21
package/src/agentProvider.ts +267 -0
package/src/cli.ts +109 -36
package/src/client/cards.ts +10 -1
package/src/client/lib.ts +143 -78
package/src/client/render.ts +17 -5
package/src/client/selection.ts +1 -1
package/src/client/sse.ts +34 -2
package/src/client/state.ts +22 -0
package/src/client/styles.css +27 -0
package/src/parseReply.ts +9 -1
package/src/pickModel.ts +6 -4
package/src/promptEnvelope.ts +1 -1
package/src/resolve.ts +134 -97
package/src/reviewSummary.ts +93 -0
package/src/server-page.ts +5 -3
package/src/server.ts +50 -16
package/src/sidecar.ts +5 -0

package/AGENTS.md ADDED Viewed

@@ -0,0 +1,227 @@
+# Redline — agent onboarding
+This file orients an AI agent contributing to the Redline codebase. It captures the architecture, the load-bearing implementation decisions, and the non-obvious gotchas that have bitten previous changes. For product framing (what Redline is, who it's for, how to use it), see [README.md](README.md).
+## What you're working in
+Redline is a single-player, local Markdown review tool. The user points it at a `.md` file; it opens a browser-based reader; the user leaves inline comments; an agent process replies in real time; the user resolves and accepts; the document is revised. The two pieces are:
+1. **The Review Reader** — local Bun + Hono server, renders Markdown, serves a small client-side JS app, handles select-to-comment, persists comments to a sidecar.
+2. **The conversational agent** — a child process spawned alongside the server that listens to comment events and replies via the selected local agent provider (`claude` or `codex`).
+The product is real-time conversation, not turn-based submit/respond. Reviews can span multiple rounds (comment → reply → resolve → revise → next round) until the user signs off.
+## Architecture
+### Sidecar — `.review/<filename>.json`
+The sidecar is the source of truth for a review session. Schema lives in [src/sidecar.ts](src/sidecar.ts):
+```ts
+Sidecar { file, context?, rounds: Round[] }
+Round   { round, started_at, submitted_at, agent_replied_at, resolved_at, comments: Comment[] }
+Comment { id, quote, context_before, context_after, thread: ThreadEntry[], resolved }
+ThreadEntry { role: human|agent, name?, message, at, requires_revision?, revision_reason? }
+```
+- **Per-file mutex.** `withSidecar(filePath, fn)` in [src/sidecar.ts](src/sidecar.ts) holds a per-file lock across `load → mutate → save` so concurrent POSTs can't interleave and silently drop a writer's mutation. Use it for every endpoint that touches the sidecar.
+- **History.** Every `resolve` snapshots the pre-revision file to `.review/history/<file>.<iso>.md`. Past rounds are viewable read-only at `/round/:n`.
+### CLI shape
+```
+redline <file>                 # opens the review reader; URL is printed and written to .review/<file>.startup.json
+redline <file> --context "..." # opens with a reviewer-supplied focus statement (see "Context-aware prompts" below)
+redline <file> --agent codex   # use Codex instead of the auto-detected/default provider
+redline <file> --no-agent      # manual annotation mode — no agent spawn, no provider CLI required on PATH
+redline resolve <file>         # one-shot: read the sidecar, run the revision pass, write back
+```
+The bare-arg path also spawns a dedicated agent subprocess alongside the server — see "The dedicated agent process" below.
+### Context-aware prompts
+When the user passes `--context "..."` (or the sidecar already has `context` set from a prior run), the string is rendered above the doc as a banner *and* prepended to both prompt builders:
+- [src/agent.ts](src/agent.ts) calls `loadSidecar` per reply and prepends a `Reviewer's stated focus` block to the user message before the document/comment sections.
+- [src/resolve.ts](src/resolve.ts) prepends the same block to the revision user message.
+The shared helper is [src/contextBlock.ts](src/contextBlock.ts) — a one-liner that returns either the formatted block or empty string. It wraps the context through the same UUID-anchored envelope as the document and comment text, so adversarial context content can't escape its envelope to pose as system instructions.
+### Frontend
+Client-side JS lives in [src/client/main.ts](src/client/main.ts). It is bundled at server startup via `Bun.build` and served from memory at `/client.js`. Pure helpers (`escapeHtml`, `latestVerdict`, `nearestCell`, `clampRangeToCell`, `captureSelection`, `highlightText`, `computeNavState`, `preserveScroll`) live in [src/client/lib.ts](src/client/lib.ts) with happy-dom test coverage in [src/client/lib.test.ts](src/client/lib.test.ts). New pure logic should land in `lib.ts` with tests.
+Server-side state is bootstrapped into the page as `window.__REDLINE__` ahead of the bundle.
+### Inline diff toggle
+After a revision pass, the new round opens with the document rendered **as the diff** (block-level insert/delete bands, word-level `<ins>`/`<del>` marks for modified paragraphs). A header toggle (`#btn-toggle-diff`, "Show changes" / "Hide changes") flips between diff view and clean view. The toggle is only rendered on rounds 2+ in the live view — round 1 has nothing to compare against, and the read-only `/round/:n` views can't use `/api/diff` honestly because that endpoint always compares the current file to the most recent snapshot.
+- Pure swap helpers (`applyDiffSwap`, `revertDiffSwap`, `updateToggleButton`, `diffStateKey`) live in [src/client/diffToggle.ts](src/client/diffToggle.ts) with tests in [src/client/diffToggle.test.ts](src/client/diffToggle.test.ts). DOM + network glue is in [src/client/diff.ts](src/client/diff.ts).
+- Diff HTML is fetched lazily from `/api/diff` and cached in-module; the original clean innerHTML is captured on the first swap so toggling back is a pure restore (no second fetch, no server round trip).
+- Per-file state is persisted in `sessionStorage` under `rl-diff-on-<title>` so soft refreshes don't reset the view. Auto-enabled on round open when `just-revised` is set.
+- After every swap, both `applyHighlights()` and `renderComments()` re-run because the prose DOM was rebuilt and comment marks/cards need to re-anchor.
+- Commenting stays enabled in diff mode. Quotes captured against diff text may include `<ins>`/`<del>` boundaries; flipping back to clean view can fail to highlight those comments. Accepted tradeoff.
+## Design decisions worth knowing
+- **Free-text comments only.** Typed actions (`[expand]`, `[challenge]`, `[cut]`) are a future direction; don't add structure prematurely.
+- **Sidecar lives next to the file, in `.review/`.** Gitignore-able as a pattern, obvious where to look. Don't move it.
+- **No anchoring-under-edits problem.** The agent doesn't edit the file while the user is reviewing — write happens between rounds. Quote + 32 chars before/after is enough to relocate a comment when reopening a review.
+- **Comments are instructions, not just notes.** The sidecar is structured so an agent can read and act on it in one pass.
+- **Shell out to local agent CLIs, don't use SDKs.** Both `src/agent.ts` (replies) and `src/resolve.ts` (revisions) invoke the selected provider through [src/agentProvider.ts](src/agentProvider.ts), so auth comes from the user's existing Claude Code or Codex session. Do not switch Redline to SDK/API-key auth without a deliberate product decision.
+## The dedicated agent process
+`redline <file>` spawns [src/agent.ts](src/agent.ts) as a child process alongside the server. The agent opens its own SSE connection to `/api/events` and reacts to comment events directly, with no harness-level task-notification overhead.
+- Reply latency is bounded by the selected provider's inference time instead of by task scheduling.
+- The agent's read loop **does not await** event handlers — it fires them and continues reading. This lets multiple comments process in parallel and prevents a slow response from blocking the SSE stream.
+- An `inProgress` Set deduplicates so the same comment isn't replied to twice if `comment-reply` fires multiply.
+- `agent-replied` is fired only when `inProgress` drains to zero, so the UI sees one "agent done" event per batch rather than per reply.
+- The CLI installs `exit`/`SIGINT`/`SIGTERM` handlers to kill the agent when the server dies, and auto-restarts the agent on unexpected exit (capped to 5 restarts per 60s window — see [src/cli.ts](src/cli.ts)).
+### `--no-agent` (manual mode)
+Skips provider preflight and the agent spawn. The server still serves the page, accepts comments, and resolves them; there's just no agent to reply or revise. The page bootstraps `noAgent: true` into `window.__REDLINE__` and shows a "Manual mode" pill in the header. The client uses the flag to:
+- Force the round-level button into **finish** mode (`/api/finish`) regardless of comment count, since `/api/accept` would dead-end on the watchdog with no agent to land `/api/reload`.
+- Suppress the "revise the document anyway" secondary link for the same reason.
+Per-comment verdict logic degrades naturally: with no agent there are no `requires_revision` fields, and the existing fallback ("field absent → treated as accept") makes "Accept as-is" the natural finish path.
+## Model picking
+Both `agent.ts` (replies) and `resolve.ts` (revisions) pick a model tier based on the human's message text:
+- Default to `fast` for short confirmations and simple substitutions.
+- Promote to `smart` when the message is long (>120 chars for replies, >150 for revisions), contains a question mark, or matches an "involved" keyword (`suggest`, `alternative`, `rewrite`, `restructure`, `tone`, etc.).
+- The provider maps tiers to concrete model ids. `redline resolve --model <id>` overrides the picker for a one-off run.
+The chosen model is logged so you can see which path each event took. See [src/pickModel.ts](src/pickModel.ts).
+## Workflow model
+The conversation is real-time. There is no "submit for review" step. The flow:
+1. Human leaves a comment → server broadcasts `comment-added` SSE event → agent replies immediately.
+2. Human can reply in the thread → broadcasts `comment-reply` → agent replies again.
+3. Human resolves comments one by one. When all are resolved, the round-level button enables — labeled by the **majority verdict** the agent attached to its replies (see "Verdict-aware resolve" below). Default is "Revise document"; if every comment was answered without implying an edit, the default flips to "Accept as-is" (calls `/api/finish`, no revision pass). The non-default action is always one click away as a small secondary link below.
+4. Human clicks Revise → broadcasts `accepted` → agent runs the resolve flow → `/api/reload` triggers a full browser reload to the next round.
+5. When a round opens with no comments to act on, the same button reads "Done" and calls `/api/finish`. That broadcasts `finished`, the browser shows a "review complete" splash, and the server prints a summary and exits — handing control back to whichever terminal launched it.
+The Revise button is intentionally **not** styled green. It triggers a non-trivial revision pass, not a final confirmation; neutral styling reflects that.
+## SSE event vocabulary
+| Event | Fired when | Client behavior |
+|---|---|---|
+| `comment-added` | Human posts a new comment | Soft refresh + `applyHighlights()` (new highlight needed) |
+| `comment-reply` | Human or agent posts to existing thread | Soft refresh; remove that comment from `thinkingCommentIds` |
+| `comment-resolved` | Human resolves a comment | Soft refresh + `applyHighlights()` (color change) |
+| `comment-thinking` | Agent POSTs `/api/comment/:id/thinking` | Add to `thinkingCommentIds`, show dots in that thread (multiple threads can be active simultaneously) |
+| `agent-replied` | Agent POSTs `/api/agent-replied` after all in-flight replies finish | Clear `thinkingCommentIds`, soft refresh |
+| `accepted` | Human clicks "Revise document" | Agent's cue to run the resolve flow |
+| `finished` | Human clicks "Done" on a round with no comments, or "Looks good — close session" in the diff overlay | Replace body with a "Review complete — close this tab" splash |
+| `reload` | The resolve flow finishes writing the revised document | Full `window.location.reload()` |
+| `agent-unavailable` | CLI hits the agent restart cap (5 in 60s) | Show persistent "Agent offline" pill in header; sticky until page reload |
+Soft refresh = `GET /api/comments` then re-render the sidebar. Full reload = `window.location.reload()`. **Use soft refresh for everything except `reload`.** Full reloads scroll to top and feel jarring.
+The server sends `: ping\n\n` comment frames every 15 seconds to keep the SSE connection alive through long revisions. Without this the browser would silently disconnect during a long revision pass and miss the `reload` event when it finally fired.
+## Frontend gotchas (paid in blood)
+- **Never call `window.location.reload()` for routine UI updates.** It loses scroll position and feels like a page navigation. Soft-refresh via `softRefresh()` for comment events.
+- **DOM rebuilds inside scrollable content can scroll the page to 0.** Two distinct causes:
+  1. `prose.normalize()` + re-wrapping text in `<mark>` shifts layout briefly.
+  2. Destroying a focused `<textarea>` (e.g. when `renderComments()` rebuilds the sidebar after a reply submit) makes the browser scroll the focus target into view, which can land at the document top.
+- The fix is the `preserveScroll(fn)` wrapper in [src/client/lib.ts](src/client/lib.ts). It saves `scrollY`, calls `.blur()` on the active element first, runs the mutation, then sets `scrollTop` synchronously **and** across two `requestAnimationFrame` callbacks. The triple restore is not paranoia — focus-related scroll lands a frame later.
+- Wrap any function that mutates significant DOM with `preserveScroll`. Right now that's `applyHighlights` and `renderComments`.
+- `applyHighlights` is expensive (full prose rewrite). Only call it when highlights actually change: new comment, resolved/unresolved. **Do not call it on reply.** This is why `softRefresh` takes `{ rehighlight: true }`.
+## Server lifecycle
+There is no auto-restart. The server binds to an OS-assigned port (`port: 0`, hostname `127.0.0.1`) so you can't kill it by port number — kill the CLI process. After editing `src/server.ts` or `src/agent.ts`:
+```bash
+pkill -f "src/cli.ts" 2>/dev/null
+pkill -f "src/agent.ts" 2>/dev/null
+bun run src/cli.ts <file.md> &
+```
+(The CLI also kills the agent on its own SIGINT/SIGTERM, but if you only edited `agent.ts` you can restart just the agent: `pkill -f "src/agent.ts" && REDLINE_PORT=<port> REDLINE_TOKEN=<token> bun src/agent.ts <absolute-path-to-file.md> &` — port and token come from the printed startup URL and `.review/<file>.startup.json`.)
+The browser SSE EventSource auto-reconnects (3s backoff), so the open tab survives a restart — but you still need to hard-reload (Cmd+Shift+R) to pick up new client-side JS.
+On startup the server checks the sidecar and auto-creates an open round if all rounds are resolved. This prevents a stuck "Revising the document" spinner state when reopening a previously-finished review.
+## Agent etiquette inside the conversation flow
+When you're acting as the agent on the other end of the SSE stream:
+1. POST `/api/comment/:id/thinking` *before* you start composing. This is the user's only signal that you saw their message.
+2. POST `/api/comment/:id/reply` with `{ role: "agent", name: "<provider display name>", message, requires_revision, revision_reason }`. The verdict fields are required of every agent reply — see "Verdict-aware resolve" below.
+3. POST `/api/agent-replied` to clear the thinking indicator and trigger a soft refresh.
+Skipping step 1 leaves the user staring at silence wondering if anything is happening.
+## Verdict-aware resolve
+Every agent reply ships with a verdict on whether the comment, once resolved, implies an edit to the document. The fields live on the `ThreadEntry` for that agent reply:
+- `requires_revision: true` — the comment implies a doc edit (typo fix, rewording, restructure, content change). The per-comment Resolve button reads "Resolve → queue edit" and tints warm. The resolved card carries an "✎ Edit queued" badge.
+- `requires_revision: false` — the conversation answered it (clarifying question, approval, agent explanation that doesn't imply an edit). The Resolve button stays plain, the card carries a "✓ Answered" badge.
+- field absent — the comment was resolved before the agent ever replied. Treated as `accept` (the human resolved unilaterally, so they're saying "doesn't matter").
+The round-level button picks its default by the latest verdict on each comment:
+- All verdicts are `accept` → primary = **Accept as-is** (`/api/finish`, no revision pass).
+- Any verdict is `revise` → primary = **Revise document** (`/api/accept`).
+- The non-default action is always available as a secondary text link under the banner. Choosing "accept anyway" when comments imply edits triggers a `confirm()` warning.
+The agent provider ([src/agentProvider.ts](src/agentProvider.ts), called by [src/agent.ts](src/agent.ts)) gets the verdict by asking the selected local agent to reply in a delimiter envelope rather than JSON:
+```
+REQUIRES_REVISION: <true|false>
+ESCALATE: <true|false>
+REASON: <one short sentence>
+---MESSAGE---
+<free-form reply prose>
+---END---
+```
+JSON was tried first and abandoned: agent replies frequently contain quotes, code fences, and Markdown that the model fails to escape inside a JSON string, which then makes `JSON.parse` blow up on the whole envelope. The delimiter form sidesteps escaping entirely. [src/parseReply.ts](src/parseReply.ts) prefers the delimiter form, accepts the legacy JSON shape as a fallback, and on any parse failure defaults to `{ requires_revision: true }` (safer to run an unnecessary revision than silently skip an implied edit).
+The verdict is **agent-owned**. The human cannot flip it directly. Disagreement flows through a follow-up reply, which gives the agent a chance to re-classify rather than be silently overridden.
+## Escalation to the launching agent
+The inline review agent and the agent that *launched* `redline` share no live channel — the sidecar is the only persisted artifact, and the launching agent only regains control when the session exits. So two mechanisms carry feedback back to it:
+- **`ESCALATE` verdict.** A second, orthogonal flag in the reply envelope. The agent sets `ESCALATE: true` when a comment can't be acted on from inside the review — it needs the launching agent's project context, tools, or authority (an external style guide, a spec to check against, a wider-project decision). It's stored as `escalate?: boolean` on the agent's `ThreadEntry` and rendered as an "↑ Escalated" badge on the comment and an "↑ Routed to the launching agent" note in the thread. Escalation is independent of `requires_revision` — the agent judges each on its own.
+- **Closeout transcript.** On `finished`, the CLI loads the sidecar and prints every comment thread verbatim via [src/reviewSummary.ts](src/reviewSummary.ts) (`formatReviewSummary`), with a dedicated callout listing escalated comments (`collectEscalations`). The escalation count is also written to `.review/<file>.result` and appended to the `REDLINE_RESULT:` line. This is the launching agent's read point — it sees the transcript when control hands back.
+## Security & resilience as built
+These are post-publish hardenings (M5/M8). They're load-bearing — don't regress them when refactoring:
+- **Loopback bind.** Server is `Bun.serve({ port: 0, hostname: "127.0.0.1" })` in [src/cli.ts](src/cli.ts). No external interface.
+- **Markdown sanitization.** `marked` v9 stopped sanitizing in-band; we run [`sanitize-html`](src/render.ts) over its output. Tests in [src/render.test.ts](src/render.test.ts) cover the dangerous-payload cases — keep them passing.
+- **CSRF token.** The CLI mints `REDLINE_TOKEN` (a UUID) and threads it to the server (mints from), the agent subprocess (env), and the bundled client (via `window.__REDLINE__`). Mutating endpoints require `X-Redline-Token`. Don't add a mutating endpoint without checking the token.
+- **Realpath on static asset reads.** [src/server.ts](src/server.ts) calls `realpath` before serving anything off disk so a symlink in `.review/history/` can't traverse out.
+- **Cross-process sidecar lock.** [src/sidecar.ts](src/sidecar.ts) layers an in-memory mutex over `proper-lockfile` on `<sidecar>.lock`. Both layers are needed: in-process is the fast path, lockfile catches concurrent processes (e.g. `redline resolve` running while the server is up).
+- **Prompt-input envelopes.** Both `agent.ts` and `resolve.ts` wrap user-controlled text (doc body, comment text) in `---DOCUMENT---` / `---COMMENTS---` style markers before sending to the selected agent provider, so a comment that contains "Ignore previous instructions" is recognizable as data, not prompt.
+- **Revision watchdog.** [src/server.ts](src/server.ts) starts a 3-min timer on `/api/accept`. If `/api/reload` doesn't land in time, the round un-resolves and the server broadcasts `revision-stalled`. The revision token in `currentRevision` makes this race-safe — see the comment block at line ~129.
+- **Revision integrity check + retry.** `validateRevision` in [src/resolve.ts](src/resolve.ts) is a pure check on the model's output: it strips fences/wrapper-tags/meta-sections, then rejects output that has no headings (when the input had them) or that drops a heading whose section no settled comment touched — the signal that the model mangled the doc instead of editing it. A failed validation retries the revision once with the same model before surfacing `revision-error`; a non-zero CLI exit is not retried (environment failure, not a model stumble).
+- **Zombie SSE recovery.** [src/client/sse.ts](src/client/sse.ts) listens for `visibilitychange`/`focus` and runs a 30s heartbeat watchdog while the `.revising` banner is up, so a tab backgrounded across the whole revision recovers when refocused.
+- **Capped agent restart.** [src/cli.ts](src/cli.ts) auto-restarts the agent on unexpected exit but caps at 5 in 60s. If you see "gave up" in the log, something structural is wrong — fix root cause, don't raise the cap.
+## Tests
+`bun test` runs the suite. Server, sidecar, parsing, model-picking, rendering, diff, SSE, integration, and happy-dom client tests all live in `tests/` and `src/`.
+## Closeout
+When the user asks to "close out" a fix/patch/feature after its PR is open, additional dev-process steps may apply. They're not documented in this public file — check local/private project notes if present, or fall back to project memory.

package/CHANGELOG.md CHANGED Viewed

@@ -4,6 +4,25 @@ All notable changes to Redline are documented here. The format follows [Keep a C
 ## [Unreleased]
+## [0.4.0] - 2026-05-15
+### Added
+- Claude and Codex agent providers are now supported through a provider-neutral runtime. Use `--agent claude|codex` or `REDLINE_AGENT` to choose explicitly; otherwise Redline auto-detects an available local provider.
+- `redline install-skill --agent claude|codex|both` installs the bundled `redline-review` skill into Claude Code and/or Codex so agents can reach for Redline automatically when a Markdown doc needs human sign-off.
+### Changed
+- `AGENTS.md` is now the canonical contributor onboarding doc. `CLAUDE.md` remains as a Claude Code compatibility pointer to the same provider-neutral guidance.
+## [0.3.0] - 2026-05-15
+### Added
+- **Escalation handoff** ([#86](https://github.com/alevi/redline/pull/86)). When a comment needs something the inline review agent can't provide — an external style guide, a spec it can't see, a decision that needs the wider project — the agent flags its reply with an escalate verdict. The comment shows an "Escalated" badge, the round banner notes the count, and the session's closeout summary and `.review/<file>.result` carry an `escalations` array so the agent that launched the review picks the feedback up.
+### Fixed
+- **Revision integrity check and retry** ([#86](https://github.com/alevi/redline/pull/86)). The revision pass now validates its output on every run and rejects a revision that silently drops document sections no comment touched, instead of writing a mangled document to disk. A failed validation retries once before surfacing an error.
+- **Cross-block and image-spanning selections** ([#85](https://github.com/alevi/redline/pull/85)) now anchor correctly. Selecting across a block boundary, or across an image, no longer fails with an anchoring error.
+- **Sessions are no longer abandoned on a transient SSE drop** ([#84](https://github.com/alevi/redline/pull/84)). A backgrounded tab, laptop sleep, or a brief network blip no longer causes the server to exit; only an explicit tab close ends the session. A server that does go away surfaces a clear "session ended" banner instead of a raw fetch error.
 ## [0.2.0] - 2026-05-11
 ### Added
@@ -41,6 +60,8 @@ Initial public release on npm as `@levistudio/redline`.
 - Auto-installs missing dependencies on first CLI run.
 - Initial test suite: server, sidecar, parsing, model-picking, rendering, diff, SSE, integration, happy-dom client.
-[Unreleased]: https://github.com/alevi/redline/compare/v0.2.0...HEAD
+[Unreleased]: https://github.com/alevi/redline/compare/v0.4.0...HEAD
+[0.4.0]: https://github.com/alevi/redline/compare/v0.3.0...v0.4.0
+[0.3.0]: https://github.com/alevi/redline/releases/tag/v0.3.0
 [0.2.0]: https://github.com/alevi/redline/releases/tag/v0.2.0
 [0.1.0]: https://github.com/alevi/redline/releases/tag/v0.1.0

package/CLAUDE.md ADDED Viewed

@@ -0,0 +1,9 @@
+# Redline — Claude Code compatibility
+The canonical agent onboarding doc is [AGENTS.md](AGENTS.md). Read that file first; it is intentionally provider-neutral and applies to Claude Code, Codex, and any other agent working in this repository.
+Claude-specific notes:
+- Redline supports Claude Code through the `claude` provider in [src/agentProvider.ts](src/agentProvider.ts).
+- Keep using the local `claude -p` CLI path for Claude support. Redline inherits the user's Claude Code auth session and does not require `ANTHROPIC_API_KEY`.
+- Do not replace the local CLI provider with the Anthropic SDK unless the product direction explicitly changes. The local-agent auth model is load-bearing.

package/README.md CHANGED Viewed

@@ -2,9 +2,9 @@
 **Inline comments on Markdown files, designed for human-in-the-loop AI doc review.**
-Open a Markdown file, highlight text, leave inline comments, discuss changes with Claude, and apply accepted revisions back to the document.
+Open a Markdown file, highlight text, leave inline comments, discuss changes with a local agent, and apply accepted revisions back to the document.
-![Redline demo — highlight text, leave a comment, Claude replies, resolve to apply the edit.](docs/assets/demo.gif)
+![Redline demo — highlight text, leave a comment, an agent replies, resolve to apply the edit.](docs/assets/demo.gif)
 ## Try it in 30 seconds
@@ -14,15 +14,32 @@ bunx @levistudio/redline ./spec.md
 (or `npx @levistudio/redline ./spec.md` — Bun is required either way.)
-The terminal prints a `localhost` URL. Cmd-click it. Select any text in the document, type a comment, hit reply. Claude responds in the thread within ~2s. Resolve each comment, then click **Revise document** to have Claude apply the agreed edits back to the file on disk.
+The terminal prints a `localhost` URL. Cmd-click it. Select any text in the document, type a comment, hit reply. Your selected local agent responds in the thread within a few seconds. Resolve each comment, then click **Revise document** to have the agent apply the agreed edits back to the file on disk.
-Requires [Bun](https://bun.sh) ≥ 1.0 and an authenticated [Claude Code](https://claude.com/claude-code) session. The agent inherits your existing Claude Code OAuth — no `ANTHROPIC_API_KEY` needed.
+Requires [Bun](https://bun.sh) >= 1.0 and an authenticated local agent CLI. Redline currently supports [Claude Code](https://claude.com/claude-code) and Codex. The agent inherits your existing CLI auth session — no provider API key needed.
+## Install
+For one-off reviews, `bunx` / `npx` is enough:
+```sh
+bunx @levistudio/redline ./spec.md
+```
+To make Redline available as a normal command and install its agent skill:
+```sh
+bun add -g @levistudio/redline
+redline install-skill --agent codex   # or: claude, both
+```
+That installs the bundled `redline-review` skill into your agent environment (`~/.codex/skills/` and/or `~/.claude/skills/`). After that, ask your agent to redline a Markdown file when it needs your sign-off; it should launch Redline, surface the local URL, wait for you to finish, then continue from the approved file.
 ## Why Redline exists
-A lot of recent docs start as a Claude draft. Reviewing those drafts in a chat window is awkward. Comments scroll out from under the text. "Fix paragraph 3" loses its anchor as soon as the doc is rewritten. Google-Docs-style tools don't speak the file on disk.
+A lot of recent docs start as an AI-agent draft. Reviewing those drafts in a chat window is awkward. Comments scroll out from under the text. "Fix paragraph 3" loses its anchor as soon as the doc is rewritten. Google-Docs-style tools don't speak the file on disk.
-Redline puts an inline-comment layer on top of a local Markdown file, with a Claude agent participating in the review thread. Comments are anchored to the text they're about. The conversation is real-time. Accepted changes are applied back to the file you started with, no copy-paste.
+Redline puts an inline-comment layer on top of a local Markdown file, with a local agent participating in the review thread. Comments are anchored to the text they're about. The conversation is real-time. Accepted changes are applied back to the file you started with, no copy-paste.
 ## Who it's for
@@ -38,25 +55,25 @@ People shipping docs with the help of AI agents:
 Two long-lived processes:
 - **Server** ([src/server.ts](src/server.ts)) — renders Markdown, serves the review UI, accepts comment / reply / resolve POSTs, broadcasts SSE events.
-- **Agent** ([src/agent.ts](src/agent.ts)) — child process listening to the SSE stream. Calls `claude -p` to compose replies and post them back. When you accept a round, the agent runs the document revision pass and writes the result to disk.
+- **Agent** ([src/agent.ts](src/agent.ts)) — child process listening to the SSE stream. Calls the selected local provider (`claude` or `codex`) to compose replies and post them back. When you accept a round, the agent runs the document revision pass and writes the result to disk.
 Review state lives in a sidecar JSON file at `.review/<filename>.json` next to the doc. History snapshots of every revision land in `.review/history/<filename>.<iso>.md` *before* the revision is written, so you can roll back from disk if a revision goes sideways. Both should be gitignored unless you want them in the repo.
 After each revision, the new round opens with the document rendered inline as a diff against the previous round — insert and delete bands at the block level, word-level marks within modified paragraphs — so you can read the agent's changes in place. Use the **Show changes** / **Hide changes** toggle in the header to flip back to the clean view at any time. From there you either open another round of comments or click **Looks good** to finish.
-[CLAUDE.md](CLAUDE.md) has the full architecture tour: sidecar schema, SSE event vocabulary, model picking, frontend gotchas.
+[AGENTS.md](AGENTS.md) has the full architecture tour: sidecar schema, SSE event vocabulary, model picking, frontend gotchas. `CLAUDE.md` remains as a Claude Code compatibility pointer.
 ## Use cases
-- **Reviewing AI-generated PRDs.** Hand Claude Code a one-line brief, let it draft the PRD, then redline it line by line before passing it on.
+- **Reviewing AI-generated PRDs.** Hand your coding agent a one-line brief, let it draft the PRD, then redline it line by line before passing it on.
 - **Reviewing architecture specs.** Mark assumptions you want challenged, ask the agent to expand sections, ship the revised spec.
-- **Reviewing README drafts.** Claude wrote your README — read through, leave inline comments where the framing is off, accept the revision in one click.
-- **Approving Claude Code output before merge.** Use the bundled [redline-review skill](skills/redline-review/SKILL.md) so your outer agent automatically hands you the doc to sign off on.
+- **Reviewing README drafts.** Your agent wrote your README — read through, leave inline comments where the framing is off, accept the revision in one click.
+- **Approving agent output before merge.** Use the bundled [redline-review skill](skills/redline-review/SKILL.md) so your outer agent automatically hands you the doc to sign off on.
 - **Human approval loops for agent-written docs.** Anywhere an agent needs your sign-off on prose before continuing, run it through Redline.
 ## Reviewing for a specific lens
-Pass `--context` at startup to tell Claude what you're focused on for the review. The context is shown in a banner above the doc and threaded into both the reply and revision prompts, so the agent can weight its responses accordingly.
+Pass `--context` at startup to tell the agent what you're focused on for the review. The context is shown in a banner above the doc and threaded into both the reply and revision prompts, so the agent can weight its responses accordingly.
 ```sh
 bunx @levistudio/redline ./spec.md --context "Reviewing for technical accuracy, not prose style."
@@ -67,16 +84,16 @@ The context is persisted in the sidecar on first run, so it carries across round
 ## Manual annotation (no agent)
-If you just want inline comments on a Markdown file without a Claude conversation, pass `--no-agent`:
+If you just want inline comments on a Markdown file without an agent conversation, pass `--no-agent`:
 ```sh
 bunx @levistudio/redline ./spec.md --no-agent
 ```
-The server still runs, you can still leave comments and resolve them, but no agent process is spawned and no `claude` binary is required on your PATH. The header shows a "Manual mode" pill so the absence of replies is intentional, not broken. Useful for:
+The server still runs, you can still leave comments and resolve them, but no agent process is spawned and no provider CLI is required on your PATH. The header shows a "Manual mode" pill so the absence of replies is intentional, not broken. Useful for:
-- Annotating a doc you don't want sent to Claude
-- Trying Redline before configuring Claude Code
+- Annotating a doc you don't want sent to an agent
+- Trying Redline before configuring Claude Code or Codex
 - Quick inline-comment passes where you don't want a revision step
 ## One-shot revision
@@ -86,24 +103,25 @@ If you already have a sidecar with resolved comments and just want to apply the
 ```sh
 bunx @levistudio/redline resolve ./spec.md
 bunx @levistudio/redline resolve ./spec.md --model claude-sonnet-4-6
+bunx @levistudio/redline resolve ./spec.md --agent codex
 ```
 ## Outer-agent handoff (optional)
-Want your AI coding agent (Claude Code etc.) to invoke Redline automatically whenever it produces a Markdown doc you need to sign off on? Install the bundled skill:
+Want your coding agent to invoke Redline automatically whenever it produces a Markdown doc you need to sign off on? Install the bundled skill:
 ```sh
-git clone https://github.com/alevi/redline.git && cd redline
-./scripts/install-skill.sh
+bun add -g @levistudio/redline
+redline install-skill --agent codex   # or: claude, both
 ```
-This copies `skills/redline-review/` into `~/.claude/skills/` with absolute paths baked in. After installation, your outer agent will reach for `redline-review` whenever it has Markdown that needs human review before it can continue.
+This copies `skills/redline-review/` into your agent skills directory with a durable launcher baked in. After installation, your agent can reach for `redline-review` whenever it has Markdown that needs human review before it can continue.
 ## Local-first / security
 - Single-player. Server binds to `127.0.0.1`. No auth, no audit log, no cloud.
 - Rendered Markdown is sanitized at the render boundary; raw HTML, inline event handlers, and `javascript:` URLs are stripped before the document reaches the browser.
-- The agent inherits your existing Claude Code OAuth session; no `ANTHROPIC_API_KEY` required.
+- The agent inherits your existing Claude Code or Codex auth session; no provider API key required.
 - **Prompt injection in document content is not defended against.** Run Redline on docs you trust.
 Full threat model: [SECURITY.md](SECURITY.md).

package/SECURITY.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # Security
-Redline is a **single-player, localhost-only** review tool that runs on the operator's machine. Reviewed Markdown is rendered in a browser and fed to a local Claude subprocess. This document describes what is defended against, what is out of scope, and how to report issues.
+Redline is a **single-player, localhost-only** review tool that runs on the operator's machine. Reviewed Markdown is rendered in a browser and fed to a selected local agent subprocess. This document describes what is defended against, what is out of scope, and how to report issues.
 ## Threat model
@@ -11,9 +11,9 @@ Redline is a **single-player, localhost-only** review tool that runs on the oper
 **Out of scope.** Redline is a developer tool, not a sandbox. It is appropriate to run on documents you authored, generated, or trust. The following are **not** defended against:
-- **Adversarial Markdown trying to manipulate the agent via prompt injection.** The agent reads the document text and comment thread as input to Claude; carefully crafted content can attempt to redirect the model. Treat agent output the same way you'd treat any other AI output you didn't fully verify.
+- **Adversarial Markdown trying to manipulate the agent via prompt injection.** The agent reads the document text and comment thread as input to the selected local provider; carefully crafted content can attempt to redirect the model. Treat agent output the same way you'd treat any other AI output you didn't fully verify.
 - **Multiple `redline` processes operating on the same file.** The sidecar lock is in-process; concurrent runs on the same `.md` can corrupt review state.
-- **Malicious CLI flags or environment.** `redline` runs with the operator's full file-system permissions and shells out to `claude -p` with the operator's auth.
+- **Malicious CLI flags or environment.** `redline` runs with the operator's full file-system permissions and shells out to the selected local provider CLI with the operator's auth.
 - **Sharing reviews across operators.** Redline is single-player — it has no auth, no access control, and no audit log.
 ## What "single-player, localhost" means in practice

package/bin/redline.cjs CHANGED Viewed

@@ -47,7 +47,10 @@ if (!bun) {
   process.exit(1);
 }
-const child = spawn(bun, ["run", CLI, ...process.argv.slice(2)], { stdio: "inherit" });
+const child = spawn(bun, ["run", CLI, ...process.argv.slice(2)], {
+  stdio: "inherit",
+  env: { ...process.env, REDLINE_BIN_ABS: __filename },
+});
 child.on("exit", (code, signal) => {
   if (signal) {
     try { process.kill(process.pid, signal); } catch { process.exit(1); }

package/package.json CHANGED Viewed

@@ -1,12 +1,13 @@
 {
   "name": "@levistudio/redline",
-  "version": "0.2.0",
+  "version": "0.4.0",
   "description": "Inline comments on Markdown files, for human-in-the-loop AI doc review.",
   "keywords": [
     "markdown",
     "code-review",
     "ai",
     "claude",
+    "codex",
     "bun",
     "documentation"
   ],
@@ -33,6 +34,8 @@
     "SECURITY.md",
     "ROADMAP.md",
     "CHANGELOG.md",
+    "AGENTS.md",
+    "CLAUDE.md",
     "LICENSE"
   ],
   "engines": {