npm - bb-cc-lite - Versions diffs - 0.1.11 → 0.1.13 - Mend

bb-cc-lite 0.1.11 → 0.1.13

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (56) hide show

package/README.md +171 -71
package/assets/statusline-coach-guard-demo.gif +0 -0
package/dist/audit.d.ts +47 -0
package/dist/audit.js +427 -0
package/dist/audit.js.map +1 -0
package/dist/baseline-builder.d.ts +5 -0
package/dist/baseline-builder.js +68 -31
package/dist/baseline-builder.js.map +1 -1
package/dist/baseline.d.ts +7 -2
package/dist/baseline.js +53 -3
package/dist/baseline.js.map +1 -1
package/dist/cache-efficiency.d.ts +3 -0
package/dist/cache-efficiency.js +38 -0
package/dist/cache-efficiency.js.map +1 -0
package/dist/cli.js +42 -0
package/dist/cli.js.map +1 -1
package/dist/doctor.js +10 -0
package/dist/doctor.js.map +1 -1
package/dist/event-store-persistence.js +21 -0
package/dist/event-store-persistence.js.map +1 -1
package/dist/event-store-queries.d.ts +3 -1
package/dist/event-store-queries.js +48 -1
package/dist/event-store-queries.js.map +1 -1
package/dist/feedback-outcomes.js +6 -0
package/dist/feedback-outcomes.js.map +1 -1
package/dist/feedback-policy.d.ts +3 -1
package/dist/feedback-policy.js +99 -0
package/dist/feedback-policy.js.map +1 -1
package/dist/file-identity.d.ts +8 -0
package/dist/file-identity.js +43 -0
package/dist/file-identity.js.map +1 -0
package/dist/historical-replay.d.ts +12 -0
package/dist/historical-replay.js +139 -43
package/dist/historical-replay.js.map +1 -1
package/dist/hook-control.js +7 -2
package/dist/hook-control.js.map +1 -1
package/dist/hook-payload.js +13 -4
package/dist/hook-payload.js.map +1 -1
package/dist/hook-summary.d.ts +2 -0
package/dist/hook-summary.js +22 -1
package/dist/hook-summary.js.map +1 -1
package/dist/recovery-stats.d.ts +27 -1
package/dist/recovery-stats.js +128 -5
package/dist/recovery-stats.js.map +1 -1
package/dist/settings.js +2 -1
package/dist/settings.js.map +1 -1
package/dist/signals.js +192 -19
package/dist/signals.js.map +1 -1
package/dist/status-input.js +15 -15
package/dist/transcript.d.ts +1 -0
package/dist/transcript.js +112 -1
package/dist/transcript.js.map +1 -1
package/dist/types.d.ts +57 -0
package/dist/why.js +2 -0
package/dist/why.js.map +1 -1
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -2,48 +2,100 @@
 [![CI](https://github.com/softcane/bb-cc-lite/actions/workflows/ci.yml/badge.svg)](https://github.com/softcane/bb-cc-lite/actions/workflows/ci.yml)
-Claude Code can look busy while it is doing the wrong thing: retrying the same broken test, editing without checking, filling context, or spending money on a stuck loop.
+Behavioral health monitoring for Claude Code sessions.
-`bb-cc-lite` is a small local Claude Code session supervisor. It adds a status line that answers one question:
+Claude Code can look busy while doing the wrong thing. It can retry the same failed test, edit without verification, burn tokens, fill the context window, thrash across files, and keep going long after a human should stop it.
+The small local status line answers one question:
 > Should I let this Claude Code session keep going?
-By default, it also gives Claude a short nudge when the pattern is clear. If the same test keeps failing without a fix, Claude can be told to inspect the first failure before retrying again. If Claude follows that feedback and recovers, `why` can show the loop. If you turn on guard mode, bb can deny the obvious repeated retry before it runs.
+Token spend tells you what it cost. `bb-cc-lite` tries to show whether the session behavior still looks healthy: continue, verify, or stop before more turns are burned.
+It is local. It is not a cloud dashboard, telemetry service, proxy, gateway, or message router.
+```text
+bb: Healthy | ctx 42% | $0.18 | continue normally
+bb: Healthy | validation resolved | continue normally
+bb: Careful | edits have not been checked yet | run the smallest relevant check
+bb: Careful | same test failed twice without a fix | inspect first failure
+bb: Careful | 9 non-read tool calls, no check or recovery seen | pause and ask what changed
+bb: Stop | why: same failure retried 3x without a fix | do: stop and inspect first failure
+```
 ![bb-cc-lite statusline examples](./assets/statusline-demo.gif)
-## Requirements
+_The demo shows the difference between healthy progress, unchecked edits, repeated validation failures, and Stop-level retry loops._
-- Node.js 20 or newer
-- Claude Code with status line support
+## Why This Exists
-## Install
+Activity is not progress.
-```bash
-npx --yes bb-cc-lite install --scope local
-```
+The hard part is not seeing that Claude Code is active. The hard part is knowing whether its activity is still useful. Busy output, tool calls, and token spend can hide negative progress.
-Restart Claude Code in the project. The status line appears at the bottom.
+Humans usually notice too late: after the same check has failed three times, after a broad edit streak was never verified, or after the context window is already under pressure.
-Default install uses coach mode. It keeps the status line for you and sends Claude short safe feedback when a risky loop is visible. Coach feedback can say things like: a validation check has failed repeatedly, inspect the failure pattern, make one targeted fix, then run one focused check.
+`bb-cc-lite` watches local derived signals from the Claude Code status input, transcript tail, and optional hooks. It classifies the current session as `Healthy`, `Careful`, or `Stop`.
-Coach feedback does not include prompts, command text, tool output, file contents, raw paths, raw session ids, or raw MCP names.
+## What It Catches
-Install preserves an existing Claude Code `statusLine` unless you pass `--replace`.
+- Retry loops where the same validation check or tool fails repeatedly without a fix.
+- Repeated test, lint, typecheck, or build failures.
+- Long edit streaks without a test, lint, typecheck, or build check.
+- Busy sessions with many tool calls but no observed check or recovery.
+- Repeated full-file reads of the same unchanged file.
+- Large tool results that suddenly add thousands of input tokens.
+- Cache reuse dropping after it was working well.
+- Context pressure before the session gets too full to reason clearly.
+- Compaction boundaries where Claude should restate the goal before continuing.
+- Cost and time budget warnings that make a stuck session easier to spot.
+- Local baseline patterns, when there is enough aggregate local history.
-To uninstall:
+## Healthy / Careful / Stop
-```bash
-npx --yes bb-cc-lite uninstall --scope local
+`Healthy` means the session still looks safe to continue.
+`Careful` means slow down. Ask for verification, inspect the pattern, or make the next step smaller.
+`Stop` does not mean the project failed. It means the session pattern is no longer worth blindly continuing. Take over, redirect Claude, or inspect the first failure before spending more turns.
+## How It Helps Claude
+`bb-cc-lite` can run in three install modes.
+`observe-only` keeps the status line and local derived event data, but does not send feedback to Claude.
+`coach` is the default. It keeps the status line and can send short, safe Claude-facing notes when a risky pattern is visible, such as repeated validation failure, unchecked edits, high budget with no progress signal, or unresolved validation risk at finish.
+`guard` includes coach feedback. It may deny a high-confidence repeated Bash validation retry, such as rerunning the same test/lint/typecheck/build category after repeated failures without an edit or passing check. It does not broadly block normal reads, edits, or arbitrary commands.
+`bb-cc-lite` also records safe feedback outcomes. For example, if bb asks Claude to validate after an edit and Claude runs a passing test, `bb-cc-lite why` can show that the feedback was resolved.
+When available, `why` includes the recent bb loop:
+```text
+Recent bb loop:
+1. Coach feedback: edits needed validation.
+2. Claude ran tests.
+3. Tests passed.
+4. Outcome: resolved.
 ```
-Prefer a global install?
+## Install
+Requirements:
+- Node.js 20 or newer
+- Claude Code with status line support
 ```bash
-npm install -g bb-cc-lite
-bb-cc-lite install --scope local
+npx --yes bb-cc-lite install --scope local
 ```
+Restart Claude Code in the project. The status line appears at the bottom.
+Default install uses coach mode and builds a small local baseline when possible. It preserves an existing Claude Code `statusLine` unless you pass `--replace`.
 To replace an existing status line:
 ```bash
@@ -56,81 +108,69 @@ To observe only, without sending feedback to Claude:
 npx --yes bb-cc-lite install --scope local --observe-only
 ```
-To enable stricter guard behavior:
+To enable stricter repeated-validation retry denial:
 ```bash
 npx --yes bb-cc-lite install --scope local --guard
 ```
-Guard mode includes coach feedback. It may deny a high-confidence repeated validation retry with a safe reason. It does not broadly block normal reads or edits.
 To disable baseline learning and lesson memory:
 ```bash
 npx --yes bb-cc-lite install --scope local --no-learn
 ```
-## What It Catches
-- Retry loops where the same command or test fails repeatedly without a fix.
-- Long stretches of editing without a test, lint, typecheck, or build check.
-- Busy sessions with many tool calls but no observed check or recovery.
-- Context pressure before the session gets too full to reason clearly.
-- Cost and time budget signals that make a stuck session easier to spot.
-- Project baseline patterns, when there is enough local aggregate history.
+To uninstall:
-## How It Helps Claude
+```bash
+npx --yes bb-cc-lite uninstall --scope local
+```
-In observe-only mode, bb records the pattern and updates the status line, but Claude does not receive feedback.
+Prefer a global install?
-In coach mode, bb can send Claude a short note during the session. Claude can then inspect the failure, make a targeted fix, run a focused check, or summarize why retrying is not useful.
+```bash
+npm install -g bb-cc-lite
+bb-cc-lite install --scope local
+```
-In guard mode, bb can deny a high-confidence repeated validation retry. The retry does not run, and Claude sees a safe reason.
+Supported install scopes are `local`, `project`, and `user`. Use `local` for the current repo unless you have a specific reason to edit project or user Claude settings.
-bb also records safe feedback outcomes. For example, if bb asks Claude to validate after an edit and Claude runs a passing test, `bb-cc-lite why` can show that the feedback was resolved.
+## Try Before Installing
-## What It Shows
+Run a retrospective audit against recent local Claude Code history:
-```text
-bb: Healthy | ctx 42% | $0.18 | cache warm | continue normally
-bb: Healthy | validation resolved | continue normally
-bb: Healthy | read-only exploration | continue normally
-bb: Careful | edits have not been checked yet | ask Claude to run the smallest relevant check
-bb: Careful | 9 non-read tool calls, no check or recovery seen | pause and ask Claude what changed
-bb: Careful | session ran 1h plus 9 non-read tool calls, no check or recovery seen | pause and ask Claude what changed before continuing
-bb: Careful | same test failed twice without a fix | inspect first failure
-bb: Careful | tests failed twice; usually passes after one targeted fix | inspect first failure
-bb: Careful | estimated cost $2.25 | ask Claude to summarize progress before continuing
-bb: Stop | why: same failure retried 3x without a fix | do: stop and inspect first failure
-bb: Stop | why: same test retried after feedback | do: inspect first failure
-bb: Stop | why: test loop rarely recovered after 3 failures | do: stop retrying and inspect first failure
-bb: Stop | why: high cost plus repeated failures | do: stop and inspect first failure
+```bash
+npx --yes bb-cc-lite audit --project .
+npx --yes bb-cc-lite audit --all-projects --recent 200
 ```
-`Healthy` means keep going. `Careful` means slow down and verify. `Stop` means take over before Claude burns more turns.
-When available, `why` includes the recent bb loop:
-```text
-Recent bb loop:
-1. Coach feedback: edits needed validation.
-2. Claude ran tests.
-3. Tests passed.
-4. Outcome: resolved.
-```
+`audit` scans recent local Claude Code JSONL history and reports where `bb-cc-lite` would have warned. It highlights repeated retries and risky session patterns; it only shows duplicate retry cost/time when the transcript contains usable measured metadata. It does not install a status line or hooks. Use `--all-projects` only when you want to inspect newest transcripts across local Claude projects.
 ## Useful Commands
 ```bash
+bb-cc-lite audit --project .
+bb-cc-lite audit --all-projects --recent 200
 bb-cc-lite why
 bb-cc-lite doctor
+bb-cc-lite doctor --baseline
 bb-cc-lite unlearn
 bb-cc-lite uninstall --scope local
 ```
-`why` explains the latest statusline decision and recent feedback outcomes when available. Interactive `why` output is lightly colored; set `NO_COLOR=1` or `BB_CC_LITE_COLOR=0` for plain text. `doctor --baseline` shows safe aggregate baseline facts. `unlearn` clears learned personal baselines, project baselines, and lesson memory. `uninstall` restores the previous Claude Code status line when a backup exists.
+`audit` scans recent history without installing.
+`why` explains the latest stored status line decision and recent feedback outcomes when available. It reads the local derived event store; it does not reopen transcripts to expose raw content. Interactive `why` output is lightly colored; set `NO_COLOR=1` or `BB_CC_LITE_COLOR=0` for plain text.
-Budget guard thresholds can be changed with environment variables:
+`doctor` checks Node, Claude settings, optional hooks, transcript access, pricing cache, and related diagnostics.
+`doctor --baseline` shows safe aggregate baseline facts.
+`unlearn` clears learned personal baselines, project baselines, and lesson memory.
+`uninstall` removes bb-owned status line and hooks. When a valid backup exists, it restores the previous Claude Code status line.
+Budget thresholds can be changed with environment variables:
 ```bash
 BB_CC_LITE_BUDGET_COST_USD=1.25
@@ -138,9 +178,28 @@ BB_CC_LITE_BUDGET_COST_DELTA_USD=0.25
 BB_CC_LITE_BUDGET_DURATION_MINUTES=30
 ```
-## Project-Specific Checks
+## Validation Signals
+`bb-cc-lite` observes checks Claude Code runs. It does not run tests, lint, typecheck, or build commands by itself.
+It recognizes common Bash validation commands and groups them into safe categories:
-bb already recognizes common checks such as `npm test`, `pytest`, `cargo test`, `go test`, `npm run lint`, and `npm run build`.
+- tests
+- lint
+- typecheck
+- build
+Those categories are used for retry-loop detection, recovery baselines, coach feedback, and guard-mode retry denial.
+## Session Signals
+Some warnings are not about tests. They are about the session shape.
+`bb-cc-lite` can notice when Claude rereads the same unchanged file, when one tool result makes the next turn much larger, when prompt-cache reuse drops, or when a compaction boundary needs a quick goal check.
+These signals are intentionally simple. They do not inspect raw prompts, tool output, or file contents. They use derived metadata from the status input and transcript tail.
+## Project-Specific Checks
 If your project uses custom validation commands, you can add an optional `.bb-cc-lite.json`:
@@ -156,14 +215,55 @@ If your project uses custom validation commands, you can add an optional `.bb-cc
 This file is not generated automatically. It is only needed when you want to teach bb what your project's test, lint, typecheck, or build commands look like. bb uses it for classification, but does not copy those raw commands into its event history.
+## When This Is Useful
+- Claude keeps retrying the same failing test.
+- Claude edits many files without running checks.
+- Claude rereads the same file as if it forgot the current context.
+- One tool response suddenly bloats the next turn.
+- Cache reuse drops and the session starts getting more expensive per turn.
+- Context is getting high and reasoning quality may drop.
+- Cost or time is climbing but validation is not improving.
+- A session has many tool calls but no clear recovery signal.
+- You want a quick signal before deciding to continue or intervene.
+- You want local feedback without adding a dashboard or service.
 ## Privacy
-Everything stays local. `bb-cc-lite` does not upload transcripts, prompts, tool output, shell output, file contents, API keys, raw commands, raw paths, raw Claude session ids, or raw MCP server or tool names.
+`bb-cc-lite` is local-first. There is no cloud backend, SaaS dashboard, transcript upload, proxy, gateway, or message router.
+The health event store and baselines use derived metadata only: state, reason code, counts, rates, percentiles, confidence labels, feedback outcomes such as `resolved` or `ignored`, safe categories such as `tests`, token/cost/context fields, timestamps, weak pattern labels, hashed file identities, hashed session keys, and hashed project keys.
+The health data is designed not to store prompts, assistant text, tool output, shell output, command arguments, file contents, transcript paths, workspace paths, API keys, raw Claude session ids, or raw MCP names.
+For repeated-read warnings, bb may show a short basename-style hint such as `auth.ts`. It does not store or print the full local path.
+Local files live under `~/.claude/bb-cc-lite` by default, unless `BB_CC_LITE_HOME` or other override environment variables are set. This includes:
+- `events.json` for derived local decisions and hook events.
+- `baseline.json` for the personal aggregate baseline.
+- `project-baselines/<hashed-project>.json` for aggregate project baselines.
+- `project-lessons/<hashed-project>.json` for decaying project lesson memory.
+- `litellm-pricing.json` for cached public pricing data when refreshed.
+- `backups/` for Claude settings backups used by uninstall.
-It stores derived data only: counts, rates, percentiles, confidence labels, reason codes, feedback outcomes such as `resolved` or `ignored`, safe categories such as `tests`, cost/time/context numbers, weak pattern labels, hashed session keys, and hashed project keys.
+Install backups are local settings snapshots so uninstall can restore prior Claude settings. They may contain whatever status line, hook commands, or paths existed in those Claude settings. They are not uploaded by `bb-cc-lite`.
-Project baselines are stored under the bb app home, not inside the repo. They use only a hashed project key and safe summary data from that project. Sparse or corrupt project data falls back to the personal baseline or fixed rules.
+LiteLLM is used only as public pricing data for cost estimates. `bb-cc-lite` does not run a LiteLLM proxy or route messages.
-Lesson memory is also stored under the bb app home by hashed project key. A lesson card contains only safe fields such as a reason code, safe category, confidence, counts, timestamps, and templated wording. Lesson cards decay and can be removed with `bb-cc-lite unlearn`.
+## More Examples
-LiteLLM is used only as public pricing data for cost estimates. `bb-cc-lite` does not run a proxy, gateway, dashboard, or message router.
+```text
+bb: Healthy | read-only exploration | continue normally
+bb: Careful | estimated cost $2.25 | ask Claude to summarize progress before continuing
+bb: Careful | session ran 1h plus 9 non-read tool calls, no check or recovery seen | pause and ask what changed before continuing
+bb: Careful | ctx 83% | ask Claude for a 6-bullet handoff before more work
+bb: Careful | compaction event seen | ask Claude to restate current goal and next 3 steps
+bb: Careful | same file reread twice (auth.ts) | ask Claude to use existing context before rereading
+bb: Careful | single tool result added ~12,400 tokens | compact or narrow the next step
+bb: Careful | cache reuse dropped from 68% to 29% | keep the next prompt narrow
+bb: Careful | tests failed twice; usually passes after one targeted fix | inspect first failure
+bb: Stop | why: same file reread 3x (auth.ts) | do: stop and ask why the same file is needed again
+bb: Stop | why: same test retried after feedback | do: inspect first failure
+bb: Stop | why: test loop rarely recovered after 3 failures | do: stop retrying and inspect first failure
+```

package/assets/statusline-coach-guard-demo.gif ADDED Viewed

Binary file

package/dist/audit.d.ts ADDED Viewed

@@ -0,0 +1,47 @@
+import type { DecisionConfidence, DecisionState } from "./types.js";
+export interface AuditOptions {
+    projectDir?: string;
+    homeDir?: string;
+    transcriptPath?: string;
+    allProjects?: boolean;
+    recent?: number;
+    maxBytesPerTranscript?: number;
+}
+export interface AuditReport {
+    scope: "project" | "all-projects" | "transcript";
+    recentLimit: number;
+    sessionsScanned: number;
+    transcriptsFound: number;
+    unreadableTranscripts: number;
+    sessionsWithFindings: number;
+    findings: AuditFinding[];
+    repeatedRetriesSpotted: number;
+    estimatedSavings: AuditSavingsEstimate;
+    reportConfidence: DecisionConfidence;
+    reportConfidenceReason: string;
+}
+export interface AuditFinding {
+    session: number;
+    state: DecisionState;
+    confidence: DecisionConfidence;
+    reasonCode: string;
+    evidence: string;
+    action: string;
+    repeatedRetries?: number;
+    estimatedDurationMs?: number;
+    estimatedCostUsd?: number;
+    savingsEstimateSource?: "measured" | "fallback";
+}
+export interface AuditSavingsEstimate {
+    durationMinutes: number;
+    costUsd: number;
+    repeatedToolRunsAvoided: number;
+    confidence: DecisionConfidence;
+    basis: string;
+    measured: boolean;
+}
+export interface FormatAuditReportOptions {
+    color?: boolean;
+}
+export declare function runAudit(options?: AuditOptions): Promise<AuditReport>;
+export declare function formatAuditReport(report: AuditReport, options?: FormatAuditReportOptions): string;