npm - useathena - Versions diffs - 0.1.0 → 0.2.0 - Mend

useathena 0.1.0 → 0.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (23) hide show

package/README.md +46 -17
package/dist/api/server.js +30 -4
package/dist/cli/commands.js +81 -5
package/dist/cli/export.js +35 -0
package/dist/cli/service.js +99 -0
package/dist/cli/setup.js +30 -2
package/dist/cli.js +57 -24
package/dist/core/fixtures.js +26 -0
package/dist/core/refs.js +2 -0
package/dist/engine/engine.js +15 -6
package/dist/engine/facts.js +80 -0
package/dist/engine/parse.js +31 -6
package/dist/engine/prompts.js +10 -1
package/dist/eval/run-eval.js +3 -3
package/dist/mcp/server.js +38 -10
package/dist/sensors/claude-code-hook.js +31 -2
package/dist/serve/auto-learn.js +56 -0
package/dist/serve/auto-outcome.js +63 -0
package/dist/serve/brief.js +64 -11
package/dist/store/open.js +4 -0
package/dist/store/store.js +109 -0
package/docs/schema.md +65 -6
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -44,11 +44,15 @@ The MVP is not a broad knowledge base. It is a working closed loop for tacit jud
    - Every learned rule cites the evidence that produced it.
    - Agents can propose evidence and outcomes, but cannot directly mutate durable rules.
-3. Infer tacit rules from evidence.
+3. Infer tacit rules — and explicit facts — from evidence.
    - The hypothesis engine clusters examples by domain.
    - It infers cues, expectancies, goals, rules, and boundary conditions.
    - It replay-validates against held-out examples before a rule can be served.
-   - Inference is model-backed, not a fake deterministic string matcher.
+   - It also extracts entity facts ("Acme's renewal is in September") — deduped,
+     cited, and staleness-tracked; restatements confirm instead of duplicating.
+   - Inference is model-backed, not a fake deterministic string matcher — and it
+     triggers itself: once enough new evidence accumulates, a background
+     `athena learn` runs automatically (disable with `ATHENA_AUTO_LEARN=off`).
 4. Validate before serving; review is optional.
    - Replay-validated rules are served to agents immediately, flagged with caveats.
@@ -59,10 +63,18 @@ The MVP is not a broad knowledge base. It is a working closed loop for tacit jud
 5. Brief agents before they act.
    - MCP exposes exactly four tools: `athena_brief`, `athena_open`, `athena_record`, and `athena_search`.
-   - A brief returns applicable rules, confidence, boundaries, citations, do-not-assume items, open questions, and a readiness verdict.
-6. Learn from outcomes.
-   - Agents record whether their briefed output was accepted, corrected, abandoned, or unknown.
+   - A brief returns applicable rules, confidence, boundaries, citations, entity facts, do-not-assume items, open questions, and a readiness verdict.
+   - It also carries a knowledge map — what else athena knows and the query that
+     retrieves each slice — so agents pull what they need instead of being flooded.
+6. Learn from outcomes — mostly without anyone reporting.
+   - Agents register the artifact they produced (`athena_record type=output`); when
+     a sensor later captures what the human actually did with it, the outcome
+     records itself: approval → uncorrected, edit → corrected with the diff as a
+     counterexample. Matching is conservative; a false match would corrupt trust.
+   - Short approval sign-offs in Claude Code ("lgtm", "ship it") count as observed
+     approvals; unresolved drafts expire to `unknown` after the match window.
+   - Explicit reporting still works everywhere sensors can't see.
    - Rules gain trust when upheld and lose trust when overridden.
    - Repeated counterexamples make rules stale.
@@ -75,18 +87,26 @@ Implemented:
 - Core schema and typed domain model.
 - SQLite store with invariants and lexical search.
 - Capture ingestion with structured diffs.
-- LLM-backed hypothesis engine.
+- LLM-backed hypothesis engine, plus entity-fact extraction from the same
+  evidence (deduped, cited, staleness-tracked; `athena facts`).
+- Auto-learn: sensors trigger a background `athena learn` once enough new
+  evidence accumulates — the loop no longer stalls on a forgotten command.
 - Model-agnostic providers: `cli:claude`, `cli:codex`, Anthropic API,
   OpenAI-compatible APIs, and local models via Ollama / LM Studio / vLLM.
 - Golden-scenario eval harness.
-- Brief compilation, outcome recording, and autonomous rule promotion
-  (validated rules graduate to active through upheld outcomes).
+- Brief compilation with a knowledge map, outcome recording, and autonomous rule
+  promotion (validated rules graduate to active through upheld outcomes).
+- Automatic outcome detection: agents register drafts (`athena_record
+  type=output` / `athena record output`), sensors match what the human actually
+  sent, and the outcome records itself.
 - MCP server with the four-tool agent surface (`athena mcp`).
 - CLI with a first-run `athena setup` wizard, plus init, status, capture, learn,
-  review, brief, rules, open, record, serve, and hook install.
-- Claude Code sensor hook (installed by `athena setup` or `athena hook install`).
-- Localhost API for browser sensors.
+  review, brief, rules, facts, open, record, export, serve, and hook install.
+- Claude Code sensor hook (corrections, approvals, and `remember:` notes).
+- Localhost API for browser sensors — installable as a login service
+  (`athena serve install`, done by setup) so capture survives reboots.
 - Chrome extension v0.1 for LinkedIn and Gmail draft-edit capture.
+- `athena export`: one JSON bundle of everything learned (`--redact` for sharing).
 - One-command onboarding: `npx useathena onboard --yes` (npm package `useathena`,
   binary `athena`; GitHub installs work too).
@@ -141,9 +161,9 @@ Onboarding does everything: creates the local store, detects which model provide
 you have (claude CLI, codex CLI, Anthropic or OpenAI API keys, a running Ollama),
 verifies the one you pick with a real inference call, installs itself globally so
 hooks survive the npx cache, wires up Claude Code (sensor hook + MCP registration),
-and prints the browser-extension setup. `--yes` takes the first detected provider
-and says yes to all of it; opt out per-piece with `--no-hook`, `--skip-test`, or
-`--model <spec>`.
+installs the browser-sensor API as a login service, and prints the extension setup
+with its token. `--yes` takes the first detected provider and says yes to all of it;
+opt out per-piece with `--no-hook`, `--no-service`, `--skip-test`, or `--model <spec>`.
 Model providers are spec strings, set once by `setup` (or per-run via `ATHENA_MODEL`):
@@ -194,10 +214,12 @@ npm run athena -- capture correction \
   --after "Saw your post on GTM hiring. Would be glad to connect."
 ```
-Learn candidate rules from captured evidence:
+Learn candidate rules (and entity facts) from captured evidence — this also runs
+automatically in the background once enough new evidence accumulates:
 ```bash
 npm run athena -- learn --domain linkedin.outreach
+npm run athena -- facts
 ```
 Review inferred rules:
@@ -212,12 +234,19 @@ Ask for the kind of brief an agent should get before acting:
 npm run athena -- brief "draft a LinkedIn connection note to a GTM leader" --domain linkedin.outreach
 ```
-Start the localhost API for the Chrome extension:
+Start the localhost API for the Chrome extension (or install it as a login
+service with `serve install`):
 ```bash
 npm run athena -- serve
 ```
+Export everything athena has learned as one JSON bundle (`--redact` strips raw text):
+```bash
+npm run athena -- export
+```
 ## Design principles
 - Instances are immutable evidence; hypotheses are revisable views over evidence.

package/dist/api/server.js CHANGED Viewed

@@ -1,6 +1,10 @@
 import { createServer } from "node:http";
-import { timingSafeEqual } from "node:crypto";
-import { ingestSensorEvent } from "../capture/ingest.js";
+import { randomBytes, timingSafeEqual } from "node:crypto";
+import { mkdirSync, readFileSync, writeFileSync } from "node:fs";
+import { dirname, join } from "node:path";
+import { dbPath } from "../config.js";
+import { ingestAndMatch } from "../serve/auto-outcome.js";
+import { maybeAutoLearn } from "../serve/auto-learn.js";
 /**
  * The localhost door for browser sensors. Deliberately tiny:
  *   GET  /health  → liveness, no auth
@@ -20,6 +24,22 @@ const INSTANCE_KINDS = new Set([
     "approval",
     "manual_note",
 ]);
+/** The serve token guards the local API against other local processes. */
+export function loadOrCreateServeToken() {
+    const tokenPath = join(dirname(dbPath()), "serve.token");
+    try {
+        const existing = readFileSync(tokenPath, "utf8").trim();
+        if (existing.length >= 16)
+            return existing;
+    }
+    catch {
+        // first run — create below
+    }
+    const token = randomBytes(24).toString("base64url");
+    mkdirSync(dirname(tokenPath), { recursive: true });
+    writeFileSync(tokenPath, `${token}\n`, { mode: 0o600 });
+    return token;
+}
 export function startApiServer(store, options) {
     const server = createServer((request, response) => {
         void handle(store, options, request, response);
@@ -47,8 +67,14 @@ async function handle(store, options, request, response) {
         try {
             const body = await readBody(request);
             const event = parseEvent(body, options.sensorId ?? "sen_extension");
-            const instance = ingestSensorEvent(store, event);
-            json(response, 201, { captured: instance.id, kind: instance.kind, domain: instance.situation.domain });
+            const { instance, autoOutcome } = ingestAndMatch(store, event);
+            maybeAutoLearn(store);
+            json(response, 201, {
+                captured: instance.id,
+                kind: instance.kind,
+                domain: instance.situation.domain,
+                ...(autoOutcome ? { autoOutcome: { id: autoOutcome.id, result: autoOutcome.result } } : {}),
+            });
         }
         catch (error) {
             json(response, 400, { error: error instanceof Error ? error.message : "invalid request" });

package/dist/cli/commands.js CHANGED Viewed

@@ -1,9 +1,13 @@
 import { readFileSync } from "node:fs";
+import { createHash } from "node:crypto";
+import { newId } from "../core/ids.js";
 import { refTo } from "../core/refs.js";
 import { openRef } from "../store/open.js";
-import { ingestSensorEvent } from "../capture/ingest.js";
+import { ingestAndMatch } from "../serve/auto-outcome.js";
+import { AUTO_LEARN_THRESHOLD, LAST_LEARN_AT } from "../serve/auto-learn.js";
 import { compileBrief } from "../serve/brief.js";
 import { LlmHypothesisEngine } from "../engine/engine.js";
+import { materializeFacts } from "../engine/facts.js";
 import { bold, confidenceBar, cyan, dim, green, hitRate, readinessBadge, red, statusBadge, wrap, yellow } from "./format.js";
 /**
  * Command logic, pure-ish: store in, rendered string out. The interactive
@@ -33,6 +37,16 @@ export function cmdStatus(store) {
                 : ""),
         `  ${bold(String(counts.briefs))} briefs served, ${bold(String(counts.outcomes))} outcomes recorded`,
     ];
+    if (counts.facts > 0) {
+        lines.splice(3, 0, `  ${bold(String(counts.facts))} fact${counts.facts === 1 ? "" : "s"} about ${bold(String(store.listObjects().length))} entities`);
+    }
+    if (counts.unmatchedDrafts > 0) {
+        lines.push(`  ${bold(String(counts.unmatchedDrafts))} agent draft${counts.unmatchedDrafts === 1 ? "" : "s"} awaiting an observed outcome`);
+    }
+    const pending = store.countInstancesSince(store.getMeta(LAST_LEARN_AT));
+    if (pending > 0) {
+        lines.push(`  ${bold(String(pending))} instance${pending === 1 ? "" : "s"} awaiting inference ${dim(`(auto-learn runs at ${AUTO_LEARN_THRESHOLD}; or now: athena learn)`)}`);
+    }
     if (judged.length > 0) {
         const rate = Math.round((corrected / judged.length) * 100);
         const paint = rate <= 30 ? green : rate <= 60 ? yellow : red;
@@ -118,8 +132,11 @@ export function cmdCapture(store, flags) {
         ...(flags.before !== undefined ? { before: { mediaType: "text/plain", content: maybeFile(flags.before) } } : {}),
         ...(flags.after !== undefined ? { after: { mediaType: "text/plain", content: maybeFile(flags.after) } } : {}),
     };
-    const instance = ingestSensorEvent(store, event);
-    return `${green("captured")} ${instance.id} ${dim(`(${instance.kind}, ${instance.situation.domain})`)}`;
+    const { instance, autoOutcome } = ingestAndMatch(store, event);
+    const matched = autoOutcome
+        ? `\n${green("✓")} matched a registered agent draft — outcome ${autoOutcome.result} recorded (${autoOutcome.id})`
+        : "";
+    return `${green("captured")} ${instance.id} ${dim(`(${instance.kind}, ${instance.situation.domain})`)}${matched}`;
 }
 function maybeFile(value) {
     return value.startsWith("@") ? readFileSync(value.slice(1), "utf8") : value;
@@ -134,20 +151,79 @@ export async function cmdLearn(store, model, options = {}) {
     }
     const engine = new LlmHypothesisEngine(model);
     const inferred = await engine.infer(instances);
+    store.setMeta(LAST_LEARN_AT, new Date().toISOString());
     const existingRules = new Set(store.listHypotheses({ limit: 1000 }).map((h) => h.rule));
     let saved = 0;
     const lines = [];
-    for (const hypothesis of inferred) {
+    for (const hypothesis of inferred.hypotheses) {
         if (existingRules.has(hypothesis.rule))
             continue;
         store.saveHypothesis(hypothesis);
         saved += 1;
         lines.push("", renderRule(hypothesis));
     }
-    const header = `${bold(String(saved))} new rule${saved === 1 ? "" : "s"} from ${instances.length} instances ${dim(`(engine: ${model.id})`)}`;
+    const { created, confirmed } = materializeFacts(store, inferred.facts);
+    for (const fact of created) {
+        const entity = store.getObject(fact.objectId);
+        lines.push("", `${cyan("fact")} ${dim(`(${entity?.name ?? fact.objectId})`)} ${wrap(fact.statement, "  ").trim()}`);
+    }
+    const factNote = created.length + confirmed.length > 0
+        ? `, ${bold(String(created.length))} new fact${created.length === 1 ? "" : "s"}${confirmed.length > 0 ? ` (${confirmed.length} confirmed)` : ""}`
+        : "";
+    const header = `${bold(String(saved))} new rule${saved === 1 ? "" : "s"}${factNote} from ${instances.length} instances ${dim(`(engine: ${model.id})`)}`;
     const footer = saved > 0 ? `\n\n${yellow("review them: athena review")}` : "";
     return header + lines.join("\n") + footer;
 }
+export function cmdFacts(store, options = {}) {
+    let facts;
+    if (options.entity) {
+        const objects = store.resolveObject(options.entity);
+        if (objects.length === 0)
+            return dim(`no entity matches "${options.entity}"`);
+        facts = objects.flatMap((object) => store.listFacts({ objectId: object.id }));
+    }
+    else {
+        facts = store.listFacts(options.domain !== undefined ? { domain: options.domain } : {});
+    }
+    if (facts.length === 0) {
+        return dim("no facts yet — facts are extracted from evidence when you run: athena learn");
+    }
+    return facts
+        .map((fact) => {
+        const entity = store.getObject(fact.objectId);
+        return [
+            `${cyan(entity?.name ?? "?")}  ${confidenceBar(fact.confidence)}  ${dim(fact.domain)}  ${dim(fact.id)}`,
+            wrap(fact.statement, "  "),
+            dim(`  ${fact.supportingInstanceIds.length} supporting, last confirmed ${fact.lastConfirmedAt.slice(0, 10)}`),
+        ].join("\n");
+    })
+        .join("\n\n");
+}
+export function cmdRecordDraft(store, briefId, content) {
+    const brief = store.getBrief(briefId);
+    if (!brief)
+        throw new Error(`unknown brief ${briefId}`);
+    const draft = {
+        id: newId("drf"),
+        briefId: brief.id,
+        content,
+        contentHash: createHash("sha256").update(content).digest("hex").slice(0, 16),
+        mediaType: "text/plain",
+        domain: domainOfBrief(store, brief.id),
+        recordedAt: new Date().toISOString(),
+    };
+    store.saveDraft(draft);
+    return `${green("registered")} ${draft.id} ${dim("— the outcome records itself when a sensor sees the final version")}`;
+}
+function domainOfBrief(store, briefId) {
+    const brief = store.getBrief(briefId);
+    if (brief?.domain)
+        return brief.domain;
+    const first = brief?.rules[0];
+    if (first)
+        return store.getHypothesis(first.hypothesisId)?.domain ?? "general";
+    return "general";
+}
 // --- review queue ---
 export function reviewQueue(store) {
     return store

package/dist/cli/export.js ADDED Viewed

@@ -0,0 +1,35 @@
+/**
+ * One self-describing JSON bundle of everything athena has learned and how it
+ * got there — the raw material for improving the engine: wrong rules become
+ * eval scenarios, missed captures become sensor fixes, outcome history grades
+ * the briefs. `redact` strips raw text (artifact contents, probe answers) and
+ * keeps structure + metrics, for sharing outside the inner circle.
+ */
+export const EXPORT_FORMAT_VERSION = 1;
+export function buildExport(store, options = {}) {
+    const instances = store.listInstances({ limit: 10_000 });
+    return {
+        athenaExport: EXPORT_FORMAT_VERSION,
+        exportedAt: new Date().toISOString(),
+        redacted: options.redact === true,
+        counts: store.counts(),
+        hypotheses: store.listHypotheses({ limit: 10_000 }),
+        facts: store.listFacts({ limit: 10_000 }),
+        objects: store.listObjects(),
+        instances: options.redact ? instances.map(redactInstance) : instances,
+        briefs: store.listBriefs(10_000),
+        outcomes: store.listOutcomes(),
+        drafts: store.listDrafts().map((draft) => (options.redact ? { ...draft, content: "(redacted)" } : draft)),
+    };
+}
+function redactInstance(instance) {
+    return {
+        ...instance,
+        ...(instance.before ? { before: { ...instance.before, content: "(redacted)" } } : {}),
+        ...(instance.after ? { after: { ...instance.after, content: "(redacted)" } } : {}),
+        ...(instance.diff
+            ? { diff: { ...instance.diff, hunks: [] } }
+            : {}),
+        probeAnswers: instance.probeAnswers.map((probe) => ({ ...probe, answer: "(redacted)" })),
+    };
+}

package/dist/cli/service.js ADDED Viewed

@@ -0,0 +1,99 @@
+import { spawnSync } from "node:child_process";
+import { existsSync, mkdirSync, rmSync, writeFileSync } from "node:fs";
+import { homedir } from "node:os";
+import { dirname, join } from "node:path";
+import { dbPath } from "../config.js";
+/**
+ * Keep `athena serve` running without a dedicated terminal: a per-user login
+ * service (launchd on macOS, systemd --user on Linux) runs the same command
+ * the user would run by hand and restarts it if it dies. Logs land next to
+ * the store so `athena serve` stays debuggable.
+ */
+export const SERVICE_LABEL = "com.useathena.serve";
+export const SYSTEMD_UNIT = "athena-serve.service";
+export function launchdPlist(nodePath, athenaBin, logPath) {
+    return `<?xml version="1.0" encoding="UTF-8"?>
+<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
+<plist version="1.0">
+<dict>
+  <key>Label</key><string>${SERVICE_LABEL}</string>
+  <key>ProgramArguments</key>
+  <array>
+    <string>${nodePath}</string>
+    <string>${athenaBin}</string>
+    <string>serve</string>
+  </array>
+  <key>RunAtLoad</key><true/>
+  <key>KeepAlive</key><true/>
+  <key>StandardOutPath</key><string>${logPath}</string>
+  <key>StandardErrorPath</key><string>${logPath}</string>
+</dict>
+</plist>
+`;
+}
+export function systemdUnit(nodePath, athenaBin) {
+    return `[Unit]
+Description=athena local API for browser sensors
+[Service]
+ExecStart=${nodePath} ${athenaBin} serve
+Restart=always
+[Install]
+WantedBy=default.target
+`;
+}
+export function installService(athenaBin, platform = process.platform) {
+    const logPath = join(dirname(dbPath()), "serve.log");
+    if (platform === "darwin") {
+        const plistPath = join(homedir(), "Library", "LaunchAgents", `${SERVICE_LABEL}.plist`);
+        mkdirSync(dirname(plistPath), { recursive: true });
+        writeFileSync(plistPath, launchdPlist(process.execPath, athenaBin, logPath));
+        spawnSync("launchctl", ["unload", plistPath], { stdio: "ignore" }); // replace any previous version
+        const load = spawnSync("launchctl", ["load", plistPath], { encoding: "utf8" });
+        if (load.status !== 0) {
+            return {
+                installed: false,
+                path: plistPath,
+                message: `wrote ${plistPath} but launchctl load failed: ${(load.stderr ?? "").trim().slice(0, 200)}`,
+            };
+        }
+        return { installed: true, path: plistPath, message: `serve runs at login (launchd ${SERVICE_LABEL}, logs: ${logPath})` };
+    }
+    if (platform === "linux") {
+        const unitPath = join(homedir(), ".config", "systemd", "user", SYSTEMD_UNIT);
+        mkdirSync(dirname(unitPath), { recursive: true });
+        writeFileSync(unitPath, systemdUnit(process.execPath, athenaBin));
+        spawnSync("systemctl", ["--user", "daemon-reload"], { stdio: "ignore" });
+        const enable = spawnSync("systemctl", ["--user", "enable", "--now", SYSTEMD_UNIT], { encoding: "utf8" });
+        if (enable.status !== 0) {
+            return {
+                installed: false,
+                path: unitPath,
+                message: `wrote ${unitPath} but systemctl enable failed: ${(enable.stderr ?? "").trim().slice(0, 200)}`,
+            };
+        }
+        return { installed: true, path: unitPath, message: `serve runs at login (systemd --user ${SYSTEMD_UNIT})` };
+    }
+    return { installed: false, message: `no service template for ${platform} — run "athena serve" manually or add it to your startup apps` };
+}
+export function uninstallService(platform = process.platform) {
+    if (platform === "darwin") {
+        const plistPath = join(homedir(), "Library", "LaunchAgents", `${SERVICE_LABEL}.plist`);
+        if (!existsSync(plistPath))
+            return { installed: false, message: "no launchd agent installed" };
+        spawnSync("launchctl", ["unload", plistPath], { stdio: "ignore" });
+        rmSync(plistPath);
+        return { installed: false, path: plistPath, message: `removed ${plistPath}` };
+    }
+    if (platform === "linux") {
+        const unitPath = join(homedir(), ".config", "systemd", "user", SYSTEMD_UNIT);
+        if (!existsSync(unitPath))
+            return { installed: false, message: "no systemd unit installed" };
+        spawnSync("systemctl", ["--user", "disable", "--now", SYSTEMD_UNIT], { stdio: "ignore" });
+        rmSync(unitPath);
+        spawnSync("systemctl", ["--user", "daemon-reload"], { stdio: "ignore" });
+        return { installed: false, path: unitPath, message: `removed ${unitPath}` };
+    }
+    return { installed: false, message: `nothing to uninstall on ${platform}` };
+}

package/dist/cli/setup.js CHANGED Viewed

@@ -4,14 +4,16 @@ import { homedir } from "node:os";
 import { dirname, join, sep } from "node:path";
 import { createInterface } from "node:readline/promises";
 import { configPath, dbPath, loadConfig, saveConfig } from "../config.js";
+import { loadOrCreateServeToken } from "../api/server.js";
 import { DEFAULT_ANTHROPIC_MODEL, modelClientFromSpec, SUPPORTED_SPECS } from "../model/registry.js";
+import { installService } from "./service.js";
 import { bold, cyan, dim, green, red, yellow } from "./format.js";
 /**
  * First-run wizard (aliases: setup, onboard): pick a model provider, prove it
  * answers, wire up the sensors. `npx useathena onboard --yes` is the one-command
  * path: it accepts every default, including a global self-install when running
  * from the npx cache (hooks and MCP registrations must outlive cache pruning).
- * Granular escape hatches: --model <spec>, --hook/--no-hook, --skip-test.
+ * Granular escape hatches: --model <spec>, --hook/--no-hook, --no-service, --skip-test.
  */
 const OLLAMA_TAGS_URL = "http://127.0.0.1:11434/api/tags";
 export async function runSetup(args, packageRoot) {
@@ -35,9 +37,15 @@ export async function runSetup(args, packageRoot) {
             console.log(`  claude mcp add --scope user athena -- ${home.bin} mcp`);
             console.log(dim(`  (other MCP clients: stdio command "${home.bin} mcp")`));
         }
+        const serviceRunning = await offerServeService(args, rl, home.bin);
         console.log(`\n${bold("browser sensor")} ${dim("(LinkedIn and Gmail draft edits)")}:`);
         console.log(`  1. chrome://extensions → Developer mode → Load unpacked → ${join(home.root, "apps", "chrome-extension")}`);
-        console.log(`  2. run ${bold("athena serve")} and paste the printed token into the extension options page`);
+        if (serviceRunning) {
+            console.log(`  2. extension options page → token ${bold(loadOrCreateServeToken())}`);
+        }
+        else {
+            console.log(`  2. run ${bold("athena serve")} and paste the printed token into the extension options page`);
+        }
         console.log(`\n${bold("the loop:")} capture runs in the background — then`);
         console.log(`  ${cyan("athena learn")}            infer tacit rules from captured evidence`);
         console.log(`  ${cyan("athena rules")}            see what athena believes (and how confidently)`);
@@ -212,6 +220,26 @@ async function ensureDurableInstall(rl, packageRoot) {
     }
     return local;
 }
+/**
+ * The browser sensor only captures while `athena serve` is up — installing it
+ * as a login service removes the "keep a terminal open" failure mode.
+ */
+async function offerServeService(args, rl, athenaBin) {
+    if (args.includes("--no-service"))
+        return false;
+    let wanted = args.includes("--yes");
+    if (!wanted && rl) {
+        const answer = (await rl.question(`\nRun the browser-sensor API at login? ${dim("(background service for athena serve)")} [Y/n] `))
+            .trim()
+            .toLowerCase();
+        wanted = answer === "" || answer === "y" || answer === "yes";
+    }
+    if (!wanted)
+        return false;
+    const result = installService(athenaBin);
+    console.log(result.installed ? `${green("✓")} ${result.message}` : yellow(`  ${result.message}`));
+    return result.installed;
+}
 /** Wire up Claude Code end to end: sensor hook + MCP registration. */
 async function offerClaudeCode(args, rl, athenaBin) {
     if (args.includes("--no-hook"))

package/dist/cli.js CHANGED Viewed

@@ -1,15 +1,17 @@
 import { createInterface } from "node:readline/promises";
 import { appendFileSync, mkdirSync, readFileSync, writeFileSync } from "node:fs";
-import { randomBytes } from "node:crypto";
 import { homedir } from "node:os";
 import { dirname, join } from "node:path";
 import { fileURLToPath } from "node:url";
 import { AthenaStore } from "./store/store.js";
 import { dbPath } from "./config.js";
-import { startApiServer } from "./api/server.js";
+import { loadOrCreateServeToken, startApiServer } from "./api/server.js";
+import { installService, uninstallService } from "./cli/service.js";
 import { handleUserPrompt } from "./sensors/claude-code-hook.js";
 import { modelClientFromSpec, resolveModelSpec } from "./model/registry.js";
-import { applyReview, cmdBrief, cmdCapture, cmdLearn, cmdOpen, cmdRules, cmdStatus, renderReviewItem, reviewQueue, } from "./cli/commands.js";
+import { applyReview, cmdBrief, cmdCapture, cmdFacts, cmdLearn, cmdOpen, cmdRecordDraft, cmdRules, cmdStatus, renderReviewItem, reviewQueue, } from "./cli/commands.js";
+import { buildExport } from "./cli/export.js";
+import { maybeAutoLearn } from "./serve/auto-learn.js";
 import { recordOutcome } from "./serve/outcome.js";
 import { loadConfig } from "./config.js";
 import { installClaudeCodeHook, runSetup } from "./cli/setup.js";
@@ -26,17 +28,21 @@ usage: athena <command> [args]
   ${bold("init")}                      create the local store and print MCP setup
   ${bold("status")}                    what athena has captured, learned, and served
   ${bold("rules")} [--domain d] [--all]  the learned tacit rules (--all includes stale/retired)
+  ${bold("facts")} [--entity e] [--domain d]  the extracted entity facts (athena's explicit knowledge)
   ${bold("review")}                    review queue: approve or reject inferred rules (optional —
                             rules also graduate on their own via upheld outcomes)
-  ${bold("learn")} [--domain d]        run the hypothesis engine over captured instances
+  ${bold("learn")} [--domain d]        infer tacit rules AND extract entity facts from evidence
   ${bold("capture")} <kind> --summary "..." [--domain d] [--before t|@f] [--after t|@f]
                             capture a judgment moment (kinds: correction, override,
                             decision, escalation, failed_attempt, approval, manual_note)
   ${bold("brief")} "task" [--domain d]  compile the brief an agent would get for a task
   ${bold("record")} outcome <briefId> <uncorrected|corrected|abandoned>
+  ${bold("record")} output <briefId> <text|@file>   register an agent draft for automatic outcomes
+  ${bold("export")} [--out f] [--redact]  one JSON bundle of everything learned (for analysis)
   ${bold("open")} <athena://ref | id>  inspect any entity
   ${bold("hook")} install [--print]    install the Claude Code sensor hook into ~/.claude/settings.json
   ${bold("serve")} [--port n]          local API for browser sensors (default 127.0.0.1:4517)
+  ${bold("serve")} install|uninstall   run the API at login (launchd/systemd user service)
   ${bold("mcp")}                       run the MCP stdio server (claude mcp add athena -- athena mcp)
 store: ${dbPath()}  ${dim("(override with ATHENA_DB)")}
@@ -108,6 +114,7 @@ async function main() {
                     ...(app !== undefined ? { app } : {}),
                 };
                 console.log(cmdCapture(store, flags));
+                maybeAutoLearn(store);
                 break;
             }
             case "learn": {
@@ -123,14 +130,42 @@ async function main() {
                 break;
             }
             case "record": {
-                const [sub, briefId, result] = args;
-                if (sub !== "outcome" || !briefId || !result) {
+                const [sub, briefId, value] = args;
+                if (sub === "output") {
+                    if (!briefId || !value)
+                        throw new Error("usage: athena record output <briefId> <text|@file>");
+                    const content = value.startsWith("@") ? readFileSync(value.slice(1), "utf8") : value;
+                    console.log(cmdRecordDraft(store, briefId, content));
+                    break;
+                }
+                if (sub !== "outcome" || !briefId || !value) {
                     throw new Error("usage: athena record outcome <briefId> <uncorrected|corrected|abandoned>");
                 }
-                const outcome = recordOutcome(store, { briefId: briefId, result: result });
+                const outcome = recordOutcome(store, { briefId: briefId, result: value });
                 console.log(`${green("✓")} recorded ${outcome.id} (${outcome.result})`);
                 break;
             }
+            case "facts": {
+                const entity = flag(args, "entity");
+                const domain = flag(args, "domain");
+                console.log(cmdFacts(store, {
+                    ...(entity !== undefined ? { entity } : {}),
+                    ...(domain !== undefined ? { domain } : {}),
+                }));
+                break;
+            }
+            case "export": {
+                const out = flag(args, "out") ?? `athena-export-${new Date().toISOString().slice(0, 10)}.json`;
+                const bundle = JSON.stringify(buildExport(store, { redact: args.includes("--redact") }), null, 2);
+                if (out === "-") {
+                    console.log(bundle);
+                }
+                else {
+                    writeFileSync(out, `${bundle}\n`);
+                    console.log(`${green("✓")} exported to ${bold(out)}${args.includes("--redact") ? dim(" (raw text redacted)") : ""}`);
+                }
+                break;
+            }
             case "open": {
                 const ref = args[0];
                 if (!ref)
@@ -139,6 +174,18 @@ async function main() {
                 break;
             }
             case "serve": {
+                if (args[0] === "install") {
+                    const result = installService(athenaBin);
+                    console.log(result.installed ? `${green("✓")} ${result.message}` : yellow(result.message));
+                    if (result.installed) {
+                        console.log(`  token: ${bold(loadOrCreateServeToken())} ${dim("(paste into the extension options page)")}`);
+                    }
+                    break;
+                }
+                if (args[0] === "uninstall") {
+                    console.log(uninstallService().message);
+                    break;
+                }
                 const port = Number(flag(args, "port") ?? 4517);
                 const token = loadOrCreateServeToken();
                 const server = await startApiServer(store, { token, port });
@@ -181,22 +228,6 @@ async function main() {
         store.close();
     }
 }
-/** The serve token guards the local API against other local processes. */
-function loadOrCreateServeToken() {
-    const tokenPath = join(dirname(dbPath()), "serve.token");
-    try {
-        const existing = readFileSync(tokenPath, "utf8").trim();
-        if (existing.length >= 16)
-            return existing;
-    }
-    catch {
-        // first run — create below
-    }
-    const token = randomBytes(24).toString("base64url");
-    mkdirSync(dirname(tokenPath), { recursive: true });
-    writeFileSync(tokenPath, `${token}\n`, { mode: 0o600 });
-    return token;
-}
 /**
  * Hook mode runs on every prompt the user types into Claude Code:
  * stdout stays silent (it would pollute agent context), errors go to
@@ -238,7 +269,9 @@ async function runHook(args) {
             return;
         const store = new AthenaStore(dbPath());
         try {
-            handleUserPrompt(store, input);
+            const result = handleUserPrompt(store, input);
+            if (result.captured)
+                maybeAutoLearn(store);
         }
         finally {
             store.close();