npm - @lumoai/cli - Versions diffs - 1.41.0 → 1.42.0 - Mend

@lumoai/cli 1.41.0 → 1.42.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (15) hide show

package/assets/skill/SKILL.md +47 -41
package/assets/skill/references/criteria.md +97 -252
package/assets/skill/references/doc-editing.md +146 -0
package/assets/skill/references/docs.md +49 -278
package/assets/skill/references/memory.md +29 -0
package/assets/skill/references/onboarding.md +4 -4
package/assets/skill/references/sessions.md +84 -97
package/assets/skill/references/task-context.md +65 -149
package/assets/skill/references/task-deps.md +113 -0
package/assets/skill/references/tasks.md +2 -133
package/assets/skill/references/verify.md +121 -201
package/assets/skill/references/worktree.md +21 -36
package/dist/cli/src/commands/memory-fold.js +76 -0
package/dist/cli/src/index.js +6 -0
package/package.json +1 -1

package/assets/skill/references/verify.md CHANGED Viewed

@@ -1,11 +1,11 @@
 # lumo verify — machine verification loop
 `lumo verify` is the machine half of the acceptance system (Acceptance v1,
-LUM-343). It executes every **MACHINE** criterion's checkpointer in the local
-repo, reports one structured PASS/FAIL verdict per criterion to the server,
-and prints what to do next. The judge lives server-side: round numbering, the
-3-round cap, escalation, and the IN_REVIEW transition all happen there
-(execution on the client, adjudication on the server).
+LUM-343): it executes every **MACHINE** criterion's checkpointer in the local
+repo, POSTs one structured PASS/FAIL verdict per criterion, and prints what to
+do next. Execution is on the client; adjudication is server-side — round
+numbering, the **3-round cap**, escalation, and the **IN_REVIEW** transition all
+happen on the server.
 ## The claim-done rule
@@ -13,10 +13,10 @@ and prints what to do next. The judge lives server-side: round numbering, the
 touching its status — run `lumo verify`.** The loop replaces "I read the code
 and it looks done" with executed evidence.
-```
-lumo verify              # session-bound task
-lumo verify LUM-42       # explicit task (overrides the session binding)
-lumo verify --timeout 900  # per-checkpointer timeout in seconds (default 600)
+```bash
+lumo verify                 # session-bound task
+lumo verify LUM-42          # explicit task (overrides the session binding)
+lumo verify --timeout 900   # per-checkpointer timeout in seconds (default 600)
 ```
 ## What one round does
@@ -28,84 +28,60 @@ lumo verify --timeout 900  # per-checkpointer timeout in seconds (default 600)
    criterion at round = previous max + 1 and mirrors each verdict as a
    TaskActivity event.
 4. Prints the round outcome:
-   - **All PASS** → the task transitions to **IN_REVIEW** (existing state
-     machine + TASK_IN_REVIEW notification). **Stop here.** Human
-     adjudication and any HUMAN criteria take over; never set DONE yourself.
-   - **Any FAIL** → task status is untouched; the unmet criteria are printed
-     as next actions (statement, checkpointer, failure tail). Fix and re-run.
-   - **Round 3 still failing** → the loop escalates: a human is notified
-     (AGENT_VERIFY, requires action) and further `lumo verify` rounds are
-     rejected with 409. **Stop retrying**; fix only what the human directs.
+| Round outcome             | Effect                                                                                                                      | What to do                                                                                    |
+| ------------------------- | --------------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------- |
+| **All PASS**              | Task transitions to **IN_REVIEW** (existing state machine + TASK_IN_REVIEW notification)                                    | **Stop here.** Human adjudication + any HUMAN criteria take over; **never set DONE yourself** |
+| **Any FAIL**              | Task status untouched; unmet criteria printed as next actions (statement, checkpointer, failure tail)                       | Fix and re-run                                                                                |
+| **Round 3 still failing** | Loop escalates: a human is notified (AGENT_VERIFY, requires action); further `lumo verify` rounds are rejected with **409** | **Stop retrying**; fix only what the human directs                                            |
 Exit code 0 = all passed (or nothing to run); 1 = failures, escalation, or
 errors.
 ## Verdict semantics (what the CLI sends)
-- checkpointer exits 0 → `PASS` with evidence `cmd:<command>#exit=0`
-- non-zero exit → `FAIL`, reason = output tail, enum `CRITERION_UNMET`
-- spawn failure / timeout → `FAIL`, enum `CHECK_EXECUTION_ERROR`
+| Checkpointer result     | Verdict | Detail                                       |
+| ----------------------- | ------- | -------------------------------------------- |
+| exits 0                 | `PASS`  | evidence `cmd:<command>#exit=0`              |
+| non-zero exit           | `FAIL`  | reason = output tail, enum `CRITERION_UNMET` |
+| spawn failure / timeout | `FAIL`  | enum `CHECK_EXECUTION_ERROR`                 |
-evidencePointer is **not free text** — the server only accepts
-`commit:<hash>`, `file:<path>:<line>`, or `cmd:<command>#exit=<code>`.
-Verdicts are PASS|FAIL only; the agent path cannot write HUMAN verdicts or
-`PASS_WITH_FOLLOWUP` (red line — those enter via human-initiated UI paths
-only).
+- evidencePointer is **not free text** — the server accepts only `commit:<hash>`, `file:<path>:<line>`, or `cmd:<command>#exit=<code>`.
+- Verdicts are PASS|FAIL only; **the agent path cannot write HUMAN verdicts or `PASS_WITH_FOLLOWUP`** (red line — those enter via human-initiated UI paths only).
 ## Edge cases
-- **No contract yet** → error pointing at `lumo task criteria set`; draft the
-  contract first (criteria.md golden rule).
-- **HUMAN-only contract (zero MACHINE criteria)** → nothing to run; the CLI
-  says so and suggests handing off for human review
-  (`lumo task update <id> --status in_review`). No server write happens.
-- **A round must cover every MACHINE criterion** — the CLI always runs all of
-  them; the server rejects partial rounds.
-- Criteria added during review (`REVIEW_ADDED`) appear in the contract and
-  are picked up automatically by the next round.
-- **Session bound to a different task (LUM-459)** → the server returns 409,
-  which the command surfaces as an error. No advisory is printed; the verify
-  round is rejected outright.
-- **Provably-unbound session** → the server includes `bindingAdvisory: 'unbound'`
-  in the round response, and the command prints:
-  `⚠ Working unbound — this verify ran from a Claude Code session not attached to the task.`
-  The run is recorded as a `SESSION_BINDING_MISSING` boundary crossing visible in
-  `lumo task status` open crossings. Run `lumo session attach <LUM-N>` before the
-  next verify to bind the session.
-- **Unconfirmed session binding** → `bindingAdvisory: 'unconfirmed'` causes a
-  softer advisory: `⚠ Could not confirm this session is attached to the task.`
-  Same remediation: `lumo session attach <LUM-N>`.
+| Case                                            | Behavior                                                                                                                                                                                                                                                                                                             |
+| ----------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+| **No contract yet**                             | Error pointing at `lumo task criteria set`; draft the contract first (criteria.md golden rule).                                                                                                                                                                                                                      |
+| **HUMAN-only contract** (zero MACHINE criteria) | Nothing to run; CLI says so and suggests `lumo task update <id> --status in_review` for human review. No server write happens.                                                                                                                                                                                       |
+| **Partial round**                               | A round must cover every MACHINE criterion; the CLI always runs all of them and the server rejects partial rounds.                                                                                                                                                                                                   |
+| **`REVIEW_ADDED` criteria**                     | Criteria added during review appear in the contract and are picked up automatically by the next round.                                                                                                                                                                                                               |
+| **Session bound to a different task** (LUM-459) | Server returns 409, surfaced as an error. No advisory printed; the verify round is rejected outright.                                                                                                                                                                                                                |
+| **Provably-unbound session**                    | Response carries `bindingAdvisory: 'unbound'`; prints `⚠ Working unbound — this verify ran from a Claude Code session not attached to the task.` Recorded as a `SESSION_BINDING_MISSING` boundary crossing (visible in `lumo task status` open crossings). Run `lumo session attach <LUM-N>` before the next verify. |
+| **Unconfirmed session binding**                 | `bindingAdvisory: 'unconfirmed'` → softer advisory `⚠ Could not confirm this session is attached to the task.` Same remediation: `lumo session attach <LUM-N>`.                                                                                                                                                      |
 ## Round discipline
 Rounds are a hard budget of 3, not a retry loop. Between rounds, actually fix
-the failures — re-running without changes burns a round and (at round 3)
-pages a human. A FAIL round never changes task status; only an all-pass round
-moves it (to IN_REVIEW, never further).
+the failures — re-running without changes burns a round and (at round 3) pages a
+human. **A FAIL round never changes task status; only an all-pass round moves it
+(to IN_REVIEW, never further).**
 ## Review-time drift habits (gap findings)
-A problem discovered during acceptance/review that the contract does NOT
-cover is a **gap finding** — record it in the contract, never just fix it
-silently:
-1. **Append it on the spot.** Transcribe the human's finding as a criterion
-   via `lumo task criteria set <task> --file <desired-final-list> --human` —
-   the review-added semantics: the gap surfaced at review time, at the
-   current round.
-2. **Tag why the contract drifted** with `--cause
-<NEW_INFO|SCOPE_CHANGE|DRAFT_BLIND_SPOT|GRANULARITY|OTHER>`. Gap findings
-   are usually `DRAFT_BLIND_SPOT` (the draft missed it) or `NEW_INFO`
-   (information that didn't exist at drafting time).
-3. **Then bounce.** The appended criterion shows up in `lumo task status`
-   nextActions and the next verify round picks it up automatically — no
-   side-channel to-do list.
-How to read drift: information-lag and requirement-movement drift
-(`NEW_INFO`, `SCOPE_CHANGE`) is healthy — don't optimize it away.
-`DRAFT_BLIND_SPOT` clusters feed back into the drafting guide. **Zero drift
-across many tasks is a red flag, not a trophy** — it usually means contracts
-are too thin or state only sure-win clauses that can never be found wanting.
+A problem discovered during acceptance/review that the contract does NOT cover is
+a **gap finding** — record it in the contract, never just fix it silently:
+1. `lumo task criteria set <task> --file <desired-final-list> --human` — append it on the spot, transcribing the human's finding as a criterion. Review-added semantics: the gap surfaced at review time, at the current round.
+2. `--cause <NEW_INFO|SCOPE_CHANGE|DRAFT_BLIND_SPOT|GRANULARITY|OTHER>` — tag why the contract drifted. Gap findings are usually `DRAFT_BLIND_SPOT` (the draft missed it) or `NEW_INFO` (info that didn't exist at drafting time).
+3. `lumo task status` — then bounce: the appended criterion shows up in nextActions and the next verify round picks it up automatically. No side-channel to-do list.
+How to read drift: information-lag and requirement-movement drift (`NEW_INFO`,
+`SCOPE_CHANGE`) is healthy — don't optimize it away. `DRAFT_BLIND_SPOT` clusters
+feed back into the drafting guide. **Zero drift across many tasks is a red flag,
+not a trophy** — it usually means contracts are too thin or state only sure-win
+clauses that can never be found wanting.
 ## lumo task status — the read half (self-check entry point)
@@ -114,7 +90,7 @@ are too thin or state only sure-win clauses that can never be found wanting.
 nothing and burns no round. Defaults to the session-bound task; an explicit
 identifier overrides.
-```
+```bash
 lumo task status            # session-bound task
 lumo task status LUM-42     # explicit task
 lumo task status --json     # versioned machine-readable payload
@@ -122,108 +98,58 @@ lumo task status --json     # versioned machine-readable payload
 ### When to run it
-**Status-first recovery:** run it FIRST — before re-reading code or
-planning — whenever you:
+**Status-first recovery:** run it FIRST — before re-reading code or planning —
+whenever you:
 - resume a task in a new session (yours or another agent's earlier work);
 - come back after a verification round was rejected (`lumo verify` failed);
-- were told the task bounced in review (REVIEW_ADDED criteria may have been
-  appended at the round they surfaced — they show up here automatically).
+- were told the task bounced in review (REVIEW_ADDED criteria may have been appended at the round they surfaced — they show up here automatically).
 It answers "where does the loop stand": what already passed (don't redo it),
 what's unmet and why (the exact failure tails), and how many rounds are left.
 ### What it prints
-- Header: task identifier/title/status + `verification round N/3` (round 0 =
-  never verified) + an escalation warning when the machine loop is exhausted.
-- **Machine verification rollup** (LUM-470) — directly under the `Criteria`
-  header, one line `Machine verification: N machine-verified / M human override
-(of T MACHINE criteria)` over the active MACHINE criteria, aligned with the web
-  read model (LUM-456). Printed whenever the contract has ≥1 MACHINE criterion,
-  so the terminal rollup never reads as all-human when a checkpointer actually
-  verified the work.
-- **Criteria** — every criterion as `<glyph> <id>  [TYPE]  SOURCE@rN
-statement` (✓ latest verdict passed / ✗ failed / ○ no verdict yet) with its
-  checkpointer and latest verdict line (evidence pointer on pass, failure
-  tail on fail). `REVIEW_ADDED@rN` provenance is visible per row.
-  - A passing **MACHINE** criterion's verdict line carries a machine-state tag
-    derived from the read model's `machinePassed` flag, NOT the latest verdict
-    (LUM-470): `· machine-verified` when a checkpointer actually passed it (even
-    after a human later signs the task off), or `· human override (no machine
-pass)` when it passes only on a human sign-off with no machine run underneath.
-    This keeps the terminal honest with web — a machine-verified criterion that
-    a human co-signed no longer reads as a plain human pass.
-  - A pass can carry a **`⚠ pre-edit version`** note (LUM-457): the criterion
-    was changed after that verdict (reworded, or its checkpointer was swapped so
-    the recorded evidence ran a different command). The pass still counts as met
-    (a stale pass does not block DONE — render-only signal), but it vouches for
-    an older version — **re-run `lumo verify` to re-confirm against the current
-    criterion.** This is the habit whenever you edit a MACHINE criterion's
-    checkpointer mid-task: change the check, then re-verify so the green is honest.
+- **Header** — task identifier/title/status + `verification round N/3` (round 0 = never verified) + an escalation warning when the machine loop is exhausted.
+- **Machine verification rollup** (LUM-470) — directly under the `Criteria` header, one line `Machine verification: N machine-verified / M human override (of T MACHINE criteria)` over the active MACHINE criteria, aligned with the web read model (LUM-456). Printed whenever the contract has ≥1 MACHINE criterion, so the terminal rollup never reads as all-human when a checkpointer actually verified the work.
+- **Criteria** — every criterion as `<glyph> <id>  [TYPE]  SOURCE@rN statement` (✓ latest verdict passed / ✗ failed / ○ no verdict yet) with its checkpointer and latest verdict line (evidence pointer on pass, failure tail on fail). `REVIEW_ADDED@rN` provenance is visible per row.
+  - A passing **MACHINE** criterion's verdict line carries a machine-state tag derived from the read model's `machinePassed` flag, NOT the latest verdict (LUM-470): `· machine-verified` when a checkpointer actually passed it (even after a human later signs the task off), or `· human override (no machine pass)` when it passes only on a human sign-off with no machine run underneath. This keeps the terminal honest with web — a machine-verified criterion that a human co-signed no longer reads as a plain human pass.
+  - A pass can carry a **`⚠ pre-edit version`** note (LUM-457): the criterion was changed after that verdict (reworded, or its checkpointer was swapped so the recorded evidence ran a different command). The pass still counts as met (a stale pass does not block DONE — render-only signal), but it vouches for an older version — **re-run `lumo verify` to re-confirm against the current criterion.** This is the habit whenever you edit a MACHINE criterion's checkpointer mid-task: change the check, then re-verify so the green is honest.
 - **History** — one line per recorded round: `rN · timestamp · X PASS / Y FAIL`.
-- **Last round failures** — the most recent round's FAIL verdicts with their
-  rejection reasons (why the last round bounced).
-- **Next actions** — the unmet criteria (latest verdict is not a pass:
-  failed or never verified, HUMAN ones included). This list IS the plan —
-  it is recomputed from the event log on every read, never maintained
-  separately. Empty + rounds recorded = awaiting human adjudication.
-- **Open boundary crossings** (LUM-448) — a trailing safety block when the
-  task has ≥1 OPEN (undispositioned) forbidden-action crossing: a count, then
-  one line per crossing `• [SEVERITY] CATEGORY — <clipped detail>`
-  (highest-severity first), each followed by a read-only **attribution** line
-  `↳ by model=<m> · agent=<type>[/branch] · session=<8-char prefix>` (LUM-469 —
-  who/what crossed; any dimension that couldn't be resolved server-side prints
-  `unknown`, never a fabricated value), then a pointer to the web acceptance
-  panel. Silent when there are none, so it never overshadows the criteria.
-  **Read-only awareness** — this surfaces crossings detected elsewhere
-  (LUM-426/435/442); there is no CLI path to disposition or clear one.
-  Disposition stays web + human-only (LUM-426/435/422): an agent/CLI bearer
-  cannot clear its own crossing from the terminal. **The check fails closed
-  (LUM-480):** if the crossings read itself errors (network / server / parse),
-  the block prints `⚠ Boundary-crossing check failed (network/server error) —
-could not confirm whether any are undispositioned` instead of staying silent.
-  Silence means a successful read with zero open crossings, never a failed
-  check — a hiccup can no longer masquerade as "all clear".
+- **Last round failures** — the most recent round's FAIL verdicts with their rejection reasons (why the last round bounced).
+- **Next actions** — the unmet criteria (latest verdict is not a pass: failed or never verified, HUMAN ones included). This list IS the plan — recomputed from the event log on every read, never maintained separately. Empty + rounds recorded = awaiting human adjudication.
+- **Open boundary crossings** (LUM-448) — a trailing safety block when the task has ≥1 OPEN (undispositioned) forbidden-action crossing: a count, then one line per crossing `• [SEVERITY] CATEGORY — <clipped detail>` (highest-severity first), each followed by a read-only **attribution** line `↳ by model=<m> · agent=<type>[/branch] · session=<8-char prefix>` (LUM-469 — who/what crossed; any dimension that couldn't be resolved server-side prints `unknown`, never a fabricated value), then a pointer to the web acceptance panel. Silent when there are none, so it never overshadows the criteria.
+  - **Read-only awareness** — this surfaces crossings detected elsewhere (LUM-426/435/442); there is no CLI path to disposition or clear one. Disposition stays web + human-only (LUM-426/435/422): an agent/CLI bearer cannot clear its own crossing from the terminal.
+  - **The check fails closed (LUM-480):** if the crossings read itself errors (network / server / parse), the block prints `⚠ Boundary-crossing check failed (network/server error) — could not confirm whether any are undispositioned` instead of staying silent. Silence means a successful read with zero open crossings, never a failed check — a hiccup can no longer masquerade as "all clear".
 ### Responding to an open crossing — `lumo crossing explain` (LUM-542)
-When `lumo task status` surfaces an OPEN crossing you believe is a false
-positive — or you simply want to leave a rationale for the human reviewer —
-append a self-explanation ("申辩") to it:
+When `lumo task status` surfaces an OPEN crossing you believe is a false positive
+— or you simply want to leave a rationale for the human reviewer — append a
+self-explanation ("申辩") to it:
-```
+```bash
 lumo crossing explain <id> --note "this was a generated fixture, not a hand-edited migration"
 ```
 This is the **inverse** of dispositioning, but it is the agent/CLI path
-(bearer-only; a clerk/human caller is refused) and it can **only append** an
-append-only note — it **never clears the crossing or unblocks Done**
-(disposition stays web + human-only, LUM-448). The note is shown to the human
-reviewer at disposition time, kept for later review, and explicitly labeled
-_agent self-report · unverified_. `<id>` must be a crossing on the
-**session-bound task** (resolved from `$CLAUDE_CODE_SESSION_ID`; cross-task
-targets and unbound/mismatched sessions are rejected). Earlier explanations are
-immutable — a correction is a new note.
+(bearer-only; a clerk/human caller is refused). Behavior:
+- it can **only append** an append-only note — it **never clears the crossing or unblocks Done** (disposition stays web + human-only, LUM-448);
+- the note is shown to the human reviewer at disposition time, kept for later review, and explicitly labeled _agent self-report · unverified_;
+- `<id>` must be a crossing on the **session-bound task** (resolved from `$CLAUDE_CODE_SESSION_ID`; cross-task targets and unbound/mismatched sessions are rejected);
+- earlier explanations are immutable — a correction is a new note.
 ### --json contract
-`--json` emits the full read model with a top-level `version` field
-(currently `1`). The schema is versioned: breaking shape changes bump the
-major; additive fields don't. Pin on `version` when scripting against it.
-Each criterion carries `machinePassed` (boolean — a checkpointer currently
-vouches for it; LUM-456/470), and the payload carries a top-level
-`machineVerification` aggregate `{ total, machineVerified, humanOverridden }`
-over the active MACHINE criteria — read these, not `latestVerdict` alone, to
-tell a machine-verified criterion from a human override.
-The open boundary crossings ride along as an additive top-level
-`openCrossings` (each entry `{ id, category, severity, detail, attribution }`,
-where `attribution` is `{ workspaceMemberId, sessionId, agent, worktreeBranch,
-model }` with every field nullable — null = unknown, never fabricated; LUM-469;
-the array length is the count) — same read-only awareness, no write path.
-**`openCrossings` is `null` when the crossings check failed (LUM-480)** —
-distinct from `[]`, which is a successful read with zero open crossings. Script
-consumers must treat `null` as "unknown / could not confirm", not "safe".
+`--json` emits the full read model with a top-level `version` field (currently
+`1`). The schema is versioned: breaking shape changes bump the major; additive
+fields don't. Pin on `version` when scripting against it.
+- each criterion carries `machinePassed` (boolean — a checkpointer currently vouches for it; LUM-456/470);
+- the payload carries a top-level `machineVerification` aggregate `{ total, machineVerified, humanOverridden }` over the active MACHINE criteria — read these, not `latestVerdict` alone, to tell a machine-verified criterion from a human override;
+- open boundary crossings ride along as an additive top-level `openCrossings`, each entry `{ id, category, severity, detail, attribution }` where `attribution` is `{ workspaceMemberId, sessionId, agent, worktreeBranch, model }` with every field nullable — null = unknown, never fabricated (LUM-469); the array length is the count. Same read-only awareness, no write path;
+- **`openCrossings` is `null` when the crossings check failed (LUM-480)** — distinct from `[]` (a successful read with zero open crossings). Script consumers must treat `null` as "unknown / could not confirm", **not** "safe".
 `status` reads; `verify` judges. Running status never starts a round, never
 escalates, and never changes task state — loop rules (cap 3, IN_REVIEW on
@@ -232,10 +158,10 @@ all-pass, human-only DONE) live entirely in `lumo verify` and the server.
 ## lumo verdict — the three verdict channels (LUM-422)
 `lumo verify` is the MACHINE channel. `lumo verdict` covers the other two — the
-HUMAN pass and the AGENT send-back — under one red line: **no passing data row
-is ever agent-produced.**
+HUMAN pass and the AGENT send-back — under one red line: **no passing data row is
+ever agent-produced.**
-```
+```bash
 lumo verdict --pass
 lumo verdict LUM-42 --pass
 lumo verdict --fail --reason CRITERION_UNMET --note "the retry path is still missing"
@@ -256,62 +182,56 @@ verifierType=AGENT (a channel distinct from MACHINE and HUMAN, so "machine
 all-pass but human FAIL" stays an uncontaminated signal). The verdict is
 hard-coded FAIL — there is no agent path to a passing verdict. It:
-- requires `--reason` (case-insensitive): `CRITERION_UNMET | EVIDENCE_INSUFFICIENT
-| CHECK_EXECUTION_ERROR | SCOPE_MISMATCH | OTHER` — the agent pays the
-  structured tax a human send-back is spared;
-- takes an optional `--note`, posted as a task comment (@mentions and images for
-  free) and summarized onto the verdict row;
-- takes repeatable `--criterion <id>` to narrow the send-back; omitted, it fans
-  out to the whole contract;
-- records at round = the current max (not a new round) and bounces the task back
-  to IN_PROGRESS, with the unmet criteria surfacing through `lumo task status`.
+- `--reason <enum>` required (case-insensitive): `CRITERION_UNMET | EVIDENCE_INSUFFICIENT | CHECK_EXECUTION_ERROR | SCOPE_MISMATCH | OTHER` — the agent pays the structured tax a human send-back is spared;
+- `--note <text>` optional, posted as a task comment (@mentions and images for free) and summarized onto the verdict row;
+- `--criterion <id>` repeatable, narrows the send-back; omitted, it fans out to the whole contract;
+- `round` = the current max (not a new round); bounces the task back to IN_PROGRESS, with the unmet criteria surfacing through `lumo task status`.
 ### The DONE gate
 Once any criterion's latest verdict is FAIL — machine, AGENT, or human — moving
-the task to DONE on the agent/CLI path is refused with **409** and the
-unresolved items listed. Clear the send-back (fix + re-verify, or a human PASS)
-before `lumo task update <id> --status done`. A task with no criteria, or whose
-criteria were never adjudicated, transitions freely — the gate only blocks an
-actual send-back, never an un-adjudicated criterion. When the machine loop has
-left a task IN_REVIEW with no send-back standing, the agent may move it to DONE
-directly; a human-PASS row is a provable manual override, not a required ticket.
+the task to DONE on the agent/CLI path is refused with **409** and the unresolved
+items listed. Clear the send-back (fix + re-verify, or a human PASS) before
+`lumo task update <id> --status done`.
+- A task with no criteria, or whose criteria were never adjudicated, transitions freely — **the gate only blocks an actual send-back, never an un-adjudicated criterion.**
+- When the machine loop has left a task IN_REVIEW with no send-back standing, the agent may move it to DONE directly; a human-PASS row is a provable manual override, not a required ticket.
 ## When a defect appears — fix in place, don't spin off a new task
-On a send-back **or** a self-review finding: if the issue falls under any
-existing acceptance criterion of **this** task, fix it in place and re-run
-`lumo verify`. **Do not** `lumo task create` for it. New tasks are only for work
-genuinely _outside_ this task's acceptance contract. Creating a task (and PR)
-for in-scope rework launders a first-attempt failure — it bypasses the DONE
-gate's send-back protection and corrupts the flywheel signal.
-This is now enforced: when you're mid-task, `lumo task create` refuses the bare
-form and makes you declare intent — `--rework-of <id>` (it redirects you back to
-fix the existing task and creates nothing) or `--new-scope` (genuinely new,
-separate work). If the send-back reveals the **contract itself** was wrong,
-amend it on this task (`lumo task criteria set`) rather than opening a new
-task — see criteria.md.
-**Hard rule:** while THIS task has an unresolved send-back (any criterion's
-latest verdict is FAIL — the same condition that blocks DONE), `lumo task create`
-is refused with 409 **even with `--new-scope`**. A standing send-back means the
-task can't be completed yet; resolve it (fix + `lumo verify`, or amend the
-contract) before opening any new work. `--rework-of` still redirects you to it.
+On a send-back **or** a self-review finding: if the issue falls under any existing
+acceptance criterion of **this** task, fix it in place and re-run `lumo verify`.
+**Do not** `lumo task create` for it. New tasks are only for work genuinely
+_outside_ this task's acceptance contract. Creating a task (and PR) for in-scope
+rework launders a first-attempt failure — it bypasses the DONE gate's send-back
+protection and corrupts the flywheel signal.
+This is enforced: when you're mid-task, `lumo task create` refuses the bare form
+and makes you declare intent:
+- `lumo task create --rework-of <id>` — redirects you back to fix the existing task and creates nothing;
+- `lumo task create --new-scope` — genuinely new, separate work.
+If the send-back reveals the **contract itself** was wrong, amend it on this task
+(`lumo task criteria set`) rather than opening a new task — see criteria.md.
+**Hard rule:** while THIS task has an unresolved send-back (any criterion's latest
+verdict is FAIL — the same condition that blocks DONE), `lumo task create` is
+refused with **409 even with `--new-scope`**. A standing send-back means the task
+can't be completed yet; resolve it (fix + `lumo verify`, or amend the contract)
+before opening any new work. `--rework-of` still redirects you to it.
 ## Human-reported defects, once a task is submitted
 When someone reports a defect in conversation, your action depends on whether the
 task has **ever entered IN_REVIEW**:
-- **Not yet** (still your first working pass) → just fix it and continue. No
-  verdict needed — nothing was claimed complete, so there's nothing to contradict.
-- **Already submitted** (entered IN*REVIEW / DONE / merged) → **do not silently
-  fix and re-pass.** Either record your own send-back (`lumo verdict --fail`,
-  noting it was human-reported — this is \_your* honest concurrence, not a forged
-  human verdict), or ask the reporter to record a human FAIL via the web UI /
-  Slack (the only channel that can attribute it to a human). If the defect is a
-  **new requirement** not covered by any criterion, first transcribe it with
-  `lumo task criteria set --human`, then proceed. You can never write a human
-  _verdict_ — the terminal can't prove a human is behind the command
-  (attribution integrity, not anti-forgery).
+- **Not yet** (still your first working pass) → just fix it and continue. No verdict needed — nothing was claimed complete, so there's nothing to contradict.
+- **Already submitted** (entered IN_REVIEW / DONE / merged) → **do not silently fix and re-pass.** Either:
+  - record your own send-back `lumo verdict --fail` (noting it was human-reported — this is _your_ honest concurrence, not a forged human verdict), or
+  - ask the reporter to record a human FAIL via the web UI / Slack (the only channel that can attribute it to a human).
+If the defect is a **new requirement** not covered by any criterion, first
+transcribe it with `lumo task criteria set --human`, then proceed. You can never
+write a human _verdict_ — the terminal can't prove a human is behind the command
+(attribution integrity, not anti-forgery).

package/assets/skill/references/worktree.md CHANGED Viewed

@@ -24,42 +24,20 @@ lumo worktree add LUM-267 --no-fetch           # branch off existing origin/main
 lumo worktree add LUM-267 --verify             # run npx jest baseline after setup
 ```
-Flags: `--base <ref>` (branch off a ref other than origin/main; skips the
-fetch), `--no-fetch` (skip `git fetch origin main`), `--verify` (run `npx jest`
-in the new worktree). Errors if the target dir already exists; reuses the branch
-if it already exists (adds the worktree without `-b`).
-**The prisma gotcha it warns about:** the generated client lives in the shared
-(symlinked) `node_modules`, so a `prisma generate` in one worktree clobbers the
-client every parallel worktree depends on. Verify with jest (SWC mocks Prisma);
-do `generate + tsc` atomically once at the end.
-**The jest gotcha it warns about:** always run jest from the worktree root (`cd`
-in first) — `cli/` has no jest config, and running from the main checkout hits
-the `cli/package.json` haste collision and silently runs the wrong tests.
-**The husky hooks it copies in:** husky owns git hooks via
-`core.hooksPath = .husky/_`, a relative path git resolves against each
-worktree's own root. That `_` shim dir is **untracked** (husky regenerates it on
-the main checkout's `npm install`/`prepare`), so a fresh worktree would lack it
-and git would **silently skip every hook** — pre-commit (lint-staged) and
-commit-msg (LUM-405 drift-check) included. Since this repo has no GitHub CI,
-husky is the only deterministic quality gate, so `add` copies `.husky/_` into the
-new worktree (copy, not symlink — the shim is tiny and the hooks it invokes
-resolve relative to the worktree). If the main checkout itself has no `.husky/_`
-(husky never installed there), `add` prints a warning telling you to run
-`npm install` in the main checkout to regenerate it, rather than skipping
-silently.
-**Never run `npm install` / `npm ci` inside a worktree.** The worktree shares
-the main checkout's `node_modules` via a symlink; npm does not respect the link
-— it deletes it and reifies a full standalone `node_modules` (thousands of
-packages, ~1 min, and the shared prisma-client is gone). Run installs only in
-the main checkout, then re-create the symlink if npm replaced it. (Older npm
-could instead plant a self-referential `node_modules/node_modules` in the shared
-tree, which hard-panics Turbopack's `next build`; the `prebuild`/`predev`/
-`preanalyze` guard — `scripts/fix-nodemodules-selflink.ts` — removes that link
-automatically, but the rule above avoids the whole mess.)
+| Flag           | Notes                                                           |
+| -------------- | --------------------------------------------------------------- |
+| `--base <ref>` | Branch off a ref other than origin/main (skips the fetch).      |
+| `--no-fetch`   | Skip `git fetch origin main` (branch off existing origin/main). |
+| `--verify`     | Run `npx jest` baseline in the new worktree after setup.        |
+Errors if the target dir already exists; reuses the branch if it already exists (adds the worktree without `-b`).
+### Gotchas (two it warns about, two it can't fix)
+- **`prisma generate` clobbers all worktrees.** The generated client lives in the shared (symlinked) `node_modules`, so a `generate` in one worktree overwrites the client every parallel worktree depends on. Verify with jest (SWC mocks Prisma); do `generate + tsc` atomically once at the end.
+- **Run jest from the worktree root** (`cd` in first). `cli/` has no jest config; running from the main checkout hits the `cli/package.json` haste collision and silently runs the wrong tests.
+- **Husky hooks are copied in for you.** Husky owns hooks via `core.hooksPath = .husky/_`, resolved relative to each worktree's root. That `_` shim is **untracked** (regenerated on the main checkout's `npm install`/`prepare`), so a fresh worktree would lack it and git would **silently skip every hook** — pre-commit (lint-staged) and commit-msg (LUM-405 drift-check). With no GitHub CI, husky is the only deterministic quality gate, so `add` copies `.husky/_` in (copy, not symlink). If the main checkout has no `.husky/_`, `add` warns you to `npm install` there rather than skipping silently.
+- **Never `npm install` / `npm ci` inside a worktree.** npm doesn't respect the `node_modules` symlink — it deletes it and reifies a full standalone tree (~1 min, shared prisma-client gone). Install only in the main checkout, then re-create the symlink if npm replaced it. (Older npm could plant a self-referential `node_modules/node_modules` that hard-panics Turbopack's `next build`; the `prebuild`/`predev`/`preanalyze` guard `scripts/fix-nodemodules-selflink.ts` removes it, but this rule avoids the mess.)
 ## `lumo worktree rm <LUM-N>`
@@ -83,3 +61,10 @@ check when jest/tsc misbehaves). Read-only; runnable from anywhere in the repo.
 ```bash
 lumo worktree list
 ```
+## When to suggest worktrees
+- The user wants to work on a task **in isolation** from the current workspace — a parallel branch they can build/test without disturbing uncommitted work in the main checkout.
+- Starting a second task while one is in flight, or running independent tasks concurrently (one worktree per task — never two tasks in one worktree).
+- Before executing an implementation plan that needs a clean, dedicated branch off `origin/main`.
+- A `MISSING` node_modules link in `lumo worktree list` is the first thing to check when jest/tsc misbehaves in a worktree.

package/dist/cli/src/commands/memory-fold.js ADDED Viewed

@@ -0,0 +1,76 @@
+"use strict";
+Object.defineProperty(exports, "__esModule", { value: true });
+exports.memoryFold = memoryFold;
+const config_1 = require("../lib/config");
+const api_1 = require("../lib/api");
+const resolve_1 = require("../lib/resolve");
+const resolve_bound_task_1 = require("../lib/resolve-bound-task");
+const resolve_project_1 = require("../lib/resolve-project");
+const sanitize_1 = require("../lib/sanitize");
+function flatten(content) {
+    if (content === null || typeof content !== 'object')
+        return '';
+    return Object.values(content)
+        .map(v => (Array.isArray(v) ? v.join(' ') : typeof v === 'string' ? v : ''))
+        .filter(Boolean)
+        .join(' — ');
+}
+async function memoryFold(refArg, options) {
+    // Folding runs automatically (daily cron). The CLI is preview-only.
+    if (!options.dryRun) {
+        console.error('Topic folding runs automatically (daily). This command only previews it.\n' +
+            'Run `lumo memory fold --dry-run` to see what the next fold pass would do.');
+        return 1;
+    }
+    const creds = (0, config_1.readCredentials)();
+    if (!creds) {
+        console.error('Error: not logged in. Run `lumo auth login` first.');
+        return 1;
+    }
+    const apiUrl = (0, api_1.resolveAuthedApiUrl)(creds.apiUrl);
+    const base = (0, api_1.trimTrailingSlash)(apiUrl);
+    let projectId;
+    if (refArg) {
+        projectId = await (0, resolve_1.resolveProjectId)(base, creds.token, refArg);
+    }
+    else {
+        const bound = await (0, resolve_bound_task_1.resolveBoundTaskIdentifier)(apiUrl, creds.token);
+        if (!bound) {
+            console.error('Error: no <project-ref> given and no task bound to this session.\n' +
+                'Pass a project, or run `lumo session attach <LUM-N>`.');
+            return 1;
+        }
+        const r = await (0, resolve_project_1.resolveBoundProjectId)(apiUrl, creds.token, bound);
+        if (!r.ok) {
+            console.error(`Error: ${r.error}`);
+            return 1;
+        }
+        projectId = r.id;
+    }
+    let res;
+    try {
+        res = await fetch(`${base}/api/projects/${encodeURIComponent(projectId)}/memories/fold-candidates`, { headers: { Authorization: `Bearer ${creds.token}` } });
+    }
+    catch (err) {
+        console.error(`Error: could not reach Lumo API at ${apiUrl} (${err instanceof Error ? err.message : String(err)})`);
+        return 1;
+    }
+    if (!res.ok) {
+        console.error(`Error: fold preview failed (HTTP ${res.status})`);
+        return 1;
+    }
+    const { proposals } = (await res.json());
+    if (proposals.length === 0) {
+        process.stdout.write('No fold proposals — nothing coarse to collapse right now.\n');
+        return;
+    }
+    process.stdout.write(`${proposals.length} fold proposal(s) — DRY RUN, nothing written:\n\n`);
+    for (const p of proposals) {
+        process.stdout.write(`[${(0, sanitize_1.sanitizeField)(p.category)}] coarse card folds ${p.sourceIds.length} cards` +
+            (p.excludedIds.length
+                ? ` (${p.excludedIds.length} left ACTIVE by the coverage gate)`
+                : '') +
+            `:\n  ${(0, sanitize_1.sanitizeField)(flatten(p.coarseContent))}\n` +
+            `  folds: ${p.sourceIds.map(s => (0, sanitize_1.sanitizeField)(s)).join(', ')}\n\n`);
+    }
+}

package/dist/cli/src/index.js CHANGED Viewed

@@ -70,6 +70,7 @@ const memory_rm_1 = require("./commands/memory-rm");
 const memory_show_1 = require("./commands/memory-show");
 const memory_sync_1 = require("./commands/memory-sync");
 const memory_push_1 = require("./commands/memory-push");
+const memory_fold_1 = require("./commands/memory-fold");
 const task_artifact_add_1 = require("./commands/task-artifact-add");
 const task_criteria_set_1 = require("./commands/task-criteria-set");
 const task_criteria_list_1 = require("./commands/task-criteria-list");
@@ -533,6 +534,11 @@ memoryCmd
     .option('--dir <path>', 'Memory dir (default: ~/.claude/projects/<cwd>/memory)')
     .option('--dry-run', 'List what would be pushed without sending')
     .action(wrap((opts) => (0, memory_push_1.memoryPush)(opts)));
+memoryCmd
+    .command('fold [project-ref]')
+    .description('Preview the autonomous topic-fold pass (folding itself runs daily, automatically). Requires --dry-run.')
+    .option('--dry-run', 'preview proposed subsystem cards + which fine cards they fold; writes nothing')
+    .action(wrap((ref, opts) => (0, memory_fold_1.memoryFold)(ref, opts)));
 const milestoneCmd = program
     .command('milestone')
     .description('Inspect milestones from the terminal');