npm - @bookedsolid/rea - Versions diffs - 0.28.0 → 0.28.2 - Mend

@bookedsolid/rea 0.28.0 → 0.28.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

package/agents/rea-orchestrator.md +18 -0
package/dist/cli/review.d.ts +55 -1
package/dist/cli/review.js +167 -5
package/hooks/blocked-paths-enforcer.sh +39 -0
package/hooks/settings-protection.sh +39 -0
package/package.json +1 -1
package/scripts/dist-regression-gate.sh +46 -2

package/agents/rea-orchestrator.md CHANGED Viewed

@@ -114,6 +114,24 @@ Consumer projects may extend the roster via `.rea/agents/` and profile YAMLs, bu
 4. Delegate with full context — include file paths, constraints from policy.yaml, acceptance criteria, and the commit-discipline note above
 5. Verify outputs before reporting completion — do not trust agent summaries at face value. Read the files, check git status, confirm the build.
+## Self-review when the orchestrator implements directly (0.29.0+)
+There are sessions where the orchestrator must implement work itself instead of dispatching:
+- Subagent dispatch is unavailable (no Task tool in the current harness, exempt-subagent scenario).
+- The task is narrowly scoped to a single small surface where the dispatch overhead exceeds the implementation cost.
+- A codex round between specialist hand-offs is being used as the de facto specialist tier (the "Option C" iteration pattern from the 0.29.0 marathon).
+In every such case, you MUST still apply the specialist discipline that delegation would have enforced. This is not optional — the structural risk of "one Opus turn implements five surfaces" is exactly the failure mode that principal-engineer review caught in the 0.28.0 cycle (manifest glob-injection P1 + cache-staleness P2, both pre-commit). Reach the same closure shape by:
+1. **Name the specialists you are channeling.** Before each surface, state which specialist's discipline applies (e.g. "shell-scripting-specialist + adversarial-test-specialist for the bash gate corpus; typescript-specialist for the CLI; platform-architect for the workflow"). State it out loud so the user can spot a mis-cast role.
+2. **Codex round between surfaces, not just at the end.** A single end-of-build codex round across 5 surfaces buries P1s in noise. One round per surface keeps the signal sharp. The 0.27.0 direct-Bash codex CLI is cheap enough at one Opus turn per round to make this routine.
+3. **Explicit threat-model framing for security-tier changes.** When patching a hook, name the bypass class, the conservative-vs-narrow reading, and the sibling shapes the class implies. Refuse to commit until the corpus enumerates every shape the class includes.
+4. **Single-commit-per-PR discipline still applies.** Squash local work before push. The pre-push gate's stateless codex review runs once against the squashed diff; granular commits multiply the review burden without surfacing new findings.
+5. **Defer ruthlessly.** Trimmed-scope greenlights from the user are a maximum, not a minimum. The marathon's 0.28.0 lesson was "principal-engineer trimmed the 11-item plate to 6 with crisp deferral reasons." Apply the same lens during direct-implementation: if surface 6 needs structural rework, defer it to the next minor with the reason in the changeset rather than ship a half-baked closure.
+A self-review checkpoint after each surface (read the diff back, run the targeted tests, fire codex against the working tree) IS the specialist tier when no subagent is in the path. Skip the checkpoint and the structural lesson resets.
 ## The Plan / Build / Review Loop (default workflow)
 REA's default engineering workflow is three-legged, with Review performed by a different model than Build:

package/dist/cli/review.d.ts CHANGED Viewed

@@ -31,6 +31,8 @@
  * the push-gate's review.
  */
 import type { Command } from 'commander';
+import { type LocalReviewVerdict } from '../audit/local-review-event.js';
+import { type Finding } from '../hooks/push-gate/findings.js';
 export interface RunReviewOptions {
     /** Optional explicit base ref. Defaults to upstream-ladder resolution. */
     base?: string;
@@ -42,13 +44,65 @@ export interface RunReviewOptions {
     strictFailOn?: 'concerns' | 'blocking';
     /** Emit a single JSON line on stdout instead of pretty output. */
     json?: boolean;
+    /**
+     * 0.28.1 defect-V: when true, after the human-readable summary line
+     * (or alongside the JSON payload), emit the finding bodies grouped by
+     * severity. Default off — preserves backward-compatible single-line
+     * stdout for existing CI consumers.
+     */
+    withFindings?: boolean;
+}
+/**
+ * Exported so tests can construct fake outcomes for the seam in
+ * `runReview`. Production callers don't reference this directly.
+ */
+export interface ReviewOutcome {
+    verdict: LocalReviewVerdict;
+    findingCount: number;
+    baseRef: string;
+    headSha: string;
+    /**
+     * 0.26.0 helix-026 finding-1: tree SHA of HEAD at review time. The
+     * deterministic content fingerprint `rea preflight` matches coverage
+     * on. Empty string when not resolvable (no HEAD, no git repo) — the
+     * audit writer omits `content_token` from metadata in that case.
+     */
+    contentToken: string;
+    durationSeconds: number;
+    model: string;
+    reasoningEffort: string;
+    /**
+     * 0.28.1 defect-V: structured findings produced by the review. Pre-fix
+     * the CLI threw these away after counting; agents could not remediate
+     * blocking verdicts because the bodies were unreadable through any
+     * documented surface.
+     */
+    findings: Finding[];
+    /**
+     * 0.28.1 defect-V: full agent-prose review text. Persisted to
+     * `.rea/last-review.json` (post-redaction) so consumers have a
+     * machine-readable transcript for parser-miss debugging.
+     */
+    reviewText: string;
+    /** Count of raw JSONL events from codex — recorded in last-review.json. */
+    eventCount: number;
+}
+/**
+ * 0.28.1 defect-V — narrow test seam. Production callers never set this;
+ * tests inject a fake to drive `runReview` deterministically without
+ * spawning codex. The seam matches `executeCodexReview`'s signature so
+ * the production path and the test path go through the same downstream
+ * wiring (audit append, last-review.json, exit code, output).
+ */
+export interface RunReviewDeps {
+    executeCodexReview?: (baseDir: string, options: RunReviewOptions) => Promise<ReviewOutcome>;
 }
 /**
  * Public runner — exposed so tests can drive the function in-process and
  * the commander binding can stay thin. Throws via `process.exit` (CLI
  * convention across `src/cli/`).
  */
-export declare function runReview(options: RunReviewOptions): Promise<void>;
+export declare function runReview(options: RunReviewOptions, deps?: RunReviewDeps): Promise<void>;
 /**
  * Attach `rea review` to a commander Program.
  */

package/dist/cli/review.js CHANGED Viewed

@@ -39,9 +39,34 @@ import { loadPolicyAsync } from '../policy/loader.js';
 import { CodexNotInstalledError, CodexProtocolError, CodexSubprocessError, CodexTimeoutError, IRON_GATE_DEFAULT_MODEL, IRON_GATE_DEFAULT_REASONING, createRealGitExecutor, runCodexReview, } from '../hooks/push-gate/codex-runner.js';
 import { resolvePushGatePolicy } from '../hooks/push-gate/policy.js';
 import { resolveBaseRef } from '../hooks/push-gate/base.js';
-import { summarizeReview } from '../hooks/push-gate/findings.js';
+import { summarizeReview, } from '../hooks/push-gate/findings.js';
+import { writeLastReview } from '../hooks/push-gate/report.js';
 import { computeTreeToken, EMPTY_TREE_SHA } from '../audit/content-token.js';
+import { compileDefaultSecretPatterns, redactSecrets, } from '../gateway/middleware/redact.js';
 import { err, log } from './utils.js';
+/** Relative path to the last-review snapshot, surfaced in JSON output. */
+const LAST_REVIEW_RELATIVE = '.rea/last-review.json';
+/**
+ * 0.28.1 defect-V round-1 P2-1: shared redactor for the
+ * `writeLastReview` failure path. The canonical writer redacts findings
+ * before serialization; if it threw we still need to redact the
+ * in-memory findings before they reach `--with-findings` stdout or
+ * `--json --with-findings`. Without this, a writer failure (read-only
+ * .rea/, ENOSPC, race) would let unredacted Codex prose — which can
+ * quote secrets from the diff — escape via the new surfaces, defeating
+ * the redaction guarantee the writer provides.
+ */
+function redactFindingsInMemory(findings) {
+    const patterns = compileDefaultSecretPatterns({ source: 'default' });
+    const redactStr = (s) => redactSecrets(s, patterns).output;
+    return findings.map((f) => ({
+        severity: f.severity,
+        title: redactStr(f.title),
+        body: redactStr(f.body),
+        ...(f.file !== undefined ? { file: f.file } : {}),
+        ...(f.line !== undefined ? { line: f.line } : {}),
+    }));
+}
 const PROVIDER_CODEX = 'codex';
 /**
  * Probe `codex --version` synchronously. Same shape as the push-gate's
@@ -84,7 +109,7 @@ async function resolveLocalReviewMode(baseDir) {
  * the commander binding can stay thin. Throws via `process.exit` (CLI
  * convention across `src/cli/`).
  */
-export async function runReview(options) {
+export async function runReview(options, deps = {}) {
     const baseDir = process.cwd();
     const strictFailOn = options.strictFailOn ?? 'blocking';
     const { mode, policy } = await resolveLocalReviewMode(baseDir);
@@ -131,7 +156,8 @@ export async function runReview(options) {
     // Codex available — run the review.
     let outcome;
     try {
-        outcome = await executeCodexReview(baseDir, options);
+        const exec = deps.executeCodexReview ?? executeCodexReview;
+        outcome = await exec(baseDir, options);
     }
     catch (e) {
         const msg = e instanceof Error ? e.message : String(e);
@@ -168,6 +194,49 @@ export async function runReview(options) {
     if (probe.version !== undefined)
         metadata.provider_version = probe.version;
     await safeAudit(baseDir, LOCAL_REVIEW_TOOL_NAME, outcome.verdict === 'blocking' ? InvocationStatus.Denied : InvocationStatus.Allowed, metadata, policy);
+    // 0.28.1 defect-V: persist `.rea/last-review.json` on EVERY successful
+    // codex run (pass / concerns / blocking) BEFORE the exit so agents can
+    // read structured findings to remediate. Pre-fix only the push-gate
+    // wrote this file; `rea review` discarded the bodies after counting,
+    // so consumers saw stale snapshots from days-old push-gate runs (Ava
+    // reported a 2026-05-08 file surviving across new 2026-05-09 runs).
+    //
+    // Reuses the push-gate's writer — the canonical atomic-write path with
+    // redaction. We do NOT inline a second implementation: any divergence
+    // between the two writers would silently desynchronize the schema for
+    // `rea preflight` and any tooling that reads last-review.json.
+    //
+    // Skipped/error paths (codex unavailable, codex error) do NOT call this
+    // — there are no findings to serialize.
+    let lastReviewWritten;
+    try {
+        // `LocalReviewVerdict` permits `'error'` for the audit-record schema
+        // (transport / subprocess failures) but the codex success path can
+        // only produce pass | concerns | blocking — we caught throw above.
+        // Narrow here so the report writer's stricter `Verdict` type accepts
+        // it without losing the audit shape elsewhere in this file.
+        const verdict = outcome.verdict;
+        lastReviewWritten = writeLastReview({
+            baseDir,
+            summary: {
+                verdict,
+                findings: outcome.findings,
+                reviewText: outcome.reviewText,
+            },
+            baseRef: outcome.baseRef,
+            headSha: outcome.headSha,
+            eventCount: outcome.eventCount,
+            durationSeconds: outcome.durationSeconds,
+        });
+    }
+    catch (e) {
+        // last-review.json is a remediation surface, not a gate. A write
+        // failure (read-only fs, ENOSPC, race with another run) must not
+        // change the verdict-driven exit code. Surface the error to stderr
+        // so operators can correlate, then continue.
+        const msg = e instanceof Error ? e.message : String(e);
+        process.stderr.write(`rea: last-review.json write failed: ${msg}\n`);
+    }
     // Decide exit code based on strictFailOn.
     let exitCode;
     if (outcome.verdict === 'blocking') {
@@ -179,8 +248,17 @@ export async function runReview(options) {
     else {
         exitCode = 0;
     }
+    // 0.28.1 defect-V: redacted findings come from the writer when it
+    // succeeded (so `--with-findings` shows the same bodies that landed on
+    // disk). When the write FAILED we re-redact the in-memory findings
+    // inline (round-1 P2-1) — without this fallback, secrets that codex
+    // copied from the diff into a finding body would escape via stdout/
+    // JSON in the exact failure mode where the on-disk surface is gone.
+    const findingsForOutput = lastReviewWritten !== undefined
+        ? lastReviewWritten.findings
+        : redactFindingsInMemory(outcome.findings);
     if (options.json === true) {
-        process.stdout.write(JSON.stringify({
+        const payload = {
             status: outcome.verdict,
             finding_count: outcome.findingCount,
             head_sha: outcome.headSha,
@@ -190,14 +268,89 @@ export async function runReview(options) {
             reasoning_effort: outcome.reasoningEffort,
             duration_seconds: outcome.durationSeconds,
             exit_code: exitCode,
-        }) + '\n');
+            // 0.28.1 defect-V round-1 P2-2: only advertise `last_review_path`
+            // when the writer actually produced a current snapshot. If the
+            // write threw, the file on disk is either missing or a stale
+            // snapshot from an older run — pointing JSON consumers at it
+            // would let agents remediate against the wrong findings while
+            // the current run still exits successfully. Emit `null` and an
+            // explicit `last_review_error` so consumers can branch
+            // deterministically.
+            last_review_path: lastReviewWritten !== undefined ? LAST_REVIEW_RELATIVE : null,
+        };
+        if (lastReviewWritten === undefined) {
+            payload.last_review_error = 'write_failed';
+        }
+        if (options.withFindings === true) {
+            // Mirror last-review.json's Finding shape so JSON consumers see one
+            // schema. Findings are pre-redacted (writer-redacted on success,
+            // re-redacted inline on writer failure — see findingsForOutput).
+            payload.findings = findingsForOutput;
+        }
+        process.stdout.write(JSON.stringify(payload) + '\n');
     }
     else {
         log(`local review: ${outcome.verdict} (${outcome.findingCount} finding(s)) — head=${outcome.headSha.slice(0, 12)} base=${outcome.baseRef}`);
         log(`audit entry written: tool_name=${LOCAL_REVIEW_TOOL_NAME}`);
+        if (options.withFindings === true) {
+            printFindingsBySeverity(findingsForOutput, lastReviewWritten !== undefined);
+        }
     }
     process.exit(exitCode);
 }
+/**
+ * 0.28.1 defect-V — group findings by severity (P1 → P2 → P3) and print
+ * to stdout via `log()`. Each finding renders as
+ *
+ *   - [P1] <title> — <file>:<line>
+ *
+ * mirroring the codex-banner shape produced by the push-gate, so muscle
+ * memory transfers between the two surfaces. The full body is intentionally
+ * NOT printed here — the body can be very long, and the canonical place to
+ * read full bodies is `.rea/last-review.json`. We print enough to identify
+ * each finding and drive the agent to the file.
+ *
+ * Round-2 P2 fix: only point at last-review.json when the writer
+ * actually produced a current snapshot. Mirrors the JSON-path guard on
+ * `last_review_path`. If the write failed, the on-disk file is missing
+ * or stale; pointing a human there would let them remediate against the
+ * wrong findings. Falls back to a self-contained banner that names the
+ * failure mode.
+ */
+function printFindingsBySeverity(findings, lastReviewWritten) {
+    if (findings.length === 0)
+        return;
+    const order = ['P1', 'P2', 'P3'];
+    log('');
+    if (lastReviewWritten) {
+        log(`findings (see ${LAST_REVIEW_RELATIVE} for full bodies):`);
+    }
+    else {
+        log('findings (last-review.json write FAILED — bodies shown inline below; stale file may exist on disk and should be ignored):');
+    }
+    for (const sev of order) {
+        const group = findings.filter((f) => f.severity === sev);
+        if (group.length === 0)
+            continue;
+        for (const f of group) {
+            const loc = f.file !== undefined ? ` — ${f.file}${f.line !== undefined ? `:${f.line}` : ''}` : '';
+            log(`  - [${sev}] ${f.title}${loc}`);
+            // Round-3 P2 fix: when the writer failed, the on-disk surface is
+            // gone — agents and humans have no other place to read the body.
+            // Render the body inline (already redacted upstream) so the
+            // banner's "bodies shown inline below" promise is truthful and
+            // remediation can still happen. On the success path, bodies stay
+            // in last-review.json so the stdout surface stays scannable.
+            if (!lastReviewWritten && f.body.length > 0) {
+                for (const bodyLine of f.body.split(/\r?\n/)) {
+                    if (bodyLine.length === 0)
+                        continue;
+                    log(`      ${bodyLine}`);
+                }
+            }
+        }
+    }
+}
 /**
  * Execute the codex review subprocess and translate the output to a
  * verdict. Reuses the push-gate's resolved policy so `codex_model` /
@@ -260,6 +413,13 @@ async function executeCodexReview(baseDir, options) {
         durationSeconds: codexResult.durationSeconds,
         model: resolved.codex_model ?? IRON_GATE_DEFAULT_MODEL,
         reasoningEffort: resolved.codex_reasoning_effort ?? IRON_GATE_DEFAULT_REASONING,
+        // 0.28.1 defect-V: thread the structured findings + reviewText + event
+        // count through to the caller so `runReview` can persist last-review.json
+        // and (optionally) print bodies. Pre-fix these were dropped on the floor
+        // after `summary.findings.length` was computed.
+        findings: summary.findings,
+        reviewText: codexResult.reviewText,
+        eventCount: codexResult.eventCount,
     };
 }
 function classifyCodexError(e) {
@@ -313,11 +473,13 @@ export function registerReviewCommand(program) {
         return raw;
     })
         .option('--json', 'emit a single-line JSON result instead of human-readable output')
+        .option('--with-findings', 'after the summary, print findings grouped by severity (P1/P2/P3); when combined with --json, the JSON payload gains a `findings` array')
         .action(async (opts) => {
         await runReview({
             ...(opts.base !== undefined ? { base: opts.base } : {}),
             ...(opts.strictFailOn !== undefined ? { strictFailOn: opts.strictFailOn } : {}),
             ...(opts.json === true ? { json: true } : {}),
+            ...(opts.withFindings === true ? { withFindings: true } : {}),
         });
     });
 }

package/hooks/blocked-paths-enforcer.sh CHANGED Viewed

@@ -145,6 +145,45 @@ if [[ "$raw_has_traversal" -eq 1 ]] || [[ "$norm_has_traversal" -eq 1 ]]; then
   exit 2
 fi
+# ── 5a-bis. Reject interior single-dot segments (0.29.0 helix-/./-class) ─────
+# Parallel to the `..` guard above. `normalize_path` does NOT collapse
+# interior `./` segments — that would corrupt `..` traversals — which leaves
+# a bypass class. A blocked entry of `.env` does not match `foo/./.env`
+# (the literal-comparison loop is byte-for-byte), so an attacker who can
+# influence the file_path string can dodge the policy entry.
+#
+# The conservative closure (per Jake 2026-05-12): treat any interior `/./`
+# segment exactly like `..`. The NORMALIZED form is the safe surface for
+# the check — `normalize_path` already stripped leading `./` segments, so
+# any `/./` that survives is interior by construction. A raw-form check
+# would false-positive on benign `./foo` paths (codex round 1 P2: a path
+# like `%2E%2Fsrc/foo.ts` decodes to `./src/foo.ts` which is the same
+# leading-`./` allowed shape the comment at the top of `normalize_path`
+# documents — guarding against it on the raw form would block legit
+# writes under `src/` and friends).
+#
+# URL-encoded companion: `.%2F` / `%2E/` / `%2E%2F` decode to `./` via
+# `normalize_path` (which knows `%2E` → `.` and `%2F` → `/`). After
+# URL-decode + leading-`./` strip, any encoded INTERIOR form hits the
+# normalized `*/./* ` check. No raw-form encoded guard is needed — the
+# normalize_path path already covers every encoded shape the helper
+# decodes, and shapes it doesn't decode wouldn't resolve to an interior
+# `./` segment on disk either.
+norm_has_dot_segment=0
+case "/$NORMALIZED/" in
+  */./*) norm_has_dot_segment=1 ;;
+esac
+if [[ "$norm_has_dot_segment" -eq 1 ]]; then
+  {
+    printf 'BLOCKED PATH: interior dot-segment rejected\n'
+    printf '\n'
+    printf '  File: %s\n' "$FILE_PATH"
+    printf "  Rule: path contains an interior '/./' segment; rewrite to a\n"
+    printf '        canonical project-relative path without dot segments.\n'
+  } >&2
+  exit 2
+fi
 for writable in "${AGENT_WRITABLE[@]}"; do
   if [[ "$NORMALIZED" == "$writable" ]] || [[ "$NORMALIZED" == "$writable"* && "$writable" == */ ]]; then
     exit 0

package/hooks/settings-protection.sh CHANGED Viewed

@@ -128,6 +128,45 @@ if [[ "$raw_has_traversal" -eq 1 ]] || [[ "$norm_has_traversal" -eq 1 ]]; then
   exit 2
 fi
+# ── 5a-bis. Reject interior single-dot segments (0.29.0 helix-/./-class) ─────
+# Companion to the `..` guard above. The `normalize_path` helper deliberately
+# does NOT collapse interior `./` segments because doing so would corrupt
+# `..` traversals — but that leaves a parallel bypass class. A path like
+# `.husky/./pre-push` resolves on disk to `.husky/pre-push`, yet the literal/
+# prefix matchers in §6 compare against the un-collapsed `.husky/./pre-push`
+# string and miss the match.
+#
+# Conservative reading (per Jake 2026-05-12): treat any interior `./`
+# segment exactly like a `..` segment — refuse outright, force the caller
+# to send a canonical path. The corpus design pairs shell-scripting-specialist
+# with adversarial-test-specialist; the canonical attack shapes are:
+#
+#   .husky/./pre-push                  — single segment
+#   .husky/././pre-push                — repeated segments
+#   .husky/.//pre-push                 — `./` immediately followed by another `/`
+#   .claude/hooks/./_lib/halt-check.sh — inside a protected directory
+#   %2E%2F                             — percent-encoded `./`, caught after URL-decode
+#   .\.\pre-push                       — backslash variant, normalize_path → `./`
+#
+# Only the NORMALIZED form is checked (not the raw form) because raw `./foo`
+# at start-of-string is a legitimate relative path; `normalize_path` already
+# strips leading `./` segments, so anything that survives into the normalized
+# form's `/./` shape is INTERIOR by construction.
+norm_has_dot_segment=0
+case "/$NORMALIZED/" in
+  */./*) norm_has_dot_segment=1 ;;
+esac
+if [[ "$norm_has_dot_segment" -eq 1 ]]; then
+  {
+    printf 'SETTINGS PROTECTION: interior dot-segment rejected\n'
+    printf '\n'
+    printf '  File: %s\n' "$SAFE_FILE_PATH"
+    printf "  Rule: path contains an interior '/./' segment; rewrite to a\n"
+    printf '        canonical project-relative path without dot segments.\n'
+  } >&2
+  exit 2
+fi
 # Compute lower-cased path early so the §5b allow-list (and §6/§6b matchers
 # below) all reference a single normalized variable.
 LOWER_NORM=$(printf '%s' "$NORMALIZED" | tr '[:upper:]' '[:lower:]')

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@bookedsolid/rea",
-  "version": "0.28.0",
+  "version": "0.28.2",
   "description": "Agentic governance layer for Claude Code — policy enforcement, hook-based safety gates, audit logging, and Codex-integrated adversarial review for AI-assisted projects",
   "license": "MIT",
   "author": "Booked Solid Technology <oss@bookedsolid.tech> (https://bookedsolid.tech)",

package/scripts/dist-regression-gate.sh CHANGED Viewed

@@ -167,8 +167,52 @@ trap 'rm -rf -- "$WORK"' EXIT HUP INT TERM
 # a new tarball was published. The release.yml rebuild+verify step
 # remains the catching net at publish time, so skipping here does not
 # re-open the BUG-013 attack surface for the merge-to-main path.
-if ! ( cd "$WORK" && npm pack "${PKG_NAME}@${PREV_VERSION}" --silent >/dev/null 2>&1 ); then
-  log "skip — npm pack ${PKG_NAME}@${PREV_VERSION} failed (network issue or registry outage)"
+#
+# 0.29.0: bounded retry loop for npm CDN propagation lag. The memory
+# entries for 0.9.0, 0.12.0, 0.13.0, 0.28.0, and 0.28.1 all note
+# "release verify flaked on npm CDN lag" — `npm view` returns the
+# version metadata but `npm pack` against the same version times out
+# or 404s because the tarball blob has not propagated to all CDN edges
+# yet. The CI-side workflow already has a 12×10s retry (release.yml
+# phase 2); this script runs locally / in PR CI where the failure
+# window is shorter but still occurs.
+#
+# Shape: initial attempt + three retries with sleeps 2s / 8s / 30s.
+# Total worst-case wait = 2 + 8 + 30 = 40s, all on the failure path.
+# That covers the empirically observed CDN propagation window (cf.
+# release.yml phase 2 retry loops) while bounding the local-/ PR-side
+# blocking time to under a minute on a genuine outage.
+NPM_PACK_OK=0
+NPM_PACK_DELAYS=(2 8 30)
+NPM_PACK_ATTEMPTS=$((${#NPM_PACK_DELAYS[@]} + 1))
+# Codex round 1 P2-2: use bash arithmetic for-loop instead of `$(seq 1 N)`.
+# `seq` is not in the preflight tool list (line 104: npm jq git shasum tar)
+# and `set -e` at the top of the script would exit 127 inside the loop body
+# on minimal images that lack it (Alpine, some BusyBox shells). Bash's
+# arithmetic for-loop is a builtin and works on every supported version.
+for ((attempt = 1; attempt <= NPM_PACK_ATTEMPTS; attempt++)); do
+  if ( cd "$WORK" && npm pack "${PKG_NAME}@${PREV_VERSION}" --silent >/dev/null 2>&1 ); then
+    if [ "$attempt" -gt 1 ]; then
+      log "npm pack succeeded after ${attempt} attempt(s) (CDN propagation lag)"
+    fi
+    NPM_PACK_OK=1
+    break
+  fi
+  # Clean up any partial artifact npm pack may have left in $WORK
+  # before retrying so the next attempt has a clean slate.
+  find "$WORK" -maxdepth 1 -type f -name '*.tgz' -delete 2>/dev/null || true
+  if [ "$attempt" -lt "$NPM_PACK_ATTEMPTS" ]; then
+    # Bash array is 0-indexed; $attempt is 1-indexed; index into
+    # NPM_PACK_DELAYS at $attempt-1 to read the delay AFTER this
+    # failed attempt (before the next try).
+    idx=$((attempt - 1))
+    delay="${NPM_PACK_DELAYS[$idx]}"
+    log "npm pack attempt ${attempt}/${NPM_PACK_ATTEMPTS} failed; sleeping ${delay}s for CDN propagation"
+    sleep "$delay"
+  fi
+done
+if [ "$NPM_PACK_OK" -ne 1 ]; then
+  log "skip — npm pack ${PKG_NAME}@${PREV_VERSION} failed after ${NPM_PACK_ATTEMPTS} attempts (network issue, registry outage, or persistent CDN lag)"
   exit 0
 fi