npm - delimit-cli - Versions diffs - 4.1.49 → 4.1.51 - Mend

delimit-cli 4.1.49 → 4.1.51

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (4) hide show

package/CHANGELOG.md +42 -0
package/bin/delimit-setup.js +25 -11
package/gateway/ai/loop_engine.py +36 -2
package/package.json +1 -1

package/CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,47 @@
 # Changelog
+## [4.1.51] - 2026-04-09
+### Fixed (gateway loop engine — LED-814)
+- **`ai/loop_engine.run_governed_iteration` mishandled swarm dispatch statuses.** Only `status=='completed'` was treated as success. The swarm dispatcher returns `'dispatched'` for async handoff, so every build-loop tick fell into the failure branch and logged "Dispatch failed" even though the underlying work shipped. Session `build-loop-2026-04-09` accumulated 6 spurious failures (LED-787 / 788 / 755 / 762 / 799 / 807) for tasks that all actually shipped. Now:
+  - `'completed'` → close ledger + notify deploy loop (unchanged)
+  - `'dispatched'` → mark ledger `in_progress` with the swarm `task_id`, NOT a failure
+  - `'blocked'` → record a founder-approval gate without tripping the circuit breaker
+  - anything else → genuine failure, error message includes the unexpected status string for debuggability
+- Verified live against the running MCP session before this release: `iterations 6→7`, `errors 0`, `LED-814` recorded as `dispatched` with `swarm_task_id task-449ecdf9`.
+- Picked up via the standard `npm run sync-gateway` step in `prepublishOnly` (gateway commit `ce802cd` is now on `delimit-ai/delimit-gateway` main).
+### Added
+- **`tests/test_loop_engine_dispatch_status.py`** in the gateway — covers all four dispatch status branches (`completed` / `dispatched` / `blocked` / unknown), 154 lines, ships with the bundled gateway.
+### Scope
+- Single-purpose patch: gateway loop engine only. This is the deferred half of the multi-model deliberation that produced 4.1.50 — the deliberation explicitly required splitting the gateway fix from the CLAUDE.md regex fix so each ship has a clean rollback story.
+### Tests
+- npm CLI: 134/134 still passing (no CLI changes — bundled gateway only).
+- Gateway: new `test_loop_engine_dispatch_status.py` suite passing.
+## [4.1.50] - 2026-04-09
+### Fixed (CRITICAL — CLAUDE.md in-prose marker clobber)
+- **`upsertDelimitSection` regex was unanchored** — `bin/delimit-setup.js` used `/<!-- delimit:start[^>]*-->/` (no line anchors) to detect the managed-section markers. If a user *quoted* the markers inside backticks in a documentation bullet (e.g. `Use managed-section markers (\`<!-- delimit:start -->\` / \`<!-- delimit:end -->\`)`), the regex matched the prose mention. On the next `delimit setup` run the upsert sliced everything between the prose start and prose end markers and replaced it with a fresh stock template — the exact "never clobber user-customized files" failure mode 4.1.48 / 4.1.49 were written to prevent. Reproduced on `/root/CLAUDE.md` 2026-04-09.
+- Both markers are now anchored to start-of-line with the multiline flag, allow optional leading horizontal whitespace (`/^[ \t]*<!-- delimit:start[^>]*-->[ \t]*$/m`), and the file is BOM-stripped before matching. Result: documentation prose that quotes the markers inside backticks, blockquotes (`> `), or bullets (`- `, `* `) is never matched, while genuine markers — flush-left, indented, or BOM-prefixed — still are. Same fix applied to the test mirror in `tests/setup-onboarding.test.js`.
+### Added
+- **Regression tests** in `tests/setup-onboarding.test.js` covering five failure modes the v4.1.49 regex would have matched incorrectly:
+  - Markers quoted in a bullet via inline backticks (the exact /root/CLAUDE.md 2026-04-09 incident)
+  - Markers with a CRLF (`\r\n`) line ending
+  - File starting with a UTF-8 BOM
+  - Tab- and space-indented real markers (must still be recognized)
+  - Bullet- and blockquote-prefixed markers (must NOT be recognized)
+  Each test asserts both the first-run behavior (`appended` vs `updated`) and that user content survives a subsequent version-bump upgrade verbatim.
+### Scope
+- Single-purpose patch: CLAUDE.md preservation only. Unrelated gateway fixes (e.g. `loop_engine` LED-814) deferred to 4.1.51 per multi-model deliberation, since 4.1.48 and 4.1.49 both shipped with the regression bug undetected and 4.1.50 must stay laser-focused.
+### Tests
+- 134/134 npm CLI tests passing (was 129). New regression suite (`does not match markers quoted in prose`, CRLF, BOM, indented, bullet/blockquote-prefixed) covers every edge case the multi-model deliberation surfaced.
 ## [4.1.49] - 2026-04-09
 ### Fixed (full preservation audit follow-up to 4.1.48)

package/bin/delimit-setup.js CHANGED Viewed

@@ -1362,24 +1362,38 @@ function upsertDelimitSection(filePath) {
         return { action: 'created' };
     }
-    const existing = fs.readFileSync(filePath, 'utf-8');
-    // Check if managed markers already exist
-    const startMarkerRe = /<!-- delimit:start[^>]*-->/;
-    const endMarker = '<!-- delimit:end -->';
-    const hasStart = startMarkerRe.test(existing);
-    const hasEnd = existing.includes(endMarker);
+    const rawExisting = fs.readFileSync(filePath, 'utf-8');
+    // Strip a UTF-8 BOM if present so the start-of-line anchor still matches
+    // the very first line of the file. We write back the stripped form to keep
+    // serialization deterministic.
+    const existing = rawExisting.replace(/^\uFEFF/, '');
+    // Check if managed markers already exist.
+    // Markers MUST be on their own line — anchored with the multiline flag — so
+    // that documentation prose that quotes the markers (e.g. inside backticks,
+    // bullets, or blockquotes) does NOT get mistaken for a real managed section.
+    // The v4.1.49 unanchored regex caused exactly this clobber on /root/CLAUDE.md.
+    // We allow optional leading horizontal whitespace ([ \t]*) so genuinely
+    // indented markers still match, but NOT a leading "- ", "> ", "`", "*", etc.
+    const startMarkerRe = /^[ \t]*<!-- delimit:start[^>]*-->[ \t]*$/m;
+    const endMarkerRe = /^[ \t]*<!-- delimit:end -->[ \t]*$/m;
+    const startMatch = existing.match(startMarkerRe);
+    const endMatch = existing.match(endMarkerRe);
+    const hasStart = !!startMatch;
+    const hasEnd = !!endMatch;
     if (hasStart && hasEnd) {
-        // Extract current version from the marker
-        const versionMatch = existing.match(/<!-- delimit:start v([^ ]+) -->/);
+        // Extract current version from the marker (also anchored, allows indent)
+        const versionMatch = existing.match(/^[ \t]*<!-- delimit:start v([^ ]+) -->[ \t]*$/m);
         const currentVersion = versionMatch ? versionMatch[1] : '';
         if (currentVersion === version) {
             return { action: 'unchanged' };
         }
         // Replace only the managed region — preserve content above/below
-        const before = existing.substring(0, existing.search(startMarkerRe));
-        const after = existing.substring(existing.indexOf(endMarker) + endMarker.length);
+        const startIdx = startMatch.index;
+        const endIdx = endMatch.index + endMatch[0].length;
+        const before = existing.substring(0, startIdx);
+        const after = existing.substring(endIdx);
         fs.writeFileSync(filePath, before + newSection + after);
         return { action: 'updated' };
     }

package/gateway/ai/loop_engine.py CHANGED Viewed

@@ -941,7 +941,12 @@ def run_governed_iteration(session_id: str, hardening: Optional[Any] = None) ->
         session["cost_incurred"] += cost
         from ai.ledger_manager import update_item
-        if dispatch_result.get("status") == "completed":
+        dispatch_status = dispatch_result.get("status")
+        # "completed" = synchronous success (loop engine closes the ledger).
+        # "dispatched" = swarm handed the task to an agent; the ledger stays
+        # in_progress until the agent reports back via delimit_agent_complete.
+        # Both are success outcomes from the loop's perspective.
+        if dispatch_status == "completed":
             update_item(
                 item_id=task["id"],
                 status="done",
@@ -964,6 +969,35 @@ def run_governed_iteration(session_id: str, hardening: Optional[Any] = None) ->
                 )
             except Exception as e:
                 logger.warning("Failed to notify deploy loop for %s: %s", task.get("id"), e)
+        elif dispatch_status == "dispatched":
+            # Async handoff: mark ledger in_progress, leave closure to the agent.
+            dispatched_task_id = dispatch_result.get("task_id", "")
+            try:
+                update_item(
+                    item_id=task["id"],
+                    status="in_progress",
+                    note=(
+                        f"Dispatched to swarm agent via governed build loop "
+                        f"(swarm task_id={dispatched_task_id}). Awaiting agent completion."
+                    ),
+                    project_path=str(ROOT_LEDGER_PATH),
+                )
+            except Exception as e:
+                logger.warning("Failed to mark %s in_progress after dispatch: %s", task.get("id"), e)
+            session["tasks_completed"].append({
+                "id": task["id"],
+                "status": "dispatched",
+                "swarm_task_id": dispatched_task_id,
+                "duration": duration,
+                "cost": cost,
+            })
+        elif dispatch_status == "blocked":
+            # Founder-approval gate — not a failure, don't trip the breaker.
+            session["tasks_completed"].append({
+                "id": task["id"],
+                "status": "blocked",
+                "reason": dispatch_result.get("reason", "Requires founder approval"),
+            })
         else:
             session["errors"] += 1
             if session["errors"] >= session["error_threshold"]:
@@ -971,7 +1005,7 @@ def run_governed_iteration(session_id: str, hardening: Optional[Any] = None) ->
             session["tasks_completed"].append({
                 "id": task["id"],
                 "status": "failed",
-                "error": dispatch_result.get("error", "Dispatch failed")
+                "error": dispatch_result.get("error", f"Dispatch failed (status={dispatch_status!r})"),
             })
         _save_session(session)

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "name": "delimit-cli",
   "mcpName": "io.github.delimit-ai/delimit-mcp-server",
-  "version": "4.1.49",
+  "version": "4.1.51",
   "description": "Unify Claude Code, Codex, Cursor, and Gemini CLI with persistent context, governance, and multi-model debate.",
   "main": "index.js",
   "files": [