npm - @esoteric-logic/praxis-harness - Versions diffs - 2.9.1 → 2.10.0 - Mend

@esoteric-logic/praxis-harness 2.9.1 → 2.10.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/base/CLAUDE.md +8 -0
package/base/configs/registry.json +2 -1
package/base/hooks/on-stop-failure.sh +121 -0
package/base/hooks/settings-hooks.json +11 -0
package/base/rules/desktop-protocol.md +64 -0
package/package.json +1 -1

package/base/CLAUDE.md CHANGED Viewed

@@ -56,6 +56,13 @@ Vault path and backend are machine-specific. Read from `~/.claude/praxis.config.
 If config file is missing: tell the user to run `praxis/install.sh`.
 All `{vault_path}` references in rules and skills resolve from this config.
+## Model & Context Policy
+- Default model: `claude-opus-4-6` (set `ANTHROPIC_MODEL` in shell profile)
+- Sub-agents spawned by Praxis skills: use `haiku` for polling, search, and lint tasks
+- Compact trigger: when context approaches ceiling, finish the current milestone first
+- Never compact mid-plan — complete the milestone, write phase summary to vault, then compact
+- After compaction: re-bootstrap from § After Compaction below, re-run quality checks fresh
 ## Durable Memory
 Context is volatile. Files are permanent. Act accordingly.
@@ -164,6 +171,7 @@ Kit manifests live in `~/.claude/kits/<name>/KIT.md`.
 | `~/.claude/rules/powershell.md` | `**/*.ps1`, `**/*.psm1` |
 | `~/.claude/rules/dependency-freshness.md` | `package.json`, `go.mod`, `requirements.txt`, `Cargo.toml`, `pyproject.toml` |
 | `~/.claude/rules/live-docs-required.md` | Dependency manifests, files importing external packages |
+| `~/.claude/rules/desktop-protocol.md` | Claude Desktop ↔ Claude Code handoff sessions |
 ### Auto-invocable skills (replace former universal rules)
 | Skill | Triggers when |

package/base/configs/registry.json CHANGED Viewed

@@ -101,7 +101,8 @@
       {"path": "base/hooks/file-guard.sh", "event": "PreToolUse", "matcher": "Write|Edit|MultiEdit"},
       {"path": "base/hooks/identity-check.sh", "event": "PreToolUse", "matcher": "Bash"},
       {"path": "base/hooks/credential-guard.sh", "event": "PreToolUse", "matcher": "Bash"},
-      {"path": "base/hooks/session-data-collect.sh", "event": "Stop", "matcher": ""}
+      {"path": "base/hooks/session-data-collect.sh", "event": "Stop", "matcher": ""},
+      {"path": "base/hooks/on-stop-failure.sh", "event": "StopFailure", "matcher": ""}
     ],
     "optional": [
       {"path": "base/hooks/recursion-guard.sh", "event": "PreToolUse", "matcher": "", "feature": "recursion-detection"},

package/base/hooks/on-stop-failure.sh ADDED Viewed

@@ -0,0 +1,121 @@
+#!/usr/bin/env bash
+# on-stop-failure.sh — StopFailure hook (Auto mode failure classifier)
+# Reads Claude's stop context, classifies the failure, and either:
+#   - Auto-repairs (transient: test fail, lint fail, tool error) → exit 0 (continue)
+#   - Escalates to human (boundary/spec/security violation) → exit 1 (halt)
+# Tracks repair attempts per failure fingerprint to prevent infinite loops.
+# Reuses PPID-scoped state pattern from recursion-guard.sh.
+set -euo pipefail
+if ! command -v jq &>/dev/null; then
+  echo "on-stop-failure: jq required but not found. Cannot classify failure." >&2
+  exit 1
+fi
+# ── Parse input (single jq call) ──
+INPUT=$(cat)
+PARSED=$(echo "$INPUT" | jq -r '[
+  (.exit_code // 1 | tostring),
+  (.stop_reason // ""),
+  (.last_tool_use.name // ""),
+  (.last_tool_use.error // "")
+] | join("\t")' 2>/dev/null || echo "1\t\t\t")
+IFS=$'\t' read -r EXIT_CODE STOP_REASON LAST_TOOL TOOL_ERROR <<< "$PARSED"
+MAX_REPAIR_ATTEMPTS=3
+MAX_FINGERPRINT_LEN=200
+# ── Repair attempt tracker (PPID-scoped, same pattern as recursion-guard.sh) ──
+REPAIR_STATE="/tmp/praxis-repair-${PPID}.json"
+if [[ ! -f "$REPAIR_STATE" ]]; then
+  echo '{"repair_attempts":0,"last_fingerprint":""}' > "$REPAIR_STATE"
+fi
+# Compare raw fingerprint strings — no hash needed for equality checks
+FINGERPRINT="${EXIT_CODE}:${LAST_TOOL}:${STOP_REASON:0:$MAX_FINGERPRINT_LEN}"
+# Single jq call to read both fields
+STATE_READ=$(jq -r '[(.last_fingerprint // ""), (.repair_attempts // 0 | tostring)] | join("\t")' "$REPAIR_STATE" 2>/dev/null || echo $'\t0')
+IFS=$'\t' read -r LAST_FINGERPRINT CURRENT_ATTEMPTS <<< "$STATE_READ"
+# Same fingerprint = looping on the same error
+if [[ "$FINGERPRINT" == "$LAST_FINGERPRINT" ]]; then
+  CURRENT_ATTEMPTS=$((CURRENT_ATTEMPTS + 1))
+else
+  CURRENT_ATTEMPTS=1
+fi
+# Atomic state file update (tmp + mv)
+TMP_STATE="${REPAIR_STATE}.tmp"
+jq -n --argjson attempts "$CURRENT_ATTEMPTS" --arg fp "$FINGERPRINT" \
+  '{"repair_attempts": $attempts, "last_fingerprint": $fp}' \
+  > "$TMP_STATE" && mv "$TMP_STATE" "$REPAIR_STATE"
+# ── Hard escalation: too many repair attempts on same failure ──
+if [[ $CURRENT_ATTEMPTS -gt $MAX_REPAIR_ATTEMPTS ]]; then
+  echo "AUTO-REPAIR EXHAUSTED: $CURRENT_ATTEMPTS attempts on same failure." >&2
+  echo "Failure: exit=$EXIT_CODE tool=$LAST_TOOL" >&2
+  echo "Halting — human review required." >&2
+  CONFIG="$HOME/.claude/praxis.config.json"
+  if [[ -f "$CONFIG" ]]; then
+    VAULT_PATH=$(jq -r '.vault_path // ""' "$CONFIG" 2>/dev/null)
+    if [[ -n "$VAULT_PATH" && -d "$VAULT_PATH" ]]; then
+      TIMESTAMP=$(date -u +"%Y-%m-%dT%H:%M:%SZ")
+      ESCALATION_FILE="$VAULT_PATH/notes/escalations.md"
+      if [[ ! -f "$ESCALATION_FILE" ]]; then
+        printf -- "---\ntags: [escalation, auto-repair]\ndate: %s\nsource: agent\n---\n\n# Auto-Repair Escalations\n" \
+          "$(date +%Y-%m-%d)" > "$ESCALATION_FILE"
+      fi
+      printf "\n## %s\n- **Failure**: exit=%s tool=%s\n- **Attempts**: %s\n- **Reason**: %s\n- **Status**: HALTED — needs human review\n" \
+        "$TIMESTAMP" "$EXIT_CODE" "$LAST_TOOL" "$CURRENT_ATTEMPTS" "${STOP_REASON:0:$MAX_FINGERPRINT_LEN}" \
+        >> "$ESCALATION_FILE"
+    fi
+  fi
+  exit 1
+fi
+# ── Hard escalation patterns (single regex, never auto-repair these) ──
+HARD_REGEX="BLOCKED:|secret detected|Protected path|out of scope|permission denied|identity mismatch"
+COMBINED_CONTEXT="$STOP_REASON $TOOL_ERROR"
+if echo "$COMBINED_CONTEXT" | grep -qiE "$HARD_REGEX"; then
+  MATCHED=$(echo "$COMBINED_CONTEXT" | grep -oiE "$HARD_REGEX" | head -1)
+  echo "ESCALATING: Hard violation — '$MATCHED' matched." >&2
+  echo "Auto-repair suppressed. Human review required." >&2
+  exit 1
+fi
+# Exit code 2 = guard hard-block (recursion-guard.sh, secret-scan.sh, file-guard.sh convention)
+if [[ "$EXIT_CODE" == "2" ]]; then
+  echo "ESCALATING: Exit code 2 = guard hard-block. No auto-repair." >&2
+  exit 1
+fi
+# ── Transient failure → emit repair prompt (Auto mode continues) ──
+# Claude Code reads stdout from StopFailure hooks as an injected prompt.
+cat <<REPAIR
+Auto-repair attempt $CURRENT_ATTEMPTS of $MAX_REPAIR_ATTEMPTS.
+Failure context:
+- Exit code: $EXIT_CODE
+- Last tool: $LAST_TOOL
+- Stop reason: ${STOP_REASON:0:500}
+- Tool error: ${TOOL_ERROR:0:$MAX_FINGERPRINT_LEN}
+Instructions:
+1. Read the error above carefully. Identify the root cause from actual output.
+2. Apply the minimum fix required. Do not expand scope.
+3. Re-run validation (tests + lint) after fixing.
+4. If this is attempt $CURRENT_ATTEMPTS of $MAX_REPAIR_ATTEMPTS and the fix is not obvious:
+   report What / So What / Now What and halt.
+Do NOT re-attempt the same approach that just failed.
+REPAIR
+exit 0

package/base/hooks/settings-hooks.json CHANGED Viewed

@@ -68,6 +68,17 @@
         ]
       }
     ],
+    "StopFailure": [
+      {
+        "matcher": "",
+        "hooks": [
+          {
+            "type": "command",
+            "command": "bash ~/.claude/hooks/on-stop-failure.sh"
+          }
+        ]
+      }
+    ],
     "PreCompact": [
       {
         "matcher": "",

package/base/rules/desktop-protocol.md ADDED Viewed

@@ -0,0 +1,64 @@
+# Desktop Protocol — Rules
+# Scope: Sessions involving Claude Desktop ↔ Claude Code handoff
+# Defines role boundaries and structured handoff format
+## Role Boundaries
+| Surface | Role | Responsible for |
+|---------|------|-----------------|
+| Claude Desktop | Architect / Reviewer | ADR review, security audit, diff validation, prompt engineering |
+| Claude Code | Executor | File writes, git ops, test runs, tool use, vault updates |
+Desktop generates structured intent. Code executes it.
+Never use Desktop for file writes. Never use Code for open-ended architecture debate.
+## Handoff Format — Desktop → Code
+When Desktop completes a review or decision, it emits a structured handoff block.
+Code reads this block as its initial task context.
+```
+HANDOFF:
+  TASK: [one-line description]
+  SPEC: [link to vault plan or spec file]
+  CONSTRAINTS: [hard limits — what must NOT change]
+  ACCEPTANCE: [how to know it's done]
+  CONTEXT: [optional — key decisions or rationale from review]
+```
+Code MUST NOT start implementation without at least TASK and ACCEPTANCE filled.
+If SPEC is missing, Code asks before proceeding.
+## Handoff Format — Code → Desktop
+When Code completes implementation and wants architectural review:
+```
+REVIEW-REQUEST:
+  TASK: [what was implemented]
+  DIFF: [git diff summary or branch name]
+  SPEC: [link to plan that drove this work]
+  QUESTIONS: [specific things to review — not "does this look good"]
+```
+## Vault Write-Back
+After Desktop review, append the decision to `{vault_path}/notes/review-decisions.md`:
+```
+## YYYY-MM-DD | [task-slug]
+- **Decision**: APPROVED | CHANGES_REQUESTED | DEFERRED
+- **Rationale**: [1-2 sentences]
+- **Action items**: [if CHANGES_REQUESTED — specific items for Code to address]
+```
+## Conventions — WARN on violation
+- Desktop review decisions that affect architecture go to `{vault_path}/specs/` as ADRs
+- Desktop review decisions that are tactical (naming, style, small refactors) stay in `review-decisions.md`
+- Code must not self-approve architectural changes — route through Desktop review
+- If Desktop is unavailable, Code may proceed but must flag the decision in `status.md` for later review
+## Removal Condition
+Remove when a unified Claude surface handles both architecture review and code execution
+natively, eliminating the need for explicit handoff protocols.

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@esoteric-logic/praxis-harness",
-  "version": "2.9.1",
+  "version": "2.10.0",
   "description": "Layered Claude Code harness — workflow discipline, AI-Kits, persistent vault integration",
   "bin": {
     "praxis-harness": "./bin/praxis.js"