npm - ultimate-pi - Versions diffs - 0.17.0 → 0.18.1 - Mend

ultimate-pi 0.17.0 → 0.18.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (137) hide show

package/.agents/skills/harness-context/SKILL.md +13 -6
package/.agents/skills/harness-debate-plan/SKILL.md +37 -20
package/.agents/skills/harness-decisions/SKILL.md +1 -1
package/.agents/skills/harness-eval/SKILL.md +6 -21
package/.agents/skills/harness-governor/SKILL.md +4 -3
package/.agents/skills/harness-orchestration/SKILL.md +41 -53
package/.agents/skills/harness-plan/SKILL.md +23 -12
package/.agents/skills/harness-review/SKILL.md +52 -0
package/.agents/skills/harness-sentrux-setup/SKILL.md +16 -3
package/.agents/skills/harness-steer/SKILL.md +14 -0
package/.agents/skills/sentrux/SKILL.md +9 -9
package/.pi/agents/harness/planning/decompose.md +7 -4
package/.pi/agents/harness/planning/hypothesis-validator.md +2 -0
package/.pi/agents/harness/planning/hypothesis.md +3 -1
package/.pi/agents/harness/planning/plan-adversary.md +2 -0
package/.pi/agents/harness/planning/plan-evaluator.md +2 -0
package/.pi/agents/harness/planning/plan-synthesizer.md +25 -0
package/.pi/agents/harness/planning/planning-context.md +48 -0
package/.pi/agents/harness/planning/review-integrator.md +2 -0
package/.pi/agents/harness/planning/sprint-contract-auditor.md +2 -0
package/.pi/agents/harness/{adversary.md → reviewing/adversary.md} +3 -10
package/.pi/agents/harness/{evaluator.md → reviewing/evaluator.md} +3 -12
package/.pi/agents/harness/running/executor.md +45 -0
package/.pi/agents/harness/sentrux-steward.md +51 -0
package/.pi/extensions/00-harness-project-control.ts +133 -0
package/.pi/extensions/00-posthog-network-bootstrap.ts +11 -0
package/.pi/extensions/budget-guard.ts +2 -0
package/.pi/extensions/debate-orchestrator.ts +2 -0
package/.pi/extensions/harness-ask-user.ts +2 -2
package/.pi/extensions/harness-debate-tools.ts +2 -2
package/.pi/extensions/harness-live-widget.ts +60 -3
package/.pi/extensions/harness-plan-approval.ts +64 -58
package/.pi/extensions/harness-run-context.ts +715 -90
package/.pi/extensions/harness-subagent-submit.ts +46 -12
package/.pi/extensions/harness-subagents.ts +2 -2
package/.pi/extensions/harness-telemetry.ts +2 -0
package/.pi/extensions/harness-web-tools.ts +2 -2
package/.pi/extensions/lib/extension-load-guard.ts +10 -0
package/.pi/extensions/lib/harness-artifact-gate.ts +172 -0
package/.pi/extensions/lib/harness-posthog.ts +9 -5
package/.pi/extensions/lib/harness-spawn-topology.ts +165 -0
package/.pi/extensions/lib/harness-subagent-auth.ts +1 -2
package/.pi/extensions/lib/harness-subagent-policy.ts +28 -24
package/.pi/extensions/lib/harness-subagent-precheck.ts +36 -10
package/.pi/extensions/lib/harness-subagent-submit-pipeline.ts +66 -2
package/.pi/extensions/lib/harness-subagent-submit-registry.ts +22 -22
package/.pi/extensions/lib/harness-subagents-bridge.ts +7 -29
package/.pi/extensions/lib/harness-subprocess-bootstrap.ts +73 -0
package/.pi/extensions/lib/plan-approval/create-plan.ts +2 -3
package/.pi/extensions/lib/plan-approval/resolve-disk.ts +102 -0
package/.pi/extensions/lib/plan-approval/schema.ts +22 -8
package/.pi/extensions/lib/plan-approval/types.ts +1 -1
package/.pi/extensions/lib/plan-approval/validate.ts +2 -2
package/.pi/extensions/lib/plan-approval-readiness.ts +192 -0
package/.pi/extensions/lib/plan-debate-eligibility.ts +12 -5
package/.pi/extensions/lib/plan-debate-gate.ts +22 -1
package/.pi/extensions/lib/plan-debate-lanes.ts +32 -2
package/.pi/extensions/lib/plan-review-gate.ts +8 -0
package/.pi/extensions/lib/posthog-client.ts +76 -0
package/.pi/extensions/lib/spawn-policy.ts +3 -3
package/.pi/extensions/observation-bus.ts +2 -0
package/.pi/extensions/policy-gate.ts +26 -19
package/.pi/extensions/review-integrity.ts +91 -10
package/.pi/extensions/sentrux-rules-sync.ts +2 -0
package/.pi/extensions/test-diff-integrity.ts +1 -0
package/.pi/extensions/trace-recorder.ts +2 -0
package/.pi/harness/agents.manifest.json +37 -37
package/.pi/harness/corpus/cron.example +8 -0
package/.pi/harness/corpus/graphify-kb-updater.config.json +214 -0
package/.pi/harness/corpus/systemd/graphify-kb-updater.env.template +4 -0
package/.pi/harness/corpus/systemd/graphify-kb-updater.service +17 -0
package/.pi/harness/corpus/systemd/graphify-kb-updater.timer +11 -0
package/.pi/harness/docs/adrs/0001-harness-constitution.md +2 -1
package/.pi/harness/docs/adrs/0006-sentrux-dual-layer.md +8 -6
package/.pi/harness/docs/adrs/0009-sentrux-rules-lifecycle.md +6 -1
package/.pi/harness/docs/adrs/0031-harness-run-context.md +1 -1
package/.pi/harness/docs/adrs/0032-harness-command-orchestration.md +7 -0
package/.pi/harness/docs/adrs/0034-darwin-plan-research-pipeline.md +3 -3
package/.pi/harness/docs/adrs/0036-implementation-research-and-selective-debate.md +8 -5
package/.pi/harness/docs/adrs/0039-harness-post-run-review-gate.md +47 -0
package/.pi/harness/docs/adrs/0040-practice-grounded-orchestration.md +40 -0
package/.pi/harness/docs/adrs/0041-intelligent-planning-reconnaissance.md +39 -0
package/.pi/harness/docs/adrs/0042-agent-native-orchestration.md +35 -0
package/.pi/harness/docs/adrs/0043-path-first-harness-tools.md +38 -0
package/.pi/harness/docs/adrs/0044-harness-steer-loop.md +37 -0
package/.pi/harness/docs/adrs/0045-phase-scoped-agent-directories.md +33 -0
package/.pi/harness/docs/adrs/README.md +11 -0
package/.pi/harness/docs/graphify-kb-updater-runbook.md +163 -0
package/.pi/harness/docs/practice-map.md +110 -0
package/.pi/harness/env.harness.template +5 -3
package/.pi/harness/evals/smoke/sentrux-stub.json +1 -1
package/.pi/harness/evals/smoke/smoke-harness-plan.mjs +5 -2
package/.pi/harness/specs/README.md +1 -1
package/.pi/harness/specs/harness-run-context.schema.json +11 -0
package/.pi/harness/specs/harness-spawn-context.schema.json +15 -1
package/.pi/harness/specs/plan-execution-plan.schema.json +39 -1
package/.pi/harness/specs/plan-packet.schema.json +4 -0
package/.pi/harness/specs/plan-phase-status.schema.json +17 -0
package/.pi/harness/specs/plan-phase-waiver.schema.json +25 -0
package/.pi/harness/specs/plan-planning-context.schema.json +50 -0
package/.pi/harness/specs/repair-brief.schema.json +45 -0
package/.pi/harness/specs/review-outcome.schema.json +46 -0
package/.pi/harness/specs/sentrux-manifest-proposal.schema.json +80 -0
package/.pi/harness/specs/sentrux-signal.schema.json +43 -0
package/.pi/harness/specs/steer-state.schema.json +20 -0
package/.pi/lib/harness-context-mode-policy.ts +256 -0
package/.pi/lib/harness-project-config.ts +91 -0
package/.pi/lib/harness-repair-brief.ts +145 -0
package/.pi/lib/harness-run-context.ts +591 -32
package/.pi/lib/harness-ui-state.ts +114 -21
package/.pi/prompts/harness-auto.md +10 -10
package/.pi/prompts/harness-critic.md +3 -30
package/.pi/prompts/harness-eval.md +4 -37
package/.pi/prompts/harness-plan.md +116 -54
package/.pi/prompts/harness-review.md +150 -15
package/.pi/prompts/harness-run.md +62 -10
package/.pi/prompts/harness-sentrux-steward.md +55 -0
package/.pi/prompts/harness-setup.md +5 -4
package/.pi/prompts/harness-steer.md +30 -0
package/.pi/scripts/README.md +1 -0
package/.pi/scripts/graphify-kb-updater.mjs +398 -0
package/.pi/scripts/harness-agents-manifest.mjs +1 -1
package/.pi/scripts/harness-project-toggle.mjs +129 -0
package/.pi/scripts/harness-sentrux-cli.mjs +142 -0
package/.pi/scripts/harness-verify.mjs +22 -6
package/.pi/scripts/harness-web-policy-guard.mjs +68 -0
package/.pi/scripts/validate-plan-dag.mjs +3 -3
package/AGENTS.md +1 -0
package/CHANGELOG.md +23 -0
package/README.md +94 -58
package/package.json +5 -4
package/.pi/agents/harness/executor.md +0 -47
package/.pi/agents/harness/planning/scout-graphify.md +0 -37
package/.pi/agents/harness/planning/scout-semantic.md +0 -39
package/.pi/agents/harness/planning/scout-structure.md +0 -35
package/.pi/prompts/git-sync.md +0 -124
/package/.pi/agents/harness/{tie-breaker.md → reviewing/tie-breaker.md} +0 -0

package/.pi/extensions/review-integrity.ts CHANGED Viewed

@@ -8,6 +8,7 @@
 import { appendFile, mkdir } from "node:fs/promises";
 import { join } from "node:path";
 import type { ExtensionAPI } from "@earendil-works/pi-coding-agent";
+import { isHarnessProjectEnabled } from "../lib/harness-project-config.js";
 type HarnessPhase = "plan" | "execute" | "evaluate" | "adversary" | "merge";
@@ -15,12 +16,13 @@ const INCIDENTS_DIR = join(process.cwd(), ".pi", "harness", "incidents");
 const INCIDENT_FILE = join(INCIDENTS_DIR, "review-integrity.jsonl");
 const REVIEW_SUBAGENT_TYPES = new Set([
-	"harness/evaluator",
-	"harness/adversary",
-	"harness/tie-breaker",
+	"harness/reviewing/evaluator",
+	"harness/reviewing/adversary",
+	"harness/reviewing/tie-breaker",
 ]);
-const EXECUTOR_SUBAGENT_TYPE = "harness/executor";
+const EXECUTOR_SUBAGENT_TYPE = "harness/running/executor";
+const PLANNING_SUBAGENT_PREFIX = "harness/planning/";
 interface IsolationState {
 	executorSessionId: string | null;
@@ -138,6 +140,70 @@ function agentsFromSubagentInput(
 	return names;
 }
+function latestCustomData(
+	entries: SessionEntryLike[],
+	customType: string,
+): Record<string, unknown> | null {
+	for (let i = entries.length - 1; i >= 0; i--) {
+		const entry = entries[i];
+		if (entry.type !== "custom" || entry.customType !== customType) continue;
+		return entry.data && typeof entry.data === "object" ? entry.data : null;
+	}
+	return null;
+}
+function collectStrings(value: unknown, depth = 0): string[] {
+	if (depth > 5 || value == null) return [];
+	if (typeof value === "string") return [value];
+	if (Array.isArray(value)) {
+		return value.flatMap((item) => collectStrings(item, depth + 1));
+	}
+	if (typeof value === "object") {
+		return Object.values(value).flatMap((item) =>
+			collectStrings(item, depth + 1),
+		);
+	}
+	return [];
+}
+export function hasPlanReviseRecommendation(entries: unknown[]): boolean {
+	const typedEntries = entries as SessionEntryLike[];
+	const runContext = latestCustomData(typedEntries, "harness-run-context");
+	const text = collectStrings({
+		next_recommended_command: runContext?.next_recommended_command,
+		last_completed_step: runContext?.last_completed_step,
+		last_outcome: runContext?.last_outcome,
+		phase: runContext?.phase,
+	})
+		.join("\n")
+		.toLowerCase();
+	return text.includes("/harness-plan") && text.includes("revise");
+}
+export function isPlanRevisePlanningSubagent(input: {
+	agents: string[];
+	entries: unknown[];
+	toolInput?: Record<string, unknown>;
+}): boolean {
+	if (input.agents.length === 0) return false;
+	if (
+		!input.agents.every((agent) => agent.startsWith(PLANNING_SUBAGENT_PREFIX))
+	) {
+		return false;
+	}
+	if (hasPlanReviseRecommendation(input.entries)) return true;
+	const toolText = collectStrings(input.toolInput).join("\n").toLowerCase();
+	return (
+		toolText.includes("harness-plan") &&
+		(toolText.includes("mode: revise") ||
+			toolText.includes("mode=revise") ||
+			toolText.includes("--mode revise") ||
+			toolText.includes("--mode=revise"))
+	);
+}
 async function appendIncident(payload: Record<string, unknown>): Promise<void> {
 	await mkdir(INCIDENTS_DIR, { recursive: true });
 	await appendFile(
@@ -148,6 +214,7 @@ async function appendIncident(payload: Record<string, unknown>): Promise<void> {
 }
 export default function reviewIntegrity(pi: ExtensionAPI) {
+	if (!isHarnessProjectEnabled()) return;
 	let state: IsolationState = {
 		executorSessionId: null,
 		violationActive: false,
@@ -175,7 +242,10 @@ export default function reviewIntegrity(pi: ExtensionAPI) {
 		const phase = getPhase(ctx);
 		const currentSessionId = ctx.sessionManager.getSessionId();
 		const inReview = phase === "evaluate" || phase === "adversary";
-		if (!inReview) {
+		if (
+			!inReview ||
+			hasPlanReviseRecommendation(ctx.sessionManager.getEntries())
+		) {
 			state.violationActive = false;
 			state.updatedAt = nowIso();
 			persist();
@@ -201,7 +271,7 @@ export default function reviewIntegrity(pi: ExtensionAPI) {
 				customType: "harness-review-integrity-hint",
 				display: true,
 				content: [
-					"Review phase in executor session: spawn harness/evaluator or harness/adversary via subagent (isolated subprocess).",
+					"Review phase in executor session: spawn harness/reviewing/evaluator or harness/reviewing/adversary via subagent (isolated subprocess).",
 					"Do not run review checks directly in this session.",
 				].join("\n"),
 			},
@@ -210,9 +280,8 @@ export default function reviewIntegrity(pi: ExtensionAPI) {
 	pi.on("tool_call", async (event, ctx) => {
 		if (event.toolName === "subagent") {
-			const agents = agentsFromSubagentInput(
-				event.input as Record<string, unknown> | undefined,
-			);
+			const toolInput = event.input as Record<string, unknown> | undefined;
+			const agents = agentsFromSubagentInput(toolInput);
 			if (agents.includes(EXECUTOR_SUBAGENT_TYPE)) {
 				state.executorSessionId = ctx.sessionManager.getSessionId();
 				state.violationActive = false;
@@ -226,6 +295,18 @@ export default function reviewIntegrity(pi: ExtensionAPI) {
 				persist();
 				return undefined;
 			}
+			if (
+				isPlanRevisePlanningSubagent({
+					agents,
+					entries: ctx.sessionManager.getEntries(),
+					toolInput,
+				})
+			) {
+				state.violationActive = false;
+				state.updatedAt = nowIso();
+				persist();
+				return undefined;
+			}
 		}
 		if (!state.violationActive) return undefined;
@@ -237,7 +318,7 @@ export default function reviewIntegrity(pi: ExtensionAPI) {
 			reason:
 				"direct tool use in review phase while sharing executor session context",
 			mitigation:
-				"spawn harness/evaluator or harness/adversary via subagent instead",
+				"spawn harness/reviewing/evaluator or harness/reviewing/adversary via subagent instead",
 		});
 		return {

package/.pi/extensions/sentrux-rules-sync.ts CHANGED Viewed

@@ -4,6 +4,7 @@
 import { spawn } from "node:child_process";
 import type { ExtensionAPI } from "@earendil-works/pi-coding-agent";
+import { isHarnessProjectEnabled } from "../lib/harness-project-config.js";
 import { resolveHarnessScript } from "./lib/harness-paths.js";
 function resolveSyncScript(): string {
@@ -36,6 +37,7 @@ function runSync(args: string[]): Promise<{ code: number; output: string }> {
 }
 export default function sentruxRulesSync(pi: ExtensionAPI) {
+	if (!isHarnessProjectEnabled()) return;
 	pi.on("session_start", async () => {
 		const { code, output } = await runSync(["--check"]);
 		if (code !== 0) {

package/.pi/extensions/test-diff-integrity.ts CHANGED Viewed

@@ -13,6 +13,7 @@
 import { appendFile, mkdir } from "node:fs/promises";
 import { join } from "node:path";
 import type { ExtensionAPI } from "@earendil-works/pi-coding-agent";
+import { isHarnessProjectEnabled } from "../lib/harness-project-config.js";
 const INCIDENTS_DIR = join(process.cwd(), ".pi", "harness", "incidents");
 const INCIDENT_FILE = join(INCIDENTS_DIR, "test-diff-integrity.jsonl");

package/.pi/extensions/trace-recorder.ts CHANGED Viewed

@@ -10,6 +10,7 @@
 import { appendFile, mkdir, readFile, writeFile } from "node:fs/promises";
 import { join } from "node:path";
 import type { ExtensionAPI } from "@earendil-works/pi-coding-agent";
+import { isHarnessProjectEnabled } from "../lib/harness-project-config.js";
 import {
 	getLatestRunContext,
 	getRunIdFromSession,
@@ -182,6 +183,7 @@ function resolveRunIdForAgentStart(
 }
 export default function traceRecorder(pi: ExtensionAPI) {
+	if (!isHarnessProjectEnabled()) return;
 	let activeRun: ActiveRun | null = null;
 	let lastUserPrompt = "";

package/.pi/harness/agents.manifest.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
 	"schema_version": "1.0.0",
 	"package": "ultimate-pi",
-	"package_version": "0.15.0",
-	"generated_at": "2026-05-19T12:56:13.369Z",
+	"package_version": "0.18.0",
+	"generated_at": "2026-05-23T19:00:12.987Z",
 	"agents": {
 		"pi-pi/agent-expert": {
 			"path": ".pi/agents/pi-pi/agent-expert.md",
@@ -44,18 +44,6 @@
 			"path": ".pi/agents/pi-pi/tui-expert.md",
 			"sha256": "a619b2ee3d3d94fe599abb61db0904f90d30335ec426851c3f1efdf2e5ce5390"
 		},
-		"harness/adversary": {
-			"path": ".pi/agents/harness/adversary.md",
-			"sha256": "560c7571ab91478bde1271e9ae6c3a112c3e1d28e1a261c5450fd1d00f9f89af"
-		},
-		"harness/evaluator": {
-			"path": ".pi/agents/harness/evaluator.md",
-			"sha256": "a4667d3efb305ba2fe79118e3d7d2b0de5e0369637af040d1238161d75cd28ac"
-		},
-		"harness/executor": {
-			"path": ".pi/agents/harness/executor.md",
-			"sha256": "6baffcc3d89954494ce3ae439175686a39928b6a543a0a451da27475094b1712"
-		},
 		"harness/incident-recorder": {
 			"path": ".pi/agents/harness/incident-recorder.md",
 			"sha256": "d42fa45de1a2fe3842d075c6f319315266588942e314f1b650caabac39bdc29a"
@@ -68,17 +56,33 @@
 			"path": ".pi/agents/harness/sentrux-bootstrap.md",
 			"sha256": "3a0b43b94386a7c541b8a806a37524a5e53f1c8049270db7a420680df5799eeb"
 		},
-		"harness/tie-breaker": {
-			"path": ".pi/agents/harness/tie-breaker.md",
-			"sha256": "1c54c1c3274291dea1ea8826563a7ad4fe1d9c4302984e907bfcd22cfc4f5eba"
+		"harness/sentrux-steward": {
+			"path": ".pi/agents/harness/sentrux-steward.md",
+			"sha256": "0e63175d817adc0d65876f5c24fb54e4882081caf939ff9c658afee51fc6889c"
 		},
 		"harness/trace-librarian": {
 			"path": ".pi/agents/harness/trace-librarian.md",
 			"sha256": "336b3f3f6141cef8750ab18d29bbe454caf26973830a86afe099d9e4ad8b0abe"
 		},
+		"harness/running/executor": {
+			"path": ".pi/agents/harness/running/executor.md",
+			"sha256": "a48c37b2922b98fe20156367ae8c8fe761ae139153d402035a5aa35c9a14f106"
+		},
+		"harness/reviewing/adversary": {
+			"path": ".pi/agents/harness/reviewing/adversary.md",
+			"sha256": "697ee7c784e8eb30ce96f4f16e9bb5f9cdcaae76a4a7083ace2fe4272e6d732f"
+		},
+		"harness/reviewing/evaluator": {
+			"path": ".pi/agents/harness/reviewing/evaluator.md",
+			"sha256": "587ae14d6e91fd8af2b2842f568b9a1fa0b1d84fa6e18b4bc21c0ba2a9e62218"
+		},
+		"harness/reviewing/tie-breaker": {
+			"path": ".pi/agents/harness/reviewing/tie-breaker.md",
+			"sha256": "1c54c1c3274291dea1ea8826563a7ad4fe1d9c4302984e907bfcd22cfc4f5eba"
+		},
 		"harness/planning/decompose": {
 			"path": ".pi/agents/harness/planning/decompose.md",
-			"sha256": "0919dafa1d1cd008d513c28524c1e7218867586a138982dccf01db5270c42c73"
+			"sha256": "734eaa1bc87c337f6582c8f1c97baabf51e807731ab3c075c8960a9d207145e2"
 		},
 		"harness/planning/execution-plan-author": {
 			"path": ".pi/agents/harness/planning/execution-plan-author.md",
@@ -86,43 +90,39 @@
 		},
 		"harness/planning/hypothesis-validator": {
 			"path": ".pi/agents/harness/planning/hypothesis-validator.md",
-			"sha256": "36f0baa7796229f21bd02faf5e70402c7bf054289eab557a25bfbe3cb7781de7"
+			"sha256": "20411e5d734b14b05ae11153133089e044f46784e5b4741712f608665bbf4376"
 		},
 		"harness/planning/hypothesis": {
 			"path": ".pi/agents/harness/planning/hypothesis.md",
-			"sha256": "e83d5c4faaee8d32af4a5f22c9917b70a173f3e22d7c0f182b361706f2309171"
+			"sha256": "bbb91ac0de39c9de4bf388f0cf926151b6b6a7771d2a0d01d1009a1860daef77"
 		},
 		"harness/planning/implementation-researcher": {
 			"path": ".pi/agents/harness/planning/implementation-researcher.md",
-			"sha256": "653f320b5d51bb331774246687f24a75347b406bba4e6dfd2968d6e5d4cc8bb3"
+			"sha256": "d1bbaaf1e67ad98350319f973062f01a25ca70874c99cb335c99bec866da1f6d"
 		},
 		"harness/planning/plan-adversary": {
 			"path": ".pi/agents/harness/planning/plan-adversary.md",
-			"sha256": "3241d7ec939dc29e0af64690b99e9f74b209f40b0daa4a2a1f9ff86f99f94a8d"
+			"sha256": "d9a953c0f8f900dc9a95816ada401955dafade7bf5907406cbe3bf3ba760c469"
 		},
 		"harness/planning/plan-evaluator": {
 			"path": ".pi/agents/harness/planning/plan-evaluator.md",
-			"sha256": "71660ab58bfcfdfae56c873140d4ea5946ae30cd5719c96afeabfd02b1d1f81d"
+			"sha256": "825f296c487d6aeacad5d320e155a3f23d0db6dea822fccc99a1305941a43da2"
 		},
-		"harness/planning/review-integrator": {
-			"path": ".pi/agents/harness/planning/review-integrator.md",
-			"sha256": "cf3f0dbe81274ec9ef0ff2e0c170e8dc929b20be65492d0ee9a80d985acf6d71"
+		"harness/planning/plan-synthesizer": {
+			"path": ".pi/agents/harness/planning/plan-synthesizer.md",
+			"sha256": "5bc3ec109179790c196df1328d362c1485cd5ff9295c31c3de93c050330295da"
 		},
-		"harness/planning/scout-graphify": {
-			"path": ".pi/agents/harness/planning/scout-graphify.md",
-			"sha256": "6e2bda8ad38311810c9916d9dab311873bc776e4b8832bb0e574136e45e1255e"
+		"harness/planning/planning-context": {
+			"path": ".pi/agents/harness/planning/planning-context.md",
+			"sha256": "96a51d1f2daafc9eaa8869a06ede9d04fc9e19076d58a81041e346e4c81c8b08"
 		},
-		"harness/planning/scout-semantic": {
-			"path": ".pi/agents/harness/planning/scout-semantic.md",
-			"sha256": "416e518d8204a55b26dc53da1f750865c6f09ee2c7f343b41e7c08da3230c089"
-		},
-		"harness/planning/scout-structure": {
-			"path": ".pi/agents/harness/planning/scout-structure.md",
-			"sha256": "76c42a15cc74cf1de2cf861cb0146c865c205f69cce7b9605d41893b19600029"
+		"harness/planning/review-integrator": {
+			"path": ".pi/agents/harness/planning/review-integrator.md",
+			"sha256": "bba385463ca8833654cd0dc80f666344332293fe86d7420d2c36755a3f9e743a"
 		},
 		"harness/planning/sprint-contract-auditor": {
 			"path": ".pi/agents/harness/planning/sprint-contract-auditor.md",
-			"sha256": "12cb5e6b53dcc19ace62e8e4c152d96440717df53a182e76216dd2327410df4d"
+			"sha256": "2321298529f70d03798d23346231c4c43ad4b7490a43f291430ca65b3ef93757"
 		},
 		"harness/planning/stack-researcher": {
 			"path": ".pi/agents/harness/planning/stack-researcher.md",

package/.pi/harness/corpus/cron.example ADDED Viewed

@@ -0,0 +1,8 @@
+# Cron alternative (systemd timer is the tested path). Bounded, locked, explicit env, no overlap.
+# Edit UP_ROOT before installing with `crontab -e`.
+SHELL=/bin/sh
+PATH=/usr/local/bin:/usr/bin:/bin
+UP_ROOT=/home/USER/ai-projects/ultimate-pi
+HARNESS_GRAPHIFY_KB_LOG=/home/USER/.local/state/ultimate-pi/graphify-kb-updater.log
+30 8 * * * cd "$UP_ROOT" && /usr/bin/flock -n /tmp/graphify-kb-updater.lock /usr/bin/timeout 45m /usr/bin/env node .pi/scripts/graphify-kb-updater.mjs --apply --refresh-graph --pilot-report --max-promotions 25 >> "$HARNESS_GRAPHIFY_KB_LOG" 2>&1

package/.pi/harness/corpus/graphify-kb-updater.config.json ADDED Viewed

@@ -0,0 +1,214 @@
+{
+	"schema_version": "1.1.0",
+	"policy": "hybrid-allowlist-auto-promotion-with-conservative-staging",
+	"auto_promote_allowlist": true,
+	"source_taxonomy": {
+		"article": {
+			"category": "public_article_or_engineering_blog",
+			"risk_class": "low_to_medium",
+			"default_policy": "allowlist_auto_promote_when_approved"
+		},
+		"paper": {
+			"category": "research_paper_or_abstract_feed",
+			"risk_class": "medium",
+			"default_policy": "stage_until_rights_review"
+		},
+		"repo": {
+			"category": "public_repository_metadata",
+			"risk_class": "low_to_medium",
+			"default_policy": "allowlist_auto_promote_when_approved"
+		},
+		"release": {
+			"category": "public_repository_release_metadata",
+			"risk_class": "low_to_medium",
+			"default_policy": "allowlist_auto_promote_when_approved"
+		},
+		"book": {
+			"category": "book_or_longform_local_file",
+			"risk_class": "high",
+			"default_policy": "manual_approval_required"
+		},
+		"transcript": {
+			"category": "youtube_or_audio_transcript",
+			"risk_class": "high",
+			"default_policy": "manual_approval_required"
+		},
+		"youtube": {
+			"category": "youtube_candidate_or_video_reference",
+			"risk_class": "high",
+			"default_policy": "stage_metadata_only_until_approved"
+		}
+	},
+	"competitor_taxonomy": {
+		"ai_coding_agents": {
+			"description": "Coding-agent products, CLIs, IDE agents, and model-native coding surfaces.",
+			"keywords": [
+				"claude code",
+				"cursor",
+				"codex",
+				"aider",
+				"copilot",
+				"windsurf",
+				"zed",
+				"replit",
+				"devin"
+			]
+		},
+		"agentic_harnesses": {
+			"description": "Harnesses, orchestration frameworks, eval loops, task runners, and review gates.",
+			"keywords": [
+				"harness",
+				"orchestration",
+				"agent bus",
+				"eval",
+				"review gate",
+				"multi-agent",
+				"workflow"
+			]
+		},
+		"context_engineering": {
+			"description": "Context retrieval, compaction, memory, skills, MCP, and codebase indexing.",
+			"keywords": [
+				"context engineering",
+				"mcp",
+				"memory",
+				"retrieval",
+				"compaction",
+				"skills",
+				"knowledge graph"
+			]
+		}
+	},
+	"allowlist": [
+		{
+			"domain": "openai.com",
+			"approved": true,
+			"approved_by": "repo-policy",
+			"approved_at": "2026-05-23",
+			"allowed_source_classes": ["article"]
+		},
+		{
+			"domain": "anthropic.com",
+			"approved": true,
+			"approved_by": "repo-policy",
+			"approved_at": "2026-05-23",
+			"allowed_source_classes": ["article"]
+		},
+		{
+			"domain": "github.blog",
+			"approved": true,
+			"approved_by": "repo-policy",
+			"approved_at": "2026-05-23",
+			"allowed_source_classes": ["article"]
+		},
+		{
+			"domain": "martinfowler.com",
+			"approved": true,
+			"approved_by": "repo-policy",
+			"approved_at": "2026-05-23",
+			"allowed_source_classes": ["article"]
+		},
+		{
+			"domain": "addyosmani.com",
+			"approved": true,
+			"approved_by": "repo-policy",
+			"approved_at": "2026-05-23",
+			"allowed_source_classes": ["article"]
+		},
+		{
+			"domain": "arxiv.org",
+			"approved": false,
+			"approved_by": "manual-review-required",
+			"approved_at": "manual-review-required",
+			"allowed_source_classes": ["paper"]
+		},
+		{
+			"domain": "github.com",
+			"approved": true,
+			"approved_by": "repo-policy",
+			"approved_at": "2026-05-23",
+			"allowed_source_classes": ["repo", "release"]
+		}
+	],
+	"article_queries": [
+		"agentic engineering harness engineering AI coding agents",
+		"AI coding harness evaluation orchestration context engineering"
+	],
+	"repo_sources": [
+		{
+			"title": "Graphify project repository metadata watch",
+			"url": "https://github.com/AI-App/Graphify",
+			"approved": false,
+			"rights_access": {
+				"license": "repository metadata only; source license requires review",
+				"access": "public repository metadata",
+				"approved_by": "manual-review-required",
+				"approved_at": "manual-review-required"
+			},
+			"provenance": {
+				"origin": "curated_repo_watchlist",
+				"locator": "https://github.com/AI-App/Graphify",
+				"notes": "Metadata candidate only until manually approved."
+			},
+			"competitor_labels": ["context_engineering"]
+		}
+	],
+	"release_feeds": [
+		{
+			"title": "OpenAI agents SDK release metadata watch",
+			"url": "https://github.com/openai/openai-agents-python/releases",
+			"approved": false,
+			"rights_access": {
+				"license": "release metadata only; linked artifacts require review",
+				"access": "public release metadata",
+				"approved_by": "manual-review-required",
+				"approved_at": "manual-review-required"
+			},
+			"provenance": {
+				"origin": "curated_release_watchlist",
+				"locator": "https://github.com/openai/openai-agents-python/releases",
+				"notes": "Release metadata candidate only until manually approved."
+			},
+			"competitor_labels": ["agentic_harnesses"]
+		}
+	],
+	"paper_feeds": [
+		{
+			"title": "arXiv software engineering agents search feed",
+			"url": "https://arxiv.org/search/cs?query=agentic+software+engineering&searchtype=all",
+			"rights_access": {
+				"license": "source-specific",
+				"access": "public abstract/feed only; paper text requires review",
+				"approved_by": "manual-review-required",
+				"approved_at": "manual-review-required"
+			},
+			"provenance": {
+				"origin": "curated_search_feed",
+				"locator": "https://arxiv.org/search/cs?query=agentic+software+engineering&searchtype=all",
+				"notes": "Feed metadata only; paper body requires approval."
+			}
+		}
+	],
+	"local_books": [
+		{
+			"path": "data/books",
+			"max_files": 75
+		}
+	],
+	"local_transcripts": [
+		{
+			"path": "data/youtube-transcripts",
+			"max_files": 100
+		}
+	],
+	"youtube_candidates": [
+		{
+			"title": "Review queue placeholder for agentic engineering YouTube talks",
+			"url": "https://www.youtube.com/results?search_query=agentic+engineering+harness+engineering",
+			"rights_access": null,
+			"approved": false,
+			"competitor_labels": ["agentic_harnesses"]
+		}
+	],
+	"review_queue": []
+}

package/.pi/harness/corpus/systemd/graphify-kb-updater.env.template ADDED Viewed

@@ -0,0 +1,4 @@
+# Copy to ~/.config/ultimate-pi/graphify-kb-updater.env and edit paths.
+UP_ROOT=/home/USER/ai-projects/ultimate-pi
+NODE_ENV=production
+GRAPHIFY_KB_ARGS=--apply --refresh-graph --pilot-report --max-promotions 25

package/.pi/harness/corpus/systemd/graphify-kb-updater.service ADDED Viewed

@@ -0,0 +1,17 @@
+[Unit]
+Description=Ultimate Pi Graphify knowledge-base updater
+Documentation=file:%h/ai-projects/ultimate-pi/.pi/harness/docs/graphify-kb-updater-runbook.md
+After=network-online.target
+Wants=network-online.target
+[Service]
+Type=oneshot
+EnvironmentFile=%h/.config/ultimate-pi/graphify-kb-updater.env
+WorkingDirectory=${UP_ROOT}
+ExecStart=/usr/bin/flock -n %t/graphify-kb-updater.lock /usr/bin/timeout 45m /usr/bin/env node .pi/scripts/graphify-kb-updater.mjs ${GRAPHIFY_KB_ARGS}
+StandardOutput=append:%h/.local/state/ultimate-pi/graphify-kb-updater.log
+StandardError=append:%h/.local/state/ultimate-pi/graphify-kb-updater.err
+TimeoutStartSec=50m
+Nice=10
+IOSchedulingClass=best-effort
+IOSchedulingPriority=7

package/.pi/harness/corpus/systemd/graphify-kb-updater.timer ADDED Viewed

@@ -0,0 +1,11 @@
+[Unit]
+Description=Run Ultimate Pi Graphify knowledge-base updater daily on a bounded schedule
+[Timer]
+OnCalendar=*-*-* 08:30:00
+RandomizedDelaySec=30m
+Persistent=true
+Unit=graphify-kb-updater.service
+[Install]
+WantedBy=timers.target

package/.pi/harness/docs/adrs/0001-harness-constitution.md CHANGED Viewed

@@ -13,7 +13,8 @@ ultimate-pi needs a stable governance model for agentic runs: plan-before-mutate
 2. Phases are `plan → execute → evaluate → adversary → merge` with policy-gate as the source of truth.
 3. Local JSONL under `.pi/harness/runs/` is the **source of truth** for run history; PostHog is for team dashboards.
 4. Context for harness paths uses **context-mode only** — never lean-ctx in harness skills or extensions.
-5. `@posthog/pi` remains the LLM analytics layer; harness domain events use `harness-telemetry.ts`.
+5. Context-mode execute tools (`ctx_execute`, `ctx_batch_execute`, `ctx_execute_file`) are subject to the same phase matrix as `bash`/`write` via policy-gate.
+6. `@posthog/pi` remains the LLM analytics layer; harness domain events use `harness-telemetry.ts`.
 ## Consequences

package/.pi/harness/docs/adrs/0006-sentrux-dual-layer.md CHANGED Viewed

@@ -5,15 +5,16 @@
 ## Context
-Evaluator trust requires both programmatic gates (policy, budget, integrity) and external observation signals (Sentrux MCP).
+Evaluator trust requires both programmatic gates (policy, budget, integrity) and **measured structural actuals** from the Sentrux CLI (Pi sessions use CLI only — no Sentrux MCP in harness).
 ## Decision
 1. **Rules file:** `.sentrux/rules.toml` synced from manifest — see [ADR 0009](0009-sentrux-rules-lifecycle.md).
-2. **CLI gate:** `node "$UP_PKG/.pi/scripts/harness-verify.mjs"` fails if `HARNESS_SENTRUX_REQUIRED=true` and no `harness-sentrux-signal` stub/file exists for the run (placeholder until MCP wired). Resolve `$UP_PKG` via [.pi/scripts/README.md](../../../scripts/README.md).
-3. **MCP layer (Q2+):** Evaluator sessions must record at least one Sentrux observation before `harness_eval_verdict` promotion when Sentrux is enabled.
-4. Observations flow through `observation-bus.ts` as `HarnessObservation` envelopes.
-5. PostHog event: `harness_sentrux_signal` with `signal_type` and `score` only — no secrets.
+2. **Run observation:** `/harness-run` writes `artifacts/sentrux-signal.yaml` and appends session custom entry `harness-sentrux-signal` after root-resolved Sentrux `check` + `gate` via `harness-sentrux-cli.mjs` (baseline from `gate --save` before execute). Raw `sentrux check .` / `gate .` must not be used from `.pi/harness/runs/*` because Sentrux resolves `.sentrux/rules.toml` against the path argument.
+3. **Verify gate:** `harness-verify.mjs` with `HARNESS_SENTRUX_REQUIRED=true` prefers `$HARNESS_RUN_DIR/artifacts/sentrux-signal.yaml`; falls back to `.pi/harness/evals/smoke/sentrux-stub.json` only when no run signal exists (CI smoke / pre-run verify).
+4. **Evaluator:** `harness/evaluator` in `benchmark` mode reads `sentrux-signal.yaml` and `benchmark-log.yaml` — metrics are inputs, not executor optimization targets.
+5. Observations flow through `observation-bus.ts` as `HarnessObservation` envelopes when wired.
+6. PostHog event: `harness_sentrux_signal` with `signal_type` and `score` only — no secrets.
 ## Consequences
@@ -23,9 +24,10 @@ Evaluator trust requires both programmatic gates (policy, budget, integrity) and
 ### Negative
-- Full MCP integration remains follow-up when Sentrux server is available.
+- Teams must run `/harness-run` (or write `sentrux-signal.yaml`) before promotion verify when stub fallback is insufficient.
 ## References
 - `.pi/harness/specs/observation.schema.json`
 - `.pi/scripts/harness-verify.mjs`
+- `.pi/scripts/harness-sentrux-cli.mjs`

package/.pi/harness/docs/adrs/0009-sentrux-rules-lifecycle.md CHANGED Viewed

@@ -20,7 +20,10 @@ Sentrux enforces architecture via [`.sentrux/rules.toml`](https://sentrux.dev/do
    - On `agent_end` when harness phase is `plan` or `merge`
    - `node "$UP_PKG/.pi/scripts/harness-verify.mjs"` fails if manifest hash ≠ last sync (`--check`)
 7. **Custom rules:** TOML outside the managed block is preserved on sync.
-8. **Skill:** `harness-sentrux-setup` documents bootstrap vs `--force`.
+8. **Skill:** `harness-sentrux-setup` documents bootstrap vs steward vs sync vs observation.
+9. **Intent evolution:** `harness/sentrux-steward` proposes JSON Merge Patches via `submit_sentrux_manifest_proposal` → `artifacts/sentrux-manifest-proposal.yaml`, with graphify-first evidence (`graphify-out/GRAPH_REPORT.md`, `graphify query` / `path` / `explain`). Chair applies manifest edits; never silent auto-merge.
+10. **Material changes:** `add_layer`, `add_boundary`, `split_layer` require `adr_required` + `ask_user` when `human_required`. `tune_constraint` may proceed with sentrux/graphify evidence only when chair agrees.
+11. **Observation vs intent:** `/harness-run` + `/harness-review` run CLI fitness functions; observation failures → replan/fix. Manifest changes → steward + ADR, not directory-tree guessing.
 ## Consequences
@@ -36,6 +39,8 @@ Sentrux enforces architecture via [`.sentrux/rules.toml`](https://sentrux.dev/do
 ## References
 - ADR 0006 (Sentrux dual layer)
+- `.pi/agents/harness/sentrux-steward.md`, `.pi/prompts/harness-sentrux-steward.md`
+- `.pi/harness/specs/sentrux-manifest-proposal.schema.json`, `sentrux-signal.schema.json`
 - `.pi/scripts/harness-sentrux-bootstrap.mjs`
 - `.pi/scripts/sentrux-rules-sync.mjs`
 - `.agents/skills/harness-sentrux-setup/SKILL.md`