ultimate-pi 0.1.0 → 0.1.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (509) hide show
  1. package/.agents/skills/ck-search/SKILL.md +99 -0
  2. package/.agents/skills/defuddle/SKILL.md +90 -0
  3. package/.agents/skills/find-skills/SKILL.md +142 -0
  4. package/.agents/skills/firecrawl/SKILL.md +150 -0
  5. package/.agents/skills/firecrawl/rules/install.md +82 -0
  6. package/.agents/skills/firecrawl/rules/security.md +26 -0
  7. package/.agents/skills/firecrawl-agent/SKILL.md +57 -0
  8. package/.agents/skills/firecrawl-build-interact/SKILL.md +67 -0
  9. package/.agents/skills/firecrawl-build-onboarding/SKILL.md +102 -0
  10. package/.agents/skills/firecrawl-build-onboarding/references/auth-flow.md +39 -0
  11. package/.agents/skills/firecrawl-build-onboarding/references/project-setup.md +20 -0
  12. package/.agents/skills/firecrawl-build-onboarding/references/sdk-installation.md +17 -0
  13. package/.agents/skills/firecrawl-build-scrape/SKILL.md +68 -0
  14. package/.agents/skills/firecrawl-build-search/SKILL.md +68 -0
  15. package/.agents/skills/firecrawl-crawl/SKILL.md +58 -0
  16. package/.agents/skills/firecrawl-download/SKILL.md +69 -0
  17. package/.agents/skills/firecrawl-interact/SKILL.md +83 -0
  18. package/.agents/skills/firecrawl-map/SKILL.md +50 -0
  19. package/.agents/skills/firecrawl-parse/SKILL.md +61 -0
  20. package/.agents/skills/firecrawl-scrape/SKILL.md +68 -0
  21. package/.agents/skills/firecrawl-search/SKILL.md +59 -0
  22. package/.agents/skills/obsidian-bases/SKILL.md +299 -0
  23. package/.agents/skills/obsidian-markdown/SKILL.md +237 -0
  24. package/.agents/skills/posthog-analyst/SKILL.md +306 -0
  25. package/.agents/skills/posthog-analyst/evals/evals.json +23 -0
  26. package/.agents/skills/wiki/SKILL.md +215 -0
  27. package/.agents/skills/wiki/references/css-snippets.md +122 -0
  28. package/.agents/skills/wiki/references/frontmatter.md +107 -0
  29. package/.agents/skills/wiki/references/git-setup.md +58 -0
  30. package/.agents/skills/wiki/references/mcp-setup.md +149 -0
  31. package/.agents/skills/wiki/references/modes.md +259 -0
  32. package/.agents/skills/wiki/references/plugins.md +96 -0
  33. package/.agents/skills/wiki/references/rest-api.md +124 -0
  34. package/.agents/skills/wiki-autoresearch/SKILL.md +211 -0
  35. package/.agents/skills/wiki-autoresearch/references/program.md +75 -0
  36. package/.agents/skills/wiki-fold/SKILL.md +204 -0
  37. package/.agents/skills/wiki-fold/references/fold-template.md +133 -0
  38. package/.agents/skills/wiki-ingest/SKILL.md +288 -0
  39. package/.agents/skills/wiki-lint/SKILL.md +183 -0
  40. package/.agents/skills/wiki-query/SKILL.md +176 -0
  41. package/.agents/skills/wiki-save/SKILL.md +128 -0
  42. package/.ckignore +41 -0
  43. package/.env.example +9 -0
  44. package/.github/banner-v2.png +0 -0
  45. package/.github/workflows/lint.yml +33 -0
  46. package/.github/workflows/publish-github-packages.yml +35 -0
  47. package/.github/workflows/publish-npm.yml +32 -0
  48. package/.pi/SYSTEM.md +107 -40
  49. package/.pi/agents/pi-pi/agent-expert.md +205 -0
  50. package/.pi/agents/pi-pi/cli-expert.md +47 -0
  51. package/.pi/agents/pi-pi/config-expert.md +67 -0
  52. package/.pi/agents/pi-pi/ext-expert.md +53 -0
  53. package/.pi/agents/pi-pi/keybinding-expert.md +123 -0
  54. package/.pi/agents/pi-pi/pi-orchestrator.md +103 -0
  55. package/.pi/agents/pi-pi/prompt-expert.md +83 -0
  56. package/.pi/agents/pi-pi/skill-expert.md +52 -0
  57. package/.pi/agents/pi-pi/theme-expert.md +46 -0
  58. package/.pi/agents/pi-pi/tui-expert.md +100 -0
  59. package/.pi/agents/rethink.md +140 -0
  60. package/.pi/agents/wiki-ingest.md +67 -0
  61. package/.pi/agents/wiki-lint.md +75 -0
  62. package/.pi/auto-commit.json +20 -0
  63. package/.pi/extensions/banner.png +0 -0
  64. package/.pi/extensions/ck-enforce.ts +216 -0
  65. package/.pi/extensions/custom-footer.ts +308 -0
  66. package/.pi/extensions/custom-header.ts +116 -0
  67. package/.pi/extensions/dotenv-loader.ts +170 -0
  68. package/.pi/internal/cursor-sdk-transcript-parser.ts +59 -0
  69. package/.pi/model-router.json +95 -0
  70. package/.pi/npm/.gitignore +2 -0
  71. package/.pi/prompts/git-sync.md +124 -0
  72. package/.pi/prompts/harness-setup.md +509 -0
  73. package/.pi/prompts/save.md +16 -0
  74. package/.pi/prompts/wiki-autoresearch.md +19 -0
  75. package/.pi/prompts/wiki.md +23 -0
  76. package/.pi/providers/cursor-sdk-provider.test.mjs +476 -0
  77. package/.pi/providers/cursor-sdk-provider.ts +1085 -0
  78. package/.pi/settings.json +14 -4
  79. package/.pi/skills/agent-router/SKILL.md +174 -0
  80. package/.pi/sounds/alert/1-kaching-track.mp3 +0 -0
  81. package/.pi/sounds/error/1-ksi-wth-track.mp3 +0 -0
  82. package/.pi/sounds/error/2-smash-track.mp3 +0 -0
  83. package/.pi/sounds/error/3-buzzer-track.mp3 +0 -0
  84. package/.pi/sounds/notification/1-soft-notification-track.mp3 +0 -0
  85. package/.pi/sounds/project-sounds.json +25 -0
  86. package/.pi/sounds/reminder/1-soft-notification-track.mp3 +0 -0
  87. package/.pi/sounds/success/1-tada-track.mp3 +0 -0
  88. package/.pi/sounds/success/2-jobs-done-track.mp3 +0 -0
  89. package/.pi/sounds/success/3-yay-track.mp3 +0 -0
  90. package/CONTRIBUTING.md +116 -0
  91. package/README.md +33 -40
  92. package/biome.json +34 -0
  93. package/firecrawl/.env.template +58 -0
  94. package/firecrawl/README.md +49 -0
  95. package/firecrawl/docker-compose.yaml +201 -0
  96. package/firecrawl/searxng/searxng.env +3 -0
  97. package/firecrawl/searxng/settings.yml +85 -0
  98. package/lefthook.yml +8 -0
  99. package/package.json +55 -16
  100. package/vault/AGENTS.md +37 -0
  101. package/vault/wiki/_templates/comparison.md +39 -0
  102. package/vault/wiki/_templates/concept.md +40 -0
  103. package/vault/wiki/_templates/decision.md +21 -0
  104. package/vault/wiki/_templates/entity.md +32 -0
  105. package/vault/wiki/_templates/flow.md +14 -0
  106. package/vault/wiki/_templates/module.md +18 -0
  107. package/vault/wiki/_templates/question.md +31 -0
  108. package/vault/wiki/_templates/source.md +39 -0
  109. package/vault/wiki/concepts/AST-Aware Code Chunking.md +44 -0
  110. package/vault/wiki/concepts/Build-Time Prompt Compilation.md +107 -0
  111. package/vault/wiki/concepts/Context Engine (AI Coding).md +47 -0
  112. package/vault/wiki/concepts/Context-Aware System Reminders.md +61 -0
  113. package/vault/wiki/concepts/Contextualized Text Embedding.md +42 -0
  114. package/vault/wiki/concepts/Contractor vs Employee AI Model.md +55 -0
  115. package/vault/wiki/concepts/Dual-Model Agent Architecture.md +65 -0
  116. package/vault/wiki/concepts/Late Chunking vs Early Chunking.md +43 -0
  117. package/vault/wiki/concepts/Majority Vote Ensembling.md +68 -0
  118. package/vault/wiki/concepts/Meta-Harness.md +16 -0
  119. package/vault/wiki/concepts/Multi-Agent AI Coding Architecture.md +75 -0
  120. package/vault/wiki/concepts/Prompt Enhancement.md +90 -0
  121. package/vault/wiki/concepts/Prompt Renderer.md +89 -0
  122. package/vault/wiki/concepts/Semantic Codebase Indexing.md +67 -0
  123. package/vault/wiki/concepts/additive-config-hierarchy.md +16 -0
  124. package/vault/wiki/concepts/agent-artifacts-verifiable-deliverables.md +71 -0
  125. package/vault/wiki/concepts/agent-browser-browser-automation.md +99 -0
  126. package/vault/wiki/concepts/agent-codebase-interface.md +43 -0
  127. package/vault/wiki/concepts/agent-harness-architecture.md +67 -0
  128. package/vault/wiki/concepts/agent-loop-detection-patterns.md +133 -0
  129. package/vault/wiki/concepts/agent-search-enforcement.md +126 -0
  130. package/vault/wiki/concepts/agent-skills-ecosystem.md +74 -0
  131. package/vault/wiki/concepts/agent-skills-pattern.md +68 -0
  132. package/vault/wiki/concepts/agentic-harness-context-enforcement.md +91 -0
  133. package/vault/wiki/concepts/agentic-harness.md +34 -0
  134. package/vault/wiki/concepts/agentic-orchestration-pipeline.md +56 -0
  135. package/vault/wiki/concepts/agentic-search-no-embeddings.md +18 -0
  136. package/vault/wiki/concepts/anthropic-context-engineering.md +13 -0
  137. package/vault/wiki/concepts/antigravity-agent-first-architecture.md +61 -0
  138. package/vault/wiki/concepts/ast-compression.md +19 -0
  139. package/vault/wiki/concepts/ast-truncation.md +66 -0
  140. package/vault/wiki/concepts/barrel-files.md +37 -0
  141. package/vault/wiki/concepts/browser-harness-agent.md +41 -0
  142. package/vault/wiki/concepts/browser-subagent-visual-verification.md +82 -0
  143. package/vault/wiki/concepts/codebase-intelligence-ecosystem-comparison.md +192 -0
  144. package/vault/wiki/concepts/codebase-intelligence-harness-integration.md +161 -0
  145. package/vault/wiki/concepts/codebase-to-context-ingestion.md +46 -0
  146. package/vault/wiki/concepts/codex-harness-innovations.md +147 -0
  147. package/vault/wiki/concepts/consensus-debate-flow.md +17 -0
  148. package/vault/wiki/concepts/consensus-debate.md +206 -0
  149. package/vault/wiki/concepts/content-addressed-spec-identity.md +166 -0
  150. package/vault/wiki/concepts/context-anxiety.md +57 -0
  151. package/vault/wiki/concepts/context-compression-techniques.md +19 -0
  152. package/vault/wiki/concepts/context-continuity.md +22 -0
  153. package/vault/wiki/concepts/context-drift-in-agents.md +106 -0
  154. package/vault/wiki/concepts/context-engineering.md +62 -0
  155. package/vault/wiki/concepts/context-folding.md +67 -0
  156. package/vault/wiki/concepts/context-mode.md +38 -0
  157. package/vault/wiki/concepts/cursor-harness-innovations.md +107 -0
  158. package/vault/wiki/concepts/deterministic-session-compaction.md +79 -0
  159. package/vault/wiki/concepts/drift-detection-unified.md +296 -0
  160. package/vault/wiki/concepts/execution-feedback-loop.md +46 -0
  161. package/vault/wiki/concepts/feedforward-feedback-harness.md +60 -0
  162. package/vault/wiki/concepts/five-root-cause-metrics-sentrux.md +40 -0
  163. package/vault/wiki/concepts/fork-safe-spec-storage.md +89 -0
  164. package/vault/wiki/concepts/fts5-sandbox.md +19 -0
  165. package/vault/wiki/concepts/fuzzy-edit-matching.md +71 -0
  166. package/vault/wiki/concepts/gemini-cli-architecture.md +104 -0
  167. package/vault/wiki/concepts/generator-evaluator-architecture.md +64 -0
  168. package/vault/wiki/concepts/guardian-agent-pattern.md +67 -0
  169. package/vault/wiki/concepts/harness-configuration-layers.md +89 -0
  170. package/vault/wiki/concepts/harness-control-frameworks.md +155 -0
  171. package/vault/wiki/concepts/harness-engineering-first-principles.md +90 -0
  172. package/vault/wiki/concepts/harness-h-formalism.md +53 -0
  173. package/vault/wiki/concepts/hybrid-code-search.md +61 -0
  174. package/vault/wiki/concepts/inline-post-edit-validation.md +112 -0
  175. package/vault/wiki/concepts/legendary-engineering-patterns-harness.md +110 -0
  176. package/vault/wiki/concepts/lifecycle-hooks.md +94 -0
  177. package/vault/wiki/concepts/mcp-tool-routing.md +102 -0
  178. package/vault/wiki/concepts/memory-system-of-record-vs-ephemeral-cache.md +47 -0
  179. package/vault/wiki/concepts/meta-agent-context-pruning.md +151 -0
  180. package/vault/wiki/concepts/model-adaptive-harness.md +122 -0
  181. package/vault/wiki/concepts/model-routing-agents.md +101 -0
  182. package/vault/wiki/concepts/monorepo-architecture.md +45 -0
  183. package/vault/wiki/concepts/multi-agent-specialization.md +61 -0
  184. package/vault/wiki/concepts/permission-subsystem.md +16 -0
  185. package/vault/wiki/concepts/pi-messenger-analysis.md +243 -0
  186. package/vault/wiki/concepts/pi-vscode-extension-landscape.md +37 -0
  187. package/vault/wiki/concepts/policy-engine-pattern.md +78 -0
  188. package/vault/wiki/concepts/progressive-disclosure-agents.md +53 -0
  189. package/vault/wiki/concepts/progressive-skill-disclosure.md +17 -0
  190. package/vault/wiki/concepts/provider-native-prompting.md +203 -0
  191. package/vault/wiki/concepts/quality-signal-sentrux.md +37 -0
  192. package/vault/wiki/concepts/repo-map-ranking.md +42 -0
  193. package/vault/wiki/concepts/result-monad-error-handling.md +47 -0
  194. package/vault/wiki/concepts/safety-defense-in-depth.md +83 -0
  195. package/vault/wiki/concepts/sandbox-os-enforcement.md +18 -0
  196. package/vault/wiki/concepts/selective-debate-routing.md +70 -0
  197. package/vault/wiki/concepts/self-evolving-harness.md +60 -0
  198. package/vault/wiki/concepts/sentrux-mcp-integration.md +36 -0
  199. package/vault/wiki/concepts/sentrux-rules-engine.md +49 -0
  200. package/vault/wiki/concepts/shell-pattern-compression.md +24 -0
  201. package/vault/wiki/concepts/skill-first-architecture.md +166 -0
  202. package/vault/wiki/concepts/structured-compaction.md +78 -0
  203. package/vault/wiki/concepts/subagent-orchestration.md +17 -0
  204. package/vault/wiki/concepts/subagent-worktree-isolation.md +68 -0
  205. package/vault/wiki/concepts/superpowers-methodology.md +78 -0
  206. package/vault/wiki/concepts/think-in-code.md +73 -0
  207. package/vault/wiki/concepts/ts-execution-layer.md +100 -0
  208. package/vault/wiki/concepts/typescript-strict-mode.md +37 -0
  209. package/vault/wiki/concepts/vcc-conversation-compaction-for-pi.md +51 -0
  210. package/vault/wiki/concepts/verification-drift-detection.md +19 -0
  211. package/vault/wiki/consensus/consensus-records.md +58 -0
  212. package/vault/wiki/decisions/2026-04-30-pi-lean-ctx-native.md +122 -0
  213. package/vault/wiki/decisions/adr-008.md +40 -0
  214. package/vault/wiki/decisions/adr-009.md +46 -0
  215. package/vault/wiki/decisions/adr-010.md +55 -0
  216. package/vault/wiki/decisions/adr-011.md +165 -0
  217. package/vault/wiki/decisions/adr-012.md +102 -0
  218. package/vault/wiki/decisions/adr-013.md +59 -0
  219. package/vault/wiki/decisions/adr-014.md +73 -0
  220. package/vault/wiki/decisions/adr-015.md +81 -0
  221. package/vault/wiki/decisions/adr-016.md +91 -0
  222. package/vault/wiki/decisions/adr-017.md +79 -0
  223. package/vault/wiki/decisions/adr-018.md +100 -0
  224. package/vault/wiki/decisions/adr-019.md +75 -0
  225. package/vault/wiki/decisions/adr-020.md +106 -0
  226. package/vault/wiki/decisions/adr-021.md +86 -0
  227. package/vault/wiki/decisions/adr-022.md +113 -0
  228. package/vault/wiki/decisions/adr-023.md +113 -0
  229. package/vault/wiki/decisions/adr-024.md +73 -0
  230. package/vault/wiki/decisions/adr-025.md +130 -0
  231. package/vault/wiki/decisions/adr-026.md +56 -0
  232. package/vault/wiki/decisions/colocate-wiki.md +34 -0
  233. package/vault/wiki/entities/Anders Hejlsberg.md +29 -0
  234. package/vault/wiki/entities/Anthropic.md +17 -0
  235. package/vault/wiki/entities/Augment Code.md +49 -0
  236. package/vault/wiki/entities/Bjarne Stroustrup.md +26 -0
  237. package/vault/wiki/entities/Bolt.new (StackBlitz).md +39 -0
  238. package/vault/wiki/entities/Boris Cherny.md +11 -0
  239. package/vault/wiki/entities/Claude Code.md +19 -0
  240. package/vault/wiki/entities/Dennis Ritchie.md +26 -0
  241. package/vault/wiki/entities/Emergent Labs.md +32 -0
  242. package/vault/wiki/entities/Google Cloud.md +16 -0
  243. package/vault/wiki/entities/Guido van Rossum.md +28 -0
  244. package/vault/wiki/entities/Ken Thompson.md +28 -0
  245. package/vault/wiki/entities/Lee et al.md +16 -0
  246. package/vault/wiki/entities/Linus Torvalds.md +28 -0
  247. package/vault/wiki/entities/Lovable (company).md +40 -0
  248. package/vault/wiki/entities/Martin Fowler.md +16 -0
  249. package/vault/wiki/entities/Meng et al.md +16 -0
  250. package/vault/wiki/entities/OpenAI.md +16 -0
  251. package/vault/wiki/entities/Rocket.new.md +38 -0
  252. package/vault/wiki/entities/VILA-Lab.md +15 -0
  253. package/vault/wiki/entities/autodev-codebase.md +18 -0
  254. package/vault/wiki/entities/ck-tool.md +59 -0
  255. package/vault/wiki/entities/codesearch.md +18 -0
  256. package/vault/wiki/entities/disler-indydevdan.md +33 -0
  257. package/vault/wiki/entities/gsd-get-shit-done.md +56 -0
  258. package/vault/wiki/entities/javascript-runtimes.md +48 -0
  259. package/vault/wiki/entities/jesse-vincent.md +38 -0
  260. package/vault/wiki/entities/lean-ctx.md +32 -0
  261. package/vault/wiki/entities/opendev.md +41 -0
  262. package/vault/wiki/entities/ops-codegraph-tool.md +18 -0
  263. package/vault/wiki/entities/pi-coding-agent.md +53 -0
  264. package/vault/wiki/entities/sentrux.md +54 -0
  265. package/vault/wiki/entities/vgrep-tool.md +57 -0
  266. package/vault/wiki/entities/vitest.md +41 -0
  267. package/vault/wiki/flows/harness-wiki-pipeline.md +204 -0
  268. package/vault/wiki/hot.md +932 -0
  269. package/vault/wiki/index.md +437 -0
  270. package/vault/wiki/log.md +418 -0
  271. package/vault/wiki/meta/dashboard.md +30 -0
  272. package/vault/wiki/meta/lint-report-2026-04-30.md +86 -0
  273. package/vault/wiki/meta/lint-report-2026-05-02.md +251 -0
  274. package/vault/wiki/meta/overview.canvas +43 -0
  275. package/vault/wiki/modules/adversarial-verification.md +57 -0
  276. package/vault/wiki/modules/automated-observability.md +54 -0
  277. package/vault/wiki/modules/bench.md +20 -0
  278. package/vault/wiki/modules/extensions.md +23 -0
  279. package/vault/wiki/modules/grounding-checkpoints.md +62 -0
  280. package/vault/wiki/modules/harness-implementation-plan.md +345 -0
  281. package/vault/wiki/modules/harness-wiki-skill-mapping.md +135 -0
  282. package/vault/wiki/modules/harness.md +86 -0
  283. package/vault/wiki/modules/persistent-memory.md +85 -0
  284. package/vault/wiki/modules/schema-orchestration.md +68 -0
  285. package/vault/wiki/modules/skills.md +27 -0
  286. package/vault/wiki/modules/spec-hardening.md +58 -0
  287. package/vault/wiki/modules/structured-planning.md +53 -0
  288. package/vault/wiki/modules/think-in-code-enforcement.md +153 -0
  289. package/vault/wiki/modules/wiki-query-interface.md +64 -0
  290. package/vault/wiki/overview.md +51 -0
  291. package/vault/wiki/questions/Research-pi-vs-claude-code-agentic-orchestration-pipeline.md +87 -0
  292. package/vault/wiki/questions/Research-sentrux-dev.md +123 -0
  293. package/vault/wiki/questions/Research-superpowers-skill-for-agentic-coding-agents.md +164 -0
  294. package/vault/wiki/questions/Research: Augment Code Context Engine.md +244 -0
  295. package/vault/wiki/questions/Research: Automating Software Engineering - Lovable, Bolt, Emergent, Rocket.md +112 -0
  296. package/vault/wiki/questions/Research: Claude Code State-of-the-Art Harness Improvements.md +209 -0
  297. package/vault/wiki/questions/Research: Codex State-of-the-Art Harness Improvements.md +99 -0
  298. package/vault/wiki/questions/Research: Engineering Workflows of Legendary Programmers and AI Harness Mapping.md +107 -0
  299. package/vault/wiki/questions/Research: Fallow Codebase Intelligence Harness Integration.md +72 -0
  300. package/vault/wiki/questions/Research: Gemini CLI SOTA Harness Integration.md +166 -0
  301. package/vault/wiki/questions/Research: GitHub Issues as Harness Spec Storage.md +188 -0
  302. package/vault/wiki/questions/Research: Google Antigravity Harness Integration.md +120 -0
  303. package/vault/wiki/questions/Research: Meta-Agent Context Drift Detection.md +236 -0
  304. package/vault/wiki/questions/Research: Model-Adaptive Agent Harness Design.md +95 -0
  305. package/vault/wiki/questions/Research: Model-Specific Prompting Guides.md +165 -0
  306. package/vault/wiki/questions/Research: Prompt Renderer for Multi-Model Agent Harness.md +216 -0
  307. package/vault/wiki/questions/Research: Skill-First Harness Architecture.md +91 -0
  308. package/vault/wiki/questions/Research: TypeScript Best Practices and Codebase Structure.md +88 -0
  309. package/vault/wiki/questions/Research: TypeScript Execution Layer for Agent Tool Calling.md +81 -0
  310. package/vault/wiki/questions/Research: claude-mem over Obsidian for Harness Layer.md +71 -0
  311. package/vault/wiki/questions/Research: claude-mem over obsidian wiki as the knowledge base for our agentic harness pipeline. think from first principles. does this replace or complement our current setup? no hard feelings about previous decisions. gimme accurate points.md +80 -0
  312. package/vault/wiki/questions/Research: context-mode vs lean-ctx.md +72 -0
  313. package/vault/wiki/questions/Research: cursor.sh Harness Innovations.md +92 -0
  314. package/vault/wiki/questions/Research: executor.sh Harness Integration.md +170 -0
  315. package/vault/wiki/questions/Research: how GSD fits into our coding harness setup.md +97 -0
  316. package/vault/wiki/questions/Research: how claude-mem fits into our workflow. and whether it should replace obsidian in the codebase. no hard feelings about previous actions, rethink from first principles always.md +80 -0
  317. package/vault/wiki/questions/Research: pi-vcc.md +113 -0
  318. package/vault/wiki/questions/Research: semantic code search tools.md +69 -0
  319. package/vault/wiki/questions/Research: vcc extension for pi coding agent.md +73 -0
  320. package/vault/wiki/questions/how-to-enable-semantic-code-search-now.md +111 -0
  321. package/vault/wiki/questions/mvp-implementation-blueprint.md +552 -0
  322. package/vault/wiki/questions/research-agent-first-codebase-exploration.md +199 -0
  323. package/vault/wiki/questions/research-agentic-coding-harness-latest-papers.md +142 -0
  324. package/vault/wiki/questions/research-gitingest-gitreverse-integration.md +100 -0
  325. package/vault/wiki/questions/research-wozcode-token-reduction.md +67 -0
  326. package/vault/wiki/questions/resolved-context-pruning-inplace-vs-restart.md +95 -0
  327. package/vault/wiki/questions/resolved-context-window-economics.md +167 -0
  328. package/vault/wiki/questions/resolved-imad-debate-gating-transfer.md +126 -0
  329. package/vault/wiki/questions/resolved-mcp-tool-preference.md +112 -0
  330. package/vault/wiki/questions/resolved-small-model-meta-agents.md +107 -0
  331. package/vault/wiki/questions/resolved-treesitter-dynamic-languages.md +95 -0
  332. package/vault/wiki/sources/Auggie Context MCP Server.md +63 -0
  333. package/vault/wiki/sources/Augment Code Codacy AI Giants.md +61 -0
  334. package/vault/wiki/sources/Augment Code MCP SiliconAngle.md +49 -0
  335. package/vault/wiki/sources/Augment Code WorkOS ERC 2025.md +55 -0
  336. package/vault/wiki/sources/Augment Context Engine Official.md +71 -0
  337. package/vault/wiki/sources/Augment SWE-bench Agent GitHub.md +74 -0
  338. package/vault/wiki/sources/Augment SWE-bench Pro Blog.md +58 -0
  339. package/vault/wiki/sources/Source: AgentBus Jinja2 Prompt Pipelines.md +75 -0
  340. package/vault/wiki/sources/Source: Arxiv /342/200/224 Don't Break the Cache.md" +85 -0
  341. package/vault/wiki/sources/Source: Augment - Harness Engineering for AI Coding Agents.md +58 -0
  342. package/vault/wiki/sources/Source: Blake Crosley Agent Architecture Guide.md +100 -0
  343. package/vault/wiki/sources/Source: Bolt.new Architecture & Case Study.md +75 -0
  344. package/vault/wiki/sources/Source: Build-Time Prompt Compilation Architecture.md +107 -0
  345. package/vault/wiki/sources/Source: Claude API Agent Skills Overview.md +70 -0
  346. package/vault/wiki/sources/Source: Gemini CLI Changelogs.md +88 -0
  347. package/vault/wiki/sources/Source: Google Blog - Gemini CLI Announcement.md +57 -0
  348. package/vault/wiki/sources/Source: Google Gemini CLI Architecture Docs.md +53 -0
  349. package/vault/wiki/sources/Source: LangChain - Anatomy of Agent Harness.md +65 -0
  350. package/vault/wiki/sources/Source: Lovable Architecture & Clone Analysis.md +83 -0
  351. package/vault/wiki/sources/Source: Martin Fowler - Harness Engineering.md +70 -0
  352. package/vault/wiki/sources/Source: OpenAI Harness Engineering Five Principles.md +58 -0
  353. package/vault/wiki/sources/Source: OpenAI Harness Engineering /342/200/224 0 Lines of Human Code.md" +101 -0
  354. package/vault/wiki/sources/Source: OpenDev /342/200/224 Building AI Coding Agents for the Terminal.md" +100 -0
  355. package/vault/wiki/sources/Source: Render AI Coding Agents Benchmark 2025.md +53 -0
  356. package/vault/wiki/sources/Source: Rocket.new /342/200/224 Vibe Solutioning Platform.md" +70 -0
  357. package/vault/wiki/sources/Source: SwirlAI Agent Skills Progressive Disclosure.md +71 -0
  358. package/vault/wiki/sources/Source: TianPan Prompt Caching Architecture.md +89 -0
  359. package/vault/wiki/sources/Source: Vercel Labs agent-browser.md +155 -0
  360. package/vault/wiki/sources/Source: browser-harness CDP Harness.md +126 -0
  361. package/vault/wiki/sources/agent-drift-academic-paper.md +79 -0
  362. package/vault/wiki/sources/aider-repomap-tree-sitter.md +42 -0
  363. package/vault/wiki/sources/anthropic-compaction-api.md +58 -0
  364. package/vault/wiki/sources/anthropic-effective-harnesses.md +42 -0
  365. package/vault/wiki/sources/anthropic-prompt-best-practices.md +100 -0
  366. package/vault/wiki/sources/anthropic2026-harness-design.md +63 -0
  367. package/vault/wiki/sources/barrel-files-tkdodo.md +38 -0
  368. package/vault/wiki/sources/birth-of-unix-kernighan-interview.md +57 -0
  369. package/vault/wiki/sources/bockeler2026-harness-engineering.md +69 -0
  370. package/vault/wiki/sources/cast-code-chunking-paper.md +50 -0
  371. package/vault/wiki/sources/ck-semantic-search.md +78 -0
  372. package/vault/wiki/sources/claude-code-architecture-karaxai-2026.md +71 -0
  373. package/vault/wiki/sources/claude-code-architecture-qubytes-2026.md +50 -0
  374. package/vault/wiki/sources/claude-code-architecture-vila-lab-2026.md +64 -0
  375. package/vault/wiki/sources/claude-code-security-architecture-penligent-2026.md +70 -0
  376. package/vault/wiki/sources/claude-context-editing-docs.md +13 -0
  377. package/vault/wiki/sources/cloudflare-codemode.md +63 -0
  378. package/vault/wiki/sources/code-chunk-library-supermemory.md +63 -0
  379. package/vault/wiki/sources/codeact-apple-2024.md +62 -0
  380. package/vault/wiki/sources/codex-dsc-rfc-8573.md +41 -0
  381. package/vault/wiki/sources/codex-open-source-agent-2026.md +110 -0
  382. package/vault/wiki/sources/coir-code-retrieval-benchmark.md +51 -0
  383. package/vault/wiki/sources/colinmcnamara-context-optimization-codemode.md +48 -0
  384. package/vault/wiki/sources/context-folding-paper.md +61 -0
  385. package/vault/wiki/sources/context-mode-website.md +63 -0
  386. package/vault/wiki/sources/cursor-agent-best-practices-2026.md +62 -0
  387. package/vault/wiki/sources/cursor-fork-29b-2025.md +50 -0
  388. package/vault/wiki/sources/cursor-harness-april-2026.md +76 -0
  389. package/vault/wiki/sources/cursor-instant-apply-2024.md +45 -0
  390. package/vault/wiki/sources/cursor-shadow-workspace-2024.md +52 -0
  391. package/vault/wiki/sources/cursor-shipped-coding-agent-2026.md +53 -0
  392. package/vault/wiki/sources/cursor-vs-antigravity-2026.md +51 -0
  393. package/vault/wiki/sources/disler-pi-vs-claude-code.md +69 -0
  394. package/vault/wiki/sources/distill-deterministic-context-compression.md +53 -0
  395. package/vault/wiki/sources/embedding-models-benchmark-supermemory-2025.md +48 -0
  396. package/vault/wiki/sources/executor-rhyssullivan.md +122 -0
  397. package/vault/wiki/sources/fallow-rs-codebase-intelligence.md +125 -0
  398. package/vault/wiki/sources/fan2025-imad.md +60 -0
  399. package/vault/wiki/sources/forgecode-gpt5-agent-improvements.md +63 -0
  400. package/vault/wiki/sources/gemini-3-prompting-guide.md +78 -0
  401. package/vault/wiki/sources/gh-cli-sub-issue-rfc.md +50 -0
  402. package/vault/wiki/sources/gh-sub-issue-extension.md +72 -0
  403. package/vault/wiki/sources/github-fork-issues-discussion.md +44 -0
  404. package/vault/wiki/sources/github-issue-dependencies-docs.md +49 -0
  405. package/vault/wiki/sources/github-sub-issues-docs.md +51 -0
  406. package/vault/wiki/sources/gitingest.md +91 -0
  407. package/vault/wiki/sources/gitreverse.md +63 -0
  408. package/vault/wiki/sources/google-antigravity-official-blog.md +47 -0
  409. package/vault/wiki/sources/google-antigravity-wikipedia.md +53 -0
  410. package/vault/wiki/sources/gsd-codecentric-deep-dive.md +57 -0
  411. package/vault/wiki/sources/gsd-github-repo.md +51 -0
  412. package/vault/wiki/sources/gsd-hn-discussion.md +59 -0
  413. package/vault/wiki/sources/guido-python-design-philosophy.md +56 -0
  414. package/vault/wiki/sources/hejlsberg-7-learnings.md +48 -0
  415. package/vault/wiki/sources/ironclaw-drift-monitor.md +80 -0
  416. package/vault/wiki/sources/langsight-loop-detection.md +80 -0
  417. package/vault/wiki/sources/leanctx-website.md +69 -0
  418. package/vault/wiki/sources/lee2026-meta-harness.md +59 -0
  419. package/vault/wiki/sources/linux-kernel-coding-workflow.md +50 -0
  420. package/vault/wiki/sources/lou2026-autoharness.md +53 -0
  421. package/vault/wiki/sources/martin-fowler-harness-engineering.md +73 -0
  422. package/vault/wiki/sources/mcp-architecture-docs.md +13 -0
  423. package/vault/wiki/sources/meng2026-agent-harness-survey.md +79 -0
  424. package/vault/wiki/sources/mindstudio-four-agent-types.md +68 -0
  425. package/vault/wiki/sources/ms-chat-history-management.md +13 -0
  426. package/vault/wiki/sources/openai-prompt-guidance.md +104 -0
  427. package/vault/wiki/sources/openclaw-session-pruning.md +13 -0
  428. package/vault/wiki/sources/opencode-dcp.md +13 -0
  429. package/vault/wiki/sources/opendev-arxiv-2603.05344v1.md +79 -0
  430. package/vault/wiki/sources/openhands-platform.md +39 -0
  431. package/vault/wiki/sources/oss-guide-codebase-exploration.md +53 -0
  432. package/vault/wiki/sources/pi-compaction-extensions-ecosystem.md +102 -0
  433. package/vault/wiki/sources/pi-context-prune-github-repo.md +38 -0
  434. package/vault/wiki/sources/pi-mono-compaction-docs.md +38 -0
  435. package/vault/wiki/sources/pi-omni-compact-github-repo.md +50 -0
  436. package/vault/wiki/sources/pi-rtk-optimizer-github-repo.md +45 -0
  437. package/vault/wiki/sources/pi-vcc-github-repo.md +69 -0
  438. package/vault/wiki/sources/pi-vscode-marketplace.md +41 -0
  439. package/vault/wiki/sources/pi-vscode-model-provider-marketplace.md +39 -0
  440. package/vault/wiki/sources/py-tree-sitter.md +13 -0
  441. package/vault/wiki/sources/sentrux-dev-landing.md +40 -0
  442. package/vault/wiki/sources/sentrux-docs-pro-architecture.md +75 -0
  443. package/vault/wiki/sources/sentrux-docs-quality-signal.md +46 -0
  444. package/vault/wiki/sources/sentrux-docs-root-cause-metrics.md +57 -0
  445. package/vault/wiki/sources/sentrux-docs-rules-engine.md +58 -0
  446. package/vault/wiki/sources/sentrux-github-repo.md +56 -0
  447. package/vault/wiki/sources/superpowers-github-repo.md +56 -0
  448. package/vault/wiki/sources/superpowers-release-blog.md +54 -0
  449. package/vault/wiki/sources/superpowers-termdock-analysis.md +45 -0
  450. package/vault/wiki/sources/swe-agent-aci.md +42 -0
  451. package/vault/wiki/sources/swe-bench.md +45 -0
  452. package/vault/wiki/sources/swe-pruner-context-pruning.md +13 -0
  453. package/vault/wiki/sources/think-in-code-blog.md +48 -0
  454. package/vault/wiki/sources/tree-sitter-docs.md +13 -0
  455. package/vault/wiki/sources/ts-best-practices-2025-devto.md +42 -0
  456. package/vault/wiki/sources/ts-folder-structure-mingyang.md +58 -0
  457. package/vault/wiki/sources/ts-monorepo-koerselman.md +44 -0
  458. package/vault/wiki/sources/ts-result-error-handling-kkalamarski.md +52 -0
  459. package/vault/wiki/sources/ts-runtimes-comparison-betterstack.md +42 -0
  460. package/vault/wiki/sources/ts-strict-mode-rishikc.md +43 -0
  461. package/vault/wiki/sources/unix-philosophy.md +48 -0
  462. package/vault/wiki/sources/vectara-chunking-vs-embedding-naacl2025.md +39 -0
  463. package/vault/wiki/sources/vectara-guardian-agents.md +79 -0
  464. package/vault/wiki/sources/vgrep-semantic-search.md +76 -0
  465. package/vault/wiki/sources/vitest-official.md +41 -0
  466. package/vault/wiki/sources/vscode-pi-community-extension.md +40 -0
  467. package/vault/wiki/sources/wozcode.md +79 -0
  468. package/.agents/skills/compress/SKILL.md +0 -111
  469. package/.agents/skills/compress/scripts/__init__.py +0 -9
  470. package/.agents/skills/compress/scripts/__main__.py +0 -3
  471. package/.agents/skills/compress/scripts/benchmark.py +0 -78
  472. package/.agents/skills/compress/scripts/cli.py +0 -73
  473. package/.agents/skills/compress/scripts/compress.py +0 -227
  474. package/.agents/skills/compress/scripts/detect.py +0 -121
  475. package/.agents/skills/compress/scripts/validate.py +0 -189
  476. package/.agents/skills/emil-design-eng/SKILL.md +0 -679
  477. package/.agents/skills/lean-ctx/SKILL.md +0 -149
  478. package/.agents/skills/lean-ctx/scripts/install.sh +0 -95
  479. package/.agents/skills/scrapling-official/LICENSE.txt +0 -28
  480. package/.agents/skills/scrapling-official/SKILL.md +0 -390
  481. package/.agents/skills/scrapling-official/examples/01_fetcher_session.py +0 -26
  482. package/.agents/skills/scrapling-official/examples/02_dynamic_session.py +0 -26
  483. package/.agents/skills/scrapling-official/examples/03_stealthy_session.py +0 -26
  484. package/.agents/skills/scrapling-official/examples/04_spider.py +0 -58
  485. package/.agents/skills/scrapling-official/examples/README.md +0 -45
  486. package/.agents/skills/scrapling-official/references/fetching/choosing.md +0 -78
  487. package/.agents/skills/scrapling-official/references/fetching/dynamic.md +0 -352
  488. package/.agents/skills/scrapling-official/references/fetching/static.md +0 -432
  489. package/.agents/skills/scrapling-official/references/fetching/stealthy.md +0 -255
  490. package/.agents/skills/scrapling-official/references/mcp-server.md +0 -214
  491. package/.agents/skills/scrapling-official/references/migrating_from_beautifulsoup.md +0 -86
  492. package/.agents/skills/scrapling-official/references/parsing/adaptive.md +0 -212
  493. package/.agents/skills/scrapling-official/references/parsing/main_classes.md +0 -586
  494. package/.agents/skills/scrapling-official/references/parsing/selection.md +0 -494
  495. package/.agents/skills/scrapling-official/references/spiders/advanced.md +0 -344
  496. package/.agents/skills/scrapling-official/references/spiders/architecture.md +0 -94
  497. package/.agents/skills/scrapling-official/references/spiders/getting-started.md +0 -164
  498. package/.agents/skills/scrapling-official/references/spiders/proxy-blocking.md +0 -235
  499. package/.agents/skills/scrapling-official/references/spiders/requests-responses.md +0 -196
  500. package/.agents/skills/scrapling-official/references/spiders/sessions.md +0 -205
  501. package/.github/banner.png +0 -0
  502. package/PLAN.md +0 -11
  503. package/extensions/lean-ctx-enforce.ts +0 -166
  504. package/skills-lock.json +0 -35
  505. package/wiki/README.md +0 -10
  506. package/wiki/decisions/0001-establish-project-wiki-and-decision-record-format.md +0 -25
  507. package/wiki/decisions/0002-add-project-banner-to-readme.md +0 -26
  508. package/wiki/decisions/0003-remove-redundant-readme-title-heading.md +0 -26
  509. package/wiki/decisions/0004-publish-package-to-npm-as-ultimate-pi.md +0 -26
@@ -0,0 +1,68 @@
1
+ ---
2
+ type: module
3
+ title: Schema-Based Orchestration
4
+ status: developing
5
+ created: 2026-04-28
6
+ updated: 2026-04-28
7
+ tags: [harness, orchestration, archon, dag, layer-7]
8
+ layer: "7"
9
+ sources:
10
+ - "[[harness-implementation-plan]]"
11
+ related:
12
+ - "[[agentic-harness]]"
13
+ - "[[persistent-memory]]"
14
+ - "[[structured-planning]]"
15
+ ---
16
+
17
+ # Schema-Based Orchestration via Archon
18
+
19
+ Layer 7 of the [[agentic-harness]]. Uses Archon's YAML workflow engine for DAG execution, loop nodes, human approval gates, worktree isolation, and run persistence. No custom orchestration code.
20
+
21
+ ## Architecture
22
+
23
+ | Need | Archon provides | Alternative build cost |
24
+ |------|-----------------|----------------------|
25
+ | DAG execution | YAML workflow nodes | Custom task graph executor |
26
+ | Loop nodes | `loop: { until: CONDITION }` | Custom rework loop logic |
27
+ | Human approval gates | `loop: { until: APPROVED, interactive: true }` | Custom approval UI |
28
+ | Worktree isolation | Auto git worktree per run | Custom branch management |
29
+ | Run persistence | SQLite/PostgreSQL | Custom state storage |
30
+ | Parallel nodes | Concurrent independent nodes | Custom parallel dispatch |
31
+
32
+ pi.dev extensions implement **intelligence**. Archon implements **orchestration**.
33
+
34
+ ## Primary Workflow: harness-pipeline.yaml
35
+
36
+ 1. **harden-spec** → Spec hardening
37
+ 2. **resolve-ambiguities** → Loop until no blocking ambiguities
38
+ 3. **create-plan** → Plan from hardened spec
39
+ 4. **review-plan** → Adversarial review
40
+ 5. **approve-plan** → Loop until approved (interactive)
41
+ 6. **execute-plan** → Loop until all tasks complete (max 100 iterations)
42
+ 7. **capture-memory** → Store results via wiki-ingest skill
43
+
44
+ ## Terminal States
45
+
46
+ | State | Meaning |
47
+ |-------|---------|
48
+ | `completed` | All control objectives passed |
49
+ | `blocked` | Mandatory gate/dependency unresolved |
50
+ | `replan_required` | Drift, failed critics, or spec change |
51
+ | `cancelled` | Precondition not met |
52
+ | `failed` | Retries/limits exhausted |
53
+
54
+ ## Extension Interface
55
+
56
+ | Type | Name |
57
+ |------|------|
58
+ | Tool | `orchestrate-plan` |
59
+ | Tool | `register-agent-capability` |
60
+ | Command | `/harness-orchestration-status` |
61
+
62
+ ## Files
63
+
64
+ - `lib/harness-orchestrator.ts` — Orchestrator class, schema validation, wave tracking
65
+ - `extensions/harness-orchestrator.ts` — pi.dev extension registration
66
+ - `.archon/workflows/harness-pipeline.yaml`
67
+ - `.archon/workflows/harness-fix-issue.yaml`
68
+ - `.archon/workflows/harness-quick-review.yaml`
@@ -0,0 +1,27 @@
1
+ ---
2
+ type: module
3
+ path: ".pi/skills/"
4
+ status: active
5
+ language: markdown
6
+ purpose: "Core capability plugins for the ultimate-pi agent."
7
+ maintainer: "aryaniyaps"
8
+ last_updated: "2026-04-28"
9
+ linked_issues: []
10
+ depends_on: ["lean-ctx"]
11
+ used_by: ["pi"]
12
+ tags: [module, skills]
13
+ created: "2026-04-28"
14
+ updated: "2026-04-28"
15
+ title: "Agent Skills"
16
+ ---
17
+ # Agent Skills
18
+
19
+ ## Description
20
+ The `skills/` directory contains individual skill definitions that extend the capabilities of the agent. These are structured as Obsidian-flavored markdown documents that provide instructions and context routing.
21
+
22
+ ## Key Skills
23
+ - `wiki`, `wiki-ingest`, `wiki-query`, `wiki-lint`
24
+ - `lean-ctx` (core operations)
25
+ - `caveman`
26
+ - `compress`
27
+ - `firecrawl`
@@ -0,0 +1,58 @@
1
+ ---
2
+ type: module
3
+ title: Spec Hardening
4
+ status: developing
5
+ created: 2026-04-28
6
+ updated: 2026-04-28
7
+ tags: [harness, spec, layer-1, quality]
8
+ layer: "1"
9
+ sources:
10
+ - "[[harness-implementation-plan]]"
11
+ related:
12
+ - "[[agentic-harness]]"
13
+ - "[[structured-planning]]"
14
+ - "[[adversarial-verification]]"
15
+ ---
16
+
17
+ # Spec Hardening
18
+
19
+ Layer 1 of the [[agentic-harness]]. Blocks execution until every underspecified component is resolved. Ambiguity is a bug — if you can't write a test for it, it's not specified.
20
+
21
+ ## Flow
22
+
23
+ 1. User request → `SpecHardener.harden()` → **HardenedSpec**
24
+ 2. Count blocking ambiguities → if > 0, loop back to user (max 3 retries)
25
+ 3. Store in `.pi/harness/specs/<id>.json`
26
+ 4. Emit `spec_hardened` → Layer 2
27
+
28
+ ## HardenedSpec Data Contract
29
+
30
+ | Field | Purpose |
31
+ |-------|---------|
32
+ | `intent_summary` | What the user actually wants |
33
+ | `success_criteria` | Each must be testable |
34
+ | `anti_criteria` | What the solution MUST NOT do |
35
+ | `ambiguity_flags` | Blocking or warning severity |
36
+ | `definition_of_done` | Single boolean expression |
37
+ | `scope_boundary` | Explicit in/out of scope |
38
+ | `constraints` | Technical or domain constraints |
39
+
40
+ ## Extension Interface
41
+
42
+ | Type | Name |
43
+ |------|------|
44
+ | Tool | `harden-spec` |
45
+ | Tool | `resolve-ambiguity` |
46
+ | Tool | `approve-spec` (human override) |
47
+ | Command | `/harness-spec-status` |
48
+
49
+ ## Config
50
+
51
+ ```json
52
+ { "spec_hardening": { "max_ambiguity_retries": 3, "auto_resolve_warning": true } }
53
+ ```
54
+
55
+ ## Files
56
+
57
+ - `lib/harness-spec.ts` — SpecHardener class, AI prompt construction
58
+ - `extensions/harness-spec.ts` — Extension: intercepts requests, runs hardening gate
@@ -0,0 +1,53 @@
1
+ ---
2
+ type: module
3
+ title: Structured Planning
4
+ status: developing
5
+ created: 2026-04-28
6
+ updated: 2026-04-28
7
+ tags: [harness, planning, dag, layer-2, quality]
8
+ layer: "2"
9
+ sources:
10
+ - "[[harness-implementation-plan]]"
11
+ related:
12
+ - "[[agentic-harness]]"
13
+ - "[[spec-hardening]]"
14
+ - "[[grounding-checkpoints]]"
15
+ - "[[schema-orchestration]]"
16
+ ---
17
+
18
+ # Structured Planning
19
+
20
+ Layer 2 of the [[agentic-harness]]. Produces a machine-readable task DAG reviewed before code begins. No code without a plan.
21
+
22
+ ## Flow
23
+
24
+ 1. `spec_hardened` event → `Planner.createPlan(spec)` → **ExecutionPlan**
25
+ 2. DAG validation: cycle detection, orphan detection, spec coverage
26
+ 3. If invalid → regenerate (max 3 revisions)
27
+ 4. Plan review gate: adversarial critic review OR human approval
28
+ 5. Store in `.pi/harness/plans/<id>.json`
29
+ 6. Emit `plan_approved` → Layer 7 (Archon)
30
+
31
+ ## ExecutionPlan Data Contract
32
+
33
+ Each **PlanNode**: `task_id`, `title`, `description`, `inputs`/`outputs`, `dependencies`, `risk_surface`, `verification`, `status`.
34
+
35
+ ## Validation Checks
36
+
37
+ - **Cycle detection** — no circular dependencies
38
+ - **Orphan detection** — no disconnected nodes
39
+ - **Spec coverage** — every success criterion maps to at least one task
40
+
41
+ ## Extension Interface
42
+
43
+ | Type | Name |
44
+ |------|------|
45
+ | Tool | `create-plan` |
46
+ | Tool | `review-plan` |
47
+ | Tool | `approve-plan` |
48
+ | Command | `/harness-plan-status` |
49
+
50
+ ## Files
51
+
52
+ - `lib/harness-planner.ts` — Planner class, DAG generation, validation
53
+ - `extensions/harness-planner.ts` — Extension for spec_hardened events
@@ -0,0 +1,153 @@
1
+ ---
2
+ type: module
3
+ title: "Think-in-Code Enforcement (L3)"
4
+ status: developing
5
+ created: 2026-04-30
6
+ updated: 2026-04-30
7
+ tags: [harness, think-in-code, context-optimization, layer-3, enforcement]
8
+ layer: "3"
9
+ sources:
10
+ - "[[think-in-code-blog]]"
11
+ - "[[context-mode-website]]"
12
+ - "[[Research: context-mode vs lean-ctx]]"
13
+ related:
14
+ - "[[think-in-code]]"
15
+ - "[[agentic-harness-context-enforcement]]"
16
+ - "[[grounding-checkpoints]]"
17
+ - "[[harness-implementation-plan]]"
18
+ - "[[lean-ctx]]"
19
+ ---
20
+
21
+ # Think-in-Code Enforcement (L3 Tool Layer)
22
+
23
+ A mandatory paradigm enforced at the L3 tool layer of the harness. Agents MUST write code to process data instead of reading raw data into the context window for mental processing. This is not a suggestion — it is enforced through system prompt injection, tool interception, and post-tool compression.
24
+
25
+ ## First Principles
26
+
27
+ 1. **Reading raw data into context is wasteful**: An agent reading 47 files to count errors consumes 700KB of context. A script doing the same analysis outputs 3.6KB. Reduction: 200×.
28
+ 2. **Agents are bad at mental computation**: Counting, filtering, comparing, parsing — these are CPU tasks. Agents should delegate to CPU.
29
+ 3. **Context is the scarcest resource**: Every token of raw data is a token not used for reasoning. The context budget must be protected.
30
+ 4. **The agent won't do this voluntarily**: Under pressure (context filling, task complexity), agents revert to read-everything patterns. Enforcement is mandatory.
31
+
32
+ ## Enforcement Architecture
33
+
34
+ Three-layer enforcement, from cheapest/least-reliable to most expensive/most-reliable:
35
+
36
+ ### Layer 1: System Prompt Injection (zero cost)
37
+
38
+ AGENTS.md rule:
39
+ ```markdown
40
+ ## Think in Code (MANDATORY)
41
+ When you need to analyze, count, filter, compare, or process data,
42
+ write code (JavaScript/Python) that does the work. Output only the
43
+ answer. Do NOT read raw data into context for mental processing.
44
+ Use built-ins only. No package installs. Always try/catch.
45
+ Use ctx_execute() for sandboxed execution.
46
+ ```
47
+
48
+ Cost: 0 tokens beyond the rule text. Reliability: depends on agent compliance.
49
+
50
+ ### Layer 2: PreToolUse Interception (medium cost)
51
+
52
+ Intercept `Read()`, `Bash()`, `WebFetch()` calls at L3 executor hooks. Detect data-analysis patterns:
53
+ - Sequential reads of 3+ files without edits between them
54
+ - grep/find on large result sets (>100 lines)
55
+ - WebFetch of large API responses (>5KB)
56
+
57
+ Route to `ctx_execute()` sandbox via pi-lean-ctx's execution capabilities instead.
58
+
59
+ Cost: ~0-50 tokens per intercepted call (check logic). Reliability: high — prevents wasteful calls before they happen.
60
+
61
+ ### Layer 3: PostToolUse Compression (medium cost)
62
+
63
+ When large output enters context despite interception, lean-ctx's 90+ shell pattern matchers auto-compress:
64
+ - Strip filler/boilerplate
65
+ - Keep only signal (errors, results, key data)
66
+ - Store raw output in searchable index (FTS5 equivalent)
67
+
68
+ Cost: 0 tokens (lean-ctx shell hook pattern matching). Reliability: medium — compresses what got through, doesn't prevent.
69
+
70
+ ---
71
+
72
+ ## Execution Sandbox: ctx_execute()
73
+
74
+ pi-lean-ctx provides `ctx_execute()` — a sandboxed code execution tool:
75
+
76
+ - **What runs**: JavaScript/TypeScript (Node.js built-ins only, no npm)
77
+ - **What returns**: Only `console.log()` output enters the conversation
78
+ - **Sandbox**: Isolated subprocess, no filesystem access outside working directory
79
+ - **Timeout**: Configurable (default: 30s)
80
+
81
+ ### Example: Before vs After
82
+
83
+ **Before** (without Think in Code):
84
+ ```
85
+ Agent: Read(file1) → Read(file2) → ... → Read(file47)
86
+ → mentally count errors → report
87
+ Context: 700KB consumed. 47 tool calls. 20+ turns.
88
+ ```
89
+
90
+ **After** (with Think in Code enforced):
91
+ ```
92
+ Agent: ctx_execute(`
93
+ const fs = require('fs');
94
+ const files = fs.readdirSync('./logs');
95
+ let errors = 0;
96
+ for (const f of files) {
97
+ const content = fs.readFileSync(`./logs/${f}`, 'utf8');
98
+ errors += (content.match(/ERROR/g) || []).length;
99
+ }
100
+ console.log(JSON.stringify({total_errors: errors, files_scanned: files.length}));
101
+ `)
102
+ → Output: {"total_errors": 127, "files_scanned": 47}
103
+ Context: 3.6KB consumed. 1 tool call. 1 turn.
104
+ ```
105
+
106
+ ---
107
+
108
+ ## What Gets Routed to Think-in-Code
109
+
110
+ | Pattern | Detection | Redirect |
111
+ |---------|-----------|----------|
112
+ | Sequential file reads (3+) without edits | L2 interception | `ctx_execute()` batch script |
113
+ | grep/find with >100 results | L2 interception | `ctx_execute()` with filtered output |
114
+ | WebFetch with >5KB response | L2 interception | `ctx_execute()` with `JSON.parse()` |
115
+ | "Count how many...", "Find all..." | L1 system prompt | Agent self-routes |
116
+ | "Compare X and Y..." | L1 system prompt | Agent self-routes |
117
+ | "Summarize the errors..." | L3 compression | lean-ctx auto-compresses |
118
+
119
+ ---
120
+
121
+ ## Efficiency Gains
122
+
123
+ | Scenario | Before | After | Reduction |
124
+ |----------|--------|-------|-----------|
125
+ | Multi-file data analysis | 47 Read() calls = 700KB | 1 ctx_execute() = 3.6KB | 200× |
126
+ | Error log scanning | 20 tool calls = 600KB | 1 execute = 20KB | 30× |
127
+ | API response parsing | 5 WebFetch + Read = 500KB | 1 execute = 1KB | 500× |
128
+ | Config comparison across files | 10 Read() = 200KB | 1 execute = 5KB | 40× |
129
+
130
+ ---
131
+
132
+ ## Integration with L3 Grounding Checkpoints
133
+
134
+ Think-in-Code enforcement runs as a pre-execution hook within L3:
135
+
136
+ ```
137
+ L3 Grounding Checkpoint:
138
+ 1. Pre-execution: spec grounding check
139
+ 2. Pre-execution: Think-in-Code enforcement check (is the agent about to do data analysis via raw reads?)
140
+ 3. Execute subtask
141
+ 4. Post-execution: spec grounding check
142
+ 5. Post-execution: context usage audit (did we exceed budget?)
143
+ ```
144
+
145
+ If an agent tries to bypass Think-in-Code (reads 47 files sequentially), L3 drift monitor (L2.5) detects "excessive searching" and triggers a soft nudge.
146
+
147
+ ---
148
+
149
+ ## Files
150
+
151
+ - `lib/harness-think-in-code.ts` — Enforcement logic, pattern detection, `ctx_execute()` routing
152
+ - Update `lib/harness-executor.ts` — Add Think-in-Code hook to pre-execution phase
153
+ - Update AGENTS.md — Add mandatory Think in Code rule
@@ -0,0 +1,64 @@
1
+ ---
2
+ type: module
3
+ title: Wiki Query Interface
4
+ status: developing
5
+ created: 2026-04-28
6
+ updated: 2026-04-28
7
+ tags: [harness, wiki, search, claude-obsidian, layer-8, query]
8
+ layer: "8"
9
+ sources:
10
+ - "[[harness-implementation-plan]]"
11
+ related:
12
+ - "[[agentic-harness]]"
13
+ - "[[persistent-memory]]"
14
+ ---
15
+
16
+ # Wiki Query Interface (claude-obsidian Skills)
17
+
18
+ Layer 8 of the [[agentic-harness]]. The query interface to the wiki. Uses claude-obsidian skills in GitHub Mode B — LLM-native search via hot.md → index.md → pages. See [[adr-009]].
19
+
20
+ ## Architecture
21
+
22
+ ```
23
+ Agent / Human
24
+ ├── wiki-query (read) ──→ wiki/hot.md → index.md → pages
25
+ ├── wiki-ingest (write) ──→ wiki/ (create/update pages)
26
+ └── wiki-lint (health) ──→ orphan/contradiction checks
27
+ ```
28
+
29
+ ## Query Operations
30
+
31
+ ### Three Depth Modes
32
+
33
+ | Mode | Code | Reads | Tokens |
34
+ |------|------|-------|--------|
35
+ | **Quick** | `query quick:` | hot.md + index.md | ~1,500 |
36
+ | **Standard** | default | hot.md → index → 3-5 pages | ~3,000 |
37
+ | **Deep** | `query deep:` | Full wiki + optional web | ~8,000+ |
38
+
39
+ ## Ingest Operations
40
+
41
+ | Harness Event | Wiki Write | Frontmatter |
42
+ |--------------|-----------|-------------|
43
+ | `spec_hardened` | `decisions/ADR-<N>.md` | `type: decision` |
44
+ | `plan_approved` | `flows/PLAN-<id>.md` | `type: flow` |
45
+ | `subtask_completed` | Append to `log.md` | Operation log entry |
46
+ | `subtask_verified` | `modules/<name>.md` | `type: module` |
47
+ | `subtask_failed` | `modules/<name>.md` | `> [!contradiction]` |
48
+
49
+ ## Lint Operations (after every 10-15 writes)
50
+
51
+ 1. Orphan pages
52
+ 2. Dead links
53
+ 3. Stale claims
54
+ 4. Missing pages
55
+ 5. Frontmatter gaps
56
+ 6. Empty sections
57
+ 7. Stale index entries
58
+
59
+ Output: `wiki/meta/lint-report-YYYY-MM-DD.md`
60
+
61
+ ## Dependencies
62
+
63
+ - 24 obsidian-wiki skills (`npx skills add Ar9av/obsidian-wiki --yes`)
64
+ - 5 obsidian-skills (`npx skills add kepano/obsidian-skills --yes`)
@@ -0,0 +1,51 @@
1
+ ---
2
+ type: overview
3
+ title: "Ultimate-PI Harness Architecture Overview"
4
+ created: 2026-04-30
5
+ updated: 2026-04-30
6
+ status: active
7
+ tags: [meta, overview, harness, architecture]
8
+ ---
9
+
10
+ # Ultimate-PI Harness Architecture Overview
11
+
12
+ ## What This Is
13
+
14
+ The **ultimate-pi agentic harness** is a mandatory 8-layer pipeline with drift monitoring, cross-cutting tool enhancements, and persistent wiki-based memory. Every AI coding task flows through all layers. Verification is mandatory — agent confidence is not evidence.
15
+
16
+ ## Architecture At a Glance
17
+
18
+ ```
19
+ L1: Spec Hardening → L2: Structured Planning → L2.5: Runtime Drift Monitor
20
+ ↓ ↓ ↓ (3 paradigms: tool-call, spec, implementation)
21
+ L3: Grounding Checkpoints → L4: Adversarial Verification → Phase 16: Lint+Format Gate
22
+ ↓ (with Think-in-Code, AST truncation, fuzzy edits,
23
+ inline syntax validation, ck semantic search, Gitingest)
24
+ L5: Automated Observability → L6: Persistent Memory (Wiki) → L7: Archon Orchestration → L8: Wiki Query
25
+ ```
26
+
27
+ ## Key Numbers
28
+
29
+ - **~15,000-16,000 tokens/subtask** pipeline overhead (with all enhancements)
30
+ - **27 build phases** (P0-P27) + 3 future phases (F1-F3)
31
+ - **4 new tools**: ck (semantic search), Gitingest (bulk ingestion), pi-messenger (debate transport), pi-lean-ctx (compression+governance)
32
+ - **3 control frameworks**: H=(E,T,C,S,L,V), Feedforward-Feedback, Generator-Evaluator
33
+ - **3 drift detection paradigms**: Tool-call (L2.5), Spec (L3), Implementation (L4)
34
+ - **Model-adaptive**: 4 profiles (opus/gpt/gemini/strict) × 4 configuration layers
35
+
36
+ ## Authoritative Pages
37
+
38
+ | Page | Role |
39
+ |------|------|
40
+ | [[harness-implementation-plan]] | Master plan: phases, token budget, architecture |
41
+ | [[harness]] | Pipeline overview with layer descriptions |
42
+ | [[harness-control-frameworks]] | Unified formal models |
43
+ | [[drift-detection-unified]] | Three complementary drift paradigms |
44
+ | [[index]] | Master catalog of all wiki pages |
45
+
46
+ ## Key Decisions
47
+
48
+ - [[adr-008]] — Spec-Only Black-Box QA
49
+ - [[adr-009]] — claude-obsidian Mode B persistent memory
50
+ - [[adr-010]] — Wiki tight-coupling contract (read-first, write-after)
51
+ - [[adr-011]] — Consensus debate with selective routing (iMAD)
@@ -0,0 +1,87 @@
1
+ ---
2
+ type: synthesis
3
+ title: "Research: pi-vs-claude-code Agentic Orchestration Pipeline"
4
+ created: 2026-05-03
5
+ updated: 2026-05-03
6
+ tags:
7
+ - research
8
+ - agentic-orchestration
9
+ - pi-agent
10
+ - harness
11
+ status: developing
12
+ related:
13
+ - "[[concepts/agentic-orchestration-pipeline]]"
14
+ - "[[concepts/agent-harness-architecture]]"
15
+ - "[[concepts/multi-agent-specialization]]"
16
+ - "[[concepts/context-engineering]]"
17
+ - "[[concepts/safety-defense-in-depth]]"
18
+ - "[[entities/pi-coding-agent]]"
19
+ - "[[entities/disler-indydevdan]]"
20
+ - "[[entities/opendev]]"
21
+ sources:
22
+ - "[[sources/disler-pi-vs-claude-code]]"
23
+ - "[[sources/opendev-arxiv-2603.05344v1]]"
24
+ - "[[sources/martin-fowler-harness-engineering]]"
25
+ - "[[sources/mindstudio-four-agent-types]]"
26
+ - "[[sources/anthropic-effective-harnesses]]"
27
+ ---
28
+
29
+ # Research: pi-vs-claude-code Agentic Orchestration Pipeline
30
+
31
+ ## Overview
32
+
33
+ The `disler/pi-vs-claude-code` repository demonstrates that Pi Coding Agent's extension system can implement production-grade multi-agent orchestration entirely in user-space TypeScript. Three orchestration patterns emerge — subagent delegation, team dispatch, and sequential chaining — each with distinct use cases and implementation strategies. These patterns can be ported to our harness as `.pi/skills/` extensions backed by YAML configuration files. The broader research reveals that harness engineering (context management, safety, feedback loops) is as critical as orchestration itself, and mature systems like OpenDev provide reference architectures for both.
34
+
35
+ ## Key Findings
36
+
37
+ 1. **Pi extensions can implement full orchestration without core changes** (Source: [[sources/disler-pi-vs-claude-code]]). The three orchestration extensions (subagent-widget, agent-team, agent-chain) are clean TypeScript files that hook Pi's event system. Our harness can adopt identical patterns.
38
+
39
+ 2. **Three orchestration patterns cover the design space** (Source: [[sources/disler-pi-vs-claude-code]], [[sources/mindstudio-four-agent-types]]):
40
+ - **Subagent delegation** (fan-out): Spawn isolated agents for parallel subtasks. Best for exploration, analysis, background work.
41
+ - **Team dispatch** (specialist routing): Dispatcher selects specialist from roster. Best for domain-specific work.
42
+ - **Sequential chaining** (pipeline): Agents execute in order with `$INPUT` passing. Best for multi-phase workflows.
43
+
44
+ 3. **Schema-level isolation is more robust than runtime checks** (Source: [[sources/opendev-arxiv-2603.05344v1]]). Removing tools from a subagent's schema makes dangerous operations structurally impossible. The model cannot argue for capabilities it doesn't know exist. This should be our default safety strategy.
45
+
46
+ 4. **Context engineering is a first-class concern, not an afterthought** (Source: [[sources/opendev-arxiv-2603.05344v1]], [[sources/martin-fowler-harness-engineering]]). Staged compaction (5 graduated thresholds), event-driven reminders (24 templates, user-role injection), and dual-memory architecture (episodic + working) are proven techniques. Our harness lacks all three.
47
+
48
+ 5. **Harness = Guides + Sensors + Steering Loop** (Source: [[sources/martin-fowler-harness-engineering]]). Feedforward guides steer before action; feedback sensors observe after. The human iterates on both. Our `.pi/skills/` are feedforward; `wiki-lint` and `posthog-analyst` are feedback. We need more computational sensors.
49
+
50
+ 6. **Multi-model pipelines beat single-model agents** (Source: [[sources/mindstudio-four-agent-types]], industry pattern). Different pipeline stages benefit from different models (Opus for planning, Sonnet for building, Haiku for reviewing). Our harness should support per-stage model selection.
51
+
52
+ 7. **Safety requires defense-in-depth, not single-point checks** (Source: [[sources/opendev-arxiv-2603.05344v1]], [[sources/disler-pi-vs-claude-code]]). Five independent layers: prompt guardrails → schema gating → runtime approval → tool validation → lifecycle hooks. Our harness has none of these in structured form.
53
+
54
+ ## Key Entities
55
+
56
+ - **[[entities/pi-coding-agent]]**: The foundation — open-source terminal coding agent with TypeScript extension API. Our harness platform.
57
+ - **[[entities/disler-indydevdan]]**: Created the reference implementation of Pi orchestration extensions. Primary source of patterns.
58
+ - **[[entities/opendev]]**: Most comprehensive reference architecture for terminal coding agents. Source of context engineering and safety patterns.
59
+
60
+ ## Key Concepts
61
+
62
+ - **[[concepts/agentic-orchestration-pipeline]]**: Three orchestration patterns (subagent, team, chain) with design principles for implementation.
63
+ - **[[concepts/agent-harness-architecture]]**: Scaffolding + Harness model. Feedforward guides + Feedback sensors in a steering loop.
64
+ - **[[concepts/multi-agent-specialization]]**: Specialization by role, model, and tool set. Team composition via YAML config.
65
+ - **[[concepts/context-engineering]]**: Staged compaction, dual-memory, event-driven reminders, lazy tool discovery, prompt caching.
66
+ - **[[concepts/safety-defense-in-depth]]**: Five-layer architecture with schema gating as the primary strategy.
67
+
68
+ ## Contradictions
69
+
70
+ - **Single agent vs Multi-agent overhead**: [[sources/mindstudio-four-agent-types]] warns that orchestration adds overhead and should only be used when a single agent demonstrably fails. [[sources/disler-pi-vs-claude-code]] shows orchestration as a default pattern. Resolution: Start with a single agent; add orchestration when context limits or specialization needs are clear. This aligns with our current approach where the `Agent` tool is used selectively.
71
+
72
+ ## Open Questions
73
+
74
+ - How to implement event-driven system reminders in Pi's extension API? Pi's event system supports `tool_call` and `turn_end` events — these could drive reminder injection.
75
+ - What's the right compaction strategy for our context window? Pi doesn't expose token counts to extensions — we may need to approximate or request API changes.
76
+ - How to persist approval rules across sessions? Pi's extension lifecycle includes `session_start` and `session_shutdown` — rules could be loaded/saved in these hooks.
77
+ - Can we implement 9-pass fuzzy editing in Pi's `edit` tool handler? Pi's extension API exposes `tool_call` events — we could intercept edit failures and retry with fuzzy matching.
78
+ - What's the performance impact of context isolation per subagent? Spawning new Pi processes per subagent may be expensive. Thread-based subagents (like OpenDev) would be lighter.
79
+ - How to implement the steering loop? Need a mechanism for humans to review harness performance and update guides/sensors. Our `wiki` + `posthog-analyst` pipeline is a start.
80
+
81
+ ## Sources
82
+
83
+ - [[sources/disler-pi-vs-claude-code]]: disler, Feb 2026 — Reference implementation of Pi orchestration extensions
84
+ - [[sources/opendev-arxiv-2603.05344v1]]: Nghi D. Q. Bui, Mar 2026 — Comprehensive terminal agent architecture paper
85
+ - [[sources/martin-fowler-harness-engineering]]: Birgitta Böckeler, Apr 2026 — Harness engineering mental model and framework
86
+ - [[sources/mindstudio-four-agent-types]]: MindStudio, Apr 2026 — Taxonomy of agent types and architecture decisions
87
+ - [[sources/anthropic-effective-harnesses]]: Justin Young (Anthropic), 2025 — Authoritative harness definition