ultimate-pi 0.1.2 → 0.1.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (516) hide show
  1. package/.agents/skills/ck-search/SKILL.md +99 -0
  2. package/.agents/skills/defuddle/SKILL.md +90 -0
  3. package/.agents/skills/find-skills/SKILL.md +142 -0
  4. package/.agents/skills/firecrawl/SKILL.md +150 -0
  5. package/.agents/skills/firecrawl/rules/install.md +82 -0
  6. package/.agents/skills/firecrawl/rules/security.md +26 -0
  7. package/.agents/skills/firecrawl-agent/SKILL.md +57 -0
  8. package/.agents/skills/firecrawl-build-interact/SKILL.md +67 -0
  9. package/.agents/skills/firecrawl-build-onboarding/SKILL.md +102 -0
  10. package/.agents/skills/firecrawl-build-onboarding/references/auth-flow.md +39 -0
  11. package/.agents/skills/firecrawl-build-onboarding/references/project-setup.md +20 -0
  12. package/.agents/skills/firecrawl-build-onboarding/references/sdk-installation.md +17 -0
  13. package/.agents/skills/firecrawl-build-scrape/SKILL.md +68 -0
  14. package/.agents/skills/firecrawl-build-search/SKILL.md +68 -0
  15. package/.agents/skills/firecrawl-crawl/SKILL.md +58 -0
  16. package/.agents/skills/firecrawl-download/SKILL.md +69 -0
  17. package/.agents/skills/firecrawl-interact/SKILL.md +83 -0
  18. package/.agents/skills/firecrawl-map/SKILL.md +50 -0
  19. package/.agents/skills/firecrawl-parse/SKILL.md +61 -0
  20. package/.agents/skills/firecrawl-scrape/SKILL.md +68 -0
  21. package/.agents/skills/firecrawl-search/SKILL.md +59 -0
  22. package/.agents/skills/obsidian-bases/SKILL.md +299 -0
  23. package/.agents/skills/obsidian-markdown/SKILL.md +237 -0
  24. package/.agents/skills/posthog-analyst/SKILL.md +306 -0
  25. package/.agents/skills/posthog-analyst/evals/evals.json +23 -0
  26. package/.agents/skills/wiki/SKILL.md +215 -0
  27. package/.agents/skills/wiki/references/css-snippets.md +122 -0
  28. package/.agents/skills/wiki/references/frontmatter.md +107 -0
  29. package/.agents/skills/wiki/references/git-setup.md +58 -0
  30. package/.agents/skills/wiki/references/mcp-setup.md +149 -0
  31. package/.agents/skills/wiki/references/modes.md +259 -0
  32. package/.agents/skills/wiki/references/plugins.md +96 -0
  33. package/.agents/skills/wiki/references/rest-api.md +124 -0
  34. package/.agents/skills/wiki-autoresearch/SKILL.md +211 -0
  35. package/.agents/skills/wiki-autoresearch/references/program.md +75 -0
  36. package/.agents/skills/wiki-fold/SKILL.md +204 -0
  37. package/.agents/skills/wiki-fold/references/fold-template.md +133 -0
  38. package/.agents/skills/wiki-ingest/SKILL.md +288 -0
  39. package/.agents/skills/wiki-lint/SKILL.md +183 -0
  40. package/.agents/skills/wiki-query/SKILL.md +176 -0
  41. package/.agents/skills/wiki-save/SKILL.md +128 -0
  42. package/.ckignore +41 -0
  43. package/.env.example +9 -0
  44. package/.github/workflows/lint.yml +33 -0
  45. package/.github/workflows/publish-github-packages.yml +35 -0
  46. package/.github/workflows/publish-npm.yml +1 -1
  47. package/.pi/SYSTEM.md +107 -40
  48. package/.pi/agents/pi-pi/agent-expert.md +205 -0
  49. package/.pi/agents/pi-pi/cli-expert.md +47 -0
  50. package/.pi/agents/pi-pi/config-expert.md +67 -0
  51. package/.pi/agents/pi-pi/ext-expert.md +53 -0
  52. package/.pi/agents/pi-pi/keybinding-expert.md +123 -0
  53. package/.pi/agents/pi-pi/pi-orchestrator.md +103 -0
  54. package/.pi/agents/pi-pi/prompt-expert.md +83 -0
  55. package/.pi/agents/pi-pi/skill-expert.md +52 -0
  56. package/.pi/agents/pi-pi/theme-expert.md +46 -0
  57. package/.pi/agents/pi-pi/tui-expert.md +100 -0
  58. package/.pi/agents/rethink.md +140 -0
  59. package/.pi/agents/wiki-ingest.md +67 -0
  60. package/.pi/agents/wiki-lint.md +75 -0
  61. package/.pi/auto-commit.json +20 -0
  62. package/.pi/extensions/banner.png +0 -0
  63. package/.pi/extensions/ck-enforce.ts +216 -0
  64. package/.pi/extensions/custom-footer.ts +308 -0
  65. package/.pi/extensions/custom-header.ts +116 -0
  66. package/.pi/extensions/dotenv-loader.ts +170 -0
  67. package/.pi/internal/cursor-sdk-transcript-parser.ts +59 -0
  68. package/.pi/model-router.json +95 -0
  69. package/.pi/npm/.gitignore +2 -0
  70. package/.pi/prompts/git-sync.md +124 -0
  71. package/.pi/prompts/harness-setup.md +509 -0
  72. package/.pi/prompts/save.md +16 -0
  73. package/.pi/prompts/wiki-autoresearch.md +19 -0
  74. package/.pi/prompts/wiki.md +23 -0
  75. package/.pi/providers/cursor-sdk-provider.test.mjs +476 -0
  76. package/.pi/providers/cursor-sdk-provider.ts +1085 -0
  77. package/.pi/settings.json +14 -4
  78. package/.pi/skills/agent-router/SKILL.md +174 -0
  79. package/.pi/sounds/alert/1-kaching-track.mp3 +0 -0
  80. package/.pi/sounds/error/1-ksi-wth-track.mp3 +0 -0
  81. package/.pi/sounds/error/2-smash-track.mp3 +0 -0
  82. package/.pi/sounds/error/3-buzzer-track.mp3 +0 -0
  83. package/.pi/sounds/notification/1-soft-notification-track.mp3 +0 -0
  84. package/.pi/sounds/project-sounds.json +25 -0
  85. package/.pi/sounds/reminder/1-soft-notification-track.mp3 +0 -0
  86. package/.pi/sounds/success/1-tada-track.mp3 +0 -0
  87. package/.pi/sounds/success/2-jobs-done-track.mp3 +0 -0
  88. package/.pi/sounds/success/3-yay-track.mp3 +0 -0
  89. package/CONTRIBUTING.md +116 -0
  90. package/README.md +32 -39
  91. package/biome.json +34 -0
  92. package/firecrawl/.env.template +58 -0
  93. package/firecrawl/README.md +49 -0
  94. package/firecrawl/docker-compose.yaml +201 -0
  95. package/firecrawl/searxng/searxng.env +3 -0
  96. package/firecrawl/searxng/settings.yml +85 -0
  97. package/lefthook.yml +8 -0
  98. package/package.json +55 -24
  99. package/vault/AGENTS.md +37 -0
  100. package/vault/wiki/_templates/comparison.md +39 -0
  101. package/vault/wiki/_templates/concept.md +40 -0
  102. package/vault/wiki/_templates/decision.md +21 -0
  103. package/vault/wiki/_templates/entity.md +32 -0
  104. package/vault/wiki/_templates/flow.md +14 -0
  105. package/vault/wiki/_templates/module.md +18 -0
  106. package/vault/wiki/_templates/question.md +31 -0
  107. package/vault/wiki/_templates/source.md +39 -0
  108. package/vault/wiki/concepts/AST-Aware Code Chunking.md +44 -0
  109. package/vault/wiki/concepts/Build-Time Prompt Compilation.md +107 -0
  110. package/vault/wiki/concepts/Context Engine (AI Coding).md +47 -0
  111. package/vault/wiki/concepts/Context-Aware System Reminders.md +61 -0
  112. package/vault/wiki/concepts/Contextualized Text Embedding.md +42 -0
  113. package/vault/wiki/concepts/Contractor vs Employee AI Model.md +55 -0
  114. package/vault/wiki/concepts/Dual-Model Agent Architecture.md +65 -0
  115. package/vault/wiki/concepts/Late Chunking vs Early Chunking.md +43 -0
  116. package/vault/wiki/concepts/Majority Vote Ensembling.md +68 -0
  117. package/vault/wiki/concepts/Meta-Harness.md +16 -0
  118. package/vault/wiki/concepts/Multi-Agent AI Coding Architecture.md +75 -0
  119. package/vault/wiki/concepts/Prompt Enhancement.md +90 -0
  120. package/vault/wiki/concepts/Prompt Renderer.md +89 -0
  121. package/vault/wiki/concepts/Semantic Codebase Indexing.md +67 -0
  122. package/vault/wiki/concepts/additive-config-hierarchy.md +16 -0
  123. package/vault/wiki/concepts/agent-artifacts-verifiable-deliverables.md +71 -0
  124. package/vault/wiki/concepts/agent-browser-browser-automation.md +99 -0
  125. package/vault/wiki/concepts/agent-codebase-interface.md +43 -0
  126. package/vault/wiki/concepts/agent-harness-architecture.md +67 -0
  127. package/vault/wiki/concepts/agent-loop-detection-patterns.md +133 -0
  128. package/vault/wiki/concepts/agent-search-enforcement.md +126 -0
  129. package/vault/wiki/concepts/agent-skills-ecosystem.md +74 -0
  130. package/vault/wiki/concepts/agent-skills-pattern.md +68 -0
  131. package/vault/wiki/concepts/agentic-harness-context-enforcement.md +91 -0
  132. package/vault/wiki/concepts/agentic-harness.md +34 -0
  133. package/vault/wiki/concepts/agentic-orchestration-pipeline.md +56 -0
  134. package/vault/wiki/concepts/agentic-search-no-embeddings.md +18 -0
  135. package/vault/wiki/concepts/anthropic-context-engineering.md +13 -0
  136. package/vault/wiki/concepts/antigravity-agent-first-architecture.md +61 -0
  137. package/vault/wiki/concepts/ast-compression.md +19 -0
  138. package/vault/wiki/concepts/ast-truncation.md +66 -0
  139. package/vault/wiki/concepts/barrel-files.md +37 -0
  140. package/vault/wiki/concepts/browser-harness-agent.md +41 -0
  141. package/vault/wiki/concepts/browser-subagent-visual-verification.md +82 -0
  142. package/vault/wiki/concepts/codebase-intelligence-ecosystem-comparison.md +192 -0
  143. package/vault/wiki/concepts/codebase-intelligence-harness-integration.md +161 -0
  144. package/vault/wiki/concepts/codebase-to-context-ingestion.md +46 -0
  145. package/vault/wiki/concepts/codex-harness-innovations.md +147 -0
  146. package/vault/wiki/concepts/consensus-debate-flow.md +17 -0
  147. package/vault/wiki/concepts/consensus-debate.md +206 -0
  148. package/vault/wiki/concepts/content-addressed-spec-identity.md +166 -0
  149. package/vault/wiki/concepts/context-anxiety.md +57 -0
  150. package/vault/wiki/concepts/context-compression-techniques.md +19 -0
  151. package/vault/wiki/concepts/context-continuity.md +22 -0
  152. package/vault/wiki/concepts/context-drift-in-agents.md +106 -0
  153. package/vault/wiki/concepts/context-engineering.md +62 -0
  154. package/vault/wiki/concepts/context-folding.md +67 -0
  155. package/vault/wiki/concepts/context-mode.md +38 -0
  156. package/vault/wiki/concepts/cursor-harness-innovations.md +107 -0
  157. package/vault/wiki/concepts/deterministic-session-compaction.md +79 -0
  158. package/vault/wiki/concepts/drift-detection-unified.md +296 -0
  159. package/vault/wiki/concepts/execution-feedback-loop.md +46 -0
  160. package/vault/wiki/concepts/feedforward-feedback-harness.md +60 -0
  161. package/vault/wiki/concepts/five-root-cause-metrics-sentrux.md +40 -0
  162. package/vault/wiki/concepts/fork-safe-spec-storage.md +89 -0
  163. package/vault/wiki/concepts/fts5-sandbox.md +19 -0
  164. package/vault/wiki/concepts/fuzzy-edit-matching.md +71 -0
  165. package/vault/wiki/concepts/gemini-cli-architecture.md +104 -0
  166. package/vault/wiki/concepts/generator-evaluator-architecture.md +64 -0
  167. package/vault/wiki/concepts/guardian-agent-pattern.md +67 -0
  168. package/vault/wiki/concepts/harness-configuration-layers.md +89 -0
  169. package/vault/wiki/concepts/harness-control-frameworks.md +155 -0
  170. package/vault/wiki/concepts/harness-engineering-first-principles.md +90 -0
  171. package/vault/wiki/concepts/harness-h-formalism.md +53 -0
  172. package/vault/wiki/concepts/hybrid-code-search.md +61 -0
  173. package/vault/wiki/concepts/inline-post-edit-validation.md +112 -0
  174. package/vault/wiki/concepts/legendary-engineering-patterns-harness.md +110 -0
  175. package/vault/wiki/concepts/lifecycle-hooks.md +94 -0
  176. package/vault/wiki/concepts/mcp-tool-routing.md +102 -0
  177. package/vault/wiki/concepts/memory-system-of-record-vs-ephemeral-cache.md +47 -0
  178. package/vault/wiki/concepts/meta-agent-context-pruning.md +151 -0
  179. package/vault/wiki/concepts/model-adaptive-harness.md +122 -0
  180. package/vault/wiki/concepts/model-routing-agents.md +101 -0
  181. package/vault/wiki/concepts/monorepo-architecture.md +45 -0
  182. package/vault/wiki/concepts/multi-agent-specialization.md +61 -0
  183. package/vault/wiki/concepts/permission-subsystem.md +16 -0
  184. package/vault/wiki/concepts/pi-messenger-analysis.md +243 -0
  185. package/vault/wiki/concepts/pi-vscode-extension-landscape.md +37 -0
  186. package/vault/wiki/concepts/policy-engine-pattern.md +78 -0
  187. package/vault/wiki/concepts/progressive-disclosure-agents.md +53 -0
  188. package/vault/wiki/concepts/progressive-skill-disclosure.md +17 -0
  189. package/vault/wiki/concepts/provider-native-prompting.md +203 -0
  190. package/vault/wiki/concepts/quality-signal-sentrux.md +37 -0
  191. package/vault/wiki/concepts/repo-map-ranking.md +42 -0
  192. package/vault/wiki/concepts/result-monad-error-handling.md +47 -0
  193. package/vault/wiki/concepts/safety-defense-in-depth.md +83 -0
  194. package/vault/wiki/concepts/sandbox-os-enforcement.md +18 -0
  195. package/vault/wiki/concepts/selective-debate-routing.md +70 -0
  196. package/vault/wiki/concepts/self-evolving-harness.md +60 -0
  197. package/vault/wiki/concepts/sentrux-mcp-integration.md +36 -0
  198. package/vault/wiki/concepts/sentrux-rules-engine.md +49 -0
  199. package/vault/wiki/concepts/shell-pattern-compression.md +24 -0
  200. package/vault/wiki/concepts/skill-first-architecture.md +166 -0
  201. package/vault/wiki/concepts/structured-compaction.md +78 -0
  202. package/vault/wiki/concepts/subagent-orchestration.md +17 -0
  203. package/vault/wiki/concepts/subagent-worktree-isolation.md +68 -0
  204. package/vault/wiki/concepts/superpowers-methodology.md +78 -0
  205. package/vault/wiki/concepts/think-in-code.md +73 -0
  206. package/vault/wiki/concepts/ts-execution-layer.md +100 -0
  207. package/vault/wiki/concepts/typescript-strict-mode.md +37 -0
  208. package/vault/wiki/concepts/vcc-conversation-compaction-for-pi.md +51 -0
  209. package/vault/wiki/concepts/verification-drift-detection.md +19 -0
  210. package/vault/wiki/consensus/consensus-records.md +58 -0
  211. package/vault/wiki/decisions/2026-04-30-pi-lean-ctx-native.md +122 -0
  212. package/vault/wiki/decisions/adr-008.md +40 -0
  213. package/vault/wiki/decisions/adr-009.md +46 -0
  214. package/vault/wiki/decisions/adr-010.md +55 -0
  215. package/vault/wiki/decisions/adr-011.md +165 -0
  216. package/vault/wiki/decisions/adr-012.md +102 -0
  217. package/vault/wiki/decisions/adr-013.md +59 -0
  218. package/vault/wiki/decisions/adr-014.md +73 -0
  219. package/vault/wiki/decisions/adr-015.md +81 -0
  220. package/vault/wiki/decisions/adr-016.md +91 -0
  221. package/vault/wiki/decisions/adr-017.md +79 -0
  222. package/vault/wiki/decisions/adr-018.md +100 -0
  223. package/vault/wiki/decisions/adr-019.md +75 -0
  224. package/vault/wiki/decisions/adr-020.md +106 -0
  225. package/vault/wiki/decisions/adr-021.md +86 -0
  226. package/vault/wiki/decisions/adr-022.md +113 -0
  227. package/vault/wiki/decisions/adr-023.md +113 -0
  228. package/vault/wiki/decisions/adr-024.md +73 -0
  229. package/vault/wiki/decisions/adr-025.md +130 -0
  230. package/vault/wiki/decisions/adr-026.md +56 -0
  231. package/vault/wiki/decisions/colocate-wiki.md +34 -0
  232. package/vault/wiki/entities/Anders Hejlsberg.md +29 -0
  233. package/vault/wiki/entities/Anthropic.md +17 -0
  234. package/vault/wiki/entities/Augment Code.md +49 -0
  235. package/vault/wiki/entities/Bjarne Stroustrup.md +26 -0
  236. package/vault/wiki/entities/Bolt.new (StackBlitz).md +39 -0
  237. package/vault/wiki/entities/Boris Cherny.md +11 -0
  238. package/vault/wiki/entities/Claude Code.md +19 -0
  239. package/vault/wiki/entities/Dennis Ritchie.md +26 -0
  240. package/vault/wiki/entities/Emergent Labs.md +32 -0
  241. package/vault/wiki/entities/Google Cloud.md +16 -0
  242. package/vault/wiki/entities/Guido van Rossum.md +28 -0
  243. package/vault/wiki/entities/Ken Thompson.md +28 -0
  244. package/vault/wiki/entities/Lee et al.md +16 -0
  245. package/vault/wiki/entities/Linus Torvalds.md +28 -0
  246. package/vault/wiki/entities/Lovable (company).md +40 -0
  247. package/vault/wiki/entities/Martin Fowler.md +16 -0
  248. package/vault/wiki/entities/Meng et al.md +16 -0
  249. package/vault/wiki/entities/OpenAI.md +16 -0
  250. package/vault/wiki/entities/Rocket.new.md +38 -0
  251. package/vault/wiki/entities/VILA-Lab.md +15 -0
  252. package/vault/wiki/entities/autodev-codebase.md +18 -0
  253. package/vault/wiki/entities/ck-tool.md +59 -0
  254. package/vault/wiki/entities/codesearch.md +18 -0
  255. package/vault/wiki/entities/disler-indydevdan.md +33 -0
  256. package/vault/wiki/entities/gsd-get-shit-done.md +56 -0
  257. package/vault/wiki/entities/javascript-runtimes.md +48 -0
  258. package/vault/wiki/entities/jesse-vincent.md +38 -0
  259. package/vault/wiki/entities/lean-ctx.md +32 -0
  260. package/vault/wiki/entities/opendev.md +41 -0
  261. package/vault/wiki/entities/ops-codegraph-tool.md +18 -0
  262. package/vault/wiki/entities/pi-coding-agent.md +53 -0
  263. package/vault/wiki/entities/sentrux.md +54 -0
  264. package/vault/wiki/entities/vgrep-tool.md +57 -0
  265. package/vault/wiki/entities/vitest.md +41 -0
  266. package/vault/wiki/flows/harness-wiki-pipeline.md +204 -0
  267. package/vault/wiki/hot.md +932 -0
  268. package/vault/wiki/index.md +437 -0
  269. package/vault/wiki/log.md +418 -0
  270. package/vault/wiki/meta/dashboard.md +30 -0
  271. package/vault/wiki/meta/lint-report-2026-04-30.md +86 -0
  272. package/vault/wiki/meta/lint-report-2026-05-02.md +251 -0
  273. package/vault/wiki/meta/overview.canvas +43 -0
  274. package/vault/wiki/modules/adversarial-verification.md +57 -0
  275. package/vault/wiki/modules/automated-observability.md +54 -0
  276. package/vault/wiki/modules/bench.md +20 -0
  277. package/vault/wiki/modules/extensions.md +23 -0
  278. package/vault/wiki/modules/grounding-checkpoints.md +62 -0
  279. package/vault/wiki/modules/harness-implementation-plan.md +345 -0
  280. package/vault/wiki/modules/harness-wiki-skill-mapping.md +135 -0
  281. package/vault/wiki/modules/harness.md +86 -0
  282. package/vault/wiki/modules/persistent-memory.md +85 -0
  283. package/vault/wiki/modules/schema-orchestration.md +68 -0
  284. package/vault/wiki/modules/skills.md +27 -0
  285. package/vault/wiki/modules/spec-hardening.md +58 -0
  286. package/vault/wiki/modules/structured-planning.md +53 -0
  287. package/vault/wiki/modules/think-in-code-enforcement.md +153 -0
  288. package/vault/wiki/modules/wiki-query-interface.md +64 -0
  289. package/vault/wiki/overview.md +51 -0
  290. package/vault/wiki/questions/Research-pi-vs-claude-code-agentic-orchestration-pipeline.md +87 -0
  291. package/vault/wiki/questions/Research-sentrux-dev.md +123 -0
  292. package/vault/wiki/questions/Research-superpowers-skill-for-agentic-coding-agents.md +164 -0
  293. package/vault/wiki/questions/Research: Augment Code Context Engine.md +244 -0
  294. package/vault/wiki/questions/Research: Automating Software Engineering - Lovable, Bolt, Emergent, Rocket.md +112 -0
  295. package/vault/wiki/questions/Research: Claude Code State-of-the-Art Harness Improvements.md +209 -0
  296. package/vault/wiki/questions/Research: Codex State-of-the-Art Harness Improvements.md +99 -0
  297. package/vault/wiki/questions/Research: Engineering Workflows of Legendary Programmers and AI Harness Mapping.md +107 -0
  298. package/vault/wiki/questions/Research: Fallow Codebase Intelligence Harness Integration.md +72 -0
  299. package/vault/wiki/questions/Research: Gemini CLI SOTA Harness Integration.md +166 -0
  300. package/vault/wiki/questions/Research: GitHub Issues as Harness Spec Storage.md +188 -0
  301. package/vault/wiki/questions/Research: Google Antigravity Harness Integration.md +120 -0
  302. package/vault/wiki/questions/Research: Meta-Agent Context Drift Detection.md +236 -0
  303. package/vault/wiki/questions/Research: Model-Adaptive Agent Harness Design.md +95 -0
  304. package/vault/wiki/questions/Research: Model-Specific Prompting Guides.md +165 -0
  305. package/vault/wiki/questions/Research: Prompt Renderer for Multi-Model Agent Harness.md +216 -0
  306. package/vault/wiki/questions/Research: Skill-First Harness Architecture.md +91 -0
  307. package/vault/wiki/questions/Research: TypeScript Best Practices and Codebase Structure.md +88 -0
  308. package/vault/wiki/questions/Research: TypeScript Execution Layer for Agent Tool Calling.md +81 -0
  309. package/vault/wiki/questions/Research: claude-mem over Obsidian for Harness Layer.md +71 -0
  310. package/vault/wiki/questions/Research: claude-mem over obsidian wiki as the knowledge base for our agentic harness pipeline. think from first principles. does this replace or complement our current setup? no hard feelings about previous decisions. gimme accurate points.md +80 -0
  311. package/vault/wiki/questions/Research: context-mode vs lean-ctx.md +72 -0
  312. package/vault/wiki/questions/Research: cursor.sh Harness Innovations.md +92 -0
  313. package/vault/wiki/questions/Research: executor.sh Harness Integration.md +170 -0
  314. package/vault/wiki/questions/Research: how GSD fits into our coding harness setup.md +97 -0
  315. package/vault/wiki/questions/Research: how claude-mem fits into our workflow. and whether it should replace obsidian in the codebase. no hard feelings about previous actions, rethink from first principles always.md +80 -0
  316. package/vault/wiki/questions/Research: pi-vcc.md +113 -0
  317. package/vault/wiki/questions/Research: semantic code search tools.md +69 -0
  318. package/vault/wiki/questions/Research: vcc extension for pi coding agent.md +73 -0
  319. package/vault/wiki/questions/how-to-enable-semantic-code-search-now.md +111 -0
  320. package/vault/wiki/questions/mvp-implementation-blueprint.md +552 -0
  321. package/vault/wiki/questions/research-agent-first-codebase-exploration.md +199 -0
  322. package/vault/wiki/questions/research-agentic-coding-harness-latest-papers.md +142 -0
  323. package/vault/wiki/questions/research-gitingest-gitreverse-integration.md +100 -0
  324. package/vault/wiki/questions/research-wozcode-token-reduction.md +67 -0
  325. package/vault/wiki/questions/resolved-context-pruning-inplace-vs-restart.md +95 -0
  326. package/vault/wiki/questions/resolved-context-window-economics.md +167 -0
  327. package/vault/wiki/questions/resolved-imad-debate-gating-transfer.md +126 -0
  328. package/vault/wiki/questions/resolved-mcp-tool-preference.md +112 -0
  329. package/vault/wiki/questions/resolved-small-model-meta-agents.md +107 -0
  330. package/vault/wiki/questions/resolved-treesitter-dynamic-languages.md +95 -0
  331. package/vault/wiki/sources/Auggie Context MCP Server.md +63 -0
  332. package/vault/wiki/sources/Augment Code Codacy AI Giants.md +61 -0
  333. package/vault/wiki/sources/Augment Code MCP SiliconAngle.md +49 -0
  334. package/vault/wiki/sources/Augment Code WorkOS ERC 2025.md +55 -0
  335. package/vault/wiki/sources/Augment Context Engine Official.md +71 -0
  336. package/vault/wiki/sources/Augment SWE-bench Agent GitHub.md +74 -0
  337. package/vault/wiki/sources/Augment SWE-bench Pro Blog.md +58 -0
  338. package/vault/wiki/sources/Source: AgentBus Jinja2 Prompt Pipelines.md +75 -0
  339. package/vault/wiki/sources/Source: Arxiv /342/200/224 Don't Break the Cache.md" +85 -0
  340. package/vault/wiki/sources/Source: Augment - Harness Engineering for AI Coding Agents.md +58 -0
  341. package/vault/wiki/sources/Source: Blake Crosley Agent Architecture Guide.md +100 -0
  342. package/vault/wiki/sources/Source: Bolt.new Architecture & Case Study.md +75 -0
  343. package/vault/wiki/sources/Source: Build-Time Prompt Compilation Architecture.md +107 -0
  344. package/vault/wiki/sources/Source: Claude API Agent Skills Overview.md +70 -0
  345. package/vault/wiki/sources/Source: Gemini CLI Changelogs.md +88 -0
  346. package/vault/wiki/sources/Source: Google Blog - Gemini CLI Announcement.md +57 -0
  347. package/vault/wiki/sources/Source: Google Gemini CLI Architecture Docs.md +53 -0
  348. package/vault/wiki/sources/Source: LangChain - Anatomy of Agent Harness.md +65 -0
  349. package/vault/wiki/sources/Source: Lovable Architecture & Clone Analysis.md +83 -0
  350. package/vault/wiki/sources/Source: Martin Fowler - Harness Engineering.md +70 -0
  351. package/vault/wiki/sources/Source: OpenAI Harness Engineering Five Principles.md +58 -0
  352. package/vault/wiki/sources/Source: OpenAI Harness Engineering /342/200/224 0 Lines of Human Code.md" +101 -0
  353. package/vault/wiki/sources/Source: OpenDev /342/200/224 Building AI Coding Agents for the Terminal.md" +100 -0
  354. package/vault/wiki/sources/Source: Render AI Coding Agents Benchmark 2025.md +53 -0
  355. package/vault/wiki/sources/Source: Rocket.new /342/200/224 Vibe Solutioning Platform.md" +70 -0
  356. package/vault/wiki/sources/Source: SwirlAI Agent Skills Progressive Disclosure.md +71 -0
  357. package/vault/wiki/sources/Source: TianPan Prompt Caching Architecture.md +89 -0
  358. package/vault/wiki/sources/Source: Vercel Labs agent-browser.md +155 -0
  359. package/vault/wiki/sources/Source: browser-harness CDP Harness.md +126 -0
  360. package/vault/wiki/sources/agent-drift-academic-paper.md +79 -0
  361. package/vault/wiki/sources/aider-repomap-tree-sitter.md +42 -0
  362. package/vault/wiki/sources/anthropic-compaction-api.md +58 -0
  363. package/vault/wiki/sources/anthropic-effective-harnesses.md +42 -0
  364. package/vault/wiki/sources/anthropic-prompt-best-practices.md +100 -0
  365. package/vault/wiki/sources/anthropic2026-harness-design.md +63 -0
  366. package/vault/wiki/sources/barrel-files-tkdodo.md +38 -0
  367. package/vault/wiki/sources/birth-of-unix-kernighan-interview.md +57 -0
  368. package/vault/wiki/sources/bockeler2026-harness-engineering.md +69 -0
  369. package/vault/wiki/sources/cast-code-chunking-paper.md +50 -0
  370. package/vault/wiki/sources/ck-semantic-search.md +78 -0
  371. package/vault/wiki/sources/claude-code-architecture-karaxai-2026.md +71 -0
  372. package/vault/wiki/sources/claude-code-architecture-qubytes-2026.md +50 -0
  373. package/vault/wiki/sources/claude-code-architecture-vila-lab-2026.md +64 -0
  374. package/vault/wiki/sources/claude-code-security-architecture-penligent-2026.md +70 -0
  375. package/vault/wiki/sources/claude-context-editing-docs.md +13 -0
  376. package/vault/wiki/sources/cloudflare-codemode.md +63 -0
  377. package/vault/wiki/sources/code-chunk-library-supermemory.md +63 -0
  378. package/vault/wiki/sources/codeact-apple-2024.md +62 -0
  379. package/vault/wiki/sources/codex-dsc-rfc-8573.md +41 -0
  380. package/vault/wiki/sources/codex-open-source-agent-2026.md +110 -0
  381. package/vault/wiki/sources/coir-code-retrieval-benchmark.md +51 -0
  382. package/vault/wiki/sources/colinmcnamara-context-optimization-codemode.md +48 -0
  383. package/vault/wiki/sources/context-folding-paper.md +61 -0
  384. package/vault/wiki/sources/context-mode-website.md +63 -0
  385. package/vault/wiki/sources/cursor-agent-best-practices-2026.md +62 -0
  386. package/vault/wiki/sources/cursor-fork-29b-2025.md +50 -0
  387. package/vault/wiki/sources/cursor-harness-april-2026.md +76 -0
  388. package/vault/wiki/sources/cursor-instant-apply-2024.md +45 -0
  389. package/vault/wiki/sources/cursor-shadow-workspace-2024.md +52 -0
  390. package/vault/wiki/sources/cursor-shipped-coding-agent-2026.md +53 -0
  391. package/vault/wiki/sources/cursor-vs-antigravity-2026.md +51 -0
  392. package/vault/wiki/sources/disler-pi-vs-claude-code.md +69 -0
  393. package/vault/wiki/sources/distill-deterministic-context-compression.md +53 -0
  394. package/vault/wiki/sources/embedding-models-benchmark-supermemory-2025.md +48 -0
  395. package/vault/wiki/sources/executor-rhyssullivan.md +122 -0
  396. package/vault/wiki/sources/fallow-rs-codebase-intelligence.md +125 -0
  397. package/vault/wiki/sources/fan2025-imad.md +60 -0
  398. package/vault/wiki/sources/forgecode-gpt5-agent-improvements.md +63 -0
  399. package/vault/wiki/sources/gemini-3-prompting-guide.md +78 -0
  400. package/vault/wiki/sources/gh-cli-sub-issue-rfc.md +50 -0
  401. package/vault/wiki/sources/gh-sub-issue-extension.md +72 -0
  402. package/vault/wiki/sources/github-fork-issues-discussion.md +44 -0
  403. package/vault/wiki/sources/github-issue-dependencies-docs.md +49 -0
  404. package/vault/wiki/sources/github-sub-issues-docs.md +51 -0
  405. package/vault/wiki/sources/gitingest.md +91 -0
  406. package/vault/wiki/sources/gitreverse.md +63 -0
  407. package/vault/wiki/sources/google-antigravity-official-blog.md +47 -0
  408. package/vault/wiki/sources/google-antigravity-wikipedia.md +53 -0
  409. package/vault/wiki/sources/gsd-codecentric-deep-dive.md +57 -0
  410. package/vault/wiki/sources/gsd-github-repo.md +51 -0
  411. package/vault/wiki/sources/gsd-hn-discussion.md +59 -0
  412. package/vault/wiki/sources/guido-python-design-philosophy.md +56 -0
  413. package/vault/wiki/sources/hejlsberg-7-learnings.md +48 -0
  414. package/vault/wiki/sources/ironclaw-drift-monitor.md +80 -0
  415. package/vault/wiki/sources/langsight-loop-detection.md +80 -0
  416. package/vault/wiki/sources/leanctx-website.md +69 -0
  417. package/vault/wiki/sources/lee2026-meta-harness.md +59 -0
  418. package/vault/wiki/sources/linux-kernel-coding-workflow.md +50 -0
  419. package/vault/wiki/sources/lou2026-autoharness.md +53 -0
  420. package/vault/wiki/sources/martin-fowler-harness-engineering.md +73 -0
  421. package/vault/wiki/sources/mcp-architecture-docs.md +13 -0
  422. package/vault/wiki/sources/meng2026-agent-harness-survey.md +79 -0
  423. package/vault/wiki/sources/mindstudio-four-agent-types.md +68 -0
  424. package/vault/wiki/sources/ms-chat-history-management.md +13 -0
  425. package/vault/wiki/sources/openai-prompt-guidance.md +104 -0
  426. package/vault/wiki/sources/openclaw-session-pruning.md +13 -0
  427. package/vault/wiki/sources/opencode-dcp.md +13 -0
  428. package/vault/wiki/sources/opendev-arxiv-2603.05344v1.md +79 -0
  429. package/vault/wiki/sources/openhands-platform.md +39 -0
  430. package/vault/wiki/sources/oss-guide-codebase-exploration.md +53 -0
  431. package/vault/wiki/sources/pi-compaction-extensions-ecosystem.md +102 -0
  432. package/vault/wiki/sources/pi-context-prune-github-repo.md +38 -0
  433. package/vault/wiki/sources/pi-mono-compaction-docs.md +38 -0
  434. package/vault/wiki/sources/pi-omni-compact-github-repo.md +50 -0
  435. package/vault/wiki/sources/pi-rtk-optimizer-github-repo.md +45 -0
  436. package/vault/wiki/sources/pi-vcc-github-repo.md +69 -0
  437. package/vault/wiki/sources/pi-vscode-marketplace.md +41 -0
  438. package/vault/wiki/sources/pi-vscode-model-provider-marketplace.md +39 -0
  439. package/vault/wiki/sources/py-tree-sitter.md +13 -0
  440. package/vault/wiki/sources/sentrux-dev-landing.md +40 -0
  441. package/vault/wiki/sources/sentrux-docs-pro-architecture.md +75 -0
  442. package/vault/wiki/sources/sentrux-docs-quality-signal.md +46 -0
  443. package/vault/wiki/sources/sentrux-docs-root-cause-metrics.md +57 -0
  444. package/vault/wiki/sources/sentrux-docs-rules-engine.md +58 -0
  445. package/vault/wiki/sources/sentrux-github-repo.md +56 -0
  446. package/vault/wiki/sources/superpowers-github-repo.md +56 -0
  447. package/vault/wiki/sources/superpowers-release-blog.md +54 -0
  448. package/vault/wiki/sources/superpowers-termdock-analysis.md +45 -0
  449. package/vault/wiki/sources/swe-agent-aci.md +42 -0
  450. package/vault/wiki/sources/swe-bench.md +45 -0
  451. package/vault/wiki/sources/swe-pruner-context-pruning.md +13 -0
  452. package/vault/wiki/sources/think-in-code-blog.md +48 -0
  453. package/vault/wiki/sources/tree-sitter-docs.md +13 -0
  454. package/vault/wiki/sources/ts-best-practices-2025-devto.md +42 -0
  455. package/vault/wiki/sources/ts-folder-structure-mingyang.md +58 -0
  456. package/vault/wiki/sources/ts-monorepo-koerselman.md +44 -0
  457. package/vault/wiki/sources/ts-result-error-handling-kkalamarski.md +52 -0
  458. package/vault/wiki/sources/ts-runtimes-comparison-betterstack.md +42 -0
  459. package/vault/wiki/sources/ts-strict-mode-rishikc.md +43 -0
  460. package/vault/wiki/sources/unix-philosophy.md +48 -0
  461. package/vault/wiki/sources/vectara-chunking-vs-embedding-naacl2025.md +39 -0
  462. package/vault/wiki/sources/vectara-guardian-agents.md +79 -0
  463. package/vault/wiki/sources/vgrep-semantic-search.md +76 -0
  464. package/vault/wiki/sources/vitest-official.md +41 -0
  465. package/vault/wiki/sources/vscode-pi-community-extension.md +40 -0
  466. package/vault/wiki/sources/wozcode.md +79 -0
  467. package/.agents/skills/compress/SKILL.md +0 -111
  468. package/.agents/skills/compress/scripts/__init__.py +0 -9
  469. package/.agents/skills/compress/scripts/__main__.py +0 -3
  470. package/.agents/skills/compress/scripts/benchmark.py +0 -78
  471. package/.agents/skills/compress/scripts/cli.py +0 -73
  472. package/.agents/skills/compress/scripts/compress.py +0 -227
  473. package/.agents/skills/compress/scripts/detect.py +0 -121
  474. package/.agents/skills/compress/scripts/validate.py +0 -189
  475. package/.agents/skills/emil-design-eng/SKILL.md +0 -679
  476. package/.agents/skills/lean-ctx/SKILL.md +0 -149
  477. package/.agents/skills/lean-ctx/scripts/install.sh +0 -95
  478. package/.agents/skills/scrapling-official/LICENSE.txt +0 -28
  479. package/.agents/skills/scrapling-official/SKILL.md +0 -390
  480. package/.agents/skills/scrapling-official/examples/01_fetcher_session.py +0 -26
  481. package/.agents/skills/scrapling-official/examples/02_dynamic_session.py +0 -26
  482. package/.agents/skills/scrapling-official/examples/03_stealthy_session.py +0 -26
  483. package/.agents/skills/scrapling-official/examples/04_spider.py +0 -58
  484. package/.agents/skills/scrapling-official/examples/README.md +0 -45
  485. package/.agents/skills/scrapling-official/references/fetching/choosing.md +0 -78
  486. package/.agents/skills/scrapling-official/references/fetching/dynamic.md +0 -352
  487. package/.agents/skills/scrapling-official/references/fetching/static.md +0 -432
  488. package/.agents/skills/scrapling-official/references/fetching/stealthy.md +0 -255
  489. package/.agents/skills/scrapling-official/references/mcp-server.md +0 -214
  490. package/.agents/skills/scrapling-official/references/migrating_from_beautifulsoup.md +0 -86
  491. package/.agents/skills/scrapling-official/references/parsing/adaptive.md +0 -212
  492. package/.agents/skills/scrapling-official/references/parsing/main_classes.md +0 -586
  493. package/.agents/skills/scrapling-official/references/parsing/selection.md +0 -494
  494. package/.agents/skills/scrapling-official/references/spiders/advanced.md +0 -344
  495. package/.agents/skills/scrapling-official/references/spiders/architecture.md +0 -94
  496. package/.agents/skills/scrapling-official/references/spiders/getting-started.md +0 -164
  497. package/.agents/skills/scrapling-official/references/spiders/proxy-blocking.md +0 -235
  498. package/.agents/skills/scrapling-official/references/spiders/requests-responses.md +0 -196
  499. package/.agents/skills/scrapling-official/references/spiders/sessions.md +0 -205
  500. package/PLAN.md +0 -11
  501. package/extensions/lean-ctx-enforce.ts +0 -166
  502. package/skills-lock.json +0 -35
  503. package/wiki/README.md +0 -19
  504. package/wiki/decisions/0001-establish-project-wiki-and-decision-record-format.md +0 -25
  505. package/wiki/decisions/0002-add-project-banner-to-readme.md +0 -26
  506. package/wiki/decisions/0003-remove-redundant-readme-title-heading.md +0 -26
  507. package/wiki/decisions/0004-publish-package-to-npm-as-ultimate-pi.md +0 -26
  508. package/wiki/decisions/0005-automate-npm-publish-with-github-actions.md +0 -27
  509. package/wiki/decisions/0006-switch-to-npm-trusted-publishing.md +0 -26
  510. package/wiki/decisions/0007-use-absolute-banner-url-for-npm-readme-rendering.md +0 -26
  511. package/wiki/decisions/0008-rename-banner-asset-for-cache-busting.md +0 -26
  512. package/wiki/decisions/0009-force-oidc-path-by-clearing-node-auth-token-in-publish-step.md +0 -25
  513. package/wiki/decisions/0010-simplify-setup-node-for-npm-trusted-publishing.md +0 -26
  514. package/wiki/decisions/0011-add-noop-workflow-change-to-force-fresh-publish-run.md +0 -25
  515. package/wiki/decisions/0012-align-workflow-runtime-with-npm-trusted-publishing-requirements.md +0 -26
  516. package/wiki/decisions/0013-add-package-repository-url-for-provenance-validation.md +0 -25
@@ -0,0 +1,89 @@
1
+ ---
2
+ type: source
3
+ status: ingested
4
+ source_type: engineering-blog
5
+ title: "Prompt Caching: The Optimization That Cuts LLM Costs by 90%"
6
+ author: "Tian Pan"
7
+ date_published: 2026-04-07
8
+ url: "https://tianpan.co/blog/2025-10-13-prompt-caching-cut-llm-costs"
9
+ confidence: high
10
+ tags:
11
+ - prompt-caching
12
+ - cost-optimization
13
+ - multi-model
14
+ - cache-architecture
15
+ related:
16
+ - "[[Research: Prompt Renderer for Multi-Model Agent Harness]]"
17
+ - "[[Source: Arxiv — Don't Break the Cache]]"
18
+ key_claims:
19
+ - "Most teams overpay 60-90% by reprocessing the same tokens on every request"
20
+ - "Multi-tier caching: Semantic cache (100% savings) → Prefix cache (50-90% savings) → Full inference (0% savings)"
21
+ - "Golden rule: static content first, dynamic content last — injecting timestamps/user IDs breaks the cache"
22
+ - "Parallel execution trap: firing parallel requests before cache warms → 4% hit rate. Fix: dedicated warmup call"
23
+ - "Anthropic: cache write 25% premium, cache read 90% discount — break-even at 1.4 cache hits"
24
+ - "OpenAI: auto-caching, 50% discount, no write premium"
25
+ - "Monitor: cache hit rate = cache_read_input_tokens / total_input_tokens, target 70%+"
26
+ created: 2026-05-02
27
+ updated: 2026-05-02
28
+
29
+ ---# Multi-Tier Prompt Caching Architecture
30
+
31
+ ## Three-Tier Stack
32
+
33
+ ```
34
+ Request
35
+ → Semantic cache (exact/near-duplicate queries) → 100% savings
36
+ → Prefix cache (shared static context) → 50-90% savings
37
+ → Full inference → 0% savings
38
+ ```
39
+
40
+ Well-tuned system routes 70-80% of tokens through caching layers.
41
+
42
+ ## Prompt Structure IS Cache Architecture
43
+
44
+ The golden rule: **static content first, dynamic content last.**
45
+
46
+ ```
47
+ [System prompt — stable across all requests] ← CACHED
48
+ [Retrieved documents — stable for a given session] ← CACHED
49
+ [Conversation history — grows per turn] ← PARTIAL
50
+ [Current user message — always new] ← NEVER
51
+ ```
52
+
53
+ Cache-breaking anti-patterns:
54
+ - Timestamps in system prompts
55
+ - User IDs in static sections
56
+ - Request IDs injected early
57
+ - Document content that varies slightly across requests
58
+
59
+ ## Provider Differences
60
+
61
+ | Provider | Cache Control | Write Cost | Read Cost | TTL |
62
+ |----------|-------------|-----------|-----------|-----|
63
+ | Anthropic | Explicit `cache_control` markers | +25% premium | 90% discount | 5min (extends to 1h) |
64
+ | OpenAI | Automatic | None | 50% discount | 5min |
65
+ | Google | Explicit context cache | Storage cost | Guaranteed discount | Configurable |
66
+ | vLLM (self-host) | Automatic prefix caching (APC) | None | 14-24x throughput | Hash-table KV blocks |
67
+
68
+ ## The Parallel Execution Trap
69
+
70
+ **Problem**: Firing 10 parallel requests before cache is written → 10 cache writes, 0 reads → 5-10x expected cost.
71
+
72
+ **Fix**: Dedicated warmup call with `max_tokens=1` before parallel processing.
73
+
74
+ Cost comparison for 30K-token document with 3 parallel questions: $0.34 without warming vs $0.14 with warming — 59% reduction.
75
+
76
+ ## When Caching Hurts
77
+
78
+ - One-shot workflows: everything is unique, you're paying write premiums for zero reads
79
+ - Dynamic system prompts: personalization undermines prefix caching
80
+ - Short prompts: below 1,024-token threshold, caching doesn't engage
81
+ - Cold starts: freshly deployed services, cache TTL expiry at low-traffic hours
82
+
83
+ ## Relevance to ultimate-pi Prompt Renderer
84
+
85
+ The caching layer in the prompt renderer should:
86
+ 1. **Hash-based cache keys**: hash the base spec + variables → deterministic cache lookup
87
+ 2. **Pre-compiled prompts shipped in npm**: eliminates cache warmup entirely — prompts are pre-rendered at build time
88
+ 3. **Output caching for rendered prompts**: if same spec+model+vars produces the same output, return cached result
89
+ 4. **Monitoring**: track renderer cache hit rate (prompts served from pre-compiled vs runtime-rendered)
@@ -0,0 +1,155 @@
1
+ ---
2
+ type: source
3
+ status: ingested
4
+ source_type: official-repo
5
+ title: "agent-browser — Browser Automation CLI for AI Agents by Vercel Labs"
6
+ author: "Vercel Labs"
7
+ date_published: 2026-04-16
8
+ url: "https://github.com/vercel-labs/agent-browser"
9
+ confidence: high
10
+ tags:
11
+ - browser-automation
12
+ - ai-agents
13
+ - vercel-labs
14
+ - rust
15
+ - cdp
16
+ - headless-browser
17
+ related:
18
+ - "[[agent-browser-browser-automation]]"
19
+ - "[[browser-subagent-visual-verification]]"
20
+ - "[[harness-implementation-plan]]"
21
+ key_claims:
22
+ - "31.4K GitHub stars, 1.9K forks, 568 commits, Apache 2.0 — Rust-native browser automation CLI for AI agents"
23
+ - "Native Rust CLI + daemon — single binary, no Node.js required after install"
24
+ - "npm install -g agent-browser for global install. Also Homebrew, Cargo."
25
+ - "Snapshot + refs workflow optimized for LLMs: snapshot -i → click @e2 → fill @e3"
26
+ - "React introspection: react tree, react inspect, react renders, react suspense"
27
+ - "Web Vitals: LCP/CLS/TTFB/FCP/INP with React hydration phases"
28
+ - "Annotated screenshots: --annotate overlays numbered labels matching @eN refs"
29
+ - "Diff: structural snapshot diff + visual pixel diff between pages/states"
30
+ - "Multi-provider: Chrome local, Browserless, Browserbase, Browser Use, Kernel, AgentCore, iOS"
31
+ - "Security: domain allowlist, action policy, content boundaries, action confirmation"
32
+ - "Skills system: agent-browser skills get core — 420-line usage guide, npx skills add"
33
+ - "Dashboard: local web dashboard with live viewport, activity feed, AI chat"
34
+ - "batch mode: multi-command single invocation; session persistence; auth vault"
35
+ - "112 contributors, 81 releases, Rust 85% + TypeScript 12.4%"
36
+ created: 2026-05-02
37
+ updated: 2026-05-02
38
+ ---
39
+
40
+ # agent-browser — Browser Automation CLI for AI Agents
41
+
42
+ **Repository**: https://github.com/vercel-labs/agent-browser
43
+ **Stars**: 31.4K | **Forks**: 1.9K | **Commits**: 568 | **License**: Apache 2.0
44
+ **Language**: Rust 85% + TypeScript 12.4% | **Status**: Active (v0.26.0, Apr 16, 2026)
45
+
46
+ ## What It Is
47
+
48
+ agent-browser is a **Rust-native browser automation CLI purpose-built for AI agents**. Unlike traditional browser automation tools (Puppeteer, Playwright, Selenium) designed for human scripting, agent-browser provides an agent-first interface: snapshot-based element refs (`@e1`, `@e2`), JSON output mode, annotated screenshots, and structured diff commands.
49
+
50
+ **Core philosophy**: Give AI agents a CLI that speaks their language — refs, snapshots, JSON — not a scripting API.
51
+
52
+ ## Architecture
53
+
54
+ ```
55
+ AI Agent → agent-browser CLI → Rust Daemon → Chrome DevTools Protocol → Chrome
56
+ ↑ ↑
57
+ skills/ agent-browser.json (config)
58
+ (SKILL.md discovery) .agent-browser/ (sessions, auth)
59
+ ```
60
+
61
+ - **Rust CLI**: Parses commands, communicates with daemon via IPC
62
+ - **Rust Daemon**: Pure Rust daemon using direct CDP. No Node.js required.
63
+ - **Client-daemon model**: Daemon auto-starts, persists between commands for speed
64
+ - **Multi-provider backend**: Local Chrome, Browserless, Browserbase, Browser Use, Kernel, AgentCore, iOS
65
+
66
+ ## Key Properties
67
+
68
+ | Property | Description |
69
+ |----------|-------------|
70
+ | **Agent-native** | Snapshot with refs (`@e1`), JSON output, annotated screenshots with matching labels |
71
+ | **Rust-native** | Single binary, sub-second startup. 85% Rust, 12.4% TypeScript |
72
+ | **Full CLI** | 80+ commands: navigate, interact, snapshot, screenshot, diff, react, network, auth |
73
+ | **Skills system** | `agent-browser skills get core` — 420-line usage guide. `npx skills add vercel-labs/agent-browser` |
74
+ | **Security-first** | Domain allowlist, action policy, content boundaries, auth vault with AES-256-GCM encryption |
75
+ | **Multi-provider** | Local Chrome + 6 cloud providers (Browserless, Browserbase, Browser Use, Kernel, AgentCore, iOS) |
76
+ | **Dashboard** | Local web dashboard (port 4848) with live viewport, activity feed, AI chat |
77
+ | **Batch mode** | Multiple commands in single CLI invocation, JSON stdin mode |
78
+ | **Diff** | Structural snapshot diff + visual pixel diff between before/after states |
79
+ | **React introspection** | React component tree, fiber inspection, suspense boundaries, render profiling |
80
+ | **Web Vitals** | LCP/CLS/TTFB/FCP/INP with React hydration phase breakdown |
81
+
82
+ ## Why agent-browser Replaces browser-harness for P30
83
+
84
+ | Aspect | browser-harness | agent-browser |
85
+ |--------|----------------|---------------|
86
+ | **Stars** | 9.4K | 31.4K (3.3× larger) |
87
+ | **Language** | Python (~592 lines core) | Rust (native binary, sub-second) |
88
+ | **AI agent workflow** | Raw CDP — agent writes helpers mid-execution | Snapshot + refs — purpose-built for LLMs |
89
+ | **Skill system** | None | Built-in: `skills get core`, `npx skills add` |
90
+ | **Diff/verify** | None | Structural + visual diff between states |
91
+ | **Annotated screenshots** | None | `--annotate` with numbered labels → `@eN` refs |
92
+ | **React/Vitals** | None | `react tree`, `react renders`, `vitals` |
93
+ | **Security** | None | Domain allowlist, action policy, boundaries, auth vault |
94
+ | **Cloud providers** | None | 6 providers (Browserless, Browserbase, BW, Kernel, AgentCore, iOS) |
95
+ | **Dashboard** | None | Live viewport + activity feed + AI chat |
96
+ | **Install** | `uv add browser-harness` (Python) | `npm install -g agent-browser` (single binary) |
97
+ | **Maturity** | 253 commits, 1 main contributor | 568 commits, 112 contributors, 81 releases |
98
+ | **License** | MIT | Apache 2.0 |
99
+
100
+ ## Integration with ultimate-pi Harness (P30)
101
+
102
+ ```
103
+ P25 Subagent Router → P30 Browser Subagent
104
+
105
+ agent-browser CLI (Rust binary, sub-second startup)
106
+
107
+ Chrome DevTools Protocol (Rust daemon)
108
+
109
+ Chrome (headless or headed)
110
+
111
+ Visual verification: agent-browser snapshot -i → click @e2 → screenshot --annotate
112
+
113
+ Diff: agent-browser diff screenshot --baseline before.png
114
+ ```
115
+
116
+ ### Harness Config
117
+
118
+ ```json
119
+ // .pi/harness/browser.json
120
+ {
121
+ "engine": "agent-browser",
122
+ "mode": "headless",
123
+ "screenshot_dir": ".pi/harness/screenshots/",
124
+ "viewport": {"width": 1280, "height": 720},
125
+ "timeout_ms": 25000
126
+ }
127
+ ```
128
+
129
+ ### Key Commands for Harness P30
130
+
131
+ ```bash
132
+ # Navigate and snapshot
133
+ agent-browser open <url> && agent-browser snapshot -i --json
134
+
135
+ # Interact via refs
136
+ agent-browser click @e2
137
+ agent-browser fill @e3 "text"
138
+
139
+ # Visual verification
140
+ agent-browser screenshot --annotate before.png
141
+ # ... code change ...
142
+ agent-browser reload
143
+ agent-browser screenshot --annotate after.png
144
+ agent-browser diff screenshot --baseline before.png -o diff.png
145
+
146
+ # Structural diff
147
+ agent-browser diff snapshot --baseline before-snapshot.txt
148
+ ```
149
+
150
+ ## What We Deliberately Do NOT Adopt
151
+
152
+ - **Dashboard UI**: CLI harness only. Dashboard is nice-to-have for debugging but not integrated.
153
+ - **AI Chat feature**: Uses Vercel AI Gateway. Our agent IS the chat. Not needed.
154
+ - **Cloud providers**: Local Chrome only for harness. Cloud providers add latency and cost. Available as opt-in.
155
+ - **iOS Simulator**: Out of scope for web-focused harness. Available as opt-in.
@@ -0,0 +1,126 @@
1
+ ---
2
+ type: source
3
+ status: ingested
4
+ source_type: official-repo
5
+ title: "browser-harness — Self-Healing CDP Harness by browser-use"
6
+ author: "browser-use"
7
+ date_published: 2026-04-17
8
+ url: "https://github.com/browser-use/browser-harness"
9
+ confidence: high
10
+ tags:
11
+ - browser-automation
12
+ - cdp
13
+ - headless-browser
14
+ - browser-harness
15
+ - self-healing
16
+ related:
17
+ - "[[Research: Google Antigravity Harness Integration]]"
18
+ - "[[browser-subagent-visual-verification]]"
19
+ - "[[browser-harness-agent]]"
20
+ key_claims:
21
+ - "9.4K GitHub stars, 855 forks, 253 commits, MIT license — thin CDP harness for LLM browser control"
22
+ - "~592 lines of Python core — connects LLM directly to Chrome via one WebSocket, nothing between"
23
+ - "Self-healing: agent writes missing helper functions mid-task during execution"
24
+ - "TypeScript version available: browser-harness-js (428 stars, Bun-native, 652 typed CDP wrappers)"
25
+ - "No pre-baked helpers — raw CDP protocol. Agent calls session.Domain.method() directly"
26
+ - "One WebSocket to Chrome, zero abstraction layers. The protocol IS the API"
27
+ - "agent-workspace: agent-editable helper code + domain-skills/ for reusable per-site playbooks"
28
+ created: 2026-05-02
29
+ updated: 2026-05-02
30
+
31
+ ---
32
+
33
+ # browser-harness — Self-Healing CDP Harness
34
+
35
+ **Repository**: https://github.com/browser-use/browser-harness
36
+ **Stars**: 9.4K | **Forks**: 855 | **Commits**: 253 | **License**: MIT
37
+ **Language**: Python 100% | **Status**: Active (commits today — May 2, 2026)
38
+
39
+ ## What It Is
40
+
41
+ browser-harness is a **minimal, self-healing CDP harness** that connects LLMs directly to Chrome via the Chrome DevTools Protocol. Unlike Puppeteer/Playwright (which wrap CDP with high-level helper APIs), browser-harness gives the LLM **direct CDP access** — the agent writes what's missing during execution.
42
+
43
+ **Core philosophy**: "One WebSocket to Chrome, nothing between. The agent writes what's missing during execution. The harness improves itself every run."
44
+
45
+ ## Architecture
46
+
47
+ ```
48
+ LLM Agent → browser-harness → Chrome DevTools Protocol (CDP) → Chrome
49
+
50
+ agent-workspace/agent_helpers.py (agent edits this!)
51
+ agent-workspace/domain-skills/ (reusable per-site playbooks)
52
+ ```
53
+
54
+ ## Key Properties
55
+
56
+ | Property | Description |
57
+ |----------|-------------|
58
+ | **Minimal** | ~592 lines of Python core. No Puppeteer, no Playwright, no Selenium. |
59
+ | **Self-healing** | Agent encounters missing helper → agent writes it mid-task → harness works next time. |
60
+ | **CDP-native** | Direct `session.Page.navigate()`, `session.Input.dispatchMouseEvent()` — no wrappers. |
61
+ | **Thin** | One WebSocket to Chrome. Nothing between the LLM and the browser. |
62
+ | **Agent-editable** | `agent-workspace/agent_helpers.py` is designed for the agent to edit during execution. |
63
+ | **Domain skills** | `agent-workspace/domain-skills/` — reusable playbooks per site (GitHub, LinkedIn, Amazon…). |
64
+
65
+ ## TypeScript Version: browser-harness-js
66
+
67
+ **Repository**: https://github.com/browser-use/browser-harness-js
68
+ **Stars**: 428 | **License**: MIT | **Language**: TypeScript 99.4%
69
+
70
+ - 56 CDP domains, 652 typed wrappers — auto-generated from protocol JSON
71
+ - `npx skills add https://github.com/browser-use/browser-harness-js`
72
+ - Bun-native REPL server. CLI forwards snippets to running session.
73
+ - **No helpers at all** — "The protocol IS the API. If Chrome can do it, you can call it."
74
+ - Pure CDP recipes in `interaction-skills/`
75
+
76
+ ## Why browser-harness Replaces Puppeteer for P30
77
+
78
+ | Aspect | Puppeteer | browser-harness |
79
+ |--------|-----------|-----------------|
80
+ | **Abstraction level** | High-level helpers (page.click, page.type) | Raw CDP (session.Input.dispatchMouseEvent) |
81
+ | **LLM-native** | Designed for human scripting | Designed for LLMs to write CDP calls directly |
82
+ | **Self-healing** | No — fix scripts manually | Yes — agent writes missing helpers mid-execution |
83
+ | **Weight** | Heavy npm package + Chromium download | ~592 lines of Python or ~650 typed CDP wrappers in TS |
84
+ | **Freedom** | Limited to pre-built helper API | Complete CDP freedom — all 56+ domains accessible |
85
+ | **Version drift** | Puppeteer must update for new Chrome features | Auto-generated from CDP protocol JSON — always current |
86
+ | **Deployment** | `npm install puppeteer` | `uv init && uv add browser-harness` (Python) or `npx skills add` (JS) |
87
+
88
+ ## Integration with ultimate-pi Harness (P30)
89
+
90
+ ```
91
+ P25 Subagent Router → P30 Browser Subagent
92
+
93
+ browser-harness (thin CDP harness)
94
+
95
+ Chrome DevTools Protocol (one WebSocket)
96
+
97
+ Chrome (headless or headed)
98
+
99
+ Visual verification: screenshots via CDP Page.captureScreenshot
100
+
101
+ Self-healing: agent writes missing interaction helpers in agent_helpers.py
102
+ ```
103
+
104
+ ### TypeScript Stack Preference
105
+
106
+ For our TypeScript harness, **browser-harness-js** is the natural fit:
107
+ - TypeScript-native (99.4% TS)
108
+ - 652 typed CDP methods auto-generated from protocol JSON
109
+ - Installed via `npx skills add` — no Python dependency
110
+ - Bun REPL server for persistent sessions across agent turns
111
+
112
+ For maximum capability (domain skills, mature agent-workspace), **browser-harness** (Python) provides more features. Hybrid approach: use browser-harness-js for the core CDP bridge, borrow the domain-skills pattern from browser-harness.
113
+
114
+ ### Config
115
+
116
+ ```json
117
+ // .pi/harness/browser.json
118
+ {
119
+ "engine": "browser-harness",
120
+ "variant": "browser-harness-js",
121
+ "mode": "headless",
122
+ "cdp_url": "http://localhost:9222",
123
+ "screenshot_dir": ".pi/harness/screenshots/",
124
+ "agent_workspace": ".pi/harness/browser-workspace/"
125
+ }
126
+ ```
@@ -0,0 +1,79 @@
1
+ ---
2
+ type: source
3
+ status: ingested
4
+ source_type: academic-paper
5
+ title: "Agent Drift: Quantifying Behavioral Degradation in Multi-Agent LLM Systems Over Extended Interactions"
6
+ author: Abhishek Rath
7
+ date_published: 2026-01-07
8
+ url: https://arxiv.org/abs/2601.04170
9
+ confidence: high
10
+ key_claims:
11
+ - "Agent drift: progressive degradation in behavior, decision quality, and inter-agent coherence"
12
+ - "42% task success rate reduction, 3.2x human intervention increase in drifted systems"
13
+ - "ASI (Agent Stability Index): composite metric across 12 behavioral dimensions"
14
+ - "Three drift types: semantic, coordination, behavioral"
15
+ - "Combined mitigation strategies achieve 81.5% drift reduction"
16
+ tags:
17
+ - source
18
+ - academic
19
+ - agent-drift
20
+ - multi-agent
21
+ - reliability
22
+ related:
23
+ - "[[Research: Meta-Agent Context Drift Detection]]"
24
+ - "[[context-drift-in-agents]]"
25
+ - "[[agent-loop-detection-patterns]]"
26
+ created: 2026-05-02
27
+ updated: 2026-05-02
28
+
29
+ ---# Agent Drift: Academic Paper
30
+
31
+ ## Summary
32
+
33
+ Foundational academic paper establishing agent drift as a measurable, quantifiable phenomenon in multi-agent LLM systems. Introduces the Agent Stability Index (ASI) — a composite metric across 12 dimensions in 4 categories. Demonstrates through simulation that unchecked drift causes 42% task success reduction and 3.2x human intervention increase.
34
+
35
+ ## What It Contributes
36
+
37
+ Provides the academic foundation for agent drift as a real problem (not just anecdotal). The ASI framework gives a rigorous measurement methodology. The mitigation strategies (EMC, DAR, ABA) validate that drift can be controlled. Establishes that context window pollution is a primary mechanism — directly supporting the case for context pruning.
38
+
39
+ ## Three Drift Types
40
+
41
+ 1. **Semantic drift**: Agent outputs progressively deviate from original task intent while remaining syntactically valid
42
+ 2. **Coordination drift**: Multi-agent consensus mechanisms degrade, leading to conflicts, redundant work
43
+ 3. **Behavioral drift**: Agents develop novel strategies not present in initial interactions
44
+
45
+ ## Agent Stability Index (ASI)
46
+
47
+ Composite metric across 12 dimensions in 4 categories:
48
+
49
+ 1. **Response Consistency** (weight: 0.30): Output semantic similarity, decision pathway stability, confidence calibration
50
+ 2. **Tool Usage Patterns** (weight: 0.25): Tool selection stability, tool sequencing consistency, parameterization drift
51
+ 3. **Inter-Agent Coordination** (weight: 0.25): Consensus agreement rate, handoff efficiency, role adherence
52
+ 4. **Behavioral Boundaries** (weight: 0.20): Output length stability, error pattern emergence, human intervention rate
53
+
54
+ ASI computed over rolling 50-interaction windows. Drift detected when ASI <0.75 for 3 consecutive windows.
55
+
56
+ ## Key Findings
57
+
58
+ - Drift emerges after median 73 interactions (far earlier than expected)
59
+ - Drift accelerates: 0.08 ASI decline per 50 interactions (early) → 0.19 per 50 (late)
60
+ - Financial analysis agents drift fastest (53.2% by 500 interactions) due to task ambiguity
61
+ - Two-level hierarchies (router + specialists) are most drift-resistant
62
+ - External memory systems (vector DBs, structured logs) provide "behavioral anchors"
63
+
64
+ ## Three Mitigation Strategies
65
+
66
+ 1. **Episodic Memory Consolidation (EMC)**: Periodic compression of agent interaction histories → 51.9% drift reduction
67
+ 2. **Drift-Aware Routing (DAR)**: Router uses agent stability scores in delegation, resets drifting agents → 63.0% reduction
68
+ 3. **Adaptive Behavioral Anchoring (ABA)**: Few-shot prompt augmentation with baseline exemplars → 70.4% reduction
69
+ 4. **Combined (all three)**: 81.5% drift reduction, 23% computational overhead
70
+
71
+ ## Three Causal Mechanisms
72
+
73
+ 1. **Context window pollution**: Interaction histories fill with irrelevant information, diluting signal-to-noise
74
+ 2. **Distributional shift**: Agents encounter input distributions increasingly divergent from training data
75
+ 3. **Reinforcement through autoregression**: Small errors compound through feedback loops in shared memory
76
+
77
+ ## Relevance to Meta-Agent Concept
78
+
79
+ This paper validates that context window pollution is a primary causal mechanism of agent drift. Context pruning directly addresses this mechanism. The ASI framework provides metrics for evaluating whether pruning is effective. The finding that drift emerges after ~73 interactions sets a natural checkpoint frequency for meta-agent monitoring.
@@ -0,0 +1,42 @@
1
+ ---
2
+ type: source
3
+ source_type: blog
4
+ title: "Building a better repository map with tree-sitter"
5
+ author: "Aider (Paul Gauthier)"
6
+ date_published: 2023-10-22
7
+ url: "https://aider.chat/2023/10/22/repomap.html"
8
+ confidence: high
9
+ key_claims:
10
+ - "Repo maps provide GPT with a concise view of the entire codebase: files + key symbols with signatures"
11
+ - "tree-sitter parses source into AST to extract definitions and cross-references"
12
+ - "Graph ranking algorithm selects most important portions that fit within token budget (default 1k tokens)"
13
+ - "GPT can use the map to autonomously decide which files to inspect further"
14
+ - "Sending whole files wastes context window; repo map is a compressed representation"
15
+ - "Most important identifiers are those most referenced by other portions of code"
16
+ status: ingested
17
+ tags:
18
+ - agent-context
19
+ - tree-sitter
20
+ - repo-map
21
+ - context-window
22
+ created: 2023-10-22
23
+ updated: 2026-04-30
24
+
25
+ ---# Building a better repository map with tree-sitter
26
+
27
+ Aider's approach to solving the "code context" problem for LLMs. When an LLM needs to make changes in a large codebase, it must understand how the target code relates to the rest of the codebase. Aider sends a concise repository map built via tree-sitter AST parsing.
28
+
29
+ ## Core Technique
30
+
31
+ 1. **tree-sitter parsing**: Extract all symbol definitions (classes, functions, methods, variables, types) from every source file
32
+ 2. **Reference tracking**: Identify where each symbol is used across the codebase
33
+ 3. **Graph ranking**: Build a dependency graph (files = nodes, dependencies = edges). Rank nodes by importance — most-referenced symbols are most important.
34
+ 4. **Token budget**: Select the top-ranked nodes that fit within a configurable token budget (default 1k tokens)
35
+ 5. **Dynamic adjustment**: Map expands when no files are in chat (need full context) and contracts when working on specific files
36
+
37
+ ## Why This Works for Agents
38
+
39
+ - GPT sees call signatures and class structures across the entire repo
40
+ - Can autonomously decide which files to request for deeper inspection
41
+ - Compressed representation — doesn't waste context window on implementation details
42
+ - Tree-sitter is language-aware, producing structured, accurate symbol extraction
@@ -0,0 +1,58 @@
1
+ ---
2
+ type: source
3
+ source_type: official-docs
4
+ title: "Anthropic Context Compaction API (Beta)"
5
+ author: Anthropic
6
+ date_published: 2026-01-12
7
+ date_accessed: 2026-05-05
8
+ url: "https://docs.anthropic.com/en/docs/build-with-claude/compaction"
9
+ confidence: high
10
+ tags:
11
+ - anthropic
12
+ - claude
13
+ - compaction
14
+ - api
15
+ - context-management
16
+ key_claims:
17
+ - "Server-side automatic summarization when input tokens exceed threshold"
18
+ - "Beta, launched January 2026, header: compact-2026-01-12"
19
+ - "Supported models: Claude Mythos Preview, Opus 4.7, Opus 4.6, Sonnet 4.6"
20
+ - "Creates compaction block, drops all prior messages on next request"
21
+ - "ZDR (Zero Data Retention) eligible"
22
+ - "Context Folding available as first-class API primitive in context-management"
23
+ ---
24
+
25
+ # Anthropic Context Compaction API
26
+
27
+ ## Summary
28
+
29
+ Anthropic released a server-side context compaction API in beta (January 2026). When enabled, the API automatically detects when input tokens exceed a configurable threshold, generates a summary, creates a `compaction` block, and drops all prior messages on the next request.
30
+
31
+ ## How It Works
32
+
33
+ 1. Add `compact_20260112` to `context_management.edits` in Messages API request
34
+ 2. Include beta header `compact-2026-01-12`
35
+ 3. API detects when tokens exceed trigger threshold
36
+ 4. Generates summary → creates compaction block → continues response
37
+ 5. Subsequent requests automatically drop all pre-compaction messages
38
+
39
+ ## Supported Models
40
+
41
+ - Claude Mythos Preview
42
+ - Claude Opus 4.7
43
+ - Claude Opus 4.6
44
+ - Claude Sonnet 4.6
45
+
46
+ ## Ideal Use Cases
47
+
48
+ - Long-running chat conversations
49
+ - Tool-heavy agentic workflows
50
+ - Multi-turn conversations exceeding context limits
51
+
52
+ ## Relevance to pi-vcc
53
+
54
+ This is Anthropic's official take on compaction — LLM-based, server-side, automatic. It validates that compaction is a first-class concern. However, it has all the failure modes pi-vcc avoids: non-deterministic, no recall, LLM cost. Pi could theoretically use this API as a backend for its compaction, but pi-vcc's deterministic approach remains architecturally distinct.
55
+
56
+ ## Context Folding
57
+
58
+ Context Folding (arXiv 2510.11967) is now available as a first-class API primitive in Anthropic's beta context-management. Agents can branch/return sub-trajectories, with intermediate steps "folded" away.
@@ -0,0 +1,42 @@
1
+ ---
2
+ type: source
3
+ source_type: engineering-blog
4
+ author: "Justin Young (Anthropic)"
5
+ date_published: 2025
6
+ url: https://www.anthropic.com/engineering/effective-harnesses-for-long-running-agents
7
+ confidence: high
8
+ tags:
9
+ - anthropic
10
+ - agent-harness
11
+ - long-running-agents
12
+ - context-windows
13
+ key_claims:
14
+ - "A harness is the runtime framework that coordinates tool dispatch, context lifecycle, progress tracking, and clean handoffs between context windows"
15
+ - "Long-running agents need structured handoffs between context windows"
16
+ - "The harness must manage context as a finite resource across extended timeframes"
17
+ ---
18
+
19
+ # Effective Harnesses for Long-Running Agents
20
+
21
+ Anthropic Engineering Blog — 2025. By Justin Young.
22
+
23
+ ## Core Definition
24
+
25
+ A harness is the runtime orchestration layer that wraps the core reasoning loop and coordinates:
26
+ - Tool dispatch
27
+ - Context lifecycle management
28
+ - Safety enforcement
29
+ - Session persistence
30
+ - Progress tracking
31
+ - Clean handoffs between context windows
32
+
33
+ ## Key Principles
34
+
35
+ 1. **Context windows are finite resources** — the harness must manage them explicitly across long timeframes
36
+ 2. **Structured handoffs** — when context fills, the harness must summarize and transfer state to a fresh window
37
+ 3. **Progress tracking** — agents must maintain awareness of what's been done across context boundaries
38
+ 4. **Safety invariants** — the harness enforces constraints that persist across context resets
39
+
40
+ ## Relevance
41
+
42
+ This is the authoritative definition of "harness" as used in the agent engineering community. It maps directly to disler's Pi extensions (subagent-widget, agent-team, agent-chain) and OpenDev's four-layer architecture. Our harness implementation should treat context as a managed resource with explicit handoff protocols.