@kontourai/flow-agents 0.1.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (418) hide show
  1. package/.githooks/pre-push +11 -0
  2. package/.github/workflows/ci.yml +210 -0
  3. package/.github/workflows/docs-pages.yml +52 -0
  4. package/.github/workflows/publish-npm.yml +104 -0
  5. package/AGENTS.md +26 -0
  6. package/CHANGELOG.md +66 -0
  7. package/CODE_OF_CONDUCT.md +25 -0
  8. package/CONTEXT.md +300 -0
  9. package/CONTRIBUTING.md +44 -0
  10. package/LICENSE +201 -0
  11. package/README.md +129 -0
  12. package/SECURITY.md +33 -0
  13. package/agent-cards/dev.json +19 -0
  14. package/agents/dev.json +127 -0
  15. package/agents/tool-code-reviewer.json +61 -0
  16. package/agents/tool-dependencies-updater.json +118 -0
  17. package/agents/tool-explore-config.json +92 -0
  18. package/agents/tool-explore-deps.json +92 -0
  19. package/agents/tool-explore-entry.json +92 -0
  20. package/agents/tool-explore-patterns.json +92 -0
  21. package/agents/tool-explore-structure.json +92 -0
  22. package/agents/tool-explore-tests.json +92 -0
  23. package/agents/tool-planner.json +57 -0
  24. package/agents/tool-playwright.json +145 -0
  25. package/agents/tool-security-reviewer.json +56 -0
  26. package/agents/tool-verifier.json +61 -0
  27. package/agents/tool-worker.json +58 -0
  28. package/build/src/cli/console-learning-projection.js +123 -0
  29. package/build/src/cli/docs-preview.js +39 -0
  30. package/build/src/cli/effective-backlog-settings.js +102 -0
  31. package/build/src/cli/export-bookmarks.js +38 -0
  32. package/build/src/cli/fixture-retirement-audit.js +140 -0
  33. package/build/src/cli/flow-kit.js +138 -0
  34. package/build/src/cli/import-bookmarks.js +50 -0
  35. package/build/src/cli/init.js +239 -0
  36. package/build/src/cli/instinct-cli.js +93 -0
  37. package/build/src/cli/promote-workflow-artifact.js +63 -0
  38. package/build/src/cli/publish-change-helper.js +154 -0
  39. package/build/src/cli/pull-work-provider.js +469 -0
  40. package/build/src/cli/runtime-adapter.js +23 -0
  41. package/build/src/cli/telemetry-doctor.js +221 -0
  42. package/build/src/cli/usage-feedback.js +443 -0
  43. package/build/src/cli/validate-hook-influence.js +152 -0
  44. package/build/src/cli/validate-source-tree.js +31 -0
  45. package/build/src/cli/validate-workflow-artifacts.js +486 -0
  46. package/build/src/cli/veritas-governance.js +262 -0
  47. package/build/src/cli/workflow-artifact-cleanup-audit.js +272 -0
  48. package/build/src/cli/workflow-sidecar.js +816 -0
  49. package/build/src/cli.js +89 -0
  50. package/build/src/flow-kit/validate.js +75 -0
  51. package/build/src/lib/args.js +45 -0
  52. package/build/src/lib/fs.js +62 -0
  53. package/build/src/lib/workflow-learning-projection.js +334 -0
  54. package/build/src/runtime-adapters.js +146 -0
  55. package/build/src/tools/build-universal-bundles.js +397 -0
  56. package/build/src/tools/common.js +56 -0
  57. package/build/src/tools/filter-installed-packs.js +132 -0
  58. package/build/src/tools/generate-context-map.js +198 -0
  59. package/build/src/tools/validate-package.js +64 -0
  60. package/build/src/tools/validate-source-tree.js +622 -0
  61. package/console.telemetry.json +176 -0
  62. package/context/base-rules.md +17 -0
  63. package/context/code-review-standards.md +62 -0
  64. package/context/coding-standards.md +42 -0
  65. package/context/common/orchestrators.md +12 -0
  66. package/context/common/subagents.md +28 -0
  67. package/context/contracts/artifact-contract.md +182 -0
  68. package/context/contracts/builder-kit-workflow-state-contract.md +319 -0
  69. package/context/contracts/delivery-contract.md +69 -0
  70. package/context/contracts/execution-contract.md +53 -0
  71. package/context/contracts/governance-adapter-contract.md +67 -0
  72. package/context/contracts/planning-contract.md +85 -0
  73. package/context/contracts/review-contract.md +104 -0
  74. package/context/contracts/sandbox-policy.md +52 -0
  75. package/context/contracts/verification-contract.md +134 -0
  76. package/context/contracts/work-item-contract.md +215 -0
  77. package/context/deferred/demo-mode.md +33 -0
  78. package/context/deferred/languages/go.md +31 -0
  79. package/context/deferred/languages/python.md +31 -0
  80. package/context/deferred/languages/typescript.md +34 -0
  81. package/context/deferred/parallelization.md +35 -0
  82. package/context/deferred/worktree-isolation.md +24 -0
  83. package/context/development-workflow.md +50 -0
  84. package/context/scripts/context-budget/budget-scan.sh +166 -0
  85. package/context/scripts/detect-tools.sh +3 -0
  86. package/context/scripts/discover-agents.sh +28 -0
  87. package/context/scripts/git-status.sh +49 -0
  88. package/context/scripts/hooks/config-protection.js +79 -0
  89. package/context/scripts/hooks/desktop-notify.sh +39 -0
  90. package/context/scripts/hooks/governance-audit.sh +135 -0
  91. package/context/scripts/hooks/lib/audit-transport.sh +40 -0
  92. package/context/scripts/hooks/lib/hook-flags.js +49 -0
  93. package/context/scripts/hooks/lib/patterns.sh +57 -0
  94. package/context/scripts/hooks/lib/resolve-formatter.js +80 -0
  95. package/context/scripts/hooks/post-edit-accumulator.js +66 -0
  96. package/context/scripts/hooks/pre-commit-quality.js +194 -0
  97. package/context/scripts/hooks/quality-gate.js +93 -0
  98. package/context/scripts/hooks/report-only-guard.js +21 -0
  99. package/context/scripts/hooks/run-hook.js +136 -0
  100. package/context/scripts/hooks/stop-format-typecheck.js +141 -0
  101. package/context/scripts/hooks/stop-goal-fit.js +337 -0
  102. package/context/scripts/hooks/workflow-steering.js +250 -0
  103. package/context/scripts/telemetry/console-presets.sh +14 -0
  104. package/context/scripts/telemetry/install-console-config.sh +214 -0
  105. package/context/scripts/telemetry/lib/config.sh +85 -0
  106. package/context/scripts/telemetry/lib/enrich.sh +115 -0
  107. package/context/scripts/telemetry/lib/redact.sh +22 -0
  108. package/context/scripts/telemetry/lib/session.sh +63 -0
  109. package/context/scripts/telemetry/lib/transport.sh +183 -0
  110. package/context/scripts/telemetry/lib/usage.sh +29 -0
  111. package/context/scripts/telemetry/sync-agents.sh +173 -0
  112. package/context/scripts/telemetry/telemetry.conf +23 -0
  113. package/context/scripts/telemetry/telemetry.sh +387 -0
  114. package/context/scripts/validate-package.sh +89 -0
  115. package/context/settings/backlog-provider-settings.json +54 -0
  116. package/context/templates/core/identity.md +26 -0
  117. package/context/templates/core/user.md +15 -0
  118. package/docs/_config.yml +15 -0
  119. package/docs/_layouts/default.html +87 -0
  120. package/docs/adr/0001-flow-agents-consumes-flow.md +77 -0
  121. package/docs/adr/0002-flow-kits-as-extension-unit.md +13 -0
  122. package/docs/adr/0003-flow-agents-coordinates-kits-and-adapters.md +13 -0
  123. package/docs/adr/0004-gates-expect-surface-claims.md +15 -0
  124. package/docs/adr/0005-kubernetes-inspired-resource-contracts.md +48 -0
  125. package/docs/adr/0006-typescript-first-source-policy.md +98 -0
  126. package/docs/agent-system-guidebook.md +391 -0
  127. package/docs/agent-usage-feedback-loop.md +351 -0
  128. package/docs/assets/favicon.svg +13 -0
  129. package/docs/assets/og-image.png +0 -0
  130. package/docs/assets/site.css +774 -0
  131. package/docs/assets/site.js +139 -0
  132. package/docs/configurable-workflow-routing.md +174 -0
  133. package/docs/context-map.md +145 -0
  134. package/docs/developer-architecture.md +145 -0
  135. package/docs/developer-hook-setup.md +61 -0
  136. package/docs/fixture-ownership.md +44 -0
  137. package/docs/flow-kit-repository-contract.md +180 -0
  138. package/docs/index.md +129 -0
  139. package/docs/kontour-resource-contract.md +358 -0
  140. package/docs/migrations.md +64 -0
  141. package/docs/north-star.md +322 -0
  142. package/docs/operating-layers.md +110 -0
  143. package/docs/repository-structure.md +132 -0
  144. package/docs/sandbox-policy.md +56 -0
  145. package/docs/skills-map.md +203 -0
  146. package/docs/standards-register.md +96 -0
  147. package/docs/veritas-integration.md +165 -0
  148. package/docs/work-item-adapters.md +72 -0
  149. package/docs/workflow-artifact-lifecycle.md +141 -0
  150. package/docs/workflow-eval-strategy.md +295 -0
  151. package/docs/workflow-shared-contracts.md +51 -0
  152. package/docs/workflow-usage-guide.md +443 -0
  153. package/evals/ARCHITECTURE.md +143 -0
  154. package/evals/CONVENTIONS.md +58 -0
  155. package/evals/README.md +128 -0
  156. package/evals/acceptance/run.sh +29 -0
  157. package/evals/acceptance/test_claude_harness.sh +242 -0
  158. package/evals/acceptance/test_codex_harness.sh +108 -0
  159. package/evals/acceptance/test_kiro_harness.sh +128 -0
  160. package/evals/cases/dev/404.html +97 -0
  161. package/evals/cases/dev/code-review.yaml +44 -0
  162. package/evals/cases/dev/dashboard.html +300 -0
  163. package/evals/cases/dev/deliver.yaml +66 -0
  164. package/evals/cases/dev/dependency-update.yaml +16 -0
  165. package/evals/cases/dev/explore.yaml +20 -0
  166. package/evals/cases/dev/index.html +370 -0
  167. package/evals/cases/dev/package-lock.json +28 -0
  168. package/evals/cases/dev/package.json +16 -0
  169. package/evals/cases/dev/plan-work.yaml +20 -0
  170. package/evals/cases/dev/promptfooconfig.yaml +666 -0
  171. package/evals/cases/dev/search-first.yaml +20 -0
  172. package/evals/cases/dev/tdd-workflow.yaml +48 -0
  173. package/evals/cases/dev/verify-work.yaml +44 -0
  174. package/evals/cases/dev/workflow.yaml +34 -0
  175. package/evals/ci/run-baseline.sh +283 -0
  176. package/evals/fixtures/backlog-provider-settings/global-default.json +44 -0
  177. package/evals/fixtures/backlog-provider-settings/project-override.json +53 -0
  178. package/evals/fixtures/builder-kit-workflow-state/baseline-freshness-resolution-hint.json +139 -0
  179. package/evals/fixtures/builder-kit-workflow-state/direct-primitive-stop.json +59 -0
  180. package/evals/fixtures/builder-kit-workflow-state/empty-board-route-shape.json +55 -0
  181. package/evals/fixtures/builder-kit-workflow-state/happy-path.json +71 -0
  182. package/evals/fixtures/builder-kit-workflow-state/mid-work-resume.json +80 -0
  183. package/evals/fixtures/builder-kit-workflow-state/missing-prestep-recovery.json +65 -0
  184. package/evals/fixtures/builder-kit-workflow-state/product-build-chaining.json +60 -0
  185. package/evals/fixtures/builder-kit-workflow-state/stale-continuation-requires-new-probe.json +57 -0
  186. package/evals/fixtures/console-learning-projection/artifacts/console-learning-correction/learning.json +50 -0
  187. package/evals/fixtures/console-learning-projection/artifacts/console-learning-open-route/learning.json +41 -0
  188. package/evals/fixtures/flow-kit-repository/invalid-absolute-path/kit.json +8 -0
  189. package/evals/fixtures/flow-kit-repository/invalid-asset-section/flows/review.flow.json +6 -0
  190. package/evals/fixtures/flow-kit-repository/invalid-asset-section/kit.json +11 -0
  191. package/evals/fixtures/flow-kit-repository/invalid-duplicate-flow/flows/review.flow.json +6 -0
  192. package/evals/fixtures/flow-kit-repository/invalid-duplicate-flow/kit.json +9 -0
  193. package/evals/fixtures/flow-kit-repository/invalid-id/flows/review.flow.json +6 -0
  194. package/evals/fixtures/flow-kit-repository/invalid-id/kit.json +8 -0
  195. package/evals/fixtures/flow-kit-repository/invalid-malformed-json/kit.json +8 -0
  196. package/evals/fixtures/flow-kit-repository/invalid-missing-flow/kit.json +8 -0
  197. package/evals/fixtures/flow-kit-repository/invalid-missing-id/flows/review.flow.json +6 -0
  198. package/evals/fixtures/flow-kit-repository/invalid-missing-id/kit.json +7 -0
  199. package/evals/fixtures/flow-kit-repository/invalid-missing-schema-version/flows/review.flow.json +6 -0
  200. package/evals/fixtures/flow-kit-repository/invalid-missing-schema-version/kit.json +7 -0
  201. package/evals/fixtures/flow-kit-repository/invalid-name/flows/review.flow.json +6 -0
  202. package/evals/fixtures/flow-kit-repository/invalid-name/kit.json +8 -0
  203. package/evals/fixtures/flow-kit-repository/invalid-schema-version/flows/review.flow.json +6 -0
  204. package/evals/fixtures/flow-kit-repository/invalid-schema-version/kit.json +8 -0
  205. package/evals/fixtures/flow-kit-repository/invalid-traversal/kit.json +8 -0
  206. package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/adapters/example.json +3 -0
  207. package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/assets/example.txt +1 -0
  208. package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/docs/README.md +3 -0
  209. package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/flows/runtime.flow.json +26 -0
  210. package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/kit-evals/example.json +3 -0
  211. package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/kit-skills/mixed/SKILL.md +3 -0
  212. package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/kit.json +44 -0
  213. package/evals/fixtures/flow-kit-repository/valid-local-kit/docs/README.md +3 -0
  214. package/evals/fixtures/flow-kit-repository/valid-local-kit/flows/review.flow.json +26 -0
  215. package/evals/fixtures/flow-kit-repository/valid-local-kit/kit.json +20 -0
  216. package/evals/fixtures/hook-influence/cases.json +336 -0
  217. package/evals/fixtures/pull-work-provider/github-issues.json +170 -0
  218. package/evals/fixtures/pull-work-wip-shepherding/global-wip-informs.json +43 -0
  219. package/evals/fixtures/pull-work-wip-shepherding/personal-wip-blocks.json +42 -0
  220. package/evals/fixtures/surface-trust/accepted-claim-trust-report.json +31 -0
  221. package/evals/fixtures/surface-trust/artifact-absent.json +19 -0
  222. package/evals/fixtures/surface-trust/integrity-mismatch-trust-report.json +32 -0
  223. package/evals/fixtures/surface-trust/missing-authority-trust-report.json +27 -0
  224. package/evals/fixtures/surface-trust/provider-absent.json +19 -0
  225. package/evals/fixtures/surface-trust/rejected-claim-trust-report.json +30 -0
  226. package/evals/fixtures/surface-trust/stale-claim-trust-snapshot.json +31 -0
  227. package/evals/fixtures/usage-feedback/sample-full.jsonl +11 -0
  228. package/evals/fixtures/usage-feedback/sample-outcomes.jsonl +1 -0
  229. package/evals/fixtures/veritas-governance-adapter/fake-veritas-pass.sh +18 -0
  230. package/evals/fixtures/veritas-governance-adapter/fake-veritas-secret-fail.sh +10 -0
  231. package/evals/fixtures/veritas-governance-adapter/fake-veritas-unconfigured.sh +4 -0
  232. package/evals/integration/test_bundle_install.sh +541 -0
  233. package/evals/integration/test_console_learning_projection.sh +192 -0
  234. package/evals/integration/test_context_map.sh +65 -0
  235. package/evals/integration/test_effective_backlog_settings.sh +58 -0
  236. package/evals/integration/test_fixture_retirement_audit.sh +58 -0
  237. package/evals/integration/test_flow_agents_statusline.sh +93 -0
  238. package/evals/integration/test_flow_kit_repository.sh +90 -0
  239. package/evals/integration/test_goal_fit_hook.sh +482 -0
  240. package/evals/integration/test_hook_category_behaviors.sh +190 -0
  241. package/evals/integration/test_hook_influence_cases.sh +69 -0
  242. package/evals/integration/test_local_flow_kit_install.sh +145 -0
  243. package/evals/integration/test_publish_change_helper.sh +176 -0
  244. package/evals/integration/test_pull_work_provider.sh +140 -0
  245. package/evals/integration/test_runtime_adapter_activation.sh +106 -0
  246. package/evals/integration/test_telemetry.sh +485 -0
  247. package/evals/integration/test_telemetry_doctor.sh +193 -0
  248. package/evals/integration/test_usage_feedback_dashboard.sh +169 -0
  249. package/evals/integration/test_usage_feedback_global.sh +117 -0
  250. package/evals/integration/test_usage_feedback_import.sh +227 -0
  251. package/evals/integration/test_usage_feedback_outcomes.sh +165 -0
  252. package/evals/integration/test_usage_feedback_report.sh +263 -0
  253. package/evals/integration/test_veritas_governance_adapter.sh +235 -0
  254. package/evals/integration/test_workflow_artifact_cleanup_audit.sh +287 -0
  255. package/evals/integration/test_workflow_artifacts.sh +1247 -0
  256. package/evals/integration/test_workflow_sidecar_writer.sh +2112 -0
  257. package/evals/integration/test_workflow_steering_hook.sh +337 -0
  258. package/evals/lib/assertions/delegated-to.js +40 -0
  259. package/evals/lib/assertions/max-tool-calls.js +15 -0
  260. package/evals/lib/assertions/no-write-tools.js +27 -0
  261. package/evals/lib/assertions/pass-at-k.js +39 -0
  262. package/evals/lib/assertions/telemetry-utils.js +105 -0
  263. package/evals/lib/assertions/tool-called.js +39 -0
  264. package/evals/lib/assertions/verify-after-fix.js +61 -0
  265. package/evals/lib/claude-judge.sh +40 -0
  266. package/evals/lib/claude-provider.sh +74 -0
  267. package/evals/lib/codex-judge.sh +39 -0
  268. package/evals/lib/codex-provider.sh +81 -0
  269. package/evals/lib/eval-dev.sh +5 -0
  270. package/evals/lib/eval-judge.sh +22 -0
  271. package/evals/lib/eval-provider.sh +26 -0
  272. package/evals/lib/eval-report.sh +73 -0
  273. package/evals/lib/kiro-dev.sh +4 -0
  274. package/evals/lib/kiro-judge.sh +17 -0
  275. package/evals/lib/kiro-provider.sh +62 -0
  276. package/evals/lib/node.sh +111 -0
  277. package/evals/promptfooconfig.yaml +70 -0
  278. package/evals/run.sh +309 -0
  279. package/evals/static/test_evidence_refs.sh +141 -0
  280. package/evals/static/test_package.sh +407 -0
  281. package/evals/static/test_repo_hooks.sh +68 -0
  282. package/evals/static/test_universal_bundles.sh +274 -0
  283. package/evals/static/test_workflow_skills.sh +1207 -0
  284. package/install.sh +64 -0
  285. package/integrations/veritas/flow-agents.adapter.json +138 -0
  286. package/integrations/veritas/flow-agents.authority-settings.json +26 -0
  287. package/integrations/veritas/flow-agents.repo-standards.json +82 -0
  288. package/kits/builder/flows/build.flow.json +218 -0
  289. package/kits/builder/flows/shape.flow.json +127 -0
  290. package/kits/builder/kit.json +19 -0
  291. package/kits/catalog.json +11 -0
  292. package/package.json +130 -0
  293. package/packaging/README.md +60 -0
  294. package/packaging/manifest.json +173 -0
  295. package/packaging/packs.json +69 -0
  296. package/powers/dependency-checker/POWER.md +20 -0
  297. package/powers/dependency-checker/mcp.json +20 -0
  298. package/powers/playwright/POWER.md +25 -0
  299. package/powers/playwright/mcp.json +12 -0
  300. package/prompts/code-audit.md +123 -0
  301. package/prompts/kcommit.md +88 -0
  302. package/schemas/backlog-provider-settings.schema.json +138 -0
  303. package/schemas/workflow-acceptance.schema.json +216 -0
  304. package/schemas/workflow-critique.schema.json +113 -0
  305. package/schemas/workflow-evidence.schema.json +357 -0
  306. package/schemas/workflow-handoff.schema.json +52 -0
  307. package/schemas/workflow-learning.schema.json +223 -0
  308. package/schemas/workflow-release.schema.json +172 -0
  309. package/schemas/workflow-state.schema.json +80 -0
  310. package/scripts/README.md +111 -0
  311. package/scripts/build-universal-bundles.js +3 -0
  312. package/scripts/check-content-boundary.cjs +99 -0
  313. package/scripts/context-budget/budget-scan.sh +166 -0
  314. package/scripts/detect-tools.sh +3 -0
  315. package/scripts/discover-agents.sh +28 -0
  316. package/scripts/effective-backlog-settings.js +2 -0
  317. package/scripts/filter-installed-packs.js +2 -0
  318. package/scripts/flow-kit.js +2 -0
  319. package/scripts/generate-context-map.js +2 -0
  320. package/scripts/git-status.sh +49 -0
  321. package/scripts/hooks/claude-hook-adapter.js +174 -0
  322. package/scripts/hooks/claude-telemetry-hook.js +115 -0
  323. package/scripts/hooks/codex-hook-adapter.js +176 -0
  324. package/scripts/hooks/codex-telemetry-hook.js +95 -0
  325. package/scripts/hooks/config-protection.js +79 -0
  326. package/scripts/hooks/desktop-notify.sh +39 -0
  327. package/scripts/hooks/governance-audit.sh +135 -0
  328. package/scripts/hooks/lib/audit-transport.sh +40 -0
  329. package/scripts/hooks/lib/hook-flags.js +49 -0
  330. package/scripts/hooks/lib/patterns.sh +57 -0
  331. package/scripts/hooks/lib/resolve-formatter.js +80 -0
  332. package/scripts/hooks/post-edit-accumulator.js +66 -0
  333. package/scripts/hooks/pre-commit-quality.js +194 -0
  334. package/scripts/hooks/quality-gate.js +93 -0
  335. package/scripts/hooks/report-only-guard.js +21 -0
  336. package/scripts/hooks/run-hook.js +136 -0
  337. package/scripts/hooks/stop-format-typecheck.js +141 -0
  338. package/scripts/hooks/stop-goal-fit.js +337 -0
  339. package/scripts/hooks/workflow-steering.js +250 -0
  340. package/scripts/install-codex-home.sh +106 -0
  341. package/scripts/package.json +3 -0
  342. package/scripts/promote-workflow-artifact.js +2 -0
  343. package/scripts/publish-change-helper.js +2 -0
  344. package/scripts/pull-work-provider.js +2 -0
  345. package/scripts/setup-repo-hooks.sh +8 -0
  346. package/scripts/statusline/flow-agents-statusline.js +157 -0
  347. package/scripts/telemetry/console-presets.sh +14 -0
  348. package/scripts/telemetry/install-console-config.sh +214 -0
  349. package/scripts/telemetry/lib/config.sh +85 -0
  350. package/scripts/telemetry/lib/enrich.sh +115 -0
  351. package/scripts/telemetry/lib/redact.sh +22 -0
  352. package/scripts/telemetry/lib/session.sh +63 -0
  353. package/scripts/telemetry/lib/transport.sh +183 -0
  354. package/scripts/telemetry/lib/usage.sh +29 -0
  355. package/scripts/telemetry/sync-agents.sh +173 -0
  356. package/scripts/telemetry/telemetry.conf +23 -0
  357. package/scripts/telemetry/telemetry.sh +387 -0
  358. package/scripts/usage-feedback.js +2 -0
  359. package/scripts/validate-hook-influence-cases.js +2 -0
  360. package/scripts/validate-package.sh +89 -0
  361. package/scripts/validate-source-tree.js +9 -0
  362. package/skills/agentic-engineering/SKILL.md +62 -0
  363. package/skills/browser-test/SKILL.md +51 -0
  364. package/skills/builder-shape/SKILL.md +76 -0
  365. package/skills/context-budget/SKILL.md +40 -0
  366. package/skills/deliver/SKILL.md +241 -0
  367. package/skills/dependency-update/SKILL.md +68 -0
  368. package/skills/design-probe/SKILL.md +107 -0
  369. package/skills/eval-rebuild/SKILL.md +39 -0
  370. package/skills/evidence-gate/SKILL.md +186 -0
  371. package/skills/execute-plan/SKILL.md +110 -0
  372. package/skills/explore/SKILL.md +137 -0
  373. package/skills/feedback-loop/SKILL.md +87 -0
  374. package/skills/fix-bug/SKILL.md +133 -0
  375. package/skills/frontend-design/SKILL.md +80 -0
  376. package/skills/github-cli/SKILL.md +63 -0
  377. package/skills/idea-to-backlog/SKILL.md +267 -0
  378. package/skills/knowledge-capture/SKILL.md +55 -0
  379. package/skills/learning-review/SKILL.md +115 -0
  380. package/skills/pickup-probe/SKILL.md +114 -0
  381. package/skills/plan-work/SKILL.md +176 -0
  382. package/skills/pull-work/SKILL.md +309 -0
  383. package/skills/release-readiness/SKILL.md +121 -0
  384. package/skills/review-work/SKILL.md +161 -0
  385. package/skills/search-first/SKILL.md +66 -0
  386. package/skills/tdd-workflow/SKILL.md +140 -0
  387. package/skills/verify-work/SKILL.md +109 -0
  388. package/src/cli/console-learning-projection.ts +140 -0
  389. package/src/cli/effective-backlog-settings.ts +99 -0
  390. package/src/cli/fixture-retirement-audit.ts +154 -0
  391. package/src/cli/flow-kit.ts +139 -0
  392. package/src/cli/init.ts +248 -0
  393. package/src/cli/promote-workflow-artifact.ts +64 -0
  394. package/src/cli/publish-change-helper.ts +143 -0
  395. package/src/cli/pull-work-provider.ts +481 -0
  396. package/src/cli/runtime-adapter.ts +24 -0
  397. package/src/cli/telemetry-doctor.ts +243 -0
  398. package/src/cli/usage-feedback.ts +418 -0
  399. package/src/cli/validate-hook-influence.ts +119 -0
  400. package/src/cli/validate-source-tree.ts +30 -0
  401. package/src/cli/validate-workflow-artifacts.ts +411 -0
  402. package/src/cli/veritas-governance.ts +322 -0
  403. package/src/cli/workflow-artifact-cleanup-audit.ts +281 -0
  404. package/src/cli/workflow-sidecar.ts +676 -0
  405. package/src/cli.ts +95 -0
  406. package/src/flow-kit/validate.ts +74 -0
  407. package/src/lib/args.ts +43 -0
  408. package/src/lib/fs.ts +62 -0
  409. package/src/lib/workflow-learning-projection.ts +491 -0
  410. package/src/runtime-adapters.ts +154 -0
  411. package/src/tools/build-universal-bundles.ts +366 -0
  412. package/src/tools/common.ts +61 -0
  413. package/src/tools/filter-installed-packs.ts +129 -0
  414. package/src/tools/generate-context-map.ts +199 -0
  415. package/src/tools/validate-package.ts +57 -0
  416. package/src/tools/validate-source-tree.ts +488 -0
  417. package/tsconfig.json +19 -0
  418. package/veritas.claims.json +6 -0
@@ -0,0 +1,11 @@
1
+ {
2
+ "schema_version": "1.0",
3
+ "kits": [
4
+ {
5
+ "id": "builder",
6
+ "name": "Builder Kit",
7
+ "path": "kits/builder",
8
+ "description": "Flow-backed shaping, planning, build, verification, merge readiness, pull request readiness, and learning workflows."
9
+ }
10
+ ]
11
+ }
package/package.json ADDED
@@ -0,0 +1,130 @@
1
+ {
2
+ "name": "@kontourai/flow-agents",
3
+ "version": "0.1.1",
4
+ "description": "Flow Agents — a Kontour product that applies Flow and Veritas discipline inside the agent tools you already use: Claude Code, Codex, Kiro, and GitHub Actions.",
5
+ "keywords": [
6
+ "agents",
7
+ "ai-agents",
8
+ "workflow",
9
+ "skills",
10
+ "claude-code",
11
+ "codex",
12
+ "kiro",
13
+ "evidence",
14
+ "process-transparency"
15
+ ],
16
+ "license": "Apache-2.0",
17
+ "publishConfig": {
18
+ "access": "public"
19
+ },
20
+ "type": "module",
21
+ "repository": {
22
+ "type": "git",
23
+ "url": "git+https://github.com/kontourai/flow-agents.git"
24
+ },
25
+ "engines": {
26
+ "node": ">=22"
27
+ },
28
+ "bin": {
29
+ "flow-agents": "build/src/cli.js",
30
+ "flow-agents-build-bundles": "build/src/cli.js",
31
+ "flow-agents-console-learning-projection": "build/src/cli.js",
32
+ "flow-agents-context-map": "build/src/cli.js",
33
+ "flow-agents-effective-backlog-settings": "build/src/cli.js",
34
+ "flow-agents-filter-installed-packs": "build/src/cli.js",
35
+ "flow-agents-flow-kit": "build/src/cli.js",
36
+ "flow-agents-fixture-retirement-audit": "build/src/cli.js",
37
+ "flow-agents-promote-workflow-artifact": "build/src/cli.js",
38
+ "flow-agents-publish-change": "build/src/cli.js",
39
+ "flow-agents-pull-work-provider": "build/src/cli.js",
40
+ "flow-agents-runtime-adapter": "build/src/cli.js",
41
+ "flow-agents-telemetry-doctor": "build/src/cli.js",
42
+ "flow-agents-usage-feedback": "build/src/cli.js",
43
+ "flow-agents-veritas-governance": "build/src/cli.js",
44
+ "flow-agents-validate-artifacts": "build/src/cli/validate-workflow-artifacts.js",
45
+ "flow-agents-validate-hook-influence": "build/src/cli.js",
46
+ "flow-agents-validate-source": "build/src/cli.js",
47
+ "flow-agents-workflow-artifact-cleanup-audit": "build/src/cli.js",
48
+ "flow-agents-workflow-sidecar": "build/src/cli/workflow-sidecar.js"
49
+ },
50
+ "files": [
51
+ ".github/",
52
+ ".githooks/",
53
+ "AGENTS.md",
54
+ "CHANGELOG.md",
55
+ "CODE_OF_CONDUCT.md",
56
+ "CONTEXT.md",
57
+ "CONTRIBUTING.md",
58
+ "SECURITY.md",
59
+ "agent-cards/",
60
+ "agents/",
61
+ "build/",
62
+ "console.telemetry.json",
63
+ "context/",
64
+ "docs/",
65
+ "evals/",
66
+ "install.sh",
67
+ "integrations/",
68
+ "kits/",
69
+ "packaging/",
70
+ "powers/",
71
+ "prompts/",
72
+ "schemas/",
73
+ "scripts/",
74
+ "skills/",
75
+ "src/",
76
+ "tsconfig.json",
77
+ "veritas.claims.json",
78
+ "!evals/cases/dev/node_modules/",
79
+ "!evals/results/",
80
+ "!**/.DS_Store",
81
+ "!**/.flow-agents/",
82
+ "!**/.surface/",
83
+ "!**/.telemetry/",
84
+ "!**/.veritas/",
85
+ "!**/node_modules/"
86
+ ],
87
+ "scripts": {
88
+ "build": "tsc -p tsconfig.json",
89
+ "typecheck": "tsc -p tsconfig.json --noEmit",
90
+ "validate:source": "npm run build --silent && node build/src/cli.js validate-source",
91
+ "validate:artifacts": "npm run build --silent && node build/src/cli/validate-workflow-artifacts.js",
92
+ "context-map": "npm run build --silent && node build/src/cli.js context-map",
93
+ "context-map:check": "npm run build --silent && node build/src/cli.js context-map --check",
94
+ "build:bundles": "npm run build --silent && node build/src/cli.js build-bundles",
95
+ "filter:packs": "npm run build --silent && node build/src/cli.js filter-installed-packs",
96
+ "validate:package": "npm run build --silent && node build/src/cli.js validate-package",
97
+ "workflow:sidecar": "npm run build --silent && node build/src/cli/workflow-sidecar.js",
98
+ "workflow:validate-artifacts": "npm run build --silent && node build/src/cli/validate-workflow-artifacts.js",
99
+ "flow-kit": "npm run build --silent && node build/src/cli.js flow-kit",
100
+ "effective-backlog-settings": "npm run build --silent && node build/src/cli.js effective-backlog-settings",
101
+ "pull-work-provider": "npm run build --silent && node build/src/cli.js pull-work-provider",
102
+ "publish-change": "npm run build --silent && node build/src/cli.js publish-change",
103
+ "promote-workflow-artifact": "npm run build --silent && node build/src/cli.js promote-workflow-artifact",
104
+ "usage-feedback": "npm run build --silent && node build/src/cli.js usage-feedback",
105
+ "runtime-adapter": "npm run build --silent && node build/src/cli.js runtime-adapter",
106
+ "telemetry-doctor": "npm run build --silent && node build/src/cli.js telemetry-doctor",
107
+ "veritas-governance": "npm run build --silent && node build/src/cli.js veritas-governance",
108
+ "validate:hook-influence": "npm run build --silent && node build/src/cli.js validate-hook-influence",
109
+ "workflow-artifact-cleanup-audit": "npm run build --silent && node build/src/cli.js workflow-artifact-cleanup-audit",
110
+ "fixture:retirement-audit": "npm run build --silent && node build/src/cli.js fixture-retirement-audit",
111
+ "setup:repo-hooks": "bash scripts/setup-repo-hooks.sh",
112
+ "validate:repo-hooks": "bash evals/static/test_repo_hooks.sh",
113
+ "eval": "bash evals/run.sh",
114
+ "eval:static": "bash evals/run.sh static",
115
+ "eval:integration": "bash evals/run.sh integration",
116
+ "eval:acceptance": "bash evals/run.sh acceptance",
117
+ "eval:llm": "bash evals/run.sh llm",
118
+ "eval:llm:codex": "bash evals/run.sh llm dev --runtime codex",
119
+ "eval:llm:claude": "bash evals/run.sh llm dev --runtime claude --judge-runtime codex",
120
+ "promptfoo": "PROMPTFOO_CONFIG_DIR=.promptfoo PROMPTFOO_DISABLE_WAL_MODE=true PROMPTFOO_DISABLE_TELEMETRY=true promptfoo",
121
+ "promptfoo:view": "PROMPTFOO_CONFIG_DIR=.promptfoo PROMPTFOO_DISABLE_WAL_MODE=true PROMPTFOO_DISABLE_TELEMETRY=true promptfoo view",
122
+ "check:content-boundary": "node scripts/check-content-boundary.cjs",
123
+ "prepack": "npm run build --silent && npm run validate:source --"
124
+ },
125
+ "devDependencies": {
126
+ "@types/node": "^22.19.19",
127
+ "promptfoo": "^0.121.15",
128
+ "typescript": "^6.0.3"
129
+ }
130
+ }
@@ -0,0 +1,60 @@
1
+ # Universal Packaging
2
+
3
+ This directory defines the cross-harness packaging layer for Flow Agents.
4
+
5
+ ## Canonical Source
6
+
7
+ The repo root stays canonical:
8
+
9
+ - `agents/` contains source Kiro-style agent specs: the `dev` workflow surface plus specialist `tool-*` agents.
10
+ - `agent-cards/` contains discovery metadata for routable orchestrators.
11
+ - `skills/`, `context/`, `powers/`, `prompts/`, `scripts/`, and `evals/` remain shared content.
12
+ - `src/` contains TypeScript CLI, runtime adapter, packaging, validation, context-map, and repository tooling source that compiles to `build/src/`.
13
+ - `packaging/manifest.json` describes target-specific copy rules, profile definitions, substitutions, and model/provider mappings.
14
+ - Generated bundles live under `dist/`, are intentionally untracked, and can be recreated at any time.
15
+
16
+ For the full source/generated/runtime inventory, see [Repository Structure](../docs/repository-structure.md).
17
+
18
+ ## Targets
19
+
20
+ - `dist/kiro/` keeps native Kiro JSON agents and rewrites path-bound config through the install token.
21
+ - `dist/claude-code/` exports `.claude/agents/*.md` and `.claude/skills/*/SKILL.md`.
22
+ - `dist/codex/` exports `.codex/agents/*.toml`, `.codex/skills/*/SKILL.md`, and profile config for `kdev` and its Bedrock variant.
23
+
24
+ All targets also receive shared canonical directories where supported: `context/`, `powers/`, `prompts/`, `scripts/`, and `evals/`.
25
+
26
+ `docs/` and `evals/` are intentionally included in generated bundles today. `docs/` gives installed agents durable local reference material, and `evals/` provides install-time and runtime smoke tests for the exported bundle. If bundle size becomes a product constraint, prune these through `packaging/manifest.json` and update install tests rather than deleting generated output by hand.
27
+
28
+ ## Generated And Runtime Boundaries
29
+
30
+ `dist/` is a generated export surface, not the source of truth. Installed runtime directories such as `.codex/` and `.claude/` are also not source. They are created from the generated target bundle and installer scripts. If generated or installed hook config is wrong, fix the canonical source, rebuild `dist/`, and reinstall the runtime config.
31
+
32
+ Runtime workflow state under `.flow-agents/<slug>/` is local working memory. Packaging should copy canonical workflow contracts and skills, but it should not publish local task artifacts as product source. Durable outcomes must be promoted into docs, source, schemas, or provider records before merge.
33
+
34
+ ## Validation And Build
35
+
36
+ Run the source validator before rebuilding:
37
+
38
+ ```bash
39
+ npm run validate:source --
40
+ ```
41
+
42
+ Rebuild every target bundle:
43
+
44
+ ```bash
45
+ npm run build:bundles --
46
+ ```
47
+
48
+ Run static package checks after rebuilding:
49
+
50
+ ```bash
51
+ bash evals/run.sh static
52
+ ```
53
+
54
+ For telemetry and shell integration coverage:
55
+
56
+ ```bash
57
+ bash evals/run.sh integration
58
+ ```
59
+
60
+ The builder is stdlib-only so the package stays dependency-free.
@@ -0,0 +1,173 @@
1
+ {
2
+ "canonical_copy_dirs": [
3
+ "agent-cards",
4
+ "context",
5
+ "docs",
6
+ "evals",
7
+ "kits",
8
+ "packaging",
9
+ "powers",
10
+ "prompts",
11
+ "scripts",
12
+ "schemas",
13
+ "skills"
14
+ ],
15
+ "root_copy_files": [
16
+ "console.telemetry.json"
17
+ ],
18
+ "optional_copy_dirs": [],
19
+ "source_root_aliases": [
20
+ "~/.flow-agents",
21
+ "$HOME/.flow-agents"
22
+ ],
23
+ "kiro": {
24
+ "path_token": "__KIRO_PACKAGE_ROOT__",
25
+ "install_default": "~/.flow-agents"
26
+ },
27
+ "claude_code": {
28
+ "task_dir": ".flow-agents",
29
+ "permissions": {
30
+ "defaultMode": "auto"
31
+ },
32
+ "skipDangerousModePermissionPrompt": true
33
+ },
34
+ "codex": {
35
+ "task_dir": ".flow-agents",
36
+ "settings": {
37
+ "approvals_reviewer": "auto_review"
38
+ },
39
+ "tui": {
40
+ "status_line": [
41
+ "model-with-reasoning",
42
+ "project-name",
43
+ "git-branch",
44
+ "task-progress",
45
+ "context-remaining",
46
+ "run-state"
47
+ ]
48
+ },
49
+ "features": {
50
+ "multi_agent": true,
51
+ "js_repl": true,
52
+ "hooks": true,
53
+ "apps": false,
54
+ "plugins": false
55
+ },
56
+ "excluded_agents": [
57
+ "dev"
58
+ ],
59
+ "profiles": {
60
+ "kdev": {
61
+ "model": "gpt-5.5",
62
+ "model_reasoning_effort": "medium",
63
+ "approval_policy": "on-request",
64
+ "sandbox_mode": "workspace-write",
65
+ "web_search": "cached",
66
+ "developer_instructions": "You are operating in Flow Agents dev mode. You write and modify code, validate it works, and deliver clean results. You own the code; specialist tool-* subagents provide bounded context, review, verification, or parallel implementation support. Use deliver for feature work, fix-bug for bug fixes, plan-work for planning, execute-plan for approved plans, review-work for critique, verify-work for evidence collection, search-first for unfamiliar external APIs, and context-budget when context grows. When a workflow skill or shared contract names a delegate, that delegate is a required gate, not an optional optimization: plan-work must delegate to tool-planner, execute-plan to tool-worker, review-work to tool-code-reviewer and conditionally tool-security-reviewer, verify-work to tool-verifier, and browser/UI verification to tool-playwright when those agents are available. Attempt required delegation even in read-only or partially blocked environments; if blocked, report the blocked gate as NOT_VERIFIED or incomplete with evidence instead of replacing it with a local summary. In progress and final reports, name exact delegate ids such as tool-planner, tool-worker, tool-code-reviewer, tool-security-reviewer, tool-verifier, and tool-playwright so text-only evals remain auditable when telemetry is unavailable. In Codex, required specialist delegation must pass the role through the spawn tool, for example agent_type=tool-worker; do not spawn unnamed/default subagents for Flow Agents worker, planner, reviewer, verifier, or Playwright gates."
67
+ },
68
+ "kdev-br": {
69
+ "model_provider": "amazon-bedrock",
70
+ "model": "amazon.nova-pro-v1:0",
71
+ "model_reasoning_effort": "medium",
72
+ "approval_policy": "on-request",
73
+ "sandbox_mode": "workspace-write",
74
+ "web_search": "cached",
75
+ "developer_instructions": "You are operating in Flow Agents dev mode on Amazon Bedrock. You write and modify code, validate it works, and deliver clean results. You own the code; specialist tool-* subagents provide bounded context, review, verification, or parallel implementation support. Use deliver for feature work, fix-bug for bug fixes, plan-work for planning, execute-plan for approved plans, review-work for critique, verify-work for evidence collection, search-first for unfamiliar external APIs, and context-budget when context grows. When a workflow skill or shared contract names a delegate, that delegate is a required gate, not an optional optimization: plan-work must delegate to tool-planner, execute-plan to tool-worker, review-work to tool-code-reviewer and conditionally tool-security-reviewer, verify-work to tool-verifier, and browser/UI verification to tool-playwright when those agents are available. Attempt required delegation even in read-only or partially blocked environments; if blocked, report the blocked gate as NOT_VERIFIED or incomplete with evidence instead of replacing it with a local summary. In progress and final reports, name exact delegate ids such as tool-planner, tool-worker, tool-code-reviewer, tool-security-reviewer, tool-verifier, and tool-playwright so text-only evals remain auditable when telemetry is unavailable. In Codex, required specialist delegation must pass the role through the spawn tool, for example agent_type=tool-worker; do not spawn unnamed/default subagents for Flow Agents worker, planner, reviewer, verifier, or Playwright gates."
76
+ }
77
+ },
78
+ "model_providers": {
79
+ "amazon-bedrock": {
80
+ "aws": {
81
+ "region": "us-east-1"
82
+ }
83
+ }
84
+ }
85
+ },
86
+ "tool_name_map": {
87
+ "read": "Read",
88
+ "imageRead": "Read",
89
+ "code": "Edit",
90
+ "write": "Write",
91
+ "shell": "Bash",
92
+ "ls": "Bash",
93
+ "grep": "Grep",
94
+ "glob": "Glob",
95
+ "subagent": "Task",
96
+ "web_fetch": "WebFetch",
97
+ "web_search": "WebSearch"
98
+ },
99
+ "claude_model_map": {
100
+ "claude-opus": "opus",
101
+ "claude-sonnet": "sonnet",
102
+ "default": "sonnet"
103
+ },
104
+ "codex_model_map": {
105
+ "claude-opus": "gpt-5.5",
106
+ "claude-sonnet": "gpt-5.5",
107
+ "kimi-k2.5": "gpt-5.3-codex-spark",
108
+ "agi-nova-beta-1m": "gpt-5.4-mini",
109
+ "default": "gpt-5.4-mini"
110
+ },
111
+ "codex_reasoning_map": {
112
+ "claude-opus": "high",
113
+ "claude-sonnet": "low",
114
+ "kimi-k2.5": "low",
115
+ "agi-nova-beta-1m": "medium",
116
+ "default": "medium"
117
+ },
118
+ "target_substitutions": {
119
+ "common": [],
120
+ "claude_code": [
121
+ {
122
+ "from": "Claude Code",
123
+ "to": "Claude Code"
124
+ },
125
+ {
126
+ "from": "todo tool",
127
+ "to": "todo tool"
128
+ },
129
+ {
130
+ "from": "delegate to a specialist agent",
131
+ "to": "delegate to a specialist agent"
132
+ },
133
+ {
134
+ "from": "run shell commands",
135
+ "to": "run shell commands"
136
+ },
137
+ {
138
+ "from": "read files",
139
+ "to": "read files"
140
+ },
141
+ {
142
+ "from": "write files",
143
+ "to": "write files"
144
+ }
145
+ ],
146
+ "codex": [
147
+ {
148
+ "from": "Claude Code",
149
+ "to": "Codex"
150
+ },
151
+ {
152
+ "from": "todo tool",
153
+ "to": "todo tool"
154
+ },
155
+ {
156
+ "from": "delegate to a specialist agent",
157
+ "to": "delegate to a native subagent"
158
+ },
159
+ {
160
+ "from": "run shell commands",
161
+ "to": "run shell commands"
162
+ },
163
+ {
164
+ "from": "read files",
165
+ "to": "read files"
166
+ },
167
+ {
168
+ "from": "write files",
169
+ "to": "write files"
170
+ }
171
+ ]
172
+ }
173
+ }
@@ -0,0 +1,69 @@
1
+ {
2
+ "schema_version": "1.0",
3
+ "packs": [
4
+ {
5
+ "name": "core",
6
+ "default": true,
7
+ "description": "Small default surface for reliable coding and workflow execution.",
8
+ "skills": [
9
+ "search-first",
10
+ "plan-work",
11
+ "execute-plan",
12
+ "review-work",
13
+ "verify-work",
14
+ "evidence-gate",
15
+ "feedback-loop",
16
+ "knowledge-capture",
17
+ "browser-test"
18
+ ],
19
+ "agents": [
20
+ "tool-planner",
21
+ "tool-worker",
22
+ "tool-verifier",
23
+ "tool-code-reviewer",
24
+ "tool-playwright"
25
+ ],
26
+ "powers": [
27
+ "playwright"
28
+ ]
29
+ },
30
+ {
31
+ "name": "development",
32
+ "default": false,
33
+ "description": "Development workflow depth for backlog, release, dependency, GitHub, TDD, and frontend work.",
34
+ "skills": [
35
+ "builder-shape",
36
+ "idea-to-backlog",
37
+ "pull-work",
38
+ "design-probe",
39
+ "pickup-probe",
40
+ "deliver",
41
+ "fix-bug",
42
+ "tdd-workflow",
43
+ "release-readiness",
44
+ "learning-review",
45
+ "dependency-update",
46
+ "eval-rebuild",
47
+ "explore",
48
+ "github-cli",
49
+ "frontend-design",
50
+ "agentic-engineering",
51
+ "context-budget"
52
+ ],
53
+ "agents": [
54
+ "dev",
55
+ "tool-dependencies-updater",
56
+ "tool-security-reviewer",
57
+ "tool-explore-config",
58
+ "tool-explore-deps",
59
+ "tool-explore-entry",
60
+ "tool-explore-patterns",
61
+ "tool-explore-structure",
62
+ "tool-explore-tests"
63
+ ],
64
+ "powers": [
65
+ "dependency-checker"
66
+ ]
67
+ }
68
+ ]
69
+ }
@@ -0,0 +1,20 @@
1
+ ---
2
+ name: "dependency-checker"
3
+ displayName: "Dependency Version Checker"
4
+ description: "Check latest versions, identify outdated packages, and find security advisories across npm, PyPI, Cargo, Maven, Go, NuGet, Ruby, PHP, Swift, Dart, Docker, Helm, Terraform, and GitHub Actions"
5
+ keywords: ["dependencies", "outdated", "update", "upgrade", "version", "security", "advisory", "cve", "vulnerability", "npm", "pypi", "cargo", "maven", "package"]
6
+ ---
7
+
8
+ # Dependency Version Checker
9
+
10
+ Check package versions and security advisories across all major ecosystems.
11
+
12
+ ## Available Tools
13
+ - `package-version-check` — Batch version lookups across ecosystems
14
+ - `package-registry` — GitHub Security Advisory search
15
+
16
+ ## Workflow
17
+ 1. Scan project for dependency manifests (package.json, requirements.txt, Cargo.toml, etc.)
18
+ 2. Use `package-version-check` tools to batch-check versions by ecosystem
19
+ 3. Use `package-registry` to search GitHub Security Advisories for outdated packages
20
+ 4. Report grouped by risk: CRITICAL (CVEs), MAJOR (breaking), MINOR (safe updates)
@@ -0,0 +1,20 @@
1
+ {
2
+ "mcpServers": {
3
+ "package-registry": {
4
+ "args": [
5
+ "package-registry-mcp"
6
+ ],
7
+ "command": "npx"
8
+ },
9
+ "package-version-check": {
10
+ "args": [
11
+ "package-version-check-mcp",
12
+ "--mode=stdio"
13
+ ],
14
+ "command": "uvx",
15
+ "env": {
16
+ "GITHUB_PAT": "${GITHUB_TOKEN}"
17
+ }
18
+ }
19
+ }
20
+ }
@@ -0,0 +1,25 @@
1
+ ---
2
+ name: "playwright"
3
+ displayName: "Playwright Browser"
4
+ description: "Browser automation and testing via Playwright - load pages, test navigation, check accessibility via structured snapshots, evaluate scripts, fill forms, take screenshots"
5
+ keywords: ["playwright", "browser", "accessibility", "dom", "screenshot", "debug", "frontend", "testing", "automation", "web"]
6
+ ---
7
+
8
+ # Playwright Browser
9
+
10
+ Provides browser automation and testing through Microsoft's Playwright MCP server. Uses structured accessibility snapshots instead of pixel-based approaches for efficient, LLM-friendly browser interaction.
11
+
12
+ ## Available Tools
13
+ - All tools from the `playwright` MCP server (navigate, click, type, snapshot, screenshot, etc.)
14
+
15
+ ## When to Use
16
+ - Loading and navigating real web pages
17
+ - Testing frontend accessibility via structured accessibility snapshots (ARIA roles, tab order)
18
+ - Evaluating JavaScript in a live browser context
19
+ - Taking screenshots for visual verification
20
+ - Filling forms, clicking buttons, testing user flows
21
+ - Generating and validating Playwright test code
22
+
23
+ ## NOT For
24
+ - General web search or fetching page content for research
25
+ - Scraping data from websites
@@ -0,0 +1,12 @@
1
+ {
2
+ "mcpServers": {
3
+ "playwright": {
4
+ "args": [
5
+ "@playwright/mcp@latest",
6
+ "--headless",
7
+ "--isolated"
8
+ ],
9
+ "command": "npx"
10
+ }
11
+ }
12
+ }
@@ -0,0 +1,123 @@
1
+ ---
2
+ name: code-audit
3
+ description: "Iterative codebase audit. First run: broad sweep. Subsequent runs: deep dive into specific dimensions or files."
4
+ ---
5
+
6
+ # Codebase Audit — {depth:broad} | Focus: {focus:all}
7
+
8
+ Audit this repository at the requested depth and focus. Every finding must reference specific files and line ranges.
9
+
10
+ ## Depth Modes
11
+
12
+ **broad** (default) — Scan the full codebase across all dimensions. Produce a summary-level report with the top findings per dimension. Don't read every file — sample key files, entry points, and the largest/most-changed modules. Goal: identify WHERE the problems are so subsequent deep dives are targeted.
13
+
14
+ **deep** — Exhaustive analysis of the specified focus area. Read every relevant file. Trace call chains, map dependencies, and propose specific refactors with before/after sketches. If focus is "all", pick the highest-severity dimension from a prior broad scan and go deep on that.
15
+
16
+ ## Focus Areas
17
+
18
+ When focus is not "all", restrict analysis to the specified dimension:
19
+
20
+ - **dry** — Duplicated logic, repeated patterns, copy-pasted code, magic values that should be constants
21
+ - **abstractions** — SRP violations, missing interfaces, tight coupling, business logic in wrong layers, god objects
22
+ - **testability** — Hardcoded dependencies, side effects, missing DI, logic buried in framework code, test organization
23
+ - **errors** — Swallowed exceptions, missing error handling on I/O/network, inconsistent error formats, missing retries/timeouts
24
+ - **naming** — Misleading names, overgrown files, dead code, TODO/FIXME/HACK markers, structural inconsistencies
25
+ - **security** — Hardcoded secrets, missing input validation, overly permissive access, sensitive data in logs
26
+ - **dependencies** — Outdated/unused deps, missing lock files, build script complexity
27
+
28
+ ## Scope Narrowing
29
+
30
+ {scope?}
31
+
32
+ If a scope is provided above (file paths, directories, or module names), restrict the audit to those areas only. Otherwise audit the full repository.
33
+
34
+ ## Phase 1: Orient
35
+
36
+ Before analyzing, understand the codebase:
37
+
38
+ 1. Map directory structure, tech stack, frameworks, languages
39
+ 2. Identify entry points (main files, API routes, CLI commands, exports)
40
+ 3. Read key config files (package.json, pyproject.toml, Cargo.toml, Dockerfile, etc.)
41
+ 4. Understand testing setup (frameworks, coverage, test locations)
42
+ 5. Check for linting/formatting config
43
+
44
+ **If depth is deep**: Also read the specific files in the focus area thoroughly. Trace imports and call chains. Understand how the focused code interacts with the rest of the system.
45
+
46
+ Summarize your understanding before proceeding.
47
+
48
+ ## Phase 2: Analyze
49
+
50
+ ### Broad Mode
51
+
52
+ For each applicable dimension, scan for the most significant issues. Limit to 3-5 findings per dimension. For each:
53
+
54
+ ```
55
+ ### [Finding Title] — Severity: HIGH | MEDIUM | LOW
56
+
57
+ **Files:** `path/to/file.ext` (lines X-Y)
58
+ **Problem:** What's wrong and why it matters.
59
+ **Recommendation:** One-line concrete fix direction.
60
+ ```
61
+
62
+ ### Deep Mode
63
+
64
+ For the focused dimension, provide exhaustive analysis:
65
+
66
+ ```
67
+ ### [Finding Title] — Severity: HIGH | MEDIUM | LOW
68
+
69
+ **Files:** `path/to/file.ext` (lines X-Y), `path/to/other.ext` (lines A-B)
70
+
71
+ **Problem:** Detailed explanation — what's wrong, why it happened, what breaks or degrades because of it.
72
+
73
+ **Current pattern:**
74
+ [Show the actual problematic code or describe the pattern concretely]
75
+
76
+ **Proposed refactor:**
77
+ [Show the target structure — new files, interfaces, function signatures. Not full implementations, but enough to be unambiguous about the design.]
78
+
79
+ **Migration path:** Steps to get from current to proposed without breaking things.
80
+
81
+ **Effort:** S / M / L
82
+ **Risk:** What could go wrong during the refactor.
83
+ ```
84
+
85
+ ## Phase 3: Report
86
+
87
+ ### Broad Mode Report
88
+
89
+ 1. **Heatmap** — Which dimensions have the most/worst findings? Rank them.
90
+ 2. **Top 5 highest-impact changes** — Best ROI refactors across all dimensions.
91
+ 3. **Recommended deep dive order** — Which focus area to audit next and why.
92
+ 4. **What's done well** — Patterns worth preserving.
93
+
94
+ ### Deep Mode Report
95
+
96
+ 1. **All findings** for the focused dimension, ordered by severity then effort.
97
+ 2. **Dependency graph** — How do the findings relate? Which refactors unlock others?
98
+ 3. **Suggested implementation order** — Sequence the fixes to minimize risk and maximize incremental value.
99
+ 4. **What's done well** in this dimension — patterns to extend, not just problems to fix.
100
+
101
+ ## Severity Guide
102
+
103
+ - **HIGH** — Actively causes bugs, security issues, or makes the codebase significantly harder to maintain
104
+ - **MEDIUM** — Creates friction, slows development, or will become a problem as the codebase grows
105
+ - **LOW** — Cleanup opportunity, style improvement, or minor inconsistency
106
+
107
+ ## Rules
108
+
109
+ - Be specific. "This function is too long" is useless. "This function handles parsing, validation, and persistence — split into three" is useful.
110
+ - Don't nitpick formatting if a formatter/linter is configured — focus on logic and design.
111
+ - Don't suggest adding tests — focus on making code testable. The user will decide when to write tests.
112
+ - Respect the existing tech stack. Don't suggest rewrites in different languages/frameworks.
113
+ - In broad mode, prioritize breadth over depth. In deep mode, prioritize thoroughness.
114
+ - If you've seen a prior broad audit in this conversation, reference those findings and go deeper — don't repeat the surface-level observations.
115
+ - If the codebase is small, say so and keep the audit proportional.
116
+
117
+ ## Iterative Usage
118
+
119
+ Typical progression:
120
+ 1. `@code-audit` — broad sweep, identify hotspots
121
+ 2. `@code-audit deep abstractions` — deep dive into worst dimension
122
+ 3. `@code-audit deep testability src/agents/` — targeted deep dive on specific code
123
+ 4. Repeat until satisfied