npm - @kontourai/flow-agents - Versions diffs - 0.1.1 - Mend

@kontourai/flow-agents 0.1.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (418) hide show

package/.githooks/pre-push +11 -0
package/.github/workflows/ci.yml +210 -0
package/.github/workflows/docs-pages.yml +52 -0
package/.github/workflows/publish-npm.yml +104 -0
package/AGENTS.md +26 -0
package/CHANGELOG.md +66 -0
package/CODE_OF_CONDUCT.md +25 -0
package/CONTEXT.md +300 -0
package/CONTRIBUTING.md +44 -0
package/LICENSE +201 -0
package/README.md +129 -0
package/SECURITY.md +33 -0
package/agent-cards/dev.json +19 -0
package/agents/dev.json +127 -0
package/agents/tool-code-reviewer.json +61 -0
package/agents/tool-dependencies-updater.json +118 -0
package/agents/tool-explore-config.json +92 -0
package/agents/tool-explore-deps.json +92 -0
package/agents/tool-explore-entry.json +92 -0
package/agents/tool-explore-patterns.json +92 -0
package/agents/tool-explore-structure.json +92 -0
package/agents/tool-explore-tests.json +92 -0
package/agents/tool-planner.json +57 -0
package/agents/tool-playwright.json +145 -0
package/agents/tool-security-reviewer.json +56 -0
package/agents/tool-verifier.json +61 -0
package/agents/tool-worker.json +58 -0
package/build/src/cli/console-learning-projection.js +123 -0
package/build/src/cli/docs-preview.js +39 -0
package/build/src/cli/effective-backlog-settings.js +102 -0
package/build/src/cli/export-bookmarks.js +38 -0
package/build/src/cli/fixture-retirement-audit.js +140 -0
package/build/src/cli/flow-kit.js +138 -0
package/build/src/cli/import-bookmarks.js +50 -0
package/build/src/cli/init.js +239 -0
package/build/src/cli/instinct-cli.js +93 -0
package/build/src/cli/promote-workflow-artifact.js +63 -0
package/build/src/cli/publish-change-helper.js +154 -0
package/build/src/cli/pull-work-provider.js +469 -0
package/build/src/cli/runtime-adapter.js +23 -0
package/build/src/cli/telemetry-doctor.js +221 -0
package/build/src/cli/usage-feedback.js +443 -0
package/build/src/cli/validate-hook-influence.js +152 -0
package/build/src/cli/validate-source-tree.js +31 -0
package/build/src/cli/validate-workflow-artifacts.js +486 -0
package/build/src/cli/veritas-governance.js +262 -0
package/build/src/cli/workflow-artifact-cleanup-audit.js +272 -0
package/build/src/cli/workflow-sidecar.js +816 -0
package/build/src/cli.js +89 -0
package/build/src/flow-kit/validate.js +75 -0
package/build/src/lib/args.js +45 -0
package/build/src/lib/fs.js +62 -0
package/build/src/lib/workflow-learning-projection.js +334 -0
package/build/src/runtime-adapters.js +146 -0
package/build/src/tools/build-universal-bundles.js +397 -0
package/build/src/tools/common.js +56 -0
package/build/src/tools/filter-installed-packs.js +132 -0
package/build/src/tools/generate-context-map.js +198 -0
package/build/src/tools/validate-package.js +64 -0
package/build/src/tools/validate-source-tree.js +622 -0
package/console.telemetry.json +176 -0
package/context/base-rules.md +17 -0
package/context/code-review-standards.md +62 -0
package/context/coding-standards.md +42 -0
package/context/common/orchestrators.md +12 -0
package/context/common/subagents.md +28 -0
package/context/contracts/artifact-contract.md +182 -0
package/context/contracts/builder-kit-workflow-state-contract.md +319 -0
package/context/contracts/delivery-contract.md +69 -0
package/context/contracts/execution-contract.md +53 -0
package/context/contracts/governance-adapter-contract.md +67 -0
package/context/contracts/planning-contract.md +85 -0
package/context/contracts/review-contract.md +104 -0
package/context/contracts/sandbox-policy.md +52 -0
package/context/contracts/verification-contract.md +134 -0
package/context/contracts/work-item-contract.md +215 -0
package/context/deferred/demo-mode.md +33 -0
package/context/deferred/languages/go.md +31 -0
package/context/deferred/languages/python.md +31 -0
package/context/deferred/languages/typescript.md +34 -0
package/context/deferred/parallelization.md +35 -0
package/context/deferred/worktree-isolation.md +24 -0
package/context/development-workflow.md +50 -0
package/context/scripts/context-budget/budget-scan.sh +166 -0
package/context/scripts/detect-tools.sh +3 -0
package/context/scripts/discover-agents.sh +28 -0
package/context/scripts/git-status.sh +49 -0
package/context/scripts/hooks/config-protection.js +79 -0
package/context/scripts/hooks/desktop-notify.sh +39 -0
package/context/scripts/hooks/governance-audit.sh +135 -0
package/context/scripts/hooks/lib/audit-transport.sh +40 -0
package/context/scripts/hooks/lib/hook-flags.js +49 -0
package/context/scripts/hooks/lib/patterns.sh +57 -0
package/context/scripts/hooks/lib/resolve-formatter.js +80 -0
package/context/scripts/hooks/post-edit-accumulator.js +66 -0
package/context/scripts/hooks/pre-commit-quality.js +194 -0
package/context/scripts/hooks/quality-gate.js +93 -0
package/context/scripts/hooks/report-only-guard.js +21 -0
package/context/scripts/hooks/run-hook.js +136 -0
package/context/scripts/hooks/stop-format-typecheck.js +141 -0
package/context/scripts/hooks/stop-goal-fit.js +337 -0
package/context/scripts/hooks/workflow-steering.js +250 -0
package/context/scripts/telemetry/console-presets.sh +14 -0
package/context/scripts/telemetry/install-console-config.sh +214 -0
package/context/scripts/telemetry/lib/config.sh +85 -0
package/context/scripts/telemetry/lib/enrich.sh +115 -0
package/context/scripts/telemetry/lib/redact.sh +22 -0
package/context/scripts/telemetry/lib/session.sh +63 -0
package/context/scripts/telemetry/lib/transport.sh +183 -0
package/context/scripts/telemetry/lib/usage.sh +29 -0
package/context/scripts/telemetry/sync-agents.sh +173 -0
package/context/scripts/telemetry/telemetry.conf +23 -0
package/context/scripts/telemetry/telemetry.sh +387 -0
package/context/scripts/validate-package.sh +89 -0
package/context/settings/backlog-provider-settings.json +54 -0
package/context/templates/core/identity.md +26 -0
package/context/templates/core/user.md +15 -0
package/docs/_config.yml +15 -0
package/docs/_layouts/default.html +87 -0
package/docs/adr/0001-flow-agents-consumes-flow.md +77 -0
package/docs/adr/0002-flow-kits-as-extension-unit.md +13 -0
package/docs/adr/0003-flow-agents-coordinates-kits-and-adapters.md +13 -0
package/docs/adr/0004-gates-expect-surface-claims.md +15 -0
package/docs/adr/0005-kubernetes-inspired-resource-contracts.md +48 -0
package/docs/adr/0006-typescript-first-source-policy.md +98 -0
package/docs/agent-system-guidebook.md +391 -0
package/docs/agent-usage-feedback-loop.md +351 -0
package/docs/assets/favicon.svg +13 -0
package/docs/assets/og-image.png +0 -0
package/docs/assets/site.css +774 -0
package/docs/assets/site.js +139 -0
package/docs/configurable-workflow-routing.md +174 -0
package/docs/context-map.md +145 -0
package/docs/developer-architecture.md +145 -0
package/docs/developer-hook-setup.md +61 -0
package/docs/fixture-ownership.md +44 -0
package/docs/flow-kit-repository-contract.md +180 -0
package/docs/index.md +129 -0
package/docs/kontour-resource-contract.md +358 -0
package/docs/migrations.md +64 -0
package/docs/north-star.md +322 -0
package/docs/operating-layers.md +110 -0
package/docs/repository-structure.md +132 -0
package/docs/sandbox-policy.md +56 -0
package/docs/skills-map.md +203 -0
package/docs/standards-register.md +96 -0
package/docs/veritas-integration.md +165 -0
package/docs/work-item-adapters.md +72 -0
package/docs/workflow-artifact-lifecycle.md +141 -0
package/docs/workflow-eval-strategy.md +295 -0
package/docs/workflow-shared-contracts.md +51 -0
package/docs/workflow-usage-guide.md +443 -0
package/evals/ARCHITECTURE.md +143 -0
package/evals/CONVENTIONS.md +58 -0
package/evals/README.md +128 -0
package/evals/acceptance/run.sh +29 -0
package/evals/acceptance/test_claude_harness.sh +242 -0
package/evals/acceptance/test_codex_harness.sh +108 -0
package/evals/acceptance/test_kiro_harness.sh +128 -0
package/evals/cases/dev/404.html +97 -0
package/evals/cases/dev/code-review.yaml +44 -0
package/evals/cases/dev/dashboard.html +300 -0
package/evals/cases/dev/deliver.yaml +66 -0
package/evals/cases/dev/dependency-update.yaml +16 -0
package/evals/cases/dev/explore.yaml +20 -0
package/evals/cases/dev/index.html +370 -0
package/evals/cases/dev/package-lock.json +28 -0
package/evals/cases/dev/package.json +16 -0
package/evals/cases/dev/plan-work.yaml +20 -0
package/evals/cases/dev/promptfooconfig.yaml +666 -0
package/evals/cases/dev/search-first.yaml +20 -0
package/evals/cases/dev/tdd-workflow.yaml +48 -0
package/evals/cases/dev/verify-work.yaml +44 -0
package/evals/cases/dev/workflow.yaml +34 -0
package/evals/ci/run-baseline.sh +283 -0
package/evals/fixtures/backlog-provider-settings/global-default.json +44 -0
package/evals/fixtures/backlog-provider-settings/project-override.json +53 -0
package/evals/fixtures/builder-kit-workflow-state/baseline-freshness-resolution-hint.json +139 -0
package/evals/fixtures/builder-kit-workflow-state/direct-primitive-stop.json +59 -0
package/evals/fixtures/builder-kit-workflow-state/empty-board-route-shape.json +55 -0
package/evals/fixtures/builder-kit-workflow-state/happy-path.json +71 -0
package/evals/fixtures/builder-kit-workflow-state/mid-work-resume.json +80 -0
package/evals/fixtures/builder-kit-workflow-state/missing-prestep-recovery.json +65 -0
package/evals/fixtures/builder-kit-workflow-state/product-build-chaining.json +60 -0
package/evals/fixtures/builder-kit-workflow-state/stale-continuation-requires-new-probe.json +57 -0
package/evals/fixtures/console-learning-projection/artifacts/console-learning-correction/learning.json +50 -0
package/evals/fixtures/console-learning-projection/artifacts/console-learning-open-route/learning.json +41 -0
package/evals/fixtures/flow-kit-repository/invalid-absolute-path/kit.json +8 -0
package/evals/fixtures/flow-kit-repository/invalid-asset-section/flows/review.flow.json +6 -0
package/evals/fixtures/flow-kit-repository/invalid-asset-section/kit.json +11 -0
package/evals/fixtures/flow-kit-repository/invalid-duplicate-flow/flows/review.flow.json +6 -0
package/evals/fixtures/flow-kit-repository/invalid-duplicate-flow/kit.json +9 -0
package/evals/fixtures/flow-kit-repository/invalid-id/flows/review.flow.json +6 -0
package/evals/fixtures/flow-kit-repository/invalid-id/kit.json +8 -0
package/evals/fixtures/flow-kit-repository/invalid-malformed-json/kit.json +8 -0
package/evals/fixtures/flow-kit-repository/invalid-missing-flow/kit.json +8 -0
package/evals/fixtures/flow-kit-repository/invalid-missing-id/flows/review.flow.json +6 -0
package/evals/fixtures/flow-kit-repository/invalid-missing-id/kit.json +7 -0
package/evals/fixtures/flow-kit-repository/invalid-missing-schema-version/flows/review.flow.json +6 -0
package/evals/fixtures/flow-kit-repository/invalid-missing-schema-version/kit.json +7 -0
package/evals/fixtures/flow-kit-repository/invalid-name/flows/review.flow.json +6 -0
package/evals/fixtures/flow-kit-repository/invalid-name/kit.json +8 -0
package/evals/fixtures/flow-kit-repository/invalid-schema-version/flows/review.flow.json +6 -0
package/evals/fixtures/flow-kit-repository/invalid-schema-version/kit.json +8 -0
package/evals/fixtures/flow-kit-repository/invalid-traversal/kit.json +8 -0
package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/adapters/example.json +3 -0
package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/assets/example.txt +1 -0
package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/docs/README.md +3 -0
package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/flows/runtime.flow.json +26 -0
package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/kit-evals/example.json +3 -0
package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/kit-skills/mixed/SKILL.md +3 -0
package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/kit.json +44 -0
package/evals/fixtures/flow-kit-repository/valid-local-kit/docs/README.md +3 -0
package/evals/fixtures/flow-kit-repository/valid-local-kit/flows/review.flow.json +26 -0
package/evals/fixtures/flow-kit-repository/valid-local-kit/kit.json +20 -0
package/evals/fixtures/hook-influence/cases.json +336 -0
package/evals/fixtures/pull-work-provider/github-issues.json +170 -0
package/evals/fixtures/pull-work-wip-shepherding/global-wip-informs.json +43 -0
package/evals/fixtures/pull-work-wip-shepherding/personal-wip-blocks.json +42 -0
package/evals/fixtures/surface-trust/accepted-claim-trust-report.json +31 -0
package/evals/fixtures/surface-trust/artifact-absent.json +19 -0
package/evals/fixtures/surface-trust/integrity-mismatch-trust-report.json +32 -0
package/evals/fixtures/surface-trust/missing-authority-trust-report.json +27 -0
package/evals/fixtures/surface-trust/provider-absent.json +19 -0
package/evals/fixtures/surface-trust/rejected-claim-trust-report.json +30 -0
package/evals/fixtures/surface-trust/stale-claim-trust-snapshot.json +31 -0
package/evals/fixtures/usage-feedback/sample-full.jsonl +11 -0
package/evals/fixtures/usage-feedback/sample-outcomes.jsonl +1 -0
package/evals/fixtures/veritas-governance-adapter/fake-veritas-pass.sh +18 -0
package/evals/fixtures/veritas-governance-adapter/fake-veritas-secret-fail.sh +10 -0
package/evals/fixtures/veritas-governance-adapter/fake-veritas-unconfigured.sh +4 -0
package/evals/integration/test_bundle_install.sh +541 -0
package/evals/integration/test_console_learning_projection.sh +192 -0
package/evals/integration/test_context_map.sh +65 -0
package/evals/integration/test_effective_backlog_settings.sh +58 -0
package/evals/integration/test_fixture_retirement_audit.sh +58 -0
package/evals/integration/test_flow_agents_statusline.sh +93 -0
package/evals/integration/test_flow_kit_repository.sh +90 -0
package/evals/integration/test_goal_fit_hook.sh +482 -0
package/evals/integration/test_hook_category_behaviors.sh +190 -0
package/evals/integration/test_hook_influence_cases.sh +69 -0
package/evals/integration/test_local_flow_kit_install.sh +145 -0
package/evals/integration/test_publish_change_helper.sh +176 -0
package/evals/integration/test_pull_work_provider.sh +140 -0
package/evals/integration/test_runtime_adapter_activation.sh +106 -0
package/evals/integration/test_telemetry.sh +485 -0
package/evals/integration/test_telemetry_doctor.sh +193 -0
package/evals/integration/test_usage_feedback_dashboard.sh +169 -0
package/evals/integration/test_usage_feedback_global.sh +117 -0
package/evals/integration/test_usage_feedback_import.sh +227 -0
package/evals/integration/test_usage_feedback_outcomes.sh +165 -0
package/evals/integration/test_usage_feedback_report.sh +263 -0
package/evals/integration/test_veritas_governance_adapter.sh +235 -0
package/evals/integration/test_workflow_artifact_cleanup_audit.sh +287 -0
package/evals/integration/test_workflow_artifacts.sh +1247 -0
package/evals/integration/test_workflow_sidecar_writer.sh +2112 -0
package/evals/integration/test_workflow_steering_hook.sh +337 -0
package/evals/lib/assertions/delegated-to.js +40 -0
package/evals/lib/assertions/max-tool-calls.js +15 -0
package/evals/lib/assertions/no-write-tools.js +27 -0
package/evals/lib/assertions/pass-at-k.js +39 -0
package/evals/lib/assertions/telemetry-utils.js +105 -0
package/evals/lib/assertions/tool-called.js +39 -0
package/evals/lib/assertions/verify-after-fix.js +61 -0
package/evals/lib/claude-judge.sh +40 -0
package/evals/lib/claude-provider.sh +74 -0
package/evals/lib/codex-judge.sh +39 -0
package/evals/lib/codex-provider.sh +81 -0
package/evals/lib/eval-dev.sh +5 -0
package/evals/lib/eval-judge.sh +22 -0
package/evals/lib/eval-provider.sh +26 -0
package/evals/lib/eval-report.sh +73 -0
package/evals/lib/kiro-dev.sh +4 -0
package/evals/lib/kiro-judge.sh +17 -0
package/evals/lib/kiro-provider.sh +62 -0
package/evals/lib/node.sh +111 -0
package/evals/promptfooconfig.yaml +70 -0
package/evals/run.sh +309 -0
package/evals/static/test_evidence_refs.sh +141 -0
package/evals/static/test_package.sh +407 -0
package/evals/static/test_repo_hooks.sh +68 -0
package/evals/static/test_universal_bundles.sh +274 -0
package/evals/static/test_workflow_skills.sh +1207 -0
package/install.sh +64 -0
package/integrations/veritas/flow-agents.adapter.json +138 -0
package/integrations/veritas/flow-agents.authority-settings.json +26 -0
package/integrations/veritas/flow-agents.repo-standards.json +82 -0
package/kits/builder/flows/build.flow.json +218 -0
package/kits/builder/flows/shape.flow.json +127 -0
package/kits/builder/kit.json +19 -0
package/kits/catalog.json +11 -0
package/package.json +130 -0
package/packaging/README.md +60 -0
package/packaging/manifest.json +173 -0
package/packaging/packs.json +69 -0
package/powers/dependency-checker/POWER.md +20 -0
package/powers/dependency-checker/mcp.json +20 -0
package/powers/playwright/POWER.md +25 -0
package/powers/playwright/mcp.json +12 -0
package/prompts/code-audit.md +123 -0
package/prompts/kcommit.md +88 -0
package/schemas/backlog-provider-settings.schema.json +138 -0
package/schemas/workflow-acceptance.schema.json +216 -0
package/schemas/workflow-critique.schema.json +113 -0
package/schemas/workflow-evidence.schema.json +357 -0
package/schemas/workflow-handoff.schema.json +52 -0
package/schemas/workflow-learning.schema.json +223 -0
package/schemas/workflow-release.schema.json +172 -0
package/schemas/workflow-state.schema.json +80 -0
package/scripts/README.md +111 -0
package/scripts/build-universal-bundles.js +3 -0
package/scripts/check-content-boundary.cjs +99 -0
package/scripts/context-budget/budget-scan.sh +166 -0
package/scripts/detect-tools.sh +3 -0
package/scripts/discover-agents.sh +28 -0
package/scripts/effective-backlog-settings.js +2 -0
package/scripts/filter-installed-packs.js +2 -0
package/scripts/flow-kit.js +2 -0
package/scripts/generate-context-map.js +2 -0
package/scripts/git-status.sh +49 -0
package/scripts/hooks/claude-hook-adapter.js +174 -0
package/scripts/hooks/claude-telemetry-hook.js +115 -0
package/scripts/hooks/codex-hook-adapter.js +176 -0
package/scripts/hooks/codex-telemetry-hook.js +95 -0
package/scripts/hooks/config-protection.js +79 -0
package/scripts/hooks/desktop-notify.sh +39 -0
package/scripts/hooks/governance-audit.sh +135 -0
package/scripts/hooks/lib/audit-transport.sh +40 -0
package/scripts/hooks/lib/hook-flags.js +49 -0
package/scripts/hooks/lib/patterns.sh +57 -0
package/scripts/hooks/lib/resolve-formatter.js +80 -0
package/scripts/hooks/post-edit-accumulator.js +66 -0
package/scripts/hooks/pre-commit-quality.js +194 -0
package/scripts/hooks/quality-gate.js +93 -0
package/scripts/hooks/report-only-guard.js +21 -0
package/scripts/hooks/run-hook.js +136 -0
package/scripts/hooks/stop-format-typecheck.js +141 -0
package/scripts/hooks/stop-goal-fit.js +337 -0
package/scripts/hooks/workflow-steering.js +250 -0
package/scripts/install-codex-home.sh +106 -0
package/scripts/package.json +3 -0
package/scripts/promote-workflow-artifact.js +2 -0
package/scripts/publish-change-helper.js +2 -0
package/scripts/pull-work-provider.js +2 -0
package/scripts/setup-repo-hooks.sh +8 -0
package/scripts/statusline/flow-agents-statusline.js +157 -0
package/scripts/telemetry/console-presets.sh +14 -0
package/scripts/telemetry/install-console-config.sh +214 -0
package/scripts/telemetry/lib/config.sh +85 -0
package/scripts/telemetry/lib/enrich.sh +115 -0
package/scripts/telemetry/lib/redact.sh +22 -0
package/scripts/telemetry/lib/session.sh +63 -0
package/scripts/telemetry/lib/transport.sh +183 -0
package/scripts/telemetry/lib/usage.sh +29 -0
package/scripts/telemetry/sync-agents.sh +173 -0
package/scripts/telemetry/telemetry.conf +23 -0
package/scripts/telemetry/telemetry.sh +387 -0
package/scripts/usage-feedback.js +2 -0
package/scripts/validate-hook-influence-cases.js +2 -0
package/scripts/validate-package.sh +89 -0
package/scripts/validate-source-tree.js +9 -0
package/skills/agentic-engineering/SKILL.md +62 -0
package/skills/browser-test/SKILL.md +51 -0
package/skills/builder-shape/SKILL.md +76 -0
package/skills/context-budget/SKILL.md +40 -0
package/skills/deliver/SKILL.md +241 -0
package/skills/dependency-update/SKILL.md +68 -0
package/skills/design-probe/SKILL.md +107 -0
package/skills/eval-rebuild/SKILL.md +39 -0
package/skills/evidence-gate/SKILL.md +186 -0
package/skills/execute-plan/SKILL.md +110 -0
package/skills/explore/SKILL.md +137 -0
package/skills/feedback-loop/SKILL.md +87 -0
package/skills/fix-bug/SKILL.md +133 -0
package/skills/frontend-design/SKILL.md +80 -0
package/skills/github-cli/SKILL.md +63 -0
package/skills/idea-to-backlog/SKILL.md +267 -0
package/skills/knowledge-capture/SKILL.md +55 -0
package/skills/learning-review/SKILL.md +115 -0
package/skills/pickup-probe/SKILL.md +114 -0
package/skills/plan-work/SKILL.md +176 -0
package/skills/pull-work/SKILL.md +309 -0
package/skills/release-readiness/SKILL.md +121 -0
package/skills/review-work/SKILL.md +161 -0
package/skills/search-first/SKILL.md +66 -0
package/skills/tdd-workflow/SKILL.md +140 -0
package/skills/verify-work/SKILL.md +109 -0
package/src/cli/console-learning-projection.ts +140 -0
package/src/cli/effective-backlog-settings.ts +99 -0
package/src/cli/fixture-retirement-audit.ts +154 -0
package/src/cli/flow-kit.ts +139 -0
package/src/cli/init.ts +248 -0
package/src/cli/promote-workflow-artifact.ts +64 -0
package/src/cli/publish-change-helper.ts +143 -0
package/src/cli/pull-work-provider.ts +481 -0
package/src/cli/runtime-adapter.ts +24 -0
package/src/cli/telemetry-doctor.ts +243 -0
package/src/cli/usage-feedback.ts +418 -0
package/src/cli/validate-hook-influence.ts +119 -0
package/src/cli/validate-source-tree.ts +30 -0
package/src/cli/validate-workflow-artifacts.ts +411 -0
package/src/cli/veritas-governance.ts +322 -0
package/src/cli/workflow-artifact-cleanup-audit.ts +281 -0
package/src/cli/workflow-sidecar.ts +676 -0
package/src/cli.ts +95 -0
package/src/flow-kit/validate.ts +74 -0
package/src/lib/args.ts +43 -0
package/src/lib/fs.ts +62 -0
package/src/lib/workflow-learning-projection.ts +491 -0
package/src/runtime-adapters.ts +154 -0
package/src/tools/build-universal-bundles.ts +366 -0
package/src/tools/common.ts +61 -0
package/src/tools/filter-installed-packs.ts +129 -0
package/src/tools/generate-context-map.ts +199 -0
package/src/tools/validate-package.ts +57 -0
package/src/tools/validate-source-tree.ts +488 -0
package/tsconfig.json +19 -0
package/veritas.claims.json +6 -0

package/scripts/telemetry/telemetry.sh ADDED Viewed

@@ -0,0 +1,387 @@
+#!/usr/bin/env bash
+# telemetry.sh — Kiro adapter for generic agent telemetry schema v0.3.0
+# Usage: echo '<hook_event_json>' | bash telemetry.sh <event_type> <agent_name>
+set -o pipefail
+TELEMETRY_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+source "${TELEMETRY_DIR}/lib/config.sh"
+source "${TELEMETRY_DIR}/lib/session.sh"
+source "${TELEMETRY_DIR}/lib/enrich.sh"
+source "${TELEMETRY_DIR}/lib/transport.sh"
+source "${TELEMETRY_DIR}/lib/usage.sh"
+normalize_tool_name() {
+  case "$1" in
+    Bash|bash|shell|execute_bash) echo "execute_bash" ;;
+    apply_patch|Edit|Write|fs_write|write|code) echo "fs_write" ;;
+    spawn_agent|use_subagent|InvokeSubagents|Task|Agent|"delegate to a specialist agent") echo "use_subagent" ;;
+    Read|read|fs_read) echo "fs_read" ;;
+    *) echo "$1" ;;
+  esac
+}
+telemetry_session_id() {
+  local event_type="$1" agent_name="$2"
+  local session_id=""
+  case "$event_type" in
+    agentSpawn)
+      session_id=$(session_start "$agent_name")
+      session_cleanup
+      ;;
+    stop)
+      session_id=$(session_get)
+      session_end
+      ;;
+    *)
+      session_id=$(session_get)
+      # Touch session file so mtime reflects last activity
+      local _sf="${TELEMETRY_SESSION_DIR}/telemetry-${PPID}"
+      [[ -f "$_sf" ]] && touch "$_sf" 2>/dev/null
+      ;;
+  esac
+  echo "${session_id:-no-session}"
+}
+schema_event_type() {
+  local event_type="$1"
+  case "$event_type" in
+    agentSpawn|SessionStart) echo "session.start" ;;
+    stop|Stop|SessionEnd) echo "session.end" ;;
+    userPromptSubmit|UserPromptSubmit) echo "turn.user" ;;
+    preToolUse|PreToolUse) echo "tool.invoke" ;;
+    permissionRequest|PermissionRequest) echo "tool.permission_request" ;;
+    postToolUse|PostToolUse|PostToolUseFailure) echo "tool.result" ;;
+    *) echo "unknown" ;;
+  esac
+}
+runtime_version() {
+  local runtime_name runtime_binary runtime_version
+  runtime_name="$1"
+  case "$runtime_name" in
+    codex) runtime_binary="codex" ;;
+    claude|claude-code) runtime_binary="claude"; runtime_name="claude-code" ;;
+    kiro|kiro-cli) runtime_binary="kiro-cli"; runtime_name="kiro-cli" ;;
+    *) runtime_binary="$runtime_name" ;;
+  esac
+  runtime_version=$(
+    "$runtime_binary" --version 2>/dev/null &
+    _pid=$!; ( sleep 2; kill $_pid 2>/dev/null ) &
+    _guard=$!; wait $_pid 2>/dev/null; kill $_guard 2>/dev/null
+    wait $_pid 2>/dev/null
+  ) 2>/dev/null
+  runtime_version=$(echo "$runtime_version" | head -n1)
+  echo "${runtime_version:-unknown}"
+}
+build_base_event() {
+  local session_id="$1" schema_event_type="$2" agent_name="$3"
+  local runtime_name="${FLOW_AGENTS_TELEMETRY_RUNTIME:-kiro-cli}"
+  case "$runtime_name" in
+    claude|claude-code) runtime_name="claude-code" ;;
+    kiro|kiro-cli) runtime_name="kiro-cli" ;;
+  esac
+  jq -nc \
+    --arg sv "0.3.0" \
+    --arg ts "$(date +%s)000" \
+    --arg sid "$session_id" \
+    --arg eid "$(uuidgen 2>/dev/null || echo "e-$(date +%s)-$$")" \
+    --arg et "$schema_event_type" \
+    --arg an "$agent_name" \
+    --arg rv "$(runtime_version "$runtime_name")" \
+    --arg rn "$runtime_name" \
+    '{
+      schema_version: $sv,
+      timestamp: $ts,
+      session_id: $sid,
+      event_id: $eid,
+      event_type: $et,
+      agent: {
+        name: $an,
+        runtime: $rn,
+        version: $rv
+      }
+    }'
+}
+add_hook_context() {
+  local event="$1" event_type="$2" stdin_json="$3"
+  local cwd tty_name pid runtime_session_id runtime_turn_id transcript_path hook_event_name model_name source stop_hook_active last_assistant_message raw_hook_input
+  cwd=$(echo "$stdin_json" | jq -r '.cwd // ""')
+  runtime_session_id=$(echo "$stdin_json" | jq -r '.session_id // ""')
+  runtime_turn_id=$(echo "$stdin_json" | jq -r '.turn_id // ""')
+  transcript_path=$(echo "$stdin_json" | jq -r '.transcript_path // ""')
+  hook_event_name=$(echo "$stdin_json" | jq -r '.hook_event_name // ""')
+  model_name=$(echo "$stdin_json" | jq -r '.model // ""')
+  source=$(echo "$stdin_json" | jq -r '.source // ""')
+  stop_hook_active=$(echo "$stdin_json" | jq -r '.stop_hook_active // empty')
+  last_assistant_message=$(echo "$stdin_json" | jq -r '.last_assistant_message // ""')
+  if [[ "$FLOW_AGENTS_TELEMETRY_CAPTURE_RAW_HOOK_INPUT" == "true" ]]; then
+    raw_hook_input="$stdin_json"
+  else
+    raw_hook_input="null"
+  fi
+  tty_name=$(session_get_tty)
+  pid=$(cat "${TELEMETRY_SESSION_DIR}/${session_id}.session" 2>/dev/null | jq -r '.pid // empty')
+  echo "$event" | jq -c \
+    --arg event_name "${hook_event_name:-$event_type}" \
+    --arg runtime_session_id "$runtime_session_id" \
+    --arg turn_id "$runtime_turn_id" \
+    --arg transcript_path "$transcript_path" \
+    --arg model "$model_name" \
+    --arg source "$source" \
+    --arg stop_hook_active "$stop_hook_active" \
+    --arg last_assistant_message "$last_assistant_message" \
+    --argjson raw "$raw_hook_input" \
+    '. + {
+      hook: {
+        event_name: $event_name,
+        runtime_session_id: $runtime_session_id,
+        turn_id: $turn_id,
+        transcript_path: $transcript_path,
+        model: $model,
+        source: $source,
+        stop_hook_active: (if $stop_hook_active == "" then null else ($stop_hook_active == "true") end),
+        last_assistant_message: $last_assistant_message,
+        raw_input: $raw
+      }
+    }'
+}
+add_runtime_context() {
+  local event="$1" event_type="$2" stdin_json="$3"
+  local cwd tty_name pid
+  cwd=$(echo "$stdin_json" | jq -r '.cwd // ""')
+  tty_name=$(session_get_tty)
+  pid=$(cat "${TELEMETRY_SESSION_DIR}/${session_id}.session" 2>/dev/null | jq -r '.pid // empty')
+  if [[ "$event_type" == "agentSpawn" ]]; then
+    local sys_json ws_json auth_json
+    sys_json=$(enrich_system)
+    ws_json=$(enrich_workspace)
+    auth_json=$(enrich_auth)
+    local os shell
+    os=$(echo "$sys_json" | jq -r '.os // "unknown"')
+    shell=$(echo "$sys_json" | jq -r '.shell // "unknown"')
+    echo "$event" | jq -c \
+      --arg cwd "$cwd" \
+      --arg tty "$tty_name" \
+      --arg os "$os" \
+      --arg shell "$shell" \
+      --argjson pid "${pid:-0}" \
+      --argjson sys "$sys_json" \
+      --argjson ws "$ws_json" \
+      --argjson auth "$auth_json" \
+      '. + {
+        context: {cwd: $cwd, tty: $tty, os: $os, shell: $shell, pid: $pid},
+        enrichment: {system: $sys, workspace: $ws, auth: $auth}
+      }'
+  else
+    echo "$event" | jq -c \
+      --arg cwd "$cwd" \
+      --arg tty "$tty_name" \
+      --argjson pid "${pid:-0}" \
+      '. + {context: {cwd: $cwd, tty: $tty, pid: $pid}}'
+  fi
+}
+add_user_prompt_data() {
+  local event="$1" stdin_json="$2"
+  local prompt_text prompt_length
+  prompt_text=$(echo "$stdin_json" | jq -r '.prompt // ""')
+  prompt_length=${#prompt_text}
+  echo "$event" | jq -c \
+    --arg pt "$prompt_text" \
+    --argjson pl "$prompt_length" \
+    '. + {turn: {prompt_text: $pt, prompt_length: $pl}}'
+}
+add_tool_event_data() {
+  local event="$1" event_type="$2" stdin_json="$3"
+  local tool_name tool_normalized_name tool_input tool_output permission_description
+  tool_name=$(echo "$stdin_json" | jq -r '.tool_name // ""')
+  tool_normalized_name=$(normalize_tool_name "$tool_name")
+  tool_input=$(echo "$stdin_json" | jq -c '.tool_input // null')
+  tool_output=$(echo "$stdin_json" | jq -c '.tool_response // null')
+  permission_description=$(echo "$stdin_json" | jq -r '.tool_input.description // ""')
+  if [[ "$event_type" == "preToolUse" ]]; then
+    event=$(echo "$event" | jq -c \
+      --arg tn "$tool_name" \
+      --arg nn "$tool_normalized_name" \
+      --argjson ti "$tool_input" \
+      '. + {tool: {name: $tn, normalized_name: $nn, input: $ti}}')
+  elif [[ "$event_type" == "permissionRequest" || "$event_type" == "PermissionRequest" ]]; then
+    event=$(echo "$event" | jq -c \
+      --arg tn "$tool_name" \
+      --arg nn "$tool_normalized_name" \
+      --argjson ti "$tool_input" \
+      --arg desc "$permission_description" \
+      '. + {tool: {name: $tn, normalized_name: $nn, input: $ti}, permission: {description: $desc}}')
+  else
+    event=$(echo "$event" | jq -c \
+      --arg tn "$tool_name" \
+      --arg nn "$tool_normalized_name" \
+      --argjson to "$tool_output" \
+      '. + {tool: {name: $tn, normalized_name: $nn, output: $to}}')
+  fi
+  echo "$event"
+}
+emit_delegation_event() {
+  local event="$1" event_type="$2" stdin_json="$3"
+  local tool_name tool_input
+  tool_name=$(echo "$stdin_json" | jq -r '.tool_name // ""')
+  tool_input=$(echo "$stdin_json" | jq -c '.tool_input // null')
+  if [[ "$tool_name" == "InvokeSubagents" && "$event_type" == "preToolUse" ]]; then
+    local targets
+    targets=$(echo "$tool_input" | jq -c '.targets // []')
+    if [[ "$targets" != "[]" ]]; then
+      local delegate_event
+      delegate_event=$(echo "$event" | jq -c \
+        --argjson targets "$targets" \
+        '.event_type = "agent.delegate" | . + {delegation: {targets: $targets}} | del(.tool)')
+      transport_emit "$delegate_event"
+    fi
+  elif [[ "$tool_name" == "spawn_agent" && "$event_type" == "preToolUse" ]]; then
+    local target
+    target=$(echo "$tool_input" | jq -r '.agent_type // "default"')
+    if [[ -n "$target" && "$target" != "null" ]]; then
+      local delegate_event
+      delegate_event=$(echo "$event" | jq -c \
+        --arg target "$target" \
+        '.event_type = "agent.delegate" | . + {delegation: {targets: [$target]}} | del(.tool)')
+      transport_emit "$delegate_event"
+    fi
+  elif [[ "$tool_name" == "use_subagent" || "$tool_name" == "subagent" || "$tool_name" == "delegate to a specialist agent" ]] && [[ "$event_type" == "preToolUse" ]]; then
+    local targets
+    targets=$(echo "$tool_input" | jq -c '
+      if (.targets? | type) == "array" then .targets
+      elif (.subagents? | type) == "array" then .subagents | map(.agent_name // .agent // .subagent_type // .name // "subagent")
+      elif (.content.subagents? | type) == "array" then .content.subagents | map(.agent_name // .agent // .subagent_type // .name // "subagent")
+      elif (.agent_name? // .agent? // .subagent_type? // empty) != "" then [(.agent_name // .agent // .subagent_type)]
+      else ["subagent"]
+      end
+    ')
+    if [[ "$targets" != "[]" ]]; then
+      local delegate_event
+      delegate_event=$(echo "$event" | jq -c \
+        --argjson targets "$targets" \
+        '.event_type = "agent.delegate" | . + {delegation: {targets: $targets}} | del(.tool)')
+      transport_emit "$delegate_event"
+    fi
+  elif [[ "$tool_name" == "Task" || "$tool_name" == "Agent" ]] && [[ "$event_type" == "preToolUse" ]]; then
+    local target
+    target=$(echo "$tool_input" | jq -r '.subagent_type // .agent_type // .agent // "general-purpose"')
+    if [[ -n "$target" && "$target" != "null" ]]; then
+      local delegate_event
+      delegate_event=$(echo "$event" | jq -c \
+        --arg target "$target" \
+        '.event_type = "agent.delegate" | . + {delegation: {targets: [$target]}} | del(.tool)')
+      transport_emit "$delegate_event"
+    fi
+  fi
+}
+add_tool_data_and_emit_delegation() {
+  local event="$1" event_type="$2" stdin_json="$3"
+  event=$(add_tool_event_data "$event" "$event_type" "$stdin_json")
+  emit_delegation_event "$event" "$event_type" "$stdin_json"
+  echo "$event"
+}
+add_stop_data_and_emit_usage() {
+  local event="$1" agent_name="$2"
+  local duration_s
+  duration_s=$(cat "${TELEMETRY_SESSION_DIR}/${session_id}.session" 2>/dev/null | jq -r '.duration_s // 0')
+  event=$(echo "$event" | jq -c \
+    --argjson ds "$duration_s" \
+    '. + {session: {duration_s: $ds}}')
+  if [[ "$TELEMETRY_USAGE_TRACKING" == "true" ]]; then
+    local model tool_count delegation_count
+    model=$(usage_get_model "$agent_name")
+    local full_log="${TELEMETRY_CHANNEL_FULL_LOG_FILE}"
+    tool_count=$(usage_count_tool_calls "$session_id" "$full_log")
+    delegation_count=$(usage_count_delegations "$session_id" "$full_log")
+    local usage_event
+    usage_event=$(echo "$event" | jq -c \
+      --arg m "$model" \
+      --argjson tc "$tool_count" \
+      --argjson dc "$delegation_count" \
+      '.event_type = "session.usage" | .event_id = (.event_id + "-usage") | . + {
+        usage: {model: $m, duration_s: .session.duration_s, tool_invocations: $tc, delegations: $dc, input_tokens: null, output_tokens: null, estimated_cost_usd: null}
+      }')
+    transport_emit "$usage_event"
+  fi
+  echo "$event"
+}
+add_event_specific_data() {
+  local event="$1" event_type="$2" agent_name="$3" stdin_json="$4"
+  case "$event_type" in
+    userPromptSubmit|UserPromptSubmit)
+      add_user_prompt_data "$event" "$stdin_json"
+      ;;
+    preToolUse|PreToolUse|permissionRequest|PermissionRequest|postToolUse|PostToolUse|PostToolUseFailure)
+      add_tool_data_and_emit_delegation "$event" "$event_type" "$stdin_json"
+      ;;
+    stop|Stop|SessionEnd)
+      add_stop_data_and_emit_usage "$event" "$agent_name"
+      ;;
+    *)
+      echo "$event"
+      ;;
+  esac
+}
+main() {
+  [[ "$TELEMETRY_ENABLED" != "true" ]] && return 0
+  local event_type="${1:-unknown}" agent_name="${2:-unknown}"
+  local stdin_json="${3:-}"
+  [[ -z "$stdin_json" ]] && stdin_json='{}'
+  session_id=$(telemetry_session_id "$event_type" "$agent_name")
+  local event
+  event=$(build_base_event "$session_id" "$(schema_event_type "$event_type")" "$agent_name")
+  event=$(add_hook_context "$event" "$event_type" "$stdin_json")
+  event=$(add_runtime_context "$event" "$event_type" "$stdin_json")
+  event=$(add_event_specific_data "$event" "$event_type" "$agent_name" "$stdin_json")
+  transport_emit "$event"
+  [[ "$event_type" == "stop" ]] && transport_maybe_rotate
+}
+# Capture stdin before backgrounding (background subshell gets /dev/null)
+_stdin=$(cat)
+if [[ "${FLOW_AGENTS_TELEMETRY_FOREGROUND:-false}" == "true" ]]; then
+  main "$@" "$_stdin"
+else
+  (main "$@" "$_stdin") </dev/null &>/dev/null &
+  disown 2>/dev/null
+fi
+if [[ "${FLOW_AGENTS_TELEMETRY_RUNTIME:-kiro-cli}" == "codex" ]]; then
+  _hook_event_name=$(printf '%s' "$_stdin" | jq -r '.hook_event_name // ""' 2>/dev/null)
+  case "$_hook_event_name" in
+    SessionStart)
+      printf '{"continue":true,"hookSpecificOutput":{"hookEventName":"SessionStart","additionalContext":"Flow Agents telemetry hooks are active for this session."}}\n'
+      ;;
+    UserPromptSubmit)
+      printf '{"continue":true,"hookSpecificOutput":{"hookEventName":"UserPromptSubmit","additionalContext":"Flow Agents telemetry captured this prompt."}}\n'
+      ;;
+    Stop)
+      printf '{"continue":true}\n'
+      ;;
+  esac
+fi
+exit 0

package/scripts/usage-feedback.js ADDED Viewed

	@@ -0,0 +1,2 @@
1	+ #!/usr/bin/env node
2	+ import("../build/src/cli/usage-feedback.js").then(({ main }) => process.exit(main()));

package/scripts/validate-hook-influence-cases.js ADDED Viewed

	@@ -0,0 +1,2 @@
1	+ #!/usr/bin/env node
2	+ import("../build/src/cli/validate-hook-influence.js").then(({ main }) => process.exit(main(process.argv.slice(2))));

package/scripts/validate-package.sh ADDED Viewed

@@ -0,0 +1,89 @@
+#!/usr/bin/env bash
+# validate-package.sh — Validate an installed Flow Agents bundle
+# Usage: bash validate-package.sh <package-prefix> [--local]
+set -uo pipefail
+PREFIX="${1:?Usage: validate-package.sh <package-prefix> [--local]}"
+[[ "${2:-}" == "--local" ]] && PREFIX="local-${PREFIX}"
+AGENTS_DIR="$HOME/.kiro/agents"
+errors=0
+echo "Package: ${PREFIX}"
+echo ""
+# Find agents
+count=$(ls "$AGENTS_DIR/${PREFIX}-"*.json 2>/dev/null | wc -l | tr -d ' ')
+echo "Agents: ${count} found"
+[[ "$count" -eq 0 ]] && echo "✗ No agents found" && exit 1
+echo ""
+# 1. Well-formedness
+spec_ok=0; spec_fail=0
+for f in "$AGENTS_DIR/${PREFIX}-"*.json; do
+  name=$(jq -r '.name // empty' "$f")
+  has_all=$(jq -r 'if .name and .prompt and .model and .description then "yes" else "no" end' "$f")
+  if [[ "$has_all" != "yes" ]]; then
+    echo "  ✗ $(basename $f): missing required field(s)"
+    spec_fail=$((spec_fail + 1))
+  elif ! echo "$name" | grep -qE '^[a-z][a-z0-9-]*$'; then
+    echo "  ✗ $name: invalid name format"
+    spec_fail=$((spec_fail + 1))
+  else
+    spec_ok=$((spec_ok + 1))
+  fi
+done
+echo "$([ $spec_fail -eq 0 ] && echo ✓ || echo ✗) Agent specs: ${spec_ok}/${count} well-formed"
+errors=$((errors + spec_fail))
+# 2. Hook scripts
+hook_total=0; hook_fail=0
+for f in "$AGENTS_DIR/${PREFIX}-"*.json; do
+  name=$(jq -r '.name' "$f")
+  for cmd in $(jq -r '.hooks // {} | .[] | .[] | .command // empty' "$f" 2>/dev/null); do
+    : # jq gives full command, need line-by-line
+  done
+  jq -r '.hooks // {} | to_entries[] | .key as $t | .value[] | "\($t)|\(.command // empty)"' "$f" 2>/dev/null | while IFS='|' read -r htype cmd; do
+    [[ -z "$cmd" ]] && continue
+    script=$(echo "$cmd" | sed 's/^bash //' | awk '{print $1}')
+    script="${script/#\~/$HOME}"
+    if [[ ! -f "$script" ]]; then
+      echo "  ✗ $name → $htype: $(basename $script) (not found)"
+    fi
+  done
+done
+hook_total=$(for f in "$AGENTS_DIR/${PREFIX}-"*.json; do jq '[.hooks // {} | .[] | .[]] | length' "$f" 2>/dev/null; done | awk '{s+=$1}END{print s}')
+hook_fail=$(for f in "$AGENTS_DIR/${PREFIX}-"*.json; do
+  jq -r '.hooks // {} | .[] | .[] | .command // empty' "$f" 2>/dev/null | while read cmd; do
+    [[ -z "$cmd" ]] && continue
+    script=$(echo "$cmd" | sed 's/^bash //' | awk '{print $1}')
+    script="${script/#\~/$HOME}"
+    [[ ! -f "$script" ]] && echo "x"
+  done
+done | wc -l | tr -d ' ')
+hook_ok=$((hook_total - hook_fail))
+echo "$([ $hook_fail -eq 0 ] && echo ✓ || echo ✗) Hook scripts: ${hook_ok}/${hook_total} resolved"
+errors=$((errors + hook_fail))
+# 3. Absolute resource paths
+res_fail=0
+for f in "$AGENTS_DIR/${PREFIX}-"*.json; do
+  name=$(jq -r '.name' "$f")
+  jq -r '.resources // [] | .[] | select(startswith("file://"))' "$f" 2>/dev/null | while read res; do
+    path="${res#file://}"
+    path="${path/#\~/$HOME}"
+    [[ "$path" == *"*"* || "$path" != /* ]] && continue
+    if [[ ! -f "$path" && ! -d "$path" ]]; then
+      echo "  ✗ $name: missing $path"
+    fi
+  done
+done
+echo "✓ Resources: checked"
+# 4. Summary
+echo ""
+if [[ $errors -eq 0 ]]; then
+  echo "Result: PASS"
+else
+  echo "Result: FAIL ($errors error(s))"
+fi

package/scripts/validate-source-tree.js ADDED Viewed

@@ -0,0 +1,9 @@
+#!/usr/bin/env node
+import("node:child_process").then(({ spawnSync }) => {
+  const result = spawnSync("npm", ["run", "validate:source", "--silent", "--", ...process.argv.slice(2)], {
+    cwd: new URL("..", import.meta.url),
+    encoding: "utf8",
+    stdio: "inherit",
+  });
+  process.exit(result.status ?? 1);
+});

package/skills/agentic-engineering/SKILL.md ADDED Viewed

@@ -0,0 +1,62 @@
+---
+name: agentic-engineering
+description: "Eval-first execution, task decomposition, and cost-aware model routing for AI-driven development workflows."
+---
+# Agentic Engineering
+Principles for AI-driven development: eval-first loops, disciplined decomposition, and cost-aware model selection.
+## Eval-First Loop
+Every implementation follows this cycle:
+1. **Define eval** — write the acceptance criteria as a runnable check (test, script, assertion)
+2. **Run baseline** — capture current behavior against the eval
+3. **Implement** — make the change
+4. **Re-run eval** — verify improvement
+5. **Check regressions** — run the full suite, not just the new eval
+Never ship without steps 4-5. If you can't define an eval, the requirement isn't clear enough.
+## 15-Minute Unit Rule
+Decompose every task into units where each:
+- Is **independently verifiable** — has its own eval or test
+- Has a **single dominant risk** — one thing that could go wrong
+- Has a **clear done condition** — unambiguous pass/fail
+- Takes **~15 minutes** of focused agent work
+If a unit can't be verified independently, it's too coupled. If it has multiple risks, split it.
+## Model Routing
+Match model tier to task complexity:
+| Tier | Model class | Use for |
+|------|-------------|---------|
+| Fast | Haiku | Boilerplate, narrow edits, formatting, simple transforms |
+| Standard | Sonnet | Implementation, refactors, test writing, code review |
+| Reasoning | Opus | Architecture decisions, root-cause analysis, complex debugging |
+### Cost Discipline
+- Start at the lowest tier that could work
+- Escalate only when the lower tier fails with a **clear reasoning gap** (not just a wrong answer — a structural inability to solve the problem)
+- Document the escalation reason: "Sonnet couldn't hold the full dependency graph → escalated to Opus"
+- Never use Opus for tasks Sonnet handles correctly
+## Session Strategy
+- **Continue** session for coupled units within the same phase
+- **Fresh** session after phase transitions (plan → implement, implement → verify)
+- **Compact** after milestones — summarize context, drop intermediate artifacts
+## Review Focus for AI-Generated Code
+AI code passes syntax checks easily but fails on subtler dimensions. Prioritize reviewing:
+- **Invariants** — are assumptions about state actually enforced?
+- **Edge cases** — empty inputs, boundary values, concurrent access
+- **Error boundaries** — does the error surface or get swallowed?
+- **Security assumptions** — auth checks, input sanitization, secret handling
+- **Hidden coupling** — does this change break something non-obvious elsewhere?

package/skills/browser-test/SKILL.md ADDED Viewed

@@ -0,0 +1,51 @@
+---
+name: "browser-test"
+description: "Headless browser automation via Playwright — screenshots, accessibility checks, form filling, UI testing, DOM inspection."
+---
+# Browser Testing
+Delegate browser automation and testing tasks to `tool-playwright` for real browser interaction — page loading, accessibility snapshots, form filling, screenshots, and user flow testing.
+## Trigger Patterns
+This skill activates when the user:
+- Wants to load a URL and inspect the page
+- Wants to test a user flow (click, type, navigate)
+- Wants to check accessibility (ARIA roles, tab order, snapshots)
+- Wants a screenshot for visual verification
+- Wants to fill forms or interact with UI elements
+- Mentions Playwright, browser testing, or DOM inspection
+- Needs to debug frontend behavior in a live browser
+## Workflow
+### Step 1: CLARIFY TARGET
+Identify what the user wants tested — a URL, a local dev server, a specific flow. If a local server is needed and not running, tell the user to start it first and provide the URL.
+### Step 2: DELEGATE
+Hand off to `tool-playwright` with a clear prompt describing:
+- The URL to load
+- What to inspect or test (accessibility, visual, flow)
+- Any specific interactions (click X, fill Y, navigate to Z)
+### Step 3: REPORT
+Relay `tool-playwright`'s findings back to the user. Highlight:
+- Accessibility issues found via snapshots
+- Visual anomalies from screenshots
+- Flow failures or unexpected behavior
+- Suggested fixes if applicable
+## NOT For
+- General web search or fetching page content for research — use web search tools instead
+- Scraping data from websites
+- API testing — use curl or httpie directly
+## Key Principles
+- ALWAYS delegate to `tool-playwright` — do not attempt browser interaction directly
+- Prefer accessibility snapshots (`browser_snapshot`) over screenshots for understanding page structure
+- If the user provides a localhost URL, confirm the dev server is running before delegating
+- Close the browser when done