@kontourai/flow-agents 0.1.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.githooks/pre-push +11 -0
- package/.github/workflows/ci.yml +210 -0
- package/.github/workflows/docs-pages.yml +52 -0
- package/.github/workflows/publish-npm.yml +104 -0
- package/AGENTS.md +26 -0
- package/CHANGELOG.md +66 -0
- package/CODE_OF_CONDUCT.md +25 -0
- package/CONTEXT.md +300 -0
- package/CONTRIBUTING.md +44 -0
- package/LICENSE +201 -0
- package/README.md +129 -0
- package/SECURITY.md +33 -0
- package/agent-cards/dev.json +19 -0
- package/agents/dev.json +127 -0
- package/agents/tool-code-reviewer.json +61 -0
- package/agents/tool-dependencies-updater.json +118 -0
- package/agents/tool-explore-config.json +92 -0
- package/agents/tool-explore-deps.json +92 -0
- package/agents/tool-explore-entry.json +92 -0
- package/agents/tool-explore-patterns.json +92 -0
- package/agents/tool-explore-structure.json +92 -0
- package/agents/tool-explore-tests.json +92 -0
- package/agents/tool-planner.json +57 -0
- package/agents/tool-playwright.json +145 -0
- package/agents/tool-security-reviewer.json +56 -0
- package/agents/tool-verifier.json +61 -0
- package/agents/tool-worker.json +58 -0
- package/build/src/cli/console-learning-projection.js +123 -0
- package/build/src/cli/docs-preview.js +39 -0
- package/build/src/cli/effective-backlog-settings.js +102 -0
- package/build/src/cli/export-bookmarks.js +38 -0
- package/build/src/cli/fixture-retirement-audit.js +140 -0
- package/build/src/cli/flow-kit.js +138 -0
- package/build/src/cli/import-bookmarks.js +50 -0
- package/build/src/cli/init.js +239 -0
- package/build/src/cli/instinct-cli.js +93 -0
- package/build/src/cli/promote-workflow-artifact.js +63 -0
- package/build/src/cli/publish-change-helper.js +154 -0
- package/build/src/cli/pull-work-provider.js +469 -0
- package/build/src/cli/runtime-adapter.js +23 -0
- package/build/src/cli/telemetry-doctor.js +221 -0
- package/build/src/cli/usage-feedback.js +443 -0
- package/build/src/cli/validate-hook-influence.js +152 -0
- package/build/src/cli/validate-source-tree.js +31 -0
- package/build/src/cli/validate-workflow-artifacts.js +486 -0
- package/build/src/cli/veritas-governance.js +262 -0
- package/build/src/cli/workflow-artifact-cleanup-audit.js +272 -0
- package/build/src/cli/workflow-sidecar.js +816 -0
- package/build/src/cli.js +89 -0
- package/build/src/flow-kit/validate.js +75 -0
- package/build/src/lib/args.js +45 -0
- package/build/src/lib/fs.js +62 -0
- package/build/src/lib/workflow-learning-projection.js +334 -0
- package/build/src/runtime-adapters.js +146 -0
- package/build/src/tools/build-universal-bundles.js +397 -0
- package/build/src/tools/common.js +56 -0
- package/build/src/tools/filter-installed-packs.js +132 -0
- package/build/src/tools/generate-context-map.js +198 -0
- package/build/src/tools/validate-package.js +64 -0
- package/build/src/tools/validate-source-tree.js +622 -0
- package/console.telemetry.json +176 -0
- package/context/base-rules.md +17 -0
- package/context/code-review-standards.md +62 -0
- package/context/coding-standards.md +42 -0
- package/context/common/orchestrators.md +12 -0
- package/context/common/subagents.md +28 -0
- package/context/contracts/artifact-contract.md +182 -0
- package/context/contracts/builder-kit-workflow-state-contract.md +319 -0
- package/context/contracts/delivery-contract.md +69 -0
- package/context/contracts/execution-contract.md +53 -0
- package/context/contracts/governance-adapter-contract.md +67 -0
- package/context/contracts/planning-contract.md +85 -0
- package/context/contracts/review-contract.md +104 -0
- package/context/contracts/sandbox-policy.md +52 -0
- package/context/contracts/verification-contract.md +134 -0
- package/context/contracts/work-item-contract.md +215 -0
- package/context/deferred/demo-mode.md +33 -0
- package/context/deferred/languages/go.md +31 -0
- package/context/deferred/languages/python.md +31 -0
- package/context/deferred/languages/typescript.md +34 -0
- package/context/deferred/parallelization.md +35 -0
- package/context/deferred/worktree-isolation.md +24 -0
- package/context/development-workflow.md +50 -0
- package/context/scripts/context-budget/budget-scan.sh +166 -0
- package/context/scripts/detect-tools.sh +3 -0
- package/context/scripts/discover-agents.sh +28 -0
- package/context/scripts/git-status.sh +49 -0
- package/context/scripts/hooks/config-protection.js +79 -0
- package/context/scripts/hooks/desktop-notify.sh +39 -0
- package/context/scripts/hooks/governance-audit.sh +135 -0
- package/context/scripts/hooks/lib/audit-transport.sh +40 -0
- package/context/scripts/hooks/lib/hook-flags.js +49 -0
- package/context/scripts/hooks/lib/patterns.sh +57 -0
- package/context/scripts/hooks/lib/resolve-formatter.js +80 -0
- package/context/scripts/hooks/post-edit-accumulator.js +66 -0
- package/context/scripts/hooks/pre-commit-quality.js +194 -0
- package/context/scripts/hooks/quality-gate.js +93 -0
- package/context/scripts/hooks/report-only-guard.js +21 -0
- package/context/scripts/hooks/run-hook.js +136 -0
- package/context/scripts/hooks/stop-format-typecheck.js +141 -0
- package/context/scripts/hooks/stop-goal-fit.js +337 -0
- package/context/scripts/hooks/workflow-steering.js +250 -0
- package/context/scripts/telemetry/console-presets.sh +14 -0
- package/context/scripts/telemetry/install-console-config.sh +214 -0
- package/context/scripts/telemetry/lib/config.sh +85 -0
- package/context/scripts/telemetry/lib/enrich.sh +115 -0
- package/context/scripts/telemetry/lib/redact.sh +22 -0
- package/context/scripts/telemetry/lib/session.sh +63 -0
- package/context/scripts/telemetry/lib/transport.sh +183 -0
- package/context/scripts/telemetry/lib/usage.sh +29 -0
- package/context/scripts/telemetry/sync-agents.sh +173 -0
- package/context/scripts/telemetry/telemetry.conf +23 -0
- package/context/scripts/telemetry/telemetry.sh +387 -0
- package/context/scripts/validate-package.sh +89 -0
- package/context/settings/backlog-provider-settings.json +54 -0
- package/context/templates/core/identity.md +26 -0
- package/context/templates/core/user.md +15 -0
- package/docs/_config.yml +15 -0
- package/docs/_layouts/default.html +87 -0
- package/docs/adr/0001-flow-agents-consumes-flow.md +77 -0
- package/docs/adr/0002-flow-kits-as-extension-unit.md +13 -0
- package/docs/adr/0003-flow-agents-coordinates-kits-and-adapters.md +13 -0
- package/docs/adr/0004-gates-expect-surface-claims.md +15 -0
- package/docs/adr/0005-kubernetes-inspired-resource-contracts.md +48 -0
- package/docs/adr/0006-typescript-first-source-policy.md +98 -0
- package/docs/agent-system-guidebook.md +391 -0
- package/docs/agent-usage-feedback-loop.md +351 -0
- package/docs/assets/favicon.svg +13 -0
- package/docs/assets/og-image.png +0 -0
- package/docs/assets/site.css +774 -0
- package/docs/assets/site.js +139 -0
- package/docs/configurable-workflow-routing.md +174 -0
- package/docs/context-map.md +145 -0
- package/docs/developer-architecture.md +145 -0
- package/docs/developer-hook-setup.md +61 -0
- package/docs/fixture-ownership.md +44 -0
- package/docs/flow-kit-repository-contract.md +180 -0
- package/docs/index.md +129 -0
- package/docs/kontour-resource-contract.md +358 -0
- package/docs/migrations.md +64 -0
- package/docs/north-star.md +322 -0
- package/docs/operating-layers.md +110 -0
- package/docs/repository-structure.md +132 -0
- package/docs/sandbox-policy.md +56 -0
- package/docs/skills-map.md +203 -0
- package/docs/standards-register.md +96 -0
- package/docs/veritas-integration.md +165 -0
- package/docs/work-item-adapters.md +72 -0
- package/docs/workflow-artifact-lifecycle.md +141 -0
- package/docs/workflow-eval-strategy.md +295 -0
- package/docs/workflow-shared-contracts.md +51 -0
- package/docs/workflow-usage-guide.md +443 -0
- package/evals/ARCHITECTURE.md +143 -0
- package/evals/CONVENTIONS.md +58 -0
- package/evals/README.md +128 -0
- package/evals/acceptance/run.sh +29 -0
- package/evals/acceptance/test_claude_harness.sh +242 -0
- package/evals/acceptance/test_codex_harness.sh +108 -0
- package/evals/acceptance/test_kiro_harness.sh +128 -0
- package/evals/cases/dev/404.html +97 -0
- package/evals/cases/dev/code-review.yaml +44 -0
- package/evals/cases/dev/dashboard.html +300 -0
- package/evals/cases/dev/deliver.yaml +66 -0
- package/evals/cases/dev/dependency-update.yaml +16 -0
- package/evals/cases/dev/explore.yaml +20 -0
- package/evals/cases/dev/index.html +370 -0
- package/evals/cases/dev/package-lock.json +28 -0
- package/evals/cases/dev/package.json +16 -0
- package/evals/cases/dev/plan-work.yaml +20 -0
- package/evals/cases/dev/promptfooconfig.yaml +666 -0
- package/evals/cases/dev/search-first.yaml +20 -0
- package/evals/cases/dev/tdd-workflow.yaml +48 -0
- package/evals/cases/dev/verify-work.yaml +44 -0
- package/evals/cases/dev/workflow.yaml +34 -0
- package/evals/ci/run-baseline.sh +283 -0
- package/evals/fixtures/backlog-provider-settings/global-default.json +44 -0
- package/evals/fixtures/backlog-provider-settings/project-override.json +53 -0
- package/evals/fixtures/builder-kit-workflow-state/baseline-freshness-resolution-hint.json +139 -0
- package/evals/fixtures/builder-kit-workflow-state/direct-primitive-stop.json +59 -0
- package/evals/fixtures/builder-kit-workflow-state/empty-board-route-shape.json +55 -0
- package/evals/fixtures/builder-kit-workflow-state/happy-path.json +71 -0
- package/evals/fixtures/builder-kit-workflow-state/mid-work-resume.json +80 -0
- package/evals/fixtures/builder-kit-workflow-state/missing-prestep-recovery.json +65 -0
- package/evals/fixtures/builder-kit-workflow-state/product-build-chaining.json +60 -0
- package/evals/fixtures/builder-kit-workflow-state/stale-continuation-requires-new-probe.json +57 -0
- package/evals/fixtures/console-learning-projection/artifacts/console-learning-correction/learning.json +50 -0
- package/evals/fixtures/console-learning-projection/artifacts/console-learning-open-route/learning.json +41 -0
- package/evals/fixtures/flow-kit-repository/invalid-absolute-path/kit.json +8 -0
- package/evals/fixtures/flow-kit-repository/invalid-asset-section/flows/review.flow.json +6 -0
- package/evals/fixtures/flow-kit-repository/invalid-asset-section/kit.json +11 -0
- package/evals/fixtures/flow-kit-repository/invalid-duplicate-flow/flows/review.flow.json +6 -0
- package/evals/fixtures/flow-kit-repository/invalid-duplicate-flow/kit.json +9 -0
- package/evals/fixtures/flow-kit-repository/invalid-id/flows/review.flow.json +6 -0
- package/evals/fixtures/flow-kit-repository/invalid-id/kit.json +8 -0
- package/evals/fixtures/flow-kit-repository/invalid-malformed-json/kit.json +8 -0
- package/evals/fixtures/flow-kit-repository/invalid-missing-flow/kit.json +8 -0
- package/evals/fixtures/flow-kit-repository/invalid-missing-id/flows/review.flow.json +6 -0
- package/evals/fixtures/flow-kit-repository/invalid-missing-id/kit.json +7 -0
- package/evals/fixtures/flow-kit-repository/invalid-missing-schema-version/flows/review.flow.json +6 -0
- package/evals/fixtures/flow-kit-repository/invalid-missing-schema-version/kit.json +7 -0
- package/evals/fixtures/flow-kit-repository/invalid-name/flows/review.flow.json +6 -0
- package/evals/fixtures/flow-kit-repository/invalid-name/kit.json +8 -0
- package/evals/fixtures/flow-kit-repository/invalid-schema-version/flows/review.flow.json +6 -0
- package/evals/fixtures/flow-kit-repository/invalid-schema-version/kit.json +8 -0
- package/evals/fixtures/flow-kit-repository/invalid-traversal/kit.json +8 -0
- package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/adapters/example.json +3 -0
- package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/assets/example.txt +1 -0
- package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/docs/README.md +3 -0
- package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/flows/runtime.flow.json +26 -0
- package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/kit-evals/example.json +3 -0
- package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/kit-skills/mixed/SKILL.md +3 -0
- package/evals/fixtures/flow-kit-repository/mixed-runtime-kit/kit.json +44 -0
- package/evals/fixtures/flow-kit-repository/valid-local-kit/docs/README.md +3 -0
- package/evals/fixtures/flow-kit-repository/valid-local-kit/flows/review.flow.json +26 -0
- package/evals/fixtures/flow-kit-repository/valid-local-kit/kit.json +20 -0
- package/evals/fixtures/hook-influence/cases.json +336 -0
- package/evals/fixtures/pull-work-provider/github-issues.json +170 -0
- package/evals/fixtures/pull-work-wip-shepherding/global-wip-informs.json +43 -0
- package/evals/fixtures/pull-work-wip-shepherding/personal-wip-blocks.json +42 -0
- package/evals/fixtures/surface-trust/accepted-claim-trust-report.json +31 -0
- package/evals/fixtures/surface-trust/artifact-absent.json +19 -0
- package/evals/fixtures/surface-trust/integrity-mismatch-trust-report.json +32 -0
- package/evals/fixtures/surface-trust/missing-authority-trust-report.json +27 -0
- package/evals/fixtures/surface-trust/provider-absent.json +19 -0
- package/evals/fixtures/surface-trust/rejected-claim-trust-report.json +30 -0
- package/evals/fixtures/surface-trust/stale-claim-trust-snapshot.json +31 -0
- package/evals/fixtures/usage-feedback/sample-full.jsonl +11 -0
- package/evals/fixtures/usage-feedback/sample-outcomes.jsonl +1 -0
- package/evals/fixtures/veritas-governance-adapter/fake-veritas-pass.sh +18 -0
- package/evals/fixtures/veritas-governance-adapter/fake-veritas-secret-fail.sh +10 -0
- package/evals/fixtures/veritas-governance-adapter/fake-veritas-unconfigured.sh +4 -0
- package/evals/integration/test_bundle_install.sh +541 -0
- package/evals/integration/test_console_learning_projection.sh +192 -0
- package/evals/integration/test_context_map.sh +65 -0
- package/evals/integration/test_effective_backlog_settings.sh +58 -0
- package/evals/integration/test_fixture_retirement_audit.sh +58 -0
- package/evals/integration/test_flow_agents_statusline.sh +93 -0
- package/evals/integration/test_flow_kit_repository.sh +90 -0
- package/evals/integration/test_goal_fit_hook.sh +482 -0
- package/evals/integration/test_hook_category_behaviors.sh +190 -0
- package/evals/integration/test_hook_influence_cases.sh +69 -0
- package/evals/integration/test_local_flow_kit_install.sh +145 -0
- package/evals/integration/test_publish_change_helper.sh +176 -0
- package/evals/integration/test_pull_work_provider.sh +140 -0
- package/evals/integration/test_runtime_adapter_activation.sh +106 -0
- package/evals/integration/test_telemetry.sh +485 -0
- package/evals/integration/test_telemetry_doctor.sh +193 -0
- package/evals/integration/test_usage_feedback_dashboard.sh +169 -0
- package/evals/integration/test_usage_feedback_global.sh +117 -0
- package/evals/integration/test_usage_feedback_import.sh +227 -0
- package/evals/integration/test_usage_feedback_outcomes.sh +165 -0
- package/evals/integration/test_usage_feedback_report.sh +263 -0
- package/evals/integration/test_veritas_governance_adapter.sh +235 -0
- package/evals/integration/test_workflow_artifact_cleanup_audit.sh +287 -0
- package/evals/integration/test_workflow_artifacts.sh +1247 -0
- package/evals/integration/test_workflow_sidecar_writer.sh +2112 -0
- package/evals/integration/test_workflow_steering_hook.sh +337 -0
- package/evals/lib/assertions/delegated-to.js +40 -0
- package/evals/lib/assertions/max-tool-calls.js +15 -0
- package/evals/lib/assertions/no-write-tools.js +27 -0
- package/evals/lib/assertions/pass-at-k.js +39 -0
- package/evals/lib/assertions/telemetry-utils.js +105 -0
- package/evals/lib/assertions/tool-called.js +39 -0
- package/evals/lib/assertions/verify-after-fix.js +61 -0
- package/evals/lib/claude-judge.sh +40 -0
- package/evals/lib/claude-provider.sh +74 -0
- package/evals/lib/codex-judge.sh +39 -0
- package/evals/lib/codex-provider.sh +81 -0
- package/evals/lib/eval-dev.sh +5 -0
- package/evals/lib/eval-judge.sh +22 -0
- package/evals/lib/eval-provider.sh +26 -0
- package/evals/lib/eval-report.sh +73 -0
- package/evals/lib/kiro-dev.sh +4 -0
- package/evals/lib/kiro-judge.sh +17 -0
- package/evals/lib/kiro-provider.sh +62 -0
- package/evals/lib/node.sh +111 -0
- package/evals/promptfooconfig.yaml +70 -0
- package/evals/run.sh +309 -0
- package/evals/static/test_evidence_refs.sh +141 -0
- package/evals/static/test_package.sh +407 -0
- package/evals/static/test_repo_hooks.sh +68 -0
- package/evals/static/test_universal_bundles.sh +274 -0
- package/evals/static/test_workflow_skills.sh +1207 -0
- package/install.sh +64 -0
- package/integrations/veritas/flow-agents.adapter.json +138 -0
- package/integrations/veritas/flow-agents.authority-settings.json +26 -0
- package/integrations/veritas/flow-agents.repo-standards.json +82 -0
- package/kits/builder/flows/build.flow.json +218 -0
- package/kits/builder/flows/shape.flow.json +127 -0
- package/kits/builder/kit.json +19 -0
- package/kits/catalog.json +11 -0
- package/package.json +130 -0
- package/packaging/README.md +60 -0
- package/packaging/manifest.json +173 -0
- package/packaging/packs.json +69 -0
- package/powers/dependency-checker/POWER.md +20 -0
- package/powers/dependency-checker/mcp.json +20 -0
- package/powers/playwright/POWER.md +25 -0
- package/powers/playwright/mcp.json +12 -0
- package/prompts/code-audit.md +123 -0
- package/prompts/kcommit.md +88 -0
- package/schemas/backlog-provider-settings.schema.json +138 -0
- package/schemas/workflow-acceptance.schema.json +216 -0
- package/schemas/workflow-critique.schema.json +113 -0
- package/schemas/workflow-evidence.schema.json +357 -0
- package/schemas/workflow-handoff.schema.json +52 -0
- package/schemas/workflow-learning.schema.json +223 -0
- package/schemas/workflow-release.schema.json +172 -0
- package/schemas/workflow-state.schema.json +80 -0
- package/scripts/README.md +111 -0
- package/scripts/build-universal-bundles.js +3 -0
- package/scripts/check-content-boundary.cjs +99 -0
- package/scripts/context-budget/budget-scan.sh +166 -0
- package/scripts/detect-tools.sh +3 -0
- package/scripts/discover-agents.sh +28 -0
- package/scripts/effective-backlog-settings.js +2 -0
- package/scripts/filter-installed-packs.js +2 -0
- package/scripts/flow-kit.js +2 -0
- package/scripts/generate-context-map.js +2 -0
- package/scripts/git-status.sh +49 -0
- package/scripts/hooks/claude-hook-adapter.js +174 -0
- package/scripts/hooks/claude-telemetry-hook.js +115 -0
- package/scripts/hooks/codex-hook-adapter.js +176 -0
- package/scripts/hooks/codex-telemetry-hook.js +95 -0
- package/scripts/hooks/config-protection.js +79 -0
- package/scripts/hooks/desktop-notify.sh +39 -0
- package/scripts/hooks/governance-audit.sh +135 -0
- package/scripts/hooks/lib/audit-transport.sh +40 -0
- package/scripts/hooks/lib/hook-flags.js +49 -0
- package/scripts/hooks/lib/patterns.sh +57 -0
- package/scripts/hooks/lib/resolve-formatter.js +80 -0
- package/scripts/hooks/post-edit-accumulator.js +66 -0
- package/scripts/hooks/pre-commit-quality.js +194 -0
- package/scripts/hooks/quality-gate.js +93 -0
- package/scripts/hooks/report-only-guard.js +21 -0
- package/scripts/hooks/run-hook.js +136 -0
- package/scripts/hooks/stop-format-typecheck.js +141 -0
- package/scripts/hooks/stop-goal-fit.js +337 -0
- package/scripts/hooks/workflow-steering.js +250 -0
- package/scripts/install-codex-home.sh +106 -0
- package/scripts/package.json +3 -0
- package/scripts/promote-workflow-artifact.js +2 -0
- package/scripts/publish-change-helper.js +2 -0
- package/scripts/pull-work-provider.js +2 -0
- package/scripts/setup-repo-hooks.sh +8 -0
- package/scripts/statusline/flow-agents-statusline.js +157 -0
- package/scripts/telemetry/console-presets.sh +14 -0
- package/scripts/telemetry/install-console-config.sh +214 -0
- package/scripts/telemetry/lib/config.sh +85 -0
- package/scripts/telemetry/lib/enrich.sh +115 -0
- package/scripts/telemetry/lib/redact.sh +22 -0
- package/scripts/telemetry/lib/session.sh +63 -0
- package/scripts/telemetry/lib/transport.sh +183 -0
- package/scripts/telemetry/lib/usage.sh +29 -0
- package/scripts/telemetry/sync-agents.sh +173 -0
- package/scripts/telemetry/telemetry.conf +23 -0
- package/scripts/telemetry/telemetry.sh +387 -0
- package/scripts/usage-feedback.js +2 -0
- package/scripts/validate-hook-influence-cases.js +2 -0
- package/scripts/validate-package.sh +89 -0
- package/scripts/validate-source-tree.js +9 -0
- package/skills/agentic-engineering/SKILL.md +62 -0
- package/skills/browser-test/SKILL.md +51 -0
- package/skills/builder-shape/SKILL.md +76 -0
- package/skills/context-budget/SKILL.md +40 -0
- package/skills/deliver/SKILL.md +241 -0
- package/skills/dependency-update/SKILL.md +68 -0
- package/skills/design-probe/SKILL.md +107 -0
- package/skills/eval-rebuild/SKILL.md +39 -0
- package/skills/evidence-gate/SKILL.md +186 -0
- package/skills/execute-plan/SKILL.md +110 -0
- package/skills/explore/SKILL.md +137 -0
- package/skills/feedback-loop/SKILL.md +87 -0
- package/skills/fix-bug/SKILL.md +133 -0
- package/skills/frontend-design/SKILL.md +80 -0
- package/skills/github-cli/SKILL.md +63 -0
- package/skills/idea-to-backlog/SKILL.md +267 -0
- package/skills/knowledge-capture/SKILL.md +55 -0
- package/skills/learning-review/SKILL.md +115 -0
- package/skills/pickup-probe/SKILL.md +114 -0
- package/skills/plan-work/SKILL.md +176 -0
- package/skills/pull-work/SKILL.md +309 -0
- package/skills/release-readiness/SKILL.md +121 -0
- package/skills/review-work/SKILL.md +161 -0
- package/skills/search-first/SKILL.md +66 -0
- package/skills/tdd-workflow/SKILL.md +140 -0
- package/skills/verify-work/SKILL.md +109 -0
- package/src/cli/console-learning-projection.ts +140 -0
- package/src/cli/effective-backlog-settings.ts +99 -0
- package/src/cli/fixture-retirement-audit.ts +154 -0
- package/src/cli/flow-kit.ts +139 -0
- package/src/cli/init.ts +248 -0
- package/src/cli/promote-workflow-artifact.ts +64 -0
- package/src/cli/publish-change-helper.ts +143 -0
- package/src/cli/pull-work-provider.ts +481 -0
- package/src/cli/runtime-adapter.ts +24 -0
- package/src/cli/telemetry-doctor.ts +243 -0
- package/src/cli/usage-feedback.ts +418 -0
- package/src/cli/validate-hook-influence.ts +119 -0
- package/src/cli/validate-source-tree.ts +30 -0
- package/src/cli/validate-workflow-artifacts.ts +411 -0
- package/src/cli/veritas-governance.ts +322 -0
- package/src/cli/workflow-artifact-cleanup-audit.ts +281 -0
- package/src/cli/workflow-sidecar.ts +676 -0
- package/src/cli.ts +95 -0
- package/src/flow-kit/validate.ts +74 -0
- package/src/lib/args.ts +43 -0
- package/src/lib/fs.ts +62 -0
- package/src/lib/workflow-learning-projection.ts +491 -0
- package/src/runtime-adapters.ts +154 -0
- package/src/tools/build-universal-bundles.ts +366 -0
- package/src/tools/common.ts +61 -0
- package/src/tools/filter-installed-packs.ts +129 -0
- package/src/tools/generate-context-map.ts +199 -0
- package/src/tools/validate-package.ts +57 -0
- package/src/tools/validate-source-tree.ts +488 -0
- package/tsconfig.json +19 -0
- package/veritas.claims.json +6 -0
|
@@ -0,0 +1,11 @@
|
|
|
1
|
+
{
|
|
2
|
+
"schema_version": "1.0",
|
|
3
|
+
"kits": [
|
|
4
|
+
{
|
|
5
|
+
"id": "builder",
|
|
6
|
+
"name": "Builder Kit",
|
|
7
|
+
"path": "kits/builder",
|
|
8
|
+
"description": "Flow-backed shaping, planning, build, verification, merge readiness, pull request readiness, and learning workflows."
|
|
9
|
+
}
|
|
10
|
+
]
|
|
11
|
+
}
|
package/package.json
ADDED
|
@@ -0,0 +1,130 @@
|
|
|
1
|
+
{
|
|
2
|
+
"name": "@kontourai/flow-agents",
|
|
3
|
+
"version": "0.1.1",
|
|
4
|
+
"description": "Flow Agents — a Kontour product that applies Flow and Veritas discipline inside the agent tools you already use: Claude Code, Codex, Kiro, and GitHub Actions.",
|
|
5
|
+
"keywords": [
|
|
6
|
+
"agents",
|
|
7
|
+
"ai-agents",
|
|
8
|
+
"workflow",
|
|
9
|
+
"skills",
|
|
10
|
+
"claude-code",
|
|
11
|
+
"codex",
|
|
12
|
+
"kiro",
|
|
13
|
+
"evidence",
|
|
14
|
+
"process-transparency"
|
|
15
|
+
],
|
|
16
|
+
"license": "Apache-2.0",
|
|
17
|
+
"publishConfig": {
|
|
18
|
+
"access": "public"
|
|
19
|
+
},
|
|
20
|
+
"type": "module",
|
|
21
|
+
"repository": {
|
|
22
|
+
"type": "git",
|
|
23
|
+
"url": "git+https://github.com/kontourai/flow-agents.git"
|
|
24
|
+
},
|
|
25
|
+
"engines": {
|
|
26
|
+
"node": ">=22"
|
|
27
|
+
},
|
|
28
|
+
"bin": {
|
|
29
|
+
"flow-agents": "build/src/cli.js",
|
|
30
|
+
"flow-agents-build-bundles": "build/src/cli.js",
|
|
31
|
+
"flow-agents-console-learning-projection": "build/src/cli.js",
|
|
32
|
+
"flow-agents-context-map": "build/src/cli.js",
|
|
33
|
+
"flow-agents-effective-backlog-settings": "build/src/cli.js",
|
|
34
|
+
"flow-agents-filter-installed-packs": "build/src/cli.js",
|
|
35
|
+
"flow-agents-flow-kit": "build/src/cli.js",
|
|
36
|
+
"flow-agents-fixture-retirement-audit": "build/src/cli.js",
|
|
37
|
+
"flow-agents-promote-workflow-artifact": "build/src/cli.js",
|
|
38
|
+
"flow-agents-publish-change": "build/src/cli.js",
|
|
39
|
+
"flow-agents-pull-work-provider": "build/src/cli.js",
|
|
40
|
+
"flow-agents-runtime-adapter": "build/src/cli.js",
|
|
41
|
+
"flow-agents-telemetry-doctor": "build/src/cli.js",
|
|
42
|
+
"flow-agents-usage-feedback": "build/src/cli.js",
|
|
43
|
+
"flow-agents-veritas-governance": "build/src/cli.js",
|
|
44
|
+
"flow-agents-validate-artifacts": "build/src/cli/validate-workflow-artifacts.js",
|
|
45
|
+
"flow-agents-validate-hook-influence": "build/src/cli.js",
|
|
46
|
+
"flow-agents-validate-source": "build/src/cli.js",
|
|
47
|
+
"flow-agents-workflow-artifact-cleanup-audit": "build/src/cli.js",
|
|
48
|
+
"flow-agents-workflow-sidecar": "build/src/cli/workflow-sidecar.js"
|
|
49
|
+
},
|
|
50
|
+
"files": [
|
|
51
|
+
".github/",
|
|
52
|
+
".githooks/",
|
|
53
|
+
"AGENTS.md",
|
|
54
|
+
"CHANGELOG.md",
|
|
55
|
+
"CODE_OF_CONDUCT.md",
|
|
56
|
+
"CONTEXT.md",
|
|
57
|
+
"CONTRIBUTING.md",
|
|
58
|
+
"SECURITY.md",
|
|
59
|
+
"agent-cards/",
|
|
60
|
+
"agents/",
|
|
61
|
+
"build/",
|
|
62
|
+
"console.telemetry.json",
|
|
63
|
+
"context/",
|
|
64
|
+
"docs/",
|
|
65
|
+
"evals/",
|
|
66
|
+
"install.sh",
|
|
67
|
+
"integrations/",
|
|
68
|
+
"kits/",
|
|
69
|
+
"packaging/",
|
|
70
|
+
"powers/",
|
|
71
|
+
"prompts/",
|
|
72
|
+
"schemas/",
|
|
73
|
+
"scripts/",
|
|
74
|
+
"skills/",
|
|
75
|
+
"src/",
|
|
76
|
+
"tsconfig.json",
|
|
77
|
+
"veritas.claims.json",
|
|
78
|
+
"!evals/cases/dev/node_modules/",
|
|
79
|
+
"!evals/results/",
|
|
80
|
+
"!**/.DS_Store",
|
|
81
|
+
"!**/.flow-agents/",
|
|
82
|
+
"!**/.surface/",
|
|
83
|
+
"!**/.telemetry/",
|
|
84
|
+
"!**/.veritas/",
|
|
85
|
+
"!**/node_modules/"
|
|
86
|
+
],
|
|
87
|
+
"scripts": {
|
|
88
|
+
"build": "tsc -p tsconfig.json",
|
|
89
|
+
"typecheck": "tsc -p tsconfig.json --noEmit",
|
|
90
|
+
"validate:source": "npm run build --silent && node build/src/cli.js validate-source",
|
|
91
|
+
"validate:artifacts": "npm run build --silent && node build/src/cli/validate-workflow-artifacts.js",
|
|
92
|
+
"context-map": "npm run build --silent && node build/src/cli.js context-map",
|
|
93
|
+
"context-map:check": "npm run build --silent && node build/src/cli.js context-map --check",
|
|
94
|
+
"build:bundles": "npm run build --silent && node build/src/cli.js build-bundles",
|
|
95
|
+
"filter:packs": "npm run build --silent && node build/src/cli.js filter-installed-packs",
|
|
96
|
+
"validate:package": "npm run build --silent && node build/src/cli.js validate-package",
|
|
97
|
+
"workflow:sidecar": "npm run build --silent && node build/src/cli/workflow-sidecar.js",
|
|
98
|
+
"workflow:validate-artifacts": "npm run build --silent && node build/src/cli/validate-workflow-artifacts.js",
|
|
99
|
+
"flow-kit": "npm run build --silent && node build/src/cli.js flow-kit",
|
|
100
|
+
"effective-backlog-settings": "npm run build --silent && node build/src/cli.js effective-backlog-settings",
|
|
101
|
+
"pull-work-provider": "npm run build --silent && node build/src/cli.js pull-work-provider",
|
|
102
|
+
"publish-change": "npm run build --silent && node build/src/cli.js publish-change",
|
|
103
|
+
"promote-workflow-artifact": "npm run build --silent && node build/src/cli.js promote-workflow-artifact",
|
|
104
|
+
"usage-feedback": "npm run build --silent && node build/src/cli.js usage-feedback",
|
|
105
|
+
"runtime-adapter": "npm run build --silent && node build/src/cli.js runtime-adapter",
|
|
106
|
+
"telemetry-doctor": "npm run build --silent && node build/src/cli.js telemetry-doctor",
|
|
107
|
+
"veritas-governance": "npm run build --silent && node build/src/cli.js veritas-governance",
|
|
108
|
+
"validate:hook-influence": "npm run build --silent && node build/src/cli.js validate-hook-influence",
|
|
109
|
+
"workflow-artifact-cleanup-audit": "npm run build --silent && node build/src/cli.js workflow-artifact-cleanup-audit",
|
|
110
|
+
"fixture:retirement-audit": "npm run build --silent && node build/src/cli.js fixture-retirement-audit",
|
|
111
|
+
"setup:repo-hooks": "bash scripts/setup-repo-hooks.sh",
|
|
112
|
+
"validate:repo-hooks": "bash evals/static/test_repo_hooks.sh",
|
|
113
|
+
"eval": "bash evals/run.sh",
|
|
114
|
+
"eval:static": "bash evals/run.sh static",
|
|
115
|
+
"eval:integration": "bash evals/run.sh integration",
|
|
116
|
+
"eval:acceptance": "bash evals/run.sh acceptance",
|
|
117
|
+
"eval:llm": "bash evals/run.sh llm",
|
|
118
|
+
"eval:llm:codex": "bash evals/run.sh llm dev --runtime codex",
|
|
119
|
+
"eval:llm:claude": "bash evals/run.sh llm dev --runtime claude --judge-runtime codex",
|
|
120
|
+
"promptfoo": "PROMPTFOO_CONFIG_DIR=.promptfoo PROMPTFOO_DISABLE_WAL_MODE=true PROMPTFOO_DISABLE_TELEMETRY=true promptfoo",
|
|
121
|
+
"promptfoo:view": "PROMPTFOO_CONFIG_DIR=.promptfoo PROMPTFOO_DISABLE_WAL_MODE=true PROMPTFOO_DISABLE_TELEMETRY=true promptfoo view",
|
|
122
|
+
"check:content-boundary": "node scripts/check-content-boundary.cjs",
|
|
123
|
+
"prepack": "npm run build --silent && npm run validate:source --"
|
|
124
|
+
},
|
|
125
|
+
"devDependencies": {
|
|
126
|
+
"@types/node": "^22.19.19",
|
|
127
|
+
"promptfoo": "^0.121.15",
|
|
128
|
+
"typescript": "^6.0.3"
|
|
129
|
+
}
|
|
130
|
+
}
|
|
@@ -0,0 +1,60 @@
|
|
|
1
|
+
# Universal Packaging
|
|
2
|
+
|
|
3
|
+
This directory defines the cross-harness packaging layer for Flow Agents.
|
|
4
|
+
|
|
5
|
+
## Canonical Source
|
|
6
|
+
|
|
7
|
+
The repo root stays canonical:
|
|
8
|
+
|
|
9
|
+
- `agents/` contains source Kiro-style agent specs: the `dev` workflow surface plus specialist `tool-*` agents.
|
|
10
|
+
- `agent-cards/` contains discovery metadata for routable orchestrators.
|
|
11
|
+
- `skills/`, `context/`, `powers/`, `prompts/`, `scripts/`, and `evals/` remain shared content.
|
|
12
|
+
- `src/` contains TypeScript CLI, runtime adapter, packaging, validation, context-map, and repository tooling source that compiles to `build/src/`.
|
|
13
|
+
- `packaging/manifest.json` describes target-specific copy rules, profile definitions, substitutions, and model/provider mappings.
|
|
14
|
+
- Generated bundles live under `dist/`, are intentionally untracked, and can be recreated at any time.
|
|
15
|
+
|
|
16
|
+
For the full source/generated/runtime inventory, see [Repository Structure](../docs/repository-structure.md).
|
|
17
|
+
|
|
18
|
+
## Targets
|
|
19
|
+
|
|
20
|
+
- `dist/kiro/` keeps native Kiro JSON agents and rewrites path-bound config through the install token.
|
|
21
|
+
- `dist/claude-code/` exports `.claude/agents/*.md` and `.claude/skills/*/SKILL.md`.
|
|
22
|
+
- `dist/codex/` exports `.codex/agents/*.toml`, `.codex/skills/*/SKILL.md`, and profile config for `kdev` and its Bedrock variant.
|
|
23
|
+
|
|
24
|
+
All targets also receive shared canonical directories where supported: `context/`, `powers/`, `prompts/`, `scripts/`, and `evals/`.
|
|
25
|
+
|
|
26
|
+
`docs/` and `evals/` are intentionally included in generated bundles today. `docs/` gives installed agents durable local reference material, and `evals/` provides install-time and runtime smoke tests for the exported bundle. If bundle size becomes a product constraint, prune these through `packaging/manifest.json` and update install tests rather than deleting generated output by hand.
|
|
27
|
+
|
|
28
|
+
## Generated And Runtime Boundaries
|
|
29
|
+
|
|
30
|
+
`dist/` is a generated export surface, not the source of truth. Installed runtime directories such as `.codex/` and `.claude/` are also not source. They are created from the generated target bundle and installer scripts. If generated or installed hook config is wrong, fix the canonical source, rebuild `dist/`, and reinstall the runtime config.
|
|
31
|
+
|
|
32
|
+
Runtime workflow state under `.flow-agents/<slug>/` is local working memory. Packaging should copy canonical workflow contracts and skills, but it should not publish local task artifacts as product source. Durable outcomes must be promoted into docs, source, schemas, or provider records before merge.
|
|
33
|
+
|
|
34
|
+
## Validation And Build
|
|
35
|
+
|
|
36
|
+
Run the source validator before rebuilding:
|
|
37
|
+
|
|
38
|
+
```bash
|
|
39
|
+
npm run validate:source --
|
|
40
|
+
```
|
|
41
|
+
|
|
42
|
+
Rebuild every target bundle:
|
|
43
|
+
|
|
44
|
+
```bash
|
|
45
|
+
npm run build:bundles --
|
|
46
|
+
```
|
|
47
|
+
|
|
48
|
+
Run static package checks after rebuilding:
|
|
49
|
+
|
|
50
|
+
```bash
|
|
51
|
+
bash evals/run.sh static
|
|
52
|
+
```
|
|
53
|
+
|
|
54
|
+
For telemetry and shell integration coverage:
|
|
55
|
+
|
|
56
|
+
```bash
|
|
57
|
+
bash evals/run.sh integration
|
|
58
|
+
```
|
|
59
|
+
|
|
60
|
+
The builder is stdlib-only so the package stays dependency-free.
|
|
@@ -0,0 +1,173 @@
|
|
|
1
|
+
{
|
|
2
|
+
"canonical_copy_dirs": [
|
|
3
|
+
"agent-cards",
|
|
4
|
+
"context",
|
|
5
|
+
"docs",
|
|
6
|
+
"evals",
|
|
7
|
+
"kits",
|
|
8
|
+
"packaging",
|
|
9
|
+
"powers",
|
|
10
|
+
"prompts",
|
|
11
|
+
"scripts",
|
|
12
|
+
"schemas",
|
|
13
|
+
"skills"
|
|
14
|
+
],
|
|
15
|
+
"root_copy_files": [
|
|
16
|
+
"console.telemetry.json"
|
|
17
|
+
],
|
|
18
|
+
"optional_copy_dirs": [],
|
|
19
|
+
"source_root_aliases": [
|
|
20
|
+
"~/.flow-agents",
|
|
21
|
+
"$HOME/.flow-agents"
|
|
22
|
+
],
|
|
23
|
+
"kiro": {
|
|
24
|
+
"path_token": "__KIRO_PACKAGE_ROOT__",
|
|
25
|
+
"install_default": "~/.flow-agents"
|
|
26
|
+
},
|
|
27
|
+
"claude_code": {
|
|
28
|
+
"task_dir": ".flow-agents",
|
|
29
|
+
"permissions": {
|
|
30
|
+
"defaultMode": "auto"
|
|
31
|
+
},
|
|
32
|
+
"skipDangerousModePermissionPrompt": true
|
|
33
|
+
},
|
|
34
|
+
"codex": {
|
|
35
|
+
"task_dir": ".flow-agents",
|
|
36
|
+
"settings": {
|
|
37
|
+
"approvals_reviewer": "auto_review"
|
|
38
|
+
},
|
|
39
|
+
"tui": {
|
|
40
|
+
"status_line": [
|
|
41
|
+
"model-with-reasoning",
|
|
42
|
+
"project-name",
|
|
43
|
+
"git-branch",
|
|
44
|
+
"task-progress",
|
|
45
|
+
"context-remaining",
|
|
46
|
+
"run-state"
|
|
47
|
+
]
|
|
48
|
+
},
|
|
49
|
+
"features": {
|
|
50
|
+
"multi_agent": true,
|
|
51
|
+
"js_repl": true,
|
|
52
|
+
"hooks": true,
|
|
53
|
+
"apps": false,
|
|
54
|
+
"plugins": false
|
|
55
|
+
},
|
|
56
|
+
"excluded_agents": [
|
|
57
|
+
"dev"
|
|
58
|
+
],
|
|
59
|
+
"profiles": {
|
|
60
|
+
"kdev": {
|
|
61
|
+
"model": "gpt-5.5",
|
|
62
|
+
"model_reasoning_effort": "medium",
|
|
63
|
+
"approval_policy": "on-request",
|
|
64
|
+
"sandbox_mode": "workspace-write",
|
|
65
|
+
"web_search": "cached",
|
|
66
|
+
"developer_instructions": "You are operating in Flow Agents dev mode. You write and modify code, validate it works, and deliver clean results. You own the code; specialist tool-* subagents provide bounded context, review, verification, or parallel implementation support. Use deliver for feature work, fix-bug for bug fixes, plan-work for planning, execute-plan for approved plans, review-work for critique, verify-work for evidence collection, search-first for unfamiliar external APIs, and context-budget when context grows. When a workflow skill or shared contract names a delegate, that delegate is a required gate, not an optional optimization: plan-work must delegate to tool-planner, execute-plan to tool-worker, review-work to tool-code-reviewer and conditionally tool-security-reviewer, verify-work to tool-verifier, and browser/UI verification to tool-playwright when those agents are available. Attempt required delegation even in read-only or partially blocked environments; if blocked, report the blocked gate as NOT_VERIFIED or incomplete with evidence instead of replacing it with a local summary. In progress and final reports, name exact delegate ids such as tool-planner, tool-worker, tool-code-reviewer, tool-security-reviewer, tool-verifier, and tool-playwright so text-only evals remain auditable when telemetry is unavailable. In Codex, required specialist delegation must pass the role through the spawn tool, for example agent_type=tool-worker; do not spawn unnamed/default subagents for Flow Agents worker, planner, reviewer, verifier, or Playwright gates."
|
|
67
|
+
},
|
|
68
|
+
"kdev-br": {
|
|
69
|
+
"model_provider": "amazon-bedrock",
|
|
70
|
+
"model": "amazon.nova-pro-v1:0",
|
|
71
|
+
"model_reasoning_effort": "medium",
|
|
72
|
+
"approval_policy": "on-request",
|
|
73
|
+
"sandbox_mode": "workspace-write",
|
|
74
|
+
"web_search": "cached",
|
|
75
|
+
"developer_instructions": "You are operating in Flow Agents dev mode on Amazon Bedrock. You write and modify code, validate it works, and deliver clean results. You own the code; specialist tool-* subagents provide bounded context, review, verification, or parallel implementation support. Use deliver for feature work, fix-bug for bug fixes, plan-work for planning, execute-plan for approved plans, review-work for critique, verify-work for evidence collection, search-first for unfamiliar external APIs, and context-budget when context grows. When a workflow skill or shared contract names a delegate, that delegate is a required gate, not an optional optimization: plan-work must delegate to tool-planner, execute-plan to tool-worker, review-work to tool-code-reviewer and conditionally tool-security-reviewer, verify-work to tool-verifier, and browser/UI verification to tool-playwright when those agents are available. Attempt required delegation even in read-only or partially blocked environments; if blocked, report the blocked gate as NOT_VERIFIED or incomplete with evidence instead of replacing it with a local summary. In progress and final reports, name exact delegate ids such as tool-planner, tool-worker, tool-code-reviewer, tool-security-reviewer, tool-verifier, and tool-playwright so text-only evals remain auditable when telemetry is unavailable. In Codex, required specialist delegation must pass the role through the spawn tool, for example agent_type=tool-worker; do not spawn unnamed/default subagents for Flow Agents worker, planner, reviewer, verifier, or Playwright gates."
|
|
76
|
+
}
|
|
77
|
+
},
|
|
78
|
+
"model_providers": {
|
|
79
|
+
"amazon-bedrock": {
|
|
80
|
+
"aws": {
|
|
81
|
+
"region": "us-east-1"
|
|
82
|
+
}
|
|
83
|
+
}
|
|
84
|
+
}
|
|
85
|
+
},
|
|
86
|
+
"tool_name_map": {
|
|
87
|
+
"read": "Read",
|
|
88
|
+
"imageRead": "Read",
|
|
89
|
+
"code": "Edit",
|
|
90
|
+
"write": "Write",
|
|
91
|
+
"shell": "Bash",
|
|
92
|
+
"ls": "Bash",
|
|
93
|
+
"grep": "Grep",
|
|
94
|
+
"glob": "Glob",
|
|
95
|
+
"subagent": "Task",
|
|
96
|
+
"web_fetch": "WebFetch",
|
|
97
|
+
"web_search": "WebSearch"
|
|
98
|
+
},
|
|
99
|
+
"claude_model_map": {
|
|
100
|
+
"claude-opus": "opus",
|
|
101
|
+
"claude-sonnet": "sonnet",
|
|
102
|
+
"default": "sonnet"
|
|
103
|
+
},
|
|
104
|
+
"codex_model_map": {
|
|
105
|
+
"claude-opus": "gpt-5.5",
|
|
106
|
+
"claude-sonnet": "gpt-5.5",
|
|
107
|
+
"kimi-k2.5": "gpt-5.3-codex-spark",
|
|
108
|
+
"agi-nova-beta-1m": "gpt-5.4-mini",
|
|
109
|
+
"default": "gpt-5.4-mini"
|
|
110
|
+
},
|
|
111
|
+
"codex_reasoning_map": {
|
|
112
|
+
"claude-opus": "high",
|
|
113
|
+
"claude-sonnet": "low",
|
|
114
|
+
"kimi-k2.5": "low",
|
|
115
|
+
"agi-nova-beta-1m": "medium",
|
|
116
|
+
"default": "medium"
|
|
117
|
+
},
|
|
118
|
+
"target_substitutions": {
|
|
119
|
+
"common": [],
|
|
120
|
+
"claude_code": [
|
|
121
|
+
{
|
|
122
|
+
"from": "Claude Code",
|
|
123
|
+
"to": "Claude Code"
|
|
124
|
+
},
|
|
125
|
+
{
|
|
126
|
+
"from": "todo tool",
|
|
127
|
+
"to": "todo tool"
|
|
128
|
+
},
|
|
129
|
+
{
|
|
130
|
+
"from": "delegate to a specialist agent",
|
|
131
|
+
"to": "delegate to a specialist agent"
|
|
132
|
+
},
|
|
133
|
+
{
|
|
134
|
+
"from": "run shell commands",
|
|
135
|
+
"to": "run shell commands"
|
|
136
|
+
},
|
|
137
|
+
{
|
|
138
|
+
"from": "read files",
|
|
139
|
+
"to": "read files"
|
|
140
|
+
},
|
|
141
|
+
{
|
|
142
|
+
"from": "write files",
|
|
143
|
+
"to": "write files"
|
|
144
|
+
}
|
|
145
|
+
],
|
|
146
|
+
"codex": [
|
|
147
|
+
{
|
|
148
|
+
"from": "Claude Code",
|
|
149
|
+
"to": "Codex"
|
|
150
|
+
},
|
|
151
|
+
{
|
|
152
|
+
"from": "todo tool",
|
|
153
|
+
"to": "todo tool"
|
|
154
|
+
},
|
|
155
|
+
{
|
|
156
|
+
"from": "delegate to a specialist agent",
|
|
157
|
+
"to": "delegate to a native subagent"
|
|
158
|
+
},
|
|
159
|
+
{
|
|
160
|
+
"from": "run shell commands",
|
|
161
|
+
"to": "run shell commands"
|
|
162
|
+
},
|
|
163
|
+
{
|
|
164
|
+
"from": "read files",
|
|
165
|
+
"to": "read files"
|
|
166
|
+
},
|
|
167
|
+
{
|
|
168
|
+
"from": "write files",
|
|
169
|
+
"to": "write files"
|
|
170
|
+
}
|
|
171
|
+
]
|
|
172
|
+
}
|
|
173
|
+
}
|
|
@@ -0,0 +1,69 @@
|
|
|
1
|
+
{
|
|
2
|
+
"schema_version": "1.0",
|
|
3
|
+
"packs": [
|
|
4
|
+
{
|
|
5
|
+
"name": "core",
|
|
6
|
+
"default": true,
|
|
7
|
+
"description": "Small default surface for reliable coding and workflow execution.",
|
|
8
|
+
"skills": [
|
|
9
|
+
"search-first",
|
|
10
|
+
"plan-work",
|
|
11
|
+
"execute-plan",
|
|
12
|
+
"review-work",
|
|
13
|
+
"verify-work",
|
|
14
|
+
"evidence-gate",
|
|
15
|
+
"feedback-loop",
|
|
16
|
+
"knowledge-capture",
|
|
17
|
+
"browser-test"
|
|
18
|
+
],
|
|
19
|
+
"agents": [
|
|
20
|
+
"tool-planner",
|
|
21
|
+
"tool-worker",
|
|
22
|
+
"tool-verifier",
|
|
23
|
+
"tool-code-reviewer",
|
|
24
|
+
"tool-playwright"
|
|
25
|
+
],
|
|
26
|
+
"powers": [
|
|
27
|
+
"playwright"
|
|
28
|
+
]
|
|
29
|
+
},
|
|
30
|
+
{
|
|
31
|
+
"name": "development",
|
|
32
|
+
"default": false,
|
|
33
|
+
"description": "Development workflow depth for backlog, release, dependency, GitHub, TDD, and frontend work.",
|
|
34
|
+
"skills": [
|
|
35
|
+
"builder-shape",
|
|
36
|
+
"idea-to-backlog",
|
|
37
|
+
"pull-work",
|
|
38
|
+
"design-probe",
|
|
39
|
+
"pickup-probe",
|
|
40
|
+
"deliver",
|
|
41
|
+
"fix-bug",
|
|
42
|
+
"tdd-workflow",
|
|
43
|
+
"release-readiness",
|
|
44
|
+
"learning-review",
|
|
45
|
+
"dependency-update",
|
|
46
|
+
"eval-rebuild",
|
|
47
|
+
"explore",
|
|
48
|
+
"github-cli",
|
|
49
|
+
"frontend-design",
|
|
50
|
+
"agentic-engineering",
|
|
51
|
+
"context-budget"
|
|
52
|
+
],
|
|
53
|
+
"agents": [
|
|
54
|
+
"dev",
|
|
55
|
+
"tool-dependencies-updater",
|
|
56
|
+
"tool-security-reviewer",
|
|
57
|
+
"tool-explore-config",
|
|
58
|
+
"tool-explore-deps",
|
|
59
|
+
"tool-explore-entry",
|
|
60
|
+
"tool-explore-patterns",
|
|
61
|
+
"tool-explore-structure",
|
|
62
|
+
"tool-explore-tests"
|
|
63
|
+
],
|
|
64
|
+
"powers": [
|
|
65
|
+
"dependency-checker"
|
|
66
|
+
]
|
|
67
|
+
}
|
|
68
|
+
]
|
|
69
|
+
}
|
|
@@ -0,0 +1,20 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: "dependency-checker"
|
|
3
|
+
displayName: "Dependency Version Checker"
|
|
4
|
+
description: "Check latest versions, identify outdated packages, and find security advisories across npm, PyPI, Cargo, Maven, Go, NuGet, Ruby, PHP, Swift, Dart, Docker, Helm, Terraform, and GitHub Actions"
|
|
5
|
+
keywords: ["dependencies", "outdated", "update", "upgrade", "version", "security", "advisory", "cve", "vulnerability", "npm", "pypi", "cargo", "maven", "package"]
|
|
6
|
+
---
|
|
7
|
+
|
|
8
|
+
# Dependency Version Checker
|
|
9
|
+
|
|
10
|
+
Check package versions and security advisories across all major ecosystems.
|
|
11
|
+
|
|
12
|
+
## Available Tools
|
|
13
|
+
- `package-version-check` — Batch version lookups across ecosystems
|
|
14
|
+
- `package-registry` — GitHub Security Advisory search
|
|
15
|
+
|
|
16
|
+
## Workflow
|
|
17
|
+
1. Scan project for dependency manifests (package.json, requirements.txt, Cargo.toml, etc.)
|
|
18
|
+
2. Use `package-version-check` tools to batch-check versions by ecosystem
|
|
19
|
+
3. Use `package-registry` to search GitHub Security Advisories for outdated packages
|
|
20
|
+
4. Report grouped by risk: CRITICAL (CVEs), MAJOR (breaking), MINOR (safe updates)
|
|
@@ -0,0 +1,20 @@
|
|
|
1
|
+
{
|
|
2
|
+
"mcpServers": {
|
|
3
|
+
"package-registry": {
|
|
4
|
+
"args": [
|
|
5
|
+
"package-registry-mcp"
|
|
6
|
+
],
|
|
7
|
+
"command": "npx"
|
|
8
|
+
},
|
|
9
|
+
"package-version-check": {
|
|
10
|
+
"args": [
|
|
11
|
+
"package-version-check-mcp",
|
|
12
|
+
"--mode=stdio"
|
|
13
|
+
],
|
|
14
|
+
"command": "uvx",
|
|
15
|
+
"env": {
|
|
16
|
+
"GITHUB_PAT": "${GITHUB_TOKEN}"
|
|
17
|
+
}
|
|
18
|
+
}
|
|
19
|
+
}
|
|
20
|
+
}
|
|
@@ -0,0 +1,25 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: "playwright"
|
|
3
|
+
displayName: "Playwright Browser"
|
|
4
|
+
description: "Browser automation and testing via Playwright - load pages, test navigation, check accessibility via structured snapshots, evaluate scripts, fill forms, take screenshots"
|
|
5
|
+
keywords: ["playwright", "browser", "accessibility", "dom", "screenshot", "debug", "frontend", "testing", "automation", "web"]
|
|
6
|
+
---
|
|
7
|
+
|
|
8
|
+
# Playwright Browser
|
|
9
|
+
|
|
10
|
+
Provides browser automation and testing through Microsoft's Playwright MCP server. Uses structured accessibility snapshots instead of pixel-based approaches for efficient, LLM-friendly browser interaction.
|
|
11
|
+
|
|
12
|
+
## Available Tools
|
|
13
|
+
- All tools from the `playwright` MCP server (navigate, click, type, snapshot, screenshot, etc.)
|
|
14
|
+
|
|
15
|
+
## When to Use
|
|
16
|
+
- Loading and navigating real web pages
|
|
17
|
+
- Testing frontend accessibility via structured accessibility snapshots (ARIA roles, tab order)
|
|
18
|
+
- Evaluating JavaScript in a live browser context
|
|
19
|
+
- Taking screenshots for visual verification
|
|
20
|
+
- Filling forms, clicking buttons, testing user flows
|
|
21
|
+
- Generating and validating Playwright test code
|
|
22
|
+
|
|
23
|
+
## NOT For
|
|
24
|
+
- General web search or fetching page content for research
|
|
25
|
+
- Scraping data from websites
|
|
@@ -0,0 +1,123 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: code-audit
|
|
3
|
+
description: "Iterative codebase audit. First run: broad sweep. Subsequent runs: deep dive into specific dimensions or files."
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Codebase Audit — {depth:broad} | Focus: {focus:all}
|
|
7
|
+
|
|
8
|
+
Audit this repository at the requested depth and focus. Every finding must reference specific files and line ranges.
|
|
9
|
+
|
|
10
|
+
## Depth Modes
|
|
11
|
+
|
|
12
|
+
**broad** (default) — Scan the full codebase across all dimensions. Produce a summary-level report with the top findings per dimension. Don't read every file — sample key files, entry points, and the largest/most-changed modules. Goal: identify WHERE the problems are so subsequent deep dives are targeted.
|
|
13
|
+
|
|
14
|
+
**deep** — Exhaustive analysis of the specified focus area. Read every relevant file. Trace call chains, map dependencies, and propose specific refactors with before/after sketches. If focus is "all", pick the highest-severity dimension from a prior broad scan and go deep on that.
|
|
15
|
+
|
|
16
|
+
## Focus Areas
|
|
17
|
+
|
|
18
|
+
When focus is not "all", restrict analysis to the specified dimension:
|
|
19
|
+
|
|
20
|
+
- **dry** — Duplicated logic, repeated patterns, copy-pasted code, magic values that should be constants
|
|
21
|
+
- **abstractions** — SRP violations, missing interfaces, tight coupling, business logic in wrong layers, god objects
|
|
22
|
+
- **testability** — Hardcoded dependencies, side effects, missing DI, logic buried in framework code, test organization
|
|
23
|
+
- **errors** — Swallowed exceptions, missing error handling on I/O/network, inconsistent error formats, missing retries/timeouts
|
|
24
|
+
- **naming** — Misleading names, overgrown files, dead code, TODO/FIXME/HACK markers, structural inconsistencies
|
|
25
|
+
- **security** — Hardcoded secrets, missing input validation, overly permissive access, sensitive data in logs
|
|
26
|
+
- **dependencies** — Outdated/unused deps, missing lock files, build script complexity
|
|
27
|
+
|
|
28
|
+
## Scope Narrowing
|
|
29
|
+
|
|
30
|
+
{scope?}
|
|
31
|
+
|
|
32
|
+
If a scope is provided above (file paths, directories, or module names), restrict the audit to those areas only. Otherwise audit the full repository.
|
|
33
|
+
|
|
34
|
+
## Phase 1: Orient
|
|
35
|
+
|
|
36
|
+
Before analyzing, understand the codebase:
|
|
37
|
+
|
|
38
|
+
1. Map directory structure, tech stack, frameworks, languages
|
|
39
|
+
2. Identify entry points (main files, API routes, CLI commands, exports)
|
|
40
|
+
3. Read key config files (package.json, pyproject.toml, Cargo.toml, Dockerfile, etc.)
|
|
41
|
+
4. Understand testing setup (frameworks, coverage, test locations)
|
|
42
|
+
5. Check for linting/formatting config
|
|
43
|
+
|
|
44
|
+
**If depth is deep**: Also read the specific files in the focus area thoroughly. Trace imports and call chains. Understand how the focused code interacts with the rest of the system.
|
|
45
|
+
|
|
46
|
+
Summarize your understanding before proceeding.
|
|
47
|
+
|
|
48
|
+
## Phase 2: Analyze
|
|
49
|
+
|
|
50
|
+
### Broad Mode
|
|
51
|
+
|
|
52
|
+
For each applicable dimension, scan for the most significant issues. Limit to 3-5 findings per dimension. For each:
|
|
53
|
+
|
|
54
|
+
```
|
|
55
|
+
### [Finding Title] — Severity: HIGH | MEDIUM | LOW
|
|
56
|
+
|
|
57
|
+
**Files:** `path/to/file.ext` (lines X-Y)
|
|
58
|
+
**Problem:** What's wrong and why it matters.
|
|
59
|
+
**Recommendation:** One-line concrete fix direction.
|
|
60
|
+
```
|
|
61
|
+
|
|
62
|
+
### Deep Mode
|
|
63
|
+
|
|
64
|
+
For the focused dimension, provide exhaustive analysis:
|
|
65
|
+
|
|
66
|
+
```
|
|
67
|
+
### [Finding Title] — Severity: HIGH | MEDIUM | LOW
|
|
68
|
+
|
|
69
|
+
**Files:** `path/to/file.ext` (lines X-Y), `path/to/other.ext` (lines A-B)
|
|
70
|
+
|
|
71
|
+
**Problem:** Detailed explanation — what's wrong, why it happened, what breaks or degrades because of it.
|
|
72
|
+
|
|
73
|
+
**Current pattern:**
|
|
74
|
+
[Show the actual problematic code or describe the pattern concretely]
|
|
75
|
+
|
|
76
|
+
**Proposed refactor:**
|
|
77
|
+
[Show the target structure — new files, interfaces, function signatures. Not full implementations, but enough to be unambiguous about the design.]
|
|
78
|
+
|
|
79
|
+
**Migration path:** Steps to get from current to proposed without breaking things.
|
|
80
|
+
|
|
81
|
+
**Effort:** S / M / L
|
|
82
|
+
**Risk:** What could go wrong during the refactor.
|
|
83
|
+
```
|
|
84
|
+
|
|
85
|
+
## Phase 3: Report
|
|
86
|
+
|
|
87
|
+
### Broad Mode Report
|
|
88
|
+
|
|
89
|
+
1. **Heatmap** — Which dimensions have the most/worst findings? Rank them.
|
|
90
|
+
2. **Top 5 highest-impact changes** — Best ROI refactors across all dimensions.
|
|
91
|
+
3. **Recommended deep dive order** — Which focus area to audit next and why.
|
|
92
|
+
4. **What's done well** — Patterns worth preserving.
|
|
93
|
+
|
|
94
|
+
### Deep Mode Report
|
|
95
|
+
|
|
96
|
+
1. **All findings** for the focused dimension, ordered by severity then effort.
|
|
97
|
+
2. **Dependency graph** — How do the findings relate? Which refactors unlock others?
|
|
98
|
+
3. **Suggested implementation order** — Sequence the fixes to minimize risk and maximize incremental value.
|
|
99
|
+
4. **What's done well** in this dimension — patterns to extend, not just problems to fix.
|
|
100
|
+
|
|
101
|
+
## Severity Guide
|
|
102
|
+
|
|
103
|
+
- **HIGH** — Actively causes bugs, security issues, or makes the codebase significantly harder to maintain
|
|
104
|
+
- **MEDIUM** — Creates friction, slows development, or will become a problem as the codebase grows
|
|
105
|
+
- **LOW** — Cleanup opportunity, style improvement, or minor inconsistency
|
|
106
|
+
|
|
107
|
+
## Rules
|
|
108
|
+
|
|
109
|
+
- Be specific. "This function is too long" is useless. "This function handles parsing, validation, and persistence — split into three" is useful.
|
|
110
|
+
- Don't nitpick formatting if a formatter/linter is configured — focus on logic and design.
|
|
111
|
+
- Don't suggest adding tests — focus on making code testable. The user will decide when to write tests.
|
|
112
|
+
- Respect the existing tech stack. Don't suggest rewrites in different languages/frameworks.
|
|
113
|
+
- In broad mode, prioritize breadth over depth. In deep mode, prioritize thoroughness.
|
|
114
|
+
- If you've seen a prior broad audit in this conversation, reference those findings and go deeper — don't repeat the surface-level observations.
|
|
115
|
+
- If the codebase is small, say so and keep the audit proportional.
|
|
116
|
+
|
|
117
|
+
## Iterative Usage
|
|
118
|
+
|
|
119
|
+
Typical progression:
|
|
120
|
+
1. `@code-audit` — broad sweep, identify hotspots
|
|
121
|
+
2. `@code-audit deep abstractions` — deep dive into worst dimension
|
|
122
|
+
3. `@code-audit deep testability src/agents/` — targeted deep dive on specific code
|
|
123
|
+
4. Repeat until satisfied
|