agentic-orchestrator 0.1.2 → 0.1.4
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude/settings.local.json +15 -0
- package/CLAUDE.md +126 -0
- package/README.md +166 -25
- package/agentic/orchestrator/adapters.yaml +3 -0
- package/agentic/orchestrator/gates.yaml +47 -0
- package/agentic/orchestrator/policy.yaml +89 -0
- package/agentic/orchestrator/schemas/adapters.schema.json +12 -0
- package/agentic/orchestrator/schemas/gates.schema.json +6 -1
- package/agentic/orchestrator/schemas/index.schema.json +14 -0
- package/agentic/orchestrator/schemas/multi-project.schema.json +41 -0
- package/agentic/orchestrator/schemas/policy.schema.json +449 -52
- package/agentic/orchestrator/schemas/state.schema.json +16 -0
- package/agentic/orchestrator/tools/catalog.json +68 -0
- package/agentic/orchestrator/tools/schemas/input/cost.get.input.schema.json +10 -0
- package/agentic/orchestrator/tools/schemas/input/cost.record.input.schema.json +13 -0
- package/agentic/orchestrator/tools/schemas/input/feature.send_message.input.schema.json +11 -0
- package/agentic/orchestrator/tools/schemas/input/performance.get_analytics.input.schema.json +10 -0
- package/agentic/orchestrator/tools/schemas/input/performance.record_outcome.input.schema.json +18 -0
- package/agentic/orchestrator/tools/schemas/output/cost.get.output.schema.json +13 -0
- package/agentic/orchestrator/tools/schemas/output/cost.record.output.schema.json +13 -0
- package/agentic/orchestrator/tools/schemas/output/feature.ready_to_merge.output.schema.json +7 -0
- package/agentic/orchestrator/tools/schemas/output/feature.send_message.output.schema.json +23 -0
- package/agentic/orchestrator/tools/schemas/output/performance.get_analytics.output.schema.json +46 -0
- package/agentic/orchestrator/tools/schemas/output/performance.record_outcome.output.schema.json +10 -0
- package/agentic/orchestrator/tools.md +5 -0
- package/apps/control-plane/scripts/validate-architecture-rules.mjs +28 -2
- package/apps/control-plane/scripts/validate-docker-mcp-contract.mjs +12 -0
- package/apps/control-plane/scripts/validate-mcp-contracts.ts +92 -0
- package/apps/control-plane/src/application/adapters/adapter-registry.ts +169 -0
- package/apps/control-plane/src/application/multi-project-loader.ts +119 -0
- package/apps/control-plane/src/application/services/activity-monitor-service.ts +199 -0
- package/apps/control-plane/src/application/services/cost-tracking-service.ts +82 -0
- package/apps/control-plane/src/application/services/dependency-scheduler-service.ts +86 -0
- package/apps/control-plane/src/application/services/feature-deletion-service.ts +8 -7
- package/apps/control-plane/src/application/services/gate-interpolation-service.ts +15 -0
- package/apps/control-plane/src/application/services/gate-service.ts +38 -2
- package/apps/control-plane/src/application/services/instance-isolation-service.ts +18 -0
- package/apps/control-plane/src/application/services/issue-tracker-service.ts +469 -0
- package/apps/control-plane/src/application/services/merge-service.ts +67 -3
- package/apps/control-plane/src/application/services/notifier-service.ts +295 -0
- package/apps/control-plane/src/application/services/performance-analytics-service.ts +122 -0
- package/apps/control-plane/src/application/services/plan-service.ts +51 -0
- package/apps/control-plane/src/application/services/pr-monitor-service.ts +262 -0
- package/apps/control-plane/src/application/services/reactions-service.ts +175 -0
- package/apps/control-plane/src/application/services/reporting-service.ts +17 -2
- package/apps/control-plane/src/application/services/run-lease-service.ts +16 -38
- package/apps/control-plane/src/application/tools/tool-metadata.ts +4 -1
- package/apps/control-plane/src/cli/aop.ts +1 -1
- package/apps/control-plane/src/cli/attach-command-handler.ts +120 -0
- package/apps/control-plane/src/cli/cleanup-command-handler.ts +190 -0
- package/apps/control-plane/src/cli/cli-argument-parser.ts +69 -3
- package/apps/control-plane/src/cli/dashboard-command-handler.ts +57 -0
- package/apps/control-plane/src/cli/help-command-handler.ts +163 -0
- package/apps/control-plane/src/cli/init-command-handler.ts +609 -0
- package/apps/control-plane/src/cli/resume-command-handler.ts +1 -0
- package/apps/control-plane/src/cli/retry-command-handler.ts +138 -0
- package/apps/control-plane/src/cli/run-command-handler.ts +115 -3
- package/apps/control-plane/src/cli/send-command-handler.ts +65 -0
- package/apps/control-plane/src/cli/status-command-handler.ts +102 -2
- package/apps/control-plane/src/cli/types.ts +26 -1
- package/apps/control-plane/src/core/constants.ts +8 -2
- package/apps/control-plane/src/core/error-codes.ts +3 -1
- package/apps/control-plane/src/core/gates.ts +170 -50
- package/apps/control-plane/src/core/kernel.ts +280 -5
- package/apps/control-plane/src/core/path-layout.ts +12 -0
- package/apps/control-plane/src/core/tool-caller.ts +36 -0
- package/apps/control-plane/src/core/workspace-hooks.ts +87 -0
- package/apps/control-plane/src/interfaces/cli/bootstrap.ts +258 -9
- package/apps/control-plane/src/providers/providers.ts +235 -14
- package/apps/control-plane/src/supervisor/build-wave-executor.ts +129 -8
- package/apps/control-plane/src/supervisor/qa-wave-executor.ts +123 -5
- package/apps/control-plane/src/supervisor/run-coordinator.ts +143 -6
- package/apps/control-plane/src/supervisor/runtime.ts +135 -6
- package/apps/control-plane/src/supervisor/types.ts +12 -21
- package/apps/control-plane/src/supervisor/worker-decision-loop.ts +8 -0
- package/apps/control-plane/test/activity-monitor.spec.ts +294 -0
- package/apps/control-plane/test/adapter-registry.spec.ts +132 -0
- package/apps/control-plane/test/batch-operations.spec.ts +112 -0
- package/apps/control-plane/test/bootstrap-attach.spec.ts +102 -0
- package/apps/control-plane/test/bootstrap-edge-cases.spec.ts +252 -0
- package/apps/control-plane/test/bootstrap.spec.ts +560 -0
- package/apps/control-plane/test/cleanup-command.spec.ts +301 -0
- package/apps/control-plane/test/cli-helpers.spec.ts +404 -1
- package/apps/control-plane/test/cli.unit.spec.ts +182 -1
- package/apps/control-plane/test/collision-queue.spec.ts +104 -1
- package/apps/control-plane/test/core-utils.spec.ts +175 -2
- package/apps/control-plane/test/cost-tracking.spec.ts +143 -0
- package/apps/control-plane/test/dashboard-api.integration.spec.ts +247 -0
- package/apps/control-plane/test/dashboard-client.spec.ts +116 -0
- package/apps/control-plane/test/dashboard-command.spec.ts +103 -0
- package/apps/control-plane/test/dependency-scheduler.spec.ts +189 -0
- package/apps/control-plane/test/epoch-tracking.spec.ts +4 -4
- package/apps/control-plane/test/feature-deletion-service.spec.ts +422 -0
- package/apps/control-plane/test/feature-lifecycle.spec.ts +202 -0
- package/apps/control-plane/test/git-spawn-error.spec.ts +24 -0
- package/apps/control-plane/test/incremental-gates.spec.ts +137 -0
- package/apps/control-plane/test/init-wizard.spec.ts +506 -0
- package/apps/control-plane/test/instance-isolation.spec.ts +83 -0
- package/apps/control-plane/test/issue-tracker.spec.ts +890 -0
- package/apps/control-plane/test/kernel.coverage.spec.ts +3 -5
- package/apps/control-plane/test/kernel.coverage2.spec.ts +871 -0
- package/apps/control-plane/test/kernel.spec.ts +13 -11
- package/apps/control-plane/test/lock-service.spec.ts +508 -0
- package/apps/control-plane/test/mcp-helpers.spec.ts +176 -0
- package/apps/control-plane/test/mcp.spec.ts +50 -15
- package/apps/control-plane/test/merge-service.spec.ts +67 -4
- package/apps/control-plane/test/multi-project.spec.ts +372 -0
- package/apps/control-plane/test/notifier-service.spec.ts +388 -0
- package/apps/control-plane/test/parallel-gates.spec.ts +312 -0
- package/apps/control-plane/test/patch-service.spec.ts +253 -0
- package/apps/control-plane/test/performance-analytics.spec.ts +338 -0
- package/apps/control-plane/test/planning-wave-executor.spec.ts +168 -0
- package/apps/control-plane/test/pr-monitor.spec.ts +385 -0
- package/apps/control-plane/test/providers.spec.ts +344 -1
- package/apps/control-plane/test/reactions.spec.ts +392 -0
- package/apps/control-plane/test/resume-command.spec.ts +390 -0
- package/apps/control-plane/test/run-coordinator.spec.ts +481 -2
- package/apps/control-plane/test/schema-date-time.spec.ts +46 -0
- package/apps/control-plane/test/service-retry-paths.spec.ts +30 -0
- package/apps/control-plane/test/services.spec.ts +95 -2
- package/apps/control-plane/test/session-management.spec.ts +450 -0
- package/apps/control-plane/test/spec-ingestion.spec.ts +190 -0
- package/apps/control-plane/test/supervisor-collaborators.spec.ts +699 -2
- package/apps/control-plane/test/supervisor.spec.ts +36 -30
- package/apps/control-plane/test/supervisor.unit.spec.ts +405 -0
- package/apps/control-plane/test/worker-decision-loop.spec.ts +57 -0
- package/apps/control-plane/test/workspace-hooks.spec.ts +177 -0
- package/apps/control-plane/vitest.config.ts +21 -5
- package/dist/apps/control-plane/application/adapters/adapter-registry.d.ts +44 -0
- package/dist/apps/control-plane/application/adapters/adapter-registry.js +76 -0
- package/dist/apps/control-plane/application/adapters/adapter-registry.js.map +1 -0
- package/dist/apps/control-plane/application/multi-project-loader.d.ts +31 -0
- package/dist/apps/control-plane/application/multi-project-loader.js +82 -0
- package/dist/apps/control-plane/application/multi-project-loader.js.map +1 -0
- package/dist/apps/control-plane/application/services/activity-monitor-service.d.ts +43 -0
- package/dist/apps/control-plane/application/services/activity-monitor-service.js +132 -0
- package/dist/apps/control-plane/application/services/activity-monitor-service.js.map +1 -0
- package/dist/apps/control-plane/application/services/cost-tracking-service.d.ts +28 -0
- package/dist/apps/control-plane/application/services/cost-tracking-service.js +48 -0
- package/dist/apps/control-plane/application/services/cost-tracking-service.js.map +1 -0
- package/dist/apps/control-plane/application/services/dependency-scheduler-service.d.ts +26 -0
- package/dist/apps/control-plane/application/services/dependency-scheduler-service.js +75 -0
- package/dist/apps/control-plane/application/services/dependency-scheduler-service.js.map +1 -0
- package/dist/apps/control-plane/application/services/feature-deletion-service.d.ts +2 -0
- package/dist/apps/control-plane/application/services/feature-deletion-service.js +6 -7
- package/dist/apps/control-plane/application/services/feature-deletion-service.js.map +1 -1
- package/dist/apps/control-plane/application/services/gate-interpolation-service.d.ts +7 -0
- package/dist/apps/control-plane/application/services/gate-interpolation-service.js +7 -0
- package/dist/apps/control-plane/application/services/gate-interpolation-service.js.map +1 -0
- package/dist/apps/control-plane/application/services/gate-service.js +32 -2
- package/dist/apps/control-plane/application/services/gate-service.js.map +1 -1
- package/dist/apps/control-plane/application/services/instance-isolation-service.d.ts +11 -0
- package/dist/apps/control-plane/application/services/instance-isolation-service.js +17 -0
- package/dist/apps/control-plane/application/services/instance-isolation-service.js.map +1 -0
- package/dist/apps/control-plane/application/services/issue-tracker-service.d.ts +65 -0
- package/dist/apps/control-plane/application/services/issue-tracker-service.js +358 -0
- package/dist/apps/control-plane/application/services/issue-tracker-service.js.map +1 -0
- package/dist/apps/control-plane/application/services/merge-service.d.ts +4 -0
- package/dist/apps/control-plane/application/services/merge-service.js +44 -2
- package/dist/apps/control-plane/application/services/merge-service.js.map +1 -1
- package/dist/apps/control-plane/application/services/notifier-service.d.ts +74 -0
- package/dist/apps/control-plane/application/services/notifier-service.js +212 -0
- package/dist/apps/control-plane/application/services/notifier-service.js.map +1 -0
- package/dist/apps/control-plane/application/services/performance-analytics-service.d.ts +39 -0
- package/dist/apps/control-plane/application/services/performance-analytics-service.js +75 -0
- package/dist/apps/control-plane/application/services/performance-analytics-service.js.map +1 -0
- package/dist/apps/control-plane/application/services/plan-service.d.ts +1 -0
- package/dist/apps/control-plane/application/services/plan-service.js +53 -0
- package/dist/apps/control-plane/application/services/plan-service.js.map +1 -1
- package/dist/apps/control-plane/application/services/pr-monitor-service.d.ts +44 -0
- package/dist/apps/control-plane/application/services/pr-monitor-service.js +192 -0
- package/dist/apps/control-plane/application/services/pr-monitor-service.js.map +1 -0
- package/dist/apps/control-plane/application/services/reactions-service.d.ts +67 -0
- package/dist/apps/control-plane/application/services/reactions-service.js +114 -0
- package/dist/apps/control-plane/application/services/reactions-service.js.map +1 -0
- package/dist/apps/control-plane/application/services/reporting-service.d.ts +1 -0
- package/dist/apps/control-plane/application/services/reporting-service.js +13 -2
- package/dist/apps/control-plane/application/services/reporting-service.js.map +1 -1
- package/dist/apps/control-plane/application/services/run-lease-service.d.ts +2 -0
- package/dist/apps/control-plane/application/services/run-lease-service.js +14 -38
- package/dist/apps/control-plane/application/services/run-lease-service.js.map +1 -1
- package/dist/apps/control-plane/application/tools/tool-metadata.js +3 -1
- package/dist/apps/control-plane/application/tools/tool-metadata.js.map +1 -1
- package/dist/apps/control-plane/cli/aop.d.ts +1 -1
- package/dist/apps/control-plane/cli/aop.js +1 -1
- package/dist/apps/control-plane/cli/attach-command-handler.d.ts +12 -0
- package/dist/apps/control-plane/cli/attach-command-handler.js +98 -0
- package/dist/apps/control-plane/cli/attach-command-handler.js.map +1 -0
- package/dist/apps/control-plane/cli/cleanup-command-handler.d.ts +12 -0
- package/dist/apps/control-plane/cli/cleanup-command-handler.js +162 -0
- package/dist/apps/control-plane/cli/cleanup-command-handler.js.map +1 -0
- package/dist/apps/control-plane/cli/cli-argument-parser.js +73 -3
- package/dist/apps/control-plane/cli/cli-argument-parser.js.map +1 -1
- package/dist/apps/control-plane/cli/dashboard-command-handler.d.ts +7 -0
- package/dist/apps/control-plane/cli/dashboard-command-handler.js +45 -0
- package/dist/apps/control-plane/cli/dashboard-command-handler.js.map +1 -0
- package/dist/apps/control-plane/cli/help-command-handler.d.ts +8 -0
- package/dist/apps/control-plane/cli/help-command-handler.js +146 -0
- package/dist/apps/control-plane/cli/help-command-handler.js.map +1 -0
- package/dist/apps/control-plane/cli/init-command-handler.d.ts +26 -0
- package/dist/apps/control-plane/cli/init-command-handler.js +517 -0
- package/dist/apps/control-plane/cli/init-command-handler.js.map +1 -0
- package/dist/apps/control-plane/cli/resume-command-handler.js +1 -1
- package/dist/apps/control-plane/cli/resume-command-handler.js.map +1 -1
- package/dist/apps/control-plane/cli/retry-command-handler.d.ts +8 -0
- package/dist/apps/control-plane/cli/retry-command-handler.js +111 -0
- package/dist/apps/control-plane/cli/retry-command-handler.js.map +1 -0
- package/dist/apps/control-plane/cli/run-command-handler.d.ts +5 -0
- package/dist/apps/control-plane/cli/run-command-handler.js +82 -3
- package/dist/apps/control-plane/cli/run-command-handler.js.map +1 -1
- package/dist/apps/control-plane/cli/send-command-handler.d.ts +8 -0
- package/dist/apps/control-plane/cli/send-command-handler.js +55 -0
- package/dist/apps/control-plane/cli/send-command-handler.js.map +1 -0
- package/dist/apps/control-plane/cli/status-command-handler.d.ts +12 -1
- package/dist/apps/control-plane/cli/status-command-handler.js +55 -2
- package/dist/apps/control-plane/cli/status-command-handler.js.map +1 -1
- package/dist/apps/control-plane/cli/types.d.ts +26 -1
- package/dist/apps/control-plane/cli/types.js +15 -1
- package/dist/apps/control-plane/cli/types.js.map +1 -1
- package/dist/apps/control-plane/core/constants.d.ts +6 -0
- package/dist/apps/control-plane/core/constants.js +8 -2
- package/dist/apps/control-plane/core/constants.js.map +1 -1
- package/dist/apps/control-plane/core/error-codes.d.ts +2 -0
- package/dist/apps/control-plane/core/error-codes.js +3 -1
- package/dist/apps/control-plane/core/error-codes.js.map +1 -1
- package/dist/apps/control-plane/core/gates.d.ts +4 -0
- package/dist/apps/control-plane/core/gates.js +140 -43
- package/dist/apps/control-plane/core/gates.js.map +1 -1
- package/dist/apps/control-plane/core/kernel.d.ts +50 -1
- package/dist/apps/control-plane/core/kernel.js +220 -7
- package/dist/apps/control-plane/core/kernel.js.map +1 -1
- package/dist/apps/control-plane/core/path-layout.d.ts +3 -0
- package/dist/apps/control-plane/core/path-layout.js +9 -0
- package/dist/apps/control-plane/core/path-layout.js.map +1 -1
- package/dist/apps/control-plane/core/tool-caller.d.ts +32 -0
- package/dist/apps/control-plane/core/tool-caller.js +2 -0
- package/dist/apps/control-plane/core/tool-caller.js.map +1 -0
- package/dist/apps/control-plane/core/workspace-hooks.d.ts +20 -0
- package/dist/apps/control-plane/core/workspace-hooks.js +69 -0
- package/dist/apps/control-plane/core/workspace-hooks.js.map +1 -0
- package/dist/apps/control-plane/interfaces/cli/bootstrap.js +245 -9
- package/dist/apps/control-plane/interfaces/cli/bootstrap.js.map +1 -1
- package/dist/apps/control-plane/providers/providers.d.ts +42 -3
- package/dist/apps/control-plane/providers/providers.js +216 -5
- package/dist/apps/control-plane/providers/providers.js.map +1 -1
- package/dist/apps/control-plane/supervisor/build-wave-executor.d.ts +3 -0
- package/dist/apps/control-plane/supervisor/build-wave-executor.js +115 -6
- package/dist/apps/control-plane/supervisor/build-wave-executor.js.map +1 -1
- package/dist/apps/control-plane/supervisor/qa-wave-executor.d.ts +3 -0
- package/dist/apps/control-plane/supervisor/qa-wave-executor.js +109 -5
- package/dist/apps/control-plane/supervisor/qa-wave-executor.js.map +1 -1
- package/dist/apps/control-plane/supervisor/run-coordinator.d.ts +15 -0
- package/dist/apps/control-plane/supervisor/run-coordinator.js +132 -6
- package/dist/apps/control-plane/supervisor/run-coordinator.js.map +1 -1
- package/dist/apps/control-plane/supervisor/runtime.d.ts +3 -0
- package/dist/apps/control-plane/supervisor/runtime.js +110 -6
- package/dist/apps/control-plane/supervisor/runtime.js.map +1 -1
- package/dist/apps/control-plane/supervisor/types.d.ts +9 -16
- package/dist/apps/control-plane/supervisor/types.js.map +1 -1
- package/dist/apps/control-plane/supervisor/worker-decision-loop.d.ts +3 -0
- package/dist/apps/control-plane/supervisor/worker-decision-loop.js +5 -0
- package/dist/apps/control-plane/supervisor/worker-decision-loop.js.map +1 -1
- package/eslint.config.mjs +2 -1
- package/package.json +12 -2
- package/packages/web-dashboard/next-env.d.ts +5 -0
- package/packages/web-dashboard/next.config.js +7 -0
- package/packages/web-dashboard/package.json +26 -0
- package/packages/web-dashboard/src/app/api/actions/route.ts +64 -0
- package/packages/web-dashboard/src/app/api/events/route.ts +51 -0
- package/packages/web-dashboard/src/app/api/features/[id]/checkout/route.ts +256 -0
- package/packages/web-dashboard/src/app/api/features/[id]/diff/route.ts +10 -0
- package/packages/web-dashboard/src/app/api/features/[id]/evidence/[artifact]/route.ts +25 -0
- package/packages/web-dashboard/src/app/api/features/[id]/review/route.ts +63 -0
- package/packages/web-dashboard/src/app/api/features/[id]/route.ts +16 -0
- package/packages/web-dashboard/src/app/api/projects/route.ts +31 -0
- package/packages/web-dashboard/src/app/api/status/route.ts +15 -0
- package/packages/web-dashboard/src/app/globals.css +2 -0
- package/packages/web-dashboard/src/app/layout.tsx +15 -0
- package/packages/web-dashboard/src/app/page.tsx +393 -0
- package/packages/web-dashboard/src/lib/aop-client.ts +244 -0
- package/packages/web-dashboard/src/lib/multi-project-config.ts +116 -0
- package/packages/web-dashboard/src/lib/orchestrator-tools.ts +284 -0
- package/packages/web-dashboard/src/lib/types.ts +58 -0
- package/packages/web-dashboard/tsconfig.json +40 -0
- package/packages/web-dashboard/vitest.config.ts +6 -0
- package/spec-files/completed/agentic_orchestrator_feature_gaps_closure_spec.md +1764 -0
- package/spec-files/outstanding/agentic_orchestrator_enterprise_governance_dashboard_spec.md +348 -0
- package/spec-files/outstanding/agentic_orchestrator_knowledge_canary_spec.md +344 -0
- package/spec-files/outstanding/agentic_orchestrator_observability_integrity_diagnostics_spec.md +374 -0
- package/spec-files/outstanding/agentic_orchestrator_performance_improvements_spec.md +1059 -0
- package/spec-files/outstanding/agentic_orchestrator_planning_review_quality_spec.md +466 -0
- package/spec-files/outstanding/agentic_orchestrator_quality_adoption_execution_spec.md +198 -0
- package/spec-files/outstanding/agentic_orchestrator_validator_hardening_spec.md +365 -0
- package/spec-files/progress.md +481 -52
- /package/spec-files/{agentic_orchestrator_cli_delete_command_spec.md → completed/agentic_orchestrator_cli_delete_command_spec.md} +0 -0
- /package/spec-files/{agentic_orchestrator_dot_aop_generated_artifacts_spec.md → completed/agentic_orchestrator_dot_aop_generated_artifacts_spec.md} +0 -0
- /package/spec-files/{agentic_orchestrator_mcp_formalization_spec.md → completed/agentic_orchestrator_mcp_formalization_spec.md} +0 -0
- /package/spec-files/{agentic_orchestrator_oop_refactor_spec.md → completed/agentic_orchestrator_oop_refactor_spec.md} +0 -0
- /package/spec-files/{agentic_orchestrator_single_global_orchestrator_spec.md → completed/agentic_orchestrator_single_global_orchestrator_spec.md} +0 -0
- /package/spec-files/{agentic_orchestrator_spec.md → completed/agentic_orchestrator_spec.md} +0 -0
|
@@ -0,0 +1,466 @@
|
|
|
1
|
+
# Feature Spec: Planning and Review Quality Hardening (AOP)
|
|
2
|
+
|
|
3
|
+
> **Purpose of this document**: Define implementation-ready delivery for Q1, Q2, Q7, Q8, and Q9: semantic collision detection, feasibility scoring, prompt contract tests, flaky gate intelligence, and generated review briefs.
|
|
4
|
+
|
|
5
|
+
**Version:** 1.0
|
|
6
|
+
**Date:** 2026-03-03
|
|
7
|
+
**Status:** Draft
|
|
8
|
+
**Roadmap Mapping:** M34-M36
|
|
9
|
+
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
## 0. Standards and Scope
|
|
13
|
+
|
|
14
|
+
### 0.1 Standards
|
|
15
|
+
|
|
16
|
+
All implementation MUST preserve:
|
|
17
|
+
- deterministic plan admission behavior
|
|
18
|
+
- policy/schema governance
|
|
19
|
+
- transport parity for new tools
|
|
20
|
+
- existing collision + lock safety guarantees
|
|
21
|
+
|
|
22
|
+
### 0.2 Feature Scope
|
|
23
|
+
|
|
24
|
+
This spec implements:
|
|
25
|
+
- **Q1** Semantic Collision Engine
|
|
26
|
+
- **Q2** Plan Feasibility Scoring
|
|
27
|
+
- **Q7** Prompt Contract Test Harness
|
|
28
|
+
- **Q8** Flaky Gate Intelligence + Quarantine Workflow
|
|
29
|
+
- **Q9** Auto-Generated Review Briefs
|
|
30
|
+
|
|
31
|
+
### 0.3 Out of Scope
|
|
32
|
+
|
|
33
|
+
- ML-based probabilistic planning heuristics
|
|
34
|
+
- autonomous prompt rewriting
|
|
35
|
+
- automatic flaky-test suppression without accountability metadata
|
|
36
|
+
|
|
37
|
+
---
|
|
38
|
+
|
|
39
|
+
## 1. Objectives
|
|
40
|
+
|
|
41
|
+
### 1.1 Must-Have Outcomes
|
|
42
|
+
|
|
43
|
+
- detect semantic collisions before expensive build/QA loops
|
|
44
|
+
- prevent low-quality plans from advancing when policy requires score threshold
|
|
45
|
+
- protect planner/builder/qa behavior from prompt regressions
|
|
46
|
+
- reduce gate noise caused by flaky tests
|
|
47
|
+
- provide deterministic, high-signal review briefs for human approval
|
|
48
|
+
|
|
49
|
+
### 1.2 Non-Goals
|
|
50
|
+
|
|
51
|
+
- no change to explicit user merge control semantics
|
|
52
|
+
- no replacement of current gate profiles
|
|
53
|
+
- no dynamic prompt evaluation service
|
|
54
|
+
|
|
55
|
+
---
|
|
56
|
+
|
|
57
|
+
## 2. Architecture Decisions
|
|
58
|
+
|
|
59
|
+
### 2.1 Collision Checks as Layered Validation
|
|
60
|
+
|
|
61
|
+
Plan admission remains layered:
|
|
62
|
+
1. schema validation
|
|
63
|
+
2. policy validation
|
|
64
|
+
3. path/contract collision checks
|
|
65
|
+
4. **semantic collision checks (new)**
|
|
66
|
+
5. feasibility scoring threshold check (optional)
|
|
67
|
+
|
|
68
|
+
### 2.2 Prompt Contracts as CI Artifact
|
|
69
|
+
|
|
70
|
+
Prompt contracts are treated as repository source contracts with test fixtures and invariants. They are verified by deterministic scripts in CI, not by runtime agent heuristics.
|
|
71
|
+
|
|
72
|
+
### 2.3 Flaky Intelligence is Advisory + Governed
|
|
73
|
+
|
|
74
|
+
Flaky classification influences reporting and quarantine workflow but MUST NOT silently bypass required merge gates without explicit policy.
|
|
75
|
+
|
|
76
|
+
### 2.4 Review Briefs are Deterministic Summaries
|
|
77
|
+
|
|
78
|
+
Review briefs MUST be generated from canonical artifacts only (plan, state, diff, gate evidence, collisions, locks), not free-form model synthesis.
|
|
79
|
+
|
|
80
|
+
---
|
|
81
|
+
|
|
82
|
+
## 3. Contracts and Schema Deltas
|
|
83
|
+
|
|
84
|
+
### 3.1 Policy Extensions
|
|
85
|
+
|
|
86
|
+
```yaml
|
|
87
|
+
innovation:
|
|
88
|
+
semantic_collisions: false
|
|
89
|
+
plan_feasibility_scoring: false
|
|
90
|
+
prompt_contract_tests: false
|
|
91
|
+
flaky_gate_intelligence: false
|
|
92
|
+
review_briefs: false
|
|
93
|
+
|
|
94
|
+
planning:
|
|
95
|
+
feasibility:
|
|
96
|
+
enabled: false
|
|
97
|
+
minimum_score: 70
|
|
98
|
+
weights:
|
|
99
|
+
scope_realism: 0.25
|
|
100
|
+
test_sufficiency: 0.25
|
|
101
|
+
dependency_completeness: 0.2
|
|
102
|
+
contract_risk: 0.15
|
|
103
|
+
lock_contention_risk: 0.15
|
|
104
|
+
|
|
105
|
+
testing:
|
|
106
|
+
flaky:
|
|
107
|
+
enabled: false
|
|
108
|
+
quarantine:
|
|
109
|
+
max_days: 14
|
|
110
|
+
require_owner: true
|
|
111
|
+
require_reason: true
|
|
112
|
+
```
|
|
113
|
+
|
|
114
|
+
### 3.2 New/Extended Artifacts
|
|
115
|
+
|
|
116
|
+
```text
|
|
117
|
+
.aop/
|
|
118
|
+
analytics/flaky-tests.json
|
|
119
|
+
features/<feature_id>/review_brief.json
|
|
120
|
+
|
|
121
|
+
agentic/orchestrator/
|
|
122
|
+
prompt-contract/
|
|
123
|
+
fixtures/
|
|
124
|
+
invariants/
|
|
125
|
+
schemas/review_brief.schema.json
|
|
126
|
+
```
|
|
127
|
+
|
|
128
|
+
### 3.3 Tool Surface Additions
|
|
129
|
+
|
|
130
|
+
- `plan.score` (read-only)
|
|
131
|
+
- `gate.flaky_report_get` (read-only)
|
|
132
|
+
- `report.feature_review_brief` (read-only)
|
|
133
|
+
|
|
134
|
+
`collisions.scan` output schema must be extended for `semantic_conflicts[]`.
|
|
135
|
+
|
|
136
|
+
---
|
|
137
|
+
|
|
138
|
+
## 4. Detailed Feature Specs
|
|
139
|
+
|
|
140
|
+
## 4.1 Q1: Semantic Collision Engine
|
|
141
|
+
|
|
142
|
+
### 4.1.1 Problem
|
|
143
|
+
Two features can avoid path collisions yet conflict semantically (same API operation evolution, incompatible event payloads, conflicting migration intent).
|
|
144
|
+
|
|
145
|
+
### 4.1.2 Design
|
|
146
|
+
Add semantic analyzers executed during `plan.submit` and `collisions.scan` when enabled.
|
|
147
|
+
|
|
148
|
+
### 4.1.3 Analyzer Modules
|
|
149
|
+
|
|
150
|
+
- `OpenApiSemanticAnalyzer`
|
|
151
|
+
- `EventSchemaSemanticAnalyzer`
|
|
152
|
+
- `DbIntentSemanticAnalyzer`
|
|
153
|
+
|
|
154
|
+
### 4.1.4 Output Contract
|
|
155
|
+
|
|
156
|
+
```json
|
|
157
|
+
{
|
|
158
|
+
"semantic_conflicts": [
|
|
159
|
+
{
|
|
160
|
+
"type": "openapi_operation_conflict",
|
|
161
|
+
"feature_a": "feature_1",
|
|
162
|
+
"feature_b": "feature_2",
|
|
163
|
+
"fingerprint": "sha256:...",
|
|
164
|
+
"severity": "high",
|
|
165
|
+
"details": {
|
|
166
|
+
"path": "/users/{id}",
|
|
167
|
+
"method": "PATCH"
|
|
168
|
+
}
|
|
169
|
+
}
|
|
170
|
+
]
|
|
171
|
+
}
|
|
172
|
+
```
|
|
173
|
+
|
|
174
|
+
### 4.1.5 File Targets
|
|
175
|
+
|
|
176
|
+
- `apps/control-plane/src/application/services/semantic-collision-service.ts` (new)
|
|
177
|
+
- `apps/control-plane/src/application/services/plan-service.ts`
|
|
178
|
+
- `apps/control-plane/src/application/services/reporting-service.ts`
|
|
179
|
+
- `agentic/orchestrator/tools/schemas/output/collisions.scan.output.schema.json`
|
|
180
|
+
|
|
181
|
+
### 4.1.6 Acceptance Criteria
|
|
182
|
+
|
|
183
|
+
- semantic conflicts produce stable fingerprints and deterministic ordering
|
|
184
|
+
- policy `collision_policy` handling applies to semantic conflicts identically to existing collisions
|
|
185
|
+
|
|
186
|
+
---
|
|
187
|
+
|
|
188
|
+
## 4.2 Q2: Plan Feasibility Scoring
|
|
189
|
+
|
|
190
|
+
### 4.2.1 Problem
|
|
191
|
+
Schema-valid plans can still be operationally poor and drive retries.
|
|
192
|
+
|
|
193
|
+
### 4.2.2 Design
|
|
194
|
+
Add deterministic scoring service and optional threshold gate in plan admission.
|
|
195
|
+
|
|
196
|
+
### 4.2.3 Scoring Components
|
|
197
|
+
|
|
198
|
+
- scope realism: file breadth vs stated summary/criteria
|
|
199
|
+
- test sufficiency: acceptance criteria mapped to gate/test obligations
|
|
200
|
+
- dependency completeness: missing/unsatisfied dependencies
|
|
201
|
+
- contract risk: openapi/events/db impact complexity
|
|
202
|
+
- lock contention risk: likelihood of blocking based on held/pending locks
|
|
203
|
+
|
|
204
|
+
### 4.2.4 Tool Contract (`plan.score`)
|
|
205
|
+
|
|
206
|
+
```json
|
|
207
|
+
{
|
|
208
|
+
"feature_id": "my_feature",
|
|
209
|
+
"score": 78,
|
|
210
|
+
"breakdown": {
|
|
211
|
+
"scope_realism": 0.8,
|
|
212
|
+
"test_sufficiency": 0.7,
|
|
213
|
+
"dependency_completeness": 0.9,
|
|
214
|
+
"contract_risk": 0.6,
|
|
215
|
+
"lock_contention_risk": 0.7
|
|
216
|
+
},
|
|
217
|
+
"reasons": [
|
|
218
|
+
"Acceptance criteria #3 has no mapped verification target"
|
|
219
|
+
]
|
|
220
|
+
}
|
|
221
|
+
```
|
|
222
|
+
|
|
223
|
+
### 4.2.5 File Targets
|
|
224
|
+
|
|
225
|
+
- `apps/control-plane/src/application/services/plan-feasibility-service.ts` (new)
|
|
226
|
+
- `apps/control-plane/src/application/services/plan-service.ts`
|
|
227
|
+
- `agentic/orchestrator/tools/schemas/output/plan.score.output.schema.json` (new)
|
|
228
|
+
|
|
229
|
+
### 4.2.6 Acceptance Criteria
|
|
230
|
+
|
|
231
|
+
- score for same inputs is stable across repeated calls
|
|
232
|
+
- below-threshold plans blocked only when feasibility gate is enabled
|
|
233
|
+
|
|
234
|
+
---
|
|
235
|
+
|
|
236
|
+
## 4.3 Q7: Prompt Contract Test Harness
|
|
237
|
+
|
|
238
|
+
### 4.3.1 Problem
|
|
239
|
+
Prompt edits currently risk behavior drift without dedicated regression coverage.
|
|
240
|
+
|
|
241
|
+
### 4.3.2 Design
|
|
242
|
+
Fixture + invariant harness validates role outputs against deterministic expectations.
|
|
243
|
+
|
|
244
|
+
### 4.3.3 Invariant Set (Initial)
|
|
245
|
+
|
|
246
|
+
- planner output must include actionable plan artifacts
|
|
247
|
+
- builder output must not propose forbidden tool usage
|
|
248
|
+
- QA output must contain explicit evidence/test update intent
|
|
249
|
+
|
|
250
|
+
### 4.3.4 Tooling
|
|
251
|
+
|
|
252
|
+
- script: `apps/control-plane/scripts/test-prompt-contracts.mjs`
|
|
253
|
+
- repo assets:
|
|
254
|
+
- `agentic/orchestrator/prompt-contract/fixtures/*`
|
|
255
|
+
- `agentic/orchestrator/prompt-contract/invariants/*`
|
|
256
|
+
- package script:
|
|
257
|
+
- `npm run test:prompt-contracts`
|
|
258
|
+
|
|
259
|
+
### 4.3.5 CI Contract
|
|
260
|
+
|
|
261
|
+
If prompt files under `agentic/orchestrator/prompts/` change, prompt-contract suite MUST run and block merge on failure.
|
|
262
|
+
|
|
263
|
+
### 4.3.6 Acceptance Criteria
|
|
264
|
+
|
|
265
|
+
- invariant failures produce explicit diff-like diagnostics
|
|
266
|
+
- prompt harness artifacts persist under `.aop/runtime/prompt-contract/`
|
|
267
|
+
|
|
268
|
+
---
|
|
269
|
+
|
|
270
|
+
## 4.4 Q8: Flaky Gate Intelligence + Quarantine Workflow
|
|
271
|
+
|
|
272
|
+
### 4.4.1 Problem
|
|
273
|
+
Flaky failures create noisy retries and reduce confidence in gate results.
|
|
274
|
+
|
|
275
|
+
### 4.4.2 Design
|
|
276
|
+
Track per-test stability history and expose deterministic suspect scoring and quarantine governance.
|
|
277
|
+
|
|
278
|
+
### 4.4.3 Flake Model (Deterministic)
|
|
279
|
+
|
|
280
|
+
Per test key `<framework>:<suite>:<test_name>`:
|
|
281
|
+
- rolling window results
|
|
282
|
+
- failure-rate and alternation index
|
|
283
|
+
- quarantine metadata (`owner`, `reason`, `expires_at`)
|
|
284
|
+
|
|
285
|
+
### 4.4.4 Quarantine Rules
|
|
286
|
+
|
|
287
|
+
- quarantine entry requires owner + reason
|
|
288
|
+
- quarantine expires automatically
|
|
289
|
+
- expired quarantine generates reminder finding/event
|
|
290
|
+
- quarantine cannot silently satisfy required merge gates unless policy explicitly permits
|
|
291
|
+
|
|
292
|
+
### 4.4.5 File Targets
|
|
293
|
+
|
|
294
|
+
- `apps/control-plane/src/application/services/flaky-gate-service.ts` (new)
|
|
295
|
+
- `apps/control-plane/src/application/services/gate-service.ts`
|
|
296
|
+
- `agentic/orchestrator/tools/schemas/output/gate.flaky_report_get.output.schema.json` (new)
|
|
297
|
+
|
|
298
|
+
### 4.4.6 Acceptance Criteria
|
|
299
|
+
|
|
300
|
+
- flaky suspect list deterministic for same history input
|
|
301
|
+
- quarantine lifecycle enforced by policy constraints
|
|
302
|
+
|
|
303
|
+
---
|
|
304
|
+
|
|
305
|
+
## 4.5 Q9: Auto-Generated Review Briefs
|
|
306
|
+
|
|
307
|
+
### 4.5.1 Problem
|
|
308
|
+
Human reviewers must manually synthesize plan/diff/evidence context.
|
|
309
|
+
|
|
310
|
+
### 4.5.2 Design
|
|
311
|
+
Generate concise structured review brief artifacts for each reviewable feature.
|
|
312
|
+
|
|
313
|
+
### 4.5.3 Brief Sections
|
|
314
|
+
|
|
315
|
+
- intent and scope summary
|
|
316
|
+
- high-risk paths/contracts
|
|
317
|
+
- gate outcome matrix
|
|
318
|
+
- unresolved risks/questions
|
|
319
|
+
- evidence references
|
|
320
|
+
|
|
321
|
+
### 4.5.4 Artifact Contract
|
|
322
|
+
|
|
323
|
+
```json
|
|
324
|
+
{
|
|
325
|
+
"feature_id": "my_feature",
|
|
326
|
+
"intent_summary": "Add idempotent billing retry guard",
|
|
327
|
+
"high_risk_paths": ["libs/billing/retry.ts"],
|
|
328
|
+
"contract_changes": {
|
|
329
|
+
"openapi": false,
|
|
330
|
+
"events": true,
|
|
331
|
+
"db": false
|
|
332
|
+
},
|
|
333
|
+
"gate_summary": {
|
|
334
|
+
"fast": "pass",
|
|
335
|
+
"full": "pass",
|
|
336
|
+
"merge": "na"
|
|
337
|
+
},
|
|
338
|
+
"open_questions": [
|
|
339
|
+
"Should retry cap be environment-specific?"
|
|
340
|
+
],
|
|
341
|
+
"evidence_refs": [
|
|
342
|
+
".aop/features/my_feature/evidence/gates/full-2026-03-03.json"
|
|
343
|
+
]
|
|
344
|
+
}
|
|
345
|
+
```
|
|
346
|
+
|
|
347
|
+
### 4.5.5 File Targets
|
|
348
|
+
|
|
349
|
+
- `apps/control-plane/src/application/services/review-brief-service.ts` (new)
|
|
350
|
+
- `apps/control-plane/src/application/services/reporting-service.ts`
|
|
351
|
+
- `packages/web-dashboard/src/components/ReviewPanel.tsx`
|
|
352
|
+
- `agentic/orchestrator/schemas/review_brief.schema.json` (new)
|
|
353
|
+
|
|
354
|
+
### 4.5.6 Acceptance Criteria
|
|
355
|
+
|
|
356
|
+
- review brief generated for `ready_to_merge` features
|
|
357
|
+
- brief content deterministic and sorted consistently
|
|
358
|
+
|
|
359
|
+
---
|
|
360
|
+
|
|
361
|
+
## 5. Milestones
|
|
362
|
+
|
|
363
|
+
### PRQ1: Semantic + Feasibility Admission
|
|
364
|
+
|
|
365
|
+
Tasks:
|
|
366
|
+
1. Implement semantic analyzers + schema updates.
|
|
367
|
+
2. Implement feasibility scoring + threshold enforcement.
|
|
368
|
+
|
|
369
|
+
Exit gate:
|
|
370
|
+
- conflicting and low-feasibility plans blocked deterministically when enabled.
|
|
371
|
+
|
|
372
|
+
### PRQ2: Prompt Contract Reliability
|
|
373
|
+
|
|
374
|
+
Tasks:
|
|
375
|
+
1. Add fixtures/invariants and harness script.
|
|
376
|
+
2. Wire CI enforcement for prompt changes.
|
|
377
|
+
|
|
378
|
+
Exit gate:
|
|
379
|
+
- prompt regressions are caught in CI with clear diagnostics.
|
|
380
|
+
|
|
381
|
+
### PRQ3: Flaky Gate Intelligence
|
|
382
|
+
|
|
383
|
+
Tasks:
|
|
384
|
+
1. Add flake analytics persistence and report tool.
|
|
385
|
+
2. Add quarantine governance and expiry checks.
|
|
386
|
+
|
|
387
|
+
Exit gate:
|
|
388
|
+
- flaky signal exposed without weakening mandatory gates by default.
|
|
389
|
+
|
|
390
|
+
### PRQ4: Review Brief Automation
|
|
391
|
+
|
|
392
|
+
Tasks:
|
|
393
|
+
1. Implement review-brief service and artifact generation.
|
|
394
|
+
2. Add MCP report tool and dashboard integration.
|
|
395
|
+
|
|
396
|
+
Exit gate:
|
|
397
|
+
- reviewers receive deterministic brief before merge approval.
|
|
398
|
+
|
|
399
|
+
---
|
|
400
|
+
|
|
401
|
+
## 6. Test Strategy (Normative)
|
|
402
|
+
|
|
403
|
+
### 6.1 Unit Tests
|
|
404
|
+
|
|
405
|
+
- each semantic analyzer rule
|
|
406
|
+
- score determinism and weight normalization
|
|
407
|
+
- prompt invariant validators
|
|
408
|
+
- flaky score and quarantine expiry logic
|
|
409
|
+
- brief assembly ordering and content extraction
|
|
410
|
+
|
|
411
|
+
### 6.2 Integration Tests
|
|
412
|
+
|
|
413
|
+
- `plan.submit` blocked by semantic conflict
|
|
414
|
+
- `plan.submit` blocked by low feasibility when enabled
|
|
415
|
+
- prompt-contract harness pass/fail suites
|
|
416
|
+
- flaky report generation from realistic gate histories
|
|
417
|
+
- review brief generation from live feature artifacts
|
|
418
|
+
|
|
419
|
+
### 6.3 Transport Parity Tests
|
|
420
|
+
|
|
421
|
+
- `plan.score`
|
|
422
|
+
- `gate.flaky_report_get`
|
|
423
|
+
- `report.feature_review_brief`
|
|
424
|
+
|
|
425
|
+
### 6.4 UI Tests
|
|
426
|
+
|
|
427
|
+
- review panel displays brief sections and evidence links
|
|
428
|
+
- no brief available state handled clearly
|
|
429
|
+
|
|
430
|
+
---
|
|
431
|
+
|
|
432
|
+
## 7. Acceptance Criteria
|
|
433
|
+
|
|
434
|
+
1. Q1/Q2 quality gates work deterministically and are policy-controlled.
|
|
435
|
+
2. Prompt contract harness is operational and CI-gated.
|
|
436
|
+
3. Flaky intelligence improves signal while preserving gate integrity.
|
|
437
|
+
4. Review briefs are generated and consumable in CLI/dashboard flows.
|
|
438
|
+
5. All validation gates pass (`lint`, `typecheck`, `test`, contract/architecture validators).
|
|
439
|
+
|
|
440
|
+
---
|
|
441
|
+
|
|
442
|
+
## 8. Risks and Mitigations
|
|
443
|
+
|
|
444
|
+
- Risk: semantic analyzers produce false positives.
|
|
445
|
+
- Mitigation: confidence/severity labels + precise conflict details and opt-in rollout.
|
|
446
|
+
|
|
447
|
+
- Risk: scoring model over-rejects valid plans.
|
|
448
|
+
- Mitigation: threshold configurable and disabled by default.
|
|
449
|
+
|
|
450
|
+
- Risk: prompt fixtures become stale with runtime evolution.
|
|
451
|
+
- Mitigation: fixture versioning and required update notes on role contract changes.
|
|
452
|
+
|
|
453
|
+
- Risk: quarantine becomes an abuse path.
|
|
454
|
+
- Mitigation: owner/reason/expiry mandatory and surfaced in status/reporting.
|
|
455
|
+
|
|
456
|
+
---
|
|
457
|
+
|
|
458
|
+
## 9. Definition of Done
|
|
459
|
+
|
|
460
|
+
A completed PRQ milestone MUST include:
|
|
461
|
+
- changed file inventory
|
|
462
|
+
- schema/catalog updates
|
|
463
|
+
- new tests and coverage notes
|
|
464
|
+
- verification command summaries
|
|
465
|
+
- `spec-files/progress.md` task closure and next steps
|
|
466
|
+
|
|
@@ -0,0 +1,198 @@
|
|
|
1
|
+
# Feature Spec: Quality + Adoption Execution Master Index (AOP)
|
|
2
|
+
|
|
3
|
+
> **Purpose of this document**: Serve as the orchestration index for split implementation specs covering Q1-Q12. Detailed implementation requirements live in the feature-set specs listed below.
|
|
4
|
+
|
|
5
|
+
**Version:** 1.1
|
|
6
|
+
**Date:** 2026-03-03
|
|
7
|
+
**Status:** Draft (Split-Spec Edition)
|
|
8
|
+
**Roadmap Window:** M33-M38
|
|
9
|
+
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
## 1. Why This Split Exists
|
|
13
|
+
|
|
14
|
+
The original monolithic execution document covered too many orthogonal concerns in one spec. It is now split into focused specs by feature set so future agents can execute with clearer boundaries, lower context cost, and cleaner milestone ownership.
|
|
15
|
+
|
|
16
|
+
---
|
|
17
|
+
|
|
18
|
+
## 2. Canonical Split Specs
|
|
19
|
+
|
|
20
|
+
### 2.1 Observability/Integrity/Diagnostics (M33)
|
|
21
|
+
|
|
22
|
+
- `spec-files/agentic_orchestrator_observability_integrity_diagnostics_spec.md`
|
|
23
|
+
|
|
24
|
+
Implements:
|
|
25
|
+
- Q3 Deterministic Replay + Time-Travel Debugger
|
|
26
|
+
- Q4 Tamper-Evident Evidence Chain
|
|
27
|
+
- Q11 `aop doctor` Readiness + Drift Scanner
|
|
28
|
+
|
|
29
|
+
### 2.2 Planning/Review Quality Hardening (M34-M36)
|
|
30
|
+
|
|
31
|
+
- `spec-files/agentic_orchestrator_planning_review_quality_spec.md`
|
|
32
|
+
|
|
33
|
+
Implements:
|
|
34
|
+
- Q1 Semantic Collision Engine
|
|
35
|
+
- Q2 Plan Feasibility Scoring
|
|
36
|
+
- Q7 Prompt Contract Test Harness
|
|
37
|
+
- Q8 Flaky Gate Intelligence + Quarantine Workflow
|
|
38
|
+
- Q9 Auto-Generated Review Briefs
|
|
39
|
+
|
|
40
|
+
### 2.3 Enterprise Governance + Secure Dashboard (M37)
|
|
41
|
+
|
|
42
|
+
- `spec-files/agentic_orchestrator_enterprise_governance_dashboard_spec.md`
|
|
43
|
+
|
|
44
|
+
Implements:
|
|
45
|
+
- Q5 Compliance Policy Packs + Control Export
|
|
46
|
+
- Q6 Dashboard AuthN/AuthZ + SSO + Audit Federation
|
|
47
|
+
|
|
48
|
+
### 2.4 Knowledge + Progressive Merge Safety (M38)
|
|
49
|
+
|
|
50
|
+
- `spec-files/agentic_orchestrator_knowledge_canary_spec.md`
|
|
51
|
+
|
|
52
|
+
Implements:
|
|
53
|
+
- Q10 Cross-Feature Knowledge Graph
|
|
54
|
+
- Q12 Progressive Merge Guardrails (Canary Verification)
|
|
55
|
+
|
|
56
|
+
---
|
|
57
|
+
|
|
58
|
+
## 3. Feature Outcome Matrix
|
|
59
|
+
|
|
60
|
+
| ID | Feature | Primary Outcome | Detailed Spec |
|
|
61
|
+
|---|---|---|---|
|
|
62
|
+
| Q1 | Semantic Collision Engine | Fewer hidden integration conflicts before build/QA | `agentic_orchestrator_planning_review_quality_spec.md` |
|
|
63
|
+
| Q2 | Plan Feasibility Scoring | Early detection of weak plans | `agentic_orchestrator_planning_review_quality_spec.md` |
|
|
64
|
+
| Q3 | Replay + Time-Travel | Faster incident RCA | `agentic_orchestrator_observability_integrity_diagnostics_spec.md` |
|
|
65
|
+
| Q4 | Tamper-Evident Evidence | Stronger audit trust | `agentic_orchestrator_observability_integrity_diagnostics_spec.md` |
|
|
66
|
+
| Q5 | Compliance Packs + Export | Faster enterprise onboarding | `agentic_orchestrator_enterprise_governance_dashboard_spec.md` |
|
|
67
|
+
| Q6 | Dashboard Auth + SSO + Audit | Production-safe dashboard operation | `agentic_orchestrator_enterprise_governance_dashboard_spec.md` |
|
|
68
|
+
| Q7 | Prompt Contract Tests | Prompt regression prevention | `agentic_orchestrator_planning_review_quality_spec.md` |
|
|
69
|
+
| Q8 | Flaky Intelligence | Higher gate signal quality | `agentic_orchestrator_planning_review_quality_spec.md` |
|
|
70
|
+
| Q9 | Review Briefs | Faster/high-quality human review | `agentic_orchestrator_planning_review_quality_spec.md` |
|
|
71
|
+
| Q10 | Knowledge Graph | Better cross-feature reuse | `agentic_orchestrator_knowledge_canary_spec.md` |
|
|
72
|
+
| Q11 | `aop doctor` | Faster readiness/drift triage | `agentic_orchestrator_observability_integrity_diagnostics_spec.md` |
|
|
73
|
+
| Q12 | Canary Merge Verification | Safer merge promotion | `agentic_orchestrator_knowledge_canary_spec.md` |
|
|
74
|
+
|
|
75
|
+
---
|
|
76
|
+
|
|
77
|
+
## 4. Cross-Spec Coordination Rules
|
|
78
|
+
|
|
79
|
+
### 4.1 Shared Constraints (Normative)
|
|
80
|
+
|
|
81
|
+
All split specs MUST preserve:
|
|
82
|
+
- deterministic MCP contract behavior
|
|
83
|
+
- explicit user approval for merge promotion
|
|
84
|
+
- policy/schema-driven validation
|
|
85
|
+
- `.aop/**` runtime artifact ownership
|
|
86
|
+
- architecture boundary validation and transport parity tests
|
|
87
|
+
|
|
88
|
+
### 4.2 Shared Policy Namespace
|
|
89
|
+
|
|
90
|
+
All new toggles MUST be grouped under:
|
|
91
|
+
|
|
92
|
+
```yaml
|
|
93
|
+
innovation:
|
|
94
|
+
semantic_collisions: false
|
|
95
|
+
plan_feasibility_scoring: false
|
|
96
|
+
replay_debugger: true
|
|
97
|
+
tamper_evident_evidence: false
|
|
98
|
+
compliance_packs: false
|
|
99
|
+
dashboard_auth: false
|
|
100
|
+
prompt_contract_tests: false
|
|
101
|
+
flaky_gate_intelligence: false
|
|
102
|
+
review_briefs: false
|
|
103
|
+
knowledge_graph: false
|
|
104
|
+
canary_merge_verification: false
|
|
105
|
+
```
|
|
106
|
+
|
|
107
|
+
### 4.3 Tool Contract Integration
|
|
108
|
+
|
|
109
|
+
Any new tool introduced in a child spec MUST include:
|
|
110
|
+
- catalog entry
|
|
111
|
+
- input/output schemas
|
|
112
|
+
- RBAC entry
|
|
113
|
+
- tool docs regeneration
|
|
114
|
+
- parity tests (`inprocess` vs `mcp`)
|
|
115
|
+
|
|
116
|
+
---
|
|
117
|
+
|
|
118
|
+
## 5. Milestone Ordering and Dependencies
|
|
119
|
+
|
|
120
|
+
### M33 (must complete first)
|
|
121
|
+
|
|
122
|
+
From `observability_integrity_diagnostics_spec`:
|
|
123
|
+
- replay timeline
|
|
124
|
+
- evidence chain verification
|
|
125
|
+
- doctor scanner
|
|
126
|
+
|
|
127
|
+
Dependency rationale:
|
|
128
|
+
- provides baseline observability needed to debug and harden later milestones
|
|
129
|
+
|
|
130
|
+
### M34-M36
|
|
131
|
+
|
|
132
|
+
From `planning_review_quality_spec`:
|
|
133
|
+
- semantic collisions + feasibility scoring
|
|
134
|
+
- prompt contracts
|
|
135
|
+
- flaky intelligence
|
|
136
|
+
- review briefs
|
|
137
|
+
|
|
138
|
+
Dependency rationale:
|
|
139
|
+
- quality-admission and review optimizations depend on M33 observability artifacts
|
|
140
|
+
|
|
141
|
+
### M37
|
|
142
|
+
|
|
143
|
+
From `enterprise_governance_dashboard_spec`:
|
|
144
|
+
- compliance export
|
|
145
|
+
- dashboard auth/authz/audit federation
|
|
146
|
+
|
|
147
|
+
Dependency rationale:
|
|
148
|
+
- enterprise controls consume stable evidence/reporting from prior milestones
|
|
149
|
+
|
|
150
|
+
### M38
|
|
151
|
+
|
|
152
|
+
From `knowledge_canary_spec`:
|
|
153
|
+
- knowledge graph retrieval
|
|
154
|
+
- canary merge verification
|
|
155
|
+
|
|
156
|
+
Dependency rationale:
|
|
157
|
+
- canary and graph quality improve with historical artifacts generated by M33-M37
|
|
158
|
+
|
|
159
|
+
---
|
|
160
|
+
|
|
161
|
+
## 6. Execution Workflow for Future Agents
|
|
162
|
+
|
|
163
|
+
For each milestone:
|
|
164
|
+
|
|
165
|
+
1. Read this master index and only the child spec for the milestone in scope.
|
|
166
|
+
2. Implement file-level targets from that child spec.
|
|
167
|
+
3. Add/adjust contract artifacts (catalog/schemas/policy/docs).
|
|
168
|
+
4. Add tests per child-spec test plan.
|
|
169
|
+
5. Run required gates:
|
|
170
|
+
- `npm run lint`
|
|
171
|
+
- `npm run typecheck`
|
|
172
|
+
- `npm test`
|
|
173
|
+
- `npm run validate:mcp-contracts`
|
|
174
|
+
- `npm run validate:architecture`
|
|
175
|
+
6. Update `spec-files/progress.md` with:
|
|
176
|
+
- completed tasks
|
|
177
|
+
- residual next tasks
|
|
178
|
+
- blockers/risk notes
|
|
179
|
+
|
|
180
|
+
---
|
|
181
|
+
|
|
182
|
+
## 7. Success Metrics (Program-Level)
|
|
183
|
+
|
|
184
|
+
- 30% reduction in failed merges due to late conflict discovery.
|
|
185
|
+
- 40% reduction in incident triage time.
|
|
186
|
+
- 50% reduction in flaky failure noise within 60 days.
|
|
187
|
+
- enterprise security/review onboarding reduced to <2 weeks for pilot rollout.
|
|
188
|
+
- 20% faster reviewer approval turnaround.
|
|
189
|
+
|
|
190
|
+
---
|
|
191
|
+
|
|
192
|
+
## 8. Definition of Done (Master)
|
|
193
|
+
|
|
194
|
+
The split initiative is complete only when:
|
|
195
|
+
1. Q1-Q12 are delivered via their child specs.
|
|
196
|
+
2. all feature flags and schema/tool contracts are integrated and validated.
|
|
197
|
+
3. no regressions occur in existing deterministic behavior.
|
|
198
|
+
4. `spec-files/progress.md` marks all child-spec milestones complete.
|