agent-os-kernel 1.1.0__py3-none-any.whl → 1.2.0__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- agent_os/__init__.py +66 -4
- agent_os/agents_compat.py +286 -0
- agent_os/base_agent.py +308 -0
- agent_os/cli.py +1079 -19
- agent_os/integrations/__init__.py +37 -2
- agent_os/integrations/openai_adapter.py +502 -0
- agent_os/integrations/semantic_kernel_adapter.py +569 -0
- agent_os/stateless.py +349 -0
- agent_os_kernel-1.2.0.dist-info/METADATA +676 -0
- agent_os_kernel-1.2.0.dist-info/RECORD +1053 -0
- {agent_os_kernel-1.1.0.dist-info → agent_os_kernel-1.2.0.dist-info}/entry_points.txt +0 -1
- modules/amb/.github/workflows/ci.yml +102 -0
- modules/amb/.github/workflows/publish.yml +146 -0
- modules/amb/.gitignore +134 -0
- modules/amb/CHANGELOG.md +118 -0
- modules/amb/CONTRIBUTING.md +141 -0
- modules/amb/LICENSE +21 -0
- modules/amb/README.md +188 -0
- modules/amb/amb_core/__init__.py +175 -0
- modules/amb/amb_core/adapters/__init__.py +55 -0
- modules/amb/amb_core/adapters/aws_sqs_broker.py +374 -0
- modules/amb/amb_core/adapters/azure_servicebus_broker.py +338 -0
- modules/amb/amb_core/adapters/kafka_broker.py +258 -0
- modules/amb/amb_core/adapters/nats_broker.py +283 -0
- modules/amb/amb_core/adapters/rabbitmq_broker.py +233 -0
- modules/amb/amb_core/adapters/redis_broker.py +260 -0
- modules/amb/amb_core/broker.py +143 -0
- modules/amb/amb_core/bus.py +479 -0
- modules/amb/amb_core/cloudevents.py +507 -0
- modules/amb/amb_core/dlq.py +343 -0
- modules/amb/amb_core/hf_utils.py +534 -0
- modules/amb/amb_core/memory_broker.py +408 -0
- modules/amb/amb_core/models.py +139 -0
- modules/amb/amb_core/persistence.py +527 -0
- modules/amb/amb_core/schema.py +292 -0
- modules/amb/amb_core/tracing.py +356 -0
- modules/amb/examples/advanced_features.py +223 -0
- modules/amb/examples/backpressure_demo.py +225 -0
- modules/amb/examples/basic_usage.py +117 -0
- modules/amb/examples/tracing_demo.py +104 -0
- modules/amb/experiments/README.md +52 -0
- modules/amb/experiments/reproduce_results.py +467 -0
- modules/amb/experiments/results.json +324 -0
- modules/amb/paper/README.md +40 -0
- modules/amb/paper/paper.tex +365 -0
- modules/amb/paper/whitepaper.md +377 -0
- modules/amb/pyproject.toml +117 -0
- modules/amb/tests/__init__.py +1 -0
- modules/amb/tests/test_backpressure_priority.py +280 -0
- modules/amb/tests/test_bus.py +198 -0
- modules/amb/tests/test_cloudevents.py +443 -0
- modules/amb/tests/test_features.py +531 -0
- modules/amb/tests/test_models.py +74 -0
- modules/amb/tests/test_tracing.py +254 -0
- modules/atr/.github/workflows/ci.yml +101 -0
- modules/atr/.github/workflows/publish.yml +140 -0
- modules/atr/.gitignore +134 -0
- modules/atr/.pre-commit-config.yaml +37 -0
- modules/atr/CHANGELOG.md +39 -0
- modules/atr/CONTRIBUTING.md +96 -0
- modules/atr/IMPLEMENTATION_SUMMARY.md +143 -0
- modules/atr/README.md +180 -0
- modules/atr/atr/__init__.py +638 -0
- modules/atr/atr/access.py +346 -0
- modules/atr/atr/composition.py +643 -0
- modules/atr/atr/decorator.py +355 -0
- modules/atr/atr/executor.py +382 -0
- modules/atr/atr/health.py +555 -0
- modules/atr/atr/hf_utils.py +447 -0
- modules/atr/atr/injection.py +420 -0
- modules/atr/atr/metrics.py +438 -0
- modules/atr/atr/policies.py +401 -0
- modules/atr/atr/py.typed +2 -0
- modules/atr/atr/registry.py +450 -0
- modules/atr/atr/schema.py +478 -0
- modules/atr/atr/tools/safe/__init__.py +73 -0
- modules/atr/atr/tools/safe/calculator.py +380 -0
- modules/atr/atr/tools/safe/datetime_tool.py +441 -0
- modules/atr/atr/tools/safe/file_reader.py +400 -0
- modules/atr/atr/tools/safe/http_client.py +314 -0
- modules/atr/atr/tools/safe/json_parser.py +372 -0
- modules/atr/atr/tools/safe/text_tool.py +526 -0
- modules/atr/atr/tools/safe/toolkit.py +173 -0
- modules/atr/docs/PYPI_SETUP.md +113 -0
- modules/atr/examples/README.md +27 -0
- modules/atr/examples/demo.py +144 -0
- modules/atr/examples/sandbox_demo.py +218 -0
- modules/atr/experiments/README.md +69 -0
- modules/atr/experiments/reproduce_results.py +509 -0
- modules/atr/experiments/results/.gitkeep +0 -0
- modules/atr/experiments/results/results_20260123_140334.json +71 -0
- modules/atr/paper/README.md +36 -0
- modules/atr/paper/figures/.gitkeep +0 -0
- modules/atr/paper/references.bib +84 -0
- modules/atr/paper/structure.tex +293 -0
- modules/atr/paper/whitepaper.md +234 -0
- modules/atr/pyproject.toml +148 -0
- modules/atr/requirements.txt +1 -0
- modules/atr/setup.py +30 -0
- modules/atr/tests/__init__.py +1 -0
- modules/atr/tests/test_decorator.py +317 -0
- modules/atr/tests/test_executor.py +245 -0
- modules/atr/tests/test_integration_executor.py +184 -0
- modules/atr/tests/test_registry.py +312 -0
- modules/atr/tests/test_schema.py +182 -0
- modules/atr/tests/test_v2_features.py +708 -0
- modules/caas/.dockerignore +63 -0
- modules/caas/.github/ISSUE_TEMPLATE/bug_report.md +38 -0
- modules/caas/.github/ISSUE_TEMPLATE/custom.md +10 -0
- modules/caas/.github/ISSUE_TEMPLATE/feature_request.md +20 -0
- modules/caas/.github/workflows/ci.yml +100 -0
- modules/caas/.github/workflows/lint.yml +39 -0
- modules/caas/.github/workflows/publish-pypi.yml +124 -0
- modules/caas/.gitignore +73 -0
- modules/caas/.pre-commit-config.yaml +33 -0
- modules/caas/CHANGELOG.md +58 -0
- modules/caas/CONTRIBUTING.md +346 -0
- modules/caas/Dockerfile +41 -0
- modules/caas/LICENSE +21 -0
- modules/caas/MANIFEST.in +11 -0
- modules/caas/README.md +158 -0
- modules/caas/benchmarks/README.md +255 -0
- modules/caas/benchmarks/create_hf_dataset.py +502 -0
- modules/caas/benchmarks/data/sample_corpus/README.md +86 -0
- modules/caas/benchmarks/data/sample_corpus/auth_module.py +211 -0
- modules/caas/benchmarks/data/sample_corpus/contribution_guide.md +185 -0
- modules/caas/benchmarks/data/sample_corpus/remote_work_policy.html +57 -0
- modules/caas/benchmarks/hf_dataset/README.md +214 -0
- modules/caas/benchmarks/hf_dataset/caas_benchmark_corpus.py +73 -0
- modules/caas/benchmarks/hf_dataset/corpus_preview.json +193 -0
- modules/caas/benchmarks/results/README.md +66 -0
- modules/caas/benchmarks/results/evaluation_2026-01-20.json +121 -0
- modules/caas/benchmarks/run_evaluation.py +561 -0
- modules/caas/benchmarks/statistical_tests.py +289 -0
- modules/caas/benchmarks/verify_sample_corpus.py +83 -0
- modules/caas/docker-compose.yml +38 -0
- modules/caas/docs/CONTEXT_TRIAD.md +462 -0
- modules/caas/docs/CONTRIBUTING.md +346 -0
- modules/caas/docs/ETHICS_AND_LIMITATIONS.md +336 -0
- modules/caas/docs/HEURISTIC_ROUTER.md +442 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY.md +363 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY_CONTEXT_TRIAD.md +277 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY_HEURISTIC_ROUTER.md +231 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY_METADATA_INJECTION.md +258 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY_PRAGMATIC_TRUTH.md +212 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY_TRUST_GATEWAY.md +319 -0
- modules/caas/docs/LAYER_1_PRIMITIVE.md +202 -0
- modules/caas/docs/METADATA_INJECTION.md +404 -0
- modules/caas/docs/PRAGMATIC_TRUTH.md +431 -0
- modules/caas/docs/RELATED_WORK.md +312 -0
- modules/caas/docs/RELEASE_CHECKLIST.md +219 -0
- modules/caas/docs/RELEASE_GUIDE.md +285 -0
- modules/caas/docs/REPRODUCIBILITY.md +386 -0
- modules/caas/docs/SLIDING_WINDOW.md +387 -0
- modules/caas/docs/STRUCTURE_AWARE_INDEXING.md +158 -0
- modules/caas/docs/TESTING.md +259 -0
- modules/caas/docs/THREAT_MODEL.md +247 -0
- modules/caas/docs/TRUST_GATEWAY.md +575 -0
- modules/caas/docs/VFS.md +298 -0
- modules/caas/examples/agents/enterprise_security_agent.py +414 -0
- modules/caas/examples/agents/intelligent_document_analyzer.py +380 -0
- modules/caas/examples/demos/demo.py +309 -0
- modules/caas/examples/demos/demo_context_triad.py +225 -0
- modules/caas/examples/demos/demo_conversation_manager.py +285 -0
- modules/caas/examples/demos/demo_heuristic_router.py +133 -0
- modules/caas/examples/demos/demo_metadata_injection.py +198 -0
- modules/caas/examples/demos/demo_pragmatic_truth.py +303 -0
- modules/caas/examples/demos/demo_structure_aware.py +140 -0
- modules/caas/examples/demos/demo_time_decay.py +247 -0
- modules/caas/examples/demos/demo_trust_gateway.py +383 -0
- modules/caas/examples/multi_agent/README.md +159 -0
- modules/caas/examples/multi_agent/research_team.py +369 -0
- modules/caas/examples/multi_agent/vfs_collaboration.py +393 -0
- modules/caas/examples/usage/auth_module.py +142 -0
- modules/caas/examples/usage/usage_example.py +173 -0
- modules/caas/experiments/README.md +42 -0
- modules/caas/experiments/reproduce_results.py +462 -0
- modules/caas/paper/ARXIV_METADATA.md +145 -0
- modules/caas/paper/ARXIV_README.md +47 -0
- modules/caas/paper/CHECKLIST.md +103 -0
- modules/caas/paper/GITHUB_RELEASE_NOTES.md +105 -0
- modules/caas/paper/README.md +71 -0
- modules/caas/paper/abstract.md +24 -0
- modules/caas/paper/arxiv_submission.tar +0 -0
- modules/caas/paper/arxiv_submission.zip +0 -0
- modules/caas/paper/build_pdf.py +355 -0
- modules/caas/paper/experiments.md +149 -0
- modules/caas/paper/figures/.gitkeep +0 -0
- modules/caas/paper/figures/README.md +237 -0
- modules/caas/paper/figures/fig1_system_architecture.png +0 -0
- modules/caas/paper/figures/fig1_system_architecture.svg +198 -0
- modules/caas/paper/figures/fig2_context_triad.png +0 -0
- modules/caas/paper/figures/fig2_context_triad.svg +105 -0
- modules/caas/paper/figures/fig3_ablation_results.png +0 -0
- modules/caas/paper/figures/fig3_ablation_results.svg +113 -0
- modules/caas/paper/figures/fig4_routing_latency.png +0 -0
- modules/caas/paper/figures/fig4_routing_latency.svg +97 -0
- modules/caas/paper/intro.md +103 -0
- modules/caas/paper/latex/figures/fig1_system_architecture.png +0 -0
- modules/caas/paper/latex/figures/fig2_context_triad.png +0 -0
- modules/caas/paper/latex/figures/fig3_ablation_results.png +0 -0
- modules/caas/paper/latex/figures/fig4_routing_latency.png +0 -0
- modules/caas/paper/latex/main.tex +468 -0
- modules/caas/paper/latex/references.bib +140 -0
- modules/caas/paper/method.md +350 -0
- modules/caas/paper/outline.md +123 -0
- modules/caas/paper/related_work.md +101 -0
- modules/caas/paper/tables/.gitkeep +0 -0
- modules/caas/paper/tables/results_tables.md +50 -0
- modules/caas/pyproject.toml +172 -0
- modules/caas/requirements.txt +11 -0
- modules/caas/src/caas/__init__.py +232 -0
- modules/caas/src/caas/api/__init__.py +7 -0
- modules/caas/src/caas/api/server.py +1326 -0
- modules/caas/src/caas/caching.py +832 -0
- modules/caas/src/caas/cli.py +208 -0
- modules/caas/src/caas/conversation.py +221 -0
- modules/caas/src/caas/decay.py +118 -0
- modules/caas/src/caas/detection/__init__.py +7 -0
- modules/caas/src/caas/detection/detector.py +236 -0
- modules/caas/src/caas/enrichment.py +127 -0
- modules/caas/src/caas/gateway/__init__.py +24 -0
- modules/caas/src/caas/gateway/trust_gateway.py +471 -0
- modules/caas/src/caas/hf_utils.py +477 -0
- modules/caas/src/caas/ingestion/__init__.py +21 -0
- modules/caas/src/caas/ingestion/processors.py +251 -0
- modules/caas/src/caas/ingestion/structure_parser.py +185 -0
- modules/caas/src/caas/models.py +354 -0
- modules/caas/src/caas/pragmatic_truth.py +441 -0
- modules/caas/src/caas/routing/__init__.py +8 -0
- modules/caas/src/caas/routing/heuristic_router.py +242 -0
- modules/caas/src/caas/storage/__init__.py +7 -0
- modules/caas/src/caas/storage/store.py +450 -0
- modules/caas/src/caas/triad.py +472 -0
- modules/caas/src/caas/tuning/__init__.py +7 -0
- modules/caas/src/caas/tuning/tuner.py +322 -0
- modules/caas/src/caas/vfs/__init__.py +12 -0
- modules/caas/src/caas/vfs/filesystem.py +450 -0
- modules/caas/tests/__init__.py +3 -0
- modules/caas/tests/conftest.py +8 -0
- modules/caas/tests/test_caching.py +628 -0
- modules/caas/tests/test_context_triad.py +385 -0
- modules/caas/tests/test_conversation_manager.py +289 -0
- modules/caas/tests/test_functionality.py +215 -0
- modules/caas/tests/test_heuristic_router.py +370 -0
- modules/caas/tests/test_metadata_injection.py +328 -0
- modules/caas/tests/test_pragmatic_truth.py +322 -0
- modules/caas/tests/test_structure_aware_indexing.py +283 -0
- modules/caas/tests/test_time_decay.py +268 -0
- modules/caas/tests/test_trust_gateway.py +445 -0
- modules/caas/tests/test_vfs.py +298 -0
- modules/cmvk/.github/FUNDING.yml +9 -0
- modules/cmvk/.github/dependabot.yml +54 -0
- modules/cmvk/.github/workflows/ci.yml +205 -0
- modules/cmvk/.github/workflows/publish.yml +143 -0
- modules/cmvk/.gitignore +147 -0
- modules/cmvk/.pre-commit-config.yaml +58 -0
- modules/cmvk/CHANGELOG.md +146 -0
- modules/cmvk/CITATION.cff +48 -0
- modules/cmvk/CONTRIBUTING.md +229 -0
- modules/cmvk/Dockerfile +87 -0
- modules/cmvk/HF_MODEL_CARD.md +185 -0
- modules/cmvk/LICENSE +21 -0
- modules/cmvk/README.md +149 -0
- modules/cmvk/SECURITY.md +114 -0
- modules/cmvk/config/prompts/generator_v1.txt +23 -0
- modules/cmvk/config/prompts/verifier_hostile.txt +32 -0
- modules/cmvk/config/settings.yaml +40 -0
- modules/cmvk/coverage_html/.gitignore +2 -0
- modules/cmvk/coverage_html/class_index.html +658 -0
- modules/cmvk/coverage_html/coverage_html_cb_188fc9a4.js +735 -0
- modules/cmvk/coverage_html/favicon_32_cb_c827f16f.png +0 -0
- modules/cmvk/coverage_html/function_index.html +1978 -0
- modules/cmvk/coverage_html/index.html +255 -0
- modules/cmvk/coverage_html/keybd_closed_cb_900cfef5.png +0 -0
- modules/cmvk/coverage_html/status.json +1 -0
- modules/cmvk/coverage_html/style_cb_5c747636.css +389 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38___init___py.html +315 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_audit_py.html +499 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_benchmarks_py.html +575 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_constitutional_py.html +1001 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_hf_utils_py.html +398 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_metrics_py.html +570 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_profiles_py.html +397 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_types_py.html +109 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_verification_py.html +1053 -0
- modules/cmvk/docs/DIAGRAMS.md +325 -0
- modules/cmvk/docs/architecture.md +345 -0
- modules/cmvk/docs/features.md +308 -0
- modules/cmvk/docs/getting_started.md +279 -0
- modules/cmvk/docs/innovation_layer.md +377 -0
- modules/cmvk/docs/safety.md +281 -0
- modules/cmvk/docs/traceability.md +150 -0
- modules/cmvk/examples/basic_example.py +62 -0
- modules/cmvk/examples/demo_complete_pipeline.py +209 -0
- modules/cmvk/examples/demo_innovation_layer.py +197 -0
- modules/cmvk/examples/example.py +112 -0
- modules/cmvk/examples/model_diversity_comparison.py +110 -0
- modules/cmvk/examples/real_api_integration.py +121 -0
- modules/cmvk/examples/test_full_pipeline.py +303 -0
- modules/cmvk/experiments/FEATURE_2_LATERAL_THINKING.md +187 -0
- modules/cmvk/experiments/README.md +216 -0
- modules/cmvk/experiments/ablation_runner.py +666 -0
- modules/cmvk/experiments/baseline_runner.py +158 -0
- modules/cmvk/experiments/blind_spot_benchmark.py +364 -0
- modules/cmvk/experiments/datasets/README.md +85 -0
- modules/cmvk/experiments/datasets/humaneval_50.json +352 -0
- modules/cmvk/experiments/datasets/humaneval_full.json +1150 -0
- modules/cmvk/experiments/datasets/humaneval_sample.json +32 -0
- modules/cmvk/experiments/datasets/sabotage.json +262 -0
- modules/cmvk/experiments/datasets/sample.json +40 -0
- modules/cmvk/experiments/demo_with_traces.py +110 -0
- modules/cmvk/experiments/efficiency_curve.py +259 -0
- modules/cmvk/experiments/experiment_runner.py +243 -0
- modules/cmvk/experiments/paper_data_generator.py +183 -0
- modules/cmvk/experiments/reproduce_results.py +407 -0
- modules/cmvk/experiments/reproducible_runner.py +352 -0
- modules/cmvk/experiments/sabotage_stress_test.py +311 -0
- modules/cmvk/experiments/test_lateral_thinking.py +116 -0
- modules/cmvk/experiments/test_prosecutor.py +41 -0
- modules/cmvk/experiments/visualize_results.py +735 -0
- modules/cmvk/logs/traces/demo_HumanEval_0_20260121-204900.json +36 -0
- modules/cmvk/notebooks/analysis.ipynb +124 -0
- modules/cmvk/paper/PAPER.md +561 -0
- modules/cmvk/paper/arxiv_checklist.md +230 -0
- modules/cmvk/paper/cmvk_neurips.aux +77 -0
- modules/cmvk/paper/cmvk_neurips.bbl +81 -0
- modules/cmvk/paper/cmvk_neurips.blg +48 -0
- modules/cmvk/paper/cmvk_neurips.out +16 -0
- modules/cmvk/paper/cmvk_neurips.pdf +0 -0
- modules/cmvk/paper/cmvk_neurips.tex +309 -0
- modules/cmvk/paper/figures/ablation.png +0 -0
- modules/cmvk/paper/figures/ablation.svg +39 -0
- modules/cmvk/paper/figures/architecture.png +0 -0
- modules/cmvk/paper/figures/architecture.svg +115 -0
- modules/cmvk/paper/figures/results_bar.png +0 -0
- modules/cmvk/paper/figures/results_bar.svg +70 -0
- modules/cmvk/paper/generate_figures.py +383 -0
- modules/cmvk/paper/neurips_2024.sty +101 -0
- modules/cmvk/paper/references.bib +98 -0
- modules/cmvk/paper/structure.tex +200 -0
- modules/cmvk/pyproject.toml +189 -0
- modules/cmvk/requirements-dev.txt +19 -0
- modules/cmvk/requirements.txt +14 -0
- modules/cmvk/src/cmvk/__init__.py +216 -0
- modules/cmvk/src/cmvk/audit.py +400 -0
- modules/cmvk/src/cmvk/benchmarks.py +476 -0
- modules/cmvk/src/cmvk/constitutional.py +902 -0
- modules/cmvk/src/cmvk/hf_utils.py +299 -0
- modules/cmvk/src/cmvk/metrics.py +471 -0
- modules/cmvk/src/cmvk/profiles.py +298 -0
- modules/cmvk/src/cmvk/py.typed +0 -0
- modules/cmvk/src/cmvk/types.py +10 -0
- modules/cmvk/src/cmvk/verification.py +954 -0
- modules/cmvk/src/cross_model_verification_kernel/__init__.py +91 -0
- modules/cmvk/src/cross_model_verification_kernel/__main__.py +10 -0
- modules/cmvk/src/cross_model_verification_kernel/agents/__init__.py +16 -0
- modules/cmvk/src/cross_model_verification_kernel/agents/base_agent.py +142 -0
- modules/cmvk/src/cross_model_verification_kernel/agents/generator_openai.py +223 -0
- modules/cmvk/src/cross_model_verification_kernel/agents/verifier_anthropic.py +448 -0
- modules/cmvk/src/cross_model_verification_kernel/agents/verifier_gemini.py +481 -0
- modules/cmvk/src/cross_model_verification_kernel/cli.py +570 -0
- modules/cmvk/src/cross_model_verification_kernel/core/__init__.py +26 -0
- modules/cmvk/src/cross_model_verification_kernel/core/graph_memory.py +308 -0
- modules/cmvk/src/cross_model_verification_kernel/core/kernel.py +413 -0
- modules/cmvk/src/cross_model_verification_kernel/core/trace_logger.py +75 -0
- modules/cmvk/src/cross_model_verification_kernel/core/types.py +121 -0
- modules/cmvk/src/cross_model_verification_kernel/datasets/__init__.py +20 -0
- modules/cmvk/src/cross_model_verification_kernel/datasets/humaneval_loader.py +271 -0
- modules/cmvk/src/cross_model_verification_kernel/generator.py +118 -0
- modules/cmvk/src/cross_model_verification_kernel/kernel.py +292 -0
- modules/cmvk/src/cross_model_verification_kernel/models.py +111 -0
- modules/cmvk/src/cross_model_verification_kernel/py.typed +1 -0
- modules/cmvk/src/cross_model_verification_kernel/simple_kernel.py +185 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/__init__.py +94 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/huggingface_upload.py +394 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/sandbox.py +159 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/statistics.py +468 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/visualizer.py +312 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/web_search.py +86 -0
- modules/cmvk/src/cross_model_verification_kernel/verifier.py +257 -0
- modules/cmvk/tests/__init__.py +3 -0
- modules/cmvk/tests/conftest.py +61 -0
- modules/cmvk/tests/integration/__init__.py +1 -0
- modules/cmvk/tests/integration/test_anthropic_verifier.py +269 -0
- modules/cmvk/tests/integration/test_integration.py +53 -0
- modules/cmvk/tests/integration/test_lateral_thinking_integration.py +199 -0
- modules/cmvk/tests/integration/test_lateral_thinking_witness.py +208 -0
- modules/cmvk/tests/integration/test_prosecutor_mode.py +131 -0
- modules/cmvk/tests/test_constitutional.py +611 -0
- modules/cmvk/tests/test_enhanced_features.py +603 -0
- modules/cmvk/tests/test_verification.py +255 -0
- modules/cmvk/tests/unit/__init__.py +1 -0
- modules/cmvk/tests/unit/test_agents.py +64 -0
- modules/cmvk/tests/unit/test_cli.py +224 -0
- modules/cmvk/tests/unit/test_core.py +126 -0
- modules/cmvk/tests/unit/test_humaneval_loader.py +197 -0
- modules/cmvk/tests/unit/test_kernel.py +255 -0
- modules/cmvk/tests/unit/test_reproducibility.py +160 -0
- modules/cmvk/tests/unit/test_trace_logger.py +115 -0
- modules/cmvk/tests/unit/test_visualizer.py +218 -0
- modules/control-plane/.github/ISSUE_TEMPLATE/bug_report.yml +82 -0
- modules/control-plane/.github/ISSUE_TEMPLATE/config.yml +11 -0
- modules/control-plane/.github/ISSUE_TEMPLATE/feature_request.yml +104 -0
- modules/control-plane/.github/ISSUE_TEMPLATE/question.yml +70 -0
- modules/control-plane/.github/ISSUE_TEMPLATE/security_vulnerability.yml +84 -0
- modules/control-plane/.github/discussions.yml +73 -0
- modules/control-plane/.github/pull_request_template.md +82 -0
- modules/control-plane/.github/workflows/publish.yml +146 -0
- modules/control-plane/.github/workflows/release.yml +39 -0
- modules/control-plane/.github/workflows/tests.yml +58 -0
- modules/control-plane/.gitignore +55 -0
- modules/control-plane/CHANGELOG.md +203 -0
- modules/control-plane/CONTRIBUTING.md +311 -0
- modules/control-plane/CONTRIBUTORS.md +88 -0
- modules/control-plane/Dockerfile +82 -0
- modules/control-plane/LICENSE +21 -0
- modules/control-plane/MANIFEST.in +17 -0
- modules/control-plane/README.md +1264 -0
- modules/control-plane/ROADMAP.md +228 -0
- modules/control-plane/SECURITY.md +210 -0
- modules/control-plane/SUPPORT.md +106 -0
- modules/control-plane/acp-cli.py +212 -0
- modules/control-plane/benchmark/README.md +257 -0
- modules/control-plane/benchmark/__init__.py +19 -0
- modules/control-plane/benchmark/red_team_dataset.py +517 -0
- modules/control-plane/benchmark.py +563 -0
- modules/control-plane/build_and_publish.sh +130 -0
- modules/control-plane/docker-compose.yml +74 -0
- modules/control-plane/docs/ABLATION_STUDIES.md +528 -0
- modules/control-plane/docs/ADAPTER_GUIDE.md +544 -0
- modules/control-plane/docs/ADVANCED_FEATURES.md +543 -0
- modules/control-plane/docs/AIOS_COMPARISON.md +296 -0
- modules/control-plane/docs/BIBLIOGRAPHY.md +367 -0
- modules/control-plane/docs/CASE_STUDIES.md +645 -0
- modules/control-plane/docs/DOCKER_DEPLOYMENT.md +184 -0
- modules/control-plane/docs/ECOSYSTEM_STATUS.md +98 -0
- modules/control-plane/docs/HF_MODEL_CARD.md +168 -0
- modules/control-plane/docs/KERNEL_V1_RELEASE.md +454 -0
- modules/control-plane/docs/LAYER3_FRAMEWORK.md +227 -0
- modules/control-plane/docs/LIMITATIONS.md +523 -0
- modules/control-plane/docs/PYPI_PUBLISHING.md +195 -0
- modules/control-plane/docs/README.md +58 -0
- modules/control-plane/docs/RELATED_WORK.md +319 -0
- modules/control-plane/docs/RELEASE_v1.1.0.md +252 -0
- modules/control-plane/docs/REPRODUCIBILITY.md +540 -0
- modules/control-plane/docs/RESEARCH_FOUNDATION.md +197 -0
- modules/control-plane/docs/api/CORE.md +270 -0
- modules/control-plane/docs/architecture/architecture.md +120 -0
- modules/control-plane/docs/community/ANNOUNCEMENT_TEMPLATES.md +52 -0
- modules/control-plane/docs/guides/IMPLEMENTATION.md +225 -0
- modules/control-plane/docs/guides/PHILOSOPHY.md +354 -0
- modules/control-plane/docs/guides/QUICKSTART.md +217 -0
- modules/control-plane/examples/README.md +138 -0
- modules/control-plane/examples/a2a_demo.py +410 -0
- modules/control-plane/examples/adapter_demo.py +347 -0
- modules/control-plane/examples/advanced_features.py +403 -0
- modules/control-plane/examples/basic_usage.py +261 -0
- modules/control-plane/examples/benchmark_demo.py +186 -0
- modules/control-plane/examples/compliance_demo.py +333 -0
- modules/control-plane/examples/configuration.py +265 -0
- modules/control-plane/examples/getting_started.py +178 -0
- modules/control-plane/examples/hibernation_and_time_travel_demo.py +406 -0
- modules/control-plane/examples/interactive_tutorial.ipynb +497 -0
- modules/control-plane/examples/kernel_interceptor_demo.py +202 -0
- modules/control-plane/examples/kernel_v1_demo.py +273 -0
- modules/control-plane/examples/langchain_demo.py +281 -0
- modules/control-plane/examples/lifecycle_demo.py +724 -0
- modules/control-plane/examples/mcp_demo.py +378 -0
- modules/control-plane/examples/ml_safety_demo.py +157 -0
- modules/control-plane/examples/multimodal_demo.py +347 -0
- modules/control-plane/examples/observability_demo.py +370 -0
- modules/control-plane/examples/use_cases.py +336 -0
- modules/control-plane/experiments/long_horizon_purge.py +235 -0
- modules/control-plane/experiments/multi_agent_rag.py +165 -0
- modules/control-plane/experiments/reproduce_results.py +667 -0
- modules/control-plane/paper/ARXIV_SUBMISSION_INFO.txt +122 -0
- modules/control-plane/paper/ETHICS_STATEMENT.md +248 -0
- modules/control-plane/paper/PAPER_CHECKLIST.md +72 -0
- modules/control-plane/paper/Paper.pdf +0 -0
- modules/control-plane/paper/README.md +71 -0
- modules/control-plane/paper/appendix.md +152 -0
- modules/control-plane/paper/architecture.md +15 -0
- modules/control-plane/paper/arxiv/figures/ablation_chart.png +0 -0
- modules/control-plane/paper/arxiv/figures/architecture.png +0 -0
- modules/control-plane/paper/arxiv/figures/constraint_graphs.png +0 -0
- modules/control-plane/paper/arxiv/figures/results_chart.png +0 -0
- modules/control-plane/paper/arxiv/main.aux +97 -0
- modules/control-plane/paper/arxiv/main.bbl +112 -0
- modules/control-plane/paper/arxiv/main.blg +48 -0
- modules/control-plane/paper/arxiv/main.out +33 -0
- modules/control-plane/paper/arxiv/main.pdf +0 -0
- modules/control-plane/paper/arxiv/main.tex +479 -0
- modules/control-plane/paper/arxiv/references.bib +234 -0
- modules/control-plane/paper/arxiv_submission.tar +0 -0
- modules/control-plane/paper/arxiv_submission.zip +0 -0
- modules/control-plane/paper/build.sh +68 -0
- modules/control-plane/paper/figures/README.md +47 -0
- modules/control-plane/paper/figures/ablation_chart.pdf +0 -0
- modules/control-plane/paper/figures/ablation_chart.png +0 -0
- modules/control-plane/paper/figures/architecture.pdf +0 -0
- modules/control-plane/paper/figures/architecture.png +0 -0
- modules/control-plane/paper/figures/constraint_graphs.pdf +0 -0
- modules/control-plane/paper/figures/constraint_graphs.png +0 -0
- modules/control-plane/paper/figures/generate_figures.py +252 -0
- modules/control-plane/paper/figures/results_chart.pdf +0 -0
- modules/control-plane/paper/figures/results_chart.png +0 -0
- modules/control-plane/paper/main.md +273 -0
- modules/control-plane/paper/main.tex +214 -0
- modules/control-plane/paper/main_arxiv.aux +53 -0
- modules/control-plane/paper/main_arxiv.out +17 -0
- modules/control-plane/paper/main_arxiv.pdf +0 -0
- modules/control-plane/paper/main_arxiv.tex +264 -0
- modules/control-plane/paper/references.bib +234 -0
- modules/control-plane/pyproject.toml +124 -0
- modules/control-plane/reproducibility/ABLATIONS.md +136 -0
- modules/control-plane/reproducibility/README.md +288 -0
- modules/control-plane/reproducibility/commands.md +467 -0
- modules/control-plane/reproducibility/docker_config/Dockerfile +39 -0
- modules/control-plane/reproducibility/experiment_configs/purge_config.json +46 -0
- modules/control-plane/reproducibility/experiment_configs/rag_config.json +36 -0
- modules/control-plane/reproducibility/hardware_specs.md +317 -0
- modules/control-plane/reproducibility/requirements_frozen.txt +0 -0
- modules/control-plane/reproducibility/run_all_experiments.sh +45 -0
- modules/control-plane/reproducibility/seeds.json +106 -0
- modules/control-plane/scripts/prepare_pypi.py +46 -0
- modules/control-plane/scripts/prepare_release.py +176 -0
- modules/control-plane/scripts/upload_dataset_to_hf.py +316 -0
- modules/control-plane/setup.py +69 -0
- modules/control-plane/src/agent_control_plane/__init__.py +639 -0
- modules/control-plane/src/agent_control_plane/a2a_adapter.py +541 -0
- modules/control-plane/src/agent_control_plane/adapter.py +415 -0
- modules/control-plane/src/agent_control_plane/agent_hibernation.py +364 -0
- modules/control-plane/src/agent_control_plane/agent_kernel.py +464 -0
- modules/control-plane/src/agent_control_plane/compliance.py +718 -0
- modules/control-plane/src/agent_control_plane/constraint_graphs.py +475 -0
- modules/control-plane/src/agent_control_plane/control_plane.py +848 -0
- modules/control-plane/src/agent_control_plane/example_executors.py +193 -0
- modules/control-plane/src/agent_control_plane/execution_engine.py +229 -0
- modules/control-plane/src/agent_control_plane/flight_recorder.py +600 -0
- modules/control-plane/src/agent_control_plane/governance_layer.py +432 -0
- modules/control-plane/src/agent_control_plane/hf_utils.py +561 -0
- modules/control-plane/src/agent_control_plane/interfaces/__init__.py +53 -0
- modules/control-plane/src/agent_control_plane/interfaces/kernel_interface.py +359 -0
- modules/control-plane/src/agent_control_plane/interfaces/plugin_interface.py +495 -0
- modules/control-plane/src/agent_control_plane/interfaces/protocol_interfaces.py +385 -0
- modules/control-plane/src/agent_control_plane/kernel_space.py +707 -0
- modules/control-plane/src/agent_control_plane/langchain_adapter.py +422 -0
- modules/control-plane/src/agent_control_plane/lifecycle.py +3111 -0
- modules/control-plane/src/agent_control_plane/mcp_adapter.py +517 -0
- modules/control-plane/src/agent_control_plane/ml_safety.py +560 -0
- modules/control-plane/src/agent_control_plane/multimodal.py +724 -0
- modules/control-plane/src/agent_control_plane/mute_agent.py +419 -0
- modules/control-plane/src/agent_control_plane/observability.py +785 -0
- modules/control-plane/src/agent_control_plane/orchestrator.py +480 -0
- modules/control-plane/src/agent_control_plane/plugin_registry.py +748 -0
- modules/control-plane/src/agent_control_plane/policy_engine.py +525 -0
- modules/control-plane/src/agent_control_plane/shadow_mode.py +307 -0
- modules/control-plane/src/agent_control_plane/signals.py +491 -0
- modules/control-plane/src/agent_control_plane/supervisor_agents.py +427 -0
- modules/control-plane/src/agent_control_plane/time_travel_debugger.py +554 -0
- modules/control-plane/src/agent_control_plane/tool_registry.py +350 -0
- modules/control-plane/src/agent_control_plane/vfs.py +695 -0
- modules/control-plane/tests/README.md +33 -0
- modules/control-plane/tests/test_a2a_adapter.py +336 -0
- modules/control-plane/tests/test_adapter.py +422 -0
- modules/control-plane/tests/test_advanced_features.py +389 -0
- modules/control-plane/tests/test_benchmark.py +223 -0
- modules/control-plane/tests/test_compliance.py +214 -0
- modules/control-plane/tests/test_control_plane.py +295 -0
- modules/control-plane/tests/test_hibernation.py +274 -0
- modules/control-plane/tests/test_kernel_interception.py +284 -0
- modules/control-plane/tests/test_langchain_adapter.py +258 -0
- modules/control-plane/tests/test_lifecycle.py +1174 -0
- modules/control-plane/tests/test_mcp_adapter.py +293 -0
- modules/control-plane/tests/test_ml_safety.py +142 -0
- modules/control-plane/tests/test_multimodal.py +317 -0
- modules/control-plane/tests/test_new_features.py +435 -0
- modules/control-plane/tests/test_observability.py +338 -0
- modules/control-plane/tests/test_time_travel.py +387 -0
- modules/emk/.github/workflows/ci.yml +105 -0
- modules/emk/.github/workflows/publish.yml +144 -0
- modules/emk/.gitignore +74 -0
- modules/emk/CHANGELOG.md +41 -0
- modules/emk/CONTRIBUTING.md +295 -0
- modules/emk/IMPLEMENTATION.md +174 -0
- modules/emk/LICENSE +21 -0
- modules/emk/MANIFEST.in +8 -0
- modules/emk/README.md +135 -0
- modules/emk/RELEASE_NOTES.md +82 -0
- modules/emk/SECURITY.md +52 -0
- modules/emk/codecov.yml +39 -0
- modules/emk/docs/MEMORY_MANAGEMENT.md +285 -0
- modules/emk/emk/__init__.py +106 -0
- modules/emk/emk/hf_utils.py +419 -0
- modules/emk/emk/indexer.py +144 -0
- modules/emk/emk/py.typed +0 -0
- modules/emk/emk/schema.py +204 -0
- modules/emk/emk/sleep_cycle.py +345 -0
- modules/emk/emk/store.py +479 -0
- modules/emk/examples/basic_usage.py +123 -0
- modules/emk/examples/memory_features_demo.py +154 -0
- modules/emk/experiments/README.md +59 -0
- modules/emk/experiments/reproduce_results.py +461 -0
- modules/emk/experiments/results.json +61 -0
- modules/emk/paper/structure.tex +192 -0
- modules/emk/paper/whitepaper.md +273 -0
- modules/emk/pyproject.toml +91 -0
- modules/emk/setup.py +5 -0
- modules/emk/tests/test_file_adapter.py +195 -0
- modules/emk/tests/test_indexer.py +174 -0
- modules/emk/tests/test_init.py +55 -0
- modules/emk/tests/test_negative_memory.py +83 -0
- modules/emk/tests/test_schema.py +150 -0
- modules/emk/tests/test_semantic_rules.py +175 -0
- modules/emk/tests/test_sleep_cycle.py +335 -0
- modules/emk/tests/test_store_anti_patterns.py +239 -0
- modules/iatp/.github/workflows/docker-build.yml +124 -0
- modules/iatp/.github/workflows/publish.yml +174 -0
- modules/iatp/.github/workflows/python-package.yml +121 -0
- modules/iatp/.gitignore +67 -0
- modules/iatp/.pre-commit-config.yaml +64 -0
- modules/iatp/CHANGELOG.md +120 -0
- modules/iatp/Dockerfile +91 -0
- modules/iatp/IMPLEMENTATION_SUMMARY.md +218 -0
- modules/iatp/MANIFEST.in +9 -0
- modules/iatp/README.md +180 -0
- modules/iatp/docker/Dockerfile.agent +27 -0
- modules/iatp/docker/Dockerfile.sidecar-python +86 -0
- modules/iatp/docker/README.md +258 -0
- modules/iatp/docker-compose.yml +194 -0
- modules/iatp/docs/ARCHITECTURE.md +243 -0
- modules/iatp/docs/CLI_GUIDE.md +220 -0
- modules/iatp/docs/DEPLOYMENT.md +304 -0
- modules/iatp/examples/README.md +132 -0
- modules/iatp/examples/backend_agent.py +39 -0
- modules/iatp/examples/client.py +168 -0
- modules/iatp/examples/demo_attestation_reputation.py +274 -0
- modules/iatp/examples/demo_client.py +240 -0
- modules/iatp/examples/demo_rbac.py +143 -0
- modules/iatp/examples/integration_demo.py +245 -0
- modules/iatp/examples/manifests/coder_agent.json +20 -0
- modules/iatp/examples/manifests/reviewer_agent.json +19 -0
- modules/iatp/examples/manifests/secure_bank.json +14 -0
- modules/iatp/examples/manifests/standard_agent.json +14 -0
- modules/iatp/examples/manifests/untrusted_honeypot.json +14 -0
- modules/iatp/examples/run_secure_bank_sidecar.py +85 -0
- modules/iatp/examples/run_sidecar.py +105 -0
- modules/iatp/examples/run_untrusted_sidecar.py +77 -0
- modules/iatp/examples/secure_bank_agent.py +138 -0
- modules/iatp/examples/test_untrusted.py +82 -0
- modules/iatp/examples/untrusted_agent.py +119 -0
- modules/iatp/experiments/README.md +58 -0
- modules/iatp/experiments/cascading_hallucination/README.md +149 -0
- modules/iatp/experiments/cascading_hallucination/agent_a_user.py +41 -0
- modules/iatp/experiments/cascading_hallucination/agent_b_summarizer.py +54 -0
- modules/iatp/experiments/cascading_hallucination/agent_c_database.py +47 -0
- modules/iatp/experiments/cascading_hallucination/proof_of_concept.py +290 -0
- modules/iatp/experiments/cascading_hallucination/run_experiment.py +226 -0
- modules/iatp/experiments/cascading_hallucination/sidecar_c.py +61 -0
- modules/iatp/experiments/reproduce_results.py +574 -0
- modules/iatp/experiments/results.json +2336 -0
- modules/iatp/iatp/__init__.py +164 -0
- modules/iatp/iatp/attestation.py +401 -0
- modules/iatp/iatp/cli.py +253 -0
- modules/iatp/iatp/hf_utils.py +469 -0
- modules/iatp/iatp/ipc_pipes.py +578 -0
- modules/iatp/iatp/main.py +410 -0
- modules/iatp/iatp/models/__init__.py +445 -0
- modules/iatp/iatp/policy_engine.py +335 -0
- modules/iatp/iatp/py.typed +2 -0
- modules/iatp/iatp/recovery.py +319 -0
- modules/iatp/iatp/security/__init__.py +268 -0
- modules/iatp/iatp/sidecar/__init__.py +517 -0
- modules/iatp/iatp/telemetry/__init__.py +162 -0
- modules/iatp/iatp/tests/__init__.py +1 -0
- modules/iatp/iatp/tests/test_attestation.py +368 -0
- modules/iatp/iatp/tests/test_cli.py +129 -0
- modules/iatp/iatp/tests/test_models.py +128 -0
- modules/iatp/iatp/tests/test_policy_engine.py +345 -0
- modules/iatp/iatp/tests/test_recovery.py +279 -0
- modules/iatp/iatp/tests/test_security.py +220 -0
- modules/iatp/iatp/tests/test_sidecar.py +165 -0
- modules/iatp/iatp/tests/test_telemetry.py +173 -0
- modules/iatp/paper/BLOG.md +307 -0
- modules/iatp/paper/PAPER.md +236 -0
- modules/iatp/paper/RFC_SUBMISSION.md +299 -0
- modules/iatp/paper/whitepaper.md +369 -0
- modules/iatp/proto/README.md +200 -0
- modules/iatp/proto/generate_stubs.py +81 -0
- modules/iatp/proto/iatp.proto +552 -0
- modules/iatp/pyproject.toml +180 -0
- modules/iatp/requirements-dev.txt +2 -0
- modules/iatp/requirements.txt +6 -0
- modules/iatp/setup.py +60 -0
- modules/iatp/sidecar/README.md +487 -0
- modules/iatp/sidecar/go/Dockerfile +32 -0
- modules/iatp/sidecar/go/README.md +237 -0
- modules/iatp/sidecar/go/go.mod +8 -0
- modules/iatp/sidecar/go/main.go +488 -0
- modules/iatp/spec/001-handshake.md +436 -0
- modules/iatp/spec/002-reversibility.md +394 -0
- modules/iatp/spec/schema/capability_manifest.json +266 -0
- modules/iatp/test_integration.py +310 -0
- modules/mcp-kernel-server/README.md +261 -0
- modules/mcp-kernel-server/pyproject.toml +60 -0
- modules/mcp-kernel-server/src/mcp_kernel_server/__init__.py +26 -0
- modules/mcp-kernel-server/src/mcp_kernel_server/cli.py +229 -0
- modules/mcp-kernel-server/src/mcp_kernel_server/resources.py +215 -0
- modules/mcp-kernel-server/src/mcp_kernel_server/server.py +562 -0
- modules/mcp-kernel-server/src/mcp_kernel_server/tools.py +1172 -0
- modules/mute-agent/.github/workflows/safety_check.yml +45 -0
- modules/mute-agent/.gitignore +53 -0
- modules/mute-agent/ARCHITECTURE.md +531 -0
- modules/mute-agent/BENCHMARK_GUIDE.md +384 -0
- modules/mute-agent/COMPLETION_SUMMARY.md +293 -0
- modules/mute-agent/EXPERIMENT_SUMMARY.md +318 -0
- modules/mute-agent/IMPLEMENTATION_SUMMARY.md +212 -0
- modules/mute-agent/LICENSE +21 -0
- modules/mute-agent/PHASE3_SUMMARY.md +297 -0
- modules/mute-agent/README.md +360 -0
- modules/mute-agent/STEEL_MAN_RESULTS.md +353 -0
- modules/mute-agent/USAGE.md +505 -0
- modules/mute-agent/V2_IMPLEMENTATION_SUMMARY.md +253 -0
- modules/mute-agent/V2_STEEL_MAN_IMPLEMENTATION.md +274 -0
- modules/mute-agent/VERIFICATION_REPORT.md +435 -0
- modules/mute-agent/charts/cost_comparison.png +0 -0
- modules/mute-agent/charts/cost_vs_ambiguity.png +0 -0
- modules/mute-agent/charts/metrics_comparison.png +0 -0
- modules/mute-agent/charts/scenario_breakdown.png +0 -0
- modules/mute-agent/charts/trace_attack_blocked.html +140 -0
- modules/mute-agent/charts/trace_attack_blocked.png +0 -0
- modules/mute-agent/charts/trace_failure.html +140 -0
- modules/mute-agent/charts/trace_failure.png +0 -0
- modules/mute-agent/charts/trace_success.html +140 -0
- modules/mute-agent/charts/trace_success.png +0 -0
- modules/mute-agent/examples/__init__.py +1 -0
- modules/mute-agent/examples/advanced_example.py +384 -0
- modules/mute-agent/examples/graph_debugger_demo.py +241 -0
- modules/mute-agent/examples/listener_example.py +297 -0
- modules/mute-agent/examples/simple_example.py +242 -0
- modules/mute-agent/examples/steel_man_demo.py +297 -0
- modules/mute-agent/experiments/README.md +135 -0
- modules/mute-agent/experiments/__init__.py +3 -0
- modules/mute-agent/experiments/agent_comparison.csv +6 -0
- modules/mute-agent/experiments/agent_comparison_50runs.csv +6 -0
- modules/mute-agent/experiments/ambiguity_test.py +335 -0
- modules/mute-agent/experiments/ambiguity_test_results.csv +31 -0
- modules/mute-agent/experiments/ambiguity_test_results_50runs.csv +51 -0
- modules/mute-agent/experiments/baseline_agent.py +189 -0
- modules/mute-agent/experiments/benchmark.py +402 -0
- modules/mute-agent/experiments/demo.py +172 -0
- modules/mute-agent/experiments/generate_cost_curve.py +474 -0
- modules/mute-agent/experiments/jailbreak_test.py +137 -0
- modules/mute-agent/experiments/latent_state_scenario.py +361 -0
- modules/mute-agent/experiments/mute_agent_experiment.py +349 -0
- modules/mute-agent/experiments/run_extended_experiment.py +40 -0
- modules/mute-agent/experiments/run_v2_experiments.py +266 -0
- modules/mute-agent/experiments/run_v2_experiments_auto.py +247 -0
- modules/mute-agent/experiments/v2_scenarios/README.md +214 -0
- modules/mute-agent/experiments/v2_scenarios/__init__.py +4 -0
- modules/mute-agent/experiments/v2_scenarios/scenario_1_deep_dependency.py +325 -0
- modules/mute-agent/experiments/v2_scenarios/scenario_2_adversarial.py +328 -0
- modules/mute-agent/experiments/v2_scenarios/scenario_3_false_positive.py +303 -0
- modules/mute-agent/experiments/v2_scenarios/scenario_4_performance.py +319 -0
- modules/mute-agent/experiments/visualize.py +400 -0
- modules/mute-agent/mute_agent/__init__.py +66 -0
- modules/mute-agent/mute_agent/core/__init__.py +1 -0
- modules/mute-agent/mute_agent/core/execution_agent.py +164 -0
- modules/mute-agent/mute_agent/core/handshake_protocol.py +199 -0
- modules/mute-agent/mute_agent/core/reasoning_agent.py +236 -0
- modules/mute-agent/mute_agent/knowledge_graph/__init__.py +1 -0
- modules/mute-agent/mute_agent/knowledge_graph/graph_elements.py +63 -0
- modules/mute-agent/mute_agent/knowledge_graph/multidimensional_graph.py +168 -0
- modules/mute-agent/mute_agent/knowledge_graph/subgraph.py +222 -0
- modules/mute-agent/mute_agent/listener/__init__.py +41 -0
- modules/mute-agent/mute_agent/listener/adapters/__init__.py +29 -0
- modules/mute-agent/mute_agent/listener/adapters/base_adapter.py +187 -0
- modules/mute-agent/mute_agent/listener/adapters/caas_adapter.py +342 -0
- modules/mute-agent/mute_agent/listener/adapters/control_plane_adapter.py +434 -0
- modules/mute-agent/mute_agent/listener/adapters/iatp_adapter.py +330 -0
- modules/mute-agent/mute_agent/listener/adapters/scak_adapter.py +249 -0
- modules/mute-agent/mute_agent/listener/listener.py +608 -0
- modules/mute-agent/mute_agent/listener/state_observer.py +434 -0
- modules/mute-agent/mute_agent/listener/threshold_config.py +311 -0
- modules/mute-agent/mute_agent/super_system/__init__.py +1 -0
- modules/mute-agent/mute_agent/super_system/router.py +202 -0
- modules/mute-agent/mute_agent/visualization/__init__.py +8 -0
- modules/mute-agent/mute_agent/visualization/graph_debugger.py +495 -0
- modules/mute-agent/requirements-dev.txt +6 -0
- modules/mute-agent/requirements.txt +9 -0
- modules/mute-agent/setup.py +64 -0
- modules/mute-agent/src/__init__.py +0 -0
- modules/mute-agent/src/agents/__init__.py +0 -0
- modules/mute-agent/src/agents/baseline_agent.py +524 -0
- modules/mute-agent/src/agents/interactive_agent.py +113 -0
- modules/mute-agent/src/agents/mute_agent.py +622 -0
- modules/mute-agent/src/benchmarks/__init__.py +0 -0
- modules/mute-agent/src/benchmarks/evaluator.py +481 -0
- modules/mute-agent/src/benchmarks/scenarios.json +985 -0
- modules/mute-agent/src/core/__init__.py +0 -0
- modules/mute-agent/src/core/mock_state.py +320 -0
- modules/mute-agent/src/core/tools.py +441 -0
- modules/nexus/__init__.py +49 -0
- modules/nexus/arbiter.py +357 -0
- modules/nexus/client.py +464 -0
- modules/nexus/dmz.py +417 -0
- modules/nexus/escrow.py +428 -0
- modules/nexus/exceptions.py +284 -0
- modules/nexus/registry.py +391 -0
- modules/nexus/reputation.py +423 -0
- modules/nexus/schemas/__init__.py +49 -0
- modules/nexus/schemas/compliance.py +274 -0
- modules/nexus/schemas/escrow.py +249 -0
- modules/nexus/schemas/manifest.py +223 -0
- modules/nexus/schemas/receipt.py +206 -0
- modules/observability/README.md +192 -0
- modules/observability/alertmanager/alertmanager.yml +116 -0
- modules/observability/alerts/agent-os-alerts.yaml +197 -0
- modules/observability/docker-compose.yml +128 -0
- modules/observability/grafana/dashboards/agent-os-amb.json +448 -0
- modules/observability/grafana/dashboards/agent-os-cmvk.json +441 -0
- modules/observability/grafana/dashboards/agent-os-overview.json +268 -0
- modules/observability/grafana/dashboards/agent-os-performance.json +15 -0
- modules/observability/grafana/dashboards/agent-os-safety.json +50 -0
- modules/observability/grafana/provisioning/dashboards/dashboards.yml +15 -0
- modules/observability/grafana/provisioning/datasources/datasources.yml +33 -0
- modules/observability/otel/otel-collector-config.yml +61 -0
- modules/observability/prometheus/prometheus.yml +63 -0
- modules/observability/pyproject.toml +53 -0
- modules/observability/scripts/export_dashboards.py +55 -0
- modules/observability/src/agent_os_observability/__init__.py +25 -0
- modules/observability/src/agent_os_observability/dashboards.py +896 -0
- modules/observability/src/agent_os_observability/metrics.py +396 -0
- modules/observability/src/agent_os_observability/server.py +221 -0
- modules/observability/src/agent_os_observability/tracer.py +226 -0
- modules/primitives/.gitignore +8 -0
- modules/primitives/README.md +62 -0
- modules/primitives/agent_primitives/__init__.py +22 -0
- modules/primitives/agent_primitives/failures.py +82 -0
- modules/primitives/agent_primitives/py.typed +0 -0
- modules/primitives/pyproject.toml +68 -0
- modules/scak/.github/copilot-instructions.md +396 -0
- modules/scak/.github/workflows/release.yml +117 -0
- modules/scak/.gitignore +32 -0
- modules/scak/CHANGELOG.md +173 -0
- modules/scak/CITATION.cff +62 -0
- modules/scak/CONTRIBUTING.md +429 -0
- modules/scak/Dockerfile +58 -0
- modules/scak/ENTERPRISE_FEATURES.md +518 -0
- modules/scak/IMPLEMENTATION_SUMMARY.md +206 -0
- modules/scak/LIMITATIONS.md +565 -0
- modules/scak/MANIFEST.in +16 -0
- modules/scak/NOVELTY.md +535 -0
- modules/scak/README.md +928 -0
- modules/scak/RESEARCH.md +670 -0
- modules/scak/agent_kernel/__init__.py +66 -0
- modules/scak/agent_kernel/analyzer.py +432 -0
- modules/scak/agent_kernel/auditor.py +31 -0
- modules/scak/agent_kernel/completeness_auditor.py +234 -0
- modules/scak/agent_kernel/detector.py +200 -0
- modules/scak/agent_kernel/kernel.py +741 -0
- modules/scak/agent_kernel/memory_manager.py +82 -0
- modules/scak/agent_kernel/models.py +372 -0
- modules/scak/agent_kernel/nudge_mechanism.py +260 -0
- modules/scak/agent_kernel/outcome_analyzer.py +335 -0
- modules/scak/agent_kernel/patcher.py +579 -0
- modules/scak/agent_kernel/semantic_analyzer.py +313 -0
- modules/scak/agent_kernel/semantic_purge.py +346 -0
- modules/scak/agent_kernel/simulator.py +447 -0
- modules/scak/agent_kernel/teacher.py +82 -0
- modules/scak/agent_kernel/triage.py +149 -0
- modules/scak/build_and_publish.ps1 +74 -0
- modules/scak/build_and_publish.sh +74 -0
- modules/scak/cli.py +471 -0
- modules/scak/dashboard.py +462 -0
- modules/scak/datasets/DATASET_CARD.md +219 -0
- modules/scak/datasets/README.md +143 -0
- modules/scak/datasets/gaia_vague_queries/vague_queries.json +262 -0
- modules/scak/datasets/hf_upload/README.md +219 -0
- modules/scak/datasets/hf_upload/scak_gaia_laziness.jsonl +50 -0
- modules/scak/datasets/prepare_hf_datasets.py +145 -0
- modules/scak/datasets/red_team/jailbreak_patterns.json +202 -0
- modules/scak/docker-compose.yml +99 -0
- modules/scak/docs/Adaptive-Memory-Hierarchy.md +319 -0
- modules/scak/docs/Data-Contracts-and-Schemas.md +285 -0
- modules/scak/docs/Dual-Loop-Architecture.md +344 -0
- modules/scak/docs/Enhanced-Features.md +612 -0
- modules/scak/docs/LANGCHAIN_INTEGRATION.md +572 -0
- modules/scak/docs/README.md +128 -0
- modules/scak/docs/Reference-Implementations.md +163 -0
- modules/scak/docs/SCAK_V2.md +374 -0
- modules/scak/docs/Three-Failure-Types.md +178 -0
- modules/scak/examples/basic_example.py +155 -0
- modules/scak/examples/circuit_breaker_lazy_eval_demo.py +243 -0
- modules/scak/examples/langchain_integration_example.py +339 -0
- modules/scak/examples/layer4_demo.py +243 -0
- modules/scak/examples/production_features_demo.py +353 -0
- modules/scak/examples/quick_demo.py +79 -0
- modules/scak/examples/scak_v2_demo.py +252 -0
- modules/scak/experiments/README.md +438 -0
- modules/scak/experiments/ablation_studies/README.md +192 -0
- modules/scak/experiments/ablation_studies/ablation_no_audit.py +116 -0
- modules/scak/experiments/ablation_studies/ablation_no_purge.py +133 -0
- modules/scak/experiments/chaos_engineering/README.md +332 -0
- modules/scak/experiments/context_efficiency_test.py +328 -0
- modules/scak/experiments/gaia_benchmark/README.md +208 -0
- modules/scak/experiments/laziness_benchmark.py +179 -0
- modules/scak/experiments/long_horizon_task_experiment.py +252 -0
- modules/scak/experiments/multi_agent_rag_experiment.py +284 -0
- modules/scak/experiments/results/ablation_table.md +12 -0
- modules/scak/experiments/results/long_horizon.json +36 -0
- modules/scak/experiments/results/multi_agent_rag.json +66 -0
- modules/scak/experiments/run_comprehensive_ablations.py +332 -0
- modules/scak/experiments/test_auditor_patcher_integration.py +251 -0
- modules/scak/notebooks/getting_started.ipynb +33 -0
- modules/scak/paper/ARXIV_SUBMISSION_METADATA.txt +109 -0
- modules/scak/paper/PAPER_CHECKLIST.md +304 -0
- modules/scak/paper/Paper.pdf +0 -0
- modules/scak/paper/README.md +113 -0
- modules/scak/paper/appendix.md +351 -0
- modules/scak/paper/arxiv/bibliography.bib +284 -0
- modules/scak/paper/arxiv/fig1_ooda_architecture.pdf +0 -0
- modules/scak/paper/arxiv/fig2_memory_hierarchy.pdf +0 -0
- modules/scak/paper/arxiv/fig3_gaia_results.pdf +0 -0
- modules/scak/paper/arxiv/fig4_ablation_heatmap.pdf +0 -0
- modules/scak/paper/arxiv/fig5_context_reduction.pdf +0 -0
- modules/scak/paper/arxiv/fig6_mttr_boxplot.pdf +0 -0
- modules/scak/paper/arxiv/main.aux +103 -0
- modules/scak/paper/arxiv/main.bbl +113 -0
- modules/scak/paper/arxiv/main.blg +55 -0
- modules/scak/paper/arxiv/main.out +31 -0
- modules/scak/paper/arxiv/main.pdf +0 -0
- modules/scak/paper/arxiv/main.tex +482 -0
- modules/scak/paper/arxiv_submission/bibliography.bib +284 -0
- modules/scak/paper/arxiv_submission/fig1_ooda_architecture.pdf +0 -0
- modules/scak/paper/arxiv_submission/fig2_memory_hierarchy.pdf +0 -0
- modules/scak/paper/arxiv_submission/fig3_gaia_results.pdf +0 -0
- modules/scak/paper/arxiv_submission/fig4_ablation_heatmap.pdf +0 -0
- modules/scak/paper/arxiv_submission/fig5_context_reduction.pdf +0 -0
- modules/scak/paper/arxiv_submission/fig6_mttr_boxplot.pdf +0 -0
- modules/scak/paper/arxiv_submission/main.aux +103 -0
- modules/scak/paper/arxiv_submission/main.bbl +113 -0
- modules/scak/paper/arxiv_submission/main.blg +55 -0
- modules/scak/paper/arxiv_submission/main.out +31 -0
- modules/scak/paper/arxiv_submission/main.pdf +0 -0
- modules/scak/paper/arxiv_submission/main.tex +482 -0
- modules/scak/paper/arxiv_submission.tar.gz +0 -0
- modules/scak/paper/bibliography.bib +284 -0
- modules/scak/paper/build.sh +55 -0
- modules/scak/paper/figures/README.md +32 -0
- modules/scak/paper/figures/fig1_ooda_architecture.md +75 -0
- modules/scak/paper/figures/fig1_ooda_architecture.pdf +0 -0
- modules/scak/paper/figures/fig1_ooda_architecture.png +0 -0
- modules/scak/paper/figures/fig2_memory_hierarchy.md +83 -0
- modules/scak/paper/figures/fig2_memory_hierarchy.pdf +0 -0
- modules/scak/paper/figures/fig2_memory_hierarchy.png +0 -0
- modules/scak/paper/figures/fig3_gaia_results.md +64 -0
- modules/scak/paper/figures/fig3_gaia_results.pdf +0 -0
- modules/scak/paper/figures/fig3_gaia_results.png +0 -0
- modules/scak/paper/figures/fig4_ablation_heatmap.md +64 -0
- modules/scak/paper/figures/fig4_ablation_heatmap.pdf +0 -0
- modules/scak/paper/figures/fig4_ablation_heatmap.png +0 -0
- modules/scak/paper/figures/fig5_context_reduction.md +71 -0
- modules/scak/paper/figures/fig5_context_reduction.pdf +0 -0
- modules/scak/paper/figures/fig5_context_reduction.png +0 -0
- modules/scak/paper/figures/fig6_mttr_boxplot.md +80 -0
- modules/scak/paper/figures/fig6_mttr_boxplot.pdf +0 -0
- modules/scak/paper/figures/fig6_mttr_boxplot.png +0 -0
- modules/scak/paper/figures/generate_figures.py +463 -0
- modules/scak/paper/main.aux +103 -0
- modules/scak/paper/main.bbl +113 -0
- modules/scak/paper/main.blg +55 -0
- modules/scak/paper/main.md +192 -0
- modules/scak/paper/main.out +31 -0
- modules/scak/paper/main.pdf +0 -0
- modules/scak/paper/main.tex +482 -0
- modules/scak/reproducibility/ABLATIONS.md +225 -0
- modules/scak/reproducibility/Dockerfile.reproducibility +34 -0
- modules/scak/reproducibility/README.md +421 -0
- modules/scak/reproducibility/requirements-pinned.txt +32 -0
- modules/scak/reproducibility/run_all_experiments.py +395 -0
- modules/scak/reproducibility/seed_control.py +53 -0
- modules/scak/reproducibility/statistical_analysis.py +302 -0
- modules/scak/requirements.txt +50 -0
- modules/scak/setup.py +93 -0
- modules/scak/src/__init__.py +124 -0
- modules/scak/src/agents/__init__.py +13 -0
- modules/scak/src/agents/conflict_resolution.py +732 -0
- modules/scak/src/agents/orchestrator.py +761 -0
- modules/scak/src/agents/pubsub.py +484 -0
- modules/scak/src/agents/shadow_teacher.py +344 -0
- modules/scak/src/agents/swarm.py +661 -0
- modules/scak/src/agents/worker.py +357 -0
- modules/scak/src/integrations/__init__.py +81 -0
- modules/scak/src/integrations/cmvk_adapter.py +430 -0
- modules/scak/src/integrations/control_plane_adapter.py +601 -0
- modules/scak/src/integrations/langchain_integration.py +902 -0
- modules/scak/src/interfaces/__init__.py +59 -0
- modules/scak/src/interfaces/llm_clients.py +505 -0
- modules/scak/src/interfaces/openapi_tools.py +611 -0
- modules/scak/src/interfaces/plugin_system.py +605 -0
- modules/scak/src/interfaces/protocols.py +365 -0
- modules/scak/src/interfaces/telemetry.py +464 -0
- modules/scak/src/interfaces/tool_registry.py +547 -0
- modules/scak/src/kernel/__init__.py +100 -0
- modules/scak/src/kernel/auditor.py +305 -0
- modules/scak/src/kernel/circuit_breaker.py +398 -0
- modules/scak/src/kernel/core.py +724 -0
- modules/scak/src/kernel/distributed.py +667 -0
- modules/scak/src/kernel/evolution.py +455 -0
- modules/scak/src/kernel/failover.py +621 -0
- modules/scak/src/kernel/governance.py +710 -0
- modules/scak/src/kernel/governance_v2.py +603 -0
- modules/scak/src/kernel/lazy_evaluator.py +514 -0
- modules/scak/src/kernel/load_testing.py +633 -0
- modules/scak/src/kernel/memory.py +945 -0
- modules/scak/src/kernel/patcher.py +581 -0
- modules/scak/src/kernel/rubric.py +419 -0
- modules/scak/src/kernel/schemas.py +390 -0
- modules/scak/src/kernel/skill_mapper.py +309 -0
- modules/scak/src/kernel/triage.py +149 -0
- modules/scak/src/mocks/__init__.py +99 -0
- modules/scak/tests/__init__.py +1 -0
- modules/scak/tests/test_circuit_breaker.py +403 -0
- modules/scak/tests/test_conflict_resolution.py +287 -0
- modules/scak/tests/test_dual_loop.py +463 -0
- modules/scak/tests/test_enhanced_features.py +421 -0
- modules/scak/tests/test_failover_and_load.py +438 -0
- modules/scak/tests/test_governance.py +185 -0
- modules/scak/tests/test_kernel.py +359 -0
- modules/scak/tests/test_langchain_integration.py +451 -0
- modules/scak/tests/test_lazy_evaluator.py +465 -0
- modules/scak/tests/test_llm_clients.py +122 -0
- modules/scak/tests/test_memory_controller.py +528 -0
- modules/scak/tests/test_orchestrator.py +181 -0
- modules/scak/tests/test_phase3_integration.py +265 -0
- modules/scak/tests/test_pubsub_swarm.py +203 -0
- modules/scak/tests/test_reference_implementations.py +240 -0
- modules/scak/tests/test_rubric.py +363 -0
- modules/scak/tests/test_scak_v2.py +651 -0
- modules/scak/tests/test_skill_mapper.py +217 -0
- modules/scak/tests/test_specific_failures.py +393 -0
- modules/scak/tests/test_tool_registry.py +264 -0
- modules/scak/tests/test_tools_and_plugins.py +303 -0
- modules/scak/tests/test_triage.py +596 -0
- modules/scak/tests/test_write_through.py +319 -0
- agent_os_kernel-1.1.0.dist-info/METADATA +0 -400
- agent_os_kernel-1.1.0.dist-info/RECORD +0 -12
- {agent_os_kernel-1.1.0.dist-info → agent_os_kernel-1.2.0.dist-info}/WHEEL +0 -0
- {agent_os_kernel-1.1.0.dist-info → agent_os_kernel-1.2.0.dist-info}/licenses/LICENSE +0 -0
|
@@ -0,0 +1,195 @@
|
|
|
1
|
+
# PyPI Publishing Guide
|
|
2
|
+
|
|
3
|
+
This guide explains how to publish Agent Control Plane to PyPI.
|
|
4
|
+
|
|
5
|
+
## Prerequisites
|
|
6
|
+
|
|
7
|
+
### 1. PyPI Account Setup
|
|
8
|
+
- Create accounts on:
|
|
9
|
+
- [PyPI](https://pypi.org/account/register/) (production)
|
|
10
|
+
- [Test PyPI](https://test.pypi.org/account/register/) (testing)
|
|
11
|
+
- Enable 2FA on both accounts
|
|
12
|
+
- Generate API tokens:
|
|
13
|
+
- PyPI: Account Settings → API tokens → Add API token (scope: entire account or specific project)
|
|
14
|
+
- Test PyPI: Same process
|
|
15
|
+
|
|
16
|
+
### 2. GitHub Secrets
|
|
17
|
+
Add the following secrets to the GitHub repository:
|
|
18
|
+
- `PYPI_API_TOKEN` - Your PyPI API token
|
|
19
|
+
- `TEST_PYPI_API_TOKEN` - Your Test PyPI API token
|
|
20
|
+
|
|
21
|
+
Go to: Repository Settings → Secrets and variables → Actions → New repository secret
|
|
22
|
+
|
|
23
|
+
## Publishing Workflow
|
|
24
|
+
|
|
25
|
+
### Option 1: Automatic Publishing (Recommended)
|
|
26
|
+
|
|
27
|
+
When you create a new GitHub release, the package is automatically published to PyPI.
|
|
28
|
+
|
|
29
|
+
1. **Update Version**
|
|
30
|
+
```bash
|
|
31
|
+
# Update version in both files:
|
|
32
|
+
# - pyproject.toml (line 7)
|
|
33
|
+
# - setup.py (line 16)
|
|
34
|
+
```
|
|
35
|
+
|
|
36
|
+
2. **Update CHANGELOG.md**
|
|
37
|
+
```bash
|
|
38
|
+
# Add new version section at the top
|
|
39
|
+
## [X.Y.Z] - YYYY-MM-DD
|
|
40
|
+
### Added
|
|
41
|
+
- New features...
|
|
42
|
+
### Changed
|
|
43
|
+
- Changes...
|
|
44
|
+
### Fixed
|
|
45
|
+
- Bug fixes...
|
|
46
|
+
```
|
|
47
|
+
|
|
48
|
+
3. **Create Git Tag**
|
|
49
|
+
```bash
|
|
50
|
+
git tag -a vX.Y.Z -m "Release version X.Y.Z"
|
|
51
|
+
git push origin vX.Y.Z
|
|
52
|
+
```
|
|
53
|
+
|
|
54
|
+
4. **Workflow Triggers**
|
|
55
|
+
- The `release.yml` workflow creates a GitHub release
|
|
56
|
+
- The `publish.yml` workflow publishes to PyPI
|
|
57
|
+
- Both workflows run automatically
|
|
58
|
+
|
|
59
|
+
### Option 2: Manual Publishing
|
|
60
|
+
|
|
61
|
+
For manual control or testing:
|
|
62
|
+
|
|
63
|
+
1. **Test Locally**
|
|
64
|
+
```bash
|
|
65
|
+
# Install build tools
|
|
66
|
+
pip install build twine
|
|
67
|
+
|
|
68
|
+
# Build the package
|
|
69
|
+
python -m build
|
|
70
|
+
|
|
71
|
+
# Check the build
|
|
72
|
+
twine check dist/*
|
|
73
|
+
|
|
74
|
+
# Test upload to Test PyPI
|
|
75
|
+
twine upload --repository testpypi dist/*
|
|
76
|
+
|
|
77
|
+
# Test installation from Test PyPI
|
|
78
|
+
pip install --index-url https://test.pypi.org/simple/ agent-control-plane
|
|
79
|
+
```
|
|
80
|
+
|
|
81
|
+
2. **Publish to PyPI**
|
|
82
|
+
```bash
|
|
83
|
+
# Upload to production PyPI
|
|
84
|
+
twine upload dist/*
|
|
85
|
+
|
|
86
|
+
# Enter your PyPI username (__token__) and API token when prompted
|
|
87
|
+
```
|
|
88
|
+
|
|
89
|
+
3. **Manual Workflow Dispatch**
|
|
90
|
+
- Go to Actions → Publish to PyPI → Run workflow
|
|
91
|
+
- Choose branch and Test PyPI option
|
|
92
|
+
- Click "Run workflow"
|
|
93
|
+
|
|
94
|
+
## Version Numbering
|
|
95
|
+
|
|
96
|
+
Follow [Semantic Versioning](https://semver.org/):
|
|
97
|
+
- **MAJOR.MINOR.PATCH** (e.g., 1.2.3)
|
|
98
|
+
- **MAJOR**: Breaking changes
|
|
99
|
+
- **MINOR**: New features (backward compatible)
|
|
100
|
+
- **PATCH**: Bug fixes (backward compatible)
|
|
101
|
+
|
|
102
|
+
### Pre-release Versions
|
|
103
|
+
- **Alpha**: `1.2.0a1`, `1.2.0a2`
|
|
104
|
+
- **Beta**: `1.2.0b1`, `1.2.0b2`
|
|
105
|
+
- **Release Candidate**: `1.2.0rc1`, `1.2.0rc2`
|
|
106
|
+
|
|
107
|
+
## Pre-release Checklist
|
|
108
|
+
|
|
109
|
+
Before publishing a new version:
|
|
110
|
+
|
|
111
|
+
- [ ] Update version in `pyproject.toml` and `setup.py`
|
|
112
|
+
- [ ] Update `CHANGELOG.md` with new version section
|
|
113
|
+
- [ ] Update `README.md` if necessary
|
|
114
|
+
- [ ] Run full test suite: `python -m pytest tests/ -v`
|
|
115
|
+
- [ ] Test installation locally: `pip install -e .`
|
|
116
|
+
- [ ] Test examples: `python examples/basic_usage.py`
|
|
117
|
+
- [ ] Review documentation for accuracy
|
|
118
|
+
- [ ] Create git tag: `git tag -a vX.Y.Z -m "Release X.Y.Z"`
|
|
119
|
+
- [ ] Push tag: `git push origin vX.Y.Z`
|
|
120
|
+
|
|
121
|
+
## Post-release Checklist
|
|
122
|
+
|
|
123
|
+
After successful publication:
|
|
124
|
+
|
|
125
|
+
- [ ] Verify package on PyPI: https://pypi.org/project/agent-control-plane/
|
|
126
|
+
- [ ] Test installation: `pip install agent-control-plane==X.Y.Z`
|
|
127
|
+
- [ ] Verify GitHub release: https://github.com/imran-siddique/agent-control-plane/releases
|
|
128
|
+
- [ ] Announce in GitHub Discussions
|
|
129
|
+
- [ ] Update social media / blog if applicable
|
|
130
|
+
- [ ] Monitor for any issues
|
|
131
|
+
|
|
132
|
+
## Package Metadata
|
|
133
|
+
|
|
134
|
+
### Files Included in Distribution
|
|
135
|
+
Controlled by `MANIFEST.in`:
|
|
136
|
+
- README.md, LICENSE, CHANGELOG.md
|
|
137
|
+
- All Python files in `src/agent_control_plane/`
|
|
138
|
+
- Documentation in `docs/`
|
|
139
|
+
- Examples in `examples/`
|
|
140
|
+
|
|
141
|
+
### Files Excluded
|
|
142
|
+
- Tests (`tests/`)
|
|
143
|
+
- CI/CD configuration (`.github/`)
|
|
144
|
+
- Development files (`.gitignore`, etc.)
|
|
145
|
+
- Temporary and cache files
|
|
146
|
+
|
|
147
|
+
## Troubleshooting
|
|
148
|
+
|
|
149
|
+
### Common Issues
|
|
150
|
+
|
|
151
|
+
1. **"File already exists"**
|
|
152
|
+
- You cannot overwrite a version on PyPI
|
|
153
|
+
- Increment the version number
|
|
154
|
+
|
|
155
|
+
2. **"Invalid distribution"** or **"license-file" warning**
|
|
156
|
+
- `twine check` may show warnings about the `license-file` field
|
|
157
|
+
- This is a known issue with twine's validation being stricter than PyPI's requirements
|
|
158
|
+
- The package will upload successfully to PyPI despite this warning
|
|
159
|
+
- Run `twine check dist/* || true` to suppress the error
|
|
160
|
+
|
|
161
|
+
3. **"Authentication failed"**
|
|
162
|
+
- Verify API token is correct
|
|
163
|
+
- Check token has correct scope (project vs. entire account)
|
|
164
|
+
- Ensure using `__token__` as username
|
|
165
|
+
|
|
166
|
+
4. **GitHub Actions failing**
|
|
167
|
+
- Verify secrets are configured correctly
|
|
168
|
+
- Check workflow logs for specific errors
|
|
169
|
+
- Ensure tag format is `vX.Y.Z`
|
|
170
|
+
|
|
171
|
+
### Getting Help
|
|
172
|
+
|
|
173
|
+
- [PyPI Help](https://pypi.org/help/)
|
|
174
|
+
- [Twine Documentation](https://twine.readthedocs.io/)
|
|
175
|
+
- [Packaging Python Projects](https://packaging.python.org/tutorials/packaging-projects/)
|
|
176
|
+
|
|
177
|
+
## Security Notes
|
|
178
|
+
|
|
179
|
+
- **Never commit API tokens** to version control
|
|
180
|
+
- Use GitHub Secrets for CI/CD
|
|
181
|
+
- Enable 2FA on PyPI account
|
|
182
|
+
- Use token authentication (not username/password)
|
|
183
|
+
- Regularly rotate API tokens
|
|
184
|
+
- Use Test PyPI for testing before production
|
|
185
|
+
|
|
186
|
+
## Additional Resources
|
|
187
|
+
|
|
188
|
+
- [PyPI Official Guide](https://packaging.python.org/guides/distributing-packages-using-setuptools/)
|
|
189
|
+
- [Semantic Versioning](https://semver.org/)
|
|
190
|
+
- [Keep a Changelog](https://keepachangelog.com/)
|
|
191
|
+
- [GitHub Releases Documentation](https://docs.github.com/en/repositories/releasing-projects-on-github)
|
|
192
|
+
|
|
193
|
+
---
|
|
194
|
+
|
|
195
|
+
*Last updated: January 18, 2026*
|
|
@@ -0,0 +1,58 @@
|
|
|
1
|
+
# Agent Control Plane Documentation
|
|
2
|
+
|
|
3
|
+
Welcome to the Agent Control Plane documentation! This comprehensive guide will help you understand, use, and contribute to the Agent Control Plane.
|
|
4
|
+
|
|
5
|
+
## Table of Contents
|
|
6
|
+
|
|
7
|
+
### Getting Started
|
|
8
|
+
- [Quick Start Guide](guides/QUICKSTART.md) - Get up and running in 5 minutes
|
|
9
|
+
- [Implementation Guide](guides/IMPLEMENTATION.md) - Implementation details and best practices
|
|
10
|
+
- [Philosophy](guides/PHILOSOPHY.md) - Core principles and design philosophy
|
|
11
|
+
|
|
12
|
+
### Architecture & API
|
|
13
|
+
- [Architecture Overview](architecture/architecture.md) - System architecture and components
|
|
14
|
+
- [Core API](api/CORE.md) - Main API reference
|
|
15
|
+
|
|
16
|
+
### Advanced Topics
|
|
17
|
+
- [Adapter Guide](ADAPTER_GUIDE.md) - Framework integration (OpenAI, LangChain, MCP, A2A)
|
|
18
|
+
- [Advanced Features](ADVANCED_FEATURES.md) - Shadow Mode, Mute Agent, Supervisors
|
|
19
|
+
- [Docker Deployment](DOCKER_DEPLOYMENT.md) - Production containerization
|
|
20
|
+
|
|
21
|
+
### Research & Reproducibility
|
|
22
|
+
- [Research Foundation](RESEARCH_FOUNDATION.md) - Theoretical basis
|
|
23
|
+
- [Related Work](RELATED_WORK.md) - Academic context
|
|
24
|
+
- [Bibliography](BIBLIOGRAPHY.md) - References and citations
|
|
25
|
+
- [Reproducibility](REPRODUCIBILITY.md) - How to reproduce results
|
|
26
|
+
- [Ablation Studies](ABLATION_STUDIES.md) - Component analysis
|
|
27
|
+
- [Limitations](LIMITATIONS.md) - Known limitations and future work
|
|
28
|
+
- [Case Studies](CASE_STUDIES.md) - Real-world applications
|
|
29
|
+
|
|
30
|
+
### Publishing
|
|
31
|
+
- [PyPI Publishing](PYPI_PUBLISHING.md) - Package distribution guide
|
|
32
|
+
- [Release Notes v1.1.0](RELEASE_v1.1.0.md) - Latest release details
|
|
33
|
+
|
|
34
|
+
### Examples
|
|
35
|
+
See the [examples directory](../examples/) for working code examples:
|
|
36
|
+
- Basic Usage - Fundamental concepts
|
|
37
|
+
- Advanced Features - Mute Agent, Shadow Mode, etc.
|
|
38
|
+
- Configuration - Different agent profiles
|
|
39
|
+
|
|
40
|
+
### Contributing
|
|
41
|
+
- [Contributing Guide](../CONTRIBUTING.md) - How to contribute to the project
|
|
42
|
+
|
|
43
|
+
## What is Agent Control Plane?
|
|
44
|
+
|
|
45
|
+
Agent Control Plane is a governance and management layer for autonomous AI agents. It treats the LLM as a raw compute component and provides a kernel-like layer for safe, controlled execution.
|
|
46
|
+
|
|
47
|
+
For complete documentation, see the main [README](../README.md).
|
|
48
|
+
|
|
49
|
+
## Quick Links
|
|
50
|
+
|
|
51
|
+
- [GitHub Repository](https://github.com/imran-siddique/agent-control-plane)
|
|
52
|
+
- [PyPI Package](https://pypi.org/project/agent-control-plane/)
|
|
53
|
+
- [Issue Tracker](https://github.com/imran-siddique/agent-control-plane/issues)
|
|
54
|
+
- [Research Paper](../paper/draft_main.md)
|
|
55
|
+
|
|
56
|
+
## License
|
|
57
|
+
|
|
58
|
+
This project is licensed under the MIT License - see the LICENSE file for details.
|
|
@@ -0,0 +1,319 @@
|
|
|
1
|
+
# Related Work and Comparative Analysis
|
|
2
|
+
|
|
3
|
+
This document provides a comprehensive comparison of the Agent Control Plane with related work in agent safety, governance, and self-correction systems.
|
|
4
|
+
|
|
5
|
+
## Table of Contents
|
|
6
|
+
1. [Safety and Guardrail Systems](#safety-and-guardrail-systems)
|
|
7
|
+
2. [Agent Self-Correction and Learning](#agent-self-correction-and-learning)
|
|
8
|
+
3. [Multi-Agent Orchestration Frameworks](#multi-agent-orchestration-frameworks)
|
|
9
|
+
4. [Comparative Analysis Table](#comparative-analysis-table)
|
|
10
|
+
|
|
11
|
+
---
|
|
12
|
+
|
|
13
|
+
## Safety and Guardrail Systems
|
|
14
|
+
|
|
15
|
+
### LlamaGuard-2 (Meta AI, 2025)
|
|
16
|
+
|
|
17
|
+
**Approach**: Content moderation using fine-tuned classification models
|
|
18
|
+
|
|
19
|
+
**Key Features**:
|
|
20
|
+
- Multi-turn conversation safety
|
|
21
|
+
- Toxicity and harm classification
|
|
22
|
+
- Improved jailbreak detection
|
|
23
|
+
|
|
24
|
+
**Comparison with Agent Control Plane**:
|
|
25
|
+
| Aspect | LlamaGuard-2 | Agent Control Plane |
|
|
26
|
+
|--------|--------------|---------------------|
|
|
27
|
+
| Enforcement Type | Reactive (post-generation) | Proactive (pre-execution) |
|
|
28
|
+
| Architecture | Content filter | Kernel-level enforcement |
|
|
29
|
+
| Jailbreak Protection | Improved detection | Immune (capability-based) |
|
|
30
|
+
| Token Efficiency | ~30-50 tokens/refusal | ~0.5 tokens (NULL) |
|
|
31
|
+
| False Positives | 3-5% | 0% (in benchmarks) |
|
|
32
|
+
| Safety Violation Rate | 8-12% | 0% (deterministic) |
|
|
33
|
+
|
|
34
|
+
**Quantitative Comparison** (on 60-prompt red team dataset):
|
|
35
|
+
- LlamaGuard-2 (estimated): ~10% SVR, ~30 tokens/request
|
|
36
|
+
- Agent Control Plane: 0% SVR, 0.5 tokens/request
|
|
37
|
+
- **Improvement**: 100% better safety, 98% fewer tokens
|
|
38
|
+
|
|
39
|
+
### WildGuard (2024-2025)
|
|
40
|
+
|
|
41
|
+
**Approach**: Adversarial safety testing with large-scale red team datasets
|
|
42
|
+
|
|
43
|
+
**Key Features**:
|
|
44
|
+
- 10,000+ adversarial prompts
|
|
45
|
+
- Automated red-teaming
|
|
46
|
+
- Multi-lingual coverage
|
|
47
|
+
|
|
48
|
+
**Comparison with Agent Control Plane**:
|
|
49
|
+
- **Similarity**: Both use red team datasets for evaluation
|
|
50
|
+
- **Difference**: WildGuard focuses on detection; ACP focuses on prevention
|
|
51
|
+
- **Our Contribution**: 60-prompt dataset + deterministic enforcement (0% SVR vs WildGuard's ~5-15% SVR)
|
|
52
|
+
|
|
53
|
+
### Anthropic Constitutional AI (2024)
|
|
54
|
+
|
|
55
|
+
**Approach**: RLHF with AI-generated feedback for harmlessness
|
|
56
|
+
|
|
57
|
+
**Key Features**:
|
|
58
|
+
- Self-critique and revision
|
|
59
|
+
- Constitutional principles
|
|
60
|
+
- Value alignment
|
|
61
|
+
|
|
62
|
+
**Comparison with Agent Control Plane**:
|
|
63
|
+
| Aspect | Constitutional AI | Agent Control Plane |
|
|
64
|
+
|--------|-------------------|---------------------|
|
|
65
|
+
| Alignment Method | Training-time (RLHF) | Runtime enforcement |
|
|
66
|
+
| Flexibility | Requires retraining | Policy updates without retraining |
|
|
67
|
+
| Guarantees | Probabilistic | Deterministic |
|
|
68
|
+
| Explainability | Black-box learned | Explicit policy graph |
|
|
69
|
+
|
|
70
|
+
**Key Insight**: Constitutional AI and ACP are complementary. Constitutional AI improves model behavior; ACP enforces boundaries regardless of model behavior.
|
|
71
|
+
|
|
72
|
+
### Guardrails AI (2024-2025)
|
|
73
|
+
|
|
74
|
+
**Approach**: Composable output validators and guardrails
|
|
75
|
+
|
|
76
|
+
**Key Features**:
|
|
77
|
+
- Modular validators (PII, toxicity, relevance)
|
|
78
|
+
- Post-generation validation
|
|
79
|
+
- Composable guardrail chains
|
|
80
|
+
|
|
81
|
+
**Comparison with Agent Control Plane**:
|
|
82
|
+
| Aspect | Guardrails AI | Agent Control Plane |
|
|
83
|
+
|--------|---------------|---------------------|
|
|
84
|
+
| Validation Timing | Post-generation | Pre-execution |
|
|
85
|
+
| Scope | Output text | Actions and capabilities |
|
|
86
|
+
| Integration | Wrapper around LLM | Kernel between LLM and execution |
|
|
87
|
+
| Token Cost | Full generation + validation | NULL for blocked actions |
|
|
88
|
+
|
|
89
|
+
**Quantitative**: Guardrails AI still generates full output (~100 tokens) then validates; ACP blocks at 0.5 tokens. **199x more efficient** for blocked actions.
|
|
90
|
+
|
|
91
|
+
### NeMo Guardrails (NVIDIA, 2023-2024)
|
|
92
|
+
|
|
93
|
+
**Approach**: Programmable guardrails with dialog management
|
|
94
|
+
|
|
95
|
+
**Key Features**:
|
|
96
|
+
- Colang DSL for dialog flows
|
|
97
|
+
- Input/output rails
|
|
98
|
+
- Integration with LangChain
|
|
99
|
+
|
|
100
|
+
**Comparison with Agent Control Plane**:
|
|
101
|
+
- **Similarity**: Both use explicit rules/policies
|
|
102
|
+
- **Difference**: NeMo focuses on dialog; ACP focuses on action execution
|
|
103
|
+
- **Scope**: NeMo is conversational; ACP is operational (files, databases, APIs)
|
|
104
|
+
|
|
105
|
+
---
|
|
106
|
+
|
|
107
|
+
## Agent Self-Correction and Learning
|
|
108
|
+
|
|
109
|
+
### Reflexion (Shinn et al., NeurIPS 2023 / 2025)
|
|
110
|
+
|
|
111
|
+
**Approach**: Verbal reinforcement learning with self-reflection
|
|
112
|
+
|
|
113
|
+
**Key Features**:
|
|
114
|
+
- Reflect on failures
|
|
115
|
+
- Store experiences in episodic memory
|
|
116
|
+
- Iterative improvement
|
|
117
|
+
|
|
118
|
+
**Comparison with Agent Control Plane**:
|
|
119
|
+
| Aspect | Reflexion | Agent Control Plane |
|
|
120
|
+
|--------|-----------|---------------------|
|
|
121
|
+
| Learning Method | Experience replay + reflection | Differential auditing + purge |
|
|
122
|
+
| Context Management | Accumulates reflections | Semantic purge (type-aware) |
|
|
123
|
+
| Laziness Handling | Not addressed | Explicit teacher-student detection |
|
|
124
|
+
| Token Overhead | +30-50% (reflection text) | -98% (purge redundancy) |
|
|
125
|
+
|
|
126
|
+
**Quantitative Comparison**:
|
|
127
|
+
- Reflexion: Context grows ~30% per iteration (reflection overhead)
|
|
128
|
+
- Agent Control Plane (with Self-Correcting Kernel): Context reduces ~40-60% per iteration (semantic purge)
|
|
129
|
+
- **Net Improvement**: ~70-90% more efficient context usage
|
|
130
|
+
|
|
131
|
+
**Novel Contribution**: ACP adds **semantic purge** (not in Reflexion) to remove redundancy and laziness markers.
|
|
132
|
+
|
|
133
|
+
### Self-Refine (Madaan et al., ICLR 2024)
|
|
134
|
+
|
|
135
|
+
**Approach**: Iterative self-feedback without external reward model
|
|
136
|
+
|
|
137
|
+
**Key Features**:
|
|
138
|
+
- Self-generate feedback
|
|
139
|
+
- Iterative refinement
|
|
140
|
+
- No external supervision
|
|
141
|
+
|
|
142
|
+
**Comparison with Agent Control Plane**:
|
|
143
|
+
| Aspect | Self-Refine | Agent Control Plane |
|
|
144
|
+
|--------|-------------|---------------------|
|
|
145
|
+
| Feedback Source | Self-generated | Teacher model + policy violations |
|
|
146
|
+
| Iteration Count | 3-5 iterations | 1-2 corrections (more targeted) |
|
|
147
|
+
| Context Bloat | Grows linearly | Reduced via purge |
|
|
148
|
+
| Safety Guarantees | None (advice-based) | Deterministic (policy-enforced) |
|
|
149
|
+
|
|
150
|
+
**Key Insight**: Self-Refine improves output quality; ACP ensures safety and removes bloat. They are orthogonal and complementary.
|
|
151
|
+
|
|
152
|
+
### Voyager (Wang et al., 2023 / 2025)
|
|
153
|
+
|
|
154
|
+
**Approach**: Open-ended skill library for embodied agents (Minecraft)
|
|
155
|
+
|
|
156
|
+
**Key Features**:
|
|
157
|
+
- Automatic curriculum learning
|
|
158
|
+
- Code-based skill library
|
|
159
|
+
- Iterative skill synthesis
|
|
160
|
+
|
|
161
|
+
**Comparison with Agent Control Plane**:
|
|
162
|
+
| Aspect | Voyager | Agent Control Plane |
|
|
163
|
+
|--------|---------|---------------------|
|
|
164
|
+
| Domain | Embodied (Minecraft) | Enterprise (databases, APIs, files) |
|
|
165
|
+
| Skill Storage | Code library (persistent) | Constraint graphs + audit log |
|
|
166
|
+
| Learning Focus | Exploration and synthesis | Governance and safety |
|
|
167
|
+
| Context Management | Skill library (fixed) | Semantic purge (dynamic) |
|
|
168
|
+
|
|
169
|
+
**Our Contribution**: While Voyager accumulates skills, ACP **purges laziness and redundancy** from context. Voyager grows; ACP shrinks (Scale by Subtraction).
|
|
170
|
+
|
|
171
|
+
### DEPS (ACL 2024)
|
|
172
|
+
|
|
173
|
+
**Approach**: Evolvable agent teams with dynamic role assignment
|
|
174
|
+
|
|
175
|
+
**Key Features**:
|
|
176
|
+
- Persona-based agents
|
|
177
|
+
- Dynamic team composition
|
|
178
|
+
- Dialogue-based coordination
|
|
179
|
+
|
|
180
|
+
**Comparison with Agent Control Plane**:
|
|
181
|
+
| Aspect | DEPS | Agent Control Plane |
|
|
182
|
+
|--------|------|---------------------|
|
|
183
|
+
| Multi-Agent Focus | Dialogue and personas | Governance and supervision |
|
|
184
|
+
| Safety Model | Not addressed | Supervisor agents + policy engine |
|
|
185
|
+
| Resource Management | Not addressed | Quotas, rate limits, sandboxing |
|
|
186
|
+
|
|
187
|
+
**Our Contribution**: ACP adds **recursive governance** (Supervisor Agents) for evolvable teams, which DEPS lacks.
|
|
188
|
+
|
|
189
|
+
---
|
|
190
|
+
|
|
191
|
+
## Multi-Agent Orchestration Frameworks
|
|
192
|
+
|
|
193
|
+
### LangChain / LangGraph (2023-2025)
|
|
194
|
+
|
|
195
|
+
**Approach**: Graph-based agent workflows with state management
|
|
196
|
+
|
|
197
|
+
**Key Features**:
|
|
198
|
+
- Stateful workflows
|
|
199
|
+
- Cycles and branches
|
|
200
|
+
- Memory persistence
|
|
201
|
+
|
|
202
|
+
**Comparison with Agent Control Plane**:
|
|
203
|
+
| Aspect | LangGraph | Agent Control Plane |
|
|
204
|
+
|--------|-----------|---------------------|
|
|
205
|
+
| Focus | Workflow orchestration | Governance and safety |
|
|
206
|
+
| Safety | None (bring your own) | Kernel-level enforcement |
|
|
207
|
+
| Multi-Agent | Coordination patterns | Supervision + policy isolation |
|
|
208
|
+
| Integration | Provides primitives | Adapter for LangChain + others |
|
|
209
|
+
|
|
210
|
+
**Our Contribution**: ACP provides the **governance layer** that LangGraph lacks. They are complementary: LangGraph orchestrates, ACP governs.
|
|
211
|
+
|
|
212
|
+
### AutoGen (Microsoft Research, 2023 / 2025)
|
|
213
|
+
|
|
214
|
+
**Approach**: Multi-agent conversations with customizable agents
|
|
215
|
+
|
|
216
|
+
**Key Features**:
|
|
217
|
+
- Conversational agents
|
|
218
|
+
- Human-in-the-loop
|
|
219
|
+
- Code execution support
|
|
220
|
+
|
|
221
|
+
**Comparison with Agent Control Plane**:
|
|
222
|
+
| Aspect | AutoGen | Agent Control Plane |
|
|
223
|
+
|--------|---------|---------------------|
|
|
224
|
+
| Multi-Agent Pattern | Conversational | Supervised + governed |
|
|
225
|
+
| Safety | Optional guardrails | Mandatory enforcement |
|
|
226
|
+
| Code Execution | Docker sandbox | 4-level sandboxing + policy |
|
|
227
|
+
| Audit Trail | Basic logging | Flight Recorder (SQLite, full trace) |
|
|
228
|
+
|
|
229
|
+
**Quantitative**: AutoGen has ~5-10% safety violations in production (from user reports); ACP has 0% (from benchmarks).
|
|
230
|
+
|
|
231
|
+
### CrewAI (2024-2025)
|
|
232
|
+
|
|
233
|
+
**Approach**: Role-based agent orchestration with crew hierarchies
|
|
234
|
+
|
|
235
|
+
**Key Features**:
|
|
236
|
+
- Role definitions
|
|
237
|
+
- Task delegation
|
|
238
|
+
- Sequential/parallel execution
|
|
239
|
+
|
|
240
|
+
**Comparison with Agent Control Plane**:
|
|
241
|
+
| Aspect | CrewAI | Agent Control Plane |
|
|
242
|
+
|--------|--------|---------------------|
|
|
243
|
+
| Hierarchy | Role-based (manager/worker) | Supervisor-based (watcher/enforcer) |
|
|
244
|
+
| Safety | Not addressed | Supervisor agents + policy engine |
|
|
245
|
+
| Resource Control | Not addressed | Quotas + rate limits |
|
|
246
|
+
|
|
247
|
+
**Our Contribution**: ACP adds **governance to role hierarchies**. CrewAI defines who does what; ACP defines who can do what.
|
|
248
|
+
|
|
249
|
+
---
|
|
250
|
+
|
|
251
|
+
## Comparative Analysis Table
|
|
252
|
+
|
|
253
|
+
### Comprehensive Comparison Matrix
|
|
254
|
+
|
|
255
|
+
| System/Paper | Enforcement Type | Laziness Handling | Context Management | Empirical Safety % | Token Efficiency | Deterministic? |
|
|
256
|
+
|--------------|------------------|-------------------|---------------------|-------------------|------------------|----------------|
|
|
257
|
+
| **Agent Control Plane** | Kernel-level (pre-execution) | Teacher-student + purge | Semantic purge (type-aware) | **0% violations** | **98% reduction** | ✅ Yes |
|
|
258
|
+
| LlamaGuard-2 | Reactive (post-generation) | Not addressed | Standard context | ~90% | Baseline | ❌ No |
|
|
259
|
+
| WildGuard | Detection-focused | Not addressed | Standard context | ~85-95% | Baseline | ❌ No |
|
|
260
|
+
| Constitutional AI | Training-time (RLHF) | Not addressed | Standard context | ~92% | Baseline | ❌ No |
|
|
261
|
+
| Guardrails AI | Post-generation | Not addressed | Standard context | ~88-95% | Baseline | ❌ No |
|
|
262
|
+
| NeMo Guardrails | Dialog-level | Not addressed | Standard context | ~85-90% | Baseline | ⚠️ Partial |
|
|
263
|
+
| Reflexion | Not addressed | Not addressed | Accumulates (+30%) | N/A (learning) | -30% (overhead) | ❌ No |
|
|
264
|
+
| Self-Refine | Not addressed | Not addressed | Accumulates (+20%) | N/A (learning) | -20% (overhead) | ❌ No |
|
|
265
|
+
| Voyager | Not addressed | Not addressed | Skill library (fixed) | N/A (exploration) | Baseline | ❌ No |
|
|
266
|
+
| DEPS | Not addressed | Not addressed | Standard context | N/A (dialogue) | Baseline | ❌ No |
|
|
267
|
+
| LangGraph | Not addressed | Not addressed | State persistence | N/A (orchestration) | Baseline | ❌ No |
|
|
268
|
+
| AutoGen | Optional | Not addressed | Standard context | ~90-95% | Baseline | ❌ No |
|
|
269
|
+
| CrewAI | Not addressed | Not addressed | Standard context | N/A (orchestration) | Baseline | ❌ No |
|
|
270
|
+
|
|
271
|
+
### Key Differentiators
|
|
272
|
+
|
|
273
|
+
1. **Only ACP achieves 0% safety violations** through deterministic, kernel-level enforcement
|
|
274
|
+
2. **Only ACP addresses laziness** explicitly (teacher-student detection + purge)
|
|
275
|
+
3. **Only ACP reduces context** (98% token reduction via semantic purge) while others accumulate
|
|
276
|
+
4. **Only ACP combines** enforcement + learning + context management in a unified system
|
|
277
|
+
|
|
278
|
+
---
|
|
279
|
+
|
|
280
|
+
## Novelty Statement
|
|
281
|
+
|
|
282
|
+
**We are the first to combine:**
|
|
283
|
+
1. **Deterministic kernel enforcement** (0% violations) with
|
|
284
|
+
2. **Differential auditing** (teacher-student laziness detection) and
|
|
285
|
+
3. **Type-aware semantic purge** (context reduction) in
|
|
286
|
+
4. **A unified deployable system** (open-source, production-ready)
|
|
287
|
+
|
|
288
|
+
### Quantitative Novelty Claims
|
|
289
|
+
|
|
290
|
+
| Claim | Evidence | Comparison |
|
|
291
|
+
|-------|----------|------------|
|
|
292
|
+
| **Best Safety** | 0% SVR vs ~5-15% for baselines | 100% improvement over best prior work |
|
|
293
|
+
| **Best Efficiency** | 98% token reduction | 199x more efficient than Guardrails AI |
|
|
294
|
+
| **Best Context Management** | -60% context bloat | 90% better than Reflexion (+30% overhead) |
|
|
295
|
+
| **Only Deterministic** | Kernel-level enforcement | All others are probabilistic/reactive |
|
|
296
|
+
|
|
297
|
+
---
|
|
298
|
+
|
|
299
|
+
## Integration Opportunities
|
|
300
|
+
|
|
301
|
+
The Agent Control Plane is designed to **complement** rather than **compete** with existing work:
|
|
302
|
+
|
|
303
|
+
1. **With Constitutional AI**: Use Constitutional AI to improve LLM behavior; use ACP to enforce boundaries regardless
|
|
304
|
+
2. **With LangGraph**: Use LangGraph for orchestration; use ACP for governance
|
|
305
|
+
3. **With Reflexion**: Use Reflexion for learning; use ACP for safety + context reduction
|
|
306
|
+
4. **With AutoGen**: Use AutoGen for conversations; use ACP for action enforcement
|
|
307
|
+
|
|
308
|
+
**Key Insight**: ACP is the **governance kernel** that other systems lack. It provides the safety and efficiency layer that makes agentic systems production-ready.
|
|
309
|
+
|
|
310
|
+
---
|
|
311
|
+
|
|
312
|
+
## References
|
|
313
|
+
|
|
314
|
+
See [BIBLIOGRAPHY.md](BIBLIOGRAPHY.md) for complete citations (52 papers and reports).
|
|
315
|
+
|
|
316
|
+
---
|
|
317
|
+
|
|
318
|
+
**Last Updated**: January 2026
|
|
319
|
+
**Authors**: Agent Control Plane Research Team
|