agent-os-kernel 1.1.0__py3-none-any.whl → 1.2.0__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- agent_os/__init__.py +66 -4
- agent_os/agents_compat.py +286 -0
- agent_os/base_agent.py +308 -0
- agent_os/cli.py +1079 -19
- agent_os/integrations/__init__.py +37 -2
- agent_os/integrations/openai_adapter.py +502 -0
- agent_os/integrations/semantic_kernel_adapter.py +569 -0
- agent_os/stateless.py +349 -0
- agent_os_kernel-1.2.0.dist-info/METADATA +676 -0
- agent_os_kernel-1.2.0.dist-info/RECORD +1053 -0
- {agent_os_kernel-1.1.0.dist-info → agent_os_kernel-1.2.0.dist-info}/entry_points.txt +0 -1
- modules/amb/.github/workflows/ci.yml +102 -0
- modules/amb/.github/workflows/publish.yml +146 -0
- modules/amb/.gitignore +134 -0
- modules/amb/CHANGELOG.md +118 -0
- modules/amb/CONTRIBUTING.md +141 -0
- modules/amb/LICENSE +21 -0
- modules/amb/README.md +188 -0
- modules/amb/amb_core/__init__.py +175 -0
- modules/amb/amb_core/adapters/__init__.py +55 -0
- modules/amb/amb_core/adapters/aws_sqs_broker.py +374 -0
- modules/amb/amb_core/adapters/azure_servicebus_broker.py +338 -0
- modules/amb/amb_core/adapters/kafka_broker.py +258 -0
- modules/amb/amb_core/adapters/nats_broker.py +283 -0
- modules/amb/amb_core/adapters/rabbitmq_broker.py +233 -0
- modules/amb/amb_core/adapters/redis_broker.py +260 -0
- modules/amb/amb_core/broker.py +143 -0
- modules/amb/amb_core/bus.py +479 -0
- modules/amb/amb_core/cloudevents.py +507 -0
- modules/amb/amb_core/dlq.py +343 -0
- modules/amb/amb_core/hf_utils.py +534 -0
- modules/amb/amb_core/memory_broker.py +408 -0
- modules/amb/amb_core/models.py +139 -0
- modules/amb/amb_core/persistence.py +527 -0
- modules/amb/amb_core/schema.py +292 -0
- modules/amb/amb_core/tracing.py +356 -0
- modules/amb/examples/advanced_features.py +223 -0
- modules/amb/examples/backpressure_demo.py +225 -0
- modules/amb/examples/basic_usage.py +117 -0
- modules/amb/examples/tracing_demo.py +104 -0
- modules/amb/experiments/README.md +52 -0
- modules/amb/experiments/reproduce_results.py +467 -0
- modules/amb/experiments/results.json +324 -0
- modules/amb/paper/README.md +40 -0
- modules/amb/paper/paper.tex +365 -0
- modules/amb/paper/whitepaper.md +377 -0
- modules/amb/pyproject.toml +117 -0
- modules/amb/tests/__init__.py +1 -0
- modules/amb/tests/test_backpressure_priority.py +280 -0
- modules/amb/tests/test_bus.py +198 -0
- modules/amb/tests/test_cloudevents.py +443 -0
- modules/amb/tests/test_features.py +531 -0
- modules/amb/tests/test_models.py +74 -0
- modules/amb/tests/test_tracing.py +254 -0
- modules/atr/.github/workflows/ci.yml +101 -0
- modules/atr/.github/workflows/publish.yml +140 -0
- modules/atr/.gitignore +134 -0
- modules/atr/.pre-commit-config.yaml +37 -0
- modules/atr/CHANGELOG.md +39 -0
- modules/atr/CONTRIBUTING.md +96 -0
- modules/atr/IMPLEMENTATION_SUMMARY.md +143 -0
- modules/atr/README.md +180 -0
- modules/atr/atr/__init__.py +638 -0
- modules/atr/atr/access.py +346 -0
- modules/atr/atr/composition.py +643 -0
- modules/atr/atr/decorator.py +355 -0
- modules/atr/atr/executor.py +382 -0
- modules/atr/atr/health.py +555 -0
- modules/atr/atr/hf_utils.py +447 -0
- modules/atr/atr/injection.py +420 -0
- modules/atr/atr/metrics.py +438 -0
- modules/atr/atr/policies.py +401 -0
- modules/atr/atr/py.typed +2 -0
- modules/atr/atr/registry.py +450 -0
- modules/atr/atr/schema.py +478 -0
- modules/atr/atr/tools/safe/__init__.py +73 -0
- modules/atr/atr/tools/safe/calculator.py +380 -0
- modules/atr/atr/tools/safe/datetime_tool.py +441 -0
- modules/atr/atr/tools/safe/file_reader.py +400 -0
- modules/atr/atr/tools/safe/http_client.py +314 -0
- modules/atr/atr/tools/safe/json_parser.py +372 -0
- modules/atr/atr/tools/safe/text_tool.py +526 -0
- modules/atr/atr/tools/safe/toolkit.py +173 -0
- modules/atr/docs/PYPI_SETUP.md +113 -0
- modules/atr/examples/README.md +27 -0
- modules/atr/examples/demo.py +144 -0
- modules/atr/examples/sandbox_demo.py +218 -0
- modules/atr/experiments/README.md +69 -0
- modules/atr/experiments/reproduce_results.py +509 -0
- modules/atr/experiments/results/.gitkeep +0 -0
- modules/atr/experiments/results/results_20260123_140334.json +71 -0
- modules/atr/paper/README.md +36 -0
- modules/atr/paper/figures/.gitkeep +0 -0
- modules/atr/paper/references.bib +84 -0
- modules/atr/paper/structure.tex +293 -0
- modules/atr/paper/whitepaper.md +234 -0
- modules/atr/pyproject.toml +148 -0
- modules/atr/requirements.txt +1 -0
- modules/atr/setup.py +30 -0
- modules/atr/tests/__init__.py +1 -0
- modules/atr/tests/test_decorator.py +317 -0
- modules/atr/tests/test_executor.py +245 -0
- modules/atr/tests/test_integration_executor.py +184 -0
- modules/atr/tests/test_registry.py +312 -0
- modules/atr/tests/test_schema.py +182 -0
- modules/atr/tests/test_v2_features.py +708 -0
- modules/caas/.dockerignore +63 -0
- modules/caas/.github/ISSUE_TEMPLATE/bug_report.md +38 -0
- modules/caas/.github/ISSUE_TEMPLATE/custom.md +10 -0
- modules/caas/.github/ISSUE_TEMPLATE/feature_request.md +20 -0
- modules/caas/.github/workflows/ci.yml +100 -0
- modules/caas/.github/workflows/lint.yml +39 -0
- modules/caas/.github/workflows/publish-pypi.yml +124 -0
- modules/caas/.gitignore +73 -0
- modules/caas/.pre-commit-config.yaml +33 -0
- modules/caas/CHANGELOG.md +58 -0
- modules/caas/CONTRIBUTING.md +346 -0
- modules/caas/Dockerfile +41 -0
- modules/caas/LICENSE +21 -0
- modules/caas/MANIFEST.in +11 -0
- modules/caas/README.md +158 -0
- modules/caas/benchmarks/README.md +255 -0
- modules/caas/benchmarks/create_hf_dataset.py +502 -0
- modules/caas/benchmarks/data/sample_corpus/README.md +86 -0
- modules/caas/benchmarks/data/sample_corpus/auth_module.py +211 -0
- modules/caas/benchmarks/data/sample_corpus/contribution_guide.md +185 -0
- modules/caas/benchmarks/data/sample_corpus/remote_work_policy.html +57 -0
- modules/caas/benchmarks/hf_dataset/README.md +214 -0
- modules/caas/benchmarks/hf_dataset/caas_benchmark_corpus.py +73 -0
- modules/caas/benchmarks/hf_dataset/corpus_preview.json +193 -0
- modules/caas/benchmarks/results/README.md +66 -0
- modules/caas/benchmarks/results/evaluation_2026-01-20.json +121 -0
- modules/caas/benchmarks/run_evaluation.py +561 -0
- modules/caas/benchmarks/statistical_tests.py +289 -0
- modules/caas/benchmarks/verify_sample_corpus.py +83 -0
- modules/caas/docker-compose.yml +38 -0
- modules/caas/docs/CONTEXT_TRIAD.md +462 -0
- modules/caas/docs/CONTRIBUTING.md +346 -0
- modules/caas/docs/ETHICS_AND_LIMITATIONS.md +336 -0
- modules/caas/docs/HEURISTIC_ROUTER.md +442 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY.md +363 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY_CONTEXT_TRIAD.md +277 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY_HEURISTIC_ROUTER.md +231 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY_METADATA_INJECTION.md +258 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY_PRAGMATIC_TRUTH.md +212 -0
- modules/caas/docs/IMPLEMENTATION_SUMMARY_TRUST_GATEWAY.md +319 -0
- modules/caas/docs/LAYER_1_PRIMITIVE.md +202 -0
- modules/caas/docs/METADATA_INJECTION.md +404 -0
- modules/caas/docs/PRAGMATIC_TRUTH.md +431 -0
- modules/caas/docs/RELATED_WORK.md +312 -0
- modules/caas/docs/RELEASE_CHECKLIST.md +219 -0
- modules/caas/docs/RELEASE_GUIDE.md +285 -0
- modules/caas/docs/REPRODUCIBILITY.md +386 -0
- modules/caas/docs/SLIDING_WINDOW.md +387 -0
- modules/caas/docs/STRUCTURE_AWARE_INDEXING.md +158 -0
- modules/caas/docs/TESTING.md +259 -0
- modules/caas/docs/THREAT_MODEL.md +247 -0
- modules/caas/docs/TRUST_GATEWAY.md +575 -0
- modules/caas/docs/VFS.md +298 -0
- modules/caas/examples/agents/enterprise_security_agent.py +414 -0
- modules/caas/examples/agents/intelligent_document_analyzer.py +380 -0
- modules/caas/examples/demos/demo.py +309 -0
- modules/caas/examples/demos/demo_context_triad.py +225 -0
- modules/caas/examples/demos/demo_conversation_manager.py +285 -0
- modules/caas/examples/demos/demo_heuristic_router.py +133 -0
- modules/caas/examples/demos/demo_metadata_injection.py +198 -0
- modules/caas/examples/demos/demo_pragmatic_truth.py +303 -0
- modules/caas/examples/demos/demo_structure_aware.py +140 -0
- modules/caas/examples/demos/demo_time_decay.py +247 -0
- modules/caas/examples/demos/demo_trust_gateway.py +383 -0
- modules/caas/examples/multi_agent/README.md +159 -0
- modules/caas/examples/multi_agent/research_team.py +369 -0
- modules/caas/examples/multi_agent/vfs_collaboration.py +393 -0
- modules/caas/examples/usage/auth_module.py +142 -0
- modules/caas/examples/usage/usage_example.py +173 -0
- modules/caas/experiments/README.md +42 -0
- modules/caas/experiments/reproduce_results.py +462 -0
- modules/caas/paper/ARXIV_METADATA.md +145 -0
- modules/caas/paper/ARXIV_README.md +47 -0
- modules/caas/paper/CHECKLIST.md +103 -0
- modules/caas/paper/GITHUB_RELEASE_NOTES.md +105 -0
- modules/caas/paper/README.md +71 -0
- modules/caas/paper/abstract.md +24 -0
- modules/caas/paper/arxiv_submission.tar +0 -0
- modules/caas/paper/arxiv_submission.zip +0 -0
- modules/caas/paper/build_pdf.py +355 -0
- modules/caas/paper/experiments.md +149 -0
- modules/caas/paper/figures/.gitkeep +0 -0
- modules/caas/paper/figures/README.md +237 -0
- modules/caas/paper/figures/fig1_system_architecture.png +0 -0
- modules/caas/paper/figures/fig1_system_architecture.svg +198 -0
- modules/caas/paper/figures/fig2_context_triad.png +0 -0
- modules/caas/paper/figures/fig2_context_triad.svg +105 -0
- modules/caas/paper/figures/fig3_ablation_results.png +0 -0
- modules/caas/paper/figures/fig3_ablation_results.svg +113 -0
- modules/caas/paper/figures/fig4_routing_latency.png +0 -0
- modules/caas/paper/figures/fig4_routing_latency.svg +97 -0
- modules/caas/paper/intro.md +103 -0
- modules/caas/paper/latex/figures/fig1_system_architecture.png +0 -0
- modules/caas/paper/latex/figures/fig2_context_triad.png +0 -0
- modules/caas/paper/latex/figures/fig3_ablation_results.png +0 -0
- modules/caas/paper/latex/figures/fig4_routing_latency.png +0 -0
- modules/caas/paper/latex/main.tex +468 -0
- modules/caas/paper/latex/references.bib +140 -0
- modules/caas/paper/method.md +350 -0
- modules/caas/paper/outline.md +123 -0
- modules/caas/paper/related_work.md +101 -0
- modules/caas/paper/tables/.gitkeep +0 -0
- modules/caas/paper/tables/results_tables.md +50 -0
- modules/caas/pyproject.toml +172 -0
- modules/caas/requirements.txt +11 -0
- modules/caas/src/caas/__init__.py +232 -0
- modules/caas/src/caas/api/__init__.py +7 -0
- modules/caas/src/caas/api/server.py +1326 -0
- modules/caas/src/caas/caching.py +832 -0
- modules/caas/src/caas/cli.py +208 -0
- modules/caas/src/caas/conversation.py +221 -0
- modules/caas/src/caas/decay.py +118 -0
- modules/caas/src/caas/detection/__init__.py +7 -0
- modules/caas/src/caas/detection/detector.py +236 -0
- modules/caas/src/caas/enrichment.py +127 -0
- modules/caas/src/caas/gateway/__init__.py +24 -0
- modules/caas/src/caas/gateway/trust_gateway.py +471 -0
- modules/caas/src/caas/hf_utils.py +477 -0
- modules/caas/src/caas/ingestion/__init__.py +21 -0
- modules/caas/src/caas/ingestion/processors.py +251 -0
- modules/caas/src/caas/ingestion/structure_parser.py +185 -0
- modules/caas/src/caas/models.py +354 -0
- modules/caas/src/caas/pragmatic_truth.py +441 -0
- modules/caas/src/caas/routing/__init__.py +8 -0
- modules/caas/src/caas/routing/heuristic_router.py +242 -0
- modules/caas/src/caas/storage/__init__.py +7 -0
- modules/caas/src/caas/storage/store.py +450 -0
- modules/caas/src/caas/triad.py +472 -0
- modules/caas/src/caas/tuning/__init__.py +7 -0
- modules/caas/src/caas/tuning/tuner.py +322 -0
- modules/caas/src/caas/vfs/__init__.py +12 -0
- modules/caas/src/caas/vfs/filesystem.py +450 -0
- modules/caas/tests/__init__.py +3 -0
- modules/caas/tests/conftest.py +8 -0
- modules/caas/tests/test_caching.py +628 -0
- modules/caas/tests/test_context_triad.py +385 -0
- modules/caas/tests/test_conversation_manager.py +289 -0
- modules/caas/tests/test_functionality.py +215 -0
- modules/caas/tests/test_heuristic_router.py +370 -0
- modules/caas/tests/test_metadata_injection.py +328 -0
- modules/caas/tests/test_pragmatic_truth.py +322 -0
- modules/caas/tests/test_structure_aware_indexing.py +283 -0
- modules/caas/tests/test_time_decay.py +268 -0
- modules/caas/tests/test_trust_gateway.py +445 -0
- modules/caas/tests/test_vfs.py +298 -0
- modules/cmvk/.github/FUNDING.yml +9 -0
- modules/cmvk/.github/dependabot.yml +54 -0
- modules/cmvk/.github/workflows/ci.yml +205 -0
- modules/cmvk/.github/workflows/publish.yml +143 -0
- modules/cmvk/.gitignore +147 -0
- modules/cmvk/.pre-commit-config.yaml +58 -0
- modules/cmvk/CHANGELOG.md +146 -0
- modules/cmvk/CITATION.cff +48 -0
- modules/cmvk/CONTRIBUTING.md +229 -0
- modules/cmvk/Dockerfile +87 -0
- modules/cmvk/HF_MODEL_CARD.md +185 -0
- modules/cmvk/LICENSE +21 -0
- modules/cmvk/README.md +149 -0
- modules/cmvk/SECURITY.md +114 -0
- modules/cmvk/config/prompts/generator_v1.txt +23 -0
- modules/cmvk/config/prompts/verifier_hostile.txt +32 -0
- modules/cmvk/config/settings.yaml +40 -0
- modules/cmvk/coverage_html/.gitignore +2 -0
- modules/cmvk/coverage_html/class_index.html +658 -0
- modules/cmvk/coverage_html/coverage_html_cb_188fc9a4.js +735 -0
- modules/cmvk/coverage_html/favicon_32_cb_c827f16f.png +0 -0
- modules/cmvk/coverage_html/function_index.html +1978 -0
- modules/cmvk/coverage_html/index.html +255 -0
- modules/cmvk/coverage_html/keybd_closed_cb_900cfef5.png +0 -0
- modules/cmvk/coverage_html/status.json +1 -0
- modules/cmvk/coverage_html/style_cb_5c747636.css +389 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38___init___py.html +315 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_audit_py.html +499 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_benchmarks_py.html +575 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_constitutional_py.html +1001 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_hf_utils_py.html +398 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_metrics_py.html +570 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_profiles_py.html +397 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_types_py.html +109 -0
- modules/cmvk/coverage_html/z_2c49bd2ed3e01e38_verification_py.html +1053 -0
- modules/cmvk/docs/DIAGRAMS.md +325 -0
- modules/cmvk/docs/architecture.md +345 -0
- modules/cmvk/docs/features.md +308 -0
- modules/cmvk/docs/getting_started.md +279 -0
- modules/cmvk/docs/innovation_layer.md +377 -0
- modules/cmvk/docs/safety.md +281 -0
- modules/cmvk/docs/traceability.md +150 -0
- modules/cmvk/examples/basic_example.py +62 -0
- modules/cmvk/examples/demo_complete_pipeline.py +209 -0
- modules/cmvk/examples/demo_innovation_layer.py +197 -0
- modules/cmvk/examples/example.py +112 -0
- modules/cmvk/examples/model_diversity_comparison.py +110 -0
- modules/cmvk/examples/real_api_integration.py +121 -0
- modules/cmvk/examples/test_full_pipeline.py +303 -0
- modules/cmvk/experiments/FEATURE_2_LATERAL_THINKING.md +187 -0
- modules/cmvk/experiments/README.md +216 -0
- modules/cmvk/experiments/ablation_runner.py +666 -0
- modules/cmvk/experiments/baseline_runner.py +158 -0
- modules/cmvk/experiments/blind_spot_benchmark.py +364 -0
- modules/cmvk/experiments/datasets/README.md +85 -0
- modules/cmvk/experiments/datasets/humaneval_50.json +352 -0
- modules/cmvk/experiments/datasets/humaneval_full.json +1150 -0
- modules/cmvk/experiments/datasets/humaneval_sample.json +32 -0
- modules/cmvk/experiments/datasets/sabotage.json +262 -0
- modules/cmvk/experiments/datasets/sample.json +40 -0
- modules/cmvk/experiments/demo_with_traces.py +110 -0
- modules/cmvk/experiments/efficiency_curve.py +259 -0
- modules/cmvk/experiments/experiment_runner.py +243 -0
- modules/cmvk/experiments/paper_data_generator.py +183 -0
- modules/cmvk/experiments/reproduce_results.py +407 -0
- modules/cmvk/experiments/reproducible_runner.py +352 -0
- modules/cmvk/experiments/sabotage_stress_test.py +311 -0
- modules/cmvk/experiments/test_lateral_thinking.py +116 -0
- modules/cmvk/experiments/test_prosecutor.py +41 -0
- modules/cmvk/experiments/visualize_results.py +735 -0
- modules/cmvk/logs/traces/demo_HumanEval_0_20260121-204900.json +36 -0
- modules/cmvk/notebooks/analysis.ipynb +124 -0
- modules/cmvk/paper/PAPER.md +561 -0
- modules/cmvk/paper/arxiv_checklist.md +230 -0
- modules/cmvk/paper/cmvk_neurips.aux +77 -0
- modules/cmvk/paper/cmvk_neurips.bbl +81 -0
- modules/cmvk/paper/cmvk_neurips.blg +48 -0
- modules/cmvk/paper/cmvk_neurips.out +16 -0
- modules/cmvk/paper/cmvk_neurips.pdf +0 -0
- modules/cmvk/paper/cmvk_neurips.tex +309 -0
- modules/cmvk/paper/figures/ablation.png +0 -0
- modules/cmvk/paper/figures/ablation.svg +39 -0
- modules/cmvk/paper/figures/architecture.png +0 -0
- modules/cmvk/paper/figures/architecture.svg +115 -0
- modules/cmvk/paper/figures/results_bar.png +0 -0
- modules/cmvk/paper/figures/results_bar.svg +70 -0
- modules/cmvk/paper/generate_figures.py +383 -0
- modules/cmvk/paper/neurips_2024.sty +101 -0
- modules/cmvk/paper/references.bib +98 -0
- modules/cmvk/paper/structure.tex +200 -0
- modules/cmvk/pyproject.toml +189 -0
- modules/cmvk/requirements-dev.txt +19 -0
- modules/cmvk/requirements.txt +14 -0
- modules/cmvk/src/cmvk/__init__.py +216 -0
- modules/cmvk/src/cmvk/audit.py +400 -0
- modules/cmvk/src/cmvk/benchmarks.py +476 -0
- modules/cmvk/src/cmvk/constitutional.py +902 -0
- modules/cmvk/src/cmvk/hf_utils.py +299 -0
- modules/cmvk/src/cmvk/metrics.py +471 -0
- modules/cmvk/src/cmvk/profiles.py +298 -0
- modules/cmvk/src/cmvk/py.typed +0 -0
- modules/cmvk/src/cmvk/types.py +10 -0
- modules/cmvk/src/cmvk/verification.py +954 -0
- modules/cmvk/src/cross_model_verification_kernel/__init__.py +91 -0
- modules/cmvk/src/cross_model_verification_kernel/__main__.py +10 -0
- modules/cmvk/src/cross_model_verification_kernel/agents/__init__.py +16 -0
- modules/cmvk/src/cross_model_verification_kernel/agents/base_agent.py +142 -0
- modules/cmvk/src/cross_model_verification_kernel/agents/generator_openai.py +223 -0
- modules/cmvk/src/cross_model_verification_kernel/agents/verifier_anthropic.py +448 -0
- modules/cmvk/src/cross_model_verification_kernel/agents/verifier_gemini.py +481 -0
- modules/cmvk/src/cross_model_verification_kernel/cli.py +570 -0
- modules/cmvk/src/cross_model_verification_kernel/core/__init__.py +26 -0
- modules/cmvk/src/cross_model_verification_kernel/core/graph_memory.py +308 -0
- modules/cmvk/src/cross_model_verification_kernel/core/kernel.py +413 -0
- modules/cmvk/src/cross_model_verification_kernel/core/trace_logger.py +75 -0
- modules/cmvk/src/cross_model_verification_kernel/core/types.py +121 -0
- modules/cmvk/src/cross_model_verification_kernel/datasets/__init__.py +20 -0
- modules/cmvk/src/cross_model_verification_kernel/datasets/humaneval_loader.py +271 -0
- modules/cmvk/src/cross_model_verification_kernel/generator.py +118 -0
- modules/cmvk/src/cross_model_verification_kernel/kernel.py +292 -0
- modules/cmvk/src/cross_model_verification_kernel/models.py +111 -0
- modules/cmvk/src/cross_model_verification_kernel/py.typed +1 -0
- modules/cmvk/src/cross_model_verification_kernel/simple_kernel.py +185 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/__init__.py +94 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/huggingface_upload.py +394 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/sandbox.py +159 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/statistics.py +468 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/visualizer.py +312 -0
- modules/cmvk/src/cross_model_verification_kernel/tools/web_search.py +86 -0
- modules/cmvk/src/cross_model_verification_kernel/verifier.py +257 -0
- modules/cmvk/tests/__init__.py +3 -0
- modules/cmvk/tests/conftest.py +61 -0
- modules/cmvk/tests/integration/__init__.py +1 -0
- modules/cmvk/tests/integration/test_anthropic_verifier.py +269 -0
- modules/cmvk/tests/integration/test_integration.py +53 -0
- modules/cmvk/tests/integration/test_lateral_thinking_integration.py +199 -0
- modules/cmvk/tests/integration/test_lateral_thinking_witness.py +208 -0
- modules/cmvk/tests/integration/test_prosecutor_mode.py +131 -0
- modules/cmvk/tests/test_constitutional.py +611 -0
- modules/cmvk/tests/test_enhanced_features.py +603 -0
- modules/cmvk/tests/test_verification.py +255 -0
- modules/cmvk/tests/unit/__init__.py +1 -0
- modules/cmvk/tests/unit/test_agents.py +64 -0
- modules/cmvk/tests/unit/test_cli.py +224 -0
- modules/cmvk/tests/unit/test_core.py +126 -0
- modules/cmvk/tests/unit/test_humaneval_loader.py +197 -0
- modules/cmvk/tests/unit/test_kernel.py +255 -0
- modules/cmvk/tests/unit/test_reproducibility.py +160 -0
- modules/cmvk/tests/unit/test_trace_logger.py +115 -0
- modules/cmvk/tests/unit/test_visualizer.py +218 -0
- modules/control-plane/.github/ISSUE_TEMPLATE/bug_report.yml +82 -0
- modules/control-plane/.github/ISSUE_TEMPLATE/config.yml +11 -0
- modules/control-plane/.github/ISSUE_TEMPLATE/feature_request.yml +104 -0
- modules/control-plane/.github/ISSUE_TEMPLATE/question.yml +70 -0
- modules/control-plane/.github/ISSUE_TEMPLATE/security_vulnerability.yml +84 -0
- modules/control-plane/.github/discussions.yml +73 -0
- modules/control-plane/.github/pull_request_template.md +82 -0
- modules/control-plane/.github/workflows/publish.yml +146 -0
- modules/control-plane/.github/workflows/release.yml +39 -0
- modules/control-plane/.github/workflows/tests.yml +58 -0
- modules/control-plane/.gitignore +55 -0
- modules/control-plane/CHANGELOG.md +203 -0
- modules/control-plane/CONTRIBUTING.md +311 -0
- modules/control-plane/CONTRIBUTORS.md +88 -0
- modules/control-plane/Dockerfile +82 -0
- modules/control-plane/LICENSE +21 -0
- modules/control-plane/MANIFEST.in +17 -0
- modules/control-plane/README.md +1264 -0
- modules/control-plane/ROADMAP.md +228 -0
- modules/control-plane/SECURITY.md +210 -0
- modules/control-plane/SUPPORT.md +106 -0
- modules/control-plane/acp-cli.py +212 -0
- modules/control-plane/benchmark/README.md +257 -0
- modules/control-plane/benchmark/__init__.py +19 -0
- modules/control-plane/benchmark/red_team_dataset.py +517 -0
- modules/control-plane/benchmark.py +563 -0
- modules/control-plane/build_and_publish.sh +130 -0
- modules/control-plane/docker-compose.yml +74 -0
- modules/control-plane/docs/ABLATION_STUDIES.md +528 -0
- modules/control-plane/docs/ADAPTER_GUIDE.md +544 -0
- modules/control-plane/docs/ADVANCED_FEATURES.md +543 -0
- modules/control-plane/docs/AIOS_COMPARISON.md +296 -0
- modules/control-plane/docs/BIBLIOGRAPHY.md +367 -0
- modules/control-plane/docs/CASE_STUDIES.md +645 -0
- modules/control-plane/docs/DOCKER_DEPLOYMENT.md +184 -0
- modules/control-plane/docs/ECOSYSTEM_STATUS.md +98 -0
- modules/control-plane/docs/HF_MODEL_CARD.md +168 -0
- modules/control-plane/docs/KERNEL_V1_RELEASE.md +454 -0
- modules/control-plane/docs/LAYER3_FRAMEWORK.md +227 -0
- modules/control-plane/docs/LIMITATIONS.md +523 -0
- modules/control-plane/docs/PYPI_PUBLISHING.md +195 -0
- modules/control-plane/docs/README.md +58 -0
- modules/control-plane/docs/RELATED_WORK.md +319 -0
- modules/control-plane/docs/RELEASE_v1.1.0.md +252 -0
- modules/control-plane/docs/REPRODUCIBILITY.md +540 -0
- modules/control-plane/docs/RESEARCH_FOUNDATION.md +197 -0
- modules/control-plane/docs/api/CORE.md +270 -0
- modules/control-plane/docs/architecture/architecture.md +120 -0
- modules/control-plane/docs/community/ANNOUNCEMENT_TEMPLATES.md +52 -0
- modules/control-plane/docs/guides/IMPLEMENTATION.md +225 -0
- modules/control-plane/docs/guides/PHILOSOPHY.md +354 -0
- modules/control-plane/docs/guides/QUICKSTART.md +217 -0
- modules/control-plane/examples/README.md +138 -0
- modules/control-plane/examples/a2a_demo.py +410 -0
- modules/control-plane/examples/adapter_demo.py +347 -0
- modules/control-plane/examples/advanced_features.py +403 -0
- modules/control-plane/examples/basic_usage.py +261 -0
- modules/control-plane/examples/benchmark_demo.py +186 -0
- modules/control-plane/examples/compliance_demo.py +333 -0
- modules/control-plane/examples/configuration.py +265 -0
- modules/control-plane/examples/getting_started.py +178 -0
- modules/control-plane/examples/hibernation_and_time_travel_demo.py +406 -0
- modules/control-plane/examples/interactive_tutorial.ipynb +497 -0
- modules/control-plane/examples/kernel_interceptor_demo.py +202 -0
- modules/control-plane/examples/kernel_v1_demo.py +273 -0
- modules/control-plane/examples/langchain_demo.py +281 -0
- modules/control-plane/examples/lifecycle_demo.py +724 -0
- modules/control-plane/examples/mcp_demo.py +378 -0
- modules/control-plane/examples/ml_safety_demo.py +157 -0
- modules/control-plane/examples/multimodal_demo.py +347 -0
- modules/control-plane/examples/observability_demo.py +370 -0
- modules/control-plane/examples/use_cases.py +336 -0
- modules/control-plane/experiments/long_horizon_purge.py +235 -0
- modules/control-plane/experiments/multi_agent_rag.py +165 -0
- modules/control-plane/experiments/reproduce_results.py +667 -0
- modules/control-plane/paper/ARXIV_SUBMISSION_INFO.txt +122 -0
- modules/control-plane/paper/ETHICS_STATEMENT.md +248 -0
- modules/control-plane/paper/PAPER_CHECKLIST.md +72 -0
- modules/control-plane/paper/Paper.pdf +0 -0
- modules/control-plane/paper/README.md +71 -0
- modules/control-plane/paper/appendix.md +152 -0
- modules/control-plane/paper/architecture.md +15 -0
- modules/control-plane/paper/arxiv/figures/ablation_chart.png +0 -0
- modules/control-plane/paper/arxiv/figures/architecture.png +0 -0
- modules/control-plane/paper/arxiv/figures/constraint_graphs.png +0 -0
- modules/control-plane/paper/arxiv/figures/results_chart.png +0 -0
- modules/control-plane/paper/arxiv/main.aux +97 -0
- modules/control-plane/paper/arxiv/main.bbl +112 -0
- modules/control-plane/paper/arxiv/main.blg +48 -0
- modules/control-plane/paper/arxiv/main.out +33 -0
- modules/control-plane/paper/arxiv/main.pdf +0 -0
- modules/control-plane/paper/arxiv/main.tex +479 -0
- modules/control-plane/paper/arxiv/references.bib +234 -0
- modules/control-plane/paper/arxiv_submission.tar +0 -0
- modules/control-plane/paper/arxiv_submission.zip +0 -0
- modules/control-plane/paper/build.sh +68 -0
- modules/control-plane/paper/figures/README.md +47 -0
- modules/control-plane/paper/figures/ablation_chart.pdf +0 -0
- modules/control-plane/paper/figures/ablation_chart.png +0 -0
- modules/control-plane/paper/figures/architecture.pdf +0 -0
- modules/control-plane/paper/figures/architecture.png +0 -0
- modules/control-plane/paper/figures/constraint_graphs.pdf +0 -0
- modules/control-plane/paper/figures/constraint_graphs.png +0 -0
- modules/control-plane/paper/figures/generate_figures.py +252 -0
- modules/control-plane/paper/figures/results_chart.pdf +0 -0
- modules/control-plane/paper/figures/results_chart.png +0 -0
- modules/control-plane/paper/main.md +273 -0
- modules/control-plane/paper/main.tex +214 -0
- modules/control-plane/paper/main_arxiv.aux +53 -0
- modules/control-plane/paper/main_arxiv.out +17 -0
- modules/control-plane/paper/main_arxiv.pdf +0 -0
- modules/control-plane/paper/main_arxiv.tex +264 -0
- modules/control-plane/paper/references.bib +234 -0
- modules/control-plane/pyproject.toml +124 -0
- modules/control-plane/reproducibility/ABLATIONS.md +136 -0
- modules/control-plane/reproducibility/README.md +288 -0
- modules/control-plane/reproducibility/commands.md +467 -0
- modules/control-plane/reproducibility/docker_config/Dockerfile +39 -0
- modules/control-plane/reproducibility/experiment_configs/purge_config.json +46 -0
- modules/control-plane/reproducibility/experiment_configs/rag_config.json +36 -0
- modules/control-plane/reproducibility/hardware_specs.md +317 -0
- modules/control-plane/reproducibility/requirements_frozen.txt +0 -0
- modules/control-plane/reproducibility/run_all_experiments.sh +45 -0
- modules/control-plane/reproducibility/seeds.json +106 -0
- modules/control-plane/scripts/prepare_pypi.py +46 -0
- modules/control-plane/scripts/prepare_release.py +176 -0
- modules/control-plane/scripts/upload_dataset_to_hf.py +316 -0
- modules/control-plane/setup.py +69 -0
- modules/control-plane/src/agent_control_plane/__init__.py +639 -0
- modules/control-plane/src/agent_control_plane/a2a_adapter.py +541 -0
- modules/control-plane/src/agent_control_plane/adapter.py +415 -0
- modules/control-plane/src/agent_control_plane/agent_hibernation.py +364 -0
- modules/control-plane/src/agent_control_plane/agent_kernel.py +464 -0
- modules/control-plane/src/agent_control_plane/compliance.py +718 -0
- modules/control-plane/src/agent_control_plane/constraint_graphs.py +475 -0
- modules/control-plane/src/agent_control_plane/control_plane.py +848 -0
- modules/control-plane/src/agent_control_plane/example_executors.py +193 -0
- modules/control-plane/src/agent_control_plane/execution_engine.py +229 -0
- modules/control-plane/src/agent_control_plane/flight_recorder.py +600 -0
- modules/control-plane/src/agent_control_plane/governance_layer.py +432 -0
- modules/control-plane/src/agent_control_plane/hf_utils.py +561 -0
- modules/control-plane/src/agent_control_plane/interfaces/__init__.py +53 -0
- modules/control-plane/src/agent_control_plane/interfaces/kernel_interface.py +359 -0
- modules/control-plane/src/agent_control_plane/interfaces/plugin_interface.py +495 -0
- modules/control-plane/src/agent_control_plane/interfaces/protocol_interfaces.py +385 -0
- modules/control-plane/src/agent_control_plane/kernel_space.py +707 -0
- modules/control-plane/src/agent_control_plane/langchain_adapter.py +422 -0
- modules/control-plane/src/agent_control_plane/lifecycle.py +3111 -0
- modules/control-plane/src/agent_control_plane/mcp_adapter.py +517 -0
- modules/control-plane/src/agent_control_plane/ml_safety.py +560 -0
- modules/control-plane/src/agent_control_plane/multimodal.py +724 -0
- modules/control-plane/src/agent_control_plane/mute_agent.py +419 -0
- modules/control-plane/src/agent_control_plane/observability.py +785 -0
- modules/control-plane/src/agent_control_plane/orchestrator.py +480 -0
- modules/control-plane/src/agent_control_plane/plugin_registry.py +748 -0
- modules/control-plane/src/agent_control_plane/policy_engine.py +525 -0
- modules/control-plane/src/agent_control_plane/shadow_mode.py +307 -0
- modules/control-plane/src/agent_control_plane/signals.py +491 -0
- modules/control-plane/src/agent_control_plane/supervisor_agents.py +427 -0
- modules/control-plane/src/agent_control_plane/time_travel_debugger.py +554 -0
- modules/control-plane/src/agent_control_plane/tool_registry.py +350 -0
- modules/control-plane/src/agent_control_plane/vfs.py +695 -0
- modules/control-plane/tests/README.md +33 -0
- modules/control-plane/tests/test_a2a_adapter.py +336 -0
- modules/control-plane/tests/test_adapter.py +422 -0
- modules/control-plane/tests/test_advanced_features.py +389 -0
- modules/control-plane/tests/test_benchmark.py +223 -0
- modules/control-plane/tests/test_compliance.py +214 -0
- modules/control-plane/tests/test_control_plane.py +295 -0
- modules/control-plane/tests/test_hibernation.py +274 -0
- modules/control-plane/tests/test_kernel_interception.py +284 -0
- modules/control-plane/tests/test_langchain_adapter.py +258 -0
- modules/control-plane/tests/test_lifecycle.py +1174 -0
- modules/control-plane/tests/test_mcp_adapter.py +293 -0
- modules/control-plane/tests/test_ml_safety.py +142 -0
- modules/control-plane/tests/test_multimodal.py +317 -0
- modules/control-plane/tests/test_new_features.py +435 -0
- modules/control-plane/tests/test_observability.py +338 -0
- modules/control-plane/tests/test_time_travel.py +387 -0
- modules/emk/.github/workflows/ci.yml +105 -0
- modules/emk/.github/workflows/publish.yml +144 -0
- modules/emk/.gitignore +74 -0
- modules/emk/CHANGELOG.md +41 -0
- modules/emk/CONTRIBUTING.md +295 -0
- modules/emk/IMPLEMENTATION.md +174 -0
- modules/emk/LICENSE +21 -0
- modules/emk/MANIFEST.in +8 -0
- modules/emk/README.md +135 -0
- modules/emk/RELEASE_NOTES.md +82 -0
- modules/emk/SECURITY.md +52 -0
- modules/emk/codecov.yml +39 -0
- modules/emk/docs/MEMORY_MANAGEMENT.md +285 -0
- modules/emk/emk/__init__.py +106 -0
- modules/emk/emk/hf_utils.py +419 -0
- modules/emk/emk/indexer.py +144 -0
- modules/emk/emk/py.typed +0 -0
- modules/emk/emk/schema.py +204 -0
- modules/emk/emk/sleep_cycle.py +345 -0
- modules/emk/emk/store.py +479 -0
- modules/emk/examples/basic_usage.py +123 -0
- modules/emk/examples/memory_features_demo.py +154 -0
- modules/emk/experiments/README.md +59 -0
- modules/emk/experiments/reproduce_results.py +461 -0
- modules/emk/experiments/results.json +61 -0
- modules/emk/paper/structure.tex +192 -0
- modules/emk/paper/whitepaper.md +273 -0
- modules/emk/pyproject.toml +91 -0
- modules/emk/setup.py +5 -0
- modules/emk/tests/test_file_adapter.py +195 -0
- modules/emk/tests/test_indexer.py +174 -0
- modules/emk/tests/test_init.py +55 -0
- modules/emk/tests/test_negative_memory.py +83 -0
- modules/emk/tests/test_schema.py +150 -0
- modules/emk/tests/test_semantic_rules.py +175 -0
- modules/emk/tests/test_sleep_cycle.py +335 -0
- modules/emk/tests/test_store_anti_patterns.py +239 -0
- modules/iatp/.github/workflows/docker-build.yml +124 -0
- modules/iatp/.github/workflows/publish.yml +174 -0
- modules/iatp/.github/workflows/python-package.yml +121 -0
- modules/iatp/.gitignore +67 -0
- modules/iatp/.pre-commit-config.yaml +64 -0
- modules/iatp/CHANGELOG.md +120 -0
- modules/iatp/Dockerfile +91 -0
- modules/iatp/IMPLEMENTATION_SUMMARY.md +218 -0
- modules/iatp/MANIFEST.in +9 -0
- modules/iatp/README.md +180 -0
- modules/iatp/docker/Dockerfile.agent +27 -0
- modules/iatp/docker/Dockerfile.sidecar-python +86 -0
- modules/iatp/docker/README.md +258 -0
- modules/iatp/docker-compose.yml +194 -0
- modules/iatp/docs/ARCHITECTURE.md +243 -0
- modules/iatp/docs/CLI_GUIDE.md +220 -0
- modules/iatp/docs/DEPLOYMENT.md +304 -0
- modules/iatp/examples/README.md +132 -0
- modules/iatp/examples/backend_agent.py +39 -0
- modules/iatp/examples/client.py +168 -0
- modules/iatp/examples/demo_attestation_reputation.py +274 -0
- modules/iatp/examples/demo_client.py +240 -0
- modules/iatp/examples/demo_rbac.py +143 -0
- modules/iatp/examples/integration_demo.py +245 -0
- modules/iatp/examples/manifests/coder_agent.json +20 -0
- modules/iatp/examples/manifests/reviewer_agent.json +19 -0
- modules/iatp/examples/manifests/secure_bank.json +14 -0
- modules/iatp/examples/manifests/standard_agent.json +14 -0
- modules/iatp/examples/manifests/untrusted_honeypot.json +14 -0
- modules/iatp/examples/run_secure_bank_sidecar.py +85 -0
- modules/iatp/examples/run_sidecar.py +105 -0
- modules/iatp/examples/run_untrusted_sidecar.py +77 -0
- modules/iatp/examples/secure_bank_agent.py +138 -0
- modules/iatp/examples/test_untrusted.py +82 -0
- modules/iatp/examples/untrusted_agent.py +119 -0
- modules/iatp/experiments/README.md +58 -0
- modules/iatp/experiments/cascading_hallucination/README.md +149 -0
- modules/iatp/experiments/cascading_hallucination/agent_a_user.py +41 -0
- modules/iatp/experiments/cascading_hallucination/agent_b_summarizer.py +54 -0
- modules/iatp/experiments/cascading_hallucination/agent_c_database.py +47 -0
- modules/iatp/experiments/cascading_hallucination/proof_of_concept.py +290 -0
- modules/iatp/experiments/cascading_hallucination/run_experiment.py +226 -0
- modules/iatp/experiments/cascading_hallucination/sidecar_c.py +61 -0
- modules/iatp/experiments/reproduce_results.py +574 -0
- modules/iatp/experiments/results.json +2336 -0
- modules/iatp/iatp/__init__.py +164 -0
- modules/iatp/iatp/attestation.py +401 -0
- modules/iatp/iatp/cli.py +253 -0
- modules/iatp/iatp/hf_utils.py +469 -0
- modules/iatp/iatp/ipc_pipes.py +578 -0
- modules/iatp/iatp/main.py +410 -0
- modules/iatp/iatp/models/__init__.py +445 -0
- modules/iatp/iatp/policy_engine.py +335 -0
- modules/iatp/iatp/py.typed +2 -0
- modules/iatp/iatp/recovery.py +319 -0
- modules/iatp/iatp/security/__init__.py +268 -0
- modules/iatp/iatp/sidecar/__init__.py +517 -0
- modules/iatp/iatp/telemetry/__init__.py +162 -0
- modules/iatp/iatp/tests/__init__.py +1 -0
- modules/iatp/iatp/tests/test_attestation.py +368 -0
- modules/iatp/iatp/tests/test_cli.py +129 -0
- modules/iatp/iatp/tests/test_models.py +128 -0
- modules/iatp/iatp/tests/test_policy_engine.py +345 -0
- modules/iatp/iatp/tests/test_recovery.py +279 -0
- modules/iatp/iatp/tests/test_security.py +220 -0
- modules/iatp/iatp/tests/test_sidecar.py +165 -0
- modules/iatp/iatp/tests/test_telemetry.py +173 -0
- modules/iatp/paper/BLOG.md +307 -0
- modules/iatp/paper/PAPER.md +236 -0
- modules/iatp/paper/RFC_SUBMISSION.md +299 -0
- modules/iatp/paper/whitepaper.md +369 -0
- modules/iatp/proto/README.md +200 -0
- modules/iatp/proto/generate_stubs.py +81 -0
- modules/iatp/proto/iatp.proto +552 -0
- modules/iatp/pyproject.toml +180 -0
- modules/iatp/requirements-dev.txt +2 -0
- modules/iatp/requirements.txt +6 -0
- modules/iatp/setup.py +60 -0
- modules/iatp/sidecar/README.md +487 -0
- modules/iatp/sidecar/go/Dockerfile +32 -0
- modules/iatp/sidecar/go/README.md +237 -0
- modules/iatp/sidecar/go/go.mod +8 -0
- modules/iatp/sidecar/go/main.go +488 -0
- modules/iatp/spec/001-handshake.md +436 -0
- modules/iatp/spec/002-reversibility.md +394 -0
- modules/iatp/spec/schema/capability_manifest.json +266 -0
- modules/iatp/test_integration.py +310 -0
- modules/mcp-kernel-server/README.md +261 -0
- modules/mcp-kernel-server/pyproject.toml +60 -0
- modules/mcp-kernel-server/src/mcp_kernel_server/__init__.py +26 -0
- modules/mcp-kernel-server/src/mcp_kernel_server/cli.py +229 -0
- modules/mcp-kernel-server/src/mcp_kernel_server/resources.py +215 -0
- modules/mcp-kernel-server/src/mcp_kernel_server/server.py +562 -0
- modules/mcp-kernel-server/src/mcp_kernel_server/tools.py +1172 -0
- modules/mute-agent/.github/workflows/safety_check.yml +45 -0
- modules/mute-agent/.gitignore +53 -0
- modules/mute-agent/ARCHITECTURE.md +531 -0
- modules/mute-agent/BENCHMARK_GUIDE.md +384 -0
- modules/mute-agent/COMPLETION_SUMMARY.md +293 -0
- modules/mute-agent/EXPERIMENT_SUMMARY.md +318 -0
- modules/mute-agent/IMPLEMENTATION_SUMMARY.md +212 -0
- modules/mute-agent/LICENSE +21 -0
- modules/mute-agent/PHASE3_SUMMARY.md +297 -0
- modules/mute-agent/README.md +360 -0
- modules/mute-agent/STEEL_MAN_RESULTS.md +353 -0
- modules/mute-agent/USAGE.md +505 -0
- modules/mute-agent/V2_IMPLEMENTATION_SUMMARY.md +253 -0
- modules/mute-agent/V2_STEEL_MAN_IMPLEMENTATION.md +274 -0
- modules/mute-agent/VERIFICATION_REPORT.md +435 -0
- modules/mute-agent/charts/cost_comparison.png +0 -0
- modules/mute-agent/charts/cost_vs_ambiguity.png +0 -0
- modules/mute-agent/charts/metrics_comparison.png +0 -0
- modules/mute-agent/charts/scenario_breakdown.png +0 -0
- modules/mute-agent/charts/trace_attack_blocked.html +140 -0
- modules/mute-agent/charts/trace_attack_blocked.png +0 -0
- modules/mute-agent/charts/trace_failure.html +140 -0
- modules/mute-agent/charts/trace_failure.png +0 -0
- modules/mute-agent/charts/trace_success.html +140 -0
- modules/mute-agent/charts/trace_success.png +0 -0
- modules/mute-agent/examples/__init__.py +1 -0
- modules/mute-agent/examples/advanced_example.py +384 -0
- modules/mute-agent/examples/graph_debugger_demo.py +241 -0
- modules/mute-agent/examples/listener_example.py +297 -0
- modules/mute-agent/examples/simple_example.py +242 -0
- modules/mute-agent/examples/steel_man_demo.py +297 -0
- modules/mute-agent/experiments/README.md +135 -0
- modules/mute-agent/experiments/__init__.py +3 -0
- modules/mute-agent/experiments/agent_comparison.csv +6 -0
- modules/mute-agent/experiments/agent_comparison_50runs.csv +6 -0
- modules/mute-agent/experiments/ambiguity_test.py +335 -0
- modules/mute-agent/experiments/ambiguity_test_results.csv +31 -0
- modules/mute-agent/experiments/ambiguity_test_results_50runs.csv +51 -0
- modules/mute-agent/experiments/baseline_agent.py +189 -0
- modules/mute-agent/experiments/benchmark.py +402 -0
- modules/mute-agent/experiments/demo.py +172 -0
- modules/mute-agent/experiments/generate_cost_curve.py +474 -0
- modules/mute-agent/experiments/jailbreak_test.py +137 -0
- modules/mute-agent/experiments/latent_state_scenario.py +361 -0
- modules/mute-agent/experiments/mute_agent_experiment.py +349 -0
- modules/mute-agent/experiments/run_extended_experiment.py +40 -0
- modules/mute-agent/experiments/run_v2_experiments.py +266 -0
- modules/mute-agent/experiments/run_v2_experiments_auto.py +247 -0
- modules/mute-agent/experiments/v2_scenarios/README.md +214 -0
- modules/mute-agent/experiments/v2_scenarios/__init__.py +4 -0
- modules/mute-agent/experiments/v2_scenarios/scenario_1_deep_dependency.py +325 -0
- modules/mute-agent/experiments/v2_scenarios/scenario_2_adversarial.py +328 -0
- modules/mute-agent/experiments/v2_scenarios/scenario_3_false_positive.py +303 -0
- modules/mute-agent/experiments/v2_scenarios/scenario_4_performance.py +319 -0
- modules/mute-agent/experiments/visualize.py +400 -0
- modules/mute-agent/mute_agent/__init__.py +66 -0
- modules/mute-agent/mute_agent/core/__init__.py +1 -0
- modules/mute-agent/mute_agent/core/execution_agent.py +164 -0
- modules/mute-agent/mute_agent/core/handshake_protocol.py +199 -0
- modules/mute-agent/mute_agent/core/reasoning_agent.py +236 -0
- modules/mute-agent/mute_agent/knowledge_graph/__init__.py +1 -0
- modules/mute-agent/mute_agent/knowledge_graph/graph_elements.py +63 -0
- modules/mute-agent/mute_agent/knowledge_graph/multidimensional_graph.py +168 -0
- modules/mute-agent/mute_agent/knowledge_graph/subgraph.py +222 -0
- modules/mute-agent/mute_agent/listener/__init__.py +41 -0
- modules/mute-agent/mute_agent/listener/adapters/__init__.py +29 -0
- modules/mute-agent/mute_agent/listener/adapters/base_adapter.py +187 -0
- modules/mute-agent/mute_agent/listener/adapters/caas_adapter.py +342 -0
- modules/mute-agent/mute_agent/listener/adapters/control_plane_adapter.py +434 -0
- modules/mute-agent/mute_agent/listener/adapters/iatp_adapter.py +330 -0
- modules/mute-agent/mute_agent/listener/adapters/scak_adapter.py +249 -0
- modules/mute-agent/mute_agent/listener/listener.py +608 -0
- modules/mute-agent/mute_agent/listener/state_observer.py +434 -0
- modules/mute-agent/mute_agent/listener/threshold_config.py +311 -0
- modules/mute-agent/mute_agent/super_system/__init__.py +1 -0
- modules/mute-agent/mute_agent/super_system/router.py +202 -0
- modules/mute-agent/mute_agent/visualization/__init__.py +8 -0
- modules/mute-agent/mute_agent/visualization/graph_debugger.py +495 -0
- modules/mute-agent/requirements-dev.txt +6 -0
- modules/mute-agent/requirements.txt +9 -0
- modules/mute-agent/setup.py +64 -0
- modules/mute-agent/src/__init__.py +0 -0
- modules/mute-agent/src/agents/__init__.py +0 -0
- modules/mute-agent/src/agents/baseline_agent.py +524 -0
- modules/mute-agent/src/agents/interactive_agent.py +113 -0
- modules/mute-agent/src/agents/mute_agent.py +622 -0
- modules/mute-agent/src/benchmarks/__init__.py +0 -0
- modules/mute-agent/src/benchmarks/evaluator.py +481 -0
- modules/mute-agent/src/benchmarks/scenarios.json +985 -0
- modules/mute-agent/src/core/__init__.py +0 -0
- modules/mute-agent/src/core/mock_state.py +320 -0
- modules/mute-agent/src/core/tools.py +441 -0
- modules/nexus/__init__.py +49 -0
- modules/nexus/arbiter.py +357 -0
- modules/nexus/client.py +464 -0
- modules/nexus/dmz.py +417 -0
- modules/nexus/escrow.py +428 -0
- modules/nexus/exceptions.py +284 -0
- modules/nexus/registry.py +391 -0
- modules/nexus/reputation.py +423 -0
- modules/nexus/schemas/__init__.py +49 -0
- modules/nexus/schemas/compliance.py +274 -0
- modules/nexus/schemas/escrow.py +249 -0
- modules/nexus/schemas/manifest.py +223 -0
- modules/nexus/schemas/receipt.py +206 -0
- modules/observability/README.md +192 -0
- modules/observability/alertmanager/alertmanager.yml +116 -0
- modules/observability/alerts/agent-os-alerts.yaml +197 -0
- modules/observability/docker-compose.yml +128 -0
- modules/observability/grafana/dashboards/agent-os-amb.json +448 -0
- modules/observability/grafana/dashboards/agent-os-cmvk.json +441 -0
- modules/observability/grafana/dashboards/agent-os-overview.json +268 -0
- modules/observability/grafana/dashboards/agent-os-performance.json +15 -0
- modules/observability/grafana/dashboards/agent-os-safety.json +50 -0
- modules/observability/grafana/provisioning/dashboards/dashboards.yml +15 -0
- modules/observability/grafana/provisioning/datasources/datasources.yml +33 -0
- modules/observability/otel/otel-collector-config.yml +61 -0
- modules/observability/prometheus/prometheus.yml +63 -0
- modules/observability/pyproject.toml +53 -0
- modules/observability/scripts/export_dashboards.py +55 -0
- modules/observability/src/agent_os_observability/__init__.py +25 -0
- modules/observability/src/agent_os_observability/dashboards.py +896 -0
- modules/observability/src/agent_os_observability/metrics.py +396 -0
- modules/observability/src/agent_os_observability/server.py +221 -0
- modules/observability/src/agent_os_observability/tracer.py +226 -0
- modules/primitives/.gitignore +8 -0
- modules/primitives/README.md +62 -0
- modules/primitives/agent_primitives/__init__.py +22 -0
- modules/primitives/agent_primitives/failures.py +82 -0
- modules/primitives/agent_primitives/py.typed +0 -0
- modules/primitives/pyproject.toml +68 -0
- modules/scak/.github/copilot-instructions.md +396 -0
- modules/scak/.github/workflows/release.yml +117 -0
- modules/scak/.gitignore +32 -0
- modules/scak/CHANGELOG.md +173 -0
- modules/scak/CITATION.cff +62 -0
- modules/scak/CONTRIBUTING.md +429 -0
- modules/scak/Dockerfile +58 -0
- modules/scak/ENTERPRISE_FEATURES.md +518 -0
- modules/scak/IMPLEMENTATION_SUMMARY.md +206 -0
- modules/scak/LIMITATIONS.md +565 -0
- modules/scak/MANIFEST.in +16 -0
- modules/scak/NOVELTY.md +535 -0
- modules/scak/README.md +928 -0
- modules/scak/RESEARCH.md +670 -0
- modules/scak/agent_kernel/__init__.py +66 -0
- modules/scak/agent_kernel/analyzer.py +432 -0
- modules/scak/agent_kernel/auditor.py +31 -0
- modules/scak/agent_kernel/completeness_auditor.py +234 -0
- modules/scak/agent_kernel/detector.py +200 -0
- modules/scak/agent_kernel/kernel.py +741 -0
- modules/scak/agent_kernel/memory_manager.py +82 -0
- modules/scak/agent_kernel/models.py +372 -0
- modules/scak/agent_kernel/nudge_mechanism.py +260 -0
- modules/scak/agent_kernel/outcome_analyzer.py +335 -0
- modules/scak/agent_kernel/patcher.py +579 -0
- modules/scak/agent_kernel/semantic_analyzer.py +313 -0
- modules/scak/agent_kernel/semantic_purge.py +346 -0
- modules/scak/agent_kernel/simulator.py +447 -0
- modules/scak/agent_kernel/teacher.py +82 -0
- modules/scak/agent_kernel/triage.py +149 -0
- modules/scak/build_and_publish.ps1 +74 -0
- modules/scak/build_and_publish.sh +74 -0
- modules/scak/cli.py +471 -0
- modules/scak/dashboard.py +462 -0
- modules/scak/datasets/DATASET_CARD.md +219 -0
- modules/scak/datasets/README.md +143 -0
- modules/scak/datasets/gaia_vague_queries/vague_queries.json +262 -0
- modules/scak/datasets/hf_upload/README.md +219 -0
- modules/scak/datasets/hf_upload/scak_gaia_laziness.jsonl +50 -0
- modules/scak/datasets/prepare_hf_datasets.py +145 -0
- modules/scak/datasets/red_team/jailbreak_patterns.json +202 -0
- modules/scak/docker-compose.yml +99 -0
- modules/scak/docs/Adaptive-Memory-Hierarchy.md +319 -0
- modules/scak/docs/Data-Contracts-and-Schemas.md +285 -0
- modules/scak/docs/Dual-Loop-Architecture.md +344 -0
- modules/scak/docs/Enhanced-Features.md +612 -0
- modules/scak/docs/LANGCHAIN_INTEGRATION.md +572 -0
- modules/scak/docs/README.md +128 -0
- modules/scak/docs/Reference-Implementations.md +163 -0
- modules/scak/docs/SCAK_V2.md +374 -0
- modules/scak/docs/Three-Failure-Types.md +178 -0
- modules/scak/examples/basic_example.py +155 -0
- modules/scak/examples/circuit_breaker_lazy_eval_demo.py +243 -0
- modules/scak/examples/langchain_integration_example.py +339 -0
- modules/scak/examples/layer4_demo.py +243 -0
- modules/scak/examples/production_features_demo.py +353 -0
- modules/scak/examples/quick_demo.py +79 -0
- modules/scak/examples/scak_v2_demo.py +252 -0
- modules/scak/experiments/README.md +438 -0
- modules/scak/experiments/ablation_studies/README.md +192 -0
- modules/scak/experiments/ablation_studies/ablation_no_audit.py +116 -0
- modules/scak/experiments/ablation_studies/ablation_no_purge.py +133 -0
- modules/scak/experiments/chaos_engineering/README.md +332 -0
- modules/scak/experiments/context_efficiency_test.py +328 -0
- modules/scak/experiments/gaia_benchmark/README.md +208 -0
- modules/scak/experiments/laziness_benchmark.py +179 -0
- modules/scak/experiments/long_horizon_task_experiment.py +252 -0
- modules/scak/experiments/multi_agent_rag_experiment.py +284 -0
- modules/scak/experiments/results/ablation_table.md +12 -0
- modules/scak/experiments/results/long_horizon.json +36 -0
- modules/scak/experiments/results/multi_agent_rag.json +66 -0
- modules/scak/experiments/run_comprehensive_ablations.py +332 -0
- modules/scak/experiments/test_auditor_patcher_integration.py +251 -0
- modules/scak/notebooks/getting_started.ipynb +33 -0
- modules/scak/paper/ARXIV_SUBMISSION_METADATA.txt +109 -0
- modules/scak/paper/PAPER_CHECKLIST.md +304 -0
- modules/scak/paper/Paper.pdf +0 -0
- modules/scak/paper/README.md +113 -0
- modules/scak/paper/appendix.md +351 -0
- modules/scak/paper/arxiv/bibliography.bib +284 -0
- modules/scak/paper/arxiv/fig1_ooda_architecture.pdf +0 -0
- modules/scak/paper/arxiv/fig2_memory_hierarchy.pdf +0 -0
- modules/scak/paper/arxiv/fig3_gaia_results.pdf +0 -0
- modules/scak/paper/arxiv/fig4_ablation_heatmap.pdf +0 -0
- modules/scak/paper/arxiv/fig5_context_reduction.pdf +0 -0
- modules/scak/paper/arxiv/fig6_mttr_boxplot.pdf +0 -0
- modules/scak/paper/arxiv/main.aux +103 -0
- modules/scak/paper/arxiv/main.bbl +113 -0
- modules/scak/paper/arxiv/main.blg +55 -0
- modules/scak/paper/arxiv/main.out +31 -0
- modules/scak/paper/arxiv/main.pdf +0 -0
- modules/scak/paper/arxiv/main.tex +482 -0
- modules/scak/paper/arxiv_submission/bibliography.bib +284 -0
- modules/scak/paper/arxiv_submission/fig1_ooda_architecture.pdf +0 -0
- modules/scak/paper/arxiv_submission/fig2_memory_hierarchy.pdf +0 -0
- modules/scak/paper/arxiv_submission/fig3_gaia_results.pdf +0 -0
- modules/scak/paper/arxiv_submission/fig4_ablation_heatmap.pdf +0 -0
- modules/scak/paper/arxiv_submission/fig5_context_reduction.pdf +0 -0
- modules/scak/paper/arxiv_submission/fig6_mttr_boxplot.pdf +0 -0
- modules/scak/paper/arxiv_submission/main.aux +103 -0
- modules/scak/paper/arxiv_submission/main.bbl +113 -0
- modules/scak/paper/arxiv_submission/main.blg +55 -0
- modules/scak/paper/arxiv_submission/main.out +31 -0
- modules/scak/paper/arxiv_submission/main.pdf +0 -0
- modules/scak/paper/arxiv_submission/main.tex +482 -0
- modules/scak/paper/arxiv_submission.tar.gz +0 -0
- modules/scak/paper/bibliography.bib +284 -0
- modules/scak/paper/build.sh +55 -0
- modules/scak/paper/figures/README.md +32 -0
- modules/scak/paper/figures/fig1_ooda_architecture.md +75 -0
- modules/scak/paper/figures/fig1_ooda_architecture.pdf +0 -0
- modules/scak/paper/figures/fig1_ooda_architecture.png +0 -0
- modules/scak/paper/figures/fig2_memory_hierarchy.md +83 -0
- modules/scak/paper/figures/fig2_memory_hierarchy.pdf +0 -0
- modules/scak/paper/figures/fig2_memory_hierarchy.png +0 -0
- modules/scak/paper/figures/fig3_gaia_results.md +64 -0
- modules/scak/paper/figures/fig3_gaia_results.pdf +0 -0
- modules/scak/paper/figures/fig3_gaia_results.png +0 -0
- modules/scak/paper/figures/fig4_ablation_heatmap.md +64 -0
- modules/scak/paper/figures/fig4_ablation_heatmap.pdf +0 -0
- modules/scak/paper/figures/fig4_ablation_heatmap.png +0 -0
- modules/scak/paper/figures/fig5_context_reduction.md +71 -0
- modules/scak/paper/figures/fig5_context_reduction.pdf +0 -0
- modules/scak/paper/figures/fig5_context_reduction.png +0 -0
- modules/scak/paper/figures/fig6_mttr_boxplot.md +80 -0
- modules/scak/paper/figures/fig6_mttr_boxplot.pdf +0 -0
- modules/scak/paper/figures/fig6_mttr_boxplot.png +0 -0
- modules/scak/paper/figures/generate_figures.py +463 -0
- modules/scak/paper/main.aux +103 -0
- modules/scak/paper/main.bbl +113 -0
- modules/scak/paper/main.blg +55 -0
- modules/scak/paper/main.md +192 -0
- modules/scak/paper/main.out +31 -0
- modules/scak/paper/main.pdf +0 -0
- modules/scak/paper/main.tex +482 -0
- modules/scak/reproducibility/ABLATIONS.md +225 -0
- modules/scak/reproducibility/Dockerfile.reproducibility +34 -0
- modules/scak/reproducibility/README.md +421 -0
- modules/scak/reproducibility/requirements-pinned.txt +32 -0
- modules/scak/reproducibility/run_all_experiments.py +395 -0
- modules/scak/reproducibility/seed_control.py +53 -0
- modules/scak/reproducibility/statistical_analysis.py +302 -0
- modules/scak/requirements.txt +50 -0
- modules/scak/setup.py +93 -0
- modules/scak/src/__init__.py +124 -0
- modules/scak/src/agents/__init__.py +13 -0
- modules/scak/src/agents/conflict_resolution.py +732 -0
- modules/scak/src/agents/orchestrator.py +761 -0
- modules/scak/src/agents/pubsub.py +484 -0
- modules/scak/src/agents/shadow_teacher.py +344 -0
- modules/scak/src/agents/swarm.py +661 -0
- modules/scak/src/agents/worker.py +357 -0
- modules/scak/src/integrations/__init__.py +81 -0
- modules/scak/src/integrations/cmvk_adapter.py +430 -0
- modules/scak/src/integrations/control_plane_adapter.py +601 -0
- modules/scak/src/integrations/langchain_integration.py +902 -0
- modules/scak/src/interfaces/__init__.py +59 -0
- modules/scak/src/interfaces/llm_clients.py +505 -0
- modules/scak/src/interfaces/openapi_tools.py +611 -0
- modules/scak/src/interfaces/plugin_system.py +605 -0
- modules/scak/src/interfaces/protocols.py +365 -0
- modules/scak/src/interfaces/telemetry.py +464 -0
- modules/scak/src/interfaces/tool_registry.py +547 -0
- modules/scak/src/kernel/__init__.py +100 -0
- modules/scak/src/kernel/auditor.py +305 -0
- modules/scak/src/kernel/circuit_breaker.py +398 -0
- modules/scak/src/kernel/core.py +724 -0
- modules/scak/src/kernel/distributed.py +667 -0
- modules/scak/src/kernel/evolution.py +455 -0
- modules/scak/src/kernel/failover.py +621 -0
- modules/scak/src/kernel/governance.py +710 -0
- modules/scak/src/kernel/governance_v2.py +603 -0
- modules/scak/src/kernel/lazy_evaluator.py +514 -0
- modules/scak/src/kernel/load_testing.py +633 -0
- modules/scak/src/kernel/memory.py +945 -0
- modules/scak/src/kernel/patcher.py +581 -0
- modules/scak/src/kernel/rubric.py +419 -0
- modules/scak/src/kernel/schemas.py +390 -0
- modules/scak/src/kernel/skill_mapper.py +309 -0
- modules/scak/src/kernel/triage.py +149 -0
- modules/scak/src/mocks/__init__.py +99 -0
- modules/scak/tests/__init__.py +1 -0
- modules/scak/tests/test_circuit_breaker.py +403 -0
- modules/scak/tests/test_conflict_resolution.py +287 -0
- modules/scak/tests/test_dual_loop.py +463 -0
- modules/scak/tests/test_enhanced_features.py +421 -0
- modules/scak/tests/test_failover_and_load.py +438 -0
- modules/scak/tests/test_governance.py +185 -0
- modules/scak/tests/test_kernel.py +359 -0
- modules/scak/tests/test_langchain_integration.py +451 -0
- modules/scak/tests/test_lazy_evaluator.py +465 -0
- modules/scak/tests/test_llm_clients.py +122 -0
- modules/scak/tests/test_memory_controller.py +528 -0
- modules/scak/tests/test_orchestrator.py +181 -0
- modules/scak/tests/test_phase3_integration.py +265 -0
- modules/scak/tests/test_pubsub_swarm.py +203 -0
- modules/scak/tests/test_reference_implementations.py +240 -0
- modules/scak/tests/test_rubric.py +363 -0
- modules/scak/tests/test_scak_v2.py +651 -0
- modules/scak/tests/test_skill_mapper.py +217 -0
- modules/scak/tests/test_specific_failures.py +393 -0
- modules/scak/tests/test_tool_registry.py +264 -0
- modules/scak/tests/test_tools_and_plugins.py +303 -0
- modules/scak/tests/test_triage.py +596 -0
- modules/scak/tests/test_write_through.py +319 -0
- agent_os_kernel-1.1.0.dist-info/METADATA +0 -400
- agent_os_kernel-1.1.0.dist-info/RECORD +0 -12
- {agent_os_kernel-1.1.0.dist-info → agent_os_kernel-1.2.0.dist-info}/WHEEL +0 -0
- {agent_os_kernel-1.1.0.dist-info → agent_os_kernel-1.2.0.dist-info}/licenses/LICENSE +0 -0
|
@@ -0,0 +1,346 @@
|
|
|
1
|
+
# Contributing to Context-as-a-Service
|
|
2
|
+
|
|
3
|
+
Thank you for your interest in contributing! This document provides guidelines for contributing to the project.
|
|
4
|
+
|
|
5
|
+
## Code of Conduct
|
|
6
|
+
|
|
7
|
+
Be respectful, inclusive, and collaborative. We're here to build something great together.
|
|
8
|
+
|
|
9
|
+
## Getting Started
|
|
10
|
+
|
|
11
|
+
### 1. Fork and Clone
|
|
12
|
+
|
|
13
|
+
```bash
|
|
14
|
+
git clone https://github.com/<your-username>/context-as-a-service.git
|
|
15
|
+
cd context-as-a-service
|
|
16
|
+
```
|
|
17
|
+
|
|
18
|
+
### 2. Install Dependencies
|
|
19
|
+
|
|
20
|
+
```bash
|
|
21
|
+
pip install -r requirements.txt
|
|
22
|
+
```
|
|
23
|
+
|
|
24
|
+
### 3. Run Tests
|
|
25
|
+
|
|
26
|
+
```bash
|
|
27
|
+
python run_tests.py
|
|
28
|
+
```
|
|
29
|
+
|
|
30
|
+
## Project Structure
|
|
31
|
+
|
|
32
|
+
```
|
|
33
|
+
context-as-a-service/
|
|
34
|
+
├── caas/ # Main package
|
|
35
|
+
│ ├── __init__.py
|
|
36
|
+
│ ├── models.py # Data models
|
|
37
|
+
│ ├── cli.py # CLI tool
|
|
38
|
+
│ ├── api/ # REST API
|
|
39
|
+
│ ├── ingestion/ # Document processors
|
|
40
|
+
│ ├── detection/ # Type detection
|
|
41
|
+
│ ├── tuning/ # Weight tuning
|
|
42
|
+
│ ├── storage/ # Document storage
|
|
43
|
+
│ ├── enrichment.py # Metadata enrichment
|
|
44
|
+
│ ├── decay.py # Time-based decay
|
|
45
|
+
│ ├── triad.py # Context Triad (Hot/Warm/Cold)
|
|
46
|
+
│ ├── pragmatic_truth.py # Source tracking
|
|
47
|
+
│ ├── routing/ # Heuristic routing
|
|
48
|
+
│ ├── conversation.py # Conversation management
|
|
49
|
+
│ └── gateway/ # Trust Gateway
|
|
50
|
+
├── tests/ # Test suite
|
|
51
|
+
├── examples/ # Example usage
|
|
52
|
+
│ ├── agents/ # Sample agent implementations
|
|
53
|
+
│ └── *.py # Demo scripts
|
|
54
|
+
├── docs/ # Documentation (markdown files)
|
|
55
|
+
├── run_tests.py # Test runner
|
|
56
|
+
├── TESTING.md # Testing guide
|
|
57
|
+
├── CONTRIBUTING.md # This file
|
|
58
|
+
└── README.md # Project overview
|
|
59
|
+
```
|
|
60
|
+
|
|
61
|
+
## Development Workflow
|
|
62
|
+
|
|
63
|
+
### 1. Create a Branch
|
|
64
|
+
|
|
65
|
+
```bash
|
|
66
|
+
git checkout -b feature/your-feature-name
|
|
67
|
+
```
|
|
68
|
+
|
|
69
|
+
### 2. Make Changes
|
|
70
|
+
|
|
71
|
+
- Write clear, documented code
|
|
72
|
+
- Follow existing code style
|
|
73
|
+
- Add docstrings to functions and classes
|
|
74
|
+
- Use type hints where appropriate
|
|
75
|
+
|
|
76
|
+
### 3. Add Tests
|
|
77
|
+
|
|
78
|
+
For any new functionality, add tests in `tests/`:
|
|
79
|
+
|
|
80
|
+
```python
|
|
81
|
+
"""
|
|
82
|
+
Test description.
|
|
83
|
+
"""
|
|
84
|
+
import sys
|
|
85
|
+
from pathlib import Path
|
|
86
|
+
sys.path.insert(0, str(Path(__file__).parent.parent))
|
|
87
|
+
|
|
88
|
+
from caas.module import NewFeature
|
|
89
|
+
|
|
90
|
+
|
|
91
|
+
def test_new_feature():
|
|
92
|
+
"""Test the new feature."""
|
|
93
|
+
print("\n=== Testing New Feature ===")
|
|
94
|
+
|
|
95
|
+
feature = NewFeature()
|
|
96
|
+
result = feature.do_something()
|
|
97
|
+
|
|
98
|
+
assert result is not None
|
|
99
|
+
print("✓ Feature working correctly")
|
|
100
|
+
```
|
|
101
|
+
|
|
102
|
+
### 4. Run Tests
|
|
103
|
+
|
|
104
|
+
```bash
|
|
105
|
+
python run_tests.py
|
|
106
|
+
```
|
|
107
|
+
|
|
108
|
+
### 5. Commit Changes
|
|
109
|
+
|
|
110
|
+
Use clear, descriptive commit messages:
|
|
111
|
+
|
|
112
|
+
```bash
|
|
113
|
+
git add .
|
|
114
|
+
git commit -m "Add feature: brief description
|
|
115
|
+
|
|
116
|
+
- Detailed point 1
|
|
117
|
+
- Detailed point 2
|
|
118
|
+
- Closes #123"
|
|
119
|
+
```
|
|
120
|
+
|
|
121
|
+
### 6. Push and Create PR
|
|
122
|
+
|
|
123
|
+
```bash
|
|
124
|
+
git push origin feature/your-feature-name
|
|
125
|
+
```
|
|
126
|
+
|
|
127
|
+
Then create a Pull Request on GitHub.
|
|
128
|
+
|
|
129
|
+
## Coding Standards
|
|
130
|
+
|
|
131
|
+
### Python Style
|
|
132
|
+
|
|
133
|
+
- Follow PEP 8 style guide
|
|
134
|
+
- Use 4 spaces for indentation
|
|
135
|
+
- Maximum line length: 100 characters
|
|
136
|
+
- Use descriptive variable names
|
|
137
|
+
|
|
138
|
+
### Docstrings
|
|
139
|
+
|
|
140
|
+
Use Google-style docstrings:
|
|
141
|
+
|
|
142
|
+
```python
|
|
143
|
+
def extract_context(document_id: str, query: str, max_tokens: int = 2000) -> Tuple[str, Dict]:
|
|
144
|
+
"""
|
|
145
|
+
Extract context from a document.
|
|
146
|
+
|
|
147
|
+
Args:
|
|
148
|
+
document_id: ID of document to extract from
|
|
149
|
+
query: Search query
|
|
150
|
+
max_tokens: Maximum tokens to extract
|
|
151
|
+
|
|
152
|
+
Returns:
|
|
153
|
+
Tuple of (context_string, metadata_dict)
|
|
154
|
+
|
|
155
|
+
Raises:
|
|
156
|
+
ValueError: If document not found
|
|
157
|
+
|
|
158
|
+
Example:
|
|
159
|
+
>>> context, metadata = extractor.extract_context("doc-123", "authentication")
|
|
160
|
+
>>> print(len(context))
|
|
161
|
+
1847
|
|
162
|
+
"""
|
|
163
|
+
# Implementation
|
|
164
|
+
```
|
|
165
|
+
|
|
166
|
+
### Type Hints
|
|
167
|
+
|
|
168
|
+
Use type hints for function signatures:
|
|
169
|
+
|
|
170
|
+
```python
|
|
171
|
+
from typing import List, Dict, Optional
|
|
172
|
+
|
|
173
|
+
def process_documents(
|
|
174
|
+
docs: List[Document],
|
|
175
|
+
max_count: Optional[int] = None
|
|
176
|
+
) -> Dict[str, Any]:
|
|
177
|
+
...
|
|
178
|
+
```
|
|
179
|
+
|
|
180
|
+
## Testing Guidelines
|
|
181
|
+
|
|
182
|
+
### Test Coverage
|
|
183
|
+
|
|
184
|
+
- Write tests for all new features
|
|
185
|
+
- Test edge cases and error conditions
|
|
186
|
+
- Aim for clear, readable test code
|
|
187
|
+
|
|
188
|
+
### Test Structure
|
|
189
|
+
|
|
190
|
+
```python
|
|
191
|
+
def test_feature_name():
|
|
192
|
+
"""Test description."""
|
|
193
|
+
print("\n=== Testing Feature ===")
|
|
194
|
+
|
|
195
|
+
# Setup
|
|
196
|
+
component = Component()
|
|
197
|
+
|
|
198
|
+
# Execute
|
|
199
|
+
result = component.method()
|
|
200
|
+
|
|
201
|
+
# Assert
|
|
202
|
+
assert result == expected_value
|
|
203
|
+
print("✓ Test passed")
|
|
204
|
+
```
|
|
205
|
+
|
|
206
|
+
### Running Tests
|
|
207
|
+
|
|
208
|
+
```bash
|
|
209
|
+
# All tests
|
|
210
|
+
python run_tests.py
|
|
211
|
+
|
|
212
|
+
# Specific test
|
|
213
|
+
python -m tests.test_module_name
|
|
214
|
+
```
|
|
215
|
+
|
|
216
|
+
## Areas for Contribution
|
|
217
|
+
|
|
218
|
+
### High Priority
|
|
219
|
+
|
|
220
|
+
1. **Additional Document Processors**
|
|
221
|
+
- Support for more file formats (DOCX, Markdown, etc.)
|
|
222
|
+
- Better code language support
|
|
223
|
+
- Improved structure detection
|
|
224
|
+
|
|
225
|
+
2. **Enhanced Detection**
|
|
226
|
+
- Better document type classification
|
|
227
|
+
- More sophisticated pattern matching
|
|
228
|
+
- Machine learning-based detection
|
|
229
|
+
|
|
230
|
+
3. **Performance Optimization**
|
|
231
|
+
- Faster document processing
|
|
232
|
+
- Efficient storage mechanisms
|
|
233
|
+
- Caching strategies
|
|
234
|
+
|
|
235
|
+
### Medium Priority
|
|
236
|
+
|
|
237
|
+
4. **API Enhancements**
|
|
238
|
+
- Additional endpoints
|
|
239
|
+
- WebSocket support for real-time updates
|
|
240
|
+
- GraphQL API option
|
|
241
|
+
|
|
242
|
+
5. **CLI Improvements**
|
|
243
|
+
- Interactive mode
|
|
244
|
+
- Better output formatting
|
|
245
|
+
- Progress indicators
|
|
246
|
+
|
|
247
|
+
6. **Documentation**
|
|
248
|
+
- More examples
|
|
249
|
+
- Tutorials
|
|
250
|
+
- Video guides
|
|
251
|
+
|
|
252
|
+
### Lower Priority
|
|
253
|
+
|
|
254
|
+
7. **UI/Dashboard**
|
|
255
|
+
- Web-based interface
|
|
256
|
+
- Visualization of document structures
|
|
257
|
+
- Analytics dashboard
|
|
258
|
+
|
|
259
|
+
8. **Integrations**
|
|
260
|
+
- Slack bot
|
|
261
|
+
- VS Code extension
|
|
262
|
+
- Zapier integration
|
|
263
|
+
|
|
264
|
+
## Module-Specific Guidelines
|
|
265
|
+
|
|
266
|
+
### Ingestion Module (`caas/ingestion/`)
|
|
267
|
+
|
|
268
|
+
When adding new processors:
|
|
269
|
+
- Inherit from base `Processor` class
|
|
270
|
+
- Implement `process()` method
|
|
271
|
+
- Track document hierarchy
|
|
272
|
+
- Extract meaningful sections
|
|
273
|
+
|
|
274
|
+
### Detection Module (`caas/detection/`)
|
|
275
|
+
|
|
276
|
+
When adding detection patterns:
|
|
277
|
+
- Add patterns to appropriate category
|
|
278
|
+
- Test with diverse documents
|
|
279
|
+
- Consider false positives/negatives
|
|
280
|
+
|
|
281
|
+
### Tuning Module (`caas/tuning/`)
|
|
282
|
+
|
|
283
|
+
When adding tuning rules:
|
|
284
|
+
- Add to `TYPE_SPECIFIC_WEIGHTS`
|
|
285
|
+
- Document reasoning
|
|
286
|
+
- Test with real documents
|
|
287
|
+
|
|
288
|
+
## Documentation
|
|
289
|
+
|
|
290
|
+
### Inline Documentation
|
|
291
|
+
|
|
292
|
+
- Add docstrings to all public functions/classes
|
|
293
|
+
- Include examples in docstrings
|
|
294
|
+
- Explain complex algorithms
|
|
295
|
+
|
|
296
|
+
### README Updates
|
|
297
|
+
|
|
298
|
+
Update README.md when:
|
|
299
|
+
- Adding new features
|
|
300
|
+
- Changing API
|
|
301
|
+
- Updating installation process
|
|
302
|
+
|
|
303
|
+
### Architecture Documentation
|
|
304
|
+
|
|
305
|
+
Document architectural decisions in:
|
|
306
|
+
- Code comments for complex logic
|
|
307
|
+
- Separate docs for major changes
|
|
308
|
+
- Examples for new patterns
|
|
309
|
+
|
|
310
|
+
## Pull Request Process
|
|
311
|
+
|
|
312
|
+
1. **Before Submitting**
|
|
313
|
+
- Run all tests
|
|
314
|
+
- Update documentation
|
|
315
|
+
- Add examples if needed
|
|
316
|
+
- Rebase on latest main
|
|
317
|
+
|
|
318
|
+
2. **PR Description**
|
|
319
|
+
- Describe what and why
|
|
320
|
+
- Link related issues
|
|
321
|
+
- Show before/after examples
|
|
322
|
+
- List breaking changes
|
|
323
|
+
|
|
324
|
+
3. **Review Process**
|
|
325
|
+
- Address review comments
|
|
326
|
+
- Keep discussions focused
|
|
327
|
+
- Be open to suggestions
|
|
328
|
+
|
|
329
|
+
4. **After Merge**
|
|
330
|
+
- Delete your branch
|
|
331
|
+
- Update your fork
|
|
332
|
+
- Celebrate! 🎉
|
|
333
|
+
|
|
334
|
+
## Questions or Issues?
|
|
335
|
+
|
|
336
|
+
- **Bug reports**: Open an issue with reproduction steps
|
|
337
|
+
- **Feature requests**: Open an issue with use case description
|
|
338
|
+
- **Questions**: Start a discussion or open an issue
|
|
339
|
+
|
|
340
|
+
## License
|
|
341
|
+
|
|
342
|
+
By contributing, you agree that your contributions will be licensed under the MIT License.
|
|
343
|
+
|
|
344
|
+
---
|
|
345
|
+
|
|
346
|
+
Thank you for contributing to Context-as-a-Service! 🚀
|
|
@@ -0,0 +1,336 @@
|
|
|
1
|
+
# Ethics and Limitations
|
|
2
|
+
|
|
3
|
+
## Overview
|
|
4
|
+
|
|
5
|
+
This document addresses the ethical considerations, known limitations, potential biases, and responsible use guidelines for Context-as-a-Service (CaaS). We believe in transparent communication about system capabilities and constraints.
|
|
6
|
+
|
|
7
|
+
## Ethical Considerations
|
|
8
|
+
|
|
9
|
+
### 1. Data Privacy and Consent
|
|
10
|
+
|
|
11
|
+
#### User Consent
|
|
12
|
+
- **Principle**: Users must explicitly consent to document ingestion and processing
|
|
13
|
+
- **Implementation**: CaaS processes only explicitly provided documents
|
|
14
|
+
- **Consideration**: Organizations must ensure they have rights to process all ingested content
|
|
15
|
+
- **Risk**: Inadvertent processing of personal or confidential data without proper consent
|
|
16
|
+
|
|
17
|
+
#### Data Minimization
|
|
18
|
+
- **Principle**: Collect and retain only necessary data
|
|
19
|
+
- **Implementation**: Configurable retention policies and data purging capabilities
|
|
20
|
+
- **Consideration**: Balance between context quality and privacy
|
|
21
|
+
- **Risk**: Over-retention of data beyond its useful lifetime
|
|
22
|
+
|
|
23
|
+
### 2. Transparency and Explainability
|
|
24
|
+
|
|
25
|
+
#### Source Attribution
|
|
26
|
+
- **Strength**: CaaS provides transparent source citations for all context
|
|
27
|
+
- **Benefit**: Users can verify information origin and trustworthiness
|
|
28
|
+
- **Limitation**: Source tracking may reveal organizational patterns
|
|
29
|
+
- **Mitigation**: Configurable source anonymization options
|
|
30
|
+
|
|
31
|
+
#### Decision Transparency
|
|
32
|
+
- **Strength**: Heuristic routing provides deterministic, auditable decisions
|
|
33
|
+
- **Benefit**: No "black box" AI routing decisions
|
|
34
|
+
- **Limitation**: Simple heuristics may miss nuanced query intent
|
|
35
|
+
- **Trade-off**: Transparency over potentially higher accuracy
|
|
36
|
+
|
|
37
|
+
### 3. Bias and Fairness
|
|
38
|
+
|
|
39
|
+
#### Temporal Bias
|
|
40
|
+
- **Issue**: Time decay inherently biases toward recent information
|
|
41
|
+
- **Impact**: Older but still-valid information may be deprioritized
|
|
42
|
+
- **Use Case**: Appropriate for fast-moving domains (software, news)
|
|
43
|
+
- **Inappropriate**: Historical research, legal precedents, foundational knowledge
|
|
44
|
+
- **Mitigation**: Configurable decay rates, ability to disable time-based ranking
|
|
45
|
+
|
|
46
|
+
#### Source Bias
|
|
47
|
+
- **Issue**: "Pragmatic Truth" may elevate unofficial sources (Slack, forums) over official documentation
|
|
48
|
+
- **Impact**: Unofficial but practical knowledge gets visibility
|
|
49
|
+
- **Risk**: Unofficial sources may contain incorrect information
|
|
50
|
+
- **Mitigation**: Conflict detection highlights discrepancies between sources
|
|
51
|
+
|
|
52
|
+
#### Structural Bias
|
|
53
|
+
- **Issue**: Auto-tuning weights based on detected patterns
|
|
54
|
+
- **Impact**: Content types appearing frequently get higher weights
|
|
55
|
+
- **Risk**: Minority document types may be underrepresented
|
|
56
|
+
- **Mitigation**: Manual weight overrides, minimum weight thresholds
|
|
57
|
+
|
|
58
|
+
#### Language and Cultural Bias
|
|
59
|
+
- **Issue**: Structure detection and metadata enrichment tuned for English content
|
|
60
|
+
- **Impact**: Non-English documents may have suboptimal structure detection
|
|
61
|
+
- **Risk**: Reduced quality for international users
|
|
62
|
+
- **Limitation**: Current version is English-centric
|
|
63
|
+
- **Future Work**: Multi-language support, cultural context awareness
|
|
64
|
+
|
|
65
|
+
### 4. Dual Use and Misuse Potential
|
|
66
|
+
|
|
67
|
+
#### Surveillance and Monitoring
|
|
68
|
+
- **Risk**: CaaS could be used to monitor employee communications
|
|
69
|
+
- **Example**: Ingesting internal Slack/email to answer "What is the team saying about X?"
|
|
70
|
+
- **Ethics**: Employee surveillance without consent is unethical
|
|
71
|
+
- **Guidance**: Organizations must have clear policies and employee notification
|
|
72
|
+
|
|
73
|
+
#### Competitive Intelligence
|
|
74
|
+
- **Risk**: Aggressive scraping of competitor websites/documentation
|
|
75
|
+
- **Ethics**: Respecting robots.txt, terms of service, and legal boundaries
|
|
76
|
+
- **Guidance**: Only ingest publicly available, legally accessible content
|
|
77
|
+
|
|
78
|
+
#### Disinformation and Manipulation
|
|
79
|
+
- **Risk**: Selectively ingesting biased sources to manipulate context
|
|
80
|
+
- **Example**: Only ingesting one side of a debate to bias AI responses
|
|
81
|
+
- **Mitigation**: Encourage diverse source ingestion
|
|
82
|
+
- **Responsibility**: Ultimately on the deploying organization
|
|
83
|
+
|
|
84
|
+
### 5. Environmental Impact
|
|
85
|
+
|
|
86
|
+
#### Carbon Footprint
|
|
87
|
+
- **Processing Cost**: Document ingestion and structure analysis require computation
|
|
88
|
+
- **Embedding Cost**: If vector embeddings are used (optional future feature)
|
|
89
|
+
- **Inference Cost**: Context extraction and serving
|
|
90
|
+
- **Mitigation**:
|
|
91
|
+
- Efficient algorithms (no unnecessary re-processing)
|
|
92
|
+
- Local deployment reduces network costs
|
|
93
|
+
- Heuristic routing (no LLM calls for routing)
|
|
94
|
+
- Sliding window (no summarization LLM calls)
|
|
95
|
+
|
|
96
|
+
#### Resource Efficiency
|
|
97
|
+
- **Strength**: CaaS optimizes for efficiency over maximum accuracy
|
|
98
|
+
- **Examples**:
|
|
99
|
+
- Chopping (FIFO) instead of expensive summarization
|
|
100
|
+
- Heuristic routing instead of LLM-based routing
|
|
101
|
+
- Local processing instead of API calls
|
|
102
|
+
- **Philosophy**: "Good enough" solutions that minimize resource waste
|
|
103
|
+
|
|
104
|
+
## Known Limitations
|
|
105
|
+
|
|
106
|
+
### 1. Context Quality Limitations
|
|
107
|
+
|
|
108
|
+
#### Flat Embedding Limitations (if using vector search)
|
|
109
|
+
- **Issue**: Even with structure-aware indexing, semantic search has inherent limitations
|
|
110
|
+
- **Example**: Cannot distinguish between "This is good" (positive) and "This is not good" (negative)
|
|
111
|
+
- **Impact**: Some nuanced queries may retrieve suboptimal context
|
|
112
|
+
- **Mitigation**: Hybrid search combining keywords and semantics (future work)
|
|
113
|
+
|
|
114
|
+
#### Metadata Incompleteness
|
|
115
|
+
- **Issue**: Metadata enrichment depends on structure detection accuracy
|
|
116
|
+
- **Example**: Unstructured documents may have minimal metadata
|
|
117
|
+
- **Impact**: Less effective chunk disambiguation
|
|
118
|
+
- **Mitigation**: Manual metadata addition capabilities
|
|
119
|
+
|
|
120
|
+
#### Cold Start Problem
|
|
121
|
+
- **Issue**: Auto-tuning requires sufficient corpus to learn patterns
|
|
122
|
+
- **Example**: First few documents have generic weights
|
|
123
|
+
- **Impact**: Suboptimal context quality initially
|
|
124
|
+
- **Mitigation**: Sensible defaults, manual tuning option
|
|
125
|
+
|
|
126
|
+
### 2. Temporal Limitations
|
|
127
|
+
|
|
128
|
+
#### Truth Stability Assumption
|
|
129
|
+
- **Issue**: Time decay assumes older content is less relevant
|
|
130
|
+
- **Problem**: Some domains have stable truths (mathematics, history)
|
|
131
|
+
- **Example**: A 10-year-old explanation of quicksort is still valid
|
|
132
|
+
- **Impact**: Inappropriate for domains with stable knowledge
|
|
133
|
+
- **Mitigation**: Configurable decay rates, domain-specific policies
|
|
134
|
+
|
|
135
|
+
#### Timestamp Reliability
|
|
136
|
+
- **Issue**: Relies on file modification times or explicit timestamps
|
|
137
|
+
- **Problem**: Copied/migrated files may have incorrect timestamps
|
|
138
|
+
- **Impact**: Incorrect recency judgments
|
|
139
|
+
- **Mitigation**: Manual timestamp overrides, ingestion date tracking
|
|
140
|
+
|
|
141
|
+
### 3. Scale Limitations
|
|
142
|
+
|
|
143
|
+
#### Single-Node Architecture (Current)
|
|
144
|
+
- **Issue**: Current implementation assumes single-server deployment
|
|
145
|
+
- **Limitation**: Limited to documents that fit in available storage
|
|
146
|
+
- **Impact**: May not scale to massive corporate corpora
|
|
147
|
+
- **Future Work**: Distributed storage and processing
|
|
148
|
+
|
|
149
|
+
#### Query Performance
|
|
150
|
+
- **Issue**: Linear search over all chunks as corpus grows
|
|
151
|
+
- **Impact**: Slower response times with large corpora
|
|
152
|
+
- **Mitigation**: Indexing, caching strategies (future work)
|
|
153
|
+
|
|
154
|
+
### 4. Language and Format Limitations
|
|
155
|
+
|
|
156
|
+
#### Supported Formats
|
|
157
|
+
- **Current**: PDF, HTML, Python/JavaScript source code
|
|
158
|
+
- **Limitation**: No support for DOCX, PowerPoint, images, videos
|
|
159
|
+
- **Impact**: Cannot ingest all document types
|
|
160
|
+
- **Workaround**: Convert to supported formats
|
|
161
|
+
- **Future Work**: Additional format processors
|
|
162
|
+
|
|
163
|
+
#### Language Support
|
|
164
|
+
- **Current**: Optimized for English text
|
|
165
|
+
- **Limitation**: Non-English text may have suboptimal processing
|
|
166
|
+
- **Impact**: Reduced effectiveness for international deployments
|
|
167
|
+
- **Future Work**: Multi-language support, language detection
|
|
168
|
+
|
|
169
|
+
### 5. Integration Limitations
|
|
170
|
+
|
|
171
|
+
#### No Native Vector Database
|
|
172
|
+
- **Current**: Simple in-memory or file-based storage
|
|
173
|
+
- **Limitation**: No optimized vector similarity search
|
|
174
|
+
- **Impact**: May be slower than specialized solutions at scale
|
|
175
|
+
- **Future Work**: Optional integrations with Qdrant, Pinecone, Weaviate
|
|
176
|
+
|
|
177
|
+
#### No Built-in LLM Integration
|
|
178
|
+
- **Current**: CaaS is context-serving only, not a complete RAG system
|
|
179
|
+
- **Benefit**: Modular, bring-your-own-LLM
|
|
180
|
+
- **Limitation**: Requires separate LLM infrastructure
|
|
181
|
+
- **Philosophy**: Separation of concerns (context ≠ generation)
|
|
182
|
+
|
|
183
|
+
## Failure Modes and Edge Cases
|
|
184
|
+
|
|
185
|
+
### 1. Heuristic Router Failures
|
|
186
|
+
|
|
187
|
+
#### Ambiguous Queries
|
|
188
|
+
- **Scenario**: Query matches multiple heuristic patterns
|
|
189
|
+
- **Failure Mode**: Falls back to default strategy
|
|
190
|
+
- **Impact**: May not route optimally
|
|
191
|
+
- **Frequency**: Low-medium
|
|
192
|
+
- **Mitigation**: More specific patterns, user feedback loop
|
|
193
|
+
|
|
194
|
+
#### Unseen Query Types
|
|
195
|
+
- **Scenario**: Query type not covered by any heuristic
|
|
196
|
+
- **Failure Mode**: Generic fallback routing
|
|
197
|
+
- **Impact**: Suboptimal results
|
|
198
|
+
- **Frequency**: Medium
|
|
199
|
+
- **Mitigation**: Extensible pattern system, analytics to identify gaps
|
|
200
|
+
|
|
201
|
+
### 2. Pragmatic Truth Conflicts
|
|
202
|
+
|
|
203
|
+
#### Irreconcilable Conflicts
|
|
204
|
+
- **Scenario**: Official docs say X, team says Y, both plausible
|
|
205
|
+
- **Failure Mode**: System highlights conflict but cannot resolve
|
|
206
|
+
- **Impact**: User must manually adjudicate
|
|
207
|
+
- **Frequency**: Low
|
|
208
|
+
- **Philosophy**: Transparent uncertainty is better than false confidence
|
|
209
|
+
|
|
210
|
+
#### Stale Unofficial Information
|
|
211
|
+
- **Scenario**: Slack message from 6 months ago contradicts current docs
|
|
212
|
+
- **Failure Mode**: Time decay may not fully resolve which is current
|
|
213
|
+
- **Impact**: Potentially outdated information surfaced
|
|
214
|
+
- **Frequency**: Low
|
|
215
|
+
- **Mitigation**: Source-specific decay rates
|
|
216
|
+
|
|
217
|
+
### 3. Time Decay Side Effects
|
|
218
|
+
|
|
219
|
+
#### Recent Errors Amplified
|
|
220
|
+
- **Scenario**: Recently ingested document contains errors
|
|
221
|
+
- **Failure Mode**: Error gets high weight due to recency
|
|
222
|
+
- **Impact**: Bad information prioritized
|
|
223
|
+
- **Frequency**: Low
|
|
224
|
+
- **Mitigation**: Document review processes, explicit corrections
|
|
225
|
+
|
|
226
|
+
#### Historical Knowledge Lost
|
|
227
|
+
- **Scenario**: Foundational documents decay over time
|
|
228
|
+
- **Failure Mode**: Core knowledge deprioritized
|
|
229
|
+
- **Impact**: Important background information missing
|
|
230
|
+
- **Frequency**: Medium (in domains with stable knowledge)
|
|
231
|
+
- **Mitigation**: Pin important documents, disable decay for foundations
|
|
232
|
+
|
|
233
|
+
## Hallucination and Accuracy
|
|
234
|
+
|
|
235
|
+
### Important Distinction
|
|
236
|
+
- **CaaS is NOT a generative AI system**
|
|
237
|
+
- **CaaS does NOT generate text or make claims**
|
|
238
|
+
- **CaaS retrieves and ranks existing content**
|
|
239
|
+
|
|
240
|
+
### What CaaS Does
|
|
241
|
+
- Extracts actual text from real documents
|
|
242
|
+
- Ranks and prioritizes based on structure, time, and source
|
|
243
|
+
- Provides transparent citations for all content
|
|
244
|
+
|
|
245
|
+
### What CaaS Cannot Do
|
|
246
|
+
- Synthesize new information not in the corpus
|
|
247
|
+
- Answer questions about events outside the ingested documents
|
|
248
|
+
- Generate creative content
|
|
249
|
+
|
|
250
|
+
### Accuracy Depends On
|
|
251
|
+
1. **Source Quality**: Garbage in, garbage out
|
|
252
|
+
2. **Structure Detection**: Better detection = better ranking
|
|
253
|
+
3. **Weight Tuning**: Appropriate weights for your use case
|
|
254
|
+
4. **Query Matching**: Heuristics must match your query patterns
|
|
255
|
+
|
|
256
|
+
## Responsible Use Guidelines
|
|
257
|
+
|
|
258
|
+
### For Organizations Deploying CaaS
|
|
259
|
+
|
|
260
|
+
1. **Obtain Proper Consents**
|
|
261
|
+
- Ensure rights to process all ingested content
|
|
262
|
+
- Notify employees if processing internal communications
|
|
263
|
+
- Comply with data protection regulations (GDPR, CCPA, etc.)
|
|
264
|
+
|
|
265
|
+
2. **Implement Access Controls**
|
|
266
|
+
- Not all documents should be accessible to all users
|
|
267
|
+
- Implement role-based access controls
|
|
268
|
+
- Audit access to sensitive contexts
|
|
269
|
+
|
|
270
|
+
3. **Monitor for Bias**
|
|
271
|
+
- Regularly review source distribution
|
|
272
|
+
- Check for underrepresented content types
|
|
273
|
+
- Validate time decay appropriateness for your domain
|
|
274
|
+
|
|
275
|
+
4. **Establish Governance**
|
|
276
|
+
- Clear policies on what can be ingested
|
|
277
|
+
- Review processes for document quality
|
|
278
|
+
- Incident response for inaccurate information
|
|
279
|
+
|
|
280
|
+
5. **Provide User Training**
|
|
281
|
+
- Explain system capabilities and limitations
|
|
282
|
+
- Teach users to verify critical information
|
|
283
|
+
- Encourage feedback on result quality
|
|
284
|
+
|
|
285
|
+
### For Developers Extending CaaS
|
|
286
|
+
|
|
287
|
+
1. **Preserve Privacy Protections**
|
|
288
|
+
- Maintain on-premises deployment capability
|
|
289
|
+
- Avoid mandatory external API calls
|
|
290
|
+
- Respect data minimization principles
|
|
291
|
+
|
|
292
|
+
2. **Maintain Transparency**
|
|
293
|
+
- Keep heuristic routing deterministic
|
|
294
|
+
- Provide clear source attribution
|
|
295
|
+
- Document all algorithmic decisions
|
|
296
|
+
|
|
297
|
+
3. **Test for Bias**
|
|
298
|
+
- Evaluate on diverse document sets
|
|
299
|
+
- Check for language/format discrimination
|
|
300
|
+
- Validate across different domains
|
|
301
|
+
|
|
302
|
+
4. **Document Limitations**
|
|
303
|
+
- Be clear about what your extension can/cannot do
|
|
304
|
+
- Provide guidance on appropriate use cases
|
|
305
|
+
- Warn about potential failure modes
|
|
306
|
+
|
|
307
|
+
## Future Ethical Considerations
|
|
308
|
+
|
|
309
|
+
As CaaS evolves, we will continue to address:
|
|
310
|
+
|
|
311
|
+
1. **AI-Generated Content Detection**: How to handle documents that are themselves AI-generated
|
|
312
|
+
2. **Federated Learning**: Privacy-preserving corpus analysis across organizations
|
|
313
|
+
3. **Differential Privacy**: Formal privacy guarantees for sensitive documents
|
|
314
|
+
4. **Fairness Metrics**: Quantitative evaluation of bias in context serving
|
|
315
|
+
5. **Explainable AI**: Even better explanations of ranking decisions
|
|
316
|
+
6. **Red Teaming**: Adversarial testing for misuse scenarios
|
|
317
|
+
|
|
318
|
+
## Reporting Issues
|
|
319
|
+
|
|
320
|
+
If you identify ethical concerns, biases, or limitations not covered here:
|
|
321
|
+
|
|
322
|
+
1. Open a GitHub issue with label `ethics` or `bias`
|
|
323
|
+
2. Provide specific examples and reproduction steps
|
|
324
|
+
3. Suggest potential mitigations if you have ideas
|
|
325
|
+
4. We commit to addressing reports within 7 days
|
|
326
|
+
|
|
327
|
+
## Conclusion
|
|
328
|
+
|
|
329
|
+
Context-as-a-Service is a tool, and like all tools, it can be used responsibly or irresponsibly. We've designed CaaS with transparency, privacy, and efficiency as core principles. However, the ultimate responsibility for ethical deployment lies with the organizations and individuals using the system.
|
|
330
|
+
|
|
331
|
+
**Use CaaS to empower, not surveil. To inform, not manipulate. To augment human intelligence, not replace human judgment.**
|
|
332
|
+
|
|
333
|
+
---
|
|
334
|
+
|
|
335
|
+
*Last Updated: January 2026*
|
|
336
|
+
*Version: 0.1.0*
|