agentops-cockpit 0.5.0__tar.gz → 0.9.5__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (150) hide show
  1. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/Dockerfile +4 -1
  2. agentops_cockpit-0.9.5/Makefile +134 -0
  3. agentops_cockpit-0.9.5/PKG-INFO +246 -0
  4. agentops_cockpit-0.9.5/README.md +222 -0
  5. {agentops_cockpit-0.5.0/public → agentops_cockpit-0.9.5/docs}/A2A_GUIDE.md +1 -1
  6. {agentops_cockpit-0.5.0/public → agentops_cockpit-0.9.5/docs}/BE_INTEGRATION_GUIDE.md +1 -1
  7. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5/docs}/DEPLOYMENT.md +24 -6
  8. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/package.json +1 -1
  9. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5/public}/A2A_GUIDE.md +1 -1
  10. agentops_cockpit-0.9.5/public/AGENT_OPS_STORY.md +48 -0
  11. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5/public}/BE_INTEGRATION_GUIDE.md +1 -1
  12. agentops_cockpit-0.9.5/public/CHANGELOG.md +23 -0
  13. agentops_cockpit-0.9.5/public/CLI_COMMANDS.md +84 -0
  14. agentops_cockpit-0.9.5/public/COCKPIT_GUIDE.md +32 -0
  15. agentops_cockpit-0.9.5/public/CONTRIBUTING.md +25 -0
  16. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/public/DEPLOYMENT.md +23 -5
  17. agentops_cockpit-0.9.5/public/DEVELOPMENT.md +35 -0
  18. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/public/GEMINI.md +4 -4
  19. agentops_cockpit-0.9.5/public/GETTING_STARTED.md +67 -0
  20. agentops_cockpit-0.9.5/public/GOOGLE_ARCHITECTURE.md +58 -0
  21. agentops_cockpit-0.9.5/public/GOVERNANCE_GUIDE.md +49 -0
  22. agentops_cockpit-0.9.5/public/GUIDE_ARCH_REVIEW.md +37 -0
  23. agentops_cockpit-0.9.5/public/GUIDE_AUDIT.md +42 -0
  24. agentops_cockpit-0.9.5/public/GUIDE_DEPLOYMENT.md +50 -0
  25. agentops_cockpit-0.9.5/public/GUIDE_LOAD_TEST.md +41 -0
  26. agentops_cockpit-0.9.5/public/GUIDE_MCP.md +58 -0
  27. agentops_cockpit-0.9.5/public/GUIDE_OPTIMIZER.md +36 -0
  28. agentops_cockpit-0.9.5/public/GUIDE_POLICY.md +50 -0
  29. agentops_cockpit-0.9.5/public/GUIDE_RED_TEAM.md +36 -0
  30. agentops_cockpit-0.9.5/public/GUIDE_RELIABILITY.md +41 -0
  31. agentops_cockpit-0.9.5/public/LIMITATIONS.md +48 -0
  32. agentops_cockpit-0.9.5/public/OPTIMIZATION_GUIDE.md +40 -0
  33. agentops_cockpit-0.9.5/public/PRD.md +101 -0
  34. agentops_cockpit-0.9.5/public/PRODUCTION_CHECKLIST.md +45 -0
  35. agentops_cockpit-0.5.0/PKG-INFO → agentops_cockpit-0.9.5/public/README.md +36 -29
  36. agentops_cockpit-0.9.5/public/ROADMAP.md +56 -0
  37. agentops_cockpit-0.9.5/public/SECURITY_GUIDE.md +34 -0
  38. agentops_cockpit-0.9.5/public/assets/ecosystem.png +0 -0
  39. agentops_cockpit-0.9.5/public/assets/trinity.png +0 -0
  40. agentops_cockpit-0.9.5/public/assets/workflow.png +0 -0
  41. agentops_cockpit-0.9.5/public/kokpi_branded.jpg +0 -0
  42. agentops_cockpit-0.9.5/public/kokpi_kun.png +0 -0
  43. agentops_cockpit-0.9.5/public/kokpi_simplistic.png +0 -0
  44. agentops_cockpit-0.9.5/public/sample-report.html +931 -0
  45. agentops_cockpit-0.9.5/public/sample-report.md +262 -0
  46. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/pyproject.toml +8 -2
  47. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/App.tsx +7 -3
  48. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/agent.py +57 -7
  49. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/cli/main.py +104 -11
  50. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/eval/load_test.py +15 -10
  51. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/eval/quality_climber.py +23 -5
  52. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/eval/red_team.py +5 -4
  53. agentops_cockpit-0.9.5/src/agent_ops_cockpit/mcp_server.py +132 -0
  54. agentops_cockpit-0.9.5/src/agent_ops_cockpit/ops/arch_review.py +166 -0
  55. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/ops/cost_optimizer.py +0 -1
  56. agentops_cockpit-0.9.5/src/agent_ops_cockpit/ops/evidence_bridge.py +132 -0
  57. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/ops/frameworks.py +79 -10
  58. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/ops/mcp_hub.py +1 -2
  59. agentops_cockpit-0.9.5/src/agent_ops_cockpit/ops/orchestrator.py +417 -0
  60. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/ops/pii_scrubber.py +1 -1
  61. agentops_cockpit-0.9.5/src/agent_ops_cockpit/ops/policies.json +26 -0
  62. agentops_cockpit-0.9.5/src/agent_ops_cockpit/ops/policy_engine.py +85 -0
  63. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/ops/reliability.py +28 -7
  64. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/ops/secret_scanner.py +10 -3
  65. agentops_cockpit-0.9.5/src/agent_ops_cockpit/ops/ui_auditor.py +115 -0
  66. agentops_cockpit-0.9.5/src/agent_ops_cockpit/ops/watcher.py +138 -0
  67. agentops_cockpit-0.9.5/src/agent_ops_cockpit/ops/watchlist.json +88 -0
  68. agentops_cockpit-0.9.5/src/agent_ops_cockpit/optimizer.py +485 -0
  69. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/shadow/router.py +7 -8
  70. agentops_cockpit-0.9.5/src/agent_ops_cockpit/system_prompt.md +13 -0
  71. agentops_cockpit-0.9.5/src/agent_ops_cockpit/tests/golden_set.json +52 -0
  72. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/tests/test_agent.py +16 -9
  73. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/tests/test_arch_review.py +1 -3
  74. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/tests/test_frameworks.py +1 -3
  75. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/tests/test_optimizer.py +1 -2
  76. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/tests/test_quality_climber.py +1 -3
  77. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/tests/test_red_team.py +6 -4
  78. {agentops_cockpit-0.5.0/src/backend → agentops_cockpit-0.9.5/src/agent_ops_cockpit}/tests/test_secret_scanner.py +1 -3
  79. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/components/Home.tsx +396 -38
  80. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/components/OpsDashboard.tsx +68 -12
  81. agentops_cockpit-0.9.5/src/components/ReportSamples.tsx +354 -0
  82. agentops_cockpit-0.9.5/src/docs/DocHome.tsx +261 -0
  83. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/docs/DocLayout.tsx +178 -30
  84. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/index.css +108 -0
  85. agentops_cockpit-0.5.0/Makefile +0 -100
  86. agentops_cockpit-0.5.0/README.md +0 -153
  87. agentops_cockpit-0.5.0/public/AGENT_OPS_STORY.md +0 -38
  88. agentops_cockpit-0.5.0/public/CLI_COMMANDS.md +0 -85
  89. agentops_cockpit-0.5.0/public/DEVELOPMENT.md +0 -55
  90. agentops_cockpit-0.5.0/public/GETTING_STARTED.md +0 -39
  91. agentops_cockpit-0.5.0/public/README.md +0 -151
  92. agentops_cockpit-0.5.0/src/agent_ops_cockpit/mcp_server.py +0 -98
  93. agentops_cockpit-0.5.0/src/agent_ops_cockpit/ops/arch_review.py +0 -105
  94. agentops_cockpit-0.5.0/src/agent_ops_cockpit/ops/cost_optimizer.py +0 -40
  95. agentops_cockpit-0.5.0/src/agent_ops_cockpit/ops/orchestrator.py +0 -103
  96. agentops_cockpit-0.5.0/src/agent_ops_cockpit/ops/pii_scrubber.py +0 -47
  97. agentops_cockpit-0.5.0/src/agent_ops_cockpit/ops/reliability.py +0 -50
  98. agentops_cockpit-0.5.0/src/agent_ops_cockpit/ops/secret_scanner.py +0 -75
  99. agentops_cockpit-0.5.0/src/agent_ops_cockpit/ops/ui_auditor.py +0 -74
  100. agentops_cockpit-0.5.0/src/agent_ops_cockpit/optimizer.py +0 -177
  101. agentops_cockpit-0.5.0/src/agent_ops_cockpit/shadow/router.py +0 -75
  102. agentops_cockpit-0.5.0/src/backend/__init__.py +0 -0
  103. agentops_cockpit-0.5.0/src/backend/cache/semantic_cache.py +0 -59
  104. agentops_cockpit-0.5.0/src/backend/cost_control.py +0 -53
  105. agentops_cockpit-0.5.0/src/backend/eval/__init__.py +0 -1
  106. agentops_cockpit-0.5.0/src/backend/eval/load_test.py +0 -91
  107. agentops_cockpit-0.5.0/src/backend/eval/quality_climber.py +0 -129
  108. agentops_cockpit-0.5.0/src/backend/eval/red_team.py +0 -77
  109. agentops_cockpit-0.5.0/src/backend/ops/__init__.py +0 -1
  110. agentops_cockpit-0.5.0/src/backend/ops/arch_review.py +0 -105
  111. agentops_cockpit-0.5.0/src/backend/ops/evidence.py +0 -25
  112. agentops_cockpit-0.5.0/src/backend/ops/frameworks.py +0 -407
  113. agentops_cockpit-0.5.0/src/backend/ops/mcp_hub.py +0 -80
  114. agentops_cockpit-0.5.0/src/backend/ops/memory_optimizer.py +0 -44
  115. agentops_cockpit-0.5.0/src/backend/ops/orchestrator.py +0 -113
  116. agentops_cockpit-0.5.0/src/backend/ops/swarm.py +0 -71
  117. agentops_cockpit-0.5.0/src/backend/ops/ui_auditor.py +0 -74
  118. agentops_cockpit-0.5.0/src/backend/optimizer.py +0 -177
  119. agentops_cockpit-0.5.0/src/components/Playground.tsx +0 -267
  120. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/.gitignore +0 -0
  121. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/LICENSE +0 -0
  122. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/firebase.json +0 -0
  123. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/index.html +0 -0
  124. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/public/cicd-workflow.png +0 -0
  125. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/public/diagrams/agentic-stack.png +0 -0
  126. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/public/diagrams/value-proposition.png +0 -0
  127. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/public/hero.png +0 -0
  128. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/public/og-image.png +0 -0
  129. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/a2ui/A2UIRenderer.tsx +0 -0
  130. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/a2ui/components/index.tsx +0 -0
  131. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/a2ui/components/lit-component-example.ts +0 -0
  132. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/a2ui/types.ts +0 -0
  133. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/__init__.py +0 -0
  134. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/cache/__init__.py +0 -0
  135. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/cache/semantic_cache.py +0 -0
  136. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/cli/__init__.py +0 -0
  137. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/cost_control.py +0 -0
  138. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/eval/__init__.py +0 -0
  139. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/ops/__init__.py +0 -0
  140. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/ops/evidence.py +0 -0
  141. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/ops/memory_optimizer.py +0 -0
  142. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/ops/swarm.py +0 -0
  143. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/agent_ops_cockpit/shadow/__init__.py +0 -0
  144. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/components/FlightRecorder.tsx +0 -0
  145. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/components/ThemeToggle.tsx +0 -0
  146. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/docs/DocPage.tsx +0 -0
  147. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/knowledge/example_policy.txt +0 -0
  148. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/src/main.tsx +0 -0
  149. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/tsconfig.json +0 -0
  150. {agentops_cockpit-0.5.0 → agentops_cockpit-0.9.5}/vite.config.ts +0 -0
@@ -11,4 +11,7 @@ COPY src/ ./src/
11
11
  # Set PYTHONPATH so it can find the modules
12
12
  ENV PYTHONPATH=/app/src
13
13
 
14
- CMD ["python", "src/backend/agent.py"]
14
+ # Run mandatory AgentOps Audit during build
15
+ RUN python src/agent_ops_cockpit/ops/orchestrator.py --mode quick
16
+
17
+ CMD ["python", "src/agent_ops_cockpit/agent.py"]
@@ -0,0 +1,134 @@
1
+ # --- A2UI Starter Makefile ---
2
+
3
+ # Project Variables
4
+ PYTHON = $(shell if [ -f "./.venv/bin/python3.14" ]; then echo "./.venv/bin/python3.14"; elif [ -d ".venv" ]; then echo "./.venv/bin/python"; else echo "python3"; fi)
5
+ PROJECT_ID ?= $(shell gcloud config get-value project)
6
+ REGION ?= us-central1
7
+ SERVICE_NAME = agent-ops-backend
8
+ IMAGE_TAG = us-central1-docker.pkg.dev/$(PROJECT_ID)/agent-repo/$(SERVICE_NAME):latest
9
+
10
+ .PHONY: help dev build deploy-cloud-run deploy-firebase deploy-gke audit deploy-prod scan-secrets ui-audit audit-all watch
11
+
12
+ help:
13
+ @echo "Available commands:"
14
+ @echo " make dev - Start local development server"
15
+ @echo " make audit - [MASTER] Quick Safe-Build (uvx agentops-cockpit report --mode quick)"
16
+ @echo " make audit-deep - [MASTER] Deep System Audit (uvx agentops-cockpit report --mode deep)"
17
+ @echo " make optimizer-audit - [CODE] Quick code audit (uvx agentops-cockpit audit --quick)"
18
+ @echo " make optimizer-audit-deep - [CODE] Deep code audit (uvx agentops-cockpit audit)"
19
+ @echo " make reliability - Run unit tests and regression suite"
20
+ @echo " make diagnose - [DevEx] System health check and env diagnosis"
21
+ @echo " make email-report - [GOV] Email the latest Persona-Approved report"
22
+ @echo " make red-team - Run adversarial security audit"
23
+ @echo " make load-test - Run base load test"
24
+ @echo " make deploy-prod - Deploy to production (All Audits -> Cloud Run + Firebase)"
25
+ @echo " make deploy-cloud-run - Deploy to Google Cloud Run"
26
+ @echo " make deploy-firebase - Deploy to Firebase Hosting"
27
+ @echo " make watch - Track ecosystem updates (ADK, A2A, LangChain, etc.)"
28
+
29
+
30
+
31
+
32
+ dev:
33
+ npm run dev
34
+
35
+ build:
36
+ npm run build
37
+
38
+ # 🏁 Master Audit: Safe-Build (Essential for dev velocity)
39
+ audit:
40
+ @$(PYTHON) src/agent_ops_cockpit/ops/orchestrator.py --mode quick
41
+
42
+ # 🚀 Deep Master Audit: Full benchmarks and stress tests
43
+ audit-deep:
44
+ @$(PYTHON) src/agent_ops_cockpit/ops/orchestrator.py --mode deep
45
+
46
+ # 🌐 Global Audit: Point the Cockpit at an external repository
47
+ # Usage: make audit-all TARGET=/path/to/your/agent
48
+ TARGET ?= .
49
+ audit-all:
50
+ @$(PYTHON) src/agent_ops_cockpit/ops/orchestrator.py --mode quick --path $(TARGET)
51
+
52
+ # 🛡️ Reliability: Unit tests and regression suite
53
+ reliability:
54
+ @$(PYTHON) src/agent_ops_cockpit/ops/reliability.py
55
+
56
+ # 🩺 Diagnose: DevEx system check
57
+ diagnose:
58
+ @PYTHONPATH=src $(PYTHON) -m agent_ops_cockpit.cli.main diagnose
59
+
60
+ # 🔍 The Optimizer: Audit specific agent file for code-level waste
61
+ optimizer-audit:
62
+ @$(PYTHON) src/agent_ops_cockpit/optimizer.py src/agent_ops_cockpit/agent.py --quick
63
+
64
+ # 🔍 Deep Optimizer: Fetch live SDK evidence
65
+ optimizer-audit-deep:
66
+ @$(PYTHON) src/agent_ops_cockpit/optimizer.py src/agent_ops_cockpit/agent.py
67
+
68
+ # 🏛️ Architecture: Design review against Google Well-Architected Framework
69
+ arch-review:
70
+ @$(PYTHON) src/agent_ops_cockpit/ops/arch_review.py
71
+
72
+ # 🧗 Quality: Iterative Hill Climbing optimization
73
+ quality-baseline:
74
+ @$(PYTHON) src/agent_ops_cockpit/eval/quality_climber.py climb
75
+
76
+ # 🧪 Secrets: Scan for hardcoded credentials
77
+ scan-secrets:
78
+ @$(PYTHON) src/agent_ops_cockpit/ops/secret_scanner.py .
79
+
80
+ # 🎨 UI/UX: Face Auditor for frontend quality
81
+ ui-audit:
82
+ @$(PYTHON) src/agent_ops_cockpit/ops/ui_auditor.py src
83
+
84
+ # 🔥 Red Team: Unleash self-hacking security audit
85
+
86
+ red-team:
87
+ @$(PYTHON) src/agent_ops_cockpit/eval/red_team.py src/agent_ops_cockpit/agent.py
88
+
89
+ # ⚡ Load Test: Stress test your agent endpoint (Usage: make load_test REQUESTS=100 CONCURRENCY=10)
90
+ REQUESTS ?= 50
91
+
92
+ CONCURRENCY ?= 5
93
+ URL ?= http://localhost:8000/agent/query?q=healthcheck
94
+
95
+ load_test:
96
+ @$(PYTHON) src/agent_ops_cockpit/eval/load_test.py run --url $(URL) --requests $(REQUESTS) --concurrency $(CONCURRENCY)
97
+
98
+ # 🚀 Production: The Vercel-style 1-click deploy (using Quick Audit for speed)
99
+ deploy-prod: audit build
100
+
101
+ @echo "📦 Containerizing and deploying to Cloud Run..."
102
+ gcloud run deploy $(SERVICE_NAME) --source . --region $(REGION) --allow-unauthenticated --port 80
103
+ @echo "🔥 Deploying frontend to Firebase..."
104
+ firebase deploy --only hosting
105
+
106
+ # 🚀 Cloud Run: The fastest way to production
107
+ deploy-cloud-run:
108
+ gcloud run deploy $(SERVICE_NAME) --source . --region $(REGION) --allow-unauthenticated --port 80
109
+
110
+ # 🔥 Firebase: Optimized for frontend hosting
111
+ deploy-firebase: build
112
+ firebase deploy --only hosting
113
+
114
+ # ☸️ GKE: Enterprise container orchestration
115
+ deploy-gke:
116
+ docker build -t $(IMAGE_TAG) .
117
+ docker push $(IMAGE_TAG)
118
+ @echo "Updating deployment.yaml..."
119
+ sed -i '' 's|image: .*|image: $(IMAGE_TAG)|' deployment.yaml || true
120
+ kubectl apply -f deployment.yaml || echo "No deployment.yaml found. Please create one based on DEPLOYMENT.md"
121
+
122
+ # 📡 Watch: Ecosystem sync check
123
+ watch:
124
+ @$(PYTHON) src/agent_ops_cockpit/ops/watcher.py
125
+
126
+ # 🔌 MCP: Start the Model Context Protocol server
127
+ mcp-serve:
128
+ @$(PYTHON) src/agent_ops_cockpit/mcp_server.py
129
+
130
+ # 📧 Reporting: Email the latest audit results
131
+ email-report:
132
+ @read -p "Enter recipient email: " email; \
133
+ $(PYTHON) -m agent_ops_cockpit.cli.main email-report $$email
134
+
@@ -0,0 +1,246 @@
1
+ Metadata-Version: 2.4
2
+ Name: agentops-cockpit
3
+ Version: 0.9.5
4
+ Summary: Production-grade Agent Operations (AgentOps) Platform
5
+ Project-URL: Homepage, https://github.com/enriquekalven/agent-ops-cockpit
6
+ Project-URL: Bug Tracker, https://github.com/enriquekalven/agent-ops-cockpit/issues
7
+ Author-email: Enrique <enrique@example.com>
8
+ License-File: LICENSE
9
+ Classifier: License :: OSI Approved :: MIT License
10
+ Classifier: Operating System :: OS Independent
11
+ Classifier: Programming Language :: Python :: 3
12
+ Requires-Python: >=3.10
13
+ Requires-Dist: aiohttp>=3.9.0
14
+ Requires-Dist: fastapi>=0.100.0
15
+ Requires-Dist: gitpython>=3.1.0
16
+ Requires-Dist: mcp>=0.1.0
17
+ Requires-Dist: packaging>=23.0
18
+ Requires-Dist: pydantic>=2.0.0
19
+ Requires-Dist: rich>=13.0.0
20
+ Requires-Dist: tenacity>=8.0.0
21
+ Requires-Dist: typer>=0.9.0
22
+ Requires-Dist: uvicorn>=0.20.0
23
+ Description-Content-Type: text/markdown
24
+
25
+ # 🕹️ AgentOps Cockpit
26
+
27
+ <div align="center">
28
+ <img src="public/assets/trinity.png" alt="AgentOps Cockpit Trinity" width="100%" />
29
+ </div>
30
+
31
+ <div align="center">
32
+ <br />
33
+ <a href="https://agent-cockpit.web.app" target="_blank"><strong>🌐 Official Website & Live Demo</strong></a>
34
+ <br /><br />
35
+ <a href="https://deploy.cloud.google.com?repo=https://github.com/enriquekalven/agent-cockpit">
36
+ <img src="https://deploy.cloud.google.com/button.svg" alt="Deploy to Google Cloud" />
37
+ </a>
38
+ <br />
39
+ <br />
40
+ <img src="https://img.shields.io/github/stars/enriquekalven/agent-cockpit?style=for-the-badge&color=ffd700" alt="GitHub Stars" />
41
+ <img src="https://img.shields.io/github/license/enriquekalven/agent-cockpit?style=for-the-badge&color=007bff" alt="License" />
42
+ <img src="https://img.shields.io/badge/Google-Well--Architected-4285F4?style=for-the-badge&logo=google-cloud" alt="Google Well-Architected" />
43
+ <img src="https://img.shields.io/badge/A2A_Standard-Enabled-10b981?style=for-the-badge" alt="A2A Standard" />
44
+ </div>
45
+
46
+ <br />
47
+
48
+ <div align="center">
49
+ <h3>"Infrastructure gives you the pipes. We give you the Intelligence."</h3>
50
+ <p>The developer distribution for building, optimizing, and securing AI agents on Google Cloud.</p>
51
+ </div>
52
+
53
+ ---
54
+
55
+ ## 📽️ The Mission
56
+ Most AI agent templates stop at a single Python file and an API key. **The AgentOps Cockpit** is for developers moving into production. It provides framework-agnostic governance, safety, and cost guardrails for the entire agentic ecosystem.
57
+
58
+ - **Governance-as-Code**: Audit your agent against [Google Well-Architected](/docs/google-architecture) best practices with the **Evidence Bridge**—real-time citations for architectural integrity.
59
+ - **SME Persona Audits**: Parallelized review of your codebase by automated "Principal SMEs" across FinOps, SecOps, and Architecture.
60
+ - **Agentic Trinity**: Dedicated layers for the Engine (Logic), Face (UX), and Cockpit (Ops).
61
+ - **A2A Connectivity**: Implements the [Agent-to-Agent Transmission Standard](/A2A_GUIDE.md) for secure swarm orchestration.
62
+ - **MCP Native**: Registration as a [Model Context Protocol](https://modelcontextprotocol.io) server for 1P/2P/3P tool consumption.
63
+
64
+ ---
65
+
66
+ ## 🏗️ The Agentic Trinity
67
+ We divide the complexity of production agents into three focused pillars:
68
+
69
+ ```mermaid
70
+ graph LR
71
+ subgraph Trinity [The Agentic Trinity]
72
+ E(The Engine: Reasoning)
73
+ F(The Face: Interface)
74
+ C(The Cockpit: Operations)
75
+ end
76
+ E <--> C
77
+ F <--> C
78
+ E <--> F
79
+ style Trinity fill:#f9f9f9,stroke:#333,stroke-width:2px
80
+ ```
81
+
82
+ - **⚙️ The Engine**: The reasoning core. Built with **ADK**, FastAPI, and Vertex AI.
83
+ - **🎭 The Face**: The user experience. Adaptive UI surfaces and **GenUI** standards via the A2UI spec.
84
+ - **🕹️ The Cockpit**: The operational brain. Cost control, semantic caching, shadow routing, and adversarial audits.
85
+
86
+ <div align="center">
87
+ <img src="public/assets/ecosystem.png" alt="Ecosystem Integrations" width="100%" />
88
+ </div>
89
+
90
+ ---
91
+
92
+ ## 🌐 Framework Agnostic Governance
93
+ The Cockpit isn't just for ADK. It provides **Best Practices as Code** across all major agentic frameworks:
94
+
95
+ <div align="center">
96
+ <img src="https://img.shields.io/badge/OpenAI_Agentkit-412991?style=for-the-badge&logo=openai" alt="OpenAI Agentkit" />
97
+ <img src="https://img.shields.io/badge/Anthropic_Claude-D97757?style=for-the-badge&logo=anthropic" alt="Anthropic" />
98
+ <img src="https://img.shields.io/badge/Microsoft_AutoGen-0078d4?style=for-the-badge&logo=microsoft" alt="Microsoft" />
99
+ <img src="https://img.shields.io/badge/AWS_Bedrock-FF9900?style=for-the-badge&logo=amazon-aws" alt="AWS" />
100
+ <img src="https://img.shields.io/badge/CopilotKit.ai-6366f1?style=for-the-badge" alt="CopilotKit" />
101
+ <img src="https://img.shields.io/badge/LangChain-1C3C3C?style=for-the-badge" alt="LangChain" />
102
+ <img src="https://img.shields.io/badge/ADK-4285F4?style=for-the-badge&logo=google-cloud" alt="ADK" />
103
+ <img src="public/assets/workflow.png" alt="Operational Workflow" width="100%" />
104
+ </div>
105
+
106
+ ## 🛠️ Operational Flow
107
+
108
+ ```mermaid
109
+ sequenceDiagram
110
+ participant U as User
111
+ participant C as Cockpit
112
+ participant E as Engine
113
+ participant F as Face
114
+
115
+ U->>C: Prompt / Input
116
+ C->>C: Policy Audit (RFC-307)
117
+ C->>E: Execute Logic / Tools
118
+ E->>C: Action Proposals
119
+ C->>E: Approve (HITL)
120
+ E->>F: GenUI Metadata
121
+ F->>U: Reactive Surface (A2UI)
122
+ ```
123
+
124
+ <br />
125
+
126
+ <div align="center">
127
+ <img src="https://img.shields.io/badge/Python-3776AB?style=flat-square&logo=python&logoColor=white" alt="Python" />
128
+ <img src="https://img.shields.io/badge/Go-00ADD8?style=flat-square&logo=go&logoColor=white" alt="Go" />
129
+ <img src="https://img.shields.io/badge/NodeJS-339933?style=flat-square&logo=node.js&logoColor=white" alt="NodeJS" />
130
+ <img src="https://img.shields.io/badge/TypeScript-3178C6?style=flat-square&logo=typescript&logoColor=white" alt="TypeScript" />
131
+ <img src="https://img.shields.io/badge/Streamlit-FF4B4B?style=flat-square&logo=streamlit&logoColor=white" alt="Streamlit" />
132
+ <img src="https://img.shields.io/badge/Angular-DD0031?style=flat-square&logo=angular&logoColor=white" alt="Angular" />
133
+ <img src="https://img.shields.io/badge/Lit-324FFF?style=flat-square&logo=lit&logoColor=white" alt="Lit" />
134
+ </div>
135
+
136
+ Whether you are building a swarm in **CrewAI**, a Go-based high-perf engine, or a **Streamlit** dashboard, the Cockpit ensures your agent maps to the **Google Well-Architected Framework**.
137
+
138
+
139
+ ---
140
+
141
+ ## 🚀 Key Innovation: The "Intelligence" Layer
142
+
143
+ ### 🛡️ Red Team Auditor (Self-Hacking)
144
+ Don't wait for your users to find prompt injections. Use the built-in Adversarial Evaluator to launch self-attacks against your agent, testing for PII leaks, instruction overrides, and safety filter bypasses.
145
+
146
+ ### 🧠 Hive Mind (Semantic Caching)
147
+ **Reduce LLM costs by up to 40%.** The Hive Mind checks for semantically similar queries in 10ms, serving cached answers for common questions without calling the LLM.
148
+
149
+ ### 🏛️ Arch Review & Framework Detection
150
+ Every agent in the cockpit is graded against a framework-aware checklist. The Cockpit intelligently detects your stack—**Google ADK**, **OpenAI Agentkit**, **Anthropic Claude**, **Microsoft AutoGen/Semantic Kernel**, **AWS Bedrock Agents**, or **CopilotKit**—and runs a tailored audit against corresponding production standards. Use `make arch-review` to verify your **Governance-as-Code**.
151
+
152
+ ### 🕹️ MCP Connectivity Hub (Model Context Protocol)
153
+ Stop building one-off tool integrations. The Cockpit provides a unified hub for **MCP Servers**. Connect to Google Search, Slack, or your internal databases via the standardized Model Context Protocol for secure, audited tool execution. Start the server with `make mcp-serve`.
154
+
155
+ ### 🗄️ Situational Database Audits
156
+ The Cockpit now performs platform-specific performance and security audits for:
157
+ - **AlloyDB**: Optimizes for the **Columnar Engine** (100x query speedup).
158
+ - **Pinecone**: Suggests **gRPC** and **Namespace Isolation** for high-perf RAG.
159
+ - **BigQuery**: Suggests **BQ Vector Search** for serverless, cost-effective grounding.
160
+ - **Cloud SQL**: Enforces **IAM-based authentication** via the official Python Connector.
161
+
162
+ ### 🧗 Quality Hill Climbing (ADK Evaluation)
163
+ Following **Google ADK Evaluation** best practices, the Cockpit provides an iterative optimization loop. `make quality-baseline` runs your agent against a "Golden Dataset" using **LLM-as-a-Judge** scoring (Response Match & Tool Trajectory), climbing the quality curve until production-grade fidelity is reached.
164
+
165
+ ### 🛑 Mandatory Governance Enforcement (NEW)
166
+ The Cockpit now acts as a mandatory gate for production.
167
+ - **Blocking CI/CD**: GitHub Actions now fail if **High Impact** cost issues or **Red Team** security vulnerabilities are detected.
168
+ - **Build-Time Audit**: The `Dockerfile` includes a mandatory `RUN` audit step. If your agent is not "Well-Architected," the container image will fail to build.
169
+
170
+ ---
171
+
172
+ ## ⌨️ Quick Start
173
+
174
+ The Cockpit is available as a first-class CLI on PyPI.
175
+
176
+ ```bash
177
+ # 1. Install the Cockpit globally
178
+ pip install agentops-cockpit
179
+
180
+ # 2. Run Global Audit (Produces unified report)
181
+ agent-ops report --mode quick # ⚡ Quick Safe-Build
182
+ agent-ops report --mode deep # 🚀 Full System Audit
183
+
184
+ # 3. Guardrail Policy Audit (RFC-307)
185
+ agent-ops policy-audit --text "How to make a bomb?"
186
+
187
+ # 4. Global Scaffolding
188
+ agent-ops-cockpit create <name> --ui a2ui
189
+ ```
190
+
191
+ ### 🔍 Agent Optimizer v2 (Situational Intelligence)
192
+ The Cockpit doesn't just look for generic waste. It now performs **Triple-State Analysis**:
193
+ - **Legacy Workarounds**: Suggests situational fixes for older SDK versions (e.g., manual prompt pruning).
194
+ - **Modernization Paths**: Highlights native performance gains (e.g., 90% cost reduction via Context Caching) available in latest SDKs.
195
+ - **Conflict Guard**: Real-time cross-package validation to prevent architectural deadlocks (e.g., CrewAI vs LangGraph state loops).
196
+
197
+ ### ⚡ Quick-Safe Build (12x Faster Loops)
198
+ Development velocity shouldn't sacrifice safety. The new `--quick` mode in the auditor reduces check latency from **1.8s to 0.15s**, providing sub-second feedback while maintaining the integrity of the Conflict Guard and Architecture Review.
199
+
200
+ ---
201
+
202
+ ### 🧑‍💼 Principal SME Persona Approvals
203
+ The Cockpit now features a **Multi-Persona Governance Board**. Every audit result is framed through the lens of a Principal Engineer in that domain (Security, Legal, FinOps, UX), ensuring your agent is compliant with organizational standards.
204
+
205
+ ### 📄 Export & Reporting
206
+ * **HTML/PDF Export**: Every audit automatically generates `cockpit_report.html`, a premium, printable report ready for PDF export.
207
+ * **Email Reports**: Send audit results directly to stakeholders via the CLI.
208
+
209
+ ---
210
+
211
+ ## 📊 Local Development
212
+ The Cockpit provides a unified "Mission Control" to evaluate your agents instantly.
213
+
214
+ ```bash
215
+ make audit # 🕹️ Run Master Audit (Persona Approved)
216
+ make audit-deep # 🚀 Run Deep Audit (Full SME Verdicts)
217
+ make email-report # 📧 Email the latest result to a stakeholder
218
+ make diagnose # 🩺 Run environment health check
219
+ make optimizer-audit # 🔍 Run Optimizer on specific agent files
220
+ make reliability # 🛡️ Run unit tests and regression suite
221
+ make dev # Start the local Engine + Face stack
222
+ make arch-review # 🏛️ Run the Google Well-Architected design review
223
+ make quality-baseline # 🧗 Run iterative 'Hill Climbing' quality audit
224
+ make red-team # Execute a white-hat security audit
225
+ make deploy-prod # 🚀 1-click deploy to Google Cloud
226
+ ```
227
+
228
+ ---
229
+
230
+ ## 🧭 Roadmap
231
+ - [x] **One-Click GitHub Action**: Automated governance audits on every PR.
232
+ - [x] **Mandatory Build Gates**: Blocking CI/CD and Container audits for production safety.
233
+ - [x] **Multi-Agent Orchestrator**: Standardized A2A Swarm/Coordinator patterns.
234
+ - [ ] **Visual Mission Control**: Real-time cockpit observability dashboard.
235
+
236
+ [View full roadmap →](/ROADMAP.md)
237
+
238
+ ---
239
+
240
+ ## 🤝 Community
241
+ - **Star this repo** to help us build the future of AgentOps.
242
+ - **Join the Discussion** for patterns on Google Cloud.
243
+ - **Contribute**: Read our [Contributing Guide](/CONTRIBUTING.md).
244
+
245
+ ---
246
+ *Reference: [Google Cloud Architecture Center - Agentic AI Overview](https://docs.cloud.google.com/architecture/agentic-ai-overview)*
@@ -0,0 +1,222 @@
1
+ # 🕹️ AgentOps Cockpit
2
+
3
+ <div align="center">
4
+ <img src="public/assets/trinity.png" alt="AgentOps Cockpit Trinity" width="100%" />
5
+ </div>
6
+
7
+ <div align="center">
8
+ <br />
9
+ <a href="https://agent-cockpit.web.app" target="_blank"><strong>🌐 Official Website & Live Demo</strong></a>
10
+ <br /><br />
11
+ <a href="https://deploy.cloud.google.com?repo=https://github.com/enriquekalven/agent-cockpit">
12
+ <img src="https://deploy.cloud.google.com/button.svg" alt="Deploy to Google Cloud" />
13
+ </a>
14
+ <br />
15
+ <br />
16
+ <img src="https://img.shields.io/github/stars/enriquekalven/agent-cockpit?style=for-the-badge&color=ffd700" alt="GitHub Stars" />
17
+ <img src="https://img.shields.io/github/license/enriquekalven/agent-cockpit?style=for-the-badge&color=007bff" alt="License" />
18
+ <img src="https://img.shields.io/badge/Google-Well--Architected-4285F4?style=for-the-badge&logo=google-cloud" alt="Google Well-Architected" />
19
+ <img src="https://img.shields.io/badge/A2A_Standard-Enabled-10b981?style=for-the-badge" alt="A2A Standard" />
20
+ </div>
21
+
22
+ <br />
23
+
24
+ <div align="center">
25
+ <h3>"Infrastructure gives you the pipes. We give you the Intelligence."</h3>
26
+ <p>The developer distribution for building, optimizing, and securing AI agents on Google Cloud.</p>
27
+ </div>
28
+
29
+ ---
30
+
31
+ ## 📽️ The Mission
32
+ Most AI agent templates stop at a single Python file and an API key. **The AgentOps Cockpit** is for developers moving into production. It provides framework-agnostic governance, safety, and cost guardrails for the entire agentic ecosystem.
33
+
34
+ - **Governance-as-Code**: Audit your agent against [Google Well-Architected](/docs/google-architecture) best practices with the **Evidence Bridge**—real-time citations for architectural integrity.
35
+ - **SME Persona Audits**: Parallelized review of your codebase by automated "Principal SMEs" across FinOps, SecOps, and Architecture.
36
+ - **Agentic Trinity**: Dedicated layers for the Engine (Logic), Face (UX), and Cockpit (Ops).
37
+ - **A2A Connectivity**: Implements the [Agent-to-Agent Transmission Standard](/A2A_GUIDE.md) for secure swarm orchestration.
38
+ - **MCP Native**: Registration as a [Model Context Protocol](https://modelcontextprotocol.io) server for 1P/2P/3P tool consumption.
39
+
40
+ ---
41
+
42
+ ## 🏗️ The Agentic Trinity
43
+ We divide the complexity of production agents into three focused pillars:
44
+
45
+ ```mermaid
46
+ graph LR
47
+ subgraph Trinity [The Agentic Trinity]
48
+ E(The Engine: Reasoning)
49
+ F(The Face: Interface)
50
+ C(The Cockpit: Operations)
51
+ end
52
+ E <--> C
53
+ F <--> C
54
+ E <--> F
55
+ style Trinity fill:#f9f9f9,stroke:#333,stroke-width:2px
56
+ ```
57
+
58
+ - **⚙️ The Engine**: The reasoning core. Built with **ADK**, FastAPI, and Vertex AI.
59
+ - **🎭 The Face**: The user experience. Adaptive UI surfaces and **GenUI** standards via the A2UI spec.
60
+ - **🕹️ The Cockpit**: The operational brain. Cost control, semantic caching, shadow routing, and adversarial audits.
61
+
62
+ <div align="center">
63
+ <img src="public/assets/ecosystem.png" alt="Ecosystem Integrations" width="100%" />
64
+ </div>
65
+
66
+ ---
67
+
68
+ ## 🌐 Framework Agnostic Governance
69
+ The Cockpit isn't just for ADK. It provides **Best Practices as Code** across all major agentic frameworks:
70
+
71
+ <div align="center">
72
+ <img src="https://img.shields.io/badge/OpenAI_Agentkit-412991?style=for-the-badge&logo=openai" alt="OpenAI Agentkit" />
73
+ <img src="https://img.shields.io/badge/Anthropic_Claude-D97757?style=for-the-badge&logo=anthropic" alt="Anthropic" />
74
+ <img src="https://img.shields.io/badge/Microsoft_AutoGen-0078d4?style=for-the-badge&logo=microsoft" alt="Microsoft" />
75
+ <img src="https://img.shields.io/badge/AWS_Bedrock-FF9900?style=for-the-badge&logo=amazon-aws" alt="AWS" />
76
+ <img src="https://img.shields.io/badge/CopilotKit.ai-6366f1?style=for-the-badge" alt="CopilotKit" />
77
+ <img src="https://img.shields.io/badge/LangChain-1C3C3C?style=for-the-badge" alt="LangChain" />
78
+ <img src="https://img.shields.io/badge/ADK-4285F4?style=for-the-badge&logo=google-cloud" alt="ADK" />
79
+ <img src="public/assets/workflow.png" alt="Operational Workflow" width="100%" />
80
+ </div>
81
+
82
+ ## 🛠️ Operational Flow
83
+
84
+ ```mermaid
85
+ sequenceDiagram
86
+ participant U as User
87
+ participant C as Cockpit
88
+ participant E as Engine
89
+ participant F as Face
90
+
91
+ U->>C: Prompt / Input
92
+ C->>C: Policy Audit (RFC-307)
93
+ C->>E: Execute Logic / Tools
94
+ E->>C: Action Proposals
95
+ C->>E: Approve (HITL)
96
+ E->>F: GenUI Metadata
97
+ F->>U: Reactive Surface (A2UI)
98
+ ```
99
+
100
+ <br />
101
+
102
+ <div align="center">
103
+ <img src="https://img.shields.io/badge/Python-3776AB?style=flat-square&logo=python&logoColor=white" alt="Python" />
104
+ <img src="https://img.shields.io/badge/Go-00ADD8?style=flat-square&logo=go&logoColor=white" alt="Go" />
105
+ <img src="https://img.shields.io/badge/NodeJS-339933?style=flat-square&logo=node.js&logoColor=white" alt="NodeJS" />
106
+ <img src="https://img.shields.io/badge/TypeScript-3178C6?style=flat-square&logo=typescript&logoColor=white" alt="TypeScript" />
107
+ <img src="https://img.shields.io/badge/Streamlit-FF4B4B?style=flat-square&logo=streamlit&logoColor=white" alt="Streamlit" />
108
+ <img src="https://img.shields.io/badge/Angular-DD0031?style=flat-square&logo=angular&logoColor=white" alt="Angular" />
109
+ <img src="https://img.shields.io/badge/Lit-324FFF?style=flat-square&logo=lit&logoColor=white" alt="Lit" />
110
+ </div>
111
+
112
+ Whether you are building a swarm in **CrewAI**, a Go-based high-perf engine, or a **Streamlit** dashboard, the Cockpit ensures your agent maps to the **Google Well-Architected Framework**.
113
+
114
+
115
+ ---
116
+
117
+ ## 🚀 Key Innovation: The "Intelligence" Layer
118
+
119
+ ### 🛡️ Red Team Auditor (Self-Hacking)
120
+ Don't wait for your users to find prompt injections. Use the built-in Adversarial Evaluator to launch self-attacks against your agent, testing for PII leaks, instruction overrides, and safety filter bypasses.
121
+
122
+ ### 🧠 Hive Mind (Semantic Caching)
123
+ **Reduce LLM costs by up to 40%.** The Hive Mind checks for semantically similar queries in 10ms, serving cached answers for common questions without calling the LLM.
124
+
125
+ ### 🏛️ Arch Review & Framework Detection
126
+ Every agent in the cockpit is graded against a framework-aware checklist. The Cockpit intelligently detects your stack—**Google ADK**, **OpenAI Agentkit**, **Anthropic Claude**, **Microsoft AutoGen/Semantic Kernel**, **AWS Bedrock Agents**, or **CopilotKit**—and runs a tailored audit against corresponding production standards. Use `make arch-review` to verify your **Governance-as-Code**.
127
+
128
+ ### 🕹️ MCP Connectivity Hub (Model Context Protocol)
129
+ Stop building one-off tool integrations. The Cockpit provides a unified hub for **MCP Servers**. Connect to Google Search, Slack, or your internal databases via the standardized Model Context Protocol for secure, audited tool execution. Start the server with `make mcp-serve`.
130
+
131
+ ### 🗄️ Situational Database Audits
132
+ The Cockpit now performs platform-specific performance and security audits for:
133
+ - **AlloyDB**: Optimizes for the **Columnar Engine** (100x query speedup).
134
+ - **Pinecone**: Suggests **gRPC** and **Namespace Isolation** for high-perf RAG.
135
+ - **BigQuery**: Suggests **BQ Vector Search** for serverless, cost-effective grounding.
136
+ - **Cloud SQL**: Enforces **IAM-based authentication** via the official Python Connector.
137
+
138
+ ### 🧗 Quality Hill Climbing (ADK Evaluation)
139
+ Following **Google ADK Evaluation** best practices, the Cockpit provides an iterative optimization loop. `make quality-baseline` runs your agent against a "Golden Dataset" using **LLM-as-a-Judge** scoring (Response Match & Tool Trajectory), climbing the quality curve until production-grade fidelity is reached.
140
+
141
+ ### 🛑 Mandatory Governance Enforcement (NEW)
142
+ The Cockpit now acts as a mandatory gate for production.
143
+ - **Blocking CI/CD**: GitHub Actions now fail if **High Impact** cost issues or **Red Team** security vulnerabilities are detected.
144
+ - **Build-Time Audit**: The `Dockerfile` includes a mandatory `RUN` audit step. If your agent is not "Well-Architected," the container image will fail to build.
145
+
146
+ ---
147
+
148
+ ## ⌨️ Quick Start
149
+
150
+ The Cockpit is available as a first-class CLI on PyPI.
151
+
152
+ ```bash
153
+ # 1. Install the Cockpit globally
154
+ pip install agentops-cockpit
155
+
156
+ # 2. Run Global Audit (Produces unified report)
157
+ agent-ops report --mode quick # ⚡ Quick Safe-Build
158
+ agent-ops report --mode deep # 🚀 Full System Audit
159
+
160
+ # 3. Guardrail Policy Audit (RFC-307)
161
+ agent-ops policy-audit --text "How to make a bomb?"
162
+
163
+ # 4. Global Scaffolding
164
+ agent-ops-cockpit create <name> --ui a2ui
165
+ ```
166
+
167
+ ### 🔍 Agent Optimizer v2 (Situational Intelligence)
168
+ The Cockpit doesn't just look for generic waste. It now performs **Triple-State Analysis**:
169
+ - **Legacy Workarounds**: Suggests situational fixes for older SDK versions (e.g., manual prompt pruning).
170
+ - **Modernization Paths**: Highlights native performance gains (e.g., 90% cost reduction via Context Caching) available in latest SDKs.
171
+ - **Conflict Guard**: Real-time cross-package validation to prevent architectural deadlocks (e.g., CrewAI vs LangGraph state loops).
172
+
173
+ ### ⚡ Quick-Safe Build (12x Faster Loops)
174
+ Development velocity shouldn't sacrifice safety. The new `--quick` mode in the auditor reduces check latency from **1.8s to 0.15s**, providing sub-second feedback while maintaining the integrity of the Conflict Guard and Architecture Review.
175
+
176
+ ---
177
+
178
+ ### 🧑‍💼 Principal SME Persona Approvals
179
+ The Cockpit now features a **Multi-Persona Governance Board**. Every audit result is framed through the lens of a Principal Engineer in that domain (Security, Legal, FinOps, UX), ensuring your agent is compliant with organizational standards.
180
+
181
+ ### 📄 Export & Reporting
182
+ * **HTML/PDF Export**: Every audit automatically generates `cockpit_report.html`, a premium, printable report ready for PDF export.
183
+ * **Email Reports**: Send audit results directly to stakeholders via the CLI.
184
+
185
+ ---
186
+
187
+ ## 📊 Local Development
188
+ The Cockpit provides a unified "Mission Control" to evaluate your agents instantly.
189
+
190
+ ```bash
191
+ make audit # 🕹️ Run Master Audit (Persona Approved)
192
+ make audit-deep # 🚀 Run Deep Audit (Full SME Verdicts)
193
+ make email-report # 📧 Email the latest result to a stakeholder
194
+ make diagnose # 🩺 Run environment health check
195
+ make optimizer-audit # 🔍 Run Optimizer on specific agent files
196
+ make reliability # 🛡️ Run unit tests and regression suite
197
+ make dev # Start the local Engine + Face stack
198
+ make arch-review # 🏛️ Run the Google Well-Architected design review
199
+ make quality-baseline # 🧗 Run iterative 'Hill Climbing' quality audit
200
+ make red-team # Execute a white-hat security audit
201
+ make deploy-prod # 🚀 1-click deploy to Google Cloud
202
+ ```
203
+
204
+ ---
205
+
206
+ ## 🧭 Roadmap
207
+ - [x] **One-Click GitHub Action**: Automated governance audits on every PR.
208
+ - [x] **Mandatory Build Gates**: Blocking CI/CD and Container audits for production safety.
209
+ - [x] **Multi-Agent Orchestrator**: Standardized A2A Swarm/Coordinator patterns.
210
+ - [ ] **Visual Mission Control**: Real-time cockpit observability dashboard.
211
+
212
+ [View full roadmap →](/ROADMAP.md)
213
+
214
+ ---
215
+
216
+ ## 🤝 Community
217
+ - **Star this repo** to help us build the future of AgentOps.
218
+ - **Join the Discussion** for patterns on Google Cloud.
219
+ - **Contribute**: Read our [Contributing Guide](/CONTRIBUTING.md).
220
+
221
+ ---
222
+ *Reference: [Google Cloud Architecture Center - Agentic AI Overview](https://docs.cloud.google.com/architecture/agentic-ai-overview)*
@@ -7,7 +7,7 @@ Building a single agent is easy. Building a **Swarm** of agents that communicate
7
7
  | Layer | Responsibility | Protocol / Spec |
8
8
  | :--- | :--- | :--- |
9
9
  | **Surface** | Human-Agent Interaction | [A2UI Spec](/docs/a2ui) |
10
- | **Memory** | Cross-Agent Knowledge | [Vector Workspace (Hive Mind)](/src/backend/cache) |
10
+ | **Memory** | Cross-Agent Knowledge | [Vector Workspace (Hive Mind)](/src/agent_ops_cockpit/cache) |
11
11
  | **Logic** | Tool & Reasoning Handshake | [A2P Handshake](#a2p-handshake) |
12
12
  | **Security** | Identity & Permissions | [GCP Workload Identity](https://cloud.google.com/kubernetes-engine/docs/how-to/workload-identity) |
13
13
 
@@ -14,7 +14,7 @@ The Engine comes pre-installed with the **Cockpit Middleware Stack**:
14
14
  We recommend building your tools as **MCP (Model Context Protocol)** or **ADK Extensions**. This ensures that the agent can discover and invoke them with high reliability.
15
15
 
16
16
  ```python
17
- # Example Tool in src/backend/tools/search.py
17
+ # Example Tool in src/agent_ops_cockpit/tools/search.py
18
18
  from adk import Tool
19
19
 
20
20
  @Tool