blockmine 1.21.0 → 1.23.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (492) hide show
  1. package/.claude/agents/README.md +469 -0
  2. package/.claude/agents/auth-route-debugger.md +118 -0
  3. package/.claude/agents/auth-route-tester.md +93 -0
  4. package/.claude/agents/auto-error-resolver.md +97 -0
  5. package/.claude/agents/build-optimizer.md +236 -0
  6. package/.claude/agents/code-architecture-reviewer.md +83 -0
  7. package/.claude/agents/code-refactor-master.md +94 -0
  8. package/.claude/agents/cost-optimizer.md +134 -0
  9. package/.claude/agents/deployment-orchestrator.md +113 -0
  10. package/.claude/agents/documentation-architect.md +82 -0
  11. package/.claude/agents/frontend-error-fixer.md +77 -0
  12. package/.claude/agents/iac-code-generator.md +71 -0
  13. package/.claude/agents/incident-responder.md +346 -0
  14. package/.claude/agents/infrastructure-architect.md +31 -0
  15. package/.claude/agents/kubernetes-specialist.md +56 -0
  16. package/.claude/agents/migration-planner.md +181 -0
  17. package/.claude/agents/network-architect.md +196 -0
  18. package/.claude/agents/plan-reviewer.md +52 -0
  19. package/.claude/agents/refactor-planner.md +63 -0
  20. package/.claude/agents/security-scanner.md +102 -0
  21. package/.claude/agents/web-research-specialist.md +78 -0
  22. package/.claude/commands/cost-analysis.md +315 -0
  23. package/.claude/commands/dev-docs-update.md +55 -0
  24. package/.claude/commands/dev-docs.md +51 -0
  25. package/.claude/commands/incident-debug.md +247 -0
  26. package/.claude/commands/infra-plan.md +81 -0
  27. package/.claude/commands/migration-plan.md +478 -0
  28. package/.claude/commands/route-research-for-testing.md +37 -0
  29. package/.claude/commands/security-review.md +66 -0
  30. package/.claude/hooks/CONFIG.md +448 -0
  31. package/.claude/hooks/README.md +163 -0
  32. package/.claude/hooks/SKILL_ACTIVATION_COMPLETE.md +226 -0
  33. package/.claude/hooks/WINDOWS_HOOKS_README.md +151 -0
  34. package/.claude/hooks/add-skill-activation-banners.ts +132 -0
  35. package/.claude/hooks/comprehensive-skill-test.ts +1315 -0
  36. package/.claude/hooks/error-handling-reminder.sh +12 -0
  37. package/.claude/hooks/error-handling-reminder.ts +222 -0
  38. package/.claude/hooks/k8s-manifest-validator.sh +56 -0
  39. package/.claude/hooks/package-lock.json +556 -0
  40. package/.claude/hooks/package.json +16 -0
  41. package/.claude/hooks/post-tool-use-tracker.ps1 +174 -0
  42. package/.claude/hooks/post-tool-use-tracker.sh +183 -0
  43. package/.claude/hooks/security-policy-check.sh +247 -0
  44. package/.claude/hooks/skill-activation-prompt.ps1 +10 -0
  45. package/.claude/hooks/skill-activation-prompt.sh +10 -0
  46. package/.claude/hooks/skill-activation-prompt.ts +141 -0
  47. package/.claude/hooks/stop-build-check-enhanced.sh +130 -0
  48. package/.claude/hooks/terraform-validator.sh +53 -0
  49. package/.claude/hooks/test-input.json +7 -0
  50. package/.claude/hooks/test-skill-activation.ts +427 -0
  51. package/.claude/hooks/trigger-build-resolver.sh +79 -0
  52. package/.claude/hooks/tsc-check.sh +173 -0
  53. package/.claude/hooks/tsconfig.json +19 -0
  54. package/.claude/settings.json +59 -0
  55. package/.claude/settings.local.json +36 -14
  56. package/.claude/skills/README.md +507 -0
  57. package/.claude/skills/api-engineering/SKILL.md +63 -0
  58. package/.claude/skills/api-engineering/resources/api-versioning.md +88 -0
  59. package/.claude/skills/api-engineering/resources/graphql-patterns.md +106 -0
  60. package/.claude/skills/api-engineering/resources/rate-limiting.md +118 -0
  61. package/.claude/skills/api-engineering/resources/rest-api-design.md +105 -0
  62. package/.claude/skills/backend-dev-guidelines/SKILL.md +306 -0
  63. package/.claude/skills/backend-dev-guidelines/resources/architecture-overview.md +451 -0
  64. package/.claude/skills/backend-dev-guidelines/resources/async-and-errors.md +307 -0
  65. package/.claude/skills/backend-dev-guidelines/resources/complete-examples.md +638 -0
  66. package/.claude/skills/backend-dev-guidelines/resources/configuration.md +275 -0
  67. package/.claude/skills/backend-dev-guidelines/resources/database-patterns.md +224 -0
  68. package/.claude/skills/backend-dev-guidelines/resources/middleware-guide.md +213 -0
  69. package/.claude/skills/backend-dev-guidelines/resources/routing-and-controllers.md +756 -0
  70. package/.claude/skills/backend-dev-guidelines/resources/sentry-and-monitoring.md +336 -0
  71. package/.claude/skills/backend-dev-guidelines/resources/services-and-repositories.md +789 -0
  72. package/.claude/skills/backend-dev-guidelines/resources/testing-guide.md +235 -0
  73. package/.claude/skills/backend-dev-guidelines/resources/validation-patterns.md +754 -0
  74. package/.claude/skills/budget-and-cost-management/SKILL.md +850 -0
  75. package/.claude/skills/build-engineering/SKILL.md +431 -0
  76. package/.claude/skills/build-engineering/resources/artifact-repositories.md +72 -0
  77. package/.claude/skills/build-engineering/resources/build-caching.md +96 -0
  78. package/.claude/skills/build-engineering/resources/build-pipelines.md +105 -0
  79. package/.claude/skills/build-engineering/resources/build-security.md +95 -0
  80. package/.claude/skills/build-engineering/resources/build-systems.md +389 -0
  81. package/.claude/skills/build-engineering/resources/compilation-optimization.md +201 -0
  82. package/.claude/skills/build-engineering/resources/dependency-management.md +73 -0
  83. package/.claude/skills/build-engineering/resources/monorepo-builds.md +110 -0
  84. package/.claude/skills/build-engineering/resources/performance-optimization.md +113 -0
  85. package/.claude/skills/build-engineering/resources/reproducible-builds.md +82 -0
  86. package/.claude/skills/cloud-engineering/SKILL.md +675 -0
  87. package/.claude/skills/cloud-engineering/resources/aws-patterns.md +742 -0
  88. package/.claude/skills/cloud-engineering/resources/azure-patterns.md +714 -0
  89. package/.claude/skills/cloud-engineering/resources/cleared-cloud-environments.md +987 -0
  90. package/.claude/skills/cloud-engineering/resources/cloud-cost-optimization.md +757 -0
  91. package/.claude/skills/cloud-engineering/resources/cloud-networking.md +1058 -0
  92. package/.claude/skills/cloud-engineering/resources/cloud-security-tools.md +1530 -0
  93. package/.claude/skills/cloud-engineering/resources/cloud-security.md +990 -0
  94. package/.claude/skills/cloud-engineering/resources/gcp-patterns.md +758 -0
  95. package/.claude/skills/cloud-engineering/resources/migration-strategies.md +820 -0
  96. package/.claude/skills/cloud-engineering/resources/multi-cloud-strategies.md +670 -0
  97. package/.claude/skills/cloud-engineering/resources/oci-patterns.md +1198 -0
  98. package/.claude/skills/cloud-engineering/resources/serverless-patterns.md +795 -0
  99. package/.claude/skills/cloud-engineering/resources/well-architected-frameworks.md +966 -0
  100. package/.claude/skills/cybersecurity/SKILL.md +409 -0
  101. package/.claude/skills/cybersecurity/resources/security-architecture.md +266 -0
  102. package/.claude/skills/database-engineering/SKILL.md +61 -0
  103. package/.claude/skills/database-engineering/resources/backup-and-recovery.md +72 -0
  104. package/.claude/skills/database-engineering/resources/database-replication.md +63 -0
  105. package/.claude/skills/database-engineering/resources/postgresql-fundamentals.md +70 -0
  106. package/.claude/skills/database-engineering/resources/query-optimization.md +68 -0
  107. package/.claude/skills/devsecops/SKILL.md +374 -0
  108. package/.claude/skills/devsecops/resources/ci-cd-security.md +204 -0
  109. package/.claude/skills/devsecops/resources/compliance-automation.md +530 -0
  110. package/.claude/skills/devsecops/resources/compliance-frameworks.md +2322 -0
  111. package/.claude/skills/devsecops/resources/container-security.md +915 -0
  112. package/.claude/skills/devsecops/resources/cspm-integration.md +1440 -0
  113. package/.claude/skills/devsecops/resources/policy-enforcement.md +619 -0
  114. package/.claude/skills/devsecops/resources/secrets-management.md +755 -0
  115. package/.claude/skills/devsecops/resources/security-monitoring.md +146 -0
  116. package/.claude/skills/devsecops/resources/security-scanning.md +887 -0
  117. package/.claude/skills/devsecops/resources/security-testing.md +203 -0
  118. package/.claude/skills/devsecops/resources/supply-chain-security.md +518 -0
  119. package/.claude/skills/devsecops/resources/vulnerability-management.md +481 -0
  120. package/.claude/skills/devsecops/resources/zero-trust-architecture.md +177 -0
  121. package/.claude/skills/documentation-as-code/SKILL.md +323 -0
  122. package/.claude/skills/documentation-as-code/resources/api-documentation.md +90 -0
  123. package/.claude/skills/documentation-as-code/resources/changelog-management.md +79 -0
  124. package/.claude/skills/documentation-as-code/resources/diagram-generation.md +44 -0
  125. package/.claude/skills/documentation-as-code/resources/docs-as-code-workflow.md +99 -0
  126. package/.claude/skills/documentation-as-code/resources/documentation-automation.md +68 -0
  127. package/.claude/skills/documentation-as-code/resources/documentation-sites.md +79 -0
  128. package/.claude/skills/documentation-as-code/resources/markdown-best-practices.md +162 -0
  129. package/.claude/skills/documentation-as-code/resources/openapi-specification.md +77 -0
  130. package/.claude/skills/documentation-as-code/resources/readme-engineering.md +60 -0
  131. package/.claude/skills/documentation-as-code/resources/technical-writing-guide.md +202 -0
  132. package/.claude/skills/engineering-management/SKILL.md +356 -0
  133. package/.claude/skills/engineering-management/resources/career-ladders.md +609 -0
  134. package/.claude/skills/engineering-management/resources/hiring-and-assessment.md +555 -0
  135. package/.claude/skills/engineering-management/resources/one-on-one-guides.md +609 -0
  136. package/.claude/skills/engineering-management/resources/resource-planning.md +557 -0
  137. package/.claude/skills/engineering-management/resources/team-organization-patterns.md +491 -0
  138. package/.claude/skills/engineering-management/resources/technical-interviews.md +474 -0
  139. package/.claude/skills/engineering-operations-management/SKILL.md +817 -0
  140. package/.claude/skills/error-tracking/SKILL.md +379 -0
  141. package/.claude/skills/frontend-dev-guidelines/SKILL.md +403 -0
  142. package/.claude/skills/frontend-dev-guidelines/resources/common-patterns.md +331 -0
  143. package/.claude/skills/frontend-dev-guidelines/resources/complete-examples.md +872 -0
  144. package/.claude/skills/frontend-dev-guidelines/resources/component-patterns.md +502 -0
  145. package/.claude/skills/frontend-dev-guidelines/resources/data-fetching.md +767 -0
  146. package/.claude/skills/frontend-dev-guidelines/resources/file-organization.md +502 -0
  147. package/.claude/skills/frontend-dev-guidelines/resources/loading-and-error-states.md +501 -0
  148. package/.claude/skills/frontend-dev-guidelines/resources/performance.md +406 -0
  149. package/.claude/skills/frontend-dev-guidelines/resources/routing-guide.md +364 -0
  150. package/.claude/skills/frontend-dev-guidelines/resources/styling-guide.md +428 -0
  151. package/.claude/skills/frontend-dev-guidelines/resources/typescript-standards.md +418 -0
  152. package/.claude/skills/general-it-engineering/SKILL.md +393 -0
  153. package/.claude/skills/general-it-engineering/resources/asset-management.md +712 -0
  154. package/.claude/skills/general-it-engineering/resources/automation-orchestration.md +817 -0
  155. package/.claude/skills/general-it-engineering/resources/business-continuity.md +786 -0
  156. package/.claude/skills/general-it-engineering/resources/change-management.md +715 -0
  157. package/.claude/skills/general-it-engineering/resources/enterprise-monitoring.md +729 -0
  158. package/.claude/skills/general-it-engineering/resources/help-desk-operations.md +738 -0
  159. package/.claude/skills/general-it-engineering/resources/incident-service-management.md +834 -0
  160. package/.claude/skills/general-it-engineering/resources/it-governance.md +753 -0
  161. package/.claude/skills/general-it-engineering/resources/itil-framework.md +503 -0
  162. package/.claude/skills/general-it-engineering/resources/service-management.md +669 -0
  163. package/.claude/skills/infrastructure-architecture/SKILL.md +328 -0
  164. package/.claude/skills/infrastructure-architecture/resources/architecture-decision-records.md +505 -0
  165. package/.claude/skills/infrastructure-architecture/resources/architecture-patterns.md +528 -0
  166. package/.claude/skills/infrastructure-architecture/resources/capacity-planning.md +453 -0
  167. package/.claude/skills/infrastructure-architecture/resources/cleared-environment-architecture.md +773 -0
  168. package/.claude/skills/infrastructure-architecture/resources/cost-architecture.md +499 -0
  169. package/.claude/skills/infrastructure-architecture/resources/data-architecture.md +501 -0
  170. package/.claude/skills/infrastructure-architecture/resources/disaster-recovery.md +535 -0
  171. package/.claude/skills/infrastructure-architecture/resources/migration-architecture.md +512 -0
  172. package/.claude/skills/infrastructure-architecture/resources/multi-region-design.md +608 -0
  173. package/.claude/skills/infrastructure-architecture/resources/reference-architectures.md +562 -0
  174. package/.claude/skills/infrastructure-architecture/resources/security-architecture.md +538 -0
  175. package/.claude/skills/infrastructure-architecture/resources/system-design-principles.md +489 -0
  176. package/.claude/skills/infrastructure-architecture/resources/workload-classification.md +1000 -0
  177. package/.claude/skills/infrastructure-strategy/SKILL.md +924 -0
  178. package/.claude/skills/network-engineering/SKILL.md +385 -0
  179. package/.claude/skills/network-engineering/resources/dns-management.md +738 -0
  180. package/.claude/skills/network-engineering/resources/load-balancing.md +820 -0
  181. package/.claude/skills/network-engineering/resources/network-architecture.md +546 -0
  182. package/.claude/skills/network-engineering/resources/network-security.md +921 -0
  183. package/.claude/skills/network-engineering/resources/network-troubleshooting.md +749 -0
  184. package/.claude/skills/network-engineering/resources/routing-switching.md +373 -0
  185. package/.claude/skills/network-engineering/resources/sdn-networking.md +695 -0
  186. package/.claude/skills/network-engineering/resources/service-mesh-networking.md +777 -0
  187. package/.claude/skills/network-engineering/resources/tcp-ip-protocols.md +444 -0
  188. package/.claude/skills/network-engineering/resources/vpn-connectivity.md +672 -0
  189. package/.claude/skills/observability-engineering/SKILL.md +101 -0
  190. package/.claude/skills/observability-engineering/resources/apm-tools.md +97 -0
  191. package/.claude/skills/observability-engineering/resources/correlation-strategies.md +87 -0
  192. package/.claude/skills/observability-engineering/resources/distributed-tracing.md +98 -0
  193. package/.claude/skills/observability-engineering/resources/logs-aggregation.md +118 -0
  194. package/.claude/skills/observability-engineering/resources/observability-cost-optimization.md +141 -0
  195. package/.claude/skills/observability-engineering/resources/opentelemetry.md +110 -0
  196. package/.claude/skills/platform-engineering/SKILL.md +555 -0
  197. package/.claude/skills/platform-engineering/resources/architecture-overview.md +600 -0
  198. package/.claude/skills/platform-engineering/resources/container-orchestration.md +916 -0
  199. package/.claude/skills/platform-engineering/resources/cost-optimization.md +634 -0
  200. package/.claude/skills/platform-engineering/resources/developer-platforms.md +670 -0
  201. package/.claude/skills/platform-engineering/resources/gitops-automation.md +650 -0
  202. package/.claude/skills/platform-engineering/resources/infrastructure-as-code.md +778 -0
  203. package/.claude/skills/platform-engineering/resources/infrastructure-standards.md +708 -0
  204. package/.claude/skills/platform-engineering/resources/multi-tenancy.md +602 -0
  205. package/.claude/skills/platform-engineering/resources/platform-security.md +711 -0
  206. package/.claude/skills/platform-engineering/resources/resource-management.md +592 -0
  207. package/.claude/skills/platform-engineering/resources/service-mesh.md +628 -0
  208. package/.claude/skills/release-engineering/SKILL.md +393 -0
  209. package/.claude/skills/release-engineering/resources/artifact-management.md +108 -0
  210. package/.claude/skills/release-engineering/resources/build-optimization.md +84 -0
  211. package/.claude/skills/release-engineering/resources/ci-cd-pipelines.md +411 -0
  212. package/.claude/skills/release-engineering/resources/deployment-strategies.md +197 -0
  213. package/.claude/skills/release-engineering/resources/pipeline-security.md +62 -0
  214. package/.claude/skills/release-engineering/resources/progressive-delivery.md +83 -0
  215. package/.claude/skills/release-engineering/resources/release-automation.md +68 -0
  216. package/.claude/skills/release-engineering/resources/release-orchestration.md +77 -0
  217. package/.claude/skills/release-engineering/resources/rollback-strategies.md +66 -0
  218. package/.claude/skills/release-engineering/resources/versioning-strategies.md +59 -0
  219. package/.claude/skills/route-tester/SKILL.md +392 -0
  220. package/.claude/skills/skill-developer/ADVANCED.md +197 -0
  221. package/.claude/skills/skill-developer/HOOK_MECHANISMS.md +306 -0
  222. package/.claude/skills/skill-developer/PATTERNS_LIBRARY.md +152 -0
  223. package/.claude/skills/skill-developer/SKILL.md +430 -0
  224. package/.claude/skills/skill-developer/SKILL_RULES_REFERENCE.md +315 -0
  225. package/.claude/skills/skill-developer/TRIGGER_TYPES.md +305 -0
  226. package/.claude/skills/skill-developer/TROUBLESHOOTING.md +514 -0
  227. package/.claude/skills/skill-rules.json +2940 -0
  228. package/.claude/skills/sre/SKILL.md +464 -0
  229. package/.claude/skills/sre/resources/alerting-best-practices.md +282 -0
  230. package/.claude/skills/sre/resources/capacity-planning.md +226 -0
  231. package/.claude/skills/sre/resources/chaos-engineering.md +193 -0
  232. package/.claude/skills/sre/resources/disaster-recovery.md +232 -0
  233. package/.claude/skills/sre/resources/incident-management.md +436 -0
  234. package/.claude/skills/sre/resources/observability-stack.md +240 -0
  235. package/.claude/skills/sre/resources/on-call-runbooks.md +167 -0
  236. package/.claude/skills/sre/resources/performance-optimization.md +108 -0
  237. package/.claude/skills/sre/resources/reliability-patterns.md +183 -0
  238. package/.claude/skills/sre/resources/slo-sli-sla.md +464 -0
  239. package/.claude/skills/sre/resources/toil-reduction.md +145 -0
  240. package/.claude/skills/systems-engineering/SKILL.md +648 -0
  241. package/.claude/skills/systems-engineering/resources/automation-patterns.md +771 -0
  242. package/.claude/skills/systems-engineering/resources/configuration-management.md +998 -0
  243. package/.claude/skills/systems-engineering/resources/linux-administration.md +672 -0
  244. package/.claude/skills/systems-engineering/resources/networking-fundamentals.md +982 -0
  245. package/.claude/skills/systems-engineering/resources/performance-tuning.md +871 -0
  246. package/.claude/skills/systems-engineering/resources/powershell-scripting.md +482 -0
  247. package/.claude/skills/systems-engineering/resources/security-hardening.md +739 -0
  248. package/.claude/skills/systems-engineering/resources/shell-scripting.md +915 -0
  249. package/.claude/skills/systems-engineering/resources/storage-management.md +628 -0
  250. package/.claude/skills/systems-engineering/resources/system-monitoring.md +787 -0
  251. package/.claude/skills/systems-engineering/resources/troubleshooting-guide.md +753 -0
  252. package/.claude/skills/systems-engineering/resources/windows-administration.md +738 -0
  253. package/.claude/skills/technical-leadership/SKILL.md +728 -0
  254. package/CHANGELOG.md +102 -42
  255. package/CLAUDE.md +284 -0
  256. package/README.md +315 -71
  257. package/backend/docs/SECRETS_DOCUMENTATION.md +327 -0
  258. package/backend/jest.config.js +59 -0
  259. package/backend/package-lock.json +6801 -0
  260. package/backend/package.json +24 -4
  261. package/backend/prisma/migrations/20251026104609_add_websocket_api/migration.sql +33 -0
  262. package/backend/prisma/migrations/20251116111851_add_execution_trace/migration.sql +22 -0
  263. package/backend/prisma/migrations/20251120154914_add_panel_api_keys/migration.sql +21 -0
  264. package/backend/prisma/migrations/20251121110241_add_proxy_table/migration.sql +45 -0
  265. package/backend/prisma/migrations/migration_lock.toml +2 -2
  266. package/backend/prisma/schema.prisma +103 -1
  267. package/backend/src/__tests__/core/DependencyService.test.js +336 -0
  268. package/backend/src/__tests__/core/UserService.test.js +875 -0
  269. package/backend/src/__tests__/repositories/BaseRepository.test.js +146 -0
  270. package/backend/src/__tests__/repositories/BotRepository.test.js +118 -0
  271. package/backend/src/__tests__/repositories/CommandRepository.test.js +132 -0
  272. package/backend/src/__tests__/repositories/EventGraphRepository.test.js +93 -0
  273. package/backend/src/__tests__/repositories/GroupRepository.test.js +155 -0
  274. package/backend/src/__tests__/repositories/PermissionRepository.test.js +130 -0
  275. package/backend/src/__tests__/repositories/PluginRepository.test.js +107 -0
  276. package/backend/src/__tests__/repositories/ServerRepository.test.js +80 -0
  277. package/backend/src/__tests__/repositories/UserRepository.test.js +128 -0
  278. package/backend/src/__tests__/secretsFilter.test.js +425 -0
  279. package/backend/src/__tests__/services/BotLifecycleService.test.js +416 -0
  280. package/backend/src/__tests__/services/BotProcessManager.test.js +285 -0
  281. package/backend/src/__tests__/services/CacheManager.test.js +125 -0
  282. package/backend/src/__tests__/services/CommandExecutionService.test.js +460 -0
  283. package/backend/src/__tests__/services/ResourceMonitorService.test.js +207 -0
  284. package/backend/src/__tests__/services/TelemetryService.test.js +291 -0
  285. package/backend/src/__tests__/setup.js +25 -0
  286. package/backend/src/ai/plugin-assistant-system-prompt.md +788 -0
  287. package/backend/src/api/middleware/auth.js +27 -0
  288. package/backend/src/api/middleware/botAccess.js +7 -3
  289. package/backend/src/api/middleware/panelApiAuth.js +135 -0
  290. package/backend/src/api/routes/aiAssistant.js +995 -0
  291. package/backend/src/api/routes/apiKeys.js +181 -0
  292. package/backend/src/api/routes/auth.js +669 -633
  293. package/backend/src/api/routes/botCommands.js +107 -0
  294. package/backend/src/api/routes/botGroups.js +165 -0
  295. package/backend/src/api/routes/botHistory.js +108 -0
  296. package/backend/src/api/routes/botPermissions.js +99 -0
  297. package/backend/src/api/routes/botStatus.js +36 -0
  298. package/backend/src/api/routes/botUsers.js +162 -0
  299. package/backend/src/api/routes/bots.js +2451 -2360
  300. package/backend/src/api/routes/eventGraphs.js +4 -1
  301. package/backend/src/api/routes/logs.js +13 -3
  302. package/backend/src/api/routes/panel.js +66 -66
  303. package/backend/src/api/routes/panelApiKeys.js +179 -0
  304. package/backend/src/api/routes/pluginIde.js +1715 -135
  305. package/backend/src/api/routes/plugins.js +376 -218
  306. package/backend/src/api/routes/proxies.js +130 -0
  307. package/backend/src/api/routes/search.js +4 -0
  308. package/backend/src/api/routes/servers.js +20 -3
  309. package/backend/src/api/routes/settings.js +5 -0
  310. package/backend/src/api/routes/system.js +174 -0
  311. package/backend/src/api/routes/traces.js +131 -0
  312. package/backend/src/config/debug.config.js +36 -0
  313. package/backend/src/container.js +82 -0
  314. package/backend/src/core/BotHistoryStore.js +180 -0
  315. package/backend/src/core/BotManager.js +149 -868
  316. package/backend/src/core/BotManager.old.js +1093 -0
  317. package/backend/src/core/BotProcess.js +850 -191
  318. package/backend/src/core/EventGraphManager.js +194 -198
  319. package/backend/src/core/GraphExecutionEngine.js +709 -57
  320. package/backend/src/core/MessageQueue.js +39 -12
  321. package/backend/src/core/NodeRegistry.js +37 -1134
  322. package/backend/src/core/PluginLoader.js +99 -5
  323. package/backend/src/core/PluginManager.js +126 -15
  324. package/backend/src/core/PrismaService.js +32 -0
  325. package/backend/src/core/TaskScheduler.js +1 -1
  326. package/backend/src/core/UserService.js +3 -3
  327. package/backend/src/core/__tests__/PrismaService.test.js +24 -0
  328. package/backend/src/core/commands/README.md +305 -0
  329. package/backend/src/core/commands/dev.js +13 -7
  330. package/backend/src/core/commands/ping.js +10 -4
  331. package/backend/src/core/commands/whois.js +63 -0
  332. package/backend/src/core/config/validation.js +27 -0
  333. package/backend/src/core/constants/graphTypes.js +21 -0
  334. package/backend/src/core/node-registries/actions.js +202 -0
  335. package/backend/src/core/node-registries/arrays.js +155 -0
  336. package/backend/src/core/node-registries/bot.js +23 -0
  337. package/backend/src/core/node-registries/data.js +290 -0
  338. package/backend/src/core/node-registries/debug.js +26 -0
  339. package/backend/src/core/node-registries/events.js +201 -0
  340. package/backend/src/core/node-registries/flow.js +139 -0
  341. package/backend/src/core/node-registries/logic.js +62 -0
  342. package/backend/src/core/node-registries/math.js +42 -0
  343. package/backend/src/core/node-registries/objects.js +98 -0
  344. package/backend/src/core/node-registries/strings.js +187 -0
  345. package/backend/src/core/node-registries/time.js +113 -0
  346. package/backend/src/core/node-registries/type.js +25 -0
  347. package/backend/src/core/node-registries/users.js +79 -0
  348. package/backend/src/core/nodes/{action_bot_look_at.js → actions/bot_look_at.js} +36 -36
  349. package/backend/src/core/nodes/{action_bot_set_variable.js → actions/bot_set_variable.js} +32 -32
  350. package/backend/src/core/nodes/actions/create_command.js +189 -0
  351. package/backend/src/core/nodes/actions/delete_command.js +92 -0
  352. package/backend/src/core/nodes/{action_send_log.js → actions/send_log.js} +28 -23
  353. package/backend/src/core/nodes/{action_send_message.js → actions/send_message.js} +32 -32
  354. package/backend/src/core/nodes/actions/send_websocket_response.js +33 -0
  355. package/backend/src/core/nodes/actions/update_command.js +133 -0
  356. package/backend/src/core/nodes/arrays/get_next.js +35 -0
  357. package/backend/src/core/nodes/arrays/join.js +28 -0
  358. package/backend/src/core/nodes/{data_cast.js → data/cast.js} +10 -1
  359. package/backend/src/core/nodes/data/datetime_literal.js +27 -0
  360. package/backend/src/core/nodes/data/entity_info.js +69 -0
  361. package/backend/src/core/nodes/data/get_nearby_entities.js +32 -0
  362. package/backend/src/core/nodes/data/get_nearby_players.js +64 -0
  363. package/backend/src/core/nodes/{data_get_user_field.js → data/get_user_field.js} +1 -1
  364. package/backend/src/core/nodes/data/type_check.js +53 -0
  365. package/backend/src/core/nodes/{debug_log.js → debug/log.js} +16 -16
  366. package/backend/src/core/nodes/{flow_branch.js → flow/branch.js} +15 -15
  367. package/backend/src/core/nodes/{flow_break.js → flow/break.js} +14 -14
  368. package/backend/src/core/nodes/flow/delay.js +43 -0
  369. package/backend/src/core/nodes/{flow_for_each.js → flow/for_each.js} +39 -39
  370. package/backend/src/core/nodes/{flow_sequence.js → flow/sequence.js} +16 -16
  371. package/backend/src/core/nodes/{flow_switch.js → flow/switch.js} +47 -47
  372. package/backend/src/core/nodes/{flow_while.js → flow/while.js} +1 -1
  373. package/backend/src/core/nodes/logic/__tests__/compare.test.js +83 -0
  374. package/backend/src/core/nodes/logic/not.js +22 -0
  375. package/backend/src/core/nodes/math/__tests__/operation.test.js +65 -0
  376. package/backend/src/core/nodes/strings/__tests__/concat.test.js +89 -0
  377. package/backend/src/core/nodes/{string_starts_with.js → strings/starts_with.js} +1 -1
  378. package/backend/src/core/nodes/strings/to_lower.js +22 -0
  379. package/backend/src/core/nodes/strings/to_upper.js +22 -0
  380. package/backend/src/core/nodes/time/__tests__/now.test.js +24 -0
  381. package/backend/src/core/nodes/time/add.js +33 -0
  382. package/backend/src/core/nodes/time/compare.js +35 -0
  383. package/backend/src/core/nodes/time/diff.js +29 -0
  384. package/backend/src/core/nodes/time/format.js +32 -0
  385. package/backend/src/core/nodes/time/now.js +18 -0
  386. package/backend/src/core/nodes/type/to_string.js +32 -0
  387. package/backend/src/core/nodes/{user_check_blacklist.js → users/check_blacklist.js} +37 -37
  388. package/backend/src/core/nodes/{user_get_groups.js → users/get_groups.js} +36 -36
  389. package/backend/src/core/nodes/{user_get_permissions.js → users/get_permissions.js} +36 -36
  390. package/backend/src/core/nodes/{user_set_blacklist.js → users/set_blacklist.js} +37 -37
  391. package/backend/src/core/services/BotLifecycleService.js +835 -0
  392. package/backend/src/core/services/BotProcessManager.js +163 -0
  393. package/backend/src/core/services/CacheManager.js +111 -0
  394. package/backend/src/core/services/CommandExecutionService.js +430 -0
  395. package/backend/src/core/services/DebugSessionManager.js +347 -0
  396. package/backend/src/core/services/GraphCollaborationManager.js +501 -0
  397. package/backend/src/core/services/MinecraftBotManager.js +259 -0
  398. package/backend/src/core/services/MinecraftViewerService.js +216 -0
  399. package/backend/src/core/services/ResourceMonitorService.js +90 -0
  400. package/backend/src/core/services/TelemetryService.js +124 -0
  401. package/backend/src/core/services/TraceCollectorService.js +545 -0
  402. package/backend/src/core/services/ValidationService.js +132 -0
  403. package/backend/src/core/services/__tests__/ValidationService.test.js +148 -0
  404. package/backend/src/core/services.js +20 -5
  405. package/backend/src/core/system/CommandContext.js +84 -0
  406. package/backend/src/core/system/RuntimeCommandRegistry.js +116 -0
  407. package/backend/src/core/system/Transport.js +74 -0
  408. package/backend/src/core/utils/__tests__/jsonParser.test.js +44 -0
  409. package/backend/src/core/utils/jsonParser.js +18 -0
  410. package/backend/src/core/utils/secretsFilter.js +262 -0
  411. package/backend/src/core/utils/variableParser.js +89 -0
  412. package/backend/src/core/validation/__tests__/nodeSchemas.test.js +175 -0
  413. package/backend/src/core/validation/nodeSchemas.js +112 -0
  414. package/backend/src/lib/prisma.js +2 -4
  415. package/backend/src/real-time/botApi/handlers/commandHandlers.js +28 -0
  416. package/backend/src/real-time/botApi/handlers/graphHandlers.js +99 -0
  417. package/backend/src/real-time/botApi/handlers/graphWebSocketHandlers.js +147 -0
  418. package/backend/src/real-time/botApi/handlers/index.js +43 -0
  419. package/backend/src/real-time/botApi/handlers/messageHandlers.js +66 -0
  420. package/backend/src/real-time/botApi/handlers/statusHandlers.js +17 -0
  421. package/backend/src/real-time/botApi/handlers/userHandlers.js +141 -0
  422. package/backend/src/real-time/botApi/index.js +40 -0
  423. package/backend/src/real-time/botApi/middleware.js +79 -0
  424. package/backend/src/real-time/botApi/utils.js +65 -0
  425. package/backend/src/real-time/panelNamespace.js +387 -0
  426. package/backend/src/real-time/presence.js +7 -2
  427. package/backend/src/real-time/socketHandler.js +400 -5
  428. package/backend/src/repositories/BaseRepository.js +43 -0
  429. package/backend/src/repositories/BotRepository.js +42 -0
  430. package/backend/src/repositories/CommandRepository.js +53 -0
  431. package/backend/src/repositories/EventGraphRepository.js +40 -0
  432. package/backend/src/repositories/GroupRepository.js +69 -0
  433. package/backend/src/repositories/PermissionRepository.js +48 -0
  434. package/backend/src/repositories/PluginRepository.js +42 -0
  435. package/backend/src/repositories/ServerRepository.js +27 -0
  436. package/backend/src/repositories/UserRepository.js +48 -0
  437. package/backend/src/server.js +21 -0
  438. package/backend/src/test-refactor.js +85 -0
  439. package/frontend/dist/assets/index-B1serztM.js +11210 -0
  440. package/frontend/dist/assets/index-t6K1u4OV.css +32 -0
  441. package/frontend/dist/index.html +2 -2
  442. package/frontend/package-lock.json +9437 -0
  443. package/frontend/package.json +8 -5
  444. package/package.json +3 -2
  445. package/screen/console.png +0 -0
  446. package/screen/dashboard.png +0 -0
  447. package/screen/graph_collabe.png +0 -0
  448. package/screen/graph_live_debug.png +0 -0
  449. package/screen/management_command.png +0 -0
  450. package/screen/node_debug_trace.png +0 -0
  451. package/screen/plugin_/320/276/320/261/320/267/320/276/321/200.png +0 -0
  452. package/screen/websocket.png +0 -0
  453. package/screen//320/275/320/260/321/201/321/202/321/200/320/276/320/271/320/272/320/270_/320/276/321/202/320/264/320/265/320/273/321/214/320/275/321/213/321/205_/320/272/320/276/320/274/320/260/320/275/320/264_/320/272/320/260/320/266/320/264/321/203_/320/272/320/276/320/274/320/260/320/275/320/273/320/264/321/203_/320/274/320/276/320/266/320/275/320/276_/320/275/320/260/321/201/321/202/321/200/320/260/320/270/320/262/320/260/321/202/321/214.png +0 -0
  454. package/screen//320/277/320/273/320/260/320/275/320/270/321/200/320/276/320/262/321/211/320/270/320/272_/320/274/320/276/320/266/320/275/320/276_/320/267/320/260/320/264/320/260/320/262/320/260/321/202/321/214_/320/264/320/265/320/271/321/201/321/202/320/262/320/270/321/217_/320/277/320/276_/320/262/321/200/320/265/320/274/320/265/320/275/320/270.png +0 -0
  455. package/frontend/dist/assets/index-B9GedHEa.js +0 -8352
  456. package/frontend/dist/assets/index-zLiy9MDx.css +0 -1
  457. package/nul +0 -0
  458. /package/backend/src/core/nodes/{action_http_request.js → actions/http_request.js} +0 -0
  459. /package/backend/src/core/nodes/{array_add_element.js → arrays/add_element.js} +0 -0
  460. /package/backend/src/core/nodes/{array_contains.js → arrays/contains.js} +0 -0
  461. /package/backend/src/core/nodes/{array_find_index.js → arrays/find_index.js} +0 -0
  462. /package/backend/src/core/nodes/{array_get_by_index.js → arrays/get_by_index.js} +0 -0
  463. /package/backend/src/core/nodes/{array_get_random_element.js → arrays/get_random_element.js} +0 -0
  464. /package/backend/src/core/nodes/{array_remove_by_index.js → arrays/remove_by_index.js} +0 -0
  465. /package/backend/src/core/nodes/{bot_get_position.js → bot/get_position.js} +0 -0
  466. /package/backend/src/core/nodes/{data_array_literal.js → data/array_literal.js} +0 -0
  467. /package/backend/src/core/nodes/{data_boolean_literal.js → data/boolean_literal.js} +0 -0
  468. /package/backend/src/core/nodes/{data_get_argument.js → data/get_argument.js} +0 -0
  469. /package/backend/src/core/nodes/{data_get_bot_look.js → data/get_bot_look.js} +0 -0
  470. /package/backend/src/core/nodes/{data_get_entity_field.js → data/get_entity_field.js} +0 -0
  471. /package/backend/src/core/nodes/{data_get_server_players.js → data/get_server_players.js} +0 -0
  472. /package/backend/src/core/nodes/{data_get_variable.js → data/get_variable.js} +0 -0
  473. /package/backend/src/core/nodes/{data_length.js → data/length.js} +0 -0
  474. /package/backend/src/core/nodes/{data_make_object.js → data/make_object.js} +0 -0
  475. /package/backend/src/core/nodes/{data_number_literal.js → data/number_literal.js} +0 -0
  476. /package/backend/src/core/nodes/{data_string_literal.js → data/string_literal.js} +0 -0
  477. /package/backend/src/core/nodes/{logic_compare.js → logic/compare.js} +0 -0
  478. /package/backend/src/core/nodes/{logic_operation.js → logic/operation.js} +0 -0
  479. /package/backend/src/core/nodes/{math_operation.js → math/operation.js} +0 -0
  480. /package/backend/src/core/nodes/{math_random_number.js → math/random_number.js} +0 -0
  481. /package/backend/src/core/nodes/{object_create.js → objects/create.js} +0 -0
  482. /package/backend/src/core/nodes/{object_delete.js → objects/delete.js} +0 -0
  483. /package/backend/src/core/nodes/{object_get.js → objects/get.js} +0 -0
  484. /package/backend/src/core/nodes/{object_has_key.js → objects/has_key.js} +0 -0
  485. /package/backend/src/core/nodes/{object_set.js → objects/set.js} +0 -0
  486. /package/backend/src/core/nodes/{string_concat.js → strings/concat.js} +0 -0
  487. /package/backend/src/core/nodes/{string_contains.js → strings/contains.js} +0 -0
  488. /package/backend/src/core/nodes/{string_ends_with.js → strings/ends_with.js} +0 -0
  489. /package/backend/src/core/nodes/{string_equals.js → strings/equals.js} +0 -0
  490. /package/backend/src/core/nodes/{string_length.js → strings/length.js} +0 -0
  491. /package/backend/src/core/nodes/{string_matches.js → strings/matches.js} +0 -0
  492. /package/backend/src/core/nodes/{string_split.js → strings/split.js} +0 -0
@@ -0,0 +1,729 @@
1
+ # Enterprise Monitoring
2
+
3
+ Enterprise monitoring tools, dashboards, capacity management, performance metrics, and proactive monitoring strategies.
4
+
5
+ ## Table of Contents
6
+
7
+ - [Monitoring Overview](#monitoring-overview)
8
+ - [Monitoring Tools](#monitoring-tools)
9
+ - [Monitoring Metrics](#monitoring-metrics)
10
+ - [Dashboards](#dashboards)
11
+ - [Alerting](#alerting)
12
+ - [Capacity Management](#capacity-management)
13
+ - [Best Practices](#best-practices)
14
+
15
+ ## Monitoring Overview
16
+
17
+ ### Purpose
18
+
19
+ Enterprise monitoring provides:
20
+ - Real-time visibility into IT infrastructure
21
+ - Proactive issue detection
22
+ - Performance optimization
23
+ - Capacity planning
24
+ - Service level compliance
25
+ - Root cause analysis
26
+
27
+ ### Monitoring Layers
28
+
29
+ ```
30
+ ┌─────────────────────────────────────────┐
31
+ │ Business Monitoring │
32
+ │ - Transaction success rate │
33
+ │ - Revenue per minute │
34
+ │ - Customer experience │
35
+ └──────────────┬──────────────────────────┘
36
+
37
+ ┌─────────────────────────────────────────┐
38
+ │ Application Monitoring (APM) │
39
+ │ - Response times │
40
+ │ - Error rates │
41
+ │ - Database query performance │
42
+ └──────────────┬──────────────────────────┘
43
+
44
+ ┌─────────────────────────────────────────┐
45
+ │ Infrastructure Monitoring │
46
+ │ - Server CPU/memory │
47
+ │ - Network bandwidth │
48
+ │ - Storage capacity │
49
+ └──────────────┬──────────────────────────┘
50
+
51
+ ┌─────────────────────────────────────────┐
52
+ │ Network Monitoring │
53
+ │ - Link availability │
54
+ │ - Latency │
55
+ │ - Packet loss │
56
+ └─────────────────────────────────────────┘
57
+ ```
58
+
59
+ ## Monitoring Tools
60
+
61
+ ### Enterprise Monitoring Stack
62
+
63
+ **Infrastructure Monitoring:**
64
+ ```yaml
65
+ Tools:
66
+ - Nagios/Icinga: Traditional monitoring
67
+ - Zabbix: Enterprise monitoring
68
+ - PRTG: Network monitoring
69
+ - SolarWinds: Comprehensive suite
70
+
71
+ Capabilities:
72
+ - Server monitoring (CPU, memory, disk)
73
+ - Network device monitoring
74
+ - Service checks (HTTP, SMTP, etc.)
75
+ - SNMP monitoring
76
+ - Alerting
77
+ ```
78
+
79
+ **Application Performance Monitoring (APM):**
80
+ ```yaml
81
+ Tools:
82
+ - New Relic: Full-stack observability
83
+ - Dynatrace: AI-powered APM
84
+ - AppDynamics: Application intelligence
85
+ - Datadog: Cloud-scale monitoring
86
+
87
+ Capabilities:
88
+ - Application performance
89
+ - Transaction tracing
90
+ - Code-level diagnostics
91
+ - User experience monitoring
92
+ - Error tracking
93
+ ```
94
+
95
+ **Log Management:**
96
+ ```yaml
97
+ Tools:
98
+ - Splunk: Enterprise log analysis
99
+ - ELK Stack: Open-source (Elasticsearch, Logstash, Kibana)
100
+ - Graylog: Log management
101
+ - Sumo Logic: Cloud-native logs
102
+
103
+ Capabilities:
104
+ - Centralized logging
105
+ - Log aggregation
106
+ - Search and analysis
107
+ - Correlation
108
+ - Compliance
109
+ ```
110
+
111
+ **Cloud Monitoring:**
112
+ ```yaml
113
+ AWS:
114
+ - CloudWatch: Metrics and logs
115
+ - X-Ray: Distributed tracing
116
+ - CloudTrail: Audit logs
117
+
118
+ Azure:
119
+ - Azure Monitor: Unified monitoring
120
+ - Application Insights: APM
121
+ - Log Analytics: Log management
122
+
123
+ GCP:
124
+ - Cloud Monitoring: Metrics
125
+ - Cloud Logging: Logs
126
+ - Cloud Trace: Distributed tracing
127
+ ```
128
+
129
+ ## Monitoring Metrics
130
+
131
+ ### Infrastructure Metrics
132
+
133
+ **Server Metrics:**
134
+ ```yaml
135
+ CPU:
136
+ - CPU utilization (%)
137
+ - Load average (1m, 5m, 15m)
138
+ - Context switches
139
+ - CPU steal time (virtual)
140
+
141
+ Thresholds:
142
+ Warning: >70%
143
+ Critical: >90%
144
+
145
+ Memory:
146
+ - Memory utilization (%)
147
+ - Swap usage
148
+ - Memory available
149
+ - Page faults
150
+
151
+ Thresholds:
152
+ Warning: >80%
153
+ Critical: >95%
154
+
155
+ Disk:
156
+ - Disk utilization (%)
157
+ - Disk I/O (read/write IOPS)
158
+ - Disk latency
159
+ - Disk queue depth
160
+
161
+ Thresholds:
162
+ Utilization Warning: >80%
163
+ Utilization Critical: >90%
164
+ Latency Warning: >20ms
165
+ Latency Critical: >50ms
166
+
167
+ Network:
168
+ - Bandwidth utilization (%)
169
+ - Packets in/out
170
+ - Errors
171
+ - Dropped packets
172
+
173
+ Thresholds:
174
+ Bandwidth Warning: >70%
175
+ Bandwidth Critical: >90%
176
+ ```
177
+
178
+ ### Application Metrics
179
+
180
+ ```yaml
181
+ Availability:
182
+ - Uptime (%)
183
+ - Error rate (%)
184
+ - Success rate (%)
185
+
186
+ Targets:
187
+ Uptime: 99.9% (SLA)
188
+ Error Rate: <1%
189
+
190
+ Performance:
191
+ - Response time (p50, p95, p99)
192
+ - Transactions per second (TPS)
193
+ - Throughput
194
+ - Apdex score
195
+
196
+ Targets:
197
+ Response Time p95: <500ms
198
+ Response Time p99: <1000ms
199
+ TPS: >1000
200
+
201
+ Resource Usage:
202
+ - Connection pool usage
203
+ - Thread pool usage
204
+ - Cache hit rate
205
+ - Queue depth
206
+
207
+ Targets:
208
+ Connection Pool: <80%
209
+ Cache Hit Rate: >90%
210
+
211
+ Database:
212
+ - Query response time
213
+ - Slow queries
214
+ - Connection count
215
+ - Deadlocks
216
+
217
+ Targets:
218
+ Query Time p95: <100ms
219
+ Slow Queries: <10/hour
220
+ ```
221
+
222
+ ### Business Metrics
223
+
224
+ ```yaml
225
+ E-Commerce Example:
226
+
227
+ Revenue Metrics:
228
+ - Orders per minute
229
+ - Revenue per minute
230
+ - Cart abandonment rate
231
+ - Conversion rate
232
+
233
+ User Experience:
234
+ - Page load time
235
+ - Time to first byte
236
+ - Search results time
237
+ - Checkout time
238
+
239
+ Operational:
240
+ - Inventory accuracy
241
+ - Order fulfillment time
242
+ - Customer support tickets
243
+ - Failed payments
244
+ ```
245
+
246
+ ## Dashboards
247
+
248
+ ### Executive Dashboard
249
+
250
+ ```yaml
251
+ Executive IT Dashboard:
252
+
253
+ Service Health:
254
+ ┌──────────────────────────────────────┐
255
+ │ Service Status │
256
+ ├──────────────────────────────────────┤
257
+ │ Email: ✅ Operational │
258
+ │ Customer Portal: ✅ Operational │
259
+ │ VPN: ✅ Operational │
260
+ │ File Shares: ⚠️ Degraded │
261
+ │ ERP System: ✅ Operational │
262
+ └──────────────────────────────────────┘
263
+
264
+ SLA Compliance (This Month):
265
+ ┌──────────────────────────────────────┐
266
+ │ Overall SLA: 99.7% ✅ (Target: 99.5%)│
267
+ │ │
268
+ │ Email: 99.95% ✅ │
269
+ │ Portal: 99.80% ✅ │
270
+ │ VPN: 99.50% ✅ │
271
+ │ File Shares: 99.40% ⚠️ │
272
+ └──────────────────────────────────────┘
273
+
274
+ Incidents:
275
+ ┌──────────────────────────────────────┐
276
+ │ Open: 15 (▼ 25% vs last month) │
277
+ │ P1: 0 │
278
+ │ P2: 2 │
279
+ │ P3: 8 │
280
+ │ P4: 5 │
281
+ │ │
282
+ │ MTTR: 2.5 hours ✅ (Target: 4 hours) │
283
+ └──────────────────────────────────────┘
284
+
285
+ Costs:
286
+ ┌──────────────────────────────────────┐
287
+ │ Cloud Spend: $145,000 │
288
+ │ Trend: ▼ 5% vs budget ✅ │
289
+ │ Top Costs: │
290
+ │ 1. Compute: $65,000 (45%) │
291
+ │ 2. Storage: $35,000 (24%) │
292
+ │ 3. Network: $25,000 (17%) │
293
+ └──────────────────────────────────────┘
294
+ ```
295
+
296
+ ### Operations Dashboard
297
+
298
+ ```yaml
299
+ NOC (Network Operations Center) Dashboard:
300
+
301
+ Infrastructure Overview:
302
+ ┌──────────────────────────────────────┐
303
+ │ Servers: 245 ✅ / 3 ⚠️ / 0 ❌ │
304
+ │ Network: 45 ✅ / 1 ⚠️ / 0 ❌ │
305
+ │ Storage: 15 ✅ / 0 ⚠️ / 0 ❌ │
306
+ │ Applications: 32 ✅ / 1 ⚠️ / 0 ❌ │
307
+ └──────────────────────────────────────┘
308
+
309
+ Active Alerts:
310
+ ┌──────────────────────────────────────┐
311
+ │ Critical: 0 │
312
+ │ Warning: 5 │
313
+ │ │
314
+ │ 1. File Server Disk 85% (Warning) │
315
+ │ 2. Web01 CPU 75% (Warning) │
316
+ │ 3. Network Link Latency 25ms (Warn) │
317
+ │ 4. Database Slow Queries (Warning) │
318
+ │ 5. Backup Job Delayed (Warning) │
319
+ └──────────────────────────────────────┘
320
+
321
+ Performance:
322
+ ┌──────────────────────────────────────┐
323
+ │ Application Response Time (p95) │
324
+ │ ████████████░░░░░░░░ 485ms ✅ │
325
+ │ Target: <500ms │
326
+ │ │
327
+ │ Network Latency (avg) │
328
+ │ ████░░░░░░░░░░░░░░░░ 18ms ✅ │
329
+ │ Target: <50ms │
330
+ │ │
331
+ │ Database Query Time (p95) │
332
+ │ ██████░░░░░░░░░░░░░░ 85ms ✅ │
333
+ │ Target: <100ms │
334
+ └──────────────────────────────────────┘
335
+ ```
336
+
337
+ ### Application Dashboard
338
+
339
+ ```yaml
340
+ Application Performance Dashboard:
341
+
342
+ Customer Portal:
343
+
344
+ Response Time Trend (24 hours):
345
+ ┌──────────────────────────────────────┐
346
+ │ p50 ▁▂▃▂▁▂▃▂▁▂▃▂▁▂▃▂▁▂▃▂▁▂▃▂ 250ms│
347
+ │ p95 ▃▅▆▅▃▅▆▅▃▅▆▅▃▅▆▅▃▅▆▅▃▅▆▅ 480ms│
348
+ │ p99 ▆▇█▇▆▇█▇▆▇█▇▆▇█▇▆▇█▇▆▇█▇ 920ms│
349
+ │ │
350
+ │ 00:00 06:00 12:00 18:00 │
351
+ └──────────────────────────────────────┘
352
+
353
+ Error Rate (24 hours):
354
+ ┌──────────────────────────────────────┐
355
+ │ 2% █ │
356
+ │ 1% █ ▆ ▃ │
357
+ │ 0% ▅▃▂▁▂▃▂▁▂▃▂▁▂▃▂▁▂▃▂▁▂▃▂▁▂ │
358
+ │ │
359
+ │ Current: 0.3% ✅ (Target: <1%) │
360
+ └──────────────────────────────────────┘
361
+
362
+ Top Endpoints:
363
+ ┌──────────────────────────────────────┐
364
+ │ Endpoint | Requests | p95 │
365
+ ├──────────────────────────────────────┤
366
+ │ /api/orders | 15,000 | 320ms │
367
+ │ /api/products | 12,500 | 280ms │
368
+ │ /api/customers | 8,000 | 450ms │
369
+ │ /api/search | 6,000 | 650ms │
370
+ │ /api/checkout | 3,500 | 890ms │
371
+ └──────────────────────────────────────┘
372
+
373
+ Database Queries:
374
+ ┌──────────────────────────────────────┐
375
+ │ Slow Queries (>1s): 12 ⚠️ │
376
+ │ │
377
+ │ Top Slow Queries: │
378
+ │ 1. SELECT * FROM orders... (2.5s) │
379
+ │ 2. JOIN customers... (1.8s) │
380
+ │ 3. UPDATE inventory... (1.2s) │
381
+ └──────────────────────────────────────┘
382
+ ```
383
+
384
+ ## Alerting
385
+
386
+ ### Alert Levels
387
+
388
+ ```yaml
389
+ Alert Severity Levels:
390
+
391
+ Critical:
392
+ Description: Service down, immediate action required
393
+ Examples:
394
+ - Production database down
395
+ - Website unreachable
396
+ - Data loss detected
397
+ Response: Page on-call, all hands on deck
398
+ SLA: Response within 15 minutes
399
+
400
+ Warning:
401
+ Description: Threshold exceeded, may impact service
402
+ Examples:
403
+ - Disk 85% full
404
+ - CPU 80% for 10 minutes
405
+ - Backup job delayed
406
+ Response: Investigate within 1 hour
407
+ SLA: Acknowledge within 30 minutes
408
+
409
+ Info:
410
+ Description: Informational, no action required
411
+ Examples:
412
+ - Backup completed successfully
413
+ - Deployment finished
414
+ - Certificate renewed
415
+ Response: Review during business hours
416
+ SLA: No SLA
417
+ ```
418
+
419
+ ### Alert Rules
420
+
421
+ ```yaml
422
+ Server CPU Alert:
423
+
424
+ Metric: cpu.utilization
425
+ Condition: Average > 80% for 10 minutes
426
+ Severity: Warning
427
+
428
+ Actions:
429
+ - Send email to ops-team@company.com
430
+ - Create Slack notification in #ops-alerts
431
+ - Create ServiceNow ticket
432
+
433
+ Escalation:
434
+ If CPU > 90% for 10 minutes:
435
+ - Upgrade to Critical
436
+ - Page on-call engineer
437
+ - Notify manager
438
+
439
+ Auto-remediation:
440
+ If CPU > 95% for 5 minutes:
441
+ - Scale up (add server instance)
442
+ - Restart stuck processes (if configured)
443
+ ```
444
+
445
+ ### Alert Best Practices
446
+
447
+ ```yaml
448
+ Alert Design:
449
+
450
+ 1. Actionable:
451
+ ❌ Bad: "Server CPU high"
452
+ ✅ Good: "Web01 CPU >90% for 15min. Check runaway processes or scale up."
453
+
454
+ 2. Contextual:
455
+ Include:
456
+ - Current value
457
+ - Threshold
458
+ - Duration
459
+ - Impact
460
+ - Runbook link
461
+
462
+ 3. Threshold Tuning:
463
+ - Start conservative (avoid alert fatigue)
464
+ - Adjust based on normal patterns
465
+ - Different thresholds for different times
466
+ - Use anomaly detection
467
+
468
+ 4. Alert Routing:
469
+ - Route to responsible team
470
+ - Escalate if not acknowledged
471
+ - Different channels per severity
472
+ - On-call rotation
473
+
474
+ 5. Alert Deduplication:
475
+ - Group related alerts
476
+ - Suppress dependent alerts
477
+ - Cooldown periods
478
+ - Flapping detection
479
+ ```
480
+
481
+ ## Capacity Management
482
+
483
+ ### Capacity Planning Process
484
+
485
+ ```yaml
486
+ Capacity Planning Cycle:
487
+
488
+ 1. Monitor Current Usage (Ongoing):
489
+ - Track resource utilization
490
+ - Identify trends
491
+ - Collect metrics
492
+
493
+ 2. Forecast Future Demand (Quarterly):
494
+ - Business growth projections
495
+ - Seasonal variations
496
+ - New initiatives
497
+ - Historical trends
498
+
499
+ 3. Analyze Capacity (Quarterly):
500
+ - Current vs forecasted demand
501
+ - Time to resource exhaustion
502
+ - Bottlenecks
503
+ - Optimization opportunities
504
+
505
+ 4. Plan Capacity Changes (Quarterly):
506
+ - Procurement requirements
507
+ - Budget approval
508
+ - Implementation timeline
509
+ - Risk mitigation
510
+
511
+ 5. Implement Changes (As needed):
512
+ - Procure resources
513
+ - Deploy infrastructure
514
+ - Validate capacity
515
+ - Document changes
516
+
517
+ 6. Review and Optimize (Monthly):
518
+ - Actual vs plan
519
+ - Cost efficiency
520
+ - Performance impact
521
+ - Lessons learned
522
+ ```
523
+
524
+ ### Capacity Metrics
525
+
526
+ ```yaml
527
+ Server Capacity:
528
+
529
+ Current State:
530
+ Total Servers: 250
531
+ Average CPU: 45%
532
+ Average Memory: 60%
533
+ Average Disk: 55%
534
+
535
+ Trend (6 months):
536
+ CPU: ▲ 5% increase
537
+ Memory: ▲ 8% increase
538
+ Disk: ▲ 12% increase
539
+
540
+ Forecast (Next 6 months):
541
+ Expected CPU: 55% (10% headroom)
542
+ Expected Memory: 75% (adequate)
543
+ Expected Disk: 75% (adequate)
544
+
545
+ Action Required:
546
+ - None for CPU/Memory
547
+ - Monitor disk growth
548
+ - Plan storage expansion in Q2
549
+
550
+ Storage Capacity:
551
+
552
+ Current: 500 TB used / 750 TB total (67%)
553
+ Growth Rate: 15 TB/month
554
+ Forecast: 590 TB in 6 months (79%)
555
+ Threshold: 80% (warning)
556
+
557
+ Action:
558
+ - Procure additional 250 TB
559
+ - Timeline: Q2 2025
560
+ - Budget: $50,000
561
+
562
+ Network Capacity:
563
+
564
+ Current: 1 Gbps links
565
+ Peak Usage: 650 Mbps (65%)
566
+ Growth: 5% per quarter
567
+ Forecast: 850 Mbps in 12 months (85%)
568
+
569
+ Action:
570
+ - Upgrade to 10 Gbps in Q3
571
+ - Cost: $25,000
572
+ - Provides 10x headroom
573
+ ```
574
+
575
+ ### Capacity Reporting
576
+
577
+ ```yaml
578
+ Monthly Capacity Report:
579
+
580
+ Executive Summary:
581
+ - All systems within capacity targets
582
+ - Storage requiring expansion in 6 months
583
+ - Network upgrade planned Q3
584
+ - No immediate concerns
585
+
586
+ Current Utilization:
587
+ Compute: 45% (Low ✅)
588
+ Memory: 60% (Moderate ✅)
589
+ Storage: 67% (Moderate ✅)
590
+ Network: 65% (Moderate ✅)
591
+
592
+ Trends:
593
+ - Steady 5% quarterly compute growth
594
+ - Storage growth accelerating (cleanup needed)
595
+ - Network stable
596
+
597
+ Forecasts:
598
+ Next 6 Months:
599
+ - Compute: Adequate capacity
600
+ - Storage: Approaching limit (action required)
601
+ - Network: Adequate capacity
602
+
603
+ Next 12 Months:
604
+ - Compute: Adequate capacity
605
+ - Storage: Expansion required
606
+ - Network: Upgrade recommended
607
+
608
+ Actions:
609
+ - Storage procurement initiated
610
+ - Network upgrade planning started
611
+ - Cost: $75,000 (approved)
612
+ ```
613
+
614
+ ## Best Practices
615
+
616
+ ### 1. Monitoring Coverage
617
+
618
+ ```yaml
619
+ Ensure Comprehensive Coverage:
620
+
621
+ Infrastructure:
622
+ - All production servers
623
+ - Network devices
624
+ - Storage systems
625
+ - Virtualization platforms
626
+
627
+ Applications:
628
+ - All critical applications
629
+ - Key transactions
630
+ - Dependencies
631
+ - APIs
632
+
633
+ Business:
634
+ - Revenue metrics
635
+ - User experience
636
+ - SLA compliance
637
+ - Customer satisfaction
638
+ ```
639
+
640
+ ### 2. Baseline Establishment
641
+
642
+ ```yaml
643
+ Establish Performance Baselines:
644
+
645
+ Process:
646
+ 1. Collect metrics for 30 days
647
+ 2. Analyze patterns (daily, weekly)
648
+ 3. Calculate normal ranges
649
+ 4. Set thresholds above baseline
650
+ 5. Review quarterly
651
+
652
+ Example:
653
+ Metric: Application response time
654
+ Baseline (p95): 450ms
655
+ Warning: 600ms (133% of baseline)
656
+ Critical: 900ms (200% of baseline)
657
+ ```
658
+
659
+ ### 3. Alert Fatigue Prevention
660
+
661
+ ```yaml
662
+ Avoid Alert Fatigue:
663
+
664
+ Strategies:
665
+ - Tune thresholds (reduce false positives)
666
+ - Use intelligent alerting (anomaly detection)
667
+ - Implement alert aggregation
668
+ - Regular alert review and cleanup
669
+ - Auto-remediation where possible
670
+
671
+ Metrics:
672
+ - Alert volume: <100/day
673
+ - Alert-to-incident ratio: >50%
674
+ - False positive rate: <10%
675
+ - Time to acknowledge: <5 minutes
676
+ ```
677
+
678
+ ### 4. Correlation and Root Cause
679
+
680
+ ```yaml
681
+ Use Correlation for RCA:
682
+
683
+ Approach:
684
+ - Correlate metrics across layers
685
+ - Identify cascading failures
686
+ - Trace requests end-to-end
687
+ - Link logs to metrics
688
+ - Use dependency mapping
689
+
690
+ Example:
691
+ Symptom: Application slow
692
+ Correlation:
693
+ - Application response time ↑
694
+ - Database query time ↑
695
+ - Database disk I/O ↑
696
+ - Storage latency ↑
697
+ Root Cause: Storage array degraded disk
698
+ ```
699
+
700
+ ### 5. Continuous Improvement
701
+
702
+ ```yaml
703
+ Monitoring Improvement Process:
704
+
705
+ Monthly:
706
+ - Review alert effectiveness
707
+ - Tune thresholds
708
+ - Add missing metrics
709
+ - Update dashboards
710
+
711
+ Quarterly:
712
+ - Capacity planning review
713
+ - Tool evaluation
714
+ - Process optimization
715
+ - Team training
716
+
717
+ Annually:
718
+ - Technology refresh
719
+ - Tool consolidation
720
+ - Architecture review
721
+ - Strategy planning
722
+ ```
723
+
724
+ ---
725
+
726
+ **Related Resources:**
727
+ - [incident-service-management.md](incident-service-management.md) - Incident response
728
+ - [business-continuity.md](business-continuity.md) - DR monitoring
729
+ - [automation-orchestration.md](automation-orchestration.md) - Automated remediation