blockmine 1.24.0 → 1.27.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (476) hide show
  1. package/CHANGELOG.md +76 -1
  2. package/README.en.md +427 -0
  3. package/README.md +40 -0
  4. package/backend/package.json +2 -2
  5. package/backend/prisma/migrations/20260328173000_add_plugin_source_ref/migration.sql +2 -0
  6. package/backend/prisma/migrations/migration_lock.toml +2 -2
  7. package/backend/prisma/schema.prisma +2 -0
  8. package/backend/src/ai/plugin-assistant-system-prompt.md +664 -5
  9. package/backend/src/api/routes/apiKeys.js +8 -0
  10. package/backend/src/api/routes/bots.js +271 -9
  11. package/backend/src/api/routes/eventGraphs.js +151 -1
  12. package/backend/src/api/routes/health.js +38 -0
  13. package/backend/src/api/routes/nodeRegistry.js +63 -0
  14. package/backend/src/api/routes/plugins.js +254 -29
  15. package/backend/src/api/routes/servers.js +14 -2
  16. package/backend/src/container.js +11 -8
  17. package/backend/src/core/BotCommandLoader.js +161 -0
  18. package/backend/src/core/BotConnection.js +125 -0
  19. package/backend/src/core/BotEventHandlers.js +234 -0
  20. package/backend/src/core/BotIPCHandler.js +445 -0
  21. package/backend/src/core/BotManager.js +15 -7
  22. package/backend/src/core/BotProcess.js +169 -140
  23. package/backend/src/core/EventGraphManager.js +7 -3
  24. package/backend/src/core/GraphDebugHandler.js +229 -0
  25. package/backend/src/core/GraphDebugIPC.js +117 -0
  26. package/backend/src/core/GraphExecutionEngine.js +545 -978
  27. package/backend/src/core/GraphTraversal.js +80 -0
  28. package/backend/src/core/GraphValidation.js +73 -0
  29. package/backend/src/core/NodeDefinition.js +138 -0
  30. package/backend/src/core/NodeRegistry.js +153 -141
  31. package/backend/src/core/PluginLoader.js +83 -3
  32. package/backend/src/core/PluginManager.js +346 -35
  33. package/backend/src/core/RewindSignal.js +9 -0
  34. package/backend/src/core/config/ConfigValidator.js +72 -0
  35. package/backend/src/core/config/FeatureFlags.js +52 -0
  36. package/backend/src/core/config/__tests__/ConfigValidator.test.js +232 -0
  37. package/backend/src/core/domain/entities/Bot.js +39 -0
  38. package/backend/src/core/domain/entities/Command.js +41 -0
  39. package/backend/src/core/domain/entities/EventGraph.js +39 -0
  40. package/backend/src/core/domain/entities/Plugin.js +45 -0
  41. package/backend/src/core/domain/entities/User.js +40 -0
  42. package/backend/src/core/domain/services/DependencyResolver.js +168 -0
  43. package/backend/src/core/domain/services/GraphValidator.js +117 -0
  44. package/backend/src/core/domain/services/PermissionChecker.js +34 -0
  45. package/backend/src/core/domain/services/__tests__/DependencyResolver.test.js +126 -0
  46. package/backend/src/core/domain/valueObjects/BotConfig.js +27 -0
  47. package/backend/src/core/domain/valueObjects/DependencyGraph.js +86 -0
  48. package/backend/src/core/domain/valueObjects/PluginManifest.js +36 -0
  49. package/backend/src/core/errors/BaseError.js +29 -0
  50. package/backend/src/core/errors/ErrorHandler.js +81 -0
  51. package/backend/src/core/errors/__tests__/ErrorHandler.test.js +188 -0
  52. package/backend/src/core/errors/index.js +68 -0
  53. package/backend/src/core/infrastructure/BatchingUtility.js +66 -0
  54. package/backend/src/core/infrastructure/CircuitBreaker.js +103 -0
  55. package/backend/src/core/infrastructure/ConnectionPool.js +81 -0
  56. package/backend/src/core/infrastructure/RateLimiter.js +64 -0
  57. package/backend/src/core/infrastructure/__tests__/BatchingUtility.test.js +86 -0
  58. package/backend/src/core/infrastructure/__tests__/CircuitBreaker.test.js +156 -0
  59. package/backend/src/core/infrastructure/__tests__/ConnectionPool.test.js +146 -0
  60. package/backend/src/core/infrastructure/__tests__/RateLimiter.test.js +171 -0
  61. package/backend/src/core/ipc/botApiFactory.js +72 -0
  62. package/backend/src/core/ipc/ipcMessageTypes.js +115 -0
  63. package/backend/src/core/logging/AuditLogger.js +61 -0
  64. package/backend/src/core/logging/StructuredLogger.js +80 -0
  65. package/backend/src/core/logging/__tests__/StructuredLogger.test.js +213 -0
  66. package/backend/src/core/logging/index.js +7 -0
  67. package/backend/src/core/metrics/MetricsCollector.js +104 -0
  68. package/backend/src/core/metrics/__tests__/MetricsCollector.test.js +131 -0
  69. package/backend/src/core/node-registries/actionsNodes.js +191 -0
  70. package/backend/src/core/node-registries/arraysNodes.js +152 -0
  71. package/backend/src/core/node-registries/botNodes.js +48 -0
  72. package/backend/src/core/node-registries/containerNodes.js +141 -0
  73. package/backend/src/core/node-registries/dataNodes.js +284 -0
  74. package/backend/src/core/node-registries/debugNodes.js +23 -0
  75. package/backend/src/core/node-registries/eventsNodes.js +223 -0
  76. package/backend/src/core/node-registries/flowNodes.js +151 -0
  77. package/backend/src/core/node-registries/furnaceNodes.js +123 -0
  78. package/backend/src/core/node-registries/index.js +108 -0
  79. package/backend/src/core/node-registries/inventory.js +102 -106
  80. package/backend/src/core/node-registries/logicNodes.js +54 -0
  81. package/backend/src/core/node-registries/mathNodes.js +38 -0
  82. package/backend/src/core/node-registries/navigationNodes.js +109 -0
  83. package/backend/src/core/node-registries/objectsNodes.js +90 -0
  84. package/backend/src/core/node-registries/stringsNodes.js +165 -0
  85. package/backend/src/core/node-registries/timeNodes.js +105 -0
  86. package/backend/src/core/node-registries/typeNodes.js +22 -0
  87. package/backend/src/core/node-registries/usersNodes.js +126 -0
  88. package/backend/src/core/nodes/arrays/shuffle.js +14 -0
  89. package/backend/src/core/nodes/bot/get_name.js +8 -0
  90. package/backend/src/core/nodes/bot/stop_bot.js +5 -0
  91. package/backend/src/core/nodes/container/open.js +101 -111
  92. package/backend/src/core/nodes/data/store_read.js +26 -0
  93. package/backend/src/core/nodes/data/store_write.js +23 -0
  94. package/backend/src/core/nodes/event/call_event.js +31 -0
  95. package/backend/src/core/nodes/event/custom_event.js +8 -0
  96. package/backend/src/core/nodes/flow/timer.js +35 -0
  97. package/backend/src/core/nodes/inventory/drop.js +73 -65
  98. package/backend/src/core/nodes/inventory/equip.js +54 -45
  99. package/backend/src/core/nodes/inventory/select_slot.js +48 -46
  100. package/backend/src/core/nodes/navigation/follow.js +54 -51
  101. package/backend/src/core/nodes/navigation/go_to.js +41 -53
  102. package/backend/src/core/nodes/navigation/go_to_entity.js +65 -69
  103. package/backend/src/core/nodes/navigation/go_to_player.js +65 -70
  104. package/backend/src/core/nodes/navigation/stop.js +17 -26
  105. package/backend/src/core/nodes/users/add_to_group.js +24 -0
  106. package/backend/src/core/nodes/users/check_permission.js +26 -0
  107. package/backend/src/core/nodes/users/remove_from_group.js +24 -0
  108. package/backend/src/core/services/BotIPCMessageRouter.js +337 -0
  109. package/backend/src/core/services/BotLifecycleService.js +43 -450
  110. package/backend/src/core/services/CacheManager.js +83 -23
  111. package/backend/src/core/services/CrashRestartManager.js +42 -0
  112. package/backend/src/core/services/DebugSessionManager.js +114 -12
  113. package/backend/src/core/services/EventGraphService.js +69 -0
  114. package/backend/src/core/services/MinecraftBotManager.js +9 -1
  115. package/backend/src/core/services/PluginManagementService.js +84 -0
  116. package/backend/src/core/services/TestModeContext.js +65 -0
  117. package/backend/src/core/services/__tests__/CacheManager.test.js +168 -0
  118. package/backend/src/core/services.js +1 -11
  119. package/backend/src/core/validation/InputValidator.js +167 -0
  120. package/backend/src/core/validation/__tests__/InputValidator.test.js +296 -0
  121. package/backend/src/real-time/botApi/index.js +1 -1
  122. package/backend/src/real-time/socketHandler.js +26 -0
  123. package/backend/src/server.js +21 -6
  124. package/frontend/dist/assets/browser-ponyfill-D8y0Ty7C.js +2 -0
  125. package/frontend/dist/assets/index-CFJLS0dk.css +32 -0
  126. package/frontend/dist/assets/index-D91UGNMG.js +11260 -0
  127. package/frontend/dist/flags/en.svg +32 -0
  128. package/frontend/dist/flags/ru.svg +5 -0
  129. package/frontend/dist/index.html +2 -2
  130. package/frontend/dist/locales/en/admin.json +100 -0
  131. package/frontend/dist/locales/en/api-keys.json +58 -0
  132. package/frontend/dist/locales/en/bots.json +113 -0
  133. package/frontend/dist/locales/en/common.json +53 -0
  134. package/frontend/dist/locales/en/configuration.json +22 -0
  135. package/frontend/dist/locales/en/console.json +10 -0
  136. package/frontend/dist/locales/en/dashboard.json +85 -0
  137. package/frontend/dist/locales/en/dialogs.json +70 -0
  138. package/frontend/dist/locales/en/event-graphs.json +50 -0
  139. package/frontend/dist/locales/en/graph-store.json +70 -0
  140. package/frontend/dist/locales/en/login.json +36 -0
  141. package/frontend/dist/locales/en/management.json +192 -0
  142. package/frontend/dist/locales/en/minecraft-viewer.json +27 -0
  143. package/frontend/dist/locales/en/nodes.json +1132 -0
  144. package/frontend/dist/locales/en/permissions.json +50 -0
  145. package/frontend/dist/locales/en/plugin-detail.json +69 -0
  146. package/frontend/dist/locales/en/plugins.json +329 -0
  147. package/frontend/dist/locales/en/proxies.json +81 -0
  148. package/frontend/dist/locales/en/servers.json +39 -0
  149. package/frontend/dist/locales/en/setup.json +19 -0
  150. package/frontend/dist/locales/en/sidebar.json +195 -0
  151. package/frontend/dist/locales/en/tasks.json +62 -0
  152. package/frontend/dist/locales/en/visual-editor.json +418 -0
  153. package/frontend/dist/locales/en/websocket.json +86 -0
  154. package/frontend/dist/locales/ru/admin.json +100 -0
  155. package/frontend/dist/locales/ru/api-keys.json +58 -0
  156. package/frontend/dist/locales/ru/bots.json +113 -0
  157. package/frontend/dist/locales/ru/common.json +49 -0
  158. package/frontend/dist/locales/ru/configuration.json +22 -0
  159. package/frontend/dist/locales/ru/console.json +10 -0
  160. package/frontend/dist/locales/ru/dashboard.json +85 -0
  161. package/frontend/dist/locales/ru/dialogs.json +70 -0
  162. package/frontend/dist/locales/ru/event-graphs.json +50 -0
  163. package/frontend/dist/locales/ru/graph-store.json +70 -0
  164. package/frontend/dist/locales/ru/login.json +36 -0
  165. package/frontend/dist/locales/ru/management.json +192 -0
  166. package/frontend/dist/locales/ru/minecraft-viewer.json +30 -0
  167. package/frontend/dist/locales/ru/nodes.json +1131 -0
  168. package/frontend/dist/locales/ru/permissions.json +50 -0
  169. package/frontend/dist/locales/ru/plugin-detail.json +49 -0
  170. package/frontend/dist/locales/ru/plugins.json +209 -0
  171. package/frontend/dist/locales/ru/proxies.json +81 -0
  172. package/frontend/dist/locales/ru/servers.json +39 -0
  173. package/frontend/dist/locales/ru/setup.json +19 -0
  174. package/frontend/dist/locales/ru/sidebar.json +195 -0
  175. package/frontend/dist/locales/ru/tasks.json +62 -0
  176. package/frontend/dist/locales/ru/visual-editor.json +420 -0
  177. package/frontend/dist/locales/ru/websocket.json +86 -0
  178. package/frontend/dist/monacoeditorwork/css.worker.bundle.js +7 -7
  179. package/frontend/dist/monacoeditorwork/html.worker.bundle.js +7 -7
  180. package/frontend/dist/monacoeditorwork/json.worker.bundle.js +7 -7
  181. package/frontend/dist/monacoeditorwork/ts.worker.bundle.js +3 -3
  182. package/frontend/package.json +6 -0
  183. package/nul +12 -0
  184. package/package.json +3 -3
  185. package/screen/3dviewer.png +0 -0
  186. package/screen/console.png +0 -0
  187. package/screen/dashboard.png +0 -0
  188. package/screen/graph_collabe.png +0 -0
  189. package/screen/graph_live_debug.png +0 -0
  190. package/screen/language_selector.png +0 -0
  191. package/screen/management_command.png +0 -0
  192. package/screen/node_debug_trace.png +0 -0
  193. package/screen/plugin_/320/276/320/261/320/267/320/276/321/200.png +0 -0
  194. package/screen/websocket.png +0 -0
  195. package/screen//320/275/320/260/321/201/321/202/321/200/320/276/320/271/320/272/320/270_/320/276/321/202/320/264/320/265/320/273/321/214/320/275/321/213/321/205_/320/272/320/276/320/274/320/260/320/275/320/264_/320/272/320/260/320/266/320/264/321/203_/320/272/320/276/320/274/320/260/320/275/320/273/320/264/321/203_/320/274/320/276/320/266/320/275/320/276_/320/275/320/260/321/201/321/202/321/200/320/260/320/270/320/262/320/260/321/202/321/214.png +0 -0
  196. package/screen//320/277/320/273/320/260/320/275/320/270/321/200/320/276/320/262/321/211/320/270/320/272_/320/274/320/276/320/266/320/275/320/276_/320/267/320/260/320/264/320/260/320/262/320/260/321/202/321/214_/320/264/320/265/320/271/321/201/321/202/320/262/320/270/321/217_/320/277/320/276_/320/262/321/200/320/265/320/274/320/265/320/275/320/270.png +0 -0
  197. package/.claude/agents/README.md +0 -469
  198. package/.claude/agents/auth-route-debugger.md +0 -118
  199. package/.claude/agents/auth-route-tester.md +0 -93
  200. package/.claude/agents/auto-error-resolver.md +0 -97
  201. package/.claude/agents/build-optimizer.md +0 -236
  202. package/.claude/agents/code-architect.md +0 -34
  203. package/.claude/agents/code-architecture-reviewer.md +0 -83
  204. package/.claude/agents/code-explorer.md +0 -51
  205. package/.claude/agents/code-refactor-master.md +0 -94
  206. package/.claude/agents/code-reviewer.md +0 -46
  207. package/.claude/agents/cost-optimizer.md +0 -134
  208. package/.claude/agents/deployment-orchestrator.md +0 -113
  209. package/.claude/agents/documentation-architect.md +0 -82
  210. package/.claude/agents/frontend-error-fixer.md +0 -77
  211. package/.claude/agents/iac-code-generator.md +0 -71
  212. package/.claude/agents/incident-responder.md +0 -346
  213. package/.claude/agents/infrastructure-architect.md +0 -31
  214. package/.claude/agents/kubernetes-specialist.md +0 -56
  215. package/.claude/agents/migration-planner.md +0 -181
  216. package/.claude/agents/network-architect.md +0 -196
  217. package/.claude/agents/plan-reviewer.md +0 -52
  218. package/.claude/agents/refactor-planner.md +0 -63
  219. package/.claude/agents/security-scanner.md +0 -102
  220. package/.claude/agents/web-research-specialist.md +0 -78
  221. package/.claude/commands/cost-analysis.md +0 -315
  222. package/.claude/commands/dev-docs-update.md +0 -55
  223. package/.claude/commands/dev-docs.md +0 -51
  224. package/.claude/commands/feature-dev.md +0 -125
  225. package/.claude/commands/incident-debug.md +0 -247
  226. package/.claude/commands/infra-plan.md +0 -81
  227. package/.claude/commands/migration-plan.md +0 -478
  228. package/.claude/commands/route-research-for-testing.md +0 -37
  229. package/.claude/commands/security-review.md +0 -66
  230. package/.claude/hooks/CONFIG.md +0 -448
  231. package/.claude/hooks/README.md +0 -163
  232. package/.claude/hooks/SKILL_ACTIVATION_COMPLETE.md +0 -226
  233. package/.claude/hooks/WINDOWS_HOOKS_README.md +0 -151
  234. package/.claude/hooks/add-skill-activation-banners.ts +0 -132
  235. package/.claude/hooks/comprehensive-skill-test.ts +0 -1315
  236. package/.claude/hooks/error-handling-reminder.sh +0 -12
  237. package/.claude/hooks/error-handling-reminder.ts +0 -222
  238. package/.claude/hooks/k8s-manifest-validator.sh +0 -56
  239. package/.claude/hooks/package-lock.json +0 -556
  240. package/.claude/hooks/package.json +0 -16
  241. package/.claude/hooks/post-tool-use-tracker.ps1 +0 -174
  242. package/.claude/hooks/post-tool-use-tracker.sh +0 -183
  243. package/.claude/hooks/security-policy-check.sh +0 -247
  244. package/.claude/hooks/skill-activation-prompt.ps1 +0 -10
  245. package/.claude/hooks/skill-activation-prompt.sh +0 -10
  246. package/.claude/hooks/skill-activation-prompt.ts +0 -141
  247. package/.claude/hooks/stop-build-check-enhanced.sh +0 -130
  248. package/.claude/hooks/terraform-validator.sh +0 -53
  249. package/.claude/hooks/test-input.json +0 -7
  250. package/.claude/hooks/test-skill-activation.ts +0 -427
  251. package/.claude/hooks/trigger-build-resolver.sh +0 -79
  252. package/.claude/hooks/tsc-check.sh +0 -173
  253. package/.claude/hooks/tsconfig.json +0 -19
  254. package/.claude/settings.json +0 -59
  255. package/.claude/settings.local.json +0 -67
  256. package/.claude/skills/README.md +0 -507
  257. package/.claude/skills/api-engineering/SKILL.md +0 -63
  258. package/.claude/skills/api-engineering/resources/api-versioning.md +0 -88
  259. package/.claude/skills/api-engineering/resources/graphql-patterns.md +0 -106
  260. package/.claude/skills/api-engineering/resources/rate-limiting.md +0 -118
  261. package/.claude/skills/api-engineering/resources/rest-api-design.md +0 -105
  262. package/.claude/skills/backend-dev-guidelines/SKILL.md +0 -306
  263. package/.claude/skills/backend-dev-guidelines/resources/architecture-overview.md +0 -451
  264. package/.claude/skills/backend-dev-guidelines/resources/async-and-errors.md +0 -307
  265. package/.claude/skills/backend-dev-guidelines/resources/complete-examples.md +0 -638
  266. package/.claude/skills/backend-dev-guidelines/resources/configuration.md +0 -275
  267. package/.claude/skills/backend-dev-guidelines/resources/database-patterns.md +0 -224
  268. package/.claude/skills/backend-dev-guidelines/resources/middleware-guide.md +0 -213
  269. package/.claude/skills/backend-dev-guidelines/resources/routing-and-controllers.md +0 -756
  270. package/.claude/skills/backend-dev-guidelines/resources/sentry-and-monitoring.md +0 -336
  271. package/.claude/skills/backend-dev-guidelines/resources/services-and-repositories.md +0 -789
  272. package/.claude/skills/backend-dev-guidelines/resources/testing-guide.md +0 -235
  273. package/.claude/skills/backend-dev-guidelines/resources/validation-patterns.md +0 -754
  274. package/.claude/skills/budget-and-cost-management/SKILL.md +0 -850
  275. package/.claude/skills/build-engineering/SKILL.md +0 -431
  276. package/.claude/skills/build-engineering/resources/artifact-repositories.md +0 -72
  277. package/.claude/skills/build-engineering/resources/build-caching.md +0 -96
  278. package/.claude/skills/build-engineering/resources/build-pipelines.md +0 -105
  279. package/.claude/skills/build-engineering/resources/build-security.md +0 -95
  280. package/.claude/skills/build-engineering/resources/build-systems.md +0 -389
  281. package/.claude/skills/build-engineering/resources/compilation-optimization.md +0 -201
  282. package/.claude/skills/build-engineering/resources/dependency-management.md +0 -73
  283. package/.claude/skills/build-engineering/resources/monorepo-builds.md +0 -110
  284. package/.claude/skills/build-engineering/resources/performance-optimization.md +0 -113
  285. package/.claude/skills/build-engineering/resources/reproducible-builds.md +0 -82
  286. package/.claude/skills/cloud-engineering/SKILL.md +0 -675
  287. package/.claude/skills/cloud-engineering/resources/aws-patterns.md +0 -742
  288. package/.claude/skills/cloud-engineering/resources/azure-patterns.md +0 -714
  289. package/.claude/skills/cloud-engineering/resources/cleared-cloud-environments.md +0 -987
  290. package/.claude/skills/cloud-engineering/resources/cloud-cost-optimization.md +0 -757
  291. package/.claude/skills/cloud-engineering/resources/cloud-networking.md +0 -1058
  292. package/.claude/skills/cloud-engineering/resources/cloud-security-tools.md +0 -1530
  293. package/.claude/skills/cloud-engineering/resources/cloud-security.md +0 -990
  294. package/.claude/skills/cloud-engineering/resources/gcp-patterns.md +0 -758
  295. package/.claude/skills/cloud-engineering/resources/migration-strategies.md +0 -820
  296. package/.claude/skills/cloud-engineering/resources/multi-cloud-strategies.md +0 -670
  297. package/.claude/skills/cloud-engineering/resources/oci-patterns.md +0 -1198
  298. package/.claude/skills/cloud-engineering/resources/serverless-patterns.md +0 -795
  299. package/.claude/skills/cloud-engineering/resources/well-architected-frameworks.md +0 -966
  300. package/.claude/skills/cybersecurity/SKILL.md +0 -409
  301. package/.claude/skills/cybersecurity/resources/security-architecture.md +0 -266
  302. package/.claude/skills/database-engineering/SKILL.md +0 -61
  303. package/.claude/skills/database-engineering/resources/backup-and-recovery.md +0 -72
  304. package/.claude/skills/database-engineering/resources/database-replication.md +0 -63
  305. package/.claude/skills/database-engineering/resources/postgresql-fundamentals.md +0 -70
  306. package/.claude/skills/database-engineering/resources/query-optimization.md +0 -68
  307. package/.claude/skills/devsecops/SKILL.md +0 -374
  308. package/.claude/skills/devsecops/resources/ci-cd-security.md +0 -204
  309. package/.claude/skills/devsecops/resources/compliance-automation.md +0 -530
  310. package/.claude/skills/devsecops/resources/compliance-frameworks.md +0 -2322
  311. package/.claude/skills/devsecops/resources/container-security.md +0 -915
  312. package/.claude/skills/devsecops/resources/cspm-integration.md +0 -1440
  313. package/.claude/skills/devsecops/resources/policy-enforcement.md +0 -619
  314. package/.claude/skills/devsecops/resources/secrets-management.md +0 -755
  315. package/.claude/skills/devsecops/resources/security-monitoring.md +0 -146
  316. package/.claude/skills/devsecops/resources/security-scanning.md +0 -887
  317. package/.claude/skills/devsecops/resources/security-testing.md +0 -203
  318. package/.claude/skills/devsecops/resources/supply-chain-security.md +0 -518
  319. package/.claude/skills/devsecops/resources/vulnerability-management.md +0 -481
  320. package/.claude/skills/devsecops/resources/zero-trust-architecture.md +0 -177
  321. package/.claude/skills/documentation-as-code/SKILL.md +0 -323
  322. package/.claude/skills/documentation-as-code/resources/api-documentation.md +0 -90
  323. package/.claude/skills/documentation-as-code/resources/changelog-management.md +0 -79
  324. package/.claude/skills/documentation-as-code/resources/diagram-generation.md +0 -44
  325. package/.claude/skills/documentation-as-code/resources/docs-as-code-workflow.md +0 -99
  326. package/.claude/skills/documentation-as-code/resources/documentation-automation.md +0 -68
  327. package/.claude/skills/documentation-as-code/resources/documentation-sites.md +0 -79
  328. package/.claude/skills/documentation-as-code/resources/markdown-best-practices.md +0 -162
  329. package/.claude/skills/documentation-as-code/resources/openapi-specification.md +0 -77
  330. package/.claude/skills/documentation-as-code/resources/readme-engineering.md +0 -60
  331. package/.claude/skills/documentation-as-code/resources/technical-writing-guide.md +0 -202
  332. package/.claude/skills/engineering-management/SKILL.md +0 -356
  333. package/.claude/skills/engineering-management/resources/career-ladders.md +0 -609
  334. package/.claude/skills/engineering-management/resources/hiring-and-assessment.md +0 -555
  335. package/.claude/skills/engineering-management/resources/one-on-one-guides.md +0 -609
  336. package/.claude/skills/engineering-management/resources/resource-planning.md +0 -557
  337. package/.claude/skills/engineering-management/resources/team-organization-patterns.md +0 -491
  338. package/.claude/skills/engineering-management/resources/technical-interviews.md +0 -474
  339. package/.claude/skills/engineering-operations-management/SKILL.md +0 -817
  340. package/.claude/skills/error-tracking/SKILL.md +0 -379
  341. package/.claude/skills/frontend-design/SKILL.md +0 -42
  342. package/.claude/skills/frontend-dev-guidelines/SKILL.md +0 -403
  343. package/.claude/skills/frontend-dev-guidelines/resources/common-patterns.md +0 -331
  344. package/.claude/skills/frontend-dev-guidelines/resources/complete-examples.md +0 -872
  345. package/.claude/skills/frontend-dev-guidelines/resources/component-patterns.md +0 -502
  346. package/.claude/skills/frontend-dev-guidelines/resources/data-fetching.md +0 -767
  347. package/.claude/skills/frontend-dev-guidelines/resources/file-organization.md +0 -502
  348. package/.claude/skills/frontend-dev-guidelines/resources/loading-and-error-states.md +0 -501
  349. package/.claude/skills/frontend-dev-guidelines/resources/performance.md +0 -406
  350. package/.claude/skills/frontend-dev-guidelines/resources/routing-guide.md +0 -364
  351. package/.claude/skills/frontend-dev-guidelines/resources/styling-guide.md +0 -428
  352. package/.claude/skills/frontend-dev-guidelines/resources/typescript-standards.md +0 -418
  353. package/.claude/skills/general-it-engineering/SKILL.md +0 -393
  354. package/.claude/skills/general-it-engineering/resources/asset-management.md +0 -712
  355. package/.claude/skills/general-it-engineering/resources/automation-orchestration.md +0 -817
  356. package/.claude/skills/general-it-engineering/resources/business-continuity.md +0 -786
  357. package/.claude/skills/general-it-engineering/resources/change-management.md +0 -715
  358. package/.claude/skills/general-it-engineering/resources/enterprise-monitoring.md +0 -729
  359. package/.claude/skills/general-it-engineering/resources/help-desk-operations.md +0 -738
  360. package/.claude/skills/general-it-engineering/resources/incident-service-management.md +0 -834
  361. package/.claude/skills/general-it-engineering/resources/it-governance.md +0 -753
  362. package/.claude/skills/general-it-engineering/resources/itil-framework.md +0 -503
  363. package/.claude/skills/general-it-engineering/resources/service-management.md +0 -669
  364. package/.claude/skills/infrastructure-architecture/SKILL.md +0 -328
  365. package/.claude/skills/infrastructure-architecture/resources/architecture-decision-records.md +0 -505
  366. package/.claude/skills/infrastructure-architecture/resources/architecture-patterns.md +0 -528
  367. package/.claude/skills/infrastructure-architecture/resources/capacity-planning.md +0 -453
  368. package/.claude/skills/infrastructure-architecture/resources/cleared-environment-architecture.md +0 -773
  369. package/.claude/skills/infrastructure-architecture/resources/cost-architecture.md +0 -499
  370. package/.claude/skills/infrastructure-architecture/resources/data-architecture.md +0 -501
  371. package/.claude/skills/infrastructure-architecture/resources/disaster-recovery.md +0 -535
  372. package/.claude/skills/infrastructure-architecture/resources/migration-architecture.md +0 -512
  373. package/.claude/skills/infrastructure-architecture/resources/multi-region-design.md +0 -608
  374. package/.claude/skills/infrastructure-architecture/resources/reference-architectures.md +0 -562
  375. package/.claude/skills/infrastructure-architecture/resources/security-architecture.md +0 -538
  376. package/.claude/skills/infrastructure-architecture/resources/system-design-principles.md +0 -489
  377. package/.claude/skills/infrastructure-architecture/resources/workload-classification.md +0 -1000
  378. package/.claude/skills/infrastructure-strategy/SKILL.md +0 -924
  379. package/.claude/skills/network-engineering/SKILL.md +0 -385
  380. package/.claude/skills/network-engineering/resources/dns-management.md +0 -738
  381. package/.claude/skills/network-engineering/resources/load-balancing.md +0 -820
  382. package/.claude/skills/network-engineering/resources/network-architecture.md +0 -546
  383. package/.claude/skills/network-engineering/resources/network-security.md +0 -921
  384. package/.claude/skills/network-engineering/resources/network-troubleshooting.md +0 -749
  385. package/.claude/skills/network-engineering/resources/routing-switching.md +0 -373
  386. package/.claude/skills/network-engineering/resources/sdn-networking.md +0 -695
  387. package/.claude/skills/network-engineering/resources/service-mesh-networking.md +0 -777
  388. package/.claude/skills/network-engineering/resources/tcp-ip-protocols.md +0 -444
  389. package/.claude/skills/network-engineering/resources/vpn-connectivity.md +0 -672
  390. package/.claude/skills/node-development/SKILL.md +0 -317
  391. package/.claude/skills/observability-engineering/SKILL.md +0 -101
  392. package/.claude/skills/observability-engineering/resources/apm-tools.md +0 -97
  393. package/.claude/skills/observability-engineering/resources/correlation-strategies.md +0 -87
  394. package/.claude/skills/observability-engineering/resources/distributed-tracing.md +0 -98
  395. package/.claude/skills/observability-engineering/resources/logs-aggregation.md +0 -118
  396. package/.claude/skills/observability-engineering/resources/observability-cost-optimization.md +0 -141
  397. package/.claude/skills/observability-engineering/resources/opentelemetry.md +0 -110
  398. package/.claude/skills/platform-engineering/SKILL.md +0 -555
  399. package/.claude/skills/platform-engineering/resources/architecture-overview.md +0 -600
  400. package/.claude/skills/platform-engineering/resources/container-orchestration.md +0 -916
  401. package/.claude/skills/platform-engineering/resources/cost-optimization.md +0 -634
  402. package/.claude/skills/platform-engineering/resources/developer-platforms.md +0 -670
  403. package/.claude/skills/platform-engineering/resources/gitops-automation.md +0 -650
  404. package/.claude/skills/platform-engineering/resources/infrastructure-as-code.md +0 -778
  405. package/.claude/skills/platform-engineering/resources/infrastructure-standards.md +0 -708
  406. package/.claude/skills/platform-engineering/resources/multi-tenancy.md +0 -602
  407. package/.claude/skills/platform-engineering/resources/platform-security.md +0 -711
  408. package/.claude/skills/platform-engineering/resources/resource-management.md +0 -592
  409. package/.claude/skills/platform-engineering/resources/service-mesh.md +0 -628
  410. package/.claude/skills/release-engineering/SKILL.md +0 -393
  411. package/.claude/skills/release-engineering/resources/artifact-management.md +0 -108
  412. package/.claude/skills/release-engineering/resources/build-optimization.md +0 -84
  413. package/.claude/skills/release-engineering/resources/ci-cd-pipelines.md +0 -411
  414. package/.claude/skills/release-engineering/resources/deployment-strategies.md +0 -197
  415. package/.claude/skills/release-engineering/resources/pipeline-security.md +0 -62
  416. package/.claude/skills/release-engineering/resources/progressive-delivery.md +0 -83
  417. package/.claude/skills/release-engineering/resources/release-automation.md +0 -68
  418. package/.claude/skills/release-engineering/resources/release-orchestration.md +0 -77
  419. package/.claude/skills/release-engineering/resources/rollback-strategies.md +0 -66
  420. package/.claude/skills/release-engineering/resources/versioning-strategies.md +0 -59
  421. package/.claude/skills/route-tester/SKILL.md +0 -392
  422. package/.claude/skills/skill-developer/ADVANCED.md +0 -197
  423. package/.claude/skills/skill-developer/HOOK_MECHANISMS.md +0 -306
  424. package/.claude/skills/skill-developer/PATTERNS_LIBRARY.md +0 -152
  425. package/.claude/skills/skill-developer/SKILL.md +0 -430
  426. package/.claude/skills/skill-developer/SKILL_RULES_REFERENCE.md +0 -315
  427. package/.claude/skills/skill-developer/TRIGGER_TYPES.md +0 -305
  428. package/.claude/skills/skill-developer/TROUBLESHOOTING.md +0 -514
  429. package/.claude/skills/skill-rules.json +0 -2989
  430. package/.claude/skills/sre/SKILL.md +0 -464
  431. package/.claude/skills/sre/resources/alerting-best-practices.md +0 -282
  432. package/.claude/skills/sre/resources/capacity-planning.md +0 -226
  433. package/.claude/skills/sre/resources/chaos-engineering.md +0 -193
  434. package/.claude/skills/sre/resources/disaster-recovery.md +0 -232
  435. package/.claude/skills/sre/resources/incident-management.md +0 -436
  436. package/.claude/skills/sre/resources/observability-stack.md +0 -240
  437. package/.claude/skills/sre/resources/on-call-runbooks.md +0 -167
  438. package/.claude/skills/sre/resources/performance-optimization.md +0 -108
  439. package/.claude/skills/sre/resources/reliability-patterns.md +0 -183
  440. package/.claude/skills/sre/resources/slo-sli-sla.md +0 -464
  441. package/.claude/skills/sre/resources/toil-reduction.md +0 -145
  442. package/.claude/skills/systems-engineering/SKILL.md +0 -648
  443. package/.claude/skills/systems-engineering/resources/automation-patterns.md +0 -771
  444. package/.claude/skills/systems-engineering/resources/configuration-management.md +0 -998
  445. package/.claude/skills/systems-engineering/resources/linux-administration.md +0 -672
  446. package/.claude/skills/systems-engineering/resources/networking-fundamentals.md +0 -982
  447. package/.claude/skills/systems-engineering/resources/performance-tuning.md +0 -871
  448. package/.claude/skills/systems-engineering/resources/powershell-scripting.md +0 -482
  449. package/.claude/skills/systems-engineering/resources/security-hardening.md +0 -739
  450. package/.claude/skills/systems-engineering/resources/shell-scripting.md +0 -915
  451. package/.claude/skills/systems-engineering/resources/storage-management.md +0 -628
  452. package/.claude/skills/systems-engineering/resources/system-monitoring.md +0 -787
  453. package/.claude/skills/systems-engineering/resources/troubleshooting-guide.md +0 -753
  454. package/.claude/skills/systems-engineering/resources/windows-administration.md +0 -738
  455. package/.claude/skills/technical-leadership/SKILL.md +0 -728
  456. package/backend/docs/SECRETS_DOCUMENTATION.md +0 -327
  457. package/backend/package-lock.json +0 -6801
  458. package/backend/src/core/node-registries/actions.js +0 -202
  459. package/backend/src/core/node-registries/arrays.js +0 -155
  460. package/backend/src/core/node-registries/bot.js +0 -23
  461. package/backend/src/core/node-registries/container.js +0 -162
  462. package/backend/src/core/node-registries/data.js +0 -290
  463. package/backend/src/core/node-registries/debug.js +0 -26
  464. package/backend/src/core/node-registries/events.js +0 -201
  465. package/backend/src/core/node-registries/flow.js +0 -139
  466. package/backend/src/core/node-registries/furnace.js +0 -143
  467. package/backend/src/core/node-registries/logic.js +0 -62
  468. package/backend/src/core/node-registries/math.js +0 -42
  469. package/backend/src/core/node-registries/navigation.js +0 -111
  470. package/backend/src/core/node-registries/objects.js +0 -98
  471. package/backend/src/core/node-registries/strings.js +0 -187
  472. package/backend/src/core/node-registries/time.js +0 -113
  473. package/backend/src/core/node-registries/type.js +0 -25
  474. package/backend/src/core/node-registries/users.js +0 -79
  475. package/frontend/dist/assets/index-BC-NbKXi.css +0 -32
  476. package/frontend/dist/assets/index-DqJXZMHY.js +0 -11266
@@ -1,282 +0,0 @@
1
- # Alerting Best Practices
2
-
3
- Alert design principles, notification routing (PagerDuty, OpsGenie), alert fatigue prevention, and effective on-call alerting strategies.
4
-
5
- ## Table of Contents
6
-
7
- - [Alert Design Principles](#alert-design-principles)
8
- - [Alert Rules](#alert-rules)
9
- - [Notification Routing](#notification-routing)
10
- - [Alert Fatigue Prevention](#alert-fatigue-prevention)
11
- - [Best Practices](#best-practices)
12
-
13
- ## Alert Design Principles
14
-
15
- **Good Alerts:**
16
- ```
17
- ✅ Actionable - Can be fixed immediately
18
- ✅ Specific - Clear what's wrong
19
- ✅ User-impacting - Affects customers
20
- ✅ Urgent - Requires immediate attention
21
- ✅ Novel - Not duplicate of existing alert
22
- ```
23
-
24
- **Bad Alerts:**
25
- ```
26
- ❌ Noisy - Frequent false positives
27
- ❌ Vague - Unclear what to do
28
- ❌ Premature - Fires before issue impacts users
29
- ❌ Duplicate - Same as other alerts
30
- ❌ Low-priority - Can wait until business hours
31
- ```
32
-
33
- ## Alert Rules
34
-
35
- **Prometheus Alerting:**
36
- ```yaml
37
- groups:
38
- - name: slo_alerts
39
- rules:
40
- # Good: User-impacting, actionable
41
- - alert: HighErrorRate
42
- expr: |
43
- (
44
- sum(rate(http_requests_total{status=~"5.."}[5m]))
45
- /
46
- sum(rate(http_requests_total[5m]))
47
- ) > 0.05
48
- for: 5m
49
- labels:
50
- severity: critical
51
- team: platform
52
- annotations:
53
- summary: "Error rate above 5% for 5 minutes"
54
- description: "{{ $value | humanizePercentage }} of requests failing"
55
- runbook: "https://runbooks.example.com/high-error-rate"
56
- dashboard: "https://grafana.example.com/d/service-health"
57
-
58
- # Good: SLO-based, clear threshold
59
- - alert: LatencyP95High
60
- expr: |
61
- histogram_quantile(0.95,
62
- rate(http_request_duration_seconds_bucket[5m])
63
- ) > 0.5
64
- for: 10m
65
- labels:
66
- severity: warning
67
- team: platform
68
- annotations:
69
- summary: "P95 latency above 500ms"
70
- impact: "Users experiencing slow response times"
71
- ```
72
-
73
- **Multi-Window Alerts:**
74
- ```yaml
75
- # Fast burn + slow burn
76
- - alert: ErrorBudgetBurn
77
- expr: |
78
- (
79
- sum(rate(http_requests_total{status=~"5.."}[1h]))
80
- /
81
- sum(rate(http_requests_total[1h]))
82
- > (14.4 * (1 - 0.999))
83
- )
84
- and
85
- (
86
- sum(rate(http_requests_total{status=~"5.."}[5m]))
87
- /
88
- sum(rate(http_requests_total[5m]))
89
- > (14.4 * (1 - 0.999))
90
- )
91
- labels:
92
- severity: critical
93
- annotations:
94
- summary: "Error budget burning at 14.4x rate"
95
- ```
96
-
97
- ## Notification Routing
98
-
99
- **AlertManager Config:**
100
- ```yaml
101
- route:
102
- receiver: default
103
- group_by: ['alertname', 'cluster']
104
- group_wait: 30s
105
- group_interval: 5m
106
- repeat_interval: 12h
107
-
108
- routes:
109
- # Critical: Page immediately
110
- - match:
111
- severity: critical
112
- receiver: pagerduty
113
- group_wait: 10s
114
- repeat_interval: 5m
115
-
116
- # Warning: Slack notification
117
- - match:
118
- severity: warning
119
- receiver: slack
120
- repeat_interval: 4h
121
-
122
- # Info: Email only
123
- - match:
124
- severity: info
125
- receiver: email
126
- repeat_interval: 24h
127
-
128
- receivers:
129
- - name: pagerduty
130
- pagerduty_configs:
131
- - service_key: $PAGERDUTY_SERVICE_KEY
132
- description: "{{ .GroupLabels.alertname }}"
133
-
134
- - name: slack
135
- slack_configs:
136
- - api_url: $SLACK_WEBHOOK_URL
137
- channel: '#alerts'
138
- title: "{{ .GroupLabels.alertname }}"
139
- text: "{{ range .Alerts }}{{ .Annotations.description }}{{ end }}"
140
-
141
- - name: email
142
- email_configs:
143
- - to: 'team@example.com'
144
- from: 'alertmanager@example.com'
145
- ```
146
-
147
- **PagerDuty Integration:**
148
- ```yaml
149
- pagerduty_configs:
150
- - routing_key: $PAGERDUTY_ROUTING_KEY
151
- severity: "{{ .Labels.severity }}"
152
- client: "Alertmanager"
153
- client_url: "{{ .ExternalURL }}"
154
- description: "{{ .GroupLabels.alertname }}"
155
- details:
156
- firing: "{{ .Alerts.Firing | len }}"
157
- resolved: "{{ .Alerts.Resolved | len }}"
158
- summary: "{{ range .Alerts }}{{ .Annotations.summary }}{{ end }}"
159
- ```
160
-
161
- ## Alert Fatigue Prevention
162
-
163
- **Strategies:**
164
-
165
- 1. **High Signal-to-Noise Ratio**
166
- ```
167
- Target: < 5% false positive rate
168
- If alert fires but no action taken → remove or adjust
169
- ```
170
-
171
- 2. **Appropriate Thresholds**
172
- ```yaml
173
- # Too sensitive
174
- expr: cpu_usage > 0.5 # Fires constantly
175
-
176
- # Better
177
- expr: cpu_usage > 0.9 for 10m # Sustained high usage
178
- ```
179
-
180
- 3. **Group Similar Alerts**
181
- ```yaml
182
- route:
183
- group_by: ['alertname', 'cluster', 'service']
184
- group_wait: 30s # Wait to group
185
- group_interval: 5m # Send grouped updates
186
- ```
187
-
188
- 4. **Escalation Policies**
189
- ```yaml
190
- # PagerDuty escalation
191
- escalation_policy:
192
- - level: 1
193
- targets: [on_call_primary]
194
- escalation_delay: 5m
195
-
196
- - level: 2
197
- targets: [on_call_secondary, team_lead]
198
- escalation_delay: 10m
199
-
200
- - level: 3
201
- targets: [engineering_manager]
202
- escalation_delay: 15m
203
- ```
204
-
205
- 5. **Alert Inhibition**
206
- ```yaml
207
- inhibit_rules:
208
- # If service is down, don't alert on high latency
209
- - source_match:
210
- severity: critical
211
- alertname: ServiceDown
212
- target_match:
213
- severity: warning
214
- alertname: HighLatency
215
- equal: ['service']
216
- ```
217
-
218
- ## Best Practices
219
-
220
- ### 1. Include Runbook Links
221
-
222
- ```yaml
223
- annotations:
224
- runbook: "https://runbooks.example.com/{{ $labels.alertname }}"
225
- ```
226
-
227
- ### 2. Add Context
228
-
229
- ```yaml
230
- annotations:
231
- description: |
232
- Service {{ $labels.service }} error rate is {{ $value | humanizePercentage }}
233
- Dashboard: https://grafana.example.com/d/{{ $labels.service }}
234
- Logs: https://logs.example.com/?service={{ $labels.service }}
235
- ```
236
-
237
- ### 3. Test Alerts
238
-
239
- ```bash
240
- # Send test alert
241
- amtool alert add alertname=TestAlert severity=warning
242
-
243
- # Check routing
244
- amtool config routes test --config.file=alertmanager.yml \
245
- severity=critical team=platform
246
- ```
247
-
248
- ### 4. Review Alerts Regularly
249
-
250
- ```yaml
251
- # Quarterly alert audit
252
- review_process:
253
- - Check false positive rate
254
- - Verify runbooks are current
255
- - Update thresholds based on trends
256
- - Remove unused alerts
257
- ```
258
-
259
- ### 5. Time-Based Routing
260
-
261
- ```yaml
262
- # Different routing for business hours vs off-hours
263
- routes:
264
- - match:
265
- severity: warning
266
- receiver: slack
267
- active_time_intervals:
268
- - business_hours
269
-
270
- - match:
271
- severity: warning
272
- receiver: email
273
- active_time_intervals:
274
- - off_hours
275
- ```
276
-
277
- ---
278
-
279
- **Related Resources:**
280
- - [incident-management.md](incident-management.md)
281
- - [on-call-runbooks.md](on-call-runbooks.md)
282
- - [observability-stack.md](observability-stack.md)
@@ -1,226 +0,0 @@
1
- # Capacity Planning
2
-
3
- Resource forecasting, growth modeling, scalability analysis, load testing, and proactive capacity management.
4
-
5
- ## Table of Contents
6
-
7
- - [Capacity Planning Process](#capacity-planning-process)
8
- - [Resource Forecasting](#resource-forecasting)
9
- - [Load Testing](#load-testing)
10
- - [Scalability Analysis](#scalability-analysis)
11
-
12
- ## Capacity Planning Process
13
-
14
- ```yaml
15
- quarterly_process:
16
- 1_collect_data:
17
- - Current resource usage trends
18
- - Traffic growth patterns
19
- - Business projections
20
- - Seasonal variations
21
-
22
- 2_forecast:
23
- - Project 6-12 months ahead
24
- - Account for growth initiatives
25
- - Include safety margin (20-30%)
26
-
27
- 3_plan_upgrades:
28
- - Identify bottlenecks
29
- - Plan infrastructure changes
30
- - Budget for new resources
31
-
32
- 4_implement:
33
- - Gradual rollout
34
- - Monitor impact
35
- - Adjust as needed
36
- ```
37
-
38
- ## Resource Forecasting
39
-
40
- **Linear Growth Model:**
41
- ```python
42
- import pandas as pd
43
- import numpy as np
44
- from sklearn.linear_model import LinearRegression
45
-
46
- def forecast_capacity(historical_data, months_ahead=6):
47
- """
48
- Forecast resource requirements
49
-
50
- Args:
51
- historical_data: DataFrame with 'date' and 'usage' columns
52
- months_ahead: Number of months to forecast
53
-
54
- Returns:
55
- Forecasted usage values
56
- """
57
- # Prepare data
58
- X = np.array(range(len(historical_data))).reshape(-1, 1)
59
- y = historical_data['usage'].values
60
-
61
- # Train model
62
- model = LinearRegression()
63
- model.fit(X, y)
64
-
65
- # Forecast
66
- future_X = np.array(range(len(historical_data),
67
- len(historical_data) + months_ahead)).reshape(-1, 1)
68
- forecast = model.predict(future_X)
69
-
70
- # Add 30% safety margin
71
- return forecast * 1.3
72
-
73
- # Usage
74
- import pandas as pd
75
- data = pd.DataFrame({
76
- 'date': pd.date_range('2023-01-01', periods=12, freq='M'),
77
- 'usage': [100, 110, 115, 125, 130, 140, 145, 155, 160, 170, 175, 185]
78
- })
79
-
80
- forecast = forecast_capacity(data, months_ahead=6)
81
- print(f"Forecasted usage in 6 months: {forecast[-1]:.0f}")
82
- ```
83
-
84
- **Capacity Metrics:**
85
- ```yaml
86
- cpu:
87
- current_avg: 45%
88
- current_p95: 75%
89
- target_max: 80%
90
- growth_rate: 5% monthly
91
- action_needed: Scale in 4 months
92
-
93
- memory:
94
- current_avg: 60%
95
- current_p95: 85%
96
- target_max: 85%
97
- growth_rate: 3% monthly
98
- action_needed: Scale in 6 months
99
-
100
- storage:
101
- current_usage: 500GB
102
- total_capacity: 1TB
103
- growth_rate: 50GB monthly
104
- action_needed: Scale in 10 months
105
- ```
106
-
107
- ## Load Testing
108
-
109
- **k6 Load Test:**
110
- ```javascript
111
- // load-test.js
112
- import http from 'k6/http';
113
- import { check, sleep } from 'k6';
114
-
115
- export const options = {
116
- stages: [
117
- { duration: '5m', target: 100 }, // Ramp up to 100 users
118
- { duration: '10m', target: 100 }, // Stay at 100 users
119
- { duration: '5m', target: 500 }, // Ramp to 500 users
120
- { duration: '10m', target: 500 }, // Stay at 500
121
- { duration: '5m', target: 1000 }, // Spike to 1000
122
- { duration: '5m', target: 0 }, // Ramp down
123
- ],
124
- thresholds: {
125
- http_req_duration: ['p(95)<500'], // 95% of requests < 500ms
126
- http_req_failed: ['rate<0.01'], // Error rate < 1%
127
- },
128
- };
129
-
130
- export default function () {
131
- const res = http.get('https://api.example.com/');
132
- check(res, {
133
- 'status is 200': (r) => r.status === 200,
134
- 'response time < 500ms': (r) => r.timings.duration < 500,
135
- });
136
- sleep(1);
137
- }
138
- ```
139
-
140
- **Run Load Test:**
141
- ```bash
142
- # Local test
143
- k6 run load-test.js
144
-
145
- # Cloud test (distributed)
146
- k6 cloud load-test.js
147
-
148
- # With custom VUs
149
- k6 run --vus 1000 --duration 30m load-test.js
150
- ```
151
-
152
- ## Scalability Analysis
153
-
154
- **Horizontal vs Vertical Scaling:**
155
- ```yaml
156
- horizontal_scaling:
157
- when: Stateless applications, need high availability
158
- pros:
159
- - No downtime
160
- - Better fault tolerance
161
- - Linear cost scaling
162
- cons:
163
- - More complex
164
- - Coordination overhead
165
-
166
- vertical_scaling:
167
- when: Stateful applications, simpler architecture
168
- pros:
169
- - Simpler architecture
170
- - Less coordination
171
- cons:
172
- - Downtime required
173
- - Upper limits
174
- - Single point of failure
175
- ```
176
-
177
- **Auto-scaling Configuration:**
178
- ```yaml
179
- apiVersion: autoscaling/v2
180
- kind: HorizontalPodAutoscaler
181
- metadata:
182
- name: api-hpa
183
- spec:
184
- scaleTargetRef:
185
- apiVersion: apps/v1
186
- kind: Deployment
187
- name: api
188
- minReplicas: 3
189
- maxReplicas: 100
190
- metrics:
191
- - type: Resource
192
- resource:
193
- name: cpu
194
- target:
195
- type: Utilization
196
- averageUtilization: 70
197
- - type: Resource
198
- resource:
199
- name: memory
200
- target:
201
- type: Utilization
202
- averageUtilization: 80
203
- behavior:
204
- scaleDown:
205
- stabilizationWindowSeconds: 300
206
- policies:
207
- - type: Percent
208
- value: 50
209
- periodSeconds: 60
210
- scaleUp:
211
- stabilizationWindowSeconds: 0
212
- policies:
213
- - type: Percent
214
- value: 100
215
- periodSeconds: 30
216
- - type: Pods
217
- value: 5
218
- periodSeconds: 30
219
- selectPolicy: Max
220
- ```
221
-
222
- ---
223
-
224
- **Related Resources:**
225
- - [performance-optimization.md](performance-optimization.md)
226
- - [resource-management.md](../platform-engineering/resources/resource-management.md)
@@ -1,193 +0,0 @@
1
- # Chaos Engineering
2
-
3
- Chaos Monkey, fault injection, failure mode testing, Chaos Toolkit, Litmus Chaos, and resilience testing practices.
4
-
5
- ## Table of Contents
6
-
7
- - [Principles](#principles)
8
- - [Tools](#tools)
9
- - [Experiments](#experiments)
10
- - [Best Practices](#best-practices)
11
-
12
- ## Principles
13
-
14
- **Chaos Engineering Principles:**
15
- 1. Build a hypothesis around steady state
16
- 2. Vary real-world events
17
- 3. Run experiments in production
18
- 4. Automate experiments
19
- 5. Minimize blast radius
20
-
21
- ## Tools
22
-
23
- **Chaos Mesh (Kubernetes):**
24
- ```yaml
25
- apiVersion: chaos-mesh.org/v1alpha1
26
- kind: PodChaos
27
- metadata:
28
- name: pod-failure-example
29
- spec:
30
- action: pod-failure
31
- mode: one
32
- selector:
33
- namespaces:
34
- - production
35
- labelSelectors:
36
- app: api-service
37
- duration: "30s"
38
- scheduler:
39
- cron: "@every 2h"
40
- ```
41
-
42
- **Network Chaos:**
43
- ```yaml
44
- apiVersion: chaos-mesh.org/v1alpha1
45
- kind: NetworkChaos
46
- metadata:
47
- name: network-delay
48
- spec:
49
- action: delay
50
- mode: all
51
- selector:
52
- namespaces:
53
- - production
54
- labelSelectors:
55
- app: api-service
56
- delay:
57
- latency: "100ms"
58
- correlation: "25"
59
- jitter: "10ms"
60
- duration: "5m"
61
- ```
62
-
63
- **Litmus Chaos:**
64
- ```yaml
65
- apiVersion: litmuschaos.io/v1alpha1
66
- kind: ChaosEngine
67
- metadata:
68
- name: nginx-chaos
69
- spec:
70
- appinfo:
71
- appns: 'default'
72
- applabel: 'app=nginx'
73
- appkind: 'deployment'
74
- chaosServiceAccount: litmus-admin
75
- experiments:
76
- - name: pod-delete
77
- spec:
78
- components:
79
- env:
80
- - name: TOTAL_CHAOS_DURATION
81
- value: '30'
82
- - name: CHAOS_INTERVAL
83
- value: '10'
84
- - name: FORCE
85
- value: 'false'
86
- ```
87
-
88
- ## Experiments
89
-
90
- **Pod Deletion Test:**
91
- ```bash
92
- # Verify system handles pod failures
93
- kubectl delete pod -l app=api-service --grace-period=0
94
-
95
- # Expected outcome:
96
- # - New pod starts automatically
97
- # - No service interruption
98
- # - Requests handled by other pods
99
- ```
100
-
101
- **Database Failure Simulation:**
102
- ```yaml
103
- # Simulate database connection issues
104
- apiVersion: chaos-mesh.org/v1alpha1
105
- kind: NetworkChaos
106
- metadata:
107
- name: db-partition
108
- spec:
109
- action: partition
110
- mode: all
111
- selector:
112
- namespaces:
113
- - production
114
- labelSelectors:
115
- app: api-service
116
- direction: to
117
- target:
118
- selector:
119
- namespaces:
120
- - production
121
- labelSelectors:
122
- app: postgres
123
- duration: "2m"
124
- ```
125
-
126
- **CPU Stress Test:**
127
- ```yaml
128
- apiVersion: chaos-mesh.org/v1alpha1
129
- kind: StressChaos
130
- metadata:
131
- name: cpu-stress
132
- spec:
133
- mode: one
134
- selector:
135
- namespaces:
136
- - production
137
- labelSelectors:
138
- app: api-service
139
- stressors:
140
- cpu:
141
- workers: 4
142
- load: 80
143
- duration: "5m"
144
- ```
145
-
146
- ## Best Practices
147
-
148
- ### 1. Start Small
149
-
150
- ```
151
- Begin in dev/staging
152
- Small blast radius
153
- Short duration
154
- Gradually increase scope
155
- ```
156
-
157
- ### 2. Define Success Criteria
158
-
159
- ```yaml
160
- experiment:
161
- hypothesis: "API continues serving traffic during pod failure"
162
- success_criteria:
163
- - Error rate < 0.1%
164
- - P95 latency < 500ms
165
- - No customer impact
166
- failure_action: Rollback immediately
167
- ```
168
-
169
- ### 3. Automate Chaos
170
-
171
- ```yaml
172
- # Regular chaos experiments
173
- schedule:
174
- daily: Pod deletion
175
- weekly: Network latency
176
- monthly: Region failure simulation
177
- ```
178
-
179
- ### 4. Monitor During Experiments
180
-
181
- ```yaml
182
- observability:
183
- - Real-time dashboards
184
- - Alert on anomalies
185
- - Correlate with experiment timeline
186
- - Document unexpected behavior
187
- ```
188
-
189
- ---
190
-
191
- **Related Resources:**
192
- - [reliability-patterns.md](reliability-patterns.md)
193
- - [incident-management.md](incident-management.md)