claude-flow-novice 2.0.0 → 2.0.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude/agents/CLAUDE.md +669 -51
- package/.claude/agents/agent-principles/CODER_AGENT_GUIDELINES.md +1245 -0
- package/.claude/agents/agent-principles/agent-type-guidelines.md +137 -0
- package/.claude/agents/agent-principles/format-selection.md +20 -0
- package/.claude/agents/agent-principles/prompt-engineering.md +165 -35
- package/.claude/agents/agent-principles/quality-metrics.md +83 -2
- package/.claude/agents/analysis/code-analyzer.md +722 -0
- package/.claude/agents/analysis/code-review/analyze-code-quality.md +33 -3
- package/.claude/agents/analysis/perf-analyzer.md +812 -0
- package/.claude/agents/architecture/system-architect.md +25 -11
- package/.claude/agents/cfn-loop/product-owner.md +458 -2
- package/.claude/agents/code-booster.md +13 -2
- package/.claude/agents/consensus/byzantine-coordinator.md +259 -6
- package/.claude/agents/consensus/consensus-builder.md +135 -2
- package/.claude/agents/consensus/crdt-synchronizer.md +307 -771
- package/.claude/agents/consensus/gossip-coordinator.md +227 -2
- package/.claude/agents/consensus/performance-benchmarker.md +385 -704
- package/.claude/agents/consensus/quorum-manager.md +241 -749
- package/.claude/agents/consensus/raft-manager.md +195 -2
- package/.claude/agents/consensus/security-manager.md +461 -518
- package/.claude/agents/core-agents/analyst.md +560 -0
- package/.claude/agents/core-agents/architect.md +578 -0
- package/.claude/agents/core-agents/base-template-generator.md +137 -0
- package/.claude/agents/core-agents/coder.md +409 -0
- package/.claude/agents/core-agents/coordinator.md +1429 -0
- package/.claude/agents/core-agents/planner.md +343 -0
- package/.claude/agents/core-agents/researcher.md +414 -0
- package/.claude/agents/core-agents/reviewer.md +652 -0
- package/.claude/agents/core-agents/task-coordinator.md +400 -0
- package/.claude/agents/core-agents/tester.md +912 -0
- package/.claude/agents/development/backend/dev-backend-api.md +418 -23
- package/.claude/agents/devops/devops-engineer.md +240 -433
- package/.claude/agents/documentation/api-docs/docs-api-openapi.md +350 -11
- package/.claude/agents/examples/blocking-coordinator-example.md +388 -0
- package/.claude/agents/frontend/interaction-tester.md +334 -17
- package/.claude/agents/frontend/react-frontend-engineer.md +255 -2
- package/.claude/agents/frontend/state-architect.md +235 -9
- package/.claude/agents/frontend/ui-designer.md +261 -132
- package/.claude/agents/goal/goal-planner.md +803 -52
- package/.claude/agents/planning-team/api-designer-persona.md +736 -0
- package/.claude/agents/planning-team/security-architect-persona.md +643 -0
- package/.claude/agents/planning-team/system-architect-persona.md +585 -0
- package/.claude/agents/product-owner-team/accessibility-advocate-persona.md +796 -0
- package/.claude/agents/product-owner-team/cto-agent.md +473 -0
- package/.claude/agents/product-owner-team/power-user-persona.md +590 -0
- package/.claude/agents/product-owner-team/product-owner-agent.md +806 -0
- package/.claude/agents/security/security-specialist.md +515 -13
- package/.claude/agents/sparc/architecture.md +237 -1
- package/.claude/agents/sparc/pseudocode.md +237 -1
- package/.claude/agents/sparc/refinement.md +244 -1
- package/.claude/agents/sparc/specification.md +282 -21
- package/.claude/agents/specialized/code-booster.md +826 -0
- package/.claude/agents/specialized/mobile/mobile-dev.md +560 -0
- package/.claude/agents/specialized/mobile/spec-mobile-react-native.md +33 -1
- package/.claude/agents/swarm/adaptive-coordinator-enhanced.md +485 -746
- package/.claude/agents/swarm/adaptive-coordinator.md +269 -37
- package/.claude/agents/swarm/blocking-coordinator-example.md +456 -0
- package/.claude/agents/swarm/hierarchical-coordinator.md +324 -60
- package/.claude/agents/swarm/mesh-coordinator.md +774 -324
- package/.claude/agents/swarm/test-coordinator.md +123 -74
- package/.claude/agents/testing/e2e/playwright-agent.md +32 -0
- package/.claude/agents/testing/interaction-tester.md +525 -0
- package/.claude/agents/testing/playwright-tester.md +405 -0
- package/.claude/agents/testing/production-validator.md +644 -0
- package/.claude/agents/testing/tdd-london-swarm.md +659 -0
- package/.claude/agents/testing/unit/tdd-london-swarm.md +27 -0
- package/.claude/agents/testing/validation/production-validator.md +390 -1
- package/.claude/agents-ignore/mesh-coordinator-backup.md +435 -0
- package/.claude/commands/cfn-loop-document.md +441 -0
- package/.claude/commands/github-commit.md +289 -0
- package/.claude-flow-novice/.claude/agents/CLAUDE.md +669 -51
- package/.claude-flow-novice/.claude/agents/agent-principles/agent-type-guidelines.md +137 -0
- package/.claude-flow-novice/.claude/agents/agent-principles/format-selection.md +20 -0
- package/.claude-flow-novice/.claude/agents/agent-principles/prompt-engineering.md +165 -35
- package/.claude-flow-novice/.claude/agents/agent-principles/quality-metrics.md +83 -2
- package/.claude-flow-novice/.claude/agents/analysis/code-analyzer.md +722 -192
- package/.claude-flow-novice/.claude/agents/analysis/code-review/analyze-code-quality.md +33 -3
- package/.claude-flow-novice/.claude/agents/analysis/perf-analyzer.md +812 -0
- package/.claude-flow-novice/.claude/agents/architecture/system-architect.md +25 -11
- package/.claude-flow-novice/.claude/agents/cfn-loop/product-owner.md +458 -2
- package/.claude-flow-novice/.claude/agents/code-booster.md +13 -2
- package/.claude-flow-novice/.claude/agents/consensus/byzantine-coordinator.md +259 -6
- package/.claude-flow-novice/.claude/agents/consensus/consensus-builder.md +135 -2
- package/.claude-flow-novice/.claude/agents/consensus/crdt-synchronizer.md +307 -771
- package/.claude-flow-novice/.claude/agents/consensus/gossip-coordinator.md +227 -2
- package/.claude-flow-novice/.claude/agents/consensus/performance-benchmarker.md +385 -704
- package/.claude-flow-novice/.claude/agents/consensus/quorum-manager.md +241 -749
- package/.claude-flow-novice/.claude/agents/consensus/raft-manager.md +195 -2
- package/.claude-flow-novice/.claude/agents/consensus/security-manager.md +461 -518
- package/.claude-flow-novice/.claude/agents/core-agents/analyst.md +560 -0
- package/.claude-flow-novice/.claude/agents/core-agents/architect.md +578 -0
- package/.claude-flow-novice/.claude/agents/core-agents/base-template-generator.md +137 -0
- package/.claude-flow-novice/.claude/agents/core-agents/coder.md +409 -0
- package/.claude-flow-novice/.claude/agents/core-agents/coordinator.md +1429 -0
- package/.claude-flow-novice/.claude/agents/core-agents/planner.md +343 -0
- package/.claude-flow-novice/.claude/agents/core-agents/researcher.md +414 -0
- package/.claude-flow-novice/.claude/agents/core-agents/reviewer.md +652 -0
- package/.claude-flow-novice/.claude/agents/core-agents/task-coordinator.md +400 -0
- package/.claude-flow-novice/.claude/agents/core-agents/tester.md +912 -0
- package/.claude-flow-novice/.claude/agents/development/backend/dev-backend-api.md +418 -23
- package/.claude-flow-novice/.claude/agents/devops/devops-engineer.md +240 -433
- package/.claude-flow-novice/.claude/agents/documentation/api-docs/docs-api-openapi.md +350 -11
- package/.claude-flow-novice/.claude/agents/examples/blocking-coordinator-example.md +388 -0
- package/.claude-flow-novice/.claude/agents/frontend/interaction-tester.md +334 -17
- package/.claude-flow-novice/.claude/agents/frontend/react-frontend-engineer.md +255 -2
- package/.claude-flow-novice/.claude/agents/frontend/state-architect.md +235 -9
- package/.claude-flow-novice/.claude/agents/frontend/ui-designer.md +261 -132
- package/.claude-flow-novice/.claude/agents/goal/goal-planner.md +803 -52
- package/.claude-flow-novice/.claude/agents/planning-team/api-designer-persona.md +736 -0
- package/.claude-flow-novice/.claude/agents/planning-team/security-architect-persona.md +643 -0
- package/.claude-flow-novice/.claude/agents/planning-team/system-architect-persona.md +585 -0
- package/.claude-flow-novice/.claude/agents/predesign-negotiation/accessibility-advocate-persona.md +796 -0
- package/.claude-flow-novice/.claude/agents/predesign-negotiation/cto-agent.md +473 -0
- package/.claude-flow-novice/.claude/agents/predesign-negotiation/power-user-persona.md +590 -0
- package/.claude-flow-novice/.claude/agents/predesign-negotiation/product-owner-agent.md +806 -0
- package/.claude-flow-novice/.claude/agents/product-owner-team/accessibility-advocate-persona.md +796 -0
- package/.claude-flow-novice/.claude/agents/product-owner-team/cto-agent.md +473 -0
- package/.claude-flow-novice/.claude/agents/product-owner-team/power-user-persona.md +590 -0
- package/.claude-flow-novice/.claude/agents/product-owner-team/product-owner-agent.md +806 -0
- package/.claude-flow-novice/.claude/agents/security/security-specialist.md +515 -13
- package/.claude-flow-novice/.claude/agents/sparc/architecture.md +237 -1
- package/.claude-flow-novice/.claude/agents/sparc/pseudocode.md +237 -1
- package/.claude-flow-novice/.claude/agents/sparc/refinement.md +244 -1
- package/.claude-flow-novice/.claude/agents/sparc/specification.md +282 -21
- package/.claude-flow-novice/.claude/agents/specialized/code-booster.md +826 -0
- package/.claude-flow-novice/.claude/agents/specialized/mobile/mobile-dev.md +560 -0
- package/.claude-flow-novice/.claude/agents/specialized/mobile/spec-mobile-react-native.md +33 -1
- package/.claude-flow-novice/.claude/agents/swarm/adaptive-coordinator-enhanced.md +485 -746
- package/.claude-flow-novice/.claude/agents/swarm/adaptive-coordinator.md +269 -37
- package/.claude-flow-novice/.claude/agents/swarm/blocking-coordinator-example.md +456 -0
- package/.claude-flow-novice/.claude/agents/swarm/hierarchical-coordinator.md +324 -60
- package/.claude-flow-novice/.claude/agents/swarm/mesh-coordinator.md +774 -324
- package/.claude-flow-novice/.claude/agents/swarm/test-coordinator.md +123 -74
- package/.claude-flow-novice/.claude/agents/testing/e2e/playwright-agent.md +32 -0
- package/.claude-flow-novice/.claude/agents/testing/interaction-tester.md +525 -0
- package/.claude-flow-novice/.claude/agents/testing/playwright-tester.md +405 -0
- package/.claude-flow-novice/.claude/agents/testing/production-validator.md +644 -0
- package/.claude-flow-novice/.claude/agents/testing/tdd-london-swarm.md +659 -0
- package/.claude-flow-novice/.claude/agents/testing/unit/tdd-london-swarm.md +27 -0
- package/.claude-flow-novice/.claude/agents/testing/validation/production-validator.md +390 -1
- package/.claude-flow-novice/config/typescript/tsconfig.tsbuildinfo +1 -1
- package/.claude-flow-novice/dist/__tests__/redis/RedisHealthMonitor.test.d.ts +14 -0
- package/.claude-flow-novice/dist/agents/heartbeat-manager.d.ts +73 -0
- package/.claude-flow-novice/dist/agents/lifecycle-cleanup-enhanced.d.ts +190 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/agent-lifecycle-sqlite.test.d.ts +17 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/blocking-coordination-audit.test.d.ts +16 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/blocking-coordination-signals.test.d.ts +14 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/byzantine-consensus-adapter.test.d.ts +14 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/byzantine-performance.test.d.ts +17 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/cfn-loop-byzantine-integration.test.d.ts +15 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/cfn-loop-e2e.test.d.ts +15 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/cfn-loop-memory-manager.test.d.ts +9 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/cleanup-integration.test.d.ts +21 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/cleanup-performance-validation.test.d.ts +13 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/coordinator-timeout-handler.test.d.ts +14 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/dead-coordinator-detection.test.d.ts +15 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/doc-code-examples-validator.d.ts +35 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/doc-executable-examples.test.d.ts +10 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/extended-timeout-testing.test.d.ts +24 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/heartbeat-warning-system.test.d.ts +21 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/redis-health-monitor.test.d.ts +22 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/signal-ack-protocol.test.d.ts +21 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/sqlite-memory-manager.test.d.ts +19 -0
- package/.claude-flow-novice/dist/cfn-loop/__tests__/test-utilities.d.ts +133 -0
- package/.claude-flow-novice/dist/cfn-loop/agent-lifecycle-sqlite.d.ts +143 -0
- package/.claude-flow-novice/dist/cfn-loop/blocking-coordination-signals.d.ts +178 -0
- package/.claude-flow-novice/dist/cfn-loop/blocking-coordination.d.ts +268 -0
- package/.claude-flow-novice/dist/cfn-loop/byzantine-consensus-adapter.d.ts +193 -0
- package/.claude-flow-novice/dist/cfn-loop/cfn-loop-memory-manager.d.ts +221 -0
- package/.claude-flow-novice/dist/cfn-loop/cfn-loop-orchestrator.d.ts +193 -1
- package/.claude-flow-novice/dist/cfn-loop/checkpoint-serializer.d.ts +113 -0
- package/.claude-flow-novice/dist/cfn-loop/circuit-breaker.d.ts +8 -2
- package/.claude-flow-novice/dist/cfn-loop/conflict-resolver.d.ts +221 -0
- package/.claude-flow-novice/dist/cfn-loop/consensus/enterprise-planning-consensus.d.ts +61 -0
- package/.claude-flow-novice/dist/cfn-loop/consensus/mvp-consensus.d.ts +33 -0
- package/.claude-flow-novice/dist/cfn-loop/coordination-validator.d.ts +121 -0
- package/.claude-flow-novice/dist/cfn-loop/coordinator-timeout-handler.d.ts +195 -0
- package/.claude-flow-novice/dist/cfn-loop/crash-detector.d.ts +138 -0
- package/.claude-flow-novice/dist/cfn-loop/epic-report-generator.d.ts +136 -0
- package/.claude-flow-novice/dist/cfn-loop/git-checkpoint-integration.example.d.ts +13 -0
- package/.claude-flow-novice/dist/cfn-loop/git-checkpoint-manager.d.ts +165 -0
- package/.claude-flow-novice/dist/cfn-loop/heartbeat-integration-example.d.ts +16 -0
- package/.claude-flow-novice/dist/cfn-loop/heartbeat-warning-system.d.ts +202 -0
- package/.claude-flow-novice/dist/cfn-loop/meta-coordinator.d.ts +208 -0
- package/.claude-flow-novice/dist/cfn-loop/modes/__tests__/mode-selection.test.d.ts +9 -0
- package/.claude-flow-novice/dist/cfn-loop/modes/enterprise-mode.d.ts +37 -0
- package/.claude-flow-novice/dist/cfn-loop/modes/index.d.ts +111 -0
- package/.claude-flow-novice/dist/cfn-loop/modes/mvp-mode.d.ts +31 -0
- package/.claude-flow-novice/dist/cfn-loop/modes/standard-mode.d.ts +31 -0
- package/.claude-flow-novice/dist/cfn-loop/modes/types.d.ts +135 -0
- package/.claude-flow-novice/dist/cfn-loop/product-owner/enterprise-owner-team.d.ts +50 -0
- package/.claude-flow-novice/dist/cfn-loop/product-owner/mvp-owner.d.ts +31 -0
- package/.claude-flow-novice/dist/cfn-loop/recovery-engine.d.ts +183 -0
- package/.claude-flow-novice/dist/cfn-loop/redis-health-integration-example.d.ts +13 -0
- package/.claude-flow-novice/dist/cfn-loop/redis-health-monitor.d.ts +164 -0
- package/.claude-flow-novice/dist/cfn-loop/redis-pubsub-helpers.d.ts +230 -0
- package/.claude-flow-novice/dist/cfn-loop/sprint-coordinator-enhanced.d.ts +199 -0
- package/.claude-flow-novice/dist/cfn-loop/state-checkpoint-manager.d.ts +198 -0
- package/.claude-flow-novice/dist/cfn-loop/test-aggregator.d.ts +205 -0
- package/.claude-flow-novice/dist/cfn-loop/test-lock-coordinator.d.ts +176 -0
- package/.claude-flow-novice/dist/cfn-loop/test-product-owner-decision.d.ts +19 -0
- package/.claude-flow-novice/dist/cfn-loop/types.d.ts +174 -0
- package/.claude-flow-novice/dist/cfn-loop/validator-methods-replacement.d.ts +68 -0
- package/.claude-flow-novice/dist/cli/cleanup-orphans.d.ts +54 -0
- package/.claude-flow-novice/dist/cli/commands/agent-lifecycle.d.ts +226 -0
- package/.claude-flow-novice/dist/cli/commands/cfn-loop-parallel.d.ts +21 -0
- package/.claude-flow-novice/dist/cli/commands/recovery-resume.d.ts +33 -0
- package/.claude-flow-novice/dist/cli/commands/recovery-status.d.ts +57 -0
- package/.claude-flow-novice/dist/cli/commands/recovery.d.ts +88 -0
- package/.claude-flow-novice/dist/cli/commands/validate-coordination.d.ts +14 -0
- package/.claude-flow-novice/dist/cli/node-compat.d.ts +1 -1
- package/.claude-flow-novice/dist/cli/simple-commands/hive-mind/queen.d.ts +3 -3
- package/.claude-flow-novice/dist/cli/utils/interactive-detector.d.ts +1 -1
- package/.claude-flow-novice/dist/cli/utils/redis-client.d.ts +1 -5
- package/.claude-flow-novice/dist/consensus/byzantine-coordinator.d.ts +314 -0
- package/.claude-flow-novice/dist/constants/agent-types.d.ts +2 -2
- package/.claude-flow-novice/dist/coordination/hive-orchestrator.d.ts +1 -1
- package/.claude-flow-novice/dist/coordination/validation-schemas.d.ts +12 -12
- package/.claude-flow-novice/dist/hooks/index.d.ts +1 -1
- package/.claude-flow-novice/dist/hooks/useSwarmRealtimeData.d.ts +11 -11
- package/.claude-flow-novice/dist/memory/advanced-memory-manager.d.ts +1 -0
- package/.claude-flow-novice/dist/memory/backends/sqlite.d.ts +1 -0
- package/.claude-flow-novice/dist/memory/distributed-memory.d.ts +1 -0
- package/.claude-flow-novice/dist/memory/secret-detector.d.ts +131 -0
- package/.claude-flow-novice/dist/memory/sqlite-enhanced-backend.d.ts +1 -0
- package/.claude-flow-novice/dist/monitoring/memory-leak-dashboard-widget.d.ts +194 -0
- package/.claude-flow-novice/dist/providers/api-key-rotation-example.d.ts +54 -0
- package/.claude-flow-novice/dist/providers/api-key-rotator.d.ts +166 -0
- package/.claude-flow-novice/dist/providers/rate-limit-detector.d.ts +60 -0
- package/.claude-flow-novice/dist/redis/RedisHealthMonitor.d.ts +162 -0
- package/.claude-flow-novice/dist/redis/health-integration-example.d.ts +86 -0
- package/.claude-flow-novice/dist/services/swarm-memory-manager.d.ts +1 -0
- package/.claude-flow-novice/dist/src/agents/heartbeat-manager.js +144 -0
- package/.claude-flow-novice/dist/src/agents/lifecycle-cleanup-enhanced.js +514 -0
- package/.claude-flow-novice/dist/src/automation/test-pipeline/PipelineValidator.js +1 -1
- package/.claude-flow-novice/dist/src/automation/test-pipeline/SwarmTestCoordinator.js +1 -1
- package/.claude-flow-novice/dist/src/cfn-loop/agent-lifecycle-sqlite.js +385 -0
- package/.claude-flow-novice/dist/src/cfn-loop/blocking-coordination-signals.js +470 -0
- package/.claude-flow-novice/dist/src/cfn-loop/blocking-coordination.js +768 -0
- package/.claude-flow-novice/dist/src/cfn-loop/byzantine-consensus-adapter.js +548 -0
- package/.claude-flow-novice/dist/src/cfn-loop/cfn-loop-memory-manager.js +589 -0
- package/.claude-flow-novice/dist/src/cfn-loop/cfn-loop-orchestrator.js +1059 -21
- package/.claude-flow-novice/dist/src/cfn-loop/checkpoint-serializer.js +308 -0
- package/.claude-flow-novice/dist/src/cfn-loop/circuit-breaker.js +34 -9
- package/.claude-flow-novice/dist/src/cfn-loop/conflict-resolver.js +525 -0
- package/.claude-flow-novice/dist/src/cfn-loop/consensus/enterprise-planning-consensus.js +403 -0
- package/.claude-flow-novice/dist/src/cfn-loop/consensus/mvp-consensus.js +235 -0
- package/.claude-flow-novice/dist/src/cfn-loop/coordination-validator.js +304 -0
- package/.claude-flow-novice/dist/src/cfn-loop/coordinator-timeout-handler.js +600 -0
- package/.claude-flow-novice/dist/src/cfn-loop/crash-detector.js +362 -0
- package/.claude-flow-novice/dist/src/cfn-loop/epic-report-generator.js +283 -0
- package/.claude-flow-novice/dist/src/cfn-loop/git-checkpoint-integration.example.js +161 -0
- package/.claude-flow-novice/dist/src/cfn-loop/git-checkpoint-manager.js +486 -0
- package/.claude-flow-novice/dist/src/cfn-loop/heartbeat-integration-example.js +187 -0
- package/.claude-flow-novice/dist/src/cfn-loop/heartbeat-warning-system.js +492 -0
- package/.claude-flow-novice/dist/src/cfn-loop/meta-coordinator.js +538 -0
- package/.claude-flow-novice/dist/src/cfn-loop/modes/enterprise-mode.js +132 -0
- package/.claude-flow-novice/dist/src/cfn-loop/modes/index.js +191 -0
- package/.claude-flow-novice/dist/src/cfn-loop/modes/mvp-mode.js +79 -0
- package/.claude-flow-novice/dist/src/cfn-loop/modes/standard-mode.js +81 -0
- package/.claude-flow-novice/dist/src/cfn-loop/modes/types.js +41 -0
- package/.claude-flow-novice/dist/src/cfn-loop/product-owner/enterprise-owner-team.js +380 -0
- package/.claude-flow-novice/dist/src/cfn-loop/product-owner/mvp-owner.js +170 -0
- package/.claude-flow-novice/dist/src/cfn-loop/recovery-engine.js +546 -0
- package/.claude-flow-novice/dist/src/cfn-loop/redis-health-integration-example.js +215 -0
- package/.claude-flow-novice/dist/src/cfn-loop/redis-health-monitor.js +414 -0
- package/.claude-flow-novice/dist/src/cfn-loop/redis-pubsub-helpers.js +463 -0
- package/.claude-flow-novice/dist/src/cfn-loop/sprint-coordinator-enhanced.js +466 -0
- package/.claude-flow-novice/dist/src/cfn-loop/state-checkpoint-manager.js +402 -0
- package/.claude-flow-novice/dist/src/cfn-loop/test-aggregator.js +476 -0
- package/.claude-flow-novice/dist/src/cfn-loop/test-lock-coordinator.js +446 -0
- package/.claude-flow-novice/dist/src/cfn-loop/test-product-owner-decision.js +69 -0
- package/.claude-flow-novice/dist/src/cfn-loop/types.js +30 -0
- package/.claude-flow-novice/dist/src/cfn-loop/validator-methods-replacement.js +362 -0
- package/.claude-flow-novice/dist/src/cli/cleanup-orphans.js +246 -0
- package/.claude-flow-novice/dist/src/cli/commands/agent-lifecycle.js +1058 -0
- package/.claude-flow-novice/dist/src/cli/commands/cfn-loop-parallel.js +436 -0
- package/.claude-flow-novice/dist/src/cli/commands/index.js +86 -0
- package/.claude-flow-novice/dist/src/cli/commands/parse-epic.js +64 -2
- package/.claude-flow-novice/dist/src/cli/commands/recovery-resume.js +369 -0
- package/.claude-flow-novice/dist/src/cli/commands/recovery-status.js +265 -0
- package/.claude-flow-novice/dist/src/cli/commands/recovery.js +546 -0
- package/.claude-flow-novice/dist/src/cli/commands/validate-coordination.js +211 -0
- package/.claude-flow-novice/dist/src/cli/simple-commands/init/templates/CLAUDE-backup-pre-enterprise-loop.md +735 -0
- package/.claude-flow-novice/dist/src/cli/simple-commands/init/templates/CLAUDE.md +176 -326
- package/.claude-flow-novice/dist/src/coordination/shared/transparency/transparency-system.js +1 -1
- package/.claude-flow-novice/dist/src/memory/advanced-memory-manager.js +17 -2
- package/.claude-flow-novice/dist/src/memory/backends/sqlite.js +23 -1
- package/.claude-flow-novice/dist/src/memory/distributed-memory.js +18 -3
- package/.claude-flow-novice/dist/src/memory/secret-detector.js +253 -0
- package/.claude-flow-novice/dist/src/memory/sqlite-enhanced-backend.js +20 -1
- package/.claude-flow-novice/dist/src/monitoring/memory-leak-dashboard-widget.js +421 -0
- package/.claude-flow-novice/dist/src/observability/prometheus-metrics.d.js +8 -0
- package/.claude-flow-novice/dist/src/providers/api-key-rotation-example.js +165 -0
- package/.claude-flow-novice/dist/src/providers/api-key-rotator.js +412 -0
- package/.claude-flow-novice/dist/src/providers/rate-limit-detector.js +193 -0
- package/.claude-flow-novice/dist/src/redis/RedisHealthMonitor.js +429 -0
- package/.claude-flow-novice/dist/src/redis/health-integration-example.js +353 -0
- package/.claude-flow-novice/dist/src/services/swarm-memory-manager.js +72 -42
- package/.claude-flow-novice/dist/src/sqlite/ACLEnforcer.cjs +928 -0
- package/.claude-flow-novice/dist/src/sqlite/AgentRegistry.cjs +702 -0
- package/.claude-flow-novice/dist/src/sqlite/AgentRegistry.js +702 -0
- package/.claude-flow-novice/dist/src/sqlite/EncryptionKeyManager.cjs +754 -0
- package/.claude-flow-novice/dist/src/sqlite/EncryptionKeyManager.js +754 -0
- package/.claude-flow-novice/dist/src/sqlite/MemoryStoreAdapter.cjs +571 -0
- package/.claude-flow-novice/dist/src/sqlite/MemoryStoreAdapter.js +571 -0
- package/.claude-flow-novice/dist/src/sqlite/MultiLayerCache.cjs +640 -0
- package/.claude-flow-novice/dist/src/sqlite/MultiLayerCache.js +640 -0
- package/.claude-flow-novice/dist/src/sqlite/RedisCoordinator.cjs +636 -0
- package/.claude-flow-novice/dist/src/sqlite/RedisCoordinator.js +636 -0
- package/.claude-flow-novice/dist/src/sqlite/SwarmMemoryManager.cjs +750 -0
- package/.claude-flow-novice/dist/src/sqlite/SwarmMemoryManager.js +750 -0
- package/.claude-flow-novice/dist/src/sqlite/index.cjs +620 -0
- package/.claude-flow-novice/dist/src/sqlite/index.js +620 -0
- package/.claude-flow-novice/dist/src/sqlite/performance-benchmarks.cjs +839 -0
- package/.claude-flow-novice/dist/src/sqlite/performance-benchmarks.js +839 -0
- package/.claude-flow-novice/dist/src/testing/performance/PerformanceTestRunner.js +1 -1
- package/.claude-flow-novice/dist/src/wasm-regex-engine/pkg/wasm_regex_engine.d.js +11 -0
- package/.claude-flow-novice/dist/src/wasm-regex-engine/pkg/wasm_regex_engine_bg.wasm.d.js +28 -0
- package/.claude-flow-novice/dist/web/api/routes/parallel-status.d.ts +105 -0
- package/.claude-flow-novice/dist/web/dashboard/hooks/useWebSocket.d.ts +4 -4
- package/.claude-flow-novice/tsconfig.tsbuildinfo +1 -1
- package/AUTO_SETUP.md +271 -0
- package/CLAUDE.md +176 -326
- package/README.md +127 -30
- package/config/.env.example +17 -0
- package/config/cfn-loop/enterprise-criteria.json +207 -0
- package/config/cfn-loop/instructions/enterprise-instructions.md +506 -0
- package/config/cfn-loop/instructions/mvp-instructions.md +420 -0
- package/config/cfn-loop/instructions/standard-instructions.md +497 -0
- package/config/cfn-loop/mvp-criteria.json +133 -0
- package/config/docker/DEPLOYMENT_VALIDATION_RESULTS.md +1 -1
- package/config/docker/QUICK_START.txt +7 -5
- package/config/docker/STABILITY_TEST_README.md +10 -10
- package/config/hooks/AGENT_TEMPLATE_VALIDATOR_COMPLETION.md +440 -0
- package/config/hooks/BLOCKING_COORDINATION_VALIDATOR_IMPLEMENTATION_REPORT.md +559 -0
- package/config/hooks/BLOCKING_COORDINATION_VALIDATOR_README.md +467 -0
- package/config/hooks/CFN_LOOP_MEMORY_VALIDATOR_IMPLEMENTATION.md +343 -0
- package/config/hooks/COVERAGE_VALIDATOR_QUICK_START.md +218 -0
- package/config/hooks/POST_TEST_COVERAGE_README.md +657 -0
- package/config/hooks/README-AGENT-TEMPLATE-VALIDATOR.md +464 -0
- package/config/hooks/README-CFN-LOOP-MEMORY-VALIDATOR.md +442 -0
- package/config/hooks/TEST_COVERAGE_VALIDATOR_COMPLETION.md +497 -0
- package/config/hooks/WASM_REGEX_ENGINE.md +210 -0
- package/config/hooks/coverage.config.json +40 -0
- package/config/hooks/hook-manager.cjs +47 -0
- package/config/hooks/markdown-validator.js +202 -0
- package/config/hooks/post-edit-agent-template.js +607 -0
- package/config/hooks/post-edit-blocking-coordination.js +748 -0
- package/config/hooks/post-edit-cfn-loop-memory.cjs +503 -0
- package/config/hooks/post-edit-pipeline.js +290 -145
- package/config/hooks/post-test-coverage.js +981 -0
- package/config/hooks/pre-commit-db-scan +119 -0
- package/config/hooks/pre-edit-security.js +33 -6
- package/config/hooks/pre-tool-validation.js +60 -1
- package/config/hooks/safety-validator.js +236 -21
- package/config/hooks/safety-validator.js.backup +1323 -0
- package/config/hooks/validators/CWEValidator.js +152 -0
- package/config/hooks/validators/ComplianceValidator.js +187 -0
- package/config/hooks/validators/DependencyScanner.js +162 -0
- package/config/hooks/validators/InputSanitizer.js +134 -0
- package/config/hooks/validators/OWASPValidator.js +197 -0
- package/config/hooks/validators/SecurityPatternScanner.js +318 -0
- package/config/jest/jest.config.js +12 -1
- package/docs/PRE_COMMIT_HOOK.md +294 -0
- package/docs/README.md +130 -153
- package/docs/TEST_INFRASTRUCTURE.md +381 -0
- package/docs/agent-lifecycle-hooks.md +860 -0
- package/docs/api/FUNCTION_CATALOG.md +584 -0
- package/docs/api/ROUTING_QUICK_REFERENCE.md +117 -0
- package/docs/api/VALIDATION_QUICK_REFERENCE.md +172 -0
- package/docs/api/blocking-coordination-api.md +1451 -0
- package/docs/architecture/MULTI_SWARM_COORDINATION_README.md +620 -0
- package/docs/architecture/README_REALTIME_COMMUNICATION.md +463 -0
- package/docs/architecture/REALTIME_COMMUNICATION_ANALYSIS.md +321 -0
- package/docs/architecture/WASM_ARCHITECTURE_SUMMARY.md +429 -0
- package/docs/architecture/WASM_INTEGRATION_ARCHITECTURE.md +1330 -0
- package/docs/archive/2025-10-10-architecture/deprecated-implementations/BLOCKING_COORDINATION_VALIDATION_FINAL.md +334 -0
- package/docs/archive/2025-10-10-architecture/deprecated-implementations/blocking-coordination-pattern.md +484 -0
- package/docs/archive/2025-10-10-architecture/deprecated-implementations/production-blocking-coordination-plan.md +764 -0
- package/docs/archive/2025-10-10-architecture/deprecated-implementations/revised-production-blocking-plan.md +614 -0
- package/docs/archive/2025-10-10-architecture/implementation-guides/WASM_IMPLEMENTATION_GUIDE.md +1011 -0
- package/docs/archive/2025-10-10-architecture/implementation-guides/WASM_ROLLOUT_PLAN.md +701 -0
- package/docs/archive/2025-10-10-architecture/implementation-guides/agent-lifecycle-implementation-plan.md +1428 -0
- package/docs/archive/2025-10-10-architecture/other-designs/CORRECTED-task-tool-constraints.md +366 -0
- package/docs/archive/2025-10-10-architecture/other-designs/claude-code-task-tool-constraints.md +401 -0
- package/docs/archive/2025-10-10-architecture/other-designs/cleanup-architecture-explanation.md +423 -0
- package/docs/archive/2025-10-10-guides/setup-guides/CONTRIBUTING.md +136 -0
- package/docs/archive/2025-10-10-guides/setup-guides/DEVELOPMENT_SETUP.md +486 -0
- package/docs/archive/2025-10-10-guides/setup-guides/EXAMPLES.md +793 -0
- package/docs/archive/2025-10-10-guides/setup-guides/INSTALLATION.md +608 -0
- package/docs/archive/2025-10-10-guides/setup-guides/QUICK_START_INSTALLATION.md +521 -0
- package/docs/archive/2025-10-10-guides/setup-guides/README.md +162 -0
- package/docs/archive/2025-10-10-guides/setup-guides/TROUBLESHOOTING.md +1388 -0
- package/docs/archive/2025-10-10-operations/ARCHIVE_MIGRATION_PLAN.md +214 -0
- package/docs/archive/2025-10-10-performance/wasm-deliverables/WASM_DELIVERABLES.md +421 -0
- package/docs/archive/ARCHIVAL_EXECUTION_REPORT_2025-10-10.md +219 -0
- package/docs/archive/HTTP_POLLING_FALLBACK.md +283 -0
- package/docs/archive/reference-historical/BACKUP_MANIFEST.md +32 -0
- package/docs/archive/reference-historical/README-PHASE4.md +355 -0
- package/docs/archive/reference-historical/READMEv2.md +524 -0
- package/docs/deployment/blocking-coordination-secrets.md +1445 -0
- package/docs/implementation/SQLITE_INTEGRATION_IMPLEMENTATION.md +663 -0
- package/docs/integration/cfn-loop-examples.md +1107 -0
- package/docs/observability/prometheus-setup.md +455 -0
- package/docs/operations/OPERATIONS_FOLDER_REVIEW_REPORT.json +135 -0
- package/docs/operations/failure-recovery-playbook.md +877 -0
- package/docs/operations/monitoring-runbook.md +880 -0
- package/docs/patterns/blocking-coordination-pattern.md +642 -0
- package/docs/reference/CHANGELOG-POST-EDIT-PIPELINE.md +370 -0
- package/docs/reference/MANUAL_NPM_PUBLICATION_GUIDE.md +248 -0
- package/docs/security/SEC-002-race-condition-fix.md +300 -0
- package/docs/security/SEC-003-JSON-VALIDATION.md +215 -0
- package/docs/testing/chaos-engineering.md +524 -0
- package/docs/training/best-practices.md +1241 -0
- package/docs/training/faq.md +1483 -0
- package/docs/training/interactive-tutorial.md +966 -0
- package/docs/training/troubleshooting-guide.md +1279 -0
- package/docs/training/video-walkthrough-script.md +675 -0
- package/examples/demonstrations/phase5-demonstration.cjs +227 -0
- package/examples/rest-api-simple/sparc-implementation-roadmap.md +1 -1
- package/examples/rest-api-simple/sparc-implementation-roadmap.md.backup-1760135091708 +190 -0
- package/examples/templates/basic-swarm/CLAUDE.md +464 -0
- package/examples/templates/custom-agent/CLAUDE.md +299 -0
- package/examples/templates/custom-agent/package.json +26 -0
- package/examples/templates/event-bus/package.json +28 -0
- package/examples/templates/fleet-manager/CLAUDE.md +134 -0
- package/examples/templates/fleet-manager/package.json +28 -0
- package/package.json +60 -18
- package/readme/additional-commands.md +365 -2
- package/readme/cfn-loop-modes.md +527 -0
- package/readme/logs-cli-redis.md +82 -14
- package/readme/logs-documentation-index.md +8 -0
- package/readme/logs-features.md +188 -24
- package/readme/logs-slash-commands.md +35 -11
- package/scripts/CLEANUP_OPTIMIZATION_REPORT.json +312 -0
- package/scripts/CLEANUP_PERFORMANCE_OPTIMIZATION.md +387 -0
- package/scripts/CLEANUP_QUICK_START.md +268 -0
- package/scripts/CLEANUP_TEST_RESULTS.md +205 -0
- package/scripts/auto-setup.js +332 -0
- package/scripts/cleanup-blocking-coordination.sh +420 -0
- package/scripts/collect-build-metrics.js +65 -0
- package/scripts/demo/README.md +79 -0
- package/scripts/demo/autoscaling-demo-simplified.js +963 -0
- package/scripts/demo/comprehensive-dashboard-test.js +693 -0
- package/scripts/demo/confidence-log.js +87 -0
- package/scripts/demo/confidence-report.js +82 -0
- package/scripts/demo/demo-multi-swarm-coordination.js +325 -0
- package/scripts/demo/demo-production-deployment.js +399 -0
- package/scripts/demo/demo-visualization-system.js +149 -0
- package/scripts/demo/performance-analysis.cjs +71 -0
- package/scripts/demo/performance-analysis.js +71 -0
- package/scripts/demo/test-autoscaling-demo.js +314 -0
- package/scripts/dev/demo-phase3-compliance.js +2 -2
- package/scripts/ecosystem.config.cjs +90 -0
- package/scripts/hook-wrapper.sh +54 -0
- package/scripts/install-pre-commit-hook.sh +127 -0
- package/scripts/legacy/performance-test-runner.js +7 -7
- package/scripts/migration/QUICK-START.md +189 -0
- package/scripts/migration/QUICK-START.md.backup-1760135091363 +189 -0
- package/scripts/migration/README.md +30 -0
- package/scripts/migration/TASK-1.3.2-COMPLETION-REPORT.md +500 -0
- package/scripts/migration/TASK-1.3.2-COMPLETION-REPORT.md.backup-1760135091348 +500 -0
- package/scripts/migration/UPDATE-PATHS-README.md +464 -0
- package/scripts/migration/UPDATE-PATHS-README.md.backup-1760135091337 +464 -0
- package/scripts/migration/example-patterns.json +19 -0
- package/scripts/migration/reorganize-workspace.js +504 -0
- package/scripts/migration/test-update-paths.js +359 -0
- package/scripts/migration/update-paths.js +664 -0
- package/scripts/migration/validate-migration.js +647 -0
- package/scripts/monitoring/README.md +6 -6
- package/scripts/monitoring/analyze-resources.sh +1 -1
- package/scripts/monitoring/dynamic-monitor.sh +4 -4
- package/scripts/monitoring/test-monitor-quick.sh +1 -1
- package/scripts/performance-test-runner.js +7 -7
- package/scripts/redis-lua/cleanup-blocking-coordination.lua +198 -0
- package/scripts/sync-agents.js +290 -0
- package/scripts/test/NEW_STABILITY_TEST_GUIDE.md +13 -8
- package/scripts/test/quick-multilingual-demo.js +2 -2
- package/scripts/test-cleanup-performance.sh +416 -0
- package/scripts/test-runner.cjs +154 -0
- package/scripts/validate-agent-hooks.js +506 -0
- package/scripts/validation/README.md +33 -0
- package/scripts/validation/acl-security-validation.cjs +214 -0
- package/scripts/validation/acl-security-validation.js +402 -0
- package/scripts/validation/byzantine-verification.js +407 -0
- package/scripts/validation/final-phase-2-consensus.cjs +219 -0
- package/scripts/validation/final-security-validation.js +791 -0
- package/scripts/validation/final-wasm-validation.cjs +840 -0
- package/scripts/validation/integration-test-analysis.js +105 -0
- package/scripts/validation/phase-0-comprehensive-validation.js +474 -0
- package/scripts/validation/phase-0-consensus-report.js +139 -0
- package/scripts/validation/phase-0-final-report.js +112 -0
- package/scripts/validation/phase-0-redis-consensus-report.js +129 -0
- package/scripts/validation/phase-0-validation-improved.js +490 -0
- package/scripts/validation/phase-0-validation-test.js +65 -0
- package/scripts/validation/phase-1-consensus-report.cjs +342 -0
- package/scripts/validation/phase-1-consensus-validation.cjs +551 -0
- package/scripts/validation/phase-1-consensus-validation.js +551 -0
- package/scripts/validation/phase-2-consensus-report.cjs +186 -0
- package/scripts/validation/phase-2-validation.cjs +171 -0
- package/scripts/validation/phase-2-validation.js +171 -0
- package/scripts/validation/phase-4-consensus-report.js +181 -0
- package/scripts/validation/phase-4-final-validation.js +351 -0
- package/scripts/validation/phase-5-consensus-report.cjs +113 -0
- package/scripts/validation/phase-5-consensus-report.js +113 -0
- package/scripts/validation/security-analysis.js +49 -0
- package/scripts/validation/security-validation.js +492 -0
- package/scripts/validation/simple-security-validation.js +464 -0
- package/scripts/verify-installation.js +44 -14
- package/src/cli/simple-commands/init/templates/CLAUDE-backup-pre-enterprise-loop.md +735 -0
- package/src/cli/simple-commands/init/templates/CLAUDE.md +176 -326
- package/src/observability/blocking-coordination-metrics.js +161 -0
- package/src/observability/prometheus-metrics.d.ts +21 -0
- package/src/observability/prometheus-metrics.js +280 -0
- package/wiki/tutorials/beginner/04-quality-testing.md +3 -3
- package/.claude/agents/analyst.md +0 -300
- package/.claude/agents/architect.md +0 -558
- package/.claude/agents/base-template-generator.md +0 -65
- package/.claude/agents/coder.md +0 -181
- package/.claude/agents/planner.md +0 -135
- package/.claude/agents/researcher.md +0 -185
- package/.claude/agents/reviewer.md +0 -293
- package/.claude/agents/task-coordinator.md +0 -126
- package/.claude/agents/tester.md +0 -664
- package/MCP_DEPRECATION_COMPLETE.md +0 -375
- package/V2.0.0_READY_FOR_PUBLICATION.md +0 -417
- package/V2_RELEASE_SUMMARY.md +0 -568
- package/docs/DEPLOYMENT.md +0 -523
- package/docs/TROUBLESHOOTING.md +0 -1388
- package/docs/agent-token-analysis-results.json +0 -1329
- package/docs/architecture/agent-lifecycle-implementation-plan.md +0 -1428
- package/templates/custom-agent/package.json +0 -26
- package/templates/event-bus/package.json +0 -28
- package/templates/fleet-manager/package.json +0 -28
- /package/.claude/{agents → agents-ignore}/benchmarking-tests/test-agent-code-heavy.md +0 -0
- /package/.claude/{agents → agents-ignore}/benchmarking-tests/test-agent-metadata.md +0 -0
- /package/.claude/{agents → agents-ignore}/benchmarking-tests/test-agent-minimal.md +0 -0
- /package/.claude/{agents/coordinator.md → agents-ignore/coordinator-backup.md} +0 -0
- /package/.claude/{agents → agents-ignore}/data/ml/data-ml-model.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/code-review-swarm.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/github-modes.md +0 -0
- /package/.claude/{agents/templates → agents-ignore/github}/github-pr-manager.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/github-specialist.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/issue-tracker.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/multi-repo-swarm.md +0 -0
- /package/.claude/{agents/devops/ci-cd → agents-ignore/github}/ops-cicd-github.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/pr-manager.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/project-board-sync.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/release-manager.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/release-swarm.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/repo-architect.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/swarm-issue.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/swarm-pr.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/sync-coordinator.md +0 -0
- /package/.claude/{agents → agents-ignore}/github/workflow-automation.md +0 -0
- /package/.claude/{agents → agents-ignore}/neural/neural-pattern-agent.md +0 -0
- /package/.claude/{agents → agents-ignore}/neural/safla-neural.md +0 -0
- /package/.claude/{agents → agents-ignore}/optimization/benchmark-suite.md +0 -0
- /package/.claude/{agents → agents-ignore}/optimization/load-balancer.md +0 -0
- /package/.claude/{agents → agents-ignore}/optimization/perf-analyzer.md +0 -0
- /package/.claude/{agents → agents-ignore}/optimization/performance-monitor.md +0 -0
- /package/.claude/{agents → agents-ignore}/optimization/resource-allocator.md +0 -0
- /package/.claude/{agents → agents-ignore}/optimization/topology-optimizer.md +0 -0
- /package/.claude/{agents → agents-ignore}/sublinear/consciousness-evolution-agent.md +0 -0
- /package/.claude/{agents → agents-ignore}/sublinear/matrix-solver-agent.md +0 -0
- /package/.claude/{agents → agents-ignore}/sublinear/nanosecond-scheduler-agent.md +0 -0
- /package/.claude/{agents → agents-ignore}/sublinear/pagerank-agent.md +0 -0
- /package/.claude/{agents → agents-ignore}/sublinear/phi-calculator-agent.md +0 -0
- /package/.claude/{agents → agents-ignore}/sublinear/psycho-symbolic-agent.md +0 -0
- /package/.claude/{agents → agents-ignore}/sublinear/sublinear.md +0 -0
- /package/.claude/{agents → agents-ignore}/sublinear/temporal-advantage-agent.md +0 -0
- /package/.claude/{agents/architecture → agents-ignore}/system-design/arch-system-design.md +0 -0
- /package/.claude/{agents → agents-ignore}/templates/automation-smart-agent.md +0 -0
- /package/.claude/{agents → agents-ignore}/templates/coordinator-swarm-init.md +0 -0
- /package/.claude/{agents → agents-ignore}/templates/implementer-sparc-coder.md +0 -0
- /package/.claude/{agents → agents-ignore}/templates/memory-coordinator.md +0 -0
- /package/.claude/{agents → agents-ignore}/templates/migration-plan.md +0 -0
- /package/.claude/{agents → agents-ignore}/templates/orchestrator-task.md +0 -0
- /package/.claude/{agents → agents-ignore}/templates/performance-analyzer.md +0 -0
- /package/.claude/{agents → agents-ignore}/templates/sparc-coordinator.md +0 -0
- /package/{.claude/agents/specialized → .claude-flow-novice/.claude/agents/agent-principles}/CODER_AGENT_GUIDELINES.md +0 -0
- /package/docs/{API.md → api/API.md} +0 -0
- /package/docs/{CONFIGURATION.md → api/CONFIGURATION.md} +0 -0
- /package/docs/{PROVIDER_ROUTING_CONFIGURATION.md → api/PROVIDER_ROUTING_CONFIGURATION.md} +0 -0
- /package/docs/{PROVIDER_ROUTING_VERIFICATION.md → api/PROVIDER_ROUTING_VERIFICATION.md} +0 -0
- /package/docs/{ROUTING_FLOW_DIAGRAM.md → api/ROUTING_FLOW_DIAGRAM.md} +0 -0
- /package/{AGENT_PERFORMANCE_GUIDELINES.md → docs/architecture/AGENT_PERFORMANCE_GUIDELINES.md} +0 -0
- /package/docs/{EVENTEMITTER_CLEANUP_PATTERN.md → architecture/EVENTEMITTER_CLEANUP_PATTERN.md} +0 -0
- /package/docs/{REDIS_COORDINATION_SYSTEM.md → architecture/REDIS_COORDINATION_SYSTEM.md} +0 -0
- /package/docs/{SYSTEM_ARCHITECTURE.md → architecture/SYSTEM_ARCHITECTURE.md} +0 -0
- /package/docs/{consensus → architecture/consensus}/QUORUM_VERIFICATION_GUIDE.md +0 -0
- /package/docs/{consensus → architecture/consensus}/README.md +0 -0
- /package/docs/{consensus → architecture/consensus}/consensus-verification-1758747665635.json +0 -0
- /package/docs/{agents → archive/2025-10-10-architecture/agent-subdirectory}/MIGRATION_SUMMARY.md +0 -0
- /package/docs/{agents → archive/2025-10-10-architecture/agent-subdirectory}/README.md +0 -0
- /package/docs/{agent-booster-architecture.md → archive/2025-10-10-architecture/agent-subdirectory/agent-booster-architecture.md} +0 -0
- /package/docs/{agent-prompt-guidelines.md → archive/2025-10-10-architecture/agent-subdirectory/agent-prompt-guidelines.md} +0 -0
- /package/docs/{agent-token-usage-analysis-report.md → archive/2025-10-10-architecture/agent-subdirectory/agent-token-usage-analysis-report.md} +0 -0
- /package/docs/{agents → archive/2025-10-10-architecture/agent-subdirectory}/consensus-README.md +0 -0
- /package/docs/{agents → archive/2025-10-10-architecture/agent-subdirectory}/dependency-tracking-examples.md +0 -0
- /package/docs/{agents → archive/2025-10-10-architecture/agent-subdirectory}/optimization-README.md +0 -0
- /package/docs/{agents → archive/2025-10-10-architecture/agent-subdirectory}/swarm-README.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/CONSENSUS-COMPARISON.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/ROUND-5-EXECUTIVE-SUMMARY.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/consolidated-consensus-report.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-consensus-round-2.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-round-3-consensus-summary.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-round-3-validator-1.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-round-3-validator-2.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-round-3-validator-3.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-round-3-validator-4.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-round-4-consensus-summary.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-round-4-validator-1.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-round-4-validator-2.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-round-4-validator-3.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-round-4-validator-4.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-round-5-final-consensus.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/fullstack-swarm-consensus-report.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/post-edit-consensus-round-2.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/raft-implementation-summary.md +0 -0
- /package/docs/{consensus → archive/2025-10-10-architecture/consensus-rounds}/verification-summary.md +0 -0
- /package/docs/{comprehensive-mcp-solution-architecture.md → archive/2025-10-10-architecture/deprecated-implementations/comprehensive-mcp-solution-architecture.md} +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture}/experimental/ExperimentalFeaturesArchitecture.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/frontend-specific}/frontend-agent-ecosystem-integration.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/frontend-specific}/frontend-agent-technical-decisions.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/frontend-specific}/frontend-backend-coordination-interfaces.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/frontend-specific}/react-frontend-agent-specification.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/github-specific}/github-agent-consolidation-architecture.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/github-specific}/github-architecture-diagrams.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/implementation-guides}/agent-lifecycle-implementation-guide.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/implementation-guides}/implementation-guide.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/implementation-guides}/implementation-specifications.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/implementation-guides}/integration-guide.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/implementation-guides}/performance-optimization-guide.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/old-summaries}/architecture-summary-report.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/old-summaries}/fullstack-swarm-implementation-summary.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/old-summaries}/ultra-fast-communication-summary.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/other-designs}/agent-discovery-registration-system.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/other-designs}/agent-lifecycle-management-architecture.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/other-designs}/architectural-decisions.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/other-designs}/architecture-decision-records.md +0 -0
- /package/docs/{claude-soul-implementation.md → archive/2025-10-10-architecture/other-designs/claude-soul-implementation.md} +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/other-designs}/file-based-cross-team-communication.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/other-designs}/full-stack-swarm-team-specification.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/other-designs}/fullstack-communication-integration.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/other-designs}/stage3-unified-system-architecture.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/other-designs}/ultra-fast-communication-bus-design.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/other-designs}/zero-latency-communication-architecture.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/specific-feature-specs}/dynamic-agent-spawning-architecture.md +0 -0
- /package/docs/{fleet-manager-design.md → archive/2025-10-10-architecture/specific-feature-specs/fleet-manager-design.md} +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/specific-feature-specs}/fleet-manager-npm-architecture.md +0 -0
- /package/docs/{help-coordinator-implementation.md → archive/2025-10-10-architecture/specific-feature-specs/help-coordinator-implementation.md} +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/specific-feature-specs}/high-performance-memory-store.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/specific-feature-specs}/intelligent-configuration-system.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/specific-feature-specs}/message-serialization-compression-strategy.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/specific-feature-specs}/priority-queue-dead-letter-design.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/specific-feature-specs}/swarm-message-router-extension-design.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/specific-feature-specs}/swarm-router-implementation-spec.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/specific-feature-specs}/user-preference-storage-design.md +0 -0
- /package/docs/{architecture → archive/2025-10-10-architecture/specific-feature-specs}/websocket-connection-scaling-design.md +0 -0
- /package/docs/{swarm-coordination-test-results.md → archive/2025-10-10-architecture/test-results/swarm-coordination-test-results.md} +0 -0
- /package/docs/{development → archive/2025-10-10-development}/COMPREHENSIVE_WORKFLOW_SYSTEM.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/DEVELOPMENT_WORKFLOW.md +0 -0
- /package/docs/{EXAMPLES.md → archive/2025-10-10-development/EXAMPLES.md} +0 -0
- /package/docs/{development → archive/2025-10-10-development}/SPARC.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/agent-scope-creep-prevention-guide.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/cargo-build-validator-summary.md +0 -0
- /package/docs/{development → archive/2025-10-10-development/cli-consolidation}/command-consolidation-technical-spec.md +0 -0
- /package/docs/{development → archive/2025-10-10-development/cli-consolidation}/consolidated-cli-implementation.md +0 -0
- /package/docs/{development → archive/2025-10-10-development/cli-consolidation}/consolidated-command-design.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/experimental-features-improvement-plan.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/feature-simplification-strategy.md +0 -0
- /package/docs/{fixes → archive/2025-10-10-development/fixes}/fullstack-swarm-fixes-round-1.md +0 -0
- /package/docs/{fixes → archive/2025-10-10-development/fixes}/fullstack-swarm-fixes-round-3.md +0 -0
- /package/docs/{fixes → archive/2025-10-10-development/fixes}/fullstack-swarm-fixes-round-4.md +0 -0
- /package/docs/{fixes → archive/2025-10-10-development/fixes}/fullstack-swarm-fixes-round-5.md +0 -0
- /package/docs/{fixes → archive/2025-10-10-development/fixes}/round-5-quick-reference.md +0 -0
- /package/docs/{fixes → archive/2025-10-10-development/fixes}/round-5-summary.md +0 -0
- /package/docs/{fixes → archive/2025-10-10-development/fixes}/round-5-visual-summary.md +0 -0
- /package/docs/{implementation → archive/2025-10-10-development/implementation}/configuration-system-specs.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/npm-packaging-solution.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/pair-optimization.md +0 -0
- /package/docs/{phase11-cli-integration-complete.md → archive/2025-10-10-development/phase-summaries/phase11-cli-integration-complete.md} +0 -0
- /package/docs/{phase4-deployment-summary.md → archive/2025-10-10-development/phase-summaries/phase4-deployment-summary.md} +0 -0
- /package/docs/{development → archive/2025-10-10-development}/rust-framework-detection.md +0 -0
- /package/docs/{SDK-INTEGRATION-TEST-SUMMARY.md → archive/2025-10-10-development/sdk-integration/SDK-INTEGRATION-TEST-SUMMARY.md} +0 -0
- /package/docs/{SDK-TESTING.md → archive/2025-10-10-development/sdk-integration/SDK-TESTING.md} +0 -0
- /package/docs/{claude-agent-sdk-integration-strategy.md → archive/2025-10-10-development/sdk-integration/claude-agent-sdk-integration-strategy.md} +0 -0
- /package/docs/{sdk-integration-phase1.md → archive/2025-10-10-development/sdk-integration/sdk-integration-phase1.md} +0 -0
- /package/docs/{sdk-migration-guide.md → archive/2025-10-10-development/sdk-integration/sdk-migration-guide.md} +0 -0
- /package/docs/{sdk-phase1-summary.md → archive/2025-10-10-development/sdk-integration/sdk-phase1-summary.md} +0 -0
- /package/docs/{swarm-fullstack → archive/2025-10-10-development/swarm-fullstack}/IMPLEMENTATION-SUMMARY.md +0 -0
- /package/docs/{swarm-fullstack → archive/2025-10-10-development/swarm-fullstack}/frontend-testing-system.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/technical-implementation-guide.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/token-tracking-guide.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/token-tracking-status.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/troubleshooting.md +0 -0
- /package/docs/{development → archive/2025-10-10-development}/typescript-distribution-solution.md +0 -0
- /package/docs/{personalization → archive/2025-10-10-guides/personalization}/cli-integration-guide.md +0 -0
- /package/docs/{phase4-ux → archive/2025-10-10-guides/phase4-ux}/error-handling-ux-guide.md +0 -0
- /package/docs/{phase4-ux → archive/2025-10-10-guides/phase4-ux}/rollout-monitoring-dashboard.md +0 -0
- /package/docs/{phase4-ux → archive/2025-10-10-guides/phase4-ux}/user-experience-validation-framework.md +0 -0
- /package/docs/{phase4-ux → archive/2025-10-10-guides/phase4-ux}/user-onboarding-experience.md +0 -0
- /package/docs/{NOVICE_USER_GUIDE.md → archive/2025-10-10-guides/setup-guides/NOVICE_USER_GUIDE.md} +0 -0
- /package/docs/{QUICK_START.md → archive/2025-10-10-guides/setup-guides/QUICK_START.md} +0 -0
- /package/docs/{SETUP_WIZARD.md → archive/2025-10-10-guides/setup-guides/SETUP_WIZARD.md} +0 -0
- /package/docs/{ZAIR_SETUP_CHECKLIST.md → archive/2025-10-10-guides/setup-guides/ZAIR_SETUP_CHECKLIST.md} +0 -0
- /package/docs/{user → archive/2025-10-10-guides/user-guides}/PREFERENCE_SYSTEM_GUIDE.md +0 -0
- /package/docs/{user → archive/2025-10-10-guides/user-guides}/USER_GUIDE.md +0 -0
- /package/docs/{user → archive/2025-10-10-guides/user-guides}/enterprise-stakeholder-guide.md +0 -0
- /package/docs/{user → archive/2025-10-10-guides/user-guides}/novice-user-guide.md +0 -0
- /package/docs/{user → archive/2025-10-10-guides/user-guides}/tutorial.md +0 -0
- /package/docs/{user → archive/2025-10-10-guides/user-guides}/ux-assessment-pain-points.md +0 -0
- /package/docs/{ux-design/mockups → archive/2025-10-10-guides/ux-design}/configuration-ui-mockups.md +0 -0
- /package/docs/{ux-design/wizards → archive/2025-10-10-guides/ux-design}/configuration-wizard-flows.md +0 -0
- /package/docs/{ux-design/ui-patterns → archive/2025-10-10-guides/ux-design}/progressive-disclosure-patterns.md +0 -0
- /package/docs/{ux-design → archive/2025-10-10-guides/ux-design}/usability-testing-plan.md +0 -0
- /package/docs/{ux-design/user-journeys → archive/2025-10-10-guides/ux-design}/user-personas-analysis.md +0 -0
- /package/docs/{ux-design/accessibility → archive/2025-10-10-guides/ux-design}/wcag-compliance-guidelines.md +0 -0
- /package/docs/{HOOK-COMPARISON.md → archive/2025-10-10-integration/HOOK-COMPARISON.md} +0 -0
- /package/docs/{POST-EDIT-PIPELINE-AGENT-INFO.md → archive/2025-10-10-integration/POST-EDIT-PIPELINE-AGENT-INFO.md} +0 -0
- /package/docs/{POST-EDIT-PIPELINE-MERGED.md → archive/2025-10-10-integration/POST-EDIT-PIPELINE-MERGED.md} +0 -0
- /package/docs/{POST-EDIT-PIPELINE-UNIFIED.md → archive/2025-10-10-integration/POST-EDIT-PIPELINE-UNIFIED.md} +0 -0
- /package/docs/{automation → archive/2025-10-10-integration/automation}/swarm-test-pipeline-strategy.md +0 -0
- /package/docs/{integration → archive/2025-10-10-integration/mcp-compatibility}/issue-772-implementation-plan.md +0 -0
- /package/docs/{mcp-backwards-compatibility.md → archive/2025-10-10-integration/mcp-compatibility/mcp-backwards-compatibility.md} +0 -0
- /package/docs/{mcp-novice-simplification.md → archive/2025-10-10-integration/mcp-compatibility/mcp-novice-simplification.md} +0 -0
- /package/docs/{slash-commands → archive/2025-10-10-integration/slash-commands}/cfn-claude-sync-usage.md +0 -0
- /package/docs/{slash-commands → archive/2025-10-10-integration/slash-commands}/cfn-loop-quick-reference.md +0 -0
- /package/docs/{slash-commands → archive/2025-10-10-integration/slash-commands}/cfn-loop-usage.md +0 -0
- /package/docs/{final-slash-commands-setup.md → archive/2025-10-10-integration/slash-commands/final-slash-commands-setup.md} +0 -0
- /package/docs/{commands → archive/2025-10-10-integration/slash-commands}/fullstack.md +0 -0
- /package/docs/{slash-commands-complete-status.md → archive/2025-10-10-integration/slash-commands/slash-commands-complete-status.md} +0 -0
- /package/docs/{slash-commands-status-report.md → archive/2025-10-10-integration/slash-commands/slash-commands-status-report.md} +0 -0
- /package/docs/{workflows → archive/2025-10-10-integration/workflows}/IMPLEMENTATION_SUMMARY.md +0 -0
- /package/docs/{workflows → archive/2025-10-10-integration/workflows}/README.md +0 -0
- /package/docs/{workflows → archive/2025-10-10-integration/workflows}/iterative-build-test-workflow.md +0 -0
- /package/docs/{DOCUMENTATION_AUTO_UPDATER_CHANGELOG.md → archive/2025-10-10-migration/deprecation-notices/DOCUMENTATION_AUTO_UPDATER_CHANGELOG.md} +0 -0
- /package/{MCP_DEPRECATION_NOTICE.md → docs/archive/2025-10-10-migration/deprecation-notices/MCP_DEPRECATION_NOTICE.md} +0 -0
- /package/docs/{migration → archive/2025-10-10-migration/deprecation-notices}/README.md +0 -0
- /package/docs/{deprecation-report.md → archive/2025-10-10-migration/deprecation-notices/deprecation-report.md} +0 -0
- /package/docs/{migration → archive/2025-10-10-migration/v2-migration}/COMPREHENSIVE_MIGRATION_GUIDE.md +0 -0
- /package/docs/{V1_TO_V2_MIGRATION.md → archive/2025-10-10-migration/v2-migration/V1_TO_V2_MIGRATION.md} +0 -0
- /package/{V2_MIGRATION_GUIDE.md → docs/archive/2025-10-10-migration/v2-migration/V2_MIGRATION_GUIDE.md} +0 -0
- /package/docs/{migration → archive/2025-10-10-migration/v2-migration}/migration-assessment-toolkit.md +0 -0
- /package/docs/{npm-package-updates.md → archive/2025-10-10-migration/v2-migration/npm-package-updates.md} +0 -0
- /package/docs/{migration → archive/2025-10-10-migration/v2-migration}/proven-migration-case-studies.md +0 -0
- /package/docs/{APM_INTEGRATION_GUIDE.md → archive/2025-10-10-operations/APM_INTEGRATION_GUIDE.md} +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/DEPLOYMENT.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/ENABLE_AUTHENTICATION.md +0 -0
- /package/docs/{HOW_METRICS_WORK.md → archive/2025-10-10-operations/HOW_METRICS_WORK.md} +0 -0
- /package/docs/{METRICS_PLACEMENT_STRATEGY.md → archive/2025-10-10-operations/METRICS_PLACEMENT_STRATEGY.md} +0 -0
- /package/docs/{PRODUCTION_OPERATIONS.md → archive/2025-10-10-operations/PRODUCTION_OPERATIONS.md} +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/RESOURCE_MANAGEMENT_IMPLEMENTATION_PLAN.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/RESOURCE_MANAGEMENT_TECHNICAL_SPECS.md +0 -0
- /package/docs/{SESSION_CLEANUP_SYSTEM.md → archive/2025-10-10-operations/SESSION_CLEANUP_SYSTEM.md} +0 -0
- /package/docs/{V2_TRANSPARENCY_SYSTEM.md → archive/2025-10-10-operations/V2_TRANSPARENCY_SYSTEM.md} +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/analytics-system.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/benchmarks}/benchmark-claude-flow-conflict-analysis.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/benchmarks}/benchmark-cleanup-analysis.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/benchmarks}/build-artifacts-analysis.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/byzantine-consensus}/FINAL_BYZANTINE_CONSENSUS_VERIFICATION_REPORT.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/byzantine-consensus}/byzantine-consensus-verification-report-phase2.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/byzantine-consensus}/byzantine-consensus-verification-report-phase4.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/chrome-mcp-research-report.md +0 -0
- /package/docs/{ci-cd → archive/2025-10-10-operations/ci-cd}/README.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/cli-command-consolidation-analysis.md +0 -0
- /package/docs/{deployment → archive/2025-10-10-operations/deployment}/DEPLOYMENT_GUIDE.md +0 -0
- /package/docs/{deployment → archive/2025-10-10-operations/deployment}/DEPLOYMENT_STRATEGIES.md +0 -0
- /package/docs/{deployment → archive/2025-10-10-operations/deployment}/DISASTER_RECOVERY.md +0 -0
- /package/docs/{deployment → archive/2025-10-10-operations/deployment}/DOCKER_SECURITY.md +0 -0
- /package/docs/{deployment → archive/2025-10-10-operations/deployment}/HELM_CHARTS.md +0 -0
- /package/docs/{deployment → archive/2025-10-10-operations/deployment}/INFRASTRUCTURE_AS_CODE.md +0 -0
- /package/docs/{deployment → archive/2025-10-10-operations/deployment}/MONITORING_OBSERVABILITY.md +0 -0
- /package/docs/{deployment → archive/2025-10-10-operations/deployment}/PERFORMANCE_OPTIMIZATION.md +0 -0
- /package/docs/{deployment → archive/2025-10-10-operations/deployment}/README.md +0 -0
- /package/docs/{deployment → archive/2025-10-10-operations/deployment}/pm2-setup.md +0 -0
- /package/docs/{deployment → archive/2025-10-10-operations/deployment}/production-deployment-guide.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/deployment-checklist.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/deployment-report.md +0 -0
- /package/docs/{metrics-counter-usage.md → archive/2025-10-10-operations/metrics-counter-usage.md} +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/migration-strategy.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/performance-analysis}/agent-analysis-report.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/performance-analysis}/agent-persistence-performance-analysis.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/performance-analysis}/performance-analysis-report.md +0 -0
- /package/docs/{runbooks → archive/2025-10-10-operations/runbooks}/DATABASE_PERFORMANCE_RUNBOOK.md +0 -0
- /package/docs/{runbooks → archive/2025-10-10-operations/runbooks}/EMERGENCY_RESPONSE_PROCEDURES.md +0 -0
- /package/docs/{runbooks → archive/2025-10-10-operations/runbooks}/SERVICE_OUTAGE_RUNBOOK.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/shadcn-mcp-swarm-research-report.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/training-pipeline-demo.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations}/training-pipeline-real-only.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/COMPREHENSIVE_QA_VALIDATION_REPORT.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/PRODUCTION_VALIDATION_REPORT.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/WIKI_VALIDATION_REPORT.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/checkpoint-1-3-validation-report.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/checkpoint-1-4-validation-summary.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/cli-validation-report.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/command-consolidation-usability-validation.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/configuration-system-validation-report.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/experimental-features-validation-report.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/final-validation-summary.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/unified-config-validation-report.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/validation-executive-summary.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/validator-scope-overreach-analysis.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/verification-integration.md +0 -0
- /package/docs/{operations → archive/2025-10-10-operations/validation-reports}/verification-validation.md +0 -0
- /package/docs/{performance → archive/2025-10-10-performance}/COMPREHENSIVE_SQLITE_ANALYSIS.md +0 -0
- /package/docs/{LRU_GARBAGE_COLLECTION.md → archive/2025-10-10-performance/LRU_GARBAGE_COLLECTION.md} +0 -0
- /package/docs/{OPTIMIZATION_SAFETY_REPORT.md → archive/2025-10-10-performance/OPTIMIZATION_SAFETY_REPORT.md} +0 -0
- /package/docs/{performance → archive/2025-10-10-performance}/Phase3-Remediation-Report.md +0 -0
- /package/docs/{benchmark-realistic-code-generation.md → archive/2025-10-10-performance/benchmarks/benchmark-realistic-code-generation.md} +0 -0
- /package/docs/{benchmark-rust-known-issues.md → archive/2025-10-10-performance/benchmarks/benchmark-rust-known-issues.md} +0 -0
- /package/docs/{benchmark-rust-support-summary.md → archive/2025-10-10-performance/benchmarks/benchmark-rust-support-summary.md} +0 -0
- /package/docs/{optimization → archive/2025-10-10-performance/optimization}/README.md +0 -0
- /package/docs/{optimization → archive/2025-10-10-performance/optimization}/communication-improvements.md +0 -0
- /package/docs/{performance → archive/2025-10-10-performance/optimization}/sqlite-performance-analysis.md +0 -0
- /package/docs/{security → archive/2025-10-10-security}/DEPLOYMENT_CHECKLIST.md +0 -0
- /package/docs/{security → archive/2025-10-10-security}/GIT_SECRETS_SETUP.md +0 -0
- /package/docs/{operations/SECURITY_AUDIT_REPORT.md → archive/2025-10-10-security/PACKAGE_SECURITY_AUDIT.md} +0 -0
- /package/docs/{security → archive/2025-10-10-security}/SECRET-DETECTION.md +0 -0
- /package/docs/{SECURITY_AUDIT_REPORT.md → archive/2025-10-10-security/SECRET_DETECTION_AUDIT.md} +0 -0
- /package/docs/{security → archive/2025-10-10-security/authentication}/JWT_AUTHENTICATION.md +0 -0
- /package/docs/{security → archive/2025-10-10-security/authentication}/MIGRATION_BASE64_TO_JWT.md +0 -0
- /package/docs/{security → archive/2025-10-10-security/authentication}/REDIS_AUTHENTICATION.md +0 -0
- /package/docs/{SECURITY_AUTH.md → archive/2025-10-10-security/authentication/SECURITY_AUTH.md} +0 -0
- /package/docs/{certification → archive/2025-10-10-security/certification}/FINAL-PRODUCTION-CERTIFICATION.md +0 -0
- /package/docs/{certification → archive/2025-10-10-security/certification}/README.md +0 -0
- /package/docs/{certification → archive/2025-10-10-security/certification}/fullstack-swarm-production-cert.md +0 -0
- /package/docs/{certification → archive/2025-10-10-security/certification}/post-edit-pipeline-production-cert.md +0 -0
- /package/docs/{security → archive/2025-10-10-security}/phase5-security-implementation-summary.md +0 -0
- /package/docs/{security → archive/2025-10-10-security}/sec-024-lamport-clock-implementation.md +0 -0
- /package/docs/{security → archive/2025-10-10-security/vulnerabilities}/CRYPTO_CIPHER_FIX_REPORT.md +0 -0
- /package/docs/{security → archive/2025-10-10-security/vulnerabilities}/CRYPTO_VULNERABILITY_SUMMARY.md +0 -0
- /package/docs/{security → archive/2025-10-10-security/vulnerabilities}/cve-2025-005-006-implementation.md +0 -0
- /package/docs/{security → archive/2025-10-10-security/vulnerabilities}/rbac-test-bypass-fix.md +0 -0
- /package/docs/{testing → archive/2025-10-10-testing}/README.md +0 -0
- /package/docs/{testing → archive/2025-10-10-testing}/consensus-decision-matrix.md +0 -0
- /package/docs/{testing → archive/2025-10-10-testing}/playwright-mcp-integration-guide.md +0 -0
- /package/docs/{CROSS_PLATFORM_TEST_RESULTS.md → archive/2025-10-10-testing/test-results/CROSS_PLATFORM_TEST_RESULTS.md} +0 -0
- /package/docs/{V2_MULTI_LEVEL_TEST_RESULTS.md → archive/2025-10-10-testing/test-results/V2_MULTI_LEVEL_TEST_RESULTS.md} +0 -0
- /package/docs/{backend-testing-system.md → archive/2025-10-10-testing/test-results/backend-testing-system.md} +0 -0
- /package/docs/{benchmark-test-report.md → archive/2025-10-10-testing/test-results/benchmark-test-report.md} +0 -0
- /package/docs/{testing → archive/2025-10-10-testing/test-results}/comprehensive-test-results.md +0 -0
- /package/docs/{validation → archive/2025-10-10-testing/validation}/PRODUCTION-CERTIFICATION-SUMMARY.md +0 -0
- /package/docs/{validation → archive/2025-10-10-testing/validation}/byzantine-consensus-coordination-report.md +0 -0
- /package/docs/{validation → archive/2025-10-10-testing/validation}/byzantine-consensus-summary.md +0 -0
- /package/docs/{validation → archive/2025-10-10-testing/validation}/completion-validation-verification-report.md +0 -0
- /package/docs/{validation → archive/2025-10-10-testing/validation}/fullstack-integration-report.md +0 -0
- /package/docs/{validation → archive/2025-10-10-testing/validation}/phase2-byzantine-consensus-verification-report.md +0 -0
- /package/docs/{validation → archive/2025-10-10-testing/validation}/phase2-completion-consensus-report.md +0 -0
- /package/docs/{validation → archive/2025-10-10-testing/validation}/stage5-consensus-report.md +0 -0
- /package/docs/{validation → archive/2025-10-10-testing/validation}/stage6-final-certification.md +0 -0
- /package/docs/{validation → archive/2025-10-10-testing/validation}/stage7-production-certification.md +0 -0
- /package/docs/{ERROR_HANDLING_IMPLEMENTATION_SUMMARY.md → archive/ERROR_HANDLING_IMPLEMENTATION_SUMMARY.md} +0 -0
- /package/docs/{ERROR_MESSAGES_GUIDE.md → archive/ERROR_MESSAGES_GUIDE.md} +0 -0
- /package/docs/{HTTP_POLLING_FALLBACK.md → archive/HTTP_POLLING_FALLBACK.md.backup-1760135090706} +0 -0
- /package/docs/{phase2-implementation-summary.md → archive/cfn-loop/completed-phases/phase2-implementation-summary.md} +0 -0
- /package/docs/{CFN_LOOP.md → archive/cfn-loop/deprecated-3-loop/CFN_LOOP.md} +0 -0
- /package/docs/{validation-loop-pattern.md → archive/cfn-loop/early-patterns/validation-loop-pattern.md} +0 -0
- /package/docs/{MCP_ENDPOINTS_REFERENCE.md → archive/deprecated-mcp/MCP_ENDPOINTS_REFERENCE.md} +0 -0
- /package/docs/{api → archive/deprecated-mcp}/MCP_TOOLS.md +0 -0
- /package/docs/{api → archive/deprecated-mcp}/mcp-swarm-integration-api.md +0 -0
- /package/docs/{API_AUTH.md → archive/phase3-auth-unimplemented/API_AUTH.md} +0 -0
- /package/docs/{AUTHENTICATION.md → archive/phase3-auth-unimplemented/AUTHENTICATION.md} +0 -0
- /package/docs/{AUTH_DOCUMENTATION_SUMMARY.md → archive/phase3-auth-unimplemented/AUTH_DOCUMENTATION_SUMMARY.md} +0 -0
- /package/docs/{AUTH_MIGRATION.md → archive/phase3-auth-unimplemented/AUTH_MIGRATION.md} +0 -0
- /package/docs/{phase5-booster-integration-summary.md → archive/phase5-booster-integration-summary.md} +0 -0
- /package/{CHANGELOG_V2.md → docs/archive/reference-historical/CHANGELOG_V2.md} +0 -0
- /package/docs/{INDEX.md → archive/reference-historical/INDEX.md} +0 -0
- /package/docs/{CFN_LOOP_PHASE_ORCHESTRATION.md → cfn-loop/CFN_LOOP_PHASE_ORCHESTRATION.md} +0 -0
- /package/docs/{CFN_LOOP_SCOPE_CONTROL.md → cfn-loop/CFN_LOOP_SCOPE_CONTROL.md} +0 -0
- /package/docs/{CFN_LOOP_SELF_LOOPING_ADDITIONS.md → cfn-loop/CFN_LOOP_SELF_LOOPING_ADDITIONS.md} +0 -0
- /package/docs/{SPRINT_ORCHESTRATION.md → cfn-loop/SPRINT_ORCHESTRATION.md} +0 -0
- /package/docs/{epic-iteration-limits-implementation.md → cfn-loop/epic-iteration-limits-implementation.md} +0 -0
- /package/docs/{phase-5-sprint-5.2-multi-level-control.md → cfn-loop/phase-5-sprint-5.2-multi-level-control.md} +0 -0
- /package/docs/{phase-orchestrator-sprint-enhancement-summary.md → cfn-loop/phase-orchestrator-sprint-enhancement-summary.md} +0 -0
- /package/docs/{phases → cfn-loop/phases}/PHASE_06_ARCHITECTURE_SUMMARY.md +0 -0
- /package/docs/{phases → cfn-loop/phases}/PHASE_06_COMPONENT_INTERFACES.md +0 -0
- /package/docs/{phases → cfn-loop/phases}/PHASE_06_INTEGRATION_STRATEGY.md +0 -0
- /package/docs/{phases → cfn-loop/phases}/PHASE_06_MESH_COORDINATION_ARCHITECTURE.md +0 -0
- /package/docs/{phases → cfn-loop/phases}/PHASE_06_README.md +0 -0
- /package/docs/{phases → cfn-loop/phases}/PHASE_07_HELP_SYSTEM_ARCHITECTURE.md +0 -0
- /package/docs/{phases → cfn-loop/phases}/PHASE_0_SDK_FOUNDATION.md +0 -0
- /package/docs/{phases → cfn-loop/phases}/phase-05-architecture.md +0 -0
- /package/docs/{self-validating-loops-implementation.md → cfn-loop/self-validating-loops-implementation.md} +0 -0
- /package/{CHANGELOG.md → docs/reference/CHANGELOG.md} +0 -0
- /package/{NPM_PACKAGE_CONTENTS.md → docs/reference/NPM_PACKAGE_CONTENTS.md} +0 -0
- /package/{README-NPM.md → docs/reference/README-NPM.md} +0 -0
- /package/docs/{SITE_MAP.md → reference/SITE_MAP.md} +0 -0
- /package/docs/{research → reference/research}/AGENT_ACCESSIBILITY_GUIDE.md +0 -0
- /package/docs/{research → reference/research}/AGENT_PERMISSION_SYSTEM_ANALYSIS.md +0 -0
- /package/docs/{research → reference/research}/CLAUDE_AGENT_SDK_COMPREHENSIVE_ANALYSIS.md +0 -0
- /package/docs/{research → reference/research}/CLAUDE_AGENT_SDK_EXECUTIVE_SUMMARY.md +0 -0
- /package/docs/{research → reference/research}/CLEANUP_CRITERIA_QUICK_REFERENCE.md +0 -0
- /package/docs/{research → reference/research}/claude-session-cpu-behavior-analysis.md +0 -0
- /package/docs/{research → reference/research}/completion-validation-research.md +0 -0
- /package/docs/{templates → reference/templates}/PHASE_DOCUMENT_TEMPLATE.md +0 -0
- /package/docs/{templates → reference/templates}/PHASE_TEMPLATE_USAGE_GUIDE.md +0 -0
- /package/docs/{TEMPLATE_CUSTOMIZATION_GUIDE.md → reference/templates/TEMPLATE_CUSTOMIZATION_GUIDE.md} +0 -0
- /package/docs/{TEMPLATE_EXAMPLES_AND_BEST_PRACTICES.md → reference/templates/TEMPLATE_EXAMPLES_AND_BEST_PRACTICES.md} +0 -0
- /package/docs/{TEMPLATE_SYSTEM_DOCUMENTATION.md → reference/templates/TEMPLATE_SYSTEM_DOCUMENTATION.md} +0 -0
- /package/docs/{wiki → reference/wiki}/background-commands.md +0 -0
- /package/docs/{wiki → reference/wiki}/efficiency-patterns-and-anti-patterns.md +0 -0
- /package/docs/{wiki → reference/wiki}/monitoring-and-metrics-guide.md +0 -0
- /package/docs/{wiki → reference/wiki}/performance-benchmarking-tools.md +0 -0
- /package/docs/{wiki → reference/wiki}/performance-optimization-strategies.md +0 -0
- /package/docs/{wiki → reference/wiki}/performance-testing-framework.md +0 -0
- /package/docs/{wiki → reference/wiki}/resource-optimization-techniques.md +0 -0
- /package/docs/{wiki → reference/wiki}/scalability-guidelines.md +0 -0
- /package/docs/{wiki → reference/wiki}/security/README.md +0 -0
- /package/docs/{wiki → reference/wiki}/security/authentication-authorization-strategies.md +0 -0
- /package/docs/{wiki → reference/wiki}/security/compliance-automation-workflows.md +0 -0
- /package/docs/{wiki → reference/wiki}/security/compliance-frameworks-integration.md +0 -0
- /package/docs/{wiki → reference/wiki}/security/enterprise-security-patterns.md +0 -0
- /package/docs/{wiki → reference/wiki}/security/incident-response-guide.md +0 -0
- /package/docs/{wiki → reference/wiki}/security/secrets-management-guide.md +0 -0
- /package/docs/{wiki → reference/wiki}/security/secure-coding-patterns.md +0 -0
- /package/docs/{wiki → reference/wiki}/security/security-best-practices.md +0 -0
- /package/docs/{wiki → reference/wiki}/security/security-first-development-workflows.md +0 -0
- /package/docs/{wiki → reference/wiki}/security/security-testing-framework.md +0 -0
- /package/docs/{wiki → reference/wiki}/session-persistence.md +0 -0
- /package/docs/{wiki → reference/wiki}/stream-chain-command.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/README.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/cli-troubleshooting.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/configuration-troubleshooting.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/debug-mode.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/error-analysis.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/linux-troubleshooting.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/log-analysis.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/macos-troubleshooting.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/mcp-troubleshooting.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/performance-troubleshooting.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/quick-reference.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting/windows-troubleshooting.md +0 -0
- /package/docs/{wiki → reference/wiki}/troubleshooting-slow-workflows.md +0 -0
- /package/docs/{CROSS_PLATFORM_VALIDATION_CONFIDENCE.json → testing/CROSS_PLATFORM_VALIDATION_CONFIDENCE.json} +0 -0
- /package/docs/{validation → testing/validation}/byzantine-validation-report.json +0 -0
- /package/{templates → examples/templates}/README.md +0 -0
- /package/{templates → examples/templates}/basic-swarm/.claude/settings.json +0 -0
- /package/{templates/basic-swarm/CLAUDE.md → examples/templates/basic-swarm/CLAUDE.md.backup-1760135091193} +0 -0
- /package/{templates → examples/templates}/basic-swarm/coordination.md +0 -0
- /package/{templates → examples/templates}/basic-swarm/memory-bank.md +0 -0
- /package/{templates → examples/templates}/basic-swarm/package.json +0 -0
- /package/{templates → examples/templates}/custom-agent/.claude/settings.json +0 -0
- /package/{templates/custom-agent/CLAUDE.md → examples/templates/custom-agent/CLAUDE.md.backup-1760135091180} +0 -0
- /package/{templates → examples/templates}/event-bus/.claude/settings.json +0 -0
- /package/{templates → examples/templates}/event-bus/CLAUDE.md +0 -0
- /package/{templates → examples/templates}/fleet-manager/.claude/settings.json +0 -0
- /package/{templates/fleet-manager/CLAUDE.md → examples/templates/fleet-manager/CLAUDE.md.backup-1760135091167} +0 -0
- /package/{docs → scripts}/agent-token-analysis.js +0 -0
|
@@ -0,0 +1,877 @@
|
|
|
1
|
+
# Failure Recovery Playbook
|
|
2
|
+
|
|
3
|
+
## Overview
|
|
4
|
+
|
|
5
|
+
This playbook provides step-by-step incident response procedures for blocking coordination failures in CFN Loop production environments. Each section includes symptoms, detection methods, root cause analysis, and recovery procedures.
|
|
6
|
+
|
|
7
|
+
## Incident Response Workflow
|
|
8
|
+
|
|
9
|
+
```
|
|
10
|
+
Detection → Triage → Diagnosis → Recovery → Validation → Post-Mortem
|
|
11
|
+
↓ ↓ ↓ ↓ ↓ ↓
|
|
12
|
+
Alerts Severity Root Cause Execute Verify Document
|
|
13
|
+
Assessment Analysis Playbook Success Learnings
|
|
14
|
+
```
|
|
15
|
+
|
|
16
|
+
---
|
|
17
|
+
|
|
18
|
+
## 1. Redis Connection Loss
|
|
19
|
+
|
|
20
|
+
### Symptoms
|
|
21
|
+
|
|
22
|
+
- Circuit breaker transitions to OPEN state
|
|
23
|
+
- Coordinators stuck in blocking state unable to send/receive signals
|
|
24
|
+
- Heartbeat failures spike across all coordinators
|
|
25
|
+
- Prometheus alert: `heartbeat_failures_total` rate >0.5/s
|
|
26
|
+
|
|
27
|
+
### Detection
|
|
28
|
+
|
|
29
|
+
**Automated Alerts**:
|
|
30
|
+
```
|
|
31
|
+
Alert: RedisConnectionLoss
|
|
32
|
+
Severity: CRITICAL
|
|
33
|
+
Trigger: heartbeat_failures_total{reason="connection"} > 0.5/s for 2min
|
|
34
|
+
```
|
|
35
|
+
|
|
36
|
+
**Manual Verification**:
|
|
37
|
+
```bash
|
|
38
|
+
# Check Redis connectivity from coordinator hosts
|
|
39
|
+
redis-cli -h redis.prod.example.com ping
|
|
40
|
+
# Expected: PONG
|
|
41
|
+
|
|
42
|
+
# Check coordinator logs for connection errors
|
|
43
|
+
kubectl logs -n cfn-loop -l app=coordinator | grep "Redis connection failed"
|
|
44
|
+
|
|
45
|
+
# Monitor circuit breaker state
|
|
46
|
+
curl http://coordinator-1.cfn-loop:8080/metrics | grep circuit_breaker_state
|
|
47
|
+
# circuit_breaker_state{state="open"} 1
|
|
48
|
+
```
|
|
49
|
+
|
|
50
|
+
### Root Cause Analysis
|
|
51
|
+
|
|
52
|
+
Common causes:
|
|
53
|
+
|
|
54
|
+
1. **Redis Server Crash**: Redis process terminated or OOM killed
|
|
55
|
+
2. **Network Partition**: Network connectivity lost between coordinators and Redis
|
|
56
|
+
3. **Resource Exhaustion**: Redis max connections exceeded
|
|
57
|
+
4. **DNS Resolution Failure**: Redis hostname unresolvable
|
|
58
|
+
5. **Firewall Rules**: Security group or firewall blocking Redis port (6379)
|
|
59
|
+
|
|
60
|
+
**Diagnosis Steps**:
|
|
61
|
+
|
|
62
|
+
```bash
|
|
63
|
+
# 1. Check Redis server health
|
|
64
|
+
kubectl get pods -n redis
|
|
65
|
+
# Look for CrashLoopBackOff or Pending status
|
|
66
|
+
|
|
67
|
+
# 2. Verify Redis logs
|
|
68
|
+
kubectl logs -n redis redis-0 --tail=100
|
|
69
|
+
# Look for OOM, connection errors, slowlog warnings
|
|
70
|
+
|
|
71
|
+
# 3. Check network connectivity
|
|
72
|
+
kubectl exec -n cfn-loop coordinator-1 -- ping redis.prod.example.com
|
|
73
|
+
|
|
74
|
+
# 4. Verify Redis connections
|
|
75
|
+
redis-cli -h redis.prod.example.com info clients
|
|
76
|
+
# Check connected_clients vs maxclients
|
|
77
|
+
|
|
78
|
+
# 5. Check DNS resolution
|
|
79
|
+
kubectl exec -n cfn-loop coordinator-1 -- nslookup redis.prod.example.com
|
|
80
|
+
```
|
|
81
|
+
|
|
82
|
+
### Recovery Procedure
|
|
83
|
+
|
|
84
|
+
**Step 1: Immediate Mitigation (ETA: 2 minutes)**
|
|
85
|
+
|
|
86
|
+
```bash
|
|
87
|
+
# Option A: Restart Redis (if crashed)
|
|
88
|
+
kubectl rollout restart statefulset/redis -n redis
|
|
89
|
+
|
|
90
|
+
# Option B: Scale Redis replicas (if resource exhaustion)
|
|
91
|
+
kubectl scale statefulset/redis -n redis --replicas=3
|
|
92
|
+
|
|
93
|
+
# Option C: Clear Redis connections (if max connections exceeded)
|
|
94
|
+
redis-cli -h redis.prod.example.com CLIENT KILL TYPE normal SKIPME yes
|
|
95
|
+
```
|
|
96
|
+
|
|
97
|
+
**Step 2: Verify Redis Health (ETA: 1 minute)**
|
|
98
|
+
|
|
99
|
+
```bash
|
|
100
|
+
# Check Redis is accepting connections
|
|
101
|
+
redis-cli -h redis.prod.example.com ping
|
|
102
|
+
# Expected: PONG
|
|
103
|
+
|
|
104
|
+
# Verify replication status
|
|
105
|
+
redis-cli -h redis.prod.example.com info replication
|
|
106
|
+
# role:master
|
|
107
|
+
# connected_slaves:2
|
|
108
|
+
|
|
109
|
+
# Check memory usage
|
|
110
|
+
redis-cli -h redis.prod.example.com info memory
|
|
111
|
+
# used_memory_human should be <80% of maxmemory
|
|
112
|
+
```
|
|
113
|
+
|
|
114
|
+
**Step 3: Restart Coordinators (ETA: 3 minutes)**
|
|
115
|
+
|
|
116
|
+
```bash
|
|
117
|
+
# Gracefully restart coordinators to reconnect to Redis
|
|
118
|
+
kubectl rollout restart deployment/coordinator -n cfn-loop
|
|
119
|
+
|
|
120
|
+
# Monitor rollout status
|
|
121
|
+
kubectl rollout status deployment/coordinator -n cfn-loop
|
|
122
|
+
|
|
123
|
+
# Wait for all coordinators to become ready
|
|
124
|
+
kubectl wait --for=condition=ready pod -l app=coordinator -n cfn-loop --timeout=180s
|
|
125
|
+
```
|
|
126
|
+
|
|
127
|
+
**Step 4: Verify Circuit Breaker Recovery (ETA: 2 minutes)**
|
|
128
|
+
|
|
129
|
+
```bash
|
|
130
|
+
# Check circuit breaker state transitions to CLOSED
|
|
131
|
+
for i in {1..10}; do
|
|
132
|
+
curl -s http://coordinator-$i.cfn-loop:8080/metrics | grep circuit_breaker_state
|
|
133
|
+
sleep 1
|
|
134
|
+
done
|
|
135
|
+
# Expected: circuit_breaker_state{state="closed"} 1
|
|
136
|
+
|
|
137
|
+
# Verify heartbeats resuming
|
|
138
|
+
redis-cli -h redis.prod.example.com --scan --pattern "blocking:heartbeat:*" | wc -l
|
|
139
|
+
# Should match number of active coordinators
|
|
140
|
+
```
|
|
141
|
+
|
|
142
|
+
**Step 5: Resume Blocking Operations (ETA: 1 minute)**
|
|
143
|
+
|
|
144
|
+
```bash
|
|
145
|
+
# Trigger signal retry for stuck coordinators
|
|
146
|
+
kubectl exec -n cfn-loop coordinator-1 -- \
|
|
147
|
+
curl -X POST http://localhost:8080/api/signals/retry-failed
|
|
148
|
+
|
|
149
|
+
# Monitor ACK delivery latency
|
|
150
|
+
curl -s http://coordinator-1.cfn-loop:8080/metrics | \
|
|
151
|
+
grep signal_delivery_latency_seconds_bucket
|
|
152
|
+
```
|
|
153
|
+
|
|
154
|
+
### Validation
|
|
155
|
+
|
|
156
|
+
Confirm recovery with these checks:
|
|
157
|
+
|
|
158
|
+
- [ ] Redis responds to PING commands
|
|
159
|
+
- [ ] Circuit breaker state is CLOSED across all coordinators
|
|
160
|
+
- [ ] Heartbeat failures rate returns to baseline (<0.01/s)
|
|
161
|
+
- [ ] All coordinators report READY status
|
|
162
|
+
- [ ] Signal delivery latency P95 <5s
|
|
163
|
+
- [ ] No stale heartbeat warnings in last 5 minutes
|
|
164
|
+
|
|
165
|
+
### Rollback Procedure
|
|
166
|
+
|
|
167
|
+
If recovery fails, rollback to previous stable state:
|
|
168
|
+
|
|
169
|
+
```bash
|
|
170
|
+
# Revert to previous Redis version
|
|
171
|
+
kubectl rollout undo statefulset/redis -n redis
|
|
172
|
+
|
|
173
|
+
# Restore Redis data from backup (last good backup)
|
|
174
|
+
kubectl exec -n redis redis-0 -- redis-cli --rdb /data/dump.rdb
|
|
175
|
+
|
|
176
|
+
# Scale coordinators to 0 to prevent further failures
|
|
177
|
+
kubectl scale deployment/coordinator -n cfn-loop --replicas=0
|
|
178
|
+
```
|
|
179
|
+
|
|
180
|
+
---
|
|
181
|
+
|
|
182
|
+
## 2. Dead Coordinator Detection
|
|
183
|
+
|
|
184
|
+
### Symptoms
|
|
185
|
+
|
|
186
|
+
- Coordinator heartbeat not updated >120s (2 minutes)
|
|
187
|
+
- No ACK received from coordinator for multiple signals
|
|
188
|
+
- Timeout handler logs show coordinator flagged as dead
|
|
189
|
+
- Prometheus metric: `timeout_events_total{reason="heartbeat"}` increments
|
|
190
|
+
|
|
191
|
+
### Detection
|
|
192
|
+
|
|
193
|
+
**Automated Alerts**:
|
|
194
|
+
```
|
|
195
|
+
Alert: DeadCoordinatorDetected
|
|
196
|
+
Severity: WARNING
|
|
197
|
+
Trigger: timeout_events_total{reason="heartbeat"} > 0 for 1min
|
|
198
|
+
```
|
|
199
|
+
|
|
200
|
+
**Manual Verification**:
|
|
201
|
+
```bash
|
|
202
|
+
# Check coordinator heartbeat age
|
|
203
|
+
redis-cli -h redis.prod.example.com get blocking:heartbeat:coordinator-1 | jq .timestamp
|
|
204
|
+
# Calculate age: Date.now() - timestamp
|
|
205
|
+
|
|
206
|
+
# Verify coordinator process status
|
|
207
|
+
kubectl get pods -n cfn-loop -l coordinator-id=coordinator-1
|
|
208
|
+
|
|
209
|
+
# Check coordinator logs for errors
|
|
210
|
+
kubectl logs -n cfn-loop coordinator-1 --tail=100
|
|
211
|
+
```
|
|
212
|
+
|
|
213
|
+
### Root Cause Analysis
|
|
214
|
+
|
|
215
|
+
Common causes:
|
|
216
|
+
|
|
217
|
+
1. **Process Crash**: Coordinator process terminated unexpectedly
|
|
218
|
+
2. **Hung Process**: Coordinator process alive but unresponsive (deadlock, infinite loop)
|
|
219
|
+
3. **Resource Starvation**: CPU/memory exhaustion preventing heartbeat updates
|
|
220
|
+
4. **Network Partition**: Coordinator isolated from Redis
|
|
221
|
+
5. **Kubernetes Eviction**: Node pressure caused pod eviction
|
|
222
|
+
|
|
223
|
+
**Diagnosis Steps**:
|
|
224
|
+
|
|
225
|
+
```bash
|
|
226
|
+
# 1. Check pod status
|
|
227
|
+
kubectl describe pod coordinator-1 -n cfn-loop
|
|
228
|
+
# Look for OOMKilled, Evicted, CrashLoopBackOff
|
|
229
|
+
|
|
230
|
+
# 2. Verify process is running
|
|
231
|
+
kubectl exec -n cfn-loop coordinator-1 -- ps aux | grep node
|
|
232
|
+
# Check if coordinator process exists
|
|
233
|
+
|
|
234
|
+
# 3. Check resource usage
|
|
235
|
+
kubectl top pod coordinator-1 -n cfn-loop
|
|
236
|
+
# Compare against pod limits
|
|
237
|
+
|
|
238
|
+
# 4. Verify network connectivity to Redis
|
|
239
|
+
kubectl exec -n cfn-loop coordinator-1 -- ping redis.prod.example.com
|
|
240
|
+
|
|
241
|
+
# 5. Check coordinator metrics
|
|
242
|
+
curl http://coordinator-1.cfn-loop:8080/metrics
|
|
243
|
+
# If no response, coordinator is hung or crashed
|
|
244
|
+
```
|
|
245
|
+
|
|
246
|
+
### Recovery Procedure
|
|
247
|
+
|
|
248
|
+
**Step 1: Confirm Coordinator Death (ETA: 30 seconds)**
|
|
249
|
+
|
|
250
|
+
```bash
|
|
251
|
+
# Check heartbeat age
|
|
252
|
+
HEARTBEAT=$(redis-cli -h redis.prod.example.com get blocking:heartbeat:coordinator-1)
|
|
253
|
+
TIMESTAMP=$(echo $HEARTBEAT | jq -r .timestamp)
|
|
254
|
+
AGE=$(($(date +%s) - ($TIMESTAMP / 1000)))
|
|
255
|
+
|
|
256
|
+
if [ $AGE -gt 120 ]; then
|
|
257
|
+
echo "Coordinator confirmed dead: heartbeat age ${AGE}s"
|
|
258
|
+
fi
|
|
259
|
+
```
|
|
260
|
+
|
|
261
|
+
**Step 2: Trigger Automatic Cleanup (ETA: 1 minute)**
|
|
262
|
+
|
|
263
|
+
The timeout handler automatically cleans up dead coordinator state:
|
|
264
|
+
|
|
265
|
+
```bash
|
|
266
|
+
# Monitor cleanup progress
|
|
267
|
+
kubectl logs -n cfn-loop timeout-handler -f | grep "Dead coordinator escalation"
|
|
268
|
+
|
|
269
|
+
# Verify cleanup completed
|
|
270
|
+
redis-cli -h redis.prod.example.com --scan --pattern "blocking:heartbeat:coordinator-1"
|
|
271
|
+
# Should return no keys
|
|
272
|
+
|
|
273
|
+
redis-cli -h redis.prod.example.com --scan --pattern "blocking:ack:coordinator-1:*"
|
|
274
|
+
# Should return no keys
|
|
275
|
+
```
|
|
276
|
+
|
|
277
|
+
**Step 3: Spawn Replacement Coordinator (ETA: 2 minutes)**
|
|
278
|
+
|
|
279
|
+
```bash
|
|
280
|
+
# Automatic spawn is triggered by timeout handler
|
|
281
|
+
# Verify spawn request created
|
|
282
|
+
redis-cli -h redis.prod.example.com --scan --pattern "coordinator:spawn:*" | head -1
|
|
283
|
+
# coordinator:spawn:coordinator-1234567890-abc123
|
|
284
|
+
|
|
285
|
+
# Monitor new coordinator startup
|
|
286
|
+
kubectl get pods -n cfn-loop -l coordinator-id=coordinator-1234567890-abc123 -w
|
|
287
|
+
|
|
288
|
+
# Wait for new coordinator to become ready
|
|
289
|
+
kubectl wait --for=condition=ready pod -l coordinator-id=coordinator-1234567890-abc123 -n cfn-loop --timeout=120s
|
|
290
|
+
```
|
|
291
|
+
|
|
292
|
+
**Step 4: Verify Work Transfer (ETA: 30 seconds)**
|
|
293
|
+
|
|
294
|
+
```bash
|
|
295
|
+
# Check work items transferred to new coordinator
|
|
296
|
+
NEW_COORD_ID=$(redis-cli -h redis.prod.example.com --scan --pattern "coordinator:spawn:*" | \
|
|
297
|
+
xargs redis-cli -h redis.prod.example.com get | jq -r .newCoordinatorId)
|
|
298
|
+
|
|
299
|
+
redis-cli -h redis.prod.example.com --scan --pattern "coordinator:work:*:${NEW_COORD_ID}:*"
|
|
300
|
+
# Should show transferred work items
|
|
301
|
+
|
|
302
|
+
# Verify transfer metadata
|
|
303
|
+
redis-cli -h redis.prod.example.com get "coordinator:work:swarm-123:${NEW_COORD_ID}:task-1" | \
|
|
304
|
+
jq '.transferredFrom, .transferredAt'
|
|
305
|
+
```
|
|
306
|
+
|
|
307
|
+
**Step 5: Resume Blocked Operations (ETA: 1 minute)**
|
|
308
|
+
|
|
309
|
+
```bash
|
|
310
|
+
# Notify parent coordinator of replacement
|
|
311
|
+
SWARM_ID=$(redis-cli -h redis.prod.example.com get "coordinator:escalation:coordinator-1" | jq -r .swarmId)
|
|
312
|
+
redis-cli -h redis.prod.example.com publish "swarm:${SWARM_ID}:coordinator:replacement" \
|
|
313
|
+
"{\"oldCoordinatorId\":\"coordinator-1\",\"newCoordinatorId\":\"${NEW_COORD_ID}\"}"
|
|
314
|
+
|
|
315
|
+
# Monitor parent coordinator acknowledgment
|
|
316
|
+
kubectl logs -n cfn-loop parent-coordinator -f | grep "Replacement coordinator acknowledged"
|
|
317
|
+
```
|
|
318
|
+
|
|
319
|
+
### Validation
|
|
320
|
+
|
|
321
|
+
Confirm recovery with these checks:
|
|
322
|
+
|
|
323
|
+
- [ ] Dead coordinator heartbeat key removed from Redis
|
|
324
|
+
- [ ] Dead coordinator ACK keys removed from Redis
|
|
325
|
+
- [ ] New coordinator spawned and READY
|
|
326
|
+
- [ ] Work items transferred to new coordinator
|
|
327
|
+
- [ ] Parent coordinator acknowledged replacement
|
|
328
|
+
- [ ] New coordinator sending heartbeats
|
|
329
|
+
- [ ] Escalation record created in Redis
|
|
330
|
+
|
|
331
|
+
### Manual Intervention
|
|
332
|
+
|
|
333
|
+
If automatic recovery fails:
|
|
334
|
+
|
|
335
|
+
```bash
|
|
336
|
+
# 1. Manually kill dead coordinator pod
|
|
337
|
+
kubectl delete pod coordinator-1 -n cfn-loop --force --grace-period=0
|
|
338
|
+
|
|
339
|
+
# 2. Manually spawn replacement
|
|
340
|
+
kubectl apply -f - <<EOF
|
|
341
|
+
apiVersion: v1
|
|
342
|
+
kind: Pod
|
|
343
|
+
metadata:
|
|
344
|
+
name: coordinator-replacement
|
|
345
|
+
namespace: cfn-loop
|
|
346
|
+
labels:
|
|
347
|
+
app: coordinator
|
|
348
|
+
spec:
|
|
349
|
+
containers:
|
|
350
|
+
- name: coordinator
|
|
351
|
+
image: coordinator:v1.2.3
|
|
352
|
+
env:
|
|
353
|
+
- name: COORDINATOR_ID
|
|
354
|
+
value: "coordinator-replacement"
|
|
355
|
+
- name: SWARM_ID
|
|
356
|
+
value: "swarm-123"
|
|
357
|
+
EOF
|
|
358
|
+
|
|
359
|
+
# 3. Manually transfer work
|
|
360
|
+
# See docs/operations/manual-work-transfer.md
|
|
361
|
+
```
|
|
362
|
+
|
|
363
|
+
---
|
|
364
|
+
|
|
365
|
+
## 3. Signal Delivery Failure
|
|
366
|
+
|
|
367
|
+
### Symptoms
|
|
368
|
+
|
|
369
|
+
- ACK not received within 5s timeout
|
|
370
|
+
- Signal retry attempts exhausted (3 attempts)
|
|
371
|
+
- Prometheus metric: `signal_delivery_latency_seconds` P99 >5s
|
|
372
|
+
- Logs show "Signal retry exhausted all attempts"
|
|
373
|
+
|
|
374
|
+
### Detection
|
|
375
|
+
|
|
376
|
+
**Automated Alerts**:
|
|
377
|
+
```
|
|
378
|
+
Alert: HighSignalLatency
|
|
379
|
+
Severity: WARNING
|
|
380
|
+
Trigger: histogram_quantile(0.99, signal_delivery_latency_seconds_bucket) > 5 for 5min
|
|
381
|
+
```
|
|
382
|
+
|
|
383
|
+
**Manual Verification**:
|
|
384
|
+
```bash
|
|
385
|
+
# Check signal delivery latency P99
|
|
386
|
+
curl -s http://coordinator-1.cfn-loop:8080/metrics | \
|
|
387
|
+
grep signal_delivery_latency_seconds_bucket | \
|
|
388
|
+
./calculate-quantile.sh 0.99
|
|
389
|
+
|
|
390
|
+
# Verify signal exists in Redis
|
|
391
|
+
redis-cli -h redis.prod.example.com get blocking:signal:coordinator-2
|
|
392
|
+
|
|
393
|
+
# Check for failed signal records
|
|
394
|
+
redis-cli -h redis.prod.example.com --scan --pattern "blocking:retry:failed:*"
|
|
395
|
+
```
|
|
396
|
+
|
|
397
|
+
### Root Cause Analysis
|
|
398
|
+
|
|
399
|
+
Common causes:
|
|
400
|
+
|
|
401
|
+
1. **Receiver Coordinator Down**: Target coordinator crashed before ACK sent
|
|
402
|
+
2. **Redis Pub/Sub Latency**: High Redis load causing pub/sub delays
|
|
403
|
+
3. **Network Congestion**: Packet loss between coordinators and Redis
|
|
404
|
+
4. **ACK Signature Verification Failure**: HMAC secret mismatch
|
|
405
|
+
5. **Signal Overwrite**: Duplicate signal overwrote previous signal before ACK
|
|
406
|
+
|
|
407
|
+
**Diagnosis Steps**:
|
|
408
|
+
|
|
409
|
+
```bash
|
|
410
|
+
# 1. Check receiver coordinator health
|
|
411
|
+
kubectl get pods -n cfn-loop -l coordinator-id=coordinator-2
|
|
412
|
+
|
|
413
|
+
# 2. Verify Redis latency
|
|
414
|
+
redis-cli -h redis.prod.example.com --latency-history
|
|
415
|
+
|
|
416
|
+
# 3. Check network latency
|
|
417
|
+
kubectl exec -n cfn-loop coordinator-1 -- \
|
|
418
|
+
ping -c 10 redis.prod.example.com | tail -1
|
|
419
|
+
|
|
420
|
+
# 4. Verify HMAC secret consistency
|
|
421
|
+
kubectl get secret blocking-coordination-secret -n cfn-loop -o jsonpath='{.data.secret}' | \
|
|
422
|
+
base64 -d | sha256sum
|
|
423
|
+
# Compare across all coordinators
|
|
424
|
+
|
|
425
|
+
# 5. Check for signal overwrites
|
|
426
|
+
redis-cli -h redis.prod.example.com get blocking:signal:coordinator-2 | jq .messageId
|
|
427
|
+
# Verify messageId matches expected signal
|
|
428
|
+
```
|
|
429
|
+
|
|
430
|
+
### Recovery Procedure
|
|
431
|
+
|
|
432
|
+
**Step 1: Verify Signal Delivery (ETA: 30 seconds)**
|
|
433
|
+
|
|
434
|
+
```bash
|
|
435
|
+
# Check if signal exists in Redis
|
|
436
|
+
SIGNAL=$(redis-cli -h redis.prod.example.com get blocking:signal:coordinator-2)
|
|
437
|
+
|
|
438
|
+
if [ -z "$SIGNAL" ]; then
|
|
439
|
+
echo "Signal missing - likely overwritten or expired"
|
|
440
|
+
else
|
|
441
|
+
echo "Signal exists: $(echo $SIGNAL | jq .messageId)"
|
|
442
|
+
fi
|
|
443
|
+
```
|
|
444
|
+
|
|
445
|
+
**Step 2: Retry Signal Delivery (ETA: 1 minute)**
|
|
446
|
+
|
|
447
|
+
```bash
|
|
448
|
+
# Trigger manual signal retry
|
|
449
|
+
kubectl exec -n cfn-loop coordinator-1 -- \
|
|
450
|
+
curl -X POST http://localhost:8080/api/signals/retry \
|
|
451
|
+
-H "Content-Type: application/json" \
|
|
452
|
+
-d '{
|
|
453
|
+
"receiverId": "coordinator-2",
|
|
454
|
+
"signalType": "completion",
|
|
455
|
+
"iteration": 3,
|
|
456
|
+
"maxRetries": 3
|
|
457
|
+
}'
|
|
458
|
+
|
|
459
|
+
# Monitor retry progress
|
|
460
|
+
kubectl logs -n cfn-loop coordinator-1 -f | grep "Signal retry"
|
|
461
|
+
```
|
|
462
|
+
|
|
463
|
+
**Step 3: Verify ACK Reception (ETA: 30 seconds)**
|
|
464
|
+
|
|
465
|
+
```bash
|
|
466
|
+
# Check if ACK received after retry
|
|
467
|
+
SIGNAL_ID="coordinator-1:coordinator-2:completion:3:1234567890"
|
|
468
|
+
ACK=$(redis-cli -h redis.prod.example.com get "blocking:ack:coordinator-2:${SIGNAL_ID}")
|
|
469
|
+
|
|
470
|
+
if [ -n "$ACK" ]; then
|
|
471
|
+
echo "ACK received: $(echo $ACK | jq .timestamp)"
|
|
472
|
+
else
|
|
473
|
+
echo "ACK still missing - escalate to dead coordinator handling"
|
|
474
|
+
fi
|
|
475
|
+
```
|
|
476
|
+
|
|
477
|
+
**Step 4: Verify HMAC Signature (ETA: 15 seconds)**
|
|
478
|
+
|
|
479
|
+
```bash
|
|
480
|
+
# If ACK received but verification failing, check HMAC secret
|
|
481
|
+
kubectl get secret blocking-coordination-secret -n cfn-loop -o jsonpath='{.data.secret}' | \
|
|
482
|
+
base64 -d | wc -c
|
|
483
|
+
# Should be 32 bytes (256 bits)
|
|
484
|
+
|
|
485
|
+
# Verify all coordinators use same secret
|
|
486
|
+
for i in {1..10}; do
|
|
487
|
+
kubectl exec -n cfn-loop coordinator-$i -- \
|
|
488
|
+
printenv BLOCKING_COORDINATION_SECRET | sha256sum
|
|
489
|
+
done
|
|
490
|
+
# All should output same hash
|
|
491
|
+
```
|
|
492
|
+
|
|
493
|
+
**Step 5: Fallback to Direct Communication (ETA: 1 minute)**
|
|
494
|
+
|
|
495
|
+
If signal delivery continues failing, bypass Redis:
|
|
496
|
+
|
|
497
|
+
```bash
|
|
498
|
+
# Use HTTP direct communication as fallback
|
|
499
|
+
kubectl exec -n cfn-loop coordinator-1 -- \
|
|
500
|
+
curl -X POST http://coordinator-2.cfn-loop:8080/api/signals/receive \
|
|
501
|
+
-H "Content-Type: application/json" \
|
|
502
|
+
-d '{
|
|
503
|
+
"signalType": "completion",
|
|
504
|
+
"senderId": "coordinator-1",
|
|
505
|
+
"iteration": 3,
|
|
506
|
+
"payload": {}
|
|
507
|
+
}'
|
|
508
|
+
```
|
|
509
|
+
|
|
510
|
+
### Validation
|
|
511
|
+
|
|
512
|
+
Confirm recovery with these checks:
|
|
513
|
+
|
|
514
|
+
- [ ] Signal exists in Redis at expected key
|
|
515
|
+
- [ ] ACK received within 5s timeout
|
|
516
|
+
- [ ] ACK signature verification passes
|
|
517
|
+
- [ ] Signal delivery latency P99 <5s
|
|
518
|
+
- [ ] No failed signal records in Redis
|
|
519
|
+
- [ ] Retry attempts succeeded within 3 attempts
|
|
520
|
+
|
|
521
|
+
### Escalation
|
|
522
|
+
|
|
523
|
+
If signal delivery fails after all retries:
|
|
524
|
+
|
|
525
|
+
```bash
|
|
526
|
+
# 1. Log failed signal for manual escalation
|
|
527
|
+
redis-cli -h redis.prod.example.com setex \
|
|
528
|
+
"blocking:retry:failed:${SIGNAL_ID}" 3600 \
|
|
529
|
+
"{\"signalId\":\"${SIGNAL_ID}\",\"receiver\":\"coordinator-2\",\"attempts\":3,\"timestamp\":$(date +%s)000}"
|
|
530
|
+
|
|
531
|
+
# 2. Emit escalation event
|
|
532
|
+
kubectl exec -n cfn-loop coordinator-1 -- \
|
|
533
|
+
curl -X POST http://event-bus.cfn-loop:8080/publish \
|
|
534
|
+
-H "Content-Type: application/json" \
|
|
535
|
+
-d '{
|
|
536
|
+
"type": "signal.delivery.failed",
|
|
537
|
+
"data": {
|
|
538
|
+
"signalId": "'${SIGNAL_ID}'",
|
|
539
|
+
"receiver": "coordinator-2",
|
|
540
|
+
"attempts": 3
|
|
541
|
+
},
|
|
542
|
+
"priority": 9
|
|
543
|
+
}'
|
|
544
|
+
|
|
545
|
+
# 3. Page on-call engineer
|
|
546
|
+
curl -X POST https://pagerduty.com/api/incidents \
|
|
547
|
+
-H "Authorization: Token ${PD_TOKEN}" \
|
|
548
|
+
-d '{
|
|
549
|
+
"incident": {
|
|
550
|
+
"title": "Signal Delivery Failure - coordinator-2",
|
|
551
|
+
"urgency": "high",
|
|
552
|
+
"service": "cfn-loop-coordinators"
|
|
553
|
+
}
|
|
554
|
+
}'
|
|
555
|
+
```
|
|
556
|
+
|
|
557
|
+
---
|
|
558
|
+
|
|
559
|
+
## 4. Timeout Events
|
|
560
|
+
|
|
561
|
+
### Symptoms
|
|
562
|
+
|
|
563
|
+
- Coordinator blocking duration exceeds configured timeout (default: 30 minutes)
|
|
564
|
+
- Prometheus metric: `blocking_duration_seconds` P99 >1800s
|
|
565
|
+
- Logs show "ACK wait timeout" with missing coordinators
|
|
566
|
+
- on_blocking_timeout lifecycle hook executed
|
|
567
|
+
|
|
568
|
+
### Detection
|
|
569
|
+
|
|
570
|
+
**Automated Alerts**:
|
|
571
|
+
```
|
|
572
|
+
Alert: StuckCoordinator
|
|
573
|
+
Severity: CRITICAL
|
|
574
|
+
Trigger: histogram_quantile(0.99, blocking_duration_seconds_bucket) > 1800 for 10min
|
|
575
|
+
```
|
|
576
|
+
|
|
577
|
+
**Manual Verification**:
|
|
578
|
+
```bash
|
|
579
|
+
# Check blocking duration P99
|
|
580
|
+
curl -s http://coordinator-1.cfn-loop:8080/metrics | \
|
|
581
|
+
grep blocking_duration_seconds_bucket | \
|
|
582
|
+
./calculate-quantile.sh 0.99
|
|
583
|
+
|
|
584
|
+
# Identify stuck coordinators
|
|
585
|
+
redis-cli -h redis.prod.example.com --scan --pattern "coordinator:activity:*" | while read key; do
|
|
586
|
+
ACTIVITY=$(redis-cli -h redis.prod.example.com get $key)
|
|
587
|
+
LAST_ACTIVITY=$(echo $ACTIVITY | jq -r .lastActivity)
|
|
588
|
+
AGE=$(($(date +%s) - ($LAST_ACTIVITY / 1000)))
|
|
589
|
+
if [ $AGE -gt 1800 ]; then
|
|
590
|
+
echo "Stuck: $key (age: ${AGE}s)"
|
|
591
|
+
fi
|
|
592
|
+
done
|
|
593
|
+
```
|
|
594
|
+
|
|
595
|
+
### Root Cause Analysis
|
|
596
|
+
|
|
597
|
+
Common causes:
|
|
598
|
+
|
|
599
|
+
1. **External Dependency Timeout**: Blocking task waiting for external API that never responds
|
|
600
|
+
2. **Deadlock**: Circular dependency between coordinators waiting for each other
|
|
601
|
+
3. **Resource Contention**: Blocking task waiting for shared resource (database lock, file lock)
|
|
602
|
+
4. **Infinite Loop**: Bug in blocking task causing infinite loop without timeout
|
|
603
|
+
5. **Missed Signal**: Signal sent but never received due to Redis failure
|
|
604
|
+
|
|
605
|
+
**Diagnosis Steps**:
|
|
606
|
+
|
|
607
|
+
```bash
|
|
608
|
+
# 1. Check what coordinator is blocked on
|
|
609
|
+
kubectl logs -n cfn-loop coordinator-1 | grep "Waiting for ACKs"
|
|
610
|
+
# Look for list of coordinators being waited on
|
|
611
|
+
|
|
612
|
+
# 2. Verify those coordinators are alive
|
|
613
|
+
for coord in coordinator-2 coordinator-3; do
|
|
614
|
+
kubectl get pod -n cfn-loop -l coordinator-id=$coord
|
|
615
|
+
done
|
|
616
|
+
|
|
617
|
+
# 3. Check for deadlock (circular wait)
|
|
618
|
+
# Coordinator-1 waits for Coordinator-2
|
|
619
|
+
# Coordinator-2 waits for Coordinator-1
|
|
620
|
+
redis-cli -h redis.prod.example.com get "coordinator:activity:coordinator-1" | jq .waitingFor
|
|
621
|
+
redis-cli -h redis.prod.example.com get "coordinator:activity:coordinator-2" | jq .waitingFor
|
|
622
|
+
|
|
623
|
+
# 4. Check external dependencies
|
|
624
|
+
kubectl logs -n cfn-loop coordinator-1 | grep "Calling external API"
|
|
625
|
+
# Verify API endpoint is responsive
|
|
626
|
+
|
|
627
|
+
# 5. Check resource locks
|
|
628
|
+
kubectl exec -n cfn-loop coordinator-1 -- \
|
|
629
|
+
curl http://localhost:8080/api/locks/status
|
|
630
|
+
```
|
|
631
|
+
|
|
632
|
+
### Recovery Procedure
|
|
633
|
+
|
|
634
|
+
**Step 1: Identify Blocking Root Cause (ETA: 2 minutes)**
|
|
635
|
+
|
|
636
|
+
```bash
|
|
637
|
+
# Get blocking state
|
|
638
|
+
ACTIVITY=$(redis-cli -h redis.prod.example.com get "coordinator:activity:coordinator-1")
|
|
639
|
+
echo $ACTIVITY | jq .
|
|
640
|
+
|
|
641
|
+
# Check what signal is being waited for
|
|
642
|
+
SIGNAL_ID=$(echo $ACTIVITY | jq -r .waitingForSignal)
|
|
643
|
+
redis-cli -h redis.prod.example.com get "blocking:signal:${SIGNAL_ID}"
|
|
644
|
+
```
|
|
645
|
+
|
|
646
|
+
**Step 2: Attempt Graceful Unblock (ETA: 1 minute)**
|
|
647
|
+
|
|
648
|
+
```bash
|
|
649
|
+
# Option A: Send missing signal manually
|
|
650
|
+
kubectl exec -n cfn-loop coordinator-2 -- \
|
|
651
|
+
curl -X POST http://localhost:8080/api/signals/send \
|
|
652
|
+
-H "Content-Type: application/json" \
|
|
653
|
+
-d '{
|
|
654
|
+
"receiverId": "coordinator-1",
|
|
655
|
+
"signalType": "completion",
|
|
656
|
+
"iteration": 3
|
|
657
|
+
}'
|
|
658
|
+
|
|
659
|
+
# Option B: Trigger timeout handler to force cleanup
|
|
660
|
+
kubectl exec -n cfn-loop timeout-handler -- \
|
|
661
|
+
curl -X POST http://localhost:8080/api/timeouts/force-check \
|
|
662
|
+
-H "Content-Type: application/json" \
|
|
663
|
+
-d '{"coordinatorId": "coordinator-1"}'
|
|
664
|
+
```
|
|
665
|
+
|
|
666
|
+
**Step 3: Force Unblock (ETA: 30 seconds)**
|
|
667
|
+
|
|
668
|
+
If graceful unblock fails, force termination:
|
|
669
|
+
|
|
670
|
+
```bash
|
|
671
|
+
# Kill stuck coordinator pod
|
|
672
|
+
kubectl delete pod coordinator-1 -n cfn-loop --force --grace-period=0
|
|
673
|
+
|
|
674
|
+
# Verify pod restarted
|
|
675
|
+
kubectl get pod -n cfn-loop -l coordinator-id=coordinator-1 -w
|
|
676
|
+
```
|
|
677
|
+
|
|
678
|
+
**Step 4: Investigate External Dependencies (ETA: 3 minutes)**
|
|
679
|
+
|
|
680
|
+
```bash
|
|
681
|
+
# Check external API health
|
|
682
|
+
EXTERNAL_API=$(kubectl logs -n cfn-loop coordinator-1 | \
|
|
683
|
+
grep "Calling external API" | tail -1 | awk '{print $NF}')
|
|
684
|
+
|
|
685
|
+
curl -I $EXTERNAL_API
|
|
686
|
+
# Expected: HTTP 200 OK
|
|
687
|
+
|
|
688
|
+
# Check database lock status
|
|
689
|
+
kubectl exec -n cfn-loop postgres-0 -- \
|
|
690
|
+
psql -U admin -d cfn_loop -c \
|
|
691
|
+
"SELECT * FROM pg_locks WHERE granted = false;"
|
|
692
|
+
|
|
693
|
+
# Check file locks
|
|
694
|
+
kubectl exec -n cfn-loop coordinator-1 -- \
|
|
695
|
+
lsof /data/locks/
|
|
696
|
+
```
|
|
697
|
+
|
|
698
|
+
**Step 5: Adjust Timeout Configuration (ETA: 2 minutes)**
|
|
699
|
+
|
|
700
|
+
If timeouts are too aggressive, increase threshold:
|
|
701
|
+
|
|
702
|
+
```bash
|
|
703
|
+
# Update coordinator deployment with higher timeout
|
|
704
|
+
kubectl patch deployment coordinator -n cfn-loop -p \
|
|
705
|
+
'{"spec":{"template":{"spec":{"containers":[{"name":"coordinator","env":[{"name":"BLOCKING_TIMEOUT_MS","value":"3600000"}]}]}}}}'
|
|
706
|
+
|
|
707
|
+
# Verify rollout
|
|
708
|
+
kubectl rollout status deployment/coordinator -n cfn-loop
|
|
709
|
+
```
|
|
710
|
+
|
|
711
|
+
### Validation
|
|
712
|
+
|
|
713
|
+
Confirm recovery with these checks:
|
|
714
|
+
|
|
715
|
+
- [ ] Stuck coordinator unblocked or terminated
|
|
716
|
+
- [ ] New coordinator pod running if terminated
|
|
717
|
+
- [ ] Blocking duration P99 returns to normal (<300s)
|
|
718
|
+
- [ ] No timeout events in last 10 minutes
|
|
719
|
+
- [ ] External dependencies responding within SLA
|
|
720
|
+
- [ ] No database locks held >30s
|
|
721
|
+
|
|
722
|
+
### Prevention
|
|
723
|
+
|
|
724
|
+
Implement these measures to prevent future timeout events:
|
|
725
|
+
|
|
726
|
+
```bash
|
|
727
|
+
# 1. Add timeout monitoring dashboard
|
|
728
|
+
kubectl apply -f monitoring/dashboards/blocking-timeout-dashboard.yaml
|
|
729
|
+
|
|
730
|
+
# 2. Configure timeout alerts with graduated severity
|
|
731
|
+
kubectl apply -f monitoring/alerts/timeout-alerts.yaml
|
|
732
|
+
|
|
733
|
+
# 3. Enable timeout hook for automatic recovery
|
|
734
|
+
kubectl set env deployment/coordinator -n cfn-loop \
|
|
735
|
+
ENABLE_TIMEOUT_HOOKS=true
|
|
736
|
+
```
|
|
737
|
+
|
|
738
|
+
---
|
|
739
|
+
|
|
740
|
+
## 5. Cleanup Script Failures
|
|
741
|
+
|
|
742
|
+
### Symptoms
|
|
743
|
+
|
|
744
|
+
- Stale coordinator state not removed from Redis
|
|
745
|
+
- Heartbeat keys remain after coordinator death
|
|
746
|
+
- ACK keys accumulate without TTL expiration
|
|
747
|
+
- Manual inspection shows orphaned keys
|
|
748
|
+
|
|
749
|
+
### Detection
|
|
750
|
+
|
|
751
|
+
**Manual Verification**:
|
|
752
|
+
```bash
|
|
753
|
+
# Check for stale heartbeat keys (>10 minutes old)
|
|
754
|
+
redis-cli -h redis.prod.example.com --scan --pattern "blocking:heartbeat:*" | while read key; do
|
|
755
|
+
HEARTBEAT=$(redis-cli -h redis.prod.example.com get $key)
|
|
756
|
+
TIMESTAMP=$(echo $HEARTBEAT | jq -r .timestamp)
|
|
757
|
+
AGE=$(($(date +%s) - ($TIMESTAMP / 1000)))
|
|
758
|
+
if [ $AGE -gt 600 ]; then
|
|
759
|
+
echo "Stale: $key (age: ${AGE}s)"
|
|
760
|
+
fi
|
|
761
|
+
done
|
|
762
|
+
|
|
763
|
+
# Check cleanup script logs
|
|
764
|
+
kubectl logs -n cfn-loop cleanup-cron --tail=100 | grep "ERROR"
|
|
765
|
+
```
|
|
766
|
+
|
|
767
|
+
### Recovery Procedure
|
|
768
|
+
|
|
769
|
+
**Step 1: Run Cleanup Script with Dry-Run (ETA: 1 minute)**
|
|
770
|
+
|
|
771
|
+
```bash
|
|
772
|
+
# Verify cleanup logic without deleting keys
|
|
773
|
+
kubectl exec -n cfn-loop cleanup-cron -- \
|
|
774
|
+
/scripts/cleanup-blocking-state.sh --dry-run
|
|
775
|
+
|
|
776
|
+
# Review keys that would be deleted
|
|
777
|
+
kubectl logs -n cfn-loop cleanup-cron | grep "Would delete"
|
|
778
|
+
```
|
|
779
|
+
|
|
780
|
+
**Step 2: Execute Cleanup (ETA: 2 minutes)**
|
|
781
|
+
|
|
782
|
+
```bash
|
|
783
|
+
# Run cleanup script
|
|
784
|
+
kubectl exec -n cfn-loop cleanup-cron -- \
|
|
785
|
+
/scripts/cleanup-blocking-state.sh --execute
|
|
786
|
+
|
|
787
|
+
# Monitor progress
|
|
788
|
+
kubectl logs -n cfn-loop cleanup-cron -f
|
|
789
|
+
```
|
|
790
|
+
|
|
791
|
+
**Step 3: Manual Cleanup (if script fails) (ETA: 5 minutes)**
|
|
792
|
+
|
|
793
|
+
```bash
|
|
794
|
+
# Delete stale heartbeat keys
|
|
795
|
+
redis-cli -h redis.prod.example.com --scan --pattern "blocking:heartbeat:*" | \
|
|
796
|
+
xargs redis-cli -h redis.prod.example.com del
|
|
797
|
+
|
|
798
|
+
# Delete expired ACK keys
|
|
799
|
+
redis-cli -h redis.prod.example.com --scan --pattern "blocking:ack:*" | \
|
|
800
|
+
while read key; do
|
|
801
|
+
TTL=$(redis-cli -h redis.prod.example.com ttl $key)
|
|
802
|
+
if [ $TTL -eq -1 ]; then
|
|
803
|
+
redis-cli -h redis.prod.example.com del $key
|
|
804
|
+
fi
|
|
805
|
+
done
|
|
806
|
+
|
|
807
|
+
# Delete orphaned idempotency keys
|
|
808
|
+
redis-cli -h redis.prod.example.com --scan --pattern "blocking:idempotency:*" | \
|
|
809
|
+
xargs redis-cli -h redis.prod.example.com del
|
|
810
|
+
```
|
|
811
|
+
|
|
812
|
+
### Validation
|
|
813
|
+
|
|
814
|
+
Confirm cleanup with these checks:
|
|
815
|
+
|
|
816
|
+
- [ ] No stale heartbeat keys (all <5 minutes old)
|
|
817
|
+
- [ ] All ACK keys have TTL set
|
|
818
|
+
- [ ] No orphaned idempotency keys
|
|
819
|
+
- [ ] Cleanup script logs show success
|
|
820
|
+
- [ ] Redis memory usage decreased
|
|
821
|
+
|
|
822
|
+
---
|
|
823
|
+
|
|
824
|
+
## Post-Incident Review Template
|
|
825
|
+
|
|
826
|
+
After resolving any incident, complete this post-mortem:
|
|
827
|
+
|
|
828
|
+
```markdown
|
|
829
|
+
# Incident Post-Mortem: [Incident Name]
|
|
830
|
+
|
|
831
|
+
**Date**: YYYY-MM-DD
|
|
832
|
+
**Duration**: X hours Y minutes
|
|
833
|
+
**Severity**: P0 / P1 / P2
|
|
834
|
+
**Responders**: [Names]
|
|
835
|
+
|
|
836
|
+
## Timeline
|
|
837
|
+
|
|
838
|
+
- HH:MM - Detection: [How was it detected?]
|
|
839
|
+
- HH:MM - Triage: [Initial assessment]
|
|
840
|
+
- HH:MM - Mitigation: [What was done?]
|
|
841
|
+
- HH:MM - Resolution: [When was service restored?]
|
|
842
|
+
|
|
843
|
+
## Root Cause
|
|
844
|
+
|
|
845
|
+
[Technical explanation of what caused the incident]
|
|
846
|
+
|
|
847
|
+
## Impact
|
|
848
|
+
|
|
849
|
+
- **Coordinators Affected**: X
|
|
850
|
+
- **Signals Lost**: Y
|
|
851
|
+
- **Data Loss**: None / Minimal / Moderate / Severe
|
|
852
|
+
- **Customer Impact**: None / Low / Medium / High
|
|
853
|
+
|
|
854
|
+
## What Went Well
|
|
855
|
+
|
|
856
|
+
- [Positive aspects of the response]
|
|
857
|
+
|
|
858
|
+
## What Went Wrong
|
|
859
|
+
|
|
860
|
+
- [Areas for improvement]
|
|
861
|
+
|
|
862
|
+
## Action Items
|
|
863
|
+
|
|
864
|
+
- [ ] [Action item 1] - Owner: [Name] - Due: [Date]
|
|
865
|
+
- [ ] [Action item 2] - Owner: [Name] - Due: [Date]
|
|
866
|
+
|
|
867
|
+
## Preventive Measures
|
|
868
|
+
|
|
869
|
+
- [How to prevent this in the future]
|
|
870
|
+
```
|
|
871
|
+
|
|
872
|
+
---
|
|
873
|
+
|
|
874
|
+
**Next Steps**:
|
|
875
|
+
- Review [Monitoring Runbook](./monitoring-runbook.md) for alert response procedures
|
|
876
|
+
- See [Blocking Coordination Pattern Guide](../patterns/blocking-coordination-pattern.md) for architectural details
|
|
877
|
+
- Check [Integration Examples](../integration/cfn-loop-examples.md) for implementation code
|