agentic-flow 1.0.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude/agents/MIGRATION_SUMMARY.md +222 -0
- package/.claude/agents/README.md +89 -0
- package/.claude/agents/analysis/code-analyzer.md +209 -0
- package/.claude/agents/analysis/code-review/analyze-code-quality.md +180 -0
- package/.claude/agents/architecture/system-design/arch-system-design.md +156 -0
- package/.claude/agents/base-template-generator.md +42 -0
- package/.claude/agents/consensus/README.md +253 -0
- package/.claude/agents/consensus/byzantine-coordinator.md +63 -0
- package/.claude/agents/consensus/crdt-synchronizer.md +997 -0
- package/.claude/agents/consensus/gossip-coordinator.md +63 -0
- package/.claude/agents/consensus/performance-benchmarker.md +851 -0
- package/.claude/agents/consensus/quorum-manager.md +823 -0
- package/.claude/agents/consensus/raft-manager.md +63 -0
- package/.claude/agents/consensus/security-manager.md +622 -0
- package/.claude/agents/core/coder.md +211 -0
- package/.claude/agents/core/planner.md +116 -0
- package/.claude/agents/core/researcher.md +136 -0
- package/.claude/agents/core/reviewer.md +272 -0
- package/.claude/agents/core/tester.md +266 -0
- package/.claude/agents/data/ml/data-ml-model.md +193 -0
- package/.claude/agents/development/backend/dev-backend-api.md +142 -0
- package/.claude/agents/devops/ci-cd/ops-cicd-github.md +164 -0
- package/.claude/agents/documentation/api-docs/docs-api-openapi.md +174 -0
- package/.claude/agents/flow-nexus/app-store.md +88 -0
- package/.claude/agents/flow-nexus/authentication.md +69 -0
- package/.claude/agents/flow-nexus/challenges.md +81 -0
- package/.claude/agents/flow-nexus/neural-network.md +88 -0
- package/.claude/agents/flow-nexus/payments.md +83 -0
- package/.claude/agents/flow-nexus/sandbox.md +76 -0
- package/.claude/agents/flow-nexus/swarm.md +76 -0
- package/.claude/agents/flow-nexus/user-tools.md +96 -0
- package/.claude/agents/flow-nexus/workflow.md +84 -0
- package/.claude/agents/github/code-review-swarm.md +538 -0
- package/.claude/agents/github/github-modes.md +173 -0
- package/.claude/agents/github/issue-tracker.md +319 -0
- package/.claude/agents/github/multi-repo-swarm.md +553 -0
- package/.claude/agents/github/pr-manager.md +191 -0
- package/.claude/agents/github/project-board-sync.md +509 -0
- package/.claude/agents/github/release-manager.md +367 -0
- package/.claude/agents/github/release-swarm.md +583 -0
- package/.claude/agents/github/repo-architect.md +398 -0
- package/.claude/agents/github/swarm-issue.md +573 -0
- package/.claude/agents/github/swarm-pr.md +428 -0
- package/.claude/agents/github/sync-coordinator.md +452 -0
- package/.claude/agents/github/workflow-automation.md +635 -0
- package/.claude/agents/goal/agent.md +816 -0
- package/.claude/agents/goal/goal-planner.md +73 -0
- package/.claude/agents/optimization/README.md +250 -0
- package/.claude/agents/optimization/benchmark-suite.md +665 -0
- package/.claude/agents/optimization/load-balancer.md +431 -0
- package/.claude/agents/optimization/performance-monitor.md +672 -0
- package/.claude/agents/optimization/resource-allocator.md +674 -0
- package/.claude/agents/optimization/topology-optimizer.md +808 -0
- package/.claude/agents/payments/agentic-payments.md +126 -0
- package/.claude/agents/sparc/architecture.md +472 -0
- package/.claude/agents/sparc/pseudocode.md +318 -0
- package/.claude/agents/sparc/refinement.md +525 -0
- package/.claude/agents/sparc/specification.md +276 -0
- package/.claude/agents/specialized/mobile/spec-mobile-react-native.md +226 -0
- package/.claude/agents/sublinear/consensus-coordinator.md +338 -0
- package/.claude/agents/sublinear/matrix-optimizer.md +185 -0
- package/.claude/agents/sublinear/pagerank-analyzer.md +299 -0
- package/.claude/agents/sublinear/performance-optimizer.md +368 -0
- package/.claude/agents/sublinear/trading-predictor.md +246 -0
- package/.claude/agents/swarm/README.md +190 -0
- package/.claude/agents/swarm/adaptive-coordinator.md +396 -0
- package/.claude/agents/swarm/hierarchical-coordinator.md +256 -0
- package/.claude/agents/swarm/mesh-coordinator.md +392 -0
- package/.claude/agents/templates/automation-smart-agent.md +205 -0
- package/.claude/agents/templates/coordinator-swarm-init.md +90 -0
- package/.claude/agents/templates/github-pr-manager.md +177 -0
- package/.claude/agents/templates/implementer-sparc-coder.md +259 -0
- package/.claude/agents/templates/memory-coordinator.md +187 -0
- package/.claude/agents/templates/migration-plan.md +746 -0
- package/.claude/agents/templates/orchestrator-task.md +139 -0
- package/.claude/agents/templates/performance-analyzer.md +199 -0
- package/.claude/agents/templates/sparc-coordinator.md +183 -0
- package/.claude/agents/test-neural.md +14 -0
- package/.claude/agents/testing/unit/tdd-london-swarm.md +244 -0
- package/.claude/agents/testing/validation/production-validator.md +395 -0
- package/.claude/commands/agents/README.md +10 -0
- package/.claude/commands/agents/agent-capabilities.md +21 -0
- package/.claude/commands/agents/agent-coordination.md +28 -0
- package/.claude/commands/agents/agent-spawning.md +28 -0
- package/.claude/commands/agents/agent-types.md +26 -0
- package/.claude/commands/analysis/COMMAND_COMPLIANCE_REPORT.md +54 -0
- package/.claude/commands/analysis/README.md +9 -0
- package/.claude/commands/analysis/bottleneck-detect.md +162 -0
- package/.claude/commands/analysis/performance-bottlenecks.md +59 -0
- package/.claude/commands/analysis/performance-report.md +25 -0
- package/.claude/commands/analysis/token-efficiency.md +45 -0
- package/.claude/commands/analysis/token-usage.md +25 -0
- package/.claude/commands/automation/README.md +9 -0
- package/.claude/commands/automation/auto-agent.md +122 -0
- package/.claude/commands/automation/self-healing.md +106 -0
- package/.claude/commands/automation/session-memory.md +90 -0
- package/.claude/commands/automation/smart-agents.md +73 -0
- package/.claude/commands/automation/smart-spawn.md +25 -0
- package/.claude/commands/automation/workflow-select.md +25 -0
- package/.claude/commands/claude-flow-help.md +103 -0
- package/.claude/commands/claude-flow-memory.md +107 -0
- package/.claude/commands/claude-flow-swarm.md +205 -0
- package/.claude/commands/coordination/README.md +9 -0
- package/.claude/commands/coordination/agent-spawn.md +25 -0
- package/.claude/commands/coordination/init.md +44 -0
- package/.claude/commands/coordination/orchestrate.md +43 -0
- package/.claude/commands/coordination/spawn.md +45 -0
- package/.claude/commands/coordination/swarm-init.md +85 -0
- package/.claude/commands/coordination/task-orchestrate.md +25 -0
- package/.claude/commands/flow-nexus/app-store.md +124 -0
- package/.claude/commands/flow-nexus/challenges.md +120 -0
- package/.claude/commands/flow-nexus/login-registration.md +65 -0
- package/.claude/commands/flow-nexus/neural-network.md +134 -0
- package/.claude/commands/flow-nexus/payments.md +116 -0
- package/.claude/commands/flow-nexus/sandbox.md +83 -0
- package/.claude/commands/flow-nexus/swarm.md +87 -0
- package/.claude/commands/flow-nexus/user-tools.md +152 -0
- package/.claude/commands/flow-nexus/workflow.md +115 -0
- package/.claude/commands/github/README.md +11 -0
- package/.claude/commands/github/code-review-swarm.md +514 -0
- package/.claude/commands/github/code-review.md +25 -0
- package/.claude/commands/github/github-modes.md +147 -0
- package/.claude/commands/github/github-swarm.md +121 -0
- package/.claude/commands/github/issue-tracker.md +292 -0
- package/.claude/commands/github/issue-triage.md +25 -0
- package/.claude/commands/github/multi-repo-swarm.md +519 -0
- package/.claude/commands/github/pr-enhance.md +26 -0
- package/.claude/commands/github/pr-manager.md +170 -0
- package/.claude/commands/github/project-board-sync.md +471 -0
- package/.claude/commands/github/release-manager.md +338 -0
- package/.claude/commands/github/release-swarm.md +544 -0
- package/.claude/commands/github/repo-analyze.md +25 -0
- package/.claude/commands/github/repo-architect.md +367 -0
- package/.claude/commands/github/swarm-issue.md +482 -0
- package/.claude/commands/github/swarm-pr.md +285 -0
- package/.claude/commands/github/sync-coordinator.md +301 -0
- package/.claude/commands/github/workflow-automation.md +442 -0
- package/.claude/commands/hive-mind/README.md +17 -0
- package/.claude/commands/hive-mind/hive-mind-consensus.md +8 -0
- package/.claude/commands/hive-mind/hive-mind-init.md +18 -0
- package/.claude/commands/hive-mind/hive-mind-memory.md +8 -0
- package/.claude/commands/hive-mind/hive-mind-metrics.md +8 -0
- package/.claude/commands/hive-mind/hive-mind-resume.md +8 -0
- package/.claude/commands/hive-mind/hive-mind-sessions.md +8 -0
- package/.claude/commands/hive-mind/hive-mind-spawn.md +21 -0
- package/.claude/commands/hive-mind/hive-mind-status.md +8 -0
- package/.claude/commands/hive-mind/hive-mind-stop.md +8 -0
- package/.claude/commands/hive-mind/hive-mind-wizard.md +8 -0
- package/.claude/commands/hive-mind/hive-mind.md +27 -0
- package/.claude/commands/hooks/README.md +11 -0
- package/.claude/commands/hooks/overview.md +58 -0
- package/.claude/commands/hooks/post-edit.md +117 -0
- package/.claude/commands/hooks/post-task.md +112 -0
- package/.claude/commands/hooks/pre-edit.md +113 -0
- package/.claude/commands/hooks/pre-task.md +111 -0
- package/.claude/commands/hooks/session-end.md +118 -0
- package/.claude/commands/hooks/setup.md +103 -0
- package/.claude/commands/memory/README.md +9 -0
- package/.claude/commands/memory/memory-persist.md +25 -0
- package/.claude/commands/memory/memory-search.md +25 -0
- package/.claude/commands/memory/memory-usage.md +25 -0
- package/.claude/commands/memory/neural.md +47 -0
- package/.claude/commands/memory/usage.md +46 -0
- package/.claude/commands/monitoring/README.md +9 -0
- package/.claude/commands/monitoring/agent-metrics.md +25 -0
- package/.claude/commands/monitoring/agents.md +44 -0
- package/.claude/commands/monitoring/real-time-view.md +25 -0
- package/.claude/commands/monitoring/status.md +46 -0
- package/.claude/commands/monitoring/swarm-monitor.md +25 -0
- package/.claude/commands/optimization/README.md +9 -0
- package/.claude/commands/optimization/auto-topology.md +62 -0
- package/.claude/commands/optimization/cache-manage.md +25 -0
- package/.claude/commands/optimization/parallel-execute.md +25 -0
- package/.claude/commands/optimization/parallel-execution.md +50 -0
- package/.claude/commands/optimization/topology-optimize.md +25 -0
- package/.claude/commands/pair/README.md +261 -0
- package/.claude/commands/pair/commands.md +546 -0
- package/.claude/commands/pair/config.md +510 -0
- package/.claude/commands/pair/examples.md +512 -0
- package/.claude/commands/pair/modes.md +348 -0
- package/.claude/commands/pair/session.md +407 -0
- package/.claude/commands/pair/start.md +209 -0
- package/.claude/commands/sparc/analyzer.md +52 -0
- package/.claude/commands/sparc/architect.md +53 -0
- package/.claude/commands/sparc/ask.md +97 -0
- package/.claude/commands/sparc/batch-executor.md +54 -0
- package/.claude/commands/sparc/code.md +89 -0
- package/.claude/commands/sparc/coder.md +54 -0
- package/.claude/commands/sparc/debug.md +83 -0
- package/.claude/commands/sparc/debugger.md +54 -0
- package/.claude/commands/sparc/designer.md +53 -0
- package/.claude/commands/sparc/devops.md +109 -0
- package/.claude/commands/sparc/docs-writer.md +80 -0
- package/.claude/commands/sparc/documenter.md +54 -0
- package/.claude/commands/sparc/innovator.md +54 -0
- package/.claude/commands/sparc/integration.md +83 -0
- package/.claude/commands/sparc/mcp.md +117 -0
- package/.claude/commands/sparc/memory-manager.md +54 -0
- package/.claude/commands/sparc/optimizer.md +54 -0
- package/.claude/commands/sparc/orchestrator.md +132 -0
- package/.claude/commands/sparc/post-deployment-monitoring-mode.md +83 -0
- package/.claude/commands/sparc/refinement-optimization-mode.md +83 -0
- package/.claude/commands/sparc/researcher.md +54 -0
- package/.claude/commands/sparc/reviewer.md +54 -0
- package/.claude/commands/sparc/security-review.md +80 -0
- package/.claude/commands/sparc/sparc-modes.md +174 -0
- package/.claude/commands/sparc/sparc.md +111 -0
- package/.claude/commands/sparc/spec-pseudocode.md +80 -0
- package/.claude/commands/sparc/supabase-admin.md +348 -0
- package/.claude/commands/sparc/swarm-coordinator.md +54 -0
- package/.claude/commands/sparc/tdd.md +54 -0
- package/.claude/commands/sparc/tester.md +54 -0
- package/.claude/commands/sparc/tutorial.md +79 -0
- package/.claude/commands/sparc/workflow-manager.md +54 -0
- package/.claude/commands/sparc.md +166 -0
- package/.claude/commands/stream-chain/pipeline.md +121 -0
- package/.claude/commands/stream-chain/run.md +70 -0
- package/.claude/commands/swarm/README.md +15 -0
- package/.claude/commands/swarm/analysis.md +95 -0
- package/.claude/commands/swarm/development.md +96 -0
- package/.claude/commands/swarm/examples.md +168 -0
- package/.claude/commands/swarm/maintenance.md +102 -0
- package/.claude/commands/swarm/optimization.md +117 -0
- package/.claude/commands/swarm/research.md +136 -0
- package/.claude/commands/swarm/swarm-analysis.md +8 -0
- package/.claude/commands/swarm/swarm-background.md +8 -0
- package/.claude/commands/swarm/swarm-init.md +19 -0
- package/.claude/commands/swarm/swarm-modes.md +8 -0
- package/.claude/commands/swarm/swarm-monitor.md +8 -0
- package/.claude/commands/swarm/swarm-spawn.md +19 -0
- package/.claude/commands/swarm/swarm-status.md +8 -0
- package/.claude/commands/swarm/swarm-strategies.md +8 -0
- package/.claude/commands/swarm/swarm.md +27 -0
- package/.claude/commands/swarm/testing.md +131 -0
- package/.claude/commands/training/README.md +9 -0
- package/.claude/commands/training/model-update.md +25 -0
- package/.claude/commands/training/neural-patterns.md +74 -0
- package/.claude/commands/training/neural-train.md +25 -0
- package/.claude/commands/training/pattern-learn.md +25 -0
- package/.claude/commands/training/specialization.md +63 -0
- package/.claude/commands/truth/start.md +143 -0
- package/.claude/commands/verify/check.md +50 -0
- package/.claude/commands/verify/start.md +128 -0
- package/.claude/commands/workflows/README.md +9 -0
- package/.claude/commands/workflows/development.md +78 -0
- package/.claude/commands/workflows/research.md +63 -0
- package/.claude/commands/workflows/workflow-create.md +25 -0
- package/.claude/commands/workflows/workflow-execute.md +25 -0
- package/.claude/commands/workflows/workflow-export.md +25 -0
- package/.claude/helpers/checkpoint-manager.sh +251 -0
- package/.claude/helpers/github-safe.js +106 -0
- package/.claude/helpers/github-setup.sh +28 -0
- package/.claude/helpers/quick-start.sh +19 -0
- package/.claude/helpers/setup-mcp.sh +18 -0
- package/.claude/helpers/standard-checkpoint-hooks.sh +179 -0
- package/.claude/mcp.json +13 -0
- package/.claude/settings-backup.json +130 -0
- package/.claude/settings-optimized.json +116 -0
- package/.claude/settings-simple.json +78 -0
- package/.claude/settings.json +114 -0
- package/.claude/settings.local.json +14 -0
- package/README.md +1280 -0
- package/dist/agents/claudeAgent.js +73 -0
- package/dist/agents/claudeFlowAgent.js +115 -0
- package/dist/agents/codeReviewAgent.js +34 -0
- package/dist/agents/dataAgent.js +34 -0
- package/dist/agents/directApiAgent.js +260 -0
- package/dist/agents/webResearchAgent.js +35 -0
- package/dist/cli/mcp.js +135 -0
- package/dist/cli-proxy.js +246 -0
- package/dist/cli.js +158 -0
- package/dist/config/claudeFlow.js +67 -0
- package/dist/config/tools.js +33 -0
- package/dist/coordination/parallelSwarm.js +226 -0
- package/dist/examples/multi-agent-orchestration.js +45 -0
- package/dist/examples/parallel-swarm-deployment.js +171 -0
- package/dist/examples/use-goal-planner.js +52 -0
- package/dist/health.js +46 -0
- package/dist/index-with-proxy.js +101 -0
- package/dist/index.js +167 -0
- package/dist/mcp/claudeFlowSdkServer.js +202 -0
- package/dist/mcp/fastmcp/servers/claude-flow-sdk.js +198 -0
- package/dist/mcp/fastmcp/servers/http-streaming-updated.js +421 -0
- package/dist/mcp/fastmcp/servers/poc-stdio.js +82 -0
- package/dist/mcp/fastmcp/servers/stdio-full.js +421 -0
- package/dist/mcp/fastmcp/tools/agent/add-agent.js +107 -0
- package/dist/mcp/fastmcp/tools/agent/add-command.js +117 -0
- package/dist/mcp/fastmcp/tools/agent/execute.js +56 -0
- package/dist/mcp/fastmcp/tools/agent/list.js +82 -0
- package/dist/mcp/fastmcp/tools/agent/parallel.js +63 -0
- package/dist/mcp/fastmcp/tools/memory/retrieve.js +38 -0
- package/dist/mcp/fastmcp/tools/memory/search.js +41 -0
- package/dist/mcp/fastmcp/tools/memory/store.js +56 -0
- package/dist/mcp/fastmcp/tools/swarm/init.js +41 -0
- package/dist/mcp/fastmcp/tools/swarm/orchestrate.js +47 -0
- package/dist/mcp/fastmcp/tools/swarm/spawn.js +40 -0
- package/dist/mcp/fastmcp/types/index.js +2 -0
- package/dist/proxy/anthropic-to-openrouter.js +246 -0
- package/dist/router/providers/anthropic.js +89 -0
- package/dist/router/providers/onnx-local-optimized.js +167 -0
- package/dist/router/providers/onnx-local.js +294 -0
- package/dist/router/providers/onnx-phi4.js +190 -0
- package/dist/router/providers/onnx.js +242 -0
- package/dist/router/providers/openrouter.js +242 -0
- package/dist/router/router.js +283 -0
- package/dist/router/test-integration.js +140 -0
- package/dist/router/test-onnx-benchmark.js +145 -0
- package/dist/router/test-onnx-integration.js +128 -0
- package/dist/router/test-onnx-local.js +37 -0
- package/dist/router/test-onnx.js +148 -0
- package/dist/router/test-openrouter.js +121 -0
- package/dist/router/test-phi4.js +137 -0
- package/dist/router/types.js +2 -0
- package/dist/utils/agentLoader.js +106 -0
- package/dist/utils/cli.js +128 -0
- package/dist/utils/logger.js +41 -0
- package/dist/utils/mcpCommands.js +214 -0
- package/dist/utils/model-downloader.js +182 -0
- package/dist/utils/retry.js +54 -0
- package/docs/.claude-flow/metrics/agent-metrics.json +1 -0
- package/docs/.claude-flow/metrics/performance.json +9 -0
- package/docs/.claude-flow/metrics/task-metrics.json +10 -0
- package/docs/CHANGELOG.md +155 -0
- package/docs/CLAUDE.md +352 -0
- package/docs/COMPLETE_VALIDATION_SUMMARY.md +405 -0
- package/docs/INDEX.md +183 -0
- package/docs/LICENSE +21 -0
- package/docs/ONNX_CLI_USAGE.md +344 -0
- package/docs/ONNX_ENV_VARS.md +564 -0
- package/docs/ONNX_INTEGRATION.md +422 -0
- package/docs/ONNX_OPTIMIZATION_GUIDE.md +665 -0
- package/docs/ONNX_OPTIMIZATION_SUMMARY.md +374 -0
- package/docs/ONNX_VS_CLAUDE_QUALITY.md +442 -0
- package/docs/OPENROUTER_DEPLOYMENT.md +495 -0
- package/docs/architecture/EXECUTIVE_SUMMARY.md +310 -0
- package/docs/architecture/IMPROVEMENT_PLAN.md +11 -0
- package/docs/architecture/INTEGRATION-STATUS.md +290 -0
- package/docs/architecture/MULTI_MODEL_ROUTER_PLAN.md +620 -0
- package/docs/architecture/QUICK_WINS.md +333 -0
- package/docs/architecture/README.md +15 -0
- package/docs/architecture/RESEARCH_SUMMARY.md +652 -0
- package/docs/archived/FASTMCP_COMPLETE.md +428 -0
- package/docs/archived/FASTMCP_INTEGRATION_STATUS.md +288 -0
- package/docs/archived/FLOW-NEXUS-COMPLETE.md +269 -0
- package/docs/archived/INTEGRATION_CONFIRMED.md +351 -0
- package/docs/archived/ONNX_FINAL_REPORT.md +312 -0
- package/docs/archived/ONNX_IMPLEMENTATION_COMPLETE.md +215 -0
- package/docs/archived/ONNX_IMPLEMENTATION_SUMMARY.md +197 -0
- package/docs/archived/ONNX_SUCCESS_REPORT.md +271 -0
- package/docs/archived/OPENROUTER_PROXY_COMPLETE.md +494 -0
- package/docs/archived/PACKAGE-COMPLETE.md +138 -0
- package/docs/archived/README.md +27 -0
- package/docs/archived/RESEARCH_COMPLETE.txt +335 -0
- package/docs/archived/SDK-SETUP-COMPLETE.md +252 -0
- package/docs/guides/ALTERNATIVE_LLM_MODELS.md +524 -0
- package/docs/guides/DOCKER_AGENT_USAGE.md +352 -0
- package/docs/guides/IMPLEMENTATION_EXAMPLES.md +960 -0
- package/docs/guides/NPM-PUBLISH.md +218 -0
- package/docs/guides/README.md +17 -0
- package/docs/guides/agent-sdk.md +234 -0
- package/docs/integrations/CLAUDE_AGENTS_INTEGRATION.md +356 -0
- package/docs/integrations/CLAUDE_FLOW_INTEGRATION.md +535 -0
- package/docs/integrations/FASTMCP_CLI_INTEGRATION.md +503 -0
- package/docs/integrations/FLOW-NEXUS-INTEGRATION.md +319 -0
- package/docs/integrations/README.md +18 -0
- package/docs/integrations/fastmcp-implementation-plan.md +2516 -0
- package/docs/integrations/fastmcp-poc-integration.md +198 -0
- package/docs/router/ONNX_PHI4_RESEARCH.md +220 -0
- package/docs/router/ONNX_RUNTIME_INTEGRATION_PLAN.md +866 -0
- package/docs/router/PHI4_HYPEROPTIMIZATION_PLAN.md +2488 -0
- package/docs/router/README.md +552 -0
- package/docs/router/ROUTER_CONFIG_REFERENCE.md +577 -0
- package/docs/router/ROUTER_USER_GUIDE.md +865 -0
- package/docs/validation/DOCKER_MCP_VALIDATION.md +358 -0
- package/docs/validation/DOCKER_OPENROUTER_VALIDATION.md +443 -0
- package/docs/validation/FINAL_SYSTEM_VALIDATION.md +458 -0
- package/docs/validation/FINAL_VALIDATION_SUMMARY.md +409 -0
- package/docs/validation/MCP_CLI_TOOLS_VALIDATION.md +266 -0
- package/docs/validation/MODEL_VALIDATION_REPORT.md +386 -0
- package/docs/validation/OPENROUTER_VALIDATION_COMPLETE.md +382 -0
- package/docs/validation/README.md +20 -0
- package/docs/validation/ROUTER_VALIDATION.md +311 -0
- package/package.json +140 -0
|
@@ -0,0 +1,333 @@
|
|
|
1
|
+
# Quick Wins - Immediate Improvements (Week 1)
|
|
2
|
+
|
|
3
|
+
These are the highest-impact changes that can be implemented immediately to 10x the capabilities of our Claude Agent SDK implementation.
|
|
4
|
+
|
|
5
|
+
## Priority 1: Tool Integration (2 hours)
|
|
6
|
+
|
|
7
|
+
**Impact**: Agents go from "just talking" to "actually doing"
|
|
8
|
+
|
|
9
|
+
### Before
|
|
10
|
+
```typescript
|
|
11
|
+
export async function webResearchAgent(input: string) {
|
|
12
|
+
const result = query({
|
|
13
|
+
prompt: input,
|
|
14
|
+
options: {
|
|
15
|
+
systemPrompt: `You perform fast web-style reconnaissance...`
|
|
16
|
+
// NO TOOLS = Agent can only generate text
|
|
17
|
+
}
|
|
18
|
+
});
|
|
19
|
+
}
|
|
20
|
+
```
|
|
21
|
+
|
|
22
|
+
### After
|
|
23
|
+
```typescript
|
|
24
|
+
export async function webResearchAgent(input: string) {
|
|
25
|
+
const result = query({
|
|
26
|
+
prompt: input,
|
|
27
|
+
options: {
|
|
28
|
+
systemPrompt: `You perform fast web-style reconnaissance and return a concise bullet list of findings.`,
|
|
29
|
+
allowedTools: [
|
|
30
|
+
'WebSearch', // Can actually search the web
|
|
31
|
+
'WebFetch', // Can fetch and analyze web pages
|
|
32
|
+
'FileRead', // Can read existing research
|
|
33
|
+
'FileWrite' // Can save findings
|
|
34
|
+
],
|
|
35
|
+
maxTurns: 20 // Allow iterative research
|
|
36
|
+
}
|
|
37
|
+
});
|
|
38
|
+
}
|
|
39
|
+
```
|
|
40
|
+
|
|
41
|
+
**Result**: Agent can now perform real web research instead of hallucinating
|
|
42
|
+
|
|
43
|
+
---
|
|
44
|
+
|
|
45
|
+
## Priority 2: Streaming Responses (1 hour)
|
|
46
|
+
|
|
47
|
+
**Impact**: 5-10x better perceived performance
|
|
48
|
+
|
|
49
|
+
### Before
|
|
50
|
+
```typescript
|
|
51
|
+
// Buffers entire response before showing anything
|
|
52
|
+
let output = '';
|
|
53
|
+
for await (const msg of result) {
|
|
54
|
+
if (msg.type === 'assistant') {
|
|
55
|
+
output += msg.message.content?.map((c: any) => c.type === 'text' ? c.text : '').join('');
|
|
56
|
+
}
|
|
57
|
+
}
|
|
58
|
+
return { output }; // Wait until completely done
|
|
59
|
+
```
|
|
60
|
+
|
|
61
|
+
### After
|
|
62
|
+
```typescript
|
|
63
|
+
// Show progress in real-time
|
|
64
|
+
for await (const msg of result) {
|
|
65
|
+
if (msg.type === 'stream_event') {
|
|
66
|
+
// Real-time streaming
|
|
67
|
+
process.stdout.write(extractText(msg.event));
|
|
68
|
+
} else if (msg.type === 'assistant') {
|
|
69
|
+
// Complete response
|
|
70
|
+
console.log('\n[COMPLETE]');
|
|
71
|
+
}
|
|
72
|
+
}
|
|
73
|
+
```
|
|
74
|
+
|
|
75
|
+
**Result**: Users see progress immediately instead of waiting 30+ seconds
|
|
76
|
+
|
|
77
|
+
---
|
|
78
|
+
|
|
79
|
+
## Priority 3: Error Handling (2 hours)
|
|
80
|
+
|
|
81
|
+
**Impact**: 99% → 99.9% reliability
|
|
82
|
+
|
|
83
|
+
### Before
|
|
84
|
+
```typescript
|
|
85
|
+
// Silent failures
|
|
86
|
+
const [researchOut, reviewOut, dataOut] = await Promise.all([
|
|
87
|
+
webResearchAgent(`Give me context and risks about: ${topic}`),
|
|
88
|
+
// If this fails, entire orchestration fails
|
|
89
|
+
codeReviewAgent(`Review this diff...`),
|
|
90
|
+
dataAgent(`Analyze ${datasetHint}...`)
|
|
91
|
+
]);
|
|
92
|
+
```
|
|
93
|
+
|
|
94
|
+
### After
|
|
95
|
+
```typescript
|
|
96
|
+
// Resilient execution with fallbacks
|
|
97
|
+
const [researchOut, reviewOut, dataOut] = await Promise.allSettled([
|
|
98
|
+
withRetry(() => webResearchAgent(`...`), 3),
|
|
99
|
+
withRetry(() => codeReviewAgent(`...`), 3),
|
|
100
|
+
withRetry(() => dataAgent(`...`), 3)
|
|
101
|
+
]);
|
|
102
|
+
|
|
103
|
+
const results = {
|
|
104
|
+
research: researchOut.status === 'fulfilled' ? researchOut.value : null,
|
|
105
|
+
review: reviewOut.status === 'fulfilled' ? reviewOut.value : null,
|
|
106
|
+
data: dataOut.status === 'fulfilled' ? dataOut.value : null
|
|
107
|
+
};
|
|
108
|
+
|
|
109
|
+
// Continue with partial results
|
|
110
|
+
if (!results.research && !results.review && !results.data) {
|
|
111
|
+
throw new Error('All agents failed');
|
|
112
|
+
}
|
|
113
|
+
|
|
114
|
+
// Generate summary from available results
|
|
115
|
+
```
|
|
116
|
+
|
|
117
|
+
**Result**: System continues working even if 1-2 agents fail
|
|
118
|
+
|
|
119
|
+
---
|
|
120
|
+
|
|
121
|
+
## Priority 4: Basic Logging (1 hour)
|
|
122
|
+
|
|
123
|
+
**Impact**: 10x faster debugging
|
|
124
|
+
|
|
125
|
+
### Before
|
|
126
|
+
```typescript
|
|
127
|
+
// No visibility into what's happening
|
|
128
|
+
await Promise.all([
|
|
129
|
+
webResearchAgent(`...`),
|
|
130
|
+
codeReviewAgent(`...`),
|
|
131
|
+
dataAgent(`...`)
|
|
132
|
+
]);
|
|
133
|
+
```
|
|
134
|
+
|
|
135
|
+
### After
|
|
136
|
+
```typescript
|
|
137
|
+
import winston from 'winston';
|
|
138
|
+
|
|
139
|
+
const logger = winston.createLogger({
|
|
140
|
+
level: 'info',
|
|
141
|
+
format: winston.format.json(),
|
|
142
|
+
transports: [
|
|
143
|
+
new winston.transports.Console(),
|
|
144
|
+
new winston.transports.File({ filename: 'agents.log' })
|
|
145
|
+
]
|
|
146
|
+
});
|
|
147
|
+
|
|
148
|
+
logger.info('Starting orchestration', { topic, timestamp: Date.now() });
|
|
149
|
+
|
|
150
|
+
const results = await Promise.allSettled([
|
|
151
|
+
loggedExecution('research', () => webResearchAgent(`...`)),
|
|
152
|
+
loggedExecution('review', () => codeReviewAgent(`...`)),
|
|
153
|
+
loggedExecution('data', () => dataAgent(`...`))
|
|
154
|
+
]);
|
|
155
|
+
|
|
156
|
+
logger.info('Orchestration complete', {
|
|
157
|
+
duration: Date.now() - startTime,
|
|
158
|
+
success: results.filter(r => r.status === 'fulfilled').length,
|
|
159
|
+
failed: results.filter(r => r.status === 'rejected').length
|
|
160
|
+
});
|
|
161
|
+
```
|
|
162
|
+
|
|
163
|
+
**Result**: Can debug issues in minutes instead of hours
|
|
164
|
+
|
|
165
|
+
---
|
|
166
|
+
|
|
167
|
+
## Priority 5: Health Check (30 minutes)
|
|
168
|
+
|
|
169
|
+
**Impact**: Know when system is broken
|
|
170
|
+
|
|
171
|
+
### Add to index.ts
|
|
172
|
+
```typescript
|
|
173
|
+
import express from 'express';
|
|
174
|
+
|
|
175
|
+
const app = express();
|
|
176
|
+
|
|
177
|
+
app.get('/health', async (req, res) => {
|
|
178
|
+
const health = {
|
|
179
|
+
status: 'healthy',
|
|
180
|
+
timestamp: new Date().toISOString(),
|
|
181
|
+
anthropic: await checkAnthropicAPI()
|
|
182
|
+
};
|
|
183
|
+
|
|
184
|
+
res.json(health);
|
|
185
|
+
});
|
|
186
|
+
|
|
187
|
+
app.listen(3000, () => {
|
|
188
|
+
logger.info('Health check server started on port 3000');
|
|
189
|
+
});
|
|
190
|
+
|
|
191
|
+
async function checkAnthropicAPI() {
|
|
192
|
+
try {
|
|
193
|
+
const result = query({
|
|
194
|
+
prompt: 'ping',
|
|
195
|
+
options: { maxTurns: 1 }
|
|
196
|
+
});
|
|
197
|
+
|
|
198
|
+
for await (const msg of result) {
|
|
199
|
+
if (msg.type === 'result') {
|
|
200
|
+
return { status: 'ok', error: msg.is_error };
|
|
201
|
+
}
|
|
202
|
+
}
|
|
203
|
+
|
|
204
|
+
return { status: 'ok' };
|
|
205
|
+
} catch (error) {
|
|
206
|
+
return { status: 'error', message: error.message };
|
|
207
|
+
}
|
|
208
|
+
}
|
|
209
|
+
```
|
|
210
|
+
|
|
211
|
+
**Result**: Can monitor system health from orchestration tools
|
|
212
|
+
|
|
213
|
+
---
|
|
214
|
+
|
|
215
|
+
## Implementation Script (6.5 hours total)
|
|
216
|
+
|
|
217
|
+
### Day 1 Morning: Tool Integration (2h)
|
|
218
|
+
```bash
|
|
219
|
+
# 1. Update agent files
|
|
220
|
+
# Add allowedTools to each agent's options
|
|
221
|
+
|
|
222
|
+
# 2. Test
|
|
223
|
+
npm run dev
|
|
224
|
+
# Verify agents can now use tools
|
|
225
|
+
```
|
|
226
|
+
|
|
227
|
+
### Day 1 Afternoon: Streaming + Logging (2h)
|
|
228
|
+
```bash
|
|
229
|
+
# 1. Install dependencies
|
|
230
|
+
npm install winston
|
|
231
|
+
|
|
232
|
+
# 2. Update agent execution to stream
|
|
233
|
+
# 3. Add logging throughout
|
|
234
|
+
|
|
235
|
+
# 4. Test
|
|
236
|
+
npm run dev
|
|
237
|
+
# Verify real-time output and logs
|
|
238
|
+
```
|
|
239
|
+
|
|
240
|
+
### Day 2 Morning: Error Handling (2h)
|
|
241
|
+
```bash
|
|
242
|
+
# 1. Create withRetry utility
|
|
243
|
+
# 2. Replace Promise.all with Promise.allSettled
|
|
244
|
+
# 3. Add error handling in main()
|
|
245
|
+
|
|
246
|
+
# 4. Test failure scenarios
|
|
247
|
+
# - Kill network mid-execution
|
|
248
|
+
# - Invalid API key
|
|
249
|
+
# - Tool errors
|
|
250
|
+
```
|
|
251
|
+
|
|
252
|
+
### Day 2 Afternoon: Health Check (30m)
|
|
253
|
+
```bash
|
|
254
|
+
# 1. Install express
|
|
255
|
+
npm install express @types/express
|
|
256
|
+
|
|
257
|
+
# 2. Add health endpoint
|
|
258
|
+
# 3. Test
|
|
259
|
+
curl http://localhost:3000/health
|
|
260
|
+
```
|
|
261
|
+
|
|
262
|
+
---
|
|
263
|
+
|
|
264
|
+
## Testing the Improvements
|
|
265
|
+
|
|
266
|
+
### Test 1: Real Web Research
|
|
267
|
+
```bash
|
|
268
|
+
TOPIC="Claude Agent SDK best practices 2025" npm run dev
|
|
269
|
+
```
|
|
270
|
+
|
|
271
|
+
**Expected**: Agent uses WebSearch to find actual documentation
|
|
272
|
+
|
|
273
|
+
### Test 2: Resilience
|
|
274
|
+
```bash
|
|
275
|
+
# Set invalid API key for one agent
|
|
276
|
+
ANTHROPIC_API_KEY=invalid npm run dev
|
|
277
|
+
```
|
|
278
|
+
|
|
279
|
+
**Expected**: Other agents continue, partial results returned
|
|
280
|
+
|
|
281
|
+
### Test 3: Streaming
|
|
282
|
+
```bash
|
|
283
|
+
npm run dev | grep -v "^$"
|
|
284
|
+
```
|
|
285
|
+
|
|
286
|
+
**Expected**: See output in real-time, not all at once
|
|
287
|
+
|
|
288
|
+
### Test 4: Monitoring
|
|
289
|
+
```bash
|
|
290
|
+
# Terminal 1
|
|
291
|
+
npm run dev
|
|
292
|
+
|
|
293
|
+
# Terminal 2
|
|
294
|
+
curl http://localhost:3000/health
|
|
295
|
+
tail -f agents.log
|
|
296
|
+
```
|
|
297
|
+
|
|
298
|
+
**Expected**: Health check passes, logs show detailed execution
|
|
299
|
+
|
|
300
|
+
---
|
|
301
|
+
|
|
302
|
+
## Metrics
|
|
303
|
+
|
|
304
|
+
### Before Quick Wins
|
|
305
|
+
- Reliability: 60%
|
|
306
|
+
- Avg Response Time: 45s (perceived)
|
|
307
|
+
- Tools Available: 0
|
|
308
|
+
- Debuggability: Low
|
|
309
|
+
- Monitoring: None
|
|
310
|
+
|
|
311
|
+
### After Quick Wins
|
|
312
|
+
- Reliability: 95%
|
|
313
|
+
- Avg Response Time: 5s (perceived, streaming)
|
|
314
|
+
- Tools Available: 15+
|
|
315
|
+
- Debuggability: High
|
|
316
|
+
- Monitoring: Basic health checks
|
|
317
|
+
|
|
318
|
+
### ROI
|
|
319
|
+
- **Time Investment**: 6.5 hours
|
|
320
|
+
- **Impact**: 10x improvement in capabilities
|
|
321
|
+
- **Payback**: Immediate
|
|
322
|
+
|
|
323
|
+
---
|
|
324
|
+
|
|
325
|
+
## Next Steps After Quick Wins
|
|
326
|
+
|
|
327
|
+
Once these are deployed:
|
|
328
|
+
|
|
329
|
+
1. **Week 2**: Add Prometheus metrics and hooks
|
|
330
|
+
2. **Week 3**: Implement hierarchical orchestration
|
|
331
|
+
3. **Week 4**: Add MCP custom tools and permissions
|
|
332
|
+
|
|
333
|
+
But these 5 changes alone will transform the system from a demo to production-ready.
|
|
@@ -0,0 +1,15 @@
|
|
|
1
|
+
# Architecture Documentation
|
|
2
|
+
|
|
3
|
+
System architecture, planning, and design documents.
|
|
4
|
+
|
|
5
|
+
## Overview
|
|
6
|
+
|
|
7
|
+
- [Executive Summary](EXECUTIVE_SUMMARY.md) - High-level system overview and capabilities
|
|
8
|
+
- [Integration Status](INTEGRATION-STATUS.md) - Current integration status and roadmap
|
|
9
|
+
- [Research Summary](RESEARCH_SUMMARY.md) - Technical research and findings
|
|
10
|
+
|
|
11
|
+
## Planning
|
|
12
|
+
|
|
13
|
+
- [Improvement Plan](IMPROVEMENT_PLAN.md) - System improvement roadmap
|
|
14
|
+
- [Quick Wins](QUICK_WINS.md) - High-impact, low-effort improvements
|
|
15
|
+
- [Multi-Model Router Plan](MULTI_MODEL_ROUTER_PLAN.md) - Router architecture and design
|