adaptive-memory-multi-model-router 2.14.46 → 2.14.48
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/{docs/llms.txt → llms.txt.bak} +6 -6
- package/package.json +270 -72
- package/src/routing/advancedRouter.ts.bak +650 -0
- package/test.js.bak +376 -0
- package/.dockerignore +0 -82
- package/.env.example +0 -303
- package/.github/DISCUSSIONS_WELCOME.md +0 -27
- package/.github/DISCUSSION_TEMPLATE.yml +0 -5
- package/.github/FUNDING.yml +0 -2
- package/.github/ISSUE_TEMPLATE/bug_report.md +0 -94
- package/.github/ISSUE_TEMPLATE/config.yml +0 -17
- package/.github/ISSUE_TEMPLATE/feature_request.md +0 -71
- package/.github/PULL_REQUEST_TEMPLATE.md +0 -71
- package/.github/dependabot.yml +0 -9
- package/.github/workflows/auto-publish.yml +0 -51
- package/.github/workflows/ci.yml +0 -263
- package/.github/workflows/codeql.yml +0 -38
- package/.github/workflows/npm-publish.yml +0 -20
- package/.github/workflows/pages.yml +0 -37
- package/.github/workflows/stale.yml +0 -54
- package/.publish-tick +0 -1
- package/.well-known/ai-plugin.json +0 -16
- package/AGENT_COUNCIL_FINDINGS.md +0 -142
- package/ARCHITECTURE.md +0 -346
- package/AUDIT_REPORT.md +0 -28
- package/CODE_OF_CONDUCT.md +0 -128
- package/CONTRIBUTING.md +0 -50
- package/CONTRIBUTORS.md +0 -20
- package/Dockerfile +0 -53
- package/Dockerfile.proxy +0 -33
- package/HEALTH_REPORT.md +0 -118
- package/IMPROVEMENT_PLAN.md +0 -107
- package/LANDING.md +0 -43
- package/LAUNCH-PAIN-DRIVEN.md +0 -339
- package/LAUNCH.md +0 -337
- package/LAUNCH_CHECKLIST.md +0 -141
- package/LAUNCH_SNAPSHOT.md +0 -260
- package/MANIFESTO.md +0 -41
- package/POPULARITY_BOOSTERS.md +0 -285
- package/PR_STATUS_REPORT.md +0 -148
- package/REDESIGN.md +0 -95
- package/RUNKIT.md +0 -83
- package/SECURITY.md +0 -29
- package/SUBMISSIONS.md +0 -43
- package/_schema.html +0 -53
- package/ai-plugin.json +0 -16
- package/articles/AI_AGENT_LLM_ROUTING.md +0 -150
- package/articles/CHINESE_DIRECTORIES.md +0 -100
- package/articles/CHINESE_SUBMISSIONS_READY.md +0 -322
- package/articles/COMPETITOR_ALERTS.md +0 -31
- package/articles/COMPLETE_POSTING_DIRECTORY.md +0 -147
- package/articles/CONTENT_STRUCTURE.md +0 -292
- package/articles/DEVTO_COST_GUIDE.md +0 -473
- package/articles/DEVTO_FINAL.md +0 -416
- package/articles/DEVTO_MULTI_PROVIDER.md +0 -542
- package/articles/DEVTO_READY.md +0 -255
- package/articles/DEVTO_V2_ANNOUNCEMENT.md +0 -160
- package/articles/DEVTO_VIRAL_GROWTH.md +0 -280
- package/articles/FRESH_devto.md +0 -460
- package/articles/FRESH_devto_2026_05.md +0 -73
- package/articles/FRESH_hackernews.md +0 -14
- package/articles/FRESH_reddit_ml.md +0 -90
- package/articles/FRESH_reddit_node.md +0 -198
- package/articles/FRESH_reddit_sideproject.md +0 -72
- package/articles/FRESH_reddit_webdev.md +0 -130
- package/articles/FROM_ZERO_TO_10K.md +0 -107
- package/articles/HN_10X_BETTER.md +0 -430
- package/articles/HN_ACCOUNT_GUIDE.md +0 -21
- package/articles/HN_CHINESE_STYLE.md +0 -308
- package/articles/HN_FINAL.md +0 -148
- package/articles/HN_POSTED_VERSION.md +0 -56
- package/articles/HN_POST_READY.md +0 -137
- package/articles/HN_RESEARCH.md +0 -364
- package/articles/HN_SHOW_routerarena.md +0 -17
- package/articles/HN_TIMING_GUIDE.md +0 -52
- package/articles/INDIEHACKERS_POST.md +0 -52
- package/articles/INDIEHACKERS_READY.md +0 -120
- package/articles/LLM_BENCHMARK_DEEP_DIVE.md +0 -153
- package/articles/MASTER_POSTING_DIRECTORY.md +0 -189
- package/articles/NEWSLETTER_SEND_NOW.md +0 -259
- package/articles/NEWSLETTER_SUBMISSIONS.md +0 -112
- package/articles/PAIN-DRIVEN-devto-v2.md +0 -308
- package/articles/PAIN-DRIVEN-devto-v3.md +0 -268
- package/articles/PAIN-DRIVEN-devto.md +0 -242
- package/articles/PAIN-DRIVEN-hackernews-v2.md +0 -138
- package/articles/PAIN-DRIVEN-hackernews-v3.md +0 -151
- package/articles/PAIN-DRIVEN-hackernews.md +0 -131
- package/articles/PAIN-DRIVEN-reddit-v2.md +0 -301
- package/articles/PAIN-DRIVEN-reddit-v3.md +0 -236
- package/articles/PAIN-DRIVEN-reddit.md +0 -218
- package/articles/PAIN-DRIVEN-twitter-v2.md +0 -110
- package/articles/PAIN-DRIVEN-twitter-v3.md +0 -121
- package/articles/PAIN-DRIVEN-twitter.md +0 -120
- package/articles/PORTKEY_VS_A3M.md +0 -147
- package/articles/POSTING_KIT_2026_05.md +0 -67
- package/articles/PRESS_KIT_routerarena.md +0 -77
- package/articles/PRODUCTHUNT_LISTING.md +0 -48
- package/articles/PRODUCTHUNT_READY.md +0 -106
- package/articles/PR_PLAN_vault.md +0 -125
- package/articles/REDDIT_FINAL.md +0 -232
- package/articles/REDDIT_POST.md +0 -67
- package/articles/REDDIT_SUBMISSION_READY.md +0 -348
- package/articles/ROUTERARENA_LEADER.md +0 -45
- package/articles/SHOW_HN_FINAL.md +0 -29
- package/articles/TWEETS_10K_DOWNLOADS.md +0 -47
- package/articles/TWEETS_BENCHMARK_FIRST.md +0 -46
- package/articles/TWEETS_MCP_PLAY.md +0 -51
- package/articles/TWEETS_SEQUENTIAL_BROKEN.md +0 -49
- package/articles/TWEETS_WHY_BUILD.md +0 -54
- package/articles/TWEETS_routerarena_leader.md +0 -53
- package/articles/TWEET_STORM_READY.md +0 -165
- package/articles/TWITTER_FINAL.md +0 -167
- package/articles/WHY_10X_BETTER.md +0 -261
- package/articles/WHY_CHINESE_STYLE_BETTER.md +0 -323
- package/articles/ai-discoverability-llm-routing.md +0 -210
- package/articles/devto-llm-routing.md +0 -138
- package/articles/hackernews-show-hn.md +0 -54
- package/articles/hashnode-llm-cost-optimization.md +0 -125
- package/articles/hn_show_2026_05.md +0 -11
- package/articles/medium-building-llm-router.md +0 -205
- package/articles/reddit-ml.md +0 -76
- package/articles/twitter-thread-cost-savings.md +0 -50
- package/articles/youtube-tutorial-script.md +0 -262
- package/assets/a3m_3blue1brown.mp4 +0 -0
- package/assets/banner.svg +0 -109
- package/assets/chart-cost-v2.svg +0 -91
- package/assets/chart-cost-v3.svg +0 -143
- package/assets/chart-features-v2.svg +0 -132
- package/assets/chart-features-v3.svg +0 -211
- package/assets/chart-growth-v2.svg +0 -122
- package/assets/chart-growth-v3.svg +0 -189
- package/assets/cost-comparison.svg +0 -134
- package/assets/cost-simple.svg +0 -64
- package/assets/demo-hn.gif +0 -0
- package/assets/feature-matrix.svg +0 -136
- package/assets/growth-chart-animated.svg +0 -76
- package/assets/growth-chart.svg +0 -82
- package/assets/growth-simple.svg +0 -69
- package/assets/hero-diagram.svg +0 -81
- package/assets/logo-new.svg +0 -21
- package/assets/logo.svg +0 -68
- package/assets/provider-comparison.svg +0 -121
- package/assets/social-preview-new.svg +0 -100
- package/assets/social-preview.svg +0 -194
- package/assets/social-v2.svg +0 -130
- package/assets/social-v3.svg +0 -212
- package/benchmark-provider-results.json +0 -245
- package/benchmark-results.json +0 -54
- package/council-votes/architecture-vote.md +0 -121
- package/council-votes/coverage-vote.md +0 -93
- package/data/adaptive-benchmark.json +0 -92
- package/data/benchmark-results.json +0 -47
- package/data/labeled-benchmark.json +0 -88
- package/demo/3blue1brown_video.py +0 -285
- package/demo/3blue1brown_video_v2.py +0 -310
- package/demo/IMPROVED_PROMPTS.md +0 -229
- package/demo/VEO3_PROMPTS.md +0 -269
- package/demo/VIDEO_PRODUCTION_GUIDE.md +0 -333
- package/demo/a3m_3blue1brown.mp4 +0 -0
- package/demo/asciinema-demo.sh +0 -195
- package/demo/demo-hn.tape +0 -74
- package/demo/demo-script.md +0 -53
- package/demo/demo-script.sh +0 -62
- package/demo/demo.svg +0 -75
- package/demo/frame1_ai_data_center.png +0 -0
- package/demo/frame1_sunset_video.mp4 +0 -0
- package/demo/frame2_cost_comparison.png +0 -0
- package/demo/frame2_cost_comparison_fallback.png +0 -0
- package/demo/frame3_parallel_execution.png +0 -0
- package/demo/frame3_parallel_execution_fallback.png +0 -0
- package/demo/frame4_providers.png +0 -0
- package/demo/frame4_providers_fallback.png +0 -0
- package/demo/frame5_endcard.png +0 -0
- package/demo/frame5_endcard_fallback.png +0 -0
- package/demo/new_frame1_hook.png +0 -0
- package/demo/new_frame2_proof.png +0 -0
- package/demo/new_frame3_wow.png +0 -0
- package/demo/new_frame4_social.png +0 -0
- package/demo/new_frame5_cta.png +0 -0
- package/demo/package.json +0 -13
- package/demo/product-video-final.mp4 +0 -0
- package/demo/product-video-hype-v1.mp4 +0 -0
- package/demo/product-video-v1.mp4 +0 -0
- package/demo/public/index.html +0 -762
- package/demo/recording.cast +0 -55
- package/demo/server.js +0 -405
- package/demo-new.tape +0 -71
- package/demo-real.sh +0 -198
- package/demo-simple.tape +0 -205
- package/demo.html +0 -520
- package/demo.sh +0 -85
- package/demo.tape +0 -259
- package/dist/analytics/costAnalytics.d.ts.map +0 -1
- package/dist/analytics/costAnalytics.js.map +0 -1
- package/dist/benchmark/comprehensive.js.map +0 -1
- package/dist/benchmark/reproducible.d.ts.map +0 -1
- package/dist/benchmark/reproducible.js.map +0 -1
- package/dist/cache/prefixCache.d.ts.map +0 -1
- package/dist/cache/prefixCache.js.map +0 -1
- package/dist/cache/responseCache.d.ts.map +0 -1
- package/dist/cache/responseCache.js.map +0 -1
- package/dist/cache/semanticCache.d.ts.map +0 -1
- package/dist/cache/semanticCache.js.map +0 -1
- package/dist/cli/setupWizard.d.ts.map +0 -1
- package/dist/cli/setupWizard.js.map +0 -1
- package/dist/cost/budgetEnforcer.d.ts.map +0 -1
- package/dist/cost/budgetEnforcer.js.map +0 -1
- package/dist/cost/costTracker.d.ts.map +0 -1
- package/dist/cost/costTracker.js.map +0 -1
- package/dist/ensemble/multiRoundDialog.js.map +0 -1
- package/dist/ensemble/shapleyValue.js.map +0 -1
- package/dist/integrations/langchainAdapter.d.ts.map +0 -1
- package/dist/integrations/langchainAdapter.js.map +0 -1
- package/dist/integrations/oauth.d.ts.map +0 -1
- package/dist/integrations/oauth.js.map +0 -1
- package/dist/integrations/scienceAdapter.js.map +0 -1
- package/dist/memory/autoFetch.d.ts.map +0 -1
- package/dist/memory/autoFetch.js.map +0 -1
- package/dist/memory/episodicMemory.d.ts.map +0 -1
- package/dist/memory/episodicMemory.js.map +0 -1
- package/dist/memory/hybridMemory.js.map +0 -1
- package/dist/memory/memoryTree.d.ts.map +0 -1
- package/dist/memory/memoryTree.js.map +0 -1
- package/dist/memory/obsidianVault.d.ts.map +0 -1
- package/dist/memory/obsidianVault.js.map +0 -1
- package/dist/memory/reasoningBank.js.map +0 -1
- package/dist/observability/changeWatch.d.ts.map +0 -1
- package/dist/observability/changeWatch.js.map +0 -1
- package/dist/observability/fatigueDetector.d.ts.map +0 -1
- package/dist/observability/fatigueDetector.js.map +0 -1
- package/dist/observability/index.d.ts.map +0 -1
- package/dist/observability/index.js.map +0 -1
- package/dist/observability/metrics.d.ts.map +0 -1
- package/dist/observability/metrics.js.map +0 -1
- package/dist/observability/middleware.d.ts.map +0 -1
- package/dist/observability/middleware.js.map +0 -1
- package/dist/observability/tracer.d.ts.map +0 -1
- package/dist/observability/tracer.js.map +0 -1
- package/dist/observability/types.d.ts.map +0 -1
- package/dist/observability/types.js.map +0 -1
- package/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
- package/dist/orchestration/haloOrchestrator.js.map +0 -1
- package/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
- package/dist/orchestration/mctsWorkflow.js.map +0 -1
- package/dist/providers/localProvider.d.ts.map +0 -1
- package/dist/providers/localProvider.js.map +0 -1
- package/dist/providers/providerConfig.d.ts.map +0 -1
- package/dist/providers/providerConfig.js.map +0 -1
- package/dist/providers/registry.d.ts.map +0 -1
- package/dist/providers/registry.js.map +0 -1
- package/dist/routing/advancedRouter.d.ts.map +0 -1
- package/dist/routing/advancedRouter.js.map +0 -1
- package/dist/routing/crossModelValidation.d.ts.map +0 -1
- package/dist/routing/crossModelValidation.js.map +0 -1
- package/dist/routing/providerHealth.d.ts.map +0 -1
- package/dist/routing/providerHealth.js.map +0 -1
- package/dist/routing/providerRetry.d.ts.map +0 -1
- package/dist/routing/providerRetry.js.map +0 -1
- package/dist/scripts/banner.js +0 -29
- package/dist/security/guardrails.d.ts.map +0 -1
- package/dist/security/guardrails.js.map +0 -1
- package/dist/server/dashboard.d.ts.map +0 -1
- package/dist/server/dashboard.js.map +0 -1
- package/dist/server/modelMapper.d.ts.map +0 -1
- package/dist/server/modelMapper.js.map +0 -1
- package/dist/server/proxyServer.d.ts.map +0 -1
- package/dist/server/proxyServer.js.map +0 -1
- package/dist/skills/__tests__/skill_manager.test.d.ts +0 -2
- package/dist/skills/__tests__/skill_manager.test.d.ts.map +0 -1
- package/dist/skills/__tests__/skill_manager.test.js +0 -268
- package/dist/skills/__tests__/skill_manager.test.js.map +0 -1
- package/dist/tools/tmlpdTools.d.ts.map +0 -1
- package/dist/tools/tmlpdTools.js.map +0 -1
- package/dist/tui/dashboard.d.ts.map +0 -1
- package/dist/tui/dashboard.js.map +0 -1
- package/dist/tui/index.d.ts.map +0 -1
- package/dist/tui/index.js.map +0 -1
- package/dist/utils/batchProcessor.d.ts.map +0 -1
- package/dist/utils/batchProcessor.js.map +0 -1
- package/dist/utils/compression.d.ts.map +0 -1
- package/dist/utils/compression.js.map +0 -1
- package/dist/utils/costUtils.d.ts.map +0 -1
- package/dist/utils/costUtils.js.map +0 -1
- package/dist/utils/reliability.d.ts.map +0 -1
- package/dist/utils/reliability.js.map +0 -1
- package/dist/utils/sorting.d.ts.map +0 -1
- package/dist/utils/sorting.js.map +0 -1
- package/dist/utils/speculativeDecoding.d.ts.map +0 -1
- package/dist/utils/speculativeDecoding.js.map +0 -1
- package/dist/utils/tokenUtils.d.ts.map +0 -1
- package/dist/utils/tokenUtils.js.map +0 -1
- package/docs/.nojekyll +0 -0
- package/docs/ANALYSIS_PRINCIPLES.md +0 -162
- package/docs/API.md +0 -855
- package/docs/ARCHITECTURAL-IMPROVEMENTS-2025.md +0 -1391
- package/docs/ARCHITECTURAL-IMPROVEMENTS-REVISED-2025.md +0 -1051
- package/docs/BENCHMARK.md +0 -170
- package/docs/CHINESE_PROVIDER_RELIABILITY.md +0 -37
- package/docs/CITATIONS.md +0 -74
- package/docs/CLAIMS_AND_EVIDENCE.md +0 -58
- package/docs/CONFIGURATION.md +0 -476
- package/docs/COUNCIL_DECISION.json +0 -816
- package/docs/COUNCIL_SUMMARY.md +0 -319
- package/docs/COUNCIL_V2.2_DECISION.md +0 -416
- package/docs/ENGINEERING_SPEC.md +0 -55
- package/docs/FACTORY_RESET.md +0 -34
- package/docs/GEO.md +0 -66
- package/docs/GEO_OPTIMIZATION.md +0 -30
- package/docs/GEO_ROOT_CAUSE.md +0 -136
- package/docs/GEO_STATUS.md +0 -85
- package/docs/GEO_TEST_RESULTS.md +0 -176
- package/docs/HN_CHECKLIST.md +0 -38
- package/docs/HN_FOUNDER_COMMENT.md +0 -17
- package/docs/HN_SUBMISSION_FINAL.md +0 -180
- package/docs/HN_SUBMISSION_V3.md +0 -56
- package/docs/IMPROVEMENT_ROADMAP.md +0 -515
- package/docs/INTEGRATIONS.md +0 -420
- package/docs/LANGCHAIN_INTEGRATION.md +0 -147
- package/docs/LLM_COUNCIL_DECISION.md +0 -508
- package/docs/MIDDLEWARE_CHAIN.md +0 -35
- package/docs/PROMO_CHECKLIST.md +0 -200
- package/docs/QUICKSTART.md +0 -271
- package/docs/QUICK_START.md +0 -43
- package/docs/QUICK_START_VISIBILITY.md +0 -782
- package/docs/REDDIT_GAP_ANALYSIS.md +0 -299
- package/docs/RELEASE_CHECKLIST.md +0 -32
- package/docs/REPRODUCIBILITY.md +0 -63
- package/docs/RESEARCH_BACKED_IMPROVEMENTS.md +0 -1180
- package/docs/ROUTING_RUBRIC.md +0 -197
- package/docs/SEO_AUDIT.md +0 -186
- package/docs/SOCIAL_LISTENING.md +0 -219
- package/docs/TMLPD_QNA.md +0 -751
- package/docs/TMLPD_V2.1_COMPLETE.md +0 -763
- package/docs/TMLPD_V2.2_RESEARCH_ROADMAP.md +0 -754
- package/docs/UPDATE_TOPICS.md +0 -15
- package/docs/USE_CASES.md +0 -59
- package/docs/V2.2_IMPLEMENTATION_COMPLETE.md +0 -446
- package/docs/V2_IMPLEMENTATION_GUIDE.md +0 -388
- package/docs/VERCEL_AI_SDK.md +0 -209
- package/docs/VISIBILITY_ADOPTION_PLAN.md +0 -1005
- package/docs/_config.yml +0 -49
- package/docs/ai-plugin.json +0 -16
- package/docs/api.html +0 -513
- package/docs/architecture-diagram.md +0 -40
- package/docs/benchmark-chart.png +0 -0
- package/docs/benchmark.html +0 -387
- package/docs/blog/routerarena-number-one.html +0 -73
- package/docs/cli-cheatsheet.md +0 -339
- package/docs/compare.md +0 -109
- package/docs/comparison-litellm.md +0 -88
- package/docs/comparison.md +0 -108
- package/docs/cost-chart-ascii.md +0 -42
- package/docs/cost-comparison-chart.svg +0 -88
- package/docs/curl-examples.md +0 -247
- package/docs/demo-auto.html +0 -264
- package/docs/demo.html +0 -416
- package/docs/geo/GENERATIVE_ENGINE_OPTIMIZATION.md +0 -232
- package/docs/index.html +0 -507
- package/docs/launch-content/LAUNCH_EXECUTION_CHECKLIST.md +0 -421
- package/docs/launch-content/README.md +0 -457
- package/docs/launch-content/assets/cost_comparison_100_tasks.png +0 -0
- package/docs/launch-content/assets/cumulative_savings.png +0 -0
- package/docs/launch-content/assets/parallel_speedup.png +0 -0
- package/docs/launch-content/assets/provider_pricing_comparison.png +0 -0
- package/docs/launch-content/assets/task_breakdown_comparison.png +0 -0
- package/docs/launch-content/generate_charts.py +0 -313
- package/docs/launch-content/hn_show_post.md +0 -139
- package/docs/launch-content/partner_outreach_templates.md +0 -745
- package/docs/launch-content/reddit_posts.md +0 -467
- package/docs/launch-content/twitter_thread.txt +0 -460
- package/docs/npm-downloads-chart.svg +0 -43
- package/docs/openapi.json +0 -139
- package/docs/openapi.yaml +0 -1318
- package/docs/quick-start.html +0 -366
- package/docs/robots.txt +0 -52
- package/docs/sitemap.xml +0 -57
- package/docs/styles.css +0 -682
- package/docs/well-known/ai-plugin.json +0 -16
- package/docs/wellknown/ai-plugin.json +0 -16
- package/docs-site/assets/og-banner.svg +0 -194
- package/docs-site/index.html +0 -632
- package/eval/README.md +0 -46
- package/eval/baselines/main.json +0 -12
- package/eval/benchmark_dataset.jsonl +0 -16
- package/eval/check_golden_routes.js +0 -64
- package/eval/datasets/catalog.json +0 -33
- package/eval/datasets/slices/cn_provider_reliability_v1.jsonl +0 -3
- package/eval/datasets/slices/cost_pressure_v1.jsonl +0 -3
- package/eval/datasets/slices/safety_guardrails_v1.jsonl +0 -3
- package/eval/evals.json +0 -199
- package/eval/fault_injection_thresholds.json +0 -3
- package/eval/generate_report.js +0 -128
- package/eval/golden_routes.json +0 -114
- package/eval/lib/experiment_registry.js +0 -24
- package/eval/run_eval.js +0 -197
- package/eval/run_fault_injection.js +0 -201
- package/eval/run_shadow_eval.js +0 -85
- package/eval/thresholds.json +0 -9
- package/examples/QUICKSTART.md +0 -183
- package/examples/README.md +0 -61
- package/examples/a3m-sdk.js +0 -124
- package/examples/basic-route.js +0 -54
- package/examples/chat-loop.js +0 -202
- package/examples/classify-then-route.js +0 -102
- package/examples/cost-compare.js +0 -120
- package/examples/ensemble.js +0 -160
- package/examples/whatsapp-telegram-bridge-demo.js +0 -302
- package/examples/whatsapp-telegram-bridge.js +0 -269
- package/hf-space/README.md +0 -23
- package/hf-space/app.py +0 -240
- package/hf-space/requirements.txt +0 -1
- package/huggingface_space/README.md +0 -35
- package/huggingface_space/app.py +0 -126
- package/huggingface_space/create_space.py +0 -208
- package/huggingface_space/requirements.txt +0 -1
- package/mcp-server/README.md +0 -188
- package/mcp-server/package.json +0 -29
- package/mcp-server/src/index.ts +0 -744
- package/mcp-server/tsconfig.json +0 -19
- package/openclaw-alexa-bridge/ALL_REMAINING_FIXES_PLAN.md +0 -313
- package/openclaw-alexa-bridge/REMAINING_FIXES_SUMMARY.md +0 -277
- package/openclaw-alexa-bridge/src/alexa_handler_no_tmlpd.js +0 -1234
- package/openclaw-alexa-bridge/test_fixes.js +0 -77
- package/playground/README.md +0 -51
- package/playground/codesandbox.json +0 -12
- package/playground/index.js +0 -39
- package/proxy/README.md +0 -227
- package/proxy/package-lock.json +0 -831
- package/proxy/package.json +0 -17
- package/proxy/rate-limit.js +0 -145
- package/proxy/rate-limit.test.js +0 -311
- package/proxy/server.js +0 -970
- package/python/README.md +0 -102
- package/python/a3m/__init__.py +0 -6
- package/python/a3m/client.py +0 -190
- package/python/a3m/models.py +0 -40
- package/python/a3m/sync_client.py +0 -61
- package/python/examples.py +0 -53
- package/python/integrations.py +0 -330
- package/python/pyproject.toml +0 -23
- package/python/setup.py +0 -28
- package/python/tmlpd.py +0 -369
- package/qna/REDDIT_GAP_ANALYSIS.md +0 -299
- package/qna/TMLPD_QNA.md +0 -751
- package/research/FINDING_001_safety.md +0 -28
- package/research/FINDING_002_error_diversity.md +0 -32
- package/research/FINDING_003_confidence_weighted_voting.md +0 -32
- package/research/FINDING_004_cross_model_semantic_detection.md +0 -37
- package/research/FINDING_005_knowledge_gap_orthogonality.md +0 -34
- package/research/HALLUCINATION_RESEARCH.md +0 -27
- package/research/ensemble-voting.md +0 -324
- package/research/loss-functions.md +0 -545
- package/research-log.md +0 -49
- package/scripts/banner.js +0 -29
- package/scripts/benchmark-local-routerarena.ts +0 -176
- package/scripts/benchmark.js +0 -145
- package/scripts/benchmark.sh +0 -61
- package/scripts/compare-providers.sh +0 -230
- package/scripts/content-planner.js +0 -25
- package/scripts/create-labeled-benchmark.ts +0 -105
- package/scripts/cross_post.py +0 -443
- package/scripts/local-router-benchmark.ts +0 -154
- package/scripts/post-all.sh +0 -41
- package/scripts/publish_fcc.py +0 -106
- package/scripts/push-to-gitee.sh +0 -25
- package/scripts/routerarena_ensemble.js +0 -144
- package/scripts/routing-benchmark-v2.js +0 -373
- package/scripts/routing-benchmark-v3.js +0 -118
- package/scripts/routing-benchmark.js +0 -462
- package/scripts/run-labeled-benchmark.mjs +0 -104
- package/scripts/run-mmlu-benchmark.js +0 -176
- package/scripts/run-provider-benchmark.js +0 -244
- package/scripts/update-npm-badges.js +0 -158
- package/skill/SKILL.md +0 -238
- package/src/__tests__/integration/tmpld_integration.test.py +0 -540
- package/src/skills/__tests__/skill_manager.test.ts +0 -328
- package/submissions/benchmarks/ALL_PLATFORMS_SUBMISSION.md +0 -94
- package/submissions/benchmarks/LLMROUTERBENCH_SUBMISSION.md +0 -121
- package/submissions/benchmarks/MMRBENCH_SUBMISSION.md +0 -94
- package/submissions/benchmarks/ROUTERARENA_UPDATE.md +0 -83
- package/submissions/benchmarks/ROUTERBENCH_SUBMISSION.md +0 -225
- package/test-council/1-structure-tests.test.js +0 -353
- package/test-council/1-structure-tests.test.ts +0 -353
- package/test-council/2-edge-case-tests.test.ts +0 -361
- package/test-council/3-performance-tests.test.ts +0 -669
- package/test-council/4-integration-tests.test.ts +0 -391
- package/test-council/5-agent-council-eval.test.ts +0 -413
- package/test-council/AGENT_COUNCIL_ARCHITECTURE.md +0 -349
- package/test-council/TEST_COUNCIL_REPORT.md +0 -201
- package/test-council/agents/edge-case-agent.ts +0 -363
- package/test-council/agents/performance-agent.ts +0 -426
- package/test-council/agents/structure-agent.ts +0 -227
- package/test-council/council.md +0 -183
- package/tests/__mocks__/tokenUtils.ts +0 -8
- package/tests/memory/episodicMemory.test.ts +0 -227
- package/tests/package-lock.json +0 -1628
- package/tests/package.json +0 -18
- package/tests/routing/ensembleVoting.test.ts +0 -236
- package/tests/routing/providerRetry.test.ts +0 -360
- package/tests/routing/queryTypePresets.test.ts +0 -208
- package/tests/security/guardrailEngine.test.ts +0 -700
- package/tests/tsconfig.json +0 -21
- package/tests/vitest.config.ts +0 -18
- package/tmlpd-pi-extension/README.md +0 -66
- package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts +0 -114
- package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/cache/prefixCache.js +0 -285
- package/tmlpd-pi-extension/dist/cache/prefixCache.js.map +0 -1
- package/tmlpd-pi-extension/dist/cache/responseCache.d.ts +0 -58
- package/tmlpd-pi-extension/dist/cache/responseCache.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/cache/responseCache.js +0 -153
- package/tmlpd-pi-extension/dist/cache/responseCache.js.map +0 -1
- package/tmlpd-pi-extension/dist/cli.js +0 -59
- package/tmlpd-pi-extension/dist/cost/costTracker.d.ts +0 -95
- package/tmlpd-pi-extension/dist/cost/costTracker.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/cost/costTracker.js +0 -240
- package/tmlpd-pi-extension/dist/cost/costTracker.js.map +0 -1
- package/tmlpd-pi-extension/dist/index.d.ts +0 -723
- package/tmlpd-pi-extension/dist/index.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/index.js +0 -239
- package/tmlpd-pi-extension/dist/index.js.map +0 -1
- package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts +0 -82
- package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/memory/episodicMemory.js +0 -145
- package/tmlpd-pi-extension/dist/memory/episodicMemory.js.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts +0 -102
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js +0 -207
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts +0 -85
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js +0 -210
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js.map +0 -1
- package/tmlpd-pi-extension/dist/providers/localProvider.d.ts +0 -102
- package/tmlpd-pi-extension/dist/providers/localProvider.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/providers/localProvider.js +0 -338
- package/tmlpd-pi-extension/dist/providers/localProvider.js.map +0 -1
- package/tmlpd-pi-extension/dist/providers/registry.d.ts +0 -55
- package/tmlpd-pi-extension/dist/providers/registry.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/providers/registry.js +0 -138
- package/tmlpd-pi-extension/dist/providers/registry.js.map +0 -1
- package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts +0 -68
- package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/routing/advancedRouter.js +0 -332
- package/tmlpd-pi-extension/dist/routing/advancedRouter.js.map +0 -1
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts +0 -101
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.js +0 -368
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts +0 -96
- package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/batchProcessor.js +0 -170
- package/tmlpd-pi-extension/dist/utils/batchProcessor.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/compression.d.ts +0 -61
- package/tmlpd-pi-extension/dist/utils/compression.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/compression.js +0 -281
- package/tmlpd-pi-extension/dist/utils/compression.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/reliability.d.ts +0 -74
- package/tmlpd-pi-extension/dist/utils/reliability.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/reliability.js +0 -177
- package/tmlpd-pi-extension/dist/utils/reliability.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts +0 -117
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js +0 -246
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts +0 -50
- package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/tokenUtils.js +0 -124
- package/tmlpd-pi-extension/dist/utils/tokenUtils.js.map +0 -1
- package/tmlpd-pi-extension/examples/QUICKSTART.md +0 -183
- package/tmlpd-pi-extension/package-lock.json +0 -79
- package/tmlpd-pi-extension/package.json +0 -172
- package/tmlpd-pi-extension/python/examples.py +0 -53
- package/tmlpd-pi-extension/python/integrations.py +0 -330
- package/tmlpd-pi-extension/python/setup.py +0 -28
- package/tmlpd-pi-extension/python/tmlpd.py +0 -369
- package/tmlpd-pi-extension/qna/REDDIT_GAP_ANALYSIS.md +0 -299
- package/tmlpd-pi-extension/qna/TMLPD_QNA.md +0 -751
- package/tmlpd-pi-extension/skill/SKILL.md +0 -238
- package/tmlpd-pi-extension/src/cache/responseCache.ts +0 -147
- package/tmlpd-pi-extension/src/cost/costTracker.ts +0 -302
- package/tmlpd-pi-extension/src/index.ts +0 -232
- package/tmlpd-pi-extension/src/memory/episodicMemory.ts +0 -257
- package/tmlpd-pi-extension/src/orchestration/haloOrchestrator.ts +0 -266
- package/tmlpd-pi-extension/src/orchestration/mctsWorkflow.ts +0 -262
- package/tmlpd-pi-extension/src/providers/localProvider.ts +0 -406
- package/tmlpd-pi-extension/src/providers/registry.ts +0 -164
- package/tmlpd-pi-extension/src/routing/ensembleVoting.ts +0 -159
- package/tmlpd-pi-extension/src/routing/queryTypePresets.ts +0 -136
- package/tmlpd-pi-extension/src/tools/tmlpdTools.ts +0 -433
- package/tmlpd-pi-extension/src/utils/batchProcessor.ts +0 -232
- package/tmlpd-pi-extension/src/utils/compression.ts +0 -325
- package/tmlpd-pi-extension/src/utils/reliability.ts +0 -221
- package/tmlpd-pi-extension/src/utils/tokenUtils.ts +0 -145
- package/tmlpd-pi-extension/tsconfig.json +0 -18
- package/tsconfig.build.json +0 -29
- package/tsconfig.json +0 -18
- /package/{docs/llms-full.txt → llms-full.txt.bak} +0 -0
package/articles/DEVTO_READY.md
DELETED
|
@@ -1,255 +0,0 @@
|
|
|
1
|
-
# I Built an LLM Router That Decides Which Model to Use in 0.3ms — No ML, No GPU
|
|
2
|
-
|
|
3
|
-
*How a 5-signal keyword classifier outperforms RouteLLM and beats GPT-5 at 1/200th the cost*
|
|
4
|
-
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
## The Problem Nobody Talks About
|
|
8
|
-
|
|
9
|
-
Every LLM gateway does sequential fallback:
|
|
10
|
-
|
|
11
|
-
```
|
|
12
|
-
Try GPT-4o → fail → Try Claude → fail → Try Groq → success → return
|
|
13
|
-
```
|
|
14
|
-
|
|
15
|
-
You get the **first successful answer**. Not the **best answer**. And that first provider is usually the most expensive one.
|
|
16
|
-
|
|
17
|
-
I was spending $800/month on LLM APIs. Half of those calls were GPT-4o answering "what is 2+2?" at $0.03/query. That's $0.03 to do a math problem a free model could answer just as well.
|
|
18
|
-
|
|
19
|
-
## The Solution: Parallel Ensemble with Scoring
|
|
20
|
-
|
|
21
|
-
Instead of sequential fallback, A3M calls all providers in parallel and scores every response:
|
|
22
|
-
|
|
23
|
-
- **Domain match** — does this provider handle the query's domain?
|
|
24
|
-
- **Specificity** — did it answer the actual question or give a generic response?
|
|
25
|
-
- **Structure alignment** — did it follow the requested format?
|
|
26
|
-
|
|
27
|
-
The cheapest provider that fully satisfies the query wins.
|
|
28
|
-
|
|
29
|
-
This is architecturally different from every other gateway. litellm, RouteLLM, Portkey — all sequential. A3M is parallel.
|
|
30
|
-
|
|
31
|
-
---
|
|
32
|
-
|
|
33
|
-
## The 5-Signal Architecture
|
|
34
|
-
|
|
35
|
-
Each incoming query is scored on five orthogonal signals. The weighted sum maps to a cost tier.
|
|
36
|
-
|
|
37
|
-
```
|
|
38
|
-
Query → [domain, task, structure, verb, specificity] → weighted sum → tier → provider
|
|
39
|
-
```
|
|
40
|
-
|
|
41
|
-
### Signal 1: Domain Detection
|
|
42
|
-
|
|
43
|
-
```typescript
|
|
44
|
-
const DOMAIN_PATTERNS: Record<string, RegExp[]> = {
|
|
45
|
-
code: [/\b(function|class|import|export|async|await)\b/gi, ...],
|
|
46
|
-
math: [/\b(equation|integral|derivative|theorem|calculate)\b/gi, ...],
|
|
47
|
-
legal: [/\b(contract|liability|clause|statute|compliance)\b/gi, ...],
|
|
48
|
-
medical: [/\b(diagnosis|symptom|treatment|clinical|dosage)\b/gi, ...],
|
|
49
|
-
};
|
|
50
|
-
|
|
51
|
-
function scoreDomain(query: string): number {
|
|
52
|
-
let maxScore = 0;
|
|
53
|
-
for (const [domain, patterns] of Object.entries(DOMAIN_PATTERNS)) {
|
|
54
|
-
const matchCount = patterns.reduce(
|
|
55
|
-
(sum, pattern) => sum + (query.match(pattern)?.length ?? 0), 0
|
|
56
|
-
);
|
|
57
|
-
maxScore = Math.max(maxScore, Math.min(matchCount * 0.15, 1.0));
|
|
58
|
-
}
|
|
59
|
-
return maxScore;
|
|
60
|
-
}
|
|
61
|
-
```
|
|
62
|
-
|
|
63
|
-
### Signal 2: Task Indicators
|
|
64
|
-
|
|
65
|
-
```typescript
|
|
66
|
-
const TASK_KEYWORDS: Record<string, { keywords: string[]; complexity: number }> = {
|
|
67
|
-
summarize: { keywords: ['summarize', 'tldr', 'brief', 'overview'], complexity: 0.2 },
|
|
68
|
-
explain: { keywords: ['explain', 'describe', 'what is', 'how does'], complexity: 0.3 },
|
|
69
|
-
debug: { keywords: ['debug', 'fix this', 'error', 'stack trace'], complexity: 0.55 },
|
|
70
|
-
create: { keywords: ['write', 'create', 'generate', 'build'], complexity: 0.75 },
|
|
71
|
-
architect: { keywords: ['architect', 'design a system', 'system design'], complexity: 0.9 },
|
|
72
|
-
};
|
|
73
|
-
|
|
74
|
-
function scoreTask(query: string): number {
|
|
75
|
-
const lower = query.toLowerCase();
|
|
76
|
-
let score = 0;
|
|
77
|
-
for (const [, config] of Object.entries(TASK_KEYWORDS)) {
|
|
78
|
-
if (config.keywords.some(kw => lower.includes(kw))) score += config.complexity;
|
|
79
|
-
}
|
|
80
|
-
return Math.min(score, 1.0);
|
|
81
|
-
}
|
|
82
|
-
```
|
|
83
|
-
|
|
84
|
-
### Signal 3: Query Structure
|
|
85
|
-
|
|
86
|
-
```typescript
|
|
87
|
-
function scoreStructure(query: string): number {
|
|
88
|
-
let score = 0;
|
|
89
|
-
const stepMarkers = query.split(/\b(first|then|after|finally)\b/i);
|
|
90
|
-
score += Math.max(0, (stepMarkers.length - 1)) * 0.2;
|
|
91
|
-
const conditionals = query.match(/\b(if|unless|otherwise|whether)\b/gi);
|
|
92
|
-
score += (conditionals?.length ?? 0) * 0.15;
|
|
93
|
-
score += Math.min(query.length / 500, 0.3);
|
|
94
|
-
return Math.min(score, 1.0);
|
|
95
|
-
}
|
|
96
|
-
```
|
|
97
|
-
|
|
98
|
-
### Signal 4: Verb Intensity
|
|
99
|
-
|
|
100
|
-
```typescript
|
|
101
|
-
const VERB_WEIGHTS: Record<string, number> = {
|
|
102
|
-
'what is': 0.1, 'define': 0.15, 'list': 0.2, 'describe': 0.25,
|
|
103
|
-
'explain': 0.35, 'convert': 0.4, 'translate': 0.4, 'summarize': 0.4,
|
|
104
|
-
'debug': 0.6, 'fix': 0.6, 'analyze': 0.65, 'compare': 0.65,
|
|
105
|
-
'optimize': 0.7, 'design': 0.8, 'architect': 0.85,
|
|
106
|
-
};
|
|
107
|
-
|
|
108
|
-
function scoreVerb(query: string): number {
|
|
109
|
-
const lower = query.toLowerCase();
|
|
110
|
-
return Math.max(...Object.entries(VERB_WEIGHTS)
|
|
111
|
-
.filter(([v]) => lower.includes(v))
|
|
112
|
-
.map(([, w]) => w), 0);
|
|
113
|
-
}
|
|
114
|
-
```
|
|
115
|
-
|
|
116
|
-
### Signal 5: Specificity
|
|
117
|
-
|
|
118
|
-
```typescript
|
|
119
|
-
function scoreSpecificity(query: string): number {
|
|
120
|
-
let score = 0;
|
|
121
|
-
const technicalTerms = query.match(/\b[A-Z][a-z]+[A-Z][a-z]+\b/g);
|
|
122
|
-
score += Math.min((technicalTerms?.length ?? 0) * 0.12, 0.3);
|
|
123
|
-
const numbers = query.match(/\d+/g);
|
|
124
|
-
score += Math.min((numbers?.length ?? 0) * 0.03, 0.15);
|
|
125
|
-
const vagueTerms = query.match(/\b(something|anything|stuff|things|etc)\b/gi);
|
|
126
|
-
score -= (vagueTerms?.length ?? 0) * 0.15;
|
|
127
|
-
return Math.max(0, Math.min(score, 1.0));
|
|
128
|
-
}
|
|
129
|
-
```
|
|
130
|
-
|
|
131
|
-
### Putting It Together
|
|
132
|
-
|
|
133
|
-
```typescript
|
|
134
|
-
const WEIGHTS = { domain: 0.25, task: 0.25, structure: 0.20, verbIntensity: 0.15, specificity: 0.15 };
|
|
135
|
-
const TIER_THRESHOLDS: [number, Tier][] = [[0.20,'free'], [0.40,'cheap'], [0.60,'mid'], [0.80,'premium'], [1.01,'enterprise']];
|
|
136
|
-
|
|
137
|
-
function route(query: string): Tier {
|
|
138
|
-
const score =
|
|
139
|
-
scoreDomain(query) * WEIGHTS.domain +
|
|
140
|
-
scoreTask(query) * WEIGHTS.task +
|
|
141
|
-
scoreStructure(query) * WEIGHTS.structure +
|
|
142
|
-
scoreVerb(query) * WEIGHTS.verbIntensity +
|
|
143
|
-
scoreSpecificity(query) * WEIGHTS.specificity;
|
|
144
|
-
return TIER_THRESHOLDS.find(([t]) => score < t)?.[1] ?? 'enterprise';
|
|
145
|
-
}
|
|
146
|
-
```
|
|
147
|
-
|
|
148
|
-
---
|
|
149
|
-
|
|
150
|
-
## Real Query Examples
|
|
151
|
-
|
|
152
|
-
### "What is Python?" → Free tier ✅
|
|
153
|
-
|
|
154
|
-
| Signal | Score | Weight | Weighted |
|
|
155
|
-
|--------|:-----:|:------:|:--------:|
|
|
156
|
-
| Domain | 0.0 | 0.25 | 0.000 |
|
|
157
|
-
| Task | 0.3 | 0.25 | 0.075 |
|
|
158
|
-
| Structure | 0.03 | 0.20 | 0.006 |
|
|
159
|
-
| Verb | 0.1 | 0.15 | 0.015 |
|
|
160
|
-
| Specificity | 0.0 | 0.15 | 0.000 |
|
|
161
|
-
| **Total** | | | **0.096** → Free |
|
|
162
|
-
|
|
163
|
-
### "Implement a red-black tree in TypeScript" → Mid tier ✅
|
|
164
|
-
|
|
165
|
-
| Signal | Score | Weight | Weighted |
|
|
166
|
-
|--------|:-----:|:------:|:--------:|
|
|
167
|
-
| Domain | 0.45 | 0.25 | 0.113 |
|
|
168
|
-
| Task | 0.75 | 0.25 | 0.188 |
|
|
169
|
-
| Structure | 0.15 | 0.20 | 0.030 |
|
|
170
|
-
| Verb | 0.75 | 0.15 | 0.113 |
|
|
171
|
-
| Specificity | 0.42 | 0.15 | 0.063 |
|
|
172
|
-
| **Total** | | | **0.505** → Mid |
|
|
173
|
-
|
|
174
|
-
---
|
|
175
|
-
|
|
176
|
-
## Benchmark Results
|
|
177
|
-
|
|
178
|
-
RouterArena (arXiv:2510.00202) — 8,400 queries, 9 domains:
|
|
179
|
-
|
|
180
|
-
| Router | Score | Cost/1K |
|
|
181
|
-
|--------|:-----:|:-------:|
|
|
182
|
-
| **A3M Router** | **70.32** | **$0.047** |
|
|
183
|
-
| Sqwish | 75.27 | $0.180 |
|
|
184
|
-
| Azure | 71.87 | $0.220 |
|
|
185
|
-
| GPT-5 | 64.32 | $10.020 |
|
|
186
|
-
| RouteLLM | 48.07 | $0.270 |
|
|
187
|
-
|
|
188
|
-
**#1 among cost-aware routers. 4.7× cheaper than the next cheapest. And it scores higher than GPT-5 at 200× lower cost.**
|
|
189
|
-
|
|
190
|
-
---
|
|
191
|
-
|
|
192
|
-
## The Cost Math
|
|
193
|
-
|
|
194
|
-
If you're spending **$1,000/month** on LLM APIs:
|
|
195
|
-
|
|
196
|
-
| Router | Score | Monthly Cost |
|
|
197
|
-
|--------|:-----:|:------------:|
|
|
198
|
-
| GPT-4o only | 64.32 | $1,000 |
|
|
199
|
-
| RouteLLM | 48.07 | $270 |
|
|
200
|
-
| A3M Router | **70.32** | **$47** |
|
|
201
|
-
|
|
202
|
-
**62% savings vs RouteLLM. 95% savings vs GPT-4o only.**
|
|
203
|
-
|
|
204
|
-
---
|
|
205
|
-
|
|
206
|
-
## Semantic Caching (30%+ Hit Rate)
|
|
207
|
-
|
|
208
|
-
Same algorithm, no extra infrastructure:
|
|
209
|
-
|
|
210
|
-
```typescript
|
|
211
|
-
function trigramJaccard(a: string, b: string): number {
|
|
212
|
-
const trigrams = (s: string) => {
|
|
213
|
-
const set = new Set<string>();
|
|
214
|
-
for (let i = 0; i <= s.length - 3; i++) set.add(s.slice(i, i + 3));
|
|
215
|
-
return set;
|
|
216
|
-
};
|
|
217
|
-
const intersection = [...trigrams(a)].filter(x => trigrams(b).has(x)).length;
|
|
218
|
-
return intersection / new Set([...trigrams(a), ...trigrams(b)]).size;
|
|
219
|
-
}
|
|
220
|
-
|
|
221
|
-
// "Explain React hooks" and "what are React hooks?" → Jaccard > 0.4 → cache hit
|
|
222
|
-
```
|
|
223
|
-
|
|
224
|
-
---
|
|
225
|
-
|
|
226
|
-
## Get Started
|
|
227
|
-
|
|
228
|
-
```bash
|
|
229
|
-
npm install adaptive-memory-multi-model-router
|
|
230
|
-
```
|
|
231
|
-
|
|
232
|
-
```typescript
|
|
233
|
-
import { A3MRouter } from 'adaptive-memory-multi-model-router';
|
|
234
|
-
|
|
235
|
-
const router = new A3MRouter({
|
|
236
|
-
providers: {
|
|
237
|
-
openai: { apiKey: process.env.OPENAI_API_KEY },
|
|
238
|
-
anthropic: { apiKey: process.env.ANTHROPIC_API_KEY },
|
|
239
|
-
groq: { apiKey: process.env.GROQ_API_KEY },
|
|
240
|
-
}
|
|
241
|
-
});
|
|
242
|
-
|
|
243
|
-
const result = await router.route({
|
|
244
|
-
messages: [{ role: 'user', content: 'Design a microservices architecture' }]
|
|
245
|
-
});
|
|
246
|
-
// → Premium tier
|
|
247
|
-
console.log(result.provider, result.cost);
|
|
248
|
-
```
|
|
249
|
-
|
|
250
|
-
**GitHub:** [https://github.com/Das-rebel/a3m-router](https://github.com/Das-rebel/a3m-router)
|
|
251
|
-
**Live Demo:** [https://das-rebel.github.io/a3m-router/](https://das-rebel.github.io/a3m-router/)
|
|
252
|
-
|
|
253
|
-
---
|
|
254
|
-
|
|
255
|
-
*19.5 KB. Zero ML dependencies. 0.3ms routing latency. 47+ providers. MIT license.*
|
|
@@ -1,160 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
title: "A3M Router v2.0: Now an OpenAI-Compatible AI Gateway with 39 Providers 🚀"
|
|
3
|
-
published: true
|
|
4
|
-
description: "Drop-in OpenAI proxy, LangChain adapter, guardrails, semantic cache, cost analytics, and 39 LLM providers — all in one npm package"
|
|
5
|
-
tags: node, javascript, ai, webdev
|
|
6
|
-
canonical_url: https://github.com/Das-rebel/a3m-router
|
|
7
|
-
---
|
|
8
|
-
|
|
9
|
-
We just shipped A3M Router v2.0.0 — the biggest update since launch.
|
|
10
|
-
|
|
11
|
-
**What started as a simple routing library is now a full AI Gateway.**
|
|
12
|
-
|
|
13
|
-
## What's New
|
|
14
|
-
|
|
15
|
-
### 1. OpenAI-Compatible Proxy Server
|
|
16
|
-
|
|
17
|
-
```bash
|
|
18
|
-
npx a3m-router serve
|
|
19
|
-
```
|
|
20
|
-
|
|
21
|
-
That's it. You now have an OpenAI-compatible API proxy running on `localhost:8787`.
|
|
22
|
-
|
|
23
|
-
```python
|
|
24
|
-
# Drop-in replacement — just change the base URL
|
|
25
|
-
from openai import OpenAI
|
|
26
|
-
|
|
27
|
-
client = OpenAI(
|
|
28
|
-
base_url="http://localhost:8787/v1",
|
|
29
|
-
api_key="not-needed" # A3M Router handles provider keys
|
|
30
|
-
)
|
|
31
|
-
|
|
32
|
-
response = client.chat.completions.create(
|
|
33
|
-
model="auto", # Intelligent routing
|
|
34
|
-
messages=[{"role": "user", "content": "Hello"}]
|
|
35
|
-
)
|
|
36
|
-
```
|
|
37
|
-
|
|
38
|
-
**Any OpenAI SDK works without code changes.** Python, Node, LangChain, LlamaIndex — just point `base_url` to A3M Router.
|
|
39
|
-
|
|
40
|
-
### 2. Real-Time Dashboard
|
|
41
|
-
|
|
42
|
-
```
|
|
43
|
-
http://localhost:8787/
|
|
44
|
-
```
|
|
45
|
-
|
|
46
|
-
Live dashboard showing:
|
|
47
|
-
- Request volume and costs
|
|
48
|
-
- Provider status (online/offline)
|
|
49
|
-
- Request log with routing decisions
|
|
50
|
-
- Cost breakdown by provider
|
|
51
|
-
|
|
52
|
-
### 3. LangChain Adapter
|
|
53
|
-
|
|
54
|
-
```javascript
|
|
55
|
-
import { A3MChatModel } from 'adaptive-memory-multi-model-router/langchain';
|
|
56
|
-
|
|
57
|
-
const model = new A3MChatModel({ modelName: 'auto' });
|
|
58
|
-
const response = await model.invoke([new HumanMessage("Hello")]);
|
|
59
|
-
```
|
|
60
|
-
|
|
61
|
-
Drop-in replacement for `ChatOpenAI`. Supports streaming, tool calling, and structured output.
|
|
62
|
-
|
|
63
|
-
### 4. Guardrails Engine
|
|
64
|
-
|
|
65
|
-
```javascript
|
|
66
|
-
import { GuardrailEngine } from 'adaptive-memory-multi-model-router';
|
|
67
|
-
|
|
68
|
-
const guardrail = new GuardrailEngine({
|
|
69
|
-
promptInjection: true,
|
|
70
|
-
piiDetection: true,
|
|
71
|
-
contentFilter: true
|
|
72
|
-
});
|
|
73
|
-
|
|
74
|
-
const result = await guardrail.checkInput(userInput);
|
|
75
|
-
if (result.blocked) {
|
|
76
|
-
// Prompt injection or PII detected
|
|
77
|
-
console.log(result.reason);
|
|
78
|
-
}
|
|
79
|
-
```
|
|
80
|
-
|
|
81
|
-
Built-in detection for:
|
|
82
|
-
- Prompt injection attempts
|
|
83
|
-
- PII (emails, phones, SSNs, credit cards, API keys)
|
|
84
|
-
- Harmful content
|
|
85
|
-
- Language detection for routing
|
|
86
|
-
|
|
87
|
-
### 5. Semantic Cache
|
|
88
|
-
|
|
89
|
-
```javascript
|
|
90
|
-
import { SemanticCache } from 'adaptive-memory-multi-model-router';
|
|
91
|
-
|
|
92
|
-
const cache = new SemanticCache({ similarityThreshold: 0.92 });
|
|
93
|
-
|
|
94
|
-
// First query: miss, calls provider
|
|
95
|
-
const result1 = await cache.get("What is Python?");
|
|
96
|
-
|
|
97
|
-
// Semantically similar: HIT (no API call!)
|
|
98
|
-
const result2 = await cache.get("Tell me about Python");
|
|
99
|
-
```
|
|
100
|
-
|
|
101
|
-
Catches semantically similar queries using n-gram similarity. No embedding API needed.
|
|
102
|
-
|
|
103
|
-
### 6. Cost Analytics
|
|
104
|
-
|
|
105
|
-
```javascript
|
|
106
|
-
import { CostAnalytics } from 'adaptive-memory-multi-model-router';
|
|
107
|
-
|
|
108
|
-
const analytics = new CostAnalytics();
|
|
109
|
-
analytics.record({ provider: 'groq', cost: 0.001, latency: 420 });
|
|
110
|
-
|
|
111
|
-
const savings = analytics.getSavings('openai');
|
|
112
|
-
// Total saved: $X, XX% cheaper, projected monthly savings: $Y
|
|
113
|
-
```
|
|
114
|
-
|
|
115
|
-
### 7. 39 Providers (was 12)
|
|
116
|
-
|
|
117
|
-
New providers added:
|
|
118
|
-
- **Local**: Ollama, LM Studio, vLLM
|
|
119
|
-
- **Cheap/Fast**: DeepInfra, Together AI, Fireworks, Anyscale, Novita, SambaNova
|
|
120
|
-
- **Mid-tier**: Cohere, Perplexity, AI21
|
|
121
|
-
- **Asian**: DeepSeek, Moonshot, Qwen, Zhipu, Yi
|
|
122
|
-
- **Enterprise**: Azure OpenAI, AWS Bedrock, Google Vertex
|
|
123
|
-
|
|
124
|
-
## Quick Start
|
|
125
|
-
|
|
126
|
-
```bash
|
|
127
|
-
# Install
|
|
128
|
-
npm install adaptive-memory-multi-model-router
|
|
129
|
-
|
|
130
|
-
# Start the gateway
|
|
131
|
-
npx a3m-router serve
|
|
132
|
-
|
|
133
|
-
# Or use as a library
|
|
134
|
-
node -e "const { createA3MRouter } = require('adaptive-memory-multi-model-router'); ..."
|
|
135
|
-
```
|
|
136
|
-
|
|
137
|
-
## The Numbers
|
|
138
|
-
|
|
139
|
-
| Metric | v1.9.5 | v2.0.0 |
|
|
140
|
-
|--------|--------|--------|
|
|
141
|
-
| Providers | 12 | 39 |
|
|
142
|
-
| Exports | 13 | 17 |
|
|
143
|
-
| Proxy server | ❌ | ✅ |
|
|
144
|
-
| Dashboard | ❌ | ✅ |
|
|
145
|
-
| LangChain | ❌ | ✅ |
|
|
146
|
-
| Guardrails | Basic | Full engine |
|
|
147
|
-
| Semantic cache | ❌ | ✅ |
|
|
148
|
-
| Cost analytics | Basic | Full analytics |
|
|
149
|
-
|
|
150
|
-
## Links
|
|
151
|
-
|
|
152
|
-
- **GitHub**: https://github.com/Das-rebel/a3m-router
|
|
153
|
-
- **NPM**: https://www.npmjs.com/package/adaptive-memory-multi-model-router
|
|
154
|
-
- **872+ weekly downloads**
|
|
155
|
-
|
|
156
|
-
MIT license. Open source. No vendor lock-in.
|
|
157
|
-
|
|
158
|
-
---
|
|
159
|
-
|
|
160
|
-
*Built because I was tired of marketing claims. Sharing the data so you don't have to benchmark yourself.*
|
|
@@ -1,280 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
title: "2,775 Downloads in 3 Days With Zero Marketing Budget — Here's What Happened"
|
|
3
|
-
published: true
|
|
4
|
-
description: "The honest growth story of an open-source LLM router: what worked, what failed, and why 1,903 people downloaded it yesterday"
|
|
5
|
-
tags: discuss, opensource, ai, webdev
|
|
6
|
-
canonical_url: https://github.com/Das-rebel/a3m-router
|
|
7
|
-
---
|
|
8
|
-
|
|
9
|
-
# 2,775 Downloads in 3 Days With Zero Marketing Budget — Here's What Happened
|
|
10
|
-
|
|
11
|
-
I need to be honest up front: I don't fully know why this happened.
|
|
12
|
-
|
|
13
|
-
Three days ago, we published [adaptive-memory-multi-model-router](https://www.npmjs.com/package/adaptive-memory-multi-model-router) v2.0.0 to npm. No Product Hunt launch. No Twitter thread. No influencer shoutout. No paid ads. Nothing.
|
|
14
|
-
|
|
15
|
-
Here's what the download numbers look like:
|
|
16
|
-
|
|
17
|
-
| Day | Downloads | Notes |
|
|
18
|
-
|-----|-----------|-------|
|
|
19
|
-
| Day 1 | 552 | Steady trickle after publish |
|
|
20
|
-
| Day 2 | 320 | Actually *dropped* — I was worried |
|
|
21
|
-
| Day 3 | **1,903** | What? |
|
|
22
|
-
| **Total** | **2,775** | In 72 hours |
|
|
23
|
-
|
|
24
|
-
That's not a typo. Day 3 had more downloads than Days 1 and 2 combined. Six times more.
|
|
25
|
-
|
|
26
|
-
This is the story of what we did, what actually worked, what completely failed, and what I think happened. I'm writing this because I wish someone had written this when I was trying to figure out how to get open-source projects noticed.
|
|
27
|
-
|
|
28
|
-
## What We Built
|
|
29
|
-
|
|
30
|
-
First, context. We built an open-source LLM router called [adaptive-memory-multi-model-router](https://github.com/Das-rebel/a3m-router) (the name is terrible, I know — we're working on it). It does three things:
|
|
31
|
-
|
|
32
|
-
1. **Routes requests across 39 LLM providers** — OpenAI, Anthropic, Gemini, Groq, Cerebras, Mistral, DeepSeek, Ollama, and 30+ more
|
|
33
|
-
2. **Adaptive memory** — tracks which models work best for your use case and routes accordingly
|
|
34
|
-
3. **OpenAI-compatible proxy server** — drop-in replacement for `api.openai.com`
|
|
35
|
-
|
|
36
|
-
The proxy server was the v2.0.0 addition. You run:
|
|
37
|
-
|
|
38
|
-
```bash
|
|
39
|
-
npx a3m-router serve
|
|
40
|
-
```
|
|
41
|
-
|
|
42
|
-
And suddenly every OpenAI SDK in your stack can talk to any of 39 providers. No code changes. No new SDKs. Just swap the base URL.
|
|
43
|
-
|
|
44
|
-
That's it. That's the product.
|
|
45
|
-
|
|
46
|
-
## The Strategy (What We Thought Would Work)
|
|
47
|
-
|
|
48
|
-
Before publishing, we had a plan. It was a bad plan, but it was a plan:
|
|
49
|
-
|
|
50
|
-
### 1. Dev.to Articles
|
|
51
|
-
|
|
52
|
-
We wrote 4 articles. Technical, detailed, with code examples. We thought the developer community would pick them up.
|
|
53
|
-
|
|
54
|
-
**Result: 0 views.** Literally zero. New Dev.to account, no followers, no history. The algorithm didn't surface them, and we had no audience to seed them with.
|
|
55
|
-
|
|
56
|
-
Lesson learned: publishing to a platform where you have zero reputation is like shouting into the void. Content doesn't go viral on its own — community does.
|
|
57
|
-
|
|
58
|
-
### 2. Hacker News
|
|
59
|
-
|
|
60
|
-
We submitted to Hacker News. Show HN, carefully written, technical angle.
|
|
61
|
-
|
|
62
|
-
**Result: Flagged and buried.** New account, first post, no karma. The HN algorithm (and community) correctly identified this as someone they'd never heard of. Gone within minutes.
|
|
63
|
-
|
|
64
|
-
Lesson learned: HN requires community cred. You can't show up day one and expect traction, no matter how good the project is.
|
|
65
|
-
|
|
66
|
-
### 3. GitHub Stars
|
|
67
|
-
|
|
68
|
-
We expected developers to star the repo after discovering it.
|
|
69
|
-
|
|
70
|
-
**Result: npm is the front door.** 2,775 developers found the package through npm search alone. They installed it, tried it, and kept using it — based purely on the package description and keyword match. No blog post. No HN launch. No Twitter thread. npm SEO did 100% of the work.
|
|
71
|
-
|
|
72
|
-
This stings, but it makes sense. People don't star repos they find through npm. They install, they try, they move on. GitHub stars come from community, not package managers.
|
|
73
|
-
|
|
74
|
-
### 4. SEO and Keywords
|
|
75
|
-
|
|
76
|
-
This is where things get interesting.
|
|
77
|
-
|
|
78
|
-
## What Actually Worked (The 156 Keywords)
|
|
79
|
-
|
|
80
|
-
Here's the part I didn't expect to matter. When we published to npm, we stuffed the `package.json` with 156 keywords. Not spammy SEO garbage — actual, descriptive keywords that developers and AI agents might search for:
|
|
81
|
-
|
|
82
|
-
```json
|
|
83
|
-
{
|
|
84
|
-
"keywords": [
|
|
85
|
-
"openai",
|
|
86
|
-
"anthropic",
|
|
87
|
-
"llm",
|
|
88
|
-
"router",
|
|
89
|
-
"gateway",
|
|
90
|
-
"proxy",
|
|
91
|
-
"multi-model",
|
|
92
|
-
"ai",
|
|
93
|
-
"gpt",
|
|
94
|
-
"claude",
|
|
95
|
-
"gemini",
|
|
96
|
-
"groq",
|
|
97
|
-
"ollama",
|
|
98
|
-
"langchain",
|
|
99
|
-
"agent",
|
|
100
|
-
"inference",
|
|
101
|
-
"fallback",
|
|
102
|
-
"load-balancing",
|
|
103
|
-
"streaming",
|
|
104
|
-
"adaptive",
|
|
105
|
-
// ... 136 more
|
|
106
|
-
]
|
|
107
|
-
}
|
|
108
|
-
```
|
|
109
|
-
|
|
110
|
-
The intent was simple: make the package findable when someone searches npm for "openai proxy" or "llm router" or "multi-model gateway."
|
|
111
|
-
|
|
112
|
-
But I think what actually happened is more interesting.
|
|
113
|
-
|
|
114
|
-
### The AI Agent Discoverability Hypothesis
|
|
115
|
-
|
|
116
|
-
Here's my theory on the Day 3 spike: **AI coding agents are discovering packages through npm search.**
|
|
117
|
-
|
|
118
|
-
When a developer asks Cursor, Copilot, Windsurf, or any AI agent to "set up an LLM routing solution" or "add multi-model support to my app," the agent searches npm. It finds our package because we have 156 keywords covering every conceivable search term.
|
|
119
|
-
|
|
120
|
-
The Day 3 spike wasn't organic human discovery. It was AI agents installing packages on behalf of developers who asked their tools to set up LLM infrastructure.
|
|
121
|
-
|
|
122
|
-
I can't prove this. But the pattern fits:
|
|
123
|
-
- Day 1: Initial publish, some human curiosity
|
|
124
|
-
- Day 2: Drop-off, the curiosity faded
|
|
125
|
-
- Day 3: Spike — something systemic kicked in
|
|
126
|
-
|
|
127
|
-
If I'm right, this has massive implications for open-source discoverability. The next SEO isn't Google — it's npm keywords optimized for AI agent search.
|
|
128
|
-
|
|
129
|
-
### The Proxy Server Effect
|
|
130
|
-
|
|
131
|
-
The other factor: `npx a3m-router serve`.
|
|
132
|
-
|
|
133
|
-
This one command gives you a working OpenAI-compatible API gateway. No configuration, no API keys to start, no setup wizard. Just run it and point your existing OpenAI SDK at `http://localhost:8080/v1`.
|
|
134
|
-
|
|
135
|
-
Developers (and AI agents) can test it in 30 seconds:
|
|
136
|
-
|
|
137
|
-
```bash
|
|
138
|
-
# Install and start
|
|
139
|
-
npm install adaptive-memory-multi-model-router
|
|
140
|
-
npx a3m-router serve
|
|
141
|
-
|
|
142
|
-
# In another terminal — your existing OpenAI code, unchanged
|
|
143
|
-
curl http://localhost:8000/v1/chat/completions \
|
|
144
|
-
-H "Content-Type: application/json" \
|
|
145
|
-
-d '{"model":"claude-3-haiku-20240307","messages":[{"role":"user","content":"hello"}]}'
|
|
146
|
-
```
|
|
147
|
-
|
|
148
|
-
Zero friction. Zero code changes. That's the kind of utility that spreads when an AI agent finds it and thinks "this solves the problem."
|
|
149
|
-
|
|
150
|
-
## The Download Hockey Stick
|
|
151
|
-
|
|
152
|
-
Here's the numbers again, visualized:
|
|
153
|
-
|
|
154
|
-
```
|
|
155
|
-
2,000 | ***
|
|
156
|
-
| * *
|
|
157
|
-
1,500 | * *
|
|
158
|
-
| * *
|
|
159
|
-
1,000 | * *
|
|
160
|
-
| * *
|
|
161
|
-
500 | *** * *
|
|
162
|
-
| * * * *
|
|
163
|
-
0 |* * * * * *
|
|
164
|
-
+------ ------ --------------------
|
|
165
|
-
Day 1 Day 2 Day 3
|
|
166
|
-
552 320 1,903
|
|
167
|
-
```
|
|
168
|
-
|
|
169
|
-
Day 3 is a hockey stick. Downloads went from hundreds to nearly two thousand overnight. The weekly npm badge went from 872 to 2,775 in a single day.
|
|
170
|
-
|
|
171
|
-
## What We Did Wrong (Complete Transparency)
|
|
172
|
-
|
|
173
|
-
I want to be honest about the failures because growth stories that only highlight successes are worthless.
|
|
174
|
-
|
|
175
|
-
**We built features instead of community.** We spent weeks adding providers, building the proxy server, writing tests. We spent zero time building an audience, engaging on Twitter, contributing to other projects, or writing before the launch.
|
|
176
|
-
|
|
177
|
-
**We ignored the landing page.** No website, no docs site — just a solid README and npm package. 2,775 people installed it based on the npm description alone. That's the power of good package metadata.
|
|
178
|
-
|
|
179
|
-
**We picked a terrible name.** `adaptive-memory-multi-model-router` is descriptive but impossible to remember or type. We should have branded it something short and memorable from day one.
|
|
180
|
-
|
|
181
|
-
**We launched on platforms where we had no presence.** Dev.to with 0 followers. HN with 0 karma. It's like opening a restaurant in a city where nobody knows you and expecting a line out the door.
|
|
182
|
-
|
|
183
|
-
## The v2.0 Pivot That Changed Everything
|
|
184
|
-
|
|
185
|
-
The original v1.0 was a library. You imported it, configured it, wrote code against it. Useful, but not the kind of thing that spreads.
|
|
186
|
-
|
|
187
|
-
v2.0 added the proxy server, and that changed the entire value proposition:
|
|
188
|
-
|
|
189
|
-
| Before v2.0 | After v2.0 |
|
|
190
|
-
|-------------|------------|
|
|
191
|
-
| Library you import | Service you run |
|
|
192
|
-
| Requires code changes | Drop-in replacement |
|
|
193
|
-
| Framework-specific | Works with everything |
|
|
194
|
-
| Need to learn the API | OpenAI-compatible |
|
|
195
|
-
|
|
196
|
-
The proxy server means you don't need to learn our API. You don't need to change your code. You don't need to rewrite your LangChain chains or your HTTP calls. You just swap `api.openai.com` for `localhost:8000` and everything works.
|
|
197
|
-
|
|
198
|
-
That's the kind of product that spreads. Not because it's better — but because it requires zero effort to try.
|
|
199
|
-
|
|
200
|
-
## The Dashboard Effect
|
|
201
|
-
|
|
202
|
-
v2.0 also added a web dashboard. You run the proxy and open `http://localhost:8081` to see:
|
|
203
|
-
|
|
204
|
-
- All 39 providers and their status
|
|
205
|
-
- Real-time request routing
|
|
206
|
-
- Cost tracking across providers
|
|
207
|
-
- Latency comparisons
|
|
208
|
-
- Memory scores (which models perform best)
|
|
209
|
-
|
|
210
|
-
The dashboard isn't just useful — it's **proof**. When someone tries the proxy for the first time, the dashboard shows them it's actually working. That builds trust instantly.
|
|
211
|
-
|
|
212
|
-
## Try It (30 Seconds to a Working AI Gateway)
|
|
213
|
-
|
|
214
|
-
If you've read this far, you might as well try it:
|
|
215
|
-
|
|
216
|
-
```bash
|
|
217
|
-
# Install
|
|
218
|
-
npm install -g adaptive-memory-multi-model-router
|
|
219
|
-
|
|
220
|
-
# Start the proxy server
|
|
221
|
-
npx a3m-router serve
|
|
222
|
-
|
|
223
|
-
# Test it (OpenAI-compatible!)
|
|
224
|
-
curl http://localhost:8000/v1/chat/completions \
|
|
225
|
-
-H "Content-Type: application/json" \
|
|
226
|
-
-d '{
|
|
227
|
-
"model": "gpt-4o-mini",
|
|
228
|
-
"messages": [{"role": "user", "content": "Hello!"}]
|
|
229
|
-
}'
|
|
230
|
-
```
|
|
231
|
-
|
|
232
|
-
Or with the OpenAI SDK:
|
|
233
|
-
|
|
234
|
-
```javascript
|
|
235
|
-
import OpenAI from 'openai';
|
|
236
|
-
|
|
237
|
-
const client = new OpenAI({
|
|
238
|
-
baseURL: 'http://localhost:8000/v1',
|
|
239
|
-
apiKey: 'any-key-works' // proxy handles routing
|
|
240
|
-
});
|
|
241
|
-
|
|
242
|
-
const response = await client.chat.completions.create({
|
|
243
|
-
model: 'gpt-4o-mini',
|
|
244
|
-
messages: [{ role: 'user', content: 'Hello!' }]
|
|
245
|
-
});
|
|
246
|
-
```
|
|
247
|
-
|
|
248
|
-
Same SDK. Same API. Different backend. That's the point.
|
|
249
|
-
|
|
250
|
-
**GitHub:** [github.com/Das-rebel/a3m-router](https://github.com/Das-rebel/a3m-router)
|
|
251
|
-
|
|
252
|
-
**NPM:** [npmjs.com/package/adaptive-memory-multi-model-router](https://www.npmjs.com/package/adaptive-memory-multi-model-router)
|
|
253
|
-
|
|
254
|
-
## What's Next (And We Need Your Help)
|
|
255
|
-
|
|
256
|
-
The downloads are great, but downloads without community is just a number on a badge. Here's what we need:
|
|
257
|
-
|
|
258
|
-
1. **GitHub stars help discoverability.** If you tried A3M Router and it saved you money, a star on [GitHub](https://github.com/Das-rebel/a3m-router) helps other developers find it.
|
|
259
|
-
|
|
260
|
-
2. **What providers do you need?** We have 39. But if your provider isn't listed, tell us. We'll add it.
|
|
261
|
-
|
|
262
|
-
3. **What features are missing?** The proxy is new. The memory system is new. We know there are gaps. Open an issue, even if it's just a one-liner.
|
|
263
|
-
|
|
264
|
-
4. **War stories.** If you tried it and something broke, tell us. If it worked in a way we didn't expect, tell us that too. Building in public means nothing if nobody talks back.
|
|
265
|
-
|
|
266
|
-
## The Honest Takeaway
|
|
267
|
-
|
|
268
|
-
I don't have a growth hack for you. We didn't crack a secret algorithm. We didn't game any system.
|
|
269
|
-
|
|
270
|
-
What happened is: we built something genuinely useful (an OpenAI-compatible proxy that talks to 39 providers), we made it trivially easy to try (`npx a3m-router serve`), and we made it findable (156 npm keywords).
|
|
271
|
-
|
|
272
|
-
Then we got lucky with timing. AI agents are becoming the primary way developers discover tools. Our keywords made us findable by those agents. The proxy made us useful once found.
|
|
273
|
-
|
|
274
|
-
That's the whole story. No secrets. No tricks. Just build useful things, make them easy to try, and make them findable.
|
|
275
|
-
|
|
276
|
-
The 2,775 downloads proved npm search is a viable growth channel. Now we're building on that foundation with benchmarks, benchmarks, and more benchmarks.
|
|
277
|
-
|
|
278
|
-
---
|
|
279
|
-
|
|
280
|
-
*If this story was useful, follow along at [github.com/Das-rebel/a3m-router](https://github.com/Das-rebel/a3m-router). Stars appreciated. Issues celebrated. PRs worshipped.*
|