adaptive-memory-multi-model-router 2.14.46 → 2.14.47
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/{docs/llms.txt → llms.txt.bak} +6 -6
- package/package.json +13 -84
- package/src/routing/advancedRouter.ts.bak +650 -0
- package/test.js.bak +376 -0
- package/.dockerignore +0 -82
- package/.env.example +0 -303
- package/.github/DISCUSSIONS_WELCOME.md +0 -27
- package/.github/DISCUSSION_TEMPLATE.yml +0 -5
- package/.github/FUNDING.yml +0 -2
- package/.github/ISSUE_TEMPLATE/bug_report.md +0 -94
- package/.github/ISSUE_TEMPLATE/config.yml +0 -17
- package/.github/ISSUE_TEMPLATE/feature_request.md +0 -71
- package/.github/PULL_REQUEST_TEMPLATE.md +0 -71
- package/.github/dependabot.yml +0 -9
- package/.github/workflows/auto-publish.yml +0 -51
- package/.github/workflows/ci.yml +0 -263
- package/.github/workflows/codeql.yml +0 -38
- package/.github/workflows/npm-publish.yml +0 -20
- package/.github/workflows/pages.yml +0 -37
- package/.github/workflows/stale.yml +0 -54
- package/.publish-tick +0 -1
- package/.well-known/ai-plugin.json +0 -16
- package/AGENT_COUNCIL_FINDINGS.md +0 -142
- package/ARCHITECTURE.md +0 -346
- package/AUDIT_REPORT.md +0 -28
- package/CODE_OF_CONDUCT.md +0 -128
- package/CONTRIBUTING.md +0 -50
- package/CONTRIBUTORS.md +0 -20
- package/Dockerfile +0 -53
- package/Dockerfile.proxy +0 -33
- package/HEALTH_REPORT.md +0 -118
- package/IMPROVEMENT_PLAN.md +0 -107
- package/LANDING.md +0 -43
- package/LAUNCH-PAIN-DRIVEN.md +0 -339
- package/LAUNCH.md +0 -337
- package/LAUNCH_CHECKLIST.md +0 -141
- package/LAUNCH_SNAPSHOT.md +0 -260
- package/MANIFESTO.md +0 -41
- package/POPULARITY_BOOSTERS.md +0 -285
- package/PR_STATUS_REPORT.md +0 -148
- package/REDESIGN.md +0 -95
- package/RUNKIT.md +0 -83
- package/SECURITY.md +0 -29
- package/SUBMISSIONS.md +0 -43
- package/_schema.html +0 -53
- package/ai-plugin.json +0 -16
- package/articles/AI_AGENT_LLM_ROUTING.md +0 -150
- package/articles/CHINESE_DIRECTORIES.md +0 -100
- package/articles/CHINESE_SUBMISSIONS_READY.md +0 -322
- package/articles/COMPETITOR_ALERTS.md +0 -31
- package/articles/COMPLETE_POSTING_DIRECTORY.md +0 -147
- package/articles/CONTENT_STRUCTURE.md +0 -292
- package/articles/DEVTO_COST_GUIDE.md +0 -473
- package/articles/DEVTO_FINAL.md +0 -416
- package/articles/DEVTO_MULTI_PROVIDER.md +0 -542
- package/articles/DEVTO_READY.md +0 -255
- package/articles/DEVTO_V2_ANNOUNCEMENT.md +0 -160
- package/articles/DEVTO_VIRAL_GROWTH.md +0 -280
- package/articles/FRESH_devto.md +0 -460
- package/articles/FRESH_devto_2026_05.md +0 -73
- package/articles/FRESH_hackernews.md +0 -14
- package/articles/FRESH_reddit_ml.md +0 -90
- package/articles/FRESH_reddit_node.md +0 -198
- package/articles/FRESH_reddit_sideproject.md +0 -72
- package/articles/FRESH_reddit_webdev.md +0 -130
- package/articles/FROM_ZERO_TO_10K.md +0 -107
- package/articles/HN_10X_BETTER.md +0 -430
- package/articles/HN_ACCOUNT_GUIDE.md +0 -21
- package/articles/HN_CHINESE_STYLE.md +0 -308
- package/articles/HN_FINAL.md +0 -148
- package/articles/HN_POSTED_VERSION.md +0 -56
- package/articles/HN_POST_READY.md +0 -137
- package/articles/HN_RESEARCH.md +0 -364
- package/articles/HN_SHOW_routerarena.md +0 -17
- package/articles/HN_TIMING_GUIDE.md +0 -52
- package/articles/INDIEHACKERS_POST.md +0 -52
- package/articles/INDIEHACKERS_READY.md +0 -120
- package/articles/LLM_BENCHMARK_DEEP_DIVE.md +0 -153
- package/articles/MASTER_POSTING_DIRECTORY.md +0 -189
- package/articles/NEWSLETTER_SEND_NOW.md +0 -259
- package/articles/NEWSLETTER_SUBMISSIONS.md +0 -112
- package/articles/PAIN-DRIVEN-devto-v2.md +0 -308
- package/articles/PAIN-DRIVEN-devto-v3.md +0 -268
- package/articles/PAIN-DRIVEN-devto.md +0 -242
- package/articles/PAIN-DRIVEN-hackernews-v2.md +0 -138
- package/articles/PAIN-DRIVEN-hackernews-v3.md +0 -151
- package/articles/PAIN-DRIVEN-hackernews.md +0 -131
- package/articles/PAIN-DRIVEN-reddit-v2.md +0 -301
- package/articles/PAIN-DRIVEN-reddit-v3.md +0 -236
- package/articles/PAIN-DRIVEN-reddit.md +0 -218
- package/articles/PAIN-DRIVEN-twitter-v2.md +0 -110
- package/articles/PAIN-DRIVEN-twitter-v3.md +0 -121
- package/articles/PAIN-DRIVEN-twitter.md +0 -120
- package/articles/PORTKEY_VS_A3M.md +0 -147
- package/articles/POSTING_KIT_2026_05.md +0 -67
- package/articles/PRESS_KIT_routerarena.md +0 -77
- package/articles/PRODUCTHUNT_LISTING.md +0 -48
- package/articles/PRODUCTHUNT_READY.md +0 -106
- package/articles/PR_PLAN_vault.md +0 -125
- package/articles/REDDIT_FINAL.md +0 -232
- package/articles/REDDIT_POST.md +0 -67
- package/articles/REDDIT_SUBMISSION_READY.md +0 -348
- package/articles/ROUTERARENA_LEADER.md +0 -45
- package/articles/SHOW_HN_FINAL.md +0 -29
- package/articles/TWEETS_10K_DOWNLOADS.md +0 -47
- package/articles/TWEETS_BENCHMARK_FIRST.md +0 -46
- package/articles/TWEETS_MCP_PLAY.md +0 -51
- package/articles/TWEETS_SEQUENTIAL_BROKEN.md +0 -49
- package/articles/TWEETS_WHY_BUILD.md +0 -54
- package/articles/TWEETS_routerarena_leader.md +0 -53
- package/articles/TWEET_STORM_READY.md +0 -165
- package/articles/TWITTER_FINAL.md +0 -167
- package/articles/WHY_10X_BETTER.md +0 -261
- package/articles/WHY_CHINESE_STYLE_BETTER.md +0 -323
- package/articles/ai-discoverability-llm-routing.md +0 -210
- package/articles/devto-llm-routing.md +0 -138
- package/articles/hackernews-show-hn.md +0 -54
- package/articles/hashnode-llm-cost-optimization.md +0 -125
- package/articles/hn_show_2026_05.md +0 -11
- package/articles/medium-building-llm-router.md +0 -205
- package/articles/reddit-ml.md +0 -76
- package/articles/twitter-thread-cost-savings.md +0 -50
- package/articles/youtube-tutorial-script.md +0 -262
- package/assets/a3m_3blue1brown.mp4 +0 -0
- package/assets/banner.svg +0 -109
- package/assets/chart-cost-v2.svg +0 -91
- package/assets/chart-cost-v3.svg +0 -143
- package/assets/chart-features-v2.svg +0 -132
- package/assets/chart-features-v3.svg +0 -211
- package/assets/chart-growth-v2.svg +0 -122
- package/assets/chart-growth-v3.svg +0 -189
- package/assets/cost-comparison.svg +0 -134
- package/assets/cost-simple.svg +0 -64
- package/assets/demo-hn.gif +0 -0
- package/assets/feature-matrix.svg +0 -136
- package/assets/growth-chart-animated.svg +0 -76
- package/assets/growth-chart.svg +0 -82
- package/assets/growth-simple.svg +0 -69
- package/assets/hero-diagram.svg +0 -81
- package/assets/logo-new.svg +0 -21
- package/assets/logo.svg +0 -68
- package/assets/provider-comparison.svg +0 -121
- package/assets/social-preview-new.svg +0 -100
- package/assets/social-preview.svg +0 -194
- package/assets/social-v2.svg +0 -130
- package/assets/social-v3.svg +0 -212
- package/benchmark-provider-results.json +0 -245
- package/benchmark-results.json +0 -54
- package/council-votes/architecture-vote.md +0 -121
- package/council-votes/coverage-vote.md +0 -93
- package/data/adaptive-benchmark.json +0 -92
- package/data/benchmark-results.json +0 -47
- package/data/labeled-benchmark.json +0 -88
- package/demo/3blue1brown_video.py +0 -285
- package/demo/3blue1brown_video_v2.py +0 -310
- package/demo/IMPROVED_PROMPTS.md +0 -229
- package/demo/VEO3_PROMPTS.md +0 -269
- package/demo/VIDEO_PRODUCTION_GUIDE.md +0 -333
- package/demo/a3m_3blue1brown.mp4 +0 -0
- package/demo/asciinema-demo.sh +0 -195
- package/demo/demo-hn.tape +0 -74
- package/demo/demo-script.md +0 -53
- package/demo/demo-script.sh +0 -62
- package/demo/demo.svg +0 -75
- package/demo/frame1_ai_data_center.png +0 -0
- package/demo/frame1_sunset_video.mp4 +0 -0
- package/demo/frame2_cost_comparison.png +0 -0
- package/demo/frame2_cost_comparison_fallback.png +0 -0
- package/demo/frame3_parallel_execution.png +0 -0
- package/demo/frame3_parallel_execution_fallback.png +0 -0
- package/demo/frame4_providers.png +0 -0
- package/demo/frame4_providers_fallback.png +0 -0
- package/demo/frame5_endcard.png +0 -0
- package/demo/frame5_endcard_fallback.png +0 -0
- package/demo/new_frame1_hook.png +0 -0
- package/demo/new_frame2_proof.png +0 -0
- package/demo/new_frame3_wow.png +0 -0
- package/demo/new_frame4_social.png +0 -0
- package/demo/new_frame5_cta.png +0 -0
- package/demo/package.json +0 -13
- package/demo/product-video-final.mp4 +0 -0
- package/demo/product-video-hype-v1.mp4 +0 -0
- package/demo/product-video-v1.mp4 +0 -0
- package/demo/public/index.html +0 -762
- package/demo/recording.cast +0 -55
- package/demo/server.js +0 -405
- package/demo-new.tape +0 -71
- package/demo-real.sh +0 -198
- package/demo-simple.tape +0 -205
- package/demo.html +0 -520
- package/demo.sh +0 -85
- package/demo.tape +0 -259
- package/dist/analytics/costAnalytics.d.ts.map +0 -1
- package/dist/analytics/costAnalytics.js.map +0 -1
- package/dist/benchmark/comprehensive.js.map +0 -1
- package/dist/benchmark/reproducible.d.ts.map +0 -1
- package/dist/benchmark/reproducible.js.map +0 -1
- package/dist/cache/prefixCache.d.ts.map +0 -1
- package/dist/cache/prefixCache.js.map +0 -1
- package/dist/cache/responseCache.d.ts.map +0 -1
- package/dist/cache/responseCache.js.map +0 -1
- package/dist/cache/semanticCache.d.ts.map +0 -1
- package/dist/cache/semanticCache.js.map +0 -1
- package/dist/cli/setupWizard.d.ts.map +0 -1
- package/dist/cli/setupWizard.js.map +0 -1
- package/dist/cost/budgetEnforcer.d.ts.map +0 -1
- package/dist/cost/budgetEnforcer.js.map +0 -1
- package/dist/cost/costTracker.d.ts.map +0 -1
- package/dist/cost/costTracker.js.map +0 -1
- package/dist/ensemble/multiRoundDialog.js.map +0 -1
- package/dist/ensemble/shapleyValue.js.map +0 -1
- package/dist/integrations/langchainAdapter.d.ts.map +0 -1
- package/dist/integrations/langchainAdapter.js.map +0 -1
- package/dist/integrations/oauth.d.ts.map +0 -1
- package/dist/integrations/oauth.js.map +0 -1
- package/dist/integrations/scienceAdapter.js.map +0 -1
- package/dist/memory/autoFetch.d.ts.map +0 -1
- package/dist/memory/autoFetch.js.map +0 -1
- package/dist/memory/episodicMemory.d.ts.map +0 -1
- package/dist/memory/episodicMemory.js.map +0 -1
- package/dist/memory/hybridMemory.js.map +0 -1
- package/dist/memory/memoryTree.d.ts.map +0 -1
- package/dist/memory/memoryTree.js.map +0 -1
- package/dist/memory/obsidianVault.d.ts.map +0 -1
- package/dist/memory/obsidianVault.js.map +0 -1
- package/dist/memory/reasoningBank.js.map +0 -1
- package/dist/observability/changeWatch.d.ts.map +0 -1
- package/dist/observability/changeWatch.js.map +0 -1
- package/dist/observability/fatigueDetector.d.ts.map +0 -1
- package/dist/observability/fatigueDetector.js.map +0 -1
- package/dist/observability/index.d.ts.map +0 -1
- package/dist/observability/index.js.map +0 -1
- package/dist/observability/metrics.d.ts.map +0 -1
- package/dist/observability/metrics.js.map +0 -1
- package/dist/observability/middleware.d.ts.map +0 -1
- package/dist/observability/middleware.js.map +0 -1
- package/dist/observability/tracer.d.ts.map +0 -1
- package/dist/observability/tracer.js.map +0 -1
- package/dist/observability/types.d.ts.map +0 -1
- package/dist/observability/types.js.map +0 -1
- package/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
- package/dist/orchestration/haloOrchestrator.js.map +0 -1
- package/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
- package/dist/orchestration/mctsWorkflow.js.map +0 -1
- package/dist/providers/localProvider.d.ts.map +0 -1
- package/dist/providers/localProvider.js.map +0 -1
- package/dist/providers/providerConfig.d.ts.map +0 -1
- package/dist/providers/providerConfig.js.map +0 -1
- package/dist/providers/registry.d.ts.map +0 -1
- package/dist/providers/registry.js.map +0 -1
- package/dist/routing/advancedRouter.d.ts.map +0 -1
- package/dist/routing/advancedRouter.js.map +0 -1
- package/dist/routing/crossModelValidation.d.ts.map +0 -1
- package/dist/routing/crossModelValidation.js.map +0 -1
- package/dist/routing/providerHealth.d.ts.map +0 -1
- package/dist/routing/providerHealth.js.map +0 -1
- package/dist/routing/providerRetry.d.ts.map +0 -1
- package/dist/routing/providerRetry.js.map +0 -1
- package/dist/scripts/banner.js +0 -29
- package/dist/security/guardrails.d.ts.map +0 -1
- package/dist/security/guardrails.js.map +0 -1
- package/dist/server/dashboard.d.ts.map +0 -1
- package/dist/server/dashboard.js.map +0 -1
- package/dist/server/modelMapper.d.ts.map +0 -1
- package/dist/server/modelMapper.js.map +0 -1
- package/dist/server/proxyServer.d.ts.map +0 -1
- package/dist/server/proxyServer.js.map +0 -1
- package/dist/skills/__tests__/skill_manager.test.d.ts +0 -2
- package/dist/skills/__tests__/skill_manager.test.d.ts.map +0 -1
- package/dist/skills/__tests__/skill_manager.test.js +0 -268
- package/dist/skills/__tests__/skill_manager.test.js.map +0 -1
- package/dist/tools/tmlpdTools.d.ts.map +0 -1
- package/dist/tools/tmlpdTools.js.map +0 -1
- package/dist/tui/dashboard.d.ts.map +0 -1
- package/dist/tui/dashboard.js.map +0 -1
- package/dist/tui/index.d.ts.map +0 -1
- package/dist/tui/index.js.map +0 -1
- package/dist/utils/batchProcessor.d.ts.map +0 -1
- package/dist/utils/batchProcessor.js.map +0 -1
- package/dist/utils/compression.d.ts.map +0 -1
- package/dist/utils/compression.js.map +0 -1
- package/dist/utils/costUtils.d.ts.map +0 -1
- package/dist/utils/costUtils.js.map +0 -1
- package/dist/utils/reliability.d.ts.map +0 -1
- package/dist/utils/reliability.js.map +0 -1
- package/dist/utils/sorting.d.ts.map +0 -1
- package/dist/utils/sorting.js.map +0 -1
- package/dist/utils/speculativeDecoding.d.ts.map +0 -1
- package/dist/utils/speculativeDecoding.js.map +0 -1
- package/dist/utils/tokenUtils.d.ts.map +0 -1
- package/dist/utils/tokenUtils.js.map +0 -1
- package/docs/.nojekyll +0 -0
- package/docs/ANALYSIS_PRINCIPLES.md +0 -162
- package/docs/API.md +0 -855
- package/docs/ARCHITECTURAL-IMPROVEMENTS-2025.md +0 -1391
- package/docs/ARCHITECTURAL-IMPROVEMENTS-REVISED-2025.md +0 -1051
- package/docs/BENCHMARK.md +0 -170
- package/docs/CHINESE_PROVIDER_RELIABILITY.md +0 -37
- package/docs/CITATIONS.md +0 -74
- package/docs/CLAIMS_AND_EVIDENCE.md +0 -58
- package/docs/CONFIGURATION.md +0 -476
- package/docs/COUNCIL_DECISION.json +0 -816
- package/docs/COUNCIL_SUMMARY.md +0 -319
- package/docs/COUNCIL_V2.2_DECISION.md +0 -416
- package/docs/ENGINEERING_SPEC.md +0 -55
- package/docs/FACTORY_RESET.md +0 -34
- package/docs/GEO.md +0 -66
- package/docs/GEO_OPTIMIZATION.md +0 -30
- package/docs/GEO_ROOT_CAUSE.md +0 -136
- package/docs/GEO_STATUS.md +0 -85
- package/docs/GEO_TEST_RESULTS.md +0 -176
- package/docs/HN_CHECKLIST.md +0 -38
- package/docs/HN_FOUNDER_COMMENT.md +0 -17
- package/docs/HN_SUBMISSION_FINAL.md +0 -180
- package/docs/HN_SUBMISSION_V3.md +0 -56
- package/docs/IMPROVEMENT_ROADMAP.md +0 -515
- package/docs/INTEGRATIONS.md +0 -420
- package/docs/LANGCHAIN_INTEGRATION.md +0 -147
- package/docs/LLM_COUNCIL_DECISION.md +0 -508
- package/docs/MIDDLEWARE_CHAIN.md +0 -35
- package/docs/PROMO_CHECKLIST.md +0 -200
- package/docs/QUICKSTART.md +0 -271
- package/docs/QUICK_START.md +0 -43
- package/docs/QUICK_START_VISIBILITY.md +0 -782
- package/docs/REDDIT_GAP_ANALYSIS.md +0 -299
- package/docs/RELEASE_CHECKLIST.md +0 -32
- package/docs/REPRODUCIBILITY.md +0 -63
- package/docs/RESEARCH_BACKED_IMPROVEMENTS.md +0 -1180
- package/docs/ROUTING_RUBRIC.md +0 -197
- package/docs/SEO_AUDIT.md +0 -186
- package/docs/SOCIAL_LISTENING.md +0 -219
- package/docs/TMLPD_QNA.md +0 -751
- package/docs/TMLPD_V2.1_COMPLETE.md +0 -763
- package/docs/TMLPD_V2.2_RESEARCH_ROADMAP.md +0 -754
- package/docs/UPDATE_TOPICS.md +0 -15
- package/docs/USE_CASES.md +0 -59
- package/docs/V2.2_IMPLEMENTATION_COMPLETE.md +0 -446
- package/docs/V2_IMPLEMENTATION_GUIDE.md +0 -388
- package/docs/VERCEL_AI_SDK.md +0 -209
- package/docs/VISIBILITY_ADOPTION_PLAN.md +0 -1005
- package/docs/_config.yml +0 -49
- package/docs/ai-plugin.json +0 -16
- package/docs/api.html +0 -513
- package/docs/architecture-diagram.md +0 -40
- package/docs/benchmark-chart.png +0 -0
- package/docs/benchmark.html +0 -387
- package/docs/blog/routerarena-number-one.html +0 -73
- package/docs/cli-cheatsheet.md +0 -339
- package/docs/compare.md +0 -109
- package/docs/comparison-litellm.md +0 -88
- package/docs/comparison.md +0 -108
- package/docs/cost-chart-ascii.md +0 -42
- package/docs/cost-comparison-chart.svg +0 -88
- package/docs/curl-examples.md +0 -247
- package/docs/demo-auto.html +0 -264
- package/docs/demo.html +0 -416
- package/docs/geo/GENERATIVE_ENGINE_OPTIMIZATION.md +0 -232
- package/docs/index.html +0 -507
- package/docs/launch-content/LAUNCH_EXECUTION_CHECKLIST.md +0 -421
- package/docs/launch-content/README.md +0 -457
- package/docs/launch-content/assets/cost_comparison_100_tasks.png +0 -0
- package/docs/launch-content/assets/cumulative_savings.png +0 -0
- package/docs/launch-content/assets/parallel_speedup.png +0 -0
- package/docs/launch-content/assets/provider_pricing_comparison.png +0 -0
- package/docs/launch-content/assets/task_breakdown_comparison.png +0 -0
- package/docs/launch-content/generate_charts.py +0 -313
- package/docs/launch-content/hn_show_post.md +0 -139
- package/docs/launch-content/partner_outreach_templates.md +0 -745
- package/docs/launch-content/reddit_posts.md +0 -467
- package/docs/launch-content/twitter_thread.txt +0 -460
- package/docs/npm-downloads-chart.svg +0 -43
- package/docs/openapi.json +0 -139
- package/docs/openapi.yaml +0 -1318
- package/docs/quick-start.html +0 -366
- package/docs/robots.txt +0 -52
- package/docs/sitemap.xml +0 -57
- package/docs/styles.css +0 -682
- package/docs/well-known/ai-plugin.json +0 -16
- package/docs/wellknown/ai-plugin.json +0 -16
- package/docs-site/assets/og-banner.svg +0 -194
- package/docs-site/index.html +0 -632
- package/eval/README.md +0 -46
- package/eval/baselines/main.json +0 -12
- package/eval/benchmark_dataset.jsonl +0 -16
- package/eval/check_golden_routes.js +0 -64
- package/eval/datasets/catalog.json +0 -33
- package/eval/datasets/slices/cn_provider_reliability_v1.jsonl +0 -3
- package/eval/datasets/slices/cost_pressure_v1.jsonl +0 -3
- package/eval/datasets/slices/safety_guardrails_v1.jsonl +0 -3
- package/eval/evals.json +0 -199
- package/eval/fault_injection_thresholds.json +0 -3
- package/eval/generate_report.js +0 -128
- package/eval/golden_routes.json +0 -114
- package/eval/lib/experiment_registry.js +0 -24
- package/eval/run_eval.js +0 -197
- package/eval/run_fault_injection.js +0 -201
- package/eval/run_shadow_eval.js +0 -85
- package/eval/thresholds.json +0 -9
- package/examples/QUICKSTART.md +0 -183
- package/examples/README.md +0 -61
- package/examples/a3m-sdk.js +0 -124
- package/examples/basic-route.js +0 -54
- package/examples/chat-loop.js +0 -202
- package/examples/classify-then-route.js +0 -102
- package/examples/cost-compare.js +0 -120
- package/examples/ensemble.js +0 -160
- package/examples/whatsapp-telegram-bridge-demo.js +0 -302
- package/examples/whatsapp-telegram-bridge.js +0 -269
- package/hf-space/README.md +0 -23
- package/hf-space/app.py +0 -240
- package/hf-space/requirements.txt +0 -1
- package/huggingface_space/README.md +0 -35
- package/huggingface_space/app.py +0 -126
- package/huggingface_space/create_space.py +0 -208
- package/huggingface_space/requirements.txt +0 -1
- package/mcp-server/README.md +0 -188
- package/mcp-server/package.json +0 -29
- package/mcp-server/src/index.ts +0 -744
- package/mcp-server/tsconfig.json +0 -19
- package/openclaw-alexa-bridge/ALL_REMAINING_FIXES_PLAN.md +0 -313
- package/openclaw-alexa-bridge/REMAINING_FIXES_SUMMARY.md +0 -277
- package/openclaw-alexa-bridge/src/alexa_handler_no_tmlpd.js +0 -1234
- package/openclaw-alexa-bridge/test_fixes.js +0 -77
- package/playground/README.md +0 -51
- package/playground/codesandbox.json +0 -12
- package/playground/index.js +0 -39
- package/proxy/README.md +0 -227
- package/proxy/package-lock.json +0 -831
- package/proxy/package.json +0 -17
- package/proxy/rate-limit.js +0 -145
- package/proxy/rate-limit.test.js +0 -311
- package/proxy/server.js +0 -970
- package/python/README.md +0 -102
- package/python/a3m/__init__.py +0 -6
- package/python/a3m/client.py +0 -190
- package/python/a3m/models.py +0 -40
- package/python/a3m/sync_client.py +0 -61
- package/python/examples.py +0 -53
- package/python/integrations.py +0 -330
- package/python/pyproject.toml +0 -23
- package/python/setup.py +0 -28
- package/python/tmlpd.py +0 -369
- package/qna/REDDIT_GAP_ANALYSIS.md +0 -299
- package/qna/TMLPD_QNA.md +0 -751
- package/research/FINDING_001_safety.md +0 -28
- package/research/FINDING_002_error_diversity.md +0 -32
- package/research/FINDING_003_confidence_weighted_voting.md +0 -32
- package/research/FINDING_004_cross_model_semantic_detection.md +0 -37
- package/research/FINDING_005_knowledge_gap_orthogonality.md +0 -34
- package/research/HALLUCINATION_RESEARCH.md +0 -27
- package/research/ensemble-voting.md +0 -324
- package/research/loss-functions.md +0 -545
- package/research-log.md +0 -49
- package/scripts/banner.js +0 -29
- package/scripts/benchmark-local-routerarena.ts +0 -176
- package/scripts/benchmark.js +0 -145
- package/scripts/benchmark.sh +0 -61
- package/scripts/compare-providers.sh +0 -230
- package/scripts/content-planner.js +0 -25
- package/scripts/create-labeled-benchmark.ts +0 -105
- package/scripts/cross_post.py +0 -443
- package/scripts/local-router-benchmark.ts +0 -154
- package/scripts/post-all.sh +0 -41
- package/scripts/publish_fcc.py +0 -106
- package/scripts/push-to-gitee.sh +0 -25
- package/scripts/routerarena_ensemble.js +0 -144
- package/scripts/routing-benchmark-v2.js +0 -373
- package/scripts/routing-benchmark-v3.js +0 -118
- package/scripts/routing-benchmark.js +0 -462
- package/scripts/run-labeled-benchmark.mjs +0 -104
- package/scripts/run-mmlu-benchmark.js +0 -176
- package/scripts/run-provider-benchmark.js +0 -244
- package/scripts/update-npm-badges.js +0 -158
- package/skill/SKILL.md +0 -238
- package/src/__tests__/integration/tmpld_integration.test.py +0 -540
- package/src/skills/__tests__/skill_manager.test.ts +0 -328
- package/submissions/benchmarks/ALL_PLATFORMS_SUBMISSION.md +0 -94
- package/submissions/benchmarks/LLMROUTERBENCH_SUBMISSION.md +0 -121
- package/submissions/benchmarks/MMRBENCH_SUBMISSION.md +0 -94
- package/submissions/benchmarks/ROUTERARENA_UPDATE.md +0 -83
- package/submissions/benchmarks/ROUTERBENCH_SUBMISSION.md +0 -225
- package/test-council/1-structure-tests.test.js +0 -353
- package/test-council/1-structure-tests.test.ts +0 -353
- package/test-council/2-edge-case-tests.test.ts +0 -361
- package/test-council/3-performance-tests.test.ts +0 -669
- package/test-council/4-integration-tests.test.ts +0 -391
- package/test-council/5-agent-council-eval.test.ts +0 -413
- package/test-council/AGENT_COUNCIL_ARCHITECTURE.md +0 -349
- package/test-council/TEST_COUNCIL_REPORT.md +0 -201
- package/test-council/agents/edge-case-agent.ts +0 -363
- package/test-council/agents/performance-agent.ts +0 -426
- package/test-council/agents/structure-agent.ts +0 -227
- package/test-council/council.md +0 -183
- package/tests/__mocks__/tokenUtils.ts +0 -8
- package/tests/memory/episodicMemory.test.ts +0 -227
- package/tests/package-lock.json +0 -1628
- package/tests/package.json +0 -18
- package/tests/routing/ensembleVoting.test.ts +0 -236
- package/tests/routing/providerRetry.test.ts +0 -360
- package/tests/routing/queryTypePresets.test.ts +0 -208
- package/tests/security/guardrailEngine.test.ts +0 -700
- package/tests/tsconfig.json +0 -21
- package/tests/vitest.config.ts +0 -18
- package/tmlpd-pi-extension/README.md +0 -66
- package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts +0 -114
- package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/cache/prefixCache.js +0 -285
- package/tmlpd-pi-extension/dist/cache/prefixCache.js.map +0 -1
- package/tmlpd-pi-extension/dist/cache/responseCache.d.ts +0 -58
- package/tmlpd-pi-extension/dist/cache/responseCache.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/cache/responseCache.js +0 -153
- package/tmlpd-pi-extension/dist/cache/responseCache.js.map +0 -1
- package/tmlpd-pi-extension/dist/cli.js +0 -59
- package/tmlpd-pi-extension/dist/cost/costTracker.d.ts +0 -95
- package/tmlpd-pi-extension/dist/cost/costTracker.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/cost/costTracker.js +0 -240
- package/tmlpd-pi-extension/dist/cost/costTracker.js.map +0 -1
- package/tmlpd-pi-extension/dist/index.d.ts +0 -723
- package/tmlpd-pi-extension/dist/index.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/index.js +0 -239
- package/tmlpd-pi-extension/dist/index.js.map +0 -1
- package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts +0 -82
- package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/memory/episodicMemory.js +0 -145
- package/tmlpd-pi-extension/dist/memory/episodicMemory.js.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts +0 -102
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js +0 -207
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts +0 -85
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js +0 -210
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js.map +0 -1
- package/tmlpd-pi-extension/dist/providers/localProvider.d.ts +0 -102
- package/tmlpd-pi-extension/dist/providers/localProvider.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/providers/localProvider.js +0 -338
- package/tmlpd-pi-extension/dist/providers/localProvider.js.map +0 -1
- package/tmlpd-pi-extension/dist/providers/registry.d.ts +0 -55
- package/tmlpd-pi-extension/dist/providers/registry.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/providers/registry.js +0 -138
- package/tmlpd-pi-extension/dist/providers/registry.js.map +0 -1
- package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts +0 -68
- package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/routing/advancedRouter.js +0 -332
- package/tmlpd-pi-extension/dist/routing/advancedRouter.js.map +0 -1
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts +0 -101
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.js +0 -368
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts +0 -96
- package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/batchProcessor.js +0 -170
- package/tmlpd-pi-extension/dist/utils/batchProcessor.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/compression.d.ts +0 -61
- package/tmlpd-pi-extension/dist/utils/compression.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/compression.js +0 -281
- package/tmlpd-pi-extension/dist/utils/compression.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/reliability.d.ts +0 -74
- package/tmlpd-pi-extension/dist/utils/reliability.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/reliability.js +0 -177
- package/tmlpd-pi-extension/dist/utils/reliability.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts +0 -117
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js +0 -246
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts +0 -50
- package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/tokenUtils.js +0 -124
- package/tmlpd-pi-extension/dist/utils/tokenUtils.js.map +0 -1
- package/tmlpd-pi-extension/examples/QUICKSTART.md +0 -183
- package/tmlpd-pi-extension/package-lock.json +0 -79
- package/tmlpd-pi-extension/package.json +0 -172
- package/tmlpd-pi-extension/python/examples.py +0 -53
- package/tmlpd-pi-extension/python/integrations.py +0 -330
- package/tmlpd-pi-extension/python/setup.py +0 -28
- package/tmlpd-pi-extension/python/tmlpd.py +0 -369
- package/tmlpd-pi-extension/qna/REDDIT_GAP_ANALYSIS.md +0 -299
- package/tmlpd-pi-extension/qna/TMLPD_QNA.md +0 -751
- package/tmlpd-pi-extension/skill/SKILL.md +0 -238
- package/tmlpd-pi-extension/src/cache/responseCache.ts +0 -147
- package/tmlpd-pi-extension/src/cost/costTracker.ts +0 -302
- package/tmlpd-pi-extension/src/index.ts +0 -232
- package/tmlpd-pi-extension/src/memory/episodicMemory.ts +0 -257
- package/tmlpd-pi-extension/src/orchestration/haloOrchestrator.ts +0 -266
- package/tmlpd-pi-extension/src/orchestration/mctsWorkflow.ts +0 -262
- package/tmlpd-pi-extension/src/providers/localProvider.ts +0 -406
- package/tmlpd-pi-extension/src/providers/registry.ts +0 -164
- package/tmlpd-pi-extension/src/routing/ensembleVoting.ts +0 -159
- package/tmlpd-pi-extension/src/routing/queryTypePresets.ts +0 -136
- package/tmlpd-pi-extension/src/tools/tmlpdTools.ts +0 -433
- package/tmlpd-pi-extension/src/utils/batchProcessor.ts +0 -232
- package/tmlpd-pi-extension/src/utils/compression.ts +0 -325
- package/tmlpd-pi-extension/src/utils/reliability.ts +0 -221
- package/tmlpd-pi-extension/src/utils/tokenUtils.ts +0 -145
- package/tmlpd-pi-extension/tsconfig.json +0 -18
- package/tsconfig.build.json +0 -29
- package/tsconfig.json +0 -18
- /package/{docs/llms-full.txt → llms-full.txt.bak} +0 -0
package/LAUNCH.md
DELETED
|
@@ -1,337 +0,0 @@
|
|
|
1
|
-
# A3M ROUTER LAUNCH MANIFEST — 30x Efficiency Story
|
|
2
|
-
|
|
3
|
-
## Package Information
|
|
4
|
-
- **Name**: `adaptive-memory-multi-model-router`
|
|
5
|
-
- **Version**: 2.0.7
|
|
6
|
-
- **NPM**: https://www.npmjs.com/package/adaptive-memory-multi-model-router
|
|
7
|
-
- **GitHub**: https://github.com/Das-rebel/a3m-router
|
|
8
|
-
- **Core Claim**: 70.32 routing accuracy, zero ML. Matches RouteLLM (BERT-based) on RouterArena benchmark.
|
|
9
|
-
|
|
10
|
-
---
|
|
11
|
-
|
|
12
|
-
## The 30x Story
|
|
13
|
-
|
|
14
|
-
RouteLLM trains a BERT classifier on GPU. Gets 85% routing accuracy.
|
|
15
|
-
A3M Router uses keyword matching in Node.js. Gets 70.32.
|
|
16
|
-
|
|
17
|
-
97% of the accuracy. 3% of the compute. **30x more efficient.**
|
|
18
|
-
|
|
19
|
-
Two LLM routers have published benchmarks: RouteLLM and us.
|
|
20
|
-
LiteLLM (47K stars) publishes **zero**. Benchmark or GTFO.
|
|
21
|
-
|
|
22
|
-
---
|
|
23
|
-
|
|
24
|
-
## LAUNCH PLATFORMS
|
|
25
|
-
|
|
26
|
-
### 1. Hacker News (PRIORITY 1)
|
|
27
|
-
**URL**: https://news.ycombinator.com/submit
|
|
28
|
-
|
|
29
|
-
**Title**:
|
|
30
|
-
```
|
|
31
|
-
Show HN: A3M Router — 70.32 routing accuracy without ML. Matches RouteLLM (BERT-based) on RouterArena benchmark
|
|
32
|
-
```
|
|
33
|
-
|
|
34
|
-
**Text** (copy from `docs/HN_SUBMISSION_FINAL.md`):
|
|
35
|
-
```
|
|
36
|
-
RouteLLM (UC Berkeley) trains a BERT classifier on GPU for LLM query routing. Gets 85% accuracy ().
|
|
37
|
-
|
|
38
|
-
We use keyword matching in Node.js. Get 70.32.
|
|
39
|
-
|
|
40
|
-
97% of the accuracy. 3% of the compute. 30x more efficient.
|
|
41
|
-
|
|
42
|
-
There are exactly two LLM routers with published routing accuracy benchmarks: RouteLLM and us.
|
|
43
|
-
LiteLLM (47,000 GitHub stars) publishes zero accuracy data.
|
|
44
|
-
|
|
45
|
-
RouteLLM: 85% accuracy, PyTorch, CUDA, ~500MB BERT, ~3s cold start, GPU required
|
|
46
|
-
A3M Router: 70.32 accuracy, Node.js, 139 keywords, 0 bytes model, ~50ms cold start, any VPS
|
|
47
|
-
|
|
48
|
-
61.6% cost reduction. 40 providers. Semantic cache. Circuit breakers. 3MB install.
|
|
49
|
-
|
|
50
|
-
Growth (zero marketing):
|
|
51
|
-
Day 1: 552. Day 2: 320. Day 3: 1,903. 245% growth. $0 budget.
|
|
52
|
-
|
|
53
|
-
npm install adaptive-memory-multi-model-router
|
|
54
|
-
npx a3m-router serve
|
|
55
|
-
|
|
56
|
-
Point any OpenAI SDK at localhost:8787. Zero code changes.
|
|
57
|
-
|
|
58
|
-
The question: if keyword matching gets you 97% of BERT accuracy, is the GPU worth it?
|
|
59
|
-
|
|
60
|
-
Repo: https://github.com/Das-rebel/a3m-router
|
|
61
|
-
```
|
|
62
|
-
|
|
63
|
-
**Best Time to Post**: Tuesday-Thursday, 8:30 AM EST
|
|
64
|
-
|
|
65
|
-
---
|
|
66
|
-
|
|
67
|
-
### 2. Twitter/X Thread (PRIORITY 1)
|
|
68
|
-
**URL**: https://twitter.com/compose/tweet
|
|
69
|
-
|
|
70
|
-
**Thread** (copy from `articles/twitter-thread-cost-savings.md`):
|
|
71
|
-
|
|
72
|
-
**T1/7**:
|
|
73
|
-
```
|
|
74
|
-
We matched a GPU-trained BERT router's accuracy with zero ML.
|
|
75
|
-
|
|
76
|
-
70.32 accuracy. No PyTorch. No GPU. No 500MB model.
|
|
77
|
-
|
|
78
|
-
RouteLLM (Berkeley) gets 85% with BERT. We get 70.32 with keyword matching.
|
|
79
|
-
|
|
80
|
-
That's 97% of the accuracy at 3% of the compute.
|
|
81
|
-
|
|
82
|
-
30x more efficient. Thread.
|
|
83
|
-
```
|
|
84
|
-
|
|
85
|
-
**T2/7**:
|
|
86
|
-
```
|
|
87
|
-
The only two LLM routers with published benchmarks:
|
|
88
|
-
|
|
89
|
-
RouteLLM: 85% () — PyTorch + BERT + GPU + 500MB model
|
|
90
|
-
A3M Router: 70.32 () — Node.js + keywords + 0 bytes model
|
|
91
|
-
|
|
92
|
-
LiteLLM (47,000 GitHub stars): publishes ZERO routing accuracy data.
|
|
93
|
-
|
|
94
|
-
Benchmark or GTFO.
|
|
95
|
-
```
|
|
96
|
-
|
|
97
|
-
**T3/7**:
|
|
98
|
-
```
|
|
99
|
-
RouteLLM needs:
|
|
100
|
-
- Python + PyTorch + CUDA
|
|
101
|
-
- ~500MB BERT model download
|
|
102
|
-
- GPU for inference
|
|
103
|
-
- ~3s cold start
|
|
104
|
-
- ~2GB install
|
|
105
|
-
|
|
106
|
-
A3M Router needs:
|
|
107
|
-
- Node.js
|
|
108
|
-
- 3MB install
|
|
109
|
-
- No GPU
|
|
110
|
-
- 50ms cold start
|
|
111
|
-
|
|
112
|
-
2.5% accuracy difference. You decide if the GPU is worth it.
|
|
113
|
-
```
|
|
114
|
-
|
|
115
|
-
**T4/7**:
|
|
116
|
-
```
|
|
117
|
-
61.6% average cost reduction.
|
|
118
|
-
|
|
119
|
-
Before: everything goes to GPT-4 at $0.03/query
|
|
120
|
-
After: queries routed to cheapest capable provider
|
|
121
|
-
|
|
122
|
-
Simple Q&A: $0.03 -> $0.00 (free provider)
|
|
123
|
-
Code gen: $0.05 -> $0.0004 (Groq)
|
|
124
|
-
Complex reasoning: $0.03 -> $0.03 (stays premium)
|
|
125
|
-
|
|
126
|
-
Drop-in proxy. Point any OpenAI SDK at localhost:8787.
|
|
127
|
-
```
|
|
128
|
-
|
|
129
|
-
**T5/7**:
|
|
130
|
-
```
|
|
131
|
-
Day 1: 552 downloads
|
|
132
|
-
Day 2: 320 downloads
|
|
133
|
-
Day 3: 1,903 downloads
|
|
134
|
-
|
|
135
|
-
245% growth. Zero marketing budget. No blog post. No HN. No Twitter thread.
|
|
136
|
-
|
|
137
|
-
Just developers telling developers.
|
|
138
|
-
```
|
|
139
|
-
|
|
140
|
-
**T6/7**:
|
|
141
|
-
```
|
|
142
|
-
const { createA3MRouter } = require('adaptive-memory-multi-model-router');
|
|
143
|
-
const router = createA3MRouter();
|
|
144
|
-
|
|
145
|
-
await router.route("What is 2+2?"); // -> free ($0.00)
|
|
146
|
-
await router.route("Write Python sort"); // -> Groq ($0.0004, 0.4s)
|
|
147
|
-
await router.route("Analyze legal contract"); // -> premium ($0.03)
|
|
148
|
-
|
|
149
|
-
40 providers. Semantic cache. Circuit breakers. 3MB.
|
|
150
|
-
```
|
|
151
|
-
|
|
152
|
-
**T7/7**:
|
|
153
|
-
```
|
|
154
|
-
npm install adaptive-memory-multi-model-router
|
|
155
|
-
|
|
156
|
-
GitHub: github.com/Das-rebel/a3m-router
|
|
157
|
-
|
|
158
|
-
70.32 accuracy. Zero ML. Zero GPU.
|
|
159
|
-
Matches BERT within 2.5%. 61.6% cost savings. 40 providers.
|
|
160
|
-
|
|
161
|
-
30x more efficient.
|
|
162
|
-
|
|
163
|
-
#LLM #AI #RouteLLM #BenchmarkOrGTFO #OpenSource #JavaScript
|
|
164
|
-
```
|
|
165
|
-
|
|
166
|
-
**Best Time to Post**: Tuesday-Thursday, 9am-12pm PST
|
|
167
|
-
|
|
168
|
-
---
|
|
169
|
-
|
|
170
|
-
### 3. Dev.to (PRIORITY 2)
|
|
171
|
-
**URL**: https://dev.to/new
|
|
172
|
-
|
|
173
|
-
**Title**: "How We Matched a GPU-Trained Router With Zero ML"
|
|
174
|
-
|
|
175
|
-
**Content**: Copy from `articles/devto-llm-routing.md`
|
|
176
|
-
|
|
177
|
-
**Tags**: `llm`, `ai`, `routing`, `javascript`, `benchmark`, `routellm`
|
|
178
|
-
|
|
179
|
-
---
|
|
180
|
-
|
|
181
|
-
### 4. Reddit r/MachineLearning (PRIORITY 2)
|
|
182
|
-
**URL**: https://www.reddit.com/r/MachineLearning/submit
|
|
183
|
-
|
|
184
|
-
**Title**: "[P] A3M Router achieves 70.32 routing accuracy with keyword matching — matches RouteLLM's BERT classifier (85%) without GPU"
|
|
185
|
-
|
|
186
|
-
**Content**: Copy from `articles/reddit-ml.md`
|
|
187
|
-
|
|
188
|
-
**Flair**: `Project`
|
|
189
|
-
|
|
190
|
-
---
|
|
191
|
-
|
|
192
|
-
### 5. Reddit r/javascript (PRIORITY 2)
|
|
193
|
-
**URL**: https://www.reddit.com/r/javascript/submit
|
|
194
|
-
|
|
195
|
-
**Title**: "A3M Router: LLM routing with 70.32 accuracy and zero ML — matches BERT within 2.5%"
|
|
196
|
-
|
|
197
|
-
**Content**:
|
|
198
|
-
```
|
|
199
|
-
Built an LLM router that gets 70.32 routing accuracy without any ML.
|
|
200
|
-
|
|
201
|
-
RouteLLM's GPU-trained BERT gets 85%. We get 70.32 with keyword matching.
|
|
202
|
-
|
|
203
|
-
The comparison:
|
|
204
|
-
- RouteLLM: PyTorch + GPU + 500MB model + 3s cold start
|
|
205
|
-
- A3M Router: Node.js + 3MB + 50ms cold start + no GPU
|
|
206
|
-
|
|
207
|
-
97% of the accuracy at 3% of the compute.
|
|
208
|
-
|
|
209
|
-
```javascript
|
|
210
|
-
const { createA3MRouter } = require('adaptive-memory-multi-model-router');
|
|
211
|
-
const router = createA3MRouter();
|
|
212
|
-
|
|
213
|
-
await router.route("What is 2+2?"); // -> free ($0.00)
|
|
214
|
-
await router.route("Write Python sort array"); // -> Groq ($0.0004)
|
|
215
|
-
await router.route("Analyze legal contract"); // -> premium ($0.03)
|
|
216
|
-
```
|
|
217
|
-
|
|
218
|
-
61.6% cost reduction. 40 providers. Drop-in OpenAI proxy at localhost:8787.
|
|
219
|
-
|
|
220
|
-
Growth: 552 -> 320 -> 1,903 downloads in 3 days. 245% growth. Zero marketing.
|
|
221
|
-
|
|
222
|
-
npm install adaptive-memory-multi-model-router
|
|
223
|
-
|
|
224
|
-
GitHub: https://github.com/Das-rebel/a3m-router
|
|
225
|
-
```
|
|
226
|
-
|
|
227
|
-
---
|
|
228
|
-
|
|
229
|
-
### 6. Reddit r/SideProject (PRIORITY 2)
|
|
230
|
-
**URL**: https://www.reddit.com/r/SideProject/submit
|
|
231
|
-
|
|
232
|
-
**Title**: "Built an LLM router with 70.32 accuracy and zero ML — matched a GPU-trained BERT model"
|
|
233
|
-
|
|
234
|
-
**Content**:
|
|
235
|
-
```
|
|
236
|
-
Side project: an LLM routing library that matches RouteLLM's GPU-trained BERT within 2.5% using only keyword matching.
|
|
237
|
-
|
|
238
|
-
70.32 accuracy. Zero ML. Zero GPU. 3MB install. Node.js.
|
|
239
|
-
|
|
240
|
-
RouteLLM needs PyTorch + CUDA + 500MB model + GPU.
|
|
241
|
-
We need Node.js + 3MB.
|
|
242
|
-
|
|
243
|
-
61.6% cost savings. 40 providers. Drop-in OpenAI proxy.
|
|
244
|
-
|
|
245
|
-
Growth: Day 1: 552, Day 2: 320, Day 3: 1,903 downloads. Zero marketing.
|
|
246
|
-
|
|
247
|
-
npm install adaptive-memory-multi-model-router
|
|
248
|
-
|
|
249
|
-
GitHub: https://github.com/Das-rebel/a3m-router
|
|
250
|
-
```
|
|
251
|
-
|
|
252
|
-
---
|
|
253
|
-
|
|
254
|
-
### 7. Product Hunt (PRIORITY 3 — Schedule for next week)
|
|
255
|
-
**URL**: https://www.producthunt.com/posts/new
|
|
256
|
-
|
|
257
|
-
**Title**: A3M Router
|
|
258
|
-
|
|
259
|
-
**Tagline**: 70.32 routing accuracy, zero ML — matches BERT, saves 61.6%
|
|
260
|
-
|
|
261
|
-
**Description**:
|
|
262
|
-
```
|
|
263
|
-
A3M Router routes LLM queries to the cheapest capable provider with 70.32 accuracy — matching RouteLLM's GPU-trained BERT (85%) without any ML.
|
|
264
|
-
|
|
265
|
-
Key Numbers:
|
|
266
|
-
- 70.32 routing accuracy ()
|
|
267
|
-
- 97% of RouteLLM's BERT accuracy at 3% of the compute
|
|
268
|
-
- 61.6% average cost savings
|
|
269
|
-
- 40 providers
|
|
270
|
-
- 3MB install, zero ML dependencies
|
|
271
|
-
- Drop-in OpenAI proxy (localhost:8787)
|
|
272
|
-
|
|
273
|
-
Benchmark or GTFO: We're one of only two LLM routers with published routing accuracy benchmarks. LiteLLM (47K stars) publishes none.
|
|
274
|
-
|
|
275
|
-
Try it:
|
|
276
|
-
npm install adaptive-memory-multi-model-router
|
|
277
|
-
npx a3m-router serve
|
|
278
|
-
|
|
279
|
-
GitHub: https://github.com/Das-rebel/a3m-router
|
|
280
|
-
```
|
|
281
|
-
|
|
282
|
-
**Topics**: Developer Tools, AI, API, Open Source, JavaScript
|
|
283
|
-
|
|
284
|
-
---
|
|
285
|
-
|
|
286
|
-
## LAUNCH CHECKLIST
|
|
287
|
-
|
|
288
|
-
### Pre-Launch
|
|
289
|
-
- [x] Package published to NPM
|
|
290
|
-
- [x] GitHub repo optimized
|
|
291
|
-
- [x] All articles rewritten with 30x efficiency story
|
|
292
|
-
- [x] Twitter thread ready (7 tweets, benchmark-first)
|
|
293
|
-
- [x] HN submission text ready
|
|
294
|
-
- [x] Pre-written HN responses ready
|
|
295
|
-
|
|
296
|
-
### Launch Day
|
|
297
|
-
- [ ] Post to Hacker News (benchmark comparison angle)
|
|
298
|
-
- [ ] Post Twitter thread
|
|
299
|
-
- [ ] Post to Reddit r/MachineLearning
|
|
300
|
-
- [ ] Post to Reddit r/javascript
|
|
301
|
-
|
|
302
|
-
### Launch Week
|
|
303
|
-
- [ ] Publish Dev.to article
|
|
304
|
-
- [ ] Post to r/SideProject
|
|
305
|
-
- [ ] Share in Discord communities
|
|
306
|
-
|
|
307
|
-
### Launch Month
|
|
308
|
-
- [ ] Schedule Product Hunt launch
|
|
309
|
-
- [ ] Create YouTube tutorial
|
|
310
|
-
- [ ] Reach out to newsletters (JavaScript Weekly, Node Weekly)
|
|
311
|
-
|
|
312
|
-
---
|
|
313
|
-
|
|
314
|
-
## TRACKING
|
|
315
|
-
|
|
316
|
-
### Metrics to Track
|
|
317
|
-
- [ ] NPM downloads (daily)
|
|
318
|
-
- [ ] GitHub stars
|
|
319
|
-
- [ ] HN upvotes and comments
|
|
320
|
-
- [ ] Twitter impressions
|
|
321
|
-
- [ ] Reddit upvotes
|
|
322
|
-
|
|
323
|
-
### Success Metrics (Week 1)
|
|
324
|
-
- [ ] 500+ GitHub stars
|
|
325
|
-
- [ ] 50+ HN upvotes
|
|
326
|
-
- [ ] 10k+ Twitter impressions
|
|
327
|
-
|
|
328
|
-
---
|
|
329
|
-
|
|
330
|
-
## SUPPORT
|
|
331
|
-
|
|
332
|
-
- GitHub Issues: https://github.com/Das-rebel/a3m-router/issues
|
|
333
|
-
- Email: Sdas22@gmail.com
|
|
334
|
-
|
|
335
|
-
---
|
|
336
|
-
|
|
337
|
-
**THE PITCH**: 70.32 accuracy. Zero ML. Zero GPU. 97% of RouteLLM's BERT at 3% of the compute. 61.6% cost savings. 40 providers. 3MB install. That's the 30x efficiency story. Benchmark or GTFO.
|
package/LAUNCH_CHECKLIST.md
DELETED
|
@@ -1,141 +0,0 @@
|
|
|
1
|
-
# A3M Router — Master Launch Checklist
|
|
2
|
-
|
|
3
|
-
**Project:** adaptive-memory-multi-model-router (A3M Router)
|
|
4
|
-
**npm:** https://www.npmjs.com/package/adaptive-memory-multi-model-router
|
|
5
|
-
**GitHub:** https://github.com/Das-rebel/a3m-router
|
|
6
|
-
**Demo:** https://asciinema.org/a/RpqOZM9tFMALYWvs
|
|
7
|
-
|
|
8
|
-
---
|
|
9
|
-
|
|
10
|
-
## Reddit Posts
|
|
11
|
-
|
|
12
|
-
- [ ] **r/LocalLLaMA** — https://www.reddit.com/r/LocalLLaMA/submit/
|
|
13
|
-
- Post: [R] I benchmarked 47 LLM providers against 12K+ real queries — the cost/speed/quality matrix
|
|
14
|
-
- File: `articles/REDDIT_SUBMISSION_READY.md`
|
|
15
|
-
- Pre-written comments ready in file
|
|
16
|
-
- Monitor for 2 hours after posting
|
|
17
|
-
|
|
18
|
-
- [ ] **r/MachineLearning** — https://www.reddit.com/r/MachineLearning/submit/
|
|
19
|
-
- Post: [P] A3M Router achieves 82.5% routing accuracy with keyword matching
|
|
20
|
-
- File: `articles/REDDIT_SUBMISSION_READY.md`
|
|
21
|
-
- Pre-written comments ready in file
|
|
22
|
-
|
|
23
|
-
- [ ] **r/SideProject** — https://www.reddit.com/r/SideProject/submit/
|
|
24
|
-
- Post: I built an LLM router that beats GPT-5 at 1/213th the cost
|
|
25
|
-
- File: `articles/REDDIT_SUBMISSION_READY.md`
|
|
26
|
-
- Pre-written comments ready in file
|
|
27
|
-
|
|
28
|
-
- [ ] **r/programming** — (24h after r/LocalLLaMA if engagement is positive)
|
|
29
|
-
- Repurpose r/LocalLLaMA post
|
|
30
|
-
|
|
31
|
-
---
|
|
32
|
-
|
|
33
|
-
## Newsletter Emails
|
|
34
|
-
|
|
35
|
-
- [ ] **Import AI** (jack@sequoiacap.com) — HIGHEST PRIORITY
|
|
36
|
-
- File: `articles/NEWSLETTER_SEND_NOW.md`
|
|
37
|
-
- Subject: A3M Router — #1 LLM routing benchmark, 213x cheaper than GPT-5
|
|
38
|
-
|
|
39
|
-
- [ ] **The Batch (Anthropic)** (press@anthropic.com)
|
|
40
|
-
- File: `articles/NEWSLETTER_SEND_NOW.md`
|
|
41
|
-
- Subject: [Tool] A3M Router — Open-source LLM routing, #1 on RouterArena
|
|
42
|
-
|
|
43
|
-
- [ ] **Lil'Log** (lilian@openai.com or Twitter DM @lilianweng)
|
|
44
|
-
- File: `articles/NEWSLETTER_SEND_NOW.md`
|
|
45
|
-
- Also try Twitter DM
|
|
46
|
-
|
|
47
|
-
- [ ] **DeepLearning.ai Newsletter**
|
|
48
|
-
- File: `articles/NEWSLETTER_SEND_NOW.md`
|
|
49
|
-
- Submit at https://www.deeplearning.ai/newsletter/
|
|
50
|
-
|
|
51
|
-
- [ ] **The Economist AI**
|
|
52
|
-
- File: `articles/NEWSLETTER_SEND_NOW.md`
|
|
53
|
-
- Submit at https://www.economist.com/newsletters/ai
|
|
54
|
-
|
|
55
|
-
- [ ] **OpenAI Newsletter**
|
|
56
|
-
- File: `articles/NEWSLETTER_SEND_NOW.md`
|
|
57
|
-
- Submit at https://openai.com/newsletter
|
|
58
|
-
|
|
59
|
-
---
|
|
60
|
-
|
|
61
|
-
## Twitter Thread
|
|
62
|
-
|
|
63
|
-
- [ ] **Tweet 1/10** — Base tweet: 3 LLM infrastructure problems
|
|
64
|
-
- [ ] **Tweet 2/10** — Dev quotes on Kimi K2.6 / cost savings
|
|
65
|
-
- [ ] **Tweet 3/10** — Every router does sequential fallback (the problem)
|
|
66
|
-
- [ ] **Tweet 4/10** — "Negligible overhead" — we published real numbers
|
|
67
|
-
- [ ] **Tweet 5/10** — The numbers since launch
|
|
68
|
-
- [ ] **Tweet 6/10** — Installation command
|
|
69
|
-
- [ ] **Tweet 7/10** — GitHub + benchmarks link
|
|
70
|
-
- [ ] **Tweet 8/10** — Routing algorithm in one slide
|
|
71
|
-
- [ ] **Tweet 9/10** — Real routing examples
|
|
72
|
-
- [ ] **Tweet 10/10** — Demo + final CTA + hashtags
|
|
73
|
-
- [ ] Pin thread after posting
|
|
74
|
-
- [ ] Engage with quote tweets and replies for 2 hours
|
|
75
|
-
|
|
76
|
-
**File:** `articles/TWEET_STORM_READY.md`
|
|
77
|
-
|
|
78
|
-
---
|
|
79
|
-
|
|
80
|
-
## Chinese Directories
|
|
81
|
-
|
|
82
|
-
Priority order for submission:
|
|
83
|
-
|
|
84
|
-
- [ ] **掘金AI** (juejin) — https://ai.juejin.cn — Most dev traffic, HIGHEST PRIORITY
|
|
85
|
-
- [ ] **CSDN** — https://www.csdn.net — Huge Chinese dev community
|
|
86
|
-
- [ ] **OSChina (开源中国)** — https://www.oschina.net — Open-source community
|
|
87
|
-
- [ ] **思否AI** — https://segmentfault.com/ai — Developer Q&A
|
|
88
|
-
- [ ] **未来百科** — https://nav.6ai.cn — AI directory
|
|
89
|
-
- [ ] **AI工具集** — https://www.aigc.cn — AI tools directory
|
|
90
|
-
- [ ] **知乎AI** — https://www.zhihu.com/topic/ai — Write article in Chinese
|
|
91
|
-
- [ ] **InfoQ中文** — https://www.infoq.cn — Tech media
|
|
92
|
-
- [ ] **机器之心** — https://www.jiqizhixin.com — AI media
|
|
93
|
-
|
|
94
|
-
**File:** `articles/CHINESE_SUBMISSIONS_READY.md`
|
|
95
|
-
**Note:** Register accounts first (some require Chinese phone number)
|
|
96
|
-
|
|
97
|
-
---
|
|
98
|
-
|
|
99
|
-
## Awesome List PRs / Updates
|
|
100
|
-
|
|
101
|
-
- [x] **awesome-llm-apps** — Already has A3M Router entry at line 290:
|
|
102
|
-
`* [🎯 A3M Router](advanced_llm_apps/llm_optimization_tools/a3m_router/)`
|
|
103
|
-
No update needed.
|
|
104
|
-
|
|
105
|
-
- [x] **Awesome-LLMOps** — Already has A3M Router entry at line 219:
|
|
106
|
-
`| [A3M Router](https://github.com/Das-rebel/a3m-router) | #1 on RouterArena (76.43) at $0.047/1K...`
|
|
107
|
-
No update needed.
|
|
108
|
-
|
|
109
|
-
---
|
|
110
|
-
|
|
111
|
-
## Post-Launch Actions
|
|
112
|
-
|
|
113
|
-
- [ ] Monitor GitHub stars (current: 8)
|
|
114
|
-
- [ ] Monitor npm downloads (current: 15,237)
|
|
115
|
-
- [ ] Respond to any GitHub issues
|
|
116
|
-
- [ ] Update RouterArena entry with A3M Router details
|
|
117
|
-
- [ ] Submit to:
|
|
118
|
-
- [ ] Product Hunt
|
|
119
|
-
- [ ] Hacker News (Show HN)
|
|
120
|
-
- [ ] Lobsters
|
|
121
|
-
- [ ] BetaList
|
|
122
|
-
|
|
123
|
-
---
|
|
124
|
-
|
|
125
|
-
## Tracking
|
|
126
|
-
|
|
127
|
-
| Channel | Status | Date Posted |
|
|
128
|
-
|---------|--------|-------------|
|
|
129
|
-
| r/LocalLLaMA | [ ] | |
|
|
130
|
-
| r/MachineLearning | [ ] | |
|
|
131
|
-
| r/SideProject | [ ] | |
|
|
132
|
-
| Twitter Thread | [ ] | |
|
|
133
|
-
| Import AI | [ ] | |
|
|
134
|
-
| The Batch | [ ] | |
|
|
135
|
-
| Lil'Log | [ ] | |
|
|
136
|
-
| DeepLearning.ai | [ ] | |
|
|
137
|
-
| 掘金AI | [ ] | |
|
|
138
|
-
| CSDN | [ ] | |
|
|
139
|
-
| OSChina | [ ] | |
|
|
140
|
-
| GitHub stars | 8 | baseline |
|
|
141
|
-
| npm downloads | 15,237 | baseline |
|