adaptive-memory-multi-model-router 2.14.46 → 2.14.48
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/{docs/llms.txt → llms.txt.bak} +6 -6
- package/package.json +270 -72
- package/src/routing/advancedRouter.ts.bak +650 -0
- package/test.js.bak +376 -0
- package/.dockerignore +0 -82
- package/.env.example +0 -303
- package/.github/DISCUSSIONS_WELCOME.md +0 -27
- package/.github/DISCUSSION_TEMPLATE.yml +0 -5
- package/.github/FUNDING.yml +0 -2
- package/.github/ISSUE_TEMPLATE/bug_report.md +0 -94
- package/.github/ISSUE_TEMPLATE/config.yml +0 -17
- package/.github/ISSUE_TEMPLATE/feature_request.md +0 -71
- package/.github/PULL_REQUEST_TEMPLATE.md +0 -71
- package/.github/dependabot.yml +0 -9
- package/.github/workflows/auto-publish.yml +0 -51
- package/.github/workflows/ci.yml +0 -263
- package/.github/workflows/codeql.yml +0 -38
- package/.github/workflows/npm-publish.yml +0 -20
- package/.github/workflows/pages.yml +0 -37
- package/.github/workflows/stale.yml +0 -54
- package/.publish-tick +0 -1
- package/.well-known/ai-plugin.json +0 -16
- package/AGENT_COUNCIL_FINDINGS.md +0 -142
- package/ARCHITECTURE.md +0 -346
- package/AUDIT_REPORT.md +0 -28
- package/CODE_OF_CONDUCT.md +0 -128
- package/CONTRIBUTING.md +0 -50
- package/CONTRIBUTORS.md +0 -20
- package/Dockerfile +0 -53
- package/Dockerfile.proxy +0 -33
- package/HEALTH_REPORT.md +0 -118
- package/IMPROVEMENT_PLAN.md +0 -107
- package/LANDING.md +0 -43
- package/LAUNCH-PAIN-DRIVEN.md +0 -339
- package/LAUNCH.md +0 -337
- package/LAUNCH_CHECKLIST.md +0 -141
- package/LAUNCH_SNAPSHOT.md +0 -260
- package/MANIFESTO.md +0 -41
- package/POPULARITY_BOOSTERS.md +0 -285
- package/PR_STATUS_REPORT.md +0 -148
- package/REDESIGN.md +0 -95
- package/RUNKIT.md +0 -83
- package/SECURITY.md +0 -29
- package/SUBMISSIONS.md +0 -43
- package/_schema.html +0 -53
- package/ai-plugin.json +0 -16
- package/articles/AI_AGENT_LLM_ROUTING.md +0 -150
- package/articles/CHINESE_DIRECTORIES.md +0 -100
- package/articles/CHINESE_SUBMISSIONS_READY.md +0 -322
- package/articles/COMPETITOR_ALERTS.md +0 -31
- package/articles/COMPLETE_POSTING_DIRECTORY.md +0 -147
- package/articles/CONTENT_STRUCTURE.md +0 -292
- package/articles/DEVTO_COST_GUIDE.md +0 -473
- package/articles/DEVTO_FINAL.md +0 -416
- package/articles/DEVTO_MULTI_PROVIDER.md +0 -542
- package/articles/DEVTO_READY.md +0 -255
- package/articles/DEVTO_V2_ANNOUNCEMENT.md +0 -160
- package/articles/DEVTO_VIRAL_GROWTH.md +0 -280
- package/articles/FRESH_devto.md +0 -460
- package/articles/FRESH_devto_2026_05.md +0 -73
- package/articles/FRESH_hackernews.md +0 -14
- package/articles/FRESH_reddit_ml.md +0 -90
- package/articles/FRESH_reddit_node.md +0 -198
- package/articles/FRESH_reddit_sideproject.md +0 -72
- package/articles/FRESH_reddit_webdev.md +0 -130
- package/articles/FROM_ZERO_TO_10K.md +0 -107
- package/articles/HN_10X_BETTER.md +0 -430
- package/articles/HN_ACCOUNT_GUIDE.md +0 -21
- package/articles/HN_CHINESE_STYLE.md +0 -308
- package/articles/HN_FINAL.md +0 -148
- package/articles/HN_POSTED_VERSION.md +0 -56
- package/articles/HN_POST_READY.md +0 -137
- package/articles/HN_RESEARCH.md +0 -364
- package/articles/HN_SHOW_routerarena.md +0 -17
- package/articles/HN_TIMING_GUIDE.md +0 -52
- package/articles/INDIEHACKERS_POST.md +0 -52
- package/articles/INDIEHACKERS_READY.md +0 -120
- package/articles/LLM_BENCHMARK_DEEP_DIVE.md +0 -153
- package/articles/MASTER_POSTING_DIRECTORY.md +0 -189
- package/articles/NEWSLETTER_SEND_NOW.md +0 -259
- package/articles/NEWSLETTER_SUBMISSIONS.md +0 -112
- package/articles/PAIN-DRIVEN-devto-v2.md +0 -308
- package/articles/PAIN-DRIVEN-devto-v3.md +0 -268
- package/articles/PAIN-DRIVEN-devto.md +0 -242
- package/articles/PAIN-DRIVEN-hackernews-v2.md +0 -138
- package/articles/PAIN-DRIVEN-hackernews-v3.md +0 -151
- package/articles/PAIN-DRIVEN-hackernews.md +0 -131
- package/articles/PAIN-DRIVEN-reddit-v2.md +0 -301
- package/articles/PAIN-DRIVEN-reddit-v3.md +0 -236
- package/articles/PAIN-DRIVEN-reddit.md +0 -218
- package/articles/PAIN-DRIVEN-twitter-v2.md +0 -110
- package/articles/PAIN-DRIVEN-twitter-v3.md +0 -121
- package/articles/PAIN-DRIVEN-twitter.md +0 -120
- package/articles/PORTKEY_VS_A3M.md +0 -147
- package/articles/POSTING_KIT_2026_05.md +0 -67
- package/articles/PRESS_KIT_routerarena.md +0 -77
- package/articles/PRODUCTHUNT_LISTING.md +0 -48
- package/articles/PRODUCTHUNT_READY.md +0 -106
- package/articles/PR_PLAN_vault.md +0 -125
- package/articles/REDDIT_FINAL.md +0 -232
- package/articles/REDDIT_POST.md +0 -67
- package/articles/REDDIT_SUBMISSION_READY.md +0 -348
- package/articles/ROUTERARENA_LEADER.md +0 -45
- package/articles/SHOW_HN_FINAL.md +0 -29
- package/articles/TWEETS_10K_DOWNLOADS.md +0 -47
- package/articles/TWEETS_BENCHMARK_FIRST.md +0 -46
- package/articles/TWEETS_MCP_PLAY.md +0 -51
- package/articles/TWEETS_SEQUENTIAL_BROKEN.md +0 -49
- package/articles/TWEETS_WHY_BUILD.md +0 -54
- package/articles/TWEETS_routerarena_leader.md +0 -53
- package/articles/TWEET_STORM_READY.md +0 -165
- package/articles/TWITTER_FINAL.md +0 -167
- package/articles/WHY_10X_BETTER.md +0 -261
- package/articles/WHY_CHINESE_STYLE_BETTER.md +0 -323
- package/articles/ai-discoverability-llm-routing.md +0 -210
- package/articles/devto-llm-routing.md +0 -138
- package/articles/hackernews-show-hn.md +0 -54
- package/articles/hashnode-llm-cost-optimization.md +0 -125
- package/articles/hn_show_2026_05.md +0 -11
- package/articles/medium-building-llm-router.md +0 -205
- package/articles/reddit-ml.md +0 -76
- package/articles/twitter-thread-cost-savings.md +0 -50
- package/articles/youtube-tutorial-script.md +0 -262
- package/assets/a3m_3blue1brown.mp4 +0 -0
- package/assets/banner.svg +0 -109
- package/assets/chart-cost-v2.svg +0 -91
- package/assets/chart-cost-v3.svg +0 -143
- package/assets/chart-features-v2.svg +0 -132
- package/assets/chart-features-v3.svg +0 -211
- package/assets/chart-growth-v2.svg +0 -122
- package/assets/chart-growth-v3.svg +0 -189
- package/assets/cost-comparison.svg +0 -134
- package/assets/cost-simple.svg +0 -64
- package/assets/demo-hn.gif +0 -0
- package/assets/feature-matrix.svg +0 -136
- package/assets/growth-chart-animated.svg +0 -76
- package/assets/growth-chart.svg +0 -82
- package/assets/growth-simple.svg +0 -69
- package/assets/hero-diagram.svg +0 -81
- package/assets/logo-new.svg +0 -21
- package/assets/logo.svg +0 -68
- package/assets/provider-comparison.svg +0 -121
- package/assets/social-preview-new.svg +0 -100
- package/assets/social-preview.svg +0 -194
- package/assets/social-v2.svg +0 -130
- package/assets/social-v3.svg +0 -212
- package/benchmark-provider-results.json +0 -245
- package/benchmark-results.json +0 -54
- package/council-votes/architecture-vote.md +0 -121
- package/council-votes/coverage-vote.md +0 -93
- package/data/adaptive-benchmark.json +0 -92
- package/data/benchmark-results.json +0 -47
- package/data/labeled-benchmark.json +0 -88
- package/demo/3blue1brown_video.py +0 -285
- package/demo/3blue1brown_video_v2.py +0 -310
- package/demo/IMPROVED_PROMPTS.md +0 -229
- package/demo/VEO3_PROMPTS.md +0 -269
- package/demo/VIDEO_PRODUCTION_GUIDE.md +0 -333
- package/demo/a3m_3blue1brown.mp4 +0 -0
- package/demo/asciinema-demo.sh +0 -195
- package/demo/demo-hn.tape +0 -74
- package/demo/demo-script.md +0 -53
- package/demo/demo-script.sh +0 -62
- package/demo/demo.svg +0 -75
- package/demo/frame1_ai_data_center.png +0 -0
- package/demo/frame1_sunset_video.mp4 +0 -0
- package/demo/frame2_cost_comparison.png +0 -0
- package/demo/frame2_cost_comparison_fallback.png +0 -0
- package/demo/frame3_parallel_execution.png +0 -0
- package/demo/frame3_parallel_execution_fallback.png +0 -0
- package/demo/frame4_providers.png +0 -0
- package/demo/frame4_providers_fallback.png +0 -0
- package/demo/frame5_endcard.png +0 -0
- package/demo/frame5_endcard_fallback.png +0 -0
- package/demo/new_frame1_hook.png +0 -0
- package/demo/new_frame2_proof.png +0 -0
- package/demo/new_frame3_wow.png +0 -0
- package/demo/new_frame4_social.png +0 -0
- package/demo/new_frame5_cta.png +0 -0
- package/demo/package.json +0 -13
- package/demo/product-video-final.mp4 +0 -0
- package/demo/product-video-hype-v1.mp4 +0 -0
- package/demo/product-video-v1.mp4 +0 -0
- package/demo/public/index.html +0 -762
- package/demo/recording.cast +0 -55
- package/demo/server.js +0 -405
- package/demo-new.tape +0 -71
- package/demo-real.sh +0 -198
- package/demo-simple.tape +0 -205
- package/demo.html +0 -520
- package/demo.sh +0 -85
- package/demo.tape +0 -259
- package/dist/analytics/costAnalytics.d.ts.map +0 -1
- package/dist/analytics/costAnalytics.js.map +0 -1
- package/dist/benchmark/comprehensive.js.map +0 -1
- package/dist/benchmark/reproducible.d.ts.map +0 -1
- package/dist/benchmark/reproducible.js.map +0 -1
- package/dist/cache/prefixCache.d.ts.map +0 -1
- package/dist/cache/prefixCache.js.map +0 -1
- package/dist/cache/responseCache.d.ts.map +0 -1
- package/dist/cache/responseCache.js.map +0 -1
- package/dist/cache/semanticCache.d.ts.map +0 -1
- package/dist/cache/semanticCache.js.map +0 -1
- package/dist/cli/setupWizard.d.ts.map +0 -1
- package/dist/cli/setupWizard.js.map +0 -1
- package/dist/cost/budgetEnforcer.d.ts.map +0 -1
- package/dist/cost/budgetEnforcer.js.map +0 -1
- package/dist/cost/costTracker.d.ts.map +0 -1
- package/dist/cost/costTracker.js.map +0 -1
- package/dist/ensemble/multiRoundDialog.js.map +0 -1
- package/dist/ensemble/shapleyValue.js.map +0 -1
- package/dist/integrations/langchainAdapter.d.ts.map +0 -1
- package/dist/integrations/langchainAdapter.js.map +0 -1
- package/dist/integrations/oauth.d.ts.map +0 -1
- package/dist/integrations/oauth.js.map +0 -1
- package/dist/integrations/scienceAdapter.js.map +0 -1
- package/dist/memory/autoFetch.d.ts.map +0 -1
- package/dist/memory/autoFetch.js.map +0 -1
- package/dist/memory/episodicMemory.d.ts.map +0 -1
- package/dist/memory/episodicMemory.js.map +0 -1
- package/dist/memory/hybridMemory.js.map +0 -1
- package/dist/memory/memoryTree.d.ts.map +0 -1
- package/dist/memory/memoryTree.js.map +0 -1
- package/dist/memory/obsidianVault.d.ts.map +0 -1
- package/dist/memory/obsidianVault.js.map +0 -1
- package/dist/memory/reasoningBank.js.map +0 -1
- package/dist/observability/changeWatch.d.ts.map +0 -1
- package/dist/observability/changeWatch.js.map +0 -1
- package/dist/observability/fatigueDetector.d.ts.map +0 -1
- package/dist/observability/fatigueDetector.js.map +0 -1
- package/dist/observability/index.d.ts.map +0 -1
- package/dist/observability/index.js.map +0 -1
- package/dist/observability/metrics.d.ts.map +0 -1
- package/dist/observability/metrics.js.map +0 -1
- package/dist/observability/middleware.d.ts.map +0 -1
- package/dist/observability/middleware.js.map +0 -1
- package/dist/observability/tracer.d.ts.map +0 -1
- package/dist/observability/tracer.js.map +0 -1
- package/dist/observability/types.d.ts.map +0 -1
- package/dist/observability/types.js.map +0 -1
- package/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
- package/dist/orchestration/haloOrchestrator.js.map +0 -1
- package/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
- package/dist/orchestration/mctsWorkflow.js.map +0 -1
- package/dist/providers/localProvider.d.ts.map +0 -1
- package/dist/providers/localProvider.js.map +0 -1
- package/dist/providers/providerConfig.d.ts.map +0 -1
- package/dist/providers/providerConfig.js.map +0 -1
- package/dist/providers/registry.d.ts.map +0 -1
- package/dist/providers/registry.js.map +0 -1
- package/dist/routing/advancedRouter.d.ts.map +0 -1
- package/dist/routing/advancedRouter.js.map +0 -1
- package/dist/routing/crossModelValidation.d.ts.map +0 -1
- package/dist/routing/crossModelValidation.js.map +0 -1
- package/dist/routing/providerHealth.d.ts.map +0 -1
- package/dist/routing/providerHealth.js.map +0 -1
- package/dist/routing/providerRetry.d.ts.map +0 -1
- package/dist/routing/providerRetry.js.map +0 -1
- package/dist/scripts/banner.js +0 -29
- package/dist/security/guardrails.d.ts.map +0 -1
- package/dist/security/guardrails.js.map +0 -1
- package/dist/server/dashboard.d.ts.map +0 -1
- package/dist/server/dashboard.js.map +0 -1
- package/dist/server/modelMapper.d.ts.map +0 -1
- package/dist/server/modelMapper.js.map +0 -1
- package/dist/server/proxyServer.d.ts.map +0 -1
- package/dist/server/proxyServer.js.map +0 -1
- package/dist/skills/__tests__/skill_manager.test.d.ts +0 -2
- package/dist/skills/__tests__/skill_manager.test.d.ts.map +0 -1
- package/dist/skills/__tests__/skill_manager.test.js +0 -268
- package/dist/skills/__tests__/skill_manager.test.js.map +0 -1
- package/dist/tools/tmlpdTools.d.ts.map +0 -1
- package/dist/tools/tmlpdTools.js.map +0 -1
- package/dist/tui/dashboard.d.ts.map +0 -1
- package/dist/tui/dashboard.js.map +0 -1
- package/dist/tui/index.d.ts.map +0 -1
- package/dist/tui/index.js.map +0 -1
- package/dist/utils/batchProcessor.d.ts.map +0 -1
- package/dist/utils/batchProcessor.js.map +0 -1
- package/dist/utils/compression.d.ts.map +0 -1
- package/dist/utils/compression.js.map +0 -1
- package/dist/utils/costUtils.d.ts.map +0 -1
- package/dist/utils/costUtils.js.map +0 -1
- package/dist/utils/reliability.d.ts.map +0 -1
- package/dist/utils/reliability.js.map +0 -1
- package/dist/utils/sorting.d.ts.map +0 -1
- package/dist/utils/sorting.js.map +0 -1
- package/dist/utils/speculativeDecoding.d.ts.map +0 -1
- package/dist/utils/speculativeDecoding.js.map +0 -1
- package/dist/utils/tokenUtils.d.ts.map +0 -1
- package/dist/utils/tokenUtils.js.map +0 -1
- package/docs/.nojekyll +0 -0
- package/docs/ANALYSIS_PRINCIPLES.md +0 -162
- package/docs/API.md +0 -855
- package/docs/ARCHITECTURAL-IMPROVEMENTS-2025.md +0 -1391
- package/docs/ARCHITECTURAL-IMPROVEMENTS-REVISED-2025.md +0 -1051
- package/docs/BENCHMARK.md +0 -170
- package/docs/CHINESE_PROVIDER_RELIABILITY.md +0 -37
- package/docs/CITATIONS.md +0 -74
- package/docs/CLAIMS_AND_EVIDENCE.md +0 -58
- package/docs/CONFIGURATION.md +0 -476
- package/docs/COUNCIL_DECISION.json +0 -816
- package/docs/COUNCIL_SUMMARY.md +0 -319
- package/docs/COUNCIL_V2.2_DECISION.md +0 -416
- package/docs/ENGINEERING_SPEC.md +0 -55
- package/docs/FACTORY_RESET.md +0 -34
- package/docs/GEO.md +0 -66
- package/docs/GEO_OPTIMIZATION.md +0 -30
- package/docs/GEO_ROOT_CAUSE.md +0 -136
- package/docs/GEO_STATUS.md +0 -85
- package/docs/GEO_TEST_RESULTS.md +0 -176
- package/docs/HN_CHECKLIST.md +0 -38
- package/docs/HN_FOUNDER_COMMENT.md +0 -17
- package/docs/HN_SUBMISSION_FINAL.md +0 -180
- package/docs/HN_SUBMISSION_V3.md +0 -56
- package/docs/IMPROVEMENT_ROADMAP.md +0 -515
- package/docs/INTEGRATIONS.md +0 -420
- package/docs/LANGCHAIN_INTEGRATION.md +0 -147
- package/docs/LLM_COUNCIL_DECISION.md +0 -508
- package/docs/MIDDLEWARE_CHAIN.md +0 -35
- package/docs/PROMO_CHECKLIST.md +0 -200
- package/docs/QUICKSTART.md +0 -271
- package/docs/QUICK_START.md +0 -43
- package/docs/QUICK_START_VISIBILITY.md +0 -782
- package/docs/REDDIT_GAP_ANALYSIS.md +0 -299
- package/docs/RELEASE_CHECKLIST.md +0 -32
- package/docs/REPRODUCIBILITY.md +0 -63
- package/docs/RESEARCH_BACKED_IMPROVEMENTS.md +0 -1180
- package/docs/ROUTING_RUBRIC.md +0 -197
- package/docs/SEO_AUDIT.md +0 -186
- package/docs/SOCIAL_LISTENING.md +0 -219
- package/docs/TMLPD_QNA.md +0 -751
- package/docs/TMLPD_V2.1_COMPLETE.md +0 -763
- package/docs/TMLPD_V2.2_RESEARCH_ROADMAP.md +0 -754
- package/docs/UPDATE_TOPICS.md +0 -15
- package/docs/USE_CASES.md +0 -59
- package/docs/V2.2_IMPLEMENTATION_COMPLETE.md +0 -446
- package/docs/V2_IMPLEMENTATION_GUIDE.md +0 -388
- package/docs/VERCEL_AI_SDK.md +0 -209
- package/docs/VISIBILITY_ADOPTION_PLAN.md +0 -1005
- package/docs/_config.yml +0 -49
- package/docs/ai-plugin.json +0 -16
- package/docs/api.html +0 -513
- package/docs/architecture-diagram.md +0 -40
- package/docs/benchmark-chart.png +0 -0
- package/docs/benchmark.html +0 -387
- package/docs/blog/routerarena-number-one.html +0 -73
- package/docs/cli-cheatsheet.md +0 -339
- package/docs/compare.md +0 -109
- package/docs/comparison-litellm.md +0 -88
- package/docs/comparison.md +0 -108
- package/docs/cost-chart-ascii.md +0 -42
- package/docs/cost-comparison-chart.svg +0 -88
- package/docs/curl-examples.md +0 -247
- package/docs/demo-auto.html +0 -264
- package/docs/demo.html +0 -416
- package/docs/geo/GENERATIVE_ENGINE_OPTIMIZATION.md +0 -232
- package/docs/index.html +0 -507
- package/docs/launch-content/LAUNCH_EXECUTION_CHECKLIST.md +0 -421
- package/docs/launch-content/README.md +0 -457
- package/docs/launch-content/assets/cost_comparison_100_tasks.png +0 -0
- package/docs/launch-content/assets/cumulative_savings.png +0 -0
- package/docs/launch-content/assets/parallel_speedup.png +0 -0
- package/docs/launch-content/assets/provider_pricing_comparison.png +0 -0
- package/docs/launch-content/assets/task_breakdown_comparison.png +0 -0
- package/docs/launch-content/generate_charts.py +0 -313
- package/docs/launch-content/hn_show_post.md +0 -139
- package/docs/launch-content/partner_outreach_templates.md +0 -745
- package/docs/launch-content/reddit_posts.md +0 -467
- package/docs/launch-content/twitter_thread.txt +0 -460
- package/docs/npm-downloads-chart.svg +0 -43
- package/docs/openapi.json +0 -139
- package/docs/openapi.yaml +0 -1318
- package/docs/quick-start.html +0 -366
- package/docs/robots.txt +0 -52
- package/docs/sitemap.xml +0 -57
- package/docs/styles.css +0 -682
- package/docs/well-known/ai-plugin.json +0 -16
- package/docs/wellknown/ai-plugin.json +0 -16
- package/docs-site/assets/og-banner.svg +0 -194
- package/docs-site/index.html +0 -632
- package/eval/README.md +0 -46
- package/eval/baselines/main.json +0 -12
- package/eval/benchmark_dataset.jsonl +0 -16
- package/eval/check_golden_routes.js +0 -64
- package/eval/datasets/catalog.json +0 -33
- package/eval/datasets/slices/cn_provider_reliability_v1.jsonl +0 -3
- package/eval/datasets/slices/cost_pressure_v1.jsonl +0 -3
- package/eval/datasets/slices/safety_guardrails_v1.jsonl +0 -3
- package/eval/evals.json +0 -199
- package/eval/fault_injection_thresholds.json +0 -3
- package/eval/generate_report.js +0 -128
- package/eval/golden_routes.json +0 -114
- package/eval/lib/experiment_registry.js +0 -24
- package/eval/run_eval.js +0 -197
- package/eval/run_fault_injection.js +0 -201
- package/eval/run_shadow_eval.js +0 -85
- package/eval/thresholds.json +0 -9
- package/examples/QUICKSTART.md +0 -183
- package/examples/README.md +0 -61
- package/examples/a3m-sdk.js +0 -124
- package/examples/basic-route.js +0 -54
- package/examples/chat-loop.js +0 -202
- package/examples/classify-then-route.js +0 -102
- package/examples/cost-compare.js +0 -120
- package/examples/ensemble.js +0 -160
- package/examples/whatsapp-telegram-bridge-demo.js +0 -302
- package/examples/whatsapp-telegram-bridge.js +0 -269
- package/hf-space/README.md +0 -23
- package/hf-space/app.py +0 -240
- package/hf-space/requirements.txt +0 -1
- package/huggingface_space/README.md +0 -35
- package/huggingface_space/app.py +0 -126
- package/huggingface_space/create_space.py +0 -208
- package/huggingface_space/requirements.txt +0 -1
- package/mcp-server/README.md +0 -188
- package/mcp-server/package.json +0 -29
- package/mcp-server/src/index.ts +0 -744
- package/mcp-server/tsconfig.json +0 -19
- package/openclaw-alexa-bridge/ALL_REMAINING_FIXES_PLAN.md +0 -313
- package/openclaw-alexa-bridge/REMAINING_FIXES_SUMMARY.md +0 -277
- package/openclaw-alexa-bridge/src/alexa_handler_no_tmlpd.js +0 -1234
- package/openclaw-alexa-bridge/test_fixes.js +0 -77
- package/playground/README.md +0 -51
- package/playground/codesandbox.json +0 -12
- package/playground/index.js +0 -39
- package/proxy/README.md +0 -227
- package/proxy/package-lock.json +0 -831
- package/proxy/package.json +0 -17
- package/proxy/rate-limit.js +0 -145
- package/proxy/rate-limit.test.js +0 -311
- package/proxy/server.js +0 -970
- package/python/README.md +0 -102
- package/python/a3m/__init__.py +0 -6
- package/python/a3m/client.py +0 -190
- package/python/a3m/models.py +0 -40
- package/python/a3m/sync_client.py +0 -61
- package/python/examples.py +0 -53
- package/python/integrations.py +0 -330
- package/python/pyproject.toml +0 -23
- package/python/setup.py +0 -28
- package/python/tmlpd.py +0 -369
- package/qna/REDDIT_GAP_ANALYSIS.md +0 -299
- package/qna/TMLPD_QNA.md +0 -751
- package/research/FINDING_001_safety.md +0 -28
- package/research/FINDING_002_error_diversity.md +0 -32
- package/research/FINDING_003_confidence_weighted_voting.md +0 -32
- package/research/FINDING_004_cross_model_semantic_detection.md +0 -37
- package/research/FINDING_005_knowledge_gap_orthogonality.md +0 -34
- package/research/HALLUCINATION_RESEARCH.md +0 -27
- package/research/ensemble-voting.md +0 -324
- package/research/loss-functions.md +0 -545
- package/research-log.md +0 -49
- package/scripts/banner.js +0 -29
- package/scripts/benchmark-local-routerarena.ts +0 -176
- package/scripts/benchmark.js +0 -145
- package/scripts/benchmark.sh +0 -61
- package/scripts/compare-providers.sh +0 -230
- package/scripts/content-planner.js +0 -25
- package/scripts/create-labeled-benchmark.ts +0 -105
- package/scripts/cross_post.py +0 -443
- package/scripts/local-router-benchmark.ts +0 -154
- package/scripts/post-all.sh +0 -41
- package/scripts/publish_fcc.py +0 -106
- package/scripts/push-to-gitee.sh +0 -25
- package/scripts/routerarena_ensemble.js +0 -144
- package/scripts/routing-benchmark-v2.js +0 -373
- package/scripts/routing-benchmark-v3.js +0 -118
- package/scripts/routing-benchmark.js +0 -462
- package/scripts/run-labeled-benchmark.mjs +0 -104
- package/scripts/run-mmlu-benchmark.js +0 -176
- package/scripts/run-provider-benchmark.js +0 -244
- package/scripts/update-npm-badges.js +0 -158
- package/skill/SKILL.md +0 -238
- package/src/__tests__/integration/tmpld_integration.test.py +0 -540
- package/src/skills/__tests__/skill_manager.test.ts +0 -328
- package/submissions/benchmarks/ALL_PLATFORMS_SUBMISSION.md +0 -94
- package/submissions/benchmarks/LLMROUTERBENCH_SUBMISSION.md +0 -121
- package/submissions/benchmarks/MMRBENCH_SUBMISSION.md +0 -94
- package/submissions/benchmarks/ROUTERARENA_UPDATE.md +0 -83
- package/submissions/benchmarks/ROUTERBENCH_SUBMISSION.md +0 -225
- package/test-council/1-structure-tests.test.js +0 -353
- package/test-council/1-structure-tests.test.ts +0 -353
- package/test-council/2-edge-case-tests.test.ts +0 -361
- package/test-council/3-performance-tests.test.ts +0 -669
- package/test-council/4-integration-tests.test.ts +0 -391
- package/test-council/5-agent-council-eval.test.ts +0 -413
- package/test-council/AGENT_COUNCIL_ARCHITECTURE.md +0 -349
- package/test-council/TEST_COUNCIL_REPORT.md +0 -201
- package/test-council/agents/edge-case-agent.ts +0 -363
- package/test-council/agents/performance-agent.ts +0 -426
- package/test-council/agents/structure-agent.ts +0 -227
- package/test-council/council.md +0 -183
- package/tests/__mocks__/tokenUtils.ts +0 -8
- package/tests/memory/episodicMemory.test.ts +0 -227
- package/tests/package-lock.json +0 -1628
- package/tests/package.json +0 -18
- package/tests/routing/ensembleVoting.test.ts +0 -236
- package/tests/routing/providerRetry.test.ts +0 -360
- package/tests/routing/queryTypePresets.test.ts +0 -208
- package/tests/security/guardrailEngine.test.ts +0 -700
- package/tests/tsconfig.json +0 -21
- package/tests/vitest.config.ts +0 -18
- package/tmlpd-pi-extension/README.md +0 -66
- package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts +0 -114
- package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/cache/prefixCache.js +0 -285
- package/tmlpd-pi-extension/dist/cache/prefixCache.js.map +0 -1
- package/tmlpd-pi-extension/dist/cache/responseCache.d.ts +0 -58
- package/tmlpd-pi-extension/dist/cache/responseCache.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/cache/responseCache.js +0 -153
- package/tmlpd-pi-extension/dist/cache/responseCache.js.map +0 -1
- package/tmlpd-pi-extension/dist/cli.js +0 -59
- package/tmlpd-pi-extension/dist/cost/costTracker.d.ts +0 -95
- package/tmlpd-pi-extension/dist/cost/costTracker.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/cost/costTracker.js +0 -240
- package/tmlpd-pi-extension/dist/cost/costTracker.js.map +0 -1
- package/tmlpd-pi-extension/dist/index.d.ts +0 -723
- package/tmlpd-pi-extension/dist/index.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/index.js +0 -239
- package/tmlpd-pi-extension/dist/index.js.map +0 -1
- package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts +0 -82
- package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/memory/episodicMemory.js +0 -145
- package/tmlpd-pi-extension/dist/memory/episodicMemory.js.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts +0 -102
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js +0 -207
- package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts +0 -85
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js +0 -210
- package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js.map +0 -1
- package/tmlpd-pi-extension/dist/providers/localProvider.d.ts +0 -102
- package/tmlpd-pi-extension/dist/providers/localProvider.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/providers/localProvider.js +0 -338
- package/tmlpd-pi-extension/dist/providers/localProvider.js.map +0 -1
- package/tmlpd-pi-extension/dist/providers/registry.d.ts +0 -55
- package/tmlpd-pi-extension/dist/providers/registry.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/providers/registry.js +0 -138
- package/tmlpd-pi-extension/dist/providers/registry.js.map +0 -1
- package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts +0 -68
- package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/routing/advancedRouter.js +0 -332
- package/tmlpd-pi-extension/dist/routing/advancedRouter.js.map +0 -1
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts +0 -101
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.js +0 -368
- package/tmlpd-pi-extension/dist/tools/tmlpdTools.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts +0 -96
- package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/batchProcessor.js +0 -170
- package/tmlpd-pi-extension/dist/utils/batchProcessor.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/compression.d.ts +0 -61
- package/tmlpd-pi-extension/dist/utils/compression.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/compression.js +0 -281
- package/tmlpd-pi-extension/dist/utils/compression.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/reliability.d.ts +0 -74
- package/tmlpd-pi-extension/dist/utils/reliability.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/reliability.js +0 -177
- package/tmlpd-pi-extension/dist/utils/reliability.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts +0 -117
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js +0 -246
- package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js.map +0 -1
- package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts +0 -50
- package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts.map +0 -1
- package/tmlpd-pi-extension/dist/utils/tokenUtils.js +0 -124
- package/tmlpd-pi-extension/dist/utils/tokenUtils.js.map +0 -1
- package/tmlpd-pi-extension/examples/QUICKSTART.md +0 -183
- package/tmlpd-pi-extension/package-lock.json +0 -79
- package/tmlpd-pi-extension/package.json +0 -172
- package/tmlpd-pi-extension/python/examples.py +0 -53
- package/tmlpd-pi-extension/python/integrations.py +0 -330
- package/tmlpd-pi-extension/python/setup.py +0 -28
- package/tmlpd-pi-extension/python/tmlpd.py +0 -369
- package/tmlpd-pi-extension/qna/REDDIT_GAP_ANALYSIS.md +0 -299
- package/tmlpd-pi-extension/qna/TMLPD_QNA.md +0 -751
- package/tmlpd-pi-extension/skill/SKILL.md +0 -238
- package/tmlpd-pi-extension/src/cache/responseCache.ts +0 -147
- package/tmlpd-pi-extension/src/cost/costTracker.ts +0 -302
- package/tmlpd-pi-extension/src/index.ts +0 -232
- package/tmlpd-pi-extension/src/memory/episodicMemory.ts +0 -257
- package/tmlpd-pi-extension/src/orchestration/haloOrchestrator.ts +0 -266
- package/tmlpd-pi-extension/src/orchestration/mctsWorkflow.ts +0 -262
- package/tmlpd-pi-extension/src/providers/localProvider.ts +0 -406
- package/tmlpd-pi-extension/src/providers/registry.ts +0 -164
- package/tmlpd-pi-extension/src/routing/ensembleVoting.ts +0 -159
- package/tmlpd-pi-extension/src/routing/queryTypePresets.ts +0 -136
- package/tmlpd-pi-extension/src/tools/tmlpdTools.ts +0 -433
- package/tmlpd-pi-extension/src/utils/batchProcessor.ts +0 -232
- package/tmlpd-pi-extension/src/utils/compression.ts +0 -325
- package/tmlpd-pi-extension/src/utils/reliability.ts +0 -221
- package/tmlpd-pi-extension/src/utils/tokenUtils.ts +0 -145
- package/tmlpd-pi-extension/tsconfig.json +0 -18
- package/tsconfig.build.json +0 -29
- package/tsconfig.json +0 -18
- /package/{docs/llms-full.txt → llms-full.txt.bak} +0 -0
package/docs/ROUTING_RUBRIC.md
DELETED
|
@@ -1,197 +0,0 @@
|
|
|
1
|
-
# A3M Router — Routing Quality Rubric
|
|
2
|
-
|
|
3
|
-
Five dimensions, each measured against real evidence from production routing data. The composite score drives the pulse metric and surfaces where routing quality degrades.
|
|
4
|
-
|
|
5
|
-
## Formula
|
|
6
|
-
|
|
7
|
-
```
|
|
8
|
-
composite_score = 0.30 × RoutingAccuracy
|
|
9
|
-
+ 0.25 × CostEfficiency
|
|
10
|
-
+ 0.20 × Latency
|
|
11
|
-
+ 0.15 × ErrorHandling
|
|
12
|
-
+ 0.10 × CacheHitRate
|
|
13
|
-
```
|
|
14
|
-
|
|
15
|
-
**Weight justification:**
|
|
16
|
-
- **30% Accuracy** — Getting the right provider for the right query is the primary function. Everything else is secondary.
|
|
17
|
-
- **25% Cost Efficiency** — The core value proposition. If accuracy is perfect but costs are high, we failed at the value prop.
|
|
18
|
-
- **20% Latency** — Developer experience. A router that's slow gets bypassed regardless of accuracy.
|
|
19
|
-
- **15% Error Handling** — Reliability under provider failures. Matters most in production.
|
|
20
|
-
- **10% Cache Hit Rate** — Bonus optimization. Only matters at scale.
|
|
21
|
-
|
|
22
|
-
---
|
|
23
|
-
|
|
24
|
-
## 1. Routing Accuracy (30%)
|
|
25
|
-
|
|
26
|
-
*"Did the router send the query to the right tier?"*
|
|
27
|
-
|
|
28
|
-
### Scoring
|
|
29
|
-
|
|
30
|
-
| Score | Criterion |
|
|
31
|
-
|-------|-----------|
|
|
32
|
-
| 90-100 | >95% within ±1 tier. RouterArena score above 70. Fewer than 1 in 20 queries misrouted by more than one tier. |
|
|
33
|
-
| 75-89 | 85-95% within ±1 tier. RouterArena score 60-70. Occasional over-tiering on simple queries. |
|
|
34
|
-
| 60-74 | 70-85% within ±1 tier. RouterArena score 50-60. Noticeable over-tiering on medium queries. |
|
|
35
|
-
| 45-59 | 50-70% within ±1 tier. Frequent misrouting on complex/expert queries. |
|
|
36
|
-
| <45 | <50% within ±1 tier. Router is essentially random. Major overhaul needed. |
|
|
37
|
-
|
|
38
|
-
### Evidence to capture
|
|
39
|
-
|
|
40
|
-
- **RouteLLM comparison** — where RouteLLM routes vs A3M (reference benchmark)
|
|
41
|
-
- **Tier confusion matrix** — which query types cause the most over/under-tiering
|
|
42
|
-
- **RouterArena score** — the single-number benchmark (current: 70.32)
|
|
43
|
-
- **Golden route deviation** — percentage of queries where A3M disagrees with golden route
|
|
44
|
-
|
|
45
|
-
### Common failure patterns
|
|
46
|
-
|
|
47
|
-
| Pattern | Fix |
|
|
48
|
-
|---------|-----|
|
|
49
|
-
| All queries go to free tier (0% to mid/premium) | Add confidence floor. If no provider has confidence > 0.5, fallback to premium |
|
|
50
|
-
| Code queries misrouted to creative models | Strengthen code-detection signals (``` blocks, function syntax) |
|
|
51
|
-
| Legal/medical routed to cheap models | Add domain detection for 5 safety-critical domains |
|
|
52
|
-
| Ambiguous queries bounce between tiers | Implement query-type confidence threshold |
|
|
53
|
-
|
|
54
|
-
### Dollar Impact
|
|
55
|
-
|
|
56
|
-
```
|
|
57
|
-
Wasted = (MismatchCount × AvgCostDelta)
|
|
58
|
-
AvgCostDelta = |ActualCost - OptimalCost|
|
|
59
|
-
```
|
|
60
|
-
|
|
61
|
-
---
|
|
62
|
-
|
|
63
|
-
## 2. Cost Efficiency (25%)
|
|
64
|
-
|
|
65
|
-
*"Did the router save money compared to all-premium routing?"*
|
|
66
|
-
|
|
67
|
-
### Scoring
|
|
68
|
-
|
|
69
|
-
| Score | Savings vs All-Premium | CPP (Cost Per Query) |
|
|
70
|
-
|-------|----------------------|---------------------|
|
|
71
|
-
| 90-100 | >70% savings | <$0.001/query |
|
|
72
|
-
| 75-89 | 50-70% savings | $0.001-$0.003/query |
|
|
73
|
-
| 60-74 | 30-50% savings | $0.003-$0.006/query |
|
|
74
|
-
| 45-59 | 15-30% savings | $0.006-$0.01/query |
|
|
75
|
-
| <45 | <15% savings | >$0.01/query |
|
|
76
|
-
|
|
77
|
-
### Evidence to capture
|
|
78
|
-
|
|
79
|
-
- **Cost per query** over the measurement window
|
|
80
|
-
- **Savings vs all-premium** — total cost if every query went to GPT-4o
|
|
81
|
-
- **Free tier utilization** — % of queries handled by free/cheap providers
|
|
82
|
-
- **Budget cap hits** — how often budget enforcement is triggered
|
|
83
|
-
- **Provider cost breakdown** — cost per provider
|
|
84
|
-
|
|
85
|
-
### Common failure patterns
|
|
86
|
-
|
|
87
|
-
| Pattern | Fix |
|
|
88
|
-
|---------|-----|
|
|
89
|
-
| Everything routes to free (0% accuracy) | Add quality floor to cost optimization |
|
|
90
|
-
| Budget cap tripped too often | Increase budget cap or reduce free-tier usage |
|
|
91
|
-
| Premium providers selected for trivial queries | Lower confidence threshold for mid-tier |
|
|
92
|
-
|
|
93
|
-
### Dollar Impact
|
|
94
|
-
|
|
95
|
-
```
|
|
96
|
-
Savings = (TotalQueryCount × AvgPremiumCost) - ActualTotalCost
|
|
97
|
-
MonthlySavings = Savings × (30 / MeasurementDays)
|
|
98
|
-
```
|
|
99
|
-
|
|
100
|
-
---
|
|
101
|
-
|
|
102
|
-
## 3. Latency (20%)
|
|
103
|
-
|
|
104
|
-
*"How fast is the router decision?"*
|
|
105
|
-
|
|
106
|
-
### Scoring (P95 Latency)
|
|
107
|
-
|
|
108
|
-
| Score | P95 Latency | Overhead vs Direct |
|
|
109
|
-
|-------|------------|-------------------|
|
|
110
|
-
| 90-100 | <200ms | <50ms overhead |
|
|
111
|
-
| 75-89 | 200-500ms | 50-100ms overhead |
|
|
112
|
-
| 60-74 | 500-1000ms | 100-200ms overhead |
|
|
113
|
-
| 45-59 | 1-3s | 200-500ms overhead |
|
|
114
|
-
| <45 | >3s | >500ms overhead |
|
|
115
|
-
|
|
116
|
-
### Evidence to capture
|
|
117
|
-
|
|
118
|
-
- **P50, P95, P99 latency** — distribution
|
|
119
|
-
- **Routing decision overhead** — time spent in routing logic vs provider response
|
|
120
|
-
- **Slowest providers** — top 5 by latency
|
|
121
|
-
- **Cache response time** — cached vs uncached query time
|
|
122
|
-
|
|
123
|
-
---
|
|
124
|
-
|
|
125
|
-
## 4. Error Handling (15%)
|
|
126
|
-
|
|
127
|
-
*"How well does the router handle failures?"*
|
|
128
|
-
|
|
129
|
-
### Scoring
|
|
130
|
-
|
|
131
|
-
| Score | Criterion |
|
|
132
|
-
|-------|-----------|
|
|
133
|
-
| 90-100 | 0 unhandled failures. All provider failures caught by circuit breaker. Graceful fallback 100% of the time. |
|
|
134
|
-
| 75-89 | <1% unhandled failures. Circuit breaker catches most issues. Fallback succeeds >95%. |
|
|
135
|
-
| 60-74 | 1-3% unhandled failures. Occasional circuit breaker misses. Fallback succeeds >80%. |
|
|
136
|
-
| 45-59 | 3-10% unhandled failures. Circuit breaker coverage gaps. Fallback degrades. |
|
|
137
|
-
| <45 | >10% unhandled failures. Critical reliability issues. |
|
|
138
|
-
|
|
139
|
-
### Evidence to capture
|
|
140
|
-
|
|
141
|
-
- **Circuit breaker trips** — how many times each provider was disabled
|
|
142
|
-
- **Fallback success rate** — % of attempts where fallback succeeded
|
|
143
|
-
- **Unhandled failures** — queries that returned no response
|
|
144
|
-
- **Provider health score** — current health of each provider
|
|
145
|
-
|
|
146
|
-
### Common failure patterns
|
|
147
|
-
|
|
148
|
-
| Pattern | Fix |
|
|
149
|
-
|---------|-----|
|
|
150
|
-
| Circuit breaker never fires (wasteful retries) | Lower threshold for circuit breaker trip |
|
|
151
|
-
| Circuit breaker fires too often | Increase threshold, add validation before trip |
|
|
152
|
-
| All providers fail simultaneously | Add cold-start provider as emergency fallback |
|
|
153
|
-
|
|
154
|
-
---
|
|
155
|
-
|
|
156
|
-
## 5. Cache Hit Rate (10%)
|
|
157
|
-
|
|
158
|
-
*"How often does semantic cache avoid a duplicate provider call?"*
|
|
159
|
-
|
|
160
|
-
### Scoring
|
|
161
|
-
|
|
162
|
-
| Score | Cache Hit Rate |
|
|
163
|
-
|-------|---------------|
|
|
164
|
-
| 90-100 | >40% |
|
|
165
|
-
| 75-89 | 30-40% |
|
|
166
|
-
| 60-74 | 20-30% |
|
|
167
|
-
| 45-59 | 10-20% |
|
|
168
|
-
| <45 | <10% |
|
|
169
|
-
|
|
170
|
-
### Evidence to capture
|
|
171
|
-
|
|
172
|
-
- **Global cache hit rate** — across all queries
|
|
173
|
-
- **Per-query-type cache rate** — which query types benefit most
|
|
174
|
-
- **Cache latency savings** — total time saved by cache hits
|
|
175
|
-
- **Cache cost savings** — how much money cache saved
|
|
176
|
-
|
|
177
|
-
---
|
|
178
|
-
|
|
179
|
-
## Composite Score Bands
|
|
180
|
-
|
|
181
|
-
| Band | Score | Meaning |
|
|
182
|
-
|------|-------|---------|
|
|
183
|
-
| 🟢 Excellent | 85-100 | Production-ready. Fine-tune edge cases. |
|
|
184
|
-
| 🟡 Good | 70-84 | Working well. Some optimization opportunities. |
|
|
185
|
-
| 🟠 Fair | 55-69 | Functional but needs attention. |
|
|
186
|
-
| 🔴 Poor | 40-54 | Quality issues. Investigate root cause. |
|
|
187
|
-
| ⚫ Critical | <40 | Router needs significant work. |
|
|
188
|
-
|
|
189
|
-
## Usage
|
|
190
|
-
|
|
191
|
-
Calculate after every 100 queries or at least once per week:
|
|
192
|
-
|
|
193
|
-
```bash
|
|
194
|
-
a3m-router metrics # Quick pulse
|
|
195
|
-
a3m-router metrics --full # Full rubric with all dimensions
|
|
196
|
-
a3m-router metrics --export # Raw JSON for analysis
|
|
197
|
-
```
|
package/docs/SEO_AUDIT.md
DELETED
|
@@ -1,186 +0,0 @@
|
|
|
1
|
-
# SEO Audit: A3M Router (adaptive-memory-multi-model-router)
|
|
2
|
-
|
|
3
|
-
**Date:** 2026-05-18 (Updated)
|
|
4
|
-
**Package:** adaptive-memory-multi-model-router
|
|
5
|
-
**NPM URL:** https://www.npmjs.com/package/adaptive-memory-multi-model-router
|
|
6
|
-
**GitHub URL:** https://github.com/Das-rebel/a3m-router
|
|
7
|
-
|
|
8
|
-
---
|
|
9
|
-
|
|
10
|
-
## 1. Keyword Research
|
|
11
|
-
|
|
12
|
-
### Primary Keywords (benchmark-driven, high intent)
|
|
13
|
-
|
|
14
|
-
| Keyword | Est. Monthly Volume | Competition | Intent | Priority |
|
|
15
|
-
|---------|---------------------|-------------|--------|----------|
|
|
16
|
-
| `llm router benchmark` | 1,200-2,000 | Low | Commercial | P0 |
|
|
17
|
-
| `llm routing accuracy` | 800-1,500 | Low | Informational | P0 |
|
|
18
|
-
| `routellm alternative` | 1,500-3,000 | Low-Medium | Commercial | P0 |
|
|
19
|
-
| `litellm alternative` | 1,500-3,000 | Low-Medium | Commercial | P0 |
|
|
20
|
-
| `llm cost optimization` | 800-1,500 | Low | Commercial | P0 |
|
|
21
|
-
| `openai proxy free` | 2,000-4,000 | Medium | Transactional | P0 |
|
|
22
|
-
| `llm gateway open source` | 1,000-2,000 | Low-Medium | Commercial | P0 |
|
|
23
|
-
|
|
24
|
-
### Long-Tail Keywords (FAQ/content targets)
|
|
25
|
-
|
|
26
|
-
| Keyword | Est. Monthly Volume | Competition | Intent | Priority |
|
|
27
|
-
|---------|---------------------|-------------|--------|----------|
|
|
28
|
-
| `how to reduce openai api costs` | 1,500-3,000 | Low | Informational | P0 |
|
|
29
|
-
| `llm routing without gpu` | 300-600 | Very Low | Informational | P0 |
|
|
30
|
-
| `lightweight llm router` | 500-1,000 | Low | Commercial | P0 |
|
|
31
|
-
| `keyword-based llm routing` | 100-300 | Very Low | Informational | P1 |
|
|
32
|
-
| `drop-in openai proxy` | 300-600 | Low | Commercial | P0 |
|
|
33
|
-
| `free llm proxy` | 800-1,500 | Low | Transactional | P0 |
|
|
34
|
-
| `cheapest openai api alternative` | 500-1,000 | Low | Commercial | P0 |
|
|
35
|
-
|
|
36
|
-
### Competitive/Comparison Keywords (HIGH VALUE)
|
|
37
|
-
|
|
38
|
-
| Keyword | Est. Monthly Volume | Competition | Priority |
|
|
39
|
-
|---------|---------------------|-------------|----------|
|
|
40
|
-
| `routellm alternative` | 1,500-3,000 | Low-Medium | P0 |
|
|
41
|
-
| `litellm alternative` | 1,500-3,000 | Low-Medium | P0 |
|
|
42
|
-
| `a3m router vs litellm` | 50-100 | Very Low | P1 |
|
|
43
|
-
| `a3m router vs routellm` | 50-100 | Very Low | P1 |
|
|
44
|
-
| `openrouter alternative` | 200-400 | Low | P1 |
|
|
45
|
-
| `portkey alternative` | 100-200 | Very Low | P2 |
|
|
46
|
-
|
|
47
|
-
### Secondary Keywords
|
|
48
|
-
|
|
49
|
-
| Keyword | Est. Monthly Volume | Competition | Priority |
|
|
50
|
-
|---------|---------------------|-------------|----------|
|
|
51
|
-
| `ai gateway` | 5,000-8,000 | High | P1 |
|
|
52
|
-
| `model routing` | 500-1,000 | Low | P1 |
|
|
53
|
-
| `llm proxy` | 1,000-2,000 | Low-Medium | P1 |
|
|
54
|
-
| `openai compatible proxy` | 500-1,000 | Low | P1 |
|
|
55
|
-
| `llm load balancer` | 300-800 | Low | P1 |
|
|
56
|
-
| `llm provider comparison` | 1,000-2,000 | Medium | P1 |
|
|
57
|
-
|
|
58
|
-
---
|
|
59
|
-
|
|
60
|
-
## 2. Key Messages (use everywhere)
|
|
61
|
-
|
|
62
|
-
1. **"82.5% routing accuracy without ML"** — Lead metric, differentiator
|
|
63
|
-
2. **"Matches RouteLLM BERT within 2.5%"** — Competitive positioning
|
|
64
|
-
3. **"30x more efficient than GPU-based routing"** — Efficiency story
|
|
65
|
-
4. **"Only router besides RouteLLM with published benchmarks"** — Trust signal
|
|
66
|
-
5. **"245% growth, 2,775 downloads in 3 days"** — Social proof
|
|
67
|
-
|
|
68
|
-
---
|
|
69
|
-
|
|
70
|
-
## 3. Competitive Positioning
|
|
71
|
-
|
|
72
|
-
### RouteLLM Alternative (HIGH VALUE)
|
|
73
|
-
|
|
74
|
-
"RouteLLM alternative" is our highest-value keyword because:
|
|
75
|
-
- RouteLLM users are actively looking for alternatives (GPU cost, complexity)
|
|
76
|
-
- We have a direct benchmark comparison (within 2.5%)
|
|
77
|
-
- We offer features RouteLLM lacks (proxy, cache, guardrails)
|
|
78
|
-
|
|
79
|
-
**Positioning:** "A3M Router matches RouteLLM BERT within 2.5% — without GPU. Plus proxy, cache, guardrails."
|
|
80
|
-
|
|
81
|
-
### LiteLLM Alternative (HIGH VALUE)
|
|
82
|
-
|
|
83
|
-
"LiteLLM alternative" captures users who want:
|
|
84
|
-
- Published routing benchmarks
|
|
85
|
-
- Zero-config setup
|
|
86
|
-
- Built-in semantic caching
|
|
87
|
-
|
|
88
|
-
**Positioning:** "A3M Router is the only LiteLLM alternative with published routing benchmarks (82.5% accuracy)."
|
|
89
|
-
|
|
90
|
-
### Competitive Table
|
|
91
|
-
|
|
92
|
-
| Competitor | NPM Weekly Downloads | Our Edge |
|
|
93
|
-
|------------|---------------------|----------|
|
|
94
|
-
| litellm | ~80,000 | Published benchmarks, zero-config, semantic cache |
|
|
95
|
-
| openrouter-sdk | ~5,000 | Self-hosted, no middleman fees, published accuracy |
|
|
96
|
-
| portkey-ai | ~3,000 | Open-source, free, no signup, benchmarks |
|
|
97
|
-
| routellm | ~1,000 | No GPU needed, proxy included, 39 providers |
|
|
98
|
-
|
|
99
|
-
---
|
|
100
|
-
|
|
101
|
-
## 4. On-Page SEO Checklist
|
|
102
|
-
|
|
103
|
-
### docs-site/index.html
|
|
104
|
-
|
|
105
|
-
| Element | Status | Target |
|
|
106
|
-
|---------|--------|--------|
|
|
107
|
-
| Title tag | UPDATED | "A3M Router — 82.5% Routing Accuracy Without ML \| Matches RouteLLM" |
|
|
108
|
-
| Meta description | UPDATED | 30x efficiency story with accuracy metric |
|
|
109
|
-
| Keywords meta | UPDATED | All 12 primary/long-tail keywords |
|
|
110
|
-
| H1 tag | UPDATED | "LLM Routing That Matches GPU Models — Without GPU" |
|
|
111
|
-
| Stats section | UPDATED | Leads with 82.5% accuracy, 2.5% gap, 30x efficiency |
|
|
112
|
-
| FAQ schema | UPDATED | 8 questions targeting AI search queries |
|
|
113
|
-
| OG tags | UPDATED | Benchmark-first messaging |
|
|
114
|
-
| Twitter cards | UPDATED | Benchmark-first messaging |
|
|
115
|
-
|
|
116
|
-
### Content Structure (H-tag hierarchy)
|
|
117
|
-
|
|
118
|
-
```
|
|
119
|
-
H1: LLM Routing That Matches GPU Models — Without GPU
|
|
120
|
-
H2: Intelligent LLM Routing (feature)
|
|
121
|
-
H2: Cost Optimization (feature)
|
|
122
|
-
H2: Smart Fallback & Retry (feature)
|
|
123
|
-
H2: Real-time Analytics (feature)
|
|
124
|
-
H2: Security Guardrails (feature)
|
|
125
|
-
H2: Semantic Cache (feature)
|
|
126
|
-
H2: LLM Provider Pricing Tiers (section)
|
|
127
|
-
H3: Free/Budget/Mid/Premium Tier
|
|
128
|
-
H2: Quick Start: LLM Routing in 30 Seconds
|
|
129
|
-
H2: Frequently Asked Questions
|
|
130
|
-
H3: What is LLM routing accuracy?
|
|
131
|
-
H3: How does keyword-based routing compare to ML routing?
|
|
132
|
-
H3: What is the best lightweight LLM router?
|
|
133
|
-
H3: How to reduce OpenAI API costs?
|
|
134
|
-
H3: How does A3M Router compare to RouteLLM?
|
|
135
|
-
H3: How does A3M Router compare to LiteLLM?
|
|
136
|
-
```
|
|
137
|
-
|
|
138
|
-
---
|
|
139
|
-
|
|
140
|
-
## 5. Technical SEO
|
|
141
|
-
|
|
142
|
-
### robots.txt (UPDATED)
|
|
143
|
-
- Allows full crawling
|
|
144
|
-
- Explicitly allows docs/, assets/, llms.txt, README.md
|
|
145
|
-
- Sitemap reference included
|
|
146
|
-
- Blocks /node_modules/, /dist/, /test/, /src/, /.git/
|
|
147
|
-
|
|
148
|
-
### sitemap.xml (UPDATED)
|
|
149
|
-
- 11 URLs including all key pages
|
|
150
|
-
- New: GEO.md, SEO_AUDIT.md, CONFIGURATION.md, INTEGRATIONS.md, benchmark-results.json, llms.txt
|
|
151
|
-
- Priority weighting: homepage (1.0) > GitHub (0.9) > NPM (0.9) > docs (0.7-0.8)
|
|
152
|
-
|
|
153
|
-
### llms.txt (UPDATED)
|
|
154
|
-
- Leads with benchmark story (82.5% accuracy)
|
|
155
|
-
- Includes comparison table vs RouteLLM/LiteLLM
|
|
156
|
-
- Structured data section for AI extraction
|
|
157
|
-
- All 5 key messages included
|
|
158
|
-
|
|
159
|
-
---
|
|
160
|
-
|
|
161
|
-
## 6. GEO (Generative Engine Optimization)
|
|
162
|
-
|
|
163
|
-
See `docs/GEO.md` for full GEO strategy. Key elements:
|
|
164
|
-
|
|
165
|
-
1. **FAQ format** answering AI-searchable questions
|
|
166
|
-
2. **Comparison tables** with verifiable data AI engines cite
|
|
167
|
-
3. **Structured key-value block** for direct AI extraction
|
|
168
|
-
4. **Target AI queries** mapped to A3M Router answers
|
|
169
|
-
|
|
170
|
-
---
|
|
171
|
-
|
|
172
|
-
## 7. Action Items
|
|
173
|
-
|
|
174
|
-
- [x] Update docs-site/index.html title, meta, H1, stats, FAQ
|
|
175
|
-
- [x] Update FAQ schema with benchmark-focused questions
|
|
176
|
-
- [x] Update OG/Twitter cards with benchmark messaging
|
|
177
|
-
- [x] Update llms.txt with benchmark story
|
|
178
|
-
- [x] Create docs/GEO.md with AI search optimization
|
|
179
|
-
- [x] Update docs/SEO_AUDIT.md with new keywords
|
|
180
|
-
- [x] Update public/sitemap.xml with all key pages
|
|
181
|
-
- [x] Update public/robots.txt with better crawling rules
|
|
182
|
-
- [x] Update package.json keywords (optimized)
|
|
183
|
-
- [ ] Create OG banner image with benchmark metrics
|
|
184
|
-
- [ ] Write comparison articles (A3M vs RouteLLM, vs LiteLLM)
|
|
185
|
-
- [ ] Submit sitemap to Google Search Console
|
|
186
|
-
- [ ] Set up Google Search Console for das-rebel.github.io
|
package/docs/SOCIAL_LISTENING.md
DELETED
|
@@ -1,219 +0,0 @@
|
|
|
1
|
-
# A3M Router — Social Listening & Reply Playbook
|
|
2
|
-
|
|
3
|
-
> "Set up Google Alerts for competitors → find discussions about routing/cost → craft reply that converts"
|
|
4
|
-
> — Vault insight, score 29.3
|
|
5
|
-
|
|
6
|
-
## 1. Monitoring Setup
|
|
7
|
-
|
|
8
|
-
### Google Alerts (free)
|
|
9
|
-
Set up alerts for these keywords. Frequency: "As it happens."
|
|
10
|
-
|
|
11
|
-
| Alert | Keyword | Why |
|
|
12
|
-
|-------|---------|-----|
|
|
13
|
-
| **A** | `"LLM routing" OR "model routing"` | Direct mention of the space |
|
|
14
|
-
| **B** | `"AI gateway" OR "LLM gateway"` | Competitor category |
|
|
15
|
-
| **C** | `"LiteLLM" OR "portkey" OR "route LLM"` | Competitor names |
|
|
16
|
-
| **D** | `"switch between LLMs" OR "multi-model"` | Pain point search |
|
|
17
|
-
| **E** | `"LLM too expensive" OR "API costs"` | Pain point — cost |
|
|
18
|
-
| **F** | `"open source LLM router"` | Direct search intent |
|
|
19
|
-
|
|
20
|
-
### F5bot (free tier)
|
|
21
|
-
Monitor Hacker News for:
|
|
22
|
-
- `llm router` `model routing` `ai gateway` `openrouter` `litellm` `route llm`
|
|
23
|
-
|
|
24
|
-
Setup: https://f5bot.com — enter keywords, get email alerts.
|
|
25
|
-
|
|
26
|
-
### ReplyGuy (paid, ~$15/mo)
|
|
27
|
-
Automated reply system for Reddit, HN, X, YouTube:
|
|
28
|
-
- Monitors keywords across platforms
|
|
29
|
-
- Suggests replies that are contextual, not spammy
|
|
30
|
-
- Setup: https://replyguy.com
|
|
31
|
-
|
|
32
|
-
### Morebirds (HN specific)
|
|
33
|
-
Monitors Hacker News for keywords and competitors:
|
|
34
|
-
- https://morebirds.com
|
|
35
|
-
|
|
36
|
-
---
|
|
37
|
-
|
|
38
|
-
## 2. Keywords to Monitor (Priority Order)
|
|
39
|
-
|
|
40
|
-
### Tier 1: High Intent (reply immediately)
|
|
41
|
-
```
|
|
42
|
-
"llm routing" "model routing" "ai gateway" "llm gateway" "routing llm"
|
|
43
|
-
"open source router" "llm router alternative" "best model router"
|
|
44
|
-
"switch between models" "model fallback" "intelligent routing"
|
|
45
|
-
```
|
|
46
|
-
|
|
47
|
-
### Tier 2: Pain Point (reply with empathy + solution)
|
|
48
|
-
```
|
|
49
|
-
"llm costs too high" "gpt too expensive" "api cost too much"
|
|
50
|
-
"openai bill too high" "reduce llm costs" "need cheaper llm"
|
|
51
|
-
"paying too much for ai" "api costs killing me"
|
|
52
|
-
```
|
|
53
|
-
|
|
54
|
-
### Tier 3: Competitor (reply as comparison, not attack)
|
|
55
|
-
```
|
|
56
|
-
"LitellLm" "Portkey" "RouteLLM" "OpenRouter" "Helicone"
|
|
57
|
-
"LangSmith" "AI Gateway (Cloudflare)" "GenAI Gateway"
|
|
58
|
-
```
|
|
59
|
-
When someone has issues with these, offer A3M as a working alternative.
|
|
60
|
-
|
|
61
|
-
### Tier 4: Educational (reply with knowledge, mention tool naturally)
|
|
62
|
-
```
|
|
63
|
-
"how to choose llm" "which model to use" "compare models"
|
|
64
|
-
"llm benchmark" "model evaluation" "provider comparison"
|
|
65
|
-
```
|
|
66
|
-
|
|
67
|
-
---
|
|
68
|
-
|
|
69
|
-
## 3. Reply Templates
|
|
70
|
-
|
|
71
|
-
### Template 1: Cost Pain
|
|
72
|
-
**Trigger:** Someone says "my OpenAI bill is too high" or "LLM API costs are crazy"
|
|
73
|
-
|
|
74
|
-
**Platform:** HN, Reddit, X
|
|
75
|
-
|
|
76
|
-
**Reply:**
|
|
77
|
-
```
|
|
78
|
-
We were in the same boat — $800/month on GPT-4. Built A3M Router to route smart queries to cheaper models and keep hard ones on premium.
|
|
79
|
-
|
|
80
|
-
Same answers (RouterArena #1 at 70.32). Cost dropped to ~$5.
|
|
81
|
-
|
|
82
|
-
Open source, MIT. Run it yourself:
|
|
83
|
-
npx a3m-router route "your query"
|
|
84
|
-
|
|
85
|
-
Or verify: npx a3m-router benchmark --reproducible
|
|
86
|
-
```
|
|
87
|
-
|
|
88
|
-
### Template 2: "Which tool should I use?"
|
|
89
|
-
**Trigger:** Someone comparing LiteLLM, Portkey, RouteLLM
|
|
90
|
-
|
|
91
|
-
**Platform:** Reddit, HN
|
|
92
|
-
|
|
93
|
-
**Reply:**
|
|
94
|
-
```
|
|
95
|
-
We evaluated all of them before building A3M. What we wanted:
|
|
96
|
-
- Open source (MIT)
|
|
97
|
-
- Actually routes to cheapest capable model, not just round-robin
|
|
98
|
-
- Works with existing OpenAI SDK
|
|
99
|
-
- Has a reproducible benchmark
|
|
100
|
-
|
|
101
|
-
A3M Router hits all of those. #1 on RouterArena (70.32). Costs $0.047/1K vs GPT-5 at $10/1K.
|
|
102
|
-
|
|
103
|
-
npx a3m-router route "test it out"
|
|
104
|
-
```
|
|
105
|
-
|
|
106
|
-
### Template 3: "Building an AI gateway"
|
|
107
|
-
**Trigger:** Someone asks how to build multi-model routing, or shows their architecture
|
|
108
|
-
|
|
109
|
-
**Platform:** HN, Reddit, DevTo
|
|
110
|
-
|
|
111
|
-
**Reply:**
|
|
112
|
-
```
|
|
113
|
-
We built something similar. Spent months on it. Eventually open-sourced it as A3M Router.
|
|
114
|
-
|
|
115
|
-
Biggest lessons:
|
|
116
|
-
1. Cost-based routing saves 200x vs always-pick-premium
|
|
117
|
-
2. Quality scores persist across sessions (memory)
|
|
118
|
-
3. Cache + parallel execution cut latency 3x
|
|
119
|
-
|
|
120
|
-
The whole thing is MIT on GitHub. Beats GPT-5 on RouterArena.
|
|
121
|
-
|
|
122
|
-
npx a3m-router benchmark --reproducible
|
|
123
|
-
```
|
|
124
|
-
|
|
125
|
-
### Template 4: "My provider is rate-limiting / failing"
|
|
126
|
-
**Trigger:** Someone complaining about OpenAI/Groq/Claude rate limits or failures
|
|
127
|
-
|
|
128
|
-
**Platform:** HN, Reddit, X
|
|
129
|
-
|
|
130
|
-
**Reply:**
|
|
131
|
-
```
|
|
132
|
-
A3M Router handles this automatically — fallback to next available provider when one fails or throttles.
|
|
133
|
-
|
|
134
|
-
47+ providers. Automatic failover. Same response format.
|
|
135
|
-
|
|
136
|
-
Open source: npx a3m-router route "try it"
|
|
137
|
-
```
|
|
138
|
-
|
|
139
|
-
### Template 5: "Looking for alternatives"
|
|
140
|
-
**Trigger:** Someone asking for alternatives to a specific tool or service
|
|
141
|
-
|
|
142
|
-
**Platform:** HN, Reddit, X
|
|
143
|
-
|
|
144
|
-
**Reply:**
|
|
145
|
-
```
|
|
146
|
-
If you're evaluating options, A3M Router is worth a look:
|
|
147
|
-
- MIT licensed (not source-available)
|
|
148
|
-
- RouterArena #1 (70.32)
|
|
149
|
-
- Same API as OpenAI SDK
|
|
150
|
-
- $0.047/1K vs $10/1K for GPT-5
|
|
151
|
-
|
|
152
|
-
npx a3m-router route "test" or npx a3m-router benchmark --reproducible
|
|
153
|
-
```
|
|
154
|
-
|
|
155
|
-
### Template 6: "Model comparison question"
|
|
156
|
-
**Trigger:** Someone asking which model is best for task X
|
|
157
|
-
|
|
158
|
-
**Platform:** HN, Reddit
|
|
159
|
-
|
|
160
|
-
**Reply:**
|
|
161
|
-
```
|
|
162
|
-
A3M Router actually solves this — it routes each query to the best model based on: complexity, cost budget, latency needs, and past quality scores.
|
|
163
|
-
|
|
164
|
-
You define 47+ providers and it picks automatically. Results tracked in memory so it gets smarter over time.
|
|
165
|
-
|
|
166
|
-
npx a3m-router recommend "coding" # See what it would pick
|
|
167
|
-
npx a3m-router route "test it" # Route a real query
|
|
168
|
-
```
|
|
169
|
-
|
|
170
|
-
### Template 7: Show HN / Launches (competitor)
|
|
171
|
-
**Trigger:** A competitor launches on HN or Product Hunt
|
|
172
|
-
|
|
173
|
-
**Platform:** HN comments
|
|
174
|
-
|
|
175
|
-
**Reply:**
|
|
176
|
-
```
|
|
177
|
-
Cool project! Curious how it compares on RouterArena. We got 70.32 — would love to see benchmarks head-to-head.
|
|
178
|
-
|
|
179
|
-
For anyone evaluating, A3M Router is open source (MIT) with a reproducible benchmark:
|
|
180
|
-
npx a3m-router benchmark --reproducible
|
|
181
|
-
```
|
|
182
|
-
|
|
183
|
-
---
|
|
184
|
-
|
|
185
|
-
## 4. Cadence
|
|
186
|
-
|
|
187
|
-
| Frequency | Action | Time |
|
|
188
|
-
|-----------|--------|------|
|
|
189
|
-
| **Daily (5 min)** | Check Google Alerts + F5bot notifications | Morning |
|
|
190
|
-
| **Daily (10 min)** | Scan HN for relevant threads | 8-10am ET |
|
|
191
|
-
| **Every 2 days** | Check Reddit for keyword matches | Random |
|
|
192
|
-
| **Weekly** | Write 1 educational post on DevTo/blog | Weekend |
|
|
193
|
-
| **Bi-weekly** | Review tracking table, adjust templates | Sunday |
|
|
194
|
-
|
|
195
|
-
### Golden Rules
|
|
196
|
-
1. **Never pitch in top-level posts** — only reply when relevant
|
|
197
|
-
2. **First sentence = empathy/understanding**, not self-promo
|
|
198
|
-
3. **Always include an action they can take** (a command to run)
|
|
199
|
-
4. **Never copy-paste** — adapt template to the specific conversation
|
|
200
|
-
5. **No URLs in first reply** unless asked (appears spammy)
|
|
201
|
-
|
|
202
|
-
---
|
|
203
|
-
|
|
204
|
-
## 5. Tracking Table
|
|
205
|
-
|
|
206
|
-
| Date | Platform | URL | Template | Reply | Clicks/Installs |
|
|
207
|
-
|------|----------|-----|----------|-------|-----------------|
|
|
208
|
-
| | | | | | |
|
|
209
|
-
| | | | | | |
|
|
210
|
-
|
|
211
|
-
Keep a running log. Review weekly to see which templates convert best.
|
|
212
|
-
|
|
213
|
-
---
|
|
214
|
-
|
|
215
|
-
## 6. Success Metric
|
|
216
|
-
|
|
217
|
-
Goal: **10 replies per week → 5 conversations → 1 GitHub star or npm install**
|
|
218
|
-
|
|
219
|
-
At this rate: 50 stars/month, 250 npm installs/month from social listening alone.
|