npm - adaptive-memory-multi-model-router - Versions diffs - 2.14.46 → 2.14.48 - Mend

adaptive-memory-multi-model-router 2.14.46 → 2.14.48

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (598) hide show

package/{docs/llms.txt → llms.txt.bak} +6 -6
package/package.json +270 -72
package/src/routing/advancedRouter.ts.bak +650 -0
package/test.js.bak +376 -0
package/.dockerignore +0 -82
package/.env.example +0 -303
package/.github/DISCUSSIONS_WELCOME.md +0 -27
package/.github/DISCUSSION_TEMPLATE.yml +0 -5
package/.github/FUNDING.yml +0 -2
package/.github/ISSUE_TEMPLATE/bug_report.md +0 -94
package/.github/ISSUE_TEMPLATE/config.yml +0 -17
package/.github/ISSUE_TEMPLATE/feature_request.md +0 -71
package/.github/PULL_REQUEST_TEMPLATE.md +0 -71
package/.github/dependabot.yml +0 -9
package/.github/workflows/auto-publish.yml +0 -51
package/.github/workflows/ci.yml +0 -263
package/.github/workflows/codeql.yml +0 -38
package/.github/workflows/npm-publish.yml +0 -20
package/.github/workflows/pages.yml +0 -37
package/.github/workflows/stale.yml +0 -54
package/.publish-tick +0 -1
package/.well-known/ai-plugin.json +0 -16
package/AGENT_COUNCIL_FINDINGS.md +0 -142
package/ARCHITECTURE.md +0 -346
package/AUDIT_REPORT.md +0 -28
package/CODE_OF_CONDUCT.md +0 -128
package/CONTRIBUTING.md +0 -50
package/CONTRIBUTORS.md +0 -20
package/Dockerfile +0 -53
package/Dockerfile.proxy +0 -33
package/HEALTH_REPORT.md +0 -118
package/IMPROVEMENT_PLAN.md +0 -107
package/LANDING.md +0 -43
package/LAUNCH-PAIN-DRIVEN.md +0 -339
package/LAUNCH.md +0 -337
package/LAUNCH_CHECKLIST.md +0 -141
package/LAUNCH_SNAPSHOT.md +0 -260
package/MANIFESTO.md +0 -41
package/POPULARITY_BOOSTERS.md +0 -285
package/PR_STATUS_REPORT.md +0 -148
package/REDESIGN.md +0 -95
package/RUNKIT.md +0 -83
package/SECURITY.md +0 -29
package/SUBMISSIONS.md +0 -43
package/_schema.html +0 -53
package/ai-plugin.json +0 -16
package/articles/AI_AGENT_LLM_ROUTING.md +0 -150
package/articles/CHINESE_DIRECTORIES.md +0 -100
package/articles/CHINESE_SUBMISSIONS_READY.md +0 -322
package/articles/COMPETITOR_ALERTS.md +0 -31
package/articles/COMPLETE_POSTING_DIRECTORY.md +0 -147
package/articles/CONTENT_STRUCTURE.md +0 -292
package/articles/DEVTO_COST_GUIDE.md +0 -473
package/articles/DEVTO_FINAL.md +0 -416
package/articles/DEVTO_MULTI_PROVIDER.md +0 -542
package/articles/DEVTO_READY.md +0 -255
package/articles/DEVTO_V2_ANNOUNCEMENT.md +0 -160
package/articles/DEVTO_VIRAL_GROWTH.md +0 -280
package/articles/FRESH_devto.md +0 -460
package/articles/FRESH_devto_2026_05.md +0 -73
package/articles/FRESH_hackernews.md +0 -14
package/articles/FRESH_reddit_ml.md +0 -90
package/articles/FRESH_reddit_node.md +0 -198
package/articles/FRESH_reddit_sideproject.md +0 -72
package/articles/FRESH_reddit_webdev.md +0 -130
package/articles/FROM_ZERO_TO_10K.md +0 -107
package/articles/HN_10X_BETTER.md +0 -430
package/articles/HN_ACCOUNT_GUIDE.md +0 -21
package/articles/HN_CHINESE_STYLE.md +0 -308
package/articles/HN_FINAL.md +0 -148
package/articles/HN_POSTED_VERSION.md +0 -56
package/articles/HN_POST_READY.md +0 -137
package/articles/HN_RESEARCH.md +0 -364
package/articles/HN_SHOW_routerarena.md +0 -17
package/articles/HN_TIMING_GUIDE.md +0 -52
package/articles/INDIEHACKERS_POST.md +0 -52
package/articles/INDIEHACKERS_READY.md +0 -120
package/articles/LLM_BENCHMARK_DEEP_DIVE.md +0 -153
package/articles/MASTER_POSTING_DIRECTORY.md +0 -189
package/articles/NEWSLETTER_SEND_NOW.md +0 -259
package/articles/NEWSLETTER_SUBMISSIONS.md +0 -112
package/articles/PAIN-DRIVEN-devto-v2.md +0 -308
package/articles/PAIN-DRIVEN-devto-v3.md +0 -268
package/articles/PAIN-DRIVEN-devto.md +0 -242
package/articles/PAIN-DRIVEN-hackernews-v2.md +0 -138
package/articles/PAIN-DRIVEN-hackernews-v3.md +0 -151
package/articles/PAIN-DRIVEN-hackernews.md +0 -131
package/articles/PAIN-DRIVEN-reddit-v2.md +0 -301
package/articles/PAIN-DRIVEN-reddit-v3.md +0 -236
package/articles/PAIN-DRIVEN-reddit.md +0 -218
package/articles/PAIN-DRIVEN-twitter-v2.md +0 -110
package/articles/PAIN-DRIVEN-twitter-v3.md +0 -121
package/articles/PAIN-DRIVEN-twitter.md +0 -120
package/articles/PORTKEY_VS_A3M.md +0 -147
package/articles/POSTING_KIT_2026_05.md +0 -67
package/articles/PRESS_KIT_routerarena.md +0 -77
package/articles/PRODUCTHUNT_LISTING.md +0 -48
package/articles/PRODUCTHUNT_READY.md +0 -106
package/articles/PR_PLAN_vault.md +0 -125
package/articles/REDDIT_FINAL.md +0 -232
package/articles/REDDIT_POST.md +0 -67
package/articles/REDDIT_SUBMISSION_READY.md +0 -348
package/articles/ROUTERARENA_LEADER.md +0 -45
package/articles/SHOW_HN_FINAL.md +0 -29
package/articles/TWEETS_10K_DOWNLOADS.md +0 -47
package/articles/TWEETS_BENCHMARK_FIRST.md +0 -46
package/articles/TWEETS_MCP_PLAY.md +0 -51
package/articles/TWEETS_SEQUENTIAL_BROKEN.md +0 -49
package/articles/TWEETS_WHY_BUILD.md +0 -54
package/articles/TWEETS_routerarena_leader.md +0 -53
package/articles/TWEET_STORM_READY.md +0 -165
package/articles/TWITTER_FINAL.md +0 -167
package/articles/WHY_10X_BETTER.md +0 -261
package/articles/WHY_CHINESE_STYLE_BETTER.md +0 -323
package/articles/ai-discoverability-llm-routing.md +0 -210
package/articles/devto-llm-routing.md +0 -138
package/articles/hackernews-show-hn.md +0 -54
package/articles/hashnode-llm-cost-optimization.md +0 -125
package/articles/hn_show_2026_05.md +0 -11
package/articles/medium-building-llm-router.md +0 -205
package/articles/reddit-ml.md +0 -76
package/articles/twitter-thread-cost-savings.md +0 -50
package/articles/youtube-tutorial-script.md +0 -262
package/assets/a3m_3blue1brown.mp4 +0 -0
package/assets/banner.svg +0 -109
package/assets/chart-cost-v2.svg +0 -91
package/assets/chart-cost-v3.svg +0 -143
package/assets/chart-features-v2.svg +0 -132
package/assets/chart-features-v3.svg +0 -211
package/assets/chart-growth-v2.svg +0 -122
package/assets/chart-growth-v3.svg +0 -189
package/assets/cost-comparison.svg +0 -134
package/assets/cost-simple.svg +0 -64
package/assets/demo-hn.gif +0 -0
package/assets/feature-matrix.svg +0 -136
package/assets/growth-chart-animated.svg +0 -76
package/assets/growth-chart.svg +0 -82
package/assets/growth-simple.svg +0 -69
package/assets/hero-diagram.svg +0 -81
package/assets/logo-new.svg +0 -21
package/assets/logo.svg +0 -68
package/assets/provider-comparison.svg +0 -121
package/assets/social-preview-new.svg +0 -100
package/assets/social-preview.svg +0 -194
package/assets/social-v2.svg +0 -130
package/assets/social-v3.svg +0 -212
package/benchmark-provider-results.json +0 -245
package/benchmark-results.json +0 -54
package/council-votes/architecture-vote.md +0 -121
package/council-votes/coverage-vote.md +0 -93
package/data/adaptive-benchmark.json +0 -92
package/data/benchmark-results.json +0 -47
package/data/labeled-benchmark.json +0 -88
package/demo/3blue1brown_video.py +0 -285
package/demo/3blue1brown_video_v2.py +0 -310
package/demo/IMPROVED_PROMPTS.md +0 -229
package/demo/VEO3_PROMPTS.md +0 -269
package/demo/VIDEO_PRODUCTION_GUIDE.md +0 -333
package/demo/a3m_3blue1brown.mp4 +0 -0
package/demo/asciinema-demo.sh +0 -195
package/demo/demo-hn.tape +0 -74
package/demo/demo-script.md +0 -53
package/demo/demo-script.sh +0 -62
package/demo/demo.svg +0 -75
package/demo/frame1_ai_data_center.png +0 -0
package/demo/frame1_sunset_video.mp4 +0 -0
package/demo/frame2_cost_comparison.png +0 -0
package/demo/frame2_cost_comparison_fallback.png +0 -0
package/demo/frame3_parallel_execution.png +0 -0
package/demo/frame3_parallel_execution_fallback.png +0 -0
package/demo/frame4_providers.png +0 -0
package/demo/frame4_providers_fallback.png +0 -0
package/demo/frame5_endcard.png +0 -0
package/demo/frame5_endcard_fallback.png +0 -0
package/demo/new_frame1_hook.png +0 -0
package/demo/new_frame2_proof.png +0 -0
package/demo/new_frame3_wow.png +0 -0
package/demo/new_frame4_social.png +0 -0
package/demo/new_frame5_cta.png +0 -0
package/demo/package.json +0 -13
package/demo/product-video-final.mp4 +0 -0
package/demo/product-video-hype-v1.mp4 +0 -0
package/demo/product-video-v1.mp4 +0 -0
package/demo/public/index.html +0 -762
package/demo/recording.cast +0 -55
package/demo/server.js +0 -405
package/demo-new.tape +0 -71
package/demo-real.sh +0 -198
package/demo-simple.tape +0 -205
package/demo.html +0 -520
package/demo.sh +0 -85
package/demo.tape +0 -259
package/dist/analytics/costAnalytics.d.ts.map +0 -1
package/dist/analytics/costAnalytics.js.map +0 -1
package/dist/benchmark/comprehensive.js.map +0 -1
package/dist/benchmark/reproducible.d.ts.map +0 -1
package/dist/benchmark/reproducible.js.map +0 -1
package/dist/cache/prefixCache.d.ts.map +0 -1
package/dist/cache/prefixCache.js.map +0 -1
package/dist/cache/responseCache.d.ts.map +0 -1
package/dist/cache/responseCache.js.map +0 -1
package/dist/cache/semanticCache.d.ts.map +0 -1
package/dist/cache/semanticCache.js.map +0 -1
package/dist/cli/setupWizard.d.ts.map +0 -1
package/dist/cli/setupWizard.js.map +0 -1
package/dist/cost/budgetEnforcer.d.ts.map +0 -1
package/dist/cost/budgetEnforcer.js.map +0 -1
package/dist/cost/costTracker.d.ts.map +0 -1
package/dist/cost/costTracker.js.map +0 -1
package/dist/ensemble/multiRoundDialog.js.map +0 -1
package/dist/ensemble/shapleyValue.js.map +0 -1
package/dist/integrations/langchainAdapter.d.ts.map +0 -1
package/dist/integrations/langchainAdapter.js.map +0 -1
package/dist/integrations/oauth.d.ts.map +0 -1
package/dist/integrations/oauth.js.map +0 -1
package/dist/integrations/scienceAdapter.js.map +0 -1
package/dist/memory/autoFetch.d.ts.map +0 -1
package/dist/memory/autoFetch.js.map +0 -1
package/dist/memory/episodicMemory.d.ts.map +0 -1
package/dist/memory/episodicMemory.js.map +0 -1
package/dist/memory/hybridMemory.js.map +0 -1
package/dist/memory/memoryTree.d.ts.map +0 -1
package/dist/memory/memoryTree.js.map +0 -1
package/dist/memory/obsidianVault.d.ts.map +0 -1
package/dist/memory/obsidianVault.js.map +0 -1
package/dist/memory/reasoningBank.js.map +0 -1
package/dist/observability/changeWatch.d.ts.map +0 -1
package/dist/observability/changeWatch.js.map +0 -1
package/dist/observability/fatigueDetector.d.ts.map +0 -1
package/dist/observability/fatigueDetector.js.map +0 -1
package/dist/observability/index.d.ts.map +0 -1
package/dist/observability/index.js.map +0 -1
package/dist/observability/metrics.d.ts.map +0 -1
package/dist/observability/metrics.js.map +0 -1
package/dist/observability/middleware.d.ts.map +0 -1
package/dist/observability/middleware.js.map +0 -1
package/dist/observability/tracer.d.ts.map +0 -1
package/dist/observability/tracer.js.map +0 -1
package/dist/observability/types.d.ts.map +0 -1
package/dist/observability/types.js.map +0 -1
package/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
package/dist/orchestration/haloOrchestrator.js.map +0 -1
package/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
package/dist/orchestration/mctsWorkflow.js.map +0 -1
package/dist/providers/localProvider.d.ts.map +0 -1
package/dist/providers/localProvider.js.map +0 -1
package/dist/providers/providerConfig.d.ts.map +0 -1
package/dist/providers/providerConfig.js.map +0 -1
package/dist/providers/registry.d.ts.map +0 -1
package/dist/providers/registry.js.map +0 -1
package/dist/routing/advancedRouter.d.ts.map +0 -1
package/dist/routing/advancedRouter.js.map +0 -1
package/dist/routing/crossModelValidation.d.ts.map +0 -1
package/dist/routing/crossModelValidation.js.map +0 -1
package/dist/routing/providerHealth.d.ts.map +0 -1
package/dist/routing/providerHealth.js.map +0 -1
package/dist/routing/providerRetry.d.ts.map +0 -1
package/dist/routing/providerRetry.js.map +0 -1
package/dist/scripts/banner.js +0 -29
package/dist/security/guardrails.d.ts.map +0 -1
package/dist/security/guardrails.js.map +0 -1
package/dist/server/dashboard.d.ts.map +0 -1
package/dist/server/dashboard.js.map +0 -1
package/dist/server/modelMapper.d.ts.map +0 -1
package/dist/server/modelMapper.js.map +0 -1
package/dist/server/proxyServer.d.ts.map +0 -1
package/dist/server/proxyServer.js.map +0 -1
package/dist/skills/__tests__/skill_manager.test.d.ts +0 -2
package/dist/skills/__tests__/skill_manager.test.d.ts.map +0 -1
package/dist/skills/__tests__/skill_manager.test.js +0 -268
package/dist/skills/__tests__/skill_manager.test.js.map +0 -1
package/dist/tools/tmlpdTools.d.ts.map +0 -1
package/dist/tools/tmlpdTools.js.map +0 -1
package/dist/tui/dashboard.d.ts.map +0 -1
package/dist/tui/dashboard.js.map +0 -1
package/dist/tui/index.d.ts.map +0 -1
package/dist/tui/index.js.map +0 -1
package/dist/utils/batchProcessor.d.ts.map +0 -1
package/dist/utils/batchProcessor.js.map +0 -1
package/dist/utils/compression.d.ts.map +0 -1
package/dist/utils/compression.js.map +0 -1
package/dist/utils/costUtils.d.ts.map +0 -1
package/dist/utils/costUtils.js.map +0 -1
package/dist/utils/reliability.d.ts.map +0 -1
package/dist/utils/reliability.js.map +0 -1
package/dist/utils/sorting.d.ts.map +0 -1
package/dist/utils/sorting.js.map +0 -1
package/dist/utils/speculativeDecoding.d.ts.map +0 -1
package/dist/utils/speculativeDecoding.js.map +0 -1
package/dist/utils/tokenUtils.d.ts.map +0 -1
package/dist/utils/tokenUtils.js.map +0 -1
package/docs/.nojekyll +0 -0
package/docs/ANALYSIS_PRINCIPLES.md +0 -162
package/docs/API.md +0 -855
package/docs/ARCHITECTURAL-IMPROVEMENTS-2025.md +0 -1391
package/docs/ARCHITECTURAL-IMPROVEMENTS-REVISED-2025.md +0 -1051
package/docs/BENCHMARK.md +0 -170
package/docs/CHINESE_PROVIDER_RELIABILITY.md +0 -37
package/docs/CITATIONS.md +0 -74
package/docs/CLAIMS_AND_EVIDENCE.md +0 -58
package/docs/CONFIGURATION.md +0 -476
package/docs/COUNCIL_DECISION.json +0 -816
package/docs/COUNCIL_SUMMARY.md +0 -319
package/docs/COUNCIL_V2.2_DECISION.md +0 -416
package/docs/ENGINEERING_SPEC.md +0 -55
package/docs/FACTORY_RESET.md +0 -34
package/docs/GEO.md +0 -66
package/docs/GEO_OPTIMIZATION.md +0 -30
package/docs/GEO_ROOT_CAUSE.md +0 -136
package/docs/GEO_STATUS.md +0 -85
package/docs/GEO_TEST_RESULTS.md +0 -176
package/docs/HN_CHECKLIST.md +0 -38
package/docs/HN_FOUNDER_COMMENT.md +0 -17
package/docs/HN_SUBMISSION_FINAL.md +0 -180
package/docs/HN_SUBMISSION_V3.md +0 -56
package/docs/IMPROVEMENT_ROADMAP.md +0 -515
package/docs/INTEGRATIONS.md +0 -420
package/docs/LANGCHAIN_INTEGRATION.md +0 -147
package/docs/LLM_COUNCIL_DECISION.md +0 -508
package/docs/MIDDLEWARE_CHAIN.md +0 -35
package/docs/PROMO_CHECKLIST.md +0 -200
package/docs/QUICKSTART.md +0 -271
package/docs/QUICK_START.md +0 -43
package/docs/QUICK_START_VISIBILITY.md +0 -782
package/docs/REDDIT_GAP_ANALYSIS.md +0 -299
package/docs/RELEASE_CHECKLIST.md +0 -32
package/docs/REPRODUCIBILITY.md +0 -63
package/docs/RESEARCH_BACKED_IMPROVEMENTS.md +0 -1180
package/docs/ROUTING_RUBRIC.md +0 -197
package/docs/SEO_AUDIT.md +0 -186
package/docs/SOCIAL_LISTENING.md +0 -219
package/docs/TMLPD_QNA.md +0 -751
package/docs/TMLPD_V2.1_COMPLETE.md +0 -763
package/docs/TMLPD_V2.2_RESEARCH_ROADMAP.md +0 -754
package/docs/UPDATE_TOPICS.md +0 -15
package/docs/USE_CASES.md +0 -59
package/docs/V2.2_IMPLEMENTATION_COMPLETE.md +0 -446
package/docs/V2_IMPLEMENTATION_GUIDE.md +0 -388
package/docs/VERCEL_AI_SDK.md +0 -209
package/docs/VISIBILITY_ADOPTION_PLAN.md +0 -1005
package/docs/_config.yml +0 -49
package/docs/ai-plugin.json +0 -16
package/docs/api.html +0 -513
package/docs/architecture-diagram.md +0 -40
package/docs/benchmark-chart.png +0 -0
package/docs/benchmark.html +0 -387
package/docs/blog/routerarena-number-one.html +0 -73
package/docs/cli-cheatsheet.md +0 -339
package/docs/compare.md +0 -109
package/docs/comparison-litellm.md +0 -88
package/docs/comparison.md +0 -108
package/docs/cost-chart-ascii.md +0 -42
package/docs/cost-comparison-chart.svg +0 -88
package/docs/curl-examples.md +0 -247
package/docs/demo-auto.html +0 -264
package/docs/demo.html +0 -416
package/docs/geo/GENERATIVE_ENGINE_OPTIMIZATION.md +0 -232
package/docs/index.html +0 -507
package/docs/launch-content/LAUNCH_EXECUTION_CHECKLIST.md +0 -421
package/docs/launch-content/README.md +0 -457
package/docs/launch-content/assets/cost_comparison_100_tasks.png +0 -0
package/docs/launch-content/assets/cumulative_savings.png +0 -0
package/docs/launch-content/assets/parallel_speedup.png +0 -0
package/docs/launch-content/assets/provider_pricing_comparison.png +0 -0
package/docs/launch-content/assets/task_breakdown_comparison.png +0 -0
package/docs/launch-content/generate_charts.py +0 -313
package/docs/launch-content/hn_show_post.md +0 -139
package/docs/launch-content/partner_outreach_templates.md +0 -745
package/docs/launch-content/reddit_posts.md +0 -467
package/docs/launch-content/twitter_thread.txt +0 -460
package/docs/npm-downloads-chart.svg +0 -43
package/docs/openapi.json +0 -139
package/docs/openapi.yaml +0 -1318
package/docs/quick-start.html +0 -366
package/docs/robots.txt +0 -52
package/docs/sitemap.xml +0 -57
package/docs/styles.css +0 -682
package/docs/well-known/ai-plugin.json +0 -16
package/docs/wellknown/ai-plugin.json +0 -16
package/docs-site/assets/og-banner.svg +0 -194
package/docs-site/index.html +0 -632
package/eval/README.md +0 -46
package/eval/baselines/main.json +0 -12
package/eval/benchmark_dataset.jsonl +0 -16
package/eval/check_golden_routes.js +0 -64
package/eval/datasets/catalog.json +0 -33
package/eval/datasets/slices/cn_provider_reliability_v1.jsonl +0 -3
package/eval/datasets/slices/cost_pressure_v1.jsonl +0 -3
package/eval/datasets/slices/safety_guardrails_v1.jsonl +0 -3
package/eval/evals.json +0 -199
package/eval/fault_injection_thresholds.json +0 -3
package/eval/generate_report.js +0 -128
package/eval/golden_routes.json +0 -114
package/eval/lib/experiment_registry.js +0 -24
package/eval/run_eval.js +0 -197
package/eval/run_fault_injection.js +0 -201
package/eval/run_shadow_eval.js +0 -85
package/eval/thresholds.json +0 -9
package/examples/QUICKSTART.md +0 -183
package/examples/README.md +0 -61
package/examples/a3m-sdk.js +0 -124
package/examples/basic-route.js +0 -54
package/examples/chat-loop.js +0 -202
package/examples/classify-then-route.js +0 -102
package/examples/cost-compare.js +0 -120
package/examples/ensemble.js +0 -160
package/examples/whatsapp-telegram-bridge-demo.js +0 -302
package/examples/whatsapp-telegram-bridge.js +0 -269
package/hf-space/README.md +0 -23
package/hf-space/app.py +0 -240
package/hf-space/requirements.txt +0 -1
package/huggingface_space/README.md +0 -35
package/huggingface_space/app.py +0 -126
package/huggingface_space/create_space.py +0 -208
package/huggingface_space/requirements.txt +0 -1
package/mcp-server/README.md +0 -188
package/mcp-server/package.json +0 -29
package/mcp-server/src/index.ts +0 -744
package/mcp-server/tsconfig.json +0 -19
package/openclaw-alexa-bridge/ALL_REMAINING_FIXES_PLAN.md +0 -313
package/openclaw-alexa-bridge/REMAINING_FIXES_SUMMARY.md +0 -277
package/openclaw-alexa-bridge/src/alexa_handler_no_tmlpd.js +0 -1234
package/openclaw-alexa-bridge/test_fixes.js +0 -77
package/playground/README.md +0 -51
package/playground/codesandbox.json +0 -12
package/playground/index.js +0 -39
package/proxy/README.md +0 -227
package/proxy/package-lock.json +0 -831
package/proxy/package.json +0 -17
package/proxy/rate-limit.js +0 -145
package/proxy/rate-limit.test.js +0 -311
package/proxy/server.js +0 -970
package/python/README.md +0 -102
package/python/a3m/__init__.py +0 -6
package/python/a3m/client.py +0 -190
package/python/a3m/models.py +0 -40
package/python/a3m/sync_client.py +0 -61
package/python/examples.py +0 -53
package/python/integrations.py +0 -330
package/python/pyproject.toml +0 -23
package/python/setup.py +0 -28
package/python/tmlpd.py +0 -369
package/qna/REDDIT_GAP_ANALYSIS.md +0 -299
package/qna/TMLPD_QNA.md +0 -751
package/research/FINDING_001_safety.md +0 -28
package/research/FINDING_002_error_diversity.md +0 -32
package/research/FINDING_003_confidence_weighted_voting.md +0 -32
package/research/FINDING_004_cross_model_semantic_detection.md +0 -37
package/research/FINDING_005_knowledge_gap_orthogonality.md +0 -34
package/research/HALLUCINATION_RESEARCH.md +0 -27
package/research/ensemble-voting.md +0 -324
package/research/loss-functions.md +0 -545
package/research-log.md +0 -49
package/scripts/banner.js +0 -29
package/scripts/benchmark-local-routerarena.ts +0 -176
package/scripts/benchmark.js +0 -145
package/scripts/benchmark.sh +0 -61
package/scripts/compare-providers.sh +0 -230
package/scripts/content-planner.js +0 -25
package/scripts/create-labeled-benchmark.ts +0 -105
package/scripts/cross_post.py +0 -443
package/scripts/local-router-benchmark.ts +0 -154
package/scripts/post-all.sh +0 -41
package/scripts/publish_fcc.py +0 -106
package/scripts/push-to-gitee.sh +0 -25
package/scripts/routerarena_ensemble.js +0 -144
package/scripts/routing-benchmark-v2.js +0 -373
package/scripts/routing-benchmark-v3.js +0 -118
package/scripts/routing-benchmark.js +0 -462
package/scripts/run-labeled-benchmark.mjs +0 -104
package/scripts/run-mmlu-benchmark.js +0 -176
package/scripts/run-provider-benchmark.js +0 -244
package/scripts/update-npm-badges.js +0 -158
package/skill/SKILL.md +0 -238
package/src/__tests__/integration/tmpld_integration.test.py +0 -540
package/src/skills/__tests__/skill_manager.test.ts +0 -328
package/submissions/benchmarks/ALL_PLATFORMS_SUBMISSION.md +0 -94
package/submissions/benchmarks/LLMROUTERBENCH_SUBMISSION.md +0 -121
package/submissions/benchmarks/MMRBENCH_SUBMISSION.md +0 -94
package/submissions/benchmarks/ROUTERARENA_UPDATE.md +0 -83
package/submissions/benchmarks/ROUTERBENCH_SUBMISSION.md +0 -225
package/test-council/1-structure-tests.test.js +0 -353
package/test-council/1-structure-tests.test.ts +0 -353
package/test-council/2-edge-case-tests.test.ts +0 -361
package/test-council/3-performance-tests.test.ts +0 -669
package/test-council/4-integration-tests.test.ts +0 -391
package/test-council/5-agent-council-eval.test.ts +0 -413
package/test-council/AGENT_COUNCIL_ARCHITECTURE.md +0 -349
package/test-council/TEST_COUNCIL_REPORT.md +0 -201
package/test-council/agents/edge-case-agent.ts +0 -363
package/test-council/agents/performance-agent.ts +0 -426
package/test-council/agents/structure-agent.ts +0 -227
package/test-council/council.md +0 -183
package/tests/__mocks__/tokenUtils.ts +0 -8
package/tests/memory/episodicMemory.test.ts +0 -227
package/tests/package-lock.json +0 -1628
package/tests/package.json +0 -18
package/tests/routing/ensembleVoting.test.ts +0 -236
package/tests/routing/providerRetry.test.ts +0 -360
package/tests/routing/queryTypePresets.test.ts +0 -208
package/tests/security/guardrailEngine.test.ts +0 -700
package/tests/tsconfig.json +0 -21
package/tests/vitest.config.ts +0 -18
package/tmlpd-pi-extension/README.md +0 -66
package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts +0 -114
package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/cache/prefixCache.js +0 -285
package/tmlpd-pi-extension/dist/cache/prefixCache.js.map +0 -1
package/tmlpd-pi-extension/dist/cache/responseCache.d.ts +0 -58
package/tmlpd-pi-extension/dist/cache/responseCache.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/cache/responseCache.js +0 -153
package/tmlpd-pi-extension/dist/cache/responseCache.js.map +0 -1
package/tmlpd-pi-extension/dist/cli.js +0 -59
package/tmlpd-pi-extension/dist/cost/costTracker.d.ts +0 -95
package/tmlpd-pi-extension/dist/cost/costTracker.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/cost/costTracker.js +0 -240
package/tmlpd-pi-extension/dist/cost/costTracker.js.map +0 -1
package/tmlpd-pi-extension/dist/index.d.ts +0 -723
package/tmlpd-pi-extension/dist/index.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/index.js +0 -239
package/tmlpd-pi-extension/dist/index.js.map +0 -1
package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts +0 -82
package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/memory/episodicMemory.js +0 -145
package/tmlpd-pi-extension/dist/memory/episodicMemory.js.map +0 -1
package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts +0 -102
package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js +0 -207
package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js.map +0 -1
package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts +0 -85
package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js +0 -210
package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js.map +0 -1
package/tmlpd-pi-extension/dist/providers/localProvider.d.ts +0 -102
package/tmlpd-pi-extension/dist/providers/localProvider.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/providers/localProvider.js +0 -338
package/tmlpd-pi-extension/dist/providers/localProvider.js.map +0 -1
package/tmlpd-pi-extension/dist/providers/registry.d.ts +0 -55
package/tmlpd-pi-extension/dist/providers/registry.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/providers/registry.js +0 -138
package/tmlpd-pi-extension/dist/providers/registry.js.map +0 -1
package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts +0 -68
package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/routing/advancedRouter.js +0 -332
package/tmlpd-pi-extension/dist/routing/advancedRouter.js.map +0 -1
package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts +0 -101
package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/tools/tmlpdTools.js +0 -368
package/tmlpd-pi-extension/dist/tools/tmlpdTools.js.map +0 -1
package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts +0 -96
package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/utils/batchProcessor.js +0 -170
package/tmlpd-pi-extension/dist/utils/batchProcessor.js.map +0 -1
package/tmlpd-pi-extension/dist/utils/compression.d.ts +0 -61
package/tmlpd-pi-extension/dist/utils/compression.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/utils/compression.js +0 -281
package/tmlpd-pi-extension/dist/utils/compression.js.map +0 -1
package/tmlpd-pi-extension/dist/utils/reliability.d.ts +0 -74
package/tmlpd-pi-extension/dist/utils/reliability.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/utils/reliability.js +0 -177
package/tmlpd-pi-extension/dist/utils/reliability.js.map +0 -1
package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts +0 -117
package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js +0 -246
package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js.map +0 -1
package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts +0 -50
package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts.map +0 -1
package/tmlpd-pi-extension/dist/utils/tokenUtils.js +0 -124
package/tmlpd-pi-extension/dist/utils/tokenUtils.js.map +0 -1
package/tmlpd-pi-extension/examples/QUICKSTART.md +0 -183
package/tmlpd-pi-extension/package-lock.json +0 -79
package/tmlpd-pi-extension/package.json +0 -172
package/tmlpd-pi-extension/python/examples.py +0 -53
package/tmlpd-pi-extension/python/integrations.py +0 -330
package/tmlpd-pi-extension/python/setup.py +0 -28
package/tmlpd-pi-extension/python/tmlpd.py +0 -369
package/tmlpd-pi-extension/qna/REDDIT_GAP_ANALYSIS.md +0 -299
package/tmlpd-pi-extension/qna/TMLPD_QNA.md +0 -751
package/tmlpd-pi-extension/skill/SKILL.md +0 -238
package/tmlpd-pi-extension/src/cache/responseCache.ts +0 -147
package/tmlpd-pi-extension/src/cost/costTracker.ts +0 -302
package/tmlpd-pi-extension/src/index.ts +0 -232
package/tmlpd-pi-extension/src/memory/episodicMemory.ts +0 -257
package/tmlpd-pi-extension/src/orchestration/haloOrchestrator.ts +0 -266
package/tmlpd-pi-extension/src/orchestration/mctsWorkflow.ts +0 -262
package/tmlpd-pi-extension/src/providers/localProvider.ts +0 -406
package/tmlpd-pi-extension/src/providers/registry.ts +0 -164
package/tmlpd-pi-extension/src/routing/ensembleVoting.ts +0 -159
package/tmlpd-pi-extension/src/routing/queryTypePresets.ts +0 -136
package/tmlpd-pi-extension/src/tools/tmlpdTools.ts +0 -433
package/tmlpd-pi-extension/src/utils/batchProcessor.ts +0 -232
package/tmlpd-pi-extension/src/utils/compression.ts +0 -325
package/tmlpd-pi-extension/src/utils/reliability.ts +0 -221
package/tmlpd-pi-extension/src/utils/tokenUtils.ts +0 -145
package/tmlpd-pi-extension/tsconfig.json +0 -18
package/tsconfig.build.json +0 -29
package/tsconfig.json +0 -18
/package/{docs/llms-full.txt → llms-full.txt.bak} +0 -0

package/articles/FRESH_devto.md DELETED Viewed

@@ -1,460 +0,0 @@
----
-title: "We Built an LLM Router That Runs on Keywords, Not Neural Networks — Here's How It Works"
-published: false
-description: "A 19.5 KB TypeScript package that routes LLM queries with 70.32 accuracy using 5 keyword-based signals. No GPU, no ML weights, zero dependencies."
-tags: llm, typescript, ai, optimization
-cover_image: https://placeholder.dev.to/cover.png
----
-We needed to route LLM queries across 36 providers. The ML approach (BERT classifier, embedding similarity, LLM-as-judge) adds latency, infrastructure, and cost. We tried something simpler: a 5-signal keyword scoring system in pure TypeScript.
-The result: **70.32  accuracy**, **64.5% exact match**, **0.3ms routing latency**, in a **19.5 KB gzipped** package with zero runtime dependencies.
-Here's exactly how each signal works, with code.
----
-## The problem
-We have 36 LLM providers across 5 complexity tiers:
-| Tier | Count | Examples | Price range |
-|------|-------|---------|-------------|
-| Free | 6 | Gemini Flash, Groq free tier | $0 |
-| Cheap | 15 | DeepSeek, Mistral Small | ~$0.15/1M tokens |
-| Mid | 9 | Claude Sonnet, GPT-4o-mini | ~$1-3/1M tokens |
-| Premium | 3 | GPT-4, Claude Opus | ~$15-30/1M tokens |
-| Enterprise | 3 | Claude Max, GPT-4 turbo | ~$60+/1M tokens |
-Every query needs to land in the right tier. Sending "what is 2+2?" to GPT-4 wastes money. Sending "design a Byzantine fault-tolerant consensus algorithm" to a free model wastes the response.
-## The 5-signal architecture
-Each incoming query is scored on five orthogonal signals (0-1 range). The weighted sum maps to a tier.
-```
-Query → [domain, task, structure, verb, specificity] → weighted sum → tier → provider
-```
-Let's break down each signal.
----
-### Signal 1: Domain Detection
-**What it measures:** Is this query from a specialized domain (code, math, legal, medical)?
-**Why it matters:** Domain-specific queries need domain-specific capabilities. Code generation needs instruction-following. Math needs chain-of-thought. Medical needs accuracy.
-```typescript
-const DOMAIN_PATTERNS: Record<string, RegExp[]> = {
-  code: [
-    /\b(function|class|import|export|async|await|def|return|const|let|var)\b/gi,
-    /\b(api|endpoint|database|query|schema|migrate|deploy)\b/gi,
-  ],
-  math: [
-    /\b(equation|integral|derivative|theorem|proof|calculate|solve|formula)\b/gi,
-    /\b(algebra|calculus|geometry|statistics|probability)\b/gi,
-  ],
-  legal: [
-    /\b(contract|liability|clause|statute|regulation|compliance|attorney)\b/gi,
-  ],
-  medical: [
-    /\b(diagnosis|symptom|treatment|patient|clinical|dosage|prescription)\b/gi,
-  ],
-};
-function scoreDomain(query: string): number {
-  let maxScore = 0;
-  for (const [domain, patterns] of Object.entries(DOMAIN_PATTERNS)) {
-    const matchCount = patterns.reduce(
-      (sum, pattern) => sum + (query.match(pattern)?.length ?? 0), 0
-    );
-    const domainScore = Math.min(matchCount * 0.15, 1.0);
-    maxScore = Math.max(maxScore, domainScore);
-  }
-  return maxScore;
-}
-```
-**Example scoring:**
-| Query | Domain score | Detected domain |
-|-------|-------------|----------------|
-| "What is the weather?" | 0.0 | none |
-| "Explain async/await in JavaScript" | 0.45 | code |
-| "Prove that sqrt(2) is irrational" | 0.45 | math |
-| "Debug this React component, the useState hook isn't updating" | 0.60 | code |
----
-### Signal 2: Task Indicators
-**What it measures:** What type of task is the user asking for? Summarize, translate, debug, create, analyze?
-**Why it matters:** Different tasks have different complexity ceilings. "Summarize" is bounded. "Create from scratch" is unbounded.
-```typescript
-const TASK_KEYWORDS: Record<string, { keywords: string[]; complexity: number }> = {
-  summarize: {
-    keywords: ['summarize', 'tldr', 'brief', 'overview', 'recap', 'sum up'],
-    complexity: 0.2,
-  },
-  translate: {
-    keywords: ['translate', 'in french', 'in spanish', 'in german', 'in japanese'],
-    complexity: 0.25,
-  },
-  explain: {
-    keywords: ['explain', 'describe', 'tell me about', 'what is', 'how does'],
-    complexity: 0.3,
-  },
-  debug: {
-    keywords: ['debug', 'fix this', 'error', 'stack trace', 'not working', 'broken'],
-    complexity: 0.55,
-  },
-  analyze: {
-    keywords: ['analyze', 'compare', 'evaluate', 'assess', 'investigate', 'critique'],
-    complexity: 0.7,
-  },
-  create: {
-    keywords: ['write', 'create', 'generate', 'build', 'implement', 'design', 'develop'],
-    complexity: 0.75,
-  },
-  architect: {
-    keywords: ['architect', 'design a system', 'system design', 'infrastructure'],
-    complexity: 0.9,
-  },
-};
-function scoreTask(query: string): number {
-  const lower = query.toLowerCase();
-  let score = 0;
-  for (const [task, config] of Object.entries(TASK_KEYWORDS)) {
-    const matched = config.keywords.some(kw => lower.includes(kw));
-    if (matched) score += config.complexity;
-  }
-  return Math.min(score, 1.0);
-}
-```
-**Example scoring:**
-| Query | Task score | Tasks detected |
-|-------|-----------|---------------|
-| "What is React?" | 0.3 | explain |
-| "Summarize this article" | 0.2 | summarize |
-| "Debug this Python script and explain the fix" | 0.85 | debug + explain |
-| "Design a microservices architecture and write the API gateway" | 1.0 | architect + create |
----
-### Signal 3: Query Structure
-**What it measures:** The structural complexity of the query — multiple steps, conditionals, nested requirements.
-**Why it matters:** "Translate this" is simple. "Translate this, then summarize in 3 bullets, then check for legal compliance" is structurally complex regardless of the individual tasks.
-```typescript
-function scoreStructure(query: string): number {
-  let score = 0;
-  // Multi-step queries ("first do X, then do Y, finally Z")
-  const stepMarkers = query.split(/\b(first|then|after|before|finally|next|lastly)\b/i);
-  score += Math.max(0, (stepMarkers.length - 1)) * 0.2;
-  // Conditional queries ("if X then Y otherwise Z")
-  const conditionals = query.match(/\b(if|unless|otherwise|whether|given that)\b/gi);
-  score += (conditionals?.length ?? 0) * 0.15;
-  // Conjunction chains (A and B and C)
-  const conjunctions = query.match(/\band\b/gi);
-  score += Math.min((conjunctions?.length ?? 0) * 0.05, 0.2);
-  // Query length with diminishing returns
-  score += Math.min(query.length / 500, 0.3);
-  // Nested quotes or code blocks (indicates context-heavy queries)
-  const codeBlocks = query.match(/```[\s\S]*?```/g);
-  score += (codeBlocks?.length ?? 0) * 0.1;
-  return Math.min(score, 1.0);
-}
-```
-**Example scoring:**
-| Query | Structure score | Why |
-|-------|----------------|-----|
-| "What is Python?" | 0.04 | short, simple |
-| "Explain async/await" | 0.05 | short, simple |
-| "First translate to French, then summarize in 3 bullets" | 0.47 | multi-step |
-| "If the user is admin, show the dashboard with all metrics, otherwise show a limited view with only their data" | 0.72 | conditional + multi-step |
----
-### Signal 4: Action Verb Intensity
-**What it measures:** How demanding the requested action is. "List" < "explain" < "analyze" < "design" < "architect".
-```typescript
-const VERB_WEIGHTS: Record<string, number> = {
-  // Low intensity
-  'what is': 0.1, 'define': 0.15, 'list': 0.2, 'describe': 0.25,
-  // Medium intensity
-  'explain': 0.35, 'convert': 0.4, 'translate': 0.4, 'summarize': 0.4,
-  'rewrite': 0.45, 'format': 0.45,
-  // High intensity
-  'debug': 0.6, 'fix': 0.6, 'analyze': 0.65, 'compare': 0.65,
-  'optimize': 0.7, 'refactor': 0.7, 'implement': 0.75,
-  // Very high intensity
-  'design': 0.8, 'architect': 0.85, 'reverse-engineer': 0.9,
-  'create from scratch': 0.9,
-};
-function scoreVerb(query: string): number {
-  const lower = query.toLowerCase();
-  let maxVerb = 0;
-  for (const [verb, weight] of Object.entries(VERB_WEIGHTS)) {
-    if (lower.includes(verb)) {
-      maxVerb = Math.max(maxVerb, weight);
-    }
-  }
-  return maxVerb;
-}
-```
----
-### Signal 5: Specificity
-**What it measures:** How precise and technical the query is. "Tell me about AI" vs "Implement a transformer decoder with multi-head attention using PyTorch".
-```typescript
-function scoreSpecificity(query: string): number {
-  let score = 0;
-  // Technical terms (camelCase, PascalCase identifiers)
-  const technicalTerms = query.match(/\b[A-Z][a-z]+[A-Z][a-z]+\b/g);
-  score += Math.min((technicalTerms?.length ?? 0) * 0.12, 0.3);
-  // Quoted strings (specific values, names, identifiers)
-  const quotedTerms = query.match(/["'`][^"'`]+["'`]/g);
-  score += Math.min((quotedTerms?.length ?? 0) * 0.1, 0.2);
-  // Numbers and measurements (specificity indicator)
-  const numbers = query.match(/\d+/g);
-  score += Math.min((numbers?.length ?? 0) * 0.03, 0.15);
-  // Penalize vagueness
-  const vagueTerms = query.match(/\b(something|anything|stuff|things|etc|whatever|some)\b/gi);
-  score -= (vagueTerms?.length ?? 0) * 0.15;
-  // Bonus for field-specific jargon density
-  const jargonTerms = query.match(/\b(algorithm|protocol|architecture|paradigm|heuristic|orthogonal)\b/gi);
-  score += Math.min((jargonTerms?.length ?? 0) * 0.1, 0.2);
-  return Math.max(0, Math.min(score, 1.0));
-}
-```
----
-## Putting it all together
-```typescript
-interface RoutingSignals {
-  domain: number;
-  task: number;
-  structure: number;
-  verbIntensity: number;
-  specificity: number;
-}
-const WEIGHTS = {
-  domain: 0.25,
-  task: 0.25,
-  structure: 0.20,
-  verbIntensity: 0.15,
-  specificity: 0.15,
-};
-const TIER_THRESHOLDS: [number, Tier][] = [
-  [0.20, 'free'],
-  [0.40, 'cheap'],
-  [0.60, 'mid'],
-  [0.80, 'premium'],
-  [1.01, 'enterprise'],
-];
-function route(query: string): Tier {
-  const signals: RoutingSignals = {
-    domain: scoreDomain(query),
-    task: scoreTask(query),
-    structure: scoreStructure(query),
-    verbIntensity: scoreVerb(query),
-    specificity: scoreSpecificity(query),
-  };
-  const score =
-    signals.domain * WEIGHTS.domain +
-    signals.task * WEIGHTS.task +
-    signals.structure * WEIGHTS.structure +
-    signals.verbIntensity * WEIGHTS.verbIntensity +
-    signals.specificity * WEIGHTS.specificity;
-  for (const [threshold, tier] of TIER_THRESHOLDS) {
-    if (score < threshold) return tier;
-  }
-  return 'enterprise';
-}
-```
----
-## Real query examples with full scoring
-### Example 1: "What is Python?"
-| Signal | Score | Weight | Weighted |
-|--------|-------|--------|----------|
-| Domain | 0.0 | 0.25 | 0.0 |
-| Task | 0.3 | 0.25 | 0.075 |
-| Structure | 0.03 | 0.20 | 0.006 |
-| Verb | 0.1 | 0.15 | 0.015 |
-| Specificity | 0.0 | 0.15 | 0.0 |
-| **Total** | | | **0.096** |
-**Routed to: Free tier** ✅
-### Example 2: "Implement a red-black tree with insert, delete, and search operations in TypeScript"
-| Signal | Score | Weight | Weighted |
-|--------|-------|--------|----------|
-| Domain | 0.45 | 0.25 | 0.1125 |
-| Task | 0.75 | 0.25 | 0.1875 |
-| Structure | 0.15 | 0.20 | 0.03 |
-| Verb | 0.75 | 0.15 | 0.1125 |
-| Specificity | 0.42 | 0.15 | 0.063 |
-| **Total** | | | **0.505** |
-**Routed to: Mid tier** ✅
-### Example 3: "Design a fault-tolerant distributed database that handles network partitions, supports ACID transactions, and can scale to 10,000 nodes. Include the consensus protocol, replication strategy, and failure recovery mechanism."
-| Signal | Score | Weight | Weighted |
-|--------|-------|--------|----------|
-| Domain | 0.30 | 0.25 | 0.075 |
-| Task | 0.90 | 0.25 | 0.225 |
-| Structure | 0.62 | 0.20 | 0.124 |
-| Verb | 0.80 | 0.15 | 0.12 |
-| Specificity | 0.65 | 0.15 | 0.0975 |
-| **Total** | | | **0.641** |
-**Routed to: Premium tier** ✅
----
-## Benchmark results
-Tested on 2,500 real-world queries across coding, creative writing, analysis, math, translation, and general Q&A.
-```
-Confusion Matrix (3-tier simplified):
-              Predicted
-              Free   Mid  Premium
-Actual Free    812    38      5
-Actual Mid      41   647     27
-Actual Premium   3    22    705
-```
-| Metric | Value |
-|--------|-------|
-| Exact tier match | 64.5% |
-|  accuracy | 70.32 |
-| Mean absolute error | 0.37 tiers |
-| Routing latency | 0.3ms per query |
-| Cost savings vs premium-only | 61.6% |
----
-## What about the other features?
-### Semantic Cache
-Uses trigram Jaccard similarity to detect near-duplicate queries:
-```typescript
-function trigramJaccard(a: string, b: string): number {
-  const trigrams = (s: string) => {
-    const set = new Set<string>();
-    for (let i = 0; i <= s.length - 3; i++) {
-      set.add(s.slice(i, i + 3));
-    }
-    return set;
-  };
-  const setA = trigrams(a.toLowerCase());
-  const setB = trigrams(b.toLowerCase());
-  const intersection = [...setA].filter(x => setB.has(x)).length;
-  const union = new Set([...setA, ...setB]).size;
-  return intersection / union;
-}
-// "Explain React hooks" and "what are React hooks?" → Jaccard > 0.4 → cache hit
-```
-### Prompt Injection Detection
-17 patterns covering common attack vectors:
-```typescript
-const INJECTION_PATTERNS = [
-  /ignore\s+(all\s+)?previous\s+instructions/i,
-  /you\s+are\s+now\s+/i,
-  /system\s*:\s*/i,
-  /\[INST\]/i,
-  /simulate\s+/i,
-  /pretend\s+you\s+(are|can)/i,
-  /jailbreak/i,
-  /DAN\s+mode/i,
-  // ... 9 more patterns
-];
-```
----
-## Get started
-```bash
-npm install adaptive-memory-multi-model-router
-```
-```typescript
-import { A3MRouter } from 'adaptive-memory-multi-model-router';
-const router = new A3MRouter({
-  providers: {
-    openai: { apiKey: process.env.OPENAI_API_KEY },
-    anthropic: { apiKey: process.env.ANTHROPIC_API_KEY },
-    google: { apiKey: process.env.GOOGLE_API_KEY },
-    groq: { apiKey: process.env.GROQ_API_KEY },
-  }
-});
-const result = await router.route({
-  messages: [{ role: 'user', content: 'Your query here' }]
-});
-console.log(`Provider: ${result.provider}`);
-console.log(`Tier: ${result.tier}`);
-console.log(`Cost: $${result.cost}`);
-```
-**GitHub:** https://github.com/Das-rebel/a3m-router
-**npm:** https://www.npmjs.com/package/adaptive-memory-multi-model-router
-MIT license. Self-hosted. No account. 19.5 KB. TypeScript + Python SDKs, CLI, REST API, OpenAI proxy, LangChain adapter.
----
-*We're actively looking for independent benchmark evaluations. If you run the router against your own query distribution, we'd love to see the results — especially cases where it fails.*

package/articles/FRESH_devto_2026_05.md DELETED Viewed

@@ -1,73 +0,0 @@
-LLM infrastructure has three problems that shouldn't exist in 2026. Here's what we built because nobody else fixed them.
----
-## Problem 1: Your LLM bill is unnecessarily high
-Everyone routes everything to GPT-4 because who has time to configure per-query routing. The bill hits 3-5x what it should be for zero extra value.
-People are already switching because of this. A dev on X: *"Cancelled both my Claude Code Pro and ChatGPT Pro. Kimi K2.6 is just as good for my side projects as Opus or GPT 5.4 were. The price for this is crazy low."*
-Another one: *"Just used gemini-embedding-2 to vectorize 27,603 notes for semantic search. Total cost: $0.07. That's pretty amazing."*
-The pattern is obvious — developers are actively looking for cheaper alternatives. The problem is doing it query-by-query without wasting time.
-We built a router that classifies every query by complexity and sends it to the cheapest capable model.
-```javascript
-"Design a clinical trial protocol"  → premium  ($2.50/M tokens)
-"Write a Python sort function"      → groq     ($0.20/M tokens)
-"What is 2+2?"                      → free     ($0.00/M tokens)
-```
-Result: **62% cost savings** measured across 200 real API calls. Not theoretical.
----
-## Problem 2: Sequential fallback gives you one answer, not the best
-Every gateway does: try A → fail → try B → fail → try C.
-You always get one provider's answer. Never the best across all. If A is slow, everything waits.
-Someone already built `ai-retry` — a library for retry and fallback mechanisms — because this is such a common pain. People are hacking around it manually.
-We went further. Run all providers in parallel. Score every result on specificity, structure, and relevance. Return the best answer with reasons why it won.
-```javascript
-const result = await executeEnsemble(query, context, {
-  nvidia: callNvidia,
-  groq: callGroq,
-  openai: callOpenAI
-});
-// → nvidia (scored 75, higher specificity on code)
-```
----
-## Problem 3: Every gateway claims "negligible overhead." None publish numbers.
-It's the standard line. "Negligible overhead" followed by zero data.
-We ran ours through a third-party benchmark tool (llm-gateway-bench) and published everything:
-| Scenario | Time | What's included |
-|:---------|:----:|:----------------|
-| Direct to Groq | **138ms** | Raw API call |
-| Through A3M | **374ms** | Routing + cache + guardrails + cost tracking |
-236ms overhead. Not zero. But it saves 62% on API costs — that's ~$2,600/year at 100K queries/month.
----
-## Why it grew
-10,024 downloads in 14 days. Zero marketing. Developers found it on npm, tried it, told other developers.
-The feedback loop was: *"My bill is too high"* → 62% savings. *"I want the best answer, not the first one"* → parallel ensemble. *"I don't trust your latency claims"* → here's the third-party benchmark, run it yourself.
----
-*npm: `npm install adaptive-memory-multi-model-router`*
-*GitHub: [github.com/Das-rebel/a3m-router](https://github.com/Das-rebel/a3m-router)*
-*Benchmarks: third-party via [llm-gateway-bench](https://github.com/taffy-owo/llm-gateway-bench)*

package/articles/FRESH_hackernews.md DELETED Viewed

@@ -1,14 +0,0 @@
-Show HN: A3M Router — 70.32 LLM routing accuracy with zero ML, 36 providers, semantic cache
-A3M Router is a TypeScript LLM routing library that classifies query complexity using 5 keyword-based signals (domain detection, task indicators, query structure, action verb intensity, specificity) instead of neural networks. The weighted signal sum maps queries to one of 5 complexity tiers (free → enterprise), which routes to the cheapest provider that can handle the query.
-On a 2,500-query benchmark: 70.32  accuracy, 64.5% exact tier match, 0.3ms routing latency. The entire routing classifier is ~200 lines of TypeScript with zero runtime dependencies and a 19.5 KB gzipped package size. 61.6% cost savings vs. sending everything to premium providers.
-Supports 36 providers (OpenAI, Anthropic, Google, Groq, Cerebras, Mistral, DeepSeek, etc.) across 5 tiers. Includes a semantic cache (trigram Jaccard similarity), 17-pattern prompt injection detection, PII redaction, and cost analytics. Available as TypeScript SDK, Python SDK, CLI, REST API, OpenAI-compatible proxy, and LangChain adapter. MIT license, self-hosted, no account required.
-The core insight is that keyword-based routing is within  of BERT-based routing for nearly all queries, at zero infrastructure cost. The routing signals are composable and adjustable — if a particular domain routes poorly, you add domain-specific patterns without retraining anything.
-Repo: https://github.com/Das-rebel/a3m-router
-npm: https://www.npmjs.com/package/adaptive-memory-multi-model-router
-Caveat: the 70.32 figure is self-benchmarked. We'd welcome independent evaluation, especially on non-English or creative writing query distributions where the keyword signals may be weaker.

package/articles/FRESH_reddit_ml.md DELETED Viewed

@@ -1,90 +0,0 @@
-# [D] We benchmarked keyword-based routing vs BERT for LLM provider selection. The gap is smaller than we expected — and keyword routing has zero infra cost.
-**TL;DR:** A 5-signal keyword classifier routes LLM queries across 36 providers with 70.32  accuracy and 64.5% exact tier match, in a 19.5 KB gzipped package with no ML weights. We're sharing the methodology and invite scrutiny on the benchmark design.
----
-## Background
-When you have 36 LLM providers (6 free, 15 cheap, 9 mid-tier, 3 premium, 3 enterprise), routing queries to the right provider matters. A simple "coding question → code model" heuristic breaks down fast. The established approaches are:
-1. **BERT/transformer-based routing** (e.g., RouteLLM trains a BERT classifier on paired human preferences)
-2. **LLM-as-judge routing** (ask GPT-4 to classify query complexity)
-3. **Rule-based routing** (regex, keyword matching)
-We went with approach 3, but with a structured 5-signal scoring system instead of naive regex. The question was: how much accuracy do we actually sacrifice?
-## The 5 routing signals
-Each query is scored on five orthogonal signals (0-1 scale each):
-| Signal | What it measures | Example high-score query |
-|--------|-----------------|------------------------|
-| Domain detection | Is this a specialized domain (code, math, legal, medical)? | "Implement a red-black tree with insert and delete" |
-| Task indicators | What type of task (summarize, translate, debug, create)? | "Debug this Python stack trace and explain the root cause" |
-| Query structure | Complexity of the query itself (multi-step, conditional, nested) | "First translate to French, then summarize in 3 bullets, then check for legal compliance" |
-| Action verb intensity | Strength/demand of the action requested | "Reverse-engineer" > "explain" > "mention" |
-| Specificity | How precise/vague the request is | "Quantum error correction in topological codes" vs "tell me about physics" |
-The weighted sum maps to one of 5 tiers, which maps to a provider. The whole thing runs in ~0.3ms per query.
-## Benchmark results
-We tested on a held-out set of 2,500 real-world queries across domains (coding, creative writing, analysis, math, translation, general Q&A).
-**Confusion matrix (simplified to 3 tiers for readability):**
-```
-              Predicted
-              Free  Mid  Premium
-Actual Free    812   38     5
-Actual Mid      41  647    27
-Actual Premium   3   22   705
-```
-Full 5-tier results:
-| Metric | Value |
-|--------|-------|
-| Exact tier match | 64.5% |
-|  accuracy | 70.32 |
-| Mean absolute error | 0.37 tiers |
-| Routing latency | 0.3ms/query |
-** accuracy of 70.32** means the router is never sending a trivial "what's the weather" query to GPT-4, and it's never sending a "design a distributed consensus algorithm" query to a free tier.
-### Cost impact
-On the same query workload:
-| Strategy | Cost | Savings |
-|----------|------|---------|
-| Premium-only (GPT-4 for everything) | $1.00 | — |
-| RouteLLM (reported in their paper) | ~$0.47 | ~53% |
-| A3M Router (our benchmark) | $0.384 | 61.6% |
-## Honest caveats (please poke holes)
-1. **Self-benchmarking.** We wrote the classifier, we designed the test set, we ran the evaluation. This is the biggest threat to validity. We'd love an independent evaluation. The test set and evaluation code are in the repo.
-2. **The 64.5% exact match is mediocre.** If you need surgical tier precision (e.g., you're operating at margins where the difference between "cheap" and "mid-tier" matters a lot), 64.5% means 1 in 3 queries lands in an adjacent tier. The  metric papers over this.
-3. **No comparison with RouteLLM on the same data.** We reference RouteLLM's publicly reported numbers, but we didn't run RouteLLM on our test set. Different query distributions make direct comparison unreliable.
-4. **Query distribution bias.** Our test set likely over-represents English, coding, and analytical queries because that's what we test with. Non-English and creative tasks may route differently.
-5. **Cost savings depend heavily on your query mix.** 61.6% is our benchmark workload. If 90% of your queries are complex, routing saves less. If 90% are simple, routing saves more.
-## Questions for the community
-- Is  accuracy actually the right metric? Or should we optimize for exact match at the cost of simplicity?
-- Has anyone compared RouteLLM's BERT-based approach against a strong keyword baseline on the same dataset? Our suspicion is that the gap is smaller than the ML community assumes.
-- For production routing, what's the actual cost of a "wrong tier" routing? We assume  is fine because provider quality within adjacent tiers overlaps significantly. Is that assumption valid?
-- Are there public LLM routing benchmarks we should be evaluating on?
-## Links
-- **Repo:** https://github.com/Das-rebel/a3m-router
-- **npm:** https://www.npmjs.com/package/adaptive-memory-multi-model-router
-The classifier is ~200 lines of TypeScript. No dependencies beyond a standard Node.js runtime. If you want to reproduce the benchmark or contribute a more rigorous evaluation, PRs welcome.