adaptive-memory-multi-model-router 2.14.46 → 2.14.47

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (598) hide show
  1. package/{docs/llms.txt → llms.txt.bak} +6 -6
  2. package/package.json +13 -84
  3. package/src/routing/advancedRouter.ts.bak +650 -0
  4. package/test.js.bak +376 -0
  5. package/.dockerignore +0 -82
  6. package/.env.example +0 -303
  7. package/.github/DISCUSSIONS_WELCOME.md +0 -27
  8. package/.github/DISCUSSION_TEMPLATE.yml +0 -5
  9. package/.github/FUNDING.yml +0 -2
  10. package/.github/ISSUE_TEMPLATE/bug_report.md +0 -94
  11. package/.github/ISSUE_TEMPLATE/config.yml +0 -17
  12. package/.github/ISSUE_TEMPLATE/feature_request.md +0 -71
  13. package/.github/PULL_REQUEST_TEMPLATE.md +0 -71
  14. package/.github/dependabot.yml +0 -9
  15. package/.github/workflows/auto-publish.yml +0 -51
  16. package/.github/workflows/ci.yml +0 -263
  17. package/.github/workflows/codeql.yml +0 -38
  18. package/.github/workflows/npm-publish.yml +0 -20
  19. package/.github/workflows/pages.yml +0 -37
  20. package/.github/workflows/stale.yml +0 -54
  21. package/.publish-tick +0 -1
  22. package/.well-known/ai-plugin.json +0 -16
  23. package/AGENT_COUNCIL_FINDINGS.md +0 -142
  24. package/ARCHITECTURE.md +0 -346
  25. package/AUDIT_REPORT.md +0 -28
  26. package/CODE_OF_CONDUCT.md +0 -128
  27. package/CONTRIBUTING.md +0 -50
  28. package/CONTRIBUTORS.md +0 -20
  29. package/Dockerfile +0 -53
  30. package/Dockerfile.proxy +0 -33
  31. package/HEALTH_REPORT.md +0 -118
  32. package/IMPROVEMENT_PLAN.md +0 -107
  33. package/LANDING.md +0 -43
  34. package/LAUNCH-PAIN-DRIVEN.md +0 -339
  35. package/LAUNCH.md +0 -337
  36. package/LAUNCH_CHECKLIST.md +0 -141
  37. package/LAUNCH_SNAPSHOT.md +0 -260
  38. package/MANIFESTO.md +0 -41
  39. package/POPULARITY_BOOSTERS.md +0 -285
  40. package/PR_STATUS_REPORT.md +0 -148
  41. package/REDESIGN.md +0 -95
  42. package/RUNKIT.md +0 -83
  43. package/SECURITY.md +0 -29
  44. package/SUBMISSIONS.md +0 -43
  45. package/_schema.html +0 -53
  46. package/ai-plugin.json +0 -16
  47. package/articles/AI_AGENT_LLM_ROUTING.md +0 -150
  48. package/articles/CHINESE_DIRECTORIES.md +0 -100
  49. package/articles/CHINESE_SUBMISSIONS_READY.md +0 -322
  50. package/articles/COMPETITOR_ALERTS.md +0 -31
  51. package/articles/COMPLETE_POSTING_DIRECTORY.md +0 -147
  52. package/articles/CONTENT_STRUCTURE.md +0 -292
  53. package/articles/DEVTO_COST_GUIDE.md +0 -473
  54. package/articles/DEVTO_FINAL.md +0 -416
  55. package/articles/DEVTO_MULTI_PROVIDER.md +0 -542
  56. package/articles/DEVTO_READY.md +0 -255
  57. package/articles/DEVTO_V2_ANNOUNCEMENT.md +0 -160
  58. package/articles/DEVTO_VIRAL_GROWTH.md +0 -280
  59. package/articles/FRESH_devto.md +0 -460
  60. package/articles/FRESH_devto_2026_05.md +0 -73
  61. package/articles/FRESH_hackernews.md +0 -14
  62. package/articles/FRESH_reddit_ml.md +0 -90
  63. package/articles/FRESH_reddit_node.md +0 -198
  64. package/articles/FRESH_reddit_sideproject.md +0 -72
  65. package/articles/FRESH_reddit_webdev.md +0 -130
  66. package/articles/FROM_ZERO_TO_10K.md +0 -107
  67. package/articles/HN_10X_BETTER.md +0 -430
  68. package/articles/HN_ACCOUNT_GUIDE.md +0 -21
  69. package/articles/HN_CHINESE_STYLE.md +0 -308
  70. package/articles/HN_FINAL.md +0 -148
  71. package/articles/HN_POSTED_VERSION.md +0 -56
  72. package/articles/HN_POST_READY.md +0 -137
  73. package/articles/HN_RESEARCH.md +0 -364
  74. package/articles/HN_SHOW_routerarena.md +0 -17
  75. package/articles/HN_TIMING_GUIDE.md +0 -52
  76. package/articles/INDIEHACKERS_POST.md +0 -52
  77. package/articles/INDIEHACKERS_READY.md +0 -120
  78. package/articles/LLM_BENCHMARK_DEEP_DIVE.md +0 -153
  79. package/articles/MASTER_POSTING_DIRECTORY.md +0 -189
  80. package/articles/NEWSLETTER_SEND_NOW.md +0 -259
  81. package/articles/NEWSLETTER_SUBMISSIONS.md +0 -112
  82. package/articles/PAIN-DRIVEN-devto-v2.md +0 -308
  83. package/articles/PAIN-DRIVEN-devto-v3.md +0 -268
  84. package/articles/PAIN-DRIVEN-devto.md +0 -242
  85. package/articles/PAIN-DRIVEN-hackernews-v2.md +0 -138
  86. package/articles/PAIN-DRIVEN-hackernews-v3.md +0 -151
  87. package/articles/PAIN-DRIVEN-hackernews.md +0 -131
  88. package/articles/PAIN-DRIVEN-reddit-v2.md +0 -301
  89. package/articles/PAIN-DRIVEN-reddit-v3.md +0 -236
  90. package/articles/PAIN-DRIVEN-reddit.md +0 -218
  91. package/articles/PAIN-DRIVEN-twitter-v2.md +0 -110
  92. package/articles/PAIN-DRIVEN-twitter-v3.md +0 -121
  93. package/articles/PAIN-DRIVEN-twitter.md +0 -120
  94. package/articles/PORTKEY_VS_A3M.md +0 -147
  95. package/articles/POSTING_KIT_2026_05.md +0 -67
  96. package/articles/PRESS_KIT_routerarena.md +0 -77
  97. package/articles/PRODUCTHUNT_LISTING.md +0 -48
  98. package/articles/PRODUCTHUNT_READY.md +0 -106
  99. package/articles/PR_PLAN_vault.md +0 -125
  100. package/articles/REDDIT_FINAL.md +0 -232
  101. package/articles/REDDIT_POST.md +0 -67
  102. package/articles/REDDIT_SUBMISSION_READY.md +0 -348
  103. package/articles/ROUTERARENA_LEADER.md +0 -45
  104. package/articles/SHOW_HN_FINAL.md +0 -29
  105. package/articles/TWEETS_10K_DOWNLOADS.md +0 -47
  106. package/articles/TWEETS_BENCHMARK_FIRST.md +0 -46
  107. package/articles/TWEETS_MCP_PLAY.md +0 -51
  108. package/articles/TWEETS_SEQUENTIAL_BROKEN.md +0 -49
  109. package/articles/TWEETS_WHY_BUILD.md +0 -54
  110. package/articles/TWEETS_routerarena_leader.md +0 -53
  111. package/articles/TWEET_STORM_READY.md +0 -165
  112. package/articles/TWITTER_FINAL.md +0 -167
  113. package/articles/WHY_10X_BETTER.md +0 -261
  114. package/articles/WHY_CHINESE_STYLE_BETTER.md +0 -323
  115. package/articles/ai-discoverability-llm-routing.md +0 -210
  116. package/articles/devto-llm-routing.md +0 -138
  117. package/articles/hackernews-show-hn.md +0 -54
  118. package/articles/hashnode-llm-cost-optimization.md +0 -125
  119. package/articles/hn_show_2026_05.md +0 -11
  120. package/articles/medium-building-llm-router.md +0 -205
  121. package/articles/reddit-ml.md +0 -76
  122. package/articles/twitter-thread-cost-savings.md +0 -50
  123. package/articles/youtube-tutorial-script.md +0 -262
  124. package/assets/a3m_3blue1brown.mp4 +0 -0
  125. package/assets/banner.svg +0 -109
  126. package/assets/chart-cost-v2.svg +0 -91
  127. package/assets/chart-cost-v3.svg +0 -143
  128. package/assets/chart-features-v2.svg +0 -132
  129. package/assets/chart-features-v3.svg +0 -211
  130. package/assets/chart-growth-v2.svg +0 -122
  131. package/assets/chart-growth-v3.svg +0 -189
  132. package/assets/cost-comparison.svg +0 -134
  133. package/assets/cost-simple.svg +0 -64
  134. package/assets/demo-hn.gif +0 -0
  135. package/assets/feature-matrix.svg +0 -136
  136. package/assets/growth-chart-animated.svg +0 -76
  137. package/assets/growth-chart.svg +0 -82
  138. package/assets/growth-simple.svg +0 -69
  139. package/assets/hero-diagram.svg +0 -81
  140. package/assets/logo-new.svg +0 -21
  141. package/assets/logo.svg +0 -68
  142. package/assets/provider-comparison.svg +0 -121
  143. package/assets/social-preview-new.svg +0 -100
  144. package/assets/social-preview.svg +0 -194
  145. package/assets/social-v2.svg +0 -130
  146. package/assets/social-v3.svg +0 -212
  147. package/benchmark-provider-results.json +0 -245
  148. package/benchmark-results.json +0 -54
  149. package/council-votes/architecture-vote.md +0 -121
  150. package/council-votes/coverage-vote.md +0 -93
  151. package/data/adaptive-benchmark.json +0 -92
  152. package/data/benchmark-results.json +0 -47
  153. package/data/labeled-benchmark.json +0 -88
  154. package/demo/3blue1brown_video.py +0 -285
  155. package/demo/3blue1brown_video_v2.py +0 -310
  156. package/demo/IMPROVED_PROMPTS.md +0 -229
  157. package/demo/VEO3_PROMPTS.md +0 -269
  158. package/demo/VIDEO_PRODUCTION_GUIDE.md +0 -333
  159. package/demo/a3m_3blue1brown.mp4 +0 -0
  160. package/demo/asciinema-demo.sh +0 -195
  161. package/demo/demo-hn.tape +0 -74
  162. package/demo/demo-script.md +0 -53
  163. package/demo/demo-script.sh +0 -62
  164. package/demo/demo.svg +0 -75
  165. package/demo/frame1_ai_data_center.png +0 -0
  166. package/demo/frame1_sunset_video.mp4 +0 -0
  167. package/demo/frame2_cost_comparison.png +0 -0
  168. package/demo/frame2_cost_comparison_fallback.png +0 -0
  169. package/demo/frame3_parallel_execution.png +0 -0
  170. package/demo/frame3_parallel_execution_fallback.png +0 -0
  171. package/demo/frame4_providers.png +0 -0
  172. package/demo/frame4_providers_fallback.png +0 -0
  173. package/demo/frame5_endcard.png +0 -0
  174. package/demo/frame5_endcard_fallback.png +0 -0
  175. package/demo/new_frame1_hook.png +0 -0
  176. package/demo/new_frame2_proof.png +0 -0
  177. package/demo/new_frame3_wow.png +0 -0
  178. package/demo/new_frame4_social.png +0 -0
  179. package/demo/new_frame5_cta.png +0 -0
  180. package/demo/package.json +0 -13
  181. package/demo/product-video-final.mp4 +0 -0
  182. package/demo/product-video-hype-v1.mp4 +0 -0
  183. package/demo/product-video-v1.mp4 +0 -0
  184. package/demo/public/index.html +0 -762
  185. package/demo/recording.cast +0 -55
  186. package/demo/server.js +0 -405
  187. package/demo-new.tape +0 -71
  188. package/demo-real.sh +0 -198
  189. package/demo-simple.tape +0 -205
  190. package/demo.html +0 -520
  191. package/demo.sh +0 -85
  192. package/demo.tape +0 -259
  193. package/dist/analytics/costAnalytics.d.ts.map +0 -1
  194. package/dist/analytics/costAnalytics.js.map +0 -1
  195. package/dist/benchmark/comprehensive.js.map +0 -1
  196. package/dist/benchmark/reproducible.d.ts.map +0 -1
  197. package/dist/benchmark/reproducible.js.map +0 -1
  198. package/dist/cache/prefixCache.d.ts.map +0 -1
  199. package/dist/cache/prefixCache.js.map +0 -1
  200. package/dist/cache/responseCache.d.ts.map +0 -1
  201. package/dist/cache/responseCache.js.map +0 -1
  202. package/dist/cache/semanticCache.d.ts.map +0 -1
  203. package/dist/cache/semanticCache.js.map +0 -1
  204. package/dist/cli/setupWizard.d.ts.map +0 -1
  205. package/dist/cli/setupWizard.js.map +0 -1
  206. package/dist/cost/budgetEnforcer.d.ts.map +0 -1
  207. package/dist/cost/budgetEnforcer.js.map +0 -1
  208. package/dist/cost/costTracker.d.ts.map +0 -1
  209. package/dist/cost/costTracker.js.map +0 -1
  210. package/dist/ensemble/multiRoundDialog.js.map +0 -1
  211. package/dist/ensemble/shapleyValue.js.map +0 -1
  212. package/dist/integrations/langchainAdapter.d.ts.map +0 -1
  213. package/dist/integrations/langchainAdapter.js.map +0 -1
  214. package/dist/integrations/oauth.d.ts.map +0 -1
  215. package/dist/integrations/oauth.js.map +0 -1
  216. package/dist/integrations/scienceAdapter.js.map +0 -1
  217. package/dist/memory/autoFetch.d.ts.map +0 -1
  218. package/dist/memory/autoFetch.js.map +0 -1
  219. package/dist/memory/episodicMemory.d.ts.map +0 -1
  220. package/dist/memory/episodicMemory.js.map +0 -1
  221. package/dist/memory/hybridMemory.js.map +0 -1
  222. package/dist/memory/memoryTree.d.ts.map +0 -1
  223. package/dist/memory/memoryTree.js.map +0 -1
  224. package/dist/memory/obsidianVault.d.ts.map +0 -1
  225. package/dist/memory/obsidianVault.js.map +0 -1
  226. package/dist/memory/reasoningBank.js.map +0 -1
  227. package/dist/observability/changeWatch.d.ts.map +0 -1
  228. package/dist/observability/changeWatch.js.map +0 -1
  229. package/dist/observability/fatigueDetector.d.ts.map +0 -1
  230. package/dist/observability/fatigueDetector.js.map +0 -1
  231. package/dist/observability/index.d.ts.map +0 -1
  232. package/dist/observability/index.js.map +0 -1
  233. package/dist/observability/metrics.d.ts.map +0 -1
  234. package/dist/observability/metrics.js.map +0 -1
  235. package/dist/observability/middleware.d.ts.map +0 -1
  236. package/dist/observability/middleware.js.map +0 -1
  237. package/dist/observability/tracer.d.ts.map +0 -1
  238. package/dist/observability/tracer.js.map +0 -1
  239. package/dist/observability/types.d.ts.map +0 -1
  240. package/dist/observability/types.js.map +0 -1
  241. package/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
  242. package/dist/orchestration/haloOrchestrator.js.map +0 -1
  243. package/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
  244. package/dist/orchestration/mctsWorkflow.js.map +0 -1
  245. package/dist/providers/localProvider.d.ts.map +0 -1
  246. package/dist/providers/localProvider.js.map +0 -1
  247. package/dist/providers/providerConfig.d.ts.map +0 -1
  248. package/dist/providers/providerConfig.js.map +0 -1
  249. package/dist/providers/registry.d.ts.map +0 -1
  250. package/dist/providers/registry.js.map +0 -1
  251. package/dist/routing/advancedRouter.d.ts.map +0 -1
  252. package/dist/routing/advancedRouter.js.map +0 -1
  253. package/dist/routing/crossModelValidation.d.ts.map +0 -1
  254. package/dist/routing/crossModelValidation.js.map +0 -1
  255. package/dist/routing/providerHealth.d.ts.map +0 -1
  256. package/dist/routing/providerHealth.js.map +0 -1
  257. package/dist/routing/providerRetry.d.ts.map +0 -1
  258. package/dist/routing/providerRetry.js.map +0 -1
  259. package/dist/scripts/banner.js +0 -29
  260. package/dist/security/guardrails.d.ts.map +0 -1
  261. package/dist/security/guardrails.js.map +0 -1
  262. package/dist/server/dashboard.d.ts.map +0 -1
  263. package/dist/server/dashboard.js.map +0 -1
  264. package/dist/server/modelMapper.d.ts.map +0 -1
  265. package/dist/server/modelMapper.js.map +0 -1
  266. package/dist/server/proxyServer.d.ts.map +0 -1
  267. package/dist/server/proxyServer.js.map +0 -1
  268. package/dist/skills/__tests__/skill_manager.test.d.ts +0 -2
  269. package/dist/skills/__tests__/skill_manager.test.d.ts.map +0 -1
  270. package/dist/skills/__tests__/skill_manager.test.js +0 -268
  271. package/dist/skills/__tests__/skill_manager.test.js.map +0 -1
  272. package/dist/tools/tmlpdTools.d.ts.map +0 -1
  273. package/dist/tools/tmlpdTools.js.map +0 -1
  274. package/dist/tui/dashboard.d.ts.map +0 -1
  275. package/dist/tui/dashboard.js.map +0 -1
  276. package/dist/tui/index.d.ts.map +0 -1
  277. package/dist/tui/index.js.map +0 -1
  278. package/dist/utils/batchProcessor.d.ts.map +0 -1
  279. package/dist/utils/batchProcessor.js.map +0 -1
  280. package/dist/utils/compression.d.ts.map +0 -1
  281. package/dist/utils/compression.js.map +0 -1
  282. package/dist/utils/costUtils.d.ts.map +0 -1
  283. package/dist/utils/costUtils.js.map +0 -1
  284. package/dist/utils/reliability.d.ts.map +0 -1
  285. package/dist/utils/reliability.js.map +0 -1
  286. package/dist/utils/sorting.d.ts.map +0 -1
  287. package/dist/utils/sorting.js.map +0 -1
  288. package/dist/utils/speculativeDecoding.d.ts.map +0 -1
  289. package/dist/utils/speculativeDecoding.js.map +0 -1
  290. package/dist/utils/tokenUtils.d.ts.map +0 -1
  291. package/dist/utils/tokenUtils.js.map +0 -1
  292. package/docs/.nojekyll +0 -0
  293. package/docs/ANALYSIS_PRINCIPLES.md +0 -162
  294. package/docs/API.md +0 -855
  295. package/docs/ARCHITECTURAL-IMPROVEMENTS-2025.md +0 -1391
  296. package/docs/ARCHITECTURAL-IMPROVEMENTS-REVISED-2025.md +0 -1051
  297. package/docs/BENCHMARK.md +0 -170
  298. package/docs/CHINESE_PROVIDER_RELIABILITY.md +0 -37
  299. package/docs/CITATIONS.md +0 -74
  300. package/docs/CLAIMS_AND_EVIDENCE.md +0 -58
  301. package/docs/CONFIGURATION.md +0 -476
  302. package/docs/COUNCIL_DECISION.json +0 -816
  303. package/docs/COUNCIL_SUMMARY.md +0 -319
  304. package/docs/COUNCIL_V2.2_DECISION.md +0 -416
  305. package/docs/ENGINEERING_SPEC.md +0 -55
  306. package/docs/FACTORY_RESET.md +0 -34
  307. package/docs/GEO.md +0 -66
  308. package/docs/GEO_OPTIMIZATION.md +0 -30
  309. package/docs/GEO_ROOT_CAUSE.md +0 -136
  310. package/docs/GEO_STATUS.md +0 -85
  311. package/docs/GEO_TEST_RESULTS.md +0 -176
  312. package/docs/HN_CHECKLIST.md +0 -38
  313. package/docs/HN_FOUNDER_COMMENT.md +0 -17
  314. package/docs/HN_SUBMISSION_FINAL.md +0 -180
  315. package/docs/HN_SUBMISSION_V3.md +0 -56
  316. package/docs/IMPROVEMENT_ROADMAP.md +0 -515
  317. package/docs/INTEGRATIONS.md +0 -420
  318. package/docs/LANGCHAIN_INTEGRATION.md +0 -147
  319. package/docs/LLM_COUNCIL_DECISION.md +0 -508
  320. package/docs/MIDDLEWARE_CHAIN.md +0 -35
  321. package/docs/PROMO_CHECKLIST.md +0 -200
  322. package/docs/QUICKSTART.md +0 -271
  323. package/docs/QUICK_START.md +0 -43
  324. package/docs/QUICK_START_VISIBILITY.md +0 -782
  325. package/docs/REDDIT_GAP_ANALYSIS.md +0 -299
  326. package/docs/RELEASE_CHECKLIST.md +0 -32
  327. package/docs/REPRODUCIBILITY.md +0 -63
  328. package/docs/RESEARCH_BACKED_IMPROVEMENTS.md +0 -1180
  329. package/docs/ROUTING_RUBRIC.md +0 -197
  330. package/docs/SEO_AUDIT.md +0 -186
  331. package/docs/SOCIAL_LISTENING.md +0 -219
  332. package/docs/TMLPD_QNA.md +0 -751
  333. package/docs/TMLPD_V2.1_COMPLETE.md +0 -763
  334. package/docs/TMLPD_V2.2_RESEARCH_ROADMAP.md +0 -754
  335. package/docs/UPDATE_TOPICS.md +0 -15
  336. package/docs/USE_CASES.md +0 -59
  337. package/docs/V2.2_IMPLEMENTATION_COMPLETE.md +0 -446
  338. package/docs/V2_IMPLEMENTATION_GUIDE.md +0 -388
  339. package/docs/VERCEL_AI_SDK.md +0 -209
  340. package/docs/VISIBILITY_ADOPTION_PLAN.md +0 -1005
  341. package/docs/_config.yml +0 -49
  342. package/docs/ai-plugin.json +0 -16
  343. package/docs/api.html +0 -513
  344. package/docs/architecture-diagram.md +0 -40
  345. package/docs/benchmark-chart.png +0 -0
  346. package/docs/benchmark.html +0 -387
  347. package/docs/blog/routerarena-number-one.html +0 -73
  348. package/docs/cli-cheatsheet.md +0 -339
  349. package/docs/compare.md +0 -109
  350. package/docs/comparison-litellm.md +0 -88
  351. package/docs/comparison.md +0 -108
  352. package/docs/cost-chart-ascii.md +0 -42
  353. package/docs/cost-comparison-chart.svg +0 -88
  354. package/docs/curl-examples.md +0 -247
  355. package/docs/demo-auto.html +0 -264
  356. package/docs/demo.html +0 -416
  357. package/docs/geo/GENERATIVE_ENGINE_OPTIMIZATION.md +0 -232
  358. package/docs/index.html +0 -507
  359. package/docs/launch-content/LAUNCH_EXECUTION_CHECKLIST.md +0 -421
  360. package/docs/launch-content/README.md +0 -457
  361. package/docs/launch-content/assets/cost_comparison_100_tasks.png +0 -0
  362. package/docs/launch-content/assets/cumulative_savings.png +0 -0
  363. package/docs/launch-content/assets/parallel_speedup.png +0 -0
  364. package/docs/launch-content/assets/provider_pricing_comparison.png +0 -0
  365. package/docs/launch-content/assets/task_breakdown_comparison.png +0 -0
  366. package/docs/launch-content/generate_charts.py +0 -313
  367. package/docs/launch-content/hn_show_post.md +0 -139
  368. package/docs/launch-content/partner_outreach_templates.md +0 -745
  369. package/docs/launch-content/reddit_posts.md +0 -467
  370. package/docs/launch-content/twitter_thread.txt +0 -460
  371. package/docs/npm-downloads-chart.svg +0 -43
  372. package/docs/openapi.json +0 -139
  373. package/docs/openapi.yaml +0 -1318
  374. package/docs/quick-start.html +0 -366
  375. package/docs/robots.txt +0 -52
  376. package/docs/sitemap.xml +0 -57
  377. package/docs/styles.css +0 -682
  378. package/docs/well-known/ai-plugin.json +0 -16
  379. package/docs/wellknown/ai-plugin.json +0 -16
  380. package/docs-site/assets/og-banner.svg +0 -194
  381. package/docs-site/index.html +0 -632
  382. package/eval/README.md +0 -46
  383. package/eval/baselines/main.json +0 -12
  384. package/eval/benchmark_dataset.jsonl +0 -16
  385. package/eval/check_golden_routes.js +0 -64
  386. package/eval/datasets/catalog.json +0 -33
  387. package/eval/datasets/slices/cn_provider_reliability_v1.jsonl +0 -3
  388. package/eval/datasets/slices/cost_pressure_v1.jsonl +0 -3
  389. package/eval/datasets/slices/safety_guardrails_v1.jsonl +0 -3
  390. package/eval/evals.json +0 -199
  391. package/eval/fault_injection_thresholds.json +0 -3
  392. package/eval/generate_report.js +0 -128
  393. package/eval/golden_routes.json +0 -114
  394. package/eval/lib/experiment_registry.js +0 -24
  395. package/eval/run_eval.js +0 -197
  396. package/eval/run_fault_injection.js +0 -201
  397. package/eval/run_shadow_eval.js +0 -85
  398. package/eval/thresholds.json +0 -9
  399. package/examples/QUICKSTART.md +0 -183
  400. package/examples/README.md +0 -61
  401. package/examples/a3m-sdk.js +0 -124
  402. package/examples/basic-route.js +0 -54
  403. package/examples/chat-loop.js +0 -202
  404. package/examples/classify-then-route.js +0 -102
  405. package/examples/cost-compare.js +0 -120
  406. package/examples/ensemble.js +0 -160
  407. package/examples/whatsapp-telegram-bridge-demo.js +0 -302
  408. package/examples/whatsapp-telegram-bridge.js +0 -269
  409. package/hf-space/README.md +0 -23
  410. package/hf-space/app.py +0 -240
  411. package/hf-space/requirements.txt +0 -1
  412. package/huggingface_space/README.md +0 -35
  413. package/huggingface_space/app.py +0 -126
  414. package/huggingface_space/create_space.py +0 -208
  415. package/huggingface_space/requirements.txt +0 -1
  416. package/mcp-server/README.md +0 -188
  417. package/mcp-server/package.json +0 -29
  418. package/mcp-server/src/index.ts +0 -744
  419. package/mcp-server/tsconfig.json +0 -19
  420. package/openclaw-alexa-bridge/ALL_REMAINING_FIXES_PLAN.md +0 -313
  421. package/openclaw-alexa-bridge/REMAINING_FIXES_SUMMARY.md +0 -277
  422. package/openclaw-alexa-bridge/src/alexa_handler_no_tmlpd.js +0 -1234
  423. package/openclaw-alexa-bridge/test_fixes.js +0 -77
  424. package/playground/README.md +0 -51
  425. package/playground/codesandbox.json +0 -12
  426. package/playground/index.js +0 -39
  427. package/proxy/README.md +0 -227
  428. package/proxy/package-lock.json +0 -831
  429. package/proxy/package.json +0 -17
  430. package/proxy/rate-limit.js +0 -145
  431. package/proxy/rate-limit.test.js +0 -311
  432. package/proxy/server.js +0 -970
  433. package/python/README.md +0 -102
  434. package/python/a3m/__init__.py +0 -6
  435. package/python/a3m/client.py +0 -190
  436. package/python/a3m/models.py +0 -40
  437. package/python/a3m/sync_client.py +0 -61
  438. package/python/examples.py +0 -53
  439. package/python/integrations.py +0 -330
  440. package/python/pyproject.toml +0 -23
  441. package/python/setup.py +0 -28
  442. package/python/tmlpd.py +0 -369
  443. package/qna/REDDIT_GAP_ANALYSIS.md +0 -299
  444. package/qna/TMLPD_QNA.md +0 -751
  445. package/research/FINDING_001_safety.md +0 -28
  446. package/research/FINDING_002_error_diversity.md +0 -32
  447. package/research/FINDING_003_confidence_weighted_voting.md +0 -32
  448. package/research/FINDING_004_cross_model_semantic_detection.md +0 -37
  449. package/research/FINDING_005_knowledge_gap_orthogonality.md +0 -34
  450. package/research/HALLUCINATION_RESEARCH.md +0 -27
  451. package/research/ensemble-voting.md +0 -324
  452. package/research/loss-functions.md +0 -545
  453. package/research-log.md +0 -49
  454. package/scripts/banner.js +0 -29
  455. package/scripts/benchmark-local-routerarena.ts +0 -176
  456. package/scripts/benchmark.js +0 -145
  457. package/scripts/benchmark.sh +0 -61
  458. package/scripts/compare-providers.sh +0 -230
  459. package/scripts/content-planner.js +0 -25
  460. package/scripts/create-labeled-benchmark.ts +0 -105
  461. package/scripts/cross_post.py +0 -443
  462. package/scripts/local-router-benchmark.ts +0 -154
  463. package/scripts/post-all.sh +0 -41
  464. package/scripts/publish_fcc.py +0 -106
  465. package/scripts/push-to-gitee.sh +0 -25
  466. package/scripts/routerarena_ensemble.js +0 -144
  467. package/scripts/routing-benchmark-v2.js +0 -373
  468. package/scripts/routing-benchmark-v3.js +0 -118
  469. package/scripts/routing-benchmark.js +0 -462
  470. package/scripts/run-labeled-benchmark.mjs +0 -104
  471. package/scripts/run-mmlu-benchmark.js +0 -176
  472. package/scripts/run-provider-benchmark.js +0 -244
  473. package/scripts/update-npm-badges.js +0 -158
  474. package/skill/SKILL.md +0 -238
  475. package/src/__tests__/integration/tmpld_integration.test.py +0 -540
  476. package/src/skills/__tests__/skill_manager.test.ts +0 -328
  477. package/submissions/benchmarks/ALL_PLATFORMS_SUBMISSION.md +0 -94
  478. package/submissions/benchmarks/LLMROUTERBENCH_SUBMISSION.md +0 -121
  479. package/submissions/benchmarks/MMRBENCH_SUBMISSION.md +0 -94
  480. package/submissions/benchmarks/ROUTERARENA_UPDATE.md +0 -83
  481. package/submissions/benchmarks/ROUTERBENCH_SUBMISSION.md +0 -225
  482. package/test-council/1-structure-tests.test.js +0 -353
  483. package/test-council/1-structure-tests.test.ts +0 -353
  484. package/test-council/2-edge-case-tests.test.ts +0 -361
  485. package/test-council/3-performance-tests.test.ts +0 -669
  486. package/test-council/4-integration-tests.test.ts +0 -391
  487. package/test-council/5-agent-council-eval.test.ts +0 -413
  488. package/test-council/AGENT_COUNCIL_ARCHITECTURE.md +0 -349
  489. package/test-council/TEST_COUNCIL_REPORT.md +0 -201
  490. package/test-council/agents/edge-case-agent.ts +0 -363
  491. package/test-council/agents/performance-agent.ts +0 -426
  492. package/test-council/agents/structure-agent.ts +0 -227
  493. package/test-council/council.md +0 -183
  494. package/tests/__mocks__/tokenUtils.ts +0 -8
  495. package/tests/memory/episodicMemory.test.ts +0 -227
  496. package/tests/package-lock.json +0 -1628
  497. package/tests/package.json +0 -18
  498. package/tests/routing/ensembleVoting.test.ts +0 -236
  499. package/tests/routing/providerRetry.test.ts +0 -360
  500. package/tests/routing/queryTypePresets.test.ts +0 -208
  501. package/tests/security/guardrailEngine.test.ts +0 -700
  502. package/tests/tsconfig.json +0 -21
  503. package/tests/vitest.config.ts +0 -18
  504. package/tmlpd-pi-extension/README.md +0 -66
  505. package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts +0 -114
  506. package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts.map +0 -1
  507. package/tmlpd-pi-extension/dist/cache/prefixCache.js +0 -285
  508. package/tmlpd-pi-extension/dist/cache/prefixCache.js.map +0 -1
  509. package/tmlpd-pi-extension/dist/cache/responseCache.d.ts +0 -58
  510. package/tmlpd-pi-extension/dist/cache/responseCache.d.ts.map +0 -1
  511. package/tmlpd-pi-extension/dist/cache/responseCache.js +0 -153
  512. package/tmlpd-pi-extension/dist/cache/responseCache.js.map +0 -1
  513. package/tmlpd-pi-extension/dist/cli.js +0 -59
  514. package/tmlpd-pi-extension/dist/cost/costTracker.d.ts +0 -95
  515. package/tmlpd-pi-extension/dist/cost/costTracker.d.ts.map +0 -1
  516. package/tmlpd-pi-extension/dist/cost/costTracker.js +0 -240
  517. package/tmlpd-pi-extension/dist/cost/costTracker.js.map +0 -1
  518. package/tmlpd-pi-extension/dist/index.d.ts +0 -723
  519. package/tmlpd-pi-extension/dist/index.d.ts.map +0 -1
  520. package/tmlpd-pi-extension/dist/index.js +0 -239
  521. package/tmlpd-pi-extension/dist/index.js.map +0 -1
  522. package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts +0 -82
  523. package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts.map +0 -1
  524. package/tmlpd-pi-extension/dist/memory/episodicMemory.js +0 -145
  525. package/tmlpd-pi-extension/dist/memory/episodicMemory.js.map +0 -1
  526. package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts +0 -102
  527. package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
  528. package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js +0 -207
  529. package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js.map +0 -1
  530. package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts +0 -85
  531. package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
  532. package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js +0 -210
  533. package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js.map +0 -1
  534. package/tmlpd-pi-extension/dist/providers/localProvider.d.ts +0 -102
  535. package/tmlpd-pi-extension/dist/providers/localProvider.d.ts.map +0 -1
  536. package/tmlpd-pi-extension/dist/providers/localProvider.js +0 -338
  537. package/tmlpd-pi-extension/dist/providers/localProvider.js.map +0 -1
  538. package/tmlpd-pi-extension/dist/providers/registry.d.ts +0 -55
  539. package/tmlpd-pi-extension/dist/providers/registry.d.ts.map +0 -1
  540. package/tmlpd-pi-extension/dist/providers/registry.js +0 -138
  541. package/tmlpd-pi-extension/dist/providers/registry.js.map +0 -1
  542. package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts +0 -68
  543. package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts.map +0 -1
  544. package/tmlpd-pi-extension/dist/routing/advancedRouter.js +0 -332
  545. package/tmlpd-pi-extension/dist/routing/advancedRouter.js.map +0 -1
  546. package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts +0 -101
  547. package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts.map +0 -1
  548. package/tmlpd-pi-extension/dist/tools/tmlpdTools.js +0 -368
  549. package/tmlpd-pi-extension/dist/tools/tmlpdTools.js.map +0 -1
  550. package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts +0 -96
  551. package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts.map +0 -1
  552. package/tmlpd-pi-extension/dist/utils/batchProcessor.js +0 -170
  553. package/tmlpd-pi-extension/dist/utils/batchProcessor.js.map +0 -1
  554. package/tmlpd-pi-extension/dist/utils/compression.d.ts +0 -61
  555. package/tmlpd-pi-extension/dist/utils/compression.d.ts.map +0 -1
  556. package/tmlpd-pi-extension/dist/utils/compression.js +0 -281
  557. package/tmlpd-pi-extension/dist/utils/compression.js.map +0 -1
  558. package/tmlpd-pi-extension/dist/utils/reliability.d.ts +0 -74
  559. package/tmlpd-pi-extension/dist/utils/reliability.d.ts.map +0 -1
  560. package/tmlpd-pi-extension/dist/utils/reliability.js +0 -177
  561. package/tmlpd-pi-extension/dist/utils/reliability.js.map +0 -1
  562. package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts +0 -117
  563. package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts.map +0 -1
  564. package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js +0 -246
  565. package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js.map +0 -1
  566. package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts +0 -50
  567. package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts.map +0 -1
  568. package/tmlpd-pi-extension/dist/utils/tokenUtils.js +0 -124
  569. package/tmlpd-pi-extension/dist/utils/tokenUtils.js.map +0 -1
  570. package/tmlpd-pi-extension/examples/QUICKSTART.md +0 -183
  571. package/tmlpd-pi-extension/package-lock.json +0 -79
  572. package/tmlpd-pi-extension/package.json +0 -172
  573. package/tmlpd-pi-extension/python/examples.py +0 -53
  574. package/tmlpd-pi-extension/python/integrations.py +0 -330
  575. package/tmlpd-pi-extension/python/setup.py +0 -28
  576. package/tmlpd-pi-extension/python/tmlpd.py +0 -369
  577. package/tmlpd-pi-extension/qna/REDDIT_GAP_ANALYSIS.md +0 -299
  578. package/tmlpd-pi-extension/qna/TMLPD_QNA.md +0 -751
  579. package/tmlpd-pi-extension/skill/SKILL.md +0 -238
  580. package/tmlpd-pi-extension/src/cache/responseCache.ts +0 -147
  581. package/tmlpd-pi-extension/src/cost/costTracker.ts +0 -302
  582. package/tmlpd-pi-extension/src/index.ts +0 -232
  583. package/tmlpd-pi-extension/src/memory/episodicMemory.ts +0 -257
  584. package/tmlpd-pi-extension/src/orchestration/haloOrchestrator.ts +0 -266
  585. package/tmlpd-pi-extension/src/orchestration/mctsWorkflow.ts +0 -262
  586. package/tmlpd-pi-extension/src/providers/localProvider.ts +0 -406
  587. package/tmlpd-pi-extension/src/providers/registry.ts +0 -164
  588. package/tmlpd-pi-extension/src/routing/ensembleVoting.ts +0 -159
  589. package/tmlpd-pi-extension/src/routing/queryTypePresets.ts +0 -136
  590. package/tmlpd-pi-extension/src/tools/tmlpdTools.ts +0 -433
  591. package/tmlpd-pi-extension/src/utils/batchProcessor.ts +0 -232
  592. package/tmlpd-pi-extension/src/utils/compression.ts +0 -325
  593. package/tmlpd-pi-extension/src/utils/reliability.ts +0 -221
  594. package/tmlpd-pi-extension/src/utils/tokenUtils.ts +0 -145
  595. package/tmlpd-pi-extension/tsconfig.json +0 -18
  596. package/tsconfig.build.json +0 -29
  597. package/tsconfig.json +0 -18
  598. /package/{docs/llms-full.txt → llms-full.txt.bak} +0 -0
@@ -1,754 +0,0 @@
1
- # TMLPD v2.2+ Research-Backed Evolution Roadmap
2
-
3
- ## Executive Summary
4
-
5
- Copilot's research analysis identifies **7 cutting-edge features** from 2024-2025 arXiv papers that significantly advance TMLPD beyond v2.1's capabilities.
6
-
7
- **Key Insight**: TMLPD v2.1 implemented solid foundations (difficulty routing, 3-tier memory, orchestration), but this research pushes the state-of-the-art further with:
8
-
9
- - **2-4x inference speedup** (speculative decoding + early exit)
10
- - **40-60% additional cost savings** (universal learned routing)
11
- - **19.6% quality improvement** (HALO hierarchical orchestration)
12
- - **50% better long-context** (MemoRAG global memory)
13
- - **99%+ reliability** (circuit breakers + fallback chains)
14
-
15
- **Combined Impact**: 3-5x faster, 50-70% cheaper, 35% better quality, 70.32 reliable vs TMLPD v2.1
16
-
17
- ---
18
-
19
- ## 🎯 Strategic Positioning: Why This Matters
20
-
21
- ### Current TMLPD v2.1 vs Competitive Landscape
22
-
23
- | Feature | LangChain | AutoGPT | CrewAI | TMLPD v2.1 | **TMLPD v2.2** |
24
- |---------|-----------|---------|--------|------------|----------------|
25
- | **Cost Optimization** | ❌ | ❌ | ❌ | ✅ 82% savings | ✅ **92% savings** |
26
- | **Memory System** | ❌ | ⚠️ Basic | ⚠️ Basic | ✅ 3-tier | ✅ **MemoRAG** |
27
- | **Speed** | 1x | 1x | 1x | 2-5x (parallel) | **4-8x** (speculative) |
28
- | **Orchestration** | ⚠️ Manual | ⚠️ Manual | ⚠️ Manual | ✅ Orchestrator | ✅ **HALO** |
29
- | **Quality** | Baseline | Baseline | Baseline | Baseline | **+35%** |
30
- | **Reliability** | ⚠️ Basic | ⚠️ Basic | ⚠️ Basic | 95% | **70.32** |
31
-
32
- **Insight**: TMLPD v2.2 would be **uniquely positioned** as the only framework with:
33
- 1. Learned routing (adapts to new models automatically)
34
- 2. Speculative decoding (2-4x speedup)
35
- 3. Global memory (MemoRAG)
36
- 4. Hierarchical orchestration (HALO)
37
-
38
- This creates an **unassailable competitive moat** that other frameworks cannot easily replicate.
39
-
40
- ---
41
-
42
- ## 📊 Feature Mapping: v2.1 → v2.2+
43
-
44
- ### What We Already Have (v2.1)
45
-
46
- ```
47
- TMLPD v2.1 Architecture:
48
- ├── Multi-Provider System (Phase 1) ✅
49
- │ ├── 5 providers (Anthropic, OpenAI, Cerebras, Groq, Together)
50
- │ └── Intelligent routing (difficulty-based)
51
-
52
- ├── Difficulty-Aware Routing (Phase 2) ✅
53
- │ ├── 8-factor classification (0-100 score)
54
- │ └── Static difficulty bands (TRIVIAL → EXPERT)
55
-
56
- ├── 3-Tier Memory System (Phase 3) ✅
57
- │ ├── Episodic Memory (JSON-based)
58
- │ ├── Semantic Memory (ChromaDB vectors)
59
- │ └── Working Memory (LRU cache)
60
-
61
- └── Workflow Executors (Phase 4) ✅
62
- ├── Chaining Executor (sequential)
63
- ├── Parallelization Executor (concurrent)
64
- └── Orchestrator Executor (auto-decomposition)
65
- ```
66
-
67
- ### What v2.2 Adds (Research-Backed)
68
-
69
- ```
70
- TMLPD v2.2+ Architecture:
71
- ├── Enhanced Multi-Provider ⚡
72
- │ └── Universal Learned Router (NEW)
73
- │ ├── Adapts to unseen models
74
- │ ├── Online learning from feedback
75
- │ └── Dynamic quality-cost tradeoff
76
-
77
- ├── Advanced Difficulty Routing ⚡
78
- │ └── HALO Hierarchical Orchestration (NEW)
79
- │ ├── 3-tier planning (MCTS-based)
80
- │ ├── Role assignment
81
- │ └── Adaptive refinement
82
-
83
- ├── Next-Gen Memory ⚡
84
- │ └── MemoRAG System (NEW)
85
- │ ├── Global memory encoder
86
- │ ├── Response graph (historical)
87
- │ └── Optimal inference allocation
88
-
89
- ├── Inference Acceleration (NEW MODULE)
90
- │ ├── Speculative Decoder (2-4x speedup)
91
- │ └── Adaptive Early Exit (1.5x speedup)
92
-
93
- └── Production Reliability (NEW MODULE)
94
- ├── Circuit Breaker (99%+ uptime)
95
- ├── Fallback Chain (graceful degradation)
96
- └── Budget Manager (cost control)
97
- ```
98
-
99
- ---
100
-
101
- ## 🚀 Implementation Roadmap: 5-Week Sprint
102
-
103
- ### Week 1-2: Foundation Upgrade (Tier 1) ⭐⭐⭐⭐⭐
104
-
105
- #### Feature 1: HALO Hierarchical Orchestration
106
- **Research**: arXiv:2505.13516 (HALO) + arXiv:2506.12508v3 (AgentOrchestra)
107
-
108
- **Current State**: TMLPD v2.1 has `OrchestratorExecutor` that:
109
- - Decomposes tasks using LLM
110
- - Executes sub-tasks in parallel
111
- - Delegates to chain/parallel/direct modes
112
-
113
- **Upgrade Path**:
114
- ```python
115
- # Current: src/workflows/orchestrator_executor.py
116
- class OrchestratorExecutor:
117
- async def execute(self, task, strategy="auto"):
118
- # LLM-based decomposition
119
- # Flat execution (no hierarchy)
120
- ...
121
-
122
- # New: src/orchestration/halo_orchestrator.py
123
- class HALOOrchestrator:
124
- """
125
- 3-Tier Hierarchical Planning
126
- Based on arXiv:2505.13516
127
- """
128
- async def orchestrate(self, task):
129
- # Tier 1: Planner (high-level decomposition)
130
- # Tier 2: RoleAssigner (specialized agents)
131
- # Tier 3: ExecutionEngine (parallel + verification)
132
- ...
133
- ```
134
-
135
- **Integration Strategy**:
136
- 1. Keep `OrchestratorExecutor` as v2.1 backward-compatible API
137
- 2. Add `HALOOrchestrator` as advanced mode
138
- 3. User can choose: `mode="halo"` vs `mode="orchestrator"`
139
-
140
- **Effort**: 3-4 days
141
- **Value**: ⭐⭐⭐⭐⭐ (19.6% quality improvement on complex tasks)
142
- **Files**:
143
- - `src/orchestration/halo_orchestrator.py` (400 lines)
144
- - `src/orchestration/task_planner.py` (300 lines)
145
- - `src/orchestration/mcts_search.py` (250 lines)
146
-
147
- ---
148
-
149
- #### Feature 2: Universal Learned Router
150
- **Research**: arXiv:2502.08773 (UniRoute) + ICLR 2024 (Hybrid LLM) + ICML 2025 (BEST-Route)
151
-
152
- **Current State**: TMLPD v2.1 has `AdvancedDifficultyClassifier` that:
153
- - Uses 8-factor static scoring
154
- - Routes to providers based on difficulty bands
155
- - No learning from feedback
156
-
157
- **Upgrade Path**:
158
- ```python
159
- # Current: src/workflows/advanced_difficulty_classifier.py
160
- class AdvancedDifficultyClassifier:
161
- def classify_difficulty(self, task):
162
- # Static 8-factor scoring
163
- # Returns: {"level": "COMPLEX", "score": 72}
164
- ...
165
-
166
- # New: src/routing/universal_router.py
167
- class UniversalModelRouter:
168
- """
169
- Learned routing that adapts to new models
170
- Based on arXiv:2502.08773
171
- """
172
- async def route(self, task, available_models, quality_threshold, budget_cap):
173
- # Extract task features
174
- # Score each available model (learned model profiles)
175
- # Predict quality for each model
176
- # Optimize quality-cost tradeoff
177
- # Log decision for online learning
178
- ...
179
-
180
- async def learn_from_feedback(self, outcomes):
181
- # Update model profiles based on actual quality
182
- # Incremental learning (sliding window)
183
- ...
184
- ```
185
-
186
- **Integration Strategy**:
187
- 1. Add `UniversalModelRouter` as optional routing strategy
188
- 2. Keep difficulty classifier as fallback
189
- 3. Config: `routing.strategy = universal_learned` or `difficulty_aware`
190
- 4. Auto-train from execution history
191
-
192
- **Effort**: 2-3 days
193
- **Value**: ⭐⭐⭐⭐⭐ (40-60% additional cost savings)
194
- **Files**:
195
- - `src/routing/universal_router.py` (350 lines)
196
- - `src/routing/model_profile.py` (200 lines)
197
- - `src/routing/online_learning.py` (250 lines)
198
-
199
- ---
200
-
201
- ### Week 2-3: Inference Acceleration (Tier 2) ⭐⭐⭐⭐⭐
202
-
203
- #### Feature 3: Speculative Decoding
204
- **Research**: arXiv:2503.00491 (Tutorial) + NAACL 2025 (Hierarchical SD)
205
-
206
- **Current State**: TMLPD v2.1 uses providers directly (no acceleration)
207
-
208
- **Upgrade Path**:
209
- ```python
210
- # New: src/inference/speculative_decoder.py
211
- class SpeculativeDecoder:
212
- """
213
- Multi-token speculative decoding with adaptive windows
214
- Based on arXiv:2503.00491
215
- """
216
- def __init__(self, target_model, draft_model):
217
- self.target = load_model(target_model) # Large, accurate
218
- self.draft = load_model(draft_model) # Small, fast
219
-
220
- async def decode(self, prompt, max_tokens=512, adaptive=True):
221
- # Dynamic window size (adaptive)
222
- # Draft model proposes K tokens
223
- # Target model verifies in parallel
224
- # Accept matched tokens, continue
225
- ...
226
- ```
227
-
228
- **Model Pairs**:
229
- ```
230
- Target (Accurate) Draft (Fast)
231
- ───────────────── ──────────────
232
- Anthropic Claude → Cerebras Llama
233
- OpenAI GPT-4 → Groq Llama
234
- Together Mistral → Local Mistral
235
- ```
236
-
237
- **Integration Strategy**:
238
- 1. Wrap provider calls in `SpeculativeDecoder`
239
- 2. Auto-select draft model based on target
240
- 3. Fallback to direct call if speculative fails
241
- 4. Config: `inference.use_speculative = true`
242
-
243
- **Effort**: 2-3 days
244
- **Value**: ⭐⭐⭐⭐⭐ (2-4x speedup, 30-40% cost reduction)
245
- **Files**:
246
- - `src/inference/speculative_decoder.py` (300 lines)
247
- - `src/inference/adaptive_window.py` (200 lines)
248
-
249
- ---
250
-
251
- #### Feature 4: Adaptive Early Exit
252
- **Research**: arXiv:2504.10724 (HELIOS) + DeepMind 2024 (Mixture-of-Depths)
253
-
254
- **Current State**: TMLPD v2.1 always uses full model forward pass
255
-
256
- **Upgrade Path**:
257
- ```python
258
- # New: src/inference/adaptive_compute.py
259
- class AdaptiveEarlyExit:
260
- """
261
- Token-level early exiting for faster inference
262
- Based on arXiv:2504.10724
263
- """
264
- async def forward(self, input_ids, max_layers=None):
265
- # Forward through layers
266
- # Check exit probability at each layer
267
- # Exit early if confident
268
- # Fallback: use all layers
269
- ...
270
- ```
271
-
272
- **Integration Strategy**:
273
- 1. Stack with speculative decoding
274
- 2. Exit during target model verification
275
- 3. Monitor exit rates (target: 30-50%)
276
- 4. Config: `inference.use_early_exit = true`
277
-
278
- **Effort**: 1-2 days
279
- **Value**: ⭐⭐⭐⭐ (20-30% additional speedup)
280
- **Files**:
281
- - `src/inference/adaptive_compute.py` (250 lines)
282
-
283
- ---
284
-
285
- ### Week 3-4: Memory Enhancement (Tier 3) ⭐⭐⭐⭐⭐
286
-
287
- #### Feature 5: MemoRAG Global Memory
288
- **Research**: arXiv:2409.05591 (MemoRAG) + ACL 2025 (Graph of Records)
289
-
290
- **Current State**: TMLPD v2.1 has 3-tier memory:
291
- - Episodic: JSON-based specific executions
292
- - Semantic: ChromaDB vector patterns
293
- - Working: LRU cache
294
-
295
- **Upgrade Path**:
296
- ```python
297
- # Current: src/memory/semantic_memory.py
298
- class SemanticMemoryStore:
299
- def store_pattern(self, pattern, category, source_task):
300
- # Store vector embedding
301
- ...
302
-
303
- def recall(self, task, top_k=3):
304
- # Vector similarity search
305
- ...
306
-
307
- # New: src/memory/memorag_system.py
308
- class MemoRAGSystem:
309
- """
310
- Global memory-enhanced RAG
311
- Based on arXiv:2409.05591
312
- """
313
- async def retrieve_and_generate(self, query, context_documents, quality_budget):
314
- # Stage 1: Build global memory from context
315
- # Stage 2: Allocate inference budget (retrieval vs reasoning)
316
- # Stage 3: Smart retrieval guided by memory
317
- # Stage 4: Verify with draft answer
318
- # Stage 5: Targeted re-retrieval for refinement
319
- # Stage 6: Final generation with full context
320
- ...
321
-
322
- class ResponseGraph:
323
- """
324
- Graph-based memory tracking historical responses
325
- Based on ACL 2025 (Graph of Records)
326
- """
327
- async def add_response(self, query, documents, retrieved, answer):
328
- # Add response node to graph
329
- # Track embeddings
330
- ...
331
-
332
- async def recall_similar_responses(self, query, top_k=3):
333
- # Find similar past responses for in-context learning
334
- ...
335
- ```
336
-
337
- **Integration Strategy**:
338
- 1. Add MemoRAG as optional memory backend
339
- 2. Keep existing 3-tier memory for backward compatibility
340
- 3. Use MemoRAG for long-context tasks (>10K tokens)
341
- 4. Config: `memory.use_memorag = true`
342
-
343
- **Effort**: 2-3 days
344
- **Value**: ⭐⭐⭐⭐⭐ (50%+ improvement on long-context tasks)
345
- **Files**:
346
- - `src/memory/memorag_system.py` (400 lines)
347
- - `src/memory/response_graph.py` (300 lines)
348
- - `src/memory/global_memory_encoder.py` (250 lines)
349
-
350
- ---
351
-
352
- ### Week 4-5: Production Reliability (Tier 4) ⭐⭐⭐⭐
353
-
354
- #### Feature 6: Circuit Breaker + Fallback Chain
355
- **Research**: Industry patterns (Netflix, Microsoft Azure)
356
-
357
- **Current State**: TMLPD v2.1 has basic retry logic
358
-
359
- **Upgrade Path**:
360
- ```python
361
- # New: src/reliability/circuit_breaker.py
362
- class CircuitBreaker:
363
- """
364
- Circuit breaker for provider health management
365
- States: CLOSED → OPEN → HALF_OPEN
366
- """
367
- def __init__(self, failure_threshold=3, timeout_seconds=60):
368
- self.state = "CLOSED"
369
- self.failure_count = 0
370
- ...
371
-
372
- async def call(self, provider, task):
373
- # Check state (OPEN? HALF_OPEN? CLOSED?)
374
- # Execute with protection
375
- # Track failures
376
- ...
377
-
378
- class FallbackChain:
379
- """
380
- Try providers in order until one succeeds
381
- """
382
- async def execute(self, task):
383
- # Try providers in fallback order
384
- # Circuit breaker per provider
385
- # Raise if all fail
386
- ...
387
- ```
388
-
389
- **Integration Strategy**:
390
- 1. Wrap all provider calls in circuit breaker
391
- 2. Auto-open circuit after 3 consecutive failures
392
- 3. Half-open state after 60s timeout
393
- 4. Fallback chain: primary → secondary → tertiary
394
-
395
- **Effort**: 1 day
396
- **Value**: ⭐⭐⭐⭐ (99%+ uptime, prevents cascading failures)
397
- **Files**:
398
- - `src/reliability/circuit_breaker.py` (200 lines)
399
- - `src/reliability/fallback_chain.py` (150 lines)
400
-
401
- ---
402
-
403
- #### Feature 7: Cost Optimization & Budget Management
404
- **Research**: Industry best practices
405
-
406
- **Current State**: TMLPD v2.1 tracks costs but no enforcement
407
-
408
- **Upgrade Path**:
409
- ```python
410
- # New: src/cost/cost_optimizer.py
411
- class CostOptimizer:
412
- """
413
- Optimize provider selection + model choice for cost
414
- """
415
- async def select_for_budget(self, task, budget_cents, quality_required):
416
- # Select model that fits budget and quality
417
- # Estimate cost for task
418
- # Check budget cap
419
- ...
420
-
421
- class BudgetManager:
422
- """
423
- Enforce budgets per team/user
424
- """
425
- async def check_budget(self, user_id, cost_cents):
426
- # Check daily/monthly usage
427
- # Compare to budget
428
- # Return allow/deny
429
- ...
430
-
431
- async def record_usage(self, user_id, cost_cents):
432
- # Log usage for billing
433
- # Track in database
434
- ...
435
- ```
436
-
437
- **Integration Strategy**:
438
- 1. Optional budget enforcement (multi-tenant deployments)
439
- 2. Per-user API keys with quotas
440
- 3. Real-time cost tracking dashboard
441
- 4. Config: `cost.enable_budgets = true`
442
-
443
- **Effort**: 1-2 days
444
- **Value**: ⭐⭐⭐⭐ (critical for enterprise/multi-tenant)
445
- **Files**:
446
- - `src/cost/cost_optimizer.py` (200 lines)
447
- - `src/cost/budget_manager.py` (250 lines)
448
- - `src/cost/usage_tracker.py` (150 lines)
449
-
450
- ---
451
-
452
- ## 📈 Performance Projections: v2.1 vs v2.2+
453
-
454
- ### Baseline (TMLPD v2.1)
455
- ```
456
- Cost: $0.86 per 100 tasks (82% savings vs traditional)
457
- Speed: 2-5x parallel execution speedup
458
- Quality: Baseline (same as single provider)
459
- Reliability: 95% uptime (basic retry)
460
- ```
461
-
462
- ### With v2.2 Features (Individually)
463
- ```
464
- Feature Speedup Cost Savings Quality
465
- ───────────────── ─────── ──────────── ──────
466
- HALO Orchestration 1x 0% +19.6%
467
- Universal Routing 1x 40-60% 0%
468
- Speculative Decoding 2-4x 30-40% 0%
469
- Early Exit 1.5x 20-30% 0%
470
- MemoRAG 1x 0% +50%
471
- Circuit Breakers 1x 0% 0% (reliability)
472
- ```
473
-
474
- ### Combined (TMLPD v2.2 Full Stack)
475
- ```
476
- Speed: 4-8x (speculative 3x × early exit 1.5x × parallel 1.5x)
477
- Cost: 92% savings (v2.1 82% + universal routing 50% + speculative 30%)
478
- Quality: +35% (HALO 19.6% + MemoRAG 50% on applicable tasks)
479
- Reliability: 70.32 uptime (circuit breakers + fallback)
480
- ```
481
-
482
- **Example: 100 Tasks**
483
- ```
484
- Traditional (no optimization): $5.00, 120 minutes
485
- TMLPD v2.1: $0.86, 40 minutes (3x faster, 82% cheaper)
486
- TMLPD v2.2: $0.40, 15 minutes (8x faster, 92% cheaper)
487
- ```
488
-
489
- ---
490
-
491
- ## 🎓 Research Integration Strategy
492
-
493
- ### 1. Paper-to-Code Mapping
494
-
495
- | Paper | Feature | Implementation | Effort |
496
- |-------|---------|----------------|--------|
497
- | arXiv:2505.13516 | HALO Orchestration | `src/orchestration/halo_orchestrator.py` | 3-4 days |
498
- | arXiv:2502.08773 | Universal Router | `src/routing/universal_router.py` | 2-3 days |
499
- | arXiv:2503.00491 | Speculative Decoding | `src/inference/speculative_decoder.py` | 2-3 days |
500
- | arXiv:2504.10724 | Early Exit | `src/inference/adaptive_compute.py` | 1-2 days |
501
- | arXiv:2409.05591 | MemoRAG | `src/memory/memorag_system.py` | 2-3 days |
502
- | ACL 2025 | Response Graph | `src/memory/response_graph.py` | 1 day |
503
-
504
- ### 2. Dependency Graph
505
-
506
- ```
507
- HALO Orchestration (Foundation)
508
-
509
- Universal Router (Requires HALO's task decomposition)
510
-
511
- Speculative Decoding (Can be parallel)
512
-
513
- Early Exit (Stacks with speculative)
514
-
515
- MemoRAG (Independent, can be parallel)
516
-
517
- Circuit Breakers (Required for production)
518
-
519
- Budget Management (Production requirement)
520
- ```
521
-
522
- ### 3. Implementation Order (Critical Path)
523
-
524
- **Week 1-2** (Foundation):
525
- 1. HALO Orchestration (enables better routing)
526
- 2. Universal Router (requires HALO's decomposition)
527
-
528
- **Week 2-3** (Acceleration):
529
- 3. Speculative Decoding (biggest speedup, visible win)
530
- 4. Early Exit (stacks with speculative)
531
-
532
- **Week 3-4** (Memory):
533
- 5. MemoRAG (long-context improvement)
534
-
535
- **Week 4-5** (Reliability):
536
- 6. Circuit Breakers (production safety)
537
- 7. Budget Management (enterprise feature)
538
-
539
- ---
540
-
541
- ## 🔧 Technical Architecture: v2.2+
542
-
543
- ### Unified Agent API (Backward Compatible)
544
-
545
- ```python
546
- from src.tmlpd_agent import TMLPDUnifiedAgent
547
-
548
- async def main():
549
- # v2.1 API (unchanged)
550
- async with TMLPDUnifiedAgent() as agent:
551
- result = await agent.execute({
552
- "description": "Build complete e-commerce platform"
553
- })
554
-
555
- # v2.2+ API (new features opt-in)
556
- async with TMLPDUnifiedAgent(
557
- routing_strategy="universal_learned", # NEW
558
- use_speculative=True, # NEW
559
- use_early_exit=True, # NEW
560
- memory_backend="memorag", # NEW
561
- orchestration_mode="halo" # NEW
562
- ) as agent:
563
- result = await agent.execute({
564
- "description": "Build complete e-commerce platform"
565
- })
566
-
567
- # Metrics
568
- print(f"Speedup: {result['speedup']}x")
569
- print(f"Cost: ${result['cost']:.6f}")
570
- print(f"Quality: +{result['quality_improvement']}%")
571
- print(f"Layers used: {result['layers_used']}/{result['total_layers']}") # Early exit
572
- ```
573
-
574
- ### Configuration File (tmlpd.yaml)
575
-
576
- ```yaml
577
- # TMLPD v2.2+ Configuration
578
- routing:
579
- strategy: universal_learned # NEW | difficulty_aware
580
- quality_target: 0.95
581
- cost_awareness: true
582
-
583
- orchestration:
584
- mode: halo # NEW | orchestrator | chain | parallel
585
- enable_mcts: true # NEW
586
-
587
- inference:
588
- use_speculative: true # NEW
589
- use_early_exit: true # NEW
590
- speculative_window: adaptive # NEW
591
-
592
- memory:
593
- backend: memorag # NEW | three_tier
594
- enable_response_graph: true # NEW
595
-
596
- reliability:
597
- enable_circuit_breaker: true # NEW
598
- failure_threshold: 3
599
- timeout_seconds: 60
600
-
601
- cost:
602
- enable_budgets: false # NEW (for multi-tenant)
603
- default_budget_cents: 1000
604
- ```
605
-
606
- ---
607
-
608
- ## 📊 Competitive Analysis: TMLPD v2.2 vs State-of-the-Art
609
-
610
- ### vs Other Frameworks
611
-
612
- | Feature | LangChain | AutoGPT | CrewAI | Semantic Kernel | **TMLPD v2.2** |
613
- |---------|-----------|---------|--------|-----------------|----------------|
614
- | **Routing** | Manual | Auto | Manual | Auto | ✅ **Universal Learned** |
615
- | **Speed** | 1x | 1x | 1x | 1x | ✅ **4-8x** |
616
- | **Memory** | ❌ | ⚠️ Basic | ⚠️ Basic | ⚠️ Basic | ✅ **MemoRAG + Graph** |
617
- | **Orchestration** | Chain | Auto | Role-based | Auto | ✅ **HALO Hierarchical** |
618
- | **Cost Savings** | 0% | 0% | 0% | 0% | ✅ **92%** |
619
- | **Reliability** | ⚠️ Basic | ⚠️ Basic | ⚠️ Basic | ⚠️ Basic | ✅ **70.32** |
620
- | **Research-Backed** | ❌ | ❌ | ❌ | ⚠️ Some | ✅ **30+ Papers** |
621
-
622
- **Insight**: TMLPD v2.2 would be **uniquely positioned** as the only framework combining:
623
- 1. Learned routing (adapts to new models)
624
- 2. Speculative decoding (2-4x speedup)
625
- 3. Global memory (MemoRAG)
626
- 4. Hierarchical orchestration (HALO)
627
-
628
- This creates a **12-18 month competitive advantage** (time for others to replicate research).
629
-
630
- ### vs Standalone Tools
631
-
632
- | Tool | Purpose | Limitation | TMLPD v2.2 Advantage |
633
- |------|---------|------------|---------------------|
634
- | **RouteLLM** | Learned routing | Framework-specific | ✅ Universal + online learning |
635
- | **vLLM** | Speculative decoding | Inference only | ✅ Integrated full pipeline |
636
- | **LangGraph** | Orchestration | No routing/memory | ✅ HALO + routing + memory |
637
- | **LlamaIndex** | RAG | Simple retrieval | ✅ MemoRAG global memory |
638
- | **SGLang** | Speculative decoding | No orchestration | ✅ Full agent framework |
639
-
640
- **Insight**: TMLPD v2.2 integrates all these capabilities into **one unified framework**, eliminating integration complexity.
641
-
642
- ---
643
-
644
- ## 🎯 Go-to-Market Strategy: v2.2 Launch
645
-
646
- ### Positioning Statement
647
-
648
- **v2.1**: "Production-ready AI agent framework with 82% cost savings"
649
-
650
- **v2.2**: "The first AI agent framework with universal learned routing, speculative decoding, and global memory"
651
-
652
- **Key Messages**:
653
- 1. **4-8x faster** than alternatives (speculative + early exit)
654
- 2. **92% cheaper** than traditional routing
655
- 3. **+35% better quality** (HALO + MemoRAG)
656
- 4. **Self-improving** (learns from execution history)
657
- 5. **Production-ready** (70.32 reliability)
658
-
659
- ### Launch Timeline
660
-
661
- **Month 1**: v2.1 launch (current plan)
662
- - Build initial community
663
- - Gather feedback
664
- - Identify pain points
665
-
666
- **Month 2-3**: v2.2 development (this roadmap)
667
- - Implement Tier 1-2 features (HALO + Universal Router + Speculative)
668
- - Beta testing with early adopters
669
- - Benchmark against v2.1
670
-
671
- **Month 4**: v2.2 public launch
672
- - Major version update announcement
673
- - Research paper publication (optional)
674
- - Conference talks (PyCon, AI conferences)
675
-
676
- ### Content Marketing
677
-
678
- **Blog Posts**:
679
- 1. "We Made TMLPD 4x Faster (Here's How)" - Speculative decoding
680
- 2. "Why Universal Routing Beats Heuristics" - Learned routing
681
- 3. "The Memory System That Remembers Everything" - MemoRAG
682
- 4. "From 82% to 92% Cost Savings" - v2.1 → v2.2 journey
683
-
684
- **Case Studies**:
685
- 1. "Startup X Saved $10K/month with TMLPD v2.2"
686
- 2. "Enterprise Y Achieved 70.32 Uptime with Circuit Breakers"
687
- 3. "Research Lab Z Improved Results 35% with HALO"
688
-
689
- **Research Content**:
690
- 1. "Implementing HALO: Lessons Learned" - Technical deep dive
691
- 2. "Benchmark: Speculative Decoding in Production" - Real-world data
692
- 3. "The Future of AI Agent Frameworks" - Vision paper
693
-
694
- ---
695
-
696
- ## 💡 Innovation Opportunities Beyond v2.2
697
-
698
- ### Future Research Directions (2025-2026)
699
-
700
- 1. **Multi-Modal Agents** (arXiv:2501.xxxxx)
701
- - Vision + Language + Audio
702
- - Cross-modal reasoning
703
-
704
- 2. **Reinforcement Learning from AI Feedback** (RLAIF)
705
- - Learn from user interactions
706
- - Continuous improvement
707
-
708
- 3. **Distributed Agent Execution**
709
- - Run agents across multiple machines
710
- - Edge computing + cloud hybrid
711
-
712
- 4. **Explainable Orchestration**
713
- - Why did the agent choose this path?
714
- - Debugging complex workflows
715
-
716
- 5. **Agent-to-Agent Communication**
717
- - Standardized protocols
718
- - Swarm intelligence
719
-
720
- ---
721
-
722
- ## ✅ Conclusion
723
-
724
- ### The Opportunity
725
-
726
- TMLPD v2.1 is a solid foundation, but v2.2+ with these research-backed features would be **truly state-of-the-art**:
727
-
728
- 1. **Unmatched Performance**: 4-8x faster, 92% cheaper
729
- 2. **Superior Quality**: +35% improvement on complex tasks
730
- 3. **Production-Ready**: 70.32 reliability
731
- 4. **Future-Proof**: Learns and adapts automatically
732
-
733
- ### The Strategy
734
-
735
- 1. **Launch v2.1 first** (current plan) - Build community, gather feedback
736
- 2. **Develop v2.2 in parallel** (5-week sprint) - Research-backed features
737
- 3. **Launch v2.2 as major upgrade** - Establish leadership position
738
- 4. **Continuously innovate** - Stay ahead of competition
739
-
740
- ### The Competitive Moat
741
-
742
- By the time competitors replicate these features (12-18 months), TMLPD v2.3+ will be even further ahead with:
743
- - Multi-modal capabilities
744
- - Reinforcement learning
745
- - Distributed execution
746
- - Explainable AI
747
-
748
- **This creates a sustainable competitive advantage** through continuous research integration.
749
-
750
- ---
751
-
752
- **Next Step**: Begin v2.1 launch while starting v2.2 development (HALO + Universal Router in Week 1-2).
753
-
754
- **Ready to build the future of AI agent frameworks?** 🚀