adaptive-memory-multi-model-router 2.14.45 → 2.14.47

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (605) hide show
  1. package/dist/index.d.ts +4 -0
  2. package/dist/index.js +8 -2
  3. package/dist/memory/hybridMemory.d.ts +71 -0
  4. package/dist/memory/hybridMemory.js +124 -0
  5. package/dist/memory/reasoningBank.d.ts +88 -0
  6. package/dist/memory/reasoningBank.js +303 -0
  7. package/{docs/llms.txt → llms.txt.bak} +6 -6
  8. package/package.json +13 -84
  9. package/src/index.ts +8 -0
  10. package/src/memory/hybridMemory.ts +155 -0
  11. package/src/memory/reasoningBank.ts +335 -0
  12. package/src/routing/advancedRouter.ts.bak +650 -0
  13. package/test.js.bak +376 -0
  14. package/.dockerignore +0 -82
  15. package/.env.example +0 -303
  16. package/.github/DISCUSSIONS_WELCOME.md +0 -27
  17. package/.github/DISCUSSION_TEMPLATE.yml +0 -5
  18. package/.github/FUNDING.yml +0 -2
  19. package/.github/ISSUE_TEMPLATE/bug_report.md +0 -94
  20. package/.github/ISSUE_TEMPLATE/config.yml +0 -17
  21. package/.github/ISSUE_TEMPLATE/feature_request.md +0 -71
  22. package/.github/PULL_REQUEST_TEMPLATE.md +0 -71
  23. package/.github/dependabot.yml +0 -9
  24. package/.github/workflows/auto-publish.yml +0 -51
  25. package/.github/workflows/ci.yml +0 -263
  26. package/.github/workflows/codeql.yml +0 -38
  27. package/.github/workflows/npm-publish.yml +0 -20
  28. package/.github/workflows/pages.yml +0 -37
  29. package/.github/workflows/stale.yml +0 -54
  30. package/.publish-tick +0 -1
  31. package/.well-known/ai-plugin.json +0 -16
  32. package/AGENT_COUNCIL_FINDINGS.md +0 -142
  33. package/ARCHITECTURE.md +0 -346
  34. package/AUDIT_REPORT.md +0 -28
  35. package/CODE_OF_CONDUCT.md +0 -128
  36. package/CONTRIBUTING.md +0 -50
  37. package/CONTRIBUTORS.md +0 -20
  38. package/Dockerfile +0 -53
  39. package/Dockerfile.proxy +0 -33
  40. package/HEALTH_REPORT.md +0 -118
  41. package/IMPROVEMENT_PLAN.md +0 -107
  42. package/LANDING.md +0 -43
  43. package/LAUNCH-PAIN-DRIVEN.md +0 -339
  44. package/LAUNCH.md +0 -337
  45. package/LAUNCH_CHECKLIST.md +0 -141
  46. package/LAUNCH_SNAPSHOT.md +0 -260
  47. package/MANIFESTO.md +0 -41
  48. package/POPULARITY_BOOSTERS.md +0 -285
  49. package/PR_STATUS_REPORT.md +0 -148
  50. package/REDESIGN.md +0 -95
  51. package/RUNKIT.md +0 -83
  52. package/SECURITY.md +0 -29
  53. package/SUBMISSIONS.md +0 -43
  54. package/_schema.html +0 -53
  55. package/ai-plugin.json +0 -16
  56. package/articles/AI_AGENT_LLM_ROUTING.md +0 -150
  57. package/articles/CHINESE_DIRECTORIES.md +0 -100
  58. package/articles/CHINESE_SUBMISSIONS_READY.md +0 -322
  59. package/articles/COMPETITOR_ALERTS.md +0 -31
  60. package/articles/COMPLETE_POSTING_DIRECTORY.md +0 -147
  61. package/articles/CONTENT_STRUCTURE.md +0 -292
  62. package/articles/DEVTO_COST_GUIDE.md +0 -473
  63. package/articles/DEVTO_FINAL.md +0 -416
  64. package/articles/DEVTO_MULTI_PROVIDER.md +0 -542
  65. package/articles/DEVTO_READY.md +0 -255
  66. package/articles/DEVTO_V2_ANNOUNCEMENT.md +0 -160
  67. package/articles/DEVTO_VIRAL_GROWTH.md +0 -280
  68. package/articles/FRESH_devto.md +0 -460
  69. package/articles/FRESH_devto_2026_05.md +0 -73
  70. package/articles/FRESH_hackernews.md +0 -14
  71. package/articles/FRESH_reddit_ml.md +0 -90
  72. package/articles/FRESH_reddit_node.md +0 -198
  73. package/articles/FRESH_reddit_sideproject.md +0 -72
  74. package/articles/FRESH_reddit_webdev.md +0 -130
  75. package/articles/FROM_ZERO_TO_10K.md +0 -107
  76. package/articles/HN_10X_BETTER.md +0 -430
  77. package/articles/HN_ACCOUNT_GUIDE.md +0 -21
  78. package/articles/HN_CHINESE_STYLE.md +0 -308
  79. package/articles/HN_FINAL.md +0 -148
  80. package/articles/HN_POSTED_VERSION.md +0 -56
  81. package/articles/HN_POST_READY.md +0 -137
  82. package/articles/HN_RESEARCH.md +0 -364
  83. package/articles/HN_SHOW_routerarena.md +0 -17
  84. package/articles/HN_TIMING_GUIDE.md +0 -52
  85. package/articles/INDIEHACKERS_POST.md +0 -52
  86. package/articles/INDIEHACKERS_READY.md +0 -120
  87. package/articles/LLM_BENCHMARK_DEEP_DIVE.md +0 -153
  88. package/articles/MASTER_POSTING_DIRECTORY.md +0 -189
  89. package/articles/NEWSLETTER_SEND_NOW.md +0 -259
  90. package/articles/NEWSLETTER_SUBMISSIONS.md +0 -112
  91. package/articles/PAIN-DRIVEN-devto-v2.md +0 -308
  92. package/articles/PAIN-DRIVEN-devto-v3.md +0 -268
  93. package/articles/PAIN-DRIVEN-devto.md +0 -242
  94. package/articles/PAIN-DRIVEN-hackernews-v2.md +0 -138
  95. package/articles/PAIN-DRIVEN-hackernews-v3.md +0 -151
  96. package/articles/PAIN-DRIVEN-hackernews.md +0 -131
  97. package/articles/PAIN-DRIVEN-reddit-v2.md +0 -301
  98. package/articles/PAIN-DRIVEN-reddit-v3.md +0 -236
  99. package/articles/PAIN-DRIVEN-reddit.md +0 -218
  100. package/articles/PAIN-DRIVEN-twitter-v2.md +0 -110
  101. package/articles/PAIN-DRIVEN-twitter-v3.md +0 -121
  102. package/articles/PAIN-DRIVEN-twitter.md +0 -120
  103. package/articles/PORTKEY_VS_A3M.md +0 -147
  104. package/articles/POSTING_KIT_2026_05.md +0 -67
  105. package/articles/PRESS_KIT_routerarena.md +0 -77
  106. package/articles/PRODUCTHUNT_LISTING.md +0 -48
  107. package/articles/PRODUCTHUNT_READY.md +0 -106
  108. package/articles/PR_PLAN_vault.md +0 -125
  109. package/articles/REDDIT_FINAL.md +0 -232
  110. package/articles/REDDIT_POST.md +0 -67
  111. package/articles/REDDIT_SUBMISSION_READY.md +0 -348
  112. package/articles/ROUTERARENA_LEADER.md +0 -45
  113. package/articles/SHOW_HN_FINAL.md +0 -29
  114. package/articles/TWEETS_10K_DOWNLOADS.md +0 -47
  115. package/articles/TWEETS_BENCHMARK_FIRST.md +0 -46
  116. package/articles/TWEETS_MCP_PLAY.md +0 -51
  117. package/articles/TWEETS_SEQUENTIAL_BROKEN.md +0 -49
  118. package/articles/TWEETS_WHY_BUILD.md +0 -54
  119. package/articles/TWEETS_routerarena_leader.md +0 -53
  120. package/articles/TWEET_STORM_READY.md +0 -165
  121. package/articles/TWITTER_FINAL.md +0 -167
  122. package/articles/WHY_10X_BETTER.md +0 -261
  123. package/articles/WHY_CHINESE_STYLE_BETTER.md +0 -323
  124. package/articles/ai-discoverability-llm-routing.md +0 -210
  125. package/articles/devto-llm-routing.md +0 -138
  126. package/articles/hackernews-show-hn.md +0 -54
  127. package/articles/hashnode-llm-cost-optimization.md +0 -125
  128. package/articles/hn_show_2026_05.md +0 -11
  129. package/articles/medium-building-llm-router.md +0 -205
  130. package/articles/reddit-ml.md +0 -76
  131. package/articles/twitter-thread-cost-savings.md +0 -50
  132. package/articles/youtube-tutorial-script.md +0 -262
  133. package/assets/a3m_3blue1brown.mp4 +0 -0
  134. package/assets/banner.svg +0 -109
  135. package/assets/chart-cost-v2.svg +0 -91
  136. package/assets/chart-cost-v3.svg +0 -143
  137. package/assets/chart-features-v2.svg +0 -132
  138. package/assets/chart-features-v3.svg +0 -211
  139. package/assets/chart-growth-v2.svg +0 -122
  140. package/assets/chart-growth-v3.svg +0 -189
  141. package/assets/cost-comparison.svg +0 -134
  142. package/assets/cost-simple.svg +0 -64
  143. package/assets/demo-hn.gif +0 -0
  144. package/assets/feature-matrix.svg +0 -136
  145. package/assets/growth-chart-animated.svg +0 -76
  146. package/assets/growth-chart.svg +0 -82
  147. package/assets/growth-simple.svg +0 -69
  148. package/assets/hero-diagram.svg +0 -81
  149. package/assets/logo-new.svg +0 -21
  150. package/assets/logo.svg +0 -68
  151. package/assets/provider-comparison.svg +0 -121
  152. package/assets/social-preview-new.svg +0 -100
  153. package/assets/social-preview.svg +0 -194
  154. package/assets/social-v2.svg +0 -130
  155. package/assets/social-v3.svg +0 -212
  156. package/benchmark-provider-results.json +0 -245
  157. package/benchmark-results.json +0 -54
  158. package/council-votes/architecture-vote.md +0 -121
  159. package/council-votes/coverage-vote.md +0 -93
  160. package/data/adaptive-benchmark.json +0 -92
  161. package/data/benchmark-results.json +0 -47
  162. package/data/labeled-benchmark.json +0 -88
  163. package/demo/3blue1brown_video.py +0 -285
  164. package/demo/3blue1brown_video_v2.py +0 -310
  165. package/demo/IMPROVED_PROMPTS.md +0 -229
  166. package/demo/VEO3_PROMPTS.md +0 -269
  167. package/demo/VIDEO_PRODUCTION_GUIDE.md +0 -333
  168. package/demo/a3m_3blue1brown.mp4 +0 -0
  169. package/demo/asciinema-demo.sh +0 -195
  170. package/demo/demo-hn.tape +0 -74
  171. package/demo/demo-script.md +0 -53
  172. package/demo/demo-script.sh +0 -62
  173. package/demo/demo.svg +0 -75
  174. package/demo/frame1_ai_data_center.png +0 -0
  175. package/demo/frame1_sunset_video.mp4 +0 -0
  176. package/demo/frame2_cost_comparison.png +0 -0
  177. package/demo/frame2_cost_comparison_fallback.png +0 -0
  178. package/demo/frame3_parallel_execution.png +0 -0
  179. package/demo/frame3_parallel_execution_fallback.png +0 -0
  180. package/demo/frame4_providers.png +0 -0
  181. package/demo/frame4_providers_fallback.png +0 -0
  182. package/demo/frame5_endcard.png +0 -0
  183. package/demo/frame5_endcard_fallback.png +0 -0
  184. package/demo/new_frame1_hook.png +0 -0
  185. package/demo/new_frame2_proof.png +0 -0
  186. package/demo/new_frame3_wow.png +0 -0
  187. package/demo/new_frame4_social.png +0 -0
  188. package/demo/new_frame5_cta.png +0 -0
  189. package/demo/package.json +0 -13
  190. package/demo/product-video-final.mp4 +0 -0
  191. package/demo/product-video-hype-v1.mp4 +0 -0
  192. package/demo/product-video-v1.mp4 +0 -0
  193. package/demo/public/index.html +0 -762
  194. package/demo/recording.cast +0 -55
  195. package/demo/server.js +0 -405
  196. package/demo-new.tape +0 -71
  197. package/demo-real.sh +0 -198
  198. package/demo-simple.tape +0 -205
  199. package/demo.html +0 -520
  200. package/demo.sh +0 -85
  201. package/demo.tape +0 -259
  202. package/dist/analytics/costAnalytics.d.ts.map +0 -1
  203. package/dist/analytics/costAnalytics.js.map +0 -1
  204. package/dist/benchmark/comprehensive.js.map +0 -1
  205. package/dist/benchmark/reproducible.d.ts.map +0 -1
  206. package/dist/benchmark/reproducible.js.map +0 -1
  207. package/dist/cache/prefixCache.d.ts.map +0 -1
  208. package/dist/cache/prefixCache.js.map +0 -1
  209. package/dist/cache/responseCache.d.ts.map +0 -1
  210. package/dist/cache/responseCache.js.map +0 -1
  211. package/dist/cache/semanticCache.d.ts.map +0 -1
  212. package/dist/cache/semanticCache.js.map +0 -1
  213. package/dist/cli/setupWizard.d.ts.map +0 -1
  214. package/dist/cli/setupWizard.js.map +0 -1
  215. package/dist/cost/budgetEnforcer.d.ts.map +0 -1
  216. package/dist/cost/budgetEnforcer.js.map +0 -1
  217. package/dist/cost/costTracker.d.ts.map +0 -1
  218. package/dist/cost/costTracker.js.map +0 -1
  219. package/dist/ensemble/multiRoundDialog.js.map +0 -1
  220. package/dist/ensemble/shapleyValue.js.map +0 -1
  221. package/dist/integrations/langchainAdapter.d.ts.map +0 -1
  222. package/dist/integrations/langchainAdapter.js.map +0 -1
  223. package/dist/integrations/oauth.d.ts.map +0 -1
  224. package/dist/integrations/oauth.js.map +0 -1
  225. package/dist/integrations/scienceAdapter.js.map +0 -1
  226. package/dist/memory/autoFetch.d.ts.map +0 -1
  227. package/dist/memory/autoFetch.js.map +0 -1
  228. package/dist/memory/episodicMemory.d.ts.map +0 -1
  229. package/dist/memory/episodicMemory.js.map +0 -1
  230. package/dist/memory/memoryTree.d.ts.map +0 -1
  231. package/dist/memory/memoryTree.js.map +0 -1
  232. package/dist/memory/obsidianVault.d.ts.map +0 -1
  233. package/dist/memory/obsidianVault.js.map +0 -1
  234. package/dist/observability/changeWatch.d.ts.map +0 -1
  235. package/dist/observability/changeWatch.js.map +0 -1
  236. package/dist/observability/fatigueDetector.d.ts.map +0 -1
  237. package/dist/observability/fatigueDetector.js.map +0 -1
  238. package/dist/observability/index.d.ts.map +0 -1
  239. package/dist/observability/index.js.map +0 -1
  240. package/dist/observability/metrics.d.ts.map +0 -1
  241. package/dist/observability/metrics.js.map +0 -1
  242. package/dist/observability/middleware.d.ts.map +0 -1
  243. package/dist/observability/middleware.js.map +0 -1
  244. package/dist/observability/tracer.d.ts.map +0 -1
  245. package/dist/observability/tracer.js.map +0 -1
  246. package/dist/observability/types.d.ts.map +0 -1
  247. package/dist/observability/types.js.map +0 -1
  248. package/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
  249. package/dist/orchestration/haloOrchestrator.js.map +0 -1
  250. package/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
  251. package/dist/orchestration/mctsWorkflow.js.map +0 -1
  252. package/dist/providers/localProvider.d.ts.map +0 -1
  253. package/dist/providers/localProvider.js.map +0 -1
  254. package/dist/providers/providerConfig.d.ts.map +0 -1
  255. package/dist/providers/providerConfig.js.map +0 -1
  256. package/dist/providers/registry.d.ts.map +0 -1
  257. package/dist/providers/registry.js.map +0 -1
  258. package/dist/routing/advancedRouter.d.ts.map +0 -1
  259. package/dist/routing/advancedRouter.js.map +0 -1
  260. package/dist/routing/crossModelValidation.d.ts.map +0 -1
  261. package/dist/routing/crossModelValidation.js.map +0 -1
  262. package/dist/routing/providerHealth.d.ts.map +0 -1
  263. package/dist/routing/providerHealth.js.map +0 -1
  264. package/dist/routing/providerRetry.d.ts.map +0 -1
  265. package/dist/routing/providerRetry.js.map +0 -1
  266. package/dist/scripts/banner.js +0 -29
  267. package/dist/security/guardrails.d.ts.map +0 -1
  268. package/dist/security/guardrails.js.map +0 -1
  269. package/dist/server/dashboard.d.ts.map +0 -1
  270. package/dist/server/dashboard.js.map +0 -1
  271. package/dist/server/modelMapper.d.ts.map +0 -1
  272. package/dist/server/modelMapper.js.map +0 -1
  273. package/dist/server/proxyServer.d.ts.map +0 -1
  274. package/dist/server/proxyServer.js.map +0 -1
  275. package/dist/skills/__tests__/skill_manager.test.d.ts +0 -2
  276. package/dist/skills/__tests__/skill_manager.test.d.ts.map +0 -1
  277. package/dist/skills/__tests__/skill_manager.test.js +0 -268
  278. package/dist/skills/__tests__/skill_manager.test.js.map +0 -1
  279. package/dist/tools/tmlpdTools.d.ts.map +0 -1
  280. package/dist/tools/tmlpdTools.js.map +0 -1
  281. package/dist/tui/dashboard.d.ts.map +0 -1
  282. package/dist/tui/dashboard.js.map +0 -1
  283. package/dist/tui/index.d.ts.map +0 -1
  284. package/dist/tui/index.js.map +0 -1
  285. package/dist/utils/batchProcessor.d.ts.map +0 -1
  286. package/dist/utils/batchProcessor.js.map +0 -1
  287. package/dist/utils/compression.d.ts.map +0 -1
  288. package/dist/utils/compression.js.map +0 -1
  289. package/dist/utils/costUtils.d.ts.map +0 -1
  290. package/dist/utils/costUtils.js.map +0 -1
  291. package/dist/utils/reliability.d.ts.map +0 -1
  292. package/dist/utils/reliability.js.map +0 -1
  293. package/dist/utils/sorting.d.ts.map +0 -1
  294. package/dist/utils/sorting.js.map +0 -1
  295. package/dist/utils/speculativeDecoding.d.ts.map +0 -1
  296. package/dist/utils/speculativeDecoding.js.map +0 -1
  297. package/dist/utils/tokenUtils.d.ts.map +0 -1
  298. package/dist/utils/tokenUtils.js.map +0 -1
  299. package/docs/.nojekyll +0 -0
  300. package/docs/ANALYSIS_PRINCIPLES.md +0 -162
  301. package/docs/API.md +0 -855
  302. package/docs/ARCHITECTURAL-IMPROVEMENTS-2025.md +0 -1391
  303. package/docs/ARCHITECTURAL-IMPROVEMENTS-REVISED-2025.md +0 -1051
  304. package/docs/BENCHMARK.md +0 -170
  305. package/docs/CHINESE_PROVIDER_RELIABILITY.md +0 -37
  306. package/docs/CITATIONS.md +0 -74
  307. package/docs/CLAIMS_AND_EVIDENCE.md +0 -58
  308. package/docs/CONFIGURATION.md +0 -476
  309. package/docs/COUNCIL_DECISION.json +0 -816
  310. package/docs/COUNCIL_SUMMARY.md +0 -319
  311. package/docs/COUNCIL_V2.2_DECISION.md +0 -416
  312. package/docs/ENGINEERING_SPEC.md +0 -55
  313. package/docs/FACTORY_RESET.md +0 -34
  314. package/docs/GEO.md +0 -66
  315. package/docs/GEO_OPTIMIZATION.md +0 -30
  316. package/docs/GEO_ROOT_CAUSE.md +0 -136
  317. package/docs/GEO_STATUS.md +0 -85
  318. package/docs/GEO_TEST_RESULTS.md +0 -176
  319. package/docs/HN_CHECKLIST.md +0 -38
  320. package/docs/HN_FOUNDER_COMMENT.md +0 -17
  321. package/docs/HN_SUBMISSION_FINAL.md +0 -180
  322. package/docs/HN_SUBMISSION_V3.md +0 -56
  323. package/docs/IMPROVEMENT_ROADMAP.md +0 -515
  324. package/docs/INTEGRATIONS.md +0 -420
  325. package/docs/LANGCHAIN_INTEGRATION.md +0 -147
  326. package/docs/LLM_COUNCIL_DECISION.md +0 -508
  327. package/docs/MIDDLEWARE_CHAIN.md +0 -35
  328. package/docs/PROMO_CHECKLIST.md +0 -200
  329. package/docs/QUICKSTART.md +0 -271
  330. package/docs/QUICK_START.md +0 -43
  331. package/docs/QUICK_START_VISIBILITY.md +0 -782
  332. package/docs/REDDIT_GAP_ANALYSIS.md +0 -299
  333. package/docs/RELEASE_CHECKLIST.md +0 -32
  334. package/docs/REPRODUCIBILITY.md +0 -63
  335. package/docs/RESEARCH_BACKED_IMPROVEMENTS.md +0 -1180
  336. package/docs/ROUTING_RUBRIC.md +0 -197
  337. package/docs/SEO_AUDIT.md +0 -186
  338. package/docs/SOCIAL_LISTENING.md +0 -219
  339. package/docs/TMLPD_QNA.md +0 -751
  340. package/docs/TMLPD_V2.1_COMPLETE.md +0 -763
  341. package/docs/TMLPD_V2.2_RESEARCH_ROADMAP.md +0 -754
  342. package/docs/UPDATE_TOPICS.md +0 -15
  343. package/docs/USE_CASES.md +0 -59
  344. package/docs/V2.2_IMPLEMENTATION_COMPLETE.md +0 -446
  345. package/docs/V2_IMPLEMENTATION_GUIDE.md +0 -388
  346. package/docs/VERCEL_AI_SDK.md +0 -209
  347. package/docs/VISIBILITY_ADOPTION_PLAN.md +0 -1005
  348. package/docs/_config.yml +0 -49
  349. package/docs/ai-plugin.json +0 -16
  350. package/docs/api.html +0 -513
  351. package/docs/architecture-diagram.md +0 -40
  352. package/docs/benchmark-chart.png +0 -0
  353. package/docs/benchmark.html +0 -387
  354. package/docs/blog/routerarena-number-one.html +0 -73
  355. package/docs/cli-cheatsheet.md +0 -339
  356. package/docs/compare.md +0 -109
  357. package/docs/comparison-litellm.md +0 -88
  358. package/docs/comparison.md +0 -108
  359. package/docs/cost-chart-ascii.md +0 -42
  360. package/docs/cost-comparison-chart.svg +0 -88
  361. package/docs/curl-examples.md +0 -247
  362. package/docs/demo-auto.html +0 -264
  363. package/docs/demo.html +0 -416
  364. package/docs/geo/GENERATIVE_ENGINE_OPTIMIZATION.md +0 -232
  365. package/docs/index.html +0 -507
  366. package/docs/launch-content/LAUNCH_EXECUTION_CHECKLIST.md +0 -421
  367. package/docs/launch-content/README.md +0 -457
  368. package/docs/launch-content/assets/cost_comparison_100_tasks.png +0 -0
  369. package/docs/launch-content/assets/cumulative_savings.png +0 -0
  370. package/docs/launch-content/assets/parallel_speedup.png +0 -0
  371. package/docs/launch-content/assets/provider_pricing_comparison.png +0 -0
  372. package/docs/launch-content/assets/task_breakdown_comparison.png +0 -0
  373. package/docs/launch-content/generate_charts.py +0 -313
  374. package/docs/launch-content/hn_show_post.md +0 -139
  375. package/docs/launch-content/partner_outreach_templates.md +0 -745
  376. package/docs/launch-content/reddit_posts.md +0 -467
  377. package/docs/launch-content/twitter_thread.txt +0 -460
  378. package/docs/npm-downloads-chart.svg +0 -43
  379. package/docs/openapi.json +0 -139
  380. package/docs/openapi.yaml +0 -1318
  381. package/docs/quick-start.html +0 -366
  382. package/docs/robots.txt +0 -52
  383. package/docs/sitemap.xml +0 -57
  384. package/docs/styles.css +0 -682
  385. package/docs/well-known/ai-plugin.json +0 -16
  386. package/docs/wellknown/ai-plugin.json +0 -16
  387. package/docs-site/assets/og-banner.svg +0 -194
  388. package/docs-site/index.html +0 -632
  389. package/eval/README.md +0 -46
  390. package/eval/baselines/main.json +0 -12
  391. package/eval/benchmark_dataset.jsonl +0 -16
  392. package/eval/check_golden_routes.js +0 -64
  393. package/eval/datasets/catalog.json +0 -33
  394. package/eval/datasets/slices/cn_provider_reliability_v1.jsonl +0 -3
  395. package/eval/datasets/slices/cost_pressure_v1.jsonl +0 -3
  396. package/eval/datasets/slices/safety_guardrails_v1.jsonl +0 -3
  397. package/eval/evals.json +0 -199
  398. package/eval/fault_injection_thresholds.json +0 -3
  399. package/eval/generate_report.js +0 -128
  400. package/eval/golden_routes.json +0 -114
  401. package/eval/lib/experiment_registry.js +0 -24
  402. package/eval/run_eval.js +0 -197
  403. package/eval/run_fault_injection.js +0 -201
  404. package/eval/run_shadow_eval.js +0 -85
  405. package/eval/thresholds.json +0 -9
  406. package/examples/QUICKSTART.md +0 -183
  407. package/examples/README.md +0 -61
  408. package/examples/a3m-sdk.js +0 -124
  409. package/examples/basic-route.js +0 -54
  410. package/examples/chat-loop.js +0 -202
  411. package/examples/classify-then-route.js +0 -102
  412. package/examples/cost-compare.js +0 -120
  413. package/examples/ensemble.js +0 -160
  414. package/examples/whatsapp-telegram-bridge-demo.js +0 -302
  415. package/examples/whatsapp-telegram-bridge.js +0 -269
  416. package/hf-space/README.md +0 -23
  417. package/hf-space/app.py +0 -240
  418. package/hf-space/requirements.txt +0 -1
  419. package/huggingface_space/README.md +0 -35
  420. package/huggingface_space/app.py +0 -126
  421. package/huggingface_space/create_space.py +0 -208
  422. package/huggingface_space/requirements.txt +0 -1
  423. package/mcp-server/README.md +0 -188
  424. package/mcp-server/package.json +0 -29
  425. package/mcp-server/src/index.ts +0 -744
  426. package/mcp-server/tsconfig.json +0 -19
  427. package/openclaw-alexa-bridge/ALL_REMAINING_FIXES_PLAN.md +0 -313
  428. package/openclaw-alexa-bridge/REMAINING_FIXES_SUMMARY.md +0 -277
  429. package/openclaw-alexa-bridge/src/alexa_handler_no_tmlpd.js +0 -1234
  430. package/openclaw-alexa-bridge/test_fixes.js +0 -77
  431. package/playground/README.md +0 -51
  432. package/playground/codesandbox.json +0 -12
  433. package/playground/index.js +0 -39
  434. package/proxy/README.md +0 -227
  435. package/proxy/package-lock.json +0 -831
  436. package/proxy/package.json +0 -17
  437. package/proxy/rate-limit.js +0 -145
  438. package/proxy/rate-limit.test.js +0 -311
  439. package/proxy/server.js +0 -970
  440. package/python/README.md +0 -102
  441. package/python/a3m/__init__.py +0 -6
  442. package/python/a3m/client.py +0 -190
  443. package/python/a3m/models.py +0 -40
  444. package/python/a3m/sync_client.py +0 -61
  445. package/python/examples.py +0 -53
  446. package/python/integrations.py +0 -330
  447. package/python/pyproject.toml +0 -23
  448. package/python/setup.py +0 -28
  449. package/python/tmlpd.py +0 -369
  450. package/qna/REDDIT_GAP_ANALYSIS.md +0 -299
  451. package/qna/TMLPD_QNA.md +0 -751
  452. package/research/FINDING_001_safety.md +0 -28
  453. package/research/FINDING_002_error_diversity.md +0 -32
  454. package/research/FINDING_003_confidence_weighted_voting.md +0 -32
  455. package/research/FINDING_004_cross_model_semantic_detection.md +0 -37
  456. package/research/FINDING_005_knowledge_gap_orthogonality.md +0 -34
  457. package/research/HALLUCINATION_RESEARCH.md +0 -27
  458. package/research/ensemble-voting.md +0 -324
  459. package/research/loss-functions.md +0 -545
  460. package/research-log.md +0 -49
  461. package/scripts/banner.js +0 -29
  462. package/scripts/benchmark-local-routerarena.ts +0 -176
  463. package/scripts/benchmark.js +0 -145
  464. package/scripts/benchmark.sh +0 -61
  465. package/scripts/compare-providers.sh +0 -230
  466. package/scripts/content-planner.js +0 -25
  467. package/scripts/create-labeled-benchmark.ts +0 -105
  468. package/scripts/cross_post.py +0 -443
  469. package/scripts/local-router-benchmark.ts +0 -154
  470. package/scripts/post-all.sh +0 -41
  471. package/scripts/publish_fcc.py +0 -106
  472. package/scripts/push-to-gitee.sh +0 -25
  473. package/scripts/routerarena_ensemble.js +0 -144
  474. package/scripts/routing-benchmark-v2.js +0 -373
  475. package/scripts/routing-benchmark-v3.js +0 -118
  476. package/scripts/routing-benchmark.js +0 -462
  477. package/scripts/run-labeled-benchmark.mjs +0 -104
  478. package/scripts/run-mmlu-benchmark.js +0 -176
  479. package/scripts/run-provider-benchmark.js +0 -244
  480. package/scripts/update-npm-badges.js +0 -158
  481. package/skill/SKILL.md +0 -238
  482. package/src/__tests__/integration/tmpld_integration.test.py +0 -540
  483. package/src/skills/__tests__/skill_manager.test.ts +0 -328
  484. package/submissions/benchmarks/ALL_PLATFORMS_SUBMISSION.md +0 -94
  485. package/submissions/benchmarks/LLMROUTERBENCH_SUBMISSION.md +0 -121
  486. package/submissions/benchmarks/MMRBENCH_SUBMISSION.md +0 -94
  487. package/submissions/benchmarks/ROUTERARENA_UPDATE.md +0 -83
  488. package/submissions/benchmarks/ROUTERBENCH_SUBMISSION.md +0 -225
  489. package/test-council/1-structure-tests.test.js +0 -353
  490. package/test-council/1-structure-tests.test.ts +0 -353
  491. package/test-council/2-edge-case-tests.test.ts +0 -361
  492. package/test-council/3-performance-tests.test.ts +0 -669
  493. package/test-council/4-integration-tests.test.ts +0 -391
  494. package/test-council/5-agent-council-eval.test.ts +0 -413
  495. package/test-council/AGENT_COUNCIL_ARCHITECTURE.md +0 -349
  496. package/test-council/TEST_COUNCIL_REPORT.md +0 -201
  497. package/test-council/agents/edge-case-agent.ts +0 -363
  498. package/test-council/agents/performance-agent.ts +0 -426
  499. package/test-council/agents/structure-agent.ts +0 -227
  500. package/test-council/council.md +0 -183
  501. package/tests/__mocks__/tokenUtils.ts +0 -8
  502. package/tests/memory/episodicMemory.test.ts +0 -227
  503. package/tests/package-lock.json +0 -1628
  504. package/tests/package.json +0 -18
  505. package/tests/routing/ensembleVoting.test.ts +0 -236
  506. package/tests/routing/providerRetry.test.ts +0 -360
  507. package/tests/routing/queryTypePresets.test.ts +0 -208
  508. package/tests/security/guardrailEngine.test.ts +0 -700
  509. package/tests/tsconfig.json +0 -21
  510. package/tests/vitest.config.ts +0 -18
  511. package/tmlpd-pi-extension/README.md +0 -66
  512. package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts +0 -114
  513. package/tmlpd-pi-extension/dist/cache/prefixCache.d.ts.map +0 -1
  514. package/tmlpd-pi-extension/dist/cache/prefixCache.js +0 -285
  515. package/tmlpd-pi-extension/dist/cache/prefixCache.js.map +0 -1
  516. package/tmlpd-pi-extension/dist/cache/responseCache.d.ts +0 -58
  517. package/tmlpd-pi-extension/dist/cache/responseCache.d.ts.map +0 -1
  518. package/tmlpd-pi-extension/dist/cache/responseCache.js +0 -153
  519. package/tmlpd-pi-extension/dist/cache/responseCache.js.map +0 -1
  520. package/tmlpd-pi-extension/dist/cli.js +0 -59
  521. package/tmlpd-pi-extension/dist/cost/costTracker.d.ts +0 -95
  522. package/tmlpd-pi-extension/dist/cost/costTracker.d.ts.map +0 -1
  523. package/tmlpd-pi-extension/dist/cost/costTracker.js +0 -240
  524. package/tmlpd-pi-extension/dist/cost/costTracker.js.map +0 -1
  525. package/tmlpd-pi-extension/dist/index.d.ts +0 -723
  526. package/tmlpd-pi-extension/dist/index.d.ts.map +0 -1
  527. package/tmlpd-pi-extension/dist/index.js +0 -239
  528. package/tmlpd-pi-extension/dist/index.js.map +0 -1
  529. package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts +0 -82
  530. package/tmlpd-pi-extension/dist/memory/episodicMemory.d.ts.map +0 -1
  531. package/tmlpd-pi-extension/dist/memory/episodicMemory.js +0 -145
  532. package/tmlpd-pi-extension/dist/memory/episodicMemory.js.map +0 -1
  533. package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts +0 -102
  534. package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.d.ts.map +0 -1
  535. package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js +0 -207
  536. package/tmlpd-pi-extension/dist/orchestration/haloOrchestrator.js.map +0 -1
  537. package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts +0 -85
  538. package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.d.ts.map +0 -1
  539. package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js +0 -210
  540. package/tmlpd-pi-extension/dist/orchestration/mctsWorkflow.js.map +0 -1
  541. package/tmlpd-pi-extension/dist/providers/localProvider.d.ts +0 -102
  542. package/tmlpd-pi-extension/dist/providers/localProvider.d.ts.map +0 -1
  543. package/tmlpd-pi-extension/dist/providers/localProvider.js +0 -338
  544. package/tmlpd-pi-extension/dist/providers/localProvider.js.map +0 -1
  545. package/tmlpd-pi-extension/dist/providers/registry.d.ts +0 -55
  546. package/tmlpd-pi-extension/dist/providers/registry.d.ts.map +0 -1
  547. package/tmlpd-pi-extension/dist/providers/registry.js +0 -138
  548. package/tmlpd-pi-extension/dist/providers/registry.js.map +0 -1
  549. package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts +0 -68
  550. package/tmlpd-pi-extension/dist/routing/advancedRouter.d.ts.map +0 -1
  551. package/tmlpd-pi-extension/dist/routing/advancedRouter.js +0 -332
  552. package/tmlpd-pi-extension/dist/routing/advancedRouter.js.map +0 -1
  553. package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts +0 -101
  554. package/tmlpd-pi-extension/dist/tools/tmlpdTools.d.ts.map +0 -1
  555. package/tmlpd-pi-extension/dist/tools/tmlpdTools.js +0 -368
  556. package/tmlpd-pi-extension/dist/tools/tmlpdTools.js.map +0 -1
  557. package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts +0 -96
  558. package/tmlpd-pi-extension/dist/utils/batchProcessor.d.ts.map +0 -1
  559. package/tmlpd-pi-extension/dist/utils/batchProcessor.js +0 -170
  560. package/tmlpd-pi-extension/dist/utils/batchProcessor.js.map +0 -1
  561. package/tmlpd-pi-extension/dist/utils/compression.d.ts +0 -61
  562. package/tmlpd-pi-extension/dist/utils/compression.d.ts.map +0 -1
  563. package/tmlpd-pi-extension/dist/utils/compression.js +0 -281
  564. package/tmlpd-pi-extension/dist/utils/compression.js.map +0 -1
  565. package/tmlpd-pi-extension/dist/utils/reliability.d.ts +0 -74
  566. package/tmlpd-pi-extension/dist/utils/reliability.d.ts.map +0 -1
  567. package/tmlpd-pi-extension/dist/utils/reliability.js +0 -177
  568. package/tmlpd-pi-extension/dist/utils/reliability.js.map +0 -1
  569. package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts +0 -117
  570. package/tmlpd-pi-extension/dist/utils/speculativeDecoding.d.ts.map +0 -1
  571. package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js +0 -246
  572. package/tmlpd-pi-extension/dist/utils/speculativeDecoding.js.map +0 -1
  573. package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts +0 -50
  574. package/tmlpd-pi-extension/dist/utils/tokenUtils.d.ts.map +0 -1
  575. package/tmlpd-pi-extension/dist/utils/tokenUtils.js +0 -124
  576. package/tmlpd-pi-extension/dist/utils/tokenUtils.js.map +0 -1
  577. package/tmlpd-pi-extension/examples/QUICKSTART.md +0 -183
  578. package/tmlpd-pi-extension/package-lock.json +0 -79
  579. package/tmlpd-pi-extension/package.json +0 -172
  580. package/tmlpd-pi-extension/python/examples.py +0 -53
  581. package/tmlpd-pi-extension/python/integrations.py +0 -330
  582. package/tmlpd-pi-extension/python/setup.py +0 -28
  583. package/tmlpd-pi-extension/python/tmlpd.py +0 -369
  584. package/tmlpd-pi-extension/qna/REDDIT_GAP_ANALYSIS.md +0 -299
  585. package/tmlpd-pi-extension/qna/TMLPD_QNA.md +0 -751
  586. package/tmlpd-pi-extension/skill/SKILL.md +0 -238
  587. package/tmlpd-pi-extension/src/cache/responseCache.ts +0 -147
  588. package/tmlpd-pi-extension/src/cost/costTracker.ts +0 -302
  589. package/tmlpd-pi-extension/src/index.ts +0 -232
  590. package/tmlpd-pi-extension/src/memory/episodicMemory.ts +0 -257
  591. package/tmlpd-pi-extension/src/orchestration/haloOrchestrator.ts +0 -266
  592. package/tmlpd-pi-extension/src/orchestration/mctsWorkflow.ts +0 -262
  593. package/tmlpd-pi-extension/src/providers/localProvider.ts +0 -406
  594. package/tmlpd-pi-extension/src/providers/registry.ts +0 -164
  595. package/tmlpd-pi-extension/src/routing/ensembleVoting.ts +0 -159
  596. package/tmlpd-pi-extension/src/routing/queryTypePresets.ts +0 -136
  597. package/tmlpd-pi-extension/src/tools/tmlpdTools.ts +0 -433
  598. package/tmlpd-pi-extension/src/utils/batchProcessor.ts +0 -232
  599. package/tmlpd-pi-extension/src/utils/compression.ts +0 -325
  600. package/tmlpd-pi-extension/src/utils/reliability.ts +0 -221
  601. package/tmlpd-pi-extension/src/utils/tokenUtils.ts +0 -145
  602. package/tmlpd-pi-extension/tsconfig.json +0 -18
  603. package/tsconfig.build.json +0 -29
  604. package/tsconfig.json +0 -18
  605. /package/{docs/llms-full.txt → llms-full.txt.bak} +0 -0
@@ -1,754 +0,0 @@
1
- # TMLPD v2.2+ Research-Backed Evolution Roadmap
2
-
3
- ## Executive Summary
4
-
5
- Copilot's research analysis identifies **7 cutting-edge features** from 2024-2025 arXiv papers that significantly advance TMLPD beyond v2.1's capabilities.
6
-
7
- **Key Insight**: TMLPD v2.1 implemented solid foundations (difficulty routing, 3-tier memory, orchestration), but this research pushes the state-of-the-art further with:
8
-
9
- - **2-4x inference speedup** (speculative decoding + early exit)
10
- - **40-60% additional cost savings** (universal learned routing)
11
- - **19.6% quality improvement** (HALO hierarchical orchestration)
12
- - **50% better long-context** (MemoRAG global memory)
13
- - **99%+ reliability** (circuit breakers + fallback chains)
14
-
15
- **Combined Impact**: 3-5x faster, 50-70% cheaper, 35% better quality, 70.32 reliable vs TMLPD v2.1
16
-
17
- ---
18
-
19
- ## 🎯 Strategic Positioning: Why This Matters
20
-
21
- ### Current TMLPD v2.1 vs Competitive Landscape
22
-
23
- | Feature | LangChain | AutoGPT | CrewAI | TMLPD v2.1 | **TMLPD v2.2** |
24
- |---------|-----------|---------|--------|------------|----------------|
25
- | **Cost Optimization** | ❌ | ❌ | ❌ | ✅ 82% savings | ✅ **92% savings** |
26
- | **Memory System** | ❌ | ⚠️ Basic | ⚠️ Basic | ✅ 3-tier | ✅ **MemoRAG** |
27
- | **Speed** | 1x | 1x | 1x | 2-5x (parallel) | **4-8x** (speculative) |
28
- | **Orchestration** | ⚠️ Manual | ⚠️ Manual | ⚠️ Manual | ✅ Orchestrator | ✅ **HALO** |
29
- | **Quality** | Baseline | Baseline | Baseline | Baseline | **+35%** |
30
- | **Reliability** | ⚠️ Basic | ⚠️ Basic | ⚠️ Basic | 95% | **70.32** |
31
-
32
- **Insight**: TMLPD v2.2 would be **uniquely positioned** as the only framework with:
33
- 1. Learned routing (adapts to new models automatically)
34
- 2. Speculative decoding (2-4x speedup)
35
- 3. Global memory (MemoRAG)
36
- 4. Hierarchical orchestration (HALO)
37
-
38
- This creates an **unassailable competitive moat** that other frameworks cannot easily replicate.
39
-
40
- ---
41
-
42
- ## 📊 Feature Mapping: v2.1 → v2.2+
43
-
44
- ### What We Already Have (v2.1)
45
-
46
- ```
47
- TMLPD v2.1 Architecture:
48
- ├── Multi-Provider System (Phase 1) ✅
49
- │ ├── 5 providers (Anthropic, OpenAI, Cerebras, Groq, Together)
50
- │ └── Intelligent routing (difficulty-based)
51
-
52
- ├── Difficulty-Aware Routing (Phase 2) ✅
53
- │ ├── 8-factor classification (0-100 score)
54
- │ └── Static difficulty bands (TRIVIAL → EXPERT)
55
-
56
- ├── 3-Tier Memory System (Phase 3) ✅
57
- │ ├── Episodic Memory (JSON-based)
58
- │ ├── Semantic Memory (ChromaDB vectors)
59
- │ └── Working Memory (LRU cache)
60
-
61
- └── Workflow Executors (Phase 4) ✅
62
- ├── Chaining Executor (sequential)
63
- ├── Parallelization Executor (concurrent)
64
- └── Orchestrator Executor (auto-decomposition)
65
- ```
66
-
67
- ### What v2.2 Adds (Research-Backed)
68
-
69
- ```
70
- TMLPD v2.2+ Architecture:
71
- ├── Enhanced Multi-Provider ⚡
72
- │ └── Universal Learned Router (NEW)
73
- │ ├── Adapts to unseen models
74
- │ ├── Online learning from feedback
75
- │ └── Dynamic quality-cost tradeoff
76
-
77
- ├── Advanced Difficulty Routing ⚡
78
- │ └── HALO Hierarchical Orchestration (NEW)
79
- │ ├── 3-tier planning (MCTS-based)
80
- │ ├── Role assignment
81
- │ └── Adaptive refinement
82
-
83
- ├── Next-Gen Memory ⚡
84
- │ └── MemoRAG System (NEW)
85
- │ ├── Global memory encoder
86
- │ ├── Response graph (historical)
87
- │ └── Optimal inference allocation
88
-
89
- ├── Inference Acceleration (NEW MODULE)
90
- │ ├── Speculative Decoder (2-4x speedup)
91
- │ └── Adaptive Early Exit (1.5x speedup)
92
-
93
- └── Production Reliability (NEW MODULE)
94
- ├── Circuit Breaker (99%+ uptime)
95
- ├── Fallback Chain (graceful degradation)
96
- └── Budget Manager (cost control)
97
- ```
98
-
99
- ---
100
-
101
- ## 🚀 Implementation Roadmap: 5-Week Sprint
102
-
103
- ### Week 1-2: Foundation Upgrade (Tier 1) ⭐⭐⭐⭐⭐
104
-
105
- #### Feature 1: HALO Hierarchical Orchestration
106
- **Research**: arXiv:2505.13516 (HALO) + arXiv:2506.12508v3 (AgentOrchestra)
107
-
108
- **Current State**: TMLPD v2.1 has `OrchestratorExecutor` that:
109
- - Decomposes tasks using LLM
110
- - Executes sub-tasks in parallel
111
- - Delegates to chain/parallel/direct modes
112
-
113
- **Upgrade Path**:
114
- ```python
115
- # Current: src/workflows/orchestrator_executor.py
116
- class OrchestratorExecutor:
117
- async def execute(self, task, strategy="auto"):
118
- # LLM-based decomposition
119
- # Flat execution (no hierarchy)
120
- ...
121
-
122
- # New: src/orchestration/halo_orchestrator.py
123
- class HALOOrchestrator:
124
- """
125
- 3-Tier Hierarchical Planning
126
- Based on arXiv:2505.13516
127
- """
128
- async def orchestrate(self, task):
129
- # Tier 1: Planner (high-level decomposition)
130
- # Tier 2: RoleAssigner (specialized agents)
131
- # Tier 3: ExecutionEngine (parallel + verification)
132
- ...
133
- ```
134
-
135
- **Integration Strategy**:
136
- 1. Keep `OrchestratorExecutor` as v2.1 backward-compatible API
137
- 2. Add `HALOOrchestrator` as advanced mode
138
- 3. User can choose: `mode="halo"` vs `mode="orchestrator"`
139
-
140
- **Effort**: 3-4 days
141
- **Value**: ⭐⭐⭐⭐⭐ (19.6% quality improvement on complex tasks)
142
- **Files**:
143
- - `src/orchestration/halo_orchestrator.py` (400 lines)
144
- - `src/orchestration/task_planner.py` (300 lines)
145
- - `src/orchestration/mcts_search.py` (250 lines)
146
-
147
- ---
148
-
149
- #### Feature 2: Universal Learned Router
150
- **Research**: arXiv:2502.08773 (UniRoute) + ICLR 2024 (Hybrid LLM) + ICML 2025 (BEST-Route)
151
-
152
- **Current State**: TMLPD v2.1 has `AdvancedDifficultyClassifier` that:
153
- - Uses 8-factor static scoring
154
- - Routes to providers based on difficulty bands
155
- - No learning from feedback
156
-
157
- **Upgrade Path**:
158
- ```python
159
- # Current: src/workflows/advanced_difficulty_classifier.py
160
- class AdvancedDifficultyClassifier:
161
- def classify_difficulty(self, task):
162
- # Static 8-factor scoring
163
- # Returns: {"level": "COMPLEX", "score": 72}
164
- ...
165
-
166
- # New: src/routing/universal_router.py
167
- class UniversalModelRouter:
168
- """
169
- Learned routing that adapts to new models
170
- Based on arXiv:2502.08773
171
- """
172
- async def route(self, task, available_models, quality_threshold, budget_cap):
173
- # Extract task features
174
- # Score each available model (learned model profiles)
175
- # Predict quality for each model
176
- # Optimize quality-cost tradeoff
177
- # Log decision for online learning
178
- ...
179
-
180
- async def learn_from_feedback(self, outcomes):
181
- # Update model profiles based on actual quality
182
- # Incremental learning (sliding window)
183
- ...
184
- ```
185
-
186
- **Integration Strategy**:
187
- 1. Add `UniversalModelRouter` as optional routing strategy
188
- 2. Keep difficulty classifier as fallback
189
- 3. Config: `routing.strategy = universal_learned` or `difficulty_aware`
190
- 4. Auto-train from execution history
191
-
192
- **Effort**: 2-3 days
193
- **Value**: ⭐⭐⭐⭐⭐ (40-60% additional cost savings)
194
- **Files**:
195
- - `src/routing/universal_router.py` (350 lines)
196
- - `src/routing/model_profile.py` (200 lines)
197
- - `src/routing/online_learning.py` (250 lines)
198
-
199
- ---
200
-
201
- ### Week 2-3: Inference Acceleration (Tier 2) ⭐⭐⭐⭐⭐
202
-
203
- #### Feature 3: Speculative Decoding
204
- **Research**: arXiv:2503.00491 (Tutorial) + NAACL 2025 (Hierarchical SD)
205
-
206
- **Current State**: TMLPD v2.1 uses providers directly (no acceleration)
207
-
208
- **Upgrade Path**:
209
- ```python
210
- # New: src/inference/speculative_decoder.py
211
- class SpeculativeDecoder:
212
- """
213
- Multi-token speculative decoding with adaptive windows
214
- Based on arXiv:2503.00491
215
- """
216
- def __init__(self, target_model, draft_model):
217
- self.target = load_model(target_model) # Large, accurate
218
- self.draft = load_model(draft_model) # Small, fast
219
-
220
- async def decode(self, prompt, max_tokens=512, adaptive=True):
221
- # Dynamic window size (adaptive)
222
- # Draft model proposes K tokens
223
- # Target model verifies in parallel
224
- # Accept matched tokens, continue
225
- ...
226
- ```
227
-
228
- **Model Pairs**:
229
- ```
230
- Target (Accurate) Draft (Fast)
231
- ───────────────── ──────────────
232
- Anthropic Claude → Cerebras Llama
233
- OpenAI GPT-4 → Groq Llama
234
- Together Mistral → Local Mistral
235
- ```
236
-
237
- **Integration Strategy**:
238
- 1. Wrap provider calls in `SpeculativeDecoder`
239
- 2. Auto-select draft model based on target
240
- 3. Fallback to direct call if speculative fails
241
- 4. Config: `inference.use_speculative = true`
242
-
243
- **Effort**: 2-3 days
244
- **Value**: ⭐⭐⭐⭐⭐ (2-4x speedup, 30-40% cost reduction)
245
- **Files**:
246
- - `src/inference/speculative_decoder.py` (300 lines)
247
- - `src/inference/adaptive_window.py` (200 lines)
248
-
249
- ---
250
-
251
- #### Feature 4: Adaptive Early Exit
252
- **Research**: arXiv:2504.10724 (HELIOS) + DeepMind 2024 (Mixture-of-Depths)
253
-
254
- **Current State**: TMLPD v2.1 always uses full model forward pass
255
-
256
- **Upgrade Path**:
257
- ```python
258
- # New: src/inference/adaptive_compute.py
259
- class AdaptiveEarlyExit:
260
- """
261
- Token-level early exiting for faster inference
262
- Based on arXiv:2504.10724
263
- """
264
- async def forward(self, input_ids, max_layers=None):
265
- # Forward through layers
266
- # Check exit probability at each layer
267
- # Exit early if confident
268
- # Fallback: use all layers
269
- ...
270
- ```
271
-
272
- **Integration Strategy**:
273
- 1. Stack with speculative decoding
274
- 2. Exit during target model verification
275
- 3. Monitor exit rates (target: 30-50%)
276
- 4. Config: `inference.use_early_exit = true`
277
-
278
- **Effort**: 1-2 days
279
- **Value**: ⭐⭐⭐⭐ (20-30% additional speedup)
280
- **Files**:
281
- - `src/inference/adaptive_compute.py` (250 lines)
282
-
283
- ---
284
-
285
- ### Week 3-4: Memory Enhancement (Tier 3) ⭐⭐⭐⭐⭐
286
-
287
- #### Feature 5: MemoRAG Global Memory
288
- **Research**: arXiv:2409.05591 (MemoRAG) + ACL 2025 (Graph of Records)
289
-
290
- **Current State**: TMLPD v2.1 has 3-tier memory:
291
- - Episodic: JSON-based specific executions
292
- - Semantic: ChromaDB vector patterns
293
- - Working: LRU cache
294
-
295
- **Upgrade Path**:
296
- ```python
297
- # Current: src/memory/semantic_memory.py
298
- class SemanticMemoryStore:
299
- def store_pattern(self, pattern, category, source_task):
300
- # Store vector embedding
301
- ...
302
-
303
- def recall(self, task, top_k=3):
304
- # Vector similarity search
305
- ...
306
-
307
- # New: src/memory/memorag_system.py
308
- class MemoRAGSystem:
309
- """
310
- Global memory-enhanced RAG
311
- Based on arXiv:2409.05591
312
- """
313
- async def retrieve_and_generate(self, query, context_documents, quality_budget):
314
- # Stage 1: Build global memory from context
315
- # Stage 2: Allocate inference budget (retrieval vs reasoning)
316
- # Stage 3: Smart retrieval guided by memory
317
- # Stage 4: Verify with draft answer
318
- # Stage 5: Targeted re-retrieval for refinement
319
- # Stage 6: Final generation with full context
320
- ...
321
-
322
- class ResponseGraph:
323
- """
324
- Graph-based memory tracking historical responses
325
- Based on ACL 2025 (Graph of Records)
326
- """
327
- async def add_response(self, query, documents, retrieved, answer):
328
- # Add response node to graph
329
- # Track embeddings
330
- ...
331
-
332
- async def recall_similar_responses(self, query, top_k=3):
333
- # Find similar past responses for in-context learning
334
- ...
335
- ```
336
-
337
- **Integration Strategy**:
338
- 1. Add MemoRAG as optional memory backend
339
- 2. Keep existing 3-tier memory for backward compatibility
340
- 3. Use MemoRAG for long-context tasks (>10K tokens)
341
- 4. Config: `memory.use_memorag = true`
342
-
343
- **Effort**: 2-3 days
344
- **Value**: ⭐⭐⭐⭐⭐ (50%+ improvement on long-context tasks)
345
- **Files**:
346
- - `src/memory/memorag_system.py` (400 lines)
347
- - `src/memory/response_graph.py` (300 lines)
348
- - `src/memory/global_memory_encoder.py` (250 lines)
349
-
350
- ---
351
-
352
- ### Week 4-5: Production Reliability (Tier 4) ⭐⭐⭐⭐
353
-
354
- #### Feature 6: Circuit Breaker + Fallback Chain
355
- **Research**: Industry patterns (Netflix, Microsoft Azure)
356
-
357
- **Current State**: TMLPD v2.1 has basic retry logic
358
-
359
- **Upgrade Path**:
360
- ```python
361
- # New: src/reliability/circuit_breaker.py
362
- class CircuitBreaker:
363
- """
364
- Circuit breaker for provider health management
365
- States: CLOSED → OPEN → HALF_OPEN
366
- """
367
- def __init__(self, failure_threshold=3, timeout_seconds=60):
368
- self.state = "CLOSED"
369
- self.failure_count = 0
370
- ...
371
-
372
- async def call(self, provider, task):
373
- # Check state (OPEN? HALF_OPEN? CLOSED?)
374
- # Execute with protection
375
- # Track failures
376
- ...
377
-
378
- class FallbackChain:
379
- """
380
- Try providers in order until one succeeds
381
- """
382
- async def execute(self, task):
383
- # Try providers in fallback order
384
- # Circuit breaker per provider
385
- # Raise if all fail
386
- ...
387
- ```
388
-
389
- **Integration Strategy**:
390
- 1. Wrap all provider calls in circuit breaker
391
- 2. Auto-open circuit after 3 consecutive failures
392
- 3. Half-open state after 60s timeout
393
- 4. Fallback chain: primary → secondary → tertiary
394
-
395
- **Effort**: 1 day
396
- **Value**: ⭐⭐⭐⭐ (99%+ uptime, prevents cascading failures)
397
- **Files**:
398
- - `src/reliability/circuit_breaker.py` (200 lines)
399
- - `src/reliability/fallback_chain.py` (150 lines)
400
-
401
- ---
402
-
403
- #### Feature 7: Cost Optimization & Budget Management
404
- **Research**: Industry best practices
405
-
406
- **Current State**: TMLPD v2.1 tracks costs but no enforcement
407
-
408
- **Upgrade Path**:
409
- ```python
410
- # New: src/cost/cost_optimizer.py
411
- class CostOptimizer:
412
- """
413
- Optimize provider selection + model choice for cost
414
- """
415
- async def select_for_budget(self, task, budget_cents, quality_required):
416
- # Select model that fits budget and quality
417
- # Estimate cost for task
418
- # Check budget cap
419
- ...
420
-
421
- class BudgetManager:
422
- """
423
- Enforce budgets per team/user
424
- """
425
- async def check_budget(self, user_id, cost_cents):
426
- # Check daily/monthly usage
427
- # Compare to budget
428
- # Return allow/deny
429
- ...
430
-
431
- async def record_usage(self, user_id, cost_cents):
432
- # Log usage for billing
433
- # Track in database
434
- ...
435
- ```
436
-
437
- **Integration Strategy**:
438
- 1. Optional budget enforcement (multi-tenant deployments)
439
- 2. Per-user API keys with quotas
440
- 3. Real-time cost tracking dashboard
441
- 4. Config: `cost.enable_budgets = true`
442
-
443
- **Effort**: 1-2 days
444
- **Value**: ⭐⭐⭐⭐ (critical for enterprise/multi-tenant)
445
- **Files**:
446
- - `src/cost/cost_optimizer.py` (200 lines)
447
- - `src/cost/budget_manager.py` (250 lines)
448
- - `src/cost/usage_tracker.py` (150 lines)
449
-
450
- ---
451
-
452
- ## 📈 Performance Projections: v2.1 vs v2.2+
453
-
454
- ### Baseline (TMLPD v2.1)
455
- ```
456
- Cost: $0.86 per 100 tasks (82% savings vs traditional)
457
- Speed: 2-5x parallel execution speedup
458
- Quality: Baseline (same as single provider)
459
- Reliability: 95% uptime (basic retry)
460
- ```
461
-
462
- ### With v2.2 Features (Individually)
463
- ```
464
- Feature Speedup Cost Savings Quality
465
- ───────────────── ─────── ──────────── ──────
466
- HALO Orchestration 1x 0% +19.6%
467
- Universal Routing 1x 40-60% 0%
468
- Speculative Decoding 2-4x 30-40% 0%
469
- Early Exit 1.5x 20-30% 0%
470
- MemoRAG 1x 0% +50%
471
- Circuit Breakers 1x 0% 0% (reliability)
472
- ```
473
-
474
- ### Combined (TMLPD v2.2 Full Stack)
475
- ```
476
- Speed: 4-8x (speculative 3x × early exit 1.5x × parallel 1.5x)
477
- Cost: 92% savings (v2.1 82% + universal routing 50% + speculative 30%)
478
- Quality: +35% (HALO 19.6% + MemoRAG 50% on applicable tasks)
479
- Reliability: 70.32 uptime (circuit breakers + fallback)
480
- ```
481
-
482
- **Example: 100 Tasks**
483
- ```
484
- Traditional (no optimization): $5.00, 120 minutes
485
- TMLPD v2.1: $0.86, 40 minutes (3x faster, 82% cheaper)
486
- TMLPD v2.2: $0.40, 15 minutes (8x faster, 92% cheaper)
487
- ```
488
-
489
- ---
490
-
491
- ## 🎓 Research Integration Strategy
492
-
493
- ### 1. Paper-to-Code Mapping
494
-
495
- | Paper | Feature | Implementation | Effort |
496
- |-------|---------|----------------|--------|
497
- | arXiv:2505.13516 | HALO Orchestration | `src/orchestration/halo_orchestrator.py` | 3-4 days |
498
- | arXiv:2502.08773 | Universal Router | `src/routing/universal_router.py` | 2-3 days |
499
- | arXiv:2503.00491 | Speculative Decoding | `src/inference/speculative_decoder.py` | 2-3 days |
500
- | arXiv:2504.10724 | Early Exit | `src/inference/adaptive_compute.py` | 1-2 days |
501
- | arXiv:2409.05591 | MemoRAG | `src/memory/memorag_system.py` | 2-3 days |
502
- | ACL 2025 | Response Graph | `src/memory/response_graph.py` | 1 day |
503
-
504
- ### 2. Dependency Graph
505
-
506
- ```
507
- HALO Orchestration (Foundation)
508
-
509
- Universal Router (Requires HALO's task decomposition)
510
-
511
- Speculative Decoding (Can be parallel)
512
-
513
- Early Exit (Stacks with speculative)
514
-
515
- MemoRAG (Independent, can be parallel)
516
-
517
- Circuit Breakers (Required for production)
518
-
519
- Budget Management (Production requirement)
520
- ```
521
-
522
- ### 3. Implementation Order (Critical Path)
523
-
524
- **Week 1-2** (Foundation):
525
- 1. HALO Orchestration (enables better routing)
526
- 2. Universal Router (requires HALO's decomposition)
527
-
528
- **Week 2-3** (Acceleration):
529
- 3. Speculative Decoding (biggest speedup, visible win)
530
- 4. Early Exit (stacks with speculative)
531
-
532
- **Week 3-4** (Memory):
533
- 5. MemoRAG (long-context improvement)
534
-
535
- **Week 4-5** (Reliability):
536
- 6. Circuit Breakers (production safety)
537
- 7. Budget Management (enterprise feature)
538
-
539
- ---
540
-
541
- ## 🔧 Technical Architecture: v2.2+
542
-
543
- ### Unified Agent API (Backward Compatible)
544
-
545
- ```python
546
- from src.tmlpd_agent import TMLPDUnifiedAgent
547
-
548
- async def main():
549
- # v2.1 API (unchanged)
550
- async with TMLPDUnifiedAgent() as agent:
551
- result = await agent.execute({
552
- "description": "Build complete e-commerce platform"
553
- })
554
-
555
- # v2.2+ API (new features opt-in)
556
- async with TMLPDUnifiedAgent(
557
- routing_strategy="universal_learned", # NEW
558
- use_speculative=True, # NEW
559
- use_early_exit=True, # NEW
560
- memory_backend="memorag", # NEW
561
- orchestration_mode="halo" # NEW
562
- ) as agent:
563
- result = await agent.execute({
564
- "description": "Build complete e-commerce platform"
565
- })
566
-
567
- # Metrics
568
- print(f"Speedup: {result['speedup']}x")
569
- print(f"Cost: ${result['cost']:.6f}")
570
- print(f"Quality: +{result['quality_improvement']}%")
571
- print(f"Layers used: {result['layers_used']}/{result['total_layers']}") # Early exit
572
- ```
573
-
574
- ### Configuration File (tmlpd.yaml)
575
-
576
- ```yaml
577
- # TMLPD v2.2+ Configuration
578
- routing:
579
- strategy: universal_learned # NEW | difficulty_aware
580
- quality_target: 0.95
581
- cost_awareness: true
582
-
583
- orchestration:
584
- mode: halo # NEW | orchestrator | chain | parallel
585
- enable_mcts: true # NEW
586
-
587
- inference:
588
- use_speculative: true # NEW
589
- use_early_exit: true # NEW
590
- speculative_window: adaptive # NEW
591
-
592
- memory:
593
- backend: memorag # NEW | three_tier
594
- enable_response_graph: true # NEW
595
-
596
- reliability:
597
- enable_circuit_breaker: true # NEW
598
- failure_threshold: 3
599
- timeout_seconds: 60
600
-
601
- cost:
602
- enable_budgets: false # NEW (for multi-tenant)
603
- default_budget_cents: 1000
604
- ```
605
-
606
- ---
607
-
608
- ## 📊 Competitive Analysis: TMLPD v2.2 vs State-of-the-Art
609
-
610
- ### vs Other Frameworks
611
-
612
- | Feature | LangChain | AutoGPT | CrewAI | Semantic Kernel | **TMLPD v2.2** |
613
- |---------|-----------|---------|--------|-----------------|----------------|
614
- | **Routing** | Manual | Auto | Manual | Auto | ✅ **Universal Learned** |
615
- | **Speed** | 1x | 1x | 1x | 1x | ✅ **4-8x** |
616
- | **Memory** | ❌ | ⚠️ Basic | ⚠️ Basic | ⚠️ Basic | ✅ **MemoRAG + Graph** |
617
- | **Orchestration** | Chain | Auto | Role-based | Auto | ✅ **HALO Hierarchical** |
618
- | **Cost Savings** | 0% | 0% | 0% | 0% | ✅ **92%** |
619
- | **Reliability** | ⚠️ Basic | ⚠️ Basic | ⚠️ Basic | ⚠️ Basic | ✅ **70.32** |
620
- | **Research-Backed** | ❌ | ❌ | ❌ | ⚠️ Some | ✅ **30+ Papers** |
621
-
622
- **Insight**: TMLPD v2.2 would be **uniquely positioned** as the only framework combining:
623
- 1. Learned routing (adapts to new models)
624
- 2. Speculative decoding (2-4x speedup)
625
- 3. Global memory (MemoRAG)
626
- 4. Hierarchical orchestration (HALO)
627
-
628
- This creates a **12-18 month competitive advantage** (time for others to replicate research).
629
-
630
- ### vs Standalone Tools
631
-
632
- | Tool | Purpose | Limitation | TMLPD v2.2 Advantage |
633
- |------|---------|------------|---------------------|
634
- | **RouteLLM** | Learned routing | Framework-specific | ✅ Universal + online learning |
635
- | **vLLM** | Speculative decoding | Inference only | ✅ Integrated full pipeline |
636
- | **LangGraph** | Orchestration | No routing/memory | ✅ HALO + routing + memory |
637
- | **LlamaIndex** | RAG | Simple retrieval | ✅ MemoRAG global memory |
638
- | **SGLang** | Speculative decoding | No orchestration | ✅ Full agent framework |
639
-
640
- **Insight**: TMLPD v2.2 integrates all these capabilities into **one unified framework**, eliminating integration complexity.
641
-
642
- ---
643
-
644
- ## 🎯 Go-to-Market Strategy: v2.2 Launch
645
-
646
- ### Positioning Statement
647
-
648
- **v2.1**: "Production-ready AI agent framework with 82% cost savings"
649
-
650
- **v2.2**: "The first AI agent framework with universal learned routing, speculative decoding, and global memory"
651
-
652
- **Key Messages**:
653
- 1. **4-8x faster** than alternatives (speculative + early exit)
654
- 2. **92% cheaper** than traditional routing
655
- 3. **+35% better quality** (HALO + MemoRAG)
656
- 4. **Self-improving** (learns from execution history)
657
- 5. **Production-ready** (70.32 reliability)
658
-
659
- ### Launch Timeline
660
-
661
- **Month 1**: v2.1 launch (current plan)
662
- - Build initial community
663
- - Gather feedback
664
- - Identify pain points
665
-
666
- **Month 2-3**: v2.2 development (this roadmap)
667
- - Implement Tier 1-2 features (HALO + Universal Router + Speculative)
668
- - Beta testing with early adopters
669
- - Benchmark against v2.1
670
-
671
- **Month 4**: v2.2 public launch
672
- - Major version update announcement
673
- - Research paper publication (optional)
674
- - Conference talks (PyCon, AI conferences)
675
-
676
- ### Content Marketing
677
-
678
- **Blog Posts**:
679
- 1. "We Made TMLPD 4x Faster (Here's How)" - Speculative decoding
680
- 2. "Why Universal Routing Beats Heuristics" - Learned routing
681
- 3. "The Memory System That Remembers Everything" - MemoRAG
682
- 4. "From 82% to 92% Cost Savings" - v2.1 → v2.2 journey
683
-
684
- **Case Studies**:
685
- 1. "Startup X Saved $10K/month with TMLPD v2.2"
686
- 2. "Enterprise Y Achieved 70.32 Uptime with Circuit Breakers"
687
- 3. "Research Lab Z Improved Results 35% with HALO"
688
-
689
- **Research Content**:
690
- 1. "Implementing HALO: Lessons Learned" - Technical deep dive
691
- 2. "Benchmark: Speculative Decoding in Production" - Real-world data
692
- 3. "The Future of AI Agent Frameworks" - Vision paper
693
-
694
- ---
695
-
696
- ## 💡 Innovation Opportunities Beyond v2.2
697
-
698
- ### Future Research Directions (2025-2026)
699
-
700
- 1. **Multi-Modal Agents** (arXiv:2501.xxxxx)
701
- - Vision + Language + Audio
702
- - Cross-modal reasoning
703
-
704
- 2. **Reinforcement Learning from AI Feedback** (RLAIF)
705
- - Learn from user interactions
706
- - Continuous improvement
707
-
708
- 3. **Distributed Agent Execution**
709
- - Run agents across multiple machines
710
- - Edge computing + cloud hybrid
711
-
712
- 4. **Explainable Orchestration**
713
- - Why did the agent choose this path?
714
- - Debugging complex workflows
715
-
716
- 5. **Agent-to-Agent Communication**
717
- - Standardized protocols
718
- - Swarm intelligence
719
-
720
- ---
721
-
722
- ## ✅ Conclusion
723
-
724
- ### The Opportunity
725
-
726
- TMLPD v2.1 is a solid foundation, but v2.2+ with these research-backed features would be **truly state-of-the-art**:
727
-
728
- 1. **Unmatched Performance**: 4-8x faster, 92% cheaper
729
- 2. **Superior Quality**: +35% improvement on complex tasks
730
- 3. **Production-Ready**: 70.32 reliability
731
- 4. **Future-Proof**: Learns and adapts automatically
732
-
733
- ### The Strategy
734
-
735
- 1. **Launch v2.1 first** (current plan) - Build community, gather feedback
736
- 2. **Develop v2.2 in parallel** (5-week sprint) - Research-backed features
737
- 3. **Launch v2.2 as major upgrade** - Establish leadership position
738
- 4. **Continuously innovate** - Stay ahead of competition
739
-
740
- ### The Competitive Moat
741
-
742
- By the time competitors replicate these features (12-18 months), TMLPD v2.3+ will be even further ahead with:
743
- - Multi-modal capabilities
744
- - Reinforcement learning
745
- - Distributed execution
746
- - Explainable AI
747
-
748
- **This creates a sustainable competitive advantage** through continuous research integration.
749
-
750
- ---
751
-
752
- **Next Step**: Begin v2.1 launch while starting v2.2 development (HALO + Universal Router in Week 1-2).
753
-
754
- **Ready to build the future of AI agent frameworks?** 🚀