@earendil-works/pi-ai 0.79.10 → 0.80.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (531) hide show
  1. package/README.md +646 -521
  2. package/dist/api/anthropic-messages.d.ts +71 -0
  3. package/dist/api/anthropic-messages.d.ts.map +1 -0
  4. package/dist/api/anthropic-messages.js +963 -0
  5. package/dist/api/anthropic-messages.js.map +1 -0
  6. package/dist/api/anthropic-messages.lazy.d.ts +3 -0
  7. package/dist/api/anthropic-messages.lazy.d.ts.map +1 -0
  8. package/dist/api/anthropic-messages.lazy.js +3 -0
  9. package/dist/api/anthropic-messages.lazy.js.map +1 -0
  10. package/dist/api/azure-openai-responses.d.ts +15 -0
  11. package/dist/api/azure-openai-responses.d.ts.map +1 -0
  12. package/dist/api/azure-openai-responses.js +225 -0
  13. package/dist/api/azure-openai-responses.js.map +1 -0
  14. package/dist/api/azure-openai-responses.lazy.d.ts +3 -0
  15. package/dist/api/azure-openai-responses.lazy.d.ts.map +1 -0
  16. package/dist/api/azure-openai-responses.lazy.js +3 -0
  17. package/dist/api/azure-openai-responses.lazy.js.map +1 -0
  18. package/dist/api/bedrock-converse-stream.d.ts +38 -0
  19. package/dist/api/bedrock-converse-stream.d.ts.map +1 -0
  20. package/dist/api/bedrock-converse-stream.js +863 -0
  21. package/dist/api/bedrock-converse-stream.js.map +1 -0
  22. package/dist/api/bedrock-converse-stream.lazy.d.ts +9 -0
  23. package/dist/api/bedrock-converse-stream.lazy.d.ts.map +1 -0
  24. package/dist/api/bedrock-converse-stream.lazy.js +30 -0
  25. package/dist/api/bedrock-converse-stream.lazy.js.map +1 -0
  26. package/dist/{providers → api}/cloudflare.d.ts +0 -4
  27. package/dist/api/cloudflare.d.ts.map +1 -0
  28. package/dist/{providers → api}/cloudflare.js +0 -18
  29. package/dist/api/cloudflare.js.map +1 -0
  30. package/dist/api/github-copilot-headers.d.ts.map +1 -0
  31. package/dist/api/github-copilot-headers.js.map +1 -0
  32. package/dist/api/google-generative-ai.d.ts +13 -0
  33. package/dist/api/google-generative-ai.d.ts.map +1 -0
  34. package/dist/api/google-generative-ai.js +405 -0
  35. package/dist/api/google-generative-ai.js.map +1 -0
  36. package/dist/api/google-generative-ai.lazy.d.ts +3 -0
  37. package/dist/api/google-generative-ai.lazy.d.ts.map +1 -0
  38. package/dist/api/google-generative-ai.lazy.js +3 -0
  39. package/dist/api/google-generative-ai.lazy.js.map +1 -0
  40. package/dist/api/google-shared.d.ts.map +1 -0
  41. package/dist/api/google-shared.js.map +1 -0
  42. package/dist/api/google-vertex.d.ts +15 -0
  43. package/dist/api/google-vertex.d.ts.map +1 -0
  44. package/dist/api/google-vertex.js +454 -0
  45. package/dist/api/google-vertex.js.map +1 -0
  46. package/dist/api/google-vertex.lazy.d.ts +3 -0
  47. package/dist/api/google-vertex.lazy.d.ts.map +1 -0
  48. package/dist/api/google-vertex.lazy.js +3 -0
  49. package/dist/api/google-vertex.lazy.js.map +1 -0
  50. package/dist/api/lazy.d.ts +15 -0
  51. package/dist/api/lazy.d.ts.map +1 -0
  52. package/dist/api/lazy.js +59 -0
  53. package/dist/api/lazy.js.map +1 -0
  54. package/dist/api/mistral-conversations.d.ts +25 -0
  55. package/dist/api/mistral-conversations.d.ts.map +1 -0
  56. package/dist/api/mistral-conversations.js +555 -0
  57. package/dist/api/mistral-conversations.js.map +1 -0
  58. package/dist/api/mistral-conversations.lazy.d.ts +3 -0
  59. package/dist/api/mistral-conversations.lazy.d.ts.map +1 -0
  60. package/dist/api/mistral-conversations.lazy.js +3 -0
  61. package/dist/api/mistral-conversations.lazy.js.map +1 -0
  62. package/dist/{providers → api}/openai-codex-responses.d.ts +2 -3
  63. package/dist/api/openai-codex-responses.d.ts.map +1 -0
  64. package/dist/{providers → api}/openai-codex-responses.js +66 -45
  65. package/dist/api/openai-codex-responses.js.map +1 -0
  66. package/dist/api/openai-codex-responses.lazy.d.ts +3 -0
  67. package/dist/api/openai-codex-responses.lazy.d.ts.map +1 -0
  68. package/dist/api/openai-codex-responses.lazy.js +3 -0
  69. package/dist/api/openai-codex-responses.lazy.js.map +1 -0
  70. package/dist/{providers → api}/openai-completions.d.ts +2 -3
  71. package/dist/api/openai-completions.d.ts.map +1 -0
  72. package/dist/{providers → api}/openai-completions.js +32 -37
  73. package/dist/api/openai-completions.js.map +1 -0
  74. package/dist/api/openai-completions.lazy.d.ts +3 -0
  75. package/dist/api/openai-completions.lazy.d.ts.map +1 -0
  76. package/dist/api/openai-completions.lazy.js +3 -0
  77. package/dist/api/openai-completions.lazy.js.map +1 -0
  78. package/dist/api/openai-prompt-cache.d.ts.map +1 -0
  79. package/dist/api/openai-prompt-cache.js.map +1 -0
  80. package/dist/api/openai-responses-shared.d.ts.map +1 -0
  81. package/dist/{providers → api}/openai-responses-shared.js +37 -29
  82. package/dist/api/openai-responses-shared.js.map +1 -0
  83. package/dist/{providers → api}/openai-responses.d.ts +2 -3
  84. package/dist/api/openai-responses.d.ts.map +1 -0
  85. package/dist/{providers → api}/openai-responses.js +27 -32
  86. package/dist/api/openai-responses.js.map +1 -0
  87. package/dist/api/openai-responses.lazy.d.ts +3 -0
  88. package/dist/api/openai-responses.lazy.d.ts.map +1 -0
  89. package/dist/api/openai-responses.lazy.js +3 -0
  90. package/dist/api/openai-responses.lazy.js.map +1 -0
  91. package/dist/api/openrouter-images.d.ts +3 -0
  92. package/dist/api/openrouter-images.d.ts.map +1 -0
  93. package/dist/{providers/images/openrouter.js → api/openrouter-images.js} +5 -15
  94. package/dist/api/openrouter-images.js.map +1 -0
  95. package/dist/api/openrouter-images.lazy.d.ts +3 -0
  96. package/dist/api/openrouter-images.lazy.d.ts.map +1 -0
  97. package/dist/api/openrouter-images.lazy.js +4 -0
  98. package/dist/api/openrouter-images.lazy.js.map +1 -0
  99. package/dist/api/simple-options.d.ts.map +1 -0
  100. package/dist/api/simple-options.js.map +1 -0
  101. package/dist/api/transform-messages.d.ts.map +1 -0
  102. package/dist/api/transform-messages.js.map +1 -0
  103. package/dist/auth/context.d.ts +7 -0
  104. package/dist/auth/context.d.ts.map +1 -0
  105. package/dist/auth/context.js +42 -0
  106. package/dist/auth/context.js.map +1 -0
  107. package/dist/auth/credential-store.d.ts +16 -0
  108. package/dist/auth/credential-store.d.ts.map +1 -0
  109. package/dist/auth/credential-store.js +37 -0
  110. package/dist/auth/credential-store.js.map +1 -0
  111. package/dist/auth/helpers.d.ts +20 -0
  112. package/dist/auth/helpers.d.ts.map +1 -0
  113. package/dist/auth/helpers.js +46 -0
  114. package/dist/auth/helpers.js.map +1 -0
  115. package/dist/auth/resolve.d.ts +26 -0
  116. package/dist/auth/resolve.d.ts.map +1 -0
  117. package/dist/auth/resolve.js +101 -0
  118. package/dist/auth/resolve.js.map +1 -0
  119. package/dist/auth/types.d.ts +180 -0
  120. package/dist/auth/types.d.ts.map +1 -0
  121. package/dist/auth/types.js +2 -0
  122. package/dist/auth/types.js.map +1 -0
  123. package/dist/bedrock-provider.d.ts +2 -4
  124. package/dist/bedrock-provider.d.ts.map +1 -1
  125. package/dist/bedrock-provider.js +3 -4
  126. package/dist/bedrock-provider.js.map +1 -1
  127. package/dist/compat.d.ts +65 -0
  128. package/dist/compat.d.ts.map +1 -0
  129. package/dist/compat.js +182 -0
  130. package/dist/compat.js.map +1 -0
  131. package/dist/images-api-registry.d.ts +0 -1
  132. package/dist/images-api-registry.d.ts.map +1 -1
  133. package/dist/images-api-registry.js +0 -3
  134. package/dist/images-api-registry.js.map +1 -1
  135. package/dist/images-models.d.ts +93 -0
  136. package/dist/images-models.d.ts.map +1 -0
  137. package/dist/images-models.js +141 -0
  138. package/dist/images-models.js.map +1 -0
  139. package/dist/images.d.ts +1 -0
  140. package/dist/images.d.ts.map +1 -1
  141. package/dist/images.js +1 -0
  142. package/dist/images.js.map +1 -1
  143. package/dist/index.d.ts +29 -2
  144. package/dist/index.d.ts.map +1 -1
  145. package/dist/index.js +17 -4
  146. package/dist/index.js.map +1 -1
  147. package/dist/legacy-api-aliases.d.ts +42 -0
  148. package/dist/legacy-api-aliases.d.ts.map +1 -0
  149. package/dist/legacy-api-aliases.js +49 -0
  150. package/dist/legacy-api-aliases.js.map +1 -0
  151. package/dist/models.d.ts +133 -9
  152. package/dist/models.d.ts.map +1 -1
  153. package/dist/models.generated.d.ts +1766 -156
  154. package/dist/models.generated.d.ts.map +1 -1
  155. package/dist/models.generated.js +70 -17172
  156. package/dist/models.generated.js.map +1 -1
  157. package/dist/models.js +181 -17
  158. package/dist/models.js.map +1 -1
  159. package/dist/providers/all.d.ts +21 -0
  160. package/dist/providers/all.d.ts.map +1 -0
  161. package/dist/providers/all.js +114 -0
  162. package/dist/providers/all.js.map +1 -0
  163. package/dist/providers/amazon-bedrock.d.ts +2 -38
  164. package/dist/providers/amazon-bedrock.d.ts.map +1 -1
  165. package/dist/providers/amazon-bedrock.js +35 -865
  166. package/dist/providers/amazon-bedrock.js.map +1 -1
  167. package/dist/providers/amazon-bedrock.models.d.ts +1718 -0
  168. package/dist/providers/amazon-bedrock.models.d.ts.map +1 -0
  169. package/dist/providers/amazon-bedrock.models.js +1675 -0
  170. package/dist/providers/amazon-bedrock.models.js.map +1 -0
  171. package/dist/providers/ant-ling.d.ts +3 -0
  172. package/dist/providers/ant-ling.d.ts.map +1 -0
  173. package/dist/providers/ant-ling.js +15 -0
  174. package/dist/providers/ant-ling.js.map +1 -0
  175. package/dist/providers/ant-ling.models.d.ts +86 -0
  176. package/dist/providers/ant-ling.models.d.ts.map +1 -0
  177. package/dist/providers/ant-ling.models.js +60 -0
  178. package/dist/providers/ant-ling.models.js.map +1 -0
  179. package/dist/providers/anthropic.d.ts +2 -71
  180. package/dist/providers/anthropic.d.ts.map +1 -1
  181. package/dist/providers/anthropic.js +17 -973
  182. package/dist/providers/anthropic.js.map +1 -1
  183. package/dist/providers/anthropic.models.d.ts +458 -0
  184. package/dist/providers/anthropic.models.d.ts.map +1 -0
  185. package/dist/providers/anthropic.models.js +439 -0
  186. package/dist/providers/anthropic.models.js.map +1 -0
  187. package/dist/providers/azure-openai-responses.d.ts +2 -15
  188. package/dist/providers/azure-openai-responses.d.ts.map +1 -1
  189. package/dist/providers/azure-openai-responses.js +11 -230
  190. package/dist/providers/azure-openai-responses.js.map +1 -1
  191. package/dist/providers/azure-openai-responses.models.d.ts +804 -0
  192. package/dist/providers/azure-openai-responses.models.d.ts.map +1 -0
  193. package/dist/providers/azure-openai-responses.models.js +743 -0
  194. package/dist/providers/azure-openai-responses.models.js.map +1 -0
  195. package/dist/providers/cerebras.d.ts +3 -0
  196. package/dist/providers/cerebras.d.ts.map +1 -0
  197. package/dist/providers/cerebras.js +15 -0
  198. package/dist/providers/cerebras.js.map +1 -0
  199. package/dist/providers/cerebras.models.d.ts +45 -0
  200. package/dist/providers/cerebras.models.d.ts.map +1 -0
  201. package/dist/providers/cerebras.models.js +41 -0
  202. package/dist/providers/cerebras.models.js.map +1 -0
  203. package/dist/providers/cloudflare-ai-gateway.d.ts +3 -0
  204. package/dist/providers/cloudflare-ai-gateway.d.ts.map +1 -0
  205. package/dist/providers/cloudflare-ai-gateway.js +20 -0
  206. package/dist/providers/cloudflare-ai-gateway.js.map +1 -0
  207. package/dist/providers/cloudflare-ai-gateway.models.d.ts +765 -0
  208. package/dist/providers/cloudflare-ai-gateway.models.d.ts.map +1 -0
  209. package/dist/providers/cloudflare-ai-gateway.models.js +666 -0
  210. package/dist/providers/cloudflare-ai-gateway.models.js.map +1 -0
  211. package/dist/providers/cloudflare-auth.d.ts +4 -0
  212. package/dist/providers/cloudflare-auth.d.ts.map +1 -0
  213. package/dist/providers/cloudflare-auth.js +85 -0
  214. package/dist/providers/cloudflare-auth.js.map +1 -0
  215. package/dist/providers/cloudflare-workers-ai.d.ts +3 -0
  216. package/dist/providers/cloudflare-workers-ai.d.ts.map +1 -0
  217. package/dist/providers/cloudflare-workers-ai.js +14 -0
  218. package/dist/providers/cloudflare-workers-ai.js.map +1 -0
  219. package/dist/providers/cloudflare-workers-ai.models.d.ts +302 -0
  220. package/dist/providers/cloudflare-workers-ai.models.d.ts.map +1 -0
  221. package/dist/providers/cloudflare-workers-ai.models.js +239 -0
  222. package/dist/providers/cloudflare-workers-ai.models.js.map +1 -0
  223. package/dist/providers/deepseek.d.ts +3 -0
  224. package/dist/providers/deepseek.d.ts.map +1 -0
  225. package/dist/providers/deepseek.js +15 -0
  226. package/dist/providers/deepseek.js.map +1 -0
  227. package/dist/providers/deepseek.models.d.ts +63 -0
  228. package/dist/providers/deepseek.models.d.ts.map +1 -0
  229. package/dist/providers/deepseek.models.js +43 -0
  230. package/dist/providers/deepseek.models.js.map +1 -0
  231. package/dist/providers/faux.d.ts +43 -2
  232. package/dist/providers/faux.d.ts.map +1 -1
  233. package/dist/providers/faux.js +34 -7
  234. package/dist/providers/faux.js.map +1 -1
  235. package/dist/providers/fireworks.d.ts +3 -0
  236. package/dist/providers/fireworks.d.ts.map +1 -0
  237. package/dist/providers/fireworks.js +19 -0
  238. package/dist/providers/fireworks.js.map +1 -0
  239. package/dist/providers/fireworks.models.d.ts +353 -0
  240. package/dist/providers/fireworks.models.d.ts.map +1 -0
  241. package/dist/providers/fireworks.models.js +276 -0
  242. package/dist/providers/fireworks.models.js.map +1 -0
  243. package/dist/providers/github-copilot.d.ts +3 -0
  244. package/dist/providers/github-copilot.d.ts.map +1 -0
  245. package/dist/providers/github-copilot.js +25 -0
  246. package/dist/providers/github-copilot.js.map +1 -0
  247. package/dist/providers/github-copilot.models.d.ts +616 -0
  248. package/dist/providers/github-copilot.models.d.ts.map +1 -0
  249. package/dist/providers/github-copilot.models.js +426 -0
  250. package/dist/providers/github-copilot.models.js.map +1 -0
  251. package/dist/providers/google-vertex.d.ts +2 -15
  252. package/dist/providers/google-vertex.d.ts.map +1 -1
  253. package/dist/providers/google-vertex.js +30 -455
  254. package/dist/providers/google-vertex.js.map +1 -1
  255. package/dist/providers/google-vertex.models.d.ts +202 -0
  256. package/dist/providers/google-vertex.models.d.ts.map +1 -0
  257. package/dist/providers/google-vertex.models.js +182 -0
  258. package/dist/providers/google-vertex.models.js.map +1 -0
  259. package/dist/providers/google.d.ts +2 -13
  260. package/dist/providers/google.d.ts.map +1 -1
  261. package/dist/providers/google.js +12 -408
  262. package/dist/providers/google.js.map +1 -1
  263. package/dist/providers/google.models.d.ts +328 -0
  264. package/dist/providers/google.models.d.ts.map +1 -0
  265. package/dist/providers/google.models.js +288 -0
  266. package/dist/providers/google.models.js.map +1 -0
  267. package/dist/providers/groq.d.ts +3 -0
  268. package/dist/providers/groq.d.ts.map +1 -0
  269. package/dist/providers/groq.js +15 -0
  270. package/dist/providers/groq.js.map +1 -0
  271. package/dist/providers/groq.models.d.ts +128 -0
  272. package/dist/providers/groq.models.d.ts.map +1 -0
  273. package/dist/providers/groq.models.js +125 -0
  274. package/dist/providers/groq.models.js.map +1 -0
  275. package/dist/providers/huggingface.d.ts +3 -0
  276. package/dist/providers/huggingface.d.ts.map +1 -0
  277. package/dist/providers/huggingface.js +15 -0
  278. package/dist/providers/huggingface.js.map +1 -0
  279. package/dist/providers/huggingface.models.d.ts +883 -0
  280. package/dist/providers/huggingface.models.d.ts.map +1 -0
  281. package/dist/providers/huggingface.models.js +797 -0
  282. package/dist/providers/huggingface.models.js.map +1 -0
  283. package/dist/providers/images/{openrouter.d.ts → register-builtins.d.ts} +2 -2
  284. package/dist/providers/images/register-builtins.d.ts.map +1 -0
  285. package/dist/providers/images/register-builtins.js +34 -0
  286. package/dist/providers/images/register-builtins.js.map +1 -0
  287. package/dist/providers/kimi-coding.d.ts +3 -0
  288. package/dist/providers/kimi-coding.d.ts.map +1 -0
  289. package/dist/providers/kimi-coding.js +15 -0
  290. package/dist/providers/kimi-coding.js.map +1 -0
  291. package/dist/providers/kimi-coding.models.d.ts +63 -0
  292. package/dist/providers/kimi-coding.models.d.ts.map +1 -0
  293. package/dist/providers/kimi-coding.models.js +59 -0
  294. package/dist/providers/kimi-coding.models.js.map +1 -0
  295. package/dist/providers/minimax-cn.d.ts +3 -0
  296. package/dist/providers/minimax-cn.d.ts.map +1 -0
  297. package/dist/providers/minimax-cn.js +15 -0
  298. package/dist/providers/minimax-cn.js.map +1 -0
  299. package/dist/providers/minimax-cn.models.d.ts +54 -0
  300. package/dist/providers/minimax-cn.models.d.ts.map +1 -0
  301. package/dist/providers/minimax-cn.models.js +56 -0
  302. package/dist/providers/minimax-cn.models.js.map +1 -0
  303. package/dist/providers/minimax.d.ts +3 -0
  304. package/dist/providers/minimax.d.ts.map +1 -0
  305. package/dist/providers/minimax.js +15 -0
  306. package/dist/providers/minimax.js.map +1 -0
  307. package/dist/providers/minimax.models.d.ts +54 -0
  308. package/dist/providers/minimax.models.d.ts.map +1 -0
  309. package/dist/providers/minimax.models.js +56 -0
  310. package/dist/providers/minimax.models.js.map +1 -0
  311. package/dist/providers/mistral.d.ts +2 -25
  312. package/dist/providers/mistral.d.ts.map +1 -1
  313. package/dist/providers/mistral.js +12 -560
  314. package/dist/providers/mistral.js.map +1 -1
  315. package/dist/providers/mistral.models.d.ts +513 -0
  316. package/dist/providers/mistral.models.d.ts.map +1 -0
  317. package/dist/providers/mistral.models.js +515 -0
  318. package/dist/providers/mistral.models.js.map +1 -0
  319. package/dist/providers/moonshotai-cn.d.ts +3 -0
  320. package/dist/providers/moonshotai-cn.d.ts.map +1 -0
  321. package/dist/providers/moonshotai-cn.js +15 -0
  322. package/dist/providers/moonshotai-cn.js.map +1 -0
  323. package/dist/providers/moonshotai-cn.models.d.ts +234 -0
  324. package/dist/providers/moonshotai-cn.models.d.ts.map +1 -0
  325. package/dist/providers/moonshotai-cn.models.js +169 -0
  326. package/dist/providers/moonshotai-cn.models.js.map +1 -0
  327. package/dist/providers/moonshotai.d.ts +3 -0
  328. package/dist/providers/moonshotai.d.ts.map +1 -0
  329. package/dist/providers/moonshotai.js +15 -0
  330. package/dist/providers/moonshotai.js.map +1 -0
  331. package/dist/providers/moonshotai.models.d.ts +234 -0
  332. package/dist/providers/moonshotai.models.d.ts.map +1 -0
  333. package/dist/providers/moonshotai.models.js +169 -0
  334. package/dist/providers/moonshotai.models.js.map +1 -0
  335. package/dist/providers/nvidia.d.ts +3 -0
  336. package/dist/providers/nvidia.d.ts.map +1 -0
  337. package/dist/providers/nvidia.js +15 -0
  338. package/dist/providers/nvidia.js.map +1 -0
  339. package/dist/providers/nvidia.models.d.ts +535 -0
  340. package/dist/providers/nvidia.models.d.ts.map +1 -0
  341. package/dist/providers/nvidia.models.js +366 -0
  342. package/dist/providers/nvidia.models.js.map +1 -0
  343. package/dist/providers/openai-codex.d.ts +3 -0
  344. package/dist/providers/openai-codex.d.ts.map +1 -0
  345. package/dist/providers/openai-codex.js +18 -0
  346. package/dist/providers/openai-codex.js.map +1 -0
  347. package/dist/providers/openai-codex.models.d.ts +87 -0
  348. package/dist/providers/openai-codex.models.d.ts.map +1 -0
  349. package/dist/providers/openai-codex.models.js +77 -0
  350. package/dist/providers/openai-codex.models.js.map +1 -0
  351. package/dist/providers/openai.d.ts +3 -0
  352. package/dist/providers/openai.d.ts.map +1 -0
  353. package/dist/providers/openai.js +15 -0
  354. package/dist/providers/openai.js.map +1 -0
  355. package/dist/providers/openai.models.d.ts +805 -0
  356. package/dist/providers/openai.models.d.ts.map +1 -0
  357. package/dist/providers/openai.models.js +743 -0
  358. package/dist/providers/openai.models.js.map +1 -0
  359. package/dist/providers/opencode-go.d.ts +3 -0
  360. package/dist/providers/opencode-go.d.ts.map +1 -0
  361. package/dist/providers/opencode-go.js +18 -0
  362. package/dist/providers/opencode-go.js.map +1 -0
  363. package/dist/providers/opencode-go.models.d.ts +309 -0
  364. package/dist/providers/opencode-go.models.d.ts.map +1 -0
  365. package/dist/providers/opencode-go.models.js +240 -0
  366. package/dist/providers/opencode-go.models.js.map +1 -0
  367. package/dist/providers/opencode.d.ts +3 -0
  368. package/dist/providers/opencode.d.ts.map +1 -0
  369. package/dist/providers/opencode.js +22 -0
  370. package/dist/providers/opencode.js.map +1 -0
  371. package/dist/providers/opencode.models.d.ts +976 -0
  372. package/dist/providers/opencode.models.d.ts.map +1 -0
  373. package/dist/providers/opencode.models.js +815 -0
  374. package/dist/providers/opencode.models.js.map +1 -0
  375. package/dist/providers/openrouter-images.d.ts +3 -0
  376. package/dist/providers/openrouter-images.d.ts.map +1 -0
  377. package/dist/providers/openrouter-images.js +14 -0
  378. package/dist/providers/openrouter-images.js.map +1 -0
  379. package/dist/providers/openrouter.d.ts +3 -0
  380. package/dist/providers/openrouter.d.ts.map +1 -0
  381. package/dist/providers/openrouter.js +15 -0
  382. package/dist/providers/openrouter.js.map +1 -0
  383. package/dist/providers/openrouter.models.d.ts +5405 -0
  384. package/dist/providers/openrouter.models.d.ts.map +1 -0
  385. package/dist/providers/openrouter.models.js +4635 -0
  386. package/dist/providers/openrouter.models.js.map +1 -0
  387. package/dist/providers/together.d.ts +3 -0
  388. package/dist/providers/together.d.ts.map +1 -0
  389. package/dist/providers/together.js +15 -0
  390. package/dist/providers/together.js.map +1 -0
  391. package/dist/providers/together.models.d.ts +567 -0
  392. package/dist/providers/together.models.d.ts.map +1 -0
  393. package/dist/providers/together.models.js +361 -0
  394. package/dist/providers/together.models.js.map +1 -0
  395. package/dist/providers/vercel-ai-gateway.d.ts +3 -0
  396. package/dist/providers/vercel-ai-gateway.d.ts.map +1 -0
  397. package/dist/providers/vercel-ai-gateway.js +15 -0
  398. package/dist/providers/vercel-ai-gateway.js.map +1 -0
  399. package/dist/providers/vercel-ai-gateway.models.d.ts +2938 -0
  400. package/dist/providers/vercel-ai-gateway.models.d.ts.map +1 -0
  401. package/dist/providers/vercel-ai-gateway.models.js +2897 -0
  402. package/dist/providers/vercel-ai-gateway.models.js.map +1 -0
  403. package/dist/providers/xai.d.ts +3 -0
  404. package/dist/providers/xai.d.ts.map +1 -0
  405. package/dist/providers/xai.js +15 -0
  406. package/dist/providers/xai.js.map +1 -0
  407. package/dist/providers/xai.models.d.ts +157 -0
  408. package/dist/providers/xai.models.d.ts.map +1 -0
  409. package/dist/providers/xai.models.js +131 -0
  410. package/dist/providers/xai.models.js.map +1 -0
  411. package/dist/providers/xiaomi-token-plan-ams.d.ts +3 -0
  412. package/dist/providers/xiaomi-token-plan-ams.d.ts.map +1 -0
  413. package/dist/providers/xiaomi-token-plan-ams.js +15 -0
  414. package/dist/providers/xiaomi-token-plan-ams.js.map +1 -0
  415. package/dist/providers/xiaomi-token-plan-ams.models.d.ts +108 -0
  416. package/dist/providers/xiaomi-token-plan-ams.models.d.ts.map +1 -0
  417. package/dist/providers/xiaomi-token-plan-ams.models.js +95 -0
  418. package/dist/providers/xiaomi-token-plan-ams.models.js.map +1 -0
  419. package/dist/providers/xiaomi-token-plan-cn.d.ts +3 -0
  420. package/dist/providers/xiaomi-token-plan-cn.d.ts.map +1 -0
  421. package/dist/providers/xiaomi-token-plan-cn.js +15 -0
  422. package/dist/providers/xiaomi-token-plan-cn.js.map +1 -0
  423. package/dist/providers/xiaomi-token-plan-cn.models.d.ts +108 -0
  424. package/dist/providers/xiaomi-token-plan-cn.models.d.ts.map +1 -0
  425. package/dist/providers/xiaomi-token-plan-cn.models.js +95 -0
  426. package/dist/providers/xiaomi-token-plan-cn.models.js.map +1 -0
  427. package/dist/providers/xiaomi-token-plan-sgp.d.ts +3 -0
  428. package/dist/providers/xiaomi-token-plan-sgp.d.ts.map +1 -0
  429. package/dist/providers/xiaomi-token-plan-sgp.js +15 -0
  430. package/dist/providers/xiaomi-token-plan-sgp.js.map +1 -0
  431. package/dist/providers/xiaomi-token-plan-sgp.models.d.ts +108 -0
  432. package/dist/providers/xiaomi-token-plan-sgp.models.d.ts.map +1 -0
  433. package/dist/providers/xiaomi-token-plan-sgp.models.js +95 -0
  434. package/dist/providers/xiaomi-token-plan-sgp.models.js.map +1 -0
  435. package/dist/providers/xiaomi.d.ts +3 -0
  436. package/dist/providers/xiaomi.d.ts.map +1 -0
  437. package/dist/providers/xiaomi.js +15 -0
  438. package/dist/providers/xiaomi.js.map +1 -0
  439. package/dist/providers/xiaomi.models.d.ts +129 -0
  440. package/dist/providers/xiaomi.models.d.ts.map +1 -0
  441. package/dist/providers/xiaomi.models.js +113 -0
  442. package/dist/providers/xiaomi.models.js.map +1 -0
  443. package/dist/providers/zai-coding-cn.d.ts +3 -0
  444. package/dist/providers/zai-coding-cn.d.ts.map +1 -0
  445. package/dist/providers/zai-coding-cn.js +15 -0
  446. package/dist/providers/zai-coding-cn.js.map +1 -0
  447. package/dist/providers/zai-coding-cn.models.d.ts +153 -0
  448. package/dist/providers/zai-coding-cn.models.d.ts.map +1 -0
  449. package/dist/providers/zai-coding-cn.models.js +114 -0
  450. package/dist/providers/zai-coding-cn.models.js.map +1 -0
  451. package/dist/providers/zai.d.ts +3 -0
  452. package/dist/providers/zai.d.ts.map +1 -0
  453. package/dist/providers/zai.js +15 -0
  454. package/dist/providers/zai.js.map +1 -0
  455. package/dist/providers/zai.models.d.ts +153 -0
  456. package/dist/providers/zai.models.d.ts.map +1 -0
  457. package/dist/providers/zai.models.js +114 -0
  458. package/dist/providers/zai.models.js.map +1 -0
  459. package/dist/types.d.ts +67 -8
  460. package/dist/types.d.ts.map +1 -1
  461. package/dist/types.js.map +1 -1
  462. package/dist/utils/headers.d.ts +2 -0
  463. package/dist/utils/headers.d.ts.map +1 -1
  464. package/dist/utils/headers.js +10 -0
  465. package/dist/utils/headers.js.map +1 -1
  466. package/dist/utils/oauth/anthropic.d.ts +2 -0
  467. package/dist/utils/oauth/anthropic.d.ts.map +1 -1
  468. package/dist/utils/oauth/anthropic.js +31 -0
  469. package/dist/utils/oauth/anthropic.js.map +1 -1
  470. package/dist/utils/oauth/github-copilot.d.ts +2 -0
  471. package/dist/utils/oauth/github-copilot.d.ts.map +1 -1
  472. package/dist/utils/oauth/github-copilot.js +33 -2
  473. package/dist/utils/oauth/github-copilot.js.map +1 -1
  474. package/dist/utils/oauth/load.d.ts +5 -0
  475. package/dist/utils/oauth/load.d.ts.map +1 -0
  476. package/dist/utils/oauth/load.js +22 -0
  477. package/dist/utils/oauth/load.js.map +1 -0
  478. package/dist/utils/oauth/openai-codex.d.ts +2 -0
  479. package/dist/utils/oauth/openai-codex.d.ts.map +1 -1
  480. package/dist/utils/oauth/openai-codex.js +49 -0
  481. package/dist/utils/oauth/openai-codex.js.map +1 -1
  482. package/package.json +15 -42
  483. package/dist/api-registry.d.ts +0 -20
  484. package/dist/api-registry.d.ts.map +0 -1
  485. package/dist/api-registry.js +0 -44
  486. package/dist/api-registry.js.map +0 -1
  487. package/dist/base.d.ts +0 -30
  488. package/dist/base.d.ts.map +0 -1
  489. package/dist/base.js +0 -18
  490. package/dist/base.js.map +0 -1
  491. package/dist/providers/cloudflare.d.ts.map +0 -1
  492. package/dist/providers/cloudflare.js.map +0 -1
  493. package/dist/providers/github-copilot-headers.d.ts.map +0 -1
  494. package/dist/providers/github-copilot-headers.js.map +0 -1
  495. package/dist/providers/google-shared.d.ts.map +0 -1
  496. package/dist/providers/google-shared.js.map +0 -1
  497. package/dist/providers/images/openrouter.d.ts.map +0 -1
  498. package/dist/providers/images/openrouter.js.map +0 -1
  499. package/dist/providers/openai-codex-responses.d.ts.map +0 -1
  500. package/dist/providers/openai-codex-responses.js.map +0 -1
  501. package/dist/providers/openai-completions.d.ts.map +0 -1
  502. package/dist/providers/openai-completions.js.map +0 -1
  503. package/dist/providers/openai-prompt-cache.d.ts.map +0 -1
  504. package/dist/providers/openai-prompt-cache.js.map +0 -1
  505. package/dist/providers/openai-responses-shared.d.ts.map +0 -1
  506. package/dist/providers/openai-responses-shared.js.map +0 -1
  507. package/dist/providers/openai-responses.d.ts.map +0 -1
  508. package/dist/providers/openai-responses.js.map +0 -1
  509. package/dist/providers/register-builtins.d.ts +0 -37
  510. package/dist/providers/register-builtins.d.ts.map +0 -1
  511. package/dist/providers/register-builtins.js +0 -191
  512. package/dist/providers/register-builtins.js.map +0 -1
  513. package/dist/providers/simple-options.d.ts.map +0 -1
  514. package/dist/providers/simple-options.js.map +0 -1
  515. package/dist/providers/transform-messages.d.ts.map +0 -1
  516. package/dist/providers/transform-messages.js.map +0 -1
  517. package/dist/stream.d.ts +0 -6
  518. package/dist/stream.d.ts.map +0 -1
  519. package/dist/stream.js +0 -37
  520. package/dist/stream.js.map +0 -1
  521. /package/dist/{providers → api}/github-copilot-headers.d.ts +0 -0
  522. /package/dist/{providers → api}/github-copilot-headers.js +0 -0
  523. /package/dist/{providers → api}/google-shared.d.ts +0 -0
  524. /package/dist/{providers → api}/google-shared.js +0 -0
  525. /package/dist/{providers → api}/openai-prompt-cache.d.ts +0 -0
  526. /package/dist/{providers → api}/openai-prompt-cache.js +0 -0
  527. /package/dist/{providers → api}/openai-responses-shared.d.ts +0 -0
  528. /package/dist/{providers → api}/simple-options.d.ts +0 -0
  529. /package/dist/{providers → api}/simple-options.js +0 -0
  530. /package/dist/{providers → api}/transform-messages.d.ts +0 -0
  531. /package/dist/{providers → api}/transform-messages.js +0 -0
package/README.md CHANGED
@@ -1,6 +1,6 @@
1
1
  # @earendil-works/pi-ai
2
2
 
3
- Unified LLM API with automatic model discovery, provider configuration, token and cost tracking, and simple context persistence and hand-off to other models mid-session.
3
+ Unified LLM API with provider collections, automatic auth resolution, token and cost tracking, and simple context persistence and hand-off to other models mid-session.
4
4
 
5
5
  **Note**: This library only includes models that support tool calling (function calling), as this is essential for agentic workflows.
6
6
 
@@ -8,8 +8,17 @@ Unified LLM API with automatic model discovery, provider configuration, token an
8
8
 
9
9
  - [Supported Providers](#supported-providers)
10
10
  - [Installation](#installation)
11
- - [Base Entry Point](#base-entry-point)
12
11
  - [Quick Start](#quick-start)
12
+ - [Providers and Models](#providers-and-models)
13
+ - [Provider Factories](#provider-factories)
14
+ - [All Built-in Providers](#all-built-in-providers)
15
+ - [Querying Models](#querying-models)
16
+ - [Static Catalog Reads](#static-catalog-reads)
17
+ - [Dynamic Providers](#dynamic-providers)
18
+ - [Auth](#auth)
19
+ - [How Auth Resolves](#how-auth-resolves)
20
+ - [Credential Store](#credential-store)
21
+ - [Environment Variables](#environment-variables)
13
22
  - [Tools](#tools)
14
23
  - [Defining Tools](#defining-tools)
15
24
  - [Handling Tool Calls](#handling-tool-calls)
@@ -18,8 +27,6 @@ Unified LLM API with automatic model discovery, provider configuration, token an
18
27
  - [Complete Event Reference](#complete-event-reference)
19
28
  - [Image Input](#image-input)
20
29
  - [Image Generation](#image-generation)
21
- - [Basic Image Generation](#basic-image-generation)
22
- - [Notes and Limitations](#notes-and-limitations)
23
30
  - [Thinking/Reasoning](#thinkingreasoning)
24
31
  - [Unified Interface](#unified-interface-streamsimplecompletesimple)
25
32
  - [Provider-Specific Options](#provider-specific-options-streamcomplete)
@@ -28,26 +35,22 @@ Unified LLM API with automatic model discovery, provider configuration, token an
28
35
  - [Error Handling](#error-handling)
29
36
  - [Aborting Requests](#aborting-requests)
30
37
  - [Continuing After Abort](#continuing-after-abort)
31
- - [APIs, Models, and Providers](#apis-models-and-providers)
32
- - [Providers and Models](#providers-and-models)
33
- - [Querying Providers and Models](#querying-providers-and-models)
34
- - [Custom Models](#custom-models)
38
+ - [Debugging Provider Payloads](#debugging-provider-payloads)
39
+ - [Custom Providers](#custom-providers)
40
+ - [createProvider()](#createprovider)
41
+ - [Calling API Implementations Directly](#calling-api-implementations-directly)
35
42
  - [OpenAI Compatibility Settings](#openai-compatibility-settings)
36
- - [Type Safety](#type-safety)
43
+ - [Faux Provider for Tests](#faux-provider-for-tests)
37
44
  - [Cross-Provider Handoffs](#cross-provider-handoffs)
38
45
  - [Context Serialization](#context-serialization)
39
46
  - [Browser Usage](#browser-usage)
40
- - [Browser Compatibility Notes](#browser-compatibility-notes)
41
- - [Environment Variables](#environment-variables-nodejs-only)
42
- - [Provider-Scoped Environment Overrides](#provider-scoped-environment-overrides)
43
- - [Checking Environment Variables](#checking-environment-variables)
47
+ - [Bundling and Tree Shaking](#bundling-and-tree-shaking)
44
48
  - [OAuth Providers](#oauth-providers)
45
49
  - [Vertex AI](#vertex-ai)
46
50
  - [CLI Login](#cli-login)
47
51
  - [Programmatic OAuth](#programmatic-oauth)
48
- - [Login Flow Example](#login-flow-example)
49
- - [Using OAuth Tokens](#using-oauth-tokens)
50
- - [Provider Notes](#provider-notes)
52
+ - [Migrating from the Old Global API](#migrating-from-the-old-global-api)
53
+ - [Development](#development)
51
54
  - [License](#license)
52
55
 
53
56
  ## Supported Providers
@@ -69,16 +72,18 @@ Unified LLM API with automatic model discovery, provider configuration, token an
69
72
  - **xAI**
70
73
  - **OpenRouter**
71
74
  - **Vercel AI Gateway**
72
- - **ZAI** (with separate Coding Plan China provider)
73
- - **MiniMax**
75
+ - **ZAI Coding Plan (Global)** (with separate China provider)
76
+ - **MiniMax** (with separate China provider)
74
77
  - **Together AI**
78
+ - **Hugging Face**
79
+ - **Moonshot AI** (with separate China provider)
75
80
  - **GitHub Copilot** (requires OAuth, see below)
76
81
  - **Amazon Bedrock**
77
82
  - **OpenCode Zen**
78
83
  - **OpenCode Go**
79
- - **Fireworks** (uses Anthropic-compatible API)
80
- - **Kimi For Coding** (Moonshot AI, uses Anthropic-compatible API)
81
- - **Xiaomi MiMo** (uses Anthropic-compatible API; defaults to API billing endpoint, with separate Token Plan providers for `cn`/`ams`/`sgp` regions)
84
+ - **Fireworks** (uses OpenAI- and Anthropic-compatible APIs)
85
+ - **Kimi For Coding** (Moonshot AI subscription endpoint, uses Anthropic-compatible API)
86
+ - **Xiaomi MiMo** (defaults to API billing endpoint, with separate Token Plan providers for `cn`/`ams`/`sgp` regions)
82
87
  - **Any OpenAI-compatible API**: Ollama, vLLM, LM Studio, etc.
83
88
 
84
89
  ## Installation
@@ -89,37 +94,19 @@ npm install @earendil-works/pi-ai
89
94
 
90
95
  TypeBox exports are re-exported from `@earendil-works/pi-ai`: `Type`, `Static`, and `TSchema`.
91
96
 
92
- ## Base Entry Point
93
-
94
- Use `@earendil-works/pi-ai/base` when a bundler should include only explicitly selected transport implementations. The base entry point exposes model discovery, registries, generic dispatch, environment-key helpers, and provider-neutral utilities without registering built-in transports during module initialization. Generic dispatch still resolves configured environment keys when called.
97
+ ## Quick Start
95
98
 
96
- Import each selected transport directly and call its `register()` function:
99
+ You build a `Models` collection of providers and stream through it. The quickest start registers every built-in provider; apps that care about bundle size register individual providers instead (see [Provider Factories](#provider-factories) and [Bundling and Tree Shaking](#bundling-and-tree-shaking)).
97
100
 
98
101
  ```typescript
99
- import { getModel, streamSimple } from '@earendil-works/pi-ai/base';
100
- import { register as registerAnthropic } from '@earendil-works/pi-ai/anthropic';
101
-
102
- registerAnthropic();
103
-
104
- const model = getModel('anthropic', 'claude-sonnet-4-6');
105
- const response = await streamSimple(model, {
106
- messages: [{ role: 'user', content: 'Hello!' }]
107
- }, {
108
- apiKey: process.env.ANTHROPIC_API_KEY
109
- }).result();
110
- ```
111
-
112
- Direct transport imports bundle their implementation and SDK code. Register only the transports used by the application. For example, OpenRouter, Groq, xAI, and DeepSeek chat models share the `@earendil-works/pi-ai/openai-completions` transport.
102
+ import { Type, type Context, type Tool } from '@earendil-works/pi-ai';
103
+ import { builtinModels } from '@earendil-works/pi-ai/providers/all';
113
104
 
114
- Use `@earendil-works/pi-ai` for the batteries-included behavior. The root entry point remains backward-compatible: it registers lazy wrappers for all built-in transports and resolves environment API keys automatically during dispatch.
105
+ // A Models collection with every built-in provider registered
106
+ const models = builtinModels();
115
107
 
116
- ## Quick Start
117
-
118
- ```typescript
119
- import { Type, getModel, stream, complete, Context, Tool, StringEnum } from '@earendil-works/pi-ai';
120
-
121
- // Fully typed with auto-complete support for both providers and models
122
- const model = getModel('openai', 'gpt-4o-mini');
108
+ // Sync lookup against the collection
109
+ const model = models.getModel('openai', 'gpt-4o-mini')!;
123
110
 
124
111
  // Define tools with TypeBox schemas for type safety and validation
125
112
  const tools: Tool[] = [{
@@ -133,12 +120,13 @@ const tools: Tool[] = [{
133
120
  // Build a conversation context (easily serializable and transferable between models)
134
121
  const context: Context = {
135
122
  systemPrompt: 'You are a helpful assistant.',
136
- messages: [{ role: 'user', content: 'What time is it?' }],
123
+ messages: [{ role: 'user', content: 'What time is it?', timestamp: Date.now() }],
137
124
  tools
138
125
  };
139
126
 
140
- // Option 1: Streaming with all event types
141
- const s = stream(model, context);
127
+ // Option 1: Streaming with all event types.
128
+ // Auth resolves through the provider (OPENAI_API_KEY from the environment here).
129
+ const s = models.stream(model, context);
142
130
 
143
131
  for await (const event of s) {
144
132
  switch (event.type) {
@@ -181,7 +169,7 @@ for await (const event of s) {
181
169
  console.log(`\nFinished: ${event.reason}`);
182
170
  break;
183
171
  case 'error':
184
- console.error(`Error: ${event.error}`);
172
+ console.error(`Error: ${event.error.errorMessage}`);
185
173
  break;
186
174
  }
187
175
  }
@@ -193,7 +181,6 @@ context.messages.push(finalMessage);
193
181
  // Handle tool calls if any
194
182
  const toolCalls = finalMessage.content.filter(b => b.type === 'toolCall');
195
183
  for (const call of toolCalls) {
196
- // Execute the tool
197
184
  const result = call.name === 'get_time'
198
185
  ? new Date().toLocaleString('en-US', {
199
186
  timeZone: call.arguments.timezone || 'UTC',
@@ -215,7 +202,7 @@ for (const call of toolCalls) {
215
202
 
216
203
  // Continue if there were tool calls
217
204
  if (toolCalls.length > 0) {
218
- const continuation = await complete(model, context);
205
+ const continuation = await models.complete(model, context);
219
206
  context.messages.push(continuation);
220
207
  console.log('After tool execution:', continuation.content);
221
208
  }
@@ -224,7 +211,7 @@ console.log(`Total tokens: ${finalMessage.usage.input} in, ${finalMessage.usage.
224
211
  console.log(`Cost: $${finalMessage.usage.cost.total.toFixed(4)}`);
225
212
 
226
213
  // Option 2: Get complete response without streaming
227
- const response = await complete(model, context);
214
+ const response = await models.complete(model, context);
228
215
 
229
216
  for (const block of response.content) {
230
217
  if (block.type === 'text') {
@@ -235,6 +222,198 @@ for (const block of response.content) {
235
222
  }
236
223
  ```
237
224
 
225
+ Snippets in the rest of this README assume a `models` collection set up like this (with the relevant providers registered).
226
+
227
+ ## Providers and Models
228
+
229
+ A **provider** is the runtime unit: it owns its model catalog, its auth (API key resolution, OAuth flows), and its stream behavior. A `Models` collection holds providers and routes every request to the provider that owns the model.
230
+
231
+ Providers internally share **API implementations** (the wire protocols): Anthropic models use `anthropic-messages`, OpenAI uses `openai-responses`, while xAI, Groq, Cerebras, OpenRouter, and most others share `openai-completions`. Mixed-API providers (GitHub Copilot, OpenCode Zen) dispatch per model.
232
+
233
+ ### Provider Factories
234
+
235
+ For apps that only need specific providers, there is one factory per built-in provider, each a subpath import that pulls only that provider's catalog:
236
+
237
+ ```typescript
238
+ import { anthropicProvider } from '@earendil-works/pi-ai/providers/anthropic';
239
+ import { openaiProvider } from '@earendil-works/pi-ai/providers/openai';
240
+ import { openrouterProvider } from '@earendil-works/pi-ai/providers/openrouter';
241
+ import { amazonBedrockProvider } from '@earendil-works/pi-ai/providers/amazon-bedrock';
242
+ // ...one module per provider in the Supported Providers list
243
+
244
+ const models = createModels();
245
+ models.setProvider(anthropicProvider());
246
+ models.setProvider(openrouterProvider());
247
+ ```
248
+
249
+ Provider factories import their model catalog and a lazy API wrapper. They do not import other providers. With bundler code splitting, SDK implementations (`@anthropic-ai/sdk`, `openai`, `@google/genai`, etc.) stay in lazy chunks loaded on the first request to a model of that API.
250
+
251
+ ### All Built-in Providers
252
+
253
+ For apps that want everything (as in Quick Start):
254
+
255
+ ```typescript
256
+ import { builtinModels } from '@earendil-works/pi-ai/providers/all';
257
+
258
+ const models = builtinModels(); // a Models collection with every built-in provider registered
259
+ ```
260
+
261
+ This imports all catalogs and every built-in provider factory. It is the heavy, explicit entrypoint. `builtinModels()` accepts the same options as `createModels()` (`credentials`, `authContext`); `builtinProviders()` returns the provider array if you want to register them on your own collection.
262
+
263
+ ### Querying Models
264
+
265
+ Reads are synchronous and return the last-known lists:
266
+
267
+ ```typescript
268
+ const providers = models.getProviders(); // registered Provider objects
269
+ const provider = models.getProvider('anthropic'); // one provider
270
+
271
+ const all = models.getModels(); // every model across providers
272
+ const anthropicModels = models.getModels('anthropic');
273
+ const model = models.getModel('anthropic', 'claude-sonnet-4-5');
274
+
275
+ for (const m of anthropicModels) {
276
+ console.log(`${m.id}: ${m.name}`);
277
+ console.log(` API: ${m.api}`);
278
+ console.log(` Context: ${m.contextWindow} tokens`);
279
+ console.log(` Vision: ${m.input.includes('image')}`);
280
+ console.log(` Reasoning: ${m.reasoning}`);
281
+ }
282
+ ```
283
+
284
+ Dynamically listed models are typed `Model<Api>`. Narrow with the `hasApi()` guard when you need API-specific option typing:
285
+
286
+ ```typescript
287
+ import { hasApi } from '@earendil-works/pi-ai';
288
+
289
+ const m = models.getModel('anthropic', 'claude-sonnet-4-5');
290
+ if (m && hasApi(m, 'anthropic-messages')) {
291
+ // m: Model<'anthropic-messages'> — stream options fully typed
292
+ models.stream(m, context, { thinkingEnabled: true, thinkingBudgetTokens: 2048 });
293
+ }
294
+ ```
295
+
296
+ ### Static Catalog Reads
297
+
298
+ For tooling that wants the generated built-in catalog with full literal typing (provider and model IDs auto-complete), independent of any collection:
299
+
300
+ ```typescript
301
+ import { getBuiltinModel, getBuiltinModels, getBuiltinProviders } from '@earendil-works/pi-ai/providers/all';
302
+
303
+ const model = getBuiltinModel('openai', 'gpt-4o-mini'); // typed Model<'openai-responses'>
304
+ const providers = getBuiltinProviders();
305
+ const anthropic = getBuiltinModels('anthropic');
306
+ ```
307
+
308
+ ### Dynamic Providers
309
+
310
+ Providers may have dynamic model lists (a llama.cpp server, a live OpenRouter listing). Reads stay sync; fetching is an explicit async verb:
311
+
312
+ ```typescript
313
+ // getModels() returns the last-known list (empty before the first refresh)
314
+ await models.refresh('llamacpp'); // fetch one provider's list; rejects on failure
315
+ await models.refresh(); // refresh all providers concurrently, best-effort
316
+ const fresh = models.getModel('llamacpp', 'qwen3-30b');
317
+ ```
318
+
319
+ Static built-in providers are no-ops for `refresh()`. See [createProvider()](#createprovider) for building a dynamic provider.
320
+
321
+ ## Auth
322
+
323
+ Every provider owns its auth: how API keys resolve (stored credentials, environment variables, ambient sources like AWS profiles or gcloud ADC) and, where supported, OAuth login/refresh flows.
324
+
325
+ ### How Auth Resolves
326
+
327
+ When you call `models.stream()`, the collection resolves auth through the owning provider and merges it into the request. Explicit per-request values always win:
328
+
329
+ ```typescript
330
+ // Resolved through the provider (env var, stored credential, OAuth token):
331
+ await models.complete(model, context);
332
+
333
+ // Explicit key wins over anything the provider would resolve:
334
+ await models.complete(model, context, { apiKey: 'sk-explicit' });
335
+ ```
336
+
337
+ You can inspect resolution without making a request — useful for status UIs:
338
+
339
+ ```typescript
340
+ const auth = await models.getAuth(model);
341
+ if (auth) {
342
+ console.log(`configured via ${auth.source}`); // e.g. "ANTHROPIC_API_KEY", "OAuth", "stored credential"
343
+ } else {
344
+ console.log('not configured');
345
+ }
346
+ ```
347
+
348
+ `getAuth()` resolves `undefined` for unconfigured providers and rejects with `ModelsError` when something is actually broken (`"oauth"`: token refresh failed, credential preserved for re-login; `"auth"`: key resolution or credential store failure). Request paths surface the same failures as stream errors.
349
+
350
+ ### Credential Store
351
+
352
+ Stored credentials (API keys entered interactively, OAuth tokens) live in a `CredentialStore` — one type-tagged credential per provider. pi-ai ships an in-memory default; apps inject persistent storage:
353
+
354
+ ```typescript
355
+ import { createModels, type CredentialStore } from '@earendil-works/pi-ai';
356
+
357
+ const models = createModels({ credentials: myFileBackedStore });
358
+ // builtinModels() takes the same options:
359
+ // const models = builtinModels({ credentials: myFileBackedStore });
360
+ ```
361
+
362
+ The contract is small: `read(providerId)`, `modify(providerId, fn)` (the only write path — a serialized read-modify-write), and `delete(providerId)`. OAuth token refresh runs inside `modify`, so concurrent requests and processes cannot double-refresh a rotated token. A stored credential *owns* its provider: environment variables are only consulted when nothing is stored, and a failed refresh never silently falls back to an env key.
363
+
364
+ API-key credentials use the same discriminator as pi's `auth.json` and can carry provider-scoped env/config values:
365
+
366
+ ```typescript
367
+ const credential = {
368
+ type: 'api_key',
369
+ key: '...',
370
+ env: {
371
+ CLOUDFLARE_ACCOUNT_ID: 'account-id',
372
+ CLOUDFLARE_GATEWAY_ID: 'gateway-id'
373
+ }
374
+ } as const;
375
+ ```
376
+
377
+ ### Environment Variables
378
+
379
+ Built-in providers resolve these env vars (Node.js; in browsers pass `apiKey` explicitly):
380
+
381
+ | Provider | Environment Variable(s) |
382
+ |----------|------------------------|
383
+ | OpenAI | `OPENAI_API_KEY` |
384
+ | Ant Ling | `ANT_LING_API_KEY` |
385
+ | Azure OpenAI | `AZURE_OPENAI_API_KEY` + `AZURE_OPENAI_BASE_URL` (e.g. `https://{resource}.openai.azure.com`) or `AZURE_OPENAI_RESOURCE_NAME`. Supports `*.openai.azure.com` and `*.cognitiveservices.azure.com`; root endpoints auto-normalize to `/openai/v1`. Optional: `AZURE_OPENAI_API_VERSION` (default `v1`), `AZURE_OPENAI_DEPLOYMENT_NAME_MAP`. |
386
+ | Anthropic | `ANTHROPIC_API_KEY` or `ANTHROPIC_OAUTH_TOKEN` |
387
+ | DeepSeek | `DEEPSEEK_API_KEY` |
388
+ | NVIDIA NIM | `NVIDIA_API_KEY` |
389
+ | Google | `GEMINI_API_KEY` |
390
+ | Vertex AI | `GOOGLE_CLOUD_API_KEY` or `GOOGLE_CLOUD_PROJECT` (or `GCLOUD_PROJECT`) + `GOOGLE_CLOUD_LOCATION` + ADC |
391
+ | Mistral | `MISTRAL_API_KEY` |
392
+ | Groq | `GROQ_API_KEY` |
393
+ | Cerebras | `CEREBRAS_API_KEY` |
394
+ | Cloudflare AI Gateway | `CLOUDFLARE_API_KEY` + `CLOUDFLARE_ACCOUNT_ID` + `CLOUDFLARE_GATEWAY_ID` |
395
+ | Cloudflare Workers AI | `CLOUDFLARE_API_KEY` + `CLOUDFLARE_ACCOUNT_ID` |
396
+ | xAI | `XAI_API_KEY` |
397
+ | Fireworks | `FIREWORKS_API_KEY` |
398
+ | Together AI | `TOGETHER_API_KEY` |
399
+ | OpenRouter | `OPENROUTER_API_KEY` |
400
+ | Vercel AI Gateway | `AI_GATEWAY_API_KEY` |
401
+ | ZAI Coding Plan (Global) | `ZAI_API_KEY` |
402
+ | ZAI Coding Plan (China) | `ZAI_CODING_CN_API_KEY` |
403
+ | MiniMax (Global) | `MINIMAX_API_KEY` |
404
+ | MiniMax (China) | `MINIMAX_CN_API_KEY` |
405
+ | Moonshot AI / Moonshot AI (China) | `MOONSHOT_API_KEY` |
406
+ | Hugging Face | `HF_TOKEN` |
407
+ | OpenCode Zen / OpenCode Go | `OPENCODE_API_KEY` |
408
+ | Kimi For Coding | `KIMI_API_KEY` |
409
+ | Xiaomi MiMo (API billing) | `XIAOMI_API_KEY` |
410
+ | Xiaomi MiMo Token Plan (China) | `XIAOMI_TOKEN_PLAN_CN_API_KEY` |
411
+ | Xiaomi MiMo Token Plan (Amsterdam) | `XIAOMI_TOKEN_PLAN_AMS_API_KEY` |
412
+ | Xiaomi MiMo Token Plan (Singapore) | `XIAOMI_TOKEN_PLAN_SGP_API_KEY` |
413
+ | GitHub Copilot | `COPILOT_GITHUB_TOKEN` |
414
+
415
+ Amazon Bedrock resolves ambient AWS credentials (`AWS_PROFILE`, access key pairs, `AWS_BEARER_TOKEN_BEDROCK`, ECS task roles, web identity tokens). Vertex AI resolves either an explicit key or gcloud Application Default Credentials plus project/location.
416
+
238
417
  ## Tools
239
418
 
240
419
  Tools enable LLMs to interact with external systems. This library uses TypeBox schemas for type-safe tool definitions with automatic validation using TypeBox's built-in validator and value conversion utilities. TypeBox schemas can be serialized and deserialized as plain JSON, making them ideal for distributed systems.
@@ -242,7 +421,7 @@ Tools enable LLMs to interact with external systems. This library uses TypeBox s
242
421
  ### Defining Tools
243
422
 
244
423
  ```typescript
245
- import { Type, Tool, StringEnum } from '@earendil-works/pi-ai';
424
+ import { Type, type Tool, StringEnum } from '@earendil-works/pi-ai';
246
425
 
247
426
  // Define tool parameters with TypeBox
248
427
  const weatherTool: Tool = {
@@ -277,11 +456,11 @@ Tool results use content blocks and can include both text and images:
277
456
  import { readFileSync } from 'fs';
278
457
 
279
458
  const context: Context = {
280
- messages: [{ role: 'user', content: 'What is the weather in London?' }],
459
+ messages: [{ role: 'user', content: 'What is the weather in London?', timestamp: Date.now() }],
281
460
  tools: [weatherTool]
282
461
  };
283
462
 
284
- const response = await complete(model, context);
463
+ const response = await models.complete(model, context);
285
464
 
286
465
  // Check for tool calls in the response
287
466
  for (const block of response.content) {
@@ -322,7 +501,7 @@ context.messages.push({
322
501
  During streaming, tool call arguments are progressively parsed as they arrive. This enables real-time UI updates before the complete arguments are available:
323
502
 
324
503
  ```typescript
325
- const s = stream(model, context);
504
+ const s = models.stream(model, context);
326
505
 
327
506
  for await (const event of s) {
328
507
  if (event.type === 'toolcall_delta') {
@@ -363,15 +542,13 @@ for await (const event of s) {
363
542
 
364
543
  ### Validating Tool Arguments
365
544
 
366
- When using `agentLoop`, tool arguments are automatically validated against your TypeBox schemas before execution. If validation fails, the error is returned to the model as a tool result, allowing it to retry.
367
-
368
- When implementing your own tool execution loop with `stream()` or `complete()`, use `validateToolCall` to validate arguments before passing them to your tools:
545
+ When implementing your own tool execution loop, use `validateToolCall` to validate arguments before passing them to your tools:
369
546
 
370
547
  ```typescript
371
- import { stream, validateToolCall, Tool } from '@earendil-works/pi-ai';
548
+ import { validateToolCall, type Tool } from '@earendil-works/pi-ai';
372
549
 
373
550
  const tools: Tool[] = [weatherTool, calculatorTool];
374
- const s = stream(model, { messages, tools });
551
+ const s = models.stream(model, { messages, tools });
375
552
 
376
553
  for await (const event of s) {
377
554
  if (event.type === 'toolcall_end') {
@@ -424,9 +601,8 @@ Models with vision capabilities can process images. You can check if a model sup
424
601
 
425
602
  ```typescript
426
603
  import { readFileSync } from 'fs';
427
- import { getModel, complete } from '@earendil-works/pi-ai';
428
604
 
429
- const model = getModel('openai', 'gpt-4o-mini');
605
+ const model = models.getModel('openai', 'gpt-4o-mini')!;
430
606
 
431
607
  // Check if model supports images
432
608
  if (model.input.includes('image')) {
@@ -436,13 +612,14 @@ if (model.input.includes('image')) {
436
612
  const imageBuffer = readFileSync('image.png');
437
613
  const base64Image = imageBuffer.toString('base64');
438
614
 
439
- const response = await complete(model, {
615
+ const response = await models.complete(model, {
440
616
  messages: [{
441
617
  role: 'user',
442
618
  content: [
443
619
  { type: 'text', text: 'What is in this image?' },
444
620
  { type: 'image', data: base64Image, mimeType: 'image/png' }
445
- ]
621
+ ],
622
+ timestamp: Date.now()
446
623
  }]
447
624
  });
448
625
 
@@ -456,21 +633,21 @@ for (const block of response.content) {
456
633
 
457
634
  ## Image Generation
458
635
 
459
- Image generation uses a separate API surface from text/chat generation. Use `getImageModel()` / `getImageModels()` / `getImageProviders()` to discover image-generation models, and `generateImages()` to get the final result.
460
-
461
- Do not use `stream()` or `complete()` for image generation. Image generation is a one-shot API: `generateImages()` waits for the provider response and returns the final `AssistantImages` result.
636
+ Image generation uses a separate API surface from text/chat generation, mirroring the chat-side design: an `ImagesModels` collection holds `ImagesProvider`s, reads are sync, and auth resolves through the owning provider. Image generation is a one-shot API: `generateImages()` waits for the provider response and returns the final `AssistantImages` result — do not use the chat/stream APIs for it.
462
637
 
463
638
  ### Basic Image Generation
464
639
 
465
640
  ```typescript
466
- import { getImageModel, generateImages } from '@earendil-works/pi-ai';
641
+ import { builtinImagesModels } from '@earendil-works/pi-ai/providers/all';
467
642
 
468
- const model = getImageModel('openrouter', 'google/gemini-2.5-flash-image');
643
+ // Every built-in image-generation provider; accepts the same options as createModels()
644
+ const imagesModels = builtinImagesModels();
469
645
 
470
- const result = await generateImages(model, {
646
+ const model = imagesModels.getModel('openrouter', 'google/gemini-2.5-flash-image')!;
647
+
648
+ // Auth resolves through the provider (OPENROUTER_API_KEY here); explicit apiKey wins
649
+ const result = await imagesModels.generateImages(model, {
471
650
  input: [{ type: 'text', text: 'Generate a red circle on a plain white background.' }]
472
- }, {
473
- apiKey: process.env.OPENROUTER_API_KEY
474
651
  });
475
652
 
476
653
  for (const block of result.output) {
@@ -483,19 +660,32 @@ for (const block of result.output) {
483
660
  }
484
661
  ```
485
662
 
663
+ Like the chat side, you can build the collection from parts: `createImagesModels({ credentials?, authContext? })`, the `openrouterImagesProvider()` factory from `@earendil-works/pi-ai/providers/openrouter-images`, and `createImagesProvider({ id, auth, models, refreshModels?, api })` for custom image providers (with `imagesModels.refresh(provider?)` for dynamic lists). Failures never reject — they return an `AssistantImages` with `stopReason: "error"`. The collection's `getAuth(model)` works exactly like the chat-side one.
664
+
665
+ The old global API (`getImageModel()` / `getImageModels()` / `getImageProviders()` / `generateImages()`) remains available on the [compat entrypoint](#migrating-from-the-old-global-api):
666
+
667
+ ```typescript
668
+ import { getImageModel, generateImages } from '@earendil-works/pi-ai/compat';
669
+
670
+ const model = getImageModel('openrouter', 'google/gemini-2.5-flash-image');
671
+ const result = await generateImages(model, {
672
+ input: [{ type: 'text', text: 'Generate a red circle on a plain white background.' }]
673
+ }, {
674
+ apiKey: process.env.OPENROUTER_API_KEY
675
+ });
676
+ ```
677
+
486
678
  Some models also support image input:
487
679
 
488
680
  ```typescript
489
681
  import { readFileSync } from 'fs';
490
682
 
491
683
  const imageBuffer = readFileSync('input.png');
492
- const result = await generateImages(model, {
684
+ const result = await imagesModels.generateImages(model, {
493
685
  input: [
494
686
  { type: 'text', text: 'Create a variation of this image with a blue background.' },
495
687
  { type: 'image', data: imageBuffer.toString('base64'), mimeType: 'image/png' }
496
688
  ]
497
- }, {
498
- apiKey: process.env.OPENROUTER_API_KEY
499
689
  });
500
690
  ```
501
691
 
@@ -508,14 +698,14 @@ console.log(model.output); // ['image'] or ['image', 'text']
508
698
 
509
699
  ### Notes and Limitations
510
700
 
511
- - Use `getImageModel(...)`, not `getModel(...)`.
512
- - Use `generateImages()`, not `stream()` / `complete()`.
701
+ - Image models live in `ImagesModels` collections, chat models in `Models` collections; the two are separate surfaces.
702
+ - Use `generateImages()`, not the chat/stream APIs.
513
703
  - Image-generation models do not participate in tool calling.
514
704
  - Outputs are returned in `AssistantImages.output` and can include both base64-encoded `ImageContent` blocks and `TextContent` blocks.
515
705
  - Some models return only images, others return images plus text. Check `model.output`.
516
706
  - Some models accept image input, others are text-to-image only. Check `model.input`.
517
707
  - Like the streaming APIs, image generation supports options such as `apiKey`, `signal`, `headers`, `onPayload`, and `onResponse`, and results may include `stopReason`, `responseId`, and `usage`.
518
- - If you want a model to analyze images in a conversation or call tools, use the regular `stream()` / `complete()` APIs with a model that supports image input.
708
+ - If you want a model to analyze images in a conversation or call tools, use the regular chat APIs with a model that supports image input.
519
709
  - At the moment, image generation is available through only one provider, OpenRouter.
520
710
 
521
711
  ## Thinking/Reasoning
@@ -525,16 +715,11 @@ Many models support thinking/reasoning capabilities where they can show their in
525
715
  ### Unified Interface (streamSimple/completeSimple)
526
716
 
527
717
  ```typescript
528
- import { getModel, streamSimple, completeSimple } from '@earendil-works/pi-ai';
529
-
530
718
  // Many models across providers support thinking/reasoning
531
- const model = getModel('anthropic', 'claude-sonnet-4-20250514');
532
- // or getModel('openai', 'gpt-5-mini');
533
- // or getModel('google', 'gemini-2.5-flash');
534
- // or getModel('xai', 'grok-code-fast-1');
535
- // or getModel('groq', 'openai/gpt-oss-20b');
536
- // or getModel('cerebras', 'gpt-oss-120b');
537
- // or getModel('openrouter', 'z-ai/glm-4.5v');
719
+ const model = models.getModel('anthropic', 'claude-sonnet-4-5')!;
720
+ // or models.getModel('openai', 'gpt-5-mini');
721
+ // or models.getModel('google', 'gemini-2.5-flash');
722
+ // or models.getModel('xai', 'grok-code-fast-1');
538
723
 
539
724
  // Check if model supports reasoning
540
725
  if (model.reasoning) {
@@ -542,8 +727,8 @@ if (model.reasoning) {
542
727
  }
543
728
 
544
729
  // Use the simplified reasoning option
545
- const response = await completeSimple(model, {
546
- messages: [{ role: 'user', content: 'Solve: 2x + 5 = 13' }]
730
+ const response = await models.completeSimple(model, {
731
+ messages: [{ role: 'user', content: 'Solve: 2x + 5 = 13', timestamp: Date.now() }]
547
732
  }, {
548
733
  reasoning: 'medium' // 'minimal' | 'low' | 'medium' | 'high' | 'xhigh'
549
734
  });
@@ -560,33 +745,39 @@ for (const block of response.content) {
560
745
 
561
746
  ### Provider-Specific Options (stream/complete)
562
747
 
563
- For fine-grained control, use the provider-specific options:
748
+ `models.stream()`/`complete()` accept the owning API's full option set. Use `hasApi()` to narrow a dynamically looked-up model to its API for full option typing:
564
749
 
565
750
  ```typescript
566
- import { getModel, complete } from '@earendil-works/pi-ai';
751
+ import { hasApi } from '@earendil-works/pi-ai';
567
752
 
568
753
  // OpenAI Reasoning (o1, o3, gpt-5)
569
- const openaiModel = getModel('openai', 'gpt-5-mini');
570
- await complete(openaiModel, context, {
571
- reasoningEffort: 'medium',
572
- reasoningSummary: 'detailed' // OpenAI Responses API only
573
- });
754
+ const openaiModel = models.getModel('openai', 'gpt-5-mini')!;
755
+ if (hasApi(openaiModel, 'openai-responses')) {
756
+ await models.complete(openaiModel, context, {
757
+ reasoningEffort: 'medium',
758
+ reasoningSummary: 'detailed' // OpenAI Responses API only
759
+ });
760
+ }
574
761
 
575
- // Anthropic Thinking (Claude Sonnet 4)
576
- const anthropicModel = getModel('anthropic', 'claude-sonnet-4-20250514');
577
- await complete(anthropicModel, context, {
578
- thinkingEnabled: true,
579
- thinkingBudgetTokens: 8192 // Optional token limit
580
- });
762
+ // Anthropic Thinking
763
+ const anthropicModel = models.getModel('anthropic', 'claude-sonnet-4-5')!;
764
+ if (hasApi(anthropicModel, 'anthropic-messages')) {
765
+ await models.complete(anthropicModel, context, {
766
+ thinkingEnabled: true,
767
+ thinkingBudgetTokens: 8192 // Optional token limit
768
+ });
769
+ }
581
770
 
582
771
  // Google Gemini Thinking
583
- const googleModel = getModel('google', 'gemini-2.5-flash');
584
- await complete(googleModel, context, {
585
- thinking: {
586
- enabled: true,
587
- budgetTokens: 8192 // -1 for dynamic, 0 to disable
588
- }
589
- });
772
+ const googleModel = models.getModel('google', 'gemini-2.5-flash')!;
773
+ if (hasApi(googleModel, 'google-generative-ai')) {
774
+ await models.complete(googleModel, context, {
775
+ thinking: {
776
+ enabled: true,
777
+ budgetTokens: 8192 // -1 for dynamic, 0 to disable
778
+ }
779
+ });
780
+ }
590
781
  ```
591
782
 
592
783
  ### Streaming Thinking Content
@@ -594,7 +785,7 @@ await complete(googleModel, context, {
594
785
  When streaming, thinking content is delivered through specific events:
595
786
 
596
787
  ```typescript
597
- const s = streamSimple(model, context, { reasoning: 'high' });
788
+ const s = models.streamSimple(model, context, { reasoning: 'high' });
598
789
 
599
790
  for await (const event of s) {
600
791
  switch (event.type) {
@@ -625,11 +816,11 @@ Every `AssistantMessage` includes a `stopReason` field that indicates how the ge
625
816
 
626
817
  ## Error Handling
627
818
 
628
- When a request ends with an error (including aborts and tool call validation errors), the streaming API emits an error event:
819
+ Request failures never throw out of the stream functions: when a request ends with an error (including aborts and tool call validation errors), the streaming API emits an error event and the final message carries the details:
629
820
 
630
821
  ```typescript
631
822
  // In streaming
632
- for await (const event of stream) {
823
+ for await (const event of s) {
633
824
  if (event.type === 'error') {
634
825
  // event.reason is either "error" or "aborted"
635
826
  // event.error is the AssistantMessage with partial content
@@ -639,7 +830,7 @@ for await (const event of stream) {
639
830
  }
640
831
 
641
832
  // The final message will have the error details
642
- const message = await stream.result();
833
+ const message = await s.result();
643
834
  if (message.stopReason === 'error' || message.stopReason === 'aborted') {
644
835
  console.error('Request failed:', message.errorMessage);
645
836
  // message.content contains any partial content received before the error
@@ -647,21 +838,20 @@ if (message.stopReason === 'error' || message.stopReason === 'aborted') {
647
838
  }
648
839
  ```
649
840
 
841
+ Auth failures (no key configured, OAuth refresh failed, unknown provider) surface the same way: as a stream error with `stopReason: "error"`.
842
+
650
843
  ### Aborting Requests
651
844
 
652
845
  The abort signal allows you to cancel in-progress requests. Aborted requests have `stopReason === 'aborted'`:
653
846
 
654
847
  ```typescript
655
- import { getModel, stream } from '@earendil-works/pi-ai';
656
-
657
- const model = getModel('openai', 'gpt-4o-mini');
658
848
  const controller = new AbortController();
659
849
 
660
850
  // Abort after 2 seconds
661
851
  setTimeout(() => controller.abort(), 2000);
662
852
 
663
- const s = stream(model, {
664
- messages: [{ role: 'user', content: 'Write a long story' }]
853
+ const s = models.stream(model, {
854
+ messages: [{ role: 'user', content: 'Write a long story', timestamp: Date.now() }]
665
855
  }, {
666
856
  signal: controller.signal
667
857
  });
@@ -691,7 +881,7 @@ Aborted messages can be added to the conversation context and continued in subse
691
881
  ```typescript
692
882
  const context = {
693
883
  messages: [
694
- { role: 'user', content: 'Explain quantum computing in detail' }
884
+ { role: 'user', content: 'Explain quantum computing in detail', timestamp: Date.now() }
695
885
  ]
696
886
  };
697
887
 
@@ -699,14 +889,14 @@ const context = {
699
889
  const controller1 = new AbortController();
700
890
  setTimeout(() => controller1.abort(), 2000);
701
891
 
702
- const partial = await complete(model, context, { signal: controller1.signal });
892
+ const partial = await models.complete(model, context, { signal: controller1.signal });
703
893
 
704
894
  // Add the partial response to context
705
895
  context.messages.push(partial);
706
- context.messages.push({ role: 'user', content: 'Please continue' });
896
+ context.messages.push({ role: 'user', content: 'Please continue', timestamp: Date.now() });
707
897
 
708
898
  // Continue the conversation
709
- const continuation = await complete(model, context);
899
+ const continuation = await models.complete(model, context);
710
900
  ```
711
901
 
712
902
  ### Debugging Provider Payloads
@@ -714,7 +904,7 @@ const continuation = await complete(model, context);
714
904
  Use the `onPayload` callback to inspect the request payload sent to the provider. This is useful for debugging request formatting issues or provider validation errors.
715
905
 
716
906
  ```typescript
717
- const response = await complete(model, context, {
907
+ const response = await models.complete(model, context, {
718
908
  onPayload: (payload) => {
719
909
  console.log('Provider payload:', JSON.stringify(payload, null, 2));
720
910
  }
@@ -723,206 +913,93 @@ const response = await complete(model, context, {
723
913
 
724
914
  The callback is supported by `stream`, `complete`, `streamSimple`, and `completeSimple`.
725
915
 
726
- ## APIs, Models, and Providers
727
-
728
- The library uses a registry of API implementations. Built-in APIs include:
729
-
730
- - **`anthropic-messages`**: Anthropic Messages API (`streamAnthropic`, `AnthropicOptions`)
731
- - **`google-generative-ai`**: Google Generative AI API (`streamGoogle`, `GoogleOptions`)
732
- - **`google-vertex`**: Google Vertex AI API (`streamGoogleVertex`, `GoogleVertexOptions`)
733
- - **`mistral-conversations`**: Mistral Conversations API (`streamMistral`, `MistralOptions`)
734
- - **`openai-completions`**: OpenAI Chat Completions API (`streamOpenAICompletions`, `OpenAICompletionsOptions`)
735
- - **`openai-responses`**: OpenAI Responses API (`streamOpenAIResponses`, `OpenAIResponsesOptions`)
736
- - **`openai-codex-responses`**: OpenAI Codex Responses API (`streamOpenAICodexResponses`, `OpenAICodexResponsesOptions`)
737
- - **`azure-openai-responses`**: Azure OpenAI Responses API (`streamAzureOpenAIResponses`, `AzureOpenAIResponsesOptions`)
738
- - **`bedrock-converse-stream`**: Amazon Bedrock Converse API (`streamBedrock`, `BedrockOptions`)
916
+ ## Custom Providers
739
917
 
740
- ### Faux provider for tests
918
+ ### createProvider()
741
919
 
742
- `registerFauxProvider()` registers a temporary in-memory provider for tests and demos. It is opt-in and not part of the built-in provider set.
920
+ `createProvider()` builds a provider from parts: identity, auth, a model list, and an API implementation. Use it for local inference servers, proxies, or any OpenAI/Anthropic-compatible endpoint:
743
921
 
744
922
  ```typescript
745
- import {
746
- complete,
747
- fauxAssistantMessage,
748
- fauxText,
749
- fauxThinking,
750
- fauxToolCall,
751
- registerFauxProvider,
752
- stream,
753
- } from '@earendil-works/pi-ai';
923
+ import { createModels, createProvider, envApiKeyAuth, type Model } from '@earendil-works/pi-ai';
924
+ import { openAICompletionsApi } from '@earendil-works/pi-ai/api/openai-completions.lazy';
754
925
 
755
- const registration = registerFauxProvider({
756
- tokensPerSecond: 50 // optional
757
- });
758
-
759
- const model = registration.getModel();
760
- const context = {
761
- messages: [{ role: 'user', content: 'Summarize package.json and then call echo', timestamp: Date.now() }]
926
+ const ollamaModel: Model<'openai-completions'> = {
927
+ id: 'llama-3.1-8b',
928
+ name: 'Llama 3.1 8B (Ollama)',
929
+ api: 'openai-completions',
930
+ provider: 'ollama',
931
+ baseUrl: 'http://localhost:11434/v1',
932
+ reasoning: false,
933
+ input: ['text'],
934
+ cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
935
+ contextWindow: 128000,
936
+ maxTokens: 32000
762
937
  };
763
938
 
764
- registration.setResponses([
765
- fauxAssistantMessage([
766
- fauxThinking('Need to inspect package metadata first.'),
767
- fauxToolCall('echo', { text: 'package.json' })
768
- ], { stopReason: 'toolUse' })
769
- ]);
770
-
771
- const first = await complete(model, context, {
772
- sessionId: 'session-1',
773
- cacheRetention: 'short'
939
+ const ollama = createProvider({
940
+ id: 'ollama',
941
+ name: 'Ollama',
942
+ baseUrl: 'http://localhost:11434/v1',
943
+ // Every provider declares auth; keyless local servers resolve as configured with no key.
944
+ auth: { apiKey: { name: 'Ollama', resolve: async () => ({ auth: {} }) } },
945
+ models: [ollamaModel],
946
+ api: openAICompletionsApi(),
774
947
  });
775
- context.messages.push(first);
776
948
 
777
- context.messages.push({
778
- role: 'toolResult',
779
- toolCallId: first.content.find((block) => block.type === 'toolCall')!.id,
780
- toolName: 'echo',
781
- content: [{ type: 'text', text: 'package.json contents here' }],
782
- isError: false,
783
- timestamp: Date.now()
784
- });
949
+ const models = createModels();
950
+ models.setProvider(ollama);
785
951
 
786
- registration.setResponses([
787
- fauxAssistantMessage([
788
- fauxThinking('Now I can summarize the tool output.'),
789
- fauxText('Here is the summary.')
790
- ])
791
- ]);
952
+ await models.complete(models.getModel('ollama', 'llama-3.1-8b')!, context);
953
+ ```
792
954
 
793
- const s = stream(model, context);
794
- for await (const event of s) {
795
- console.log(event.type);
796
- }
955
+ For providers with real keys, `envApiKeyAuth(displayName, envVars)` gives the standard behavior (stored credential wins, then the first set env var):
797
956
 
798
- // Optional: register multiple faux models for model-switching tests
799
- const multiModel = registerFauxProvider({
800
- models: [
801
- { id: 'faux-fast', reasoning: false },
802
- { id: 'faux-thinker', reasoning: true }
803
- ]
957
+ ```typescript
958
+ const proxy = createProvider({
959
+ id: 'my-proxy',
960
+ auth: { apiKey: envApiKeyAuth('My proxy API key', ['MY_PROXY_API_KEY']) },
961
+ models: [/* ... */],
962
+ api: openAICompletionsApi(),
804
963
  });
805
- const thinker = multiModel.getModel('faux-thinker');
806
-
807
- console.log(thinker?.reasoning);
808
- console.log(registration.getPendingResponseCount());
809
- console.log(registration.state.callCount);
810
- registration.unregister();
811
- multiModel.unregister();
812
964
  ```
813
965
 
814
- Notes:
815
- - Responses are consumed from a queue in request start order.
816
- - If the queue is empty, the faux provider returns an assistant error message with `errorMessage: "No more faux responses queued"`.
817
- - Use `registration.setResponses([...])` to replace the remaining queue and `registration.appendResponses([...])` to add more responses.
818
- - `registration.models` exposes all registered faux models. `registration.getModel()` returns the first one, and `registration.getModel(id)` returns a specific one.
819
- - Use `fauxAssistantMessage(...)` for scripted assistant replies. Use `fauxText(...)`, `fauxThinking(...)`, and `fauxToolCall(...)` to build content blocks without filling in low-level fields manually.
820
- - `registration.unregister()` removes the temporary provider from the global API registry.
821
- - Usage is estimated at roughly 1 token per 4 characters. When `sessionId` is present and `cacheRetention` is not `"none"`, prompt cache reads and writes are simulated automatically.
822
- - Tool call arguments stream incrementally via `toolcall_delta` chunks.
823
- - By default, each streamed chunk is emitted on its own microtask. Set `tokensPerSecond` to pace chunk delivery in real time.
824
- - The intended use is one deterministic scripted flow per registration. If you need independent concurrent flows, register separate faux providers.
825
-
826
- ### Providers and Models
827
-
828
- A **provider** offers models through a specific API. For example:
829
- - **Anthropic** models use the `anthropic-messages` API
830
- - **Google** models use the `google-generative-ai` API
831
- - **OpenAI** models use the `openai-responses` API
832
- - **Mistral** models use the `mistral-conversations` API
833
- - **xAI, Cerebras, Groq, NVIDIA NIM, Together AI, etc.** models use the `openai-completions` API (OpenAI-compatible)
834
-
835
- ### Querying Providers and Models
966
+ Mixed-API providers pass a map keyed by `model.api`; each model dispatches to its API's implementation:
836
967
 
837
968
  ```typescript
838
- import { getProviders, getModels, getModel } from '@earendil-works/pi-ai';
839
-
840
- // Get all available providers
841
- const providers = getProviders();
842
- console.log(providers); // ['openai', 'anthropic', 'google', 'xai', 'groq', ...]
843
-
844
- // Get all models from a provider (fully typed)
845
- const anthropicModels = getModels('anthropic');
846
- for (const model of anthropicModels) {
847
- console.log(`${model.id}: ${model.name}`);
848
- console.log(` API: ${model.api}`); // 'anthropic-messages'
849
- console.log(` Context: ${model.contextWindow} tokens`);
850
- console.log(` Vision: ${model.input.includes('image')}`);
851
- console.log(` Reasoning: ${model.reasoning}`);
852
- }
853
-
854
- // Get a specific model (both provider and model ID are auto-completed in IDEs)
855
- const model = getModel('openai', 'gpt-4o-mini');
856
- console.log(`Using ${model.name} via ${model.api} API`);
969
+ import { anthropicMessagesApi } from '@earendil-works/pi-ai/api/anthropic-messages.lazy';
970
+ import { openAIResponsesApi } from '@earendil-works/pi-ai/api/openai-responses.lazy';
971
+
972
+ const gateway = createProvider({
973
+ id: 'my-gateway',
974
+ auth: { apiKey: envApiKeyAuth('Gateway key', ['GATEWAY_API_KEY']) },
975
+ models: [/* models with api: 'anthropic-messages' or 'openai-responses' */],
976
+ api: {
977
+ 'anthropic-messages': anthropicMessagesApi(),
978
+ 'openai-responses': openAIResponsesApi(),
979
+ },
980
+ });
857
981
  ```
858
982
 
859
- ### Custom Models
860
-
861
- You can create custom models for local inference servers or custom endpoints:
983
+ Dynamic model lists use `refreshModels`; the provider lists empty until the first `models.refresh()`:
862
984
 
863
985
  ```typescript
864
- import { Model, stream } from '@earendil-works/pi-ai';
865
-
866
- // Example: Ollama using OpenAI-compatible API
867
- const ollamaModel: Model<'openai-completions'> = {
868
- id: 'llama-3.1-8b',
869
- name: 'Llama 3.1 8B (Ollama)',
870
- api: 'openai-completions',
871
- provider: 'ollama',
872
- baseUrl: 'http://localhost:11434/v1',
873
- reasoning: false,
874
- input: ['text'],
875
- cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
876
- contextWindow: 128000,
877
- maxTokens: 32000
878
- };
879
-
880
- // Example: LiteLLM proxy with explicit compat settings
881
- const litellmModel: Model<'openai-completions'> = {
882
- id: 'gpt-4o',
883
- name: 'GPT-4o (via LiteLLM)',
884
- api: 'openai-completions',
885
- provider: 'litellm',
886
- baseUrl: 'http://localhost:4000/v1',
887
- reasoning: false,
888
- input: ['text', 'image'],
889
- cost: { input: 2.5, output: 10, cacheRead: 0, cacheWrite: 0 },
890
- contextWindow: 128000,
891
- maxTokens: 16384,
892
- compat: {
893
- supportsStore: false, // LiteLLM doesn't support the store field
894
- }
895
- };
896
-
897
- // Example: Custom endpoint with headers (bypassing Cloudflare bot detection)
898
- const proxyModel: Model<'anthropic-messages'> = {
899
- id: 'claude-sonnet-4',
900
- name: 'Claude Sonnet 4 (Proxied)',
901
- api: 'anthropic-messages',
902
- provider: 'custom-proxy',
903
- baseUrl: 'https://proxy.example.com/v1',
904
- reasoning: true,
905
- input: ['text', 'image'],
906
- cost: { input: 3, output: 15, cacheRead: 0.3, cacheWrite: 3.75 },
907
- contextWindow: 200000,
908
- maxTokens: 8192,
909
- headers: {
910
- 'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36',
911
- 'X-Custom-Auth': 'bearer-token-here'
912
- }
913
- };
914
-
915
- // Use the custom model
916
- const response = await stream(ollamaModel, context, {
917
- apiKey: 'dummy' // Ollama doesn't need a real key
986
+ const llamacpp = createProvider({
987
+ id: 'llamacpp',
988
+ auth: { apiKey: { name: 'llama.cpp', resolve: async () => ({ auth: {} }) } },
989
+ models: [],
990
+ refreshModels: async () => fetchModelsFromServer('http://localhost:8080'),
991
+ api: openAICompletionsApi(),
918
992
  });
993
+
994
+ models.setProvider(llamacpp);
995
+ await models.refresh('llamacpp');
919
996
  ```
920
997
 
921
- Some OpenAI-compatible servers do not understand the `developer` role used for reasoning-capable models. For those providers, set `compat.supportsDeveloperRole` to `false` so the system prompt is sent as a `system` message instead. If the server also does not support `reasoning_effort`, set `compat.supportsReasoningEffort` to `false` too.
998
+ Custom models can carry `headers` (e.g. proxies behind bot detection) and `compat` flags see [OpenAI Compatibility Settings](#openai-compatibility-settings).
922
999
 
923
- Use model-level `thinkingLevelMap` to describe model-specific thinking controls. Keys are pi thinking levels (`off`, `minimal`, `low`, `medium`, `high`, `xhigh`). Missing keys use provider defaults, string values are sent to the provider, and `null` marks a level unsupported.
1000
+ Some OpenAI-compatible servers do not understand the `developer` role used for reasoning-capable models. For those providers, set `compat.supportsDeveloperRole` to `false` so the system prompt is sent as a `system` message instead. If the server also does not support `reasoning_effort`, set `compat.supportsReasoningEffort` to `false` too. This commonly applies to Ollama, vLLM, SGLang, and similar OpenAI-compatible servers.
924
1001
 
925
- This commonly applies to Ollama, vLLM, SGLang, and similar OpenAI-compatible servers. You can set `compat` at the provider level or per model.
1002
+ Use model-level `thinkingLevelMap` to describe model-specific thinking controls. Keys are pi thinking levels (`off`, `minimal`, `low`, `medium`, `high`, `xhigh`). Missing keys use provider defaults, string values are sent to the provider, and `null` marks a level unsupported.
926
1003
 
927
1004
  ```typescript
928
1005
  const ollamaReasoningModel: Model<'openai-completions'> = {
@@ -950,6 +1027,36 @@ const ollamaReasoningModel: Model<'openai-completions'> = {
950
1027
  };
951
1028
  ```
952
1029
 
1030
+ ### Calling API Implementations Directly
1031
+
1032
+ The API implementations are importable on their own. Each module exports exactly `stream` and `streamSimple` with that API's full option typing. Direct calls bypass provider auth — pass `apiKey` explicitly:
1033
+
1034
+ ```typescript
1035
+ import { stream } from '@earendil-works/pi-ai/api/anthropic-messages';
1036
+
1037
+ const s = stream(claudeModel, context, {
1038
+ apiKey: process.env.ANTHROPIC_API_KEY,
1039
+ thinkingEnabled: true,
1040
+ thinkingBudgetTokens: 2048,
1041
+ });
1042
+ ```
1043
+
1044
+ Built-in API implementations live under `./api/<api-id>`:
1045
+
1046
+ | API id | Options type |
1047
+ |--------|--------------|
1048
+ | `anthropic-messages` | `AnthropicOptions` |
1049
+ | `openai-completions` | `OpenAICompletionsOptions` |
1050
+ | `openai-responses` | `OpenAIResponsesOptions` |
1051
+ | `openai-codex-responses` | `OpenAICodexResponsesOptions` |
1052
+ | `azure-openai-responses` | `AzureOpenAIResponsesOptions` |
1053
+ | `google-generative-ai` | `GoogleOptions` |
1054
+ | `google-vertex` | `GoogleVertexOptions` |
1055
+ | `mistral-conversations` | `MistralOptions` |
1056
+ | `bedrock-converse-stream` | `BedrockOptions` |
1057
+
1058
+ Importing an implementation module loads its SDK. The `./api/<id>.lazy` wrappers (used by the provider factories) defer that load to the first request when the runtime or bundler supports dynamic import chunking. Legacy raw API subpaths from older releases (`./anthropic`, `./google`, `./mistral`, `./openai-completions`, ...) were removed; use `@earendil-works/pi-ai/api/<api-id>`.
1059
+
953
1060
  ### OpenAI Compatibility Settings
954
1061
 
955
1062
  The `openai-completions` API is implemented by many providers with minor differences. By default, the library auto-detects compatibility settings based on `baseUrl` for a small set of known OpenAI-compatible providers (Cerebras, xAI, Chutes, DeepSeek, NVIDIA NIM, Together AI, zAi, OpenCode, Cloudflare Workers AI, etc.). For custom proxies or unknown endpoints, you can override these settings via the `compat` field. For `openai-responses` models, the compat field supports Responses-specific flags.
@@ -987,30 +1094,97 @@ If `compat` is not set, the library falls back to URL-based detection. If `compa
987
1094
  - **Custom inference servers**: May use non-standard field names
988
1095
  - **Self-hosted endpoints**: May have different feature support
989
1096
 
990
- ### Type Safety
1097
+ ## Faux Provider for Tests
991
1098
 
992
- Models are typed by their API, which keeps the model metadata accurate. Provider-specific option types are enforced when you call the provider functions directly. The generic `stream` and `complete` functions accept `StreamOptions` with additional provider fields.
1099
+ `fauxProvider()` builds an in-memory provider with scripted responses for tests and demos:
993
1100
 
994
1101
  ```typescript
995
- import { streamAnthropic, type AnthropicOptions } from '@earendil-works/pi-ai';
1102
+ import {
1103
+ createModels,
1104
+ fauxAssistantMessage,
1105
+ fauxProvider,
1106
+ fauxText,
1107
+ fauxThinking,
1108
+ fauxToolCall,
1109
+ } from '@earendil-works/pi-ai';
996
1110
 
997
- // TypeScript knows this is an Anthropic model
998
- const claude = getModel('anthropic', 'claude-sonnet-4-20250514');
1111
+ const faux = fauxProvider({
1112
+ tokensPerSecond: 50 // optional
1113
+ });
999
1114
 
1000
- const options: AnthropicOptions = {
1001
- thinkingEnabled: true,
1002
- thinkingBudgetTokens: 2048
1115
+ const models = createModels();
1116
+ models.setProvider(faux.provider);
1117
+
1118
+ const model = faux.getModel();
1119
+ const context = {
1120
+ messages: [{ role: 'user', content: 'Summarize package.json and then call echo', timestamp: Date.now() }]
1003
1121
  };
1004
1122
 
1005
- await streamAnthropic(claude, context, options);
1123
+ faux.setResponses([
1124
+ fauxAssistantMessage([
1125
+ fauxThinking('Need to inspect package metadata first.'),
1126
+ fauxToolCall('echo', { text: 'package.json' })
1127
+ ], { stopReason: 'toolUse' })
1128
+ ]);
1129
+
1130
+ const first = await models.complete(model, context, {
1131
+ sessionId: 'session-1',
1132
+ cacheRetention: 'short'
1133
+ });
1134
+ context.messages.push(first);
1135
+
1136
+ context.messages.push({
1137
+ role: 'toolResult',
1138
+ toolCallId: first.content.find((block) => block.type === 'toolCall')!.id,
1139
+ toolName: 'echo',
1140
+ content: [{ type: 'text', text: 'package.json contents here' }],
1141
+ isError: false,
1142
+ timestamp: Date.now()
1143
+ });
1144
+
1145
+ faux.setResponses([
1146
+ fauxAssistantMessage([
1147
+ fauxThinking('Now I can summarize the tool output.'),
1148
+ fauxText('Here is the summary.')
1149
+ ])
1150
+ ]);
1151
+
1152
+ const s = models.stream(model, context);
1153
+ for await (const event of s) {
1154
+ console.log(event.type);
1155
+ }
1156
+
1157
+ // Optional: multiple faux models for model-switching tests
1158
+ const multiModel = fauxProvider({
1159
+ provider: 'faux-multi',
1160
+ models: [
1161
+ { id: 'faux-fast', reasoning: false },
1162
+ { id: 'faux-thinker', reasoning: true }
1163
+ ]
1164
+ });
1165
+ models.setProvider(multiModel.provider);
1166
+ const thinker = multiModel.getModel('faux-thinker');
1167
+
1168
+ console.log(thinker?.reasoning);
1169
+ console.log(faux.getPendingResponseCount());
1170
+ console.log(faux.state.callCount);
1006
1171
  ```
1007
1172
 
1173
+ Notes:
1174
+ - Responses are consumed from a queue in request start order.
1175
+ - If the queue is empty, the faux provider returns an assistant error message with `errorMessage: "No more faux responses queued"`.
1176
+ - Use `faux.setResponses([...])` to replace the remaining queue and `faux.appendResponses([...])` to add more responses.
1177
+ - `faux.models` exposes all faux models. `faux.getModel()` returns the first one, and `faux.getModel(id)` returns a specific one.
1178
+ - Use `fauxAssistantMessage(...)` for scripted assistant replies. Use `fauxText(...)`, `fauxThinking(...)`, and `fauxToolCall(...)` to build content blocks without filling in low-level fields manually.
1179
+ - Usage is estimated at roughly 1 token per 4 characters. When `sessionId` is present and `cacheRetention` is not `"none"`, prompt cache reads and writes are simulated automatically.
1180
+ - Tool call arguments stream incrementally via `toolcall_delta` chunks.
1181
+ - By default, each streamed chunk is emitted on its own microtask. Set `tokensPerSecond` to pace chunk delivery in real time.
1182
+ - The intended use is one deterministic scripted flow per handle. If you need independent concurrent flows, create separate faux providers with distinct `provider` ids.
1183
+
1008
1184
  ## Cross-Provider Handoffs
1009
1185
 
1010
1186
  The library supports seamless handoffs between different LLM providers within the same conversation. This allows you to switch models mid-conversation while preserving context, including thinking blocks, tool calls, and tool results.
1011
1187
 
1012
- ### How It Works
1013
-
1014
1188
  When messages from one provider are sent to a different provider, the library automatically transforms them for compatibility:
1015
1189
 
1016
1190
  - **User and tool result messages** are passed through unchanged
@@ -1018,98 +1192,86 @@ When messages from one provider are sent to a different provider, the library au
1018
1192
  - **Assistant messages from different providers** have their thinking blocks converted to text with `<thinking>` tags
1019
1193
  - **Tool calls and regular text** are preserved unchanged
1020
1194
 
1021
- ### Example: Multi-Provider Conversation
1022
-
1023
1195
  ```typescript
1024
- import { getModel, complete, Context } from '@earendil-works/pi-ai';
1196
+ import { createModels, type Context } from '@earendil-works/pi-ai';
1197
+ import { anthropicProvider } from '@earendil-works/pi-ai/providers/anthropic';
1198
+ import { openaiProvider } from '@earendil-works/pi-ai/providers/openai';
1199
+ import { googleProvider } from '@earendil-works/pi-ai/providers/google';
1025
1200
 
1026
- // Start with Claude
1027
- const claude = getModel('anthropic', 'claude-sonnet-4-20250514');
1028
- const context: Context = {
1029
- messages: []
1030
- };
1201
+ const models = createModels();
1202
+ models.setProvider(anthropicProvider());
1203
+ models.setProvider(openaiProvider());
1204
+ models.setProvider(googleProvider());
1031
1205
 
1032
- context.messages.push({ role: 'user', content: 'What is 25 * 18?' });
1033
- const claudeResponse = await complete(claude, context, {
1034
- thinkingEnabled: true
1035
- });
1036
- context.messages.push(claudeResponse);
1206
+ const context: Context = { messages: [] };
1207
+
1208
+ // Start with Claude
1209
+ const claude = models.getModel('anthropic', 'claude-sonnet-4-5')!;
1210
+ context.messages.push({ role: 'user', content: 'What is 25 * 18?', timestamp: Date.now() });
1211
+ context.messages.push(await models.completeSimple(claude, context, { reasoning: 'medium' }));
1037
1212
 
1038
1213
  // Switch to GPT-5 - it will see Claude's thinking as <thinking> tagged text
1039
- const gpt5 = getModel('openai', 'gpt-5-mini');
1040
- context.messages.push({ role: 'user', content: 'Is that calculation correct?' });
1041
- const gptResponse = await complete(gpt5, context);
1042
- context.messages.push(gptResponse);
1214
+ const gpt5 = models.getModel('openai', 'gpt-5-mini')!;
1215
+ context.messages.push({ role: 'user', content: 'Is that calculation correct?', timestamp: Date.now() });
1216
+ context.messages.push(await models.complete(gpt5, context));
1043
1217
 
1044
1218
  // Switch to Gemini
1045
- const gemini = getModel('google', 'gemini-2.5-flash');
1046
- context.messages.push({ role: 'user', content: 'What was the original question?' });
1047
- const geminiResponse = await complete(gemini, context);
1219
+ const gemini = models.getModel('google', 'gemini-2.5-flash')!;
1220
+ context.messages.push({ role: 'user', content: 'What was the original question?', timestamp: Date.now() });
1221
+ const geminiResponse = await models.complete(gemini, context);
1048
1222
  ```
1049
1223
 
1050
- ### Provider Compatibility
1051
-
1052
- All providers can handle messages from other providers, including:
1053
- - Text content
1054
- - Tool calls and tool results (including images in tool results)
1055
- - Thinking/reasoning blocks (transformed to tagged text for cross-provider compatibility)
1056
- - Aborted messages with partial content
1057
-
1058
- This enables flexible workflows where you can:
1059
- - Start with a fast model for initial responses
1060
- - Switch to a more capable model for complex reasoning
1061
- - Use specialized models for specific tasks
1062
- - Maintain conversation continuity across provider outages
1224
+ All providers can handle messages from other providers — text, tool calls and results (including images), thinking blocks (transformed to tagged text), and aborted messages with partial content. This enables flexible workflows: start with a fast model, switch to a more capable one for complex reasoning, or maintain continuity across provider outages.
1063
1225
 
1064
1226
  ## Context Serialization
1065
1227
 
1066
1228
  The `Context` object can be easily serialized and deserialized using standard JSON methods, making it simple to persist conversations, implement chat history, or transfer contexts between services:
1067
1229
 
1068
1230
  ```typescript
1069
- import { Context, getModel, complete } from '@earendil-works/pi-ai';
1070
-
1071
- // Create and use a context
1072
1231
  const context: Context = {
1073
1232
  systemPrompt: 'You are a helpful assistant.',
1074
1233
  messages: [
1075
- { role: 'user', content: 'What is TypeScript?' }
1234
+ { role: 'user', content: 'What is TypeScript?', timestamp: Date.now() }
1076
1235
  ]
1077
1236
  };
1078
1237
 
1079
- const model = getModel('openai', 'gpt-4o-mini');
1080
- const response = await complete(model, context);
1238
+ const model = models.getModel('openai', 'gpt-4o-mini')!;
1239
+ const response = await models.complete(model, context);
1081
1240
  context.messages.push(response);
1082
1241
 
1083
1242
  // Serialize the entire context
1084
1243
  const serialized = JSON.stringify(context);
1085
- console.log('Serialized context size:', serialized.length, 'bytes');
1086
1244
 
1087
1245
  // Save to database, localStorage, file, etc.
1088
1246
  localStorage.setItem('conversation', serialized);
1089
1247
 
1090
1248
  // Later: deserialize and continue the conversation
1091
1249
  const restored: Context = JSON.parse(localStorage.getItem('conversation')!);
1092
- restored.messages.push({ role: 'user', content: 'Tell me more about its type system' });
1250
+ restored.messages.push({ role: 'user', content: 'Tell me more about its type system', timestamp: Date.now() });
1093
1251
 
1094
1252
  // Continue with any model
1095
- const newModel = getModel('anthropic', 'claude-3-5-haiku-20241022');
1096
- const continuation = await complete(newModel, restored);
1253
+ const newModel = models.getModel('anthropic', 'claude-3-5-haiku-20241022')!;
1254
+ const continuation = await models.complete(newModel, restored);
1097
1255
  ```
1098
1256
 
1257
+ Models are plain serializable data too — no functions or implementations attached — so persisting "which model was this conversation using" is a `JSON.stringify` away.
1258
+
1099
1259
  > **Note**: If the context contains images (encoded as base64 as shown in the Image Input section), those will also be serialized.
1100
1260
 
1101
1261
  ## Browser Usage
1102
1262
 
1103
- The library supports browser environments. You must pass the API key explicitly since environment variables are not available in browsers:
1263
+ The library supports browser environments. The core entrypoint and provider factories are side-effect free and bundle cleanly. Environment variables are not available in browsers, so pass API keys explicitly — or inject a `CredentialStore` (e.g. localStorage-backed) and let provider auth resolve from stored credentials:
1104
1264
 
1105
1265
  ```typescript
1106
- import { getModel, complete } from '@earendil-works/pi-ai';
1266
+ import { createModels } from '@earendil-works/pi-ai';
1267
+ import { anthropicProvider } from '@earendil-works/pi-ai/providers/anthropic';
1107
1268
 
1108
- // API key must be passed explicitly in browser
1109
- const model = getModel('anthropic', 'claude-3-5-haiku-20241022');
1269
+ const models = createModels();
1270
+ models.setProvider(anthropicProvider());
1110
1271
 
1111
- const response = await complete(model, {
1112
- messages: [{ role: 'user', content: 'Hello!' }]
1272
+ const model = models.getModel('anthropic', 'claude-3-5-haiku-20241022')!;
1273
+ const response = await models.complete(model, {
1274
+ messages: [{ role: 'user', content: 'Hello!', timestamp: Date.now() }]
1113
1275
  }, {
1114
1276
  apiKey: 'your-api-key'
1115
1277
  });
@@ -1117,70 +1279,75 @@ const response = await complete(model, {
1117
1279
 
1118
1280
  > **Security Warning**: Exposing API keys in frontend code is dangerous. Anyone can extract and abuse your keys. Only use this approach for internal tools or demos. For production applications, use a backend proxy that keeps your API keys secure.
1119
1281
 
1120
- ### Browser Compatibility Notes
1282
+ Browser compatibility notes:
1121
1283
 
1122
- - Amazon Bedrock (`bedrock-converse-stream`) is not supported in browser environments.
1123
- - OAuth login flows are not supported in browser environments. Use the `@earendil-works/pi-ai/oauth` entry point in Node.js.
1124
- - In browser builds, Bedrock can still appear in model lists. Calls to Bedrock models fail at runtime.
1284
+ - Amazon Bedrock (`bedrock-converse-stream`) is not supported in browser environments. It can still appear in model lists; calls fail at runtime.
1285
+ - OAuth login flows are Node-only. They are lazy-loaded behind bundler-opaque imports, so registering an OAuth-capable provider does not pull Node-only code into a browser bundle — only actually logging in would.
1125
1286
  - Use a server-side proxy or backend service if you need Bedrock or OAuth-based auth from a web app.
1126
- - Use `@earendil-works/pi-ai/base` plus explicit direct transport registration when a browser bundle should exclude unused provider SDK implementations.
1127
1287
 
1128
- ### Environment Variables (Node.js only)
1288
+ ## Bundling and Tree Shaking
1129
1289
 
1130
- In Node.js environments, you can set environment variables to avoid passing API keys:
1290
+ For small bundles, import only the providers you need:
1131
1291
 
1132
- | Provider | Environment Variable(s) |
1133
- |----------|------------------------|
1134
- | OpenAI | `OPENAI_API_KEY` |
1135
- | Ant Ling | `ANT_LING_API_KEY` |
1136
- | Azure OpenAI | `AZURE_OPENAI_API_KEY` + `AZURE_OPENAI_BASE_URL` (e.g. `https://{resource}.openai.azure.com`) or `AZURE_OPENAI_RESOURCE_NAME`. Supports `*.openai.azure.com` and `*.cognitiveservices.azure.com`; root endpoints auto-normalize to `/openai/v1`. Optional: `AZURE_OPENAI_API_VERSION` (default `v1`), `AZURE_OPENAI_DEPLOYMENT_NAME_MAP`. |
1137
- | Anthropic | `ANTHROPIC_API_KEY` or `ANTHROPIC_OAUTH_TOKEN` |
1138
- | DeepSeek | `DEEPSEEK_API_KEY` |
1139
- | NVIDIA NIM | `NVIDIA_API_KEY` |
1140
- | Google | `GEMINI_API_KEY` |
1141
- | Vertex AI | `GOOGLE_CLOUD_API_KEY` or `GOOGLE_CLOUD_PROJECT` (or `GCLOUD_PROJECT`) + `GOOGLE_CLOUD_LOCATION` + ADC |
1142
- | Mistral | `MISTRAL_API_KEY` |
1143
- | Groq | `GROQ_API_KEY` |
1144
- | Cerebras | `CEREBRAS_API_KEY` |
1145
- | Cloudflare AI Gateway | `CLOUDFLARE_API_KEY` + `CLOUDFLARE_ACCOUNT_ID` + `CLOUDFLARE_GATEWAY_ID` |
1146
- | Cloudflare Workers AI | `CLOUDFLARE_API_KEY` + `CLOUDFLARE_ACCOUNT_ID` |
1147
- | xAI | `XAI_API_KEY` |
1148
- | Fireworks | `FIREWORKS_API_KEY` |
1149
- | Together AI | `TOGETHER_API_KEY` |
1150
- | OpenRouter | `OPENROUTER_API_KEY` |
1151
- | Vercel AI Gateway | `AI_GATEWAY_API_KEY` |
1152
- | zAI | `ZAI_API_KEY` |
1153
- | ZAI Coding Plan (China) | `ZAI_CODING_CN_API_KEY` |
1154
- | MiniMax | `MINIMAX_API_KEY` |
1155
- | OpenCode Zen / OpenCode Go | `OPENCODE_API_KEY` |
1156
- | Kimi For Coding | `KIMI_API_KEY` |
1157
- | Xiaomi MiMo (API billing) | `XIAOMI_API_KEY` |
1158
- | Xiaomi MiMo Token Plan (China) | `XIAOMI_TOKEN_PLAN_CN_API_KEY` |
1159
- | Xiaomi MiMo Token Plan (Amsterdam) | `XIAOMI_TOKEN_PLAN_AMS_API_KEY` |
1160
- | Xiaomi MiMo Token Plan (Singapore) | `XIAOMI_TOKEN_PLAN_SGP_API_KEY` |
1161
- | GitHub Copilot | `COPILOT_GITHUB_TOKEN` |
1292
+ ```typescript
1293
+ import { createModels } from '@earendil-works/pi-ai';
1294
+ import { openaiProvider } from '@earendil-works/pi-ai/providers/openai';
1295
+
1296
+ const models = createModels();
1297
+ models.setProvider(openaiProvider());
1298
+ ```
1299
+
1300
+ Rules:
1301
+
1302
+ - `@earendil-works/pi-ai` is the core entrypoint and does not import built-in catalogs, provider factories, or SDK implementations.
1303
+ - `@earendil-works/pi-ai/providers/<provider>` imports that provider's catalog and lazy API wrapper only.
1304
+ - `@earendil-works/pi-ai/providers/all` imports every built-in provider factory and all catalogs. Use it only when you want the full built-in set.
1305
+ - With code splitting, provider SDKs stay in lazy chunks and load on first request.
1306
+ - Without code splitting, bundlers fold reachable lazy API implementations into the single bundle. A single-provider bundle then includes that provider's SDK; `providers/all` includes all statically visible SDKs. Bedrock is the exception: its AWS SDK implementation is loaded through a bundler-opaque Node-only import.
1307
+ - Importing `@earendil-works/pi-ai/api/<api-id>` directly loads that API implementation and its SDK immediately.
1308
+
1309
+ Avoid `@earendil-works/pi-ai/compat` in new bundled apps; it preserves the old global API and imports the full built-in catalog surface.
1310
+
1311
+ For single-file Node ESM bundles, some SDK dependencies may still use dynamic CommonJS `require()` internally. If you see errors such as `Dynamic require of "child_process" is not supported`, add a Node `require` shim to the bundle. With esbuild:
1312
+
1313
+ ```bash
1314
+ esbuild app.js --bundle --platform=node --format=esm \
1315
+ --banner:js='import { createRequire } from "module";const require = createRequire(import.meta.url);' \
1316
+ --outfile=app.bundle.js
1317
+ ```
1162
1318
 
1163
- When set, the library automatically uses these keys:
1319
+ This is only for Node bundles; it is not a browser or Cloudflare Workers workaround.
1320
+
1321
+ Bedrock is Node-only. Add it like any other provider:
1164
1322
 
1165
1323
  ```typescript
1166
- // Uses OPENAI_API_KEY from environment
1167
- const model = getModel('openai', 'gpt-4o-mini');
1168
- const response = await complete(model, context);
1324
+ import { createModels } from '@earendil-works/pi-ai';
1325
+ import { amazonBedrockProvider } from '@earendil-works/pi-ai/providers/amazon-bedrock';
1169
1326
 
1170
- // Or override with explicit key
1171
- const response = await complete(model, context, {
1172
- apiKey: 'sk-different-key'
1173
- });
1327
+ const models = createModels();
1328
+ models.setProvider(amazonBedrockProvider());
1329
+ ```
1330
+
1331
+ In normal Node package usage and code-split bundles, Bedrock loads its AWS SDK implementation lazily. For a standalone single-file bundle that must include Bedrock support, register the implementation module explicitly:
1332
+
1333
+ ```typescript
1334
+ import { setBedrockProviderModule } from '@earendil-works/pi-ai/api/bedrock-converse-stream.lazy';
1335
+ import { bedrockProviderModule } from '@earendil-works/pi-ai/bedrock-provider';
1336
+
1337
+ setBedrockProviderModule(bedrockProviderModule);
1174
1338
  ```
1175
1339
 
1340
+ That explicit override bundles the AWS SDK. Without it, Bedrock's opaque runtime import expects the package's Bedrock implementation file to be available at runtime.
1341
+
1176
1342
  ### Provider-Scoped Environment Overrides
1177
1343
 
1178
- Pass `env` in stream options to scope provider configuration to a request. Values in `env` are used before process environment variables for API key discovery and provider configuration such as Cloudflare account IDs, Azure OpenAI settings, Vertex project/location, Bedrock settings, `PI_CACHE_RETENTION`, and `HTTP_PROXY`/`HTTPS_PROXY`.
1344
+ Pass `env` in stream options to scope provider configuration to a request. Values in `env` are used before process environment variables for provider auth and configuration such as Cloudflare account IDs, Azure OpenAI settings, Vertex project/location, Bedrock settings, `PI_CACHE_RETENTION`, and `HTTP_PROXY`/`HTTPS_PROXY`.
1179
1345
 
1180
1346
  ```typescript
1181
- const model = getModel('cloudflare-ai-gateway', 'workers-ai/@cf/moonshotai/kimi-k2.6');
1347
+ const models = builtinModels();
1348
+ const model = models.getModel('cloudflare-ai-gateway', 'workers-ai/@cf/moonshotai/kimi-k2.6')!;
1182
1349
 
1183
- const response = await complete(model, context, {
1350
+ const response = await models.complete(model, context, {
1184
1351
  env: {
1185
1352
  CLOUDFLARE_API_KEY: '...',
1186
1353
  CLOUDFLARE_ACCOUNT_ID: 'account-id',
@@ -1191,24 +1358,47 @@ const response = await complete(model, context, {
1191
1358
 
1192
1359
  Use this when one process needs different provider settings per request, or when ambient environment variables should not leak into a provider call.
1193
1360
 
1194
- ### Checking Environment Variables
1195
-
1196
- ```typescript
1197
- import { getEnvApiKey } from '@earendil-works/pi-ai';
1198
-
1199
- // Check if an API key is set in environment variables
1200
- const key = getEnvApiKey('openai'); // checks OPENAI_API_KEY
1201
- ```
1202
-
1203
1361
  ## OAuth Providers
1204
1362
 
1205
- Several providers require OAuth authentication instead of static API keys:
1363
+ Several providers support OAuth authentication instead of static API keys:
1206
1364
 
1207
1365
  - **Anthropic** (Claude Pro/Max subscription)
1208
1366
  - **OpenAI Codex** (ChatGPT Plus/Pro subscription, access to GPT-5.x Codex models)
1209
1367
  - **GitHub Copilot** (Copilot subscription)
1210
1368
 
1211
- For paid Cloud Code Assist subscriptions, set `GOOGLE_CLOUD_PROJECT` or `GOOGLE_CLOUD_PROJECT_ID` to your project ID.
1369
+ Each of these providers carries an `OAuthAuth` on `provider.auth.oauth` with three operations: `login(callbacks)` runs the interactive flow and returns a credential, `refresh(credential)` exchanges the refresh token, and `toAuth(credential)` derives request auth (GitHub Copilot's per-account base URL comes from here). Refresh is automatic: `models.getAuth()` and the request paths refresh expired tokens under a credential-store lock, so concurrent requests and processes cannot double-refresh.
1370
+
1371
+ ```typescript
1372
+ import { createModels } from '@earendil-works/pi-ai';
1373
+ import { anthropicProvider } from '@earendil-works/pi-ai/providers/anthropic';
1374
+
1375
+ const models = createModels({ credentials: myStore }); // persistent CredentialStore
1376
+ models.setProvider(anthropicProvider());
1377
+
1378
+ // Login: drive the flow with prompt()/notify() callbacks, persist the credential
1379
+ const provider = models.getProvider('anthropic')!;
1380
+ const credential = await provider.auth.oauth!.login({
1381
+ prompt: async (p) => {
1382
+ // p.type: 'text' | 'secret' | 'select' | 'manual_code'
1383
+ // manual_code prompts race a local callback server; p.signal aborts them when the server wins
1384
+ return await askUser(p.message);
1385
+ },
1386
+ notify: (event) => {
1387
+ // event.type: 'auth_url' | 'device_code' | 'progress'
1388
+ if (event.type === 'auth_url') console.log(`Open: ${event.url}`);
1389
+ if (event.type === 'device_code') console.log(`Code: ${event.userCode} at ${event.verificationUri}`);
1390
+ if (event.type === 'progress') console.log(event.message);
1391
+ },
1392
+ });
1393
+ await myStore.modify('anthropic', async () => credential);
1394
+
1395
+ // From here on, requests resolve and refresh the token automatically
1396
+ const model = models.getModel('anthropic', 'claude-sonnet-4-5')!;
1397
+ await models.complete(model, context);
1398
+
1399
+ // Logout
1400
+ await myStore.delete('anthropic');
1401
+ ```
1212
1402
 
1213
1403
  ### Vertex AI
1214
1404
 
@@ -1220,8 +1410,6 @@ Vertex AI models support either a Google Cloud API key or Application Default Cr
1220
1410
 
1221
1411
  When using ADC, also set `GOOGLE_CLOUD_PROJECT` (or `GCLOUD_PROJECT`) and `GOOGLE_CLOUD_LOCATION`. You can also pass `project`/`location` in the call options. When using `GOOGLE_CLOUD_API_KEY`, `project` and `location` are not required.
1222
1412
 
1223
- Example:
1224
-
1225
1413
  ```bash
1226
1414
  # Local (uses your user credentials)
1227
1415
  gcloud auth application-default login
@@ -1232,23 +1420,6 @@ export GOOGLE_CLOUD_LOCATION="us-central1"
1232
1420
  export GOOGLE_APPLICATION_CREDENTIALS="/path/to/service-account.json"
1233
1421
  ```
1234
1422
 
1235
- ```typescript
1236
- import { getModel, complete } from '@earendil-works/pi-ai';
1237
-
1238
- (async () => {
1239
- const model = getModel('google-vertex', 'gemini-2.5-flash');
1240
- const response = await complete(model, {
1241
- messages: [{ role: 'user', content: 'Hello from Vertex AI' }]
1242
- }, {
1243
- apiKey: process.env.GOOGLE_CLOUD_API_KEY,
1244
- });
1245
-
1246
- for (const block of response.content) {
1247
- if (block.type === 'text') console.log(block.text);
1248
- }
1249
- })().catch(console.error);
1250
- ```
1251
-
1252
1423
  Official docs: [Application Default Credentials](https://cloud.google.com/docs/authentication/application-default-credentials)
1253
1424
 
1254
1425
  ### CLI Login
@@ -1265,124 +1436,77 @@ Credentials are saved to `auth.json` in the current directory.
1265
1436
 
1266
1437
  ### Programmatic OAuth
1267
1438
 
1268
- The library provides login and token refresh functions via the `@earendil-works/pi-ai/oauth` entry point. Credential storage is the caller's responsibility.
1269
-
1270
- ```typescript
1271
- import {
1272
- // Login functions (return credentials, do not store)
1273
- loginAnthropic,
1274
- loginOpenAICodex,
1275
- loginGitHubCopilot,
1276
- loginGeminiCli,
1277
-
1278
- // Token management
1279
- refreshOAuthToken, // (provider, credentials) => new credentials
1280
- getOAuthApiKey, // (provider, credentialsMap) => { newCredentials, apiKey } | null
1281
-
1282
- // Types
1283
- type OAuthProvider,
1284
- type OAuthCredentials,
1285
- } from '@earendil-works/pi-ai/oauth';
1286
- ```
1439
+ The legacy flow functions remain available via the `@earendil-works/pi-ai/oauth` entry point (`loginAnthropic`, `loginOpenAICodex`, `loginGitHubCopilot`, `refreshOAuthToken`, `getOAuthApiKey`); credential storage is the caller's responsibility there. New code should prefer the provider-owned `OAuthAuth` shown above — it composes with the credential store and gets locked auto-refresh for free.
1287
1440
 
1288
- ### Login Flow Example
1441
+ Provider notes:
1289
1442
 
1290
- ```typescript
1291
- import { loginGitHubCopilot } from '@earendil-works/pi-ai/oauth';
1292
- import { writeFileSync } from 'fs';
1443
+ **OpenAI Codex**: Requires a ChatGPT Plus or Pro subscription. Provides access to GPT-5.x Codex models with extended context windows and reasoning capabilities. The library automatically handles session-based prompt caching when `sessionId` is provided in stream options. You can set `transport` in stream options to `"sse"`, `"websocket"`, or `"auto"` for Codex Responses transport selection. When using WebSocket with a `sessionId`, connections are reused per session and expire after 5 minutes of inactivity.
1293
1444
 
1294
- const credentials = await loginGitHubCopilot({
1295
- onAuth: (url, instructions) => {
1296
- console.log(`Open: ${url}`);
1297
- if (instructions) console.log(instructions);
1298
- },
1299
- onPrompt: async (prompt) => {
1300
- return await getUserInput(prompt.message);
1301
- },
1302
- onProgress: (message) => console.log(message)
1303
- });
1445
+ **Azure OpenAI (Responses)**: Uses the Responses API only. Set `AZURE_OPENAI_API_KEY` and either `AZURE_OPENAI_BASE_URL` or `AZURE_OPENAI_RESOURCE_NAME`. `AZURE_OPENAI_BASE_URL` supports both `https://<resource>.openai.azure.com` and `https://<resource>.cognitiveservices.azure.com`; root endpoints are normalized to `.../openai/v1` automatically. Use `AZURE_OPENAI_API_VERSION` (defaults to `v1`) to override the API version if needed. Deployment names are treated as model IDs by default, override with `azureDeploymentName` or `AZURE_OPENAI_DEPLOYMENT_NAME_MAP` using comma-separated `model-id=deployment` pairs (for example `gpt-4o-mini=my-deployment,gpt-4o=prod`). Legacy deployment-based URLs are intentionally unsupported.
1304
1446
 
1305
- // Store credentials yourself
1306
- const auth = { 'github-copilot': { type: 'oauth', ...credentials } };
1307
- writeFileSync('auth.json', JSON.stringify(auth, null, 2));
1308
- ```
1447
+ **GitHub Copilot**: If you get "The requested model is not supported" error, enable the model manually in VS Code: open Copilot Chat, click the model selector, select the model (warning icon), and click "Enable".
1309
1448
 
1310
- ### Using OAuth Tokens
1449
+ ## Migrating from the Old Global API
1311
1450
 
1312
- Use `getOAuthApiKey()` to get an API key, automatically refreshing if expired:
1451
+ Older versions exposed a global API: `stream()`/`complete()` dispatching on `model.api` via a global registry, sync `getModel()`/`getModels()`/`getProviders()` catalog reads, `registerApiProvider()`, `getEnvApiKey()`, and per-API lazy stream functions. That surface lives unchanged on the **compat entrypoint**:
1313
1452
 
1314
1453
  ```typescript
1454
+ // Before
1315
1455
  import { getModel, complete } from '@earendil-works/pi-ai';
1316
- import { getOAuthApiKey } from '@earendil-works/pi-ai/oauth';
1317
- import { readFileSync, writeFileSync } from 'fs';
1318
-
1319
- // Load your stored credentials
1320
- const auth = JSON.parse(readFileSync('auth.json', 'utf-8'));
1321
-
1322
- // Get API key (refreshes if expired)
1323
- const result = await getOAuthApiKey('github-copilot', auth);
1324
- if (!result) throw new Error('Not logged in');
1325
1456
 
1326
- // Save refreshed credentials
1327
- auth['github-copilot'] = { type: 'oauth', ...result.newCredentials };
1328
- writeFileSync('auth.json', JSON.stringify(auth, null, 2));
1329
-
1330
- // Use the API key
1331
- const model = getModel('github-copilot', 'gpt-4o');
1332
- const response = await complete(model, {
1333
- messages: [{ role: 'user', content: 'Hello!' }]
1334
- }, { apiKey: result.apiKey });
1457
+ // After (verbatim behavior, one import-path change)
1458
+ import { getModel, complete } from '@earendil-works/pi-ai/compat';
1335
1459
  ```
1336
1460
 
1337
- ### Provider Notes
1461
+ Compat is a strict superset of the root entrypoint, so a file can switch its import path wholesale. It will be removed in a future release; migrate to `createModels()` + provider factories:
1338
1462
 
1339
- **OpenAI Codex**: Requires a ChatGPT Plus or Pro subscription. Provides access to GPT-5.x Codex models with extended context windows and reasoning capabilities. The library automatically handles session-based prompt caching when `sessionId` is provided in stream options. You can set `transport` in stream options to `"sse"`, `"websocket"`, or `"auto"` for Codex Responses transport selection. When using WebSocket with a `sessionId`, connections are reused per session and expire after 5 minutes of inactivity.
1340
-
1341
- **Azure OpenAI (Responses)**: Uses the Responses API only. Set `AZURE_OPENAI_API_KEY` and either `AZURE_OPENAI_BASE_URL` or `AZURE_OPENAI_RESOURCE_NAME`. `AZURE_OPENAI_BASE_URL` supports both `https://<resource>.openai.azure.com` and `https://<resource>.cognitiveservices.azure.com`; root endpoints are normalized to `.../openai/v1` automatically. Use `AZURE_OPENAI_API_VERSION` (defaults to `v1`) to override the API version if needed. Deployment names are treated as model IDs by default, override with `azureDeploymentName` or `AZURE_OPENAI_DEPLOYMENT_NAME_MAP` using comma-separated `model-id=deployment` pairs (for example `gpt-4o-mini=my-deployment,gpt-4o=prod`). Legacy deployment-based URLs are intentionally unsupported.
1342
-
1343
- **GitHub Copilot**: If you get "The requested model is not supported" error, enable the model manually in VS Code: open Copilot Chat, click the model selector, select the model (warning icon), and click "Enable".
1463
+ | Old | New |
1464
+ |-----|-----|
1465
+ | `getModel('openai', 'gpt-4o-mini')` | `models.getModel('openai', 'gpt-4o-mini')` or `getBuiltinModel()` from `providers/all` |
1466
+ | `getModels('anthropic')` / `getProviders()` | `models.getModels('anthropic')` / `models.getProviders()` or `getBuiltin*` |
1467
+ | `stream(model, ctx, opts)` (env-key injection) | `models.stream(model, ctx, opts)` (provider auth resolution) |
1468
+ | `registerApiProvider({ api, stream, streamSimple })` | `createProvider({ id, auth, models, api })` + `models.setProvider()` |
1469
+ | `getEnvApiKey('openai')` | `await models.getAuth(model)` |
1470
+ | `streamAnthropic(model, ctx, opts)` | `stream` from `@earendil-works/pi-ai/api/anthropic-messages`, or a provider in a collection |
1471
+ | `registerFauxProvider()` | `fauxProvider()` + `models.setProvider()` |
1344
1472
 
1345
1473
  ## Development
1346
1474
 
1347
1475
  ### Adding a New Provider
1348
1476
 
1349
- Adding a new LLM provider requires changes across multiple files. This checklist covers all necessary steps:
1477
+ Adding a new LLM provider requires changes across multiple files. The layered layout: API implementations live in `src/api/`, provider factories in `src/providers/`, generated catalogs in `src/providers/<id>.models.ts`. This checklist covers all necessary steps:
1350
1478
 
1351
1479
  #### 1. Core Types (`src/types.ts`)
1352
1480
 
1353
- - Add the API identifier to `KnownApi` (for example `"bedrock-converse-stream"`)
1354
- - Create an options interface extending `StreamOptions` (for example `BedrockOptions`)
1481
+ - Add the API identifier to `KnownApi` (for example `"bedrock-converse-stream"`), if it is a new API
1355
1482
  - Add the provider name to `KnownProvider` (for example `"amazon-bedrock"`)
1483
+ - Add the options type to `ApiOptionsMap`
1356
1484
 
1357
- #### 2. Provider Implementation (`src/providers/`)
1485
+ #### 2. API Implementation (`src/api/<api-id>.ts`, only for a new API)
1358
1486
 
1359
- Create a new provider file (for example `amazon-bedrock.ts`) that exports:
1487
+ Create a new API implementation file (for example `bedrock-converse-stream.ts`) that exports exactly `stream` and `streamSimple`, plus:
1360
1488
 
1361
- - `stream<Provider>()` function returning `AssistantMessageEventStream`
1362
- - `streamSimple<Provider>()` for `SimpleStreamOptions` mapping
1363
- - `register()` for explicit direct transport registration from `@earendil-works/pi-ai/base`
1364
- - Provider-specific options interface
1489
+ - An options interface extending `StreamOptions` (for example `BedrockOptions`)
1365
1490
  - Message conversion functions to transform `Context` to provider format
1366
1491
  - Tool conversion if the provider supports tools
1367
1492
  - Response parsing to emit standardized events (`text`, `tool_call`, `thinking`, `usage`, `stop`)
1368
1493
 
1369
- #### 3. API Registry Integration (`src/providers/register-builtins.ts`)
1494
+ Add a lazy wrapper `src/api/<api-id>.lazy.ts` (`<name>Api()` via `lazyApi()`) so providers can reference the implementation without importing its SDK. Add any root-level `export type` re-exports in `src/index.ts` that should remain available from `@earendil-works/pi-ai`.
1370
1495
 
1371
- - Register the API with `registerApiProvider()`
1372
- - Add a package subpath export in `package.json` for the provider module (`./dist/providers/<provider>.js`)
1373
- - Add lazy loader wrappers in `src/providers/register-builtins.ts`, do not statically import provider implementation modules there
1374
- - Add any root-level `export type` re-exports in `src/index.ts` and `src/base.ts` that should remain available from `@earendil-works/pi-ai` and `@earendil-works/pi-ai/base`
1375
- - Keep `src/base.ts` free of built-in registration imports
1376
- - Add credential detection in `env-api-keys.ts` for the new provider
1377
- - Ensure `streamSimple` handles auth lookup via `getEnvApiKey()` or provider-specific auth
1378
-
1379
- #### 4. Model Generation (`scripts/generate-models.ts`, `scripts/generate-image-models.ts`)
1496
+ #### 3. Model Generation (`scripts/generate-models.ts`, `scripts/generate-image-models.ts`)
1380
1497
 
1381
1498
  - Add logic to fetch and parse models from the provider's source (e.g., models.dev API)
1382
- - Map chat/tool-capable provider model data to the standardized `Model` interface via `scripts/generate-models.ts`
1499
+ - Map chat/tool-capable provider model data to the standardized `Model` interface via `scripts/generate-models.ts`; regeneration emits `src/providers/<id>.models.ts` and the aggregator
1383
1500
  - Map image-generation provider model data to the standardized `ImagesModel` interface via `scripts/generate-image-models.ts`
1384
1501
  - Handle provider-specific quirks (pricing format, capability flags, model ID transformations)
1385
1502
 
1503
+ #### 4. Provider Factory (`src/providers/<id>.ts`)
1504
+
1505
+ - `createProvider()` wiring catalog + auth + the lazy API wrapper
1506
+ - Auth: `envApiKeyAuth` for standard key providers, a custom `ApiKeyAuth` for ambient auth (AWS profiles, ADC), `lazyOAuth` where an OAuth flow exists
1507
+ - Register the factory in `src/providers/all.ts`
1508
+ - If it is a new API: register it in the builtin list in `src/compat.ts` and add the package subpath export in `package.json`
1509
+
1386
1510
  #### 5. Tests (`test/`)
1387
1511
 
1388
1512
  Create or update test files to cover the new provider:
@@ -1398,6 +1522,7 @@ Create or update test files to cover the new provider:
1398
1522
  - `image-tool-result.test.ts` - Images in tool results
1399
1523
  - `total-tokens.test.ts` - Token counting accuracy
1400
1524
  - `cross-provider-handoff.test.ts` - Cross-provider context replay
1525
+ - `providers.test.ts` - Provider listing and auth resolution
1401
1526
 
1402
1527
  For `cross-provider-handoff.test.ts`, add at least one provider/model pair. If the provider exposes multiple model families (for example GPT and Claude), add at least one pair per family.
1403
1528