mdcontext 0.0.1 → 0.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (337) hide show
  1. package/.changeset/README.md +28 -0
  2. package/.changeset/config.json +11 -0
  3. package/.claude/settings.local.json +25 -0
  4. package/.github/workflows/ci.yml +83 -0
  5. package/.github/workflows/claude-code-review.yml +44 -0
  6. package/.github/workflows/claude.yml +85 -0
  7. package/.github/workflows/release.yml +113 -0
  8. package/.tldrignore +112 -0
  9. package/BACKLOG.md +338 -0
  10. package/CONTRIBUTING.md +186 -0
  11. package/NOTES/NOTES +44 -0
  12. package/README.md +434 -11
  13. package/biome.json +36 -0
  14. package/cspell.config.yaml +14 -0
  15. package/dist/chunk-23UPXDNL.js +3044 -0
  16. package/dist/chunk-2W7MO2DL.js +1366 -0
  17. package/dist/chunk-3NUAZGMA.js +1689 -0
  18. package/dist/chunk-7TOWB2XB.js +366 -0
  19. package/dist/chunk-7XOTOADQ.js +3065 -0
  20. package/dist/chunk-AH2PDM2K.js +3042 -0
  21. package/dist/chunk-BNXWSZ63.js +3742 -0
  22. package/dist/chunk-BTL5DJVU.js +3222 -0
  23. package/dist/chunk-HDHYG7E4.js +104 -0
  24. package/dist/chunk-HLR4KZBP.js +3234 -0
  25. package/dist/chunk-IP3FRFEB.js +1045 -0
  26. package/dist/chunk-KHU56VDO.js +3042 -0
  27. package/dist/chunk-KRYIFLQR.js +88 -0
  28. package/dist/chunk-LBSDNLEM.js +287 -0
  29. package/dist/chunk-MNTQ7HCP.js +2643 -0
  30. package/dist/chunk-MUJELQQ6.js +1387 -0
  31. package/dist/chunk-MXJGMSLV.js +2199 -0
  32. package/dist/chunk-N6QJGC3Z.js +2636 -0
  33. package/dist/chunk-OBELGBPM.js +1713 -0
  34. package/dist/chunk-OT7R5XTA.js +3192 -0
  35. package/dist/chunk-P7X4RA2T.js +106 -0
  36. package/dist/chunk-PIDUQNC2.js +3185 -0
  37. package/dist/chunk-POGCDIH4.js +3187 -0
  38. package/dist/chunk-PSIEOQGZ.js +3043 -0
  39. package/dist/chunk-PVRT3IHA.js +3238 -0
  40. package/dist/chunk-QNN4TT23.js +1430 -0
  41. package/dist/chunk-RE3R45RJ.js +3042 -0
  42. package/dist/chunk-S7E6TFX6.js +803 -0
  43. package/dist/chunk-SG6GLU4U.js +1378 -0
  44. package/dist/chunk-SJCDV2ST.js +274 -0
  45. package/dist/chunk-SYE5XLF3.js +104 -0
  46. package/dist/chunk-T5VLYBZD.js +103 -0
  47. package/dist/chunk-TOQB7VWU.js +3238 -0
  48. package/dist/chunk-VFNMZ4ZQ.js +3228 -0
  49. package/dist/chunk-VVTGZNBT.js +1629 -0
  50. package/dist/chunk-W7Q4RFEV.js +104 -0
  51. package/dist/chunk-XTYYVRLO.js +3190 -0
  52. package/dist/chunk-Y6MDYVJD.js +3063 -0
  53. package/dist/cli/main.d.ts +1 -0
  54. package/dist/cli/main.js +5458 -0
  55. package/dist/index.d.ts +653 -0
  56. package/dist/index.js +79 -0
  57. package/dist/mcp/server.d.ts +1 -0
  58. package/dist/mcp/server.js +472 -0
  59. package/dist/schema-BAWSG7KY.js +22 -0
  60. package/dist/schema-E3QUPL26.js +20 -0
  61. package/dist/schema-EHL7WUT6.js +20 -0
  62. package/docs/019-USAGE.md +625 -0
  63. package/docs/020-current-implementation.md +364 -0
  64. package/docs/021-DOGFOODING-FINDINGS.md +175 -0
  65. package/docs/BACKLOG.md +80 -0
  66. package/docs/CONFIG.md +1123 -0
  67. package/docs/DESIGN.md +439 -0
  68. package/docs/ERRORS.md +383 -0
  69. package/docs/PROJECT.md +88 -0
  70. package/docs/ROADMAP.md +407 -0
  71. package/docs/summarization.md +320 -0
  72. package/docs/test-links.md +9 -0
  73. package/justfile +40 -0
  74. package/package.json +74 -9
  75. package/pnpm-workspace.yaml +5 -0
  76. package/research/INDEX.md +315 -0
  77. package/research/code-review/README.md +90 -0
  78. package/research/code-review/cli-error-handling-review.md +979 -0
  79. package/research/code-review/code-review-validation-report.md +464 -0
  80. package/research/code-review/main-ts-review.md +1128 -0
  81. package/research/config-analysis/01-current-implementation.md +470 -0
  82. package/research/config-analysis/02-strategy-recommendation.md +428 -0
  83. package/research/config-analysis/03-task-candidates.md +715 -0
  84. package/research/config-analysis/033-research-configuration-management.md +828 -0
  85. package/research/config-analysis/034-research-effect-cli-config.md +1504 -0
  86. package/research/config-analysis/04-consolidated-task-candidates.md +277 -0
  87. package/research/config-docs/SUMMARY.md +357 -0
  88. package/research/config-docs/TEST-RESULTS.md +776 -0
  89. package/research/config-docs/TODO.md +542 -0
  90. package/research/config-docs/analysis.md +744 -0
  91. package/research/config-docs/fix-validation.md +502 -0
  92. package/research/config-docs/help-audit.md +264 -0
  93. package/research/config-docs/help-system-analysis.md +890 -0
  94. package/research/dogfood/consolidated-tool-evaluation.md +373 -0
  95. package/research/dogfood/strategy-a/a-synthesis.md +184 -0
  96. package/research/dogfood/strategy-a/a1-docs.md +226 -0
  97. package/research/dogfood/strategy-a/a2-amorphic.md +156 -0
  98. package/research/dogfood/strategy-a/a3-llm.md +164 -0
  99. package/research/dogfood/strategy-b/b-synthesis.md +228 -0
  100. package/research/dogfood/strategy-b/b1-architecture.md +207 -0
  101. package/research/dogfood/strategy-b/b2-gaps.md +258 -0
  102. package/research/dogfood/strategy-b/b3-workflows.md +250 -0
  103. package/research/dogfood/strategy-c/c-synthesis.md +451 -0
  104. package/research/dogfood/strategy-c/c1-explorer.md +192 -0
  105. package/research/dogfood/strategy-c/c2-diver-memory.md +145 -0
  106. package/research/dogfood/strategy-c/c3-diver-control.md +148 -0
  107. package/research/dogfood/strategy-c/c4-diver-failure.md +151 -0
  108. package/research/dogfood/strategy-c/c5-diver-execution.md +221 -0
  109. package/research/dogfood/strategy-c/c6-diver-org.md +221 -0
  110. package/research/effect-cli-error-handling.md +845 -0
  111. package/research/effect-errors-as-values.md +943 -0
  112. package/research/errors-task-analysis/00-consolidated-tasks.md +207 -0
  113. package/research/errors-task-analysis/cli-commands-analysis.md +909 -0
  114. package/research/errors-task-analysis/embeddings-analysis.md +709 -0
  115. package/research/errors-task-analysis/index-search-analysis.md +812 -0
  116. package/research/frontmatter/COMMENTS-ARE-SKIPPED.md +149 -0
  117. package/research/frontmatter/LLM-CODE-NAVIGATION.md +276 -0
  118. package/research/issue-review.md +603 -0
  119. package/research/llm-summarization/agent-cli-tools-2026.md +1082 -0
  120. package/research/llm-summarization/alternative-providers-2026.md +1428 -0
  121. package/research/llm-summarization/anthropic-2026.md +367 -0
  122. package/research/llm-summarization/claude-cli-integration.md +1706 -0
  123. package/research/llm-summarization/cli-integration-patterns.md +3155 -0
  124. package/research/llm-summarization/openai-2026.md +473 -0
  125. package/research/llm-summarization/openai-compatible-providers-2026.md +1022 -0
  126. package/research/llm-summarization/opencode-cli-integration.md +1552 -0
  127. package/research/llm-summarization/prompt-engineering-2026.md +1426 -0
  128. package/research/llm-summarization/prototype-results.md +56 -0
  129. package/research/llm-summarization/provider-switching-patterns-2026.md +2153 -0
  130. package/research/llm-summarization/typescript-llm-libraries-2026.md +2436 -0
  131. package/research/mdcontext-error-analysis.md +521 -0
  132. package/research/mdcontext-pudding/00-EXECUTIVE-SUMMARY.md +282 -0
  133. package/research/mdcontext-pudding/01-index-embed.md +956 -0
  134. package/research/mdcontext-pudding/02-search-COMMANDS.md +142 -0
  135. package/research/mdcontext-pudding/02-search-SUMMARY.md +146 -0
  136. package/research/mdcontext-pudding/02-search.md +970 -0
  137. package/research/mdcontext-pudding/03-context.md +779 -0
  138. package/research/mdcontext-pudding/04-navigation-and-analytics.md +803 -0
  139. package/research/mdcontext-pudding/04-tree.md +704 -0
  140. package/research/mdcontext-pudding/05-config.md +1038 -0
  141. package/research/mdcontext-pudding/06-links-summary.txt +87 -0
  142. package/research/mdcontext-pudding/06-links.md +679 -0
  143. package/research/mdcontext-pudding/07-stats.md +693 -0
  144. package/research/mdcontext-pudding/BUG-FIX-PLAN.md +388 -0
  145. package/research/mdcontext-pudding/P0-BUG-VALIDATION.md +167 -0
  146. package/research/mdcontext-pudding/README.md +168 -0
  147. package/research/mdcontext-pudding/TESTING-SUMMARY.md +128 -0
  148. package/research/npm_publish/011-npm-workflow-research-agent2.md +792 -0
  149. package/research/npm_publish/012-npm-workflow-research-agent1.md +530 -0
  150. package/research/npm_publish/013-npm-workflow-research-agent3.md +722 -0
  151. package/research/npm_publish/014-npm-workflow-synthesis.md +556 -0
  152. package/research/npm_publish/031-npm-workflow-task-analysis.md +134 -0
  153. package/research/research-quality-review.md +834 -0
  154. package/research/semantic-search/002-research-embedding-models.md +490 -0
  155. package/research/semantic-search/003-research-rag-alternatives.md +523 -0
  156. package/research/semantic-search/004-research-vector-search.md +841 -0
  157. package/research/semantic-search/032-research-semantic-search.md +427 -0
  158. package/research/semantic-search/embedding-text-analysis.md +156 -0
  159. package/research/semantic-search/multi-word-failure-reproduction.md +171 -0
  160. package/research/semantic-search/query-processing-analysis.md +207 -0
  161. package/research/semantic-search/root-cause-and-solution.md +114 -0
  162. package/research/semantic-search/threshold-validation-report.md +69 -0
  163. package/research/semantic-search/vector-search-analysis.md +63 -0
  164. package/research/task-management-2026/00-synthesis-recommendations.md +295 -0
  165. package/research/task-management-2026/01-ai-workflow-tools.md +416 -0
  166. package/research/task-management-2026/02-agent-framework-patterns.md +476 -0
  167. package/research/task-management-2026/03-lightweight-file-based.md +567 -0
  168. package/research/task-management-2026/04-established-tools-ai-features.md +541 -0
  169. package/research/task-management-2026/linear/01-core-features-workflow.md +771 -0
  170. package/research/task-management-2026/linear/02-api-integrations.md +930 -0
  171. package/research/task-management-2026/linear/03-ai-features.md +368 -0
  172. package/research/task-management-2026/linear/04-pricing-setup.md +205 -0
  173. package/research/task-management-2026/linear/05-usage-patterns-best-practices.md +605 -0
  174. package/research/test-path-issues.md +276 -0
  175. package/review/ALP-76/1-error-type-design.md +962 -0
  176. package/review/ALP-76/2-error-handling-patterns.md +906 -0
  177. package/review/ALP-76/3-error-presentation.md +624 -0
  178. package/review/ALP-76/4-test-coverage.md +625 -0
  179. package/review/ALP-76/5-migration-completeness.md +440 -0
  180. package/review/ALP-76/6-effect-best-practices.md +755 -0
  181. package/scripts/apply-branch-protection.sh +47 -0
  182. package/scripts/branch-protection-templates.json +79 -0
  183. package/scripts/prototype-summarization.ts +346 -0
  184. package/scripts/rebuild-hnswlib.js +58 -0
  185. package/scripts/setup-branch-protection.sh +64 -0
  186. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/active-provider.json +7 -0
  187. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.json +541 -0
  188. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.meta.json +5 -0
  189. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/config.json +8 -0
  190. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
  191. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
  192. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/documents.json +60 -0
  193. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/links.json +13 -0
  194. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/sections.json +1197 -0
  195. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/configuration-management.md +99 -0
  196. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/distributed-systems.md +92 -0
  197. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/error-handling.md +78 -0
  198. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/failure-automation.md +55 -0
  199. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/job-context.md +69 -0
  200. package/src/__tests__/fixtures/semantic-search/multi-word-corpus/process-orchestration.md +99 -0
  201. package/src/cli/argv-preprocessor.test.ts +210 -0
  202. package/src/cli/argv-preprocessor.ts +202 -0
  203. package/src/cli/cli.test.ts +627 -0
  204. package/src/cli/commands/backlinks.ts +54 -0
  205. package/src/cli/commands/config-cmd.ts +642 -0
  206. package/src/cli/commands/context.ts +285 -0
  207. package/src/cli/commands/duplicates.ts +122 -0
  208. package/src/cli/commands/embeddings.ts +529 -0
  209. package/src/cli/commands/index-cmd.ts +480 -0
  210. package/src/cli/commands/index.ts +16 -0
  211. package/src/cli/commands/links.ts +52 -0
  212. package/src/cli/commands/search.ts +1281 -0
  213. package/src/cli/commands/stats.ts +149 -0
  214. package/src/cli/commands/tree.ts +128 -0
  215. package/src/cli/config-layer.ts +176 -0
  216. package/src/cli/error-handler.test.ts +235 -0
  217. package/src/cli/error-handler.ts +655 -0
  218. package/src/cli/flag-schemas.ts +341 -0
  219. package/src/cli/help.ts +588 -0
  220. package/src/cli/index.ts +9 -0
  221. package/src/cli/main.ts +435 -0
  222. package/src/cli/options.ts +41 -0
  223. package/src/cli/shared-error-handling.ts +199 -0
  224. package/src/cli/typo-suggester.test.ts +105 -0
  225. package/src/cli/typo-suggester.ts +130 -0
  226. package/src/cli/utils.ts +259 -0
  227. package/src/config/file-provider.test.ts +320 -0
  228. package/src/config/file-provider.ts +273 -0
  229. package/src/config/index.ts +72 -0
  230. package/src/config/integration.test.ts +667 -0
  231. package/src/config/precedence.test.ts +277 -0
  232. package/src/config/precedence.ts +451 -0
  233. package/src/config/schema.test.ts +414 -0
  234. package/src/config/schema.ts +603 -0
  235. package/src/config/service.test.ts +320 -0
  236. package/src/config/service.ts +243 -0
  237. package/src/config/testing.test.ts +264 -0
  238. package/src/config/testing.ts +110 -0
  239. package/src/core/index.ts +1 -0
  240. package/src/core/types.ts +113 -0
  241. package/src/duplicates/detector.test.ts +183 -0
  242. package/src/duplicates/detector.ts +414 -0
  243. package/src/duplicates/index.ts +18 -0
  244. package/src/embeddings/embedding-namespace.test.ts +300 -0
  245. package/src/embeddings/embedding-namespace.ts +947 -0
  246. package/src/embeddings/heading-boost.test.ts +222 -0
  247. package/src/embeddings/hnsw-build-options.test.ts +198 -0
  248. package/src/embeddings/hyde.test.ts +272 -0
  249. package/src/embeddings/hyde.ts +264 -0
  250. package/src/embeddings/index.ts +10 -0
  251. package/src/embeddings/openai-provider.ts +414 -0
  252. package/src/embeddings/pricing.json +22 -0
  253. package/src/embeddings/provider-constants.ts +204 -0
  254. package/src/embeddings/provider-errors.test.ts +967 -0
  255. package/src/embeddings/provider-errors.ts +565 -0
  256. package/src/embeddings/provider-factory.test.ts +240 -0
  257. package/src/embeddings/provider-factory.ts +225 -0
  258. package/src/embeddings/provider-integration.test.ts +788 -0
  259. package/src/embeddings/query-preprocessing.test.ts +187 -0
  260. package/src/embeddings/semantic-search-threshold.test.ts +508 -0
  261. package/src/embeddings/semantic-search.ts +1270 -0
  262. package/src/embeddings/types.ts +359 -0
  263. package/src/embeddings/vector-store.ts +708 -0
  264. package/src/embeddings/voyage-provider.ts +313 -0
  265. package/src/errors/errors.test.ts +845 -0
  266. package/src/errors/index.ts +533 -0
  267. package/src/index/ignore-patterns.test.ts +354 -0
  268. package/src/index/ignore-patterns.ts +305 -0
  269. package/src/index/index.ts +4 -0
  270. package/src/index/indexer.ts +684 -0
  271. package/src/index/storage.ts +260 -0
  272. package/src/index/types.ts +147 -0
  273. package/src/index/watcher.ts +189 -0
  274. package/src/index.ts +30 -0
  275. package/src/integration/search-keyword.test.ts +678 -0
  276. package/src/mcp/server.ts +612 -0
  277. package/src/parser/index.ts +1 -0
  278. package/src/parser/parser.test.ts +291 -0
  279. package/src/parser/parser.ts +394 -0
  280. package/src/parser/section-filter.test.ts +277 -0
  281. package/src/parser/section-filter.ts +392 -0
  282. package/src/search/__tests__/hybrid-search.test.ts +650 -0
  283. package/src/search/bm25-store.ts +366 -0
  284. package/src/search/cross-encoder.test.ts +253 -0
  285. package/src/search/cross-encoder.ts +406 -0
  286. package/src/search/fuzzy-search.test.ts +419 -0
  287. package/src/search/fuzzy-search.ts +273 -0
  288. package/src/search/hybrid-search.ts +448 -0
  289. package/src/search/path-matcher.test.ts +276 -0
  290. package/src/search/path-matcher.ts +33 -0
  291. package/src/search/query-parser.test.ts +260 -0
  292. package/src/search/query-parser.ts +319 -0
  293. package/src/search/searcher.test.ts +280 -0
  294. package/src/search/searcher.ts +724 -0
  295. package/src/search/wink-bm25.d.ts +30 -0
  296. package/src/summarization/cli-providers/claude.ts +202 -0
  297. package/src/summarization/cli-providers/detection.test.ts +273 -0
  298. package/src/summarization/cli-providers/detection.ts +118 -0
  299. package/src/summarization/cli-providers/index.ts +8 -0
  300. package/src/summarization/cost.test.ts +139 -0
  301. package/src/summarization/cost.ts +102 -0
  302. package/src/summarization/error-handler.test.ts +127 -0
  303. package/src/summarization/error-handler.ts +111 -0
  304. package/src/summarization/index.ts +102 -0
  305. package/src/summarization/pipeline.test.ts +498 -0
  306. package/src/summarization/pipeline.ts +231 -0
  307. package/src/summarization/prompts.test.ts +269 -0
  308. package/src/summarization/prompts.ts +133 -0
  309. package/src/summarization/provider-factory.test.ts +396 -0
  310. package/src/summarization/provider-factory.ts +178 -0
  311. package/src/summarization/types.ts +184 -0
  312. package/src/summarize/budget-bugs.test.ts +620 -0
  313. package/src/summarize/formatters.ts +419 -0
  314. package/src/summarize/index.ts +20 -0
  315. package/src/summarize/summarizer.test.ts +275 -0
  316. package/src/summarize/summarizer.ts +597 -0
  317. package/src/summarize/verify-bugs.test.ts +238 -0
  318. package/src/types/huggingface-transformers.d.ts +66 -0
  319. package/src/utils/index.ts +1 -0
  320. package/src/utils/tokens.test.ts +142 -0
  321. package/src/utils/tokens.ts +186 -0
  322. package/tests/fixtures/cli/.mdcontext/active-provider.json +7 -0
  323. package/tests/fixtures/cli/.mdcontext/config.json +8 -0
  324. package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
  325. package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
  326. package/tests/fixtures/cli/.mdcontext/indexes/documents.json +33 -0
  327. package/tests/fixtures/cli/.mdcontext/indexes/links.json +12 -0
  328. package/tests/fixtures/cli/.mdcontext/indexes/sections.json +247 -0
  329. package/tests/fixtures/cli/README.md +9 -0
  330. package/tests/fixtures/cli/api-reference.md +11 -0
  331. package/tests/fixtures/cli/getting-started.md +11 -0
  332. package/tests/integration/embed-index.test.ts +712 -0
  333. package/tests/integration/search-context.test.ts +469 -0
  334. package/tests/integration/search-semantic.test.ts +522 -0
  335. package/tsconfig.json +26 -0
  336. package/vitest.config.ts +16 -0
  337. package/vitest.setup.ts +12 -0
@@ -0,0 +1,367 @@
1
+ # Anthropic Claude Models & Pricing - 2026 Research
2
+
3
+ **Last Updated:** January 26, 2026
4
+ **Research Focus:** Latest Claude models, pricing, context windows, and code understanding capabilities
5
+
6
+ ---
7
+
8
+ ## Executive Summary
9
+
10
+ As of 2026, Anthropic's **Claude 4.5 series** represents the state-of-the-art in AI language models, offering a **67% cost reduction** over previous generations while delivering superior performance. The flagship **Claude Sonnet 4.5** is marketed as "the best coding model in the world" and is the recommended starting point for most use cases.
11
+
12
+ **Key Highlights:**
13
+ - Claude 4.5 series launched late 2025, offering three tiers: Haiku, Sonnet, and Opus
14
+ - Up to 1M token context window (beta) for Sonnet 4 and 4.5
15
+ - Significant cost optimization features: Prompt Caching (90% savings), Batch API (50% discount)
16
+ - Advanced code understanding and summarization capabilities
17
+
18
+ ---
19
+
20
+ ## Model Lineup & Pricing
21
+
22
+ ### Claude 4.5 Series (Current Generation - 2026)
23
+
24
+ The Claude 4.5 series offers the best balance of performance and cost in Anthropic's lineup:
25
+
26
+ | Model | Input Price | Output Price | Use Case | Speed |
27
+ |-------|-------------|--------------|----------|-------|
28
+ | **Haiku 4.5** | $1/M tokens | $5/M tokens | High-volume, speed-critical tasks | Fastest |
29
+ | **Sonnet 4.5** | $3/M tokens | $15/M tokens | Balanced performance & cost | Balanced |
30
+ | **Opus 4.5** | $5/M tokens | $25/M tokens | Most capable flagship model | Most Capable |
31
+
32
+ **Recommended Starting Point:** Claude Sonnet 4.5 - offers the strongest coding and agent performance in the Claude lineup.
33
+
34
+ ### Legacy Models (Claude 4/4.1 Series)
35
+
36
+ Legacy models remain available but at significantly higher cost:
37
+
38
+ | Model | Input Price | Output Price | Notes |
39
+ |-------|-------------|--------------|-------|
40
+ | **Opus 4/4.1** | $15/M tokens | $75/M tokens | Most expensive; 67% more costly than Opus 4.5 |
41
+
42
+ **Migration Recommendation:** Migrate to Claude 4.5 series for better performance at lower cost.
43
+
44
+ ---
45
+
46
+ ## Context Window Sizes
47
+
48
+ Claude models in 2026 offer industry-leading context windows:
49
+
50
+ ### Standard Context Window
51
+ - **200,000 tokens** (default for all models)
52
+ - Approximately 150,000 words or 500+ pages of material
53
+ - Suitable for most use cases
54
+
55
+ ### Extended Context Window (Beta)
56
+ - **1,000,000 tokens** available for Sonnet 4 and 4.5
57
+ - Requires `context-1m-2025-08-07` beta header
58
+ - Access limited to organizations in **Usage Tier 4** or with custom rate limits
59
+ - Long context pricing applies to requests exceeding 200K tokens
60
+
61
+ ### Enterprise Plan Access
62
+ - 500K context window for Claude Sonnet 4.5 on Enterprise plans
63
+ - Available for chatting with Claude in the web interface
64
+
65
+ **Note:** The 1M token context window is currently in beta and requires advancing to Usage Tier 4.
66
+
67
+ ---
68
+
69
+ ## Cost Optimization Features
70
+
71
+ ### 1. Prompt Caching (90% Savings)
72
+
73
+ Prompt caching achieves **90% savings** on repeated content after just 2 requests. This is particularly valuable for:
74
+ - Repeated codebase context
75
+ - Standard system prompts
76
+ - Recurring documentation or knowledge bases
77
+
78
+ **Implementation:** Automatically applied to repeated context in API requests.
79
+
80
+ ### 2. Batch API (50% Discount)
81
+
82
+ The Batch API allows asynchronous processing of large volumes of requests with a **50% discount** on both input and output tokens.
83
+
84
+ **Best For:**
85
+ - Large-scale code analysis
86
+ - Batch summarization tasks
87
+ - Non-time-sensitive processing
88
+
89
+ ### 3. Extended Thinking
90
+
91
+ Extended thinking tokens are billed as output tokens, not as a separate pricing tier. When you enable extended thinking with a token budget (minimum 1,024 tokens), any tokens the model uses for internal reasoning are charged at the standard output rate for that model.
92
+
93
+ **Use Case:** Complex reasoning tasks requiring step-by-step thinking.
94
+
95
+ ---
96
+
97
+ ## Code Understanding Capabilities
98
+
99
+ ### Claude Sonnet 4.5 - Best for Code Analysis
100
+
101
+ Claude Sonnet 4.5 is Anthropic's **strongest coding model** with exceptional capabilities:
102
+
103
+ **Code Understanding:**
104
+ - Substantially improved problem-solving and codebase navigation
105
+ - Navigation errors reduced from 20% to near zero
106
+ - State-of-the-art for complex codebase understanding
107
+ - Improved precision for complex changes across multiple files
108
+
109
+ **Autonomous Development:**
110
+ - Excels at autonomous multi-feature app development
111
+ - Three-phase workflow: gather context → take action → verify results
112
+ - Tools for searching files, editing code, and running tests
113
+
114
+ **Context & Navigation:**
115
+ - Can understand and navigate large codebases effectively
116
+ - 1M token context window enables processing entire repositories
117
+ - Superior performance with longer code blocks
118
+
119
+ ### Claude Haiku 4.5 - Speed & Efficiency
120
+
121
+ Haiku 4.5 achieves **90% of Sonnet 4.5's performance** in agentic coding evaluation at a fraction of the cost:
122
+
123
+ **Best For:**
124
+ - Code review
125
+ - Documentation generation
126
+ - Linting and test generation
127
+ - Sub-agent tasks
128
+ - Simple frontend/backend tasks
129
+
130
+ **Limitations:**
131
+ - Tends to hallucinate when generating code exceeding 150 lines
132
+ - Better for well-defined tasks with clear solution spaces
133
+
134
+ ### Summarization Capabilities
135
+
136
+ Claude excels at summarization tasks:
137
+
138
+ - **Natural Language Processing:** Condensing large amounts of text while retaining key information
139
+ - **Legal Documents:** Particularly emphasized for legal document summarization
140
+ - **Code Summarization:** Opus 4.5 can read all relevant context and determine what details are relevant for specific contexts
141
+ - **Lossy Compression:** Explore agent returns summaries with some information loss, but maintains contextual relevance
142
+
143
+ **Note:** WebFetch and Cowork summarization includes prompt injection protection layers.
144
+
145
+ ---
146
+
147
+ ## API Features & Rate Limits
148
+
149
+ ### Usage Tiers
150
+
151
+ Anthropic uses a tier-based system where organizations automatically advance as they reach spending thresholds:
152
+
153
+ | Tier | Deposit Requirement | Monthly Spend Limit | Requests/Min | Special Access |
154
+ |------|---------------------|---------------------|--------------|----------------|
155
+ | **Tier 1** | $5 | $100 | 50 RPM | - |
156
+ | **Tier 2** | - | - | Higher | - |
157
+ | **Tier 3** | - | - | Higher | - |
158
+ | **Tier 4** | $400+ | Higher | Highest | 1M token context window |
159
+
160
+ **Tier Advancement:** Organizations advance immediately upon reaching the cumulative credit purchase threshold (excluding tax).
161
+
162
+ ### Rate Limit Structure
163
+
164
+ Rate limits are measured across three dimensions:
165
+ 1. **RPM** - Requests per minute
166
+ 2. **ITPM** - Input tokens per minute
167
+ 3. **OTPM** - Output tokens per minute
168
+
169
+ **Algorithm:** Token bucket algorithm - capacity continuously replenishes up to maximum limit rather than resetting at fixed intervals.
170
+
171
+ **Organization-Level:** Limits are set at the organization level and can be viewed in the Claude Console Limits page.
172
+
173
+ ### Spend Limits
174
+
175
+ - Each tier has a maximum monthly API spend limit
176
+ - Once reached, API access is paused until the next calendar month or tier advancement
177
+ - Custom limits available through sales contact
178
+
179
+ ### Priority Tier Access
180
+
181
+ For higher custom limits or Priority Tier access:
182
+ - Contact sales through Claude Console
183
+ - Enhanced service levels available
184
+ - Custom rate limit arrangements
185
+
186
+ ---
187
+
188
+ ## Model Selection Recommendations
189
+
190
+ ### For Code Summarization & Analysis
191
+
192
+ **Primary Choice: Claude Sonnet 4.5**
193
+ - Best overall coding model
194
+ - Superior accuracy and understanding
195
+ - 1M token context window (beta)
196
+ - Handles complex, multi-file analysis
197
+ - **Cost:** $3 input / $15 output per million tokens
198
+
199
+ **Secondary Choice: Claude Haiku 4.5**
200
+ - 90% of Sonnet performance at lower cost
201
+ - Excellent for simpler tasks
202
+ - Fast iteration and testing
203
+ - **Cost:** $1 input / $5 output per million tokens
204
+
205
+ **Hybrid Strategy:**
206
+ - Use Sonnet 4.5 for in-depth analysis and complex reasoning
207
+ - Use Haiku 4.5 for quick reviews, documentation, and sub-tasks
208
+ - Optimize costs while maintaining quality
209
+
210
+ ### For Different Use Cases
211
+
212
+ | Use Case | Recommended Model | Rationale |
213
+ |----------|-------------------|-----------|
214
+ | **Large codebase analysis** | Sonnet 4.5 | 1M context window, superior navigation |
215
+ | **Quick code reviews** | Haiku 4.5 | Speed and cost efficiency |
216
+ | **Complex multi-file changes** | Sonnet 4.5 | Improved precision for complexity |
217
+ | **Documentation generation** | Haiku 4.5 | Well-defined task, cost-effective |
218
+ | **Autonomous development** | Sonnet 4.5 | Best problem-solving capabilities |
219
+ | **Batch summarization** | Haiku 4.5 + Batch API | 50% discount, efficient processing |
220
+ | **Legal/technical summarization** | Opus 4.5 | Highest capability, contextual understanding |
221
+
222
+ ---
223
+
224
+ ## Cost Comparison Examples
225
+
226
+ ### Code Summarization Task Example
227
+
228
+ **Scenario:** Summarize a 50K token codebase, output 5K token summary
229
+
230
+ | Model | Input Cost | Output Cost | Total Cost |
231
+ |-------|------------|-------------|------------|
232
+ | Haiku 4.5 | $0.05 | $0.025 | **$0.075** |
233
+ | Sonnet 4.5 | $0.15 | $0.075 | **$0.225** |
234
+ | Opus 4.5 | $0.25 | $0.125 | **$0.375** |
235
+ | Opus 4.1 (legacy) | $0.75 | $0.375 | **$1.125** |
236
+
237
+ **With Prompt Caching (after 2nd request):**
238
+ - Haiku 4.5: ~$0.012 per request
239
+ - Sonnet 4.5: ~$0.038 per request
240
+
241
+ **With Batch API (50% discount):**
242
+ - Haiku 4.5: $0.0375 per request
243
+ - Sonnet 4.5: $0.1125 per request
244
+
245
+ ### Large-Scale Analysis Example
246
+
247
+ **Scenario:** Process 100 repositories, 100K tokens each, 10K token summaries
248
+
249
+ **Without Optimization:**
250
+ - Haiku 4.5: $75
251
+ - Sonnet 4.5: $225
252
+ - Opus 4.5: $375
253
+
254
+ **With Batch API + Prompt Caching:**
255
+ - Haiku 4.5: ~$12-15
256
+ - Sonnet 4.5: ~$38-45
257
+
258
+ **Cost Savings:** Up to 80-90% with optimization features
259
+
260
+ ---
261
+
262
+ ## Implementation Recommendations
263
+
264
+ ### Getting Started
265
+
266
+ 1. **Start with Sonnet 4.5** for initial development and testing
267
+ 2. **Implement Prompt Caching** for repeated context (codebase analysis)
268
+ 3. **Use Batch API** for non-time-sensitive large-scale processing
269
+ 4. **Monitor usage** through Claude Console Limits page
270
+
271
+ ### Scaling Strategy
272
+
273
+ 1. **Tier Advancement:**
274
+ - Start with Tier 1 ($5 deposit)
275
+ - Advance to Tier 4 ($400+ cumulative) for 1M context window access
276
+
277
+ 2. **Hybrid Model Approach:**
278
+ - Sonnet 4.5 for complex analysis and initial summarization
279
+ - Haiku 4.5 for follow-up tasks, reviews, and sub-agent work
280
+
281
+ 3. **Cost Optimization:**
282
+ - Enable prompt caching for repeated codebase context
283
+ - Use Batch API for bulk processing (50% discount)
284
+ - Monitor token usage and optimize prompt design
285
+
286
+ ### Best Practices
287
+
288
+ 1. **Context Window Usage:**
289
+ - Use 200K standard window for most tasks
290
+ - Request 1M window beta access for very large codebases
291
+ - Be mindful of long context pricing (>200K tokens)
292
+
293
+ 2. **Error Reduction:**
294
+ - Sonnet 4.5's near-zero navigation errors reduce debugging costs
295
+ - Better first-time accuracy means fewer API calls
296
+
297
+ 3. **Quality vs. Cost:**
298
+ - Use Sonnet 4.5 for quality-critical work
299
+ - Use Haiku 4.5 for high-volume, well-defined tasks
300
+ - Leverage Opus 4.5 only when highest capability is essential
301
+
302
+ ---
303
+
304
+ ## Future Considerations
305
+
306
+ ### Trends to Watch
307
+
308
+ 1. **Model Evolution:** Anthropic is continuously improving models - expect further cost reductions and capability improvements
309
+ 2. **Context Window Expansion:** 1M token window currently in beta; likely to become standard
310
+ 3. **Pricing Changes:** Historical trend shows cost reductions with new generations
311
+ 4. **API Features:** New optimization features (beyond caching and batch) may emerge
312
+
313
+ ### Migration Path
314
+
315
+ - **From Claude 3.5:** Immediate migration to Claude 4.5 recommended
316
+ - **From Claude 4/4.1:** Migrate to Claude 4.5 for 67% cost savings
317
+ - **From Other Providers:** Sonnet 4.5's coding capabilities and context window offer competitive advantages
318
+
319
+ ---
320
+
321
+ ## Sources & References
322
+
323
+ ### Official Documentation
324
+ - [Pricing - Claude API Docs](https://platform.claude.com/docs/en/about-claude/pricing)
325
+ - [Context Windows - Claude API Docs](https://platform.claude.com/docs/en/build-with-claude/context-windows)
326
+ - [Models Overview - Claude API Docs](https://platform.claude.com/docs/en/about-claude/models/overview)
327
+ - [Rate Limits - Claude API Docs](https://platform.claude.com/docs/en/api/rate-limits)
328
+ - [Summarization with Claude](https://platform.claude.com/cookbook/capabilities-summarization-guide)
329
+
330
+ ### Anthropic Announcements
331
+ - [Introducing Claude 4](https://www.anthropic.com/news/claude-4)
332
+ - [Introducing Claude Haiku 4.5](https://www.anthropic.com/news/claude-haiku-4-5)
333
+
334
+ ### Third-Party Analysis & Guides
335
+ - [Anthropic Claude API Pricing 2026: Complete Cost Breakdown | MetaCTO](https://www.metacto.com/blogs/anthropic-api-pricing-a-full-breakdown-of-costs-and-integration)
336
+ - [The Guide to Claude Opus 4 & 4.5 API Pricing in 2026 - CometAPI](https://www.cometapi.com/the-guide-to-claude-opus-4--4-5-api-pricing-in-2026/)
337
+ - [Claude API Pricing Guide 2026 | AI Free API](https://www.aifreeapi.com/en/posts/claude-api-pricing-per-million-tokens)
338
+ - [Claude API Quota Tiers and Limits Explained 2026 | AI Free API](https://www.aifreeapi.com/en/posts/claude-api-quota-tiers-limits)
339
+ - [Anthropic API Pricing: The 2026 Guide | nOps](https://www.nops.io/blog/anthropic-api-pricing/)
340
+ - [Claude Pricing in 2026 for Individuals, Organizations, and Developers | Finout](https://www.finout.io/blog/claude-pricing-in-2026-for-individuals-organizations-and-developers)
341
+
342
+ ### Model Comparisons
343
+ - [Which Claude Model Is Best for Coding: Opus vs Sonnet vs Haiku | Data Annotation](https://www.dataannotation.tech/developers/which-claude-model-is-best-for-coding)
344
+ - [Sonnet 4.5 vs Haiku 4.5 vs Opus 4.1 — Which Claude Model Actually Works Best | Medium](https://medium.com/@ayaanhaider.dev/sonnet-4-5-vs-haiku-4-5-vs-opus-4-1-which-claude-model-actually-works-best-in-real-projects-7183c0dc2249)
345
+ - [Claude Haiku 4.5 vs Sonnet 4.5: Detailed Comparison 2025 | Creole Studios](https://www.creolestudios.com/claude-haiku-4-5-vs-sonnet-4-5-comparison/)
346
+
347
+ ### Technical Insights
348
+ - [How Claude Code works - Claude Code Docs](https://code.claude.com/docs/en/how-claude-code-works)
349
+ - [First impressions of Claude Cowork | Simon Willison](https://simonwillison.net/2026/Jan/12/claude-cowork/)
350
+ - [A practical guide to the Claude code context window size | eesel](https://www.eesel.ai/blog/claude-code-context-window-size)
351
+
352
+ ---
353
+
354
+ ## Conclusion
355
+
356
+ For code summarization and analysis in 2026, **Claude Sonnet 4.5** offers the optimal balance of:
357
+ - **Performance:** Best coding model with superior codebase understanding
358
+ - **Cost:** $3/$15 per million tokens (67% cheaper than previous generation)
359
+ - **Context:** Up to 1M token window for large codebases (beta)
360
+ - **Features:** Prompt caching and Batch API for cost optimization
361
+
362
+ **Recommended Architecture:**
363
+ - Primary: Claude Sonnet 4.5 with prompt caching for comprehensive analysis
364
+ - Secondary: Claude Haiku 4.5 with Batch API for high-volume, simpler tasks
365
+ - Advance to Tier 4 for 1M context window access on large codebases
366
+
367
+ This hybrid approach can achieve **80-90% cost savings** while maintaining high-quality results for code understanding and summarization tasks.