mdcontext 0.0.1 → 0.2.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.changeset/README.md +28 -0
- package/.changeset/config.json +11 -0
- package/.claude/settings.local.json +25 -0
- package/.github/workflows/ci.yml +83 -0
- package/.github/workflows/claude-code-review.yml +44 -0
- package/.github/workflows/claude.yml +85 -0
- package/.github/workflows/release.yml +113 -0
- package/.tldrignore +112 -0
- package/BACKLOG.md +338 -0
- package/CONTRIBUTING.md +186 -0
- package/NOTES/NOTES +44 -0
- package/README.md +434 -11
- package/biome.json +36 -0
- package/cspell.config.yaml +14 -0
- package/dist/chunk-23UPXDNL.js +3044 -0
- package/dist/chunk-2W7MO2DL.js +1366 -0
- package/dist/chunk-3NUAZGMA.js +1689 -0
- package/dist/chunk-7TOWB2XB.js +366 -0
- package/dist/chunk-7XOTOADQ.js +3065 -0
- package/dist/chunk-AH2PDM2K.js +3042 -0
- package/dist/chunk-BNXWSZ63.js +3742 -0
- package/dist/chunk-BTL5DJVU.js +3222 -0
- package/dist/chunk-HDHYG7E4.js +104 -0
- package/dist/chunk-HLR4KZBP.js +3234 -0
- package/dist/chunk-IP3FRFEB.js +1045 -0
- package/dist/chunk-KHU56VDO.js +3042 -0
- package/dist/chunk-KRYIFLQR.js +88 -0
- package/dist/chunk-LBSDNLEM.js +287 -0
- package/dist/chunk-MNTQ7HCP.js +2643 -0
- package/dist/chunk-MUJELQQ6.js +1387 -0
- package/dist/chunk-MXJGMSLV.js +2199 -0
- package/dist/chunk-N6QJGC3Z.js +2636 -0
- package/dist/chunk-OBELGBPM.js +1713 -0
- package/dist/chunk-OT7R5XTA.js +3192 -0
- package/dist/chunk-P7X4RA2T.js +106 -0
- package/dist/chunk-PIDUQNC2.js +3185 -0
- package/dist/chunk-POGCDIH4.js +3187 -0
- package/dist/chunk-PSIEOQGZ.js +3043 -0
- package/dist/chunk-PVRT3IHA.js +3238 -0
- package/dist/chunk-QNN4TT23.js +1430 -0
- package/dist/chunk-RE3R45RJ.js +3042 -0
- package/dist/chunk-S7E6TFX6.js +803 -0
- package/dist/chunk-SG6GLU4U.js +1378 -0
- package/dist/chunk-SJCDV2ST.js +274 -0
- package/dist/chunk-SYE5XLF3.js +104 -0
- package/dist/chunk-T5VLYBZD.js +103 -0
- package/dist/chunk-TOQB7VWU.js +3238 -0
- package/dist/chunk-VFNMZ4ZQ.js +3228 -0
- package/dist/chunk-VVTGZNBT.js +1629 -0
- package/dist/chunk-W7Q4RFEV.js +104 -0
- package/dist/chunk-XTYYVRLO.js +3190 -0
- package/dist/chunk-Y6MDYVJD.js +3063 -0
- package/dist/cli/main.d.ts +1 -0
- package/dist/cli/main.js +5458 -0
- package/dist/index.d.ts +653 -0
- package/dist/index.js +79 -0
- package/dist/mcp/server.d.ts +1 -0
- package/dist/mcp/server.js +472 -0
- package/dist/schema-BAWSG7KY.js +22 -0
- package/dist/schema-E3QUPL26.js +20 -0
- package/dist/schema-EHL7WUT6.js +20 -0
- package/docs/019-USAGE.md +625 -0
- package/docs/020-current-implementation.md +364 -0
- package/docs/021-DOGFOODING-FINDINGS.md +175 -0
- package/docs/BACKLOG.md +80 -0
- package/docs/CONFIG.md +1123 -0
- package/docs/DESIGN.md +439 -0
- package/docs/ERRORS.md +383 -0
- package/docs/PROJECT.md +88 -0
- package/docs/ROADMAP.md +407 -0
- package/docs/summarization.md +320 -0
- package/docs/test-links.md +9 -0
- package/justfile +40 -0
- package/package.json +74 -9
- package/pnpm-workspace.yaml +5 -0
- package/research/INDEX.md +315 -0
- package/research/code-review/README.md +90 -0
- package/research/code-review/cli-error-handling-review.md +979 -0
- package/research/code-review/code-review-validation-report.md +464 -0
- package/research/code-review/main-ts-review.md +1128 -0
- package/research/config-analysis/01-current-implementation.md +470 -0
- package/research/config-analysis/02-strategy-recommendation.md +428 -0
- package/research/config-analysis/03-task-candidates.md +715 -0
- package/research/config-analysis/033-research-configuration-management.md +828 -0
- package/research/config-analysis/034-research-effect-cli-config.md +1504 -0
- package/research/config-analysis/04-consolidated-task-candidates.md +277 -0
- package/research/config-docs/SUMMARY.md +357 -0
- package/research/config-docs/TEST-RESULTS.md +776 -0
- package/research/config-docs/TODO.md +542 -0
- package/research/config-docs/analysis.md +744 -0
- package/research/config-docs/fix-validation.md +502 -0
- package/research/config-docs/help-audit.md +264 -0
- package/research/config-docs/help-system-analysis.md +890 -0
- package/research/dogfood/consolidated-tool-evaluation.md +373 -0
- package/research/dogfood/strategy-a/a-synthesis.md +184 -0
- package/research/dogfood/strategy-a/a1-docs.md +226 -0
- package/research/dogfood/strategy-a/a2-amorphic.md +156 -0
- package/research/dogfood/strategy-a/a3-llm.md +164 -0
- package/research/dogfood/strategy-b/b-synthesis.md +228 -0
- package/research/dogfood/strategy-b/b1-architecture.md +207 -0
- package/research/dogfood/strategy-b/b2-gaps.md +258 -0
- package/research/dogfood/strategy-b/b3-workflows.md +250 -0
- package/research/dogfood/strategy-c/c-synthesis.md +451 -0
- package/research/dogfood/strategy-c/c1-explorer.md +192 -0
- package/research/dogfood/strategy-c/c2-diver-memory.md +145 -0
- package/research/dogfood/strategy-c/c3-diver-control.md +148 -0
- package/research/dogfood/strategy-c/c4-diver-failure.md +151 -0
- package/research/dogfood/strategy-c/c5-diver-execution.md +221 -0
- package/research/dogfood/strategy-c/c6-diver-org.md +221 -0
- package/research/effect-cli-error-handling.md +845 -0
- package/research/effect-errors-as-values.md +943 -0
- package/research/errors-task-analysis/00-consolidated-tasks.md +207 -0
- package/research/errors-task-analysis/cli-commands-analysis.md +909 -0
- package/research/errors-task-analysis/embeddings-analysis.md +709 -0
- package/research/errors-task-analysis/index-search-analysis.md +812 -0
- package/research/frontmatter/COMMENTS-ARE-SKIPPED.md +149 -0
- package/research/frontmatter/LLM-CODE-NAVIGATION.md +276 -0
- package/research/issue-review.md +603 -0
- package/research/llm-summarization/agent-cli-tools-2026.md +1082 -0
- package/research/llm-summarization/alternative-providers-2026.md +1428 -0
- package/research/llm-summarization/anthropic-2026.md +367 -0
- package/research/llm-summarization/claude-cli-integration.md +1706 -0
- package/research/llm-summarization/cli-integration-patterns.md +3155 -0
- package/research/llm-summarization/openai-2026.md +473 -0
- package/research/llm-summarization/openai-compatible-providers-2026.md +1022 -0
- package/research/llm-summarization/opencode-cli-integration.md +1552 -0
- package/research/llm-summarization/prompt-engineering-2026.md +1426 -0
- package/research/llm-summarization/prototype-results.md +56 -0
- package/research/llm-summarization/provider-switching-patterns-2026.md +2153 -0
- package/research/llm-summarization/typescript-llm-libraries-2026.md +2436 -0
- package/research/mdcontext-error-analysis.md +521 -0
- package/research/mdcontext-pudding/00-EXECUTIVE-SUMMARY.md +282 -0
- package/research/mdcontext-pudding/01-index-embed.md +956 -0
- package/research/mdcontext-pudding/02-search-COMMANDS.md +142 -0
- package/research/mdcontext-pudding/02-search-SUMMARY.md +146 -0
- package/research/mdcontext-pudding/02-search.md +970 -0
- package/research/mdcontext-pudding/03-context.md +779 -0
- package/research/mdcontext-pudding/04-navigation-and-analytics.md +803 -0
- package/research/mdcontext-pudding/04-tree.md +704 -0
- package/research/mdcontext-pudding/05-config.md +1038 -0
- package/research/mdcontext-pudding/06-links-summary.txt +87 -0
- package/research/mdcontext-pudding/06-links.md +679 -0
- package/research/mdcontext-pudding/07-stats.md +693 -0
- package/research/mdcontext-pudding/BUG-FIX-PLAN.md +388 -0
- package/research/mdcontext-pudding/P0-BUG-VALIDATION.md +167 -0
- package/research/mdcontext-pudding/README.md +168 -0
- package/research/mdcontext-pudding/TESTING-SUMMARY.md +128 -0
- package/research/npm_publish/011-npm-workflow-research-agent2.md +792 -0
- package/research/npm_publish/012-npm-workflow-research-agent1.md +530 -0
- package/research/npm_publish/013-npm-workflow-research-agent3.md +722 -0
- package/research/npm_publish/014-npm-workflow-synthesis.md +556 -0
- package/research/npm_publish/031-npm-workflow-task-analysis.md +134 -0
- package/research/research-quality-review.md +834 -0
- package/research/semantic-search/002-research-embedding-models.md +490 -0
- package/research/semantic-search/003-research-rag-alternatives.md +523 -0
- package/research/semantic-search/004-research-vector-search.md +841 -0
- package/research/semantic-search/032-research-semantic-search.md +427 -0
- package/research/semantic-search/embedding-text-analysis.md +156 -0
- package/research/semantic-search/multi-word-failure-reproduction.md +171 -0
- package/research/semantic-search/query-processing-analysis.md +207 -0
- package/research/semantic-search/root-cause-and-solution.md +114 -0
- package/research/semantic-search/threshold-validation-report.md +69 -0
- package/research/semantic-search/vector-search-analysis.md +63 -0
- package/research/task-management-2026/00-synthesis-recommendations.md +295 -0
- package/research/task-management-2026/01-ai-workflow-tools.md +416 -0
- package/research/task-management-2026/02-agent-framework-patterns.md +476 -0
- package/research/task-management-2026/03-lightweight-file-based.md +567 -0
- package/research/task-management-2026/04-established-tools-ai-features.md +541 -0
- package/research/task-management-2026/linear/01-core-features-workflow.md +771 -0
- package/research/task-management-2026/linear/02-api-integrations.md +930 -0
- package/research/task-management-2026/linear/03-ai-features.md +368 -0
- package/research/task-management-2026/linear/04-pricing-setup.md +205 -0
- package/research/task-management-2026/linear/05-usage-patterns-best-practices.md +605 -0
- package/research/test-path-issues.md +276 -0
- package/review/ALP-76/1-error-type-design.md +962 -0
- package/review/ALP-76/2-error-handling-patterns.md +906 -0
- package/review/ALP-76/3-error-presentation.md +624 -0
- package/review/ALP-76/4-test-coverage.md +625 -0
- package/review/ALP-76/5-migration-completeness.md +440 -0
- package/review/ALP-76/6-effect-best-practices.md +755 -0
- package/scripts/apply-branch-protection.sh +47 -0
- package/scripts/branch-protection-templates.json +79 -0
- package/scripts/prototype-summarization.ts +346 -0
- package/scripts/rebuild-hnswlib.js +58 -0
- package/scripts/setup-branch-protection.sh +64 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/active-provider.json +7 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.json +541 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.meta.json +5 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/config.json +8 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/documents.json +60 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/links.json +13 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/sections.json +1197 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/configuration-management.md +99 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/distributed-systems.md +92 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/error-handling.md +78 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/failure-automation.md +55 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/job-context.md +69 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/process-orchestration.md +99 -0
- package/src/cli/argv-preprocessor.test.ts +210 -0
- package/src/cli/argv-preprocessor.ts +202 -0
- package/src/cli/cli.test.ts +627 -0
- package/src/cli/commands/backlinks.ts +54 -0
- package/src/cli/commands/config-cmd.ts +642 -0
- package/src/cli/commands/context.ts +285 -0
- package/src/cli/commands/duplicates.ts +122 -0
- package/src/cli/commands/embeddings.ts +529 -0
- package/src/cli/commands/index-cmd.ts +480 -0
- package/src/cli/commands/index.ts +16 -0
- package/src/cli/commands/links.ts +52 -0
- package/src/cli/commands/search.ts +1281 -0
- package/src/cli/commands/stats.ts +149 -0
- package/src/cli/commands/tree.ts +128 -0
- package/src/cli/config-layer.ts +176 -0
- package/src/cli/error-handler.test.ts +235 -0
- package/src/cli/error-handler.ts +655 -0
- package/src/cli/flag-schemas.ts +341 -0
- package/src/cli/help.ts +588 -0
- package/src/cli/index.ts +9 -0
- package/src/cli/main.ts +435 -0
- package/src/cli/options.ts +41 -0
- package/src/cli/shared-error-handling.ts +199 -0
- package/src/cli/typo-suggester.test.ts +105 -0
- package/src/cli/typo-suggester.ts +130 -0
- package/src/cli/utils.ts +259 -0
- package/src/config/file-provider.test.ts +320 -0
- package/src/config/file-provider.ts +273 -0
- package/src/config/index.ts +72 -0
- package/src/config/integration.test.ts +667 -0
- package/src/config/precedence.test.ts +277 -0
- package/src/config/precedence.ts +451 -0
- package/src/config/schema.test.ts +414 -0
- package/src/config/schema.ts +603 -0
- package/src/config/service.test.ts +320 -0
- package/src/config/service.ts +243 -0
- package/src/config/testing.test.ts +264 -0
- package/src/config/testing.ts +110 -0
- package/src/core/index.ts +1 -0
- package/src/core/types.ts +113 -0
- package/src/duplicates/detector.test.ts +183 -0
- package/src/duplicates/detector.ts +414 -0
- package/src/duplicates/index.ts +18 -0
- package/src/embeddings/embedding-namespace.test.ts +300 -0
- package/src/embeddings/embedding-namespace.ts +947 -0
- package/src/embeddings/heading-boost.test.ts +222 -0
- package/src/embeddings/hnsw-build-options.test.ts +198 -0
- package/src/embeddings/hyde.test.ts +272 -0
- package/src/embeddings/hyde.ts +264 -0
- package/src/embeddings/index.ts +10 -0
- package/src/embeddings/openai-provider.ts +414 -0
- package/src/embeddings/pricing.json +22 -0
- package/src/embeddings/provider-constants.ts +204 -0
- package/src/embeddings/provider-errors.test.ts +967 -0
- package/src/embeddings/provider-errors.ts +565 -0
- package/src/embeddings/provider-factory.test.ts +240 -0
- package/src/embeddings/provider-factory.ts +225 -0
- package/src/embeddings/provider-integration.test.ts +788 -0
- package/src/embeddings/query-preprocessing.test.ts +187 -0
- package/src/embeddings/semantic-search-threshold.test.ts +508 -0
- package/src/embeddings/semantic-search.ts +1270 -0
- package/src/embeddings/types.ts +359 -0
- package/src/embeddings/vector-store.ts +708 -0
- package/src/embeddings/voyage-provider.ts +313 -0
- package/src/errors/errors.test.ts +845 -0
- package/src/errors/index.ts +533 -0
- package/src/index/ignore-patterns.test.ts +354 -0
- package/src/index/ignore-patterns.ts +305 -0
- package/src/index/index.ts +4 -0
- package/src/index/indexer.ts +684 -0
- package/src/index/storage.ts +260 -0
- package/src/index/types.ts +147 -0
- package/src/index/watcher.ts +189 -0
- package/src/index.ts +30 -0
- package/src/integration/search-keyword.test.ts +678 -0
- package/src/mcp/server.ts +612 -0
- package/src/parser/index.ts +1 -0
- package/src/parser/parser.test.ts +291 -0
- package/src/parser/parser.ts +394 -0
- package/src/parser/section-filter.test.ts +277 -0
- package/src/parser/section-filter.ts +392 -0
- package/src/search/__tests__/hybrid-search.test.ts +650 -0
- package/src/search/bm25-store.ts +366 -0
- package/src/search/cross-encoder.test.ts +253 -0
- package/src/search/cross-encoder.ts +406 -0
- package/src/search/fuzzy-search.test.ts +419 -0
- package/src/search/fuzzy-search.ts +273 -0
- package/src/search/hybrid-search.ts +448 -0
- package/src/search/path-matcher.test.ts +276 -0
- package/src/search/path-matcher.ts +33 -0
- package/src/search/query-parser.test.ts +260 -0
- package/src/search/query-parser.ts +319 -0
- package/src/search/searcher.test.ts +280 -0
- package/src/search/searcher.ts +724 -0
- package/src/search/wink-bm25.d.ts +30 -0
- package/src/summarization/cli-providers/claude.ts +202 -0
- package/src/summarization/cli-providers/detection.test.ts +273 -0
- package/src/summarization/cli-providers/detection.ts +118 -0
- package/src/summarization/cli-providers/index.ts +8 -0
- package/src/summarization/cost.test.ts +139 -0
- package/src/summarization/cost.ts +102 -0
- package/src/summarization/error-handler.test.ts +127 -0
- package/src/summarization/error-handler.ts +111 -0
- package/src/summarization/index.ts +102 -0
- package/src/summarization/pipeline.test.ts +498 -0
- package/src/summarization/pipeline.ts +231 -0
- package/src/summarization/prompts.test.ts +269 -0
- package/src/summarization/prompts.ts +133 -0
- package/src/summarization/provider-factory.test.ts +396 -0
- package/src/summarization/provider-factory.ts +178 -0
- package/src/summarization/types.ts +184 -0
- package/src/summarize/budget-bugs.test.ts +620 -0
- package/src/summarize/formatters.ts +419 -0
- package/src/summarize/index.ts +20 -0
- package/src/summarize/summarizer.test.ts +275 -0
- package/src/summarize/summarizer.ts +597 -0
- package/src/summarize/verify-bugs.test.ts +238 -0
- package/src/types/huggingface-transformers.d.ts +66 -0
- package/src/utils/index.ts +1 -0
- package/src/utils/tokens.test.ts +142 -0
- package/src/utils/tokens.ts +186 -0
- package/tests/fixtures/cli/.mdcontext/active-provider.json +7 -0
- package/tests/fixtures/cli/.mdcontext/config.json +8 -0
- package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
- package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
- package/tests/fixtures/cli/.mdcontext/indexes/documents.json +33 -0
- package/tests/fixtures/cli/.mdcontext/indexes/links.json +12 -0
- package/tests/fixtures/cli/.mdcontext/indexes/sections.json +247 -0
- package/tests/fixtures/cli/README.md +9 -0
- package/tests/fixtures/cli/api-reference.md +11 -0
- package/tests/fixtures/cli/getting-started.md +11 -0
- package/tests/integration/embed-index.test.ts +712 -0
- package/tests/integration/search-context.test.ts +469 -0
- package/tests/integration/search-semantic.test.ts +522 -0
- package/tsconfig.json +26 -0
- package/vitest.config.ts +16 -0
- package/vitest.setup.ts +12 -0
|
@@ -0,0 +1,367 @@
|
|
|
1
|
+
# Anthropic Claude Models & Pricing - 2026 Research
|
|
2
|
+
|
|
3
|
+
**Last Updated:** January 26, 2026
|
|
4
|
+
**Research Focus:** Latest Claude models, pricing, context windows, and code understanding capabilities
|
|
5
|
+
|
|
6
|
+
---
|
|
7
|
+
|
|
8
|
+
## Executive Summary
|
|
9
|
+
|
|
10
|
+
As of 2026, Anthropic's **Claude 4.5 series** represents the state-of-the-art in AI language models, offering a **67% cost reduction** over previous generations while delivering superior performance. The flagship **Claude Sonnet 4.5** is marketed as "the best coding model in the world" and is the recommended starting point for most use cases.
|
|
11
|
+
|
|
12
|
+
**Key Highlights:**
|
|
13
|
+
- Claude 4.5 series launched late 2025, offering three tiers: Haiku, Sonnet, and Opus
|
|
14
|
+
- Up to 1M token context window (beta) for Sonnet 4 and 4.5
|
|
15
|
+
- Significant cost optimization features: Prompt Caching (90% savings), Batch API (50% discount)
|
|
16
|
+
- Advanced code understanding and summarization capabilities
|
|
17
|
+
|
|
18
|
+
---
|
|
19
|
+
|
|
20
|
+
## Model Lineup & Pricing
|
|
21
|
+
|
|
22
|
+
### Claude 4.5 Series (Current Generation - 2026)
|
|
23
|
+
|
|
24
|
+
The Claude 4.5 series offers the best balance of performance and cost in Anthropic's lineup:
|
|
25
|
+
|
|
26
|
+
| Model | Input Price | Output Price | Use Case | Speed |
|
|
27
|
+
|-------|-------------|--------------|----------|-------|
|
|
28
|
+
| **Haiku 4.5** | $1/M tokens | $5/M tokens | High-volume, speed-critical tasks | Fastest |
|
|
29
|
+
| **Sonnet 4.5** | $3/M tokens | $15/M tokens | Balanced performance & cost | Balanced |
|
|
30
|
+
| **Opus 4.5** | $5/M tokens | $25/M tokens | Most capable flagship model | Most Capable |
|
|
31
|
+
|
|
32
|
+
**Recommended Starting Point:** Claude Sonnet 4.5 - offers the strongest coding and agent performance in the Claude lineup.
|
|
33
|
+
|
|
34
|
+
### Legacy Models (Claude 4/4.1 Series)
|
|
35
|
+
|
|
36
|
+
Legacy models remain available but at significantly higher cost:
|
|
37
|
+
|
|
38
|
+
| Model | Input Price | Output Price | Notes |
|
|
39
|
+
|-------|-------------|--------------|-------|
|
|
40
|
+
| **Opus 4/4.1** | $15/M tokens | $75/M tokens | Most expensive; 67% more costly than Opus 4.5 |
|
|
41
|
+
|
|
42
|
+
**Migration Recommendation:** Migrate to Claude 4.5 series for better performance at lower cost.
|
|
43
|
+
|
|
44
|
+
---
|
|
45
|
+
|
|
46
|
+
## Context Window Sizes
|
|
47
|
+
|
|
48
|
+
Claude models in 2026 offer industry-leading context windows:
|
|
49
|
+
|
|
50
|
+
### Standard Context Window
|
|
51
|
+
- **200,000 tokens** (default for all models)
|
|
52
|
+
- Approximately 150,000 words or 500+ pages of material
|
|
53
|
+
- Suitable for most use cases
|
|
54
|
+
|
|
55
|
+
### Extended Context Window (Beta)
|
|
56
|
+
- **1,000,000 tokens** available for Sonnet 4 and 4.5
|
|
57
|
+
- Requires `context-1m-2025-08-07` beta header
|
|
58
|
+
- Access limited to organizations in **Usage Tier 4** or with custom rate limits
|
|
59
|
+
- Long context pricing applies to requests exceeding 200K tokens
|
|
60
|
+
|
|
61
|
+
### Enterprise Plan Access
|
|
62
|
+
- 500K context window for Claude Sonnet 4.5 on Enterprise plans
|
|
63
|
+
- Available for chatting with Claude in the web interface
|
|
64
|
+
|
|
65
|
+
**Note:** The 1M token context window is currently in beta and requires advancing to Usage Tier 4.
|
|
66
|
+
|
|
67
|
+
---
|
|
68
|
+
|
|
69
|
+
## Cost Optimization Features
|
|
70
|
+
|
|
71
|
+
### 1. Prompt Caching (90% Savings)
|
|
72
|
+
|
|
73
|
+
Prompt caching achieves **90% savings** on repeated content after just 2 requests. This is particularly valuable for:
|
|
74
|
+
- Repeated codebase context
|
|
75
|
+
- Standard system prompts
|
|
76
|
+
- Recurring documentation or knowledge bases
|
|
77
|
+
|
|
78
|
+
**Implementation:** Automatically applied to repeated context in API requests.
|
|
79
|
+
|
|
80
|
+
### 2. Batch API (50% Discount)
|
|
81
|
+
|
|
82
|
+
The Batch API allows asynchronous processing of large volumes of requests with a **50% discount** on both input and output tokens.
|
|
83
|
+
|
|
84
|
+
**Best For:**
|
|
85
|
+
- Large-scale code analysis
|
|
86
|
+
- Batch summarization tasks
|
|
87
|
+
- Non-time-sensitive processing
|
|
88
|
+
|
|
89
|
+
### 3. Extended Thinking
|
|
90
|
+
|
|
91
|
+
Extended thinking tokens are billed as output tokens, not as a separate pricing tier. When you enable extended thinking with a token budget (minimum 1,024 tokens), any tokens the model uses for internal reasoning are charged at the standard output rate for that model.
|
|
92
|
+
|
|
93
|
+
**Use Case:** Complex reasoning tasks requiring step-by-step thinking.
|
|
94
|
+
|
|
95
|
+
---
|
|
96
|
+
|
|
97
|
+
## Code Understanding Capabilities
|
|
98
|
+
|
|
99
|
+
### Claude Sonnet 4.5 - Best for Code Analysis
|
|
100
|
+
|
|
101
|
+
Claude Sonnet 4.5 is Anthropic's **strongest coding model** with exceptional capabilities:
|
|
102
|
+
|
|
103
|
+
**Code Understanding:**
|
|
104
|
+
- Substantially improved problem-solving and codebase navigation
|
|
105
|
+
- Navigation errors reduced from 20% to near zero
|
|
106
|
+
- State-of-the-art for complex codebase understanding
|
|
107
|
+
- Improved precision for complex changes across multiple files
|
|
108
|
+
|
|
109
|
+
**Autonomous Development:**
|
|
110
|
+
- Excels at autonomous multi-feature app development
|
|
111
|
+
- Three-phase workflow: gather context → take action → verify results
|
|
112
|
+
- Tools for searching files, editing code, and running tests
|
|
113
|
+
|
|
114
|
+
**Context & Navigation:**
|
|
115
|
+
- Can understand and navigate large codebases effectively
|
|
116
|
+
- 1M token context window enables processing entire repositories
|
|
117
|
+
- Superior performance with longer code blocks
|
|
118
|
+
|
|
119
|
+
### Claude Haiku 4.5 - Speed & Efficiency
|
|
120
|
+
|
|
121
|
+
Haiku 4.5 achieves **90% of Sonnet 4.5's performance** in agentic coding evaluation at a fraction of the cost:
|
|
122
|
+
|
|
123
|
+
**Best For:**
|
|
124
|
+
- Code review
|
|
125
|
+
- Documentation generation
|
|
126
|
+
- Linting and test generation
|
|
127
|
+
- Sub-agent tasks
|
|
128
|
+
- Simple frontend/backend tasks
|
|
129
|
+
|
|
130
|
+
**Limitations:**
|
|
131
|
+
- Tends to hallucinate when generating code exceeding 150 lines
|
|
132
|
+
- Better for well-defined tasks with clear solution spaces
|
|
133
|
+
|
|
134
|
+
### Summarization Capabilities
|
|
135
|
+
|
|
136
|
+
Claude excels at summarization tasks:
|
|
137
|
+
|
|
138
|
+
- **Natural Language Processing:** Condensing large amounts of text while retaining key information
|
|
139
|
+
- **Legal Documents:** Particularly emphasized for legal document summarization
|
|
140
|
+
- **Code Summarization:** Opus 4.5 can read all relevant context and determine what details are relevant for specific contexts
|
|
141
|
+
- **Lossy Compression:** Explore agent returns summaries with some information loss, but maintains contextual relevance
|
|
142
|
+
|
|
143
|
+
**Note:** WebFetch and Cowork summarization includes prompt injection protection layers.
|
|
144
|
+
|
|
145
|
+
---
|
|
146
|
+
|
|
147
|
+
## API Features & Rate Limits
|
|
148
|
+
|
|
149
|
+
### Usage Tiers
|
|
150
|
+
|
|
151
|
+
Anthropic uses a tier-based system where organizations automatically advance as they reach spending thresholds:
|
|
152
|
+
|
|
153
|
+
| Tier | Deposit Requirement | Monthly Spend Limit | Requests/Min | Special Access |
|
|
154
|
+
|------|---------------------|---------------------|--------------|----------------|
|
|
155
|
+
| **Tier 1** | $5 | $100 | 50 RPM | - |
|
|
156
|
+
| **Tier 2** | - | - | Higher | - |
|
|
157
|
+
| **Tier 3** | - | - | Higher | - |
|
|
158
|
+
| **Tier 4** | $400+ | Higher | Highest | 1M token context window |
|
|
159
|
+
|
|
160
|
+
**Tier Advancement:** Organizations advance immediately upon reaching the cumulative credit purchase threshold (excluding tax).
|
|
161
|
+
|
|
162
|
+
### Rate Limit Structure
|
|
163
|
+
|
|
164
|
+
Rate limits are measured across three dimensions:
|
|
165
|
+
1. **RPM** - Requests per minute
|
|
166
|
+
2. **ITPM** - Input tokens per minute
|
|
167
|
+
3. **OTPM** - Output tokens per minute
|
|
168
|
+
|
|
169
|
+
**Algorithm:** Token bucket algorithm - capacity continuously replenishes up to maximum limit rather than resetting at fixed intervals.
|
|
170
|
+
|
|
171
|
+
**Organization-Level:** Limits are set at the organization level and can be viewed in the Claude Console Limits page.
|
|
172
|
+
|
|
173
|
+
### Spend Limits
|
|
174
|
+
|
|
175
|
+
- Each tier has a maximum monthly API spend limit
|
|
176
|
+
- Once reached, API access is paused until the next calendar month or tier advancement
|
|
177
|
+
- Custom limits available through sales contact
|
|
178
|
+
|
|
179
|
+
### Priority Tier Access
|
|
180
|
+
|
|
181
|
+
For higher custom limits or Priority Tier access:
|
|
182
|
+
- Contact sales through Claude Console
|
|
183
|
+
- Enhanced service levels available
|
|
184
|
+
- Custom rate limit arrangements
|
|
185
|
+
|
|
186
|
+
---
|
|
187
|
+
|
|
188
|
+
## Model Selection Recommendations
|
|
189
|
+
|
|
190
|
+
### For Code Summarization & Analysis
|
|
191
|
+
|
|
192
|
+
**Primary Choice: Claude Sonnet 4.5**
|
|
193
|
+
- Best overall coding model
|
|
194
|
+
- Superior accuracy and understanding
|
|
195
|
+
- 1M token context window (beta)
|
|
196
|
+
- Handles complex, multi-file analysis
|
|
197
|
+
- **Cost:** $3 input / $15 output per million tokens
|
|
198
|
+
|
|
199
|
+
**Secondary Choice: Claude Haiku 4.5**
|
|
200
|
+
- 90% of Sonnet performance at lower cost
|
|
201
|
+
- Excellent for simpler tasks
|
|
202
|
+
- Fast iteration and testing
|
|
203
|
+
- **Cost:** $1 input / $5 output per million tokens
|
|
204
|
+
|
|
205
|
+
**Hybrid Strategy:**
|
|
206
|
+
- Use Sonnet 4.5 for in-depth analysis and complex reasoning
|
|
207
|
+
- Use Haiku 4.5 for quick reviews, documentation, and sub-tasks
|
|
208
|
+
- Optimize costs while maintaining quality
|
|
209
|
+
|
|
210
|
+
### For Different Use Cases
|
|
211
|
+
|
|
212
|
+
| Use Case | Recommended Model | Rationale |
|
|
213
|
+
|----------|-------------------|-----------|
|
|
214
|
+
| **Large codebase analysis** | Sonnet 4.5 | 1M context window, superior navigation |
|
|
215
|
+
| **Quick code reviews** | Haiku 4.5 | Speed and cost efficiency |
|
|
216
|
+
| **Complex multi-file changes** | Sonnet 4.5 | Improved precision for complexity |
|
|
217
|
+
| **Documentation generation** | Haiku 4.5 | Well-defined task, cost-effective |
|
|
218
|
+
| **Autonomous development** | Sonnet 4.5 | Best problem-solving capabilities |
|
|
219
|
+
| **Batch summarization** | Haiku 4.5 + Batch API | 50% discount, efficient processing |
|
|
220
|
+
| **Legal/technical summarization** | Opus 4.5 | Highest capability, contextual understanding |
|
|
221
|
+
|
|
222
|
+
---
|
|
223
|
+
|
|
224
|
+
## Cost Comparison Examples
|
|
225
|
+
|
|
226
|
+
### Code Summarization Task Example
|
|
227
|
+
|
|
228
|
+
**Scenario:** Summarize a 50K token codebase, output 5K token summary
|
|
229
|
+
|
|
230
|
+
| Model | Input Cost | Output Cost | Total Cost |
|
|
231
|
+
|-------|------------|-------------|------------|
|
|
232
|
+
| Haiku 4.5 | $0.05 | $0.025 | **$0.075** |
|
|
233
|
+
| Sonnet 4.5 | $0.15 | $0.075 | **$0.225** |
|
|
234
|
+
| Opus 4.5 | $0.25 | $0.125 | **$0.375** |
|
|
235
|
+
| Opus 4.1 (legacy) | $0.75 | $0.375 | **$1.125** |
|
|
236
|
+
|
|
237
|
+
**With Prompt Caching (after 2nd request):**
|
|
238
|
+
- Haiku 4.5: ~$0.012 per request
|
|
239
|
+
- Sonnet 4.5: ~$0.038 per request
|
|
240
|
+
|
|
241
|
+
**With Batch API (50% discount):**
|
|
242
|
+
- Haiku 4.5: $0.0375 per request
|
|
243
|
+
- Sonnet 4.5: $0.1125 per request
|
|
244
|
+
|
|
245
|
+
### Large-Scale Analysis Example
|
|
246
|
+
|
|
247
|
+
**Scenario:** Process 100 repositories, 100K tokens each, 10K token summaries
|
|
248
|
+
|
|
249
|
+
**Without Optimization:**
|
|
250
|
+
- Haiku 4.5: $75
|
|
251
|
+
- Sonnet 4.5: $225
|
|
252
|
+
- Opus 4.5: $375
|
|
253
|
+
|
|
254
|
+
**With Batch API + Prompt Caching:**
|
|
255
|
+
- Haiku 4.5: ~$12-15
|
|
256
|
+
- Sonnet 4.5: ~$38-45
|
|
257
|
+
|
|
258
|
+
**Cost Savings:** Up to 80-90% with optimization features
|
|
259
|
+
|
|
260
|
+
---
|
|
261
|
+
|
|
262
|
+
## Implementation Recommendations
|
|
263
|
+
|
|
264
|
+
### Getting Started
|
|
265
|
+
|
|
266
|
+
1. **Start with Sonnet 4.5** for initial development and testing
|
|
267
|
+
2. **Implement Prompt Caching** for repeated context (codebase analysis)
|
|
268
|
+
3. **Use Batch API** for non-time-sensitive large-scale processing
|
|
269
|
+
4. **Monitor usage** through Claude Console Limits page
|
|
270
|
+
|
|
271
|
+
### Scaling Strategy
|
|
272
|
+
|
|
273
|
+
1. **Tier Advancement:**
|
|
274
|
+
- Start with Tier 1 ($5 deposit)
|
|
275
|
+
- Advance to Tier 4 ($400+ cumulative) for 1M context window access
|
|
276
|
+
|
|
277
|
+
2. **Hybrid Model Approach:**
|
|
278
|
+
- Sonnet 4.5 for complex analysis and initial summarization
|
|
279
|
+
- Haiku 4.5 for follow-up tasks, reviews, and sub-agent work
|
|
280
|
+
|
|
281
|
+
3. **Cost Optimization:**
|
|
282
|
+
- Enable prompt caching for repeated codebase context
|
|
283
|
+
- Use Batch API for bulk processing (50% discount)
|
|
284
|
+
- Monitor token usage and optimize prompt design
|
|
285
|
+
|
|
286
|
+
### Best Practices
|
|
287
|
+
|
|
288
|
+
1. **Context Window Usage:**
|
|
289
|
+
- Use 200K standard window for most tasks
|
|
290
|
+
- Request 1M window beta access for very large codebases
|
|
291
|
+
- Be mindful of long context pricing (>200K tokens)
|
|
292
|
+
|
|
293
|
+
2. **Error Reduction:**
|
|
294
|
+
- Sonnet 4.5's near-zero navigation errors reduce debugging costs
|
|
295
|
+
- Better first-time accuracy means fewer API calls
|
|
296
|
+
|
|
297
|
+
3. **Quality vs. Cost:**
|
|
298
|
+
- Use Sonnet 4.5 for quality-critical work
|
|
299
|
+
- Use Haiku 4.5 for high-volume, well-defined tasks
|
|
300
|
+
- Leverage Opus 4.5 only when highest capability is essential
|
|
301
|
+
|
|
302
|
+
---
|
|
303
|
+
|
|
304
|
+
## Future Considerations
|
|
305
|
+
|
|
306
|
+
### Trends to Watch
|
|
307
|
+
|
|
308
|
+
1. **Model Evolution:** Anthropic is continuously improving models - expect further cost reductions and capability improvements
|
|
309
|
+
2. **Context Window Expansion:** 1M token window currently in beta; likely to become standard
|
|
310
|
+
3. **Pricing Changes:** Historical trend shows cost reductions with new generations
|
|
311
|
+
4. **API Features:** New optimization features (beyond caching and batch) may emerge
|
|
312
|
+
|
|
313
|
+
### Migration Path
|
|
314
|
+
|
|
315
|
+
- **From Claude 3.5:** Immediate migration to Claude 4.5 recommended
|
|
316
|
+
- **From Claude 4/4.1:** Migrate to Claude 4.5 for 67% cost savings
|
|
317
|
+
- **From Other Providers:** Sonnet 4.5's coding capabilities and context window offer competitive advantages
|
|
318
|
+
|
|
319
|
+
---
|
|
320
|
+
|
|
321
|
+
## Sources & References
|
|
322
|
+
|
|
323
|
+
### Official Documentation
|
|
324
|
+
- [Pricing - Claude API Docs](https://platform.claude.com/docs/en/about-claude/pricing)
|
|
325
|
+
- [Context Windows - Claude API Docs](https://platform.claude.com/docs/en/build-with-claude/context-windows)
|
|
326
|
+
- [Models Overview - Claude API Docs](https://platform.claude.com/docs/en/about-claude/models/overview)
|
|
327
|
+
- [Rate Limits - Claude API Docs](https://platform.claude.com/docs/en/api/rate-limits)
|
|
328
|
+
- [Summarization with Claude](https://platform.claude.com/cookbook/capabilities-summarization-guide)
|
|
329
|
+
|
|
330
|
+
### Anthropic Announcements
|
|
331
|
+
- [Introducing Claude 4](https://www.anthropic.com/news/claude-4)
|
|
332
|
+
- [Introducing Claude Haiku 4.5](https://www.anthropic.com/news/claude-haiku-4-5)
|
|
333
|
+
|
|
334
|
+
### Third-Party Analysis & Guides
|
|
335
|
+
- [Anthropic Claude API Pricing 2026: Complete Cost Breakdown | MetaCTO](https://www.metacto.com/blogs/anthropic-api-pricing-a-full-breakdown-of-costs-and-integration)
|
|
336
|
+
- [The Guide to Claude Opus 4 & 4.5 API Pricing in 2026 - CometAPI](https://www.cometapi.com/the-guide-to-claude-opus-4--4-5-api-pricing-in-2026/)
|
|
337
|
+
- [Claude API Pricing Guide 2026 | AI Free API](https://www.aifreeapi.com/en/posts/claude-api-pricing-per-million-tokens)
|
|
338
|
+
- [Claude API Quota Tiers and Limits Explained 2026 | AI Free API](https://www.aifreeapi.com/en/posts/claude-api-quota-tiers-limits)
|
|
339
|
+
- [Anthropic API Pricing: The 2026 Guide | nOps](https://www.nops.io/blog/anthropic-api-pricing/)
|
|
340
|
+
- [Claude Pricing in 2026 for Individuals, Organizations, and Developers | Finout](https://www.finout.io/blog/claude-pricing-in-2026-for-individuals-organizations-and-developers)
|
|
341
|
+
|
|
342
|
+
### Model Comparisons
|
|
343
|
+
- [Which Claude Model Is Best for Coding: Opus vs Sonnet vs Haiku | Data Annotation](https://www.dataannotation.tech/developers/which-claude-model-is-best-for-coding)
|
|
344
|
+
- [Sonnet 4.5 vs Haiku 4.5 vs Opus 4.1 — Which Claude Model Actually Works Best | Medium](https://medium.com/@ayaanhaider.dev/sonnet-4-5-vs-haiku-4-5-vs-opus-4-1-which-claude-model-actually-works-best-in-real-projects-7183c0dc2249)
|
|
345
|
+
- [Claude Haiku 4.5 vs Sonnet 4.5: Detailed Comparison 2025 | Creole Studios](https://www.creolestudios.com/claude-haiku-4-5-vs-sonnet-4-5-comparison/)
|
|
346
|
+
|
|
347
|
+
### Technical Insights
|
|
348
|
+
- [How Claude Code works - Claude Code Docs](https://code.claude.com/docs/en/how-claude-code-works)
|
|
349
|
+
- [First impressions of Claude Cowork | Simon Willison](https://simonwillison.net/2026/Jan/12/claude-cowork/)
|
|
350
|
+
- [A practical guide to the Claude code context window size | eesel](https://www.eesel.ai/blog/claude-code-context-window-size)
|
|
351
|
+
|
|
352
|
+
---
|
|
353
|
+
|
|
354
|
+
## Conclusion
|
|
355
|
+
|
|
356
|
+
For code summarization and analysis in 2026, **Claude Sonnet 4.5** offers the optimal balance of:
|
|
357
|
+
- **Performance:** Best coding model with superior codebase understanding
|
|
358
|
+
- **Cost:** $3/$15 per million tokens (67% cheaper than previous generation)
|
|
359
|
+
- **Context:** Up to 1M token window for large codebases (beta)
|
|
360
|
+
- **Features:** Prompt caching and Batch API for cost optimization
|
|
361
|
+
|
|
362
|
+
**Recommended Architecture:**
|
|
363
|
+
- Primary: Claude Sonnet 4.5 with prompt caching for comprehensive analysis
|
|
364
|
+
- Secondary: Claude Haiku 4.5 with Batch API for high-volume, simpler tasks
|
|
365
|
+
- Advance to Tier 4 for 1M context window access on large codebases
|
|
366
|
+
|
|
367
|
+
This hybrid approach can achieve **80-90% cost savings** while maintaining high-quality results for code understanding and summarization tasks.
|