npm - mdcontext - Versions diffs - 0.1.0 → 0.2.0 - Mend

mdcontext 0.1.0 → 0.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (251) hide show

package/.changeset/config.json +9 -9
package/.claude/settings.local.json +25 -0
package/.github/workflows/claude-code-review.yml +44 -0
package/.github/workflows/claude.yml +85 -0
package/CONTRIBUTING.md +186 -0
package/NOTES/NOTES +44 -0
package/README.md +206 -3
package/biome.json +1 -1
package/dist/chunk-23UPXDNL.js +3044 -0
package/dist/chunk-2W7MO2DL.js +1366 -0
package/dist/chunk-3NUAZGMA.js +1689 -0
package/dist/chunk-7TOWB2XB.js +366 -0
package/dist/chunk-7XOTOADQ.js +3065 -0
package/dist/chunk-AH2PDM2K.js +3042 -0
package/dist/chunk-BNXWSZ63.js +3742 -0
package/dist/chunk-BTL5DJVU.js +3222 -0
package/dist/chunk-HDHYG7E4.js +104 -0
package/dist/chunk-HLR4KZBP.js +3234 -0
package/dist/chunk-IP3FRFEB.js +1045 -0
package/dist/chunk-KHU56VDO.js +3042 -0
package/dist/chunk-KRYIFLQR.js +85 -89
package/dist/chunk-LBSDNLEM.js +287 -0
package/dist/chunk-MNTQ7HCP.js +2643 -0
package/dist/chunk-MUJELQQ6.js +1387 -0
package/dist/chunk-MXJGMSLV.js +2199 -0
package/dist/chunk-N6QJGC3Z.js +2636 -0
package/dist/chunk-OBELGBPM.js +1713 -0
package/dist/chunk-OT7R5XTA.js +3192 -0
package/dist/chunk-P7X4RA2T.js +106 -0
package/dist/chunk-PIDUQNC2.js +3185 -0
package/dist/chunk-POGCDIH4.js +3187 -0
package/dist/chunk-PSIEOQGZ.js +3043 -0
package/dist/chunk-PVRT3IHA.js +3238 -0
package/dist/chunk-QNN4TT23.js +1430 -0
package/dist/chunk-RE3R45RJ.js +3042 -0
package/dist/chunk-S7E6TFX6.js +718 -657
package/dist/chunk-SG6GLU4U.js +1378 -0
package/dist/chunk-SJCDV2ST.js +274 -0
package/dist/chunk-SYE5XLF3.js +104 -0
package/dist/chunk-T5VLYBZD.js +103 -0
package/dist/chunk-TOQB7VWU.js +3238 -0
package/dist/chunk-VFNMZ4ZQ.js +3228 -0
package/dist/chunk-VVTGZNBT.js +1533 -1423
package/dist/chunk-W7Q4RFEV.js +104 -0
package/dist/chunk-XTYYVRLO.js +3190 -0
package/dist/chunk-Y6MDYVJD.js +3063 -0
package/dist/cli/main.js +4072 -629
package/dist/index.d.ts +420 -33
package/dist/index.js +8 -15
package/dist/mcp/server.js +103 -7
package/dist/schema-BAWSG7KY.js +22 -0
package/dist/schema-E3QUPL26.js +20 -0
package/dist/schema-EHL7WUT6.js +20 -0
package/docs/019-USAGE.md +44 -5
package/docs/020-current-implementation.md +8 -8
package/docs/021-DOGFOODING-FINDINGS.md +1 -1
package/docs/CONFIG.md +1123 -0
package/docs/ERRORS.md +383 -0
package/docs/summarization.md +320 -0
package/justfile +40 -0
package/package.json +39 -33
package/research/INDEX.md +315 -0
package/research/code-review/README.md +90 -0
package/research/code-review/cli-error-handling-review.md +979 -0
package/research/code-review/code-review-validation-report.md +464 -0
package/research/code-review/main-ts-review.md +1128 -0
package/research/config-docs/SUMMARY.md +357 -0
package/research/config-docs/TEST-RESULTS.md +776 -0
package/research/config-docs/TODO.md +542 -0
package/research/config-docs/analysis.md +744 -0
package/research/config-docs/fix-validation.md +502 -0
package/research/config-docs/help-audit.md +264 -0
package/research/config-docs/help-system-analysis.md +890 -0
package/research/frontmatter/COMMENTS-ARE-SKIPPED.md +149 -0
package/research/frontmatter/LLM-CODE-NAVIGATION.md +276 -0
package/research/issue-review.md +603 -0
package/research/llm-summarization/agent-cli-tools-2026.md +1082 -0
package/research/llm-summarization/alternative-providers-2026.md +1428 -0
package/research/llm-summarization/anthropic-2026.md +367 -0
package/research/llm-summarization/claude-cli-integration.md +1706 -0
package/research/llm-summarization/cli-integration-patterns.md +3155 -0
package/research/llm-summarization/openai-2026.md +473 -0
package/research/llm-summarization/openai-compatible-providers-2026.md +1022 -0
package/research/llm-summarization/opencode-cli-integration.md +1552 -0
package/research/llm-summarization/prompt-engineering-2026.md +1426 -0
package/research/llm-summarization/prototype-results.md +56 -0
package/research/llm-summarization/provider-switching-patterns-2026.md +2153 -0
package/research/llm-summarization/typescript-llm-libraries-2026.md +2436 -0
package/research/mdcontext-pudding/00-EXECUTIVE-SUMMARY.md +282 -0
package/research/mdcontext-pudding/01-index-embed.md +956 -0
package/research/mdcontext-pudding/02-search-COMMANDS.md +142 -0
package/research/mdcontext-pudding/02-search-SUMMARY.md +146 -0
package/research/mdcontext-pudding/02-search.md +970 -0
package/research/mdcontext-pudding/03-context.md +779 -0
package/research/mdcontext-pudding/04-navigation-and-analytics.md +803 -0
package/research/mdcontext-pudding/04-tree.md +704 -0
package/research/mdcontext-pudding/05-config.md +1038 -0
package/research/mdcontext-pudding/06-links-summary.txt +87 -0
package/research/mdcontext-pudding/06-links.md +679 -0
package/research/mdcontext-pudding/07-stats.md +693 -0
package/research/mdcontext-pudding/BUG-FIX-PLAN.md +388 -0
package/research/mdcontext-pudding/P0-BUG-VALIDATION.md +167 -0
package/research/mdcontext-pudding/README.md +168 -0
package/research/mdcontext-pudding/TESTING-SUMMARY.md +128 -0
package/research/research-quality-review.md +834 -0
package/research/semantic-search/embedding-text-analysis.md +156 -0
package/research/semantic-search/multi-word-failure-reproduction.md +171 -0
package/research/semantic-search/query-processing-analysis.md +207 -0
package/research/semantic-search/root-cause-and-solution.md +114 -0
package/research/semantic-search/threshold-validation-report.md +69 -0
package/research/semantic-search/vector-search-analysis.md +63 -0
package/research/test-path-issues.md +276 -0
package/review/ALP-76/1-error-type-design.md +962 -0
package/review/ALP-76/2-error-handling-patterns.md +906 -0
package/review/ALP-76/3-error-presentation.md +624 -0
package/review/ALP-76/4-test-coverage.md +625 -0
package/review/ALP-76/5-migration-completeness.md +440 -0
package/review/ALP-76/6-effect-best-practices.md +755 -0
package/scripts/apply-branch-protection.sh +47 -0
package/scripts/branch-protection-templates.json +79 -0
package/scripts/prototype-summarization.ts +346 -0
package/scripts/rebuild-hnswlib.js +32 -37
package/scripts/setup-branch-protection.sh +64 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/active-provider.json +7 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.json +541 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.meta.json +5 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/config.json +8 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/documents.json +60 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/links.json +13 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/sections.json +1197 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/configuration-management.md +99 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/distributed-systems.md +92 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/error-handling.md +78 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/failure-automation.md +55 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/job-context.md +69 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/process-orchestration.md +99 -0
package/src/cli/argv-preprocessor.test.ts +2 -2
package/src/cli/cli.test.ts +230 -33
package/src/cli/commands/config-cmd.ts +642 -0
package/src/cli/commands/context.ts +97 -9
package/src/cli/commands/duplicates.ts +122 -0
package/src/cli/commands/embeddings.ts +529 -0
package/src/cli/commands/index-cmd.ts +210 -30
package/src/cli/commands/index.ts +3 -0
package/src/cli/commands/search.ts +894 -64
package/src/cli/commands/stats.ts +3 -0
package/src/cli/commands/tree.ts +26 -5
package/src/cli/config-layer.ts +176 -0
package/src/cli/error-handler.test.ts +235 -0
package/src/cli/error-handler.ts +655 -0
package/src/cli/flag-schemas.ts +66 -0
package/src/cli/help.ts +209 -7
package/src/cli/main.ts +348 -58
package/src/cli/options.ts +10 -0
package/src/cli/shared-error-handling.ts +199 -0
package/src/cli/utils.ts +150 -17
package/src/config/file-provider.test.ts +320 -0
package/src/config/file-provider.ts +273 -0
package/src/config/index.ts +72 -0
package/src/config/integration.test.ts +667 -0
package/src/config/precedence.test.ts +277 -0
package/src/config/precedence.ts +451 -0
package/src/config/schema.test.ts +414 -0
package/src/config/schema.ts +603 -0
package/src/config/service.test.ts +320 -0
package/src/config/service.ts +243 -0
package/src/config/testing.test.ts +264 -0
package/src/config/testing.ts +110 -0
package/src/core/types.ts +6 -33
package/src/duplicates/detector.test.ts +183 -0
package/src/duplicates/detector.ts +414 -0
package/src/duplicates/index.ts +18 -0
package/src/embeddings/embedding-namespace.test.ts +300 -0
package/src/embeddings/embedding-namespace.ts +947 -0
package/src/embeddings/heading-boost.test.ts +222 -0
package/src/embeddings/hnsw-build-options.test.ts +198 -0
package/src/embeddings/hyde.test.ts +272 -0
package/src/embeddings/hyde.ts +264 -0
package/src/embeddings/index.ts +2 -0
package/src/embeddings/openai-provider.ts +332 -83
package/src/embeddings/pricing.json +22 -0
package/src/embeddings/provider-constants.ts +204 -0
package/src/embeddings/provider-errors.test.ts +967 -0
package/src/embeddings/provider-errors.ts +565 -0
package/src/embeddings/provider-factory.test.ts +240 -0
package/src/embeddings/provider-factory.ts +225 -0
package/src/embeddings/provider-integration.test.ts +788 -0
package/src/embeddings/query-preprocessing.test.ts +187 -0
package/src/embeddings/semantic-search-threshold.test.ts +508 -0
package/src/embeddings/semantic-search.ts +780 -93
package/src/embeddings/types.ts +293 -16
package/src/embeddings/vector-store.ts +486 -77
package/src/embeddings/voyage-provider.ts +313 -0
package/src/errors/errors.test.ts +845 -0
package/src/errors/index.ts +533 -0
package/src/index/ignore-patterns.test.ts +354 -0
package/src/index/ignore-patterns.ts +305 -0
package/src/index/indexer.ts +286 -48
package/src/index/storage.ts +94 -30
package/src/index/types.ts +40 -2
package/src/index/watcher.ts +67 -9
package/src/index.ts +22 -0
package/src/integration/search-keyword.test.ts +678 -0
package/src/mcp/server.ts +135 -6
package/src/parser/parser.ts +18 -19
package/src/parser/section-filter.test.ts +277 -0
package/src/parser/section-filter.ts +125 -3
package/src/search/__tests__/hybrid-search.test.ts +650 -0
package/src/search/bm25-store.ts +366 -0
package/src/search/cross-encoder.test.ts +253 -0
package/src/search/cross-encoder.ts +406 -0
package/src/search/fuzzy-search.test.ts +419 -0
package/src/search/fuzzy-search.ts +273 -0
package/src/search/hybrid-search.ts +448 -0
package/src/search/path-matcher.test.ts +276 -0
package/src/search/path-matcher.ts +33 -0
package/src/search/searcher.test.ts +99 -1
package/src/search/searcher.ts +189 -67
package/src/search/wink-bm25.d.ts +30 -0
package/src/summarization/cli-providers/claude.ts +202 -0
package/src/summarization/cli-providers/detection.test.ts +273 -0
package/src/summarization/cli-providers/detection.ts +118 -0
package/src/summarization/cli-providers/index.ts +8 -0
package/src/summarization/cost.test.ts +139 -0
package/src/summarization/cost.ts +102 -0
package/src/summarization/error-handler.test.ts +127 -0
package/src/summarization/error-handler.ts +111 -0
package/src/summarization/index.ts +102 -0
package/src/summarization/pipeline.test.ts +498 -0
package/src/summarization/pipeline.ts +231 -0
package/src/summarization/prompts.test.ts +269 -0
package/src/summarization/prompts.ts +133 -0
package/src/summarization/provider-factory.test.ts +396 -0
package/src/summarization/provider-factory.ts +178 -0
package/src/summarization/types.ts +184 -0
package/src/summarize/summarizer.ts +104 -35
package/src/types/huggingface-transformers.d.ts +66 -0
package/tests/fixtures/cli/.mdcontext/active-provider.json +7 -0
package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
package/tests/fixtures/cli/.mdcontext/indexes/documents.json +4 -4
package/tests/fixtures/cli/.mdcontext/indexes/sections.json +14 -0
package/tests/integration/embed-index.test.ts +712 -0
package/tests/integration/search-context.test.ts +469 -0
package/tests/integration/search-semantic.test.ts +522 -0
package/vitest.config.ts +1 -6
package/AGENTS.md +0 -46
package/tests/fixtures/cli/.mdcontext/vectors.bin +0 -0
package/tests/fixtures/cli/.mdcontext/vectors.meta.json +0 -1264

package/docs/ERRORS.md ADDED Viewed

@@ -0,0 +1,383 @@
+# Error Handling Patterns
+This document describes the error handling conventions used in mdcontext, following Effect's "errors as values" philosophy.
+## Error Type Taxonomy
+All domain errors are defined in `src/errors/index.ts` using Effect's `Data.TaggedError`:
+```typescript
+export class FileReadError extends Data.TaggedError('FileReadError')<{
+  readonly path: string
+  readonly message: string
+  readonly cause?: unknown
+}> {
+  get code() { return ErrorCode.FILE_READ }
+}
+```
+### Error Categories
+- **File System**: `FileReadError`, `FileWriteError`, `DirectoryCreateError`, `DirectoryWalkError`
+- **Parsing**: `ParseError` (for markdown parsing failures)
+- **API**: `ApiKeyMissingError`, `ApiKeyInvalidError`
+- **Embeddings**: `EmbeddingError` (rate limits, quota, network failures)
+- **Index**: `IndexNotFoundError`, `IndexCorruptedError`, `IndexBuildError`
+- **Search**: `DocumentNotFoundError`, `EmbeddingsNotFoundError`
+- **Config**: `ConfigError`
+- **Vector Store**: `VectorStoreError`
+- **Watch**: `WatchError`
+- **CLI**: `CliValidationError`
+## Error Codes
+Each error type has a unique error code for programmatic handling. Error codes are stable identifiers that don't change when messages are updated.
+### Code Format
+Codes follow the pattern `E{category}{number}`:
+| Category | Code Range | Description |
+|----------|------------|-------------|
+| File System | E1xx | File and directory operations |
+| Parse | E2xx | Markdown parsing errors |
+| API | E3xx | API authentication and embedding errors |
+| Index | E4xx | Index operations |
+| Search | E5xx | Search operations |
+| Vector Store | E6xx | Vector store operations |
+| Config | E7xx | Configuration errors |
+| Watch | E8xx | File watcher errors |
+| CLI | E9xx | CLI validation errors |
+### Error Code Reference
+| Code | Error Type | Description |
+|------|------------|-------------|
+| E100 | FileReadError | Cannot read file |
+| E101 | FileWriteError | Cannot write file |
+| E102 | DirectoryCreateError | Cannot create directory |
+| E103 | DirectoryWalkError | Cannot traverse directory |
+| E200 | ParseError | Markdown parsing failed |
+| E300 | ApiKeyMissingError | API key not set in environment |
+| E301 | ApiKeyInvalidError | API key rejected by provider |
+| E310 | EmbeddingError (RateLimit) | Rate limit exceeded |
+| E311 | EmbeddingError (QuotaExceeded) | API quota exceeded |
+| E312 | EmbeddingError (Network) | Network error during embedding |
+| E313 | EmbeddingError (ModelError) | Model error |
+| E319 | EmbeddingError (Unknown) | Unknown embedding error |
+| E400 | IndexNotFoundError | Index does not exist |
+| E401 | IndexCorruptedError | Index is corrupted |
+| E402 | IndexBuildError | Failed to build index |
+| E500 | DocumentNotFoundError | Document not in index |
+| E501 | EmbeddingsNotFoundError | Embeddings not found |
+| E600 | VectorStoreError | Vector store operation failed |
+| E700 | ConfigError | Configuration error |
+| E800 | WatchError | File watcher error |
+| E900 | CliValidationError | Invalid CLI arguments |
+### Exit Codes
+CLI exit codes map to error categories:
+| Exit Code | Category | Description |
+|-----------|----------|-------------|
+| 0 | Success | Operation completed successfully |
+| 1 | User Error | Invalid arguments, missing config, etc. |
+| 2 | System Error | File system, network, etc. |
+| 3 | API Error | Authentication, rate limits, etc. |
+### Usage in Scripts
+Error codes enable reliable scripting and CI/CD integration:
+```bash
+# Check for specific error codes in output
+mdcontext search "query" 2>&1 | grep -q "\[E400\]" && echo "Index not found"
+# Use exit codes for control flow
+mdcontext index || {
+  case $? in
+    1) echo "User error - check arguments" ;;
+    2) echo "System error - check permissions" ;;
+    3) echo "API error - check credentials" ;;
+  esac
+}
+```
+### Programmatic Access
+```typescript
+import { FileReadError, ErrorCode } from './errors/index.js'
+const error = new FileReadError({ path: '/file.md', message: 'ENOENT' })
+console.log(error.code) // 'E100'
+console.log(error._tag) // 'FileReadError'
+```
+## Transformation Patterns
+### 1. `mapError` - Transform Error Types
+Use `mapError` to convert low-level errors to domain errors. This preserves error specificity while adapting the error type.
+```typescript
+// GOOD - Maps to domain error with context
+parse(content, options).pipe(
+  Effect.mapError((e) =>
+    new ParseError({
+      message: e.message,
+      path: filePath,
+      cause: e,
+    })
+  )
+)
+// BAD - Loses type information
+Effect.mapError((e) => new Error(`${e._tag}: ${e.message}`))
+```
+**When to use:**
+- Converting library errors to domain errors
+- Adding context (path, operation) to errors
+- Translating between error domains
+### 2. `catchTag` / `catchTags` - Handle Specific Errors
+Use `catchTag` when you need to handle a specific known error type. This enables exhaustive error handling and type-safe recovery.
+```typescript
+// Handle specific error with recovery
+estimateEmbeddingCost(dir).pipe(
+  Effect.catchTag('IndexNotFoundError', () =>
+    Effect.succeed(null)  // Index doesn't exist, return null estimate
+  )
+)
+// Handle multiple specific errors
+buildEmbeddings(dir).pipe(
+  Effect.catchTags({
+    ApiKeyMissingError: (e) => {
+      console.error(e.message)
+      return Effect.succeed(null)
+    },
+    ApiKeyInvalidError: (e) => {
+      console.error(e.message)
+      return Effect.succeed(null)
+    },
+  })
+)
+```
+**When to use:**
+- Recovering from expected error conditions
+- Providing fallback values for specific failures
+- Implementing retry logic for transient errors
+- Filtering/handling known error types mid-pipeline
+### 3. `catchAll` - Boundary Error Handling
+Use `catchAll` **only at system boundaries** where all errors must be converted to a final format (user message, JSON response, etc.).
+```typescript
+// GOOD - At CLI boundary (main.ts)
+program.pipe(
+  Effect.catchAll((error) => {
+    console.error(formatError(error))
+    return Effect.succeed(ExitCode.failure)
+  })
+)
+// GOOD - At MCP boundary (server.ts)
+// MCP protocol requires JSON responses for all operations
+handler.pipe(
+  Effect.catchAll((e) =>
+    Effect.succeed({
+      isError: true,
+      content: [{ type: 'text', text: `Error: ${e.message}` }]
+    })
+  )
+)
+// BAD - In middle of pipeline (loses type information)
+readFile(path).pipe(
+  Effect.catchAll(() => Effect.succeed(null))  // Silent failure!
+)
+```
+**When to use:**
+- CLI entry points (converting to exit codes)
+- MCP/API handlers (converting to protocol responses)
+- Top-level program error handling
+**When NOT to use:**
+- Middle of pipelines (use `catchTag` instead)
+- When error type information is needed downstream
+- For silent failures without logging
+### 4. `Effect.tryPromise` / `Effect.try` - Lift External Operations
+Use these to wrap promise-based or synchronous operations, converting thrown errors to Effect failures.
+```typescript
+// For promises
+Effect.tryPromise({
+  try: () => fs.readFile(path, 'utf-8'),
+  catch: (e) => new FileReadError({
+    path,
+    message: e instanceof Error ? e.message : String(e),
+    cause: e,
+  })
+})
+// For synchronous code that may throw
+Effect.try({
+  try: () => JSON.parse(content),
+  catch: (e) => new IndexCorruptedError({
+    path,
+    reason: 'InvalidJson',
+    details: e instanceof Error ? e.message : undefined,
+  })
+})
+```
+## Best Practices
+### Do's
+- **Always use domain errors** - Never map to generic `Error`
+- **Preserve cause chains** - Include `cause` field for debugging
+- **Add context** - Include path, operation, and relevant metadata
+- **Document error types** - Use JSDoc to specify thrown errors
+- **Log at boundaries** - When swallowing errors, log for debugging
+### Don'ts
+- **Don't swallow errors silently** - Always log or handle explicitly
+- **Don't use `catchAll` mid-pipeline** - Use `catchTag` instead
+- **Don't mix paradigms** - Avoid try/catch inside Effect.gen
+- **Don't map to generic Error** - Always use typed domain errors
+### Batch Processing Pattern
+When processing multiple items where individual failures shouldn't stop the batch:
+```typescript
+const processFile = Effect.gen(function* () {
+  // ... processing logic
+}).pipe(
+  // Note: catchAll intentional for batch processing
+  // Individual file failures collected in errors array
+  // rather than stopping the entire operation
+  Effect.catchAll((error) => {
+    errors.push({ path, message: error.message })
+    return Effect.void
+  })
+)
+```
+Always add a comment explaining why `catchAll` is appropriate.
+### Graceful Degradation Pattern
+When a feature is optional and failure shouldn't block the main operation:
+```typescript
+// Optional embedding cost estimate for user prompt
+const estimate = yield* estimateEmbeddingCost(dir).pipe(
+  Effect.catchTag('IndexNotFoundError', () => Effect.succeed(null)),
+  // Note: catchAll for graceful degradation
+  // This is optional information - failure shouldn't block indexing
+  Effect.catchAll((e) => {
+    Effect.runSync(Effect.logWarning(`Could not estimate: ${e.message}`))
+    return Effect.succeed(null)
+  })
+)
+```
+## Summarization Error Handling
+AI summarization uses a separate error system with graceful degradation - errors never prevent search results from being displayed.
+### Summarization Error Codes
+| Code | Error Type | Description |
+|------|------------|-------------|
+| PROVIDER_NOT_FOUND | Provider name unknown | Check provider spelling |
+| PROVIDER_NOT_AVAILABLE | CLI tool not installed | Install the CLI tool |
+| CLI_EXECUTION_FAILED | CLI process error | Check CLI authentication |
+| API_REQUEST_FAILED | API call failed | Check API key and network |
+| RATE_LIMITED | Too many requests | Wait and retry |
+| INVALID_RESPONSE | Bad provider response | Report as bug |
+| TIMEOUT | Request timed out | Reduce result set |
+| NO_API_KEY | Missing API key | Set environment variable |
+### Troubleshooting Summarization
+**"CLI tool 'claude' not found"**
+```bash
+# Install Claude Code
+# Visit: https://claude.ai/download
+```
+**"CLI tool 'opencode' not found"**
+```bash
+# Install OpenCode
+npm install -g @opencode/cli
+# Or: https://github.com/opencode-ai/opencode
+```
+**"Authentication failed for anthropic"**
+```bash
+export ANTHROPIC_API_KEY=sk-ant-...
+```
+**"Rate limit exceeded"**
+- Wait 60 seconds and retry
+- Consider switching to CLI provider (free with subscription)
+**"Summarization failed: timeout"**
+- Reduce results: `mdcontext search "query" --limit 5 --summarize`
+- The default timeout is 60 seconds
+**"No summarization providers available"**
+Either:
+1. Install a CLI tool: `claude`, `opencode`, or `gh copilot`
+2. Configure an API provider with a valid API key
+### Graceful Degradation
+Summarization errors never crash the CLI. When summarization fails:
+1. Error message is displayed
+2. Search results are shown normally
+3. Exit code remains 0 (success)
+```typescript
+// Implementation pattern in search.ts
+const runSummarization = (options: SummarizationOptions): Effect.Effect<void, never> =>
+  runSummarizationUnsafe(options).pipe(
+    Effect.catchAll((error) =>
+      Effect.sync(() => {
+        displaySummarizationError(error)
+        // Search results still displayed
+      }),
+    ),
+  )
+```
+## Error Formatting
+Error formatting (user-friendly messages) should only happen at the CLI boundary in `src/cli/error-handler.ts`. Internal errors carry structured data; presentation is separate from logic.
+```typescript
+// src/cli/error-handler.ts
+const formatError = (error: MdContextError): string => {
+  switch (error._tag) {
+    case 'FileReadError':
+      return `Cannot read file: ${error.path}\n${error.message}`
+    case 'ApiKeyMissingError':
+      return `API key not configured.\n\nSet ${error.envVar} environment variable.`
+    // ... other error types
+  }
+}
+```

package/docs/summarization.md ADDED Viewed

@@ -0,0 +1,320 @@
+# AI Summarization Architecture
+This document covers the architecture and implementation details of mdcontext's AI-powered search result summarization feature.
+## Overview
+mdcontext can generate AI-powered summaries of search results using either:
+1. **CLI tools** (Claude Code, Copilot CLI, OpenCode) - Free with your subscription
+2. **API providers** (DeepSeek, Anthropic, OpenAI, Gemini) - Pay per query
+The design prioritizes CLI providers as the primary option since they leverage existing subscriptions that developers already have.
+## Architecture
+```
+┌─────────────────────────────────────────────────────────────────┐
+│                        CLI (search.ts)                          │
+│  --summarize flag triggers summarization pipeline               │
+└─────────────────────────┬───────────────────────────────────────┘
+                          │
+                          ▼
+┌─────────────────────────────────────────────────────────────────┐
+│                   Provider Factory                               │
+│  getBestAvailableSummarizer() / createSummarizer()              │
+│  - Detects installed CLI tools                                   │
+│  - Creates appropriate provider instance                         │
+└─────────────────────────┬───────────────────────────────────────┘
+                          │
+          ┌───────────────┴───────────────┐
+          ▼                               ▼
+┌─────────────────────┐       ┌─────────────────────┐
+│   CLI Providers     │       │   API Providers     │
+│   (Free)            │       │   (Pay-per-use)     │
+│                     │       │                     │
+│ - ClaudeCLI         │       │ - DeepSeek          │
+│ - OpenCode          │       │ - Anthropic         │
+│ - Copilot           │       │ - OpenAI            │
+│ - Aider             │       │ - Gemini            │
+│ - Cline             │       │ - Qwen              │
+└─────────────────────┘       └─────────────────────┘
+          │                               │
+          └───────────────┬───────────────┘
+                          ▼
+┌─────────────────────────────────────────────────────────────────┐
+│                    Summarizer Interface                          │
+│  summarize(input, prompt) → SummaryResult                       │
+│  summarizeStream(input, prompt, options) → void                 │
+│  estimateCost(inputTokens) → number                             │
+│  isAvailable() → boolean                                         │
+└─────────────────────────────────────────────────────────────────┘
+```
+## Components
+### Provider Detection (`cli-providers/detection.ts`)
+Automatically discovers installed CLI tools:
+```typescript
+import { detectInstalledCLIs } from './summarization/index.js'
+const installed = await detectInstalledCLIs()
+// [{ name: 'claude', command: 'claude', displayName: 'Claude Code', ... }]
+```
+Detection uses `which` (Unix) or `where` (Windows) via `spawn()` - never shell interpolation.
+### Provider Factory (`provider-factory.ts`)
+Creates summarizer instances based on configuration:
+```typescript
+import { createSummarizer, getBestAvailableSummarizer } from './summarization/index.js'
+// Auto-detect best available provider
+const result = await getBestAvailableSummarizer()
+if (result) {
+  const { summarizer, config } = result
+  // Use summarizer...
+}
+// Or create from explicit config
+const summarizer = await createSummarizer({
+  mode: 'cli',
+  provider: 'claude',
+})
+```
+### Cost Estimation (`cost.ts`)
+Estimates costs before execution:
+```typescript
+import { estimateSummaryCost, formatCostDisplay } from './summarization/index.js'
+const estimate = estimateSummaryCost(inputText, 'api', 'deepseek')
+// {
+//   inputTokens: 2500,
+//   outputTokens: 500,
+//   estimatedCost: 0.0007,
+//   provider: 'deepseek',
+//   isPaid: true,
+//   formattedCost: '$0.0007'
+// }
+console.log(formatCostDisplay(estimate))
+// "Estimated cost: $0.0007"
+```
+CLI providers always return `isPaid: false` with `formattedCost: 'FREE (subscription)'`.
+### Prompt Templates (`prompts.ts`)
+Pre-built prompts for different summarization styles:
+| Template | Description |
+|----------|-------------|
+| `default` | Balanced summary with key findings |
+| `concise` | 2-3 sentence quick summary |
+| `detailed` | Comprehensive analysis |
+| `actionable` | Focus on next steps |
+| `technical` | Code patterns and API details |
+```typescript
+import { buildPrompt } from './summarization/index.js'
+const prompt = buildPrompt({
+  query: 'authentication',
+  resultCount: 10,
+  searchMode: 'hybrid',
+}, 'actionable')
+```
+### Error Handling (`error-handler.ts`)
+Graceful degradation on failures:
+```typescript
+import { displaySummarizationError, isRecoverableError } from './summarization/index.js'
+try {
+  await summarizer.summarize(input, prompt)
+} catch (error) {
+  if (isRecoverableError(error)) {
+    // Retry logic
+  } else {
+    displaySummarizationError(error)
+    // Shows user-friendly message, search results still displayed
+  }
+}
+```
+## Security Considerations
+### Shell Injection Prevention
+All CLI invocations use `spawn()` with argument arrays - **NEVER** `exec()` with string interpolation:
+```typescript
+// CORRECT - Safe from shell injection
+spawn('claude', ['-p', userInput, '--output-format', 'text'])
+// WRONG - Vulnerable to shell injection
+exec(`claude -p "${userInput}"`)  // NEVER DO THIS
+```
+This is enforced throughout the codebase. User input is passed as array elements, never interpolated into shell commands.
+### API Key Handling
+- API keys are sourced from environment variables only
+- Never stored in config files
+- Environment variable names follow provider conventions:
+  - `DEEPSEEK_API_KEY`
+  - `ANTHROPIC_API_KEY`
+  - `OPENAI_API_KEY`
+  - `GOOGLE_API_KEY` (for Gemini)
+  - `QWEN_API_KEY`
+### Timeout Protection
+CLI processes have a default 60-second timeout to prevent hung processes.
+## Adding New Providers
+### CLI Provider
+1. Add to `KNOWN_CLIS` in `cli-providers/detection.ts`:
+```typescript
+{
+  name: 'newcli',
+  command: 'newcli',
+  displayName: 'New CLI Tool',
+  args: ['--prompt'],
+  useStdin: false,
+}
+```
+2. Create implementation in `cli-providers/newcli.ts`:
+```typescript
+import { spawn } from 'node:child_process'
+import type { Summarizer, SummaryResult } from '../types.js'
+export class NewCLISummarizer implements Summarizer {
+  async summarize(input: string, prompt: string): Promise<SummaryResult> {
+    // SECURITY: Always use spawn() with argument arrays
+    const proc = spawn('newcli', ['--prompt', prompt, input])
+    // ... implementation
+  }
+  async isAvailable(): Promise<boolean> {
+    // Check if CLI is installed
+  }
+}
+```
+3. Add to factory in `provider-factory.ts`
+### API Provider
+1. Add pricing to `cost.ts`:
+```typescript
+export const API_PRICING = {
+  // ... existing providers
+  newapi: { input: 0.50, output: 1.00, displayName: 'New API' },
+}
+```
+2. Create implementation using Vercel AI SDK (when implemented):
+```typescript
+import { createOpenAI } from '@ai-sdk/openai'
+export class NewAPISummarizer implements Summarizer {
+  // Use Vercel AI SDK for OpenAI-compatible APIs
+}
+```
+## Performance
+| Provider Type | Latency | Cost |
+|--------------|---------|------|
+| CLI (Claude) | 2-5s | Free |
+| CLI (OpenCode) | 2-5s | Free |
+| API (DeepSeek) | 1-3s | ~$0.0007/query |
+| API (OpenAI) | 1-2s | ~$0.005/query |
+### Token Limits
+- Input is automatically truncated at 100K characters (~25K tokens)
+- Result content is truncated to 500 chars per result
+- Output tokens capped at 500 for cost estimates
+## Configuration Reference
+### Config File
+```javascript
+// mdcontext.config.js
+/** @type {import('mdcontext').PartialMdContextConfig} */
+export default {
+  aiSummarization: {
+    mode: 'cli',           // 'cli' or 'api'
+    provider: 'claude',    // Provider name
+    model: 'deepseek-chat', // Model for API providers
+    stream: false,         // Enable streaming
+  },
+}
+```
+### Environment Variables
+| Variable | Description |
+|----------|-------------|
+| `MDCONTEXT_AISUMMARIZATION_MODE` | 'cli' or 'api' |
+| `MDCONTEXT_AISUMMARIZATION_PROVIDER` | Provider name |
+| `MDCONTEXT_AISUMMARIZATION_MODEL` | Model name (API only) |
+| `MDCONTEXT_AISUMMARIZATION_STREAM` | 'true' or 'false' |
+## Troubleshooting
+### "CLI tool 'claude' not found"
+**Solution:** Install Claude Code from https://claude.ai/download
+### "CLI tool 'opencode' not found"
+**Solution:** Install OpenCode from https://github.com/opencode-ai/opencode
+### "Authentication failed for anthropic"
+**Solution:** Set API key: `export ANTHROPIC_API_KEY=sk-...`
+### "Rate limit exceeded"
+**Solution:** Wait and retry. Consider switching to CLI provider (free).
+### "Summarization failed: timeout"
+**Solution:** Reduce result set with `--limit` or increase timeout in config.
+### "No summarization providers available"
+**Solution:** Either:
+1. Install a CLI tool (Claude Code, OpenCode)
+2. Configure an API provider with valid API key
+### OpenCode JSON format errors
+**Solution:** OpenCode JSON format is undocumented. Try updating OpenCode or switch to Claude CLI.
+## Related Documentation
+- [README.md](../README.md#ai-summarization) - Quick start guide
+- [CONFIG.md](./CONFIG.md) - Full configuration reference
+- [ERRORS.md](./ERRORS.md) - Error handling patterns