npm - mdcontext - Versions diffs - 0.1.0 → 0.2.0 - Mend

mdcontext 0.1.0 → 0.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (251) hide show

package/.changeset/config.json +9 -9
package/.claude/settings.local.json +25 -0
package/.github/workflows/claude-code-review.yml +44 -0
package/.github/workflows/claude.yml +85 -0
package/CONTRIBUTING.md +186 -0
package/NOTES/NOTES +44 -0
package/README.md +206 -3
package/biome.json +1 -1
package/dist/chunk-23UPXDNL.js +3044 -0
package/dist/chunk-2W7MO2DL.js +1366 -0
package/dist/chunk-3NUAZGMA.js +1689 -0
package/dist/chunk-7TOWB2XB.js +366 -0
package/dist/chunk-7XOTOADQ.js +3065 -0
package/dist/chunk-AH2PDM2K.js +3042 -0
package/dist/chunk-BNXWSZ63.js +3742 -0
package/dist/chunk-BTL5DJVU.js +3222 -0
package/dist/chunk-HDHYG7E4.js +104 -0
package/dist/chunk-HLR4KZBP.js +3234 -0
package/dist/chunk-IP3FRFEB.js +1045 -0
package/dist/chunk-KHU56VDO.js +3042 -0
package/dist/chunk-KRYIFLQR.js +85 -89
package/dist/chunk-LBSDNLEM.js +287 -0
package/dist/chunk-MNTQ7HCP.js +2643 -0
package/dist/chunk-MUJELQQ6.js +1387 -0
package/dist/chunk-MXJGMSLV.js +2199 -0
package/dist/chunk-N6QJGC3Z.js +2636 -0
package/dist/chunk-OBELGBPM.js +1713 -0
package/dist/chunk-OT7R5XTA.js +3192 -0
package/dist/chunk-P7X4RA2T.js +106 -0
package/dist/chunk-PIDUQNC2.js +3185 -0
package/dist/chunk-POGCDIH4.js +3187 -0
package/dist/chunk-PSIEOQGZ.js +3043 -0
package/dist/chunk-PVRT3IHA.js +3238 -0
package/dist/chunk-QNN4TT23.js +1430 -0
package/dist/chunk-RE3R45RJ.js +3042 -0
package/dist/chunk-S7E6TFX6.js +718 -657
package/dist/chunk-SG6GLU4U.js +1378 -0
package/dist/chunk-SJCDV2ST.js +274 -0
package/dist/chunk-SYE5XLF3.js +104 -0
package/dist/chunk-T5VLYBZD.js +103 -0
package/dist/chunk-TOQB7VWU.js +3238 -0
package/dist/chunk-VFNMZ4ZQ.js +3228 -0
package/dist/chunk-VVTGZNBT.js +1533 -1423
package/dist/chunk-W7Q4RFEV.js +104 -0
package/dist/chunk-XTYYVRLO.js +3190 -0
package/dist/chunk-Y6MDYVJD.js +3063 -0
package/dist/cli/main.js +4072 -629
package/dist/index.d.ts +420 -33
package/dist/index.js +8 -15
package/dist/mcp/server.js +103 -7
package/dist/schema-BAWSG7KY.js +22 -0
package/dist/schema-E3QUPL26.js +20 -0
package/dist/schema-EHL7WUT6.js +20 -0
package/docs/019-USAGE.md +44 -5
package/docs/020-current-implementation.md +8 -8
package/docs/021-DOGFOODING-FINDINGS.md +1 -1
package/docs/CONFIG.md +1123 -0
package/docs/ERRORS.md +383 -0
package/docs/summarization.md +320 -0
package/justfile +40 -0
package/package.json +39 -33
package/research/INDEX.md +315 -0
package/research/code-review/README.md +90 -0
package/research/code-review/cli-error-handling-review.md +979 -0
package/research/code-review/code-review-validation-report.md +464 -0
package/research/code-review/main-ts-review.md +1128 -0
package/research/config-docs/SUMMARY.md +357 -0
package/research/config-docs/TEST-RESULTS.md +776 -0
package/research/config-docs/TODO.md +542 -0
package/research/config-docs/analysis.md +744 -0
package/research/config-docs/fix-validation.md +502 -0
package/research/config-docs/help-audit.md +264 -0
package/research/config-docs/help-system-analysis.md +890 -0
package/research/frontmatter/COMMENTS-ARE-SKIPPED.md +149 -0
package/research/frontmatter/LLM-CODE-NAVIGATION.md +276 -0
package/research/issue-review.md +603 -0
package/research/llm-summarization/agent-cli-tools-2026.md +1082 -0
package/research/llm-summarization/alternative-providers-2026.md +1428 -0
package/research/llm-summarization/anthropic-2026.md +367 -0
package/research/llm-summarization/claude-cli-integration.md +1706 -0
package/research/llm-summarization/cli-integration-patterns.md +3155 -0
package/research/llm-summarization/openai-2026.md +473 -0
package/research/llm-summarization/openai-compatible-providers-2026.md +1022 -0
package/research/llm-summarization/opencode-cli-integration.md +1552 -0
package/research/llm-summarization/prompt-engineering-2026.md +1426 -0
package/research/llm-summarization/prototype-results.md +56 -0
package/research/llm-summarization/provider-switching-patterns-2026.md +2153 -0
package/research/llm-summarization/typescript-llm-libraries-2026.md +2436 -0
package/research/mdcontext-pudding/00-EXECUTIVE-SUMMARY.md +282 -0
package/research/mdcontext-pudding/01-index-embed.md +956 -0
package/research/mdcontext-pudding/02-search-COMMANDS.md +142 -0
package/research/mdcontext-pudding/02-search-SUMMARY.md +146 -0
package/research/mdcontext-pudding/02-search.md +970 -0
package/research/mdcontext-pudding/03-context.md +779 -0
package/research/mdcontext-pudding/04-navigation-and-analytics.md +803 -0
package/research/mdcontext-pudding/04-tree.md +704 -0
package/research/mdcontext-pudding/05-config.md +1038 -0
package/research/mdcontext-pudding/06-links-summary.txt +87 -0
package/research/mdcontext-pudding/06-links.md +679 -0
package/research/mdcontext-pudding/07-stats.md +693 -0
package/research/mdcontext-pudding/BUG-FIX-PLAN.md +388 -0
package/research/mdcontext-pudding/P0-BUG-VALIDATION.md +167 -0
package/research/mdcontext-pudding/README.md +168 -0
package/research/mdcontext-pudding/TESTING-SUMMARY.md +128 -0
package/research/research-quality-review.md +834 -0
package/research/semantic-search/embedding-text-analysis.md +156 -0
package/research/semantic-search/multi-word-failure-reproduction.md +171 -0
package/research/semantic-search/query-processing-analysis.md +207 -0
package/research/semantic-search/root-cause-and-solution.md +114 -0
package/research/semantic-search/threshold-validation-report.md +69 -0
package/research/semantic-search/vector-search-analysis.md +63 -0
package/research/test-path-issues.md +276 -0
package/review/ALP-76/1-error-type-design.md +962 -0
package/review/ALP-76/2-error-handling-patterns.md +906 -0
package/review/ALP-76/3-error-presentation.md +624 -0
package/review/ALP-76/4-test-coverage.md +625 -0
package/review/ALP-76/5-migration-completeness.md +440 -0
package/review/ALP-76/6-effect-best-practices.md +755 -0
package/scripts/apply-branch-protection.sh +47 -0
package/scripts/branch-protection-templates.json +79 -0
package/scripts/prototype-summarization.ts +346 -0
package/scripts/rebuild-hnswlib.js +32 -37
package/scripts/setup-branch-protection.sh +64 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/active-provider.json +7 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.json +541 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.meta.json +5 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/config.json +8 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/documents.json +60 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/links.json +13 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/sections.json +1197 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/configuration-management.md +99 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/distributed-systems.md +92 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/error-handling.md +78 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/failure-automation.md +55 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/job-context.md +69 -0
package/src/__tests__/fixtures/semantic-search/multi-word-corpus/process-orchestration.md +99 -0
package/src/cli/argv-preprocessor.test.ts +2 -2
package/src/cli/cli.test.ts +230 -33
package/src/cli/commands/config-cmd.ts +642 -0
package/src/cli/commands/context.ts +97 -9
package/src/cli/commands/duplicates.ts +122 -0
package/src/cli/commands/embeddings.ts +529 -0
package/src/cli/commands/index-cmd.ts +210 -30
package/src/cli/commands/index.ts +3 -0
package/src/cli/commands/search.ts +894 -64
package/src/cli/commands/stats.ts +3 -0
package/src/cli/commands/tree.ts +26 -5
package/src/cli/config-layer.ts +176 -0
package/src/cli/error-handler.test.ts +235 -0
package/src/cli/error-handler.ts +655 -0
package/src/cli/flag-schemas.ts +66 -0
package/src/cli/help.ts +209 -7
package/src/cli/main.ts +348 -58
package/src/cli/options.ts +10 -0
package/src/cli/shared-error-handling.ts +199 -0
package/src/cli/utils.ts +150 -17
package/src/config/file-provider.test.ts +320 -0
package/src/config/file-provider.ts +273 -0
package/src/config/index.ts +72 -0
package/src/config/integration.test.ts +667 -0
package/src/config/precedence.test.ts +277 -0
package/src/config/precedence.ts +451 -0
package/src/config/schema.test.ts +414 -0
package/src/config/schema.ts +603 -0
package/src/config/service.test.ts +320 -0
package/src/config/service.ts +243 -0
package/src/config/testing.test.ts +264 -0
package/src/config/testing.ts +110 -0
package/src/core/types.ts +6 -33
package/src/duplicates/detector.test.ts +183 -0
package/src/duplicates/detector.ts +414 -0
package/src/duplicates/index.ts +18 -0
package/src/embeddings/embedding-namespace.test.ts +300 -0
package/src/embeddings/embedding-namespace.ts +947 -0
package/src/embeddings/heading-boost.test.ts +222 -0
package/src/embeddings/hnsw-build-options.test.ts +198 -0
package/src/embeddings/hyde.test.ts +272 -0
package/src/embeddings/hyde.ts +264 -0
package/src/embeddings/index.ts +2 -0
package/src/embeddings/openai-provider.ts +332 -83
package/src/embeddings/pricing.json +22 -0
package/src/embeddings/provider-constants.ts +204 -0
package/src/embeddings/provider-errors.test.ts +967 -0
package/src/embeddings/provider-errors.ts +565 -0
package/src/embeddings/provider-factory.test.ts +240 -0
package/src/embeddings/provider-factory.ts +225 -0
package/src/embeddings/provider-integration.test.ts +788 -0
package/src/embeddings/query-preprocessing.test.ts +187 -0
package/src/embeddings/semantic-search-threshold.test.ts +508 -0
package/src/embeddings/semantic-search.ts +780 -93
package/src/embeddings/types.ts +293 -16
package/src/embeddings/vector-store.ts +486 -77
package/src/embeddings/voyage-provider.ts +313 -0
package/src/errors/errors.test.ts +845 -0
package/src/errors/index.ts +533 -0
package/src/index/ignore-patterns.test.ts +354 -0
package/src/index/ignore-patterns.ts +305 -0
package/src/index/indexer.ts +286 -48
package/src/index/storage.ts +94 -30
package/src/index/types.ts +40 -2
package/src/index/watcher.ts +67 -9
package/src/index.ts +22 -0
package/src/integration/search-keyword.test.ts +678 -0
package/src/mcp/server.ts +135 -6
package/src/parser/parser.ts +18 -19
package/src/parser/section-filter.test.ts +277 -0
package/src/parser/section-filter.ts +125 -3
package/src/search/__tests__/hybrid-search.test.ts +650 -0
package/src/search/bm25-store.ts +366 -0
package/src/search/cross-encoder.test.ts +253 -0
package/src/search/cross-encoder.ts +406 -0
package/src/search/fuzzy-search.test.ts +419 -0
package/src/search/fuzzy-search.ts +273 -0
package/src/search/hybrid-search.ts +448 -0
package/src/search/path-matcher.test.ts +276 -0
package/src/search/path-matcher.ts +33 -0
package/src/search/searcher.test.ts +99 -1
package/src/search/searcher.ts +189 -67
package/src/search/wink-bm25.d.ts +30 -0
package/src/summarization/cli-providers/claude.ts +202 -0
package/src/summarization/cli-providers/detection.test.ts +273 -0
package/src/summarization/cli-providers/detection.ts +118 -0
package/src/summarization/cli-providers/index.ts +8 -0
package/src/summarization/cost.test.ts +139 -0
package/src/summarization/cost.ts +102 -0
package/src/summarization/error-handler.test.ts +127 -0
package/src/summarization/error-handler.ts +111 -0
package/src/summarization/index.ts +102 -0
package/src/summarization/pipeline.test.ts +498 -0
package/src/summarization/pipeline.ts +231 -0
package/src/summarization/prompts.test.ts +269 -0
package/src/summarization/prompts.ts +133 -0
package/src/summarization/provider-factory.test.ts +396 -0
package/src/summarization/provider-factory.ts +178 -0
package/src/summarization/types.ts +184 -0
package/src/summarize/summarizer.ts +104 -35
package/src/types/huggingface-transformers.d.ts +66 -0
package/tests/fixtures/cli/.mdcontext/active-provider.json +7 -0
package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
package/tests/fixtures/cli/.mdcontext/indexes/documents.json +4 -4
package/tests/fixtures/cli/.mdcontext/indexes/sections.json +14 -0
package/tests/integration/embed-index.test.ts +712 -0
package/tests/integration/search-context.test.ts +469 -0
package/tests/integration/search-semantic.test.ts +522 -0
package/vitest.config.ts +1 -6
package/AGENTS.md +0 -46
package/tests/fixtures/cli/.mdcontext/vectors.bin +0 -0
package/tests/fixtures/cli/.mdcontext/vectors.meta.json +0 -1264

package/research/mdcontext-pudding/04-tree.md ADDED Viewed

@@ -0,0 +1,704 @@
+# mdcontext tree Command - Research and Testing
+## Overview
+The `mdcontext tree` command serves dual purposes:
+1. **Directory mode**: Lists all markdown files in a directory hierarchy
+2. **File mode**: Shows document structure with heading hierarchy and token counts
+This is a clever design that provides both high-level navigation and detailed document insight through a single command.
+## Test Environment
+- Repository: `/Users/alphab/Dev/LLM/DEV/agentic-flow`
+- Files indexed: 1,561 markdown documents
+- Sections: 52,714
+- Test date: 2026-01-26
+## Command Syntax
+```bash
+mdcontext tree [path] [options]
+Options:
+  --json    Output as JSON
+  --pretty  Pretty-print JSON output
+```
+## Usage Patterns
+### 1. File Tree (Directory Mode)
+When passed a directory path, shows all markdown files in the tree:
+```bash
+# List all markdown files in current directory
+cd /Users/alphab/Dev/LLM/DEV/agentic-flow
+mdcontext tree
+# Output format:
+# Markdown files in /Users/alphab/Dev/LLM/DEV/agentic-flow:
+#
+#   CLAUDE.md
+#   README.md
+#   agentic-flow/CHANGELOG.md
+#   agentic-flow/README.md
+#   ...
+```
+**Performance**: 685ms for 1,561 files (very fast)
+### 2. Document Outline (File Mode)
+When passed a file path, shows heading structure with token counts:
+```bash
+# Show document outline
+mdcontext tree README.md
+# Output:
+# 🚀 Agentic-Flow v2.0.0-alpha
+# Total tokens: 18095
+#
+# 🚀 Agentic-Flow v2.0.0-alpha [269 tokens]
+#   ## 🎉 What's New in v2.0.0-alpha [16 tokens]
+#     ### SONA: Self-Optimizing Neural Architecture  🧠 [249 tokens]
+#     ### Complete AgentDB@alpha Integration  🧠 [246 tokens]
+#   ## 📖 Table of Contents [237 tokens]
+#   ...
+```
+**Performance**: 620ms for a 3,008-line document (42,521 tokens) - excellent
+## Output Formats
+### 1. Human-Readable (Default)
+Directory listing:
+```
+Markdown files in /path/to/directory:
+  file1.md
+  file2.md
+  subdir/file3.md
+Total: X files
+```
+Document outline:
+```
+# Document Title
+Total tokens: XXXX
+# Heading 1 [XXX tokens]
+  ## Heading 2 [XXX tokens]
+    ### Heading 3 [XXX tokens]
+```
+### 2. JSON Format
+Directory listing (--json):
+```json
+[
+  {
+    "path": "/absolute/path/to/file.md",
+    "relativePath": "relative/path/to/file.md"
+  },
+  ...
+]
+```
+Document outline (--json --pretty):
+```json
+{
+  "title": "Document Title",
+  "path": "/absolute/path/to/file.md",
+  "totalTokens": 18095,
+  "sections": [
+    {
+      "heading": "Main Heading",
+      "level": 1,
+      "tokens": 269,
+      "children": [
+        {
+          "heading": "Sub Heading",
+          "level": 2,
+          "tokens": 16,
+          "children": []
+        }
+      ]
+    }
+  ]
+}
+```
+## Test Results
+### Test 1: Large Repository Tree View
+```bash
+cd /Users/alphab/Dev/LLM/DEV/agentic-flow
+time mdcontext tree
+```
+**Results**:
+- Files listed: 1,561 markdown files
+- Time: 685ms (0.75s user + 0.21s system)
+- CPU: 141%
+- Status: Excellent performance
+### Test 2: Subdirectory Filtering
+```bash
+mdcontext tree docs/guides/
+```
+**Results**:
+- Files found: 28 files (including subdirectories)
+- Time: 576ms
+- Output: Clean, relative paths from specified directory
+- Includes subdirectories: Yes (e.g., `getting-started/quick-start.md`)
+**Comparison with filesystem**:
+```bash
+ls -1 docs/guides/*.md | wc -l  # 26 files (only top level)
+mdcontext tree docs/guides/      # 28 files (includes subdirs)
+```
+The tree command correctly includes nested directories, while shell globbing doesn't by default.
+### Test 3: Document Outline - Small File
+```bash
+mdcontext tree docs/guides/README.md
+```
+**Output**:
+```
+# User Guides
+Total tokens: 221
+# User Guides [28 tokens]
+  ## Getting Started [95 tokens]
+  ## Model Configuration [47 tokens]
+  ## Examples [45 tokens]
+```
+**Quality**: Perfect extraction, clear hierarchy
+### Test 4: Document Outline - Large Complex File
+```bash
+mdcontext tree docs/research/JJ_INTEGRATION_ANALYSIS.md
+```
+**Results**:
+- File size: 3,008 lines
+- Total tokens: 42,521
+- Sections extracted: ~50+ headings
+- Nesting levels: Up to 4 levels (####)
+- Time: 620ms
+- Quality: Excellent - accurately captured nested structure
+**Sample output**:
+```
+# Jujutsu (jj) VCS Integration with Agentic-Flow and AgentDB
+Total tokens: 42521
+# Jujutsu (jj) VCS Integration with Agentic-Flow and AgentDB [22 tokens]
+  ## Ultra-Deep Research and Analysis [71 tokens]
+  ## Executive Summary [139 tokens]
+    ### Key Findings [246 tokens]
+  ## 1. Jujutsu VCS Core Capabilities [13 tokens]
+    ### 1.1 Operation Log Architecture [489 tokens]
+    ### 1.2 Working Copy Management [265 tokens]
+    ...
+  ## 6. Technical Architecture [11 tokens]
+    ### 6.1 System Overview [2654 tokens]
+    ### 6.2 Data Flow [447 tokens]
+    ### 6.3 Integration Layers [11 tokens]
+      #### Layer 1: CLI Wrapper (Immediate Implementation) [981 tokens]
+      #### Layer 2: Node.js Native Module (Future) [569 tokens]
+```
+### Test 5: Document Outline - Medium Complexity
+```bash
+mdcontext tree docs/guides/MULTI-MODEL-ROUTER.md
+```
+**Results**:
+- Total tokens: 9,784
+- Heading levels: 1-4 (with proper nesting)
+- Token distribution visible per section
+- Formatting preserved (emojis in headings)
+### Test 6: JSON Output
+```bash
+mdcontext tree docs/guides/ --json | head -100
+```
+**Results**:
+- Format: Array of objects with `path` and `relativePath`
+- Size: Large (64KB for 522 files in docs/)
+- Structure: Clean, parseable, useful for programmatic access
+- Performance: Same speed as regular output
+### Test 7: Pretty JSON Output
+```bash
+mdcontext tree README.md --json --pretty | head -100
+```
+**Results**:
+- Format: Hierarchical tree structure
+- Fields: `title`, `path`, `totalTokens`, `sections[]`
+- Section structure: `heading`, `level`, `tokens`, `children[]`
+- Nesting: Properly nested with children arrays
+- Quality: Machine-readable and human-inspectable
+### Test 8: Pattern Matching (Attempted)
+```bash
+mdcontext tree 'docs/guides/*.md'
+```
+**Result**: ❌ Error - glob patterns not supported
+```
+FileReadError: Cannot access path: ENOENT: no such file or directory,
+stat '/Users/alphab/Dev/LLM/DEV/agentic-flow/docs/guides/*.md'
+```
+**Note**: The command expects a literal directory or file path, not shell glob patterns. Use directory mode instead to list all files in a directory.
+## Use Cases
+### 1. Repository Navigation
+**Scenario**: Exploring a new codebase
+```bash
+mdcontext tree              # Get overview of all markdown files
+mdcontext tree docs/        # Focus on documentation
+mdcontext tree README.md    # Understand main document structure
+```
+**Value**: Quick orientation without opening files
+### 2. Documentation Planning
+**Scenario**: Understanding document scope and organization
+```bash
+mdcontext tree docs/guides/MULTI-MODEL-ROUTER.md
+```
+**Output shows**:
+- Total token count (9,784) - useful for context windows
+- Section hierarchy - see organization at a glance
+- Token distribution - identify heavy sections
+**Value**: Token budget planning, structure review
+### 3. IDE Integration
+**Scenario**: Building a documentation browser
+```bash
+mdcontext tree docs/ --json --pretty
+```
+**JSON output** enables:
+- File tree views
+- Table of contents generation
+- Document navigation panels
+- Search result previews
+**Example integration**:
+```javascript
+const files = JSON.parse(await exec('mdcontext tree docs/ --json'));
+const toc = JSON.parse(await exec('mdcontext tree README.md --json'));
+// Build interactive documentation browser
+files.forEach(file => {
+  const outline = JSON.parse(await exec(`mdcontext tree ${file.path} --json`));
+  renderDocumentWithTOC(outline);
+});
+```
+### 4. Content Analysis
+**Scenario**: Finding oversized sections for splitting
+```bash
+mdcontext tree large-doc.md | grep -E '\[([0-9]{4,}) tokens\]'
+```
+**Use**: Identify sections > 1000 tokens that might need splitting
+### 5. Documentation Quality Checks
+**Scenario**: Verify all documents have proper structure
+```bash
+for file in $(mdcontext tree docs/ --json | jq -r '.[].path'); do
+  mdcontext tree "$file" --json | jq '.sections | length'
+done
+```
+**Use**: Ensure minimum heading structure exists
+### 6. LLM Context Preparation
+**Scenario**: Check if document fits in context window
+```bash
+mdcontext tree README.md | head -3
+# Output: Total tokens: 18095
+```
+**Decision**: 18k tokens fits in most context windows, proceed
+### 7. Documentation Refactoring
+**Scenario**: Understanding token distribution before restructuring
+```bash
+mdcontext tree docs/guides/MCP-AUTHENTICATION.md
+```
+**Shows**:
+- Which sections are heaviest
+- Nesting depth issues
+- Potential splitting points
+## Performance Analysis
+### Speed Tests
+| Operation | Files/Size | Time | Notes |
+|-----------|-----------|------|-------|
+| Full repo tree | 1,561 files | 685ms | Excellent |
+| Subdirectory tree | 28 files | 576ms | Fast (includes index lookup) |
+| Small file outline | 221 tokens | ~600ms | Fast |
+| Large file outline | 42,521 tokens | 620ms | Excellent (3,008 lines) |
+| JSON output | Same | ~same | No performance penalty |
+**Key Insights**:
+- Performance scales well with repository size
+- Large document parsing is very efficient
+- No significant overhead for JSON formatting
+- Index lookup is fast (indexed files only)
+### Scalability
+The command operates on **indexed files only**, which means:
+- Performance depends on index quality
+- Unindexed files won't appear
+- Changes require re-indexing
+- Speed benefits from index caching
+**Test**: After indexing (579ms for 1,561 docs), tree command is instant (685ms)
+## Heading Extraction Quality
+### Test: Complex Nesting
+Tested on file with 4 levels of headings:
+```markdown
+# Level 1
+## Level 2
+### Level 3
+#### Level 4
+```
+**Result**: ✅ All levels correctly extracted and nested
+### Test: Emoji Handling
+Tested on headings with emojis:
+```markdown
+# 🚀 Agentic-Flow v2.0.0-alpha
+## 🎉 What's New
+```
+**Result**: ✅ Emojis preserved in output
+### Test: Numbered Headings
+Tested on structured documentation:
+```markdown
+## 1. Jujutsu VCS Core Capabilities
+### 1.1 Operation Log Architecture
+### 1.2 Working Copy Management
+```
+**Result**: ✅ Numbers preserved, correct nesting
+### Test: Token Counting
+Examined token counts across sections:
+**Observations**:
+- Small headings: 10-50 tokens (mostly structure)
+- Content sections: 100-500 tokens (normal paragraphs)
+- Large sections: 500-2000+ tokens (complex content)
+- Very heavy sections: 2654 tokens (System Overview in test file)
+**Accuracy**: Token counts appear accurate (using tiktoken internally)
+## Edge Cases and Limitations
+### 1. Glob Pattern Support
+**Issue**: Shell glob patterns are NOT supported
+```bash
+mdcontext tree 'docs/*.md'  # ❌ Fails
+```
+**Workaround**: Use directory mode
+```bash
+mdcontext tree docs/  # ✅ Works - lists all files recursively
+```
+### 2. Depth Limits
+**Observation**: No apparent depth limit in help output
+**Test needed**: Does it limit directory recursion depth?
+**Current behavior**: Appears to recurse fully through directories
+### 3. Hidden Files
+From indexing output: "20 hidden" files were skipped
+**Question**: Can tree command show hidden files?
+**Current**: Follows index, which skips hidden files by default
+### 4. Large Repositories
+**Tested**: 1,561 files performed well (685ms)
+**Question**: What happens with 10k+ files?
+**Expectation**: Should still be fast (index-based lookup)
+### 5. Malformed Markdown
+**Not tested**: How does it handle documents with:
+- Missing closing headings
+- Invalid nesting (### before ##)
+- Duplicate heading text
+**Expected**: Likely handles gracefully (remark-based parser)
+## Comparison with Alternatives
+### vs `find` + `ls`
+```bash
+find . -name "*.md"           # Finds all .md files
+mdcontext tree                # Finds indexed markdown files
+```
+**Advantage mdcontext**:
+- Filters to indexed files only
+- Respects .gitignore patterns
+- Cleaner output formatting
+- Includes relative paths
+### vs `tree` command
+```bash
+tree -P "*.md"                # File system tree
+mdcontext tree                # Document tree
+```
+**Advantage mdcontext**:
+- Document-aware (not just files)
+- Token counting built-in
+- JSON output for integration
+- Outline view for files
+### vs manual TOC generation
+```bash
+grep "^#" README.md            # Extract headings
+mdcontext tree README.md       # Full outline with tokens
+```
+**Advantage mdcontext**:
+- Token counts per section
+- Proper nesting structure
+- Machine-readable JSON
+- Consistent formatting
+## Integration Ideas
+### 1. VSCode Extension
+**Feature**: Documentation sidebar
+```typescript
+// Fetch file tree
+const tree = await exec('mdcontext tree --json');
+// Render in sidebar
+tree.forEach(file => {
+  const outline = await exec(`mdcontext tree ${file.path} --json`);
+  renderOutline(outline);
+});
+```
+**UI**:
+- File tree with markdown files
+- Click to see outline
+- Token counts visible
+- Search within structure
+### 2. Documentation Site Generator
+**Feature**: Auto-generate navigation
+```javascript
+const files = JSON.parse(await exec('mdcontext tree docs/ --json'));
+// Build nav structure
+const nav = buildNav(files);
+// Generate TOC for each page
+files.forEach(async file => {
+  const outline = JSON.parse(
+    await exec(`mdcontext tree ${file.path} --json`)
+  );
+  generatePage(file, outline);
+});
+```
+### 3. LLM Context Builder
+**Feature**: Smart document selection
+```bash
+# Check document size before adding to context
+SIZE=$(mdcontext tree README.md | head -2 | tail -1 | grep -oE '[0-9]+')
+if [ $SIZE -lt 5000 ]; then
+  mdcontext context README.md >> context.txt
+else
+  echo "Document too large, skipping"
+fi
+```
+### 4. Documentation Linter
+**Feature**: Enforce structure rules
+```javascript
+const outline = JSON.parse(
+  await exec('mdcontext tree docs/guide.md --json')
+);
+// Check rules
+if (outline.sections.length === 0) {
+  error('Document has no headings');
+}
+outline.sections.forEach(section => {
+  if (section.tokens > 1000) {
+    warn(`Section "${section.heading}" is too long (${section.tokens} tokens)`);
+  }
+});
+```
+### 5. API Documentation Generator
+**Feature**: Parse API docs structure
+```bash
+# Find all API reference files
+mdcontext tree docs/api/ --json | jq -r '.[].path' | while read file; do
+  # Extract structure
+  mdcontext tree "$file" --json | \
+    jq '.sections[] | select(.level == 2) | .heading'
+done
+```
+### 6. Git Pre-commit Hook
+**Feature**: Validate documentation changes
+```bash
+# In .git/hooks/pre-commit
+for file in $(git diff --cached --name-only | grep '\.md$'); do
+  outline=$(mdcontext tree "$file" --json)
+  tokens=$(echo "$outline" | jq '.totalTokens')
+  if [ $tokens -gt 10000 ]; then
+    echo "Warning: $file is very large ($tokens tokens)"
+  fi
+done
+```
+## Issues Found
+### Issue 1: Glob Pattern Not Supported
+**Expected**: `mdcontext tree 'docs/*.md'` would filter files
+**Actual**: Error - literal path interpretation only
+**Severity**: Minor - directory mode works well enough
+**Workaround**: Use directory path instead
+### Issue 2: No Depth Limit Option
+**Observation**: No `--max-depth` flag
+**Use case**: List only top-level subdirectories
+**Current**: Always recurses fully
+**Impact**: Minor - usually want full tree anyway
+### Issue 3: No File Count in Directory Mode Output
+**Observation**: Says "Total: X files" at end, but not in JSON
+**JSON output**: Just array of files
+**Suggestion**: Add metadata to JSON output:
+```json
+{
+  "root": "/path/to/dir",
+  "count": 1561,
+  "files": [...]
+}
+```
+## Recommendations
+### For Users
+1. **Start with tree view** - Get repository overview first
+2. **Use outline for planning** - Check token counts before editing
+3. **JSON for scripting** - Integrate with build tools
+4. **Directory mode for discovery** - Find relevant documentation quickly
+### For Integration
+1. **Cache outline data** - Parse once, use many times
+2. **Build navigation from JSON** - Clean structured data
+3. **Show token budgets** - Help users stay within context limits
+4. **Enable filtering** - Let users explore structure interactively
+### For mdcontext Development
+1. **Add glob pattern support** - More flexible file filtering
+2. **Include metadata in JSON** - File counts, timestamps
+3. **Add depth limiting** - Optional for large trees
+4. **Consider section IDs** - Enable linking to sections
+5. **Add filtering options** - By token count, heading level, etc.
+## Conclusion
+The `mdcontext tree` command is a powerful dual-purpose tool that excels at both repository navigation and document structure analysis.
+**Strengths**:
+- Fast performance (sub-second for large repos)
+- Clean, intuitive output
+- Excellent heading extraction
+- Useful token counting
+- Strong JSON output for integration
+- Smart dual-mode design (directory vs file)
+**Use Cases**:
+- Repository exploration and navigation
+- Documentation planning and token budgeting
+- IDE and tooling integration
+- Content analysis and quality checks
+- LLM context window management
+**Performance**:
+- Grade: A
+- Scales well with repository size
+- Handles large documents efficiently
+- No performance penalty for JSON output
+**Overall Assessment**: This is a well-designed, practical command that provides real value for both human users and programmatic integration. The dual-mode behavior (directory listing vs document outline) is clever and intuitive. The addition of token counts makes it particularly useful for LLM-related workflows.
+**Primary Value**: Bridges the gap between file system navigation and document content understanding, all while being cognizant of token budgets.