mdcontext 0.0.1 → 0.2.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.changeset/README.md +28 -0
- package/.changeset/config.json +11 -0
- package/.claude/settings.local.json +25 -0
- package/.github/workflows/ci.yml +83 -0
- package/.github/workflows/claude-code-review.yml +44 -0
- package/.github/workflows/claude.yml +85 -0
- package/.github/workflows/release.yml +113 -0
- package/.tldrignore +112 -0
- package/BACKLOG.md +338 -0
- package/CONTRIBUTING.md +186 -0
- package/NOTES/NOTES +44 -0
- package/README.md +434 -11
- package/biome.json +36 -0
- package/cspell.config.yaml +14 -0
- package/dist/chunk-23UPXDNL.js +3044 -0
- package/dist/chunk-2W7MO2DL.js +1366 -0
- package/dist/chunk-3NUAZGMA.js +1689 -0
- package/dist/chunk-7TOWB2XB.js +366 -0
- package/dist/chunk-7XOTOADQ.js +3065 -0
- package/dist/chunk-AH2PDM2K.js +3042 -0
- package/dist/chunk-BNXWSZ63.js +3742 -0
- package/dist/chunk-BTL5DJVU.js +3222 -0
- package/dist/chunk-HDHYG7E4.js +104 -0
- package/dist/chunk-HLR4KZBP.js +3234 -0
- package/dist/chunk-IP3FRFEB.js +1045 -0
- package/dist/chunk-KHU56VDO.js +3042 -0
- package/dist/chunk-KRYIFLQR.js +88 -0
- package/dist/chunk-LBSDNLEM.js +287 -0
- package/dist/chunk-MNTQ7HCP.js +2643 -0
- package/dist/chunk-MUJELQQ6.js +1387 -0
- package/dist/chunk-MXJGMSLV.js +2199 -0
- package/dist/chunk-N6QJGC3Z.js +2636 -0
- package/dist/chunk-OBELGBPM.js +1713 -0
- package/dist/chunk-OT7R5XTA.js +3192 -0
- package/dist/chunk-P7X4RA2T.js +106 -0
- package/dist/chunk-PIDUQNC2.js +3185 -0
- package/dist/chunk-POGCDIH4.js +3187 -0
- package/dist/chunk-PSIEOQGZ.js +3043 -0
- package/dist/chunk-PVRT3IHA.js +3238 -0
- package/dist/chunk-QNN4TT23.js +1430 -0
- package/dist/chunk-RE3R45RJ.js +3042 -0
- package/dist/chunk-S7E6TFX6.js +803 -0
- package/dist/chunk-SG6GLU4U.js +1378 -0
- package/dist/chunk-SJCDV2ST.js +274 -0
- package/dist/chunk-SYE5XLF3.js +104 -0
- package/dist/chunk-T5VLYBZD.js +103 -0
- package/dist/chunk-TOQB7VWU.js +3238 -0
- package/dist/chunk-VFNMZ4ZQ.js +3228 -0
- package/dist/chunk-VVTGZNBT.js +1629 -0
- package/dist/chunk-W7Q4RFEV.js +104 -0
- package/dist/chunk-XTYYVRLO.js +3190 -0
- package/dist/chunk-Y6MDYVJD.js +3063 -0
- package/dist/cli/main.d.ts +1 -0
- package/dist/cli/main.js +5458 -0
- package/dist/index.d.ts +653 -0
- package/dist/index.js +79 -0
- package/dist/mcp/server.d.ts +1 -0
- package/dist/mcp/server.js +472 -0
- package/dist/schema-BAWSG7KY.js +22 -0
- package/dist/schema-E3QUPL26.js +20 -0
- package/dist/schema-EHL7WUT6.js +20 -0
- package/docs/019-USAGE.md +625 -0
- package/docs/020-current-implementation.md +364 -0
- package/docs/021-DOGFOODING-FINDINGS.md +175 -0
- package/docs/BACKLOG.md +80 -0
- package/docs/CONFIG.md +1123 -0
- package/docs/DESIGN.md +439 -0
- package/docs/ERRORS.md +383 -0
- package/docs/PROJECT.md +88 -0
- package/docs/ROADMAP.md +407 -0
- package/docs/summarization.md +320 -0
- package/docs/test-links.md +9 -0
- package/justfile +40 -0
- package/package.json +74 -9
- package/pnpm-workspace.yaml +5 -0
- package/research/INDEX.md +315 -0
- package/research/code-review/README.md +90 -0
- package/research/code-review/cli-error-handling-review.md +979 -0
- package/research/code-review/code-review-validation-report.md +464 -0
- package/research/code-review/main-ts-review.md +1128 -0
- package/research/config-analysis/01-current-implementation.md +470 -0
- package/research/config-analysis/02-strategy-recommendation.md +428 -0
- package/research/config-analysis/03-task-candidates.md +715 -0
- package/research/config-analysis/033-research-configuration-management.md +828 -0
- package/research/config-analysis/034-research-effect-cli-config.md +1504 -0
- package/research/config-analysis/04-consolidated-task-candidates.md +277 -0
- package/research/config-docs/SUMMARY.md +357 -0
- package/research/config-docs/TEST-RESULTS.md +776 -0
- package/research/config-docs/TODO.md +542 -0
- package/research/config-docs/analysis.md +744 -0
- package/research/config-docs/fix-validation.md +502 -0
- package/research/config-docs/help-audit.md +264 -0
- package/research/config-docs/help-system-analysis.md +890 -0
- package/research/dogfood/consolidated-tool-evaluation.md +373 -0
- package/research/dogfood/strategy-a/a-synthesis.md +184 -0
- package/research/dogfood/strategy-a/a1-docs.md +226 -0
- package/research/dogfood/strategy-a/a2-amorphic.md +156 -0
- package/research/dogfood/strategy-a/a3-llm.md +164 -0
- package/research/dogfood/strategy-b/b-synthesis.md +228 -0
- package/research/dogfood/strategy-b/b1-architecture.md +207 -0
- package/research/dogfood/strategy-b/b2-gaps.md +258 -0
- package/research/dogfood/strategy-b/b3-workflows.md +250 -0
- package/research/dogfood/strategy-c/c-synthesis.md +451 -0
- package/research/dogfood/strategy-c/c1-explorer.md +192 -0
- package/research/dogfood/strategy-c/c2-diver-memory.md +145 -0
- package/research/dogfood/strategy-c/c3-diver-control.md +148 -0
- package/research/dogfood/strategy-c/c4-diver-failure.md +151 -0
- package/research/dogfood/strategy-c/c5-diver-execution.md +221 -0
- package/research/dogfood/strategy-c/c6-diver-org.md +221 -0
- package/research/effect-cli-error-handling.md +845 -0
- package/research/effect-errors-as-values.md +943 -0
- package/research/errors-task-analysis/00-consolidated-tasks.md +207 -0
- package/research/errors-task-analysis/cli-commands-analysis.md +909 -0
- package/research/errors-task-analysis/embeddings-analysis.md +709 -0
- package/research/errors-task-analysis/index-search-analysis.md +812 -0
- package/research/frontmatter/COMMENTS-ARE-SKIPPED.md +149 -0
- package/research/frontmatter/LLM-CODE-NAVIGATION.md +276 -0
- package/research/issue-review.md +603 -0
- package/research/llm-summarization/agent-cli-tools-2026.md +1082 -0
- package/research/llm-summarization/alternative-providers-2026.md +1428 -0
- package/research/llm-summarization/anthropic-2026.md +367 -0
- package/research/llm-summarization/claude-cli-integration.md +1706 -0
- package/research/llm-summarization/cli-integration-patterns.md +3155 -0
- package/research/llm-summarization/openai-2026.md +473 -0
- package/research/llm-summarization/openai-compatible-providers-2026.md +1022 -0
- package/research/llm-summarization/opencode-cli-integration.md +1552 -0
- package/research/llm-summarization/prompt-engineering-2026.md +1426 -0
- package/research/llm-summarization/prototype-results.md +56 -0
- package/research/llm-summarization/provider-switching-patterns-2026.md +2153 -0
- package/research/llm-summarization/typescript-llm-libraries-2026.md +2436 -0
- package/research/mdcontext-error-analysis.md +521 -0
- package/research/mdcontext-pudding/00-EXECUTIVE-SUMMARY.md +282 -0
- package/research/mdcontext-pudding/01-index-embed.md +956 -0
- package/research/mdcontext-pudding/02-search-COMMANDS.md +142 -0
- package/research/mdcontext-pudding/02-search-SUMMARY.md +146 -0
- package/research/mdcontext-pudding/02-search.md +970 -0
- package/research/mdcontext-pudding/03-context.md +779 -0
- package/research/mdcontext-pudding/04-navigation-and-analytics.md +803 -0
- package/research/mdcontext-pudding/04-tree.md +704 -0
- package/research/mdcontext-pudding/05-config.md +1038 -0
- package/research/mdcontext-pudding/06-links-summary.txt +87 -0
- package/research/mdcontext-pudding/06-links.md +679 -0
- package/research/mdcontext-pudding/07-stats.md +693 -0
- package/research/mdcontext-pudding/BUG-FIX-PLAN.md +388 -0
- package/research/mdcontext-pudding/P0-BUG-VALIDATION.md +167 -0
- package/research/mdcontext-pudding/README.md +168 -0
- package/research/mdcontext-pudding/TESTING-SUMMARY.md +128 -0
- package/research/npm_publish/011-npm-workflow-research-agent2.md +792 -0
- package/research/npm_publish/012-npm-workflow-research-agent1.md +530 -0
- package/research/npm_publish/013-npm-workflow-research-agent3.md +722 -0
- package/research/npm_publish/014-npm-workflow-synthesis.md +556 -0
- package/research/npm_publish/031-npm-workflow-task-analysis.md +134 -0
- package/research/research-quality-review.md +834 -0
- package/research/semantic-search/002-research-embedding-models.md +490 -0
- package/research/semantic-search/003-research-rag-alternatives.md +523 -0
- package/research/semantic-search/004-research-vector-search.md +841 -0
- package/research/semantic-search/032-research-semantic-search.md +427 -0
- package/research/semantic-search/embedding-text-analysis.md +156 -0
- package/research/semantic-search/multi-word-failure-reproduction.md +171 -0
- package/research/semantic-search/query-processing-analysis.md +207 -0
- package/research/semantic-search/root-cause-and-solution.md +114 -0
- package/research/semantic-search/threshold-validation-report.md +69 -0
- package/research/semantic-search/vector-search-analysis.md +63 -0
- package/research/task-management-2026/00-synthesis-recommendations.md +295 -0
- package/research/task-management-2026/01-ai-workflow-tools.md +416 -0
- package/research/task-management-2026/02-agent-framework-patterns.md +476 -0
- package/research/task-management-2026/03-lightweight-file-based.md +567 -0
- package/research/task-management-2026/04-established-tools-ai-features.md +541 -0
- package/research/task-management-2026/linear/01-core-features-workflow.md +771 -0
- package/research/task-management-2026/linear/02-api-integrations.md +930 -0
- package/research/task-management-2026/linear/03-ai-features.md +368 -0
- package/research/task-management-2026/linear/04-pricing-setup.md +205 -0
- package/research/task-management-2026/linear/05-usage-patterns-best-practices.md +605 -0
- package/research/test-path-issues.md +276 -0
- package/review/ALP-76/1-error-type-design.md +962 -0
- package/review/ALP-76/2-error-handling-patterns.md +906 -0
- package/review/ALP-76/3-error-presentation.md +624 -0
- package/review/ALP-76/4-test-coverage.md +625 -0
- package/review/ALP-76/5-migration-completeness.md +440 -0
- package/review/ALP-76/6-effect-best-practices.md +755 -0
- package/scripts/apply-branch-protection.sh +47 -0
- package/scripts/branch-protection-templates.json +79 -0
- package/scripts/prototype-summarization.ts +346 -0
- package/scripts/rebuild-hnswlib.js +58 -0
- package/scripts/setup-branch-protection.sh +64 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/active-provider.json +7 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.json +541 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.meta.json +5 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/config.json +8 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/documents.json +60 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/links.json +13 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/sections.json +1197 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/configuration-management.md +99 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/distributed-systems.md +92 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/error-handling.md +78 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/failure-automation.md +55 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/job-context.md +69 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/process-orchestration.md +99 -0
- package/src/cli/argv-preprocessor.test.ts +210 -0
- package/src/cli/argv-preprocessor.ts +202 -0
- package/src/cli/cli.test.ts +627 -0
- package/src/cli/commands/backlinks.ts +54 -0
- package/src/cli/commands/config-cmd.ts +642 -0
- package/src/cli/commands/context.ts +285 -0
- package/src/cli/commands/duplicates.ts +122 -0
- package/src/cli/commands/embeddings.ts +529 -0
- package/src/cli/commands/index-cmd.ts +480 -0
- package/src/cli/commands/index.ts +16 -0
- package/src/cli/commands/links.ts +52 -0
- package/src/cli/commands/search.ts +1281 -0
- package/src/cli/commands/stats.ts +149 -0
- package/src/cli/commands/tree.ts +128 -0
- package/src/cli/config-layer.ts +176 -0
- package/src/cli/error-handler.test.ts +235 -0
- package/src/cli/error-handler.ts +655 -0
- package/src/cli/flag-schemas.ts +341 -0
- package/src/cli/help.ts +588 -0
- package/src/cli/index.ts +9 -0
- package/src/cli/main.ts +435 -0
- package/src/cli/options.ts +41 -0
- package/src/cli/shared-error-handling.ts +199 -0
- package/src/cli/typo-suggester.test.ts +105 -0
- package/src/cli/typo-suggester.ts +130 -0
- package/src/cli/utils.ts +259 -0
- package/src/config/file-provider.test.ts +320 -0
- package/src/config/file-provider.ts +273 -0
- package/src/config/index.ts +72 -0
- package/src/config/integration.test.ts +667 -0
- package/src/config/precedence.test.ts +277 -0
- package/src/config/precedence.ts +451 -0
- package/src/config/schema.test.ts +414 -0
- package/src/config/schema.ts +603 -0
- package/src/config/service.test.ts +320 -0
- package/src/config/service.ts +243 -0
- package/src/config/testing.test.ts +264 -0
- package/src/config/testing.ts +110 -0
- package/src/core/index.ts +1 -0
- package/src/core/types.ts +113 -0
- package/src/duplicates/detector.test.ts +183 -0
- package/src/duplicates/detector.ts +414 -0
- package/src/duplicates/index.ts +18 -0
- package/src/embeddings/embedding-namespace.test.ts +300 -0
- package/src/embeddings/embedding-namespace.ts +947 -0
- package/src/embeddings/heading-boost.test.ts +222 -0
- package/src/embeddings/hnsw-build-options.test.ts +198 -0
- package/src/embeddings/hyde.test.ts +272 -0
- package/src/embeddings/hyde.ts +264 -0
- package/src/embeddings/index.ts +10 -0
- package/src/embeddings/openai-provider.ts +414 -0
- package/src/embeddings/pricing.json +22 -0
- package/src/embeddings/provider-constants.ts +204 -0
- package/src/embeddings/provider-errors.test.ts +967 -0
- package/src/embeddings/provider-errors.ts +565 -0
- package/src/embeddings/provider-factory.test.ts +240 -0
- package/src/embeddings/provider-factory.ts +225 -0
- package/src/embeddings/provider-integration.test.ts +788 -0
- package/src/embeddings/query-preprocessing.test.ts +187 -0
- package/src/embeddings/semantic-search-threshold.test.ts +508 -0
- package/src/embeddings/semantic-search.ts +1270 -0
- package/src/embeddings/types.ts +359 -0
- package/src/embeddings/vector-store.ts +708 -0
- package/src/embeddings/voyage-provider.ts +313 -0
- package/src/errors/errors.test.ts +845 -0
- package/src/errors/index.ts +533 -0
- package/src/index/ignore-patterns.test.ts +354 -0
- package/src/index/ignore-patterns.ts +305 -0
- package/src/index/index.ts +4 -0
- package/src/index/indexer.ts +684 -0
- package/src/index/storage.ts +260 -0
- package/src/index/types.ts +147 -0
- package/src/index/watcher.ts +189 -0
- package/src/index.ts +30 -0
- package/src/integration/search-keyword.test.ts +678 -0
- package/src/mcp/server.ts +612 -0
- package/src/parser/index.ts +1 -0
- package/src/parser/parser.test.ts +291 -0
- package/src/parser/parser.ts +394 -0
- package/src/parser/section-filter.test.ts +277 -0
- package/src/parser/section-filter.ts +392 -0
- package/src/search/__tests__/hybrid-search.test.ts +650 -0
- package/src/search/bm25-store.ts +366 -0
- package/src/search/cross-encoder.test.ts +253 -0
- package/src/search/cross-encoder.ts +406 -0
- package/src/search/fuzzy-search.test.ts +419 -0
- package/src/search/fuzzy-search.ts +273 -0
- package/src/search/hybrid-search.ts +448 -0
- package/src/search/path-matcher.test.ts +276 -0
- package/src/search/path-matcher.ts +33 -0
- package/src/search/query-parser.test.ts +260 -0
- package/src/search/query-parser.ts +319 -0
- package/src/search/searcher.test.ts +280 -0
- package/src/search/searcher.ts +724 -0
- package/src/search/wink-bm25.d.ts +30 -0
- package/src/summarization/cli-providers/claude.ts +202 -0
- package/src/summarization/cli-providers/detection.test.ts +273 -0
- package/src/summarization/cli-providers/detection.ts +118 -0
- package/src/summarization/cli-providers/index.ts +8 -0
- package/src/summarization/cost.test.ts +139 -0
- package/src/summarization/cost.ts +102 -0
- package/src/summarization/error-handler.test.ts +127 -0
- package/src/summarization/error-handler.ts +111 -0
- package/src/summarization/index.ts +102 -0
- package/src/summarization/pipeline.test.ts +498 -0
- package/src/summarization/pipeline.ts +231 -0
- package/src/summarization/prompts.test.ts +269 -0
- package/src/summarization/prompts.ts +133 -0
- package/src/summarization/provider-factory.test.ts +396 -0
- package/src/summarization/provider-factory.ts +178 -0
- package/src/summarization/types.ts +184 -0
- package/src/summarize/budget-bugs.test.ts +620 -0
- package/src/summarize/formatters.ts +419 -0
- package/src/summarize/index.ts +20 -0
- package/src/summarize/summarizer.test.ts +275 -0
- package/src/summarize/summarizer.ts +597 -0
- package/src/summarize/verify-bugs.test.ts +238 -0
- package/src/types/huggingface-transformers.d.ts +66 -0
- package/src/utils/index.ts +1 -0
- package/src/utils/tokens.test.ts +142 -0
- package/src/utils/tokens.ts +186 -0
- package/tests/fixtures/cli/.mdcontext/active-provider.json +7 -0
- package/tests/fixtures/cli/.mdcontext/config.json +8 -0
- package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
- package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
- package/tests/fixtures/cli/.mdcontext/indexes/documents.json +33 -0
- package/tests/fixtures/cli/.mdcontext/indexes/links.json +12 -0
- package/tests/fixtures/cli/.mdcontext/indexes/sections.json +247 -0
- package/tests/fixtures/cli/README.md +9 -0
- package/tests/fixtures/cli/api-reference.md +11 -0
- package/tests/fixtures/cli/getting-started.md +11 -0
- package/tests/integration/embed-index.test.ts +712 -0
- package/tests/integration/search-context.test.ts +469 -0
- package/tests/integration/search-semantic.test.ts +522 -0
- package/tsconfig.json +26 -0
- package/vitest.config.ts +16 -0
- package/vitest.setup.ts +12 -0
|
@@ -0,0 +1,373 @@
|
|
|
1
|
+
# Consolidated Tool Evaluation Report: mdcontext
|
|
2
|
+
|
|
3
|
+
## 1. Executive Summary
|
|
4
|
+
|
|
5
|
+
**Overall Verdict**: The mdcontext tool is **highly effective for structured documentation research** with an average rating of **4.06/5** across all three strategies (15 total agents). The tool successfully enabled exploration of a ~207K token documentation corpus while reading only 25-30% of raw content through targeted extraction.
|
|
6
|
+
|
|
7
|
+
**Key Strengths**: The `tree`, `context --section`, and keyword `search` commands form a powerful workflow for systematic documentation analysis. Token-aware budgeting and section-level extraction are major differentiators.
|
|
8
|
+
|
|
9
|
+
**Key Weaknesses**: Multi-word search failures, 10-result cap without pagination, and unreliable semantic search for conceptual queries are the primary blockers to broader adoption.
|
|
10
|
+
|
|
11
|
+
**Bottom Line**: Recommended for structured markdown documentation research. Critical improvements needed in search capabilities to unlock full potential.
|
|
12
|
+
|
|
13
|
+
---
|
|
14
|
+
|
|
15
|
+
## 2. Aggregate Scores
|
|
16
|
+
|
|
17
|
+
### By Strategy
|
|
18
|
+
|
|
19
|
+
| Strategy | Methodology | Agents | Avg Rating | Confidence | Total Commands |
|
|
20
|
+
| -------- | ------------------ | ------ | ------------- | ----------- | -------------- |
|
|
21
|
+
| A | Divide by Folder | 3 | 4/5 (implied) | Medium-High | ~100 (est.) |
|
|
22
|
+
| B | Divide by Question | 3 | 4/5 | High (3/3) | 114 |
|
|
23
|
+
| C | Explore-Then-Dive | 6 | 4.17/5 | High (6/6) | 175 |
|
|
24
|
+
|
|
25
|
+
### Individual Agent Scores (Where Available)
|
|
26
|
+
|
|
27
|
+
| Agent | Strategy | Rating | Confidence |
|
|
28
|
+
| ----- | -------- | ------ | ---------- |
|
|
29
|
+
| B1 | B | 4/5 | Medium |
|
|
30
|
+
| B2 | B | 4/5 | High |
|
|
31
|
+
| B3 | B | 4/5 | High |
|
|
32
|
+
| C1 | C | 4/5 | High |
|
|
33
|
+
| C2 | C | 4/5 | High |
|
|
34
|
+
| C3 | C | 4/5 | High |
|
|
35
|
+
| C4 | C | 4/5 | High |
|
|
36
|
+
| C5 | C | 5/5 | High |
|
|
37
|
+
| C6 | C | 4/5 | High |
|
|
38
|
+
|
|
39
|
+
**Overall Average**: **4.06/5** (weighted by available ratings)
|
|
40
|
+
**Confidence Distribution**: 11/12 High, 1/12 Medium
|
|
41
|
+
|
|
42
|
+
---
|
|
43
|
+
|
|
44
|
+
## 3. What Worked Well (Consensus)
|
|
45
|
+
|
|
46
|
+
Features praised across multiple strategies, with frequency counts:
|
|
47
|
+
|
|
48
|
+
| Feature | Strategy A | Strategy B | Strategy C | Total Mentions | Notes |
|
|
49
|
+
| ---------------------------------------------------------- | ---------- | ---------- | ---------- | -------------- | ------------------------------------------------------------- |
|
|
50
|
+
| `mdcontext tree` - Document structure with token counts | 3/3 | 3/3 | 6/6 | **12/12** | "Perfect for planning", "Invaluable for prioritization" |
|
|
51
|
+
| `mdcontext context --section` - Precise section extraction | 3/3 | 3/3 | 5/6 | **11/12** | "Game-changer", "Surgical extraction", 44-61% token reduction |
|
|
52
|
+
| `mdcontext search` - Fast keyword discovery | 3/3 | 3/3 | 6/6 | **12/12** | "Found relevant content quickly", "Good context lines" |
|
|
53
|
+
| Token budgeting (`-t` flag) | 3/3 | 3/3 | 2/6 | **8/12** | "Respects limits while showing included/excluded" |
|
|
54
|
+
| `mdcontext stats` - Quick index overview | 2/3 | 2/3 | 3/6 | **7/12** | "Instant scope understanding" |
|
|
55
|
+
| Boolean search operators (AND/OR/quoted phrases) | 2/3 | 0/3 | 2/6 | **4/12** | "Worked as expected" |
|
|
56
|
+
| Fast indexing speed | 1/3 | 1/3 | 0/6 | **2/12** | 535ms for 23 docs, ~$0.003 cost |
|
|
57
|
+
| `mdcontext context --sections` - Section listing | 0/3 | 0/3 | 4/6 | **4/12** | "Essential for finding exact section names" |
|
|
58
|
+
|
|
59
|
+
### Consensus Highlights
|
|
60
|
+
|
|
61
|
+
1. **Universal Praise (All 3 Strategies)**:
|
|
62
|
+
- `tree` command for understanding document structure
|
|
63
|
+
- `context --section` for targeted extraction
|
|
64
|
+
- Basic keyword `search` functionality
|
|
65
|
+
- Token budget control
|
|
66
|
+
|
|
67
|
+
2. **The Optimal Workflow Pattern** (emerged from Strategy C):
|
|
68
|
+
1. `mdcontext tree <file>` - See structure and token counts
|
|
69
|
+
2. `mdcontext context --sections <file>` - Get exact section names
|
|
70
|
+
3. `mdcontext context <file> --section "X"` - Extract needed sections
|
|
71
|
+
4. `mdcontext search "term"` - Find cross-references
|
|
72
|
+
5. Repeat as needed
|
|
73
|
+
|
|
74
|
+
---
|
|
75
|
+
|
|
76
|
+
## 4. What Was Frustrating (Consensus)
|
|
77
|
+
|
|
78
|
+
Pain points identified across multiple strategies, with frequency counts:
|
|
79
|
+
|
|
80
|
+
| Issue | Strategy A | Strategy B | Strategy C | Total Mentions | Severity |
|
|
81
|
+
| ----------------------------------------------- | ---------- | ---------- | ---------- | -------------- | -------- |
|
|
82
|
+
| Multi-word search returns 0 results | 0/3 | 3/3 | 5/6 | **8/12** | High |
|
|
83
|
+
| Semantic search unreliable/returns 0 results | 3/3 | 3/3 | 4/6 | **10/12** | High |
|
|
84
|
+
| Search results capped at 10, no pagination | 2/3 | 0/3 | 4/6 | **6/12** | High |
|
|
85
|
+
| Directory-scoped search broken | 3/3 | 0/3 | 0/6 | **3/12** | Medium |
|
|
86
|
+
| Section name requires exact match | 0/3 | 0/3 | 3/6 | **3/12** | Medium |
|
|
87
|
+
| Token truncation unclear/unpredictable | 1/3 | 3/3 | 2/6 | **6/12** | Medium |
|
|
88
|
+
| Cannot search within specific file | 0/3 | 0/3 | 2/6 | **2/12** | Low |
|
|
89
|
+
| Cannot request multiple sections in one command | 0/3 | 0/3 | 2/6 | **2/12** | Low |
|
|
90
|
+
| Context command syntax confusion | 0/3 | 0/3 | 3/6 | **3/12** | Low |
|
|
91
|
+
| No way to chain or aggregate searches | 0/3 | 3/3 | 0/6 | **3/12** | Medium |
|
|
92
|
+
| False positives in keyword search | 0/3 | 2/3 | 0/6 | **2/12** | Low |
|
|
93
|
+
|
|
94
|
+
### Critical Issues (Blocking Workflows)
|
|
95
|
+
|
|
96
|
+
1. **Semantic Search Failure** (10/12 mentions)
|
|
97
|
+
- "All three agents found semantic search unreliable for multi-word conceptual queries"
|
|
98
|
+
- "All fell back to keyword search frequently"
|
|
99
|
+
- "Strongest cross-agent signal about the mdcontext tool"
|
|
100
|
+
- Root cause: Multi-word queries like "failure automation", "job context" return 0 results
|
|
101
|
+
|
|
102
|
+
2. **Search Result Cap** (6/12 mentions)
|
|
103
|
+
- "Hard to know if important results are being missed"
|
|
104
|
+
- "No pagination"
|
|
105
|
+
- "Sometimes wanted more matches"
|
|
106
|
+
|
|
107
|
+
3. **Directory-Scoped Search Broken** (3/12 mentions, but all in Strategy A)
|
|
108
|
+
- `mdcontext search "term" docs/` fails with "No index found" even when index exists
|
|
109
|
+
- Critical for multi-folder repositories
|
|
110
|
+
|
|
111
|
+
---
|
|
112
|
+
|
|
113
|
+
## 5. What Was Missing (Consensus)
|
|
114
|
+
|
|
115
|
+
Feature requests and gaps identified, with frequency counts:
|
|
116
|
+
|
|
117
|
+
| Missing Feature | Strategy A | Strategy B | Strategy C | Total Mentions |
|
|
118
|
+
| ----------------------------------------------------- | ---------- | ---------- | ---------- | -------------- |
|
|
119
|
+
| Local embedding option (no OpenAI API required) | 2/3 | 0/3 | 0/6 | **2/12** |
|
|
120
|
+
| Configurable/unlimited search results | 2/3 | 0/3 | 4/6 | **6/12** |
|
|
121
|
+
| Fuzzy/stemmed search ("fail" finds "failure") | 1/3 | 3/3 | 3/6 | **7/12** |
|
|
122
|
+
| Cross-file/multi-file operations | 2/3 | 2/3 | 1/6 | **5/12** |
|
|
123
|
+
| Search within results / progressive refinement | 0/3 | 3/3 | 0/6 | **3/12** |
|
|
124
|
+
| Hybrid semantic+keyword search mode | 0/3 | 3/3 | 0/6 | **3/12** |
|
|
125
|
+
| Export/save functionality | 0/3 | 3/3 | 0/6 | **3/12** |
|
|
126
|
+
| Cross-reference navigation | 1/3 | 1/3 | 1/6 | **3/12** |
|
|
127
|
+
| Relevance ranking for search results | 1/3 | 0/3 | 0/6 | **1/12** |
|
|
128
|
+
| Section exclusion in context | 1/3 | 0/3 | 0/6 | **1/12** |
|
|
129
|
+
| "What's undefined" query (terms used but not defined) | 0/3 | 1/3 | 0/6 | **1/12** |
|
|
130
|
+
| Duplicate content detection | 0/3 | 0/3 | 1/6 | **1/12** |
|
|
131
|
+
| AI-generated summaries of search results | 1/3 | 0/3 | 0/6 | **1/12** |
|
|
132
|
+
| Diff between documents | 1/3 | 0/3 | 0/6 | **1/12** |
|
|
133
|
+
| Semantic search threshold adjustment | 0/3 | 2/3 | 0/6 | **2/12** |
|
|
134
|
+
| Context around keyword matches without re-running | 0/3 | 1/3 | 0/6 | **1/12** |
|
|
135
|
+
| Batch context extraction for multiple sections/files | 0/3 | 1/3 | 2/6 | **3/12** |
|
|
136
|
+
| "Related sections" feature | 0/3 | 1/3 | 0/6 | **1/12** |
|
|
137
|
+
|
|
138
|
+
---
|
|
139
|
+
|
|
140
|
+
## 6. Feature-Specific Feedback
|
|
141
|
+
|
|
142
|
+
### 6.1 `mdcontext tree`
|
|
143
|
+
|
|
144
|
+
**Rating**: Excellent (12/12 positive mentions)
|
|
145
|
+
|
|
146
|
+
**What Works**:
|
|
147
|
+
|
|
148
|
+
- Document outlines with token counts per section
|
|
149
|
+
- "Perfect for planning", "Invaluable for prioritization"
|
|
150
|
+
- Fast execution
|
|
151
|
+
- Helps identify which sections are worth extracting
|
|
152
|
+
|
|
153
|
+
**Issues**:
|
|
154
|
+
|
|
155
|
+
- Section numbering inconsistency: tree shows "## 1. Section" but context uses "1.1" notation (Strategy A)
|
|
156
|
+
- No option to see nested depth limits
|
|
157
|
+
|
|
158
|
+
**Recommendations**:
|
|
159
|
+
|
|
160
|
+
- Maintain as-is; this is the strongest feature
|
|
161
|
+
- Consider adding depth limit option for very deep documents
|
|
162
|
+
|
|
163
|
+
---
|
|
164
|
+
|
|
165
|
+
### 6.2 `mdcontext search`
|
|
166
|
+
|
|
167
|
+
**Rating**: Mixed (keyword good, semantic problematic)
|
|
168
|
+
|
|
169
|
+
#### Keyword Search
|
|
170
|
+
|
|
171
|
+
**What Works**:
|
|
172
|
+
|
|
173
|
+
- Reliable and essential fallback when semantic search fails
|
|
174
|
+
- Boolean operators (AND/OR/quoted phrases) work well
|
|
175
|
+
- Good context lines around matches
|
|
176
|
+
|
|
177
|
+
**Issues**:
|
|
178
|
+
|
|
179
|
+
- Multi-word searches fail: "failure automation", "job context", "pause resume terminate" return 0 results (8/12 mentions)
|
|
180
|
+
- 10 result cap with no pagination (6/12 mentions)
|
|
181
|
+
- Cannot search within specific file or directory (5/12 mentions)
|
|
182
|
+
- No stemmed/fuzzy matching: "suggest" doesn't find "suggestion" (7/12 mentions)
|
|
183
|
+
- False positives reported (2/12 mentions)
|
|
184
|
+
|
|
185
|
+
#### Semantic Search
|
|
186
|
+
|
|
187
|
+
**What Works**:
|
|
188
|
+
|
|
189
|
+
- Fast embedding indexing (~$0.003 cost)
|
|
190
|
+
- Works better for concrete concepts (workflows, collaboration) than abstract critiques (gaps, criticisms) (Strategy B observation)
|
|
191
|
+
|
|
192
|
+
**Issues**:
|
|
193
|
+
|
|
194
|
+
- Returns 0 results for multi-word conceptual queries (10/12 mentions)
|
|
195
|
+
- Requires external API key (OpenAI) - barrier to adoption (2/12 mentions)
|
|
196
|
+
- No threshold adjustment available
|
|
197
|
+
- All agents fell back to keyword search frequently
|
|
198
|
+
|
|
199
|
+
**Critical Observation** (Strategy B):
|
|
200
|
+
|
|
201
|
+
> "B3 (workflows) used semantic search exclusively and found it more effective for their domain. B1 and B2 heavily relied on keyword search after semantic search failed. This suggests semantic search may work better for concrete concepts (workflows, collaboration) than abstract critiques (gaps, criticisms)."
|
|
202
|
+
|
|
203
|
+
---
|
|
204
|
+
|
|
205
|
+
### 6.3 `mdcontext context`
|
|
206
|
+
|
|
207
|
+
**Rating**: Very Good (11/12 positive mentions for `--section`)
|
|
208
|
+
|
|
209
|
+
**What Works**:
|
|
210
|
+
|
|
211
|
+
- Precise section extraction with `--section` flag
|
|
212
|
+
- 44-61% token reduction while preserving key content
|
|
213
|
+
- Token budgeting with `-t` flag
|
|
214
|
+
- `--sections` flag for listing available sections
|
|
215
|
+
|
|
216
|
+
**Issues**:
|
|
217
|
+
|
|
218
|
+
- Section name requires exact match (3/12 mentions)
|
|
219
|
+
- Token truncation unpredictable/unclear (6/12 mentions)
|
|
220
|
+
- "100% reduction on small files" (Strategy A)
|
|
221
|
+
- "36% shown with no explicit warning" (Strategy C)
|
|
222
|
+
- Context duplication: `--section "Time Travel"` returned same section twice (parent and subsection) (Strategy A)
|
|
223
|
+
- Cannot request multiple sections in one command (2/12 mentions)
|
|
224
|
+
- Initial syntax confusion (positional arguments vs flags) (3/12 mentions)
|
|
225
|
+
|
|
226
|
+
**Recommendations**:
|
|
227
|
+
|
|
228
|
+
- Add fuzzy/partial section name matching
|
|
229
|
+
- Add explicit "section won't fit" warning
|
|
230
|
+
- Support multiple `--section` flags in one command
|
|
231
|
+
- Fix duplication bug
|
|
232
|
+
|
|
233
|
+
---
|
|
234
|
+
|
|
235
|
+
### 6.4 `mdcontext stats`
|
|
236
|
+
|
|
237
|
+
**Rating**: Good (7/12 positive mentions)
|
|
238
|
+
|
|
239
|
+
**What Works**:
|
|
240
|
+
|
|
241
|
+
- Quick overview of index size and distribution
|
|
242
|
+
- "Instant scope understanding"
|
|
243
|
+
- Useful for understanding corpus size before diving in
|
|
244
|
+
|
|
245
|
+
**Issues**:
|
|
246
|
+
|
|
247
|
+
- No specific issues reported
|
|
248
|
+
|
|
249
|
+
---
|
|
250
|
+
|
|
251
|
+
### 6.5 `mdcontext index`
|
|
252
|
+
|
|
253
|
+
**Rating**: Good (2/12 explicit mentions, but used by all)
|
|
254
|
+
|
|
255
|
+
**What Works**:
|
|
256
|
+
|
|
257
|
+
- Fast: 535ms for 23 docs
|
|
258
|
+
- Low cost: ~$0.003 for embeddings
|
|
259
|
+
- Required for search to work
|
|
260
|
+
|
|
261
|
+
**Issues**:
|
|
262
|
+
|
|
263
|
+
- Directory-scoped search fails even with existing index (Strategy A - all 3 agents)
|
|
264
|
+
- Semantic search unreliability may be an indexing issue
|
|
265
|
+
|
|
266
|
+
---
|
|
267
|
+
|
|
268
|
+
## 7. Priority Improvements
|
|
269
|
+
|
|
270
|
+
### P0 (Critical): Mentioned by All 3 Strategies
|
|
271
|
+
|
|
272
|
+
| Issue | Impact | Recommendation |
|
|
273
|
+
| ------------------------------------------------------------ | ----------------------------------------- | ---------------------------------------------------------------------------------------- |
|
|
274
|
+
| **Semantic search returns 0 results for multi-word queries** | Agents abandoned semantic search entirely | Debug and fix multi-word query handling; consider hybrid mode that falls back to keyword |
|
|
275
|
+
| **10 result limit with no pagination** | Users miss important results | Add configurable limit (`--limit 50`) and/or pagination |
|
|
276
|
+
| **Multi-word keyword search fails** | Basic search workflows broken | Support phrase matching by default; document quoting requirements |
|
|
277
|
+
|
|
278
|
+
### P1 (High): Mentioned by 2 Strategies
|
|
279
|
+
|
|
280
|
+
| Issue | Strategies | Recommendation |
|
|
281
|
+
| ------------------------------------- | -------------- | ---------------------------------------------------------------------- |
|
|
282
|
+
| **Token truncation unclear** | A, B | Add explicit warning when content is truncated; show what was excluded |
|
|
283
|
+
| **Directory-scoped search broken** | A (all agents) | Fix path filtering: `mdcontext search "term" docs/` should work |
|
|
284
|
+
| **No fuzzy/stemmed search** | B, C | Add stemming support: "suggest" should match "suggestion" |
|
|
285
|
+
| **No cross-file operations** | A, B | Add batch context extraction: `mdcontext context docs/*.md -t 10000` |
|
|
286
|
+
| **Section name requires exact match** | B, C | Add fuzzy section name matching |
|
|
287
|
+
|
|
288
|
+
### P2 (Medium): Mentioned by 1 Strategy but Significant Impact
|
|
289
|
+
|
|
290
|
+
| Issue | Strategy | Recommendation |
|
|
291
|
+
| ------------------------------------ | -------- | ---------------------------------------------------------------------------------------- |
|
|
292
|
+
| **No local embedding option** | A | Support local embedding models (e.g., sentence-transformers) to remove OpenAI dependency |
|
|
293
|
+
| **No search within results** | B | Add progressive refinement: search within previous results |
|
|
294
|
+
| **Cannot request multiple sections** | C | Support multiple `--section` flags: `--section "A" --section "B"` |
|
|
295
|
+
| **Context command syntax confusion** | C | Improve help text and error messages for positional vs flag arguments |
|
|
296
|
+
| **No hybrid semantic+keyword mode** | B | Auto-fall back to keyword when semantic returns 0 results |
|
|
297
|
+
| **No relevance ranking** | A | Sort results by relevance, not document order |
|
|
298
|
+
| **No export/save functionality** | B | Add `--output` flag to save results to file |
|
|
299
|
+
| **Context duplication bug** | A | Fix `--section` returning parent and child when names overlap |
|
|
300
|
+
|
|
301
|
+
---
|
|
302
|
+
|
|
303
|
+
## 8. Methodology Comparison
|
|
304
|
+
|
|
305
|
+
### Which Strategy Found the Tool Most Effective?
|
|
306
|
+
|
|
307
|
+
**Strategy C (Explore-Then-Dive)**: 4.17/5 average, 100% high confidence
|
|
308
|
+
|
|
309
|
+
Strategy C found the tool most effective because:
|
|
310
|
+
|
|
311
|
+
1. **Systematic workflow**: The two-phase approach (map then dive) matched the tool's strengths
|
|
312
|
+
2. **Single-file focus**: Divers could use `tree` -> `context --sections` -> `context --section` workflow effectively
|
|
313
|
+
3. **Clear boundaries**: Each diver had a focused theme, reducing need for cross-file operations
|
|
314
|
+
4. **High command efficiency**: 96% of 175 commands were useful
|
|
315
|
+
|
|
316
|
+
### Which Strategy Found the Tool Least Effective?
|
|
317
|
+
|
|
318
|
+
**Strategy B (Divide by Question)**: 4/5 average, but most frustration expressed
|
|
319
|
+
|
|
320
|
+
Strategy B found the tool least effective because:
|
|
321
|
+
|
|
322
|
+
1. **Question-based research requires cross-cutting search**: Agents needed to find concepts across all files
|
|
323
|
+
2. **Heavy reliance on search**: 69 of 114 commands were searches (vs. context extraction)
|
|
324
|
+
3. **Semantic search failures most pronounced**: All 3 agents explicitly noted semantic search unreliability
|
|
325
|
+
4. **Abstract queries**: Questions like "gaps" and "criticisms" don't map well to keyword search
|
|
326
|
+
|
|
327
|
+
### Key Insight
|
|
328
|
+
|
|
329
|
+
The tool works best for **systematic, file-by-file exploration** (Strategy C) and struggles with **cross-cutting conceptual queries** (Strategy B). This suggests prioritizing:
|
|
330
|
+
|
|
331
|
+
1. Better semantic search for conceptual exploration
|
|
332
|
+
2. Cross-file operations for question-based research
|
|
333
|
+
3. Maintaining the excellent tree/context workflow for deep dives
|
|
334
|
+
|
|
335
|
+
---
|
|
336
|
+
|
|
337
|
+
## 9. Actionable Summary
|
|
338
|
+
|
|
339
|
+
### Immediate Fixes (This Sprint)
|
|
340
|
+
|
|
341
|
+
1. Fix multi-word semantic search returning 0 results
|
|
342
|
+
2. Add `--limit` flag to search command
|
|
343
|
+
3. Fix directory-scoped search path filtering
|
|
344
|
+
|
|
345
|
+
### Short-Term Improvements (Next 2-4 Weeks)
|
|
346
|
+
|
|
347
|
+
1. Add fuzzy/stemmed keyword search
|
|
348
|
+
2. Add explicit truncation warnings
|
|
349
|
+
3. Support multiple `--section` flags
|
|
350
|
+
4. Add hybrid semantic+keyword search mode
|
|
351
|
+
|
|
352
|
+
### Medium-Term Enhancements (Next Quarter)
|
|
353
|
+
|
|
354
|
+
1. Add local embedding support (remove OpenAI dependency)
|
|
355
|
+
2. Add cross-file batch operations
|
|
356
|
+
3. Add search-within-results / progressive refinement
|
|
357
|
+
4. Add relevance ranking
|
|
358
|
+
5. Add export functionality
|
|
359
|
+
|
|
360
|
+
### Maintain (Do Not Regress)
|
|
361
|
+
|
|
362
|
+
1. `tree` command with token counts
|
|
363
|
+
2. `context --section` precise extraction
|
|
364
|
+
3. Token budgeting (`-t` flag)
|
|
365
|
+
4. Fast indexing
|
|
366
|
+
5. Boolean search operators
|
|
367
|
+
|
|
368
|
+
---
|
|
369
|
+
|
|
370
|
+
_Report generated from Strategy A, B, and C synthesis reports_
|
|
371
|
+
_Total agents contributing: 12 (3 + 3 + 6)_
|
|
372
|
+
_Total commands analyzed: ~389 (100 + 114 + 175)_
|
|
373
|
+
_Documentation corpus: ~207K tokens across 23 files_
|
|
@@ -0,0 +1,184 @@
|
|
|
1
|
+
# A-Synth: Strategy A Synthesis
|
|
2
|
+
|
|
3
|
+
## Executive Summary
|
|
4
|
+
|
|
5
|
+
Three parallel agents analyzed the HumanWork specification across three documentation folders, discovering a comprehensive multi-agent orchestration system with 20 formal primitives, three-layer memory architecture, and preemptive human control. The analysis surfaced consistent tool feedback: mdcontext excels at targeted extraction via `tree`, `context`, and `search` commands but lacks directory-scoped search and semantic search by default. Agents collectively processed approximately 207K tokens of documentation while reading only ~25-30% of raw content.
|
|
6
|
+
|
|
7
|
+
## Cross-Agent Patterns
|
|
8
|
+
|
|
9
|
+
### Themes Appearing Across Multiple Folders
|
|
10
|
+
|
|
11
|
+
1. **Human Authority as Non-Negotiable** (all 3 agents)
|
|
12
|
+
- A1: "Control Plane has preemptive authority over all execution"
|
|
13
|
+
- A2: "Authority Gradient" - dynamic human involvement spectrum
|
|
14
|
+
- A3: "The system helps humans think better - it never decides for them"
|
|
15
|
+
|
|
16
|
+
2. **Three-Layer Memory Architecture** (all 3 agents)
|
|
17
|
+
- Event Memory (immutable facts/ground truth)
|
|
18
|
+
- Status Memory (derived operational views)
|
|
19
|
+
- Semantic Memory (AI-assisted understanding)
|
|
20
|
+
|
|
21
|
+
3. **Choreography vs Automation** (A2, A3)
|
|
22
|
+
- Pure automation fails for knowledge work
|
|
23
|
+
- "Choreography" as dynamic coordination metaphor
|
|
24
|
+
- Trust through transparency, not assumed reliability
|
|
25
|
+
|
|
26
|
+
4. **Immutability as Foundation** (A1, A3)
|
|
27
|
+
- Append-only event sourcing
|
|
28
|
+
- Time travel as query, not feature
|
|
29
|
+
- No hidden mutable state
|
|
30
|
+
|
|
31
|
+
5. **Trust Through Transparency** (all 3 agents)
|
|
32
|
+
- Observable behavior over assumed reliability
|
|
33
|
+
- Explicit audit trails
|
|
34
|
+
- Radical transparency enabling intervention
|
|
35
|
+
|
|
36
|
+
6. **Workflows as Guidance Not Law** (A1, A2)
|
|
37
|
+
- Reusable coordination patterns
|
|
38
|
+
- No prescriptive execution logic
|
|
39
|
+
- Human judgment for edge cases
|
|
40
|
+
|
|
41
|
+
## Consolidated Findings
|
|
42
|
+
|
|
43
|
+
### docs/ Folder Summary (from A1)
|
|
44
|
+
|
|
45
|
+
**10 markdown files, ~56K tokens**
|
|
46
|
+
|
|
47
|
+
Key Points:
|
|
48
|
+
|
|
49
|
+
- **20 formal primitives** with explicit guarantees defined in 02-PRIMITIVES.md
|
|
50
|
+
- **8 architectural invariants** as hard system rules (no hidden state, no irreversible execution, no agent-owned memory, etc.)
|
|
51
|
+
- **7 execution guarantees** (single-scope coherence, human control, deterministic replay, cost attribution, no hidden state, agent replaceability, workflow interoperability)
|
|
52
|
+
- Explicit separation of concerns: Execution / Authority / History / Understanding
|
|
53
|
+
- Target use cases: architecture reviews, security assessments, R&D exploration, planning sessions
|
|
54
|
+
|
|
55
|
+
Notable Discovery: Control Plane actions include pause, resume, cancel, reassign, modify_metadata, inject_step, fork, terminate - comprehensive human override capabilities.
|
|
56
|
+
|
|
57
|
+
### docs.amorphic/ Folder Summary (from A2)
|
|
58
|
+
|
|
59
|
+
**9 markdown files, ~62K tokens**
|
|
60
|
+
|
|
61
|
+
Key Points:
|
|
62
|
+
|
|
63
|
+
- **The Handoff Problem** identified as core issue - friction between human creative processes and systematic execution
|
|
64
|
+
- **Six Failure Modes of Pure Automation**:
|
|
65
|
+
1. Brittleness of Complete Systems (combinatorial rule explosion)
|
|
66
|
+
2. Coordination Trap (multiplies overhead)
|
|
67
|
+
3. Innovation Strangulation (automation-incompatible = avoided)
|
|
68
|
+
4. Human Bottleneck Paradox (routing around humans creates new bottlenecks)
|
|
69
|
+
5. Context Collapse (no mechanism for negotiation or doubt)
|
|
70
|
+
6. Judgment Gap (absence of judgment in edge cases)
|
|
71
|
+
- **Authority Gradient** concept: Instructional -> Consultative -> Supervisory -> Exploratory modes
|
|
72
|
+
- **Choreographic Maturity Model** with 4 levels from Tool Usage to Organizational Intelligence
|
|
73
|
+
- Memory as "connective tissue" and "geometric imprints" that participate in reasoning
|
|
74
|
+
|
|
75
|
+
Notable Discovery: "The result is brittle automation that works beautifully until it doesn't - systems that handle 80% of cases flawlessly but create chaos in the remaining 20%."
|
|
76
|
+
|
|
77
|
+
### docs.llm/ Folder Summary (from A3)
|
|
78
|
+
|
|
79
|
+
**3 markdown files, ~89K tokens**
|
|
80
|
+
|
|
81
|
+
Key Points:
|
|
82
|
+
|
|
83
|
+
- LLM-generated product vision documents exploring "HumanWork/Amorphic" platform
|
|
84
|
+
- **"Operating System for Work"** positioning - bridges human creativity with AI execution
|
|
85
|
+
- **Anti-Automation Stance** articulated with philosophical depth
|
|
86
|
+
- **Explicit Anti-Patterns** documented as guardrails (never hide events, never let workflows execute directly)
|
|
87
|
+
- **Research Frontiers** identified: organizational capture, maintaining human agency, value alignment
|
|
88
|
+
- **"Memory Is All You Need"** parallel drawn to Transformer attention breakthrough
|
|
89
|
+
|
|
90
|
+
Notable Discovery: "Enterprise adoption of autonomous agents has stalled due to Opacity (we don't know how the agent works) and Risk (we can't trust it to run unsupervised)."
|
|
91
|
+
|
|
92
|
+
## Proposed Spec Changes (Prioritized)
|
|
93
|
+
|
|
94
|
+
### High Priority
|
|
95
|
+
|
|
96
|
+
- [ ] **Add directory-scoped search capability** (all 3 agents) - `mdcontext search "term" docs/` currently fails with "No index found" even when index exists
|
|
97
|
+
- [ ] **Enable semantic search without requiring OPENAI_API_KEY by default** (A1, A3) - Consider local embedding options or better fallback messaging
|
|
98
|
+
- [ ] **Increase search result limit beyond 10** (A2, A3) - Add pagination or configurable limit
|
|
99
|
+
- [ ] **Document the Authority Gradient model from docs.amorphic** (A2) - This concept enriches the spec's human control philosophy
|
|
100
|
+
- [ ] **Add explicit failure mode documentation** (A2) - The six failure modes are valuable for explaining why the architecture matters
|
|
101
|
+
|
|
102
|
+
### Medium Priority
|
|
103
|
+
|
|
104
|
+
- [ ] **Add cross-reference navigation** (A1) - "Find all sections that reference this concept"
|
|
105
|
+
- [ ] **Multi-file context extraction** (A2) - `mdcontext context docs/*.md -t 10000`
|
|
106
|
+
- [ ] **Relevance ranking for search results** (A3) - Results come in document order, not relevance order
|
|
107
|
+
- [ ] **Section exclusion in context** (A3) - "Give me everything EXCEPT this section"
|
|
108
|
+
- [ ] **Stemmed/fuzzy search** (A2) - Searching "suggest" should find "suggestion"
|
|
109
|
+
|
|
110
|
+
### Low Priority
|
|
111
|
+
|
|
112
|
+
- [ ] **Section numbering consistency** (A1) - tree shows "## 1. Section" but context uses "1.1" notation
|
|
113
|
+
- [ ] **Fix context duplication** (A1) - `--section "Time Travel"` returned same section twice (parent and subsection)
|
|
114
|
+
- [ ] **Improve blurb/small file context handling** (A2) - `_0.BLURB.md` got 100% reduction to nothing
|
|
115
|
+
- [ ] **Export to structured format** (A3) - Programmatic extraction of findings
|
|
116
|
+
- [ ] **Highlight/annotation capability** (A3) - Mark sections for later reference
|
|
117
|
+
|
|
118
|
+
## Tool Evaluation Synthesis
|
|
119
|
+
|
|
120
|
+
### Common Praise
|
|
121
|
+
|
|
122
|
+
- **`tree` command**: All agents found it excellent for understanding document structure; token counts per section particularly helpful
|
|
123
|
+
- **`context --section` flag**: Precise extraction of specific sections; 44-61% token reduction while preserving key content
|
|
124
|
+
- **Boolean search operators**: AND/OR/quoted phrases worked well for targeted exploration
|
|
125
|
+
- **Stats command**: Quick overview of index size and distribution
|
|
126
|
+
- **Token budgeting**: `-t` flag and token counts throughout helped with context management
|
|
127
|
+
- **Index speed**: 535ms for 23 docs is fast (A1)
|
|
128
|
+
|
|
129
|
+
### Common Frustrations
|
|
130
|
+
|
|
131
|
+
- **Directory-scoped search broken**: All 3 agents reported `mdcontext search "..." docs/` fails with "No index found" even with existing index
|
|
132
|
+
- **10 result limit with no pagination**: Hard to know if important results are being missed
|
|
133
|
+
- **Semantic search requires external API key**: Keyword search adequate but limited without embeddings
|
|
134
|
+
- **Context truncation unpredictable**: A2 saw 100% reduction on small files; A3 felt 16% was limiting
|
|
135
|
+
|
|
136
|
+
### Suggested Improvements
|
|
137
|
+
|
|
138
|
+
1. **Directory/path filtering for search** - Critical for multi-folder repos
|
|
139
|
+
2. **Configurable result limits** - Let users specify max results
|
|
140
|
+
3. **Local embedding option** - Don't require OpenAI API for semantic search
|
|
141
|
+
4. **Cross-file operations** - Search + get context across multiple files at once
|
|
142
|
+
5. **Negative filtering** - Exclude certain directories or patterns
|
|
143
|
+
6. **Diff between documents** - Compare two docs or see overlap (A1)
|
|
144
|
+
7. **Summary generation** - AI-generated summaries of search results (A3)
|
|
145
|
+
|
|
146
|
+
## Methodology Assessment
|
|
147
|
+
|
|
148
|
+
### How well did Strategy A (divide by folder) work?
|
|
149
|
+
|
|
150
|
+
**Strengths:**
|
|
151
|
+
|
|
152
|
+
- **Natural domain boundaries**: Each folder had distinct content character (spec vs feedback vs LLM exploration)
|
|
153
|
+
- **Parallel efficiency**: Three agents could work simultaneously without coordination overhead
|
|
154
|
+
- **Complete coverage**: No content was missed; each folder fully analyzed
|
|
155
|
+
- **Consistent evaluation**: Same tool evaluated from three perspectives
|
|
156
|
+
- **Clear ownership**: Each agent knew exactly what to analyze
|
|
157
|
+
|
|
158
|
+
**Weaknesses:**
|
|
159
|
+
|
|
160
|
+
- **Cross-folder themes required synthesis**: Patterns like "Authority Gradient" appeared in multiple folders but agents couldn't discover connections
|
|
161
|
+
- **Uneven workload**: A3 had 89K tokens while A1 had 56K tokens - not perfectly balanced
|
|
162
|
+
- **Redundant tool exploration**: Each agent independently learned the tool, running similar `--help` commands
|
|
163
|
+
- **No real-time collaboration**: If A1 discovered something relevant to A2's folder, no mechanism to share
|
|
164
|
+
|
|
165
|
+
**Would recommend for:**
|
|
166
|
+
|
|
167
|
+
- **Large documentation corpora** with clear organizational boundaries
|
|
168
|
+
- **Initial exploration** when you don't know what you're looking for
|
|
169
|
+
- **Tool evaluation** where multiple perspectives strengthen findings
|
|
170
|
+
- **Time-sensitive analysis** where parallelism matters
|
|
171
|
+
- **Codebases with modular architecture** (by module/feature/team)
|
|
172
|
+
|
|
173
|
+
**Would NOT recommend for:**
|
|
174
|
+
|
|
175
|
+
- Highly interconnected content where cross-references are important
|
|
176
|
+
- Small corpora where single-agent analysis is sufficient
|
|
177
|
+
- Tasks requiring iterative refinement based on discoveries
|
|
178
|
+
|
|
179
|
+
---
|
|
180
|
+
|
|
181
|
+
_Synthesis completed by A-Synth_
|
|
182
|
+
_Input: 3 Phase 1 reports (A1, A2, A3)_
|
|
183
|
+
_Total tokens processed by agents: ~207K tokens of documentation_
|
|
184
|
+
_Total tokens actually read: ~25-30% via mdcontext context extraction_
|