mdcontext 0.0.1 → 0.2.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.changeset/README.md +28 -0
- package/.changeset/config.json +11 -0
- package/.claude/settings.local.json +25 -0
- package/.github/workflows/ci.yml +83 -0
- package/.github/workflows/claude-code-review.yml +44 -0
- package/.github/workflows/claude.yml +85 -0
- package/.github/workflows/release.yml +113 -0
- package/.tldrignore +112 -0
- package/BACKLOG.md +338 -0
- package/CONTRIBUTING.md +186 -0
- package/NOTES/NOTES +44 -0
- package/README.md +434 -11
- package/biome.json +36 -0
- package/cspell.config.yaml +14 -0
- package/dist/chunk-23UPXDNL.js +3044 -0
- package/dist/chunk-2W7MO2DL.js +1366 -0
- package/dist/chunk-3NUAZGMA.js +1689 -0
- package/dist/chunk-7TOWB2XB.js +366 -0
- package/dist/chunk-7XOTOADQ.js +3065 -0
- package/dist/chunk-AH2PDM2K.js +3042 -0
- package/dist/chunk-BNXWSZ63.js +3742 -0
- package/dist/chunk-BTL5DJVU.js +3222 -0
- package/dist/chunk-HDHYG7E4.js +104 -0
- package/dist/chunk-HLR4KZBP.js +3234 -0
- package/dist/chunk-IP3FRFEB.js +1045 -0
- package/dist/chunk-KHU56VDO.js +3042 -0
- package/dist/chunk-KRYIFLQR.js +88 -0
- package/dist/chunk-LBSDNLEM.js +287 -0
- package/dist/chunk-MNTQ7HCP.js +2643 -0
- package/dist/chunk-MUJELQQ6.js +1387 -0
- package/dist/chunk-MXJGMSLV.js +2199 -0
- package/dist/chunk-N6QJGC3Z.js +2636 -0
- package/dist/chunk-OBELGBPM.js +1713 -0
- package/dist/chunk-OT7R5XTA.js +3192 -0
- package/dist/chunk-P7X4RA2T.js +106 -0
- package/dist/chunk-PIDUQNC2.js +3185 -0
- package/dist/chunk-POGCDIH4.js +3187 -0
- package/dist/chunk-PSIEOQGZ.js +3043 -0
- package/dist/chunk-PVRT3IHA.js +3238 -0
- package/dist/chunk-QNN4TT23.js +1430 -0
- package/dist/chunk-RE3R45RJ.js +3042 -0
- package/dist/chunk-S7E6TFX6.js +803 -0
- package/dist/chunk-SG6GLU4U.js +1378 -0
- package/dist/chunk-SJCDV2ST.js +274 -0
- package/dist/chunk-SYE5XLF3.js +104 -0
- package/dist/chunk-T5VLYBZD.js +103 -0
- package/dist/chunk-TOQB7VWU.js +3238 -0
- package/dist/chunk-VFNMZ4ZQ.js +3228 -0
- package/dist/chunk-VVTGZNBT.js +1629 -0
- package/dist/chunk-W7Q4RFEV.js +104 -0
- package/dist/chunk-XTYYVRLO.js +3190 -0
- package/dist/chunk-Y6MDYVJD.js +3063 -0
- package/dist/cli/main.d.ts +1 -0
- package/dist/cli/main.js +5458 -0
- package/dist/index.d.ts +653 -0
- package/dist/index.js +79 -0
- package/dist/mcp/server.d.ts +1 -0
- package/dist/mcp/server.js +472 -0
- package/dist/schema-BAWSG7KY.js +22 -0
- package/dist/schema-E3QUPL26.js +20 -0
- package/dist/schema-EHL7WUT6.js +20 -0
- package/docs/019-USAGE.md +625 -0
- package/docs/020-current-implementation.md +364 -0
- package/docs/021-DOGFOODING-FINDINGS.md +175 -0
- package/docs/BACKLOG.md +80 -0
- package/docs/CONFIG.md +1123 -0
- package/docs/DESIGN.md +439 -0
- package/docs/ERRORS.md +383 -0
- package/docs/PROJECT.md +88 -0
- package/docs/ROADMAP.md +407 -0
- package/docs/summarization.md +320 -0
- package/docs/test-links.md +9 -0
- package/justfile +40 -0
- package/package.json +74 -9
- package/pnpm-workspace.yaml +5 -0
- package/research/INDEX.md +315 -0
- package/research/code-review/README.md +90 -0
- package/research/code-review/cli-error-handling-review.md +979 -0
- package/research/code-review/code-review-validation-report.md +464 -0
- package/research/code-review/main-ts-review.md +1128 -0
- package/research/config-analysis/01-current-implementation.md +470 -0
- package/research/config-analysis/02-strategy-recommendation.md +428 -0
- package/research/config-analysis/03-task-candidates.md +715 -0
- package/research/config-analysis/033-research-configuration-management.md +828 -0
- package/research/config-analysis/034-research-effect-cli-config.md +1504 -0
- package/research/config-analysis/04-consolidated-task-candidates.md +277 -0
- package/research/config-docs/SUMMARY.md +357 -0
- package/research/config-docs/TEST-RESULTS.md +776 -0
- package/research/config-docs/TODO.md +542 -0
- package/research/config-docs/analysis.md +744 -0
- package/research/config-docs/fix-validation.md +502 -0
- package/research/config-docs/help-audit.md +264 -0
- package/research/config-docs/help-system-analysis.md +890 -0
- package/research/dogfood/consolidated-tool-evaluation.md +373 -0
- package/research/dogfood/strategy-a/a-synthesis.md +184 -0
- package/research/dogfood/strategy-a/a1-docs.md +226 -0
- package/research/dogfood/strategy-a/a2-amorphic.md +156 -0
- package/research/dogfood/strategy-a/a3-llm.md +164 -0
- package/research/dogfood/strategy-b/b-synthesis.md +228 -0
- package/research/dogfood/strategy-b/b1-architecture.md +207 -0
- package/research/dogfood/strategy-b/b2-gaps.md +258 -0
- package/research/dogfood/strategy-b/b3-workflows.md +250 -0
- package/research/dogfood/strategy-c/c-synthesis.md +451 -0
- package/research/dogfood/strategy-c/c1-explorer.md +192 -0
- package/research/dogfood/strategy-c/c2-diver-memory.md +145 -0
- package/research/dogfood/strategy-c/c3-diver-control.md +148 -0
- package/research/dogfood/strategy-c/c4-diver-failure.md +151 -0
- package/research/dogfood/strategy-c/c5-diver-execution.md +221 -0
- package/research/dogfood/strategy-c/c6-diver-org.md +221 -0
- package/research/effect-cli-error-handling.md +845 -0
- package/research/effect-errors-as-values.md +943 -0
- package/research/errors-task-analysis/00-consolidated-tasks.md +207 -0
- package/research/errors-task-analysis/cli-commands-analysis.md +909 -0
- package/research/errors-task-analysis/embeddings-analysis.md +709 -0
- package/research/errors-task-analysis/index-search-analysis.md +812 -0
- package/research/frontmatter/COMMENTS-ARE-SKIPPED.md +149 -0
- package/research/frontmatter/LLM-CODE-NAVIGATION.md +276 -0
- package/research/issue-review.md +603 -0
- package/research/llm-summarization/agent-cli-tools-2026.md +1082 -0
- package/research/llm-summarization/alternative-providers-2026.md +1428 -0
- package/research/llm-summarization/anthropic-2026.md +367 -0
- package/research/llm-summarization/claude-cli-integration.md +1706 -0
- package/research/llm-summarization/cli-integration-patterns.md +3155 -0
- package/research/llm-summarization/openai-2026.md +473 -0
- package/research/llm-summarization/openai-compatible-providers-2026.md +1022 -0
- package/research/llm-summarization/opencode-cli-integration.md +1552 -0
- package/research/llm-summarization/prompt-engineering-2026.md +1426 -0
- package/research/llm-summarization/prototype-results.md +56 -0
- package/research/llm-summarization/provider-switching-patterns-2026.md +2153 -0
- package/research/llm-summarization/typescript-llm-libraries-2026.md +2436 -0
- package/research/mdcontext-error-analysis.md +521 -0
- package/research/mdcontext-pudding/00-EXECUTIVE-SUMMARY.md +282 -0
- package/research/mdcontext-pudding/01-index-embed.md +956 -0
- package/research/mdcontext-pudding/02-search-COMMANDS.md +142 -0
- package/research/mdcontext-pudding/02-search-SUMMARY.md +146 -0
- package/research/mdcontext-pudding/02-search.md +970 -0
- package/research/mdcontext-pudding/03-context.md +779 -0
- package/research/mdcontext-pudding/04-navigation-and-analytics.md +803 -0
- package/research/mdcontext-pudding/04-tree.md +704 -0
- package/research/mdcontext-pudding/05-config.md +1038 -0
- package/research/mdcontext-pudding/06-links-summary.txt +87 -0
- package/research/mdcontext-pudding/06-links.md +679 -0
- package/research/mdcontext-pudding/07-stats.md +693 -0
- package/research/mdcontext-pudding/BUG-FIX-PLAN.md +388 -0
- package/research/mdcontext-pudding/P0-BUG-VALIDATION.md +167 -0
- package/research/mdcontext-pudding/README.md +168 -0
- package/research/mdcontext-pudding/TESTING-SUMMARY.md +128 -0
- package/research/npm_publish/011-npm-workflow-research-agent2.md +792 -0
- package/research/npm_publish/012-npm-workflow-research-agent1.md +530 -0
- package/research/npm_publish/013-npm-workflow-research-agent3.md +722 -0
- package/research/npm_publish/014-npm-workflow-synthesis.md +556 -0
- package/research/npm_publish/031-npm-workflow-task-analysis.md +134 -0
- package/research/research-quality-review.md +834 -0
- package/research/semantic-search/002-research-embedding-models.md +490 -0
- package/research/semantic-search/003-research-rag-alternatives.md +523 -0
- package/research/semantic-search/004-research-vector-search.md +841 -0
- package/research/semantic-search/032-research-semantic-search.md +427 -0
- package/research/semantic-search/embedding-text-analysis.md +156 -0
- package/research/semantic-search/multi-word-failure-reproduction.md +171 -0
- package/research/semantic-search/query-processing-analysis.md +207 -0
- package/research/semantic-search/root-cause-and-solution.md +114 -0
- package/research/semantic-search/threshold-validation-report.md +69 -0
- package/research/semantic-search/vector-search-analysis.md +63 -0
- package/research/task-management-2026/00-synthesis-recommendations.md +295 -0
- package/research/task-management-2026/01-ai-workflow-tools.md +416 -0
- package/research/task-management-2026/02-agent-framework-patterns.md +476 -0
- package/research/task-management-2026/03-lightweight-file-based.md +567 -0
- package/research/task-management-2026/04-established-tools-ai-features.md +541 -0
- package/research/task-management-2026/linear/01-core-features-workflow.md +771 -0
- package/research/task-management-2026/linear/02-api-integrations.md +930 -0
- package/research/task-management-2026/linear/03-ai-features.md +368 -0
- package/research/task-management-2026/linear/04-pricing-setup.md +205 -0
- package/research/task-management-2026/linear/05-usage-patterns-best-practices.md +605 -0
- package/research/test-path-issues.md +276 -0
- package/review/ALP-76/1-error-type-design.md +962 -0
- package/review/ALP-76/2-error-handling-patterns.md +906 -0
- package/review/ALP-76/3-error-presentation.md +624 -0
- package/review/ALP-76/4-test-coverage.md +625 -0
- package/review/ALP-76/5-migration-completeness.md +440 -0
- package/review/ALP-76/6-effect-best-practices.md +755 -0
- package/scripts/apply-branch-protection.sh +47 -0
- package/scripts/branch-protection-templates.json +79 -0
- package/scripts/prototype-summarization.ts +346 -0
- package/scripts/rebuild-hnswlib.js +58 -0
- package/scripts/setup-branch-protection.sh +64 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/active-provider.json +7 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.json +541 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/bm25.meta.json +5 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/config.json +8 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/documents.json +60 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/links.json +13 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/.mdcontext/indexes/sections.json +1197 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/configuration-management.md +99 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/distributed-systems.md +92 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/error-handling.md +78 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/failure-automation.md +55 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/job-context.md +69 -0
- package/src/__tests__/fixtures/semantic-search/multi-word-corpus/process-orchestration.md +99 -0
- package/src/cli/argv-preprocessor.test.ts +210 -0
- package/src/cli/argv-preprocessor.ts +202 -0
- package/src/cli/cli.test.ts +627 -0
- package/src/cli/commands/backlinks.ts +54 -0
- package/src/cli/commands/config-cmd.ts +642 -0
- package/src/cli/commands/context.ts +285 -0
- package/src/cli/commands/duplicates.ts +122 -0
- package/src/cli/commands/embeddings.ts +529 -0
- package/src/cli/commands/index-cmd.ts +480 -0
- package/src/cli/commands/index.ts +16 -0
- package/src/cli/commands/links.ts +52 -0
- package/src/cli/commands/search.ts +1281 -0
- package/src/cli/commands/stats.ts +149 -0
- package/src/cli/commands/tree.ts +128 -0
- package/src/cli/config-layer.ts +176 -0
- package/src/cli/error-handler.test.ts +235 -0
- package/src/cli/error-handler.ts +655 -0
- package/src/cli/flag-schemas.ts +341 -0
- package/src/cli/help.ts +588 -0
- package/src/cli/index.ts +9 -0
- package/src/cli/main.ts +435 -0
- package/src/cli/options.ts +41 -0
- package/src/cli/shared-error-handling.ts +199 -0
- package/src/cli/typo-suggester.test.ts +105 -0
- package/src/cli/typo-suggester.ts +130 -0
- package/src/cli/utils.ts +259 -0
- package/src/config/file-provider.test.ts +320 -0
- package/src/config/file-provider.ts +273 -0
- package/src/config/index.ts +72 -0
- package/src/config/integration.test.ts +667 -0
- package/src/config/precedence.test.ts +277 -0
- package/src/config/precedence.ts +451 -0
- package/src/config/schema.test.ts +414 -0
- package/src/config/schema.ts +603 -0
- package/src/config/service.test.ts +320 -0
- package/src/config/service.ts +243 -0
- package/src/config/testing.test.ts +264 -0
- package/src/config/testing.ts +110 -0
- package/src/core/index.ts +1 -0
- package/src/core/types.ts +113 -0
- package/src/duplicates/detector.test.ts +183 -0
- package/src/duplicates/detector.ts +414 -0
- package/src/duplicates/index.ts +18 -0
- package/src/embeddings/embedding-namespace.test.ts +300 -0
- package/src/embeddings/embedding-namespace.ts +947 -0
- package/src/embeddings/heading-boost.test.ts +222 -0
- package/src/embeddings/hnsw-build-options.test.ts +198 -0
- package/src/embeddings/hyde.test.ts +272 -0
- package/src/embeddings/hyde.ts +264 -0
- package/src/embeddings/index.ts +10 -0
- package/src/embeddings/openai-provider.ts +414 -0
- package/src/embeddings/pricing.json +22 -0
- package/src/embeddings/provider-constants.ts +204 -0
- package/src/embeddings/provider-errors.test.ts +967 -0
- package/src/embeddings/provider-errors.ts +565 -0
- package/src/embeddings/provider-factory.test.ts +240 -0
- package/src/embeddings/provider-factory.ts +225 -0
- package/src/embeddings/provider-integration.test.ts +788 -0
- package/src/embeddings/query-preprocessing.test.ts +187 -0
- package/src/embeddings/semantic-search-threshold.test.ts +508 -0
- package/src/embeddings/semantic-search.ts +1270 -0
- package/src/embeddings/types.ts +359 -0
- package/src/embeddings/vector-store.ts +708 -0
- package/src/embeddings/voyage-provider.ts +313 -0
- package/src/errors/errors.test.ts +845 -0
- package/src/errors/index.ts +533 -0
- package/src/index/ignore-patterns.test.ts +354 -0
- package/src/index/ignore-patterns.ts +305 -0
- package/src/index/index.ts +4 -0
- package/src/index/indexer.ts +684 -0
- package/src/index/storage.ts +260 -0
- package/src/index/types.ts +147 -0
- package/src/index/watcher.ts +189 -0
- package/src/index.ts +30 -0
- package/src/integration/search-keyword.test.ts +678 -0
- package/src/mcp/server.ts +612 -0
- package/src/parser/index.ts +1 -0
- package/src/parser/parser.test.ts +291 -0
- package/src/parser/parser.ts +394 -0
- package/src/parser/section-filter.test.ts +277 -0
- package/src/parser/section-filter.ts +392 -0
- package/src/search/__tests__/hybrid-search.test.ts +650 -0
- package/src/search/bm25-store.ts +366 -0
- package/src/search/cross-encoder.test.ts +253 -0
- package/src/search/cross-encoder.ts +406 -0
- package/src/search/fuzzy-search.test.ts +419 -0
- package/src/search/fuzzy-search.ts +273 -0
- package/src/search/hybrid-search.ts +448 -0
- package/src/search/path-matcher.test.ts +276 -0
- package/src/search/path-matcher.ts +33 -0
- package/src/search/query-parser.test.ts +260 -0
- package/src/search/query-parser.ts +319 -0
- package/src/search/searcher.test.ts +280 -0
- package/src/search/searcher.ts +724 -0
- package/src/search/wink-bm25.d.ts +30 -0
- package/src/summarization/cli-providers/claude.ts +202 -0
- package/src/summarization/cli-providers/detection.test.ts +273 -0
- package/src/summarization/cli-providers/detection.ts +118 -0
- package/src/summarization/cli-providers/index.ts +8 -0
- package/src/summarization/cost.test.ts +139 -0
- package/src/summarization/cost.ts +102 -0
- package/src/summarization/error-handler.test.ts +127 -0
- package/src/summarization/error-handler.ts +111 -0
- package/src/summarization/index.ts +102 -0
- package/src/summarization/pipeline.test.ts +498 -0
- package/src/summarization/pipeline.ts +231 -0
- package/src/summarization/prompts.test.ts +269 -0
- package/src/summarization/prompts.ts +133 -0
- package/src/summarization/provider-factory.test.ts +396 -0
- package/src/summarization/provider-factory.ts +178 -0
- package/src/summarization/types.ts +184 -0
- package/src/summarize/budget-bugs.test.ts +620 -0
- package/src/summarize/formatters.ts +419 -0
- package/src/summarize/index.ts +20 -0
- package/src/summarize/summarizer.test.ts +275 -0
- package/src/summarize/summarizer.ts +597 -0
- package/src/summarize/verify-bugs.test.ts +238 -0
- package/src/types/huggingface-transformers.d.ts +66 -0
- package/src/utils/index.ts +1 -0
- package/src/utils/tokens.test.ts +142 -0
- package/src/utils/tokens.ts +186 -0
- package/tests/fixtures/cli/.mdcontext/active-provider.json +7 -0
- package/tests/fixtures/cli/.mdcontext/config.json +8 -0
- package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.bin +0 -0
- package/tests/fixtures/cli/.mdcontext/embeddings/openai_text-embedding-3-small_512/vectors.meta.bin +0 -0
- package/tests/fixtures/cli/.mdcontext/indexes/documents.json +33 -0
- package/tests/fixtures/cli/.mdcontext/indexes/links.json +12 -0
- package/tests/fixtures/cli/.mdcontext/indexes/sections.json +247 -0
- package/tests/fixtures/cli/README.md +9 -0
- package/tests/fixtures/cli/api-reference.md +11 -0
- package/tests/fixtures/cli/getting-started.md +11 -0
- package/tests/integration/embed-index.test.ts +712 -0
- package/tests/integration/search-context.test.ts +469 -0
- package/tests/integration/search-semantic.test.ts +522 -0
- package/tsconfig.json +26 -0
- package/vitest.config.ts +16 -0
- package/vitest.setup.ts +12 -0
|
@@ -0,0 +1,258 @@
|
|
|
1
|
+
# Report: B2 - Gap Finder
|
|
2
|
+
|
|
3
|
+
## Mission
|
|
4
|
+
|
|
5
|
+
Find gaps and missing elements in the specification
|
|
6
|
+
|
|
7
|
+
## Research Question
|
|
8
|
+
|
|
9
|
+
What's missing from the spec? What gaps were identified?
|
|
10
|
+
|
|
11
|
+
## Command Log
|
|
12
|
+
|
|
13
|
+
| # | Command | Purpose | Result | Useful? |
|
|
14
|
+
| --- | ----------------------------------------------------------------------------------------- | ----------------------- | --------------------------------------------------- | --------- |
|
|
15
|
+
| 1 | `mdcontext --help` | Learn tool | Showed all commands and examples | Yes |
|
|
16
|
+
| 2 | `mdcontext index` | Index markdown files | 23 docs, 922 sections indexed | Yes |
|
|
17
|
+
| 3 | `mdcontext tree` | List all files | Showed 23 files in docs/, docs.amorphic/, docs.llm/ | Yes |
|
|
18
|
+
| 4 | `mdcontext search "gaps missing omissions"` | Find gap mentions | 0 results (semantic) | No |
|
|
19
|
+
| 5 | `mdcontext search "gaps missing omissions" --mode keyword` | Find gap mentions | 0 results | No |
|
|
20
|
+
| 6 | `mdcontext search "missing" --mode keyword` | Find missing items | 10 results - found "missing primitive" narrative | Yes |
|
|
21
|
+
| 7 | `mdcontext search "gap" --mode keyword` | Find gap mentions | 8 results - found "Judgment Gap", "AI fatigue gap" | Yes |
|
|
22
|
+
| 8 | `mdcontext search "TODO" --mode keyword` | Find TODOs | 1 result (Letta integration) | Minimal |
|
|
23
|
+
| 9 | `mdcontext search "not yet" --mode keyword` | Find incomplete | 10 results (mostly SPEC.md headers) | No |
|
|
24
|
+
| 10 | `mdcontext search "incomplete" --mode keyword` | Find incomplete | 2 results | Minimal |
|
|
25
|
+
| 11 | `mdcontext search "needs" --mode keyword` | Find needs | 10 results - coordination, clarification needs | Yes |
|
|
26
|
+
| 12 | `mdcontext search "question" --mode keyword` | Find questions | 10 results | Minimal |
|
|
27
|
+
| 13 | `mdcontext search "problem" --mode keyword` | Find problems | 10 results - handoff problem, problem space | Yes |
|
|
28
|
+
| 14 | `mdcontext search "issue" --mode keyword` | Find issues | 10 results - mostly examples | Minimal |
|
|
29
|
+
| 15 | `mdcontext search "limitation" --mode keyword` | Find limitations | 3 results - judgment limitation | Yes |
|
|
30
|
+
| 16 | `mdcontext search "concern" --mode keyword` | Find concerns | 10 results - architecture concerns | Minimal |
|
|
31
|
+
| 17 | `mdcontext search "feedback" --mode keyword` | Find feedback | 10 results - key evolution doc | Yes |
|
|
32
|
+
| 18 | `mdcontext context docs.llm/feedback.md -t 3000` | Get feedback context | Truncated summary | Partial |
|
|
33
|
+
| 19 | `mdcontext search "recommend" --mode keyword` | Find recommendations | 10 results - recommendation primitive | Yes |
|
|
34
|
+
| 20 | `mdcontext search "suggested" --mode keyword` | Find suggestions | 6 results | Minimal |
|
|
35
|
+
| 21 | `mdcontext search "should" --mode keyword` | Find should-statements | 10 results - evolution suggestions | Yes |
|
|
36
|
+
| 22 | `mdcontext context docs/HumanWork-Evolution.md --full` | Get evolution doc | Full synthesis of all gaps | Critical |
|
|
37
|
+
| 23 | `mdcontext search "security" --mode keyword` | Find security gaps | 10 results | Minimal |
|
|
38
|
+
| 24 | `mdcontext search "failure" --mode keyword` | Find failure handling | 10 results - failure recovery | Minimal |
|
|
39
|
+
| 25 | `mdcontext search "privacy" --mode keyword` | Find privacy gaps | 2 results - minimal coverage | Yes (gap) |
|
|
40
|
+
| 26 | `mdcontext context docs/03-ORG_WORKSPACE_MODEL.md --section "11. Privacy and Visibility"` | Get privacy details | Very brief - "policy overlay" | Yes (gap) |
|
|
41
|
+
| 27 | `mdcontext search "cost" --mode keyword` | Find cost model | 10 results - basic cost tracking | Yes |
|
|
42
|
+
| 28 | `mdcontext search "testing" --mode keyword` | Find testing mentions | 10 results - mostly examples | Minimal |
|
|
43
|
+
| 29 | `mdcontext search "API" --mode keyword` | Find API spec | 10 results - no formal API spec | Yes (gap) |
|
|
44
|
+
| 30 | `mdcontext context docs/05-MEMORY_MODEL.md --section "15. Implementation Guidance"` | Get impl guidance | Technology suggestions only | Yes |
|
|
45
|
+
| 31 | `mdcontext search "evolution" --mode keyword` | Find evolution plans | 10 results - evolution document | Yes |
|
|
46
|
+
| 32 | `mdcontext context docs/LETTA_INTEGRATION_PLAN.md -t 3000` | Get Letta plan | Integration phases | Yes |
|
|
47
|
+
| 33 | `mdcontext search "rename" --mode keyword` | Find rename suggestions | 1 result - terminology changes | Yes |
|
|
48
|
+
| 34 | `mdcontext search "replaces" --mode keyword` | Find replacements | 2 results - Actor/Deliverable | Yes |
|
|
49
|
+
| 35 | `mdcontext stats` | Index statistics | 23 docs, 922 sections, 178K tokens | Yes |
|
|
50
|
+
|
|
51
|
+
## Findings
|
|
52
|
+
|
|
53
|
+
### Key Discoveries
|
|
54
|
+
|
|
55
|
+
The most critical finding is the **HumanWork-Evolution.md** document which synthesizes all feedback into a comprehensive gap analysis. This document reveals 7 major categories of gaps:
|
|
56
|
+
|
|
57
|
+
#### 1. Terminology/Primitive Gaps
|
|
58
|
+
|
|
59
|
+
The spec uses outdated or suboptimal terminology:
|
|
60
|
+
|
|
61
|
+
- "Agent" should become "Actor" (unified human/machine)
|
|
62
|
+
- "Artifact" should become "Deliverable" (business language)
|
|
63
|
+
- "Event Memory" should become "The Ledger" (emphasizes IP capture)
|
|
64
|
+
|
|
65
|
+
#### 2. Missing Primitives
|
|
66
|
+
|
|
67
|
+
Several critical primitives are not defined in the spec:
|
|
68
|
+
|
|
69
|
+
- **Correction Event** - captures human intelligence when modifying outputs
|
|
70
|
+
- **Authority Gradient** - replaces binary control with spectrum
|
|
71
|
+
- **Pattern Crystallization** - organizational learning mechanism
|
|
72
|
+
|
|
73
|
+
#### 3. Philosophical Framing Gap
|
|
74
|
+
|
|
75
|
+
The spec frames human control as the end state, but feedback suggests:
|
|
76
|
+
|
|
77
|
+
- Human control is a **transition phase**, not end state
|
|
78
|
+
- System should extract and crystallize human intelligence
|
|
79
|
+
- Progressive transfer of decision-making to Actors
|
|
80
|
+
|
|
81
|
+
#### 4. Memory Model Gaps
|
|
82
|
+
|
|
83
|
+
- No geometric/semantic embeddings in Semantic Memory
|
|
84
|
+
- No Pattern Crystallization operation
|
|
85
|
+
- Missing cognitive telemetry at checkpoints
|
|
86
|
+
|
|
87
|
+
#### 5. Cost Model Gaps
|
|
88
|
+
|
|
89
|
+
- Human hours and AI tokens not unified
|
|
90
|
+
- Actor cost attribution not formalized
|
|
91
|
+
- No cost model for Actor.type (Human vs Machine)
|
|
92
|
+
|
|
93
|
+
#### 6. Privacy/Security Gap
|
|
94
|
+
|
|
95
|
+
The privacy section is minimal - described as a "policy overlay, not a core feature"
|
|
96
|
+
|
|
97
|
+
#### 7. API/Implementation Gap
|
|
98
|
+
|
|
99
|
+
- No formal API specification
|
|
100
|
+
- Implementation guidance is technology suggestions only
|
|
101
|
+
- No concrete schemas or endpoints defined
|
|
102
|
+
|
|
103
|
+
### Relevant Quotes/Sections Found
|
|
104
|
+
|
|
105
|
+
> "HumanWork is the missing runtime for Labor - alongside Compute, Storage, and Network."
|
|
106
|
+
> Source: docs/HumanWork-Evolution.md
|
|
107
|
+
|
|
108
|
+
> "The ultimate goal is human replacement through intelligence crystallization"
|
|
109
|
+
> Source: docs/HumanWork-Evolution.md (citing amorphic feedback)
|
|
110
|
+
|
|
111
|
+
> "Artifacts suggests archaeological remnants. Deliverables emphasizes work products - measurable, valuable outputs"
|
|
112
|
+
> Source: docs/HumanWork-Evolution.md
|
|
113
|
+
|
|
114
|
+
> "The System treats Human Labor and AI Agent Labor as a unified data stream"
|
|
115
|
+
> Source: docs/HumanWork-Evolution.md (Amorphic Principle)
|
|
116
|
+
|
|
117
|
+
> "Checkpoints aren't just governance - they're cognitive capture points"
|
|
118
|
+
> Source: docs/HumanWork-Evolution.md
|
|
119
|
+
|
|
120
|
+
> "Pure automation optimizes for the known, but work increasingly happens in the spaces between the known - in edge cases, exceptions, and emergent scenarios"
|
|
121
|
+
> Source: docs.amorphic/02-THE_FAILURE_OF_PURE_AUTOMATION.md, "The Judgment Gap"
|
|
122
|
+
|
|
123
|
+
> "The timing is critical. Organizations worldwide are struggling with 'AI fatigue' - the gap between AI's promise and its practical integration"
|
|
124
|
+
> Source: docs.amorphic/01-EXECUTIVE_SUMMARY.md, "Market Disruption Potential"
|
|
125
|
+
|
|
126
|
+
> "This is a **policy overlay**, not a core feature."
|
|
127
|
+
> Source: docs/03-ORG_WORKSPACE_MODEL.md, "11. Privacy and Visibility"
|
|
128
|
+
|
|
129
|
+
### Answer to Research Question
|
|
130
|
+
|
|
131
|
+
**What's missing from the spec? What gaps were identified?**
|
|
132
|
+
|
|
133
|
+
The spec has significant gaps in three areas:
|
|
134
|
+
|
|
135
|
+
**1. Primitive & Terminology Gaps:**
|
|
136
|
+
|
|
137
|
+
- Need new primitives: Actor, Deliverable, The Ledger, Correction Event, Authority Gradient, Pattern Crystallization
|
|
138
|
+
- Current terminology (Agent, Artifact, Event Memory) is inadequate
|
|
139
|
+
|
|
140
|
+
**2. Architectural Gaps:**
|
|
141
|
+
|
|
142
|
+
- Unified cost model for human/machine labor missing
|
|
143
|
+
- Geometric/semantic embeddings not in Memory Model
|
|
144
|
+
- No formal Pattern Crystallization mechanism
|
|
145
|
+
- Cognitive telemetry at checkpoints undefined
|
|
146
|
+
- Authority Gradient (spectrum of autonomy) not implemented
|
|
147
|
+
|
|
148
|
+
**3. Implementation/Documentation Gaps:**
|
|
149
|
+
|
|
150
|
+
- No formal API specification
|
|
151
|
+
- Privacy model is underdeveloped ("policy overlay")
|
|
152
|
+
- Error handling patterns not specified
|
|
153
|
+
- No concrete schemas or data models
|
|
154
|
+
- Missing hw CLI terminal experience positioning
|
|
155
|
+
|
|
156
|
+
**4. Philosophical Gap:**
|
|
157
|
+
The spec positions "human control" as the goal, but feedback suggests reframing as "intelligence extraction" - where human corrections become portable organizational intelligence that progressively makes Actors more capable.
|
|
158
|
+
|
|
159
|
+
## Proposed Spec Changes
|
|
160
|
+
|
|
161
|
+
Based on the feedback synthesis in HumanWork-Evolution.md:
|
|
162
|
+
|
|
163
|
+
### Immediate (Terminology)
|
|
164
|
+
|
|
165
|
+
- [ ] Add hw terminal metaphor to README
|
|
166
|
+
- [ ] Rename Artifact -> Deliverable throughout
|
|
167
|
+
- [ ] Rename Event Memory -> The Ledger
|
|
168
|
+
- [ ] Add Actor primitive (replaces Agent with type: Human | Machine)
|
|
169
|
+
- [ ] Add Correction Event primitive
|
|
170
|
+
|
|
171
|
+
### Near-Term (Architecture)
|
|
172
|
+
|
|
173
|
+
- [ ] Add Authority Gradient to Execution Model (instructional/consultative/supervisory/exploratory)
|
|
174
|
+
- [ ] Add Pattern Crystallization to Memory Model
|
|
175
|
+
- [ ] Enhance Semantic Memory with geometric embeddings
|
|
176
|
+
- [ ] Add cognitive telemetry to Checkpoints (deliberation_duration, confidence_signal, modification_depth)
|
|
177
|
+
- [ ] Unify cost model for Human + Machine Actors
|
|
178
|
+
|
|
179
|
+
### Strategic (Philosophy)
|
|
180
|
+
|
|
181
|
+
- [ ] Reframe "human control" as transition phase, not end state
|
|
182
|
+
- [ ] Position system as intelligence extraction, not just coordination
|
|
183
|
+
- [ ] Emphasize The Ledger as primary asset (IP capture)
|
|
184
|
+
- [ ] Adopt choreography language over orchestration
|
|
185
|
+
- [ ] Develop privacy model beyond "policy overlay"
|
|
186
|
+
- [ ] Create formal API specification
|
|
187
|
+
|
|
188
|
+
## Tool Evaluation
|
|
189
|
+
|
|
190
|
+
### What Worked Well
|
|
191
|
+
|
|
192
|
+
- **Keyword search** was essential - semantic search returned 0 results for "gaps missing omissions"
|
|
193
|
+
- **`mdcontext context --full`** for single large documents (HumanWork-Evolution.md) was extremely valuable
|
|
194
|
+
- **`mdcontext tree`** gave quick overview of document structure
|
|
195
|
+
- **Section-specific context** (`--section`) provided targeted extraction
|
|
196
|
+
- **Stats command** helped understand corpus size (178K tokens across 23 docs)
|
|
197
|
+
- Searching for key terms like "missing", "gap", "limitation", "recommendation" found relevant content
|
|
198
|
+
|
|
199
|
+
### What Was Frustrating
|
|
200
|
+
|
|
201
|
+
- **Semantic search yielded no results** for my primary query - had to fall back to keyword mode
|
|
202
|
+
- **Token truncation** meant I couldn't see full context of feedback.md (only 16% shown at 3000 tokens)
|
|
203
|
+
- **False positives in keyword search** - "not yet" returned SPEC.md headers, not actual incomplete items
|
|
204
|
+
- **No way to chain searches** - had to run many separate commands
|
|
205
|
+
- **Context command limits** - wanted to see multiple related sections but had to make separate calls
|
|
206
|
+
- Some search results returned 10 matches but didn't show the most relevant ones first
|
|
207
|
+
|
|
208
|
+
### What Was Missing
|
|
209
|
+
|
|
210
|
+
- **Cross-document synthesis** - tool doesn't connect related content across files
|
|
211
|
+
- **Diff/comparison view** - would help see gaps between spec versions
|
|
212
|
+
- **"What's undefined" query** - no way to find terms used but not defined
|
|
213
|
+
- **Relationship/dependency view** - what primitives depend on what
|
|
214
|
+
- **Semantic search tuning** - couldn't adjust semantic similarity threshold
|
|
215
|
+
- **Export/aggregation** - no way to collect all relevant sections into one view
|
|
216
|
+
|
|
217
|
+
### Confidence Level
|
|
218
|
+
|
|
219
|
+
[X] High / Medium / Low
|
|
220
|
+
|
|
221
|
+
The HumanWork-Evolution.md document is a comprehensive synthesis that explicitly answers my research question. The gaps are clearly articulated with specific recommendations. My high confidence comes from:
|
|
222
|
+
|
|
223
|
+
1. Finding the authoritative evolution document
|
|
224
|
+
2. Corroborating gaps through multiple keyword searches
|
|
225
|
+
3. Direct quotes supporting each gap identified
|
|
226
|
+
|
|
227
|
+
### Would Use Again? (1-5)
|
|
228
|
+
|
|
229
|
+
**4** - Very useful for targeted research. The keyword search mode was reliable once I learned semantic search wouldn't work for my query type. The context extraction with section targeting saved significant time. Lost one point because semantic search didn't work as expected and I had to learn workarounds.
|
|
230
|
+
|
|
231
|
+
## Time & Efficiency
|
|
232
|
+
|
|
233
|
+
- Commands run: **35**
|
|
234
|
+
- Compared to reading all files: **Much less** - The tool helped me navigate 178K tokens across 23 documents efficiently. Finding HumanWork-Evolution.md alone (7500 tokens of synthesized gaps) would have taken much longer by reading files directly. The keyword searches let me quickly identify which documents contained gap-related content without reading everything.
|
|
235
|
+
|
|
236
|
+
## Appendix: Critical Gaps from HumanWork-Evolution.md
|
|
237
|
+
|
|
238
|
+
The most authoritative source of gaps is the evolution document, which proposes three phases:
|
|
239
|
+
|
|
240
|
+
**Phase 1: Terminology Alignment**
|
|
241
|
+
|
|
242
|
+
- Agent -> Actor (with type: Human | Machine)
|
|
243
|
+
- Artifact -> Deliverable
|
|
244
|
+
- Event Memory -> The Ledger
|
|
245
|
+
- Add Correction Event, Authority Gradient, Pattern Crystallization
|
|
246
|
+
|
|
247
|
+
**Phase 2: Architectural Enhancements**
|
|
248
|
+
|
|
249
|
+
- Semantic Memory uses geometric embeddings
|
|
250
|
+
- Add Pattern Crystallization as memory operation
|
|
251
|
+
- Execution Contexts bind to Actors (not just agents)
|
|
252
|
+
- Checkpoints capture cognitive telemetry
|
|
253
|
+
|
|
254
|
+
**Phase 3: Philosophical Reframing**
|
|
255
|
+
|
|
256
|
+
- "Human control" becomes transition phase
|
|
257
|
+
- Position as intelligence extraction
|
|
258
|
+
- Choreography language over orchestration
|
|
@@ -0,0 +1,250 @@
|
|
|
1
|
+
# Report: B3 - Workflow Analyst
|
|
2
|
+
|
|
3
|
+
## Mission
|
|
4
|
+
|
|
5
|
+
Find workflow and process improvement suggestions
|
|
6
|
+
|
|
7
|
+
## Research Question
|
|
8
|
+
|
|
9
|
+
What workflow and process improvements are suggested?
|
|
10
|
+
|
|
11
|
+
## Command Log
|
|
12
|
+
|
|
13
|
+
| # | Command | Purpose | Result | Useful? |
|
|
14
|
+
| --- | -------------------------------------------------------------------------------------------------------------- | ------------------------------ | -------------------------------------------- | -------- |
|
|
15
|
+
| 1 | mdcontext --help | Learn tool capabilities | Showed commands, workflows, options | Yes |
|
|
16
|
+
| 2 | mdcontext tree | List all markdown files | Found 23 files across 3 folders | Yes |
|
|
17
|
+
| 3 | mdcontext index --force | Re-index all files | Indexed 922 sections, 904 vectors | Yes |
|
|
18
|
+
| 4 | mdcontext search "workflow improvement" | Find workflow improvements | 2 results (versioning, team review) | Moderate |
|
|
19
|
+
| 5 | mdcontext search "process automation" | Find automation content | 10 results (failure of automation, paradox) | Yes |
|
|
20
|
+
| 6 | mdcontext search "human agent collaboration" | Find collaboration patterns | 10 results (71.6% top match) | Yes |
|
|
21
|
+
| 7 | mdcontext search "execution model" | Find execution concepts | 10 results | Yes |
|
|
22
|
+
| 8 | mdcontext search "best practices" | Find best practices | 4 results | Moderate |
|
|
23
|
+
| 9 | mdcontext search "transformation organizational" | Find org transformation | 10 results (62.7% top) | Yes |
|
|
24
|
+
| 10 | mdcontext search "intervention points human decision" | Find human intervention points | 10 results (68.7% top) | Yes |
|
|
25
|
+
| 11 | mdcontext context docs/06-WORKFLOWS.md -t 3000 | Get workflows overview | Core workflow concepts extracted | Yes |
|
|
26
|
+
| 12 | mdcontext context docs/06-WORKFLOWS.md --section "Workflow Design Principles" | Get design principles | Do/Don't guidelines | Yes |
|
|
27
|
+
| 13 | mdcontext context docs.amorphic/04-THE_HUMAN-AGENT_COLLABORATION_MODEL.md --section "Intervention Points" | Get intervention details | Redirect, override, inject, escalate | Yes |
|
|
28
|
+
| 14 | mdcontext context docs.amorphic/06-ORGANIZATIONAL_TRANSFORMATION.md -t 3000 | Get org transformation | Roles, culture, governance, implementation | Yes |
|
|
29
|
+
| 15 | mdcontext context docs.amorphic/02-THE_FAILURE_OF_PURE_AUTOMATION.md -t 2500 | Get automation failures | Brittleness, context collapse, judgment gap | Yes |
|
|
30
|
+
| 16 | mdcontext context docs/04-EXECUTION_MODEL.md --section "Execution Philosophy" | Get execution philosophy | Scoped, disposable attempts | Yes |
|
|
31
|
+
| 17 | mdcontext context docs/04-EXECUTION_MODEL.md --section "Key Execution Patterns" | Get execution patterns | Ad-hoc, guided, long-running, parallel | Yes |
|
|
32
|
+
| 18 | mdcontext search "continuous improvement feedback loop" | Find feedback loops | 3 results (learning loop) | Moderate |
|
|
33
|
+
| 19 | mdcontext context docs/06-WORKFLOWS.md --section "Learning Loop" | Get learning loop details | Metrics feed refinement | Moderate |
|
|
34
|
+
| 20 | mdcontext context docs.llm/feedback.md -t 2500 | Get feedback concepts | Hybrid intelligence, resonance | Yes |
|
|
35
|
+
| 21 | mdcontext search "checkpoint governance approval" | Find checkpoint content | 5 results (executive decision) | Moderate |
|
|
36
|
+
| 22 | mdcontext context docs/07-EXAMPLE_WORKFLOWS.md -t 3000 | Get example workflows | Individual idea exploration example | Yes |
|
|
37
|
+
| 23 | mdcontext search "versioning iteration reuse" | Find versioning content | 6 results | Moderate |
|
|
38
|
+
| 24 | mdcontext context docs/06-WORKFLOWS.md --section "Workflow Versioning" | Get versioning details | Immutable versioning model | Yes |
|
|
39
|
+
| 25 | mdcontext search "human override control interrupt" | Find human control | 3 results (Human Interrupts) | Yes |
|
|
40
|
+
| 26 | mdcontext context docs/01-ARCHITECTURE.md --section "Human Interrupts" | Get interrupt flow | Control plane directive flow | Moderate |
|
|
41
|
+
| 27 | mdcontext search "memory workspace context" | Find memory concepts | 10 results (Workspace Memory top) | Yes |
|
|
42
|
+
| 28 | mdcontext context docs/05-MEMORY_MODEL.md -t 2500 | Get memory model | Three-layer architecture | Yes |
|
|
43
|
+
| 29 | mdcontext search "parallel exploration concurrent" | Find parallel patterns | 10 results (67% match) | Yes |
|
|
44
|
+
| 30 | mdcontext context docs/04-EXECUTION_MODEL.md --section "Parallel Exploration" | Get parallel pattern | Fan-out/fan-in pattern | Yes |
|
|
45
|
+
| 31 | mdcontext search "audit trail transparency accountability" | Find audit content | 6 results (67.1% match) | Yes |
|
|
46
|
+
| 32 | mdcontext context docs.amorphic/04-THE_HUMAN-AGENT_COLLABORATION_MODEL.md --section "Audit Trails That Matter" | Get audit trails | Immutable records that tell stories | Yes |
|
|
47
|
+
| 33 | mdcontext search "decision making recommendation suggestions" | Find recommendations | 1 result (Recommendation primitive) | Moderate |
|
|
48
|
+
| 34 | mdcontext context docs/02-PRIMITIVES.md --section "Recommendation" | Get recommendation definition | Ranked suggestions with confidence | Yes |
|
|
49
|
+
| 35 | mdcontext context docs/03-ORG_WORKSPACE_MODEL.md -t 2500 | Get org/workspace model | Two-tier identity model | Yes |
|
|
50
|
+
| 36 | mdcontext search "time travel rollback branching" | Find branching concepts | 8 results (62.6% match) | Yes |
|
|
51
|
+
| 37 | mdcontext context docs/05-MEMORY_MODEL.md --section "Time Travel and Branching" | Get time travel details | Rewind, compare, branch capabilities | Yes |
|
|
52
|
+
| 38 | mdcontext search "compliance governance regulation" | Find compliance content | 8 results | Yes |
|
|
53
|
+
| 39 | mdcontext context docs/07-EXAMPLE_WORKFLOWS.md --section "Example 5: Enterprise-Compliance Review" | Get compliance example | Full enterprise workflow example | Yes |
|
|
54
|
+
| 40 | mdcontext context docs/02-PRIMITIVES.md --section "Signal" | Get signal primitive | Activity, outcome, attention, health signals | Yes |
|
|
55
|
+
| 41 | mdcontext context docs.amorphic/04-THE_HUMAN-AGENT_COLLABORATION_MODEL.md -t 3000 | Get full collaboration model | Authority gradient, spectrum of modes | Yes |
|
|
56
|
+
|
|
57
|
+
## Findings
|
|
58
|
+
|
|
59
|
+
### Key Discoveries
|
|
60
|
+
|
|
61
|
+
#### 1. Workflow Philosophy: Guidance Without Control
|
|
62
|
+
|
|
63
|
+
The documentation emphasizes workflows as **reusable coordination patterns that shape how work unfolds, without prescribing execution logic**. This is a fundamental process improvement over rigid automation.
|
|
64
|
+
|
|
65
|
+
#### 2. Six Core Workflow Concepts
|
|
66
|
+
|
|
67
|
+
Every workflow consists of exactly six concepts:
|
|
68
|
+
|
|
69
|
+
1. **Entry Signals** - Conditions that suggest relevance (not trigger automatically)
|
|
70
|
+
2. **Roles** - Perspectives needed (enables agent replacement, human participation)
|
|
71
|
+
3. **Phases** - Conceptual stages (ordered but not strictly enforced, skippable)
|
|
72
|
+
4. **Activities** - What kind of work happens (not how to do it)
|
|
73
|
+
5. **Checkpoints (Guardrails)** - Deliberate pauses for human re-engagement
|
|
74
|
+
6. **Exit Conditions** - When workflow naturally concludes
|
|
75
|
+
|
|
76
|
+
#### 3. Human-Agent Collaboration Model
|
|
77
|
+
|
|
78
|
+
The "Authority Gradient" enables dynamic adjustment of human involvement:
|
|
79
|
+
|
|
80
|
+
- **Instructional Mode**: Step-by-step human instructions
|
|
81
|
+
- **Consultative Mode**: Human defines goal, agent proposes approaches
|
|
82
|
+
- **Supervisory Mode**: Agents execute with autonomy, humans monitor
|
|
83
|
+
- **Exploratory Mode**: Humans and agents alternate generating/testing ideas
|
|
84
|
+
|
|
85
|
+
#### 4. Intervention Points (Critical Process Improvement)
|
|
86
|
+
|
|
87
|
+
Humans can:
|
|
88
|
+
|
|
89
|
+
- **Redirect** an agent mid-task
|
|
90
|
+
- **Override** a decision before execution
|
|
91
|
+
- **Inject** new requirements during planning
|
|
92
|
+
- **Escalate** when complexity exceeds thresholds
|
|
93
|
+
|
|
94
|
+
#### 5. The Failure of Pure Automation (Anti-Patterns)
|
|
95
|
+
|
|
96
|
+
The docs identify key problems with pure automation:
|
|
97
|
+
|
|
98
|
+
- **Brittleness**: Works until it doesn't (80% flawless, 20% chaos)
|
|
99
|
+
- **Context Collapse**: Treats context as configuration, not conversation
|
|
100
|
+
- **Coordination Trap**: Multiplies coordination by forcing machine-readable formats
|
|
101
|
+
- **Judgment Gap**: Absence of contextual fluency
|
|
102
|
+
- **Innovation Strangulation**: Cannot accommodate work that hasn't been done before
|
|
103
|
+
|
|
104
|
+
#### 6. Three-Layer Memory Architecture
|
|
105
|
+
|
|
106
|
+
- **Event Memory**: Immutable, append-only source of truth
|
|
107
|
+
- **Status Memory**: Derived, mutable projection of current situation
|
|
108
|
+
- **Semantic Memory**: Knowledge graph for understanding and recommendations
|
|
109
|
+
|
|
110
|
+
#### 7. Time Travel and Branching
|
|
111
|
+
|
|
112
|
+
Powerful process improvement enabling:
|
|
113
|
+
|
|
114
|
+
- Rewind to any point in time
|
|
115
|
+
- Compare states (diff artifacts, trace decisions)
|
|
116
|
+
- Branch for safe experimentation
|
|
117
|
+
- Parallel hypotheses without loss
|
|
118
|
+
|
|
119
|
+
#### 8. Parallel Exploration Pattern
|
|
120
|
+
|
|
121
|
+
Multiple execution contexts can run concurrently:
|
|
122
|
+
|
|
123
|
+
- Distinct scopes (no mutual exclusion)
|
|
124
|
+
- Produce parallel alternatives
|
|
125
|
+
- Resolution is explicit, separate act
|
|
126
|
+
- Human reviews and merges
|
|
127
|
+
|
|
128
|
+
#### 9. Workflow Versioning
|
|
129
|
+
|
|
130
|
+
- Immutable versioning model
|
|
131
|
+
- Specific version pinning or "latest" reference
|
|
132
|
+
- Active workflows never upgrade mid-execution
|
|
133
|
+
- Human approval for version upgrades
|
|
134
|
+
|
|
135
|
+
#### 10. Organizational Transformation Framework
|
|
136
|
+
|
|
137
|
+
- **Choreographic Maturity Model** (4 levels):
|
|
138
|
+
1. Tool Usage
|
|
139
|
+
2. Workflow Integration
|
|
140
|
+
3. Adaptive Choreography
|
|
141
|
+
4. Organizational Intelligence
|
|
142
|
+
- **Cultural Shifts**: Experimental mindsets, transparent feedback, distributed decision-making
|
|
143
|
+
|
|
144
|
+
### Relevant Quotes/Sections Found
|
|
145
|
+
|
|
146
|
+
> "Workflows are reusable coordination patterns that shape how work unfolds, without prescribing execution logic."
|
|
147
|
+
> Source: docs/06-WORKFLOWS.md, Workflow Philosophy
|
|
148
|
+
|
|
149
|
+
> "Checkpoints are where: Human control is enforced, Cost is consciously acknowledged, Autonomy is bounded, Trust is built. Most agent systems lack this concept entirely."
|
|
150
|
+
> Source: docs/06-WORKFLOWS.md, Checkpoints (Guardrails)
|
|
151
|
+
|
|
152
|
+
> "A human can: Redirect an agent mid-task, Override a decision before execution, Inject new requirements during planning, Escalate when complexity exceeds thresholds. The system doesn't break-it adapts."
|
|
153
|
+
> Source: docs.amorphic/04-THE_HUMAN-AGENT_COLLABORATION_MODEL.md, Intervention Points
|
|
154
|
+
|
|
155
|
+
> "The goal isn't to minimize human involvement, but to optimize the unique contributions each participant brings. The most sophisticated choreographed intelligence systems often appear less automated than simpler ones."
|
|
156
|
+
> Source: docs.amorphic/06-ORGANIZATIONAL_TRANSFORMATION.md, The Paradox of Automation
|
|
157
|
+
|
|
158
|
+
> "Every event creates an immutable record, but unlike traditional logs, these records tell a story. They capture not just what happened, but why it happened, who was involved, and what alternatives were considered."
|
|
159
|
+
> Source: docs.amorphic/04-THE_HUMAN-AGENT_COLLABORATION_MODEL.md, Audit Trails That Matter
|
|
160
|
+
|
|
161
|
+
> "Recommendations are computed, explainable, optional, support override, and carry no authority."
|
|
162
|
+
> Source: docs/02-PRIMITIVES.md, Recommendation
|
|
163
|
+
|
|
164
|
+
> "HumanWork treats execution as a series of scoped, disposable attempts guided by durable intent."
|
|
165
|
+
> Source: docs/04-EXECUTION_MODEL.md, Execution Philosophy
|
|
166
|
+
|
|
167
|
+
### Answer to Research Question
|
|
168
|
+
|
|
169
|
+
**What workflow and process improvements are suggested?**
|
|
170
|
+
|
|
171
|
+
The documentation suggests a comprehensive framework of workflow and process improvements centered on **human-first design with intelligent agent collaboration**:
|
|
172
|
+
|
|
173
|
+
1. **From Rigid to Adaptive Workflows**: Replace prescriptive automation with coordination patterns that guide without controlling. Workflows suggest structure, surface activities, and enforce checkpoints while allowing human override at any point.
|
|
174
|
+
|
|
175
|
+
2. **Checkpoint-Driven Governance**: Insert frequent, deliberate pauses (checkpoints) where humans re-engage, confirm direction, and make explicit decisions about cost and autonomy. This builds trust and prevents runaway automation.
|
|
176
|
+
|
|
177
|
+
3. **Dynamic Authority Gradient**: Enable seamless transitions between different collaboration modes (instructional, consultative, supervisory, exploratory) based on context, confidence, and stakes.
|
|
178
|
+
|
|
179
|
+
4. **Granular Intervention Points**: Allow humans to redirect, override, inject, or escalate at every event boundary without breaking the system or restarting workflows.
|
|
180
|
+
|
|
181
|
+
5. **Event-Sourced Memory**: Use immutable event logs as source of truth, enabling time travel, branching, safe experimentation, and complete audit trails.
|
|
182
|
+
|
|
183
|
+
6. **Parallel Exploration**: Support concurrent execution of multiple approaches with distinct scopes, enabling humans to compare alternatives before committing.
|
|
184
|
+
|
|
185
|
+
7. **Semantic Discovery**: Layer a knowledge graph on top of facts to enable natural language queries, pattern recognition, and recommendations without giving it control authority.
|
|
186
|
+
|
|
187
|
+
8. **Versioned, Reusable Workflows**: Capture proven patterns as immutable, versioned workflows that can be shared across teams and evolved without disrupting active work.
|
|
188
|
+
|
|
189
|
+
9. **Transparency Architecture**: Make every decision, action, and state change observable in real-time. Transform audit trails from archaeology into documentation.
|
|
190
|
+
|
|
191
|
+
10. **Organizational Choreography**: Develop choreographic capabilities as competitive advantage - context-dependent, relationship-based, continuously evolving collaboration patterns.
|
|
192
|
+
|
|
193
|
+
## Proposed Spec Changes
|
|
194
|
+
|
|
195
|
+
Based on findings, consider these additions to any spec:
|
|
196
|
+
|
|
197
|
+
- [ ] Add explicit checkpoint requirements for all high-stakes workflows
|
|
198
|
+
- [ ] Define minimum intervention points per workflow phase
|
|
199
|
+
- [ ] Require confidence levels on all recommendations (strongly/moderately/weakly supported)
|
|
200
|
+
- [ ] Mandate immutable event logging as source of truth
|
|
201
|
+
- [ ] Support branching and time travel for exploratory work
|
|
202
|
+
- [ ] Enable parallel execution contexts for comparative analysis
|
|
203
|
+
- [ ] Implement workflow versioning with upgrade approval gates
|
|
204
|
+
- [ ] Create signals taxonomy (activity, outcome, attention, health)
|
|
205
|
+
- [ ] Design interfaces supporting seamless collaboration mode transitions
|
|
206
|
+
- [ ] Establish choreographic maturity assessment framework
|
|
207
|
+
|
|
208
|
+
## Tool Evaluation
|
|
209
|
+
|
|
210
|
+
### What Worked Well
|
|
211
|
+
|
|
212
|
+
- Semantic search was effective at finding conceptually related content across multiple documents
|
|
213
|
+
- Match percentages gave useful confidence signals (71.6% for human-agent collaboration was spot-on)
|
|
214
|
+
- Section-level context extraction worked excellently (--section flag)
|
|
215
|
+
- Token budget control prevented context overflow
|
|
216
|
+
- Index with embeddings was fast and cost-effective (~$0.003)
|
|
217
|
+
- Tree command gave quick overview of all files
|
|
218
|
+
- Search results showed which docs were most relevant
|
|
219
|
+
|
|
220
|
+
### What Was Frustrating
|
|
221
|
+
|
|
222
|
+
- Some searches returned 0 results when content definitely existed (e.g., "efficiency productivity scaling team")
|
|
223
|
+
- Token truncation warnings didn't always indicate which sections were excluded clearly
|
|
224
|
+
- No way to search across multiple specific sections at once
|
|
225
|
+
- Had to run many searches to explore different concept angles
|
|
226
|
+
- Some duplicate content between docs/ and docs.llm/ folders inflated results
|
|
227
|
+
|
|
228
|
+
### What Was Missing
|
|
229
|
+
|
|
230
|
+
- No way to get "all sections mentioning X" in one command
|
|
231
|
+
- No comparative search (find docs that mention A AND B vs A OR B semantically)
|
|
232
|
+
- No export/save functionality for search results
|
|
233
|
+
- Would benefit from a "related sections" feature after finding one
|
|
234
|
+
- No batch context extraction for multiple files in one command
|
|
235
|
+
- Could use a "summarize all findings" aggregation feature
|
|
236
|
+
|
|
237
|
+
### Confidence Level
|
|
238
|
+
|
|
239
|
+
[X] High / [ ] Medium / [ ] Low
|
|
240
|
+
|
|
241
|
+
The tool successfully surfaced the core workflow and process improvement concepts from the documentation. Semantic search found relevant sections across all three document folders, and the context extraction provided sufficient detail to answer the research question comprehensively.
|
|
242
|
+
|
|
243
|
+
### Would Use Again? (1-5)
|
|
244
|
+
|
|
245
|
+
**4** - Very useful for document exploration and targeted extraction. The semantic search with embeddings is significantly better than keyword grep. Main limitation is the manual iteration required to explore multiple concept angles. Would be 5/5 with batch operations and better result aggregation.
|
|
246
|
+
|
|
247
|
+
## Time & Efficiency
|
|
248
|
+
|
|
249
|
+
- Commands run: 41
|
|
250
|
+
- Compared to reading all files: **Much less** - Would have taken 2-3x longer to manually read and synthesize 23 documents (8,000+ lines estimated). The tool allowed targeted extraction of exactly the sections needed.
|