moflo 4.8.27 → 4.8.30

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (260) hide show
  1. package/.claude/agents/browser/browser-agent.yaml +182 -182
  2. package/.claude/agents/core/coder.md +265 -265
  3. package/.claude/agents/core/planner.md +167 -167
  4. package/.claude/agents/core/researcher.md +189 -189
  5. package/.claude/agents/core/reviewer.md +325 -325
  6. package/.claude/agents/core/tester.md +318 -318
  7. package/.claude/agents/database-specialist.yaml +21 -21
  8. package/.claude/agents/dual-mode/codex-coordinator.md +224 -224
  9. package/.claude/agents/dual-mode/codex-worker.md +211 -211
  10. package/.claude/agents/dual-mode/dual-orchestrator.md +291 -291
  11. package/.claude/agents/flow-nexus/app-store.md +88 -0
  12. package/.claude/agents/flow-nexus/authentication.md +69 -0
  13. package/.claude/agents/flow-nexus/challenges.md +81 -0
  14. package/.claude/agents/flow-nexus/neural-network.md +88 -0
  15. package/.claude/agents/flow-nexus/payments.md +83 -0
  16. package/.claude/agents/flow-nexus/sandbox.md +76 -0
  17. package/.claude/agents/flow-nexus/swarm.md +76 -0
  18. package/.claude/agents/flow-nexus/user-tools.md +96 -0
  19. package/.claude/agents/flow-nexus/workflow.md +84 -0
  20. package/.claude/agents/github/code-review-swarm.md +537 -537
  21. package/.claude/agents/github/github-modes.md +172 -172
  22. package/.claude/agents/github/issue-tracker.md +318 -318
  23. package/.claude/agents/github/multi-repo-swarm.md +552 -552
  24. package/.claude/agents/github/pr-manager.md +190 -190
  25. package/.claude/agents/github/project-board-sync.md +508 -508
  26. package/.claude/agents/github/release-manager.md +366 -366
  27. package/.claude/agents/github/release-swarm.md +582 -582
  28. package/.claude/agents/github/repo-architect.md +397 -397
  29. package/.claude/agents/github/swarm-issue.md +572 -572
  30. package/.claude/agents/github/swarm-pr.md +427 -427
  31. package/.claude/agents/github/sync-coordinator.md +451 -451
  32. package/.claude/agents/github/workflow-automation.md +634 -634
  33. package/.claude/agents/goal/code-goal-planner.md +445 -445
  34. package/.claude/agents/hive-mind/collective-intelligence-coordinator.md +129 -129
  35. package/.claude/agents/hive-mind/queen-coordinator.md +202 -202
  36. package/.claude/agents/hive-mind/scout-explorer.md +241 -241
  37. package/.claude/agents/hive-mind/swarm-memory-manager.md +192 -192
  38. package/.claude/agents/hive-mind/worker-specialist.md +216 -216
  39. package/.claude/agents/index.yaml +17 -17
  40. package/.claude/agents/neural/safla-neural.md +73 -73
  41. package/.claude/agents/payments/agentic-payments.md +126 -0
  42. package/.claude/agents/project-coordinator.yaml +15 -15
  43. package/.claude/agents/python-specialist.yaml +21 -21
  44. package/.claude/agents/reasoning/goal-planner.md +72 -72
  45. package/.claude/agents/security-auditor.yaml +20 -20
  46. package/.claude/agents/sona/sona-learning-optimizer.md +74 -0
  47. package/.claude/agents/sublinear/consensus-coordinator.md +338 -0
  48. package/.claude/agents/sublinear/matrix-optimizer.md +185 -0
  49. package/.claude/agents/sublinear/pagerank-analyzer.md +299 -0
  50. package/.claude/agents/sublinear/performance-optimizer.md +368 -0
  51. package/.claude/agents/sublinear/trading-predictor.md +246 -0
  52. package/.claude/agents/swarm/adaptive-coordinator.md +395 -395
  53. package/.claude/agents/swarm/hierarchical-coordinator.md +326 -326
  54. package/.claude/agents/swarm/mesh-coordinator.md +391 -391
  55. package/.claude/agents/templates/migration-plan.md +745 -745
  56. package/.claude/agents/typescript-specialist.yaml +21 -21
  57. package/.claude/agents/v3/adr-architect.md +184 -0
  58. package/.claude/agents/v3/aidefence-guardian.md +282 -0
  59. package/.claude/agents/v3/claims-authorizer.md +208 -0
  60. package/.claude/agents/v3/collective-intelligence-coordinator.md +993 -0
  61. package/.claude/agents/v3/ddd-domain-expert.md +220 -0
  62. package/.claude/agents/v3/injection-analyst.md +236 -0
  63. package/.claude/agents/v3/memory-specialist.md +995 -0
  64. package/.claude/agents/v3/performance-engineer.md +1233 -0
  65. package/.claude/agents/v3/pii-detector.md +151 -0
  66. package/.claude/agents/v3/reasoningbank-learner.md +213 -0
  67. package/.claude/agents/v3/security-architect-aidefence.md +410 -0
  68. package/.claude/agents/v3/security-architect.md +867 -0
  69. package/.claude/agents/v3/security-auditor.md +771 -0
  70. package/.claude/agents/v3/sparc-orchestrator.md +182 -0
  71. package/.claude/agents/v3/swarm-memory-manager.md +157 -0
  72. package/.claude/agents/v3/v3-integration-architect.md +205 -0
  73. package/.claude/checkpoints/1767754460.json +8 -8
  74. package/.claude/commands/agents/agent-spawning.md +28 -28
  75. package/.claude/commands/analysis/COMMAND_COMPLIANCE_REPORT.md +54 -0
  76. package/.claude/commands/analysis/README.md +9 -0
  77. package/.claude/commands/analysis/bottleneck-detect.md +162 -0
  78. package/.claude/commands/analysis/performance-bottlenecks.md +59 -0
  79. package/.claude/commands/analysis/performance-report.md +25 -0
  80. package/.claude/commands/analysis/token-efficiency.md +45 -0
  81. package/.claude/commands/analysis/token-usage.md +25 -0
  82. package/.claude/commands/automation/README.md +9 -0
  83. package/.claude/commands/automation/auto-agent.md +122 -0
  84. package/.claude/commands/automation/self-healing.md +106 -0
  85. package/.claude/commands/automation/session-memory.md +90 -0
  86. package/.claude/commands/automation/smart-agents.md +73 -0
  87. package/.claude/commands/automation/smart-spawn.md +25 -0
  88. package/.claude/commands/automation/workflow-select.md +25 -0
  89. package/.claude/commands/github/github-modes.md +146 -146
  90. package/.claude/commands/github/github-swarm.md +121 -121
  91. package/.claude/commands/github/issue-tracker.md +291 -291
  92. package/.claude/commands/github/pr-manager.md +169 -169
  93. package/.claude/commands/github/release-manager.md +337 -337
  94. package/.claude/commands/github/repo-architect.md +366 -366
  95. package/.claude/commands/github/sync-coordinator.md +300 -300
  96. package/.claude/commands/memory/neural.md +47 -47
  97. package/.claude/commands/monitoring/README.md +9 -0
  98. package/.claude/commands/monitoring/agent-metrics.md +25 -0
  99. package/.claude/commands/monitoring/agents.md +44 -0
  100. package/.claude/commands/monitoring/real-time-view.md +25 -0
  101. package/.claude/commands/monitoring/status.md +46 -0
  102. package/.claude/commands/monitoring/swarm-monitor.md +25 -0
  103. package/.claude/commands/optimization/README.md +9 -0
  104. package/.claude/commands/optimization/auto-topology.md +62 -0
  105. package/.claude/commands/optimization/cache-manage.md +25 -0
  106. package/.claude/commands/optimization/parallel-execute.md +25 -0
  107. package/.claude/commands/optimization/parallel-execution.md +50 -0
  108. package/.claude/commands/optimization/topology-optimize.md +25 -0
  109. package/.claude/commands/sparc/analyzer.md +51 -51
  110. package/.claude/commands/sparc/architect.md +53 -53
  111. package/.claude/commands/sparc/ask.md +97 -97
  112. package/.claude/commands/sparc/batch-executor.md +54 -54
  113. package/.claude/commands/sparc/code.md +89 -89
  114. package/.claude/commands/sparc/coder.md +54 -54
  115. package/.claude/commands/sparc/debug.md +83 -83
  116. package/.claude/commands/sparc/debugger.md +54 -54
  117. package/.claude/commands/sparc/designer.md +53 -53
  118. package/.claude/commands/sparc/devops.md +109 -109
  119. package/.claude/commands/sparc/docs-writer.md +80 -80
  120. package/.claude/commands/sparc/documenter.md +54 -54
  121. package/.claude/commands/sparc/innovator.md +54 -54
  122. package/.claude/commands/sparc/integration.md +83 -83
  123. package/.claude/commands/sparc/mcp.md +117 -117
  124. package/.claude/commands/sparc/memory-manager.md +54 -54
  125. package/.claude/commands/sparc/optimizer.md +54 -54
  126. package/.claude/commands/sparc/orchestrator.md +131 -131
  127. package/.claude/commands/sparc/post-deployment-monitoring-mode.md +83 -83
  128. package/.claude/commands/sparc/refinement-optimization-mode.md +83 -83
  129. package/.claude/commands/sparc/researcher.md +54 -54
  130. package/.claude/commands/sparc/reviewer.md +54 -54
  131. package/.claude/commands/sparc/security-review.md +80 -80
  132. package/.claude/commands/sparc/sparc-modes.md +174 -174
  133. package/.claude/commands/sparc/sparc.md +111 -111
  134. package/.claude/commands/sparc/spec-pseudocode.md +80 -80
  135. package/.claude/commands/sparc/supabase-admin.md +348 -348
  136. package/.claude/commands/sparc/swarm-coordinator.md +54 -54
  137. package/.claude/commands/sparc/tdd.md +54 -54
  138. package/.claude/commands/sparc/tester.md +54 -54
  139. package/.claude/commands/sparc/tutorial.md +79 -79
  140. package/.claude/commands/sparc/workflow-manager.md +54 -54
  141. package/.claude/commands/sparc.md +166 -166
  142. package/.claude/commands/swarm/analysis.md +95 -95
  143. package/.claude/commands/swarm/development.md +96 -96
  144. package/.claude/commands/swarm/examples.md +168 -168
  145. package/.claude/commands/swarm/maintenance.md +102 -102
  146. package/.claude/commands/swarm/optimization.md +117 -117
  147. package/.claude/commands/swarm/research.md +136 -136
  148. package/.claude/commands/swarm/testing.md +131 -131
  149. package/.claude/commands/workflows/development.md +77 -77
  150. package/.claude/commands/workflows/research.md +62 -62
  151. package/.claude/guidance/moflo-bootstrap.md +126 -126
  152. package/.claude/guidance/shipped/agent-bootstrap.md +148 -143
  153. package/.claude/guidance/shipped/guidance-memory-strategy.md +262 -262
  154. package/.claude/guidance/shipped/memory-strategy.md +204 -204
  155. package/.claude/guidance/shipped/moflo.md +668 -675
  156. package/.claude/guidance/shipped/task-icons.md +42 -0
  157. package/.claude/guidance/shipped/task-swarm-integration.md +441 -441
  158. package/.claude/helpers/gate-hook.mjs +50 -0
  159. package/.claude/helpers/gate.cjs +138 -0
  160. package/.claude/helpers/hook-handler.cjs +76 -0
  161. package/.claude/helpers/intelligence.cjs +207 -207
  162. package/.claude/helpers/prompt-hook.mjs +72 -0
  163. package/.claude/helpers/statusline.cjs +851 -851
  164. package/.claude/scripts/build-embeddings.mjs +549 -0
  165. package/.claude/scripts/generate-code-map.mjs +776 -0
  166. package/.claude/scripts/hooks.mjs +656 -0
  167. package/.claude/scripts/index-guidance.mjs +893 -0
  168. package/.claude/scripts/index-tests.mjs +710 -0
  169. package/.claude/scripts/semantic-search.mjs +473 -0
  170. package/.claude/scripts/session-start-launcher.mjs +238 -0
  171. package/.claude/settings.local.json +18 -0
  172. package/.claude/skills/fl/SKILL.md +583 -583
  173. package/.claude/skills/flo/SKILL.md +583 -583
  174. package/.claude/skills/github-code-review/SKILL.md +1140 -1140
  175. package/.claude/skills/github-multi-repo/SKILL.md +874 -874
  176. package/.claude/skills/github-project-management/SKILL.md +1277 -1277
  177. package/.claude/skills/github-release-management/SKILL.md +1081 -1081
  178. package/.claude/skills/github-workflow-automation/SKILL.md +1065 -1065
  179. package/.claude/skills/hive-mind-advanced/SKILL.md +712 -712
  180. package/.claude/skills/hooks-automation/SKILL.md +1201 -1201
  181. package/.claude/skills/pair-programming/SKILL.md +1202 -0
  182. package/.claude/skills/performance-analysis/SKILL.md +563 -563
  183. package/.claude/skills/sparc-methodology/SKILL.md +1115 -1115
  184. package/.claude/skills/stream-chain/SKILL.md +563 -0
  185. package/.claude/skills/swarm-advanced/SKILL.md +973 -973
  186. package/.claude/skills/v3-cli-modernization/SKILL.md +872 -0
  187. package/.claude/skills/v3-core-implementation/SKILL.md +797 -0
  188. package/.claude/skills/v3-ddd-architecture/SKILL.md +442 -0
  189. package/.claude/skills/v3-integration-deep/SKILL.md +241 -0
  190. package/.claude/skills/v3-mcp-optimization/SKILL.md +777 -0
  191. package/.claude/skills/v3-memory-unification/SKILL.md +174 -0
  192. package/.claude/skills/v3-performance-optimization/SKILL.md +390 -0
  193. package/.claude/skills/v3-security-overhaul/SKILL.md +82 -0
  194. package/.claude/skills/v3-swarm-coordination/SKILL.md +340 -0
  195. package/.claude/workflow-state.json +5 -5
  196. package/LICENSE +21 -21
  197. package/README.md +698 -685
  198. package/bin/cli.js +0 -0
  199. package/bin/gate-hook.mjs +50 -50
  200. package/bin/gate.cjs +138 -138
  201. package/bin/generate-code-map.mjs +956 -938
  202. package/bin/hook-handler.cjs +83 -83
  203. package/bin/hooks.mjs +696 -696
  204. package/bin/index-guidance.mjs +906 -893
  205. package/bin/index-tests.mjs +729 -710
  206. package/bin/lib/process-manager.mjs +256 -256
  207. package/bin/lib/registry-cleanup.cjs +41 -41
  208. package/bin/prompt-hook.mjs +72 -72
  209. package/bin/semantic-search.mjs +472 -472
  210. package/bin/session-start-launcher.mjs +238 -238
  211. package/bin/setup-project.mjs +253 -251
  212. package/package.json +123 -123
  213. package/src/@claude-flow/cli/README.md +452 -452
  214. package/src/@claude-flow/cli/bin/cli.js +180 -180
  215. package/src/@claude-flow/cli/bin/preinstall.cjs +2 -2
  216. package/src/@claude-flow/cli/dist/src/commands/completions.js +409 -409
  217. package/src/@claude-flow/cli/dist/src/commands/doctor.js +156 -3
  218. package/src/@claude-flow/cli/dist/src/commands/embeddings.js +25 -25
  219. package/src/@claude-flow/cli/dist/src/commands/github.js +61 -61
  220. package/src/@claude-flow/cli/dist/src/commands/hive-mind.js +90 -90
  221. package/src/@claude-flow/cli/dist/src/commands/hooks.js +9 -9
  222. package/src/@claude-flow/cli/dist/src/commands/init.js +3 -6
  223. package/src/@claude-flow/cli/dist/src/commands/ruvector/import.js +14 -14
  224. package/src/@claude-flow/cli/dist/src/commands/ruvector/setup.js +624 -624
  225. package/src/@claude-flow/cli/dist/src/config/moflo-config.d.ts +3 -0
  226. package/src/@claude-flow/cli/dist/src/config/moflo-config.js +101 -91
  227. package/src/@claude-flow/cli/dist/src/index.d.ts +5 -0
  228. package/src/@claude-flow/cli/dist/src/index.js +44 -0
  229. package/src/@claude-flow/cli/dist/src/init/claudemd-generator.d.ts +29 -29
  230. package/src/@claude-flow/cli/dist/src/init/claudemd-generator.js +89 -87
  231. package/src/@claude-flow/cli/dist/src/init/executor.js +453 -453
  232. package/src/@claude-flow/cli/dist/src/init/helpers-generator.js +482 -482
  233. package/src/@claude-flow/cli/dist/src/init/moflo-init.d.ts +30 -30
  234. package/src/@claude-flow/cli/dist/src/init/moflo-init.js +904 -848
  235. package/src/@claude-flow/cli/dist/src/init/statusline-generator.js +876 -876
  236. package/src/@claude-flow/cli/dist/src/mcp-tools/hooks-tools.js +3 -3
  237. package/src/@claude-flow/cli/dist/src/memory/memory-initializer.js +371 -371
  238. package/src/@claude-flow/cli/dist/src/runtime/headless.js +28 -28
  239. package/src/@claude-flow/cli/dist/src/services/container-worker-pool.d.ts +197 -0
  240. package/src/@claude-flow/cli/dist/src/services/container-worker-pool.js +584 -0
  241. package/src/@claude-flow/cli/dist/src/services/daemon-lock.d.ts +14 -0
  242. package/src/@claude-flow/cli/dist/src/services/daemon-lock.js +1 -1
  243. package/src/@claude-flow/cli/dist/src/services/headless-worker-executor.js +84 -84
  244. package/src/@claude-flow/cli/package.json +1 -1
  245. package/src/@claude-flow/guidance/README.md +1195 -1195
  246. package/src/@claude-flow/guidance/package.json +198 -198
  247. package/src/@claude-flow/memory/README.md +587 -587
  248. package/src/@claude-flow/memory/dist/agentdb-backend.js +26 -26
  249. package/src/@claude-flow/memory/dist/auto-memory-bridge.test.js +27 -27
  250. package/src/@claude-flow/memory/dist/hybrid-backend.d.ts +245 -0
  251. package/src/@claude-flow/memory/dist/hybrid-backend.js +569 -0
  252. package/src/@claude-flow/memory/dist/hybrid-backend.test.d.ts +8 -0
  253. package/src/@claude-flow/memory/dist/hybrid-backend.test.js +320 -0
  254. package/src/@claude-flow/memory/dist/sqlite-backend.d.ts +121 -0
  255. package/src/@claude-flow/memory/dist/sqlite-backend.js +572 -0
  256. package/src/@claude-flow/memory/dist/sqljs-backend.js +26 -26
  257. package/src/@claude-flow/memory/package.json +44 -44
  258. package/src/@claude-flow/shared/README.md +323 -323
  259. package/src/@claude-flow/shared/dist/events/event-store.js +31 -31
  260. package/src/README.md +493 -493
@@ -1,262 +1,262 @@
1
- # Guidance & Memory Tuning Strategy
2
-
3
- **Purpose:** How to build and tune a RAG-based guidance system using moflo's semantic search, embedding pipeline, and indexing. Reference when creating guidance documents, troubleshooting search quality, or extending the system.
4
-
5
- ---
6
-
7
- ## Problem Statement
8
-
9
- Claude Code agents need project-specific knowledge — coding rules, architecture patterns, entity templates, testing conventions — delivered at the right moment. Without a retrieval system, agents either miss critical rules or require massive CLAUDE.md files that waste context window tokens.
10
-
11
- **Goals:**
12
- - Agents find relevant guidance automatically via semantic search
13
- - Subagents spawned by the coordinator inherit memory access
14
- - Search quality is high enough that agents don't need to read whole files
15
- - The system survives `npm install` (indexing runs on session start)
16
-
17
- ---
18
-
19
- ## Architecture
20
-
21
- Three layers: embedding generation, vector storage, and search.
22
-
23
- ```
24
- Source Files (.claude/guidance/*.md, docs/*.md)
25
- |
26
- v
27
- index-guidance.mjs --- Chunk on ## headers, build RAG links
28
- | (prev/next, siblings, parent/child, context overlap)
29
- v
30
- .swarm/memory.db ----- SQLite (entries + metadata + embedding vectors)
31
- |
32
- v
33
- build-embeddings.mjs - Generate 384-dim vectors per entry
34
- | (Xenova/all-MiniLM-L6-v2 neural, or domain-aware hash fallback)
35
- v
36
- RuVector (@ruvector/core) -- HNSW index infrastructure
37
- v
38
- Search layer ---------- Three access paths:
39
- 1. MCP tools (mcp__moflo__memory_search) -- preferred
40
- 2. CLI (npx flo memory search) -- fallback
41
- 3. Script (semantic-search.mjs) -- detailed output
42
- ```
43
-
44
- **Key files:**
45
-
46
- | File | Role |
47
- |------|------|
48
- | `.claude/guidance/*.md` | Guidance documents (source of truth) |
49
- | `bin/index-guidance.mjs` | Chunks documents, stores in SQLite with RAG metadata |
50
- | `bin/build-embeddings.mjs` | Generates vector embeddings (neural or hash) |
51
- | `.swarm/memory.db` | SQLite database with entries, metadata, embeddings |
52
- | `@ruvector/core` | HNSW vector index, WASM fallback, SIMD operations |
53
-
54
- ---
55
-
56
- ## Guidance Document Optimization Rules
57
-
58
- These rules determine how well your guidance documents retrieve via semantic search:
59
-
60
- ### 1. Every file needs a Purpose line
61
-
62
- Add `**Purpose:**` as the first meaningful line after the title. Claude checks this first for relevance scoring. Without it, the chunk has no summary signal.
63
-
64
- ### 2. H2 headings are the primary retrieval signal
65
-
66
- The indexer splits on `##`. Each heading becomes the chunk title, prepended to searchable content. Domain-specific keywords in headings dramatically improve recall.
67
-
68
- **Bad:** `## Overview`, `## Rules`, `## Pattern`
69
- **Good:** `## Soft Delete Rules`, `## JWT Authentication Pattern`, `## Database Entity Migration`
70
-
71
- ### 3. Ideal chunk size: 1000-4000 characters
72
-
73
- Below 50 chars the chunk is dropped. Above 6000 the indexer force-splits on paragraphs, which breaks mid-thought. The sweet spot produces focused embeddings.
74
-
75
- ### 4. Self-contained chunks
76
-
77
- Each H2 section must answer a question without needing the rest of the document. Include: the rule, a code example, and a cross-reference.
78
-
79
- ### 5. Tables over prose
80
-
81
- Claude parses structured data more accurately than paragraphs. DO/DON'T tables, field reference tables, and command tables all retrieve better.
82
-
83
- ### 6. Cross-references create a navigation graph
84
-
85
- The RAG indexer stores `prevChunk`/`nextChunk`/`siblings` metadata. Cross-references between documents let Claude follow chains: `core.md -> coding-rules.md -> database.md`.
86
-
87
- ### 7. No decorative formatting
88
-
89
- ASCII boxes, excessive emoji, rhetorical questions, and motivational text all waste tokens without improving retrieval or comprehension.
90
-
91
- ---
92
-
93
- ## Embedding Pipeline
94
-
95
- ### Embedding Models
96
-
97
- | Model | Quality | Speed | When Used |
98
- |-------|---------|-------|-----------|
99
- | `Xenova/all-MiniLM-L6-v2` | High (true semantic) | ~3s for 1000 entries | Primary — `build-embeddings.mjs` uses this |
100
- | `domain-aware-hash-v1` | Good (domain clustering) | <1s for 1000 entries | Fallback when Transformers.js unavailable |
101
-
102
- **Neural embeddings (Xenova/all-MiniLM-L6-v2):**
103
- - Uses `@xenova/transformers` with ONNX WASM runtime
104
- - 384-dimensional vectors, L2-normalized
105
- - True semantic understanding — "soft delete" matches "mark as deleted" without keyword overlap
106
- - Loaded lazily on first use, cached for subsequent queries
107
- - Ships with moflo; no additional install needed
108
-
109
- **Domain-aware hash embeddings (fallback):**
110
- - Custom SimHash-style algorithm with 12 domain clusters
111
- - Domain clusters group related terms: `database` (orm, postgresql, entity, schema...), `frontend` (react, component, css...), `testing` (vitest, mock, expect...), etc.
112
- - Multi-position hashing with bigram/trigram features
113
- - Good at keyword-level matching but misses semantic paraphrases
114
- - No external dependencies — always available
115
-
116
- ### The Embedding Alignment Problem
117
-
118
- **Critical rule:** Query embeddings MUST match stored embeddings. Computing cosine similarity between vectors from different models produces meaningless scores.
119
-
120
- Both the search scripts and the MCP memory tools auto-detect the stored embedding model:
121
-
122
- ```javascript
123
- // Check what model stored entries predominantly use
124
- const modelCheck = db.prepare(
125
- `SELECT embedding_model, COUNT(*) as cnt FROM memory_entries
126
- WHERE status = 'active' AND embedding IS NOT NULL
127
- GROUP BY embedding_model ORDER BY cnt DESC LIMIT 1`
128
- ).get();
129
-
130
- // If stored embeddings are neural, use neural for query too
131
- ```
132
-
133
- Search also **filters out entries with mismatched `embedding_model`** — if the query uses neural embeddings, hash-embedded entries are skipped (and vice versa).
134
-
135
- ### Domain Cluster Tuning
136
-
137
- The hash fallback's domain clusters can be extended with project-specific terms. Add terms to the relevant cluster in the hash embedding function to improve keyword-level matching for your domain:
138
-
139
- | Cluster | Example Terms |
140
- |---------|--------------|
141
- | `database` | your ORM, database engine, schema terms |
142
- | `frontend` | UI framework, component library terms |
143
- | `backend` | DI container, API framework terms |
144
- | `testing` | test framework, assertion library terms |
145
- | `security` | auth system, permission model terms |
146
-
147
- ---
148
-
149
- ## RAG Indexing Pipeline
150
-
151
- ### How `index-guidance.mjs` Works
152
-
153
- 1. **Scan** configured directories for `.md` files
154
- 2. **Hash check** — Skip files whose content hash hasn't changed (unless `--force`)
155
- 3. **Store full document** as `doc-{prefix}-{name}` (for complete retrieval)
156
- 4. **Chunk on `##` headers** — Each H2 section becomes a separate entry
157
- 5. **H3 subsections** become child chunks with parent H2 as context prefix
158
- 6. **Force-split** sections over 4000 chars on paragraph boundaries
159
- 7. **Build RAG metadata** for every chunk:
160
-
161
- | Metadata Field | Purpose |
162
- |---------------|---------|
163
- | `parentDoc` | Link back to full document |
164
- | `prevChunk` / `nextChunk` | Sequential navigation |
165
- | `siblings` | All chunk keys from same document |
166
- | `hierarchicalParent` / `hierarchicalChildren` | H2->H3 relationships |
167
- | `contextBefore` / `contextAfter` | 20% overlapping text from adjacent chunks |
168
-
169
- 8. **Prepend context** — Each chunk's searchable content includes overlap from neighbors
170
- 9. **Stale cleanup** — After indexing, remove entries for files that no longer exist on disk
171
- 10. **Background embedding** — Spawn `build-embeddings.mjs` in background to generate vectors
172
-
173
- ### Configuring Indexed Directories
174
-
175
- In `moflo.yaml`:
176
-
177
- ```yaml
178
- guidance:
179
- directories:
180
- - .claude/guidance
181
- - docs/guides
182
- ```
183
-
184
- Default directories (when no config): `.claude/guidance`, `docs/guides`
185
-
186
- Moflo also automatically indexes its own bundled guidance from `node_modules/moflo/.claude/guidance/` when installed as a library in a consumer project.
187
-
188
- ---
189
-
190
- ## Lessons Learned
191
-
192
- ### Document Optimization
193
-
194
- 1. **`**Purpose:**` lines are critical** — They're the single highest-impact addition for retrieval quality.
195
- 2. **Headings are embeddings** — In a chunk-per-section system, the heading IS the embedding's primary signal. Generic headings are nearly useless.
196
- 3. **Tables retrieve better than prose** — Claude parses structured data with higher accuracy.
197
- 4. **Cross-references are the RAG graph** — Isolated documents can't be navigated.
198
- 5. **Chunk size matters** — A 10,000-char section produces a diluted embedding. Splitting into focused sections triples the chance of matching specific queries.
199
-
200
- ### Embedding Pipeline
201
-
202
- 6. **Query embeddings MUST match stored embeddings** — This is the single most critical rule. Auto-detect and match.
203
- 7. **Domain clusters need project-specific terms** — Generic NLP clusters miss project-specific terminology. Adding terms to domain clusters dramatically improves keyword-level matching.
204
- 8. **Filter mismatched entries during search** — Mixed databases need explicit filtering by `embedding_model`.
205
-
206
- ---
207
-
208
- ## Replication Guide
209
-
210
- To set up this system in a new project using moflo:
211
-
212
- ### 1. Install Moflo
213
-
214
- ```bash
215
- npm install moflo
216
- npx flo init
217
- ```
218
-
219
- ### 2. Create Guidance Documents
220
-
221
- Create `.claude/guidance/` directory with markdown files following the optimization rules above:
222
- - Every file has `**Purpose:**` line
223
- - H2 sections with domain keywords in headings
224
- - Tables for structured rules
225
- - Cross-references between related docs
226
- - 1000-4000 char sections
227
-
228
- ### 3. Configure Indexing
229
-
230
- In `moflo.yaml`:
231
-
232
- ```yaml
233
- guidance:
234
- directories:
235
- - .claude/guidance
236
- - docs/guides
237
-
238
- auto_index:
239
- guidance: true
240
- code_map: true
241
- ```
242
-
243
- ### 4. Index and Verify
244
-
245
- ```bash
246
- # Index documents
247
- npx flo-index --force
248
-
249
- # Test search quality
250
- npx flo memory search --query "your domain query" --namespace guidance
251
-
252
- # Verify from Claude Code via MCP
253
- # mcp__moflo__memory_search query="your domain query" namespace="guidance"
254
- ```
255
-
256
- ---
257
-
258
- ## See Also
259
-
260
- - `.claude/guidance/memory-strategy.md` - Memory architecture and search commands
261
- - `.claude/guidance/agent-bootstrap.md` - Subagent bootstrap guide
262
- - `.claude/guidance/moflo.md` - Full CLI/MCP reference
1
+ # Guidance & Memory Tuning Strategy
2
+
3
+ **Purpose:** How to build and tune a RAG-based guidance system using moflo's semantic search, embedding pipeline, and indexing. Reference when creating guidance documents, troubleshooting search quality, or extending the system.
4
+
5
+ ---
6
+
7
+ ## Problem Statement
8
+
9
+ Claude Code agents need project-specific knowledge — coding rules, architecture patterns, entity templates, testing conventions — delivered at the right moment. Without a retrieval system, agents either miss critical rules or require massive CLAUDE.md files that waste context window tokens.
10
+
11
+ **Goals:**
12
+ - Agents find relevant guidance automatically via semantic search
13
+ - Subagents spawned by the coordinator inherit memory access
14
+ - Search quality is high enough that agents don't need to read whole files
15
+ - The system survives `npm install` (indexing runs on session start)
16
+
17
+ ---
18
+
19
+ ## Architecture
20
+
21
+ Three layers: embedding generation, vector storage, and search.
22
+
23
+ ```
24
+ Source Files (.claude/guidance/*.md, docs/*.md)
25
+ |
26
+ v
27
+ index-guidance.mjs --- Chunk on ## headers, build RAG links
28
+ | (prev/next, siblings, parent/child, context overlap)
29
+ v
30
+ .swarm/memory.db ----- SQLite (entries + metadata + embedding vectors)
31
+ |
32
+ v
33
+ build-embeddings.mjs - Generate 384-dim vectors per entry
34
+ | (Xenova/all-MiniLM-L6-v2 neural, or domain-aware hash fallback)
35
+ v
36
+ RuVector (@ruvector/core) -- HNSW index infrastructure
37
+ v
38
+ Search layer ---------- Three access paths:
39
+ 1. MCP tools (mcp__moflo__memory_search) -- preferred
40
+ 2. CLI (npx flo memory search) -- fallback
41
+ 3. Script (semantic-search.mjs) -- detailed output
42
+ ```
43
+
44
+ **Key files:**
45
+
46
+ | File | Role |
47
+ |------|------|
48
+ | `.claude/guidance/*.md` | Guidance documents (source of truth) |
49
+ | `bin/index-guidance.mjs` | Chunks documents, stores in SQLite with RAG metadata |
50
+ | `bin/build-embeddings.mjs` | Generates vector embeddings (neural or hash) |
51
+ | `.swarm/memory.db` | SQLite database with entries, metadata, embeddings |
52
+ | `@ruvector/core` | HNSW vector index, WASM fallback, SIMD operations |
53
+
54
+ ---
55
+
56
+ ## Guidance Document Optimization Rules
57
+
58
+ These rules determine how well your guidance documents retrieve via semantic search:
59
+
60
+ ### 1. Every file needs a Purpose line
61
+
62
+ Add `**Purpose:**` as the first meaningful line after the title. Claude checks this first for relevance scoring. Without it, the chunk has no summary signal.
63
+
64
+ ### 2. H2 headings are the primary retrieval signal
65
+
66
+ The indexer splits on `##`. Each heading becomes the chunk title, prepended to searchable content. Domain-specific keywords in headings dramatically improve recall.
67
+
68
+ **Bad:** `## Overview`, `## Rules`, `## Pattern`
69
+ **Good:** `## Soft Delete Rules`, `## JWT Authentication Pattern`, `## Database Entity Migration`
70
+
71
+ ### 3. Ideal chunk size: 1000-4000 characters
72
+
73
+ Below 50 chars the chunk is dropped. Above 6000 the indexer force-splits on paragraphs, which breaks mid-thought. The sweet spot produces focused embeddings.
74
+
75
+ ### 4. Self-contained chunks
76
+
77
+ Each H2 section must answer a question without needing the rest of the document. Include: the rule, a code example, and a cross-reference.
78
+
79
+ ### 5. Tables over prose
80
+
81
+ Claude parses structured data more accurately than paragraphs. DO/DON'T tables, field reference tables, and command tables all retrieve better.
82
+
83
+ ### 6. Cross-references create a navigation graph
84
+
85
+ The RAG indexer stores `prevChunk`/`nextChunk`/`siblings` metadata. Cross-references between documents let Claude follow chains: `core.md -> coding-rules.md -> database.md`.
86
+
87
+ ### 7. No decorative formatting
88
+
89
+ ASCII boxes, excessive emoji, rhetorical questions, and motivational text all waste tokens without improving retrieval or comprehension.
90
+
91
+ ---
92
+
93
+ ## Embedding Pipeline
94
+
95
+ ### Embedding Models
96
+
97
+ | Model | Quality | Speed | When Used |
98
+ |-------|---------|-------|-----------|
99
+ | `Xenova/all-MiniLM-L6-v2` | High (true semantic) | ~3s for 1000 entries | Primary — `build-embeddings.mjs` uses this |
100
+ | `domain-aware-hash-v1` | Good (domain clustering) | <1s for 1000 entries | Fallback when Transformers.js unavailable |
101
+
102
+ **Neural embeddings (Xenova/all-MiniLM-L6-v2):**
103
+ - Uses `@xenova/transformers` with ONNX WASM runtime
104
+ - 384-dimensional vectors, L2-normalized
105
+ - True semantic understanding — "soft delete" matches "mark as deleted" without keyword overlap
106
+ - Loaded lazily on first use, cached for subsequent queries
107
+ - Ships with moflo; no additional install needed
108
+
109
+ **Domain-aware hash embeddings (fallback):**
110
+ - Custom SimHash-style algorithm with 12 domain clusters
111
+ - Domain clusters group related terms: `database` (orm, postgresql, entity, schema...), `frontend` (react, component, css...), `testing` (vitest, mock, expect...), etc.
112
+ - Multi-position hashing with bigram/trigram features
113
+ - Good at keyword-level matching but misses semantic paraphrases
114
+ - No external dependencies — always available
115
+
116
+ ### The Embedding Alignment Problem
117
+
118
+ **Critical rule:** Query embeddings MUST match stored embeddings. Computing cosine similarity between vectors from different models produces meaningless scores.
119
+
120
+ Both the search scripts and the MCP memory tools auto-detect the stored embedding model:
121
+
122
+ ```javascript
123
+ // Check what model stored entries predominantly use
124
+ const modelCheck = db.prepare(
125
+ `SELECT embedding_model, COUNT(*) as cnt FROM memory_entries
126
+ WHERE status = 'active' AND embedding IS NOT NULL
127
+ GROUP BY embedding_model ORDER BY cnt DESC LIMIT 1`
128
+ ).get();
129
+
130
+ // If stored embeddings are neural, use neural for query too
131
+ ```
132
+
133
+ Search also **filters out entries with mismatched `embedding_model`** — if the query uses neural embeddings, hash-embedded entries are skipped (and vice versa).
134
+
135
+ ### Domain Cluster Tuning
136
+
137
+ The hash fallback's domain clusters can be extended with project-specific terms. Add terms to the relevant cluster in the hash embedding function to improve keyword-level matching for your domain:
138
+
139
+ | Cluster | Example Terms |
140
+ |---------|--------------|
141
+ | `database` | your ORM, database engine, schema terms |
142
+ | `frontend` | UI framework, component library terms |
143
+ | `backend` | DI container, API framework terms |
144
+ | `testing` | test framework, assertion library terms |
145
+ | `security` | auth system, permission model terms |
146
+
147
+ ---
148
+
149
+ ## RAG Indexing Pipeline
150
+
151
+ ### How `index-guidance.mjs` Works
152
+
153
+ 1. **Scan** configured directories for `.md` files
154
+ 2. **Hash check** — Skip files whose content hash hasn't changed (unless `--force`)
155
+ 3. **Store full document** as `doc-{prefix}-{name}` (for complete retrieval)
156
+ 4. **Chunk on `##` headers** — Each H2 section becomes a separate entry
157
+ 5. **H3 subsections** become child chunks with parent H2 as context prefix
158
+ 6. **Force-split** sections over 4000 chars on paragraph boundaries
159
+ 7. **Build RAG metadata** for every chunk:
160
+
161
+ | Metadata Field | Purpose |
162
+ |---------------|---------|
163
+ | `parentDoc` | Link back to full document |
164
+ | `prevChunk` / `nextChunk` | Sequential navigation |
165
+ | `siblings` | All chunk keys from same document |
166
+ | `hierarchicalParent` / `hierarchicalChildren` | H2->H3 relationships |
167
+ | `contextBefore` / `contextAfter` | 20% overlapping text from adjacent chunks |
168
+
169
+ 8. **Prepend context** — Each chunk's searchable content includes overlap from neighbors
170
+ 9. **Stale cleanup** — After indexing, remove entries for files that no longer exist on disk
171
+ 10. **Background embedding** — Spawn `build-embeddings.mjs` in background to generate vectors
172
+
173
+ ### Configuring Indexed Directories
174
+
175
+ In `moflo.yaml`:
176
+
177
+ ```yaml
178
+ guidance:
179
+ directories:
180
+ - .claude/guidance
181
+ - docs/guides
182
+ ```
183
+
184
+ Default directories (when no config): `.claude/guidance`, `docs/guides`
185
+
186
+ Moflo also automatically indexes its own bundled guidance from `node_modules/moflo/.claude/guidance/` when installed as a library in a consumer project.
187
+
188
+ ---
189
+
190
+ ## Lessons Learned
191
+
192
+ ### Document Optimization
193
+
194
+ 1. **`**Purpose:**` lines are critical** — They're the single highest-impact addition for retrieval quality.
195
+ 2. **Headings are embeddings** — In a chunk-per-section system, the heading IS the embedding's primary signal. Generic headings are nearly useless.
196
+ 3. **Tables retrieve better than prose** — Claude parses structured data with higher accuracy.
197
+ 4. **Cross-references are the RAG graph** — Isolated documents can't be navigated.
198
+ 5. **Chunk size matters** — A 10,000-char section produces a diluted embedding. Splitting into focused sections triples the chance of matching specific queries.
199
+
200
+ ### Embedding Pipeline
201
+
202
+ 6. **Query embeddings MUST match stored embeddings** — This is the single most critical rule. Auto-detect and match.
203
+ 7. **Domain clusters need project-specific terms** — Generic NLP clusters miss project-specific terminology. Adding terms to domain clusters dramatically improves keyword-level matching.
204
+ 8. **Filter mismatched entries during search** — Mixed databases need explicit filtering by `embedding_model`.
205
+
206
+ ---
207
+
208
+ ## Replication Guide
209
+
210
+ To set up this system in a new project using moflo:
211
+
212
+ ### 1. Install Moflo
213
+
214
+ ```bash
215
+ npm install moflo
216
+ npx flo init
217
+ ```
218
+
219
+ ### 2. Create Guidance Documents
220
+
221
+ Create `.claude/guidance/` directory with markdown files following the optimization rules above:
222
+ - Every file has `**Purpose:**` line
223
+ - H2 sections with domain keywords in headings
224
+ - Tables for structured rules
225
+ - Cross-references between related docs
226
+ - 1000-4000 char sections
227
+
228
+ ### 3. Configure Indexing
229
+
230
+ In `moflo.yaml`:
231
+
232
+ ```yaml
233
+ guidance:
234
+ directories:
235
+ - .claude/guidance
236
+ - docs/guides
237
+
238
+ auto_index:
239
+ guidance: true
240
+ code_map: true
241
+ ```
242
+
243
+ ### 4. Index and Verify
244
+
245
+ ```bash
246
+ # Index documents
247
+ npx flo-index --force
248
+
249
+ # Test search quality
250
+ npx flo memory search --query "your domain query" --namespace guidance
251
+
252
+ # Verify from Claude Code via MCP
253
+ # mcp__moflo__memory_search query="your domain query" namespace="guidance"
254
+ ```
255
+
256
+ ---
257
+
258
+ ## See Also
259
+
260
+ - `.claude/guidance/memory-strategy.md` - Memory architecture and search commands
261
+ - `.claude/guidance/agent-bootstrap.md` - Subagent bootstrap guide
262
+ - `.claude/guidance/moflo.md` - Full CLI/MCP reference