moflo 4.8.21 → 4.8.23

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (178) hide show
  1. package/.claude/agents/browser/browser-agent.yaml +182 -182
  2. package/.claude/agents/core/coder.md +265 -265
  3. package/.claude/agents/core/planner.md +167 -167
  4. package/.claude/agents/core/researcher.md +189 -189
  5. package/.claude/agents/core/reviewer.md +325 -325
  6. package/.claude/agents/core/tester.md +318 -318
  7. package/.claude/agents/database-specialist.yaml +21 -21
  8. package/.claude/agents/dual-mode/codex-coordinator.md +224 -224
  9. package/.claude/agents/dual-mode/codex-worker.md +211 -211
  10. package/.claude/agents/dual-mode/dual-orchestrator.md +291 -291
  11. package/.claude/agents/github/code-review-swarm.md +537 -537
  12. package/.claude/agents/github/github-modes.md +172 -172
  13. package/.claude/agents/github/issue-tracker.md +318 -318
  14. package/.claude/agents/github/multi-repo-swarm.md +552 -552
  15. package/.claude/agents/github/pr-manager.md +190 -190
  16. package/.claude/agents/github/project-board-sync.md +508 -508
  17. package/.claude/agents/github/release-manager.md +366 -366
  18. package/.claude/agents/github/release-swarm.md +582 -582
  19. package/.claude/agents/github/repo-architect.md +397 -397
  20. package/.claude/agents/github/swarm-issue.md +572 -572
  21. package/.claude/agents/github/swarm-pr.md +427 -427
  22. package/.claude/agents/github/sync-coordinator.md +451 -451
  23. package/.claude/agents/github/workflow-automation.md +634 -634
  24. package/.claude/agents/goal/code-goal-planner.md +445 -445
  25. package/.claude/agents/hive-mind/collective-intelligence-coordinator.md +129 -129
  26. package/.claude/agents/hive-mind/queen-coordinator.md +202 -202
  27. package/.claude/agents/hive-mind/scout-explorer.md +241 -241
  28. package/.claude/agents/hive-mind/swarm-memory-manager.md +192 -192
  29. package/.claude/agents/hive-mind/worker-specialist.md +216 -216
  30. package/.claude/agents/index.yaml +17 -17
  31. package/.claude/agents/neural/safla-neural.md +73 -73
  32. package/.claude/agents/project-coordinator.yaml +15 -15
  33. package/.claude/agents/python-specialist.yaml +21 -21
  34. package/.claude/agents/reasoning/goal-planner.md +72 -72
  35. package/.claude/agents/security-auditor.yaml +20 -20
  36. package/.claude/agents/swarm/adaptive-coordinator.md +395 -395
  37. package/.claude/agents/swarm/hierarchical-coordinator.md +326 -326
  38. package/.claude/agents/swarm/mesh-coordinator.md +391 -391
  39. package/.claude/agents/templates/migration-plan.md +745 -745
  40. package/.claude/agents/typescript-specialist.yaml +21 -21
  41. package/.claude/checkpoints/1767754460.json +8 -8
  42. package/.claude/commands/agents/agent-spawning.md +28 -28
  43. package/.claude/commands/github/github-modes.md +146 -146
  44. package/.claude/commands/github/github-swarm.md +121 -121
  45. package/.claude/commands/github/issue-tracker.md +291 -291
  46. package/.claude/commands/github/pr-manager.md +169 -169
  47. package/.claude/commands/github/release-manager.md +337 -337
  48. package/.claude/commands/github/repo-architect.md +366 -366
  49. package/.claude/commands/github/sync-coordinator.md +300 -300
  50. package/.claude/commands/memory/neural.md +47 -47
  51. package/.claude/commands/sparc/analyzer.md +51 -51
  52. package/.claude/commands/sparc/architect.md +53 -53
  53. package/.claude/commands/sparc/ask.md +97 -97
  54. package/.claude/commands/sparc/batch-executor.md +54 -54
  55. package/.claude/commands/sparc/code.md +89 -89
  56. package/.claude/commands/sparc/coder.md +54 -54
  57. package/.claude/commands/sparc/debug.md +83 -83
  58. package/.claude/commands/sparc/debugger.md +54 -54
  59. package/.claude/commands/sparc/designer.md +53 -53
  60. package/.claude/commands/sparc/devops.md +109 -109
  61. package/.claude/commands/sparc/docs-writer.md +80 -80
  62. package/.claude/commands/sparc/documenter.md +54 -54
  63. package/.claude/commands/sparc/innovator.md +54 -54
  64. package/.claude/commands/sparc/integration.md +83 -83
  65. package/.claude/commands/sparc/mcp.md +117 -117
  66. package/.claude/commands/sparc/memory-manager.md +54 -54
  67. package/.claude/commands/sparc/optimizer.md +54 -54
  68. package/.claude/commands/sparc/orchestrator.md +131 -131
  69. package/.claude/commands/sparc/post-deployment-monitoring-mode.md +83 -83
  70. package/.claude/commands/sparc/refinement-optimization-mode.md +83 -83
  71. package/.claude/commands/sparc/researcher.md +54 -54
  72. package/.claude/commands/sparc/reviewer.md +54 -54
  73. package/.claude/commands/sparc/security-review.md +80 -80
  74. package/.claude/commands/sparc/sparc-modes.md +174 -174
  75. package/.claude/commands/sparc/sparc.md +111 -111
  76. package/.claude/commands/sparc/spec-pseudocode.md +80 -80
  77. package/.claude/commands/sparc/supabase-admin.md +348 -348
  78. package/.claude/commands/sparc/swarm-coordinator.md +54 -54
  79. package/.claude/commands/sparc/tdd.md +54 -54
  80. package/.claude/commands/sparc/tester.md +54 -54
  81. package/.claude/commands/sparc/tutorial.md +79 -79
  82. package/.claude/commands/sparc/workflow-manager.md +54 -54
  83. package/.claude/commands/sparc.md +166 -166
  84. package/.claude/commands/swarm/analysis.md +95 -95
  85. package/.claude/commands/swarm/development.md +96 -96
  86. package/.claude/commands/swarm/examples.md +168 -168
  87. package/.claude/commands/swarm/maintenance.md +102 -102
  88. package/.claude/commands/swarm/optimization.md +117 -117
  89. package/.claude/commands/swarm/research.md +136 -136
  90. package/.claude/commands/swarm/testing.md +131 -131
  91. package/.claude/commands/workflows/development.md +77 -77
  92. package/.claude/commands/workflows/research.md +62 -62
  93. package/.claude/guidance/moflo-bootstrap.md +126 -126
  94. package/.claude/guidance/shipped/agent-bootstrap.md +126 -126
  95. package/.claude/guidance/shipped/guidance-memory-strategy.md +262 -262
  96. package/.claude/guidance/shipped/memory-strategy.md +204 -204
  97. package/.claude/guidance/shipped/moflo.md +668 -653
  98. package/.claude/guidance/shipped/task-swarm-integration.md +441 -441
  99. package/.claude/helpers/intelligence.cjs +207 -207
  100. package/.claude/helpers/statusline.cjs +851 -851
  101. package/.claude/settings.local.json +18 -0
  102. package/.claude/skills/fl/SKILL.md +583 -583
  103. package/.claude/skills/flo/SKILL.md +583 -583
  104. package/.claude/skills/github-code-review/SKILL.md +1140 -1140
  105. package/.claude/skills/github-multi-repo/SKILL.md +874 -874
  106. package/.claude/skills/github-project-management/SKILL.md +1277 -1277
  107. package/.claude/skills/github-release-management/SKILL.md +1081 -1081
  108. package/.claude/skills/github-workflow-automation/SKILL.md +1065 -1065
  109. package/.claude/skills/hive-mind-advanced/SKILL.md +712 -712
  110. package/.claude/skills/hooks-automation/SKILL.md +1201 -1201
  111. package/.claude/skills/performance-analysis/SKILL.md +563 -563
  112. package/.claude/skills/sparc-methodology/SKILL.md +1115 -1115
  113. package/.claude/skills/swarm-advanced/SKILL.md +973 -973
  114. package/.claude/workflow-state.json +4 -4
  115. package/LICENSE +21 -21
  116. package/README.md +698 -685
  117. package/bin/cli.js +0 -0
  118. package/bin/gate-hook.mjs +50 -50
  119. package/bin/gate.cjs +138 -138
  120. package/bin/generate-code-map.mjs +775 -775
  121. package/bin/hook-handler.cjs +83 -83
  122. package/bin/hooks.mjs +656 -656
  123. package/bin/index-guidance.mjs +892 -892
  124. package/bin/index-tests.mjs +709 -709
  125. package/bin/lib/process-manager.mjs +243 -243
  126. package/bin/lib/registry-cleanup.cjs +41 -41
  127. package/bin/prompt-hook.mjs +72 -72
  128. package/bin/semantic-search.mjs +472 -472
  129. package/bin/session-start-launcher.mjs +238 -238
  130. package/bin/setup-project.mjs +250 -250
  131. package/package.json +123 -123
  132. package/src/@claude-flow/cli/README.md +452 -452
  133. package/src/@claude-flow/cli/bin/cli.js +180 -180
  134. package/src/@claude-flow/cli/bin/preinstall.cjs +2 -2
  135. package/src/@claude-flow/cli/dist/src/commands/completions.js +409 -409
  136. package/src/@claude-flow/cli/dist/src/commands/doctor.js +18 -2
  137. package/src/@claude-flow/cli/dist/src/commands/embeddings.js +25 -25
  138. package/src/@claude-flow/cli/dist/src/commands/github.js +61 -61
  139. package/src/@claude-flow/cli/dist/src/commands/hive-mind.js +90 -90
  140. package/src/@claude-flow/cli/dist/src/commands/hooks.js +9 -9
  141. package/src/@claude-flow/cli/dist/src/commands/init.js +3 -8
  142. package/src/@claude-flow/cli/dist/src/commands/ruvector/import.js +14 -14
  143. package/src/@claude-flow/cli/dist/src/commands/ruvector/setup.js +624 -624
  144. package/src/@claude-flow/cli/dist/src/config/moflo-config.d.ts +3 -0
  145. package/src/@claude-flow/cli/dist/src/config/moflo-config.js +101 -91
  146. package/src/@claude-flow/cli/dist/src/index.d.ts +5 -0
  147. package/src/@claude-flow/cli/dist/src/index.js +44 -0
  148. package/src/@claude-flow/cli/dist/src/init/claudemd-generator.d.ts +29 -29
  149. package/src/@claude-flow/cli/dist/src/init/claudemd-generator.js +43 -43
  150. package/src/@claude-flow/cli/dist/src/init/executor.js +453 -453
  151. package/src/@claude-flow/cli/dist/src/init/helpers-generator.js +482 -482
  152. package/src/@claude-flow/cli/dist/src/init/moflo-init.d.ts +30 -30
  153. package/src/@claude-flow/cli/dist/src/init/moflo-init.js +140 -140
  154. package/src/@claude-flow/cli/dist/src/init/statusline-generator.js +876 -876
  155. package/src/@claude-flow/cli/dist/src/memory/memory-initializer.js +371 -371
  156. package/src/@claude-flow/cli/dist/src/runtime/headless.js +28 -28
  157. package/src/@claude-flow/cli/dist/src/services/container-worker-pool.d.ts +197 -0
  158. package/src/@claude-flow/cli/dist/src/services/container-worker-pool.js +584 -0
  159. package/src/@claude-flow/cli/dist/src/services/daemon-lock.d.ts +14 -0
  160. package/src/@claude-flow/cli/dist/src/services/daemon-lock.js +1 -1
  161. package/src/@claude-flow/cli/dist/src/services/headless-worker-executor.js +84 -84
  162. package/src/@claude-flow/cli/package.json +1 -1
  163. package/src/@claude-flow/guidance/README.md +1195 -1195
  164. package/src/@claude-flow/guidance/package.json +198 -198
  165. package/src/@claude-flow/memory/README.md +587 -587
  166. package/src/@claude-flow/memory/dist/agentdb-backend.js +26 -26
  167. package/src/@claude-flow/memory/dist/auto-memory-bridge.test.js +27 -27
  168. package/src/@claude-flow/memory/dist/hybrid-backend.d.ts +245 -0
  169. package/src/@claude-flow/memory/dist/hybrid-backend.js +569 -0
  170. package/src/@claude-flow/memory/dist/hybrid-backend.test.d.ts +8 -0
  171. package/src/@claude-flow/memory/dist/hybrid-backend.test.js +320 -0
  172. package/src/@claude-flow/memory/dist/sqlite-backend.d.ts +121 -0
  173. package/src/@claude-flow/memory/dist/sqlite-backend.js +572 -0
  174. package/src/@claude-flow/memory/dist/sqljs-backend.js +26 -26
  175. package/src/@claude-flow/memory/package.json +44 -44
  176. package/src/@claude-flow/shared/README.md +323 -323
  177. package/src/@claude-flow/shared/dist/events/event-store.js +31 -31
  178. package/src/README.md +493 -493
@@ -1,262 +1,262 @@
1
- # Guidance & Memory Tuning Strategy
2
-
3
- **Purpose:** How to build and tune a RAG-based guidance system using moflo's semantic search, embedding pipeline, and indexing. Reference when creating guidance documents, troubleshooting search quality, or extending the system.
4
-
5
- ---
6
-
7
- ## Problem Statement
8
-
9
- Claude Code agents need project-specific knowledge — coding rules, architecture patterns, entity templates, testing conventions — delivered at the right moment. Without a retrieval system, agents either miss critical rules or require massive CLAUDE.md files that waste context window tokens.
10
-
11
- **Goals:**
12
- - Agents find relevant guidance automatically via semantic search
13
- - Subagents spawned by the coordinator inherit memory access
14
- - Search quality is high enough that agents don't need to read whole files
15
- - The system survives `npm install` (indexing runs on session start)
16
-
17
- ---
18
-
19
- ## Architecture
20
-
21
- Three layers: embedding generation, vector storage, and search.
22
-
23
- ```
24
- Source Files (.claude/guidance/*.md, docs/*.md)
25
- |
26
- v
27
- index-guidance.mjs --- Chunk on ## headers, build RAG links
28
- | (prev/next, siblings, parent/child, context overlap)
29
- v
30
- .swarm/memory.db ----- SQLite (entries + metadata + embedding vectors)
31
- |
32
- v
33
- build-embeddings.mjs - Generate 384-dim vectors per entry
34
- | (Xenova/all-MiniLM-L6-v2 neural, or domain-aware hash fallback)
35
- v
36
- RuVector (@ruvector/core) -- HNSW index infrastructure
37
- v
38
- Search layer ---------- Three access paths:
39
- 1. MCP tools (mcp__moflo__memory_search) -- preferred
40
- 2. CLI (npx flo memory search) -- fallback
41
- 3. Script (semantic-search.mjs) -- detailed output
42
- ```
43
-
44
- **Key files:**
45
-
46
- | File | Role |
47
- |------|------|
48
- | `.claude/guidance/*.md` | Guidance documents (source of truth) |
49
- | `bin/index-guidance.mjs` | Chunks documents, stores in SQLite with RAG metadata |
50
- | `bin/build-embeddings.mjs` | Generates vector embeddings (neural or hash) |
51
- | `.swarm/memory.db` | SQLite database with entries, metadata, embeddings |
52
- | `@ruvector/core` | HNSW vector index, WASM fallback, SIMD operations |
53
-
54
- ---
55
-
56
- ## Guidance Document Optimization Rules
57
-
58
- These rules determine how well your guidance documents retrieve via semantic search:
59
-
60
- ### 1. Every file needs a Purpose line
61
-
62
- Add `**Purpose:**` as the first meaningful line after the title. Claude checks this first for relevance scoring. Without it, the chunk has no summary signal.
63
-
64
- ### 2. H2 headings are the primary retrieval signal
65
-
66
- The indexer splits on `##`. Each heading becomes the chunk title, prepended to searchable content. Domain-specific keywords in headings dramatically improve recall.
67
-
68
- **Bad:** `## Overview`, `## Rules`, `## Pattern`
69
- **Good:** `## Soft Delete Rules`, `## JWT Authentication Pattern`, `## Database Entity Migration`
70
-
71
- ### 3. Ideal chunk size: 1000-4000 characters
72
-
73
- Below 50 chars the chunk is dropped. Above 6000 the indexer force-splits on paragraphs, which breaks mid-thought. The sweet spot produces focused embeddings.
74
-
75
- ### 4. Self-contained chunks
76
-
77
- Each H2 section must answer a question without needing the rest of the document. Include: the rule, a code example, and a cross-reference.
78
-
79
- ### 5. Tables over prose
80
-
81
- Claude parses structured data more accurately than paragraphs. DO/DON'T tables, field reference tables, and command tables all retrieve better.
82
-
83
- ### 6. Cross-references create a navigation graph
84
-
85
- The RAG indexer stores `prevChunk`/`nextChunk`/`siblings` metadata. Cross-references between documents let Claude follow chains: `core.md -> coding-rules.md -> database.md`.
86
-
87
- ### 7. No decorative formatting
88
-
89
- ASCII boxes, excessive emoji, rhetorical questions, and motivational text all waste tokens without improving retrieval or comprehension.
90
-
91
- ---
92
-
93
- ## Embedding Pipeline
94
-
95
- ### Embedding Models
96
-
97
- | Model | Quality | Speed | When Used |
98
- |-------|---------|-------|-----------|
99
- | `Xenova/all-MiniLM-L6-v2` | High (true semantic) | ~3s for 1000 entries | Primary — `build-embeddings.mjs` uses this |
100
- | `domain-aware-hash-v1` | Good (domain clustering) | <1s for 1000 entries | Fallback when Transformers.js unavailable |
101
-
102
- **Neural embeddings (Xenova/all-MiniLM-L6-v2):**
103
- - Uses `@xenova/transformers` with ONNX WASM runtime
104
- - 384-dimensional vectors, L2-normalized
105
- - True semantic understanding — "soft delete" matches "mark as deleted" without keyword overlap
106
- - Loaded lazily on first use, cached for subsequent queries
107
- - Ships with moflo; no additional install needed
108
-
109
- **Domain-aware hash embeddings (fallback):**
110
- - Custom SimHash-style algorithm with 12 domain clusters
111
- - Domain clusters group related terms: `database` (orm, postgresql, entity, schema...), `frontend` (react, component, css...), `testing` (vitest, mock, expect...), etc.
112
- - Multi-position hashing with bigram/trigram features
113
- - Good at keyword-level matching but misses semantic paraphrases
114
- - No external dependencies — always available
115
-
116
- ### The Embedding Alignment Problem
117
-
118
- **Critical rule:** Query embeddings MUST match stored embeddings. Computing cosine similarity between vectors from different models produces meaningless scores.
119
-
120
- Both the search scripts and the MCP memory tools auto-detect the stored embedding model:
121
-
122
- ```javascript
123
- // Check what model stored entries predominantly use
124
- const modelCheck = db.prepare(
125
- `SELECT embedding_model, COUNT(*) as cnt FROM memory_entries
126
- WHERE status = 'active' AND embedding IS NOT NULL
127
- GROUP BY embedding_model ORDER BY cnt DESC LIMIT 1`
128
- ).get();
129
-
130
- // If stored embeddings are neural, use neural for query too
131
- ```
132
-
133
- Search also **filters out entries with mismatched `embedding_model`** — if the query uses neural embeddings, hash-embedded entries are skipped (and vice versa).
134
-
135
- ### Domain Cluster Tuning
136
-
137
- The hash fallback's domain clusters can be extended with project-specific terms. Add terms to the relevant cluster in the hash embedding function to improve keyword-level matching for your domain:
138
-
139
- | Cluster | Example Terms |
140
- |---------|--------------|
141
- | `database` | your ORM, database engine, schema terms |
142
- | `frontend` | UI framework, component library terms |
143
- | `backend` | DI container, API framework terms |
144
- | `testing` | test framework, assertion library terms |
145
- | `security` | auth system, permission model terms |
146
-
147
- ---
148
-
149
- ## RAG Indexing Pipeline
150
-
151
- ### How `index-guidance.mjs` Works
152
-
153
- 1. **Scan** configured directories for `.md` files
154
- 2. **Hash check** — Skip files whose content hash hasn't changed (unless `--force`)
155
- 3. **Store full document** as `doc-{prefix}-{name}` (for complete retrieval)
156
- 4. **Chunk on `##` headers** — Each H2 section becomes a separate entry
157
- 5. **H3 subsections** become child chunks with parent H2 as context prefix
158
- 6. **Force-split** sections over 4000 chars on paragraph boundaries
159
- 7. **Build RAG metadata** for every chunk:
160
-
161
- | Metadata Field | Purpose |
162
- |---------------|---------|
163
- | `parentDoc` | Link back to full document |
164
- | `prevChunk` / `nextChunk` | Sequential navigation |
165
- | `siblings` | All chunk keys from same document |
166
- | `hierarchicalParent` / `hierarchicalChildren` | H2->H3 relationships |
167
- | `contextBefore` / `contextAfter` | 20% overlapping text from adjacent chunks |
168
-
169
- 8. **Prepend context** — Each chunk's searchable content includes overlap from neighbors
170
- 9. **Stale cleanup** — After indexing, remove entries for files that no longer exist on disk
171
- 10. **Background embedding** — Spawn `build-embeddings.mjs` in background to generate vectors
172
-
173
- ### Configuring Indexed Directories
174
-
175
- In `moflo.yaml`:
176
-
177
- ```yaml
178
- guidance:
179
- directories:
180
- - .claude/guidance
181
- - docs/guides
182
- ```
183
-
184
- Default directories (when no config): `.claude/guidance`, `docs/guides`
185
-
186
- Moflo also automatically indexes its own bundled guidance from `node_modules/moflo/.claude/guidance/` when installed as a library in a consumer project.
187
-
188
- ---
189
-
190
- ## Lessons Learned
191
-
192
- ### Document Optimization
193
-
194
- 1. **`**Purpose:**` lines are critical** — They're the single highest-impact addition for retrieval quality.
195
- 2. **Headings are embeddings** — In a chunk-per-section system, the heading IS the embedding's primary signal. Generic headings are nearly useless.
196
- 3. **Tables retrieve better than prose** — Claude parses structured data with higher accuracy.
197
- 4. **Cross-references are the RAG graph** — Isolated documents can't be navigated.
198
- 5. **Chunk size matters** — A 10,000-char section produces a diluted embedding. Splitting into focused sections triples the chance of matching specific queries.
199
-
200
- ### Embedding Pipeline
201
-
202
- 6. **Query embeddings MUST match stored embeddings** — This is the single most critical rule. Auto-detect and match.
203
- 7. **Domain clusters need project-specific terms** — Generic NLP clusters miss project-specific terminology. Adding terms to domain clusters dramatically improves keyword-level matching.
204
- 8. **Filter mismatched entries during search** — Mixed databases need explicit filtering by `embedding_model`.
205
-
206
- ---
207
-
208
- ## Replication Guide
209
-
210
- To set up this system in a new project using moflo:
211
-
212
- ### 1. Install Moflo
213
-
214
- ```bash
215
- npm install moflo
216
- npx flo init
217
- ```
218
-
219
- ### 2. Create Guidance Documents
220
-
221
- Create `.claude/guidance/` directory with markdown files following the optimization rules above:
222
- - Every file has `**Purpose:**` line
223
- - H2 sections with domain keywords in headings
224
- - Tables for structured rules
225
- - Cross-references between related docs
226
- - 1000-4000 char sections
227
-
228
- ### 3. Configure Indexing
229
-
230
- In `moflo.yaml`:
231
-
232
- ```yaml
233
- guidance:
234
- directories:
235
- - .claude/guidance
236
- - docs/guides
237
-
238
- auto_index:
239
- guidance: true
240
- code_map: true
241
- ```
242
-
243
- ### 4. Index and Verify
244
-
245
- ```bash
246
- # Index documents
247
- npx flo-index --force
248
-
249
- # Test search quality
250
- npx flo memory search --query "your domain query" --namespace guidance
251
-
252
- # Verify from Claude Code via MCP
253
- # mcp__moflo__memory_search query="your domain query" namespace="guidance"
254
- ```
255
-
256
- ---
257
-
258
- ## See Also
259
-
260
- - `.claude/guidance/memory-strategy.md` - Memory architecture and search commands
261
- - `.claude/guidance/agent-bootstrap.md` - Subagent bootstrap guide
262
- - `.claude/guidance/moflo.md` - Full CLI/MCP reference
1
+ # Guidance & Memory Tuning Strategy
2
+
3
+ **Purpose:** How to build and tune a RAG-based guidance system using moflo's semantic search, embedding pipeline, and indexing. Reference when creating guidance documents, troubleshooting search quality, or extending the system.
4
+
5
+ ---
6
+
7
+ ## Problem Statement
8
+
9
+ Claude Code agents need project-specific knowledge — coding rules, architecture patterns, entity templates, testing conventions — delivered at the right moment. Without a retrieval system, agents either miss critical rules or require massive CLAUDE.md files that waste context window tokens.
10
+
11
+ **Goals:**
12
+ - Agents find relevant guidance automatically via semantic search
13
+ - Subagents spawned by the coordinator inherit memory access
14
+ - Search quality is high enough that agents don't need to read whole files
15
+ - The system survives `npm install` (indexing runs on session start)
16
+
17
+ ---
18
+
19
+ ## Architecture
20
+
21
+ Three layers: embedding generation, vector storage, and search.
22
+
23
+ ```
24
+ Source Files (.claude/guidance/*.md, docs/*.md)
25
+ |
26
+ v
27
+ index-guidance.mjs --- Chunk on ## headers, build RAG links
28
+ | (prev/next, siblings, parent/child, context overlap)
29
+ v
30
+ .swarm/memory.db ----- SQLite (entries + metadata + embedding vectors)
31
+ |
32
+ v
33
+ build-embeddings.mjs - Generate 384-dim vectors per entry
34
+ | (Xenova/all-MiniLM-L6-v2 neural, or domain-aware hash fallback)
35
+ v
36
+ RuVector (@ruvector/core) -- HNSW index infrastructure
37
+ v
38
+ Search layer ---------- Three access paths:
39
+ 1. MCP tools (mcp__moflo__memory_search) -- preferred
40
+ 2. CLI (npx flo memory search) -- fallback
41
+ 3. Script (semantic-search.mjs) -- detailed output
42
+ ```
43
+
44
+ **Key files:**
45
+
46
+ | File | Role |
47
+ |------|------|
48
+ | `.claude/guidance/*.md` | Guidance documents (source of truth) |
49
+ | `bin/index-guidance.mjs` | Chunks documents, stores in SQLite with RAG metadata |
50
+ | `bin/build-embeddings.mjs` | Generates vector embeddings (neural or hash) |
51
+ | `.swarm/memory.db` | SQLite database with entries, metadata, embeddings |
52
+ | `@ruvector/core` | HNSW vector index, WASM fallback, SIMD operations |
53
+
54
+ ---
55
+
56
+ ## Guidance Document Optimization Rules
57
+
58
+ These rules determine how well your guidance documents retrieve via semantic search:
59
+
60
+ ### 1. Every file needs a Purpose line
61
+
62
+ Add `**Purpose:**` as the first meaningful line after the title. Claude checks this first for relevance scoring. Without it, the chunk has no summary signal.
63
+
64
+ ### 2. H2 headings are the primary retrieval signal
65
+
66
+ The indexer splits on `##`. Each heading becomes the chunk title, prepended to searchable content. Domain-specific keywords in headings dramatically improve recall.
67
+
68
+ **Bad:** `## Overview`, `## Rules`, `## Pattern`
69
+ **Good:** `## Soft Delete Rules`, `## JWT Authentication Pattern`, `## Database Entity Migration`
70
+
71
+ ### 3. Ideal chunk size: 1000-4000 characters
72
+
73
+ Below 50 chars the chunk is dropped. Above 6000 the indexer force-splits on paragraphs, which breaks mid-thought. The sweet spot produces focused embeddings.
74
+
75
+ ### 4. Self-contained chunks
76
+
77
+ Each H2 section must answer a question without needing the rest of the document. Include: the rule, a code example, and a cross-reference.
78
+
79
+ ### 5. Tables over prose
80
+
81
+ Claude parses structured data more accurately than paragraphs. DO/DON'T tables, field reference tables, and command tables all retrieve better.
82
+
83
+ ### 6. Cross-references create a navigation graph
84
+
85
+ The RAG indexer stores `prevChunk`/`nextChunk`/`siblings` metadata. Cross-references between documents let Claude follow chains: `core.md -> coding-rules.md -> database.md`.
86
+
87
+ ### 7. No decorative formatting
88
+
89
+ ASCII boxes, excessive emoji, rhetorical questions, and motivational text all waste tokens without improving retrieval or comprehension.
90
+
91
+ ---
92
+
93
+ ## Embedding Pipeline
94
+
95
+ ### Embedding Models
96
+
97
+ | Model | Quality | Speed | When Used |
98
+ |-------|---------|-------|-----------|
99
+ | `Xenova/all-MiniLM-L6-v2` | High (true semantic) | ~3s for 1000 entries | Primary — `build-embeddings.mjs` uses this |
100
+ | `domain-aware-hash-v1` | Good (domain clustering) | <1s for 1000 entries | Fallback when Transformers.js unavailable |
101
+
102
+ **Neural embeddings (Xenova/all-MiniLM-L6-v2):**
103
+ - Uses `@xenova/transformers` with ONNX WASM runtime
104
+ - 384-dimensional vectors, L2-normalized
105
+ - True semantic understanding — "soft delete" matches "mark as deleted" without keyword overlap
106
+ - Loaded lazily on first use, cached for subsequent queries
107
+ - Ships with moflo; no additional install needed
108
+
109
+ **Domain-aware hash embeddings (fallback):**
110
+ - Custom SimHash-style algorithm with 12 domain clusters
111
+ - Domain clusters group related terms: `database` (orm, postgresql, entity, schema...), `frontend` (react, component, css...), `testing` (vitest, mock, expect...), etc.
112
+ - Multi-position hashing with bigram/trigram features
113
+ - Good at keyword-level matching but misses semantic paraphrases
114
+ - No external dependencies — always available
115
+
116
+ ### The Embedding Alignment Problem
117
+
118
+ **Critical rule:** Query embeddings MUST match stored embeddings. Computing cosine similarity between vectors from different models produces meaningless scores.
119
+
120
+ Both the search scripts and the MCP memory tools auto-detect the stored embedding model:
121
+
122
+ ```javascript
123
+ // Check what model stored entries predominantly use
124
+ const modelCheck = db.prepare(
125
+ `SELECT embedding_model, COUNT(*) as cnt FROM memory_entries
126
+ WHERE status = 'active' AND embedding IS NOT NULL
127
+ GROUP BY embedding_model ORDER BY cnt DESC LIMIT 1`
128
+ ).get();
129
+
130
+ // If stored embeddings are neural, use neural for query too
131
+ ```
132
+
133
+ Search also **filters out entries with mismatched `embedding_model`** — if the query uses neural embeddings, hash-embedded entries are skipped (and vice versa).
134
+
135
+ ### Domain Cluster Tuning
136
+
137
+ The hash fallback's domain clusters can be extended with project-specific terms. Add terms to the relevant cluster in the hash embedding function to improve keyword-level matching for your domain:
138
+
139
+ | Cluster | Example Terms |
140
+ |---------|--------------|
141
+ | `database` | your ORM, database engine, schema terms |
142
+ | `frontend` | UI framework, component library terms |
143
+ | `backend` | DI container, API framework terms |
144
+ | `testing` | test framework, assertion library terms |
145
+ | `security` | auth system, permission model terms |
146
+
147
+ ---
148
+
149
+ ## RAG Indexing Pipeline
150
+
151
+ ### How `index-guidance.mjs` Works
152
+
153
+ 1. **Scan** configured directories for `.md` files
154
+ 2. **Hash check** — Skip files whose content hash hasn't changed (unless `--force`)
155
+ 3. **Store full document** as `doc-{prefix}-{name}` (for complete retrieval)
156
+ 4. **Chunk on `##` headers** — Each H2 section becomes a separate entry
157
+ 5. **H3 subsections** become child chunks with parent H2 as context prefix
158
+ 6. **Force-split** sections over 4000 chars on paragraph boundaries
159
+ 7. **Build RAG metadata** for every chunk:
160
+
161
+ | Metadata Field | Purpose |
162
+ |---------------|---------|
163
+ | `parentDoc` | Link back to full document |
164
+ | `prevChunk` / `nextChunk` | Sequential navigation |
165
+ | `siblings` | All chunk keys from same document |
166
+ | `hierarchicalParent` / `hierarchicalChildren` | H2->H3 relationships |
167
+ | `contextBefore` / `contextAfter` | 20% overlapping text from adjacent chunks |
168
+
169
+ 8. **Prepend context** — Each chunk's searchable content includes overlap from neighbors
170
+ 9. **Stale cleanup** — After indexing, remove entries for files that no longer exist on disk
171
+ 10. **Background embedding** — Spawn `build-embeddings.mjs` in background to generate vectors
172
+
173
+ ### Configuring Indexed Directories
174
+
175
+ In `moflo.yaml`:
176
+
177
+ ```yaml
178
+ guidance:
179
+ directories:
180
+ - .claude/guidance
181
+ - docs/guides
182
+ ```
183
+
184
+ Default directories (when no config): `.claude/guidance`, `docs/guides`
185
+
186
+ Moflo also automatically indexes its own bundled guidance from `node_modules/moflo/.claude/guidance/` when installed as a library in a consumer project.
187
+
188
+ ---
189
+
190
+ ## Lessons Learned
191
+
192
+ ### Document Optimization
193
+
194
+ 1. **`**Purpose:**` lines are critical** — They're the single highest-impact addition for retrieval quality.
195
+ 2. **Headings are embeddings** — In a chunk-per-section system, the heading IS the embedding's primary signal. Generic headings are nearly useless.
196
+ 3. **Tables retrieve better than prose** — Claude parses structured data with higher accuracy.
197
+ 4. **Cross-references are the RAG graph** — Isolated documents can't be navigated.
198
+ 5. **Chunk size matters** — A 10,000-char section produces a diluted embedding. Splitting into focused sections triples the chance of matching specific queries.
199
+
200
+ ### Embedding Pipeline
201
+
202
+ 6. **Query embeddings MUST match stored embeddings** — This is the single most critical rule. Auto-detect and match.
203
+ 7. **Domain clusters need project-specific terms** — Generic NLP clusters miss project-specific terminology. Adding terms to domain clusters dramatically improves keyword-level matching.
204
+ 8. **Filter mismatched entries during search** — Mixed databases need explicit filtering by `embedding_model`.
205
+
206
+ ---
207
+
208
+ ## Replication Guide
209
+
210
+ To set up this system in a new project using moflo:
211
+
212
+ ### 1. Install Moflo
213
+
214
+ ```bash
215
+ npm install moflo
216
+ npx flo init
217
+ ```
218
+
219
+ ### 2. Create Guidance Documents
220
+
221
+ Create `.claude/guidance/` directory with markdown files following the optimization rules above:
222
+ - Every file has `**Purpose:**` line
223
+ - H2 sections with domain keywords in headings
224
+ - Tables for structured rules
225
+ - Cross-references between related docs
226
+ - 1000-4000 char sections
227
+
228
+ ### 3. Configure Indexing
229
+
230
+ In `moflo.yaml`:
231
+
232
+ ```yaml
233
+ guidance:
234
+ directories:
235
+ - .claude/guidance
236
+ - docs/guides
237
+
238
+ auto_index:
239
+ guidance: true
240
+ code_map: true
241
+ ```
242
+
243
+ ### 4. Index and Verify
244
+
245
+ ```bash
246
+ # Index documents
247
+ npx flo-index --force
248
+
249
+ # Test search quality
250
+ npx flo memory search --query "your domain query" --namespace guidance
251
+
252
+ # Verify from Claude Code via MCP
253
+ # mcp__moflo__memory_search query="your domain query" namespace="guidance"
254
+ ```
255
+
256
+ ---
257
+
258
+ ## See Also
259
+
260
+ - `.claude/guidance/memory-strategy.md` - Memory architecture and search commands
261
+ - `.claude/guidance/agent-bootstrap.md` - Subagent bootstrap guide
262
+ - `.claude/guidance/moflo.md` - Full CLI/MCP reference