moflo 4.8.21 → 4.8.23
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude/agents/browser/browser-agent.yaml +182 -182
- package/.claude/agents/core/coder.md +265 -265
- package/.claude/agents/core/planner.md +167 -167
- package/.claude/agents/core/researcher.md +189 -189
- package/.claude/agents/core/reviewer.md +325 -325
- package/.claude/agents/core/tester.md +318 -318
- package/.claude/agents/database-specialist.yaml +21 -21
- package/.claude/agents/dual-mode/codex-coordinator.md +224 -224
- package/.claude/agents/dual-mode/codex-worker.md +211 -211
- package/.claude/agents/dual-mode/dual-orchestrator.md +291 -291
- package/.claude/agents/github/code-review-swarm.md +537 -537
- package/.claude/agents/github/github-modes.md +172 -172
- package/.claude/agents/github/issue-tracker.md +318 -318
- package/.claude/agents/github/multi-repo-swarm.md +552 -552
- package/.claude/agents/github/pr-manager.md +190 -190
- package/.claude/agents/github/project-board-sync.md +508 -508
- package/.claude/agents/github/release-manager.md +366 -366
- package/.claude/agents/github/release-swarm.md +582 -582
- package/.claude/agents/github/repo-architect.md +397 -397
- package/.claude/agents/github/swarm-issue.md +572 -572
- package/.claude/agents/github/swarm-pr.md +427 -427
- package/.claude/agents/github/sync-coordinator.md +451 -451
- package/.claude/agents/github/workflow-automation.md +634 -634
- package/.claude/agents/goal/code-goal-planner.md +445 -445
- package/.claude/agents/hive-mind/collective-intelligence-coordinator.md +129 -129
- package/.claude/agents/hive-mind/queen-coordinator.md +202 -202
- package/.claude/agents/hive-mind/scout-explorer.md +241 -241
- package/.claude/agents/hive-mind/swarm-memory-manager.md +192 -192
- package/.claude/agents/hive-mind/worker-specialist.md +216 -216
- package/.claude/agents/index.yaml +17 -17
- package/.claude/agents/neural/safla-neural.md +73 -73
- package/.claude/agents/project-coordinator.yaml +15 -15
- package/.claude/agents/python-specialist.yaml +21 -21
- package/.claude/agents/reasoning/goal-planner.md +72 -72
- package/.claude/agents/security-auditor.yaml +20 -20
- package/.claude/agents/swarm/adaptive-coordinator.md +395 -395
- package/.claude/agents/swarm/hierarchical-coordinator.md +326 -326
- package/.claude/agents/swarm/mesh-coordinator.md +391 -391
- package/.claude/agents/templates/migration-plan.md +745 -745
- package/.claude/agents/typescript-specialist.yaml +21 -21
- package/.claude/checkpoints/1767754460.json +8 -8
- package/.claude/commands/agents/agent-spawning.md +28 -28
- package/.claude/commands/github/github-modes.md +146 -146
- package/.claude/commands/github/github-swarm.md +121 -121
- package/.claude/commands/github/issue-tracker.md +291 -291
- package/.claude/commands/github/pr-manager.md +169 -169
- package/.claude/commands/github/release-manager.md +337 -337
- package/.claude/commands/github/repo-architect.md +366 -366
- package/.claude/commands/github/sync-coordinator.md +300 -300
- package/.claude/commands/memory/neural.md +47 -47
- package/.claude/commands/sparc/analyzer.md +51 -51
- package/.claude/commands/sparc/architect.md +53 -53
- package/.claude/commands/sparc/ask.md +97 -97
- package/.claude/commands/sparc/batch-executor.md +54 -54
- package/.claude/commands/sparc/code.md +89 -89
- package/.claude/commands/sparc/coder.md +54 -54
- package/.claude/commands/sparc/debug.md +83 -83
- package/.claude/commands/sparc/debugger.md +54 -54
- package/.claude/commands/sparc/designer.md +53 -53
- package/.claude/commands/sparc/devops.md +109 -109
- package/.claude/commands/sparc/docs-writer.md +80 -80
- package/.claude/commands/sparc/documenter.md +54 -54
- package/.claude/commands/sparc/innovator.md +54 -54
- package/.claude/commands/sparc/integration.md +83 -83
- package/.claude/commands/sparc/mcp.md +117 -117
- package/.claude/commands/sparc/memory-manager.md +54 -54
- package/.claude/commands/sparc/optimizer.md +54 -54
- package/.claude/commands/sparc/orchestrator.md +131 -131
- package/.claude/commands/sparc/post-deployment-monitoring-mode.md +83 -83
- package/.claude/commands/sparc/refinement-optimization-mode.md +83 -83
- package/.claude/commands/sparc/researcher.md +54 -54
- package/.claude/commands/sparc/reviewer.md +54 -54
- package/.claude/commands/sparc/security-review.md +80 -80
- package/.claude/commands/sparc/sparc-modes.md +174 -174
- package/.claude/commands/sparc/sparc.md +111 -111
- package/.claude/commands/sparc/spec-pseudocode.md +80 -80
- package/.claude/commands/sparc/supabase-admin.md +348 -348
- package/.claude/commands/sparc/swarm-coordinator.md +54 -54
- package/.claude/commands/sparc/tdd.md +54 -54
- package/.claude/commands/sparc/tester.md +54 -54
- package/.claude/commands/sparc/tutorial.md +79 -79
- package/.claude/commands/sparc/workflow-manager.md +54 -54
- package/.claude/commands/sparc.md +166 -166
- package/.claude/commands/swarm/analysis.md +95 -95
- package/.claude/commands/swarm/development.md +96 -96
- package/.claude/commands/swarm/examples.md +168 -168
- package/.claude/commands/swarm/maintenance.md +102 -102
- package/.claude/commands/swarm/optimization.md +117 -117
- package/.claude/commands/swarm/research.md +136 -136
- package/.claude/commands/swarm/testing.md +131 -131
- package/.claude/commands/workflows/development.md +77 -77
- package/.claude/commands/workflows/research.md +62 -62
- package/.claude/guidance/moflo-bootstrap.md +126 -126
- package/.claude/guidance/shipped/agent-bootstrap.md +126 -126
- package/.claude/guidance/shipped/guidance-memory-strategy.md +262 -262
- package/.claude/guidance/shipped/memory-strategy.md +204 -204
- package/.claude/guidance/shipped/moflo.md +668 -653
- package/.claude/guidance/shipped/task-swarm-integration.md +441 -441
- package/.claude/helpers/intelligence.cjs +207 -207
- package/.claude/helpers/statusline.cjs +851 -851
- package/.claude/settings.local.json +18 -0
- package/.claude/skills/fl/SKILL.md +583 -583
- package/.claude/skills/flo/SKILL.md +583 -583
- package/.claude/skills/github-code-review/SKILL.md +1140 -1140
- package/.claude/skills/github-multi-repo/SKILL.md +874 -874
- package/.claude/skills/github-project-management/SKILL.md +1277 -1277
- package/.claude/skills/github-release-management/SKILL.md +1081 -1081
- package/.claude/skills/github-workflow-automation/SKILL.md +1065 -1065
- package/.claude/skills/hive-mind-advanced/SKILL.md +712 -712
- package/.claude/skills/hooks-automation/SKILL.md +1201 -1201
- package/.claude/skills/performance-analysis/SKILL.md +563 -563
- package/.claude/skills/sparc-methodology/SKILL.md +1115 -1115
- package/.claude/skills/swarm-advanced/SKILL.md +973 -973
- package/.claude/workflow-state.json +4 -4
- package/LICENSE +21 -21
- package/README.md +698 -685
- package/bin/cli.js +0 -0
- package/bin/gate-hook.mjs +50 -50
- package/bin/gate.cjs +138 -138
- package/bin/generate-code-map.mjs +775 -775
- package/bin/hook-handler.cjs +83 -83
- package/bin/hooks.mjs +656 -656
- package/bin/index-guidance.mjs +892 -892
- package/bin/index-tests.mjs +709 -709
- package/bin/lib/process-manager.mjs +243 -243
- package/bin/lib/registry-cleanup.cjs +41 -41
- package/bin/prompt-hook.mjs +72 -72
- package/bin/semantic-search.mjs +472 -472
- package/bin/session-start-launcher.mjs +238 -238
- package/bin/setup-project.mjs +250 -250
- package/package.json +123 -123
- package/src/@claude-flow/cli/README.md +452 -452
- package/src/@claude-flow/cli/bin/cli.js +180 -180
- package/src/@claude-flow/cli/bin/preinstall.cjs +2 -2
- package/src/@claude-flow/cli/dist/src/commands/completions.js +409 -409
- package/src/@claude-flow/cli/dist/src/commands/doctor.js +18 -2
- package/src/@claude-flow/cli/dist/src/commands/embeddings.js +25 -25
- package/src/@claude-flow/cli/dist/src/commands/github.js +61 -61
- package/src/@claude-flow/cli/dist/src/commands/hive-mind.js +90 -90
- package/src/@claude-flow/cli/dist/src/commands/hooks.js +9 -9
- package/src/@claude-flow/cli/dist/src/commands/init.js +3 -8
- package/src/@claude-flow/cli/dist/src/commands/ruvector/import.js +14 -14
- package/src/@claude-flow/cli/dist/src/commands/ruvector/setup.js +624 -624
- package/src/@claude-flow/cli/dist/src/config/moflo-config.d.ts +3 -0
- package/src/@claude-flow/cli/dist/src/config/moflo-config.js +101 -91
- package/src/@claude-flow/cli/dist/src/index.d.ts +5 -0
- package/src/@claude-flow/cli/dist/src/index.js +44 -0
- package/src/@claude-flow/cli/dist/src/init/claudemd-generator.d.ts +29 -29
- package/src/@claude-flow/cli/dist/src/init/claudemd-generator.js +43 -43
- package/src/@claude-flow/cli/dist/src/init/executor.js +453 -453
- package/src/@claude-flow/cli/dist/src/init/helpers-generator.js +482 -482
- package/src/@claude-flow/cli/dist/src/init/moflo-init.d.ts +30 -30
- package/src/@claude-flow/cli/dist/src/init/moflo-init.js +140 -140
- package/src/@claude-flow/cli/dist/src/init/statusline-generator.js +876 -876
- package/src/@claude-flow/cli/dist/src/memory/memory-initializer.js +371 -371
- package/src/@claude-flow/cli/dist/src/runtime/headless.js +28 -28
- package/src/@claude-flow/cli/dist/src/services/container-worker-pool.d.ts +197 -0
- package/src/@claude-flow/cli/dist/src/services/container-worker-pool.js +584 -0
- package/src/@claude-flow/cli/dist/src/services/daemon-lock.d.ts +14 -0
- package/src/@claude-flow/cli/dist/src/services/daemon-lock.js +1 -1
- package/src/@claude-flow/cli/dist/src/services/headless-worker-executor.js +84 -84
- package/src/@claude-flow/cli/package.json +1 -1
- package/src/@claude-flow/guidance/README.md +1195 -1195
- package/src/@claude-flow/guidance/package.json +198 -198
- package/src/@claude-flow/memory/README.md +587 -587
- package/src/@claude-flow/memory/dist/agentdb-backend.js +26 -26
- package/src/@claude-flow/memory/dist/auto-memory-bridge.test.js +27 -27
- package/src/@claude-flow/memory/dist/hybrid-backend.d.ts +245 -0
- package/src/@claude-flow/memory/dist/hybrid-backend.js +569 -0
- package/src/@claude-flow/memory/dist/hybrid-backend.test.d.ts +8 -0
- package/src/@claude-flow/memory/dist/hybrid-backend.test.js +320 -0
- package/src/@claude-flow/memory/dist/sqlite-backend.d.ts +121 -0
- package/src/@claude-flow/memory/dist/sqlite-backend.js +572 -0
- package/src/@claude-flow/memory/dist/sqljs-backend.js +26 -26
- package/src/@claude-flow/memory/package.json +44 -44
- package/src/@claude-flow/shared/README.md +323 -323
- package/src/@claude-flow/shared/dist/events/event-store.js +31 -31
- package/src/README.md +493 -493
|
@@ -1,262 +1,262 @@
|
|
|
1
|
-
# Guidance & Memory Tuning Strategy
|
|
2
|
-
|
|
3
|
-
**Purpose:** How to build and tune a RAG-based guidance system using moflo's semantic search, embedding pipeline, and indexing. Reference when creating guidance documents, troubleshooting search quality, or extending the system.
|
|
4
|
-
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
## Problem Statement
|
|
8
|
-
|
|
9
|
-
Claude Code agents need project-specific knowledge — coding rules, architecture patterns, entity templates, testing conventions — delivered at the right moment. Without a retrieval system, agents either miss critical rules or require massive CLAUDE.md files that waste context window tokens.
|
|
10
|
-
|
|
11
|
-
**Goals:**
|
|
12
|
-
- Agents find relevant guidance automatically via semantic search
|
|
13
|
-
- Subagents spawned by the coordinator inherit memory access
|
|
14
|
-
- Search quality is high enough that agents don't need to read whole files
|
|
15
|
-
- The system survives `npm install` (indexing runs on session start)
|
|
16
|
-
|
|
17
|
-
---
|
|
18
|
-
|
|
19
|
-
## Architecture
|
|
20
|
-
|
|
21
|
-
Three layers: embedding generation, vector storage, and search.
|
|
22
|
-
|
|
23
|
-
```
|
|
24
|
-
Source Files (.claude/guidance/*.md, docs/*.md)
|
|
25
|
-
|
|
|
26
|
-
v
|
|
27
|
-
index-guidance.mjs --- Chunk on ## headers, build RAG links
|
|
28
|
-
| (prev/next, siblings, parent/child, context overlap)
|
|
29
|
-
v
|
|
30
|
-
.swarm/memory.db ----- SQLite (entries + metadata + embedding vectors)
|
|
31
|
-
|
|
|
32
|
-
v
|
|
33
|
-
build-embeddings.mjs - Generate 384-dim vectors per entry
|
|
34
|
-
| (Xenova/all-MiniLM-L6-v2 neural, or domain-aware hash fallback)
|
|
35
|
-
v
|
|
36
|
-
RuVector (@ruvector/core) -- HNSW index infrastructure
|
|
37
|
-
v
|
|
38
|
-
Search layer ---------- Three access paths:
|
|
39
|
-
1. MCP tools (mcp__moflo__memory_search) -- preferred
|
|
40
|
-
2. CLI (npx flo memory search) -- fallback
|
|
41
|
-
3. Script (semantic-search.mjs) -- detailed output
|
|
42
|
-
```
|
|
43
|
-
|
|
44
|
-
**Key files:**
|
|
45
|
-
|
|
46
|
-
| File | Role |
|
|
47
|
-
|------|------|
|
|
48
|
-
| `.claude/guidance/*.md` | Guidance documents (source of truth) |
|
|
49
|
-
| `bin/index-guidance.mjs` | Chunks documents, stores in SQLite with RAG metadata |
|
|
50
|
-
| `bin/build-embeddings.mjs` | Generates vector embeddings (neural or hash) |
|
|
51
|
-
| `.swarm/memory.db` | SQLite database with entries, metadata, embeddings |
|
|
52
|
-
| `@ruvector/core` | HNSW vector index, WASM fallback, SIMD operations |
|
|
53
|
-
|
|
54
|
-
---
|
|
55
|
-
|
|
56
|
-
## Guidance Document Optimization Rules
|
|
57
|
-
|
|
58
|
-
These rules determine how well your guidance documents retrieve via semantic search:
|
|
59
|
-
|
|
60
|
-
### 1. Every file needs a Purpose line
|
|
61
|
-
|
|
62
|
-
Add `**Purpose:**` as the first meaningful line after the title. Claude checks this first for relevance scoring. Without it, the chunk has no summary signal.
|
|
63
|
-
|
|
64
|
-
### 2. H2 headings are the primary retrieval signal
|
|
65
|
-
|
|
66
|
-
The indexer splits on `##`. Each heading becomes the chunk title, prepended to searchable content. Domain-specific keywords in headings dramatically improve recall.
|
|
67
|
-
|
|
68
|
-
**Bad:** `## Overview`, `## Rules`, `## Pattern`
|
|
69
|
-
**Good:** `## Soft Delete Rules`, `## JWT Authentication Pattern`, `## Database Entity Migration`
|
|
70
|
-
|
|
71
|
-
### 3. Ideal chunk size: 1000-4000 characters
|
|
72
|
-
|
|
73
|
-
Below 50 chars the chunk is dropped. Above 6000 the indexer force-splits on paragraphs, which breaks mid-thought. The sweet spot produces focused embeddings.
|
|
74
|
-
|
|
75
|
-
### 4. Self-contained chunks
|
|
76
|
-
|
|
77
|
-
Each H2 section must answer a question without needing the rest of the document. Include: the rule, a code example, and a cross-reference.
|
|
78
|
-
|
|
79
|
-
### 5. Tables over prose
|
|
80
|
-
|
|
81
|
-
Claude parses structured data more accurately than paragraphs. DO/DON'T tables, field reference tables, and command tables all retrieve better.
|
|
82
|
-
|
|
83
|
-
### 6. Cross-references create a navigation graph
|
|
84
|
-
|
|
85
|
-
The RAG indexer stores `prevChunk`/`nextChunk`/`siblings` metadata. Cross-references between documents let Claude follow chains: `core.md -> coding-rules.md -> database.md`.
|
|
86
|
-
|
|
87
|
-
### 7. No decorative formatting
|
|
88
|
-
|
|
89
|
-
ASCII boxes, excessive emoji, rhetorical questions, and motivational text all waste tokens without improving retrieval or comprehension.
|
|
90
|
-
|
|
91
|
-
---
|
|
92
|
-
|
|
93
|
-
## Embedding Pipeline
|
|
94
|
-
|
|
95
|
-
### Embedding Models
|
|
96
|
-
|
|
97
|
-
| Model | Quality | Speed | When Used |
|
|
98
|
-
|-------|---------|-------|-----------|
|
|
99
|
-
| `Xenova/all-MiniLM-L6-v2` | High (true semantic) | ~3s for 1000 entries | Primary — `build-embeddings.mjs` uses this |
|
|
100
|
-
| `domain-aware-hash-v1` | Good (domain clustering) | <1s for 1000 entries | Fallback when Transformers.js unavailable |
|
|
101
|
-
|
|
102
|
-
**Neural embeddings (Xenova/all-MiniLM-L6-v2):**
|
|
103
|
-
- Uses `@xenova/transformers` with ONNX WASM runtime
|
|
104
|
-
- 384-dimensional vectors, L2-normalized
|
|
105
|
-
- True semantic understanding — "soft delete" matches "mark as deleted" without keyword overlap
|
|
106
|
-
- Loaded lazily on first use, cached for subsequent queries
|
|
107
|
-
- Ships with moflo; no additional install needed
|
|
108
|
-
|
|
109
|
-
**Domain-aware hash embeddings (fallback):**
|
|
110
|
-
- Custom SimHash-style algorithm with 12 domain clusters
|
|
111
|
-
- Domain clusters group related terms: `database` (orm, postgresql, entity, schema...), `frontend` (react, component, css...), `testing` (vitest, mock, expect...), etc.
|
|
112
|
-
- Multi-position hashing with bigram/trigram features
|
|
113
|
-
- Good at keyword-level matching but misses semantic paraphrases
|
|
114
|
-
- No external dependencies — always available
|
|
115
|
-
|
|
116
|
-
### The Embedding Alignment Problem
|
|
117
|
-
|
|
118
|
-
**Critical rule:** Query embeddings MUST match stored embeddings. Computing cosine similarity between vectors from different models produces meaningless scores.
|
|
119
|
-
|
|
120
|
-
Both the search scripts and the MCP memory tools auto-detect the stored embedding model:
|
|
121
|
-
|
|
122
|
-
```javascript
|
|
123
|
-
// Check what model stored entries predominantly use
|
|
124
|
-
const modelCheck = db.prepare(
|
|
125
|
-
`SELECT embedding_model, COUNT(*) as cnt FROM memory_entries
|
|
126
|
-
WHERE status = 'active' AND embedding IS NOT NULL
|
|
127
|
-
GROUP BY embedding_model ORDER BY cnt DESC LIMIT 1`
|
|
128
|
-
).get();
|
|
129
|
-
|
|
130
|
-
// If stored embeddings are neural, use neural for query too
|
|
131
|
-
```
|
|
132
|
-
|
|
133
|
-
Search also **filters out entries with mismatched `embedding_model`** — if the query uses neural embeddings, hash-embedded entries are skipped (and vice versa).
|
|
134
|
-
|
|
135
|
-
### Domain Cluster Tuning
|
|
136
|
-
|
|
137
|
-
The hash fallback's domain clusters can be extended with project-specific terms. Add terms to the relevant cluster in the hash embedding function to improve keyword-level matching for your domain:
|
|
138
|
-
|
|
139
|
-
| Cluster | Example Terms |
|
|
140
|
-
|---------|--------------|
|
|
141
|
-
| `database` | your ORM, database engine, schema terms |
|
|
142
|
-
| `frontend` | UI framework, component library terms |
|
|
143
|
-
| `backend` | DI container, API framework terms |
|
|
144
|
-
| `testing` | test framework, assertion library terms |
|
|
145
|
-
| `security` | auth system, permission model terms |
|
|
146
|
-
|
|
147
|
-
---
|
|
148
|
-
|
|
149
|
-
## RAG Indexing Pipeline
|
|
150
|
-
|
|
151
|
-
### How `index-guidance.mjs` Works
|
|
152
|
-
|
|
153
|
-
1. **Scan** configured directories for `.md` files
|
|
154
|
-
2. **Hash check** — Skip files whose content hash hasn't changed (unless `--force`)
|
|
155
|
-
3. **Store full document** as `doc-{prefix}-{name}` (for complete retrieval)
|
|
156
|
-
4. **Chunk on `##` headers** — Each H2 section becomes a separate entry
|
|
157
|
-
5. **H3 subsections** become child chunks with parent H2 as context prefix
|
|
158
|
-
6. **Force-split** sections over 4000 chars on paragraph boundaries
|
|
159
|
-
7. **Build RAG metadata** for every chunk:
|
|
160
|
-
|
|
161
|
-
| Metadata Field | Purpose |
|
|
162
|
-
|---------------|---------|
|
|
163
|
-
| `parentDoc` | Link back to full document |
|
|
164
|
-
| `prevChunk` / `nextChunk` | Sequential navigation |
|
|
165
|
-
| `siblings` | All chunk keys from same document |
|
|
166
|
-
| `hierarchicalParent` / `hierarchicalChildren` | H2->H3 relationships |
|
|
167
|
-
| `contextBefore` / `contextAfter` | 20% overlapping text from adjacent chunks |
|
|
168
|
-
|
|
169
|
-
8. **Prepend context** — Each chunk's searchable content includes overlap from neighbors
|
|
170
|
-
9. **Stale cleanup** — After indexing, remove entries for files that no longer exist on disk
|
|
171
|
-
10. **Background embedding** — Spawn `build-embeddings.mjs` in background to generate vectors
|
|
172
|
-
|
|
173
|
-
### Configuring Indexed Directories
|
|
174
|
-
|
|
175
|
-
In `moflo.yaml`:
|
|
176
|
-
|
|
177
|
-
```yaml
|
|
178
|
-
guidance:
|
|
179
|
-
directories:
|
|
180
|
-
- .claude/guidance
|
|
181
|
-
- docs/guides
|
|
182
|
-
```
|
|
183
|
-
|
|
184
|
-
Default directories (when no config): `.claude/guidance`, `docs/guides`
|
|
185
|
-
|
|
186
|
-
Moflo also automatically indexes its own bundled guidance from `node_modules/moflo/.claude/guidance/` when installed as a library in a consumer project.
|
|
187
|
-
|
|
188
|
-
---
|
|
189
|
-
|
|
190
|
-
## Lessons Learned
|
|
191
|
-
|
|
192
|
-
### Document Optimization
|
|
193
|
-
|
|
194
|
-
1. **`**Purpose:**` lines are critical** — They're the single highest-impact addition for retrieval quality.
|
|
195
|
-
2. **Headings are embeddings** — In a chunk-per-section system, the heading IS the embedding's primary signal. Generic headings are nearly useless.
|
|
196
|
-
3. **Tables retrieve better than prose** — Claude parses structured data with higher accuracy.
|
|
197
|
-
4. **Cross-references are the RAG graph** — Isolated documents can't be navigated.
|
|
198
|
-
5. **Chunk size matters** — A 10,000-char section produces a diluted embedding. Splitting into focused sections triples the chance of matching specific queries.
|
|
199
|
-
|
|
200
|
-
### Embedding Pipeline
|
|
201
|
-
|
|
202
|
-
6. **Query embeddings MUST match stored embeddings** — This is the single most critical rule. Auto-detect and match.
|
|
203
|
-
7. **Domain clusters need project-specific terms** — Generic NLP clusters miss project-specific terminology. Adding terms to domain clusters dramatically improves keyword-level matching.
|
|
204
|
-
8. **Filter mismatched entries during search** — Mixed databases need explicit filtering by `embedding_model`.
|
|
205
|
-
|
|
206
|
-
---
|
|
207
|
-
|
|
208
|
-
## Replication Guide
|
|
209
|
-
|
|
210
|
-
To set up this system in a new project using moflo:
|
|
211
|
-
|
|
212
|
-
### 1. Install Moflo
|
|
213
|
-
|
|
214
|
-
```bash
|
|
215
|
-
npm install moflo
|
|
216
|
-
npx flo init
|
|
217
|
-
```
|
|
218
|
-
|
|
219
|
-
### 2. Create Guidance Documents
|
|
220
|
-
|
|
221
|
-
Create `.claude/guidance/` directory with markdown files following the optimization rules above:
|
|
222
|
-
- Every file has `**Purpose:**` line
|
|
223
|
-
- H2 sections with domain keywords in headings
|
|
224
|
-
- Tables for structured rules
|
|
225
|
-
- Cross-references between related docs
|
|
226
|
-
- 1000-4000 char sections
|
|
227
|
-
|
|
228
|
-
### 3. Configure Indexing
|
|
229
|
-
|
|
230
|
-
In `moflo.yaml`:
|
|
231
|
-
|
|
232
|
-
```yaml
|
|
233
|
-
guidance:
|
|
234
|
-
directories:
|
|
235
|
-
- .claude/guidance
|
|
236
|
-
- docs/guides
|
|
237
|
-
|
|
238
|
-
auto_index:
|
|
239
|
-
guidance: true
|
|
240
|
-
code_map: true
|
|
241
|
-
```
|
|
242
|
-
|
|
243
|
-
### 4. Index and Verify
|
|
244
|
-
|
|
245
|
-
```bash
|
|
246
|
-
# Index documents
|
|
247
|
-
npx flo-index --force
|
|
248
|
-
|
|
249
|
-
# Test search quality
|
|
250
|
-
npx flo memory search --query "your domain query" --namespace guidance
|
|
251
|
-
|
|
252
|
-
# Verify from Claude Code via MCP
|
|
253
|
-
# mcp__moflo__memory_search query="your domain query" namespace="guidance"
|
|
254
|
-
```
|
|
255
|
-
|
|
256
|
-
---
|
|
257
|
-
|
|
258
|
-
## See Also
|
|
259
|
-
|
|
260
|
-
- `.claude/guidance/memory-strategy.md` - Memory architecture and search commands
|
|
261
|
-
- `.claude/guidance/agent-bootstrap.md` - Subagent bootstrap guide
|
|
262
|
-
- `.claude/guidance/moflo.md` - Full CLI/MCP reference
|
|
1
|
+
# Guidance & Memory Tuning Strategy
|
|
2
|
+
|
|
3
|
+
**Purpose:** How to build and tune a RAG-based guidance system using moflo's semantic search, embedding pipeline, and indexing. Reference when creating guidance documents, troubleshooting search quality, or extending the system.
|
|
4
|
+
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
## Problem Statement
|
|
8
|
+
|
|
9
|
+
Claude Code agents need project-specific knowledge — coding rules, architecture patterns, entity templates, testing conventions — delivered at the right moment. Without a retrieval system, agents either miss critical rules or require massive CLAUDE.md files that waste context window tokens.
|
|
10
|
+
|
|
11
|
+
**Goals:**
|
|
12
|
+
- Agents find relevant guidance automatically via semantic search
|
|
13
|
+
- Subagents spawned by the coordinator inherit memory access
|
|
14
|
+
- Search quality is high enough that agents don't need to read whole files
|
|
15
|
+
- The system survives `npm install` (indexing runs on session start)
|
|
16
|
+
|
|
17
|
+
---
|
|
18
|
+
|
|
19
|
+
## Architecture
|
|
20
|
+
|
|
21
|
+
Three layers: embedding generation, vector storage, and search.
|
|
22
|
+
|
|
23
|
+
```
|
|
24
|
+
Source Files (.claude/guidance/*.md, docs/*.md)
|
|
25
|
+
|
|
|
26
|
+
v
|
|
27
|
+
index-guidance.mjs --- Chunk on ## headers, build RAG links
|
|
28
|
+
| (prev/next, siblings, parent/child, context overlap)
|
|
29
|
+
v
|
|
30
|
+
.swarm/memory.db ----- SQLite (entries + metadata + embedding vectors)
|
|
31
|
+
|
|
|
32
|
+
v
|
|
33
|
+
build-embeddings.mjs - Generate 384-dim vectors per entry
|
|
34
|
+
| (Xenova/all-MiniLM-L6-v2 neural, or domain-aware hash fallback)
|
|
35
|
+
v
|
|
36
|
+
RuVector (@ruvector/core) -- HNSW index infrastructure
|
|
37
|
+
v
|
|
38
|
+
Search layer ---------- Three access paths:
|
|
39
|
+
1. MCP tools (mcp__moflo__memory_search) -- preferred
|
|
40
|
+
2. CLI (npx flo memory search) -- fallback
|
|
41
|
+
3. Script (semantic-search.mjs) -- detailed output
|
|
42
|
+
```
|
|
43
|
+
|
|
44
|
+
**Key files:**
|
|
45
|
+
|
|
46
|
+
| File | Role |
|
|
47
|
+
|------|------|
|
|
48
|
+
| `.claude/guidance/*.md` | Guidance documents (source of truth) |
|
|
49
|
+
| `bin/index-guidance.mjs` | Chunks documents, stores in SQLite with RAG metadata |
|
|
50
|
+
| `bin/build-embeddings.mjs` | Generates vector embeddings (neural or hash) |
|
|
51
|
+
| `.swarm/memory.db` | SQLite database with entries, metadata, embeddings |
|
|
52
|
+
| `@ruvector/core` | HNSW vector index, WASM fallback, SIMD operations |
|
|
53
|
+
|
|
54
|
+
---
|
|
55
|
+
|
|
56
|
+
## Guidance Document Optimization Rules
|
|
57
|
+
|
|
58
|
+
These rules determine how well your guidance documents retrieve via semantic search:
|
|
59
|
+
|
|
60
|
+
### 1. Every file needs a Purpose line
|
|
61
|
+
|
|
62
|
+
Add `**Purpose:**` as the first meaningful line after the title. Claude checks this first for relevance scoring. Without it, the chunk has no summary signal.
|
|
63
|
+
|
|
64
|
+
### 2. H2 headings are the primary retrieval signal
|
|
65
|
+
|
|
66
|
+
The indexer splits on `##`. Each heading becomes the chunk title, prepended to searchable content. Domain-specific keywords in headings dramatically improve recall.
|
|
67
|
+
|
|
68
|
+
**Bad:** `## Overview`, `## Rules`, `## Pattern`
|
|
69
|
+
**Good:** `## Soft Delete Rules`, `## JWT Authentication Pattern`, `## Database Entity Migration`
|
|
70
|
+
|
|
71
|
+
### 3. Ideal chunk size: 1000-4000 characters
|
|
72
|
+
|
|
73
|
+
Below 50 chars the chunk is dropped. Above 6000 the indexer force-splits on paragraphs, which breaks mid-thought. The sweet spot produces focused embeddings.
|
|
74
|
+
|
|
75
|
+
### 4. Self-contained chunks
|
|
76
|
+
|
|
77
|
+
Each H2 section must answer a question without needing the rest of the document. Include: the rule, a code example, and a cross-reference.
|
|
78
|
+
|
|
79
|
+
### 5. Tables over prose
|
|
80
|
+
|
|
81
|
+
Claude parses structured data more accurately than paragraphs. DO/DON'T tables, field reference tables, and command tables all retrieve better.
|
|
82
|
+
|
|
83
|
+
### 6. Cross-references create a navigation graph
|
|
84
|
+
|
|
85
|
+
The RAG indexer stores `prevChunk`/`nextChunk`/`siblings` metadata. Cross-references between documents let Claude follow chains: `core.md -> coding-rules.md -> database.md`.
|
|
86
|
+
|
|
87
|
+
### 7. No decorative formatting
|
|
88
|
+
|
|
89
|
+
ASCII boxes, excessive emoji, rhetorical questions, and motivational text all waste tokens without improving retrieval or comprehension.
|
|
90
|
+
|
|
91
|
+
---
|
|
92
|
+
|
|
93
|
+
## Embedding Pipeline
|
|
94
|
+
|
|
95
|
+
### Embedding Models
|
|
96
|
+
|
|
97
|
+
| Model | Quality | Speed | When Used |
|
|
98
|
+
|-------|---------|-------|-----------|
|
|
99
|
+
| `Xenova/all-MiniLM-L6-v2` | High (true semantic) | ~3s for 1000 entries | Primary — `build-embeddings.mjs` uses this |
|
|
100
|
+
| `domain-aware-hash-v1` | Good (domain clustering) | <1s for 1000 entries | Fallback when Transformers.js unavailable |
|
|
101
|
+
|
|
102
|
+
**Neural embeddings (Xenova/all-MiniLM-L6-v2):**
|
|
103
|
+
- Uses `@xenova/transformers` with ONNX WASM runtime
|
|
104
|
+
- 384-dimensional vectors, L2-normalized
|
|
105
|
+
- True semantic understanding — "soft delete" matches "mark as deleted" without keyword overlap
|
|
106
|
+
- Loaded lazily on first use, cached for subsequent queries
|
|
107
|
+
- Ships with moflo; no additional install needed
|
|
108
|
+
|
|
109
|
+
**Domain-aware hash embeddings (fallback):**
|
|
110
|
+
- Custom SimHash-style algorithm with 12 domain clusters
|
|
111
|
+
- Domain clusters group related terms: `database` (orm, postgresql, entity, schema...), `frontend` (react, component, css...), `testing` (vitest, mock, expect...), etc.
|
|
112
|
+
- Multi-position hashing with bigram/trigram features
|
|
113
|
+
- Good at keyword-level matching but misses semantic paraphrases
|
|
114
|
+
- No external dependencies — always available
|
|
115
|
+
|
|
116
|
+
### The Embedding Alignment Problem
|
|
117
|
+
|
|
118
|
+
**Critical rule:** Query embeddings MUST match stored embeddings. Computing cosine similarity between vectors from different models produces meaningless scores.
|
|
119
|
+
|
|
120
|
+
Both the search scripts and the MCP memory tools auto-detect the stored embedding model:
|
|
121
|
+
|
|
122
|
+
```javascript
|
|
123
|
+
// Check what model stored entries predominantly use
|
|
124
|
+
const modelCheck = db.prepare(
|
|
125
|
+
`SELECT embedding_model, COUNT(*) as cnt FROM memory_entries
|
|
126
|
+
WHERE status = 'active' AND embedding IS NOT NULL
|
|
127
|
+
GROUP BY embedding_model ORDER BY cnt DESC LIMIT 1`
|
|
128
|
+
).get();
|
|
129
|
+
|
|
130
|
+
// If stored embeddings are neural, use neural for query too
|
|
131
|
+
```
|
|
132
|
+
|
|
133
|
+
Search also **filters out entries with mismatched `embedding_model`** — if the query uses neural embeddings, hash-embedded entries are skipped (and vice versa).
|
|
134
|
+
|
|
135
|
+
### Domain Cluster Tuning
|
|
136
|
+
|
|
137
|
+
The hash fallback's domain clusters can be extended with project-specific terms. Add terms to the relevant cluster in the hash embedding function to improve keyword-level matching for your domain:
|
|
138
|
+
|
|
139
|
+
| Cluster | Example Terms |
|
|
140
|
+
|---------|--------------|
|
|
141
|
+
| `database` | your ORM, database engine, schema terms |
|
|
142
|
+
| `frontend` | UI framework, component library terms |
|
|
143
|
+
| `backend` | DI container, API framework terms |
|
|
144
|
+
| `testing` | test framework, assertion library terms |
|
|
145
|
+
| `security` | auth system, permission model terms |
|
|
146
|
+
|
|
147
|
+
---
|
|
148
|
+
|
|
149
|
+
## RAG Indexing Pipeline
|
|
150
|
+
|
|
151
|
+
### How `index-guidance.mjs` Works
|
|
152
|
+
|
|
153
|
+
1. **Scan** configured directories for `.md` files
|
|
154
|
+
2. **Hash check** — Skip files whose content hash hasn't changed (unless `--force`)
|
|
155
|
+
3. **Store full document** as `doc-{prefix}-{name}` (for complete retrieval)
|
|
156
|
+
4. **Chunk on `##` headers** — Each H2 section becomes a separate entry
|
|
157
|
+
5. **H3 subsections** become child chunks with parent H2 as context prefix
|
|
158
|
+
6. **Force-split** sections over 4000 chars on paragraph boundaries
|
|
159
|
+
7. **Build RAG metadata** for every chunk:
|
|
160
|
+
|
|
161
|
+
| Metadata Field | Purpose |
|
|
162
|
+
|---------------|---------|
|
|
163
|
+
| `parentDoc` | Link back to full document |
|
|
164
|
+
| `prevChunk` / `nextChunk` | Sequential navigation |
|
|
165
|
+
| `siblings` | All chunk keys from same document |
|
|
166
|
+
| `hierarchicalParent` / `hierarchicalChildren` | H2->H3 relationships |
|
|
167
|
+
| `contextBefore` / `contextAfter` | 20% overlapping text from adjacent chunks |
|
|
168
|
+
|
|
169
|
+
8. **Prepend context** — Each chunk's searchable content includes overlap from neighbors
|
|
170
|
+
9. **Stale cleanup** — After indexing, remove entries for files that no longer exist on disk
|
|
171
|
+
10. **Background embedding** — Spawn `build-embeddings.mjs` in background to generate vectors
|
|
172
|
+
|
|
173
|
+
### Configuring Indexed Directories
|
|
174
|
+
|
|
175
|
+
In `moflo.yaml`:
|
|
176
|
+
|
|
177
|
+
```yaml
|
|
178
|
+
guidance:
|
|
179
|
+
directories:
|
|
180
|
+
- .claude/guidance
|
|
181
|
+
- docs/guides
|
|
182
|
+
```
|
|
183
|
+
|
|
184
|
+
Default directories (when no config): `.claude/guidance`, `docs/guides`
|
|
185
|
+
|
|
186
|
+
Moflo also automatically indexes its own bundled guidance from `node_modules/moflo/.claude/guidance/` when installed as a library in a consumer project.
|
|
187
|
+
|
|
188
|
+
---
|
|
189
|
+
|
|
190
|
+
## Lessons Learned
|
|
191
|
+
|
|
192
|
+
### Document Optimization
|
|
193
|
+
|
|
194
|
+
1. **`**Purpose:**` lines are critical** — They're the single highest-impact addition for retrieval quality.
|
|
195
|
+
2. **Headings are embeddings** — In a chunk-per-section system, the heading IS the embedding's primary signal. Generic headings are nearly useless.
|
|
196
|
+
3. **Tables retrieve better than prose** — Claude parses structured data with higher accuracy.
|
|
197
|
+
4. **Cross-references are the RAG graph** — Isolated documents can't be navigated.
|
|
198
|
+
5. **Chunk size matters** — A 10,000-char section produces a diluted embedding. Splitting into focused sections triples the chance of matching specific queries.
|
|
199
|
+
|
|
200
|
+
### Embedding Pipeline
|
|
201
|
+
|
|
202
|
+
6. **Query embeddings MUST match stored embeddings** — This is the single most critical rule. Auto-detect and match.
|
|
203
|
+
7. **Domain clusters need project-specific terms** — Generic NLP clusters miss project-specific terminology. Adding terms to domain clusters dramatically improves keyword-level matching.
|
|
204
|
+
8. **Filter mismatched entries during search** — Mixed databases need explicit filtering by `embedding_model`.
|
|
205
|
+
|
|
206
|
+
---
|
|
207
|
+
|
|
208
|
+
## Replication Guide
|
|
209
|
+
|
|
210
|
+
To set up this system in a new project using moflo:
|
|
211
|
+
|
|
212
|
+
### 1. Install Moflo
|
|
213
|
+
|
|
214
|
+
```bash
|
|
215
|
+
npm install moflo
|
|
216
|
+
npx flo init
|
|
217
|
+
```
|
|
218
|
+
|
|
219
|
+
### 2. Create Guidance Documents
|
|
220
|
+
|
|
221
|
+
Create `.claude/guidance/` directory with markdown files following the optimization rules above:
|
|
222
|
+
- Every file has `**Purpose:**` line
|
|
223
|
+
- H2 sections with domain keywords in headings
|
|
224
|
+
- Tables for structured rules
|
|
225
|
+
- Cross-references between related docs
|
|
226
|
+
- 1000-4000 char sections
|
|
227
|
+
|
|
228
|
+
### 3. Configure Indexing
|
|
229
|
+
|
|
230
|
+
In `moflo.yaml`:
|
|
231
|
+
|
|
232
|
+
```yaml
|
|
233
|
+
guidance:
|
|
234
|
+
directories:
|
|
235
|
+
- .claude/guidance
|
|
236
|
+
- docs/guides
|
|
237
|
+
|
|
238
|
+
auto_index:
|
|
239
|
+
guidance: true
|
|
240
|
+
code_map: true
|
|
241
|
+
```
|
|
242
|
+
|
|
243
|
+
### 4. Index and Verify
|
|
244
|
+
|
|
245
|
+
```bash
|
|
246
|
+
# Index documents
|
|
247
|
+
npx flo-index --force
|
|
248
|
+
|
|
249
|
+
# Test search quality
|
|
250
|
+
npx flo memory search --query "your domain query" --namespace guidance
|
|
251
|
+
|
|
252
|
+
# Verify from Claude Code via MCP
|
|
253
|
+
# mcp__moflo__memory_search query="your domain query" namespace="guidance"
|
|
254
|
+
```
|
|
255
|
+
|
|
256
|
+
---
|
|
257
|
+
|
|
258
|
+
## See Also
|
|
259
|
+
|
|
260
|
+
- `.claude/guidance/memory-strategy.md` - Memory architecture and search commands
|
|
261
|
+
- `.claude/guidance/agent-bootstrap.md` - Subagent bootstrap guide
|
|
262
|
+
- `.claude/guidance/moflo.md` - Full CLI/MCP reference
|