pi-hermes-memory 0.3.3 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -19,6 +19,9 @@ Your Pi agent normally forgets everything when you close a session. This extensi
19
19
  | **Memory Aging** | Entries carry timestamps — consolidation knows which facts are stale and which are fresh |
20
20
  | **Project Memory** | Per-project memory (`~/.pi/agent/<project>/MEMORY.md`) alongside your global memory |
21
21
  | **Secret Detection** | API keys, tokens, SSH keys, and credential assignments are blocked from being persisted to memory |
22
+ | **Session History Search** | Search across all past conversations via SQLite FTS5 — "what did we discuss about auth?" |
23
+ | **Extended Memory Store** | Unlimited searchable memories beyond the core 5,000-char limit |
24
+ | **Learn Memory Tool** | `/learn-memory-tool` — a skill that teaches users how to use the memory system |
22
25
 
23
26
  ## How It Works
24
27
 
@@ -157,9 +160,34 @@ Run `tsc --noEmit` and confirm zero errors.
157
160
 
158
161
  | Store | File | What goes here | Limit |
159
162
  |---|---|---|---|
160
- | **memory** | `MEMORY.md` | Agent's notes — env facts, project conventions, tool quirks, lessons learned | 2,200 chars |
161
- | **user** | `USER.md` | User profile — name, preferences, communication style, habits | 1,375 chars |
163
+ | **memory** | `MEMORY.md` | Agent's notes — env facts, project conventions, tool quirks, lessons learned | 5,000 chars |
164
+ | **user** | `USER.md` | User profile — name, preferences, communication style, habits | 5,000 chars |
162
165
  | **skills** | `skills/*.md` | Procedures — *how* to debug, deploy, test, or fix something | Unlimited |
166
+ | **extended** | `sessions.db` | Searchable memories beyond the core limit | Unlimited |
167
+ | **sessions** | `sessions.db` | Past conversation history (searchable via FTS5) | Unlimited |
168
+
169
+ ### Session History Search
170
+
171
+ The extension indexes your Pi session history into a SQLite database with FTS5 full-text search. The agent can search across all past conversations using the `session_search` tool:
172
+
173
+ | Tool | What it does |
174
+ |---|---|---|
175
+ | `session_search` | Search past conversations — "what did we discuss about auth?" |
176
+ | `memory_search` | Search extended memory store — unlimited capacity, keyword-based |
177
+
178
+ Session history is indexed automatically on session shutdown. To bulk-import existing sessions:
179
+
180
+ ```
181
+ /memory-index-sessions
182
+ ```
183
+
184
+ ### Extended Memory Store
185
+
186
+ When the core memory (5,000 chars) isn't enough, the agent can store additional memories in the SQLite-backed extended store. These are searchable via `memory_search` but not automatically injected into the system prompt.
187
+
188
+ This is the **hybrid memory architecture**:
189
+ - **Core memory** (MEMORY.md/USER.md): Always injected, 5,000 chars each, human-readable
190
+ - **Extended memory** (SQLite): Unlimited, searchable on demand, agent-driven
163
191
 
164
192
  ### Correction Detection
165
193
 
@@ -211,6 +239,8 @@ This means skills build up naturally over time without you having to ask.
211
239
  | `/memory-consolidate` | Manually trigger memory consolidation to free space |
212
240
  | `/memory-interview` | Answer a few questions to pre-fill your user profile |
213
241
  | `/memory-switch-project` | List all project memories and their entry counts |
242
+ | `/memory-index-sessions` | Import past Pi sessions into the search database |
243
+ | `/learn-memory-tool` | Skill that teaches users how to use the memory system |
214
244
 
215
245
  ### `/memory-insights` Output
216
246
 
@@ -254,9 +284,9 @@ Create `~/.pi/agent/hermes-memory-config.json`:
254
284
 
255
285
  ```json
256
286
  {
257
- "memoryCharLimit": 2200,
258
- "userCharLimit": 1375,
259
- "projectCharLimit": 2200,
287
+ "memoryCharLimit": 5000,
288
+ "userCharLimit": 5000,
289
+ "projectCharLimit": 5000,
260
290
  "memoryDir": "~/.pi/agent/memory",
261
291
  "nudgeInterval": 10,
262
292
  "nudgeToolCalls": 15,
@@ -271,9 +301,9 @@ Create `~/.pi/agent/hermes-memory-config.json`:
271
301
 
272
302
  | Setting | Default | Description |
273
303
  |---|---|---|
274
- | `memoryCharLimit` | `2200` | Max characters in MEMORY.md |
275
- | `userCharLimit` | `1375` | Max characters in USER.md |
276
- | `projectCharLimit` | `2200` | Max characters in project-scoped MEMORY.md |
304
+ | `memoryCharLimit` | `5000` | Max characters in MEMORY.md |
305
+ | `userCharLimit` | `5000` | Max characters in USER.md |
306
+ | `projectCharLimit` | `5000` | Max characters in project-scoped MEMORY.md |
277
307
  | `memoryDir` | `~/.pi/agent/memory` | Custom directory for memory files |
278
308
  | `nudgeInterval` | `10` | Turns between auto-reviews |
279
309
  | `nudgeToolCalls` | `15` | Tool calls between auto-reviews (OR with turns) |
@@ -290,6 +320,7 @@ Create `~/.pi/agent/hermes-memory-config.json`:
290
320
  ~/.pi/agent/memory/
291
321
  ├── MEMORY.md ← Agent's personal notes (env facts, patterns, lessons)
292
322
  ├── USER.md ← User profile (name, preferences, habits)
323
+ ├── sessions.db ← SQLite database (session history + extended memory)
293
324
  └── skills/
294
325
  ├── debug-typescript-errors.md
295
326
  └── deploy-checklist.md
@@ -297,11 +328,13 @@ Create `~/.pi/agent/hermes-memory-config.json`:
297
328
 
298
329
  These are plain markdown files. You can read and edit them directly if you want to curate what the agent remembers. Memory entries are separated by `§` (section sign). Skills use standard SKILL.md format with frontmatter.
299
330
 
331
+ The `sessions.db` SQLite database stores session history and extended memory entries. It's searchable via FTS5 full-text search.
332
+
300
333
  ## Known Limitations
301
334
 
302
335
  - **`§` delimiter**: Memory entries are separated by `§` (section sign). If an entry naturally contains `§`, it will be split incorrectly on reload. This is rare in English text but possible. [Hermes uses the same delimiter.]
303
336
  - **Background review cost**: Each review cycle costs one full LLM API call via a child `pi -p` process. Correction detection and skill auto-extraction add occasional extra calls.
304
- - **No search/indexing**: At the 2,200-char limit, the LLM can scan the entire block. Full-text search across sessions is planned for v0.3.
337
+ - **Session search requires indexing**: Past sessions must be indexed before they're searchable. Run `/memory-index-sessions` to bulk-import, or let the extension auto-index on session shutdown.
305
338
  - **System prompts are invisible**: Pi's TUI does not display the system prompt. Memory injection works but you won't see it in the interface — verify by asking the agent a question that relies on stored memory.
306
339
  - **Skills are agent-generated**: Skills are created by the agent based on its experience. They may not always be perfectly structured. You can edit or delete them in `~/.pi/agent/memory/skills/`.
307
340
 
@@ -0,0 +1,160 @@
1
+ # v0.4 Plan: SQLite FTS5 Session Search + Hybrid Memory
2
+
3
+ ## Problem
4
+
5
+ The current memory architecture has two scaling bottlenecks:
6
+
7
+ 1. **Memory capacity**: MEMORY.md is capped at 2,200 chars. Power users accumulate knowledge faster than consolidation can manage. Important facts get pruned.
8
+ 2. **No session history search**: Past conversations are stored as JSONL files in `~/.pi/agent/sessions/<project>/`, but there's no way to search them. When the agent needs context from a previous session, it's gone forever.
9
+
10
+ ## Solution: Hybrid Memory Architecture
11
+
12
+ ### Core memory (always injected, unchanged)
13
+ - `MEMORY.md` — 5,000 chars (up from 2,200)
14
+ - `USER.md` — 5,000 chars (up from 1,375)
15
+ - Still injected into every session via `<memory-context>` tags
16
+ - Still human-readable, still editable
17
+
18
+ ### Extended memory (SQLite, searchable on demand)
19
+ - `~/.pi/agent/memory/sessions.db`
20
+ - `memories` table — unlimited entries, searchable via FTS5
21
+ - Agent uses `memory_search` tool to query when it needs context
22
+ - Not automatically injected — agent must explicitly search
23
+
24
+ ### Session history (SQLite, searchable on demand)
25
+ - Same `sessions.db` file
26
+ - `sessions` + `messages` tables — all past conversations indexed
27
+ - `session_fts` FTS5 index — full-text search across all sessions
28
+ - Agent uses `session_search` tool to find relevant past context
29
+
30
+ ## Architecture
31
+
32
+ ```
33
+ Session starts
34
+
35
+ ┌─────────────────────────────────────────────────┐
36
+ │ System Prompt (always injected) │
37
+ │ ┌─────────────────────────────────────────────┐ │
38
+ │ │ <memory-context> │ │
39
+ │ │ MEMORY (your personal notes) [5,000 chars] │ │
40
+ │ │ ═══ END MEMORY ═══ │ │
41
+ │ │ </memory-context> │ │
42
+ │ │ <memory-context> │ │
43
+ │ │ USER PROFILE [5,000 chars] │ │
44
+ │ │ ═══ END MEMORY ═══ │ │
45
+ │ │ </memory-context> │ │
46
+ │ │ <memory-context> │ │
47
+ │ │ PROJECT MEMORY [5,000 chars] │ │
48
+ │ │ ═══ END MEMORY ═══ │ │
49
+ │ │ </memory-context> │ │
50
+ │ └─────────────────────────────────────────────┘ │
51
+ └─────────────────────────────────────────────────┘
52
+
53
+ Agent has access to tools:
54
+ memory_search("prisma migration")
55
+ → Searches memories table (global + project)
56
+ → Returns top-10 relevant entries
57
+
58
+ session_search("how we fixed the test hang")
59
+ → Searches session history via FTS5
60
+ → Returns relevant conversation snippets
61
+ ```
62
+
63
+ ## Data Model
64
+
65
+ ```sql
66
+ -- Session metadata
67
+ CREATE TABLE sessions (
68
+ id TEXT PRIMARY KEY, -- UUID from JSONL
69
+ project TEXT NOT NULL, -- decoded cwd path
70
+ started_at TEXT NOT NULL, -- ISO timestamp
71
+ ended_at TEXT, -- ISO timestamp (null if still running)
72
+ message_count INTEGER DEFAULT 0
73
+ );
74
+
75
+ -- All messages from all sessions
76
+ CREATE TABLE messages (
77
+ id TEXT PRIMARY KEY, -- message ID from JSONL
78
+ session_id TEXT NOT NULL REFERENCES sessions(id),
79
+ role TEXT NOT NULL, -- 'user', 'assistant', 'system'
80
+ content TEXT NOT NULL, -- extracted text content
81
+ timestamp TEXT NOT NULL, -- ISO timestamp
82
+ tool_calls TEXT -- JSON array of tool call names (for assistant messages)
83
+ );
84
+
85
+ -- FTS5 index for full-text search across messages
86
+ CREATE VIRTUAL TABLE message_fts USING fts5(
87
+ content,
88
+ content='messages',
89
+ content_rowid='rowid'
90
+ );
91
+
92
+ -- Extended memory entries (beyond MEMORY.md limit)
93
+ CREATE TABLE memories (
94
+ id INTEGER PRIMARY KEY AUTOINCREMENT,
95
+ project TEXT, -- NULL for global, project name for project-specific
96
+ target TEXT NOT NULL, -- 'memory' or 'user'
97
+ content TEXT NOT NULL,
98
+ created DATE NOT NULL,
99
+ last_referenced DATE NOT NULL
100
+ );
101
+
102
+ -- FTS5 index for memory search
103
+ CREATE VIRTUAL TABLE memory_fts USING fts5(
104
+ content,
105
+ content='memories',
106
+ content_rowid='id'
107
+ );
108
+ ```
109
+
110
+ ## Key Design Decisions
111
+
112
+ ### 1. Session indexing is lazy (on session end)
113
+ - Don't parse JSONL files on every startup
114
+ - Index a session when `session_shutdown` fires
115
+ - Bulk import existing sessions via `/memory-index-sessions` command
116
+
117
+ ### 2. FTS5 for both memories and sessions
118
+ - Keyword search is sufficient for v0.4
119
+ - Embeddings deferred to v0.5+ if search quality isn't enough
120
+ - `better-sqlite3` includes FTS5 by default
121
+
122
+ ### 3. Single DB file
123
+ - `~/.pi/agent/memory/sessions.db` stores everything
124
+ - Memories + sessions + FTS indices in one file
125
+ - Simple backup (copy one file), simple cleanup
126
+
127
+ ### 4. Agent-driven search
128
+ - `memory_search` and `session_search` are LLM tools
129
+ - Agent decides when to search (not automatic)
130
+ - Avoids injecting irrelevant context into every session
131
+
132
+ ### 5. Char limit increase to 5,000
133
+ - MEMORY.md: 2,200 → 5,000 chars
134
+ - USER.md: 1,375 → 5,000 chars
135
+ - Project MEMORY.md: 2,200 → 5,000 chars
136
+ - More room for core memories before consolidation kicks in
137
+
138
+ ## Dependencies
139
+
140
+ | Package | Purpose | Size |
141
+ |---|---|---|
142
+ | `better-sqlite3` | SQLite with FTS5 | ~1MB native addon |
143
+
144
+ ## Risks
145
+
146
+ | Risk | Mitigation |
147
+ |---|---|
148
+ | `better-sqlite3` is a native C++ addon | Standard for dev tools; CI has build tools |
149
+ | FTS5 search quality | Start with keyword search, add embeddings later if needed |
150
+ | Session JSONL format changes | Parse defensively, skip unknown message types |
151
+ | Large session history (1000+ sessions) | FTS5 handles this well; add pagination to results |
152
+ | DB corruption | Atomic writes, WAL mode, backup before migrations |
153
+
154
+ ## Success Criteria
155
+
156
+ 1. `session_search("prisma migration")` returns relevant conversation snippets from past sessions
157
+ 2. `memory_search("auth setup")` returns relevant entries from extended memory store
158
+ 3. MEMORY.md limit raised to 5,000 chars without breaking existing functionality
159
+ 4. Existing session files indexed without data loss
160
+ 5. All tests pass, zero regressions
@@ -0,0 +1,146 @@
1
+ # v0.4 Tasks: SQLite FTS5 Session Search + Hybrid Memory
2
+
3
+ ## Status Legend
4
+ - `[ ]` Not started
5
+ - `[~]` In progress
6
+ - `[x]` Done
7
+
8
+ ---
9
+
10
+ ## Epic 1: SQLite Foundation
11
+
12
+ ### Task 1.1: Install better-sqlite3 and create DB module
13
+ - [ ] Install `better-sqlite3` + `@types/better-sqlite3`
14
+ - [ ] Create `src/store/db.ts` — DatabaseManager class
15
+ - Lazy initialization (create/open DB on first use)
16
+ - WAL mode for concurrent reads
17
+ - Auto-create tables if they don't exist
18
+ - `close()` method for cleanup
19
+ - [ ] Create `tests/store/db.test.ts` — tests for DB initialization, table creation, close/reopen
20
+
21
+ ### Task 1.2: Create schema and migrations
22
+ - [ ] Define schema in `src/store/schema.ts` — all CREATE TABLE statements
23
+ - `sessions` table
24
+ - `messages` table
25
+ - `message_fts` FTS5 virtual table
26
+ - `memories` table
27
+ - `memory_fts` FTS5 virtual table
28
+ - [ ] Add triggers to keep FTS index in sync (INSERT/UPDATE/DELETE)
29
+ - [ ] Test: schema creates cleanly on fresh DB, idempotent on existing DB
30
+
31
+ ---
32
+
33
+ ## Epic 2: Session History Indexing
34
+
35
+ ### Task 2.1: JSONL parser
36
+ - [ ] Create `src/store/session-parser.ts`
37
+ - `parseSessionFile(path)` — read JSONL, extract session metadata + messages
38
+ - Handle all message types: user, assistant, system, tool_result
39
+ - Extract text content from `content` array (handle text, thinking, tool_use types)
40
+ - Skip unknown types gracefully
41
+ - Return structured `SessionData` with `messages: ParsedMessage[]`
42
+ - [ ] Create `tests/store/session-parser.test.ts` — test with real JSONL fixtures
43
+
44
+ ### Task 2.2: Session indexer
45
+ - [ ] Create `src/store/session-indexer.ts`
46
+ - `indexSession(db, sessionData)` — INSERT into sessions + messages tables
47
+ - `indexAllSessions(db, projectPath?)` — bulk index all sessions for a project (or all projects)
48
+ - Skip already-indexed sessions (by session ID)
49
+ - `getSessionStats(db)` — count of sessions, messages, indexed projects
50
+ - [ ] Create `tests/store/session-indexer.test.ts` — test indexing, deduplication, stats
51
+
52
+ ### Task 2.3: /memory-index-sessions command
53
+ - [ ] Create `src/handlers/index-sessions.ts`
54
+ - `/memory-index-sessions` — bulk import existing JSONL sessions
55
+ - Show progress: "Indexing 36 sessions..."
56
+ - Show result: "Indexed 36 sessions, 1,247 messages"
57
+ - Handle errors gracefully (corrupt JSONL, missing files)
58
+ - [ ] Wire into `src/index.ts`
59
+ - [ ] Create `tests/handlers/index-sessions.test.ts`
60
+
61
+ ---
62
+
63
+ ## Epic 3: Session Search
64
+
65
+ ### Task 3.1: Session search store
66
+ - [ ] Add to `src/store/session-indexer.ts` (or separate `session-search.ts`)
67
+ - `searchSessions(db, query, options?)` — FTS5 search across messages
68
+ - Options: `limit`, `project`, `role` filter, `since` date filter
69
+ - Returns: `SearchResult[]` with `{sessionId, role, content, timestamp, snippet, project}`
70
+ - `snippet` — highlighted match context from FTS5 `snippet()` function
71
+ - [ ] Create `tests/store/session-search.test.ts` — test search, filters, relevance
72
+
73
+ ### Task 3.2: session_search tool
74
+ - [ ] Create `src/tools/session-search-tool.ts`
75
+ - LLM tool definition: `session_search(query, project?, limit?)`
76
+ - Returns formatted results for the agent
77
+ - Includes session date, project, and content snippet
78
+ - [ ] Register in `src/index.ts`
79
+ - [ ] Create `tests/tools/session-search-tool.test.ts`
80
+
81
+ ---
82
+
83
+ ## Epic 4: Extended Memory Store
84
+
85
+ ### Task 4.1: SQLite memory store
86
+ - [ ] Create `src/store/sqlite-memory-store.ts`
87
+ - `addMemory(db, content, project?, target?)` — INSERT into memories + memory_fts
88
+ - `searchMemories(db, query, options?)` — FTS5 search across memories
89
+ - `getMemories(db, project?, target?)` — list all memories (optionally filtered)
90
+ - `removeMemory(db, id)` — DELETE by ID
91
+ - `getMemoryStats(db)` — count by project/target
92
+ - [ ] Create `tests/store/sqlite-memory-store.test.ts`
93
+
94
+ ### Task 4.2: memory_search tool
95
+ - [ ] Create `src/tools/memory-search-tool.ts`
96
+ - LLM tool definition: `memory_search(query, project?, limit?)`
97
+ - Searches both global and project-specific memories
98
+ - Returns formatted results for the agent
99
+ - [ ] Register in `src/index.ts`
100
+ - [ ] Create `tests/tools/memory-search-tool.test.ts`
101
+
102
+ ---
103
+
104
+ ## Epic 5: Char Limit Increase
105
+
106
+ ### Task 5.1: Update defaults
107
+ - [ ] Update `src/config.ts` — change defaults:
108
+ - `memoryCharLimit`: 2200 → 5000
109
+ - `userCharLimit`: 1375 → 5000
110
+ - `projectCharLimit`: 2200 → 5000
111
+ - [ ] Update `src/constants.ts` — change constants if any
112
+ - [ ] Update README configuration table
113
+
114
+ ### Task 5.2: Update tests
115
+ - [ ] Update all tests that depend on char limits
116
+ - [ ] Verify consolidation still works at new limits
117
+ - [ ] Verify interview still works at new limits
118
+
119
+ ---
120
+
121
+ ## Epic 6: Integration & Polish
122
+
123
+ ### Task 6.1: Wire everything into index.ts
124
+ - [ ] Initialize DatabaseManager on extension load
125
+ - [ ] Register `session_search` and `memory_search` tools
126
+ - [ ] Register `/memory-index-sessions` command
127
+ - [ ] Auto-index session on `session_shutdown` event
128
+ - [ ] Close DB on extension unload
129
+
130
+ ### Task 6.2: Add session indexing to background review
131
+ - [ ] In `session-flush.ts` — also index the session to SQLite before flushing memories
132
+ - [ ] Ensure session is indexed even if shutdown event is missed
133
+
134
+ ### Task 6.3: Update README
135
+ - [ ] Add "Hybrid Memory Architecture" section
136
+ - [ ] Document `session_search` and `memory_search` tools
137
+ - [ ] Document `/memory-index-sessions` command
138
+ - [ ] Update char limit documentation
139
+ - [ ] Update configuration table
140
+
141
+ ### Task 6.4: Version bump & release
142
+ - [ ] Bump version to `0.4.0`
143
+ - [ ] Update CHANGELOG.md
144
+ - [ ] Run full test suite
145
+ - [ ] Publish to npm
146
+ - [ ] Create GitHub release
package/docs/ROADMAP.md CHANGED
@@ -242,50 +242,68 @@ Project-scoped memory (`~/.pi/agent/<project>/MEMORY.md`) was added in the featu
242
242
 
243
243
  ---
244
244
 
245
- ## v0.4.0 — Structured Storage + Session Search
245
+ ## v0.4.0 — SQLite FTS5 Session Search + Hybrid Memory
246
246
 
247
- **Goal**: SQLite backend with FTS5 full-text search over all past conversations. MemoryBackend interface for pluggable storage. Keep the same tool interface.
247
+ **Goal**: SQLite backend with FTS5 full-text search over all past conversations. Extended memory store with unlimited capacity. Increased core memory limits.
248
248
 
249
- ### Core Abstraction
249
+ **Why now**: Power users hit the 2,200-char limit and lose important knowledge. Past sessions are rich with context but unreachable. Hybrid memory solves both — core memories always injected, deep knowledge searchable on demand.
250
250
 
251
- ```typescript
252
- interface MemoryBackend {
253
- // Write
254
- add(target: "memory" | "user", entry: MemoryEntry): Promise<MemoryResult>;
255
- replace(target: "memory" | "user", query: string, entry: MemoryEntry): Promise<MemoryResult>;
256
- remove(target: "memory" | "user", query: string): Promise<MemoryResult>;
251
+ **Full plan**: `docs/0.4/PLAN.md` · **Tasks**: `docs/0.4/TASKS.md`
257
252
 
258
- // Read
259
- getAll(target: "memory" | "user"): Promise<MemoryEntry[]>;
260
- search(query: string, limit?: number): Promise<MemoryEntry[]>;
253
+ ### Architecture
261
254
 
262
- // Lifecycle
263
- formatForSystemPrompt(cwd?: string, prompt?: string): Promise<string>;
264
- close(): Promise<void>;
265
- }
266
255
  ```
256
+ Session starts
257
+
258
+ ┌─────────────────────────────────────────────────┐
259
+ │ System Prompt (always injected) │
260
+ │ • MEMORY.md — 5,000 chars (up from 2,200) │
261
+ │ • USER.md — 5,000 chars (up from 1,375) │
262
+ │ • Project MEMORY.md — 5,000 chars │
263
+ │ • Skills index │
264
+ └─────────────────────────────────────────────────┘
265
+
266
+ Agent has access to tools:
267
+ memory_search("prisma migration")
268
+ → Searches SQLite memories table (global + project)
269
+ → Returns top-10 relevant entries
270
+
271
+ session_search("how we fixed the test hang")
272
+ → Searches session history via FTS5
273
+ → Returns relevant conversation snippets
274
+ ```
275
+
276
+ ### Data Model
267
277
 
268
- Current `MemoryStore` becomes `MarkdownBackend` — the default, zero-dependency implementation. New `SQLiteBackend` adds structure + FTS5 search.
278
+ - `~/.pi/agent/memory/sessions.db` single SQLite file
279
+ - `sessions` + `messages` tables — all past conversations indexed
280
+ - `message_fts` FTS5 virtual table — full-text search across messages
281
+ - `memories` table — extended memory entries (unlimited, searchable)
282
+ - `memory_fts` FTS5 virtual table — full-text search across memories
269
283
 
270
284
  ### Deliverables
271
285
 
272
- - [ ] `MemoryBackend` interface in `src/types.ts`
273
- - [ ] `MarkdownBackend` — wraps current `MemoryStore` (backwards compatible)
274
- - [ ] `SQLiteBackend` — FTS5 search, key-value entries, confidence scores, dedup by key
275
- - [ ] Session indexer — index past and current session conversations for full-text search
276
- - [ ] `session_search` tool agent can query past conversations on demand
277
- - [ ] Summarization via `pi.exec()` — summarize relevant session fragments to keep token cost manageable
278
- - [ ] Config: `"backend": "markdown" | "sqlite"` (defaults to `markdown` for zero-dep install)
286
+ - [ ] `better-sqlite3` dependency SQLite with FTS5
287
+ - [ ] `src/store/db.ts` — DatabaseManager (lazy init, WAL mode, auto-create tables)
288
+ - [ ] `src/store/session-parser.ts` — JSONL parser for Pi session files
289
+ - [ ] `src/store/session-indexer.ts` — index sessions + messages into SQLite
290
+ - [ ] `src/store/session-search.ts` — FTS5 search across session history
291
+ - [ ] `src/store/sqlite-memory-store.ts` — extended memory store (unlimited, searchable)
292
+ - [ ] `session_search` tool agent queries past conversations
293
+ - [ ] `memory_search` tool — agent queries extended memories
294
+ - [ ] `/memory-index-sessions` command — bulk import existing sessions
295
+ - [ ] Auto-index session on shutdown
296
+ - [ ] Char limits: MEMORY.md 2,200 → 5,000, USER.md 1,375 → 5,000, project 2,200 → 5,000
279
297
  - [ ] Config: `sessionSearchEnabled: boolean` (default: true)
280
298
  - [ ] Config: `sessionRetentionDays: number` (default: 90)
281
- - [ ] Migration tool: markdown → sqlite one-time import
282
299
 
283
300
  ### What Does NOT Change
284
301
 
285
- - Content scanner (guards all backends)
286
- - Tool interface (`memory` tool name and actions)
302
+ - Content scanner (guards all writes)
303
+ - Memory tool interface (add/replace/remove actions)
287
304
  - System prompt injection (frozen snapshot pattern)
288
- - Config file location and format (just adds new fields)
305
+ - Skills system (unchanged)
306
+ - Background review, correction detection, auto-consolidation (unchanged)
289
307
 
290
308
  ---
291
309
 
@@ -387,7 +405,7 @@ gantt
387
405
  Project memory polish :v03d, after v03c, 2d
388
406
 
389
407
  section v0.4.0
390
- MemoryBackend interface + SQLite + session search:v04a, after v03d, 10d
408
+ SQLite FTS5 + session search + hybrid memory :v04a, after v03d, 10d
391
409
 
392
410
  section v0.5.0
393
411
  ExternalSync + Mem0 / Honcho :v05a, after v04b, 10d
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "pi-hermes-memory",
3
- "version": "0.3.3",
3
+ "version": "0.4.0",
4
4
  "description": "Your Pi agent remembers everything across sessions — your preferences, your stack, your corrections, and even how it solved problems. Zero-config install, works immediately. Persistent memory + procedural skills + auto-correction detection + security-first content scanning.",
5
5
  "type": "module",
6
6
  "main": "src/index.ts",
@@ -43,8 +43,12 @@
43
43
  "devDependencies": {
44
44
  "@mariozechner/pi-ai": "^0.70.0",
45
45
  "@mariozechner/pi-coding-agent": "^0.70.0",
46
+ "@types/better-sqlite3": "^7.6.13",
46
47
  "tsx": "^4.21.0",
47
48
  "typebox": "^1.1.33",
48
49
  "typescript": "^6.0.3"
50
+ },
51
+ "dependencies": {
52
+ "better-sqlite3": "^12.9.0"
49
53
  }
50
54
  }
package/src/constants.ts CHANGED
@@ -8,11 +8,11 @@
8
8
  export const ENTRY_DELIMITER = "\n§\n";
9
9
 
10
10
  // ─── Character limits (not tokens — model-independent) ───
11
- export const DEFAULT_MEMORY_CHAR_LIMIT = 2200;
12
- export const DEFAULT_USER_CHAR_LIMIT = 1375;
11
+ export const DEFAULT_MEMORY_CHAR_LIMIT = 5000;
12
+ export const DEFAULT_USER_CHAR_LIMIT = 5000;
13
13
 
14
14
  // ─── Learning loop defaults ───
15
- export const DEFAULT_PROJECT_CHAR_LIMIT = 2200;
15
+ export const DEFAULT_PROJECT_CHAR_LIMIT = 5000;
16
16
 
17
17
  export const DEFAULT_NUDGE_INTERVAL = 10;
18
18
  export const DEFAULT_FLUSH_MIN_TURNS = 6;
@@ -0,0 +1,61 @@
1
+ import path from 'node:path';
2
+ import os from 'node:os';
3
+ import { DatabaseManager } from '../store/db.js';
4
+ import { indexAllSessions, getSessionStats } from '../store/session-indexer.js';
5
+
6
+ const SESSIONS_DIR = path.join(os.homedir(), '.pi', 'agent', 'sessions');
7
+
8
+ export function registerIndexSessionsCommand(ctx: {
9
+ registerCommand: (name: string, handler: (args: string, ctx: unknown) => Promise<void>) => void;
10
+ sendUserMessage: (msg: string) => void;
11
+ }) {
12
+ ctx.registerCommand('memory-index-sessions', async (_args: string, cmdCtx: unknown) => {
13
+ const sendUserMessage = (cmdCtx as { sendUserMessage?: (msg: string) => void }).sendUserMessage
14
+ ?? ctx.sendUserMessage;
15
+
16
+ sendUserMessage('🔍 Indexing session history...');
17
+
18
+ try {
19
+ const memoryDir = path.join(os.homedir(), '.pi', 'agent', 'memory');
20
+ const dbManager = new DatabaseManager(memoryDir);
21
+
22
+ try {
23
+ const result = indexAllSessions(dbManager, SESSIONS_DIR);
24
+
25
+ const stats = getSessionStats(dbManager);
26
+
27
+ let output = `\n✅ Session indexing complete!\n\n`;
28
+ output += `📊 Results:\n`;
29
+ output += `• Sessions processed: ${result.sessionsProcessed}\n`;
30
+ output += `• Sessions indexed: ${result.sessionsIndexed}\n`;
31
+ output += `• Sessions skipped (already indexed): ${result.sessionsSkipped}\n`;
32
+ output += `• Messages indexed: ${result.messagesIndexed}\n`;
33
+
34
+ if (stats.projects.length > 0) {
35
+ output += `\n📁 Projects:\n`;
36
+ for (const p of stats.projects) {
37
+ output += `• ${p.project}: ${p.sessions} sessions, ${p.messages} messages\n`;
38
+ }
39
+ }
40
+
41
+ if (result.errors.length > 0) {
42
+ output += `\n⚠️ Errors (${result.errors.length}):\n`;
43
+ for (const err of result.errors.slice(0, 5)) {
44
+ output += `• ${err}\n`;
45
+ }
46
+ if (result.errors.length > 5) {
47
+ output += `• ... and ${result.errors.length - 5} more\n`;
48
+ }
49
+ }
50
+
51
+ output += `\n💡 Use the \`session_search\` tool to search across indexed sessions.`;
52
+
53
+ sendUserMessage(output);
54
+ } finally {
55
+ dbManager.close();
56
+ }
57
+ } catch (err) {
58
+ sendUserMessage(`❌ Session indexing failed: ${err instanceof Error ? err.message : String(err)}`);
59
+ }
60
+ });
61
+ }