recallmem 0.1.5 → 0.2.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +24 -7
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -22,7 +22,7 @@
|
|
|
22
22
|
</p>
|
|
23
23
|
|
|
24
24
|
<p align="center">
|
|
25
|
-
<img src="./public/screenshots/demo.png" alt="RecallMEM
|
|
25
|
+
<img src="./public/screenshots/demo.png?v=2" alt="RecallMEM showing multiple brains, model picker, and chat UI" width="900">
|
|
26
26
|
</p>
|
|
27
27
|
|
|
28
28
|
<p align="center">
|
|
@@ -43,18 +43,35 @@ You can run it three ways:
|
|
|
43
43
|
- **Local LLMs (recommended for privacy).** Run Gemma 4 via Ollama. Nothing leaves your machine, ever. Slower setup (~7-20 GB model download) and slower responses, but truly air-gappable.
|
|
44
44
|
- **Both.** Use cloud for daily chat, switch to local for the sensitive stuff. The model dropdown lets you pick per-conversation.
|
|
45
45
|
|
|
46
|
+
## How it compares
|
|
47
|
+
|
|
48
|
+
| Feature | **RecallMEM** | **ChatGPT / Claude.ai** | **Mem0** |
|
|
49
|
+
|---|---|---|---|
|
|
50
|
+
| **Deterministic memory** (no LLM on read, TS-gated writes, auto-retires stale facts) | ✅ Full | ❌ | ⚠️ Partial |
|
|
51
|
+
| **Multiple brains** (isolated memory namespaces per agent/project/user) | ✅ | ❌ | ❌ |
|
|
52
|
+
| **Runs locally** (own LLM, local models, no signup) | ✅ | ❌ | ❌ |
|
|
53
|
+
| **LLM agnostic** (Ollama, Anthropic, OpenAI, xAI, any OpenAI-compatible) | ✅ | ❌ | ⚠️ Partial |
|
|
54
|
+
| **Temporal + editable** (knows when facts were true, edit/delete, vector search) | ✅ | ⚠️ Partial | ⚠️ Partial |
|
|
55
|
+
| **Voice + vision** (STT/TTS, PDF image understanding) | ✅ | ⚠️ Partial | ❌ |
|
|
56
|
+
| **Open & free** (Apache 2.0, usage tracking, no account) | ✅ | ❌ | ⚠️ Partial |
|
|
57
|
+
|
|
46
58
|
## Features
|
|
47
59
|
|
|
48
60
|
- **Three-layer memory** across every chat: synthesized profile, extracted facts table, and vector search over all past conversations
|
|
61
|
+
- **Smart fact selection** using vector search on facts themselves, not just recent ones. Pinned identity facts + semantically relevant facts + recent facts.
|
|
49
62
|
- **Temporal awareness** so the model knows what's current vs. historical. Auto-retires stale facts when the truth changes.
|
|
50
63
|
- **Live fact extraction** after every assistant reply, not just when the chat ends
|
|
64
|
+
- **Multiple brains** for isolated memory namespaces (work, personal, demo, etc). Each brain has its own chats, facts, and profile. Stored in Postgres, not localStorage.
|
|
51
65
|
- **Memory inspector** where you can view, edit, or delete every fact
|
|
52
|
-
- **Vector search** across past conversations with dated recall
|
|
66
|
+
- **Vector search** across past conversations and facts with dated recall
|
|
67
|
+
- **Voice input (STT)** via Deepgram Nova-3 or local Whisper. Idle mic timeout after 60s of silence.
|
|
68
|
+
- **Text-to-speech (TTS)** via xAI Grok, OpenAI HD, Deepgram Aura-2, or free browser voice. Chunked playback for instant start on long responses.
|
|
53
69
|
- **Custom rules** for how you want the AI to talk to you
|
|
54
|
-
- **File uploads** (images, PDFs, code).
|
|
70
|
+
- **File uploads** (images, PDFs, code). PDFs are rendered page-by-page as images so the LLM sees charts and diagrams, not just extracted text.
|
|
55
71
|
- **Web search** when using Anthropic or Ollama (via Brave Search)
|
|
72
|
+
- **Usage tracking** with estimated costs for chat, TTS, and STT across all providers
|
|
56
73
|
- **Wipe memory unrecoverably** with `DELETE` + `VACUUM FULL` + `CHECKPOINT`
|
|
57
|
-
- **Bring any LLM.** Ollama, Anthropic, OpenAI, or any OpenAI-compatible API.
|
|
74
|
+
- **Bring any LLM.** Ollama, Anthropic, OpenAI, xAI (Grok), or any OpenAI-compatible API.
|
|
58
75
|
|
|
59
76
|
## Quick start (Mac)
|
|
60
77
|
|
|
@@ -164,9 +181,9 @@ Wire in your own auth with two calls at startup and every lib function respects
|
|
|
164
181
|
| [Troubleshooting](./docs/TROUBLESHOOTING.md) | Every gotcha I've hit and how to fix it |
|
|
165
182
|
| [Manual install](./docs/MANUAL_INSTALL.md) | Step-by-step if you don't want to use the CLI |
|
|
166
183
|
|
|
167
|
-
## Limitations (v0.
|
|
184
|
+
## Limitations (v0.2)
|
|
168
185
|
|
|
169
|
-
|
|
186
|
+
No multi-user. No mobile app. Reasoning models (o1/o3, extended thinking) may have edge cases. Fact supersession is LLM-judged and intentionally conservative. See the [full limitations list](./docs/LIMITATIONS.md).
|
|
170
187
|
|
|
171
188
|
## Contributing
|
|
172
189
|
|
|
@@ -178,7 +195,7 @@ Apache 2.0. See [LICENSE](./LICENSE) and [NOTICE](./NOTICE). Use it, modify it,
|
|
|
178
195
|
|
|
179
196
|
## Status
|
|
180
197
|
|
|
181
|
-
v0.
|
|
198
|
+
v0.2.0. It works. I use it every day.
|
|
182
199
|
|
|
183
200
|
I built RecallMEM because I wanted an AI that actually knows me. Not because I'm paranoid about privacy (though that's a nice bonus). The chat models you use today forget you the second you close the tab and that drives me crazy. So I fixed it.
|
|
184
201
|
|