npm - recallmem - Versions diffs - 0.1.3 → 0.1.5 - Mend

recallmem 0.1.3 → 0.1.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/README.md CHANGED Viewed

@@ -6,11 +6,15 @@
 </p>
 <p align="center">
-  <strong>Persistent personal AI that actually remembers you.</strong>
+  <strong>Your Persistent Private AI that actually remembers you.</strong>
 </p>
 <p align="center">
-  LLMs like ChatGPT, Claude.ai, and Gemini tend to forget you the moment you end your session. RecallMEM doesn't. It builds a profile of who you are, extracts facts after every conversation, and runs vector search across your entire history to find relevant context. By the time you've used it for a week, it knows you better than any AI ever will.
+  <code>npx recallmem</code>
+</p>
+<p align="center">
+  Chatbots like ChatGPT, Claude, and Gemini tend to forget you the moment you end your session. RecallMEM doesn't. It builds a profile of who you are, extracts facts after every conversation, and runs vector search across your entire history to find relevant context. By the time you've used it for a week, it knows you better than any AI ever will.
 </p>
 <p align="center">
@@ -29,14 +33,14 @@
 ## What is this
-A personal AI chatbot with REAL memory. Plug in any LLM you want and RecallMEM gives it persistent memory of who you are, what you've talked about, and what's currently true vs historical.
+A personal AI chatbot with REAL memory. Plug in any LLM you want and RecallMEM gives it persistent memory of who you are, what you've talked about, and what's currently true vs historical. All your memory is stored in a local Postgres database on your machine, with pgvector powering the semantic search across your past conversations.
-The best part is that the LLM will never touch your memory in the database. Every retrieval is deterministic SQL + cosine similarity, assembled by TypeScript before the LLM ever sees it. The LLM only proposes new facts; a TypeScript validator decides what gets stored. Facts have timestamps and get auto-retired when you contradict them ("works at Acme" → "left Acme"). [Deep dive on the architecture →](./docs/ARCHITECTURE.md)
+The best part is that **the LLM proposes, TypeScript decides.** Retrieval is deterministic SQL + cosine similarity, built by TypeScript before the model ever sees it. On the write side, an LLM proposes candidate facts and contradictions, but a 6-step TypeScript validator decides what actually gets stored. Facts have timestamps and get auto-retired when the truth changes ("works at Acme" → "left Acme"). [Deep dive on the architecture →](./docs/ARCHITECTURE.md)
 You can run it three ways:
 - **Cloud LLMs (recommended for most people).** Add a Claude or OpenAI API key in Settings. Fast, smart, works on any computer. Your memory still stays local in your own Postgres database. Only the chat messages go to the provider.
-- **Local LLMs (recommended for privacy).** Run Gemma 4 via Ollama. Nothing leaves your machine, ever. Slower setup (~18 GB model download) and slower responses, but truly air-gappable.
+- **Local LLMs (recommended for privacy).** Run Gemma 4 via Ollama. Nothing leaves your machine, ever. Slower setup (~7-20 GB model download) and slower responses, but truly air-gappable.
 - **Both.** Use cloud for daily chat, switch to local for the sensitive stuff. The model dropdown lets you pick per-conversation.
 ## Features
@@ -58,13 +62,13 @@ Two options. Pick whichever fits your priority.
 ### Option A: Cloud LLM (Claude or OpenAI) — fastest, ~5 minutes
-You need Node.js 20+ and [Homebrew](https://brew.sh). Then:
+You need Node.js 20+ and [Homebrew](https://brew.sh). The installer uses Homebrew to set up Postgres + pgvector (where your memory and vector search live) and Ollama (for local AI models). Then:
 ```bash
 npx recallmem
 ```
-The installer sets up Postgres, pgvector, and Ollama (for the embedding model that powers memory). When the browser opens to `localhost:3000`:
+The installer sets up Postgres, pgvector, and Ollama (for the embedding model that powers memory). When the browser opens to `localhost:1337`:
 1. Click **Settings** in the top right
 2. Click **Providers**
@@ -78,9 +82,10 @@ The installer sets up Postgres, pgvector, and Ollama (for the embedding model th
 Same `npx recallmem` command. When the app opens, click **Settings → Manage models** and download one of these:
-- **Gemma 4 E4B** (4 GB, ~5 minute download) — fastest to test
-- **Gemma 4 26B** (18 GB, ~20-30 minute download) — recommended for daily use
-- **Gemma 4 31B** (19 GB, slower, best quality)
+- **Gemma 4 E2B** (~7 GB, fastest download) — good for a quick test or older laptops
+- **Gemma 4 E4B** (~10 GB) — good for most laptops
+- **Gemma 4 26B** (~18 GB, ~20-30 minute download) — recommended for daily use
+- **Gemma 4 31B** (~20 GB, slower, best quality)
 Then pick that model from the dropdown and chat. Nothing leaves your machine.

package/bin/commands/start.js CHANGED Viewed

@@ -33,15 +33,15 @@ async function startCommand(opts = {}) {
   const command = hasBuild ? "start" : "dev";
   info(hasBuild ? "Production build detected, running next start" : "No build found, running next dev");
-  info("Opening http://localhost:3000 in your browser...");
+  info("Opening http://localhost:1337 in your browser...");
   console.log("");
   console.log(color.dim("  (Press Ctrl+C to stop)"));
   console.log("");
   // Open the browser shortly after starting (give Next a moment to be ready)
-  setTimeout(() => openBrowser("http://localhost:3000"), 2000);
+  setTimeout(() => openBrowser("http://localhost:1337"), 2000);
-  const child = spawn("npx", ["next", command], {
+  const child = spawn("npx", ["next", command, "-p", "1337"], {
     cwd: installPath,
     stdio: "inherit",
     env: process.env,

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "recallmem",
-  "version": "0.1.3",
+  "version": "0.1.5",
   "description": "Private, local-first AI chatbot with persistent working memory. One command install via npx.",
   "license": "Apache-2.0",
   "author": "Chris Sean",