npm - @aman_asmuei/aman-agent - Versions diffs - 0.4.0 → 0.5.1 - Mend

@aman_asmuei/aman-agent 0.4.0 → 0.5.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (4) hide show

package/README.md CHANGED Viewed

@@ -8,7 +8,7 @@
 <h1 align="center">aman-agent</h1>
 <p align="center">
-  <strong>Your AI companion, running locally.</strong>
+  <strong>The AI companion that actually remembers you.</strong>
 </p>
 <p align="center">
@@ -24,14 +24,14 @@
 </p>
 <p align="center">
-  Loads the full aman ecosystem and runs a streaming AI agent in your terminal —<br/>
-  identity, memory, tools, workflows, guardrails, and skills in every conversation.
+  An AI companion that learns from every conversation, recalls relevant memories per message,<br/>
+  extracts knowledge silently, and adapts to your time of day — all running locally.
 </p>
 <p align="center">
   <a href="#-quick-start">Quick Start</a> &bull;
-  <a href="#-what-it-loads">What It Loads</a> &bull;
-  <a href="#-whats-new-in-v040">What's New</a> &bull;
+  <a href="#-intelligent-companion-features">Features</a> &bull;
+  <a href="#-how-it-works">How It Works</a> &bull;
   <a href="#-commands">Commands</a> &bull;
   <a href="#-supported-llms">LLMs</a> &bull;
   <a href="#-the-ecosystem">Ecosystem</a>
@@ -43,17 +43,17 @@
 AI coding assistants forget everything between sessions. You re-explain your stack, preferences, and boundaries every time. There's no single place where your AI loads its full context and just *works*.
+Other "memory" solutions are just markdown files the AI reads on startup — they don't *learn* from conversation, they don't *recall* per-message, and they silently lose context when the window fills up.
 ## The Solution
-**aman-agent** loads your entire AI ecosystem into a local streaming agent. One command. Full context. Every session.
+**aman-agent** is the first open-source AI companion that genuinely learns from conversation. It doesn't just store memories — it recalls them per-message, extracts new knowledge automatically, and uses your LLM to intelligently compress context instead of truncating it.
 ```bash
 npx @aman_asmuei/aman-agent
 ```
-First run walks you through LLM configuration. After that, just run and talk.
-> **Your AI knows who it is, what it remembers, what tools it has, and what rules to follow — before you say a word.**
+> **Your AI knows who it is, what it remembers, what tools it has, what rules to follow, what time it is, and what reminders are due — before you say a word.**
 ---
@@ -85,42 +85,146 @@ aman-agent --budget 12000
 ---
-## What's New in v0.4.0
+## Intelligent Companion Features
-| Feature | Before | After |
-|---|---|---|
-| **Streaming with tools** | Blocked — no output until LLM finishes | Real-time streaming, even during tool calls |
-| **Conversation persistence** | 200-char resume, full history lost | Full conversation saved to amem on exit |
-| **Context management** | Messages grow forever, eventual crash | Auto-trims at 80K tokens, keeps recent context |
-| **`/save` command** | N/A | Manually save conversation mid-session |
-| **Reminders/Schedules** | Broken — lost on exit, no daemon | Removed (replaced with `/save`) |
+### Per-Message Memory Recall
----
+Every message you send triggers a semantic search against your memory database. Relevant memories are injected into the AI's context for *that turn only* — so the AI always has the right context without bloating the conversation.
-## What It Loads
+```
+You > Let's set up the auth service
-On every session start, aman-agent assembles your full AI context:
+  Agent recalls:
+  - [decision] Auth service uses JWT tokens (confidence: 0.92)
+  - [preference] User prefers PostgreSQL (confidence: 0.88)
+  - [fact] Auth middleware rewrite driven by compliance (confidence: 0.75)
-| Layer | Source | What it provides |
-|:---|:---|:---|
-| **Identity** | `~/.acore/core.md` | AI personality, your preferences, relationship state |
-| **Memory** | `~/.amem/memory.db` | Past decisions, corrections, patterns, conversation history |
-| **Tools** | `~/.akit/kit.md` | Available capabilities (GitHub, search, databases) |
-| **Workflows** | `~/.aflow/flow.md` | Multi-step processes (code review, bug fix) |
-| **Guardrails** | `~/.arules/rules.md` | Safety boundaries and permissions |
-| **Skills** | `~/.askill/skills.md` | Deep domain expertise |
+Aman > Based on our previous decisions, I'll set up JWT-based auth
+       with PostgreSQL, keeping the compliance requirements in mind...
+```
-All layers are optional — the agent works with whatever you've set up.
+### Hybrid Memory Extraction
-### Token Budgeting
+After every response, the agent analyzes the conversation and extracts memories worth keeping. Preferences, facts, patterns, and topology are stored silently. Decisions and corrections require your confirmation.
-Layers are included by priority when space is limited:
+```
+You > I think we should go with microservices for the payment system
+Aman > That makes sense given the compliance isolation requirements...
+  Remember: "Payment system will use microservices architecture"? (y/N) y
+  [1 memory stored]
 ```
-Identity (always) → Guardrails → Workflows → Tools → Skills (can truncate)
+### LLM-Powered Context Summarization
+When the conversation gets long, the agent uses your LLM to generate real summaries — preserving decisions, preferences, and action items. No more losing critical context to 150-character truncation.
+### Parallel Tool Execution
+When the AI needs multiple tools, they run in parallel via `Promise.all` instead of sequentially. Faster responses, same guardrail checks.
+### Retry with Backoff
+LLM calls and MCP tool calls automatically retry on transient errors (rate limits, timeouts) with exponential backoff and jitter. Auth errors fail immediately.
+### Time-Aware Greetings
+The agent knows the time of day and day of week. It adapts its tone naturally — you'll notice the difference between a morning and a late-night session.
+### Reminders
 ```
+You > Remind me to review PR #42 by Thursday
-Default budget: 8,000 tokens. Override with `--budget`.
+Aman > I'll set that reminder for you.
+  [Reminder set: "Review PR #42" — due 2026-03-27]
+```
+Next session:
+```
+  [OVERDUE] Review PR #42 (was due 2026-03-27)
+```
+Reminders persist in SQLite across sessions. Set them, forget them, get nudged.
+### Memory Consolidation
+On every startup, the agent automatically merges duplicate memories, prunes stale low-confidence ones, and promotes frequently-accessed entries.
+```
+  Memory health: 94% (merged 2 duplicates, pruned 1 stale)
+```
+### Structured Debug Logging
+Every operation that can fail logs to `~/.aman-agent/debug.log` with structured JSON. No more silent failures — use `/debug` to see what's happening under the hood.
+---
+## How It Works
+```
+┌───────────────────────────────────────────────────────────┐
+│                    Your Terminal                          │
+│                                                          │
+│   You > tell me about our auth decisions                 │
+│                                                          │
+│   [recalling memories...]                                │
+│   Agent > Based on your previous decisions:              │
+│   - OAuth2 with PKCE (decided 2 weeks ago)               │
+│   - JWT for API tokens...                                │
+│                                                          │
+│   [1 memory stored]                                      │
+└──────────────────────┬────────────────────────────────────┘
+                       │
+┌──────────────────────▼────────────────────────────────────┐
+│              aman-agent runtime                          │
+│                                                          │
+│   On Startup                                             │
+│   ┌────────────────────────────────────────────────┐     │
+│   │ 1. Load ecosystem (identity, tools, rules...)  │     │
+│   │ 2. Connect MCP servers (aman-mcp + amem)       │     │
+│   │ 3. Consolidate memory (merge/prune/promote)    │     │
+│   │ 4. Check reminders (overdue/today/upcoming)    │     │
+│   │ 5. Inject time context (morning/evening/...)   │     │
+│   │ 6. Recall session context from memory          │     │
+│   └────────────────────────────────────────────────┘     │
+│                                                          │
+│   Per Message                                            │
+│   ┌────────────────────────────────────────────────┐     │
+│   │ 1. Semantic memory recall (top 5 relevant)     │     │
+│   │ 2. Augment system prompt with memories         │     │
+│   │ 3. Stream LLM response (with retry)            │     │
+│   │ 4. Execute tools in parallel (with guardrails) │     │
+│   │ 5. Extract memories from response              │     │
+│   │    - Auto-store: preferences, facts, patterns  │     │
+│   │    - Confirm: decisions, corrections           │     │
+│   └────────────────────────────────────────────────┘     │
+│                                                          │
+│   Context Management                                     │
+│   ┌────────────────────────────────────────────────┐     │
+│   │ Auto-trim at 80K tokens                        │     │
+│   │ LLM-powered summarization (not truncation)     │     │
+│   │ Fallback to text preview if LLM call fails     │     │
+│   └────────────────────────────────────────────────┘     │
+│                                                          │
+│   MCP Integration                                        │
+│   ┌────────────────────────────────────────────────┐     │
+│   │ aman-mcp  →  identity, tools, workflows, eval  │     │
+│   │ amem      →  memory, knowledge graph, reminders │     │
+│   └────────────────────────────────────────────────┘     │
+└───────────────────────────────────────────────────────────┘
+```
+### Session Lifecycle
+| Phase | What happens |
+|:---|:---|
+| **Start** | Load ecosystem, connect MCP, consolidate memory, check reminders, inject time context |
+| **Each turn** | Recall relevant memories, stream response, execute tools in parallel, extract new memories |
+| **Auto-trim** | LLM-powered summarization when approaching 80K tokens |
+| **Exit** | Save conversation to amem, update session resume, optional session rating |
 ---
@@ -136,6 +240,9 @@ Default budget: 8,000 tokens. Override with `--budget`.
 | `/skills` | View skills `[install\|uninstall ...]` |
 | `/eval` | View evaluation `[milestone ...]` |
 | `/memory` | View memories `[search\|clear ...]` |
+| `/decisions` | View decision log `[<project>]` |
+| `/export` | Export conversation to markdown |
+| `/debug` | Show debug log (last 20 entries) |
 | `/status` | Ecosystem dashboard |
 | `/doctor` | Health check all layers |
 | `/save` | Save conversation to memory |
@@ -147,66 +254,42 @@ Default budget: 8,000 tokens. Override with `--budget`.
 ---
-## Supported LLMs
+## What It Loads
-| Provider | Models | Tool Use | Streaming |
-|:---|:---|:---|:---|
-| **Anthropic** | Claude Sonnet 4.5, Opus 4.6, Haiku 4.5 | Full | Full (with tools) |
-| **OpenAI** | GPT-4o, GPT-4o Mini, o3 | Full | Full (with tools) |
-| **Ollama** | Llama, Mistral, Gemma, any local model | Text only | Full |
+On every session start, aman-agent assembles your full AI context:
----
+| Layer | Source | What it provides |
+|:---|:---|:---|
+| **Identity** | `~/.acore/core.md` | AI personality, your preferences, relationship state |
+| **Memory** | `~/.amem/memory.db` | Past decisions, corrections, patterns, conversation history |
+| **Reminders** | `~/.amem/memory.db` | Overdue, today, and upcoming reminders |
+| **Tools** | `~/.akit/kit.md` | Available capabilities (GitHub, search, databases) |
+| **Workflows** | `~/.aflow/flow.md` | Multi-step processes (code review, bug fix) |
+| **Guardrails** | `~/.arules/rules.md` | Safety boundaries and permissions |
+| **Skills** | `~/.askill/skills.md` | Deep domain expertise |
+| **Time** | System clock | Time of day, day of week for tone adaptation |
-## How It Works
+All layers are optional — the agent works with whatever you've set up.
+### Token Budgeting
+Layers are included by priority when space is limited:
 ```
-┌──────────────────────────────────────────────┐
-│              Your Terminal                   │
-│                                              │
-│   You > tell me about our auth decisions     │
-│                                              │
-│   Agent > [using memory_recall...]           │
-│   Based on your previous decisions:          │
-│   - OAuth2 with PKCE (decided 2 weeks ago)   │
-│   - JWT for API tokens...                    │
-└─────────────────┬────────────────────────────┘
-                  │
-┌─────────────────▼────────────────────────────┐
-│          aman-agent runtime                  │
-│                                              │
-│   System Prompt Assembly                     │
-│   ┌─────────────────────────────────────┐    │
-│   │ Identity + Memory + Tools +         │    │
-│   │ Workflows + Guardrails + Skills     │    │
-│   │ (priority-based token budgeting)    │    │
-│   └─────────────────────────────────────┘    │
-│                                              │
-│   Streaming LLM Client                       │
-│   ┌─────────────────────────────────────┐    │
-│   │ Anthropic / OpenAI / Ollama         │    │
-│   │ Always streaming, even with tools   │    │
-│   └─────────────────────────────────────┘    │
-│                                              │
-│   Context Manager                            │
-│   ┌─────────────────────────────────────┐    │
-│   │ Auto-trim at 80K tokens             │    │
-│   │ Keep initial context + recent msgs  │    │
-│   └─────────────────────────────────────┘    │
-│                                              │
-│   MCP Integration                            │
-│   ┌─────────────────────────────────────┐    │
-│   │ aman-mcp  →  identity, tools, eval  │    │
-│   │ amem      →  memory, knowledge      │    │
-│   └─────────────────────────────────────┘    │
-└──────────────────────────────────────────────┘
+Identity (always) → Guardrails → Workflows → Tools → Skills (can truncate)
 ```
-### Session Lifecycle
+Default budget: 8,000 tokens. Override with `--budget`.
-1. **Start** — Load ecosystem, connect MCP servers, recall memory context
-2. **Chat** — Stream responses, execute tools with guardrail checks, match workflows
-3. **Auto-trim** — Compress old messages when approaching token limits
-4. **Exit** — Save conversation to amem, update session resume, rate session
+---
+## Supported LLMs
+| Provider | Models | Tool Use | Streaming |
+|:---|:---|:---|:---|
+| **Anthropic** | Claude Sonnet 4.5, Opus 4.6, Haiku 4.5 | Full | Full (with tools) |
+| **OpenAI** | GPT-4o, GPT-4o Mini, o3 | Full | Full (with tools) |
+| **Ollama** | Llama, Mistral, Gemma, any local model | Text only | Full |
 ---
@@ -218,7 +301,16 @@ Config is stored in `~/.aman-agent/config.json`:
 {
   "provider": "anthropic",
   "apiKey": "sk-ant-...",
-  "model": "claude-sonnet-4-5-20250514"
+  "model": "claude-sonnet-4-5-20250514",
+  "hooks": {
+    "memoryRecall": true,
+    "sessionResume": true,
+    "rulesCheck": true,
+    "workflowSuggest": true,
+    "evalPrompt": true,
+    "autoSessionSave": true,
+    "extractMemories": true
+  }
 }
 ```
@@ -227,6 +319,20 @@ Config is stored in `~/.aman-agent/config.json`:
 | Model override | `--model <id>` | From config |
 | Token budget | `--budget <n>` | 8000 |
+### Hook Toggles
+All hooks are on by default. Disable any in `config.json`:
+| Hook | What it controls |
+|:---|:---|
+| `memoryRecall` | Load memory context on session start |
+| `sessionResume` | Resume from last session state |
+| `rulesCheck` | Pre-tool guardrail enforcement |
+| `workflowSuggest` | Auto-detect matching workflows |
+| `evalPrompt` | Session rating on exit |
+| `autoSessionSave` | Save conversation to amem on exit |
+| `extractMemories` | Auto-extract memories from conversation |
 > Treat the config file like a credential — it contains your API key.
 ---
@@ -252,7 +358,7 @@ aman
 | Layer | Package | What it does |
 |:---|:---|:---|
 | Identity | [acore](https://github.com/amanasmuei/acore) | Personality, values, relationship memory |
-| Memory | [amem](https://github.com/amanasmuei/amem) | Persistent memory with knowledge graph (MCP) |
+| Memory | [amem](https://github.com/amanasmuei/amem) | Persistent memory with knowledge graph + reminders (MCP) |
 | Tools | [akit](https://github.com/amanasmuei/akit) | Portable AI tools (MCP + manual fallback) |
 | Workflows | [aflow](https://github.com/amanasmuei/aflow) | Reusable AI workflows |
 | Guardrails | [arules](https://github.com/amanasmuei/arules) | Safety boundaries and permissions |
@@ -265,13 +371,51 @@ aman
 ---
+## What Makes This Different
+### aman-agent vs other companion runtimes
+| Feature | aman-agent | Letta / MemGPT | Raw LLM CLI |
+|:---|:---|:---|:---|
+| Identity system | 7 portable layers | None | None |
+| Memory | amem (SQLite + embeddings + graph) | Postgres + embeddings | None |
+| Per-message recall | Progressive disclosure (~10x token savings) | Yes | No |
+| Learns from conversation | Auto-extract (hybrid confirm) | Requires configuration | No |
+| Guardrail enforcement | Runtime tool blocking | None | None |
+| Reminders | Persistent, deadline-aware | None | None |
+| Context compression | LLM-powered summarization | Archival system | Truncation |
+| Tool observation capture | Passive logging of all tool calls | None | None |
+| Token cost visibility | Shows memory injection cost per turn | None | None |
+| Multi-LLM | Anthropic, OpenAI, Ollama | OpenAI-focused | Single provider |
+| Tool execution | Parallel with guardrails | Sequential | None |
+### amem vs other memory layers
+| Feature | amem | claude-mem (40K stars) | mem0 |
+|:---|:---|:---|:---|
+| Works with | Any MCP client | Claude Code only | OpenAI-focused |
+| Storage | SQLite + local embeddings | SQLite + Chroma vectors | Cloud vector DB |
+| Progressive disclosure | Compact index + on-demand detail | Yes (10x savings) | No |
+| Memory types | 6 typed (correction > decision > fact) | Untyped observations | Untyped blobs |
+| Knowledge graph | Typed relations between memories | None | None |
+| Reminders | Persistent, deadline-aware | None | None |
+| Scoring | relevance x recency x confidence x importance | Recency-based | Similarity only |
+| Consolidation | Auto merge/prune/promote | None | None |
+| Version history | Immutable snapshots | Immutable observations | None |
+| Token cost visibility | Shown per recall | Shown per injection | None |
+| License | MIT | AGPL-3.0 | Apache-2.0 |
+> **claude-mem** excels at capturing what Claude Code *did*. **amem** is a structured memory system that works with *any* MCP client, with typed memories, a knowledge graph, reminders, progressive disclosure, and consolidation.
+---
 ## Contributing
 ```bash
 git clone https://github.com/amanasmuei/aman-agent.git
 cd aman-agent && npm install
 npm run build   # zero errors
-npm test        # 61 tests pass
+npm test        # 84 tests pass
 ```
 PRs welcome. See [Issues](https://github.com/amanasmuei/aman-agent/issues).