npm - memory-lancedb-pro - Versions diffs - 1.1.0-beta.6 → 1.1.0-beta.8 - Mend

memory-lancedb-pro 1.1.0-beta.6 → 1.1.0-beta.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (56) hide show

package/.github/workflows/auto-assign.yml +33 -0
package/.github/workflows/ci.yml +16 -0
package/.github/workflows/claude-code-review.yml +44 -0
package/.github/workflows/claude.yml +50 -0
package/CHANGELOG-v1.1.0.md +227 -0
package/CHANGELOG.md +13 -27
package/README.md +600 -711
package/README_CN.md +656 -683
package/cli.ts +139 -14
package/docs/CHANGELOG-v1.1.0.md +306 -0
package/docs/memory_architecture_analysis.md +832 -0
package/docs/openclaw-integration-playbook.md +334 -0
package/docs/openclaw-integration-playbook.zh-CN.md +353 -0
package/index.ts +1023 -581
package/openclaw.plugin.json +176 -177
package/package.json +5 -3
package/scripts/sync-plugin-version.mjs +100 -0
package/src/access-tracker.ts +13 -3
package/src/decay-engine.ts +228 -0
package/src/extraction-prompts.ts +213 -0
package/src/llm-client.ts +124 -0
package/src/memory-categories.ts +70 -0
package/src/memory-upgrader.ts +387 -0
package/src/noise-filter.ts +18 -0
package/src/noise-prototypes.ts +163 -0
package/src/reflection-metadata.ts +2 -1
package/src/reflection-slices.ts +4 -7
package/src/reflection-store.ts +177 -151
package/src/retriever.ts +180 -43
package/src/smart-extractor.ts +869 -0
package/src/smart-metadata.ts +415 -0
package/src/store.ts +298 -96
package/src/tier-manager.ts +188 -0
package/src/tools.ts +115 -27
package/test/cli-smoke.mjs +158 -15
package/test/config-session-strategy-migration.test.mjs +2 -2
package/test/context-support-e2e.mjs +266 -0
package/test/functional-e2e.mjs +323 -0
package/test/memory-reflection.test.mjs +119 -1168
package/test/openclaw-host-functional.mjs +317 -0
package/test/plugin-manifest-regression.mjs +263 -0
package/test/recall-text-cleanup.test.mjs +240 -0
package/test/retriever-rerank-regression.mjs +278 -0
package/test/smart-extractor-branches.mjs +955 -0
package/test/smart-memory-lifecycle.mjs +219 -0
package/test/smart-metadata-v2.mjs +121 -0
package/test/sync-plugin-version.test.mjs +46 -0
package/test/update-consistency-lancedb.test.mjs +201 -0
package/test/vector-search-cosine.test.mjs +89 -0
package/memory-lancedb-pro-1.1.0-beta.6.tgz +0 -0
package/src/recall-engine.ts +0 -233
package/src/reflection-aggregation.ts +0 -165
package/src/reflection-normalize.ts +0 -41
package/src/reflection-recall.ts +0 -170
package/src/reflection-selection.ts +0 -108
package/test/helpers/openclaw-extension-api-stub.mjs +0 -39

package/.github/workflows/auto-assign.yml ADDED Viewed

@@ -0,0 +1,33 @@
+name: Auto Assign
+on:
+  issues:
+    types: [opened]
+  pull_request:
+    types: [opened]
+jobs:
+  assign-issues:
+    if: github.event_name == 'issues'
+    runs-on: ubuntu-latest
+    permissions:
+      issues: write
+    steps:
+      - name: 'Auto-assign issue to AliceLJY'
+        uses: pozil/auto-assign-issue@v1
+        with:
+          repo-token: ${{ secrets.GITHUB_TOKEN }}
+          assignees: AliceLJY
+          numOfAssignee: 1
+  assign-prs:
+    if: github.event_name == 'pull_request'
+    runs-on: ubuntu-latest
+    permissions:
+      pull-requests: write
+    steps:
+      - name: 'Auto-assign PR to rwmjhb'
+        uses: pozil/auto-assign-issue@v1
+        with:
+          repo-token: ${{ secrets.GITHUB_TOKEN }}
+          assignees: rwmjhb
+          numOfAssignee: 1

package/.github/workflows/ci.yml CHANGED Viewed

@@ -5,6 +5,22 @@ on:
   pull_request:
 jobs:
+  version-sync:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+      - name: Check version consistency
+        run: |
+          pkg_ver=$(node -p "JSON.parse(require('fs').readFileSync('package.json','utf8')).version")
+          plugin_ver=$(node -p "JSON.parse(require('fs').readFileSync('openclaw.plugin.json','utf8')).version")
+          if [ "$pkg_ver" != "$plugin_ver" ]; then
+            echo "::error::Version mismatch: package.json=$pkg_ver, openclaw.plugin.json=$plugin_ver"
+            exit 1
+          fi
+          echo "Versions match: $pkg_ver"
   cli-smoke:
     runs-on: ubuntu-latest
     steps:

package/.github/workflows/claude-code-review.yml ADDED Viewed

@@ -0,0 +1,44 @@
+name: Claude Code Review
+on:
+  pull_request:
+    types: [opened, synchronize, ready_for_review, reopened]
+    # Optional: Only run on specific file changes
+    # paths:
+    #   - "src/**/*.ts"
+    #   - "src/**/*.tsx"
+    #   - "src/**/*.js"
+    #   - "src/**/*.jsx"
+jobs:
+  claude-review:
+    # Optional: Filter by PR author
+    # if: |
+    #   github.event.pull_request.user.login == 'external-contributor' ||
+    #   github.event.pull_request.user.login == 'new-developer' ||
+    #   github.event.pull_request.author_association == 'FIRST_TIME_CONTRIBUTOR'
+    runs-on: ubuntu-latest
+    permissions:
+      contents: read
+      pull-requests: read
+      issues: read
+      id-token: write
+    steps:
+      - name: Checkout repository
+        uses: actions/checkout@v4
+        with:
+          fetch-depth: 1
+      - name: Run Claude Code Review
+        id: claude-review
+        uses: anthropics/claude-code-action@v1
+        with:
+          claude_code_oauth_token: ${{ secrets.CLAUDE_CODE_OAUTH_TOKEN }}
+          plugin_marketplaces: 'https://github.com/anthropics/claude-code.git'
+          plugins: 'code-review@claude-code-plugins'
+          prompt: '/code-review:code-review ${{ github.repository }}/pull/${{ github.event.pull_request.number }}'
+          # See https://github.com/anthropics/claude-code-action/blob/main/docs/usage.md
+          # or https://code.claude.com/docs/en/cli-reference for available options

package/.github/workflows/claude.yml ADDED Viewed

@@ -0,0 +1,50 @@
+name: Claude Code
+on:
+  issue_comment:
+    types: [created]
+  pull_request_review_comment:
+    types: [created]
+  issues:
+    types: [opened, assigned]
+  pull_request_review:
+    types: [submitted]
+jobs:
+  claude:
+    if: |
+      (github.event_name == 'issue_comment' && contains(github.event.comment.body, '@claude')) ||
+      (github.event_name == 'pull_request_review_comment' && contains(github.event.comment.body, '@claude')) ||
+      (github.event_name == 'pull_request_review' && contains(github.event.review.body, '@claude')) ||
+      (github.event_name == 'issues' && (contains(github.event.issue.body, '@claude') || contains(github.event.issue.title, '@claude')))
+    runs-on: ubuntu-latest
+    permissions:
+      contents: read
+      pull-requests: read
+      issues: read
+      id-token: write
+      actions: read # Required for Claude to read CI results on PRs
+    steps:
+      - name: Checkout repository
+        uses: actions/checkout@v4
+        with:
+          fetch-depth: 1
+      - name: Run Claude Code
+        id: claude
+        uses: anthropics/claude-code-action@v1
+        with:
+          claude_code_oauth_token: ${{ secrets.CLAUDE_CODE_OAUTH_TOKEN }}
+          # This is an optional setting that allows Claude to read CI results on PRs
+          additional_permissions: |
+            actions: read
+          # Optional: Give a custom prompt to Claude. If this is not specified, Claude will perform the instructions specified in the comment that tagged it.
+          # prompt: 'Update the pull request description to include a summary of changes.'
+          # Optional: Add claude_args to customize behavior and configuration
+          # See https://github.com/anthropics/claude-code-action/blob/main/docs/usage.md
+          # or https://code.claude.com/docs/en/cli-reference for available options
+          # claude_args: '--allowed-tools Bash(gh pr:*)'

package/CHANGELOG-v1.1.0.md ADDED Viewed

@@ -0,0 +1,227 @@
+# memory-lancedb-pro v1.1.0 — 智能记忆增强
+> **日期**: 2026-03-03
+> **作者**: CJY
+> **概述**: 基于对 AI Agent 记忆系统的深入理解，对记忆的写入质量、生命周期管理和去重能力进行了全面改进与完善
+---
+## 一、改进动机
+原有记忆系统在**检索侧**表现优异（Vector+BM25 混合检索、cross-encoder 重排序、多维评分），但在以下方面存在提升空间：
+- **记忆写入质量**：依赖正则表达式触发捕获，容易漏捕有价值信息或误捕噪声
+- **记忆结构层次**：扁平文本存储，缺乏分层索引能力
+- **记忆生命周期**：简单时间衰减，无法模拟人类记忆的遗忘与强化规律
+- **去重能力**：仅基于向量相似度的粗粒度去重，缺乏语义级判断
+本次改进针对这三个维度进行了系统性增强。
+---
+## 二、变更摘要
+| 改进维度     | 核心变更                                  | 效果                               |
+| ------------ | ----------------------------------------- | ---------------------------------- |
+| 智能提取     | LLM 驱动的 6 类别提取 + L0/L1/L2 分层存储 | 记忆写入更精准、结构更丰富         |
+| 生命周期管理 | Weibull 衰减模型 + 三层晋升/降级          | 重要记忆持久保留，过时记忆自然淡化 |
+| 智能去重     | 向量预过滤 + LLM 语义决策                 | 避免冗余记忆，支持信息演化合并     |
+---
+## 三、新增文件
+### 1. `src/memory-categories.ts` — 6 类别分类系统
+设计了语义明确的记忆分类体系，将记忆分为两大类六小类：
+- **用户记忆**：`profile`（身份属性）、`preferences`（偏好习惯）、`entities`（持续存在的实体）、`events`（发生的事件）
+- **Agent 记忆**：`cases`（问题-解决方案对）、`patterns`（可复用的处理流程）
+每个类别有不同的合并策略：
+- `profile` → 始终合并（用户身份信息持续累积）
+- `preferences` / `entities` / `patterns` → 支持智能合并
+- `events` / `cases` → 仅新增或跳过（独立记录，保留历史完整性）
+---
+### 2. `src/llm-client.ts` — LLM 客户端
+封装了 LLM 调用接口，专注于结构化 JSON 输出：
+- 复用现有 OpenAI SDK 依赖，零新增包
+- 内置 JSON 容错解析：支持 markdown 代码块包裹和平衡大括号提取
+- 低温度 (0.1) 保证输出一致性
+- 30 秒超时保护，失败时优雅降级
+---
+### 3. `src/extraction-prompts.ts` — 记忆提取提示模板
+精心设计了 3 个提示模板：
+| 函数                      | 用途                                                |
+| ------------------------- | --------------------------------------------------- |
+| `buildExtractionPrompt()` | 从对话中提取 6 类别 L0/L1/L2 记忆，含 few-shot 示例 |
+| `buildDedupPrompt()`      | CREATE / MERGE / SKIP 去重决策                      |
+| `buildMergePrompt()`      | 将新旧记忆合并为三层结构                            |
+提取提示包含完整的记忆价值判断标准、类别决策逻辑表、常见混淆澄清规则和 6 个 few-shot 示例。
+---
+### 4. `src/smart-extractor.ts` — 智能提取管线
+实现了完整的 LLM 驱动提取流水线：
+```
+对话文本 → LLM 提取 → 候选记忆 → 向量去重 → LLM 决策 → 持久化
+```
+核心设计：
+- **两阶段去重**：先用向量相似度（阈值 0.7）快速筛选候选，再用 LLM 进行语义级判断
+- **类别感知合并**：不同类别应用不同合并策略
+- **L0/L1/L2 三层存储**：L0 一句话索引用于检索注入，L1 结构化摘要用于精读，L2 完整叙述用于深度回顾
+- **向后兼容**：新增的 6 类别自动映射到已有的 5 类别存储，L0/L1/L2 存储在 metadata JSON 中
+- **按类别设定重要度**：profile (0.9) > patterns (0.85) > cases/preferences (0.8) > entities (0.7) > events (0.6)
+---
+### 5. `src/decay-engine.ts` — Weibull 衰减引擎
+基于认知心理学中的记忆遗忘曲线研究，实现了复合衰减模型：
+**复合分数 = 时效权重 × 时效 + 频率权重 × 频率 + 内在权重 × 内在价值**
+三个分量：
+| 分量                     | 机制                              | 含义                   |
+| ------------------------ | --------------------------------- | ---------------------- |
+| **时效 (recency)**       | Weibull 拉伸指数衰减 `exp(-λt^β)` | 越久远的记忆衰减越快   |
+| **频率 (frequency)**     | 对数饱和曲线 + 时间加权           | 越常被访问的记忆越活跃 |
+| **内在价值 (intrinsic)** | `importance × confidence`         | 高价值记忆天然抵抗遗忘 |
+层级特定的衰减形状 (β 参数)：
+- **Core** (β=0.8)：亚指数衰减 → 遗忘极慢，衰减地板 0.9
+- **Working** (β=1.0)：标准指数衰减，衰减地板 0.7
+- **Peripheral** (β=1.3)：超指数衰减 → 遗忘加速，衰减地板 0.5
+关键特性：
+- **重要性调制半衰期**：`effectiveHL = halfLife × exp(μ × importance)`，重要记忆持续更久
+- **搜索结果加权**：检索时自动应用衰减加权，让活跃记忆排名更高
+- **过期识别**：识别 composite < 0.3 的过期记忆
+---
+### 6. `src/tier-manager.ts` — 三层晋升/降级管理器
+模拟人类记忆的多级存储模型：
+```
+Peripheral（外围） ⟷ Working（工作） ⟷ Core（核心）
+```
+**晋升条件**：
+| 方向                 | 条件                                            |
+| -------------------- | ----------------------------------------------- |
+| Peripheral → Working | 访问次数 ≥ 3 且 衰减分数 ≥ 0.4                  |
+| Working → Core       | 访问次数 ≥ 10 且 衰减分数 ≥ 0.7 且 重要度 ≥ 0.8 |
+**降级条件**：
+| 方向                 | 条件                                             |
+| -------------------- | ------------------------------------------------ |
+| Working → Peripheral | 衰减分数 < 0.15 或（年龄 > 60 天且访问次数 < 3） |
+| Core → Working       | 衰减分数 < 0.15 且 访问次数 < 3（极少触发）      |
+---
+## 四、修改文件
+### `index.ts` — 插件入口
+#### 新增配置项
+```typescript
+smartExtraction?: boolean;    // 是否启用 LLM 智能提取（默认 true）
+llm?: {
+  apiKey?: string;            // LLM API Key（默认复用 embedding.apiKey）
+  model?: string;             // LLM 模型（默认 gpt-4o-mini）
+  baseURL?: string;           // LLM API 端点
+};
+extractMinMessages?: number;  // 最少消息数才触发提取（默认 2）
+extractMaxChars?: number;     // 送入 LLM 的最大字符数（默认 8000）
+```
+#### `agent_end` 钩子改进
+- 当 `smartExtraction` 启用时，优先使用 SmartExtractor 进行 LLM 6 类别提取
+- 当消息数不足或 SmartExtractor 未初始化时，降级回原有正则触发逻辑
+- 提取完成后输出统计日志：`smart-extracted N created, M merged, K skipped`
+#### `before_agent_start` 钩子改进
+- 注入的记忆上下文现在显示 L0 摘要而非原始文本
+- 新增 6 类别标签（如 `[preferences:global]`）
+- 新增层级标记（`[C]`ore / `[W]`orking / `[P]`eripheral）
+---
+## 五、配置指南
+### 最简配置（复用已有 API Key）
+```json
+{
+  "embedding": {
+    "apiKey": "${OPENAI_API_KEY}",
+    "model": "text-embedding-3-small"
+  },
+  "smartExtraction": true
+}
+```
+### 完整配置
+```json
+{
+  "embedding": {
+    "apiKey": "${OPENAI_API_KEY}",
+    "model": "text-embedding-3-small"
+  },
+  "smartExtraction": true,
+  "llm": {
+    "apiKey": "${OPENAI_API_KEY}",
+    "model": "gpt-4o-mini",
+    "baseURL": "https://api.openai.com/v1"
+  },
+  "extractMinMessages": 2,
+  "extractMaxChars": 8000
+}
+```
+### 禁用智能提取
+```json
+{
+  "smartExtraction": false
+}
+```
+---
+## 六、向后兼容性
+| 方面           | 兼容方式                                       |
+| -------------- | ---------------------------------------------- |
+| LanceDB Schema | 新字段存储在 `metadata` JSON 中，不修改表结构  |
+| 记忆类别       | 新 6 类别自动映射到原有 5 类别                 |
+| 混合检索       | Vector+BM25 检索管线完全保留                   |
+| 去重逻辑       | 仅在 `smartExtraction: true` 时生效            |
+| 已有数据       | 旧记忆正常读取，新记忆额外携带 L0/L1/L2 元数据 |
+| 配置           | 全部新增配置项均有默认值，零配置即可使用       |

package/CHANGELOG.md CHANGED Viewed

@@ -1,42 +1,28 @@
 # Changelog
-## 1.1.0-beta.6
+## 1.1.0-beta.2 (Smart Memory Beta + Access Reinforcement)
-- Refactor: build reset/new reflection handoff note in `runMemoryReflection`.
-- Refactor: `<open-loops>` now comes from the fresh reflection run, while `<derived-focus>` comes from historical scored itemized derived rows.
-- Refactor: upgrade historical `<derived-focus>` ranking to Derived-Focus V2 (conservative strict/soft normalization, non-linear group scoring, diversity-aware shortlist up to 36 before final note injection capped at 13, with no hard `score > 0.3` gate).
-- Breaking: stop writing and stop reading legacy combined reflection rows (`type=memory-reflection`).
-- Docs: refresh README / README_CN for the new handoff-note behavior and remove old legacy combined guidance.
+This is a **beta** release published under the npm dist-tag **`beta`** (it does not affect the stable `latest` channel).
----
-## 1.1.0
+Highlights:
+- **Smart Extraction (LLM-powered)**: 6-category extraction with L0/L1/L2 metadata (falls back to regex capture when disabled or init fails)
+- **Lifecycle scoring integrated into retrieval**: decay-based score adjustment + tier floors
+- **Tier transitions (best-effort)**: bounded metadata write-backs for top results (tier / access stats)
+- **Access reinforcement for time decay**: frequently *manually recalled* memories decay more slowly (spaced-repetition style)
+  - Adds `AccessTracker` with debounced metadata write-back (accessCount / lastAccessedAt)
+  - Adds retrieval config: `reinforcementFactor` (default: 0.5) and `maxHalfLifeMultiplier` (default: 3)
-- Feat: add integrated self-improvement governance flow (`agent:bootstrap`, `command:new/reset`, governance tools, and `.learnings` file bootstrap).
-- Feat: add `memoryReflection` session strategy with inheritance/derived injection, reflection persistence, and dedicated reflection-agent support.
-- Fix: keep session-strategy compatibility by mapping legacy `sessionMemory.enabled` to `systemSessionMemory` / `none` and trimming reflection input toward recent conversation tail.
-- Fix: retry early transient upstream reflection failures once and broaden session recovery search paths to real OpenClaw agent session directories.
-- Docs: update README / README_CN for session strategy, self-improvement, memoryReflection, mdMirror, and reflection fallback behavior.
-- Tests: add targeted coverage for reflection retry classification and session recovery path resolution.
-PRs: #43, #2
+Notes:
+- Access reinforcement is gated to manual recall (`source: \"manual\"`) to avoid auto-recall strengthening noise.
 ---
-## 1.0.32
-- Fix: strip OpenClaw `Conversation info` / `Sender` metadata noise before auto-capture matching and adaptive retrieval normalization, reducing false captures and noisy retrieval triggers.
-- Fix: parse `autoRecallMinRepeated` from plugin config so repeated-memory suppression works when configured.
+## 1.1.0-beta.1 (Smart Memory Beta)
-PR: #50
+- Initial beta with Smart Extraction + lifecycle components (decay engine + tier manager)
 ---
-## 1.0.31
-- Fix: `memory-pro import` now preserves provided IDs and is idempotent (skips if ID already exists).
 ## 1.0.26
 **Access Reinforcement for Time Decay**