npm - lifeos - Versions diffs - 1.0.2 → 1.0.3 - Mend

lifeos 1.0.2 → 1.0.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (46) hide show

package/README.md +15 -9
package/README.zh.md +15 -9
package/assets/lifeos-rules.en.md +1 -1
package/assets/lifeos-rules.zh.md +1 -1
package/assets/lifeos.yaml +1 -0
package/assets/skills/archive/SKILL.en.md +1 -1
package/assets/skills/archive/SKILL.zh.md +1 -1
package/assets/skills/ask/SKILL.en.md +1 -1
package/assets/skills/ask/SKILL.zh.md +1 -1
package/assets/skills/brainstorm/SKILL.en.md +1 -1
package/assets/skills/brainstorm/SKILL.zh.md +1 -1
package/assets/skills/digest/SKILL.en.md +212 -0
package/assets/skills/digest/SKILL.zh.md +207 -0
package/assets/skills/digest/references/__pycache__/rss-arxiv-script.cpython-312.pyc +0 -0
package/assets/skills/digest/references/config-parser.en.md +179 -0
package/assets/skills/digest/references/config-parser.zh.md +177 -0
package/assets/skills/digest/references/rss-arxiv-script.py +1549 -0
package/assets/skills/digest/references/run-pipeline.en.md +236 -0
package/assets/skills/digest/references/run-pipeline.zh.md +235 -0
package/assets/skills/digest/references/setup-guide.en.md +192 -0
package/assets/skills/digest/references/setup-guide.zh.md +188 -0
package/assets/skills/knowledge/SKILL.en.md +1 -1
package/assets/skills/knowledge/SKILL.zh.md +1 -1
package/assets/skills/project/SKILL.en.md +1 -1
package/assets/skills/project/SKILL.zh.md +1 -1
package/assets/skills/read-pdf/SKILL.en.md +1 -1
package/assets/skills/read-pdf/SKILL.zh.md +1 -1
package/assets/skills/research/SKILL.en.md +1 -1
package/assets/skills/research/SKILL.zh.md +1 -1
package/assets/skills/revise/SKILL.en.md +1 -1
package/assets/skills/revise/SKILL.zh.md +1 -1
package/assets/skills/today/SKILL.en.md +1 -1
package/assets/skills/today/SKILL.zh.md +1 -1
package/dist/cli/commands/doctor.js +9 -9
package/dist/cli/commands/doctor.js.map +1 -1
package/dist/cli/commands/upgrade.js +20 -2
package/dist/cli/commands/upgrade.js.map +1 -1
package/dist/cli/utils/install-assets.js +6 -2
package/dist/cli/utils/install-assets.js.map +1 -1
package/dist/config.d.ts +1 -0
package/dist/config.js +2 -0
package/dist/config.js.map +1 -1
package/dist/index.d.ts +1 -1
package/dist/index.js +1 -1
package/dist/server.js +1 -1
package/package.json +1 -1

package/assets/skills/digest/SKILL.zh.md ADDED Viewed

@@ -0,0 +1,207 @@
+---
+name: digest
+description: '通用信息周报技能：首次使用通过对话生成主题配置（Paper Sources、RSS、Web 搜索等），后续按配置自动抓取并产出结构化周报到草稿目录。支持多主题，每个主题独立配置和独立产出。当用户说"/digest"、"信息周报"、"周报"、"digest"时触发。'
+version: 1.0.3
+dependencies:
+  templates: []
+  prompts: []
+  schemas:
+    - path: "{系统目录}/{规范子目录}/Frontmatter_Schema.md"
+  agents: []
+---
+> [!config]
+> 本技能中的路径引用使用逻辑名（如 `{草稿目录}`）。
+> Orchestrator 从 `lifeos.yaml` 解析实际路径后注入上下文。
+> 路径映射：
+> - `{草稿目录}` → directories.drafts
+> - `{系统目录}` → directories.system
+> - `{信息子目录}` → subdirectories.system.digest
+> - `{规范子目录}` → subdirectories.system.schema
+你是 LifeOS 的信息汇总助手，帮助用户定期收集特定领域的最新进展，产出结构化周报。
+**语言规则**：所有回复、配置笔记和周报都必须为中文。
+# 工作流概述
+本技能有两种运行模式：
+| 模式 | 触发条件 | 行为 |
+|------|----------|------|
+| **Setup 模式** | `{系统目录}/{信息子目录}/` 下无配置文件，或用户指定 `setup` | 对话式引导，生成主题配置笔记 |
+| **Run 模式** | 配置文件已存在 | 读取配置，执行信息抓取，产出周报 |
+# 入口路由
+根据用户输入决定模式：
+```text
+/digest              → 扫描 {系统目录}/{信息子目录}/ 下所有 .md 配置，逐个执行 Run 模式
+                       若目录为空或不存在 → 自动进入 Setup 模式
+/digest <主题名>     → 只执行指定主题的 Run 模式（匹配文件名）
+                       若文件不存在 → 自动进入 Setup 模式，以该主题名开始引导
+/digest setup        → 进入 Setup 模式，创建新主题配置
+/digest setup <主题> → 进入 Setup 模式，以指定主题名开始引导
+```
+# Setup 模式
+按 `references/setup-guide.md` 执行对话式引导：
+1. **确定主题**：询问用户想追踪的领域和子方向
+2. **了解偏好**：学术 vs 行业、必读来源、关注重点
+3. **生成配置**：根据主题推荐信息源，生成完整配置笔记
+4. **用户确认**：写入 `{系统目录}/{信息子目录}/<TopicName>.md`，提示用户在 Obsidian 中检查和裁剪
+配置笔记使用 Markdown 表格 + checkbox 开关，用户在 Obsidian 中可直接编辑：
+- checkbox 勾选/取消 → 启用或禁用信息源模块
+- 表格增删行 → 增删具体信息源，包括 `Paper Sources`
+- 分类表格 → 调整周报结构
+配置笔记结构详见 `references/config-parser.md`。
+# Run 模式
+按 `references/run-pipeline.md` 执行信息抓取管线。
+### 前置检查
+1. 验证 Python 3 可用：`python3 --version`
+2. 读取并解析配置笔记（按 `references/config-parser.md` 规范）
+### 执行管线
+```text
+Phase 1: 解析配置 → 结构化数据
+Phase 2: 并行抓取
+  ├─ Task A: RSS + paper sources → Python 脚本（references/rss-arxiv-script.py）
+  ├─ Task B: Web 搜索 → WebSearch 工具
+  ├─ Task C: HuggingFace 热门 → WebFetch
+  └─ Task D: GitHub Trending → WebFetch（可选）
+Phase 3: 合并去重 → 按分类体系归类
+Phase 4: 写入周报 → {草稿目录}/<TopicName>-MMDD-MMDD.md
+```
+### Python 脚本调用
+RSS + paper source 抓取通过参数化 Python 脚本执行。技能先解析配置，构造 JSON 输入，通过 stdin 传入脚本。
+`Paper Sources` 模型现在支持：
+- `arXiv`、`bioRxiv`、`medRxiv`、`ChemRxiv`、`SocArXiv`、`SSRN`
+- 每一行包含 `Source Type`、`Query`、`Scope`、`Notes`
+- 脚本会为每个来源使用独立 adapter 归一化结果，并返回结构化错误，不会因为单个来源失败就中断整个流程
+- 旧版 `### arXiv Search` 配置块仍然兼容，系统会把它转换成 `arXiv` 来源继续执行
+- `SocArXiv` 结果在源站托管为 `OSF` 页面时，可以归一化为 `osf.io` 落地页
+- 所有论文来源 adapter 都保持低请求预算：每个来源一次主请求，瞬时错误有限 retry，不做分页补全
+其中 arXiv 模块有三个关键约束：
+- arXiv 查询必须使用英文
+- 脚本会先按类别抓最近论文，再在本地过滤
+- 若官方 arXiv 路径失败，可回退到 OpenAlex，但只保留能映射回 arXiv 的论文
+调用方式：
+```bash
+echo '<json_input>' | python3 .agents/skills/digest/references/rss-arxiv-script.py
+```
+JSON 输入格式：
+```json
+{
+  "language": "zh",
+  "rss": {
+    "enabled": true,
+    "feeds": [{"name": "源名称", "url": "https://..."}]
+  },
+  "papers": {
+    "enabled": true,
+    "sources": [
+      {
+        "source_type": "arXiv",
+        "query": "\"llm agent\"",
+        "scope": "cs.AI",
+        "notes": "核心技术论文"
+      },
+      {
+        "source_type": "bioRxiv",
+        "query": "single-cell",
+        "scope": "Neuroscience",
+        "notes": "生物医学预印本"
+      }
+    ]
+  },
+  "days": 7
+}
+```
+脚本会返回 `rss_articles`、归一化后的论文结果、`stats` 和结构化的 `errors`。
+### 周报产出
+写入 `{草稿目录}/<TopicName>-MMDD-MMDD.md`：
+```yaml
+---
+title: "{主题} 周报 · YYYY-MM-DD ~ YYYY-MM-DD"
+type: draft
+created: "YYYY-MM-DD"
+status: pending
+tags: [digest, {topic-tag}, weekly-digest]
+aliases: []
+---
+```
+正文按配置的分类体系组织，每条信息用 1-2 句中文摘要 + 原文链接。空分类不输出。末尾附信息来源清单。
+# 文件路径
+| 内容 | 路径 |
+|------|------|
+| 主题配置文件 | `{系统目录}/{信息子目录}/<TopicName>.md` |
+| 周报产出 | `{草稿目录}/<TopicName>-MMDD-MMDD.md` |
+| 解析规范 | `references/config-parser.md` |
+| Setup 引导 | `references/setup-guide.md` |
+| Run 管线 | `references/run-pipeline.md` |
+| Python 脚本 | `references/rss-arxiv-script.py` |
+# 记忆系统集成
+> 通用协议（文件变更通知、技能完成、会话收尾）见 `_shared/memory-protocol.md`。以下仅列出本技能特有的行为。
+### 文件变更通知
+周报文件写入 Vault 后，立即调用：
+```text
+memory_notify(file_path="{草稿目录}/<TopicName>-MMDD-MMDD.md")
+```
+### 技能完成
+```text
+memory_skill_complete(
+  skill_name="digest",
+  summary="生成 {主题} 周报 MMDD-MMDD",
+  related_files=["{草稿目录}/<TopicName>-MMDD-MMDD.md"],
+  scope="digest",
+  refresh_targets=["TaskBoard", "UserProfile"]
+)
+```
+### Setup 模式完成时
+配置文件创建后，额外记录一条决策：
+```text
+memory_log(
+  entry_type="decision",
+  summary="创建 {主题} 信息订阅配置",
+  importance=2,
+  scope="digest"
+)
+```

package/assets/skills/digest/references/__pycache__/rss-arxiv-script.cpython-312.pyc ADDED Viewed

Binary file

package/assets/skills/digest/references/config-parser.en.md ADDED Viewed

@@ -0,0 +1,179 @@
+# Config Note Parsing Rules
+This document defines how the `/digest` skill parses config notes stored at `{system directory}/{digest subdirectory}/<TopicName>.md`.
+## File Structure
+The config note contains the following fixed sections, identified by second-level and third-level headings:
+```text
+# <TopicName> Digest          ← title only, not parsed
+## Basic Info                ← key-value table
+## Sources                   ← container heading, not parsed
+  ### RSS Feeds              ← module: checkbox + table
+  ### Paper Sources          ← module: checkbox + table
+  ### arXiv Search           ← legacy module: checkbox + table (still accepted)
+  ### Web Search             ← module: checkbox + table + supplemental sites table
+  ### HuggingFace Papers     ← module: checkbox + keyword line
+  ### GitHub Trending        ← module: checkbox + keyword line
+## Categories                ← category table
+## Source List               ← not parsed, generated into the digest output
+```
+## Parsing Rules
+### 1. Basic Info
+Locate `## Basic Info` and parse the two-column table (`Field | Value`):
+| Field | Purpose | Required |
+|-------|---------|----------|
+| Topic | topic name used in digest title and filename | yes |
+| Cadence | `Weekly` / `Biweekly` / `Monthly`, used to determine lookback window | yes |
+| Language | digest output language | yes |
+**Cadence mapping:**
+- `Weekly` → 7 days
+- `Biweekly` → 14 days
+- `Monthly` → 30 days
+### 2. Module Enabled State
+The first checkbox after each `###` heading controls whether that module is enabled:
+```markdown
+### RSS Feeds
+- [x] Enabled
+```
+```markdown
+### GitHub Trending
+- [ ] Enabled
+```
+**Parsing logic:**
+1. find the `###` heading
+2. scan downward to the first line matching `- \[[ x]\]`
+3. `[x]` means enabled, `[ ]` means disabled
+### 3. Module Data
+#### RSS Feeds
+Table schema: `Name | URL | Focus`
+```json
+{
+  "enabled": true,
+  "feeds": [
+    {"name": "Import AI", "url": "https://importai.substack.com", "description": "Frontier AI research commentary"}
+  ]
+}
+```
+**URL handling:**
+- prepend `https://` when the URL does not start with `http`
+- if the URL has no `/feed` or `/rss`, optionally try appending `/feed`
+#### Paper Sources
+Table schema: `Source Type | Query | Scope | Notes`
+```json
+{
+  "enabled": true,
+  "sources": [
+    {
+      "source_type": "arXiv",
+      "query": "\"LLM agent\"",
+      "scope": "cs.AI, cs.CL",
+      "notes": "Core technical papers"
+    },
+    {
+      "source_type": "bioRxiv",
+      "query": "single-cell",
+      "scope": "Neuroscience",
+      "notes": "Biomedical preprints"
+    }
+  ]
+}
+```
+**Supported source types:** `arXiv`, `bioRxiv`, `medRxiv`, `ChemRxiv`, `SocArXiv`, `SSRN`.
+**Source semantics:** `Query` is the search term or keyword phrase; `Scope` is the category,
+collection, or journal filter used by that source; `Notes` is free-form guidance for the helper.
+**Normalization:** the helper converts each row into a source adapter input and deduplicates papers
+across sources.
+**Source-link rules:** `SocArXiv` may normalize to `osf.io` or `socarxiv.com`; `SSRN` must
+normalize to `papers.ssrn.com`, `ssrn.com`, or an SSRN DOI.
+**Budget rule:** keep one primary request per source and do not paginate.
+**Compatibility:** this is the preferred model for new notes.
+#### arXiv Search
+Table schema: `Keyword | Categories`
+```json
+{
+  "enabled": true,
+  "keywords": ["\"LLM agent\"", "\"tool use\" language model"],
+  "categories": ["cs.AI", "cs.CL", "cs.IR"],
+  "max_results": 200
+}
+```
+**Legacy compatibility:** the parser still accepts `### arXiv Search` and normalizes it into an
+`arXiv` paper source so older notes continue to work.
+**Keyword language:** keywords must be English terms or English quoted phrases. Treat non-English
+keywords as a config error for the arXiv source.
+**Category deduplication:** combine all categories from every row and deduplicate them.
+**Primary fetch behavior:** categories drive the official arXiv feed; keyword filtering happens
+locally against title and abstract.
+**Fallback behavior:** when categories are missing or the official arXiv path fails, the helper may
+fall back to OpenAlex, but only keep papers that map back to arXiv.
+**max_results:** fixed at 200 and not exposed in the note.
+#### Web Search
+Two tables:
+1. **Query Template** (`Query Template | Coverage`)
+2. **Supplemental Sites** (`Name | URL | Focus`)
+Replace `{date range}` at runtime with the actual date span. Supplemental sites are used to build additional `site:` queries.
+#### HuggingFace Papers
+Locate the `**Filter keywords:**` line and split keywords by commas.
+#### GitHub Trending
+Same parsing rule as HuggingFace.
+### 4. Categories
+Locate `## Categories` and parse the table `Category | Coverage`:
+```json
+{
+  "categories": [
+    {"name": "Key Papers / Key Articles", "scope": "The 3-5 most important papers or articles this week"},
+    {"name": "Frameworks and Tooling", "scope": "Agent frameworks, tooling, SDK updates"}
+  ]
+}
+```
+## Tolerance Rules
+| Problem | Handling |
+|---------|----------|
+| unrecognized module heading | ignore that section |
+| missing checkbox | treat as enabled |
+| mismatched table columns | parse the available cells and fill missing values with empty strings |
+| missing required Basic Info field | raise an error and ask the user to complete the note |
+| empty or malformed config note | raise an error and suggest running `/digest setup` |

package/assets/skills/digest/references/config-parser.zh.md ADDED Viewed

@@ -0,0 +1,177 @@
+# 配置笔记解析规范
+本文档定义 `/digest` 技能如何解析 `{系统目录}/{信息子目录}/<TopicName>.md` 配置笔记。
+## 文件结构
+配置笔记由以下固定 section 组成，按二级/三级标题识别：
+```text
+# <主题名> 信息              ← 标题（不解析）
+## 基本信息                   ← 键值表格
+## 信息源                     ← 容器标题（不解析）
+  ### RSS 订阅               ← 模块：checkbox + 表格
+  ### Paper Sources          ← 模块：checkbox + 表格
+  ### arXiv 搜索             ← 旧版模块：checkbox + 表格（仍兼容）
+  ### Web 搜索               ← 模块：checkbox + 表格 + 补充站点表格
+  ### HuggingFace 热门论文    ← 模块：checkbox + 关键词行
+  ### GitHub Trending         ← 模块：checkbox + 关键词行
+## 分类体系                   ← 分类表格
+## 信息来源清单               ← 不解析，周报生成时自动填充
+```
+## 解析规则
+### 1. 基本信息
+定位 `## 基本信息`，解析其下的两列表格（`字段 | 值`）：
+| 字段 | 用途 | 必填 |
+|------|------|------|
+| 主题 | 主题名，用于周报标题和文件命名 | 是 |
+| 周期 | `每周` / `每两周` / `每月`，决定回溯天数 | 是 |
+| 语言 | 周报输出语言 | 是 |
+**周期映射：**
+- `每周` → 7 天
+- `每两周` → 14 天
+- `每月` → 30 天
+### 2. 模块启用状态
+每个三级标题（`###`）后的第一个 checkbox 决定启用状态：
+```markdown
+### RSS 订阅
+- [x] 启用
+```
+```markdown
+### GitHub Trending
+- [ ] 启用
+```
+**解析逻辑：**
+1. 找到 `###` 标题行
+2. 向下扫描，找到第一个匹配 `- \[[ x]\]` 的行
+3. `[x]` 视为启用，`[ ]` 视为禁用
+### 3. 模块数据解析
+#### RSS 订阅
+表格 schema：`名称 | URL | 方向`
+```json
+{
+  "enabled": true,
+  "feeds": [
+    {"name": "Import AI", "url": "https://importai.substack.com", "description": "AI 前沿研究综述"}
+  ]
+}
+```
+**URL 处理：**
+- 若 URL 不以 `http` 开头，自动补全 `https://`
+- 若 URL 不含 `/feed` 或 `/rss`，可尝试在末尾追加 `/feed` 作为 RSS 地址
+#### Paper Sources
+表格 schema：`Source Type | Query | Scope | Notes`
+```json
+{
+  "enabled": true,
+  "sources": [
+    {
+      "source_type": "arXiv",
+      "query": "\"LLM agent\"",
+      "scope": "cs.AI, cs.CL",
+      "notes": "核心技术论文"
+    },
+    {
+      "source_type": "bioRxiv",
+      "query": "single-cell",
+      "scope": "Neuroscience",
+      "notes": "生物医学预印本"
+    }
+  ]
+}
+```
+**支持的来源类型：** `arXiv`、`bioRxiv`、`medRxiv`、`ChemRxiv`、`SocArXiv`、`SSRN`。
+**字段含义：** `Query` 是检索词或关键词短语；`Scope` 是该来源使用的类别、集合或期刊
+过滤；`Notes` 是给 helper 的自由说明。
+**归一化：** helper 会把每一行转换成独立来源 adapter 输入，并在不同来源之间去重。
+**来源链接规则：** `SocArXiv` 可以归一化到 `osf.io` 或 `socarxiv.com`；`SSRN`
+必须归一化到 `papers.ssrn.com`、`ssrn.com` 或 SSRN DOI。
+**预算规则：** 每个来源只发一次主请求，不做分页。
+**兼容策略：** 新配置优先使用这个模型。
+#### arXiv 搜索
+表格 schema：`关键词 | 类别`
+```json
+{
+  "enabled": true,
+  "keywords": ["\"LLM agent\"", "\"tool use\" language model"],
+  "categories": ["cs.AI", "cs.CL", "cs.IR"],
+  "max_results": 200
+}
+```
+**旧版兼容：** 解析器仍然接受 `### arXiv 搜索`，并将其归一化为一个 `arXiv` 论文来源，
+确保旧配置继续可用。
+**关键词语言：** 关键词必须是英文词或英文引号短语。若出现中文关键词，则将 arXiv
+来源视为配置错误。
+**类别去重：** 合并所有行的类别列，去重后作为搜索范围。
+**主抓取行为：** 类别用于官方 arXiv feed 抓取，关键词只在本地对标题和摘要做过滤。
+**fallback 行为：** 若类别缺失，或官方 arXiv 路径失败，可回退到 OpenAlex，但只保留能映射回
+arXiv 的论文。
+**max_results：** 固定 200，不在配置中暴露。
+#### Web 搜索
+两张表格：
+1. **搜索查询模板**（`搜索查询模板 | 目标覆盖`）
+2. **补充站点**（`名称 | URL | 方向`）
+查询模板中的 `{日期范围}` 在运行时替换为实际日期。补充站点用于额外构造 `site:` 查询。
+#### HuggingFace 热门论文
+定位 `**筛选关键词：**` 行，按逗号分割提取关键词列表。
+#### GitHub Trending
+同 HuggingFace，定位 `**筛选关键词：**` 行。
+### 4. 分类体系
+定位 `## 分类体系`，解析表格 `分类 | 覆盖范围`：
+```json
+{
+  "categories": [
+    {"name": "重要论文", "scope": "本周影响力最大的 3-5 篇论文"},
+    {"name": "框架与工具", "scope": "Agent 框架、开发工具、SDK 更新"}
+  ]
+}
+```
+## 容错规则
+| 异常 | 处理 |
+|------|------|
+| 模块标题不在枚举中 | 忽略该 section |
+| 模块无 checkbox | 视为启用 |
+| 表格列数不匹配 | 按已有列解析，缺失列填空 |
+| 基本信息缺少必填字段 | 报错并提示用户补全 |
+| 配置文件为空或格式错误 | 报错并建议运行 `/digest setup` |