npm - sillyspec - Versions diffs - 3.7.33 → 3.8.0 - Mend

sillyspec 3.7.33 → 3.8.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (21) hide show

package/.claude/skills/sillyspec-auto/SKILL.md +128 -0
package/.sillyspec/changes/archive/2026-04-08-derive-state/design.md +97 -0
package/.sillyspec/changes/archive/2026-04-08-derive-state/plan.md +51 -0
package/.sillyspec/changes/archive/2026-04-08-derive-state/proposal.md +29 -0
package/.sillyspec/changes/archive/2026-04-08-derive-state/requirements.md +34 -0
package/.sillyspec/changes/archive/2026-04-08-derive-state/tasks.md +13 -0
package/.sillyspec/changes/archive/2026-04-08-derive-state/verify-result.md +43 -0
package/.sillyspec/changes/auto-mode/design.md +50 -0
package/.sillyspec/changes/auto-mode/proposal.md +19 -0
package/.sillyspec/changes/auto-mode/requirements.md +21 -0
package/.sillyspec/changes/auto-mode/tasks.md +7 -0
package/package.json +1 -1
package/src/derive.js +147 -0
package/src/index.js +27 -1
package/src/progress.js +68 -1
package/src/run.js +162 -3
package/src/stages/brainstorm.js +27 -5
package/src/stages/doctor.js +10 -1
package/src/stages/execute.js +5 -0
package/src/stages/plan.js +9 -0
package/src/stages/verify.js +6 -0

package/.claude/skills/sillyspec-auto/SKILL.md ADDED Viewed

@@ -0,0 +1,128 @@
+---
+description: 自动模式 — 专精子代理驱动全流程
+argument-hint: "<需求描述>"
+---
+## 用法
+- /sillyspec:auto 实现用户登录功能
+- /sillyspec:auto 修复搜索结果的排序问题
+## 任务
+$ARGUMENTS
+---
+## 架构
+你是编排器，不亲自干活。每个阶段启动专精子代理执行，通过文件契约传递信息。
+## 阶段 personas
+### brainstorm — 资深架构师
+```
+你是一位有 15 年经验的系统架构师。
+思维模式：先理解业务本质，再设计技术方案。善于从模糊需求中提炼关键约束。
+关注点：系统边界、模块划分、扩展性、技术选型的 trade-off。
+沟通风格：直接、提问犀利、不给模糊方案。不确定就说不确定，不猜。
+输出习惯：决策要附理由，方案要列 trade-off，不用"可能""也许"。
+```
+### plan — 技术项目经理
+```
+你是一位经验丰富的技术项目经理。
+思维模式：任务拆解要粒度均匀（每个任务 1-2 小时可完成），依赖关系要明确。
+关注点：任务优先级、风险点、验证标准、里程碑。
+沟通风格：条理清晰，用 checkbox 和编号，不做模糊描述。
+输出习惯：每个任务有明确的完成标准，Wave 间有依赖说明。
+```
+### execute — 高级工程师
+```
+你是一位严谨的高级工程师。
+思维模式：先读规范再写代码，严格遵循 CONVENTIONS.md。代码要有清晰的职责划分。
+关注点：代码质量、边界处理、错误处理、命名规范。
+沟通风格：少说多做，遇到规范冲突优先问，不自作主张。
+输出习惯：每个函数有注释，改动要解释原因，测试要覆盖边界。
+```
+### verify — QA 专家
+```
+你是一位吹毛求疵的 QA 专家。
+思维模式：假设所有代码都有 bug，用最坏情况测试。关注边界、异常、并发。
+关注点：规范一致性、边界条件、回归风险、性能隐患。
+沟通风格：有问题直说，不放过任何可疑点，用证据说话。
+输出习惯：bug 要有复现步骤，验证要有具体标准，不写"看起来没问题"。
+```
+## 执行流程
+### 启动
+1. 运行 `sillyspec run auto --input "<用户需求>"`
+2. CLI 输出当前状态概览和第一步 prompt
+### 阶段循环
+按 brainstorm → plan → execute → verify 顺序，**每个阶段启动子代理**：
+1. 读取 CLI 输出的 step prompt
+2. **判断是否需要用户确认：**
+   - prompt 中包含"请用户选择""等待用户回答""展示给用户""用户确认" → **暂停，等用户回复**
+   - 纯内部操作 → **继续**
+3. 启动子代理执行（注入对应 persona + 当前 step prompt）
+4. 子代理完成后，运行 `sillyspec run auto --done --output "<摘要>"`
+5. 读取 CLI 输出的下一步 prompt
+6. 当前阶段所有 step 完成后，自动进入下一阶段，换 persona 启动新子代理
+### 子代理启动格式
+```
+## 你的角色
+{对应阶段的 persona}
+## 当前任务
+{sillyspec CLI 输出的 step prompt}
+## 项目上下文
+请先读取以下文件了解项目背景：
+- `.sillyspec/docs/<project>/scan/CONVENTIONS.md`
+- `.sillyspec/docs/<project>/scan/ARCHITECTURE.md`
+- `.sillyspec/changes/<变更名>/design.md`（如果存在）
+## 规则
+- 不要使用 npx
+- 不要编造不存在的 CLI 子命令
+- 不要自动 commit，只 git add
+- 完成后汇报结果，不要自行推进下一步
+```
+## 子代理交互协议
+子代理遇到需要用户确认的问题时，**不要自己猜测**，输出以下标记后暂停：
+```
+### ❓ 需要用户确认
+**问题：** xxx
+**选项：** A) xxx  B) xxx  C) xxx
+**建议：** B（因为 xxx）
+```
+主代理（你）看到 ❓ 标记后：
+1. 暂停当前流程
+2. 把问题转发给用户
+3. 等用户回复
+4. 把用户回复发送给子代理（sessions_send），子代理继续执行
+### CLI prompt 中的用户确认点
+sillyspec CLI 的 step prompt 本身可能包含"请用户选择""等待用户回答"等要求。
+主代理（你）在把 prompt 发给子代理前，先扫描这些关键词。
+如果有 → 先自己处理（询问用户），拿到答案后再发给子代理。
+这样子代理不需要处理交互，只管干活。
+## 异常处理
+- 命令执行失败 → 展示错误，暂停等用户介入
+- 用户说"停止"/"暂停" → 立即停止
+- 子代理失败 → 展示错误，可重试或跳过
+## 完成条件
+CLI 输出"全部流程已完成"后，提示用户 `/sillyspec:commit` 提交改动

package/.sillyspec/changes/archive/2026-04-08-derive-state/design.md ADDED Viewed

@@ -0,0 +1,97 @@
+# deriveState 状态推导
+author: qinyi
+created_at: 2026-04-08 07:10:00
+## 背景
+当前 sillyspec 的状态管理依赖 `progress.json` 作为唯一数据源。如果 AI 崩溃或异常中断，progress.json 可能与实际产出不一致（如 artifacts 文件已生成但步骤未标记完成）。
+借鉴 GSD v2 的 deriveState 架构，从文件系统反推状态，与 progress.json 交叉校验。
+## 需求
+1. `--done` 完成步骤时轻量校验当前步骤
+2. `doctor` 自检时全量扫描所有阶段
+3. `progress validate --deep` 支持手动触发全量校验
+4. 安全修复策略：明显正确的情况自动修复，有歧义的不动
+## 设计
+### 新增文件：`src/derive.js`
+纯函数模块，零外部依赖（仅 fs/path）。
+```js
+export function deriveState(cwd, options = {}) {
+  // options.mode: 'light' | 'full'（默认 light）
+  // options.fix: boolean（默认 false，只报告不修复）
+  // 返回 { issues: [{type, severity, step, artifact, suggestion}], fixed: number }
+}
+```
+#### 扫描逻辑
+1. 读取 progress.json，获取所有阶段步骤状态
+2. 扫描 `.sillyspec/.runtime/artifacts/` 目录
+3. 文件名格式：`{stage}-step{N}-{timestamp}.txt`
+4. 解析文件名提取 stage、stepIndex 信息
+5. 对比规则：
+| 情况 | 严重度 | 自动修复 |
+|------|--------|----------|
+| artifacts 有文件但 progress 标记未完成 | issue | ✅ 标记为 done |
+| progress 标记已完成但 artifacts 无文件 | warning | ❌ 可能被手动清理 |
+| artifacts 有 step5 但 progress 只到 step3 | issue | ✅ 补齐中间步骤 |
+#### 模式
+- **light**：只检查 currentStage 的当前步骤和前一步
+- **full**：检查所有阶段所有步骤
+### 集成方式
+#### 1. run.js — `--done` 轻量校验
+在 `completeStep` 末尾：
+```js
+import { deriveState } from './derive.js';
+const result = deriveState(cwd, { mode: 'light', fix: true });
+if (result.fixed > 0) {
+  console.log(`⚠️ 状态修复：${result.fixed} 个步骤已从 artifacts 恢复`);
+}
+```
+#### 2. doctor.js — 全量扫描
+第一步（SillySpec 内部检查）调用：
+```js
+const result = deriveState(cwd, { mode: 'full', fix: false });
+// 将 issues 加入自检报告
+```
+#### 3. progress.js — `validate --deep`
+validate 方法支持 deep 参数：
+```js
+validate(cwd, deep = false) {
+  // ...现有校验逻辑...
+  if (deep) {
+    const result = deriveState(cwd, { mode: 'full', fix: true });
+    // 输出校验结果
+  }
+}
+```
+CLI：`sillyspec progress validate --deep`
+## 改动范围
+- 新增：`src/derive.js`（~80 行）
+- 修改：`src/run.js`（2 行）、`src/stages/doctor.js`（3 行）、`src/progress.js`（5 行）、`src/index.js`（parse --deep）
+## 不做的事
+- 不引入 SQLite 或其他新依赖
+- 不改变 progress.json 的数据结构
+- 不自动删除 progress 中有但 artifacts 无的步骤（可能被手动清理）

package/.sillyspec/changes/archive/2026-04-08-derive-state/plan.md ADDED Viewed

@@ -0,0 +1,51 @@
+# deriveState 状态推导 — 实现计划
+author: qinyi
+created_at: 2026-04-08 07:12:00
+## Wave 1（核心，无依赖）
+- [ ] 实现 derive.js 核心函数
+  - 新增: `src/derive.js`
+  - 步骤:
+    1. 实现 `deriveState(cwd, options)` 纯函数
+    2. 实现 artifacts 文件名解析（`{stage}-step{N}-{timestamp}.txt`）
+    3. 实现 light/full 模式扫描逻辑
+    4. 实现安全修复策略（issues 分类 + fix 逻辑）
+    5. 验证: 在 sillyspec 项目上手动创建测试 artifacts，运行 `node -e "import('./src/derive.js').then(m => console.log(m.deriveState(process.cwd(), {mode:'full'})))"` 确认输出
+## Wave 2（集成，依赖 Wave 1）
+- [ ] 集成 run.js --done 轻量校验
+  - 修改: `src/run.js`
+  - 参考: `completeStep` 函数末尾
+  - 步骤:
+    1. 在 completeStep 末尾 import 并调用 deriveState(cwd, {mode:'light', fix:true})
+    2. 有修复时输出警告信息
+    3. 验证: 运行 brainstorm 完成 --done，确认无报错
+- [ ] 扩展 validate 支持 --deep
+  - 修改: `src/progress.js`
+  - 参考: `validate()` 方法
+  - 步骤:
+    1. validate 方法加 deep 参数
+    2. deep=true 时调用 deriveState(cwd, {mode:'full', fix:true})
+    3. 验证: `sillyspec progress validate --deep` 确认输出校验结果
+- [ ] CLI parse --deep 参数
+  - 修改: `src/index.js`
+  - 参考: progress 子命令的参数解析
+  - 步骤:
+    1. 在 progress validate 命令中解析 --deep flag
+    2. 传递给 validate 方法
+    3. 验证: `sillyspec progress validate --deep` 确认 flag 生效
+## Wave 3（集成，依赖 Wave 1）
+- [ ] 集成 doctor.js 全量扫描
+  - 修改: `src/stages/doctor.js`
+  - 参考: doctor 第一步（SillySpec 内部检查）的 prompt
+  - 步骤:
+    1. 在第一步 prompt 中加入 deriveState 全量扫描指令
+    2. 将 issues 列表纳入自检报告
+    3. 验证: `sillyspec run doctor` 确认第一步输出包含状态一致性检查

package/.sillyspec/changes/archive/2026-04-08-derive-state/proposal.md ADDED Viewed

@@ -0,0 +1,29 @@
+# deriveState 状态推导 — 提案
+author: qinyi
+created_at: 2026-04-08 07:11:00
+## 动机
+当前 progress.json 是 sillyspec 唯一的状态数据源。AI 崩溃或异常中断时，progress.json 可能与实际产出不一致（artifacts 已生成但步骤未标记完成）。需要从文件系统反推状态，交叉校验。
+## 变更范围
+- 新增 `src/derive.js`（状态推导纯函数）
+- 修改 `src/run.js`（--done 轻量校验）
+- 修改 `src/stages/doctor.js`（全量扫描）
+- 修改 `src/progress.js`（validate --deep）
+- 修改 `src/index.js`（parse --deep 参数）
+## 不在范围内
+- 不改 progress.json 数据结构
+- 不引入新依赖
+- 不自动删除 progress 中有但 artifacts 无的步骤
+## 成功标准
+1. `--done` 完成步骤时自动校验并修复当前步骤
+2. `doctor` 输出全量状态一致性报告
+3. `sillyspec progress validate --deep` 可手动触发全量校验
+4. 所有校验通过现有测试

package/.sillyspec/changes/archive/2026-04-08-derive-state/requirements.md ADDED Viewed

@@ -0,0 +1,34 @@
+# deriveState 状态推导 — 需求
+author: qinyi
+created_at: 2026-04-08 07:11:00
+## 功能需求
+### FR1: deriveState 核心函数
+- 从 artifacts 目录扫描文件，解析 `{stage}-step{N}-{timestamp}.txt` 格式
+- 与 progress.json 步骤状态对比
+- 返回 issues 列表和修复计数
+### FR2: 轻量模式（light）
+- 只检查 currentStage 的当前步骤和前一步
+- 用于 `--done` 完成时
+### FR3: 全量模式（full）
+- 检查所有阶段所有步骤
+- 用于 doctor 和 validate --deep
+### FR4: 安全修复策略
+- artifacts 有但 progress 漏记 → 自动修复为 done
+- artifacts 有 step5 但 progress 只到 step3 → 自动补齐
+- progress 有但 artifacts 无 → 只警告，不修复
+### FR5: CLI 集成
+- `--done` 时静默调用轻量校验，有修复才输出
+- `doctor` 第一步输出全量报告
+- `sillyspec progress validate --deep` 手动触发
+## 非功能需求
+- 零外部依赖（仅 fs/path）
+- 纯函数，易于测试
+- 不改变现有 API 行为

package/.sillyspec/changes/archive/2026-04-08-derive-state/tasks.md ADDED Viewed

@@ -0,0 +1,13 @@
+# deriveState 状态推导 — 任务
+author: qinyi
+created_at: 2026-04-08 07:11:00
+## 任务列表
+- [x] 实现 derive.js 核心函数 — `src/derive.js`
+- [x] 集成 run.js --done 轻量校验 — `src/run.js`
+- [x] 集成 doctor.js 全量扫描 — `src/stages/doctor.js`
+- [x] 扩展 validate 支持 --deep — `src/progress.js`
+- [x] CLI parse --deep 参数 — `src/index.js`
+- [ ] 测试验证

package/.sillyspec/changes/archive/2026-04-08-derive-state/verify-result.md ADDED Viewed

@@ -0,0 +1,43 @@
+# derive-state 验证报告
+author: qinyi
+created_at: 2026-04-08 07:21:00
+## 结论：✅ PASS
+## 检查结果
+### 1. 规范文件加载
+- ✅ design.md
+- ✅ proposal.md
+- ✅ requirements.md
+- ✅ tasks.md
+- ✅ plan.md
+### 2. 任务完成度：5/6 (83%)
+- ✅ 实现 derive.js 核心函数
+- ✅ 集成 run.js --done 轻量校验
+- ✅ 集成 doctor.js 全量扫描
+- ✅ 扩展 validate 支持 --deep
+- ✅ CLI parse --deep 参数
+- ⬜ 测试验证（本项目无自动化测试套件，通过手动验证替代）
+### 3. 设计一致性
+- ✅ derive.js 纯函数模块，零外部依赖（仅 fs/path）
+- ✅ light/full 模式
+- ✅ fix 参数 + 安全修复策略
+- ✅ run.js --done 轻量校验集成
+- ✅ doctor.js 全量扫描集成
+- ✅ progress.js validate --deep 支持
+- ✅ index.js CLI --deep 参数
+- ✅ 改动文件范围与 design.md 一致
+### 4. 测试和质量
+- ✅ derive.js 模块导入正常
+- ✅ sillyspec progress validate --deep 通过
+- ✅ 无 TODO/FIXME/HACK/XXX 技术债务
+- ✅ sillyspec run quick --status 正常
+- ✅ sillyspec run doctor --status 正常
+## 下一步
+sillyspec run archive

package/.sillyspec/changes/auto-mode/design.md ADDED Viewed

@@ -0,0 +1,50 @@
+# auto mode — 设计文档
+author: qinyi
+created_at: 2026-04-08 07:28:00
+## 背景
+当前 sillyspec 的每个阶段（brainstorm → plan → execute → verify）需要用户手动执行 `sillyspec run <stage>` 和 `sillyspec run <stage> --done`。用户希望一个自动模式，从 brainstorm 一路推进到 verify 完成。
+## 需求
+1. 用户启动一次，AI 自动循环所有阶段和步骤
+2. 步骤内部的用户确认点保留不变
+3. 不修改 CLI 代码，纯 skill 文件实现
+## 设计
+### 新增文件：`.claude/skills/sillyspec-auto/SKILL.md`
+**核心逻辑：**
+1. 读 `$ARGUMENTS` 作为用户需求
+2. 阶段循环（brainstorm → plan → execute → verify）
+3. 每个阶段内步骤循环：
+   - `sillyspec run <stage> --input "需求"` → 读 step prompt
+   - 执行 prompt 中的操作
+   - 需要用户确认的步骤 → 暂停等回复
+   - 完成后自动 `sillyspec run <stage> --done --output "摘要"`
+   - 读下一步 prompt，继续
+4. 当前阶段全部完成 → 自动进入下一阶段
+5. verify 完成 → 输出总结，停止
+6. 命令失败 → 暂停，等用户介入
+**确认点保留规则：**
+- prompt 中有"请用户选择""等待用户回答""展示给用户"等字样 → 暂停
+- prompt 中有"自审""检查"等纯内部操作 → 自动完成
+### 同步到 npm 包
+init.js 已有逻辑复制 `sillyspec-*` skills 到项目 `.claude/skills/`，新 skill 自动生效。
+## 改动范围
+- 新增：`.claude/skills/sillyspec-auto/SKILL.md`（~60 行）
+## 不做的事
+- 不修改任何 JS 源码
+- 不改变现有阶段流程
+- 不自动 commit 或发布

package/.sillyspec/changes/auto-mode/proposal.md ADDED Viewed

@@ -0,0 +1,19 @@
+# auto mode — 提案
+author: qinyi
+created_at: 2026-04-08 07:29:00
+## 动机
+用户希望一次启动就自动完成 brainstorm → plan → execute → verify 全流程，不需要手动输入 `sillyspec run <stage>` 和 `--done`。
+## 变更范围
+新增 `.claude/skills/sillyspec-auto/SKILL.md`
+## 不在范围内
+- 不修改 JS 源码
+- 不改变阶段流程
+## 成功标准
+1. `/sillyspec:auto "需求"` 能自动推进全流程
+2. 步骤内部确认点正常暂停
+3. 异常时暂停等用户介入

package/.sillyspec/changes/auto-mode/requirements.md ADDED Viewed

@@ -0,0 +1,21 @@
+# auto mode — 需求
+author: qinyi
+created_at: 2026-04-08 07:29:00
+## 功能需求
+### FR1: 阶段自动推进
+- 按 brainstorm → plan → execute → verify 顺序自动执行
+- 当前阶段完成后自动进入下一阶段
+### FR2: 步骤自动循环
+- 每个步骤：读 prompt → 执行 → 自动 --done → 读下一步
+- 不需要用户手动触发 --done
+### FR3: 确认点保留
+- prompt 中有用户确认要求时暂停等回复
+- 纯内部操作步骤自动完成
+### FR4: 异常处理
+- 命令失败时暂停，展示错误，等用户介入

package/.sillyspec/changes/auto-mode/tasks.md ADDED Viewed

@@ -0,0 +1,7 @@
+# auto mode — 任务
+author: qinyi
+created_at: 2026-04-08 07:29:00
+- [x] 编写 sillyspec-auto SKILL.md — `.claude/skills/sillyspec-auto/SKILL.md`
+- [x] 同步到 agents skills — `~/.agents/skills/sillyspec-auto/SKILL.md`

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "sillyspec",
-  "version": "3.7.33",
+  "version": "3.8.0",
   "description": "SillySpec CLI — 流程状态机，让 AI 严格按步骤来",
   "icon": "logo.jpg",
   "homepage": "https://sillyspec.ppdmq.top/",

package/src/derive.js ADDED Viewed

@@ -0,0 +1,147 @@
+import { readdirSync, existsSync } from 'fs';
+import { join } from 'path';
+/**
+ * 从 artifacts 文件系统反推状态，与 progress.json 交叉校验。
+ * 纯函数，零外部副作用（fix 模式除外）。
+ *
+ * @param {string} cwd - 项目根目录
+ * @param {object} options
+ * @param {'light'|'full'} options.mode - 轻量（当前步骤）或全量扫描
+ * @param {boolean} options.fix - 是否自动修复明显不一致
+ * @param {object} options.pm - ProgressManager 实例（fix 模式需要）
+ * @param {object} options.progress - 已加载的 progress 数据
+ * @returns {{ issues: Array<{type:string, severity:string, stage:string, step:number, message:string, suggestion:string}>, fixed: number }}
+ */
+export function deriveState(cwd, options = {}) {
+  const { mode = 'light', fix = false, pm = null, progress = null } = options;
+  const issues = [];
+  let fixed = 0;
+  const artifactsDir = join(cwd, '.sillyspec', '.runtime', 'artifacts');
+  if (!existsSync(artifactsDir)) {
+    return { issues: [{ type: 'no-artifacts', severity: 'info', stage: '-', step: 0, message: 'artifacts 目录不存在', suggestion: '正常，项目刚开始' }], fixed: 0 };
+  }
+  // 扫描 artifacts 文件，解析 stage/step 信息
+  const artifactMap = {}; // { "stage:stepN": [filenames] }
+  const stageStepSet = new Set(); // "stage:stepN"
+  let files;
+  try {
+    files = readdirSync(artifactsDir).filter(f => f.endsWith('.txt'));
+  } catch {
+    return { issues: [], fixed: 0 };
+  }
+  for (const file of files) {
+    // 格式: {stage}-step{N}-{timestamp}.txt
+    const match = file.match(/^(.+)-step(\d+)-\d+\.txt$/);
+    if (match) {
+      const [, stage, stepStr] = match;
+      const key = `${stage}:${stepStr}`;
+      if (!artifactMap[key]) artifactMap[key] = [];
+      artifactMap[key].push(file);
+      stageStepSet.add(key);
+    }
+  }
+  // 需要检查的阶段
+  let stagesToCheck = [];
+  if (progress) {
+    if (mode === 'light') {
+      // 轻量：只检查 currentStage
+      const currentStage = progress.currentStage || '';
+      if (currentStage) stagesToCheck.push(currentStage);
+    } else {
+      // 全量：检查所有阶段
+      stagesToCheck = Object.keys(progress.stages || {});
+    }
+  } else {
+    // 没有 progress 数据，从 artifacts 推断所有阶段
+    for (const key of stageStepSet) {
+      const stage = key.split(':')[0];
+      if (!stagesToCheck.includes(stage)) stagesToCheck.push(stage);
+    }
+  }
+  for (const stage of stagesToCheck) {
+    const stageData = progress?.stages?.[stage];
+    const steps = stageData?.steps || [];
+    // 收集 artifacts 中该阶段的步骤编号
+    const artifactSteps = new Set();
+    for (const key of stageStepSet) {
+      const [s, n] = key.split(':');
+      if (s === stage) artifactSteps.add(parseInt(n));
+    }
+    // 检查1：artifacts 有但 progress 未标记完成
+    for (const stepNum of artifactSteps) {
+      const stepIdx = stepNum - 1;
+      if (stepIdx < steps.length) {
+        const step = steps[stepIdx];
+        if (step.status !== 'done') {
+          issues.push({
+            type: 'missing-progress',
+            severity: 'issue',
+            stage,
+            step: stepNum,
+            message: `artifacts 有 ${stage}-step${stepNum} 文件但 progress 未标记完成`,
+            suggestion: '标记该步骤为 done'
+          });
+          if (fix && pm && progress) {
+            step.status = 'done';
+            pm._write(cwd, progress);
+            fixed++;
+          }
+        }
+      }
+    }
+    // 检查2：progress 有但 artifacts 无文件（warning，不修复）
+    for (let i = 0; i < steps.length; i++) {
+      const step = steps[i];
+      if (step.status === 'done' && !artifactSteps.has(i + 1)) {
+        issues.push({
+          type: 'missing-artifact',
+          severity: 'warning',
+          stage,
+          step: i + 1,
+          message: `${stage} step ${i + 1} 标记完成但 artifacts 无对应文件`,
+          suggestion: '可能被手动清理，忽略即可'
+        });
+      }
+    }
+    // 检查3：artifacts 有 step5 但 progress 只到 step3（中间漏记）
+    if (artifactSteps.size > 0) {
+      const maxArtifactStep = Math.max(...artifactSteps);
+      const maxProgressDoneStep = steps.reduce((max, s, i) => s.status === 'done' ? Math.max(max, i + 1) : max, 0);
+      if (maxArtifactStep > maxProgressDoneStep && maxProgressDoneStep > 0) {
+        // 检查中间是否有漏记的
+        for (let i = maxProgressDoneStep + 1; i <= maxArtifactStep; i++) {
+          if (artifactSteps.has(i) && i - 1 < steps.length && steps[i - 1].status !== 'done') {
+            // 已在检查1处理，跳过
+          }
+        }
+      }
+      // progress 步骤数少于 artifacts 最大步骤号
+      if (maxArtifactStep > steps.length) {
+        issues.push({
+          type: 'missing-steps',
+          severity: 'issue',
+          stage,
+          step: steps.length + 1,
+          message: `artifacts 有 step${maxArtifactStep} 但 progress 只有 ${steps.length} 个步骤`,
+          suggestion: 'progress 数据可能不完整'
+        });
+      }
+    }
+  }
+  return { issues, fixed };
+}

package/src/index.js CHANGED Viewed

@@ -128,7 +128,7 @@ async function main() {
           pm.show(dir);
           break;
         case 'validate':
-          pm.validate(dir);
+          await pm.validate(dir, filteredArgs.includes('--deep'));
           break;
         case 'reset':
           pm.reset(dir, stage);
@@ -168,6 +168,32 @@ async function main() {
           pm.completeStage(dir, compStageName);
           break;
         }
+        case 'batch': {
+          if (filteredArgs.includes('--status')) {
+            const bp = pm.readBatchProgress(dir);
+            if (!bp) { console.log('📭 无批量进度数据'); break; }
+            const line = pm._renderBatchProgress(bp);
+            console.log(line || '📭 无批量进度数据');
+            console.log(JSON.stringify(bp, null, 2));
+          } else {
+            let batchData = {};
+            const a = args;
+            for (let i = 0; i < a.length; i++) {
+              if (a[i] === '--total' && a[i + 1]) { batchData.total = parseInt(a[i + 1]); i++; }
+              if (a[i] === '--completed' && a[i + 1]) { batchData.completed = parseInt(a[i + 1]); i++; }
+              if (a[i] === '--failed' && a[i + 1]) { batchData.failed = parseInt(a[i + 1]); i++; }
+              if (a[i] === '--skipped' && a[i + 1]) { batchData.skipped = parseInt(a[i + 1]); i++; }
+            }
+            if (Object.keys(batchData).length === 0) {
+              console.log('用法: sillyspec progress batch --total 100 --completed 73');
+              console.log('     sillyspec progress batch --status');
+              break;
+            }
+            pm.updateBatchProgress(dir, batchData);
+            console.log('✅ 批量进度已更新');
+          }
+          break;
+        }
         default:
           console.log('用法: sillyspec progress <init|show|validate|reset|complete|set-stage|add-step|update-step|complete-stage>');
       }

package/src/progress.js CHANGED Viewed

@@ -289,6 +289,15 @@ export class ProgressManager {
       }
     }
+    // 批量进度
+    if (data.batchProgress) {
+      const batchLine = this._renderBatchProgress(data.batchProgress);
+      if (batchLine) {
+        console.log('');
+        console.log(`  ${batchLine}`);
+      }
+    }
     console.log('');
   }
@@ -296,7 +305,7 @@ export class ProgressManager {
     this.show(cwd);
   }
-  validate(cwd) {
+  async validate(cwd, deep = false) {
     const data = this.read(cwd);
     if (!data) { console.log('❌ 无法读取 progress.json'); return false; }
@@ -328,6 +337,26 @@ export class ProgressManager {
       this._write(cwd, fixed);
       console.log('✅ 已修复并备份');
     }
+    if (deep) {
+      try {
+        const { deriveState } = await import('./derive.js');
+        const result = deriveState(cwd, { mode: 'full', fix: true, pm: this, progress: this.read(cwd) });
+        if (result.issues.length > 0) {
+          console.log(`\n📋 deriveState 深度校验（${result.issues.length} 项）：`);
+          for (const issue of result.issues) {
+            const icon = issue.severity === 'issue' ? '🔴' : issue.severity === 'warning' ? '🟡' : '⚪';
+            console.log(`  ${icon} ${issue.stage} step ${issue.step}: ${issue.message}`);
+          }
+          if (result.fixed > 0) console.log(`  🔧 已自动修复 ${result.fixed} 项`);
+        } else {
+          console.log('✅ deriveState 深度校验通过，无不一致');
+        }
+      } catch (e) {
+        console.log(`⚠️ deriveState 校验失败: ${e.message}`);
+      }
+    }
     return true;
   }
@@ -414,6 +443,44 @@ export class ProgressManager {
     return `${Math.floor(h / 24)} 天前`;
   }
+  // ── 批量进度 ──
+  updateBatchProgress(cwd, batchData) {
+    const data = this._readOrInit(cwd);
+    if (!data) return;
+    if (!data.batchProgress) {
+      data.batchProgress = { total: 0, completed: 0, failed: 0, skipped: 0 };
+    }
+    if (batchData.total !== undefined) data.batchProgress.total = batchData.total;
+    if (batchData.completed !== undefined) data.batchProgress.completed = batchData.completed;
+    if (batchData.failed !== undefined) data.batchProgress.failed = batchData.failed;
+    if (batchData.skipped !== undefined) data.batchProgress.skipped = batchData.skipped;
+    data.lastActive = new Date().toLocaleString('zh-CN', { hour12: false });
+    this._backup(cwd);
+    this._write(cwd, data);
+  }
+  readBatchProgress(cwd) {
+    const data = this.read(cwd);
+    return data?.batchProgress || null;
+  }
+  _renderBatchProgress(batchProgress) {
+    if (!batchProgress || !batchProgress.total) return null;
+    const { total, completed = 0, failed = 0, skipped = 0 } = batchProgress;
+    const done = Math.min(completed + failed + skipped, total);
+    const barLen = 20;
+    const filled = Math.round((completed / total) * barLen);
+    const bar = '█'.repeat(filled) + '░'.repeat(barLen - filled);
+    const parts = [];
+    if (failed > 0) parts.push(`${failed} 失败`);
+    if (skipped > 0) parts.push(`${skipped} 跳过`);
+    const suffix = parts.length ? ` (${parts.join(', ')})` : '';
+    return `📊 批量进度: ${bar} ${completed}/${total}${suffix}`;
+  }
   _ensureGitignore(cwd) {
     const gitignorePath = join(cwd, '.gitignore');
     const rule = '.sillyspec/.runtime/';

package/src/run.js CHANGED Viewed

@@ -95,12 +95,27 @@ function outputStep(stageName, stepIndex, steps, cwd) {
   const total = steps.length
   const projectName = basename(cwd)
+  const personas = {
+    brainstorm: `### 🎯 你的角色：资深架构师
+你是一位有 15 年经验的系统架构师。先理解业务本质，再设计技术方案。决策附理由，方案列 trade-off。不确定就说不确定，不猜。`,
+    plan: `### 📋 你的角色：技术项目经理
+你是一位经验丰富的技术项目经理。任务拆解粒度均匀，依赖关系明确。每个任务有完成标准，Wave 间有依赖说明。条理清晰，不做模糊描述。`,
+    execute: `### 💻 你的角色：高级工程师
+你是一位严谨的高级工程师。先读规范再写代码，严格遵循 CONVENTIONS.md。代码有清晰职责划分，边界处理完善。少说多做，遇到规范冲突优先问。`,
+    verify: `### 🔍 你的角色：QA 专家
+你是一位吹毛求疵的 QA 专家。假设所有代码都有 bug，用最坏情况测试。关注边界、异常、并发。有问题直说，用证据说话，不写"看起来没问题"。`
+  }
   console.log(`---`)
   console.log(`stage: ${stageName}`)
   console.log(`step: ${stepIndex + 1}/${total}`)
   console.log(`stepName: ${step.name}`)
   console.log(`project: ${projectName}`)
   console.log(`---\n`)
+  if (personas[stageName]) {
+    console.log(personas[stageName])
+    console.log('')
+  }
   console.log(`## Step ${stepIndex + 1}/${total}: ${step.name}\n`)
   console.log(step.prompt)
   console.log(`\n### ⚠️ 铁律`)
@@ -123,9 +138,15 @@ export async function runCommand(args, cwd) {
   const stageName = args[0]
   const flags = args.slice(1)
-  if (!stageName || !stageRegistry[stageName]) {
-    console.error(`❌ 未知阶段: ${stageName || '(未指定)'}`)
-    console.error(`可选: ${Object.keys(stageRegistry).join(', ')}`)
+  if (!stageName) {
+    console.error('❌ 请指定阶段，例如: sillyspec run brainstorm')
+    console.error(`可选: ${Object.keys(stageRegistry).join(', ')}, auto`)
+    process.exit(1)
+  }
+  if (!stageRegistry[stageName] && stageName !== 'auto') {
+    console.error(`❌ 未知阶段: ${stageName}`)
+    console.error(`可选: ${Object.keys(stageRegistry).join(', ')}, auto`)
     process.exit(1)
   }
@@ -169,6 +190,11 @@ export async function runCommand(args, cwd) {
     progress = pm.init(cwd)
   }
+  // -- auto 模式：自动推进所有流程阶段
+  if (stageName === 'auto') {
+    return await runAutoMode(pm, progress, cwd, flags)
+  }
   // --change 设置当前变更名
   if (changeName) {
     progress.currentChange = changeName
@@ -329,6 +355,15 @@ async function completeStep(pm, progress, stageName, cwd, outputText, inputText
     progress.lastActive = new Date().toLocaleString('zh-CN',{hour12:false})
     pm._write(cwd, progress)
+    // deriveState 轻量校验
+    try {
+      const { deriveState } = await import('./derive.js')
+      const result = deriveState(cwd, { mode: 'light', fix: true, pm, progress })
+      if (result.fixed > 0) {
+        console.log(`⚠️ 状态修复：${result.fixed} 个步骤已从 artifacts 恢复`)
+      }
+    } catch {}
     // Append to user-inputs.md
     if (outputText) {
       const inputsPath = join(cwd, '.sillyspec', '.runtime', 'user-inputs.md')
@@ -419,6 +454,23 @@ function showStatus(progress, stageName) {
   const firstPending = steps.findIndex(s => s.status === 'pending')
+  // 批量进度
+  if (progress.batchProgress) {
+    const bp = progress.batchProgress
+    const total = bp.total || 0
+    const completed = bp.completed || 0
+    const failed = bp.failed || 0
+    const skipped = bp.skipped || 0
+    const barLen = 20
+    const filled = Math.round((completed / Math.max(total, 1)) * barLen)
+    const bar = '█'.repeat(filled) + '░'.repeat(barLen - filled)
+    const parts = []
+    if (failed > 0) parts.push(`${failed} 失败`)
+    if (skipped > 0) parts.push(`${skipped} 跳过`)
+    const suffix = parts.length ? ` (${parts.join(', ')})` : ''
+    console.log(`\n📊 批量进度: ${bar} ${completed}/${total}${suffix}\n`)
+  }
   steps.forEach((step, i) => {
     const icon = step.status === 'completed' ? '✅' : step.status === 'skipped' ? '⏭️' : '⬜'
     const isCurrent = step.status === 'pending' && i === firstPending
@@ -438,3 +490,110 @@ async function resetStage(pm, progress, stageName, cwd) {
   pm._write(cwd, progress)
   console.log(`🔄 ${stageName} 阶段已重置`)
 }
+/**
+ * auto 模式：自动推进 brainstorm → plan → execute → verify
+ */
+async function runAutoMode(pm, progress, cwd, flags) {
+  const flowStages = ['brainstorm', 'plan', 'execute', 'verify']
+  const isDone = flags.includes('--done')
+  let outputText = null
+  const outputIdx = flags.indexOf('--output')
+  if (outputIdx !== -1 && flags[outputIdx + 1]) outputText = flags[outputIdx + 1]
+  let inputText = null
+  const inputIdx = flags.indexOf('--input')
+  if (inputIdx !== -1 && flags[inputIdx + 1]) inputText = flags[inputIdx + 1]
+  if (!isDone) {
+    // 首次启动：显示当前状态和下一步
+    const currentStage = progress.currentStage || flowStages[0]
+    const stageIdx = flowStages.indexOf(currentStage)
+    if (stageIdx === -1) {
+      console.error(`❌ 当前阶段 ${currentStage} 不在 auto 流程中`)
+      console.error(`auto 流程: ${flowStages.join(' → ')}`)
+      process.exit(1)
+    }
+    // 显示进度概览
+    console.log('════════════════════════════════════════')
+    console.log('  🤖 SillySpec Auto Mode')
+    console.log('════════════════════════════════════════')
+    console.log(`  流程: ${flowStages.join(' → ')}`)
+    console.log(`  当前: ${currentStage}`)
+    for (let i = 0; i < flowStages.length; i++) {
+      const s = flowStages[i]
+      const stageData = progress.stages[s]
+      const done = stageData?.status === 'completed'
+      const active = s === currentStage
+      const total = stageData?.steps?.length || '?'
+      const completed = stageData?.steps?.filter(st => st.status === 'completed').length || 0
+      const icon = done ? '✅' : active ? '🔵' : '⬜'
+      console.log(`  ${icon} ${s} (${completed}/${total})`)
+    }
+    console.log('')
+    // 输出当前步骤 prompt
+    const steps = await getStageSteps(currentStage, cwd, progress)
+    if (!steps) {
+      console.error(`❌ 无法获取 ${currentStage} 步骤`)
+      process.exit(1)
+    }
+    const pendingIdx = steps.findIndex(s => s.status === 'pending')
+    if (pendingIdx === -1) {
+      // 阶段已完成，提示进入下一阶段
+      const next = getNextStage(currentStage)
+      if (next) {
+        console.log(`✅ ${currentStage} 已完成，下一步：sillyspec run auto --done --output "${currentStage} 完成"`)
+      } else {
+        console.log('🎉 全部流程已完成！')
+      }
+      return
+    }
+    outputStepPrompt(steps, pendingIdx, currentStage, cwd, progress)
+    return
+  }
+  // --done：完成当前步骤，如果阶段完成则自动推进
+  if (!outputText) {
+    console.error('❌ auto --done 需要 --output 参数')
+    process.exit(1)
+  }
+  const currentStage = progress.currentStage
+  const stageIdx = flowStages.indexOf(currentStage)
+  if (stageIdx === -1) {
+    console.error(`❌ 当前阶段 ${currentStage} 不在 auto 流程中`)
+    process.exit(1)
+  }
+  // 完成当前步骤
+  const completed = await completeStep(pm, progress, currentStage, cwd, outputText, inputText)
+  if (!completed) return
+  // 检查阶段是否完成
+  const nextPendingIdx = progress.stages[currentStage]?.steps?.findIndex(s => s.status === 'pending')
+  if (nextPendingIdx === -1) {
+    // 阶段已完成
+    const next = getNextStage(currentStage)
+    if (next) {
+      console.log(`\n✅ ${currentStage} 阶段完成，自动进入 ${next}`)
+      // 输出下一阶段第一步 prompt
+      const nextSteps = await getStageSteps(next, cwd, progress)
+      if (nextSteps) {
+        const firstPending = nextSteps.findIndex(s => s.status === 'pending')
+        if (firstPending !== -1) {
+          outputStepPrompt(nextSteps, firstPending, next, cwd, progress)
+        }
+      }
+    } else {
+      console.log('\n🎉 全部流程已完成！建议运行 /sillyspec:commit 提交改动')
+    }
+  } else {
+    // 阶段内下一步
+    const steps = await getStageSteps(currentStage, cwd, progress)
+    if (steps) {
+      const firstPending = steps.findIndex(s => s.status === 'pending')
+      if (firstPending !== -1) {
+        outputStepPrompt(steps, firstPending, currentStage, cwd, progress)
+      }
+    }
+  }
+}

package/src/stages/brainstorm.js CHANGED Viewed

@@ -80,7 +80,7 @@ export const definition = {
     },
     {
       name: '需求范围评估',
-      prompt: `评估需求复杂度，判断是否需要拆分。
+      prompt: `评估需求复杂度，判断是否需要拆分或走批量模式。
 ### 操作
 1. 根据分析结果判断复杂度
@@ -89,15 +89,37 @@ export const definition = {
    - 3+ 种角色有不同权限和视图
    - 跨页面状态流转（审批流、多步表单）
    - 模块间耦合度低可独立开发
-3. 需要拆分 → 生成 MASTER.md，规划子阶段
-4. 不需要拆分 → 继续
+3. 满足以下条件建议走**批量模式**：
+   - 任务数量 > 10 且任务间有重复模式（如 100 个报表、50 个表单、N 个相似页面）
+   - 本质是「模板 × 数据」而非 N 个独立功能
+   - 直接逐个开发会导致 plan.md 膨胀和上下文溢出
+4. 需要拆分 → 生成 MASTER.md，规划子阶段
+5. 检测到批量模式 → 输出提示并建议用户确认
+6. 都不需要 → 继续
+### 批量模式指引
+确认后，后续 plan/execute 按以下原则调整：
+- **不要**把每个实例列为独立任务（不要写 100 个 checkbox）
+- plan 设计通用架构（引擎/模板/配置格式），任务数控制在 10 个以内
+- 数据转换用脚本完成（Excel → 配置文件），不消耗 AI 上下文
+- execute 每个 Wave 独立模块，Wave 间通过接口定义解耦
+- verify 用脚本全量验证 + AI 抽查边界案例
+### 半批量场景
+如果任务中大部分相似但有少量特殊任务（如 20 个任务中 15 个相似、5 个特殊）：
+- **主簇**（>10 个相似）→ 走批量模式（引擎 + 配置）
+- **小簇**（2-5 个相似）→ 走简化版批量（基于主簇模板扩展）
+- **孤立任务**（1 个）→ 走标准开发流程
+- 建议用「继承 + override」配置解决特殊任务，配置解决不了的才写定制代码
+- 架构设计时预留扩展点（hooks/overrides），让特殊任务能"挂上去"而不是"另起炉灶"
 ### 输出
-拆分方案（如需要）或"无需拆分"确认
+拆分方案 / 批量模式确认 / "无需拆分"确认
 ### 注意
 - 简单 CRUD 不拆
-- 拆分方案需用户确认`,
+- 拆分方案需用户确认
+- 批量模式需用户确认`,
       outputHint: '拆分方案或无需拆分确认',
       optional: true
     },

package/src/stages/doctor.js CHANGED Viewed

@@ -108,7 +108,16 @@ done
 ### 注意
 - 不要编造路径或结果，严格基于命令输出
-- 如果 .sillyspec/ 不存在，直接输出 ❌ 并跳过后续检查`,
+- 如果 .sillyspec/ 不存在，直接输出 ❌ 并跳过后续检查
+- 额外运行 deriveState 全量校验：
+  \
+\
+\
+  node -e "import('./src/derive.js').then(m => { const pm = require('./progress.js'); const r = m.deriveState('.', {mode:'full',fix:false,progress:pm.read('.')}); console.log(JSON.stringify(r, null, 2)); })" 2>/dev/null || echo "deriveState 不可用"
+  \
+\
+\
+  将 deriveState 的 issues 列表纳入 SillySpec 内部检查结果中`,
       outputHint: 'SillySpec 内部检查结果',
       optional: false
     },

package/src/stages/execute.js CHANGED Viewed

@@ -177,6 +177,11 @@ function buildWavePrompt(wave, waveIndex) {
   }).join('\n')
   return `## Wave ${waveIndex}: 执行以下任务
+### Wave 开始前
+1. 读取 design.md 的「编码铁律」章节（如果存在），严格遵守
+2. 确认本 Wave 的输入/输出契约（前置 Wave 产出了什么，本 Wave 需要消费什么）
+3. 检查前置 Wave 的产出是否完整（文件是否存在、测试是否通过）
 ### 本 Wave 任务
 ${taskList}

package/src/stages/plan.js CHANGED Viewed

@@ -112,6 +112,15 @@ export const definition = {
 3. 有依赖的 Task 按顺序排列
 4. Wave 编号从 1 开始
+### 批量模式指引
+如果 design.md 或需求中包含批量特征（关键词：批量/模板/引擎/100个/50个/N个相似/Excel×数据），按以下原则规划：
+❌ 不要列出每个实例作为独立任务（不要写 100 个 checkbox）
+❌ 不要在 plan.md 中嵌入数据（Excel 内容、表单字段列表等）
+✅ 设计通用架构（引擎/模板/配置格式），Wave 1 聚焦架构
+✅ 数据转换用脚本完成（Excel → 配置文件），单独一个 Wave
+✅ 总任务数控制在 10 个以内
+✅ 在 design.md 中增加「编码铁律」章节（3-5 条不可违反的约束）
 ### 操作
 1. 读取 tasks.md 获取任务列表
 2. 读取 design.md 获取文件变更清单

package/src/stages/verify.js CHANGED Viewed

@@ -39,6 +39,12 @@ export const definition = {
 2. 检查代码是否实现了描述的功能
 3. 标记：✅ 已完成 / ❌ 未完成 / ⚠️ 部分完成
+### 批量模式验证指引
+如果 tasks.md 中有批量特征（引擎/模板/配置/批量生成），采用分层验证：
+- **L1 自动化（100%）**：运行验证脚本（如有），检查所有实例的文件存在、格式正确、Schema 校验通过
+- **L2 AI 抽查（5-10 个）**：选择最复杂的 3 个 + 最简单的 2 个 + 有特殊逻辑的，检查业务逻辑正确性
+- **L3 模式性 bug 检测**：L2 发现 bug → 判断是否为系统性问题 → 系统性 bug 则回退修复引擎并重新生成所有实例
 ### 输出
 任务完成度列表 + 完成率