@hongmaple0820/scale-engine 0.49.0 → 0.50.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.en.md +2 -2
- package/README.md +2 -2
- package/dist/api/DashboardHttpConfig.d.ts +28 -0
- package/dist/api/DashboardHttpConfig.js +110 -0
- package/dist/api/DashboardHttpConfig.js.map +1 -0
- package/dist/api/cli.js +102 -11
- package/dist/api/cli.js.map +1 -1
- package/dist/api/http.d.ts +1 -0
- package/dist/api/http.js +52 -0
- package/dist/api/http.js.map +1 -0
- package/dist/artifact/types.d.ts +5 -0
- package/dist/artifact/types.js.map +1 -1
- package/dist/bootstrap/DependencyBootstrap.d.ts +1 -0
- package/dist/bootstrap/DependencyBootstrap.js +14 -3
- package/dist/bootstrap/DependencyBootstrap.js.map +1 -1
- package/dist/cli/cortexApplyCommand.d.ts +26 -0
- package/dist/cli/cortexApplyCommand.js +74 -0
- package/dist/cli/cortexApplyCommand.js.map +1 -0
- package/dist/cli/cortexCandidateCommands.d.ts +42 -0
- package/dist/cli/cortexCandidateCommands.js +119 -0
- package/dist/cli/cortexCandidateCommands.js.map +1 -0
- package/dist/cli/cortexCommands.d.ts +31 -0
- package/dist/cli/cortexCommands.js +102 -17
- package/dist/cli/cortexCommands.js.map +1 -1
- package/dist/cli/engineBootstrap.d.ts +1 -1
- package/dist/cli/engineBootstrap.js +2 -0
- package/dist/cli/engineBootstrap.js.map +1 -1
- package/dist/cli/evalCommands.js +1 -0
- package/dist/cli/evalCommands.js.map +1 -1
- package/dist/cli/phaseCommands.d.ts +28 -0
- package/dist/cli/phaseCommands.js +148 -9
- package/dist/cli/phaseCommands.js.map +1 -1
- package/dist/cli/runtimeSkillCommands.js +12 -2
- package/dist/cli/runtimeSkillCommands.js.map +1 -1
- package/dist/cli/shieldCommands.d.ts +1 -0
- package/dist/cli/shieldCommands.js +20 -7
- package/dist/cli/shieldCommands.js.map +1 -1
- package/dist/cli/workflowEvidenceCommands.d.ts +120 -0
- package/dist/cli/workflowEvidenceCommands.js +228 -2
- package/dist/cli/workflowEvidenceCommands.js.map +1 -1
- package/dist/cortex/AutoFixEventObservations.d.ts +11 -0
- package/dist/cortex/AutoFixEventObservations.js +72 -0
- package/dist/cortex/AutoFixEventObservations.js.map +1 -0
- package/dist/cortex/GateEvidenceObservations.d.ts +22 -0
- package/dist/cortex/GateEvidenceObservations.js +179 -0
- package/dist/cortex/GateEvidenceObservations.js.map +1 -0
- package/dist/cortex/GovernanceMetrics.d.ts +2 -0
- package/dist/cortex/GovernanceMetrics.js +112 -22
- package/dist/cortex/GovernanceMetrics.js.map +1 -1
- package/dist/cortex/InstinctApplicationRecorder.d.ts +28 -0
- package/dist/cortex/InstinctApplicationRecorder.js +145 -0
- package/dist/cortex/InstinctApplicationRecorder.js.map +1 -0
- package/dist/cortex/InstinctCandidateAudit.d.ts +3 -0
- package/dist/cortex/InstinctCandidateAudit.js +39 -0
- package/dist/cortex/InstinctCandidateAudit.js.map +1 -0
- package/dist/cortex/InstinctCandidateReview.d.ts +32 -0
- package/dist/cortex/InstinctCandidateReview.js +125 -0
- package/dist/cortex/InstinctCandidateReview.js.map +1 -0
- package/dist/cortex/InstinctExtractor.d.ts +1 -0
- package/dist/cortex/InstinctExtractor.js +24 -17
- package/dist/cortex/InstinctExtractor.js.map +1 -1
- package/dist/cortex/InstinctRuntimeEvidence.d.ts +14 -0
- package/dist/cortex/InstinctRuntimeEvidence.js +120 -0
- package/dist/cortex/InstinctRuntimeEvidence.js.map +1 -0
- package/dist/cortex/InstinctStore.d.ts +31 -4
- package/dist/cortex/InstinctStore.js +120 -20
- package/dist/cortex/InstinctStore.js.map +1 -1
- package/dist/cortex/SessionInjector.d.ts +1 -0
- package/dist/cortex/SessionInjector.js +54 -4
- package/dist/cortex/SessionInjector.js.map +1 -1
- package/dist/dashboard/DashboardServer.d.ts +237 -0
- package/dist/dashboard/DashboardServer.js +1083 -19
- package/dist/dashboard/DashboardServer.js.map +1 -1
- package/dist/dashboard/spa/assets/index-VYBCLBje.js +11 -0
- package/dist/dashboard/spa/assets/index-VhwY_ac1.css +1 -0
- package/dist/dashboard/spa/assets/naive-ui-BQy2AJkt.js +3340 -0
- package/dist/dashboard/spa/assets/vendor-BPU6aOYA.js +3 -0
- package/dist/dashboard/spa/assets/vue-CQQMb5Wi.js +17 -0
- package/dist/dashboard/spa/index.html +16 -0
- package/dist/env/EnvironmentDoctor.js +12 -7
- package/dist/env/EnvironmentDoctor.js.map +1 -1
- package/dist/eval/WorkflowEval.d.ts +9 -0
- package/dist/eval/WorkflowEval.js +348 -2
- package/dist/eval/WorkflowEval.js.map +1 -1
- package/dist/memory/MemoryBrain.d.ts +13 -0
- package/dist/memory/MemoryBrain.js +47 -0
- package/dist/memory/MemoryBrain.js.map +1 -1
- package/dist/memory/MemoryFabric.d.ts +14 -1
- package/dist/memory/MemoryFabric.js +72 -8
- package/dist/memory/MemoryFabric.js.map +1 -1
- package/dist/memory/MemoryLearning.d.ts +1 -0
- package/dist/memory/MemoryLearning.js +6 -3
- package/dist/memory/MemoryLearning.js.map +1 -1
- package/dist/memory/MemoryProviders.d.ts +8 -1
- package/dist/memory/MemoryProviders.js +143 -29
- package/dist/memory/MemoryProviders.js.map +1 -1
- package/dist/runtime/AiOsRuntime.d.ts +14 -1
- package/dist/runtime/AiOsRuntime.js +59 -3
- package/dist/runtime/AiOsRuntime.js.map +1 -1
- package/dist/runtime/RuntimeDoctor.js +3 -1
- package/dist/runtime/RuntimeDoctor.js.map +1 -1
- package/dist/runtime/RuntimeEvidenceLedger.d.ts +6 -0
- package/dist/runtime/RuntimeEvidenceLedger.js +52 -1
- package/dist/runtime/RuntimeEvidenceLedger.js.map +1 -1
- package/dist/runtime/SessionLedger.d.ts +2 -0
- package/dist/runtime/SessionLedger.js +4 -0
- package/dist/runtime/SessionLedger.js.map +1 -1
- package/dist/setup/SetupVerification.js +53 -5
- package/dist/setup/SetupVerification.js.map +1 -1
- package/dist/shield/PolicyCompiler.js +73 -12
- package/dist/shield/PolicyCompiler.js.map +1 -1
- package/dist/shield/ProtectedPaths.js +4 -2
- package/dist/shield/ProtectedPaths.js.map +1 -1
- package/dist/skills/SkillCatalog.d.ts +2 -0
- package/dist/skills/SkillCatalog.js +8 -0
- package/dist/skills/SkillCatalog.js.map +1 -1
- package/dist/skills/SkillDoctor.d.ts +19 -2
- package/dist/skills/SkillDoctor.js +163 -13
- package/dist/skills/SkillDoctor.js.map +1 -1
- package/dist/tools/SafeCommandRunner.d.ts +1 -0
- package/dist/tools/SafeCommandRunner.js +1 -0
- package/dist/tools/SafeCommandRunner.js.map +1 -1
- package/dist/tools/ToolCapabilityRegistry.js +25 -3
- package/dist/tools/ToolCapabilityRegistry.js.map +1 -1
- package/dist/tools/ToolOrchestrator.js +21 -0
- package/dist/tools/ToolOrchestrator.js.map +1 -1
- package/dist/version.d.ts +1 -1
- package/dist/version.js +1 -1
- package/dist/workflow/AgentLoopReadiness.d.ts +103 -0
- package/dist/workflow/AgentLoopReadiness.js +371 -0
- package/dist/workflow/AgentLoopReadiness.js.map +1 -0
- package/dist/workflow/EcosystemReadinessGate.d.ts +46 -0
- package/dist/workflow/EcosystemReadinessGate.js +126 -0
- package/dist/workflow/EcosystemReadinessGate.js.map +1 -0
- package/dist/workflow/EngineeringStandards.js +48 -3
- package/dist/workflow/EngineeringStandards.js.map +1 -1
- package/dist/workflow/GateCatalog.js +9 -0
- package/dist/workflow/GateCatalog.js.map +1 -1
- package/dist/workflow/GovernanceTemplatePacks.js +2 -26
- package/dist/workflow/GovernanceTemplatePacks.js.map +1 -1
- package/dist/workflow/GovernanceTemplates.js +8 -1
- package/dist/workflow/GovernanceTemplates.js.map +1 -1
- package/dist/workflow/ReleaseDeploymentLedger.d.ts +63 -0
- package/dist/workflow/ReleaseDeploymentLedger.js +154 -0
- package/dist/workflow/ReleaseDeploymentLedger.js.map +1 -0
- package/dist/workflow/ReviewAnalyzer.js +50 -3
- package/dist/workflow/ReviewAnalyzer.js.map +1 -1
- package/dist/workflow/SessionPreamble.d.ts +7 -0
- package/dist/workflow/SessionPreamble.js +48 -9
- package/dist/workflow/SessionPreamble.js.map +1 -1
- package/dist/workflow/VerificationCommands.d.ts +1 -0
- package/dist/workflow/VerificationCommands.js.map +1 -1
- package/dist/workflow/VerificationProfile.d.ts +5 -0
- package/dist/workflow/VerificationProfile.js +26 -0
- package/dist/workflow/VerificationProfile.js.map +1 -1
- package/dist/workflow/VerificationSchema.d.ts +3 -0
- package/dist/workflow/VerificationSchema.js +6 -0
- package/dist/workflow/VerificationSchema.js.map +1 -1
- package/dist/workflow/WorkflowEffectiveness.d.ts +97 -0
- package/dist/workflow/WorkflowEffectiveness.js +302 -0
- package/dist/workflow/WorkflowEffectiveness.js.map +1 -0
- package/dist/workflow/WorkflowEffectivenessRenderer.d.ts +2 -0
- package/dist/workflow/WorkflowEffectivenessRenderer.js +67 -0
- package/dist/workflow/WorkflowEffectivenessRenderer.js.map +1 -0
- package/dist/workflow/WorkflowEffectivenessScoring.d.ts +6 -0
- package/dist/workflow/WorkflowEffectivenessScoring.js +243 -0
- package/dist/workflow/WorkflowEffectivenessScoring.js.map +1 -0
- package/dist/workflow/gates/GateSystem.d.ts +16 -0
- package/dist/workflow/gates/GateSystem.js +208 -41
- package/dist/workflow/gates/GateSystem.js.map +1 -1
- package/dist/workflow/gates/MetaGovernanceGates.js +269 -8
- package/dist/workflow/gates/MetaGovernanceGates.js.map +1 -1
- package/docs/reference/cli.md +2 -1
- package/docs/start/agent-governance-demo.md +1 -1
- package/docs/workflow/ASSESSMENT_INDEX.md +326 -0
- package/docs/workflow/COMPARATIVE_ANALYSIS.md +422 -0
- package/docs/workflow/EXECUTIVE_SUMMARY.md +310 -0
- package/docs/workflow/IMPROVEMENT_CHECKLIST.md +518 -0
- package/docs/workflow/IMPROVEMENT_ROADMAP.md +707 -0
- package/docs/workflow/README.md +9 -1
- package/docs/workflow/templates/github-actions-scale-preflight.yml +4 -1
- package/package.json +10 -3
- package/scripts/workflow/run-vitest.mjs +123 -0
|
@@ -0,0 +1,518 @@
|
|
|
1
|
+
# SCALE Engine 改进实施清单
|
|
2
|
+
|
|
3
|
+
**版本**: 1.0
|
|
4
|
+
**生成日期**: 2026-06-03
|
|
5
|
+
**用途**: 逐项追踪改进项目的实施进度
|
|
6
|
+
|
|
7
|
+
---
|
|
8
|
+
|
|
9
|
+
## 快速导航
|
|
10
|
+
|
|
11
|
+
- [短期任务(3个月)](#短期任务3个月)
|
|
12
|
+
- [中期任务(6个月)](#中期任务6个月)
|
|
13
|
+
- [长期任务(1年)](#长期任务1年)
|
|
14
|
+
- [进度追踪](#进度追踪)
|
|
15
|
+
|
|
16
|
+
---
|
|
17
|
+
|
|
18
|
+
## 短期任务(3个月)
|
|
19
|
+
|
|
20
|
+
### 🚀 P0-1: Fast-lane 模式 (第 1-2 周)
|
|
21
|
+
|
|
22
|
+
**目标**: S 级任务可在 < 2 分钟通过验证(跳过 G9-G22)
|
|
23
|
+
|
|
24
|
+
#### 具体任务
|
|
25
|
+
- [ ] **任务卡**: 创建 GitHub Issue #XXX "Implement fast-lane profile"
|
|
26
|
+
- [ ] **分析**: 调查现有 profile 实现(verification.json 结构)
|
|
27
|
+
- [ ] **设计**: 草稿 fast-lane profile 配置
|
|
28
|
+
- [ ] 文档: `.scale/verification.json` 中的 profiles schema
|
|
29
|
+
- [ ] 决策: 哪 4 个 gate (G0, G3, G4, G5)?
|
|
30
|
+
|
|
31
|
+
- [ ] **实现**:
|
|
32
|
+
- [ ] 修改 `.scale/verification.json` 新增 fast-lane
|
|
33
|
+
- [ ] 更新 `scripts/gates/all.sh` 支持 `--profile fast-lane`
|
|
34
|
+
- [ ] 新增 `scripts/gates/fast-lane-verify.sh`
|
|
35
|
+
|
|
36
|
+
- [ ] **测试**:
|
|
37
|
+
- [ ] 手工测试: typo 修复用 fast-lane
|
|
38
|
+
- [ ] 手工测试: 注释改动用 fast-lane
|
|
39
|
+
- [ ] 验证耗时 < 2 分钟
|
|
40
|
+
|
|
41
|
+
- [ ] **文档**:
|
|
42
|
+
- [ ] 创建 `docs/guides/FAST_COMMIT_GUIDE.md`
|
|
43
|
+
- [ ] 更新 `DEVELOPMENT_WORKFLOW.md` 章节 3
|
|
44
|
+
- [ ] 更新 README 快速开始部分
|
|
45
|
+
|
|
46
|
+
- [ ] **审查**: 代码审查 + 文档审查
|
|
47
|
+
- [ ] **发版**: 纳入 v0.47.0 release
|
|
48
|
+
|
|
49
|
+
**交付件**:
|
|
50
|
+
```
|
|
51
|
+
✓ FAST_COMMIT_GUIDE.md
|
|
52
|
+
✓ .scale/verification.json (fast-lane 配置)
|
|
53
|
+
✓ scripts/gates/fast-lane-verify.sh
|
|
54
|
+
✓ 单元测试 + 集成测试
|
|
55
|
+
✓ PR with evidence.md
|
|
56
|
+
```
|
|
57
|
+
|
|
58
|
+
**验证方式**:
|
|
59
|
+
```bash
|
|
60
|
+
make new-task NAME=typo LEVEL=S
|
|
61
|
+
make gate-workflow --profile fast-lane
|
|
62
|
+
# 期望: ✓ 通过,耗时 < 120s
|
|
63
|
+
```
|
|
64
|
+
|
|
65
|
+
---
|
|
66
|
+
|
|
67
|
+
### 🚀 P1-5: 学习路径与视频教程 (第 3-4 周)
|
|
68
|
+
|
|
69
|
+
**目标**: 新手 15 分钟掌握基础,30 分钟学会日常使用
|
|
70
|
+
|
|
71
|
+
#### 具体任务
|
|
72
|
+
- [ ] **制定学习分级**:
|
|
73
|
+
- [ ] Level 1 (15 min): preflight / verify 概念
|
|
74
|
+
- [ ] Level 2 (30 min): new-task / explore / plan
|
|
75
|
+
- [ ] Level 3 (45 min): gate-workflow / gate-quality
|
|
76
|
+
- [ ] Level 4 (60 min): scale orch / scale shield
|
|
77
|
+
- [ ] Level 5 (90 min): scale cortex / scale ai-os
|
|
78
|
+
|
|
79
|
+
- [ ] **编写 LEARNING_PATH.md**:
|
|
80
|
+
- [ ] 每级的学习目标
|
|
81
|
+
- [ ] 对应文档链接
|
|
82
|
+
- [ ] 3-5 个代码示例
|
|
83
|
+
- [ ] FAQ 常见问题
|
|
84
|
+
|
|
85
|
+
- [ ] **制作视频** (各 5-10 分钟):
|
|
86
|
+
- [ ] 视频 1: 新手 15 分钟快速开始
|
|
87
|
+
- [ ] 视频 2: 常见故障排查 (gate 失败、升级卡壳)
|
|
88
|
+
- [ ] 视频 3: Cortex 完整演示
|
|
89
|
+
|
|
90
|
+
**发布平台**: YouTube / Bilibili / GitHub Discussions
|
|
91
|
+
|
|
92
|
+
- [ ] **开发交互式向导**:
|
|
93
|
+
- [ ] 新增 `src/commands/onboard.ts`
|
|
94
|
+
- [ ] 实现 `scale onboard --interactive`
|
|
95
|
+
- [ ] 3-5 个问题判断工作流需求
|
|
96
|
+
- [ ] 推荐 profile + 学习路径
|
|
97
|
+
|
|
98
|
+
- [ ] **更新首页文档**:
|
|
99
|
+
- [ ] README.md 新增"快速开始"→ 链接到 LEARNING_PATH
|
|
100
|
+
- [ ] 添加评分表: 易用性从 6/10 → 7/10
|
|
101
|
+
|
|
102
|
+
**交付件**:
|
|
103
|
+
```
|
|
104
|
+
✓ docs/guides/LEARNING_PATH.md
|
|
105
|
+
✓ 3 个视频教程 (YouTube/Bilibili 链接)
|
|
106
|
+
✓ src/commands/onboard.ts
|
|
107
|
+
✓ 对应单元测试
|
|
108
|
+
✓ README.md 更新
|
|
109
|
+
```
|
|
110
|
+
|
|
111
|
+
**验证方式**:
|
|
112
|
+
```bash
|
|
113
|
+
# 测试交互式向导
|
|
114
|
+
scale onboard --interactive
|
|
115
|
+
# 输出: 3-5 个问题,推荐 profile
|
|
116
|
+
|
|
117
|
+
# 测试新手文档
|
|
118
|
+
# 邀请 3 个新手,记录完成时间
|
|
119
|
+
# 目标: avg time to first success < 30 min
|
|
120
|
+
```
|
|
121
|
+
|
|
122
|
+
---
|
|
123
|
+
|
|
124
|
+
### 🚀 P1-7: 性能基准与文档 (第 5-6 周)
|
|
125
|
+
|
|
126
|
+
**目标**: 发布 gate 耗时基线,证明工作流可接受的开销
|
|
127
|
+
|
|
128
|
+
#### 具体任务
|
|
129
|
+
- [ ] **建立性能测试环境**:
|
|
130
|
+
- [ ] 清洁 workspace(无其他后台进程)
|
|
131
|
+
- [ ] 标准化测试项目(repo size 固定)
|
|
132
|
+
- [ ] 测试机器配置固定 (CPU/RAM/网络)
|
|
133
|
+
|
|
134
|
+
- [ ] **逐个 gate 测量**:
|
|
135
|
+
- [ ] G0 (Build): 运行 5 次,记录平均/最小/最大
|
|
136
|
+
- [ ] G1 (Explore): 检查逻辑,无测试,记录 check time
|
|
137
|
+
- [ ] G3 (Test coupling): 检查逻辑,记录
|
|
138
|
+
- [ ] G4 (Lint): ESLint 扫描,记录
|
|
139
|
+
- [ ] G5 (Tests): vitest 运行,记录
|
|
140
|
+
- [ ] G6-G8: 分别测量
|
|
141
|
+
- [ ] G9-G22: 新增 gate,首次测量
|
|
142
|
+
|
|
143
|
+
- [ ] **生成基准文档**:
|
|
144
|
+
```markdown
|
|
145
|
+
# Gate Latency Baseline
|
|
146
|
+
测试环境: MacBook Pro M1, 16GB RAM, clean workspace
|
|
147
|
+
项目: scale-engine (当前)
|
|
148
|
+
|
|
149
|
+
| Gate | 说明 | 平均耗时 | 最小 | 最大 | P95 |
|
|
150
|
+
|------|------|--------|------|------|------|
|
|
151
|
+
| G0 | Build | 45s | 42s | 52s | 50s |
|
|
152
|
+
| G1 | Explore | <1s | - | - | - |
|
|
153
|
+
| G4 | Lint | 8s | 6s | 11s | 10s |
|
|
154
|
+
| G5 | Tests | 120s | 100s | 150s | 140s |
|
|
155
|
+
| Total | 全量(G0-G8) | 200s | 180s | 230s | 220s |
|
|
156
|
+
```
|
|
157
|
+
|
|
158
|
+
- [ ] **性能优化建议**:
|
|
159
|
+
- [ ] 识别最慢的 gate
|
|
160
|
+
- [ ] 建议并行化机制
|
|
161
|
+
- [ ] 评估异步 evidence 录制的收益
|
|
162
|
+
|
|
163
|
+
- [ ] **创建持续监控**:
|
|
164
|
+
- [ ] 新增 GitHub Actions workflow: performance-baseline.yml
|
|
165
|
+
- [ ] 每个 release 自动测量
|
|
166
|
+
- [ ] 记录性能 trend
|
|
167
|
+
|
|
168
|
+
**交付件**:
|
|
169
|
+
```
|
|
170
|
+
✓ docs/PERFORMANCE_BASELINE.md
|
|
171
|
+
✓ scripts/performance/measure-gates.sh
|
|
172
|
+
✓ .github/workflows/performance-baseline.yml
|
|
173
|
+
✓ performance trend 数据 (CSV)
|
|
174
|
+
```
|
|
175
|
+
|
|
176
|
+
**验证方式**:
|
|
177
|
+
```bash
|
|
178
|
+
bash scripts/performance/measure-gates.sh
|
|
179
|
+
# 输出: baseline.json
|
|
180
|
+
# {
|
|
181
|
+
# "G0": {"avg": 45, "min": 42, "max": 52},
|
|
182
|
+
# ...
|
|
183
|
+
# }
|
|
184
|
+
|
|
185
|
+
# 对比预期: total < 220s
|
|
186
|
+
```
|
|
187
|
+
|
|
188
|
+
---
|
|
189
|
+
|
|
190
|
+
### ✅ P3-10/11/12: 小改进 (第 7-8 周)
|
|
191
|
+
|
|
192
|
+
三个小改进项,并行实施:
|
|
193
|
+
|
|
194
|
+
#### P3-10: Token 预算 (G21) 强制化
|
|
195
|
+
- [ ] 修改 `.scale/verification.json`: G21.blocking = true (L/CRITICAL)
|
|
196
|
+
- [ ] 更新 `scripts/gates/G21-verify.sh`: 增强阻断逻辑
|
|
197
|
+
- [ ] 文档说明
|
|
198
|
+
- **交付**: 1 个文件改动 + 1 个脚本更新
|
|
199
|
+
|
|
200
|
+
#### P3-11: Session 健康 (G22) 细粒度信号
|
|
201
|
+
- [ ] 新增信号:
|
|
202
|
+
- [ ] context window utilization trend
|
|
203
|
+
- [ ] memory growth rate
|
|
204
|
+
- [ ] 清理建议
|
|
205
|
+
- [ ] 更新 `scripts/gates/G22-verify.sh`
|
|
206
|
+
- **交付**: G22 脚本 + 文档
|
|
207
|
+
|
|
208
|
+
#### P3-12: 文档链接卫生 (G17) 强制化
|
|
209
|
+
- [ ] 修改 `.scale/verification.json`: G17.blocking = true
|
|
210
|
+
- [ ] 增强 `scripts/gates/G17-verify.sh`: 检查变更文件中的链接
|
|
211
|
+
- **交付**: 脚本更新 + 测试
|
|
212
|
+
|
|
213
|
+
---
|
|
214
|
+
|
|
215
|
+
### 🔬 P0-3: Cortex 验证 Phase A (第 9-12 周, 并行)
|
|
216
|
+
|
|
217
|
+
**目标**: 5 个真实项目运行 Cortex 完整周期 2 个月,收集数据
|
|
218
|
+
|
|
219
|
+
#### 项目选择
|
|
220
|
+
- [ ] **项目 1**: 小规模 (< 10K loc) 单个 Agent
|
|
221
|
+
- [ ] **项目 2**: 中规模 (10-50K) 多 feature
|
|
222
|
+
- [ ] **项目 3**: 大规模 (50K+) 多 Agent
|
|
223
|
+
- [ ] **项目 4**: 快速迭代 (daily tasks)
|
|
224
|
+
- [ ] **项目 5**: 规范严格 (L/CRITICAL focus)
|
|
225
|
+
|
|
226
|
+
#### 数据收集
|
|
227
|
+
- [ ] 建立 baseline(无 Cortex,前 2 周)
|
|
228
|
+
- [ ] 记录每日 gate fail rate
|
|
229
|
+
- [ ] 记录失败模式分布
|
|
230
|
+
|
|
231
|
+
- [ ] 启用 Cortex (后 8 周)
|
|
232
|
+
- [ ] `scale cortex evolve --project <name> --observe-mode on`
|
|
233
|
+
- [ ] 每周收集 Instinct 数量
|
|
234
|
+
- [ ] 记录应用情况
|
|
235
|
+
|
|
236
|
+
- [ ] 生成报告
|
|
237
|
+
- [ ] 每个项目 1 份报告 (BEFORE/AFTER 对比)
|
|
238
|
+
- [ ] 综合报告 (5 个项目汇总)
|
|
239
|
+
|
|
240
|
+
**交付件**:
|
|
241
|
+
```
|
|
242
|
+
✓ docs/case-studies/CORTEX_VALIDATION_REPORT.md (总结)
|
|
243
|
+
✓ docs/case-studies/cortex-project-{1..5}-report.md
|
|
244
|
+
✓ cortex_metrics_raw.json (原始数据)
|
|
245
|
+
```
|
|
246
|
+
|
|
247
|
+
**验证方式**:
|
|
248
|
+
```bash
|
|
249
|
+
scale cortex metrics --days 60 --projects 5 --compare-baseline
|
|
250
|
+
# 期望输出:
|
|
251
|
+
# - gate fail rate: avg 12% → 8% (↓33%)
|
|
252
|
+
# - common patterns: Top 5 identified
|
|
253
|
+
# - Instinct applications: N times
|
|
254
|
+
```
|
|
255
|
+
|
|
256
|
+
---
|
|
257
|
+
|
|
258
|
+
## 中期任务(6个月)
|
|
259
|
+
|
|
260
|
+
### 🚀 P0-2: 升级自动化 (第 1-2 月)
|
|
261
|
+
|
|
262
|
+
**目标**: upgrade-check → recommend → apply → verify 全自动化
|
|
263
|
+
|
|
264
|
+
#### 新增命令
|
|
265
|
+
- [ ] `scale upgrade recommend`
|
|
266
|
+
- [ ] 分析 breaking changes
|
|
267
|
+
- [ ] 计算风险分数
|
|
268
|
+
- [ ] 生成自动应用计划
|
|
269
|
+
|
|
270
|
+
- [ ] `scale upgrade apply --auto-backup`
|
|
271
|
+
- [ ] 自动创建 backup branch
|
|
272
|
+
- [ ] 记录升级前后 git state
|
|
273
|
+
|
|
274
|
+
- [ ] `scale upgrade verify --compare-baseline`
|
|
275
|
+
- [ ] 升级后性能对标
|
|
276
|
+
- [ ] 超阈值自动 rollback
|
|
277
|
+
|
|
278
|
+
#### 实现
|
|
279
|
+
- [ ] 新文件: `src/commands/upgrade-recommend.ts`
|
|
280
|
+
- [ ] 更新: `src/commands/upgrade-apply.ts` 添加 --auto-backup
|
|
281
|
+
- [ ] 更新: `src/commands/upgrade-verify.ts` 添加 --compare-baseline
|
|
282
|
+
- [ ] 集成测试 (mock 升级场景)
|
|
283
|
+
|
|
284
|
+
**交付件**:
|
|
285
|
+
```
|
|
286
|
+
✓ src/commands/upgrade-recommend.ts
|
|
287
|
+
✓ src/commands/upgrade-*.ts (更新)
|
|
288
|
+
✓ docs/guides/UPGRADE_AUTOMATION.md
|
|
289
|
+
✓ 升级故障排查清单
|
|
290
|
+
✓ 集成测试
|
|
291
|
+
```
|
|
292
|
+
|
|
293
|
+
---
|
|
294
|
+
|
|
295
|
+
### 🚀 P0-3: Cortex 验证 Phase B (第 1-2 月, 并行)
|
|
296
|
+
|
|
297
|
+
**目标**: 量化 Cortex 改进效果 (期望 >20%)
|
|
298
|
+
|
|
299
|
+
#### 对标实验设计
|
|
300
|
+
- [ ] 对照组: 100 个 tasks (无 Cortex)
|
|
301
|
+
- [ ] 实验组: 100 个 tasks (有 Cortex)
|
|
302
|
+
- [ ] 随机分组
|
|
303
|
+
- [ ] 统计指标:
|
|
304
|
+
- [ ] gate fail rate
|
|
305
|
+
- [ ] 首次成功时间
|
|
306
|
+
- [ ] 迭代次数
|
|
307
|
+
|
|
308
|
+
- [ ] 发布论文初稿 (可选)
|
|
309
|
+
|
|
310
|
+
**交付件**:
|
|
311
|
+
```
|
|
312
|
+
✓ docs/case-studies/CORTEX_EFFECTIVENESS_STUDY.md
|
|
313
|
+
✓ cortex_ab_test_data.csv
|
|
314
|
+
✓ 统计分析结果 (效果量化)
|
|
315
|
+
```
|
|
316
|
+
|
|
317
|
+
---
|
|
318
|
+
|
|
319
|
+
### 🚀 P1-6: 跨平台统一 (第 3-4 月)
|
|
320
|
+
|
|
321
|
+
**目标**: 废弃 PowerShell,全量 Node.js + Bash
|
|
322
|
+
|
|
323
|
+
#### 迁移工作
|
|
324
|
+
- [ ] 迁移脚本:
|
|
325
|
+
- [ ] `scripts/workflow/verify.ps1` → `src/commands/verify.ts`
|
|
326
|
+
- [ ] `scripts/bootstrap-scale.ps1` → `src/commands/bootstrap.ts`
|
|
327
|
+
- [ ] `scripts/gates/*.sh` → Node.js modules
|
|
328
|
+
|
|
329
|
+
- [ ] 测试矩阵:
|
|
330
|
+
- [ ] Linux (GitHub Actions)
|
|
331
|
+
- [ ] macOS (GitHub Actions)
|
|
332
|
+
- [ ] Windows (WSL2)
|
|
333
|
+
|
|
334
|
+
- [ ] 性能对标
|
|
335
|
+
|
|
336
|
+
**交付件**:
|
|
337
|
+
```
|
|
338
|
+
✓ Node.js 版脚本集
|
|
339
|
+
✓ CI/CD 测试矩阵 (3 平台)
|
|
340
|
+
✓ 迁移文档 + 性能对标
|
|
341
|
+
```
|
|
342
|
+
|
|
343
|
+
---
|
|
344
|
+
|
|
345
|
+
### 🚀 P1-4: 统一配置 DSL (第 5-6 月)
|
|
346
|
+
|
|
347
|
+
**目标**: 7 个 JSON 配置 → 1 个 governance.yaml
|
|
348
|
+
|
|
349
|
+
#### 设计与实现
|
|
350
|
+
- [ ] Schema 设计: `src/schema/governance.schema.json`
|
|
351
|
+
- [ ] 自动迁移工具: `src/commands/config-migrate.ts`
|
|
352
|
+
- [ ] 反向兼容性: 旧 JSON 文件仍可读 (过渡期 6 个月)
|
|
353
|
+
- [ ] 完整测试: 迁移后功能等价性
|
|
354
|
+
|
|
355
|
+
**交付件**:
|
|
356
|
+
```
|
|
357
|
+
✓ src/schema/governance.schema.json
|
|
358
|
+
✓ .scale/governance.yaml (新模板)
|
|
359
|
+
✓ src/commands/config-migrate.ts
|
|
360
|
+
✓ docs/guides/GOVERNANCE_DSL.md
|
|
361
|
+
✓ 迁移脚本 + 完整测试
|
|
362
|
+
```
|
|
363
|
+
|
|
364
|
+
---
|
|
365
|
+
|
|
366
|
+
## 长期任务(1年)
|
|
367
|
+
|
|
368
|
+
### 🚀 P1-3: Cortex 完全闭合 (第 7-12 月)
|
|
369
|
+
|
|
370
|
+
**目标**: 观察 → 反思 → 提取 → 保存 → 应用 全自动化
|
|
371
|
+
|
|
372
|
+
#### 里程碑
|
|
373
|
+
- [ ] 6 月: Phase A/B 数据完整,发布案例
|
|
374
|
+
- [ ] 8 月: 论文初稿,投会议
|
|
375
|
+
- [ ] 10 月: 发布生产级 Cortex
|
|
376
|
+
- [ ] 12 月: 社区反馈、持续改进
|
|
377
|
+
|
|
378
|
+
---
|
|
379
|
+
|
|
380
|
+
### 🚀 学术论文发表 (第 8-12 月)
|
|
381
|
+
|
|
382
|
+
**目标**: 发表工程化最佳实践论文
|
|
383
|
+
|
|
384
|
+
#### 论文方向
|
|
385
|
+
- [ ] "SCALE Engine: AI Agent Governance with Evidence-Driven Workflow"
|
|
386
|
+
- [ ] 主要贡献:
|
|
387
|
+
- [ ] 23 个门禁体系设计
|
|
388
|
+
- [ ] 证据链持久化方案
|
|
389
|
+
- [ ] 多 Agent 协调框架
|
|
390
|
+
- [ ] 大规模验证数据 (500+ agents)
|
|
391
|
+
|
|
392
|
+
#### 投稿目标
|
|
393
|
+
- [ ] FSE / ICSE / ICSME (一流会议)
|
|
394
|
+
- [ ] 预计 8-10 月完成投稿
|
|
395
|
+
|
|
396
|
+
---
|
|
397
|
+
|
|
398
|
+
### 🚀 插件生态建设 (第 6-12 月)
|
|
399
|
+
|
|
400
|
+
**目标**: 开放 3-5 个三方 skill 插件
|
|
401
|
+
|
|
402
|
+
#### 插件示例
|
|
403
|
+
- [ ] Slack 通知 skill
|
|
404
|
+
- [ ] Jira 集成 skill
|
|
405
|
+
- [ ] GitHub 议题自动化 skill
|
|
406
|
+
- [ ] 报告生成 skill
|
|
407
|
+
- [ ] 知识库集成 skill
|
|
408
|
+
|
|
409
|
+
#### 基础设施
|
|
410
|
+
- [ ] 开放 skill API
|
|
411
|
+
- [ ] Skill 市场 (registry)
|
|
412
|
+
- [ ] 文档 & 示例
|
|
413
|
+
- [ ] 社区招募 (RFC)
|
|
414
|
+
|
|
415
|
+
---
|
|
416
|
+
|
|
417
|
+
### 🚀 分析 Dashboard (第 9-12 月)
|
|
418
|
+
|
|
419
|
+
**目标**: 实时展示工作流健康度
|
|
420
|
+
|
|
421
|
+
#### 指标展示
|
|
422
|
+
- [ ] Gate pass rate by type
|
|
423
|
+
- [ ] Task score 分布
|
|
424
|
+
- [ ] Cortex ROI trend
|
|
425
|
+
- [ ] 平均 cycle time
|
|
426
|
+
- [ ] Defect 率趋势
|
|
427
|
+
|
|
428
|
+
#### 实现
|
|
429
|
+
- [ ] 后端: Node.js API
|
|
430
|
+
- [ ] 前端: React dashboard (可选)
|
|
431
|
+
- [ ] 数据源: evidence store
|
|
432
|
+
|
|
433
|
+
---
|
|
434
|
+
|
|
435
|
+
## 进度追踪
|
|
436
|
+
|
|
437
|
+
### 短期进度表 (3 个月)
|
|
438
|
+
|
|
439
|
+
| 周次 | 改进项 | 状态 | 负责人 | 备注 |
|
|
440
|
+
|------|--------|------|-------|------|
|
|
441
|
+
| W1-W2 | #1 Fast-lane | ⏳ 未开始 | ? | Priority: P0 |
|
|
442
|
+
| W3-W4 | #5 学习路径 | ⏳ 未开始 | ? | Priority: P1 |
|
|
443
|
+
| W5-W6 | #7 性能基准 | ⏳ 未开始 | ? | Priority: P1 |
|
|
444
|
+
| W7-W8 | #10-12 小改 | ⏳ 未开始 | ? | Priority: P3 |
|
|
445
|
+
| W9-W12 | #3 Cortex PA | ⏳ 未开始 | ? | Priority: P0 (并行) |
|
|
446
|
+
|
|
447
|
+
### 关键依赖
|
|
448
|
+
|
|
449
|
+
```
|
|
450
|
+
#1 Fast-lane
|
|
451
|
+
├─ 需依赖: 理解 .scale/verification.json
|
|
452
|
+
└─ 影响: #2 升级 (需 fast-lane profile)
|
|
453
|
+
|
|
454
|
+
#5 学习路径
|
|
455
|
+
├─ 需依赖: 无
|
|
456
|
+
└─ 影响: 整体可用性评分
|
|
457
|
+
|
|
458
|
+
#7 性能基准
|
|
459
|
+
├─ 需依赖: 无
|
|
460
|
+
└─ 影响: #2 升级 (性能对标基础)
|
|
461
|
+
|
|
462
|
+
#3 Cortex PA
|
|
463
|
+
├─ 需依赖: 5 个项目选定
|
|
464
|
+
└─ 影响: #3 PB / 论文 (12 个月后)
|
|
465
|
+
```
|
|
466
|
+
|
|
467
|
+
### 成功定义
|
|
468
|
+
|
|
469
|
+
| 改进项 | 完成标志 | 验证方法 |
|
|
470
|
+
|--------|--------|--------|
|
|
471
|
+
| #1 | S 级任务 < 2 min | `make gate --profile fast-lane` 耗时 < 120s |
|
|
472
|
+
| #3PA | 5 项目报告发布 | case-studies 文件夹有 5 份报告 |
|
|
473
|
+
| #5 | 新手完成时间 | 邀请 3 位新手,avg time < 30 min |
|
|
474
|
+
| #7 | 性能基线文档 | PERFORMANCE_BASELINE.md 发布 |
|
|
475
|
+
|
|
476
|
+
---
|
|
477
|
+
|
|
478
|
+
## 风险与缓解
|
|
479
|
+
|
|
480
|
+
| 风险 | 概率 | 缓解策略 |
|
|
481
|
+
|------|------|--------|
|
|
482
|
+
| Fast-lane 导致规范松动 | L | 明确的 gate 清单,教育 |
|
|
483
|
+
| Cortex 项目不配合 | M | 优先选择内部友好项目 |
|
|
484
|
+
| 跨平台性能下降 | M | 提前 benchmark,fallback 方案 |
|
|
485
|
+
| DSL 迁移数据丢失 | L | 反向兼容 + 自动化测试 |
|
|
486
|
+
| 性能测试不稳定 | M | 多次运行取中位数,控制环境 |
|
|
487
|
+
|
|
488
|
+
---
|
|
489
|
+
|
|
490
|
+
## 资源需求
|
|
491
|
+
|
|
492
|
+
### 人员
|
|
493
|
+
- PM 1 人(总协调)
|
|
494
|
+
- 工程师 2-3 人(并行实施)
|
|
495
|
+
- QA 1 人(测试)
|
|
496
|
+
- 社区运营 0.5 人(文档/视频)
|
|
497
|
+
|
|
498
|
+
**总投入**: ~8-10 人月
|
|
499
|
+
|
|
500
|
+
### 工具/基础设施
|
|
501
|
+
- 性能测试机 (1-2 台)
|
|
502
|
+
- 视频录制工具 (开源免费)
|
|
503
|
+
- YouTube / Bilibili 账号 (自有)
|
|
504
|
+
|
|
505
|
+
---
|
|
506
|
+
|
|
507
|
+
## 批准与激活
|
|
508
|
+
|
|
509
|
+
- [ ] 技术委员会 Review 和批准
|
|
510
|
+
- [ ] 分配资源 & 负责人
|
|
511
|
+
- [ ] 创建追踪看板 (GitHub Projects / Jira)
|
|
512
|
+
- [ ] 首周 kickoff 会议
|
|
513
|
+
|
|
514
|
+
---
|
|
515
|
+
|
|
516
|
+
**文档维护**: @hongmaple0820
|
|
517
|
+
**最后更新**: 2026-06-03
|
|
518
|
+
**下次审视**: 2026-09-03 (短期完成后)
|