@hongmaple0820/scale-engine 0.28.0 → 0.33.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (58) hide show
  1. package/README.en.md +3 -0
  2. package/README.md +2 -0
  3. package/dist/api/cli.js +12 -0
  4. package/dist/api/cli.js.map +1 -1
  5. package/dist/evolution/SessionLearnings.d.ts +70 -0
  6. package/dist/evolution/SessionLearnings.js +217 -0
  7. package/dist/evolution/SessionLearnings.js.map +1 -0
  8. package/dist/runtime/AiOsRuntime.d.ts +138 -0
  9. package/dist/runtime/AiOsRuntime.js +671 -14
  10. package/dist/runtime/AiOsRuntime.js.map +1 -1
  11. package/dist/skills/RoleSkills.d.ts +20 -0
  12. package/dist/skills/RoleSkills.js +154 -0
  13. package/dist/skills/RoleSkills.js.map +1 -0
  14. package/dist/skills/SkillDiscovery.d.ts +5 -0
  15. package/dist/skills/SkillDiscovery.js +15 -0
  16. package/dist/skills/SkillDiscovery.js.map +1 -1
  17. package/dist/skills/SkillFrontmatter.d.ts +28 -0
  18. package/dist/skills/SkillFrontmatter.js +152 -0
  19. package/dist/skills/SkillFrontmatter.js.map +1 -0
  20. package/dist/skills/SkillRegistry.d.ts +11 -0
  21. package/dist/skills/SkillRegistry.js +12 -0
  22. package/dist/skills/SkillRegistry.js.map +1 -1
  23. package/dist/skills/index.d.ts +1 -0
  24. package/dist/skills/index.js +1 -0
  25. package/dist/skills/index.js.map +1 -1
  26. package/dist/testing/DiffTestSelector.d.ts +22 -0
  27. package/dist/testing/DiffTestSelector.js +114 -0
  28. package/dist/testing/DiffTestSelector.js.map +1 -0
  29. package/dist/testing/index.d.ts +1 -0
  30. package/dist/testing/index.js +3 -0
  31. package/dist/testing/index.js.map +1 -0
  32. package/dist/workflow/AdaptiveWorkflowRouter.d.ts +37 -0
  33. package/dist/workflow/AdaptiveWorkflowRouter.js +211 -0
  34. package/dist/workflow/AdaptiveWorkflowRouter.js.map +1 -0
  35. package/dist/workflow/EvolutionShadowPromoter.d.ts +46 -0
  36. package/dist/workflow/EvolutionShadowPromoter.js +73 -0
  37. package/dist/workflow/EvolutionShadowPromoter.js.map +1 -0
  38. package/dist/workflow/ReviewAnalyzer.d.ts +15 -0
  39. package/dist/workflow/ReviewAnalyzer.js +82 -0
  40. package/dist/workflow/ReviewAnalyzer.js.map +1 -1
  41. package/dist/workflow/SecurityAudit.d.ts +27 -0
  42. package/dist/workflow/SecurityAudit.js +294 -0
  43. package/dist/workflow/SecurityAudit.js.map +1 -0
  44. package/dist/workflow/SessionPreamble.d.ts +19 -0
  45. package/dist/workflow/SessionPreamble.js +125 -0
  46. package/dist/workflow/SessionPreamble.js.map +1 -0
  47. package/dist/workflow/ShipPipeline.d.ts +30 -0
  48. package/dist/workflow/ShipPipeline.js +366 -0
  49. package/dist/workflow/ShipPipeline.js.map +1 -0
  50. package/dist/workflow/WorkflowGuidance.d.ts +5 -1
  51. package/dist/workflow/WorkflowGuidance.js +31 -0
  52. package/dist/workflow/WorkflowGuidance.js.map +1 -1
  53. package/dist/workflow/index.d.ts +5 -0
  54. package/dist/workflow/index.js +5 -0
  55. package/dist/workflow/index.js.map +1 -1
  56. package/docs/AI_ENGINEERING_OS_POSITIONING.md +15 -0
  57. package/docs/CONTEXT_BUDGET.md +1 -1
  58. package/package.json +2 -1
package/README.en.md CHANGED
@@ -35,6 +35,8 @@ This is not a claim that SCALE replaces human judgment. It is the first testable
35
35
 
36
36
  The near-term target is `0.28.0` as a usable closed-loop enhancement: connect `ai-os plan`, `ai-os run`, verification recommendations, failure learning, dashboard, benchmark, migration, and adoption into one verifiable loop. The long-range target is an AI Engineering OS beta in 8-12 weeks, a stable governance runtime in 3-6 months, and a cross-agent engineering operating layer in 6-12 months. See the full roadmap in [AI Engineering OS Strategic Positioning](docs/AI_ENGINEERING_OS_POSITIONING.md).
37
37
 
38
+ The first 0.30.0 governance-maturity slice adds Evaluator Intelligence and Tool Strategy Planner to the AI OS runtime. `scale ai-os plan` now detects reasoning-heavy architecture, root-cause, security, and release work, then adds critique, threat-model, release-readiness, and uncertainty decision-log gates to the adaptive workflow. It also turns skill/artifact/verification steps into a cost, retry, fallback, side-effect, and evidence graph. `scale ai-os status` surfaces evaluator gate count, uncertainty, tool-strategy cost, and fallback coverage so reviewers can see whether reasoning and tool risks were governed instead of hidden in prose.
39
+
38
40
  The current 0.27.0 beta runtime now includes the controlled run entry point: `scale ai-os run --dry-run` reuses the unified plan, produces execution steps, evidence requirements, next actions, and writes the run report to `.scale/ai-os/runs/`. When real verification is required, use guarded mode with explicit `--verify` commands. Commands run through the safe runner by default and are recorded as runtime evidence; failed verification returns a `blocked` JSON report and a non-zero CLI exit code.
39
41
 
40
42
  ```bash
@@ -66,6 +68,7 @@ scale ai-os dashboard --json
66
68
  ```
67
69
 
68
70
  `status` is the 0.28.0 closed-loop visibility entry point. It checks runtime directories, plan/run evidence, guarded verification, dashboard health, benchmark evidence, and the adoption report in one place.
71
+ For the 0.29.0 intelligence track, `status` also reports memory recall, memory quality, context savings, context compression risk, skill routing, and benchmark intelligence signals from persisted runs and benchmark evidence.
69
72
  When guarded verification evidence is missing, it also recommends concrete commands from `.scale/verification.json` or `package.json` scripts so an agent can choose the next governed `--verify` step without guessing.
70
73
 
71
74
  Before a release or milestone review, run the fixed benchmark scenarios to compare context, memory, skill, governance, and dashboard metrics:
package/README.md CHANGED
@@ -19,6 +19,8 @@ npm:https://www.npmjs.com/package/@hongmaple0820/scale-engine
19
19
 
20
20
  ## 0.27.0 AI OS Runtime
21
21
 
22
+ > 0.30.0 治理成熟度预览:AI OS Runtime 已加入 Evaluator Intelligence 和 Tool Strategy Planner。`scale ai-os plan` 会识别架构、根因、安全、发版等推理风险任务,并把 architecture critique、root-cause review、security threat model、release readiness 和 uncertainty decision log 加入 adaptive workflow;同时把 skill、artifact、verification 步骤编译成 cost、retry、fallback、side-effect 和 evidence graph。`scale ai-os status` 会展示 evaluator gate、uncertainty、tool strategy cost 和 fallback coverage,让评审者看到推理风险和工具风险是否被门禁治理,而不是只藏在文字说明里。
23
+
22
24
  0.27.0 把战略方向落成了一个可执行入口:`scale ai-os plan`。它会在一次命令里同时生成风险治理模式、Context Compiler 预算结果、Memory Provider 召回结果、Skill Routing 执行计划和 Governance ROI,让 Agent 在开始任务前就知道应该加载什么上下文、调用什么能力、补什么证据、哪些风险会升级门禁。
23
25
 
24
26
  ```bash
package/dist/api/cli.js CHANGED
@@ -3238,6 +3238,12 @@ const aiOsStatusCommand = defineCommand({
3238
3238
  console.log(` 检查: ${report.summary.ready} ready, ${report.summary.warning} warning, ${report.summary.blocked} blocked`);
3239
3239
  console.log(` Dashboard: ${report.dashboard.health.status} (${report.dashboard.health.score})`);
3240
3240
  console.log(` Doctor: ${report.doctor.status}`);
3241
+ console.log(` Intelligence: ${report.intelligence.status} (${report.intelligence.summary.ready} ready, ${report.intelligence.summary.warning} warning, ${report.intelligence.summary.blocked} blocked)`);
3242
+ console.log(` Context risk: ${report.intelligence.summary.contextQuality.compressionRisk}; omitted ${report.intelligence.summary.contextQuality.omittedSections} section(s), evidence warnings ${report.intelligence.summary.contextQuality.evidenceLossWarnings.length}`);
3243
+ console.log(` Evaluator gates: ${report.intelligence.summary.evaluatorQuality.requiredGates}; uncertainty ${report.intelligence.summary.evaluatorQuality.averageUncertainty}`);
3244
+ console.log(` Tool strategy: ${report.intelligence.summary.toolStrategyQuality.totalSteps} step(s), cost ${report.intelligence.summary.toolStrategyQuality.estimatedCostUnits}, fallback ${report.intelligence.summary.toolStrategyQuality.fallbackCoverage}`);
3245
+ for (const signal of report.intelligence.signals)
3246
+ console.log(` [${signal.status}] ${signal.id}: ${signal.summary}`);
3241
3247
  for (const check of report.checks)
3242
3248
  console.log(` [${check.status}] ${check.id}: ${check.summary}`);
3243
3249
  if (report.verificationRecommendations.length > 0) {
@@ -3257,6 +3263,12 @@ const aiOsStatusCommand = defineCommand({
3257
3263
  console.log(` Checks: ${report.summary.ready} ready, ${report.summary.warning} warning, ${report.summary.blocked} blocked`);
3258
3264
  console.log(` Dashboard: ${report.dashboard.health.status} (${report.dashboard.health.score})`);
3259
3265
  console.log(` Doctor: ${report.doctor.status}`);
3266
+ console.log(` Intelligence: ${report.intelligence.status} (${report.intelligence.summary.ready} ready, ${report.intelligence.summary.warning} warning, ${report.intelligence.summary.blocked} blocked)`);
3267
+ console.log(` Context risk: ${report.intelligence.summary.contextQuality.compressionRisk}; omitted ${report.intelligence.summary.contextQuality.omittedSections} section(s), evidence warnings ${report.intelligence.summary.contextQuality.evidenceLossWarnings.length}`);
3268
+ console.log(` Evaluator gates: ${report.intelligence.summary.evaluatorQuality.requiredGates}; uncertainty ${report.intelligence.summary.evaluatorQuality.averageUncertainty}`);
3269
+ console.log(` Tool strategy: ${report.intelligence.summary.toolStrategyQuality.totalSteps} step(s), cost ${report.intelligence.summary.toolStrategyQuality.estimatedCostUnits}, fallback ${report.intelligence.summary.toolStrategyQuality.fallbackCoverage}`);
3270
+ for (const signal of report.intelligence.signals)
3271
+ console.log(` [${signal.status}] ${signal.id}: ${signal.summary}`);
3260
3272
  for (const check of report.checks)
3261
3273
  console.log(` [${check.status}] ${check.id}: ${check.summary}`);
3262
3274
  if (report.verificationRecommendations.length > 0) {