npm - @hongmaple0820/scale-engine - Versions diffs - 0.25.0 → 0.27.0 - Mend

@hongmaple0820/scale-engine 0.25.0 → 0.27.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (171) hide show

package/LICENSE +15 -15
package/README.en.md +384 -346
package/README.md +564 -529
package/dist/adapters/AiderAdapter.js +52 -52
package/dist/adapters/AntigravityAdapter.d.ts +4 -0
package/dist/adapters/AntigravityAdapter.js +21 -0
package/dist/adapters/AntigravityAdapter.js.map +1 -0
package/dist/adapters/ClaudeCodeAdapter.d.ts +4 -1
package/dist/adapters/ClaudeCodeAdapter.js +34 -34
package/dist/adapters/ClaudeCodeAdapter.js.map +1 -1
package/dist/adapters/ClineAdapter.d.ts +4 -0
package/dist/adapters/ClineAdapter.js +20 -0
package/dist/adapters/ClineAdapter.js.map +1 -0
package/dist/adapters/CodexAdapter.js +28 -28
package/dist/adapters/CursorAdapter.js +26 -26
package/dist/adapters/DeepSeekTuiAdapter.js +97 -97
package/dist/adapters/DoubaoAdapter.js +33 -33
package/dist/adapters/GeminiAdapter.js +26 -26
package/dist/adapters/GenericProjectAgentAdapter.d.ts +29 -0
package/dist/adapters/GenericProjectAgentAdapter.js +204 -0
package/dist/adapters/GenericProjectAgentAdapter.js.map +1 -0
package/dist/adapters/HermesAdapter.js +26 -26
package/dist/adapters/JCodeAdapter.d.ts +4 -0
package/dist/adapters/JCodeAdapter.js +19 -0
package/dist/adapters/JCodeAdapter.js.map +1 -0
package/dist/adapters/KiloCodeAdapter.d.ts +4 -0
package/dist/adapters/KiloCodeAdapter.js +20 -0
package/dist/adapters/KiloCodeAdapter.js.map +1 -0
package/dist/adapters/KimiAdapter.js +32 -32
package/dist/adapters/KiroAdapter.js +26 -26
package/dist/adapters/OpenClawAdapter.js +26 -26
package/dist/adapters/OpenCodeAdapter.js +26 -26
package/dist/adapters/QCoderAdapter.js +26 -26
package/dist/adapters/QoderAdapter.d.ts +4 -0
package/dist/adapters/QoderAdapter.js +21 -0
package/dist/adapters/QoderAdapter.js.map +1 -0
package/dist/adapters/TraeAdapter.js +26 -26
package/dist/adapters/VSCAdapter.js +26 -26
package/dist/adapters/WindsurfAdapter.js +32 -32
package/dist/adapters/WorkBuddyAdapter.js +26 -26
package/dist/adapters/index.d.ts +5 -0
package/dist/adapters/index.js +15 -0
package/dist/adapters/index.js.map +1 -1
package/dist/api/cli.js +190 -56
package/dist/api/cli.js.map +1 -1
package/dist/api/doctor.js +10 -3
package/dist/api/doctor.js.map +1 -1
package/dist/api/quickstart.js +7 -1
package/dist/api/quickstart.js.map +1 -1
package/dist/artifact/sqliteStore.js +89 -89
package/dist/artifact/types.d.ts +1 -1
package/dist/cli/phaseCommands.js +53 -53
package/dist/cli/phaseCommands.js.map +1 -1
package/dist/context/AntiPatternRegistry.js +20 -20
package/dist/context/ContextBudget.d.ts +14 -0
package/dist/context/ContextBudget.js +50 -14
package/dist/context/ContextBudget.js.map +1 -1
package/dist/context/ContextBuilder.js +155 -155
package/dist/context/ContextCompiler.d.ts +34 -0
package/dist/context/ContextCompiler.js +120 -0
package/dist/context/ContextCompiler.js.map +1 -0
package/dist/eval/WorkflowEval.js +4 -6
package/dist/eval/WorkflowEval.js.map +1 -1
package/dist/evolution/EvolutionEngine.js +31 -31
package/dist/evolution/EvolutionEvaluator.d.ts +2 -0
package/dist/evolution/EvolutionEvaluator.js +7 -1
package/dist/evolution/EvolutionEvaluator.js.map +1 -1
package/dist/fsm/FSMAgentBridge.js +11 -11
package/dist/governance/GovernanceRoi.d.ts +6 -1
package/dist/governance/GovernanceRoi.js +32 -0
package/dist/governance/GovernanceRoi.js.map +1 -1
package/dist/guardrails/DependencyAuditor.js +38 -0
package/dist/guardrails/DependencyAuditor.js.map +1 -1
package/dist/hooks/HookGeneratorEnhanced.js +218 -218
package/dist/index.d.ts +2 -1
package/dist/index.js +3 -2
package/dist/index.js.map +1 -1
package/dist/knowledge/SQLiteKnowledgeBase.js +28 -28
package/dist/memory/MemoryBrain.js +52 -52
package/dist/output/GovernanceDashboard.js +44 -44
package/dist/output/HTMLArtifactLayer.js +31 -31
package/dist/prompts/VibeTemplateGallery.js +121 -121
package/dist/runtime/AiOsRuntime.d.ts +53 -0
package/dist/runtime/AiOsRuntime.js +142 -0
package/dist/runtime/AiOsRuntime.js.map +1 -0
package/dist/runtime/index.d.ts +1 -0
package/dist/runtime/index.js +1 -0
package/dist/runtime/index.js.map +1 -1
package/dist/skills/SkillDiscovery.js +12 -1
package/dist/skills/SkillDiscovery.js.map +1 -1
package/dist/skills/routing/SkillPlanner.js +128 -40
package/dist/skills/routing/SkillPlanner.js.map +1 -1
package/dist/skills/routing/SkillRoutingTypes.d.ts +17 -0
package/dist/tools/SafeCommandRunner.d.ts +16 -0
package/dist/tools/SafeCommandRunner.js +83 -0
package/dist/tools/SafeCommandRunner.js.map +1 -0
package/dist/workflow/EngineeringStandards.js +62 -62
package/dist/workflow/GovernanceTemplatePacks.d.ts +1 -1
package/dist/workflow/GovernanceTemplatePacks.js +1990 -162
package/dist/workflow/GovernanceTemplatePacks.js.map +1 -1
package/dist/workflow/GovernanceTemplates.d.ts +2 -0
package/dist/workflow/GovernanceTemplates.js +1012 -1001
package/dist/workflow/GovernanceTemplates.js.map +1 -1
package/dist/workflow/ResourceGovernance.js +16 -16
package/dist/workflow/TaskArtifactScaffolder.js +10 -10
package/dist/workflow/UpgradeManager.d.ts +3 -2
package/dist/workflow/UpgradeManager.js +134 -49
package/dist/workflow/UpgradeManager.js.map +1 -1
package/dist/workflow/WorkspaceTopology.js +18 -15
package/dist/workflow/WorkspaceTopology.js.map +1 -1
package/dist/workflow/gates/GateSystem.js +3 -9
package/dist/workflow/gates/GateSystem.js.map +1 -1
package/docs/ACTIVE_SECURITY_VISUAL_GATES.md +87 -87
package/docs/AI_ENGINEERING_OS_POSITIONING.md +462 -0
package/docs/BACKGROUND_HUNTER.md +62 -62
package/docs/CODE_INTELLIGENCE.md +138 -138
package/docs/CONTEXT_BUDGET.md +155 -113
package/docs/DEPENDENCY_AUDIT.md +118 -89
package/docs/EVOLUTION_SHADOW_MODE.md +63 -63
package/docs/EXTERNAL_REFERENCES.md +63 -58
package/docs/GITLAB_FLOW.md +125 -125
package/docs/GOVERNANCE_DASHBOARD.md +85 -85
package/docs/MEMORY_BRAIN.md +104 -104
package/docs/MEMORY_FABRIC.md +136 -134
package/docs/README.md +102 -92
package/docs/RUNTIME_EVIDENCE.md +101 -101
package/docs/SKILL-REPOSITORY.md +57 -57
package/docs/SKILL_RADAR.md +135 -122
package/docs/THIRD_PARTY_SKILLS.md +57 -57
package/docs/WORKFLOW_EVAL.md +151 -151
package/docs/guides/DEVELOPMENT_WORKFLOW.md +80 -0
package/docs/guides/GETTING_STARTED.md +50 -0
package/docs/start/README.md +78 -72
package/docs/start/agent-governance-demo.md +107 -107
package/docs/start/quickstart.md +137 -127
package/docs/start/workflow-upgrade.md +32 -8
package/docs/workflow/README.md +67 -0
package/docs/workflow/node-library.md +52 -0
package/docs/workflow/templates/api-contract.md +29 -0
package/docs/workflow/templates/architecture-review.md +23 -0
package/docs/workflow/templates/db-change-plan.md +20 -0
package/docs/workflow/templates/docs-impact.md +17 -0
package/docs/workflow/templates/e2e-plan.md +20 -0
package/docs/workflow/templates/explore.md +16 -0
package/docs/workflow/templates/github-actions-scale-preflight.yml +32 -0
package/docs/workflow/templates/mini-prd.md +16 -0
package/docs/workflow/templates/plan.md +37 -0
package/docs/workflow/templates/pre-push-scale-preflight.sh +8 -0
package/docs/workflow/templates/product-smoke.md +61 -0
package/docs/workflow/templates/reality-check.md +28 -0
package/docs/workflow/templates/resource-cleanup.md +17 -0
package/docs/workflow/templates/resource-impact.md +25 -0
package/docs/workflow/templates/review.md +12 -0
package/docs/workflow/templates/runtime.md +23 -0
package/docs/workflow/templates/security-review.md +26 -0
package/docs/workflow/templates/skill-evidence.md +33 -0
package/docs/workflow/templates/skill-plan.md +39 -0
package/docs/workflow/templates/spec.md +17 -0
package/docs/workflow/templates/standards-impact.md +28 -0
package/docs/workflow/templates/summary.md +16 -0
package/docs/workflow/templates/tasks.md +8 -0
package/docs/workflow/templates/ui-spec.md +29 -0
package/docs/workflow/templates/verification.md +20 -0
package/docs/workflow/templates/visual-review.md +20 -0
package/examples/demo-projects/agent-governance-demo/CONTEXT.md +14 -14
package/examples/demo-projects/agent-governance-demo/README.md +48 -48
package/examples/demo-projects/agent-governance-demo/docs/CONTEXT-MAP.md +14 -14
package/examples/demo-projects/agent-governance-demo/package.json +22 -21
package/examples/demo-projects/agent-governance-demo/src/oauth-state.ts +39 -39
package/examples/demo-projects/agent-governance-demo/tests/oauth-state.test.ts +52 -52
package/package.json +95 -78

package/README.en.md CHANGED Viewed

@@ -1,346 +1,384 @@
-<p align="center">
-  <img src="https://img.shields.io/badge/version-0.23.0-orange?style=flat-square" alt="version" />
-  <img src="https://img.shields.io/badge/platforms-16-blue?style=flat-square" alt="platforms" />
-  <img src="https://img.shields.io/badge/agents-12-blue?style=flat-square" alt="agents" />
-  <img src="https://img.shields.io/badge/workflows-10-green?style=flat-square" alt="workflows" />
-  <img src="https://img.shields.io/badge/detectors-19-red?style=flat-square" alt="detectors" />
-  <img src="https://img.shields.io/badge/tests-verified-brightgreen?style=flat-square" alt="tests" />
-  <img src="https://img.shields.io/badge/npm-0.23.0-cb3837?style=flat-square&logo=npm" alt="npm" />
-</p>
-# SCALE Engine v0.23.0
-SCALE Engine makes AI coding agents follow engineering rules through executable workflow gates, evidence files, and review constraints instead of relying on prompt discipline alone. It helps humans see what the agent explored, planned, verified, skipped, and why a task is or is not ready to ship.
-Repository: https://github.com/hongmaple0820/scale-engine
-Mirror: https://gitee.com/hongmaple/scale-engine
-npm: https://www.npmjs.com/package/@hongmaple0820/scale-engine
-Language: [English](README.en.md) | [Chinese](README.md)
-## Community
-SCALE Engine is an engineering workflow governance project for real AI-agent delivery. Contributions, issues, PRs, governance-pack ideas, and field reports are welcome through the source repositories. Chinese users can also follow the WeChat public account for updates, examples, and community entry points.
-| Platform | Link | Purpose |
-| --- | --- | --- |
-| GitHub | [https://github.com/hongmaple0820/scale-engine](https://github.com/hongmaple0820/scale-engine) | Source, issues, and PRs |
-| Gitee | [https://gitee.com/hongmaple/scale-engine](https://gitee.com/hongmaple/scale-engine) | China mirror and feedback |
-| npm | [https://www.npmjs.com/package/@hongmaple0820/scale-engine](https://www.npmjs.com/package/@hongmaple0820/scale-engine) | CLI package |
-<p align="center">
-  <img src="image/wechat-public.jpg" alt="SCALE Engine WeChat public account" width="220" />
-</p>
-## Sponsorship
-If SCALE Engine saves engineering governance time for your team, or helps move AI-agent work into a verifiable, reviewable, and releasable loop, voluntary sponsorship is welcome. Sponsorship supports maintenance, examples, documentation, test coverage, and community support. It is not a commercial support contract and does not change issue or PR priority.
-<p align="center">
-  <img src="image/wxPay.jpg" alt="Sponsor with WeChat Pay" width="220" />
-  &nbsp;&nbsp;
-  <img src="image/zfb.jpg" alt="Sponsor with Alipay" width="220" />
-</p>
-## What It Solves
-AI coding becomes hard when agents must behave consistently across real teams and real repositories:
-| Failure mode | SCALE mechanism |
-| --- | --- |
-| Agent says tests passed without running them | Verification profiles and evidence stores record actual commands and results |
-| Agent skips discovery, design, TDD, or review | `scale context`, `scale diagnose`, `scale tdd`, and `scale status` produce required next actions |
-| Agent stages unrelated files or edits the wrong repository | Review-gated shipping, MOE workspace rules, and child repository blockers control boundaries |
-| Docs, screenshots, reports, scripts, and temporary files become unmaintainable | Resource governance classifies maintained assets, task evidence, temporary outputs, and forbidden commits |
-| Noisy logs, secrets, ORM misuse, framework violations, or security risks slip through | Engineering standards and OWASP scans produce traceable findings |
-| Long Markdown reports are not read | `scale artifact` renders traceable HTML reports from maintained Markdown sources |
-## See It In 3 Minutes
-```bash
-npm install -g @hongmaple0820/scale-engine
-mkdir scale-demo && cd scale-demo
-scale init --governance-pack standard
-scale preflight --preflight-profile quick
-scale status
-```
-This generates governance files you can commit to a project:
-- `.scale/verification.json`: service matrix and verification profiles
-- `.scale/skills.json`: skill routing and evidence requirements
-- `.scale/tools.json`: CLI/MCP/browser/desktop orchestration policy
-- `docs/workflow/templates/`: Mini-PRD, plan, verification, review, and summary templates
-- `docs/standards/`: engineering, Git collaboration, and resource governance rules
-Continue with a full workflow loop:
-```bash
-scale context init --name "Scale Demo"
-scale context grill --task-id 2026-05-18-oauth-hardening --task "Harden OAuth callback"
-scale diagnose plan --task-id 2026-05-18-oauth-hardening --symptom "callback returns 500 when state expires"
-scale tdd slice --task-id 2026-05-18-oauth-hardening --behavior "reject expired OAuth state" --public-interface "GET /oauth/callback" --failing-test "expired state returns 401" --test-file tests/oauth.test.ts --impl-files src/oauth.ts
-scale artifact render --task-id 2026-05-18-oauth-hardening --artifact-dir .planning/tasks/2026-05-18-oauth-hardening
-scale artifact doctor --artifact-dir .planning/tasks/2026-05-18-oauth-hardening
-```
-Read [Quickstart](docs/start/quickstart.md) and [Agent Governance Demo](docs/start/agent-governance-demo.md) for the complete walkthrough.
-## Who It Is For
-- Teams using Codex, Claude Code, Cursor, Gemini CLI, OpenCode, Aider, or similar agents on real projects.
-- Teams with multi-service, multi-repository, MOE workspace, frontend/backend, or scaffold governance needs.
-- Teams that want agents to actively use skills, MCPs, CLIs, browser automation, E2E checks, and HTML reports with safety boundaries.
-- Project owners who feel AI code is fast but hard to review, verify, and maintain.
-It is not optimized for toy projects that only want one minimal prompt file and do not need gates, collaboration rules, or long-term maintainability.
-## Core Capabilities
-- Workflow Engine: `define -> plan -> build -> verify -> review -> ship` with persisted state.
-- GateSystem: build, lint, test, coverage, security, TDD, review, and tool evidence gates.
-- Governance Packs: `standard`, `project-scaffold`, `moe-workspace`, `resource-governance`, `go-service-matrix`, `node-library`, and `frontend-app`.
-- Resource Governance: docs, media, reports, test scripts, temporary scripts, HTML artifacts, and local config lifecycle rules.
-- Skill and Tool Orchestration: UI/UX, web research, browser E2E, Chrome DevTools MCP, desktop automation, and external agent CLIs.
-- Engineering Standards: noisy logs, sensitive data, injection risks, ORM/database usage, framework boundaries, test rigor, and deployment risk.
-- HTML Artifacts: Markdown remains the maintained source; HTML becomes the review, comparison, status, and release handoff layer.
-## Installation
-```bash
-npm install -g @hongmaple0820/scale-engine
-scale --version
-```
-Node.js 20 or newer is required.
-## Governance Packs
-Use `scale init` to install a governed workflow into an existing project:
-```bash
-scale init --governance-pack standard
-scale init --governance-pack project-scaffold
-scale init --governance-pack moe-workspace
-scale init --governance-pack resource-governance
-scale init --governance-pack go-service-matrix
-scale init --governance-pack node-library
-scale init --governance-pack frontend-app
-```
-Supported packs:
-| Pack | Best fit |
-| --- | --- |
-| `standard` | General project governance with task artifacts, verification, metrics, resources, standards, and skills policy |
-| `project-scaffold` | Reproducible engineering workflow scaffold and demo governance project |
-| `moe-workspace` | Parent workspace with independent child repositories or MOE-style multi-repo development |
-| `resource-governance` | Asset/document lifecycle policy for docs, reports, screenshots, scripts, media, and generated outputs |
-| `go-service-matrix` | Go backend services with service-aware build/lint/test/security verification |
-| `node-library` | Node/TypeScript package workflow, release, and verification governance |
-| `frontend-app` | UI/UX, browser evidence, responsive checks, E2E, and visual review governance |
-If you are unsure, start with `standard`. Use a specialized pack when the project shape is clear:
-See [Getting Started](docs/start/README.md) for runnable tutorials and demo paths.
-## Phase Workflow
-```bash
-scale define "Scoped release workflow" \
-  --description "Implement a TypeScript CLI workflow with verification evidence, review records, rollback constraints, and release safety checks." \
-  --success-criteria "verify evidence is persisted,review evidence is persisted,ship blocks unreviewed files"
-scale plan <spec-id> --rollback "Revert the release commit and remove generated artifacts"
-scale build <plan-id> --description "Implement scoped release workflow"
-scale verify <task-id>
-scale review <task-id>
-scale ship <task-id> --message "feat(workflow): add scoped release workflow"
-```
-Use `scale ship <task-id> --no-commit` to generate the delivery report without creating a Git commit.
-Strict TDD evidence can be enforced when needed:
-```bash
-scale verify <task-id> --tdd-strict --tdd-evidence .scale/tdd/<task-id>.json
-```
-The TDD evidence JSON must include `red`, `green`, `refactor`, and `testFirst` set to `true`.
-## Evolution Self-Improve Loop
-Extract lessons from session defects and promote to rules and hooks:
-```bash
-# Extract Lessons from session
-scale evolution extract <session-id>
-# Run self-improve loop: Defect → Lesson → Rule → Hook
-scale evolution improve <session-id>
-# Show self-improve report
-scale evolution report <session-id>
-# View generated Hooks config
-scale evolution hooks <session-id> --json
-```
-Thresholds:
-- Lesson → Rule: requires 3 verifications
-- Rule → Active: requires 10 hits
-- Rule → Hook: requires 20 hits
-## Safety Model
-SCALE Engine uses multiple enforcement layers:
-| Layer | Purpose |
-| --- | --- |
-| FSM | Prevents invalid artifact lifecycle transitions |
-| GateSystem | Runs build, lint, test, coverage, and security gates |
-| EvidenceStore | Persists verification evidence for audit and release gating |
-| ReviewStore | Persists deterministic review records |
-| ReviewAnalyzer | Scans diffs for high-risk code, process debt, and missing security evidence |
-| Detectors | Detects brute retry, premature completion, blame shifting, busy loops, and related failure modes |
-| Ship gate | Requires passing verification and review evidence before release |
-The `ship` command no longer stages the whole workspace. It stages only files covered by passing review records and blocks if new reviewable files appear after review.
-Git branch governance follows a GitLab Flow variant: short branches merge into `dev`, verified releases land on `master`, and production publishing is triggered by user-created `vX.Y.Z` tags on `master`. `scale ship` blocks direct governed commits on `dev`, `master`, `main`, or detached HEAD, and temporary worktree cleanup is blocked when the branch still has unpushed or unmerged commits. See [docs/GITLAB_FLOW.md](docs/GITLAB_FLOW.md).
-G7 `SecurityGate` includes a lightweight built-in scan for hardcoded secrets, private keys, disabled TLS verification, `eval`/`Function`, raw HTML injection, dangerous shell commands, shell execution, and empty `catch` blocks. Compatibility mode blocks CRITICAL findings; strict mode also blocks HIGH findings.
-## Skill and Tool Governance
-Skill Radar recommends skills, MCP servers, browser automation, desktop automation, planning workflows, memory providers, and external CLIs by task intent. It returns confidence, safety level, evidence requirements, attribution metadata, and fallback behavior.
-Third-party skills stay review-required until source, scripts, license, attribution, and pinned revision are checked. `OthmanAdi/planning-with-files` (MIT), `rohitg00/agentmemory` (Apache-2.0), and `garrytan/gbrain` (MIT) have explicit attribution records; other external skills, MCP servers, CLIs, adapters, and discovery candidates are tracked in the [External Reference Inventory](docs/EXTERNAL_REFERENCES.md) with unknown licenses kept `review-required`. SCALE records them as governed references, optional integrations, or adapted concepts; it does not vendor upstream source code.
-Memory is provider-routed rather than expanded as a built-in Memory OS. Agents can use `scale memory provider status` and `scale memory provider recall` to select `agentmemory`, `gbrain`, or `scale-local` under policy; external providers are read-only by default and fall back to local evidence-backed memory.
-See [Skill Radar](docs/SKILL_RADAR.md), [Third-Party Skills](docs/THIRD_PARTY_SKILLS.md), and [External Reference Inventory](docs/EXTERNAL_REFERENCES.md).
-## Supported Platforms
-SCALE Engine includes adapters for 16 agent platforms, including Claude Code, Codex CLI, OpenCode, Cursor, Gemini CLI, OpenClaw, Hermes, Trae, WorkBuddy, VS Code Copilot CLI, QCoder, DeepSeek-TUI, Aider, Windsurf, Kimi, and Doubao.
-It also includes 12 professional agent profiles:
-- frontend
-- backend
-- testing
-- UI design
-- operations
-- product
-- code review
-- security
-- database
-- performance
-- documentation
-- architecture
-## Project Layout
-```text
-src/api/cli.ts                 CLI entrypoint
-src/cli/phaseCommands.ts       DEFINE/PLAN/BUILD/VERIFY/REVIEW/SHIP
-src/cli/evolutionCommands.ts   L6 Evolution CLI commands
-src/workflow/gates/            Quality gates and persisted evidence
-src/workflow/ReviewAnalyzer.ts Deterministic review analysis
-src/workflow/ReviewStore.ts    Review record persistence
-src/workflow/EvidenceStore.ts  Gate evidence persistence
-src/workflow/evolution/        LessonExtractor + SelfImproveEngine
-src/workflow/qa/               BrowserQA + E2ETestRunner
-src/artifact/                  Artifact store and FSM definitions
-src/guardrails/                Detector and gateway logic
-src/guardrails/OWASPDetector.ts OWASP Top 10 security detection
-src/capabilities/BrowserQACapability.ts Playwright MCP wrapper
-src/evolution/                 Defect/Lesson/Rule/Hook evolution layer
-tests/                         Vitest test suites
-```
-## Development
-```bash
-npm install
-npm run build
-npx vitest run
-npm pack --dry-run
-```
-Targeted workflow tests:
-```bash
-npx vitest run tests/workflow/phaseCli.test.ts
-npx vitest run tests/workflow/reviewAnalyzer.test.ts tests/workflow/reviewStore.test.ts tests/workflow/gateSystem.test.ts
-```
-## Release Notes
-### v0.20.0
-- Added Context Budget and Progressive Governance so low-risk S tasks stay lightweight while auth, data, security, deployment, and cross-module changes escalate automatically.
-- Added Code Intelligence with adapter-first CodeGraph / Graphify support, explicit fallback, impact analysis, context recommendations, and exploration ROI.
-- Added Workflow Eval, Failure Replay, and improvement candidates with pass@k, fix iterations, tool-call counts, token estimates, and human-correction metrics.
-- Added Skill Radar for intent-based skills, MCP, browser, desktop automation, and external CLI recommendations with confidence, safety level, and evidence requirements.
-- Added Memory Brain for evidence-backed long-term memory candidates, contradiction detection, dream maintenance, explicit promotion, and failure replay ingestion.
-- Added Governance Dashboard to summarize runtime, eval, memory, resource, and HTML artifact evidence in a local HTML review surface.
-- Fixed new `--dir` aware commands so relative `.scale` state resolves inside the target project instead of the caller workspace.
-### v0.19.0
-- Added product smoke gates, runtime evidence learning settlement, memory context packs, workspace conflict blockers, and release-readiness demo coverage.
-### v0.18.0
-- Governed HTML artifacts: `scale artifact render/doctor/settle/open`.
-- Markdown remains the editable source of truth; generated HTML is traceable task evidence.
-- Governance packs now include output policy and HTML artifact resource classification.
-- Added tests for HTML artifact rendering, safety checks, settlement evidence, and generated template output.
-### v0.17.0
-- Added active workflow command gates: `scale context`, `scale diagnose`, `scale tdd`, and `scale status`.
-- Added required next-action queues so agents cannot silently skip context, debugging, TDD, or verification work.
-### v0.16.0
-- Added governed skill repository, skill recommendation, install-safety checks, visual Vibe templates, and leadership presets.
-- Strengthened tool orchestration and resource/engineering standards governance.
-### v0.15.1
-- Added UI/UX, web research, browser automation, desktop automation, and external Agent CLI routing contracts.
-- Added resource governance and engineering standards governance for generated project packs.
-### v0.11.1
-- Phase Commands FSM blocking: `canTransition` + `process.exit(1)` for guard failures
-- OWASP Top 10 Detector: 19 security detection patterns
-- Browser QA Capability: Playwright MCP wrapper for E2E testing
-- L6 Evolution: `Defect → Lesson → Rule → Hook` self-improve loop
-- Evolution CLI: `scale evolution extract/improve/report/hooks`
-- ReviewAnalyzer regex fix: avoid false positives on pattern definitions
-- Vitest suite covered in release verification
-### v0.10.1
-- Hardened `ship` so release commits stage only files covered by passing review records.
-- Added `ship --no-commit` delivery reports for reviewable output without creating a Git commit.
-- Added optional strict TDD evidence verification with `--tdd-evidence` and `--tdd-strict`.
-- Added richer command evidence metadata: working directory, timestamps, stdout/stderr tails, and output hashes.
-- Hardened deterministic review scanning for empty `catch`, `@ts-ignore`, focused tests, dangerous shell/Git commands, and security-sensitive changes without G7 evidence.
-- Hardened built-in G7 security scanning with explainable file/line evidence and compatibility vs strict blocking modes.
-- Added CLI/unit regression tests for `review -> ship`, unreviewed-file blocking, and security-scanner false-positive boundaries.
-- Verified `npm run build`, full Vitest suite, and `npm pack --dry-run` before release.
-### v0.10.0
-- Added phase-aligned workflow commands with FSM integration.
-- Added persisted verification evidence and review records.
-- Published `@hongmaple0820/scale-engine@0.10.0`.
-- Verified `npm run build`, full Vitest suite, and `npm pack --dry-run` before release.
-## License
-MIT
+<p align="center">
+  <img src="https://img.shields.io/badge/version-0.27.0-orange?style=flat-square" alt="version" />
+  <img src="https://img.shields.io/badge/platforms-22-blue?style=flat-square" alt="platforms" />
+  <img src="https://img.shields.io/badge/agents-12-blue?style=flat-square" alt="agents" />
+  <img src="https://img.shields.io/badge/workflows-10-green?style=flat-square" alt="workflows" />
+  <img src="https://img.shields.io/badge/detectors-19-red?style=flat-square" alt="detectors" />
+  <img src="https://img.shields.io/badge/tests-verified-brightgreen?style=flat-square" alt="tests" />
+  <img src="https://img.shields.io/badge/npm-0.27.0-cb3837?style=flat-square&logo=npm" alt="npm" />
+</p>
+# SCALE Engine v0.27.0
+SCALE Engine makes AI coding agents follow engineering rules through executable workflow gates, evidence files, and review constraints instead of relying on prompt discipline alone. It helps humans see what the agent explored, planned, verified, skipped, and why a task is or is not ready to ship.
+Repository: https://github.com/hongmaple0820/scale-engine
+Mirror: https://gitee.com/hongmaple/scale-engine
+npm: https://www.npmjs.com/package/@hongmaple0820/scale-engine
+Language: [English](README.en.md) | [Chinese](README.md)
+## 0.27.0 AI OS Runtime
+0.27.0 turns the AI Engineering OS direction into one executable entry point: `scale ai-os plan`. It creates a unified task plan with progressive governance mode, Context Compiler budget output, Memory Provider recall, Skill Routing execution steps, and Governance ROI. An agent can see which context to load, which capabilities to use, what evidence is required, and which risks escalate gates before it starts the task.
+```bash
+scale ai-os plan \
+  --task-id TASK-123 \
+  --task "Fix OAuth callback auth token handling and verify browser callback flow" \
+  --level L \
+  --files src/auth/oauth.ts,src/ui/callback.tsx \
+  --budget 8000 \
+  --json
+```
+This is not a claim that SCALE replaces human judgment. It is the first testable, explainable, and measurable runtime planning layer for the AI Engineering OS direction.
+## Community
+SCALE Engine is an engineering workflow governance project for real AI-agent delivery. Contributions, issues, PRs, governance-pack ideas, and field reports are welcome through the source repositories. Chinese users can also follow the WeChat public account for updates, examples, and community entry points.
+| Platform | Link | Purpose |
+| --- | --- | --- |
+| GitHub | [https://github.com/hongmaple0820/scale-engine](https://github.com/hongmaple0820/scale-engine) | Source, issues, and PRs |
+| Gitee | [https://gitee.com/hongmaple/scale-engine](https://gitee.com/hongmaple/scale-engine) | China mirror and feedback |
+| npm | [https://www.npmjs.com/package/@hongmaple0820/scale-engine](https://www.npmjs.com/package/@hongmaple0820/scale-engine) | CLI package |
+<p align="center">
+  <img src="image/wechat-public.jpg" alt="SCALE Engine WeChat public account" width="220" />
+</p>
+## Sponsorship
+If SCALE Engine saves engineering governance time for your team, or helps move AI-agent work into a verifiable, reviewable, and releasable loop, voluntary sponsorship is welcome. Sponsorship supports maintenance, examples, documentation, test coverage, and community support. It is not a commercial support contract and does not change issue or PR priority.
+<p align="center">
+  <img src="image/wxPay.jpg" alt="Sponsor with WeChat Pay" width="220" />
+  &nbsp;&nbsp;
+  <img src="image/zfb.jpg" alt="Sponsor with Alipay" width="220" />
+</p>
+## What It Solves
+AI coding becomes hard when agents must behave consistently across real teams and real repositories:
+| Failure mode | SCALE mechanism |
+| --- | --- |
+| Agent says tests passed without running them | Verification profiles and evidence stores record actual commands and results |
+| Agent skips discovery, design, TDD, or review | `scale context`, `scale diagnose`, `scale tdd`, and `scale status` produce required next actions |
+| Agent stages unrelated files or edits the wrong repository | Review-gated shipping, MOE workspace rules, and child repository blockers control boundaries |
+| Docs, screenshots, reports, scripts, and temporary files become unmaintainable | Resource governance classifies maintained assets, task evidence, temporary outputs, and forbidden commits |
+| Noisy logs, secrets, ORM misuse, framework violations, or security risks slip through | Engineering standards and OWASP scans produce traceable findings |
+| Long Markdown reports are not read | `scale artifact` renders traceable HTML reports from maintained Markdown sources |
+## See It In 3 Minutes
+```bash
+npm install -g @hongmaple0820/scale-engine
+mkdir scale-demo && cd scale-demo
+scale init --governance-pack standard
+scale preflight --preflight-profile quick
+scale status
+```
+This generates governance files you can commit to a project:
+- `.scale/verification.json`: service matrix and verification profiles
+- `.scale/skills.json`: skill routing and evidence requirements
+- `.scale/tools.json`: CLI/MCP/browser/desktop orchestration policy
+- `docs/workflow/templates/`: Mini-PRD, plan, verification, review, and summary templates
+- `docs/standards/`: engineering, Git collaboration, and resource governance rules
+Continue with a full workflow loop:
+```bash
+scale context init --name "Scale Demo"
+scale context grill --task-id 2026-05-18-oauth-hardening --task "Harden OAuth callback"
+scale diagnose plan --task-id 2026-05-18-oauth-hardening --symptom "callback returns 500 when state expires"
+scale tdd slice --task-id 2026-05-18-oauth-hardening --behavior "reject expired OAuth state" --public-interface "GET /oauth/callback" --failing-test "expired state returns 401" --test-file tests/oauth.test.ts --impl-files src/oauth.ts
+scale artifact render --task-id 2026-05-18-oauth-hardening --artifact-dir .planning/tasks/2026-05-18-oauth-hardening
+scale artifact doctor --artifact-dir .planning/tasks/2026-05-18-oauth-hardening
+```
+Read [Quickstart](docs/start/quickstart.md) and [Agent Governance Demo](docs/start/agent-governance-demo.md) for the complete walkthrough.
+## Who It Is For
+- Teams using Codex, Claude Code, Cursor, Gemini CLI, OpenCode, Aider, or similar agents on real projects.
+- Teams with multi-service, multi-repository, MOE workspace, frontend/backend, or scaffold governance needs.
+- Teams that want agents to actively use skills, MCPs, CLIs, browser automation, E2E checks, and HTML reports with safety boundaries.
+- Project owners who feel AI code is fast but hard to review, verify, and maintain.
+It is not optimized for toy projects that only want one minimal prompt file and do not need gates, collaboration rules, or long-term maintainability.
+## Core Capabilities
+- Workflow Engine: `define -> plan -> build -> verify -> review -> ship` with persisted state.
+- GateSystem: build, lint, test, coverage, security, TDD, review, and tool evidence gates.
+- Governance Packs: `standard`, `project-scaffold`, `moe-workspace`, `resource-governance`, `go-service-matrix`, `node-library`, and `frontend-app`.
+- Resource Governance: docs, media, reports, test scripts, temporary scripts, HTML artifacts, and local config lifecycle rules.
+- Skill and Tool Orchestration: UI/UX, web research, browser E2E, Chrome DevTools MCP, desktop automation, and external agent CLIs.
+- Engineering Standards: noisy logs, sensitive data, injection risks, ORM/database usage, framework boundaries, test rigor, and deployment risk.
+- HTML Artifacts: Markdown remains the maintained source; HTML becomes the review, comparison, status, and release handoff layer.
+## Installation
+```bash
+npm install -g @hongmaple0820/scale-engine
+scale --version
+```
+Node.js 20 or newer is required.
+## Governance Packs
+Use `scale init` to install a governed workflow into an existing project:
+```bash
+scale init --governance-pack standard
+scale init --governance-pack project-scaffold
+scale init --governance-pack moe-workspace
+scale init --governance-pack resource-governance
+scale init --governance-pack go-service-matrix
+scale init --governance-pack node-library
+scale init --governance-pack frontend-app
+```
+Supported packs:
+| Pack | Best fit |
+| --- | --- |
+| `standard` | General project governance with task artifacts, verification, metrics, resources, standards, and skills policy |
+| `project-scaffold` | Reproducible engineering workflow scaffold and demo governance project |
+| `moe-workspace` | Parent workspace with independent child repositories or MOE-style multi-repo development |
+| `resource-governance` | Asset/document lifecycle policy for docs, reports, screenshots, scripts, media, and generated outputs |
+| `go-service-matrix` | Go backend services with service-aware build/lint/test/security verification |
+| `node-library` | Node/TypeScript package workflow, release, and verification governance |
+| `frontend-app` | UI/UX, browser evidence, responsive checks, E2E, and visual review governance |
+If you are unsure, start with `standard`. Use a specialized pack when the project shape is clear:
+See [Getting Started](docs/start/README.md) for runnable tutorials and demo paths.
+## Workflow Upgrade
+Do not rerun `scale init` as a blind upgrade command in existing projects. Use the guarded upgrade flow:
+```bash
+scale upgrade check --dir . --lang en
+scale upgrade plan --dir . --html --lang en
+scale upgrade apply --dir . --confirm --lang en
+scale upgrade rollback --dir . --lang en
+```
+Chinese output is the default. Add `--lang en` for English prompts and English HTML plans.
+Upgrade rules:
+- Missing managed files can be restored automatically after plan review.
+- Clean managed files whose content still matches `.scale/governance.lock.json` can be refreshed when a governance pack version changes.
+- Locally edited managed files are marked `manual-review` and are not overwritten automatically.
+- Third-party skills, MCP servers, desktop automation, browser tools, and external CLIs are check-only; SCALE reports source and trust policy but does not auto-install them.
+See [Workflow Upgrade Guide](docs/start/workflow-upgrade.md) for the runnable path.
+## Phase Workflow
+```bash
+scale define "Scoped release workflow" \
+  --description "Implement a TypeScript CLI workflow with verification evidence, review records, rollback constraints, and release safety checks." \
+  --success-criteria "verify evidence is persisted,review evidence is persisted,ship blocks unreviewed files"
+scale plan <spec-id> --rollback "Revert the release commit and remove generated artifacts"
+scale build <plan-id> --description "Implement scoped release workflow"
+scale verify <task-id>
+scale review <task-id>
+scale ship <task-id> --message "feat(workflow): add scoped release workflow"
+```
+Use `scale ship <task-id> --no-commit` to generate the delivery report without creating a Git commit.
+Strict TDD evidence can be enforced when needed:
+```bash
+scale verify <task-id> --tdd-strict --tdd-evidence .scale/tdd/<task-id>.json
+```
+The TDD evidence JSON must include `red`, `green`, `refactor`, and `testFirst` set to `true`.
+## Evolution Self-Improve Loop
+Extract lessons from session defects and promote to rules and hooks:
+```bash
+# Extract Lessons from session
+scale evolution extract <session-id>
+# Run self-improve loop: Defect → Lesson → Rule → Hook
+scale evolution improve <session-id>
+# Show self-improve report
+scale evolution report <session-id>
+# View generated Hooks config
+scale evolution hooks <session-id> --json
+```
+Thresholds:
+- Lesson → Rule: requires 3 verifications
+- Rule → Active: requires 10 hits
+- Rule → Hook: requires 20 hits
+## Safety Model
+SCALE Engine uses multiple enforcement layers:
+| Layer | Purpose |
+| --- | --- |
+| FSM | Prevents invalid artifact lifecycle transitions |
+| GateSystem | Runs build, lint, test, coverage, and security gates |
+| EvidenceStore | Persists verification evidence for audit and release gating |
+| ReviewStore | Persists deterministic review records |
+| ReviewAnalyzer | Scans diffs for high-risk code, process debt, and missing security evidence |
+| Detectors | Detects brute retry, premature completion, blame shifting, busy loops, and related failure modes |
+| Ship gate | Requires passing verification and review evidence before release |
+The `ship` command no longer stages the whole workspace. It stages only files covered by passing review records and blocks if new reviewable files appear after review.
+Git branch governance follows a GitLab Flow variant: short branches merge into `dev`, verified releases land on `master`, and production publishing is triggered by user-created `vX.Y.Z` tags on `master`. `scale ship` blocks direct governed commits on `dev`, `master`, `main`, or detached HEAD, and temporary worktree cleanup is blocked when the branch still has unpushed or unmerged commits. See [docs/GITLAB_FLOW.md](docs/GITLAB_FLOW.md).
+G7 `SecurityGate` includes a lightweight built-in scan for hardcoded secrets, private keys, disabled TLS verification, `eval`/`Function`, raw HTML injection, dangerous shell commands, shell execution, and empty `catch` blocks. Compatibility mode blocks CRITICAL findings; strict mode also blocks HIGH findings.
+## Skill and Tool Governance
+Skill Radar recommends skills, MCP servers, browser automation, desktop automation, planning workflows, memory providers, and external CLIs by task intent. It returns confidence, safety level, evidence requirements, attribution metadata, and fallback behavior.
+Third-party skills stay review-required until source, scripts, license, attribution, and pinned revision are checked. `OthmanAdi/planning-with-files` (MIT), `rohitg00/agentmemory` (Apache-2.0), and `garrytan/gbrain` (MIT) have explicit attribution records; other external skills, MCP servers, CLIs, adapters, and discovery candidates are tracked in the [External Reference Inventory](docs/EXTERNAL_REFERENCES.md) with unknown licenses kept `review-required`. SCALE records them as governed references, optional integrations, or adapted concepts; it does not vendor upstream source code.
+Memory is provider-routed rather than expanded as a built-in Memory OS. Agents can use `scale memory provider status` and `scale memory provider recall` to select `agentmemory`, `gbrain`, or `scale-local` under policy; external providers are read-only by default and fall back to local evidence-backed memory.
+See [Skill Radar](docs/SKILL_RADAR.md), [Third-Party Skills](docs/THIRD_PARTY_SKILLS.md), and [External Reference Inventory](docs/EXTERNAL_REFERENCES.md).
+## Supported Platforms
+SCALE Engine includes adapters for 22 agent platforms, including Claude Code, Codex CLI, OpenCode, Cursor, Gemini CLI, OpenClaw, Hermes, Trae, WorkBuddy, VS Code Copilot CLI, QCoder, Qoder, JCode, DeepSeek-TUI, Aider, Windsurf, Kiro, Cline, Kilo Code, Antigravity, Kimi, and Doubao.
+It also includes 12 professional agent profiles:
+- frontend
+- backend
+- testing
+- UI design
+- operations
+- product
+- code review
+- security
+- database
+- performance
+- documentation
+- architecture
+## Project Layout
+```text
+src/api/cli.ts                 CLI entrypoint
+src/cli/phaseCommands.ts       DEFINE/PLAN/BUILD/VERIFY/REVIEW/SHIP
+src/cli/evolutionCommands.ts   L6 Evolution CLI commands
+src/workflow/gates/            Quality gates and persisted evidence
+src/workflow/ReviewAnalyzer.ts Deterministic review analysis
+src/workflow/ReviewStore.ts    Review record persistence
+src/workflow/EvidenceStore.ts  Gate evidence persistence
+src/workflow/evolution/        LessonExtractor + SelfImproveEngine
+src/workflow/qa/               BrowserQA + E2ETestRunner
+src/artifact/                  Artifact store and FSM definitions
+src/guardrails/                Detector and gateway logic
+src/guardrails/OWASPDetector.ts OWASP Top 10 security detection
+src/capabilities/BrowserQACapability.ts Playwright MCP wrapper
+src/evolution/                 Defect/Lesson/Rule/Hook evolution layer
+tests/                         Vitest test suites
+```
+## Development
+```bash
+npm install
+npm run build
+npx vitest run
+npm pack --dry-run
+```
+Targeted workflow tests:
+```bash
+npx vitest run tests/workflow/phaseCli.test.ts
+npx vitest run tests/workflow/reviewAnalyzer.test.ts tests/workflow/reviewStore.test.ts tests/workflow/gateSystem.test.ts
+```
+## Release Notes
+### v0.20.0
+- Added Context Budget and Progressive Governance so low-risk S tasks stay lightweight while auth, data, security, deployment, and cross-module changes escalate automatically.
+- Added Code Intelligence with adapter-first CodeGraph / Graphify support, explicit fallback, impact analysis, context recommendations, and exploration ROI.
+- Added Workflow Eval, Failure Replay, and improvement candidates with pass@k, fix iterations, tool-call counts, token estimates, and human-correction metrics.
+- Added Skill Radar for intent-based skills, MCP, browser, desktop automation, and external CLI recommendations with confidence, safety level, and evidence requirements.
+- Added Memory Brain for evidence-backed long-term memory candidates, contradiction detection, dream maintenance, explicit promotion, and failure replay ingestion.
+- Added Governance Dashboard to summarize runtime, eval, memory, resource, and HTML artifact evidence in a local HTML review surface.
+- Fixed new `--dir` aware commands so relative `.scale` state resolves inside the target project instead of the caller workspace.
+### v0.19.0
+- Added product smoke gates, runtime evidence learning settlement, memory context packs, workspace conflict blockers, and release-readiness demo coverage.
+### v0.18.0
+- Governed HTML artifacts: `scale artifact render/doctor/settle/open`.
+- Markdown remains the editable source of truth; generated HTML is traceable task evidence.
+- Governance packs now include output policy and HTML artifact resource classification.
+- Added tests for HTML artifact rendering, safety checks, settlement evidence, and generated template output.
+### v0.17.0
+- Added active workflow command gates: `scale context`, `scale diagnose`, `scale tdd`, and `scale status`.
+- Added required next-action queues so agents cannot silently skip context, debugging, TDD, or verification work.
+### v0.16.0
+- Added governed skill repository, skill recommendation, install-safety checks, visual Vibe templates, and leadership presets.
+- Strengthened tool orchestration and resource/engineering standards governance.
+### v0.15.1
+- Added UI/UX, web research, browser automation, desktop automation, and external Agent CLI routing contracts.
+- Added resource governance and engineering standards governance for generated project packs.
+### v0.11.1
+- Phase Commands FSM blocking: `canTransition` + `process.exit(1)` for guard failures
+- OWASP Top 10 Detector: 19 security detection patterns
+- Browser QA Capability: Playwright MCP wrapper for E2E testing
+- L6 Evolution: `Defect → Lesson → Rule → Hook` self-improve loop
+- Evolution CLI: `scale evolution extract/improve/report/hooks`
+- ReviewAnalyzer regex fix: avoid false positives on pattern definitions
+- Vitest suite covered in release verification
+### v0.10.1
+- Hardened `ship` so release commits stage only files covered by passing review records.
+- Added `ship --no-commit` delivery reports for reviewable output without creating a Git commit.
+- Added optional strict TDD evidence verification with `--tdd-evidence` and `--tdd-strict`.
+- Added richer command evidence metadata: working directory, timestamps, stdout/stderr tails, and output hashes.
+- Hardened deterministic review scanning for empty `catch`, `@ts-ignore`, focused tests, dangerous shell/Git commands, and security-sensitive changes without G7 evidence.
+- Hardened built-in G7 security scanning with explainable file/line evidence and compatibility vs strict blocking modes.
+- Added CLI/unit regression tests for `review -> ship`, unreviewed-file blocking, and security-scanner false-positive boundaries.
+- Verified `npm run build`, full Vitest suite, and `npm pack --dry-run` before release.
+### v0.10.0
+- Added phase-aligned workflow commands with FSM integration.
+- Added persisted verification evidence and review records.
+- Published `@hongmaple0820/scale-engine@0.10.0`.
+- Verified `npm run build`, full Vitest suite, and `npm pack --dry-run` before release.
+## License
+MIT