npm - @ps-neko/nekowork - Versions diffs - 0.1.0-alpha.8 → 0.2.0-alpha.0 - Mend

@ps-neko/nekowork 0.1.0-alpha.8 → 0.2.0-alpha.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (243) hide show

package/README.md +45 -481
package/package.json +23 -83
package/scripts/benchmark/capture-live-ai-diff.js +230 -0
package/scripts/benchmark/rules.js +214 -0
package/scripts/benchmark/scrape-oss-positives.js +237 -0
package/scripts/benchmark/verify-candidates.js +110 -0
package/scripts/check.js +126 -0
package/scripts/cli.js +169 -1213
package/scripts/lib/decision.js +336 -0
package/scripts/lib/diff-parser.js +344 -0
package/scripts/lib/project-detector.js +309 -0
package/scripts/lib/rules/_helpers.js +149 -0
package/scripts/lib/rules/auto-apply-commit-push.js +94 -0
package/scripts/lib/rules/hardcoded-credential.js +103 -0
package/scripts/lib/rules/package-lockfile-risk.js +92 -0
package/scripts/lib/rules/secret-fallback.js +259 -0
package/scripts/lib/rules/test-or-security-disable.js +91 -0
package/scripts/lib/session-resolver.js +28 -14
package/scripts/orchestrators/_handoff-utils.js +27 -0
package/scripts/orchestrators/apply.js +4 -23
package/scripts/orchestrators/gate.js +17 -2
package/scripts/orchestrators/report.js +180 -9
package/scripts/orchestrators/verify-pr.js +476 -0
package/AGENTS.md +0 -112
package/CLAUDE.md +0 -84
package/REVIEW.md +0 -96
package/RULES.md +0 -51
package/SOUL.md +0 -21
package/WORKING-CONTEXT.md +0 -52
package/agent.yaml +0 -222
package/agents/architect.md +0 -57
package/agents/code-reviewer.md +0 -60
package/agents/codex-challenger.md +0 -53
package/agents/codex-reviewer.md +0 -56
package/agents/debugger.md +0 -33
package/agents/doc-writer.md +0 -51
package/agents/executor.md +0 -41
package/agents/planner.md +0 -49
package/agents/research.md +0 -50
package/agents/security-reviewer.md +0 -47
package/agents/test-engineer.md +0 -41
package/bridge/mcp-server.js +0 -301
package/commands/claude-led-codex-review.md +0 -29
package/docs/ADVANCED.md +0 -374
package/docs/AI-DEVELOPMENT-LIFECYCLE.md +0 -120
package/docs/ARCHITECTURE.md +0 -213
package/docs/AUDIT.md +0 -123
package/docs/AUTH-MIGRATION.md +0 -282
package/docs/AUTONOMY.md +0 -92
package/docs/BUILD.md +0 -165
package/docs/CATALOG-PACKS.md +0 -89
package/docs/CHANGELOG.md +0 -186
package/docs/CLI-STAGES.md +0 -101
package/docs/CODEMAPS/README.md +0 -15
package/docs/CODEMAPS/agents.md +0 -22
package/docs/CODEMAPS/bridge.md +0 -18
package/docs/CODEMAPS/hooks.md +0 -28
package/docs/CODEMAPS/manifests.md +0 -15
package/docs/CODEMAPS/rules.md +0 -22
package/docs/CODEMAPS/schemas.md +0 -22
package/docs/CODEMAPS/scripts.md +0 -178
package/docs/CODEMAPS/skills.md +0 -31
package/docs/CODEMAPS/tests.md +0 -112
package/docs/CORE-INVARIANTS.md +0 -39
package/docs/DEMO-REPORT.md +0 -113
package/docs/DEMO.md +0 -151
package/docs/EXAMPLE-PROJECT.md +0 -92
package/docs/FAILURE-MODES.md +0 -94
package/docs/FEEDBACK-TRIAGE.md +0 -144
package/docs/INTERNAL-PROVIDER.md +0 -85
package/docs/NAMING.md +0 -46
package/docs/PARALLEL-CANDIDATES.md +0 -58
package/docs/PORTING.md +0 -164
package/docs/PR-PREP.md +0 -35
package/docs/PRODUCT-PRINCIPLES.md +0 -344
package/docs/PUBLISH-ALPHA.md +0 -217
package/docs/QUICKSTART.md +0 -411
package/docs/RELEASE-READINESS.md +0 -201
package/docs/RISK-CLASSIFIER.md +0 -50
package/docs/ROADMAP.md +0 -128
package/docs/RUNBOOK.md +0 -153
package/docs/SAFETY-GUARANTEES.md +0 -54
package/docs/SECURITY.md +0 -79
package/docs/SETUP.md +0 -143
package/docs/TRUST-MODEL.md +0 -46
package/docs/WHY-NEKOWORK.md +0 -99
package/docs/WHY-NOT-AUTOPILOT.md +0 -37
package/docs/assets/demo-terminal.svg +0 -41
package/docs/case-studies/JSHTTP-BASIC-AUTH.md +0 -168
package/docs/case-studies/MOTDOTLA-DOTENV.md +0 -191
package/docs/case-studies/PYTHON-HYPER-H11.md +0 -168
package/docs/case-studies/README.md +0 -19
package/docs/case-studies/SINDRESORHUS-IS-PLAIN-OBJ.md +0 -141
package/docs/dev-log/2026-04-29-p1-recovery.md +0 -142
package/docs/dev-log/2026-04-29-week1-4.md +0 -81
package/docs/examples/GITHUB-ACTIONS-HARDENING.md +0 -86
package/docs/examples/QUALITY-LIFECYCLE-SMOKE.md +0 -32
package/docs/examples/TRADING-DASHBOARD-MOCK.md +0 -65
package/docs/workflows-stash/README.md +0 -32
package/docs/workflows-stash/harness-review.yml +0 -166
package/docs/workflows-stash/harness-validate.yml +0 -98
package/examples/github-actions-hardening/.github/workflows/hardened-validate.yml +0 -38
package/examples/github-actions-hardening/README.md +0 -31
package/examples/github-actions-hardening/case-study/ASK.md +0 -26
package/examples/github-actions-hardening/case-study/GATE_STATUS.md +0 -28
package/examples/github-actions-hardening/case-study/PLAN.md +0 -25
package/examples/github-actions-hardening/case-study/SHIP_READY.md +0 -21
package/examples/github-actions-hardening/case-study/TASK.md +0 -30
package/examples/github-actions-hardening/case-study/TEAM_HANDOFFS.md +0 -37
package/examples/github-actions-hardening/case-study/VERIFY_SUMMARY.md +0 -35
package/examples/github-actions-hardening/case-study/WORK_SUMMARY.md +0 -24
package/examples/github-actions-hardening/package.json +0 -12
package/examples/github-actions-hardening/scripts/check.mjs +0 -43
package/examples/quality-lifecycle-smoke/README.md +0 -30
package/examples/quality-lifecycle-smoke/case-study/ASK.md +0 -24
package/examples/quality-lifecycle-smoke/case-study/GATE_STATUS.md +0 -10
package/examples/quality-lifecycle-smoke/case-study/PLAN.md +0 -19
package/examples/quality-lifecycle-smoke/case-study/SHIP_READY.md +0 -11
package/examples/quality-lifecycle-smoke/case-study/TASK.md +0 -19
package/examples/quality-lifecycle-smoke/case-study/TEAM_HANDOFFS.md +0 -21
package/examples/quality-lifecycle-smoke/case-study/VERIFY_SUMMARY.md +0 -44
package/examples/quality-lifecycle-smoke/case-study/WORK_SUMMARY.md +0 -19
package/examples/quality-lifecycle-smoke/package.json +0 -8
package/examples/quality-lifecycle-smoke/scripts/check.mjs +0 -44
package/examples/trading-dashboard-mock/README.md +0 -33
package/examples/trading-dashboard-mock/case-study/ASK.md +0 -24
package/examples/trading-dashboard-mock/case-study/GATE_STATUS.md +0 -28
package/examples/trading-dashboard-mock/case-study/PLAN.md +0 -23
package/examples/trading-dashboard-mock/case-study/SHIP_READY.md +0 -21
package/examples/trading-dashboard-mock/case-study/TASK.md +0 -29
package/examples/trading-dashboard-mock/case-study/TEAM_HANDOFFS.md +0 -49
package/examples/trading-dashboard-mock/case-study/VERIFY_SUMMARY.md +0 -35
package/examples/trading-dashboard-mock/case-study/WORK_SUMMARY.md +0 -27
package/examples/trading-dashboard-mock/fixtures/market.json +0 -9
package/examples/trading-dashboard-mock/index.html +0 -76
package/examples/trading-dashboard-mock/package.json +0 -9
package/examples/trading-dashboard-mock/scripts/check.mjs +0 -54
package/examples/trading-dashboard-mock/src/app.js +0 -83
package/examples/trading-dashboard-mock/src/styles.css +0 -227
package/hooks/hooks.json +0 -44
package/hooks/scripts/config-protection.js +0 -34
package/hooks/scripts/gateguard-fact-force.js +0 -146
package/hooks/scripts/persistent-mode.mjs +0 -27
package/hooks/scripts/pre-bash-dispatcher.js +0 -63
package/hooks/scripts/quality-gate.js +0 -106
package/manifests/build-modes.json +0 -61
package/manifests/install-components.json +0 -200
package/manifests/install-modules.json +0 -102
package/manifests/install-profiles.json +0 -265
package/rules/common/coding-style.md +0 -71
package/rules/common/security.md +0 -69
package/rules/common/testing.md +0 -58
package/rules/python/coding-style.md +0 -80
package/rules/python/testing.md +0 -86
package/rules/typescript/coding-style.md +0 -97
package/rules/typescript/security.md +0 -67
package/rules/typescript/testing.md +0 -78
package/schemas/agent-yaml.schema.json +0 -168
package/schemas/agent.schema.json +0 -32
package/schemas/build-modes.schema.json +0 -42
package/schemas/handoff.schema.json +0 -105
package/schemas/hooks.schema.json +0 -35
package/schemas/install-components.schema.json +0 -46
package/schemas/install-modules.schema.json +0 -39
package/schemas/install-profiles.schema.json +0 -46
package/schemas/install-state.schema.json +0 -42
package/schemas/routing.schema.json +0 -42
package/schemas/skill.schema.json +0 -19
package/scripts/agents/dispatch.js +0 -148
package/scripts/agents/runners/claude.js +0 -214
package/scripts/agents/runners/codex.js +0 -233
package/scripts/agents/runners/gemini.js +0 -92
package/scripts/agents/runners/internal.js +0 -91
package/scripts/agents/runners/mock.js +0 -107
package/scripts/auth/github-import-gh.js +0 -52
package/scripts/auth/github-login.js +0 -79
package/scripts/auth/github-logout.js +0 -21
package/scripts/auth/github-status.js +0 -46
package/scripts/build-claude.js +0 -101
package/scripts/build-codemaps.js +0 -286
package/scripts/build-codex.js +0 -93
package/scripts/build-cursor.js +0 -132
package/scripts/build-gemini.js +0 -117
package/scripts/build-opencode.js +0 -117
package/scripts/ci/catalog.js +0 -127
package/scripts/ci/check-markers.js +0 -48
package/scripts/ci/security-hardening.js +0 -270
package/scripts/ci/validate-agents.js +0 -88
package/scripts/ci/validate-hooks.js +0 -99
package/scripts/ci/validate-manifests.js +0 -158
package/scripts/ci/validate-skills.js +0 -93
package/scripts/cli/commands/auto-command.js +0 -198
package/scripts/cli/commands/build-command.js +0 -259
package/scripts/core/auth-guard.js +0 -22
package/scripts/core/build-roots.js +0 -11
package/scripts/core/cli-resolver.js +0 -64
package/scripts/core/install-state.js +0 -125
package/scripts/core/json-extractor.js +0 -32
package/scripts/core/subprocess.js +0 -74
package/scripts/daemon/wait.js +0 -278
package/scripts/demo-external-project.js +0 -222
package/scripts/demo-quick-run.js +0 -212
package/scripts/demo-review.js +0 -204
package/scripts/doctor.js +0 -296
package/scripts/install-apply.js +0 -198
package/scripts/install-plan.js +0 -451
package/scripts/lib/build-intelligence.js +0 -188
package/scripts/lib/build-modes.js +0 -38
package/scripts/lib/costs.js +0 -82
package/scripts/lib/instincts.js +0 -194
package/scripts/lib/keychain.js +0 -85
package/scripts/lib/profile-policy.js +0 -134
package/scripts/lib/profile-safety.js +0 -81
package/scripts/lib/router.js +0 -138
package/scripts/lib/token-vault.js +0 -136
package/scripts/orchestrators/ask.js +0 -143
package/scripts/orchestrators/auto.js +0 -263
package/scripts/orchestrators/build.js +0 -379
package/scripts/orchestrators/ralph.js +0 -179
package/scripts/orchestrators/review.js +0 -452
package/scripts/orchestrators/run.js +0 -151
package/scripts/orchestrators/ship.js +0 -339
package/scripts/orchestrators/team-lite.js +0 -270
package/scripts/orchestrators/team.js +0 -244
package/scripts/orchestrators/verify.js +0 -306
package/scripts/orchestrators/work.js +0 -207
package/scripts/portability/simulate-port.js +0 -220
package/scripts/repair.js +0 -184
package/scripts/sync-claude-md.js +0 -224
package/scripts/verify/claude-live.js +0 -30
package/scripts/verify/codex-live.js +0 -60
package/scripts/verify/gemini-live.js +0 -48
package/scripts/verify/runtime.js +0 -105
package/skills/acceptance-coverage/SKILL.md +0 -37
package/skills/claude-led-codex-review/SKILL.md +0 -133
package/skills/plan-eng-review/SKILL.md +0 -51
package/skills/porting/SKILL.md +0 -69
package/skills/ralph/SKILL.md +0 -48
package/skills/release-readiness/SKILL.md +0 -62
package/skills/review/SKILL.md +0 -42
package/skills/security-hardening/SKILL.md +0 -59
package/skills/ship/SKILL.md +0 -44
package/skills/tdd-workflow/SKILL.md +0 -42

package/README.md CHANGED Viewed

@@ -1,502 +1,66 @@
-# NEKOWORK
-Verified Autopilot for AI code changes.
-[![harness-validate](https://github.com/Ps-Neko/NEKOWORK/actions/workflows/harness-validate.yml/badge.svg)](https://github.com/Ps-Neko/NEKOWORK/actions/workflows/harness-validate.yml)
-AI builds. Codex verifies. You approve the boundary.
+# @ps-neko/nekowork
-NEKOWORK plans, edits, verifies, repairs, and prepares ship-ready AI code changes. Final apply remains human-controlled.
+**Local verification gate for AI-written code diffs.**
-It runs:
+AI can write 100 lines in 10 seconds. Who checks them before they hit `main`?
-1. Autonomous planning and build
-2. Independent Codex verification
-3. Bounded repair when findings are fixable
-4. Report, ship/no-ship, and Human Gate
-5. Explicit apply only when the human chooses it
+This package reviews every change your AI tool makes, flags the risky parts with
+deterministic rules, and lets **you** make the final call. It never commits,
+pushes, or deploys on its own.
-No auto-commit. No auto-push. No surprise deploy.
-Product principle:
-```text
-NEKOWORK = verified autopilot -> Codex verification -> Human Gate -> explicit apply
-```
-```text
-Autonomous until apply.
-Verified before ship.
-Human-controlled at the boundary.
-```
-NEKOWORK packages a local runtime with one source catalog, `agent.yaml`, projected into Claude Code, Codex CLI, Cursor, Gemini CLI, and OpenCode surfaces. The `harness` CLI remains a legacy/internal alias for `nekowork`.
-NEKOWORK is intentionally not a 100-agent pack. Every agent, skill, hook, profile, module, and pack must:
-1. improve verification,
-2. preserve one-executor writes,
-3. produce auditable evidence,
-4. respect Human Gate.
-**Public alpha evidence:** 14 packs / 11 profiles / 36 components / 5 harness targets / 7 case-study flows / 290 tests / 0 moderate+ npm audit issues / fresh `npx @alpha` smoke
-NEKOWORK does not automatically commit, push, publish, deploy, or apply diffs. `apply` is explicit and requires verified ship-ready evidence.
+## Status
-For bounded autonomy before that boundary, use `auto`: it can route, build, verify, repair fixable findings within a budget, write a report, and then stop before apply.
+**Phase A skeleton** (2026-05-27). The 4 public verbs work via delegation to
+`@ps-neko/nekowork-cli` in the monorepo. To publish this package
+independently, the verify-pr code path needs to be moved into this package —
+see [HANDOFF-PACKAGE-SPLIT.md](./HANDOFF-PACKAGE-SPLIT.md).
-Next track: `auto --parallel-candidates N` will let isolated candidate workers propose patches, then NEKOWORK will compare them into one canonical ship candidate before Codex verification and Human Gate.
-**Latest alpha evidence:** [CI badge](https://github.com/Ps-Neko/NEKOWORK/actions/workflows/harness-validate.yml) / [npm package](https://www.npmjs.com/package/@ps-neko/nekowork) / [smoke transcript](docs/DEMO.md#one-minute-terminal-transcript) / [report artifact](docs/DEMO-REPORT.md)
-**One-minute demo:** [terminal transcript](docs/DEMO.md#one-minute-terminal-transcript) / [full report example](docs/DEMO-REPORT.md) / [alpha feedback](https://github.com/Ps-Neko/NEKOWORK/issues/new?template=alpha-feedback.yml) / [roadmap](docs/ROADMAP.md)
-![NEKOWORK one-minute terminal demo](docs/assets/demo-terminal.svg)
-## 30-Second First Run
-Use the current npm alpha for the fastest proof of the workflow:
+For the full alpha-stage product today, install:
 ```bash
-npx -y @ps-neko/nekowork@alpha check
-npx -y @ps-neko/nekowork@alpha auto "fix failing tests safely" --session first-auto
-npx -y @ps-neko/nekowork@alpha report --session latest
+npm i -g @ps-neko/nekowork-cli@alpha
 ```
-Start with `auto` when you want NEKOWORK to keep going until report/gate. Use `build` when you want one build pass. Drop down to `work`, `verify`, and `ship` only when you need phase-level control.
-Preview the route before running providers or writing session state:
+## Quickstart (once Phase A is complete)
 ```bash
-npx -y @ps-neko/nekowork@alpha auto "fix failing tests safely" --dry-run
-npx -y @ps-neko/nekowork@alpha build "fix this safely" --dry-run
-```
-Use a source checkout for local development:
-```bash
-node scripts/cli.js check
-node scripts/cli.js auto "implement this safely" --session first-auto
-node scripts/cli.js report --session latest
-node scripts/cli.js gate status --session latest
-```
-Or use the decomposed beginner path directly:
-```bash
-node scripts/cli.js check
-node scripts/cli.js run "implement this safely" --session first-run
-node scripts/cli.js report --session first-run
-node scripts/cli.js gate status --session first-run
-```
-The simple paths map to the evidence loop: `check = doctor --quick`, `build = auto routing plus mode presets over run`, `auto = bounded build/verify/repair/report before apply`, and `run = work -> verify -> ship`.
-Use `build --dry-run` when you want to preview auto routing, mode, profile, workers, stages, and apply policy before running providers or writing session state. Use `build --explain` when you want the same routing rationale and evidence list after a real build.
-To add generated harness surfaces to another local repository:
-```bash
-cd /path/to/my-project
-npx -y @ps-neko/nekowork@alpha init --profile developer --project-root .
-```
-## Example Report
-`report` is the main trust surface. It turns session evidence into a readable `REPORT.md`:
-```text
-Verdict: approve_with_fixes
-Ship ready: false
-Human gate: required
-Applied: false
-Profile: quality
-Strict quality: enabled
-Acceptance coverage: 4/5
-Quality warnings: 2
-Evidence:
-- work-summary.json
-- verify-summary.json
-- ship-summary.json
-- gate-summary.json
+# right after your AI tool changes some files:
+npx -y @ps-neko/nekowork check        # 30-sec environment check
+npx -y @ps-neko/nekowork verify-pr    # scan the diff → get a verdict
 ```
-The first screen of `REPORT.md` is the trust card: work produced, independent verification, Human Gate, ship readiness, apply state, and whether the target project was mutated.
+`verify-pr` reads the diff, writes a plain-English `REPORT.md`, and tells you
+whether the change is safe to merge.
-See the full report contract and example artifact in [docs/DEMO-REPORT.md](docs/DEMO-REPORT.md), and the one-minute terminal transcript in [docs/DEMO.md](docs/DEMO.md).
-## Human Gate Example
-```text
-Risk: security-sensitive auth parser change
-Codex verdict: approve_with_fixes
-Ship ready: false
-Required before apply:
-[ ] Add parser boundary test
-[ ] Remove long-lived API key env fallback
-[ ] Re-run verify --strict-quality
-Decision:
-- approve
-- block
-- request fixes
-```
-Human Gate is the point where NEKOWORK stops being an autopilot and becomes an approval system.
-## Apply Preview
-Before `apply`, NEKOWORK expects the human to inspect the evidence surface:
-```text
-Session: first-work
-Diff source: captured live-work diff
-Files changed: 3
-Verifier verdict: approve
-Human gate: clear
-Ship ready: true
-Apply command: node scripts/cli.js apply --session first-work
-```
-`apply` still does not commit, push, publish, deploy, or create a PR. It only applies the verified `SHIP_READY` diff when gates are clear and the target worktree is clean.
-## Compared With Agent Packs
-| Tool pattern | Optimizes for | NEKOWORK optimizes for |
-|---|---|---|
-| Large Claude Code packs | More agents, commands, skills | Curated verification loop |
-| Team simulation | More specialist perspectives | Read-only team plus one executor |
-| Autopilot | Fast autonomous execution | verified autonomy until apply, report, gate, explicit apply |
-| Discipline workflows | Better development habits | Evidence-backed ship decision |
-## When To Choose NEKOWORK
+## The 4 verbs
-| Use case | NEKOWORK fit |
+| Verb | What it does |
 |---|---|
-| You want one command to keep working until report/gate | `auto` routes, builds, verifies, repairs, and stops before apply |
-| You want one build pass with safe routing | `build` routes the task into safe mode presets |
-| You want daily planning, TDD, debugging, and finish checks | use the `productivity` pack |
-| You want team-style review before implementation | use the `team` pack; handoffs stay read-only |
-| You need PR or release evidence | use `pr` or `release` before ship/apply |
-| You need sensitive-change control | use `security` and keep Human Gate active |
-| You need explicit apply instead of autopilot mutation | keep the default `report -> gate -> apply` path |
+| `check` | Probe environment readiness (Node version, git repo, etc.) |
+| `verify-pr` | Scan working-tree diff. Produce REPORT.md + .nekowork/decision.json |
+| `report` | Render an existing decision.json to a human-readable REPORT.md |
+| `apply` | Apply a stored .diff iff decision.json says `apply_allowed: true` |
-Use other AI development tools when they fit your preferred authoring flow. Use NEKOWORK when AI work needs to become verified, reportable, gated, and explicitly applied.
-## Three Paths
-Most users should start with the Beginner path. The other paths are for explicit phase control or legacy compatibility.
-1. Beginner verified autopilot: `check -> auto -> report -> gate`
-2. One-pass safe build: `check -> build -> report -> gate`
-3. Advanced: `ask -> plan -> team -> work -> verify -> gate -> ship -> report -> apply`
-4. Legacy: `review` / `review-cycle`
-## Why NEKOWORK
-NEKOWORK is for teams that want AI-assisted development without making the agent catalog the product. The default path keeps local auth, inspectable handoffs, single-executor writes, independent Codex verification, and Human Gate decisions in front of risky ship/apply steps.
-## Status
-- Current repository version: `0.1.0-alpha.8` alpha candidate
-- Current package name: `@ps-neko/nekowork`
-- Published CLI names: `nekowork` and `harness`
-- Current npm alpha: `@ps-neko/nekowork@0.1.0-alpha.7`
-- Current npm alpha.8 status: repository candidate; public publish is pending owner OTP/web auth
-- Supported install path today: npm alpha, clone, submodule, or local repository integration
-- Dist-tag note: use `@alpha` until a stable release; `latest` still points at the first alpha line
-- Default mode: mock providers, no API keys, no provider CLI calls
-Current local verification:
-- `npm run lint`: pass
-- `npm test`: 290 tests pass
-- `npm audit --audit-level=moderate`: 0 vulnerabilities
-- `npm pack --dry-run --json`: pass
-- `npx -y @ps-neko/nekowork@alpha check`: pass with warnings only
-## Case-study Evidence
-| Flow | Risk type | Evidence produced |
-|---|---|---|
-| Financial UI mock | UI/product risk | report + Human Gate |
-| GitHub Actions hardening | CI/security risk | security findings + no-ship/ship evidence |
-| Quality lifecycle smoke | quality risk | strict-quality + acceptance coverage |
-| npm package boundary | package/release risk | pack/audit evidence |
-| Auth parser boundary | auth/security risk | parser boundary evidence |
-| Python protocol parser | protocol correctness risk | test-backed verification |
-| Dotenv configuration boundary | config/security risk | no-secret parser evidence |
-## Official Packs
-| Pack | Adds | Use when |
-|---|---|---|
-| `core` | minimal verification runtime | first install or repo smoke |
-| `builder` | safe build modes entrypoint | one-command build with verification and gates |
-| `productivity` | planning, TDD, debugging, finish routines | daily AI-assisted development |
-| `team` | read-only role handoffs | you want team-style review before one executor writes |
-| `debugging` | failing-test and regression triage | the task starts from a bug or unclear root cause |
-| `maintenance` | dependency, refactor, migration, cleanup routines | routine upkeep still needs verification |
-| `pr` | diff review, test evidence, changelog, risk notes | preparing or reviewing a PR |
-| `catalog-plus` | richest curated catalog surface | evaluating the full NEKOWORK catalog |
-| `quality` | acceptance coverage, strict evidence prompts | feature work needs proof |
-| `security` | auth/secrets/deploy risk prompts | sensitive changes |
-| `frontend` | UI mockup, component review, accessibility checks | product-facing UI work |
-| `testing` | regression planning and coverage handoffs | test confidence is the main risk |
-| `release` | ship/no-ship evidence | pre-release checks |
-| `enterprise` | full catalog with all gates | high-control teams |
-## Quick Start Details
-Requirements: Node.js 22+, npm, and git.
-For a repository-pinned local demo:
-```bash
-git clone https://github.com/Ps-Neko/NEKOWORK.git harness
-cd harness
-npm ci
-npm run demo:quick -- --cleanup
-```
-This creates a disposable target project and runs `doctor -> build -> report -> gate status`. It uses mock providers and does not call Claude, Codex, Gemini, or paid APIs.
-To initialize another local repository with the published alpha:
-```bash
-cd /path/to/my-project
-npx -y @ps-neko/nekowork@alpha init --profile developer --project-root .
-```
-For the fuller first-run guide, see [docs/QUICKSTART.md](docs/QUICKSTART.md).
-For the trust and recovery model, see [Safety Guarantees](docs/SAFETY-GUARANTEES.md), [Failure Modes](docs/FAILURE-MODES.md), [Trust Model](docs/TRUST-MODEL.md), and [Why Not Autopilot](docs/WHY-NOT-AUTOPILOT.md).
-To see the repository-based external project flow end to end:
-```bash
-npm run demo:external
-```
-To inspect small case-study targets, see [examples/trading-dashboard-mock](examples/trading-dashboard-mock), [examples/github-actions-hardening](examples/github-actions-hardening), [examples/quality-lifecycle-smoke](examples/quality-lifecycle-smoke), and [docs/case-studies](docs/case-studies). They demonstrate financial UI, CI workflow, quality lifecycle, npm package, auth parser, Python protocol library, and environment configuration flows while still preserving Codex verification, Human Gate policy, and explicit apply control.
-## Output Shape
-```text
-doctor ... OK
-build workflow ... OK
-report ... OK
-gate status ... OK
-Demo completed: verdict=approve_with_fixes, ship_ready=false, applied=false
-```
-Outputs are written under:
-```text
-.harness/state/sessions/<session-id>/handoffs/
-.harness/state/sessions/<session-id>/REPORT.md
-```
-## Repository-Pinned Install
-```bash
-cd <target-project>
-git submodule add https://github.com/Ps-Neko/NEKOWORK.git .harness-tool
-node .harness-tool/scripts/portability/simulate-port.js . --profile developer --verbose
-node .harness-tool/scripts/install-apply.js --profile developer --project-root .
-node .harness-tool/scripts/cli.js check --project-root .
-```
-The NEKOWORK tool root stays in `.harness-tool/`. Session state, generated runtime files, and git work happen in the target project root.
-For a disposable external-project walkthrough, see [docs/EXAMPLE-PROJECT.md](docs/EXAMPLE-PROJECT.md).
-## Live Provider Auth
-Live mode delegates auth to local CLI sessions:
-```bash
-claude auth status
-codex login
-gemini
-node scripts/cli.js review "live local smoke" --live --no-ship
-```
-Long-lived API key environment variables are blocked by default before provider CLI calls:
-- Claude: `ANTHROPIC_API_KEY`
-- Codex: `OPENAI_API_KEY`
-- Gemini: `GEMINI_API_KEY`, `GOOGLE_API_KEY`
-Use API-key paths only with explicit opt-in, for example `HARNESS_AUTH_ALLOW_ENV_OVERRIDE=1`.
-## Main Surface
-The public alpha surface is intentionally small:
-- `doctor`: inspect local readiness
-- `ask`: clarify goal, scope, risk, and success criteria without provider calls
-- `plan`: create a planning handoff
-- `team`: create read-only handoffs from multiple worker perspectives
-- `work`: let a single executor produce an implement handoff and isolated diff
-- `verify`: run Codex-only verification on a prior work handoff
-- `gate`: inspect, approve, or block a human gate for a session
-- `ship`: produce a ship/no-ship readiness handoff after Codex verification
-- `apply`: apply a verified `SHIP_READY` live-work diff to the target project
-- `run`: execute the decomposed wrapper, `work -> verify -> ship`, with optional apply
-- `build`: one-command builder wrapper with default `auto` routing, explicit `fast`, `safe`, `team`, `tdd`, `release`, and `--dry-run` preview
-- `auto`: bounded autonomy wrapper that can repair fixable no-ship findings within budget, then report and stop before apply
-- `report`: summarize session evidence into `REPORT.md` without project mutation
-- `review`: run the legacy full Claude-led/Codex-reviewed workflow
-- `review-cycle`: explicit compatibility alias for the legacy full review workflow
-- `install --plan` / `install --apply`: project generated harness surfaces
-Advanced features such as `team-lite`, `ralph`, `wait`, instincts, cost tracking, and the Rust supervisor are documented in [docs/ADVANCED.md](docs/ADVANCED.md).
-`plan` is recommended before `work` for larger changes. The current `run` command intentionally stays compact: it runs `work -> verify -> ship`, records acceptance criteria through `work`, and applies only when `--apply` is explicitly provided.
-Use `build "<task>"` when NEKOWORK should be the single entrypoint. It defaults to `--mode auto`, classifies the task, selects `fast`, `safe`, `team`, `tdd`, or `release`, records build intelligence, and still uses one executor for writes, Codex verification before ship, and explicit apply only. The mode safety ordering is manifest-backed in `manifests/build-modes.json`. Use an explicit `--mode` when you need to override the router.
+Anything else (`ask`, `plan`, `team`, `work`, `ship`, `build`, `auto`,
+`pr-prep`, `review`, ...) belongs to `@ps-neko/nekowork-harness` (legacy and
+power-user surface). The slim package rejects those verbs with a redirect.
-Risky explicit overrides are protected. For example, `build "change OAuth token validation" --mode fast` is blocked because auto routing recommends `safe`, and `build "prepare npm package publish release notes" --mode fast` is blocked because auto routing recommends the higher-safety `release` mode. Use the recommended mode or add `--force-mode` only when you intentionally accept that downgrade.
+## How it works
-Use `auto "<task>"` when NEKOWORK should continue before the apply boundary. `auto` routes through the same build intelligence, runs `build`, repeats fixable no-ship work within `--level cautious|normal|aggressive` budgets, writes `auto-summary.json`, generates `REPORT.md`, and never accepts `--apply`.
-Use `--profile quality` or `--profile security` on `work`, `verify`, and `run` when a task needs stronger evidence prompts. Add `--strict-quality` to `verify`, `run`, or `build` when missing evidence or acceptance coverage should become a fix-required verdict before ship.
-Use official packs when choosing an install shape:
-```bash
-node scripts/install-plan.js --list
-node scripts/install-plan.js --pack productivity
-node scripts/install-plan.js --pack team
-node scripts/install-plan.js --pack pr
-node scripts/install-plan.js --pack builder
-node scripts/install-plan.js --pack quality
-node scripts/install-plan.js --pack security --target codex --json
-```
+1. Your AI tool writes the code. `nekowork` never writes it for you.
+2. `verify-pr` runs a fixed set of risk rules over the diff — same diff, same
+   verdict, every time. **No LLM gets to "vote" the result.**
+3. It saves the evidence into a `REPORT.md` you can read.
+4. You decide at the Human Gate — approve, or don't.
+5. Only then can `apply` apply the diff. No auto-commit. No auto-push.
+## Docs
+- [Quickstart](../nekowork-cli/docs/QUICKSTART.md)
+- [How verification works](../nekowork-cli/docs/SCOPE-1.0.md)
+- [Benchmark](../nekowork-cli/docs/BENCHMARK.md) — 73/74 (99%) recall, 0/47 FP, 38 real OSS positives
+- [Integration](../nekowork-cli/docs/INTEGRATION.md)
+## License
-Packs are aliases over validated profiles. They add clearer product packaging without weakening the core gates. `productivity` is the shortest daily discipline pack: brainstorm, plan, TDD, debug, execute, verify, report, and finish over the same safe build loop. `team`, `debugging`, `maintenance`, `pr`, and `catalog-plus` make the catalog feel richer while still resolving to safety-checked profiles.
-## Catalog
-- Agents: 11
-- Skills: 10
-- Hooks: 5
-- Modules: 7
-- Profiles: `core`, `developer`, `builder`, `productivity`, `security`, `product`, `quality`, `frontend`, `testing`, `research`, `full`
-- Official packs: `core`, `builder`, `productivity`, `team`, `debugging`, `maintenance`, `pr`, `catalog-plus`, `quality`, `security`, `frontend`, `testing`, `release`, `enterprise`
-- Harness targets: `claude`, `codex`, `cursor`, `gemini`, `opencode`
-Key skills:
-- `claude-led-codex-review`
-- `plan-eng-review`
-- `tdd-workflow`
-- `acceptance-coverage`
-- `review`
-- `ship`
-- `ralph`
-- `security-hardening`
-- `release-readiness`
-- `porting`
-## Common Commands
-```bash
-node scripts/cli.js doctor
-node scripts/cli.js doctor --quick --gemini-smoke
-npm run demo:quick
-node scripts/cli.js build "builder smoke" --mode team --session build-smoke
-node scripts/cli.js auto "fix failing tests safely" --level normal --dry-run
-node scripts/cli.js report --session latest
-node scripts/install-plan.js --list
-node scripts/install-plan.js --pack quality
-node scripts/install-plan.js --profile developer
-node scripts/install-apply.js --profile developer --project-root <target>
-node scripts/cli.js ask "clarify a risky or ambiguous request"
-node scripts/cli.js plan "draft a safe implementation plan"
-node scripts/cli.js team "collect read-only worker handoffs" --workers planner,research,security,test --no-write
-node scripts/cli.js work "implement the planned change with one executor" --single-executor --session work-smoke
-node scripts/cli.js verify "verify the implemented change" --session work-smoke
-node scripts/cli.js verify "verify quality evidence" --profile quality --strict-quality --session work-smoke
-node scripts/cli.js gate status --session work-smoke
-node scripts/cli.js ship "prepare ship readiness" --require-clean-gates --session work-smoke
-node scripts/cli.js report --session work-smoke
-node scripts/cli.js apply --session work-smoke
-node scripts/cli.js run "implement, verify, and prepare ship readiness" --session run-smoke
-node scripts/cli.js report --session run-smoke
-node scripts/cli.js review "implement and review this change" --no-ship
-node scripts/cli.js review-cycle "legacy full-cycle compatibility smoke" --no-ship
-node scripts/cli.js review "security-sensitive change" --secure --no-ship
-npm run lint
-npm test
-npm audit --audit-level=moderate
-node scripts/repair.js --check
-node scripts/sync-claude-md.js --check
-node scripts/build-codemaps.js --check
-```
-## Release Gates
-Before any tag or public npm decision, run:
-```bash
-npm run lint
-npm test
-npm audit --audit-level=moderate
-node scripts/repair.js --check
-node scripts/sync-claude-md.js --check
-node scripts/build-codemaps.js --check
-npm run security:hardening
-npm pack --dry-run --json
-```
-`npm pack --dry-run --json` currently produces a package named like `ps-neko-nekowork-0.1.0-alpha.8.tgz`. It does not publish.
-## Documentation
-- [docs/QUICKSTART.md](docs/QUICKSTART.md) - first run and common paths
-- [docs/BUILD.md](docs/BUILD.md) - build command modes and invariants
-- [docs/AUTONOMY.md](docs/AUTONOMY.md) - bounded autonomy, repair budgets, and the apply boundary
-- [docs/PARALLEL-CANDIDATES.md](docs/PARALLEL-CANDIDATES.md) - planned isolated candidate writer contract
-- [docs/PR-PREP.md](docs/PR-PREP.md) - planned PR prep artifact contract
-- [docs/WHY-NEKOWORK.md](docs/WHY-NEKOWORK.md) - comparison and product positioning
-- [docs/CATALOG-PACKS.md](docs/CATALOG-PACKS.md) - curated catalog, official packs, and case-study evidence
-- [docs/PUBLISH-ALPHA.md](docs/PUBLISH-ALPHA.md) - public npm alpha release plan
-- [docs/ROADMAP.md](docs/ROADMAP.md) - small alpha roadmap and non-goals
-- [docs/FEEDBACK-TRIAGE.md](docs/FEEDBACK-TRIAGE.md) - alpha feedback classification and response guide
-- [docs/INTERNAL-PROVIDER.md](docs/INTERNAL-PROVIDER.md) - private command adapter protocol
-- [docs/DEMO.md](docs/DEMO.md) - sample command output and generated files
-- [docs/DEMO-REPORT.md](docs/DEMO-REPORT.md) - readable session report UX
-- [docs/EXAMPLE-PROJECT.md](docs/EXAMPLE-PROJECT.md) - repository-based external project demo
-- [docs/case-studies](docs/case-studies) - real external project run evidence
-- [examples/trading-dashboard-mock](examples/trading-dashboard-mock) - standalone financial UI mock target and case-study evidence
-- [examples/quality-lifecycle-smoke](examples/quality-lifecycle-smoke) - standalone quality profile and strict-quality case-study evidence
-- [docs/SECURITY.md](docs/SECURITY.md) - local-first auth and safety model
-- [docs/ADVANCED.md](docs/ADVANCED.md) - advanced workflows and runtime features
-- [docs/SETUP.md](docs/SETUP.md) - local contributor setup and live provider smoke
-- [docs/PORTING.md](docs/PORTING.md) - using NEKOWORK in an external project
-- [docs/RELEASE-READINESS.md](docs/RELEASE-READINESS.md) - release and publish gates
-- [docs/RUNBOOK.md](docs/RUNBOOK.md) - operations guide
-- [docs/ARCHITECTURE.md](docs/ARCHITECTURE.md) - system architecture
-- [docs/PRODUCT-PRINCIPLES.md](docs/PRODUCT-PRINCIPLES.md) - product position, invariants, CLI phase semantics
-- [docs/AI-DEVELOPMENT-LIFECYCLE.md](docs/AI-DEVELOPMENT-LIFECYCLE.md) - safe build modes, quality runtime, and disciplined lifecycle
-- [docs/NAMING.md](docs/NAMING.md) - product, CLI, pack, and legacy alias naming contract
-- [docs/CORE-INVARIANTS.md](docs/CORE-INVARIANTS.md) - non-negotiable runtime safety rules
-- [docs/CLI-STAGES.md](docs/CLI-STAGES.md) - stage contract and compatibility transition
-- [docs/RISK-CLASSIFIER.md](docs/RISK-CLASSIFIER.md) - shared risk tags, challenge, and gate policy
-- [docs/examples/TRADING-DASHBOARD-MOCK.md](docs/examples/TRADING-DASHBOARD-MOCK.md) - financial mockup flow with Human Gate
-- [docs/examples/GITHUB-ACTIONS-HARDENING.md](docs/examples/GITHUB-ACTIONS-HARDENING.md) - CI workflow hardening flow with Human Gate
-- [docs/examples/QUALITY-LIFECYCLE-SMOKE.md](docs/examples/QUALITY-LIFECYCLE-SMOKE.md) - quality profile flow with evidence and acceptance coverage
-- [docs/AUDIT.md](docs/AUDIT.md) - readiness and remaining debt
-- [docs/CHANGELOG.md](docs/CHANGELOG.md) - project history
-- [SOUL.md](SOUL.md), [RULES.md](RULES.md), [AGENTS.md](AGENTS.md) - project principles and agent rules
-## License
-MIT
+MIT