npm - @ikunin/sprintpilot - Versions diffs - 2.0.10 → 2.1.0 - Mend

@ikunin/sprintpilot 2.0.10 → 2.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (42) hide show

package/README.md +245 -10
package/_Sprintpilot/Sprintpilot.md +1 -1
package/_Sprintpilot/bin/autopilot.js +581 -0
package/_Sprintpilot/lib/orchestrator/action-ledger.js +148 -0
package/_Sprintpilot/lib/orchestrator/adapt.js +502 -0
package/_Sprintpilot/lib/orchestrator/decision-log.js +224 -0
package/_Sprintpilot/lib/orchestrator/divergence.js +201 -0
package/_Sprintpilot/lib/orchestrator/git-plan.js +259 -0
package/_Sprintpilot/lib/orchestrator/impact-classifier.js +108 -0
package/_Sprintpilot/lib/orchestrator/land.js +155 -0
package/_Sprintpilot/lib/orchestrator/parallel-batch.js +99 -0
package/_Sprintpilot/lib/orchestrator/profile-rules.js +167 -0
package/_Sprintpilot/lib/orchestrator/report.js +95 -0
package/_Sprintpilot/lib/orchestrator/state-machine.js +402 -0
package/_Sprintpilot/lib/orchestrator/state-store.js +260 -0
package/_Sprintpilot/lib/orchestrator/user-command-applier.js +157 -0
package/_Sprintpilot/lib/orchestrator/user-commands.js +115 -0
package/_Sprintpilot/lib/orchestrator/verify.js +397 -0
package/_Sprintpilot/manifest.yaml +1 -1
package/_Sprintpilot/modules/git/config.yaml +26 -0
package/_Sprintpilot/scripts/agent-adapter.js +4 -5
package/_Sprintpilot/scripts/auto-merge-bmad-docs.js +112 -0
package/_Sprintpilot/scripts/dispatch-layer.js +12 -8
package/_Sprintpilot/scripts/infer-dependencies.js +78 -21
package/_Sprintpilot/scripts/inject-tasks-section.js +4 -3
package/_Sprintpilot/scripts/land-this-pr.js +110 -0
package/_Sprintpilot/scripts/lint-test-pitfalls.js +133 -0
package/_Sprintpilot/scripts/list-remaining-stories.js +1 -1
package/_Sprintpilot/scripts/log-timing.js +12 -3
package/_Sprintpilot/scripts/merge-shards.js +32 -12
package/_Sprintpilot/scripts/post-green-gates.js +187 -0
package/_Sprintpilot/scripts/preflight-merge.js +2 -1
package/_Sprintpilot/scripts/resolve-dag.js +3 -1
package/_Sprintpilot/scripts/stack-snapshot.js +128 -0
package/_Sprintpilot/scripts/state-shard.js +8 -1
package/_Sprintpilot/scripts/summarize-timings.js +30 -12
package/_Sprintpilot/scripts/with-retry.js +17 -5
package/_Sprintpilot/skills/sprint-autopilot-on/SKILL.md +23 -1
package/_Sprintpilot/skills/sprint-autopilot-on/workflow.orchestrator.md +148 -0
package/lib/core/update-check.js +11 -1
package/package.json +1 -1
package/_Sprintpilot/skills/sprint-autopilot-on/workflow.md +0 -1388

package/_Sprintpilot/skills/sprint-autopilot-on/workflow.orchestrator.md ADDED Viewed

@@ -0,0 +1,148 @@
+# Sprintpilot — ON (Orchestrator Mode)
+You are running under the **orchestrator-driven** autopilot. Flow control is
+owned by `_Sprintpilot/bin/autopilot.js` — a deterministic Node.js
+state machine that enforces the BMad 7-step sequence. You own the
+*in-skill execution, diagnosis, triage, and small-judgment decisions* —
+not the flow.
+This file is the **≤150-line** authoritative workflow. It is the only
+workflow shipped — the v2.0.x prose `workflow.md` is gone.
+## The loop
+Repeat until the orchestrator emits `halt`:
+1. `node _Sprintpilot/bin/autopilot.js next` → JSON Action.
+2. Execute the Action per the dispatch table below.
+3. While executing, scan the host chat for user interjections. If
+   present, record them via a `user_input` signal (step 4) and re-loop.
+4. `node _Sprintpilot/bin/autopilot.js record --signal <json>` → JSON
+   `{ action, verdict, phase, profile }`. Use the new action.
+5. If `verdict: prompted` → ask the user the question in `action.prompt`.
+   Apply their answer as a `user_input` signal and re-loop.
+6. If `action.type: halt` → STOP. The orchestrator already wrote
+   resume state and (when relevant) the fresh-context handoff flag.
+Never improvise. Never skip a step. Never invent a next action: the
+orchestrator emits it.
+## Action dispatch
+| `action.type`     | What you do                                                                                      |
+|-------------------|--------------------------------------------------------------------------------------------------|
+| `invoke_skill`    | Run the named BMad skill **verbatim from its own body** (e.g. `bmad-create-story`, `bmad-quick-dev`, `bmad-code-review`). `action.template_slots` is a parameter bag (story_key, prior_diagnosis, relevant_decisions, prior_signals_summary, …) — it's input context for BMad's skill, NOT a replacement for the skill's instructions. When `implementation_flow=quick`, you'll receive `invoke_skill: bmad-quick-dev` per story — follow BMad's `step-oneshot.md`. |
+| `run_script`      | Execute `action.command` via the host's shell-equivalent. Argv-only — no shell interpolation.    |
+| `git_op`          | Execute `action.steps` in order. The orchestrator pre-plans every git op (commit_and_push_story, merge_epic, push, fetch, create_branch) via `git-plan.js` and inlines the resulting argv sequence — each step has `args: [cmd, ...argv]`, a `description`, and an optional `retry` policy. Run each step's argv verbatim (NO shell interpolation), halt on first non-retryable failure. Never improvise the git commands or skip a step — `git push` lives in `steps`, not in `op`. |
+| `parallel_batch`  | Dispatch each child action concurrently (M6+ hosts only — fall back to sequential otherwise).    |
+| `user_prompt`     | Ask the user `action.prompt`. Pass the answer back via `user_input` signal.                      |
+| `halt`            | Stop. Honor `action.handoff: 'sprint_finalize_pending'` by ending the session cleanly.           |
+| `noop`            | Re-loop (state machine advancing without an external effect).                                    |
+## Signals you emit
+Wrap everything in `{ "status": "...", ... }` and pass to
+`autopilot record --signal '<json>'`. Optional `decisions[]` on any signal.
+| `status`              | Required fields                                                                                  |
+|-----------------------|--------------------------------------------------------------------------------------------------|
+| `success`             | `output?: object` (for `bmad-code-review` MUST include `findings[]` with `action: 'block'\|'patch'\|'defer'`); `next_skill_hint?` |
+| `failure`             | `reason`, `diagnosis` (first-class — fed back into next retry), `recoverable: boolean`           |
+| `blocked`             | `blocker_kind` (one of the 5 TRUE BLOCKERS or recoverable kinds), `details`, `user_input_needed`, `consecutive_count?` |
+| `propose_alternative` | `reason`, `alternative` (full Action object), `urgency_hint?` (raises impact only)               |
+| `user_input`          | `commands: UserCommand[]` (validated server-side; see user-commands.js)                          |
+| `verify_override`     | `evidence: { decision_log_ref?, explanation, expected_paths? }` — used when verify.js is wrong   |
+## TRUE BLOCKER kinds (per AGENTS.md)
+`creative_user_input_required`, `new_external_dependency`,
+`consecutive_test_failures` (carry `consecutive_count`),
+`security_architectural_decision`, `contradictory_acceptance_criteria`.
+Plus recoverable kinds the orchestrator handles deterministically:
+`missing_dependency`, `failed_invariant`, `external_service`, `unknown`.
+## Decision audit channel
+Include `decisions[]` on ANY signal to log small judgment calls without
+round-tripping through `propose_alternative`. Each entry has
+`category` (one of: architecture, test-strategy, dependency,
+review-triage, review-accept, halt-recovery, scope, workaround), `impact`
+(low/medium/high), `phase` (e.g. `dev-story:RED`), `decision`, and
+`rationale`. The orchestrator stamps id + timestamp + story
+automatically and appends to `decision-log.yaml`.
+## Code-review triage
+When `bmad-code-review` completes, `success.output.findings[]` MUST
+classify each finding:
+- `action: 'block'`   → orchestrator pauses the autopilot. Manual decision required.
+- `action: 'patch'`   → orchestrator enters PATCH_APPLY (step 6a) then PATCH_RETEST (step 6b).
+- `action: 'defer'`   → recorded but not blocking; story proceeds.
+This is LLM intelligence the orchestrator routes on — you own the triage.
+## BMad bookkeeping is enforced
+`verify.js` checks more than artifact existence — it enforces the BMad
+bookkeeping you'd otherwise be tempted to skip:
+| Phase | Bookkeeping that MUST be true before you signal `success` |
+|---|---|
+| `create_story` | Story file has `## Acceptance Criteria` (≥1 bullet) AND a `## Tasks` (or `## Tasks/Subtasks`) section with at least one `[ ]` or `[x]` checkbox. |
+| `dev_red` / `dev_green` | Test files exist on disk; runner exit codes match the phase contract; `tests_run` matches the runner's count. |
+| `code_review` | `_bmad-output/reviews/<story_key>.md` exists; `findings[]` carries `{id, severity, category, action: 'block'\|'patch'\|'defer', rationale}` for every finding. |
+| `patch_apply` | Every `patch_finding` id present in `state.patch_findings` is included in `applied_finding_ids`. |
+| `story_done` (and `nano_quick_dev`) | sprint-status.yaml shows this story as `done` (under `development_status.<story_key>` or inline). Story file has zero remaining `[ ]` task boxes — dev-story is responsible for flipping them to `[x]`. `commit_sha` and `branch` reported; `story_key` matches. **`git_steps_completed: true` in success output** — set this ONLY after every step in `action.steps` (the orchestrator's decorated git plan: `git add`, `git commit`, `git push -u origin <branch>`) has exited 0. Skipping `git push` and reporting success leaves the branch unpushed and trips this check. |
+| `retrospective` | `_bmad-output/retrospectives/<epic>.md` exists. |
+Skipping any of these — even when the code is "obviously done" — produces
+`verify_rejected` in the ledger and the orchestrator re-emits the same
+action with the verifier's issues threaded into the template slot. After
+the per-profile `verify_reject_budget` is exhausted, the session pauses
+for the user.
+If you're confident verify is wrong (e.g. you renamed a test file per a
+logged decision), emit `verify_override` with `evidence.expected_paths`
+and a `decision_log_ref`. The orchestrator re-runs verify with augmented
+expectations.
+After N consecutive verify rejections on the same state (profile-
+configured budget), the orchestrator escalates to `user_prompt`.
+## Git workflow knobs
+These knobs in `_Sprintpilot/modules/git/config.yaml` change what the
+orchestrator emits as `git_op` / `run_script` actions. Always read them
+from the action payload — do NOT improvise.
+| Knob | Values | Behavior |
+|---|---|---|
+| `granularity` | `story` (default) / `epic` | Per-unit branch creation (default). Suppressed when `reuse_user_branch=true`. |
+| `reuse_user_branch` | `false` (default) / `true` | If `true`, autopilot detects the current non-base branch on boot and commits **every** story onto it. No `story/*` or `epic/*` branches are created. PR is opened from this branch at sprint-end. |
+| `merge_strategy` | `stacked` (default) / `land_as_you_go` | `stacked` keeps every story-branch open until sprint-end. `land_as_you_go` runs the new `STORY_LAND` state right after STORY_DONE to merge the PR immediately. |
+| `land_when` | `no_wait` / `ci_pass` (default) / `ci_and_review` | Under `land_as_you_go`, when to merge: synchronously, after CI is green, or after CI + an approved review. |
+| `land_wait_minutes` | int (default 30) | Max wait for CI / review under `land_as_you_go`. After this the orchestrator halts and prompts. |
+On `STORY_LAND` rebase conflicts (base moved during the story), the
+orchestrator auto-rebases the story branch onto latest base. If the
+rebase has conflicts, the orchestrator halts with a `user_prompt`. You
+resolve conflicts manually, then resume autopilot — it retries the
+land step from `state.land_pending`.
+## Resume
+On the next `autopilot start`, the orchestrator fingerprints
+`_bmad-output/`, sprint-status.yaml, and per-story branch HEADs against
+the fingerprint recorded at the last halt. Any divergence is emitted as
+`{ kind: 'resume_divergence', differences: ... }`. Forward the diff to
+the user; let them resolve via `user_input` (`force_continue` or
+`override_decision`).
+## What you must NEVER do
+- Decide the next BMad step yourself; skip step-6 patch loop; auto-accept
+  your own `propose_alternative`; write `autopilot-state.yaml` /
+  `ledger.jsonl` directly; commit secrets; use `git push --no-verify`;
+  drift from the typed Signal schema.

package/lib/core/update-check.js CHANGED Viewed

@@ -5,7 +5,17 @@ const PACKAGE_NAME = '@ikunin/sprintpilot';
 function spawnCapture(cmd, args, { timeoutMs = 7000 } = {}) {
   return new Promise((resolve) => {
-    const child = execFile(cmd, args, { timeout: timeoutMs, windowsHide: true }, (err, stdout) => {
+    // On Windows, npm ships as `npm.cmd` (a batch shim). Node's `execFile`
+    // doesn't resolve `.cmd` / `.bat` extensions through PATHEXT — pass
+    // `shell: true` so cmd.exe handles the lookup. Args are hardcoded
+    // here so there's no injection risk. (Linux/macOS execFile is fine
+    // as-is.)
+    const opts = {
+      timeout: timeoutMs,
+      windowsHide: true,
+      shell: process.platform === 'win32',
+    };
+    const child = execFile(cmd, args, opts, (err, stdout) => {
       if (err) return resolve(null);
       resolve(String(stdout || '').trim());
     });

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@ikunin/sprintpilot",
-  "version": "2.0.10",
+  "version": "2.1.0",
   "description": "Sprintpilot — autopilot and multi-agent addon for BMad Method v6: git workflow, parallel agents, autonomous story execution",
   "license": "Apache-2.0",
   "repository": {