npm - oh-my-codex - Versions diffs - 0.18.1 → 0.18.2 - Mend

oh-my-codex 0.18.1 → 0.18.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (204) hide show

package/Cargo.lock +6 -6
package/Cargo.toml +1 -1
package/README.md +4 -2
package/dist/agents/__tests__/definitions.test.js +14 -0
package/dist/agents/__tests__/definitions.test.js.map +1 -1
package/dist/agents/__tests__/native-config.test.js +19 -0
package/dist/agents/__tests__/native-config.test.js.map +1 -1
package/dist/agents/definitions.d.ts.map +1 -1
package/dist/agents/definitions.js +30 -0
package/dist/agents/definitions.js.map +1 -1
package/dist/agents/native-config.d.ts +1 -0
package/dist/agents/native-config.d.ts.map +1 -1
package/dist/agents/native-config.js +4 -0
package/dist/agents/native-config.js.map +1 -1
package/dist/catalog/__tests__/generator.test.js +4 -0
package/dist/catalog/__tests__/generator.test.js.map +1 -1
package/dist/cli/__tests__/doctor-warning-copy.test.js +61 -5
package/dist/cli/__tests__/doctor-warning-copy.test.js.map +1 -1
package/dist/cli/__tests__/index.test.js +161 -21
package/dist/cli/__tests__/index.test.js.map +1 -1
package/dist/cli/__tests__/launch-fallback.test.js +51 -3
package/dist/cli/__tests__/launch-fallback.test.js.map +1 -1
package/dist/cli/__tests__/question.test.js +2 -2
package/dist/cli/__tests__/question.test.js.map +1 -1
package/dist/cli/doctor.d.ts.map +1 -1
package/dist/cli/doctor.js +178 -7
package/dist/cli/doctor.js.map +1 -1
package/dist/cli/index.d.ts +7 -1
package/dist/cli/index.d.ts.map +1 -1
package/dist/cli/index.js +143 -43
package/dist/cli/index.js.map +1 -1
package/dist/config/__tests__/codex-hooks.test.js +3 -3
package/dist/config/__tests__/codex-hooks.test.js.map +1 -1
package/dist/config/codex-hooks.d.ts +1 -0
package/dist/config/codex-hooks.d.ts.map +1 -1
package/dist/config/codex-hooks.js +2 -4
package/dist/config/codex-hooks.js.map +1 -1
package/dist/config/generator.d.ts +14 -0
package/dist/config/generator.d.ts.map +1 -1
package/dist/config/generator.js +100 -1
package/dist/config/generator.js.map +1 -1
package/dist/goal-workflows/__tests__/codex-goal-snapshot.test.js +21 -0
package/dist/goal-workflows/__tests__/codex-goal-snapshot.test.js.map +1 -1
package/dist/goal-workflows/codex-goal-snapshot.d.ts +3 -0
package/dist/goal-workflows/codex-goal-snapshot.d.ts.map +1 -1
package/dist/goal-workflows/codex-goal-snapshot.js +45 -2
package/dist/goal-workflows/codex-goal-snapshot.js.map +1 -1
package/dist/hooks/__tests__/autopilot-skill-contract.test.js +17 -0
package/dist/hooks/__tests__/autopilot-skill-contract.test.js.map +1 -1
package/dist/hooks/__tests__/keyword-detector.test.js +170 -15
package/dist/hooks/__tests__/keyword-detector.test.js.map +1 -1
package/dist/hooks/__tests__/prometheus-strict-contract.test.d.ts +2 -0
package/dist/hooks/__tests__/prometheus-strict-contract.test.d.ts.map +1 -0
package/dist/hooks/__tests__/prometheus-strict-contract.test.js +320 -0
package/dist/hooks/__tests__/prometheus-strict-contract.test.js.map +1 -0
package/dist/hooks/__tests__/prompt-guidance-wave-two.test.js +12 -0
package/dist/hooks/__tests__/prompt-guidance-wave-two.test.js.map +1 -1
package/dist/hooks/__tests__/research-workflow-boundaries.test.d.ts +2 -0
package/dist/hooks/__tests__/research-workflow-boundaries.test.d.ts.map +1 -0
package/dist/hooks/__tests__/research-workflow-boundaries.test.js +35 -0
package/dist/hooks/__tests__/research-workflow-boundaries.test.js.map +1 -0
package/dist/hooks/keyword-detector.d.ts +1 -1
package/dist/hooks/keyword-detector.d.ts.map +1 -1
package/dist/hooks/keyword-detector.js +28 -6
package/dist/hooks/keyword-detector.js.map +1 -1
package/dist/hooks/keyword-registry.d.ts.map +1 -1
package/dist/hooks/keyword-registry.js +1 -0
package/dist/hooks/keyword-registry.js.map +1 -1
package/dist/hooks/prompt-guidance-contract.d.ts.map +1 -1
package/dist/hooks/prompt-guidance-contract.js +11 -0
package/dist/hooks/prompt-guidance-contract.js.map +1 -1
package/dist/hud/__tests__/hud-tmux-injection.test.js +22 -0
package/dist/hud/__tests__/hud-tmux-injection.test.js.map +1 -1
package/dist/hud/__tests__/reconcile.test.js +121 -10
package/dist/hud/__tests__/reconcile.test.js.map +1 -1
package/dist/hud/__tests__/render.test.js +84 -0
package/dist/hud/__tests__/render.test.js.map +1 -1
package/dist/hud/__tests__/state.test.js +51 -1
package/dist/hud/__tests__/state.test.js.map +1 -1
package/dist/hud/__tests__/tmux.test.js +69 -23
package/dist/hud/__tests__/tmux.test.js.map +1 -1
package/dist/hud/index.d.ts +1 -1
package/dist/hud/index.d.ts.map +1 -1
package/dist/hud/index.js +8 -3
package/dist/hud/index.js.map +1 -1
package/dist/hud/reconcile.d.ts.map +1 -1
package/dist/hud/reconcile.js +6 -3
package/dist/hud/reconcile.js.map +1 -1
package/dist/hud/render.d.ts.map +1 -1
package/dist/hud/render.js +26 -0
package/dist/hud/render.js.map +1 -1
package/dist/hud/state.d.ts +2 -1
package/dist/hud/state.d.ts.map +1 -1
package/dist/hud/state.js +62 -1
package/dist/hud/state.js.map +1 -1
package/dist/hud/tmux.d.ts +10 -3
package/dist/hud/tmux.d.ts.map +1 -1
package/dist/hud/tmux.js +59 -10
package/dist/hud/tmux.js.map +1 -1
package/dist/hud/types.d.ts +22 -0
package/dist/hud/types.d.ts.map +1 -1
package/dist/hud/types.js.map +1 -1
package/dist/pipeline/__tests__/orchestrator.test.js +63 -1
package/dist/pipeline/__tests__/orchestrator.test.js.map +1 -1
package/dist/pipeline/__tests__/stages.test.js +410 -4
package/dist/pipeline/__tests__/stages.test.js.map +1 -1
package/dist/pipeline/orchestrator.d.ts.map +1 -1
package/dist/pipeline/orchestrator.js +29 -2
package/dist/pipeline/orchestrator.js.map +1 -1
package/dist/pipeline/stages/ralplan.d.ts.map +1 -1
package/dist/pipeline/stages/ralplan.js +41 -6
package/dist/pipeline/stages/ralplan.js.map +1 -1
package/dist/question/__tests__/ui.test.js +43 -10
package/dist/question/__tests__/ui.test.js.map +1 -1
package/dist/question/ui.d.ts +12 -0
package/dist/question/ui.d.ts.map +1 -1
package/dist/question/ui.js +83 -46
package/dist/question/ui.js.map +1 -1
package/dist/ralplan/__tests__/runtime.test.js +200 -10
package/dist/ralplan/__tests__/runtime.test.js.map +1 -1
package/dist/ralplan/consensus-gate.d.ts +23 -0
package/dist/ralplan/consensus-gate.d.ts.map +1 -0
package/dist/ralplan/consensus-gate.js +212 -0
package/dist/ralplan/consensus-gate.js.map +1 -0
package/dist/ralplan/runtime.d.ts +25 -0
package/dist/ralplan/runtime.d.ts.map +1 -1
package/dist/ralplan/runtime.js +144 -8
package/dist/ralplan/runtime.js.map +1 -1
package/dist/scripts/__tests__/codex-native-hook.test.js +626 -7
package/dist/scripts/__tests__/codex-native-hook.test.js.map +1 -1
package/dist/scripts/__tests__/docs-site-contract.test.d.ts +2 -0
package/dist/scripts/__tests__/docs-site-contract.test.d.ts.map +1 -0
package/dist/scripts/__tests__/docs-site-contract.test.js +42 -0
package/dist/scripts/__tests__/docs-site-contract.test.js.map +1 -0
package/dist/scripts/__tests__/notify-dispatcher.test.js +115 -2
package/dist/scripts/__tests__/notify-dispatcher.test.js.map +1 -1
package/dist/scripts/__tests__/run-test-files.test.js +57 -0
package/dist/scripts/__tests__/run-test-files.test.js.map +1 -1
package/dist/scripts/__tests__/verify-native-agents.test.js +2 -2
package/dist/scripts/__tests__/verify-native-agents.test.js.map +1 -1
package/dist/scripts/codex-native-hook.d.ts.map +1 -1
package/dist/scripts/codex-native-hook.js +214 -34
package/dist/scripts/codex-native-hook.js.map +1 -1
package/dist/scripts/notify-dispatcher.js +188 -4
package/dist/scripts/notify-dispatcher.js.map +1 -1
package/dist/scripts/run-test-files.js +13 -0
package/dist/scripts/run-test-files.js.map +1 -1
package/dist/state/__tests__/workflow-transition.test.js +6 -0
package/dist/state/__tests__/workflow-transition.test.js.map +1 -1
package/dist/state/workflow-transition.d.ts +1 -1
package/dist/state/workflow-transition.d.ts.map +1 -1
package/dist/state/workflow-transition.js +7 -0
package/dist/state/workflow-transition.js.map +1 -1
package/dist/subagents/tracker.d.ts.map +1 -1
package/dist/subagents/tracker.js +4 -3
package/dist/subagents/tracker.js.map +1 -1
package/dist/team/__tests__/runtime.test.js +36 -44
package/dist/team/__tests__/runtime.test.js.map +1 -1
package/dist/team/__tests__/tmux-session.test.js +58 -18
package/dist/team/__tests__/tmux-session.test.js.map +1 -1
package/dist/team/runtime.d.ts.map +1 -1
package/dist/team/runtime.js +10 -20
package/dist/team/runtime.js.map +1 -1
package/dist/team/tmux-session.d.ts.map +1 -1
package/dist/team/tmux-session.js +15 -6
package/dist/team/tmux-session.js.map +1 -1
package/dist/ultragoal/__tests__/artifacts.test.js +50 -0
package/dist/ultragoal/__tests__/artifacts.test.js.map +1 -1
package/dist/ultragoal/artifacts.d.ts.map +1 -1
package/dist/ultragoal/artifacts.js +28 -2
package/dist/ultragoal/artifacts.js.map +1 -1
package/package.json +1 -1
package/plugins/oh-my-codex/.codex-plugin/plugin.json +1 -1
package/plugins/oh-my-codex/skills/autopilot/SKILL.md +16 -4
package/plugins/oh-my-codex/skills/autoresearch/SKILL.md +4 -0
package/plugins/oh-my-codex/skills/autoresearch-goal/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/best-practice-research/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/pipeline/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/plan/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/prometheus-strict/README.md +35 -0
package/plugins/oh-my-codex/skills/prometheus-strict/SKILL.md +219 -0
package/plugins/oh-my-codex/skills/ralplan/SKILL.md +18 -3
package/prompts/prometheus-strict-metis.md +274 -0
package/prompts/prometheus-strict-momus.md +82 -0
package/prompts/prometheus-strict-oracle.md +107 -0
package/prompts/researcher.md +22 -3
package/skills/autopilot/SKILL.md +16 -4
package/skills/autoresearch/SKILL.md +4 -0
package/skills/autoresearch-goal/SKILL.md +1 -1
package/skills/best-practice-research/SKILL.md +1 -1
package/skills/pipeline/SKILL.md +1 -1
package/skills/plan/SKILL.md +1 -1
package/skills/prometheus-strict/README.md +35 -0
package/skills/prometheus-strict/SKILL.md +219 -0
package/skills/ralplan/SKILL.md +18 -3
package/src/scripts/__tests__/codex-native-hook.test.ts +769 -8
package/src/scripts/__tests__/docs-site-contract.test.ts +47 -0
package/src/scripts/__tests__/notify-dispatcher.test.ts +132 -3
package/src/scripts/__tests__/run-test-files.test.ts +67 -0
package/src/scripts/__tests__/verify-native-agents.test.ts +2 -2
package/src/scripts/codex-native-hook.ts +237 -30
package/src/scripts/notify-dispatcher.ts +202 -4
package/src/scripts/run-test-files.ts +13 -0
package/templates/catalog-manifest.json +22 -0

package/skills/autoresearch-goal/SKILL.md CHANGED Viewed

@@ -5,7 +5,7 @@ description: Durable professor-critic research workflow over Codex goal mode wit
 # Autoresearch Goal
-Use this workflow when a research mission should be bound to Codex goal-mode focus while OMX remains the durable state owner.
+Use this workflow when a research mission should be bound to Codex goal-mode focus while OMX remains the durable state owner. This is for research projects that need Codex goal-mode management plus professor/critic-style validation; it is not the default answer for ordinary pre-planning best-practice lookup.
 ## Boundary
 - Do **not** use or revive the deprecated `omx autoresearch` direct launch surface.

package/skills/best-practice-research/SKILL.md CHANGED Viewed

@@ -10,7 +10,7 @@ Use this skill when a task depends on current external best practices, version-a
 ## Purpose
-Produce a cited, reusable best-practice answer or handoff that separates current external evidence from repo-local facts and dependency-selection decisions.
+Produce a cited, reusable best-practice answer or handoff that separates current external evidence from repo-local facts and dependency-selection decisions. For pre-planning investigation, this is the ordinary first research wrapper: gather official/upstream evidence, then hand it to `$ralplan` or the caller as planning input. Do not present `$best-practice-research` as a final architecture component or as a validator-gated research loop.
 ## Activate When

package/skills/pipeline/SKILL.md CHANGED Viewed

@@ -46,7 +46,7 @@ return a `StageResult` with status, artifacts, and duration.
 ## Built-in Stages
 - **deep-interview**: Requirements clarification and ambiguity gate.
-- **ralplan**: Consensus planning (planner + architect + critic). Skips only when both `prd-*.md` and `test-spec-*.md` planning artifacts already exist, and carries any `deep-interview-*.md` spec paths forward for traceability.
+- **ralplan**: Consensus planning (planner + architect + critic). Skips only when both `prd-*.md` and `test-spec-*.md` planning artifacts already exist **and** durable consensus evidence records Architect approval followed by Critic approval. Plan/test-spec files alone are not consensus evidence. If either review is missing, blocked, out of order, or non-approving, the stage remains in ralplan or fails with an explicit blocker/max-iteration outcome instead of progressing to execution. Carries any `deep-interview-*.md` spec paths forward for traceability.
 - **ultragoal**: Durable goal-mode execution with `.omx/ultragoal` ledgers. Launch `$team` only from inside an Ultragoal story when parallel lanes are warranted.
 - **code-review**: Merge-readiness review gate.
 - **ultraqa**: Adversarial QA gate after a clean review; docs-only/trivially non-runtime changes may record an explicit skip reason.

package/skills/plan/SKILL.md CHANGED Viewed

@@ -95,7 +95,7 @@ Jumping into code without understanding requirements leads to rework, scope cree
    c. Update the plan file in `.omx/plans/` with the accepted improvements (add missing details, refine steps, strengthen acceptance criteria, ADR updates, etc.)
    d. Note which improvements were applied in a brief changelog section at the end of the plan
    e. Before any execution handoff, derive an explicit **available-agent-types roster** from the known prompt catalog and add concrete **follow-up staffing guidance** for `$ultragoal` and `$team` (recommended roles, counts, suggested reasoning levels by lane, and why each lane exists), plus an explicit `$ralph` fallback note only when persistent single-owner verification is intentionally selected
-   f. Add a product-facing **Goal-Mode Follow-up Suggestions** section: recommend `$ultragoal` by default for general goal-oriented follow-up, `$autoresearch-goal` when the context is a research project, and `$performance-goal` when the context is an optimization or performance project. Keep these suggestions alongside the Team path and any explicit Ralph fallback rather than replacing implementation-delivery guidance. For durable-goal work that is also parallelizable, explicitly recommend **Team + Ultragoal**: Ultragoal remains leader-owned goal/ledger state and Team returns checkpoint-ready execution evidence.
+   f. Add a product-facing **Goal-Mode Follow-up Suggestions** section: recommend `$ultragoal` by default for general goal-oriented follow-up, `$autoresearch-goal` only when the context is a research project with a research deliverable/evaluator, and `$performance-goal` when the context is an optimization or performance project. Keep these suggestions alongside the Team path and any explicit Ralph fallback rather than replacing implementation-delivery guidance. For ordinary pre-planning external docs or best-practice lookup, cite `$best-practice-research` evidence and synthesize it into the plan instead of recommending Autoresearch as a final architecture component. For durable-goal work that is also parallelizable, explicitly recommend **Team + Ultragoal**: Ultragoal remains leader-owned goal/ledger state and Team returns checkpoint-ready execution evidence.
    g. For the `$team` path, add an explicit launch-hint block with concrete `omx team` / `$team` commands and a **team verification path** (what Team proves before shutdown and what Ultragoal checkpoints as durable completion evidence). Distinguish Team + Ultragoal from any explicit Ralph fallback: Team handles coordinated parallel lanes; Ultragoal is the default durable follow-up/ledger owner, and Ralph is only an explicitly requested legacy-style persistent sequential verification/fix lane when needed.
 7. On Critic approval (with improvements applied): *(--interactive only)* If running with `--interactive`, use `AskUserQuestion` / the structured question UI to present the plan with these options:
    - **Approve durable goal execution** — proceed via `$ultragoal` by default (optionally with `$team` for parallel lanes)

package/skills/prometheus-strict/README.md ADDED Viewed

@@ -0,0 +1,35 @@
+# Prometheus Strict
+`$prometheus-strict` is a clean-room OMX planning skill for rigorous interview-driven planning before execution.
+It is inspired by the high-level OMO Prometheus concept only. It does not copy OMO source text, prompts, runtime code, or workflow implementation.
+Credit: Inspired by OMO Prometheus (`code-yeongyu/oh-my-openagent`), reimplemented from concept under MIT.
+## Roles
+- **Metis** clarifies requirements, constraints, non-goals, and acceptance criteria.
+- **Momus** challenges assumptions, scope, handoff risks, and missing verification.
+- **Oracle** synthesizes the approved plan and recommends the OMX-native handoff.
+## OMX Handoff
+Prometheus Strict is planning-only by default. It should hand off to:
+1. `$ultragoal` for durable goal execution.
+2. `$team` only when the Oracle plan identifies independent parallel lanes.
+## Non-Goals
+- No hook implementation.
+- No Sisyphus or `start-work` port.
+- No direct implementation unless a downstream execution workflow is explicitly invoked.
+- No verbatim source copying from the inspiration project.
+## Expected Output
+The skill returns a Prometheus Strict Plan with clarified requirements, resolved critique, an Oracle execution plan, a verification matrix, an optional durable artifact path under `.omx/plans/prometheus-strict/`, and clean-room credit.
+## Durable Plan Artifacts
+When the plan should survive handoff or review, write the final Oracle synthesis to `.omx/plans/prometheus-strict/<slug>.md` and include that path in the plan before invoking `$ultragoal` or `$team`. Inline-only plans may set the artifact path to `N/A - inline plan only`.

package/skills/prometheus-strict/SKILL.md ADDED Viewed

@@ -0,0 +1,219 @@
+---
+name: prometheus-strict
+description: "[OMX] Clean-room interview-driven planner: Metis clarifies, Momus challenges, Oracle synthesizes, then hands off to $ultragoal/$team."
+argument-hint: "<goal or problem statement>"
+---
+# Prometheus Strict
+Clean-room OMX planning workflow inspired by the high-level OMO Prometheus concept only. This skill does not copy implementation, prompts, wording, control flow, or runtime code from OMO. It reimplements the idea under this repository's MIT-licensed skill conventions.
+Credit: Inspired by OMO Prometheus (`code-yeongyu/oh-my-openagent`), reimplemented from concept under MIT.
+<Purpose>
+Prometheus Strict creates a rigorous plan before execution when ambiguity is still risky. It separates three planning voices: Metis clarifies requirements, Momus challenges assumptions and validation gaps, and Oracle synthesizes the handoff-ready OMX-native plan.
+The output is a planning-only artifact for `$ultragoal` and, when independent lanes are justified, `$team`. When a durable artifact is useful, store or request the final plan under `.omx/plans/prometheus-strict/`.
+</Purpose>
+<Use_When>
+- The task is important enough that a shallow plan could produce wrong work.
+- Requirements are partially known but acceptance criteria, boundaries, risks, or validation are incomplete.
+- The user wants a strict interview before execution.
+- A future `$ultragoal` story needs durable scope, tests, and handoff sequencing.
+- A team split may be needed, but the lanes are not yet safe to assign.
+</Use_When>
+<Do_Not_Use_When>
+- The user asks for immediate implementation of a clear, low-risk change; use the normal executor path.
+- The task is only a repository lookup or explanation; use `explore`/`analyze` as appropriate.
+- The user needs adversarial execution QA after code changes; use `$ultraqa`.
+- The user wants hook behavior, Sisyphus behavior, or a `start-work` port. Those are explicit non-goals.
+</Do_Not_Use_When>
+<Why_This_Exists>
+OMX already has `$plan`, `$ralplan`, and `$deep-interview`. Prometheus Strict exists for a narrower case: an explicit clean-room strict-planning lane with named clarification, critique, and synthesis roles, plus a durable `.omx/plans/prometheus-strict/` handoff contract. It is not a replacement for execution workflows.
+</Why_This_Exists>
+<Execution_Policy>
+- Stay planning-only. Do not edit source code during this skill unless the user starts a separate execution workflow afterward.
+- Preserve clean-room boundaries. Do not copy or imitate OMO wording, source, prompts, runtime behavior, or control flow.
+- Keep non-goals visible: No hook implementation. No Sisyphus/start-work port. No automatic external-production actions.
+- Ask high-leverage questions as a batched round when the answers materially change scope, safety, or validation. Reserve one-at-a-time questioning only for dependent question chains where the next question depends on the previous answer.
+- If a safe assumption is available, state it and continue.
+- Use repository reads when needed to make paths, tests, and handoff commands concrete.
+- During Metis planning, run pre-question research fan-out for every non-trivial intent unless the task is trivial, the cited spec is self-contained, or cached evidence already covers the same surface; use `explore` for repo facts and the exact cheap `gpt-5.4-mini` `researcher` lane for external docs / OSS references before asking the user. Prometheus Strict may fan out up to `2 explore + 4 researcher` agents per round so breadth comes from more citation-focused mini researchers while Metis/Momus/Oracle keep stronger judgment roles.
+- Recommend `$team` only when Oracle identifies independent, bounded, verifiable lanes.
+### Structured Question Surface
+Every Metis/Momus/Oracle question to the user MUST go through the surface-appropriate structured question path. Plain prose questioning is the last fallback, not the default.
+- In attached-tmux OMX runtime, use `omx question` as the OMX-owned structured question surface (this is the `AskUserQuestion` equivalent for Prometheus Strict). From attached-tmux Bash/tool paths, prefix the command with `OMX_QUESTION_RETURN_PANE=$TMUX_PANE` (or a concrete `%pane` value) so the leader-pane return target is preserved.
+- **Batch independent high-leverage questions into a single `questions[]` array call**: scope, constraints, non-goals, deliverables, safety bounds, and acceptance criteria are normally independent and MUST be batched into one structured form so the user answers them in a single panel. Reserve one-at-a-time only for dependent question chains where the next question depends on the previous answer.
+- Wait for the `omx question` JSON answer before checking the clearance rule, asking another round, or handing off; prefer `answers[]` / `answers[i].answer`, and use the legacy top-level `answer` only as a compatibility fallback. After every `answers[]` batch, run at least **two gap-fill passes** before another question or handoff: Pass 1 assimilates user answers into the checklist; Pass 2 re-scans repo context, prior turns, research fan-out evidence, and conservative defaults to absorb non-CRITICAL residual gaps.
+- Minimum two emitted question rounds: when Metis emits any user-facing question round, do not hand off after Round 1 unless hostility/`<turn_aborted>` or the round-5 cap forces exit; handoff is allowed only after Round 2 has been emitted and processed. Zero-question complete-checklist handoff remains valid when no questions were emitted.
+- Between-round planning must actively use evidence: after Round 1 answers and the two gap-fill passes, refresh or reuse `<research_fan_out>` explore/researcher evidence, re-run spec prefill, and build Round 2 from residual CRITICAL gaps only.
+- Outside tmux, use the native structured input tool when one is available.
+- When neither structured surface can render (non-tmux Codex CLI, piped runs, CI), list the round's independent questions as a numbered prose block (`Q1: ... Q2: ... Q3: ...`) and wait for all answers in one user turn; do not split into separate round-trips.
+- Multiple interview rounds ARE expected when clearance is not yet reached; each round is one batched form (or its prose fallback), never split across forms.
+### Checklist Clearance
+The interview is governed by deterministic checklist clearance, not by subjective "feels enough" judgement. Exit the Metis interview loop when the 6-item checklist is fully YES: objective / scope IN+OUT / acceptance / test strategy / handoff target / no outstanding CRITICAL. Each item is evaluated with the tri-state defined in `<Turn_Termination_Rules>`.
+Cap interview rounds at **5** to prevent runaway. If checklist clearance is not reached by round 5, hand the remaining UNKNOWN items to Oracle as explicitly carried-forward `<unresolved_blocker>` entries.
+**Hostility / non-answer exit**: if the user's responses for a round contain refusal signals (1-2 character non-answers, dismissive `알아서` / "you decide" / "whatever" patterns, profanity-laden responses, or a `<turn_aborted>` on the prior turn), the round invalidates the answers — it does NOT advance any checklist item to YES, exits the interview loop immediately, and routes the unresolved gaps either to `<silent_absorption>` (for dismissive delegation) or back to the user via `hostility_exit` (for anger / aborted turns). See `prometheus-strict-metis` `<hostility_detection>` for the full pattern list and routing rules.
+</Execution_Policy>
+<Turn_Termination_Rules>
+Every Prometheus Strict turn ends with EXACTLY ONE of the following terminations. Bare summaries and "I think we're done" are forbidden.
+The 6-item checklist is: objective / scope IN+OUT / acceptance / test strategy / handoff target / no outstanding CRITICAL. A checklist item is YES when it is USER_ANSWERED ∪ ABSORBED_WITH_CITATION ∪ INFERRED_FROM_SPEC. Only UNKNOWN (no answer, no citation, no spec inference) counts as NO.
+- (a) `omx question` batch: use when at least one CRITICAL question survives `<gap_triage>` and `<self_review>`. The batch is the round; the turn waits for `answers[]` before continuing.
+- (b) explicit handoff: use when the 6-item checklist is fully YES. Hand off Metis → Momus after clearance, Momus → Oracle after critique, and Oracle → user or `<unresolved_blocker>` carry-forward after Pass 2 synthesis.
+- (c) stop-blocker: use when hostility/`<turn_aborted>` is detected via `<hostility_detection>` with subtype `hostility_exit`, or when the next action is destructive, credential-gated, external-production, and cannot be defaulted safely.
+Edge cases:
+1. Zero-questions-but-complete-checklist → option (b) explicit handoff. Do not emit an empty `omx question` form.
+2. Round-5-cap with incomplete checklist → option (a) emit one more question batch with surviving UNKNOWN items annotated, OR option (b) handoff with UNKNOWN items carried forward to Oracle as `<unresolved_blocker>` entries.
+3. Hostility/`<turn_aborted>` → option (c) for anger, profanity, or aborted-turn via `hostility_exit`; option (b) for dismissive-delegation (`알아서` / "you decide") with absorbed gaps annotated.
+</Turn_Termination_Rules>
+<Steps>
+### 1. Intake and Safety Bounds
+Restate the target result, known constraints, deliverables, validation expectations, and stop condition. Identify whether this turn is planning-only or whether the user also requested downstream execution.
+If the prompt contains destructive, credential-gated, external-production, or materially scope-changing decisions, hold those decisions for explicit user confirmation. Otherwise, continue through the planning loop.
+### 2. Metis Interview (Iterative, Checklist Clearance)
+Use `prometheus-strict-metis` as the interview voice. When native subagents are available, invoke the dedicated agent; otherwise run the same role in-context without editing files.
+Metis discovers success criteria, non-goals, evidence versus assumptions, required artifacts, likely execution lanes, and missing decisions. Before the first user-facing question batch, Metis must actively fan out repo/external research per intent: `explore` maps local surfaces and exact `gpt-5.4-mini` `researcher` lanes gather official/upstream or OSS-reference evidence. Research-heavy intents use more cheap researchers rather than downgrading Metis/Momus/Oracle judgment.
+Run the interview as a bounded loop:
+1. Identify every currently-UNKNOWN checklist item and every CRITICAL question whose answers would materially change scope, safety, or validation.
+2. Batch the round's independent questions into a single Structured Question Surface call (`questions[]` array, or numbered prose fallback outside tmux).
+3. Collect the structured `answers[]`, then run **Gap-fill Pass 1 — answer assimilation**: update evidence vs. assumption and mark checklist items YES only when USER_ANSWERED, ABSORBED_WITH_CITATION, or INFERRED_FROM_SPEC.
+4. Run **Gap-fill Pass 2 — residual adversarial scan**: re-check every remaining UNKNOWN against repo context, prior turns, research fan-out evidence, framework/industry defaults, and conservative reversible defaults; absorb non-CRITICAL gaps with citations/assumptions and leave only CRITICAL blockers.
+5. Run **between-round planning** after Round 1: refresh or reuse `<research_fan_out>` explore/researcher evidence, re-run spec prefill, and prepare Round 2 from residual CRITICAL gaps only.
+6. Evaluate the 6-item checklist (`<Turn_Termination_Rules>` tri-state) only after BOTH gap-fill passes and the minimum two emitted question rounds gate; exit when ALL YES and either no questions were emitted or Round 2 has been emitted and processed.
+7. If checklist clearance is not reached, or only Round 1 has been processed, return to step 1 with the next round. Cap at 5 rounds; on cap, carry remaining UNKNOWN items forward to Oracle as explicit `<unresolved_blocker>` entries.
+### 3. Momus Challenge (Bounded Retry)
+Use `prometheus-strict-momus` as the adversarial critique voice. When native subagents are available, invoke the dedicated agent; otherwise run the same role in-context without editing files.
+Momus challenges underspecified acceptance criteria, unsafe assumptions, hidden destructive steps, overbroad scope, missing verification, ownership conflicts, and `$ultragoal`/`$team` handoff ambiguity.
+**Bounded retry contract**: after Oracle synthesizes in §4, re-invoke Momus on the synthesized plan to verify that Oracle's resolutions did not introduce new risks (scope addition without matching verification, lane split that creates dependency cycles, safety reinforcement that contradicts stop conditions). Repeat the Momus → Oracle re-synthesis cycle up to **3 times total**. If blocking objections remain after the 3rd cycle, mark them as carried-forward in the final plan and proceed to §5.
+### 4. Oracle Synthesis (Two-Pass: Synthesis + Self-Verification)
+Use `prometheus-strict-oracle` as the synthesis voice. When native subagents are available, invoke the dedicated agent; otherwise run the same role in-context without editing files.
+**Pass 1 — Synthesis.** Oracle produces the final objective, scope and non-goals, accepted assumptions, resolved critique, sequenced steps or lanes, verification matrix, rollback/escalation conditions, and recommended OMX handoff.
+**Pass 2 — Self-Verification (machine-checkable acceptance contract).** Oracle re-reads its own Pass 1 output and asserts:
+- Every claim in the verification matrix has an explicit evidence source (test/build/lint/e2e/doc).
+- Every step lists its owner / lane / executor; no shared-file conflicts between parallel lanes.
+- Stop, rollback, and acceptance criteria are mutually consistent (no acceptance criterion is satisfied by a state that also triggers rollback).
+- No destructive, credential-gated, or external-production step is unauthorized.
+- The handoff command is concrete (callable verbatim) and points at an existing workflow (`$ultragoal`, `$team`, or `none`).
+- Clean-room credit is preserved.
+If any Pass 2 check fails, Oracle MUST loop back to Pass 1 to repair before emitting the plan. Cap Pass 1 ↔ Pass 2 cycles at **3**; on cycle 3 failure, emit the plan with the failing gates annotated as carried-forward and escalate to the user.
+### 5. Post-Plan Gap Check (Metis Re-Invocation)
+Before handing off, re-invoke `prometheus-strict-metis` on the finalized Oracle plan with a single charge: identify ambiguities that surfaced **only after** the plan was rendered — for example, new lane assignments that overlap, verification matrix gaps revealed by stop conditions, acceptance criteria that contradict the rollback contract.
+If post-plan Metis surfaces any blocking gap, return to §4 Pass 1 with the new question. Otherwise proceed to §6.
+### 6. Handoff
+Prometheus Strict stops with a plan unless the user explicitly invokes or authorizes the next workflow. Prefer this sequence:
+```text
+$ultragoal "<Oracle plan summary or .omx/plans/prometheus-strict/<slug>.md>"
+$team <N>:executor "execute the approved Ultragoal story in parallel lanes"  # only when warranted
+```
+</Steps>
+<Tool_Usage>
+- Use read-only repository inspection to verify referenced files, commands, and existing conventions.
+- Treat Metis research fan-out as part of planning, not execution: dispatch `explore` / exact `gpt-5.4-mini` `researcher` evidence-gathering before question generation for non-trivial intents, then re-prefill and ask only surviving CRITICAL gaps.
+- Use `prometheus-strict-metis`, `prometheus-strict-momus`, and `prometheus-strict-oracle` sequentially; do not fan out implementation work from this skill.
+- Use `$ultragoal` only as the recommended execution handoff after the plan is ready.
+- Use `$team` only when parallel lanes are independent and verifiable.
+</Tool_Usage>
+## State Management
+Prometheus Strict does not own a long-running runtime loop. If a durable planning artifact is needed, write the final plan to `.omx/plans/prometheus-strict/<slug>.md`. Draft-only or inline plans may set the artifact path to `N/A - inline plan only`.
+Do not create hook state, Sisyphus state, or `start-work` compatibility state for this skill.
+<Final_Checklist>
+- [ ] Target result is explicit.
+- [ ] Scope and non-goals are explicit.
+- [ ] Acceptance criteria are measurable.
+- [ ] Metis interview loop reached checklist clearance only after the mandatory two gap-fill passes following every `answers[]` batch and, if any question round was emitted, after the minimum two emitted question rounds gate; otherwise the 5-round cap was reached with UNKNOWN items carried forward as `<unresolved_blocker>` entries.
+- [ ] Momus objections are resolved or carried forward as explicit blockers, with at most 3 Momus → Oracle re-synthesis cycles consumed.
+- [ ] Oracle plan includes a verification matrix.
+- [ ] Oracle Pass 2 self-verification completed; every machine-checkable contract item passes or is annotated as carried-forward.
+- [ ] Post-plan Metis gap check produced no blocking objections (or all are carried forward).
+- [ ] Handoff recommends `$ultragoal` and `$team` only when warranted.
+- [ ] Clean-room credit is preserved.
+- [ ] No hook implementation or Sisyphus/start-work port was introduced.
+</Final_Checklist>
+<Advanced>
+## Output Contract
+If writing a durable plan file, store this markdown at `.omx/plans/prometheus-strict/<slug>.md` and reference that path in the handoff.
+```markdown
+## Prometheus Strict Plan
+### Target Result
+- <one-sentence objective>
+### Clarified Requirements (Metis)
+- <requirement / acceptance criterion>
+### Critique Resolved (Momus)
+- <risk or objection> -> <resolution>
+### Oracle Execution Plan
+1. <sequenced step or lane>
+### Verification Matrix
+| Claim | Required evidence | Owner/lane |
+| --- | --- | --- |
+| <claim> | <test/build/lint/e2e/doc evidence> | <owner> |
+### Artifact
+- Durable plan path: `.omx/plans/prometheus-strict/<slug>.md` or `N/A - inline plan only`
+### Handoff
+- Recommended next workflow: <$ultragoal / $team / direct execution / none>
+- Stop condition: <what proves the plan is ready or why it is blocked>
+### Clean-Room Credit
+Inspired by OMO Prometheus (`code-yeongyu/oh-my-openagent`), reimplemented from concept under MIT.
+```
+## Failure and Escalation
+Escalate instead of planning when a necessary answer cannot be inferred safely, the next step is destructive or credential-gated, required repository context is unavailable, or the user asks for behavior outside the non-goals.
+</Advanced>
+Original task:
+{{PROMPT}}

package/skills/ralplan/SKILL.md CHANGED Viewed

@@ -60,6 +60,19 @@ The consensus workflow:
 > **Important:** Steps 3 and 4 MUST run sequentially. Do NOT issue both agent calls in the same parallel batch. Always await the Architect result before invoking Critic.
+## Durable Consensus Handoff Contract
+Ralplan is not complete, skippable, or ready for execution merely because `.omx/plans/prd-*.md` and `.omx/plans/test-spec-*.md` exist. Those files are planning artifacts, not consensus evidence.
+Before any Autopilot, Pipeline, Ultragoal, Team, Ralph, or implementation handoff, persist a durable handoff record that distinguishes:
+- `planning_artifacts`: PRD/test-spec paths.
+- `ralplan_architect_review`: the completed Architect review with an approving verdict.
+- `ralplan_critic_review`: the completed Critic review with an approving verdict, recorded only after the Architect review.
+- `ralplan_consensus_gate.complete:true` only when both reviews are present, approving, and in the required Architect→Critic order.
+If Architect is missing/blocked, keep the workflow in Architect review or report that blocker. If Critic is missing/blocked/non-approving, keep the workflow in Critic/re-review or report the max-iteration outcome. Do not treat existing plan/test-spec files as permission to skip ralplan or start execution.
 Follow the Plan skill's full documentation for consensus mode details.
 ## Goal-Mode Follow-up Suggestions
@@ -87,6 +100,7 @@ Before consensus planning or execution handoff, ensure a grounded context snapsh
    - likely codebase touchpoints
 4. If ambiguity remains high, gather brownfield facts first. When session guidance enables `USE_OMX_EXPLORE_CMD`, prefer `omx explore` for simple read-only repository lookups with narrow, concrete prompts; otherwise use the richer normal explore path. Then run `$deep-interview --quick <task>` before continuing.
 5. If the plan depends on official docs, version-aware framework guidance, best practices, or external dependency behavior, use `$best-practice-research` as the bounded evidence wrapper and auto-delegate `researcher` for the official/upstream lookup before finalizing the planning handoff so execution does not start from repo-local recall alone.
+6. If a prior `$autoresearch` or `$autoresearch-goal` run exists, treat its approved artifact as evidence for the plan. Do not include Autoresearch as a final architecture or runtime component unless the user explicitly requested ongoing research automation; otherwise synthesize the evidence into the `$ralplan` ADR, risks, and verification steps.
 Do not hand off to execution modes until this intake is complete; if urgency forces progress, explicitly document the risk tradeoffs.
@@ -150,9 +164,10 @@ The gate auto-passes when it detects **any** concrete signal. You do not need al
    - **Architect** reviews for soundness
    - **Critic** validates quality and testability
 5. On consensus approval, user chooses execution path:
-   - **ralph**: sequential execution with verification
-   - **team**: parallel coordinated agents
-6. Execution begins with a clear, bounded plan
+   - **ultragoal**: default durable follow-up for sequential goal execution with ledger checkpoints
+   - **team**: coordinated parallel execution for stories that need multiple lanes, with evidence ready for Ultragoal checkpoints
+   - **ralph**: explicit single-owner fallback only when the user intentionally wants a persistent verification/completion loop instead of the default durable goal ledger
+6. Execution begins with a clear, bounded plan through the selected handoff path
 ### Troubleshooting