npm - oh-my-codex - Versions diffs - 0.18.0 → 0.18.2 - Mend

oh-my-codex 0.18.0 → 0.18.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (410) hide show

package/Cargo.lock +6 -6
package/Cargo.toml +1 -1
package/README.md +45 -19
package/crates/omx-api/src/lib.rs +66 -9
package/crates/omx-sparkshell/src/exec.rs +125 -3
package/crates/omx-sparkshell/src/main.rs +126 -36
package/crates/omx-sparkshell/tests/execution.rs +225 -1
package/dist/agents/__tests__/definitions.test.js +14 -0
package/dist/agents/__tests__/definitions.test.js.map +1 -1
package/dist/agents/__tests__/native-config.test.js +19 -0
package/dist/agents/__tests__/native-config.test.js.map +1 -1
package/dist/agents/definitions.d.ts.map +1 -1
package/dist/agents/definitions.js +30 -0
package/dist/agents/definitions.js.map +1 -1
package/dist/agents/native-config.d.ts +1 -0
package/dist/agents/native-config.d.ts.map +1 -1
package/dist/agents/native-config.js +4 -0
package/dist/agents/native-config.js.map +1 -1
package/dist/catalog/__tests__/generator.test.js +4 -0
package/dist/catalog/__tests__/generator.test.js.map +1 -1
package/dist/cli/__tests__/codex-plugin-layout.test.js +15 -7
package/dist/cli/__tests__/codex-plugin-layout.test.js.map +1 -1
package/dist/cli/__tests__/doctor-warning-copy.test.js +137 -8
package/dist/cli/__tests__/doctor-warning-copy.test.js.map +1 -1
package/dist/cli/__tests__/index.test.js +203 -15
package/dist/cli/__tests__/index.test.js.map +1 -1
package/dist/cli/__tests__/install-docs-contract.test.d.ts +2 -0
package/dist/cli/__tests__/install-docs-contract.test.d.ts.map +1 -0
package/dist/cli/__tests__/install-docs-contract.test.js +55 -0
package/dist/cli/__tests__/install-docs-contract.test.js.map +1 -0
package/dist/cli/__tests__/launch-fallback.test.js +163 -0
package/dist/cli/__tests__/launch-fallback.test.js.map +1 -1
package/dist/cli/__tests__/question.test.js +29 -43
package/dist/cli/__tests__/question.test.js.map +1 -1
package/dist/cli/__tests__/setup-install-mode.test.js +94 -35
package/dist/cli/__tests__/setup-install-mode.test.js.map +1 -1
package/dist/cli/__tests__/sparkshell-cli.test.js +20 -1
package/dist/cli/__tests__/sparkshell-cli.test.js.map +1 -1
package/dist/cli/__tests__/sparkshell-packaging.test.js +1 -0
package/dist/cli/__tests__/sparkshell-packaging.test.js.map +1 -1
package/dist/cli/__tests__/ultragoal.test.js +227 -4
package/dist/cli/__tests__/ultragoal.test.js.map +1 -1
package/dist/cli/__tests__/update.test.js +72 -1
package/dist/cli/__tests__/update.test.js.map +1 -1
package/dist/cli/codex-feature-probe.d.ts +5 -0
package/dist/cli/codex-feature-probe.d.ts.map +1 -1
package/dist/cli/codex-feature-probe.js +13 -7
package/dist/cli/codex-feature-probe.js.map +1 -1
package/dist/cli/doctor.d.ts +7 -0
package/dist/cli/doctor.d.ts.map +1 -1
package/dist/cli/doctor.js +297 -17
package/dist/cli/doctor.js.map +1 -1
package/dist/cli/index.d.ts +9 -1
package/dist/cli/index.d.ts.map +1 -1
package/dist/cli/index.js +465 -110
package/dist/cli/index.js.map +1 -1
package/dist/cli/plugin-marketplace.d.ts +2 -0
package/dist/cli/plugin-marketplace.d.ts.map +1 -1
package/dist/cli/plugin-marketplace.js +15 -1
package/dist/cli/plugin-marketplace.js.map +1 -1
package/dist/cli/setup.d.ts.map +1 -1
package/dist/cli/setup.js +71 -11
package/dist/cli/setup.js.map +1 -1
package/dist/cli/sparkshell.d.ts +7 -1
package/dist/cli/sparkshell.d.ts.map +1 -1
package/dist/cli/sparkshell.js +13 -3
package/dist/cli/sparkshell.js.map +1 -1
package/dist/cli/ultragoal.d.ts +1 -1
package/dist/cli/ultragoal.d.ts.map +1 -1
package/dist/cli/ultragoal.js +184 -10
package/dist/cli/ultragoal.js.map +1 -1
package/dist/cli/update.d.ts +2 -0
package/dist/cli/update.d.ts.map +1 -1
package/dist/cli/update.js +14 -3
package/dist/cli/update.js.map +1 -1
package/dist/compat/__tests__/doctor-contract.test.js +3 -0
package/dist/compat/__tests__/doctor-contract.test.js.map +1 -1
package/dist/config/__tests__/codex-feature-flags.test.js +11 -1
package/dist/config/__tests__/codex-feature-flags.test.js.map +1 -1
package/dist/config/__tests__/codex-hooks.test.js +22 -11
package/dist/config/__tests__/codex-hooks.test.js.map +1 -1
package/dist/config/__tests__/commit-lore-guard.test.d.ts +2 -0
package/dist/config/__tests__/commit-lore-guard.test.d.ts.map +1 -0
package/dist/config/__tests__/commit-lore-guard.test.js +20 -0
package/dist/config/__tests__/commit-lore-guard.test.js.map +1 -0
package/dist/config/codex-feature-flags.d.ts +4 -0
package/dist/config/codex-feature-flags.d.ts.map +1 -1
package/dist/config/codex-feature-flags.js +4 -0
package/dist/config/codex-feature-flags.js.map +1 -1
package/dist/config/codex-hooks.d.ts +1 -0
package/dist/config/codex-hooks.d.ts.map +1 -1
package/dist/config/codex-hooks.js +8 -10
package/dist/config/codex-hooks.js.map +1 -1
package/dist/config/commit-lore-guard.d.ts +1 -0
package/dist/config/commit-lore-guard.d.ts.map +1 -1
package/dist/config/commit-lore-guard.js +29 -3
package/dist/config/commit-lore-guard.js.map +1 -1
package/dist/config/generator.d.ts +17 -1
package/dist/config/generator.d.ts.map +1 -1
package/dist/config/generator.js +124 -11
package/dist/config/generator.js.map +1 -1
package/dist/goal-workflows/__tests__/codex-goal-snapshot.test.js +21 -0
package/dist/goal-workflows/__tests__/codex-goal-snapshot.test.js.map +1 -1
package/dist/goal-workflows/codex-goal-snapshot.d.ts +4 -0
package/dist/goal-workflows/codex-goal-snapshot.d.ts.map +1 -1
package/dist/goal-workflows/codex-goal-snapshot.js +50 -3
package/dist/goal-workflows/codex-goal-snapshot.js.map +1 -1
package/dist/hooks/__tests__/autopilot-skill-contract.test.js +27 -6
package/dist/hooks/__tests__/autopilot-skill-contract.test.js.map +1 -1
package/dist/hooks/__tests__/consensus-execution-handoff.test.d.ts +1 -1
package/dist/hooks/__tests__/consensus-execution-handoff.test.js +13 -11
package/dist/hooks/__tests__/consensus-execution-handoff.test.js.map +1 -1
package/dist/hooks/__tests__/deep-interview-contract.test.js +4 -3
package/dist/hooks/__tests__/deep-interview-contract.test.js.map +1 -1
package/dist/hooks/__tests__/keyword-detector.test.js +173 -17
package/dist/hooks/__tests__/keyword-detector.test.js.map +1 -1
package/dist/hooks/__tests__/notify-hook-team-tmux-guard.test.js +33 -0
package/dist/hooks/__tests__/notify-hook-team-tmux-guard.test.js.map +1 -1
package/dist/hooks/__tests__/prometheus-strict-contract.test.d.ts +2 -0
package/dist/hooks/__tests__/prometheus-strict-contract.test.d.ts.map +1 -0
package/dist/hooks/__tests__/prometheus-strict-contract.test.js +320 -0
package/dist/hooks/__tests__/prometheus-strict-contract.test.js.map +1 -0
package/dist/hooks/__tests__/prompt-guidance-wave-two.test.js +12 -0
package/dist/hooks/__tests__/prompt-guidance-wave-two.test.js.map +1 -1
package/dist/hooks/__tests__/research-workflow-boundaries.test.d.ts +2 -0
package/dist/hooks/__tests__/research-workflow-boundaries.test.d.ts.map +1 -0
package/dist/hooks/__tests__/research-workflow-boundaries.test.js +35 -0
package/dist/hooks/__tests__/research-workflow-boundaries.test.js.map +1 -0
package/dist/hooks/extensibility/__tests__/dispatcher.test.js +26 -3
package/dist/hooks/extensibility/__tests__/dispatcher.test.js.map +1 -1
package/dist/hooks/extensibility/dispatcher.d.ts.map +1 -1
package/dist/hooks/extensibility/dispatcher.js +29 -14
package/dist/hooks/extensibility/dispatcher.js.map +1 -1
package/dist/hooks/keyword-detector.d.ts +1 -1
package/dist/hooks/keyword-detector.d.ts.map +1 -1
package/dist/hooks/keyword-detector.js +36 -9
package/dist/hooks/keyword-detector.js.map +1 -1
package/dist/hooks/keyword-registry.d.ts.map +1 -1
package/dist/hooks/keyword-registry.js +1 -0
package/dist/hooks/keyword-registry.js.map +1 -1
package/dist/hooks/prompt-guidance-contract.d.ts.map +1 -1
package/dist/hooks/prompt-guidance-contract.js +14 -2
package/dist/hooks/prompt-guidance-contract.js.map +1 -1
package/dist/hud/__tests__/hud-tmux-injection.test.js +36 -8
package/dist/hud/__tests__/hud-tmux-injection.test.js.map +1 -1
package/dist/hud/__tests__/reconcile.test.js +122 -11
package/dist/hud/__tests__/reconcile.test.js.map +1 -1
package/dist/hud/__tests__/render.test.js +84 -0
package/dist/hud/__tests__/render.test.js.map +1 -1
package/dist/hud/__tests__/resource-leak-watch.test.d.ts +2 -0
package/dist/hud/__tests__/resource-leak-watch.test.d.ts.map +1 -0
package/dist/hud/__tests__/resource-leak-watch.test.js +28 -0
package/dist/hud/__tests__/resource-leak-watch.test.js.map +1 -0
package/dist/hud/__tests__/state.test.js +51 -1
package/dist/hud/__tests__/state.test.js.map +1 -1
package/dist/hud/__tests__/tmux.test.js +69 -23
package/dist/hud/__tests__/tmux.test.js.map +1 -1
package/dist/hud/index.d.ts +2 -2
package/dist/hud/index.d.ts.map +1 -1
package/dist/hud/index.js +17 -6
package/dist/hud/index.js.map +1 -1
package/dist/hud/reconcile.d.ts.map +1 -1
package/dist/hud/reconcile.js +6 -3
package/dist/hud/reconcile.js.map +1 -1
package/dist/hud/render.d.ts.map +1 -1
package/dist/hud/render.js +26 -0
package/dist/hud/render.js.map +1 -1
package/dist/hud/state.d.ts +2 -1
package/dist/hud/state.d.ts.map +1 -1
package/dist/hud/state.js +62 -1
package/dist/hud/state.js.map +1 -1
package/dist/hud/tmux.d.ts +10 -3
package/dist/hud/tmux.d.ts.map +1 -1
package/dist/hud/tmux.js +60 -11
package/dist/hud/tmux.js.map +1 -1
package/dist/hud/types.d.ts +22 -0
package/dist/hud/types.d.ts.map +1 -1
package/dist/hud/types.js.map +1 -1
package/dist/notifications/__tests__/http-client-resource.test.d.ts +2 -0
package/dist/notifications/__tests__/http-client-resource.test.d.ts.map +1 -0
package/dist/notifications/__tests__/http-client-resource.test.js +41 -0
package/dist/notifications/__tests__/http-client-resource.test.js.map +1 -0
package/dist/notifications/__tests__/verbosity.test.js +20 -0
package/dist/notifications/__tests__/verbosity.test.js.map +1 -1
package/dist/notifications/config.d.ts.map +1 -1
package/dist/notifications/config.js +6 -3
package/dist/notifications/config.js.map +1 -1
package/dist/notifications/http-client.d.ts.map +1 -1
package/dist/notifications/http-client.js +78 -27
package/dist/notifications/http-client.js.map +1 -1
package/dist/notifications/types.d.ts +2 -0
package/dist/notifications/types.d.ts.map +1 -1
package/dist/openclaw/__tests__/dispatcher.test.js +49 -1
package/dist/openclaw/__tests__/dispatcher.test.js.map +1 -1
package/dist/openclaw/dispatcher.d.ts +7 -4
package/dist/openclaw/dispatcher.d.ts.map +1 -1
package/dist/openclaw/dispatcher.js +32 -69
package/dist/openclaw/dispatcher.js.map +1 -1
package/dist/pipeline/__tests__/orchestrator.test.js +128 -4
package/dist/pipeline/__tests__/orchestrator.test.js.map +1 -1
package/dist/pipeline/__tests__/stages.test.js +460 -9
package/dist/pipeline/__tests__/stages.test.js.map +1 -1
package/dist/pipeline/index.d.ts +8 -2
package/dist/pipeline/index.d.ts.map +1 -1
package/dist/pipeline/index.js +5 -2
package/dist/pipeline/index.js.map +1 -1
package/dist/pipeline/orchestrator.d.ts +5 -4
package/dist/pipeline/orchestrator.d.ts.map +1 -1
package/dist/pipeline/orchestrator.js +85 -17
package/dist/pipeline/orchestrator.js.map +1 -1
package/dist/pipeline/stages/code-review.d.ts +2 -2
package/dist/pipeline/stages/code-review.d.ts.map +1 -1
package/dist/pipeline/stages/code-review.js +5 -3
package/dist/pipeline/stages/code-review.js.map +1 -1
package/dist/pipeline/stages/deep-interview.d.ts +15 -0
package/dist/pipeline/stages/deep-interview.d.ts.map +1 -0
package/dist/pipeline/stages/deep-interview.js +32 -0
package/dist/pipeline/stages/deep-interview.js.map +1 -0
package/dist/pipeline/stages/ralph-verify.d.ts +5 -5
package/dist/pipeline/stages/ralph-verify.d.ts.map +1 -1
package/dist/pipeline/stages/ralph-verify.js +2 -2
package/dist/pipeline/stages/ralph-verify.js.map +1 -1
package/dist/pipeline/stages/ralplan.d.ts.map +1 -1
package/dist/pipeline/stages/ralplan.js +41 -6
package/dist/pipeline/stages/ralplan.js.map +1 -1
package/dist/pipeline/stages/ultragoal.d.ts +19 -0
package/dist/pipeline/stages/ultragoal.d.ts.map +1 -0
package/dist/pipeline/stages/ultragoal.js +38 -0
package/dist/pipeline/stages/ultragoal.js.map +1 -0
package/dist/pipeline/stages/ultraqa.d.ts +30 -0
package/dist/pipeline/stages/ultraqa.d.ts.map +1 -0
package/dist/pipeline/stages/ultraqa.js +46 -0
package/dist/pipeline/stages/ultraqa.js.map +1 -0
package/dist/pipeline/types.d.ts +8 -6
package/dist/pipeline/types.d.ts.map +1 -1
package/dist/pipeline/types.js +2 -2
package/dist/question/__tests__/ui.test.js +43 -10
package/dist/question/__tests__/ui.test.js.map +1 -1
package/dist/question/ui.d.ts +12 -0
package/dist/question/ui.d.ts.map +1 -1
package/dist/question/ui.js +83 -46
package/dist/question/ui.js.map +1 -1
package/dist/ralplan/__tests__/runtime.test.js +200 -10
package/dist/ralplan/__tests__/runtime.test.js.map +1 -1
package/dist/ralplan/consensus-gate.d.ts +23 -0
package/dist/ralplan/consensus-gate.d.ts.map +1 -0
package/dist/ralplan/consensus-gate.js +212 -0
package/dist/ralplan/consensus-gate.js.map +1 -0
package/dist/ralplan/runtime.d.ts +25 -0
package/dist/ralplan/runtime.d.ts.map +1 -1
package/dist/ralplan/runtime.js +144 -8
package/dist/ralplan/runtime.js.map +1 -1
package/dist/scripts/__tests__/codex-native-hook.test.js +1358 -79
package/dist/scripts/__tests__/codex-native-hook.test.js.map +1 -1
package/dist/scripts/__tests__/docs-site-contract.test.d.ts +2 -0
package/dist/scripts/__tests__/docs-site-contract.test.d.ts.map +1 -0
package/dist/scripts/__tests__/docs-site-contract.test.js +42 -0
package/dist/scripts/__tests__/docs-site-contract.test.js.map +1 -0
package/dist/scripts/__tests__/notify-dispatcher.test.js +115 -2
package/dist/scripts/__tests__/notify-dispatcher.test.js.map +1 -1
package/dist/scripts/__tests__/run-test-files.test.js +57 -0
package/dist/scripts/__tests__/run-test-files.test.js.map +1 -1
package/dist/scripts/__tests__/smoke-packed-install.test.js +23 -1
package/dist/scripts/__tests__/smoke-packed-install.test.js.map +1 -1
package/dist/scripts/__tests__/verify-native-agents.test.js +18 -3
package/dist/scripts/__tests__/verify-native-agents.test.js.map +1 -1
package/dist/scripts/cleanup-explore-harness.js +1 -0
package/dist/scripts/cleanup-explore-harness.js.map +1 -1
package/dist/scripts/codex-native-hook.d.ts.map +1 -1
package/dist/scripts/codex-native-hook.js +372 -44
package/dist/scripts/codex-native-hook.js.map +1 -1
package/dist/scripts/codex-native-pre-post.d.ts.map +1 -1
package/dist/scripts/codex-native-pre-post.js +9 -1
package/dist/scripts/codex-native-pre-post.js.map +1 -1
package/dist/scripts/notify-dispatcher.js +188 -4
package/dist/scripts/notify-dispatcher.js.map +1 -1
package/dist/scripts/notify-hook/process-runner.d.ts.map +1 -1
package/dist/scripts/notify-hook/process-runner.js +39 -17
package/dist/scripts/notify-hook/process-runner.js.map +1 -1
package/dist/scripts/notify-hook/team-dispatch.d.ts.map +1 -1
package/dist/scripts/notify-hook/team-dispatch.js +9 -5
package/dist/scripts/notify-hook/team-dispatch.js.map +1 -1
package/dist/scripts/notify-hook/team-tmux-guard.d.ts +1 -1
package/dist/scripts/notify-hook/team-tmux-guard.d.ts.map +1 -1
package/dist/scripts/notify-hook/team-tmux-guard.js +7 -1
package/dist/scripts/notify-hook/team-tmux-guard.js.map +1 -1
package/dist/scripts/run-test-files.js +13 -0
package/dist/scripts/run-test-files.js.map +1 -1
package/dist/scripts/smoke-packed-install.d.ts +3 -0
package/dist/scripts/smoke-packed-install.d.ts.map +1 -1
package/dist/scripts/smoke-packed-install.js +99 -1
package/dist/scripts/smoke-packed-install.js.map +1 -1
package/dist/scripts/sync-plugin-mirror.js +2 -2
package/dist/scripts/sync-plugin-mirror.js.map +1 -1
package/dist/scripts/verify-native-agents.js +2 -2
package/dist/scripts/verify-native-agents.js.map +1 -1
package/dist/sidecar/__tests__/resource-leak-watch.test.d.ts +2 -0
package/dist/sidecar/__tests__/resource-leak-watch.test.d.ts.map +1 -0
package/dist/sidecar/__tests__/resource-leak-watch.test.js +38 -0
package/dist/sidecar/__tests__/resource-leak-watch.test.js.map +1 -0
package/dist/sidecar/index.d.ts +1 -1
package/dist/sidecar/index.d.ts.map +1 -1
package/dist/sidecar/index.js +29 -12
package/dist/sidecar/index.js.map +1 -1
package/dist/state/__tests__/operations-ralph-phase.test.js +88 -1
package/dist/state/__tests__/operations-ralph-phase.test.js.map +1 -1
package/dist/state/__tests__/workflow-transition.test.js +6 -0
package/dist/state/__tests__/workflow-transition.test.js.map +1 -1
package/dist/state/operations.d.ts.map +1 -1
package/dist/state/operations.js +11 -0
package/dist/state/operations.js.map +1 -1
package/dist/state/workflow-transition.d.ts +1 -1
package/dist/state/workflow-transition.d.ts.map +1 -1
package/dist/state/workflow-transition.js +7 -0
package/dist/state/workflow-transition.js.map +1 -1
package/dist/subagents/tracker.d.ts.map +1 -1
package/dist/subagents/tracker.js +4 -3
package/dist/subagents/tracker.js.map +1 -1
package/dist/team/__tests__/runtime.test.js +36 -44
package/dist/team/__tests__/runtime.test.js.map +1 -1
package/dist/team/__tests__/tmux-session.test.js +163 -15
package/dist/team/__tests__/tmux-session.test.js.map +1 -1
package/dist/team/runtime.d.ts.map +1 -1
package/dist/team/runtime.js +10 -20
package/dist/team/runtime.js.map +1 -1
package/dist/team/tmux-session.d.ts.map +1 -1
package/dist/team/tmux-session.js +51 -21
package/dist/team/tmux-session.js.map +1 -1
package/dist/ultragoal/__tests__/artifacts.test.js +764 -10
package/dist/ultragoal/__tests__/artifacts.test.js.map +1 -1
package/dist/ultragoal/__tests__/docs-contract.test.js +57 -1
package/dist/ultragoal/__tests__/docs-contract.test.js.map +1 -1
package/dist/ultragoal/__tests__/steering-fixtures.d.ts +68 -0
package/dist/ultragoal/__tests__/steering-fixtures.d.ts.map +1 -0
package/dist/ultragoal/__tests__/steering-fixtures.js +259 -0
package/dist/ultragoal/__tests__/steering-fixtures.js.map +1 -0
package/dist/ultragoal/__tests__/steering-fixtures.test.d.ts +2 -0
package/dist/ultragoal/__tests__/steering-fixtures.test.d.ts.map +1 -0
package/dist/ultragoal/__tests__/steering-fixtures.test.js +65 -0
package/dist/ultragoal/__tests__/steering-fixtures.test.js.map +1 -0
package/dist/ultragoal/artifacts.d.ts +97 -2
package/dist/ultragoal/artifacts.d.ts.map +1 -1
package/dist/ultragoal/artifacts.js +837 -256
package/dist/ultragoal/artifacts.js.map +1 -1
package/dist/utils/__tests__/sleep-resource.test.d.ts +2 -0
package/dist/utils/__tests__/sleep-resource.test.d.ts.map +1 -0
package/dist/utils/__tests__/sleep-resource.test.js +39 -0
package/dist/utils/__tests__/sleep-resource.test.js.map +1 -0
package/dist/utils/sleep.d.ts.map +1 -1
package/dist/utils/sleep.js +17 -6
package/dist/utils/sleep.js.map +1 -1
package/package.json +2 -1
package/plugins/oh-my-codex/.codex-plugin/plugin.json +4 -3
package/plugins/oh-my-codex/hooks/codex-native-hook.mjs +56 -0
package/plugins/oh-my-codex/hooks/hooks.json +77 -0
package/plugins/oh-my-codex/skills/autopilot/SKILL.md +92 -50
package/plugins/oh-my-codex/skills/autoresearch/SKILL.md +4 -0
package/plugins/oh-my-codex/skills/autoresearch-goal/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/best-practice-research/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/cancel/SKILL.md +2 -2
package/plugins/oh-my-codex/skills/deep-interview/SKILL.md +8 -8
package/plugins/oh-my-codex/skills/omx-setup/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/pipeline/SKILL.md +23 -12
package/plugins/oh-my-codex/skills/plan/SKILL.md +8 -8
package/plugins/oh-my-codex/skills/prometheus-strict/README.md +35 -0
package/plugins/oh-my-codex/skills/prometheus-strict/SKILL.md +219 -0
package/plugins/oh-my-codex/skills/ralph/SKILL.md +7 -0
package/plugins/oh-my-codex/skills/ralplan/SKILL.md +22 -7
package/plugins/oh-my-codex/skills/team/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/ultragoal/SKILL.md +38 -4
package/plugins/oh-my-codex/skills/ultrawork/SKILL.md +1 -1
package/prompts/planner.md +1 -1
package/prompts/prometheus-strict-metis.md +274 -0
package/prompts/prometheus-strict-momus.md +82 -0
package/prompts/prometheus-strict-oracle.md +107 -0
package/prompts/researcher.md +22 -3
package/skills/autopilot/SKILL.md +92 -50
package/skills/autoresearch/SKILL.md +4 -0
package/skills/autoresearch-goal/SKILL.md +1 -1
package/skills/best-practice-research/SKILL.md +1 -1
package/skills/cancel/SKILL.md +2 -2
package/skills/deep-interview/SKILL.md +8 -8
package/skills/omx-setup/SKILL.md +1 -1
package/skills/pipeline/SKILL.md +23 -12
package/skills/plan/SKILL.md +8 -8
package/skills/prometheus-strict/README.md +35 -0
package/skills/prometheus-strict/SKILL.md +219 -0
package/skills/ralph/SKILL.md +7 -0
package/skills/ralplan/SKILL.md +22 -7
package/skills/team/SKILL.md +1 -1
package/skills/ultragoal/SKILL.md +38 -4
package/skills/ultrawork/SKILL.md +1 -1
package/src/scripts/__tests__/codex-native-hook.test.ts +1757 -210
package/src/scripts/__tests__/docs-site-contract.test.ts +47 -0
package/src/scripts/__tests__/notify-dispatcher.test.ts +132 -3
package/src/scripts/__tests__/run-test-files.test.ts +67 -0
package/src/scripts/__tests__/smoke-packed-install.test.ts +31 -0
package/src/scripts/__tests__/verify-native-agents.test.ts +23 -3
package/src/scripts/cleanup-explore-harness.ts +1 -0
package/src/scripts/codex-native-hook.ts +393 -40
package/src/scripts/codex-native-pre-post.ts +16 -1
package/src/scripts/notify-dispatcher.ts +202 -4
package/src/scripts/notify-hook/process-runner.ts +40 -16
package/src/scripts/notify-hook/team-dispatch.ts +9 -5
package/src/scripts/notify-hook/team-tmux-guard.ts +7 -0
package/src/scripts/run-test-files.ts +13 -0
package/src/scripts/smoke-packed-install.ts +105 -0
package/src/scripts/sync-plugin-mirror.ts +3 -3
package/src/scripts/verify-native-agents.ts +2 -2
package/templates/catalog-manifest.json +22 -0

package/plugins/oh-my-codex/skills/prometheus-strict/SKILL.md ADDED Viewed

@@ -0,0 +1,219 @@
+---
+name: prometheus-strict
+description: "[OMX] Clean-room interview-driven planner: Metis clarifies, Momus challenges, Oracle synthesizes, then hands off to $ultragoal/$team."
+argument-hint: "<goal or problem statement>"
+---
+# Prometheus Strict
+Clean-room OMX planning workflow inspired by the high-level OMO Prometheus concept only. This skill does not copy implementation, prompts, wording, control flow, or runtime code from OMO. It reimplements the idea under this repository's MIT-licensed skill conventions.
+Credit: Inspired by OMO Prometheus (`code-yeongyu/oh-my-openagent`), reimplemented from concept under MIT.
+<Purpose>
+Prometheus Strict creates a rigorous plan before execution when ambiguity is still risky. It separates three planning voices: Metis clarifies requirements, Momus challenges assumptions and validation gaps, and Oracle synthesizes the handoff-ready OMX-native plan.
+The output is a planning-only artifact for `$ultragoal` and, when independent lanes are justified, `$team`. When a durable artifact is useful, store or request the final plan under `.omx/plans/prometheus-strict/`.
+</Purpose>
+<Use_When>
+- The task is important enough that a shallow plan could produce wrong work.
+- Requirements are partially known but acceptance criteria, boundaries, risks, or validation are incomplete.
+- The user wants a strict interview before execution.
+- A future `$ultragoal` story needs durable scope, tests, and handoff sequencing.
+- A team split may be needed, but the lanes are not yet safe to assign.
+</Use_When>
+<Do_Not_Use_When>
+- The user asks for immediate implementation of a clear, low-risk change; use the normal executor path.
+- The task is only a repository lookup or explanation; use `explore`/`analyze` as appropriate.
+- The user needs adversarial execution QA after code changes; use `$ultraqa`.
+- The user wants hook behavior, Sisyphus behavior, or a `start-work` port. Those are explicit non-goals.
+</Do_Not_Use_When>
+<Why_This_Exists>
+OMX already has `$plan`, `$ralplan`, and `$deep-interview`. Prometheus Strict exists for a narrower case: an explicit clean-room strict-planning lane with named clarification, critique, and synthesis roles, plus a durable `.omx/plans/prometheus-strict/` handoff contract. It is not a replacement for execution workflows.
+</Why_This_Exists>
+<Execution_Policy>
+- Stay planning-only. Do not edit source code during this skill unless the user starts a separate execution workflow afterward.
+- Preserve clean-room boundaries. Do not copy or imitate OMO wording, source, prompts, runtime behavior, or control flow.
+- Keep non-goals visible: No hook implementation. No Sisyphus/start-work port. No automatic external-production actions.
+- Ask high-leverage questions as a batched round when the answers materially change scope, safety, or validation. Reserve one-at-a-time questioning only for dependent question chains where the next question depends on the previous answer.
+- If a safe assumption is available, state it and continue.
+- Use repository reads when needed to make paths, tests, and handoff commands concrete.
+- During Metis planning, run pre-question research fan-out for every non-trivial intent unless the task is trivial, the cited spec is self-contained, or cached evidence already covers the same surface; use `explore` for repo facts and the exact cheap `gpt-5.4-mini` `researcher` lane for external docs / OSS references before asking the user. Prometheus Strict may fan out up to `2 explore + 4 researcher` agents per round so breadth comes from more citation-focused mini researchers while Metis/Momus/Oracle keep stronger judgment roles.
+- Recommend `$team` only when Oracle identifies independent, bounded, verifiable lanes.
+### Structured Question Surface
+Every Metis/Momus/Oracle question to the user MUST go through the surface-appropriate structured question path. Plain prose questioning is the last fallback, not the default.
+- In attached-tmux OMX runtime, use `omx question` as the OMX-owned structured question surface (this is the `AskUserQuestion` equivalent for Prometheus Strict). From attached-tmux Bash/tool paths, prefix the command with `OMX_QUESTION_RETURN_PANE=$TMUX_PANE` (or a concrete `%pane` value) so the leader-pane return target is preserved.
+- **Batch independent high-leverage questions into a single `questions[]` array call**: scope, constraints, non-goals, deliverables, safety bounds, and acceptance criteria are normally independent and MUST be batched into one structured form so the user answers them in a single panel. Reserve one-at-a-time only for dependent question chains where the next question depends on the previous answer.
+- Wait for the `omx question` JSON answer before checking the clearance rule, asking another round, or handing off; prefer `answers[]` / `answers[i].answer`, and use the legacy top-level `answer` only as a compatibility fallback. After every `answers[]` batch, run at least **two gap-fill passes** before another question or handoff: Pass 1 assimilates user answers into the checklist; Pass 2 re-scans repo context, prior turns, research fan-out evidence, and conservative defaults to absorb non-CRITICAL residual gaps.
+- Minimum two emitted question rounds: when Metis emits any user-facing question round, do not hand off after Round 1 unless hostility/`<turn_aborted>` or the round-5 cap forces exit; handoff is allowed only after Round 2 has been emitted and processed. Zero-question complete-checklist handoff remains valid when no questions were emitted.
+- Between-round planning must actively use evidence: after Round 1 answers and the two gap-fill passes, refresh or reuse `<research_fan_out>` explore/researcher evidence, re-run spec prefill, and build Round 2 from residual CRITICAL gaps only.
+- Outside tmux, use the native structured input tool when one is available.
+- When neither structured surface can render (non-tmux Codex CLI, piped runs, CI), list the round's independent questions as a numbered prose block (`Q1: ... Q2: ... Q3: ...`) and wait for all answers in one user turn; do not split into separate round-trips.
+- Multiple interview rounds ARE expected when clearance is not yet reached; each round is one batched form (or its prose fallback), never split across forms.
+### Checklist Clearance
+The interview is governed by deterministic checklist clearance, not by subjective "feels enough" judgement. Exit the Metis interview loop when the 6-item checklist is fully YES: objective / scope IN+OUT / acceptance / test strategy / handoff target / no outstanding CRITICAL. Each item is evaluated with the tri-state defined in `<Turn_Termination_Rules>`.
+Cap interview rounds at **5** to prevent runaway. If checklist clearance is not reached by round 5, hand the remaining UNKNOWN items to Oracle as explicitly carried-forward `<unresolved_blocker>` entries.
+**Hostility / non-answer exit**: if the user's responses for a round contain refusal signals (1-2 character non-answers, dismissive `알아서` / "you decide" / "whatever" patterns, profanity-laden responses, or a `<turn_aborted>` on the prior turn), the round invalidates the answers — it does NOT advance any checklist item to YES, exits the interview loop immediately, and routes the unresolved gaps either to `<silent_absorption>` (for dismissive delegation) or back to the user via `hostility_exit` (for anger / aborted turns). See `prometheus-strict-metis` `<hostility_detection>` for the full pattern list and routing rules.
+</Execution_Policy>
+<Turn_Termination_Rules>
+Every Prometheus Strict turn ends with EXACTLY ONE of the following terminations. Bare summaries and "I think we're done" are forbidden.
+The 6-item checklist is: objective / scope IN+OUT / acceptance / test strategy / handoff target / no outstanding CRITICAL. A checklist item is YES when it is USER_ANSWERED ∪ ABSORBED_WITH_CITATION ∪ INFERRED_FROM_SPEC. Only UNKNOWN (no answer, no citation, no spec inference) counts as NO.
+- (a) `omx question` batch: use when at least one CRITICAL question survives `<gap_triage>` and `<self_review>`. The batch is the round; the turn waits for `answers[]` before continuing.
+- (b) explicit handoff: use when the 6-item checklist is fully YES. Hand off Metis → Momus after clearance, Momus → Oracle after critique, and Oracle → user or `<unresolved_blocker>` carry-forward after Pass 2 synthesis.
+- (c) stop-blocker: use when hostility/`<turn_aborted>` is detected via `<hostility_detection>` with subtype `hostility_exit`, or when the next action is destructive, credential-gated, external-production, and cannot be defaulted safely.
+Edge cases:
+1. Zero-questions-but-complete-checklist → option (b) explicit handoff. Do not emit an empty `omx question` form.
+2. Round-5-cap with incomplete checklist → option (a) emit one more question batch with surviving UNKNOWN items annotated, OR option (b) handoff with UNKNOWN items carried forward to Oracle as `<unresolved_blocker>` entries.
+3. Hostility/`<turn_aborted>` → option (c) for anger, profanity, or aborted-turn via `hostility_exit`; option (b) for dismissive-delegation (`알아서` / "you decide") with absorbed gaps annotated.
+</Turn_Termination_Rules>
+<Steps>
+### 1. Intake and Safety Bounds
+Restate the target result, known constraints, deliverables, validation expectations, and stop condition. Identify whether this turn is planning-only or whether the user also requested downstream execution.
+If the prompt contains destructive, credential-gated, external-production, or materially scope-changing decisions, hold those decisions for explicit user confirmation. Otherwise, continue through the planning loop.
+### 2. Metis Interview (Iterative, Checklist Clearance)
+Use `prometheus-strict-metis` as the interview voice. When native subagents are available, invoke the dedicated agent; otherwise run the same role in-context without editing files.
+Metis discovers success criteria, non-goals, evidence versus assumptions, required artifacts, likely execution lanes, and missing decisions. Before the first user-facing question batch, Metis must actively fan out repo/external research per intent: `explore` maps local surfaces and exact `gpt-5.4-mini` `researcher` lanes gather official/upstream or OSS-reference evidence. Research-heavy intents use more cheap researchers rather than downgrading Metis/Momus/Oracle judgment.
+Run the interview as a bounded loop:
+1. Identify every currently-UNKNOWN checklist item and every CRITICAL question whose answers would materially change scope, safety, or validation.
+2. Batch the round's independent questions into a single Structured Question Surface call (`questions[]` array, or numbered prose fallback outside tmux).
+3. Collect the structured `answers[]`, then run **Gap-fill Pass 1 — answer assimilation**: update evidence vs. assumption and mark checklist items YES only when USER_ANSWERED, ABSORBED_WITH_CITATION, or INFERRED_FROM_SPEC.
+4. Run **Gap-fill Pass 2 — residual adversarial scan**: re-check every remaining UNKNOWN against repo context, prior turns, research fan-out evidence, framework/industry defaults, and conservative reversible defaults; absorb non-CRITICAL gaps with citations/assumptions and leave only CRITICAL blockers.
+5. Run **between-round planning** after Round 1: refresh or reuse `<research_fan_out>` explore/researcher evidence, re-run spec prefill, and prepare Round 2 from residual CRITICAL gaps only.
+6. Evaluate the 6-item checklist (`<Turn_Termination_Rules>` tri-state) only after BOTH gap-fill passes and the minimum two emitted question rounds gate; exit when ALL YES and either no questions were emitted or Round 2 has been emitted and processed.
+7. If checklist clearance is not reached, or only Round 1 has been processed, return to step 1 with the next round. Cap at 5 rounds; on cap, carry remaining UNKNOWN items forward to Oracle as explicit `<unresolved_blocker>` entries.
+### 3. Momus Challenge (Bounded Retry)
+Use `prometheus-strict-momus` as the adversarial critique voice. When native subagents are available, invoke the dedicated agent; otherwise run the same role in-context without editing files.
+Momus challenges underspecified acceptance criteria, unsafe assumptions, hidden destructive steps, overbroad scope, missing verification, ownership conflicts, and `$ultragoal`/`$team` handoff ambiguity.
+**Bounded retry contract**: after Oracle synthesizes in §4, re-invoke Momus on the synthesized plan to verify that Oracle's resolutions did not introduce new risks (scope addition without matching verification, lane split that creates dependency cycles, safety reinforcement that contradicts stop conditions). Repeat the Momus → Oracle re-synthesis cycle up to **3 times total**. If blocking objections remain after the 3rd cycle, mark them as carried-forward in the final plan and proceed to §5.
+### 4. Oracle Synthesis (Two-Pass: Synthesis + Self-Verification)
+Use `prometheus-strict-oracle` as the synthesis voice. When native subagents are available, invoke the dedicated agent; otherwise run the same role in-context without editing files.
+**Pass 1 — Synthesis.** Oracle produces the final objective, scope and non-goals, accepted assumptions, resolved critique, sequenced steps or lanes, verification matrix, rollback/escalation conditions, and recommended OMX handoff.
+**Pass 2 — Self-Verification (machine-checkable acceptance contract).** Oracle re-reads its own Pass 1 output and asserts:
+- Every claim in the verification matrix has an explicit evidence source (test/build/lint/e2e/doc).
+- Every step lists its owner / lane / executor; no shared-file conflicts between parallel lanes.
+- Stop, rollback, and acceptance criteria are mutually consistent (no acceptance criterion is satisfied by a state that also triggers rollback).
+- No destructive, credential-gated, or external-production step is unauthorized.
+- The handoff command is concrete (callable verbatim) and points at an existing workflow (`$ultragoal`, `$team`, or `none`).
+- Clean-room credit is preserved.
+If any Pass 2 check fails, Oracle MUST loop back to Pass 1 to repair before emitting the plan. Cap Pass 1 ↔ Pass 2 cycles at **3**; on cycle 3 failure, emit the plan with the failing gates annotated as carried-forward and escalate to the user.
+### 5. Post-Plan Gap Check (Metis Re-Invocation)
+Before handing off, re-invoke `prometheus-strict-metis` on the finalized Oracle plan with a single charge: identify ambiguities that surfaced **only after** the plan was rendered — for example, new lane assignments that overlap, verification matrix gaps revealed by stop conditions, acceptance criteria that contradict the rollback contract.
+If post-plan Metis surfaces any blocking gap, return to §4 Pass 1 with the new question. Otherwise proceed to §6.
+### 6. Handoff
+Prometheus Strict stops with a plan unless the user explicitly invokes or authorizes the next workflow. Prefer this sequence:
+```text
+$ultragoal "<Oracle plan summary or .omx/plans/prometheus-strict/<slug>.md>"
+$team <N>:executor "execute the approved Ultragoal story in parallel lanes"  # only when warranted
+```
+</Steps>
+<Tool_Usage>
+- Use read-only repository inspection to verify referenced files, commands, and existing conventions.
+- Treat Metis research fan-out as part of planning, not execution: dispatch `explore` / exact `gpt-5.4-mini` `researcher` evidence-gathering before question generation for non-trivial intents, then re-prefill and ask only surviving CRITICAL gaps.
+- Use `prometheus-strict-metis`, `prometheus-strict-momus`, and `prometheus-strict-oracle` sequentially; do not fan out implementation work from this skill.
+- Use `$ultragoal` only as the recommended execution handoff after the plan is ready.
+- Use `$team` only when parallel lanes are independent and verifiable.
+</Tool_Usage>
+## State Management
+Prometheus Strict does not own a long-running runtime loop. If a durable planning artifact is needed, write the final plan to `.omx/plans/prometheus-strict/<slug>.md`. Draft-only or inline plans may set the artifact path to `N/A - inline plan only`.
+Do not create hook state, Sisyphus state, or `start-work` compatibility state for this skill.
+<Final_Checklist>
+- [ ] Target result is explicit.
+- [ ] Scope and non-goals are explicit.
+- [ ] Acceptance criteria are measurable.
+- [ ] Metis interview loop reached checklist clearance only after the mandatory two gap-fill passes following every `answers[]` batch and, if any question round was emitted, after the minimum two emitted question rounds gate; otherwise the 5-round cap was reached with UNKNOWN items carried forward as `<unresolved_blocker>` entries.
+- [ ] Momus objections are resolved or carried forward as explicit blockers, with at most 3 Momus → Oracle re-synthesis cycles consumed.
+- [ ] Oracle plan includes a verification matrix.
+- [ ] Oracle Pass 2 self-verification completed; every machine-checkable contract item passes or is annotated as carried-forward.
+- [ ] Post-plan Metis gap check produced no blocking objections (or all are carried forward).
+- [ ] Handoff recommends `$ultragoal` and `$team` only when warranted.
+- [ ] Clean-room credit is preserved.
+- [ ] No hook implementation or Sisyphus/start-work port was introduced.
+</Final_Checklist>
+<Advanced>
+## Output Contract
+If writing a durable plan file, store this markdown at `.omx/plans/prometheus-strict/<slug>.md` and reference that path in the handoff.
+```markdown
+## Prometheus Strict Plan
+### Target Result
+- <one-sentence objective>
+### Clarified Requirements (Metis)
+- <requirement / acceptance criterion>
+### Critique Resolved (Momus)
+- <risk or objection> -> <resolution>
+### Oracle Execution Plan
+1. <sequenced step or lane>
+### Verification Matrix
+| Claim | Required evidence | Owner/lane |
+| --- | --- | --- |
+| <claim> | <test/build/lint/e2e/doc evidence> | <owner> |
+### Artifact
+- Durable plan path: `.omx/plans/prometheus-strict/<slug>.md` or `N/A - inline plan only`
+### Handoff
+- Recommended next workflow: <$ultragoal / $team / direct execution / none>
+- Stop condition: <what proves the plan is ready or why it is blocked>
+### Clean-Room Credit
+Inspired by OMO Prometheus (`code-yeongyu/oh-my-openagent`), reimplemented from concept under MIT.
+```
+## Failure and Escalation
+Escalate instead of planning when a necessary answer cannot be inferred safely, the next step is destructive or credential-gated, required repository context is unavailable, or the user asks for behavior outside the non-goals.
+</Advanced>
+Original task:
+{{PROMPT}}

package/plugins/oh-my-codex/skills/ralph/SKILL.md CHANGED Viewed

@@ -127,6 +127,13 @@ Use the CLI-first state surface for Ralph lifecycle state (`omx state write/read
   `omx state write --input '{"mode":"ralph","current_phase":"verifying"}' --json` or `omx state write --input '{"mode":"ralph","current_phase":"fixing"}' --json`
 - **On completion** (only after the completion audit passes with real evidence):
   `omx state write --input '{"mode":"ralph","active":false,"current_phase":"complete","completed_at":"<now>","completion_audit":{"passed":true,"prompt_to_artifact_checklist":["<requirement mapped to artifact/evidence>"],"verification_evidence":["<fresh test/build/lint command and result>"]}}' --json`
+- **Before the final answer**:
+  1. Run fresh verification and read the output.
+  2. Build `prompt_to_artifact_checklist` entries that map every user requirement, workflow gate, named file, command, PR/delivery requirement, and stop condition to a concrete artifact or evidence item.
+  3. Build `verification_evidence` entries with concrete commands, exit status, files inspected, PR URLs, or other machine-checkable evidence.
+  4. Write the Ralph completion state with a top-level `completion_audit` field on the Ralph state object. Do not write bare top-level `prompt_to_artifact_checklist` or `verification_evidence` fields by themselves; the Stop gate will reject them.
+  5. Read the state back with `omx state read --input '{"mode":"ralph"}' --json` and verify `completion_audit.passed === true`, a non-empty checklist, and non-empty verification evidence before producing the final answer.
+  6. If Codex goal mode is active, call `update_goal({status:"complete"})` only after this Ralph audit read-back succeeds.
 - **On cancellation/cleanup**:
   run `$cancel` (which should call `omx state clear --input '{"mode":"ralph"}' --json`)

package/plugins/oh-my-codex/skills/ralplan/SKILL.md CHANGED Viewed

@@ -54,12 +54,25 @@ The consensus workflow:
    d. Return to Critic evaluation
    e. Repeat this loop until Critic returns `APPROVE` or 5 iterations are reached
    f. If 5 iterations are reached without `APPROVE`, present the best version to the user
-6. On Critic approval *(--interactive only)*: If `--interactive` is set, use the structured question UI to present the plan with approval options (Approve and execute via ralph / Approve and implement via team / Start a goal-mode follow-up / Request changes / Reject). Final plan must include ADR (Decision, Drivers, Alternatives considered, Why chosen, Consequences, Follow-ups), an explicit available-agent-types roster, concrete follow-up staffing guidance for both `ralph` and `team`, suggested reasoning levels by lane, explicit `omx team` / `$team` launch hints, a concrete **team verification** path, and a product-facing **Goal-Mode Follow-up Suggestions** section. Recommend `$ultragoal` by default for goal-mode follow-up, use `$autoresearch-goal` instead when the context is a research project, and use `$performance-goal` instead when the context is an optimization or performance project. Otherwise, output the final plan and stop.
-7. *(--interactive only)* User chooses: Approve (ralph, team, or a goal-mode follow-up), Request changes, or Reject
-8. *(--interactive only)* On approval: invoke `$ralph` for sequential execution, `$team` for parallel team execution, or the selected goal-mode follow-up (`$ultragoal`, `$autoresearch-goal`, or `$performance-goal`) with the approved plan and matching success/evaluator context -- never implement directly. Preserve the explicit available-agent-types roster, reasoning-by-lane guidance, role/staffing allocation guidance, launch hints, and verification-path guidance from the approved plan for Ralph/team paths.
+6. On Critic approval *(--interactive only)*: If `--interactive` is set, use the structured question UI to present the plan with approval options (Approve durable goal execution via ultragoal / Approve and implement via team / Explicit Ralph fallback / Start specialized goal-mode follow-up / Request changes / Reject). Final plan must include ADR (Decision, Drivers, Alternatives considered, Why chosen, Consequences, Follow-ups), an explicit available-agent-types roster, concrete follow-up staffing guidance for `$ultragoal` and `$team`, plus an explicit `$ralph` fallback note when persistent single-owner verification is intentionally selected, suggested reasoning levels by lane, explicit `omx team` / `$team` launch hints, a concrete **team verification** path, and a product-facing **Goal-Mode Follow-up Suggestions** section. Recommend `$ultragoal` by default for goal-mode follow-up, use `$autoresearch-goal` instead when the context is a research project, and use `$performance-goal` instead when the context is an optimization or performance project. Otherwise, output the final plan and stop.
+7. *(--interactive only)* User chooses: Approve (`$ultragoal` durable goal execution, `$team`, explicit `$ralph` fallback, or a specialized goal-mode follow-up), Request changes, or Reject
+8. *(--interactive only)* On approval: invoke `$ultragoal` for default durable sequential execution, `$team` for parallel team execution, the selected specialized goal-mode follow-up (`$autoresearch-goal` or `$performance-goal`), or `$ralph` only when the user explicitly selects that fallback with the approved plan and matching success/evaluator context -- never implement directly. Preserve the explicit available-agent-types roster, reasoning-by-lane guidance, role/staffing allocation guidance, launch hints, and verification-path guidance from the approved plan for Ultragoal/team paths and any explicit Ralph fallback.
 > **Important:** Steps 3 and 4 MUST run sequentially. Do NOT issue both agent calls in the same parallel batch. Always await the Architect result before invoking Critic.
+## Durable Consensus Handoff Contract
+Ralplan is not complete, skippable, or ready for execution merely because `.omx/plans/prd-*.md` and `.omx/plans/test-spec-*.md` exist. Those files are planning artifacts, not consensus evidence.
+Before any Autopilot, Pipeline, Ultragoal, Team, Ralph, or implementation handoff, persist a durable handoff record that distinguishes:
+- `planning_artifacts`: PRD/test-spec paths.
+- `ralplan_architect_review`: the completed Architect review with an approving verdict.
+- `ralplan_critic_review`: the completed Critic review with an approving verdict, recorded only after the Architect review.
+- `ralplan_consensus_gate.complete:true` only when both reviews are present, approving, and in the required Architect→Critic order.
+If Architect is missing/blocked, keep the workflow in Architect review or report that blocker. If Critic is missing/blocked/non-approving, keep the workflow in Critic/re-review or report the max-iteration outcome. Do not treat existing plan/test-spec files as permission to skip ralplan or start execution.
 Follow the Plan skill's full documentation for consensus mode details.
 ## Goal-Mode Follow-up Suggestions
@@ -70,7 +83,7 @@ When ralplan outputs a final handoff or asks the user to choose a next lane, inc
 - `$autoresearch-goal` — research-project follow-up when the plan centers on a question, literature/reference gathering, evaluator-backed research, or a professor/critic-style research deliverable.
 - `$performance-goal` — optimization/performance follow-up when the plan centers on speed, latency, throughput, memory, benchmark, or other measurable performance work.
-Keep `$ralph` and `$team` as first-class execution options where appropriate: use Ralph for persistent single-owner completion/verification pressure and team for coordinated parallel implementation. For parallelizable durable-goal delivery, recommend `$ultragoal` + `$team` together: Ultragoal remains the leader-owned `.omx/ultragoal` ledger/Codex-goal wrapper while Team runs parallel lanes and returns checkpoint-ready evidence. Do not present the goal-mode options as replacements for Ralph/team when the task is mainly implementation delivery; present them as better fits when durable goal tracking, research validation, or performance evaluators are the primary need.
+Keep `$team` as a first-class execution option and keep `$ralph` available only as an explicit fallback where appropriate: use Ultragoal as the default durable goal-mode follow-up, Team for coordinated parallel implementation, and Ralph only for intentionally selected persistent single-owner completion/verification pressure. For parallelizable durable-goal delivery, recommend `$ultragoal` + `$team` together: Ultragoal remains the leader-owned `.omx/ultragoal` ledger/Codex-goal wrapper while Team runs parallel lanes and returns checkpoint-ready evidence. Do not present Ralph as the recommended follow-up when durable goal tracking is needed; present Ultragoal as the superseding default, with Team for parallel delivery and Ralph only as an explicit fallback when its narrow persistence loop is specifically desired.
 ## Pre-context Intake
@@ -87,6 +100,7 @@ Before consensus planning or execution handoff, ensure a grounded context snapsh
    - likely codebase touchpoints
 4. If ambiguity remains high, gather brownfield facts first. When session guidance enables `USE_OMX_EXPLORE_CMD`, prefer `omx explore` for simple read-only repository lookups with narrow, concrete prompts; otherwise use the richer normal explore path. Then run `$deep-interview --quick <task>` before continuing.
 5. If the plan depends on official docs, version-aware framework guidance, best practices, or external dependency behavior, use `$best-practice-research` as the bounded evidence wrapper and auto-delegate `researcher` for the official/upstream lookup before finalizing the planning handoff so execution does not start from repo-local recall alone.
+6. If a prior `$autoresearch` or `$autoresearch-goal` run exists, treat its approved artifact as evidence for the plan. Do not include Autoresearch as a final architecture or runtime component unless the user explicitly requested ongoing research automation; otherwise synthesize the evidence into the `$ralplan` ADR, risks, and verification steps.
 Do not hand off to execution modes until this intake is complete; if urgency forces progress, explicitly document the risk tradeoffs.
@@ -150,9 +164,10 @@ The gate auto-passes when it detects **any** concrete signal. You do not need al
    - **Architect** reviews for soundness
    - **Critic** validates quality and testability
 5. On consensus approval, user chooses execution path:
-   - **ralph**: sequential execution with verification
-   - **team**: parallel coordinated agents
-6. Execution begins with a clear, bounded plan
+   - **ultragoal**: default durable follow-up for sequential goal execution with ledger checkpoints
+   - **team**: coordinated parallel execution for stories that need multiple lanes, with evidence ready for Ultragoal checkpoints
+   - **ralph**: explicit single-owner fallback only when the user intentionally wants a persistent verification/completion loop instead of the default durable goal ledger
+6. Execution begins with a clear, bounded plan through the selected handoff path
 ### Troubleshooting

package/plugins/oh-my-codex/skills/team/SKILL.md CHANGED Viewed

@@ -126,7 +126,7 @@ When `$team` is used as a follow-up mode from ralplan, carry forward the approve
 - state the recommended headcount and role counts
 - state the suggested reasoning level for each lane when available
 - explain why each lane exists (delivery, verification, specialist support)
-- include an explicit launch hint (`omx team N "<task>"` / `$team N "<task>"`) for the coordinated team run; mention a later separate Ralph follow-up only when genuinely needed
+- include an explicit launch hint (`omx team N "<task>"` / `$team N "<task>"`) for the coordinated team run; mention `$ultragoal` as the default durable follow-up/ledger path; mention a later separate Ralph follow-up only when explicitly requested or genuinely needed as a fallback
 - if the ideal role is unavailable, choose the closest role from the roster and say so
 ## Current Runtime Behavior (As Implemented)

package/plugins/oh-my-codex/skills/ultragoal/SKILL.md CHANGED Viewed

@@ -9,11 +9,13 @@ Use when the user asks for `ultragoal`, `create-goals`, `complete-goals`, durabl
 ## Purpose
-`ultragoal` turns a brief into repo-native artifacts and then drives a Codex goal safely through goal tools. New plans default to an aggregate Codex goal for the whole ultragoal run while OMX tracks G001/G002 story progress in the ledger.
+`ultragoal` turns a brief into repo-native artifacts and then drives a Codex goal safely through goal tools. New plans default to a stable pointer-style aggregate Codex goal for the whole durable plan in `.omx/ultragoal/goals.json`, including later accepted/appended stories under the original brief constraints, while OMX tracks G001/G002 story progress in the ledger. Ultragoal does not call Codex `/goal clear`; before multiple sequential ultragoal runs in one Codex session/thread, manually run `/goal clear` in the Codex UI so the previous completed aggregate goal does not block or confuse the next `create_goal`.
 - `.omx/ultragoal/brief.md`
 - `.omx/ultragoal/goals.json`
-- `.omx/ultragoal/ledger.jsonl`
+- `.omx/ultragoal/ledger.jsonl` (checkpoint and structured steering audit events)
+Existing aggregate plans with the legacy enumerated objective are migrated to the stable pointer objective on read, persisted to `goals.json`, retained in `codexObjectiveAliases` for already-active hidden Codex goal reconciliation, and audited with an `aggregate_objective_migrated` ledger entry.
 ## Create goals
@@ -21,7 +23,7 @@ Use when the user asks for `ultragoal`, `create-goals`, `complete-goals`, durabl
    - `omx ultragoal create-goals --brief "<brief>"`
    - `omx ultragoal create-goals --brief-file <path>`
    - `cat <brief> | omx ultragoal create-goals --from-stdin`
-   - `omx ultragoal create-goals --codex-goal-mode per-story --brief "<brief>"` only when one fresh Codex thread per story is explicitly preferred
+   - `omx ultragoal create-goals --codex-goal-mode per-story --brief "<brief>"` only when one Codex goal context per story is explicitly preferred
 2. Inspect `.omx/ultragoal/goals.json` and refine if needed.
 ## Complete goals
@@ -43,6 +45,36 @@ Loop until `omx ultragoal status` reports all goals complete:
    `omx ultragoal checkpoint --goal-id <id> --status blocked --evidence "<completed legacy Codex goal blocks create_goal in this thread>" --codex-goal-json <get_goal-json-or-path>`
 11. Resume failed goals with `omx ultragoal complete-goals --retry-failed`.
+## Dynamic steering
+Use `omx ultragoal steer` when real findings or blockers prove the current story decomposition should change while the aggregate objective and constraints stay fixed. Steering is explicit-only and evidence-backed; broad natural-language requests are rejected instead of guessed.
+Allowed mutation kinds are:
+- `add_subgoal`
+- `split_subgoal`
+- `reorder_pending`
+- `revise_pending_wording`
+- `annotate_ledger`
+- `mark_blocked_superseded`
+Examples:
+```sh
+omx ultragoal steer --kind add_subgoal --title "Investigate blocker" --objective "Validate the blocker and report evidence." --evidence "log/test output" --rationale "The blocker changes the safe execution order." --json
+omx ultragoal steer --directive-json ./steering.json --json
+```
+Steering invariants:
+- Do not edit the aggregate Codex objective, original brief constraints, quality gates, or completion status. The aggregate objective is a stable pointer to `.omx/ultragoal/goals.json` and `.omx/ultragoal/ledger.jsonl`, not an enumeration of initial goal ids.
+- Do not hard-delete goals, auto-complete work, weaken verification, or silently mutate `.omx/ultragoal`.
+- Accepted and rejected attempts append structured audit entries to `.omx/ultragoal/ledger.jsonl`.
+- Superseded goals remain in `goals.json` with steering metadata and are skipped for scheduling.
+- Blocked goals without replacements are skipped for scheduling but still block final completion until later explicit steering replaces or supersedes them.
+UserPromptSubmit uses the same steering API only for structured directives such as `OMX_ULTRAGOAL_STEER: { ... }`, `omx.ultragoal.steer: { ... }`, or `omx ultragoal steer: { ... }`. Normal prose does not mutate state, and repeated prompt-submit directives dedupe by prompt signature or idempotency key.
 ## Use Ultragoal and Team together
 Use ultragoal and team together for a durable Ultragoal story that benefits from parallel execution. Ultragoal remains leader-owned: `.omx/ultragoal/goals.json` stores the story plan and `.omx/ultragoal/ledger.jsonl` stores checkpoints. Team is the parallel execution engine and returns task/evidence status to the leader.
@@ -69,7 +101,7 @@ The final ultragoal story is not complete until the active agent has run the fin
    omx ultragoal record-review-blockers --goal-id <id> --title "Resolve final code-review blockers" --objective "<blocker-resolution objective>" --evidence "<review findings>" --codex-goal-json <active-get-goal-json-or-path>
    ```
-   This marks the current story `review_blocked`, appends a pending blocker-resolution story, keeps the Codex goal active, and lets `omx ultragoal complete-goals` start the blocker next. In legacy per-story mode, the blocker may need a fresh/available Codex goal context because the old per-story Codex goal remains active/incomplete.
+   This marks the current story `review_blocked`, appends a pending blocker-resolution story, keeps the Codex goal active, and lets `omx ultragoal complete-goals` start the blocker next. In legacy per-story mode, the blocker may need an available Codex goal context because the old per-story Codex goal remains active/incomplete.
 6. If review is clean, call `update_goal({status: "complete"})`, call `get_goal`, and checkpoint with a structured final gate:
@@ -90,6 +122,8 @@ The final ultragoal story is not complete until the active agent has run the fin
 ## Constraints
 - The shell command cannot directly invoke Codex interactive `/goal`; it emits a model-facing handoff for the active Codex agent.
+- Ultragoal intentionally does not invoke `/goal clear` or hidden `thread/goal/clear`; the model-facing tool surface only provides `get_goal`, `create_goal`, and `update_goal`.
+- After a completed aggregate ultragoal run, clear the Codex goal manually with `/goal clear` before starting another ultragoal run in the same session/thread.
 - Never call `create_goal` when `get_goal` reports a different active goal.
 - Never call `update_goal` unless the aggregate run or legacy per-story goal is actually complete.
 - In aggregate mode, intermediate story checkpoints require a matching `active` Codex snapshot; final story completion requires a matching `complete` snapshot after `update_goal`.

package/plugins/oh-my-codex/skills/ultrawork/SKILL.md CHANGED Viewed

@@ -16,7 +16,7 @@ Ultrawork is a parallel execution engine for high-throughput task completion. It
 <Do_Not_Use_When>
 - Task requires guaranteed completion with persistence, architect verification, or deslop/reverification -- use `ralph` instead (Ralph includes ultrawork)
-- Task requires a full autonomous pipeline -- use `autopilot` instead (autopilot includes Ralph which includes ultrawork)
+- Task requires a full autonomous pipeline -- use `autopilot` instead (autopilot defaults to Ultragoal, with Team/parallel execution used only when needed)
 - There is only one sequential task with no parallelism opportunity -- execute directly or delegate to a single `executor`
 - The request is still in plan-consensus mode -- keep planning artifacts in `ralplan` until execution is explicitly authorized
 - User needs session persistence for resume -- use `ralph`, which adds persistence on top of ultrawork

package/prompts/planner.md CHANGED Viewed

@@ -59,7 +59,7 @@ Leave execution with a right-sized, evidence-grounded plan: scope, steps, accept
 - Codebase facts come from inspection.
 - Plan is saved to `.omx/plans/{name}.md`.
 - User confirmation is obtained before handoff.
-- Consensus mode includes complete RALPLAN-DR, ADR, an explicit available-agent-types roster, staffing guidance for team and ralph follow-up paths, product-facing goal-mode follow-up suggestions (`$ultragoal` generally and by default, `$autoresearch-goal` for research projects, `$performance-goal` for optimization/performance projects), suggested reasoning levels by lane, launch hints, and a team verification path when needed.
+- Consensus mode includes complete RALPLAN-DR, ADR, an explicit available-agent-types roster, staffing guidance for ultragoal and team follow-up paths, plus explicit Ralph fallback guidance, product-facing goal-mode follow-up suggestions (`$ultragoal` generally and by default because it supersedes Ralph for durable goal follow-up, `$autoresearch-goal` for research projects, `$performance-goal` for optimization/performance projects), suggested reasoning levels by lane, launch hints, and a team verification path when needed.
 </success_criteria>
 <tools>