npm - oh-my-codex - Versions diffs - 0.18.0 → 0.18.2 - Mend

oh-my-codex 0.18.0 → 0.18.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (410) hide show

package/Cargo.lock +6 -6
package/Cargo.toml +1 -1
package/README.md +45 -19
package/crates/omx-api/src/lib.rs +66 -9
package/crates/omx-sparkshell/src/exec.rs +125 -3
package/crates/omx-sparkshell/src/main.rs +126 -36
package/crates/omx-sparkshell/tests/execution.rs +225 -1
package/dist/agents/__tests__/definitions.test.js +14 -0
package/dist/agents/__tests__/definitions.test.js.map +1 -1
package/dist/agents/__tests__/native-config.test.js +19 -0
package/dist/agents/__tests__/native-config.test.js.map +1 -1
package/dist/agents/definitions.d.ts.map +1 -1
package/dist/agents/definitions.js +30 -0
package/dist/agents/definitions.js.map +1 -1
package/dist/agents/native-config.d.ts +1 -0
package/dist/agents/native-config.d.ts.map +1 -1
package/dist/agents/native-config.js +4 -0
package/dist/agents/native-config.js.map +1 -1
package/dist/catalog/__tests__/generator.test.js +4 -0
package/dist/catalog/__tests__/generator.test.js.map +1 -1
package/dist/cli/__tests__/codex-plugin-layout.test.js +15 -7
package/dist/cli/__tests__/codex-plugin-layout.test.js.map +1 -1
package/dist/cli/__tests__/doctor-warning-copy.test.js +137 -8
package/dist/cli/__tests__/doctor-warning-copy.test.js.map +1 -1
package/dist/cli/__tests__/index.test.js +203 -15
package/dist/cli/__tests__/index.test.js.map +1 -1
package/dist/cli/__tests__/install-docs-contract.test.d.ts +2 -0
package/dist/cli/__tests__/install-docs-contract.test.d.ts.map +1 -0
package/dist/cli/__tests__/install-docs-contract.test.js +55 -0
package/dist/cli/__tests__/install-docs-contract.test.js.map +1 -0
package/dist/cli/__tests__/launch-fallback.test.js +163 -0
package/dist/cli/__tests__/launch-fallback.test.js.map +1 -1
package/dist/cli/__tests__/question.test.js +29 -43
package/dist/cli/__tests__/question.test.js.map +1 -1
package/dist/cli/__tests__/setup-install-mode.test.js +94 -35
package/dist/cli/__tests__/setup-install-mode.test.js.map +1 -1
package/dist/cli/__tests__/sparkshell-cli.test.js +20 -1
package/dist/cli/__tests__/sparkshell-cli.test.js.map +1 -1
package/dist/cli/__tests__/sparkshell-packaging.test.js +1 -0
package/dist/cli/__tests__/sparkshell-packaging.test.js.map +1 -1
package/dist/cli/__tests__/ultragoal.test.js +227 -4
package/dist/cli/__tests__/ultragoal.test.js.map +1 -1
package/dist/cli/__tests__/update.test.js +72 -1
package/dist/cli/__tests__/update.test.js.map +1 -1
package/dist/cli/codex-feature-probe.d.ts +5 -0
package/dist/cli/codex-feature-probe.d.ts.map +1 -1
package/dist/cli/codex-feature-probe.js +13 -7
package/dist/cli/codex-feature-probe.js.map +1 -1
package/dist/cli/doctor.d.ts +7 -0
package/dist/cli/doctor.d.ts.map +1 -1
package/dist/cli/doctor.js +297 -17
package/dist/cli/doctor.js.map +1 -1
package/dist/cli/index.d.ts +9 -1
package/dist/cli/index.d.ts.map +1 -1
package/dist/cli/index.js +465 -110
package/dist/cli/index.js.map +1 -1
package/dist/cli/plugin-marketplace.d.ts +2 -0
package/dist/cli/plugin-marketplace.d.ts.map +1 -1
package/dist/cli/plugin-marketplace.js +15 -1
package/dist/cli/plugin-marketplace.js.map +1 -1
package/dist/cli/setup.d.ts.map +1 -1
package/dist/cli/setup.js +71 -11
package/dist/cli/setup.js.map +1 -1
package/dist/cli/sparkshell.d.ts +7 -1
package/dist/cli/sparkshell.d.ts.map +1 -1
package/dist/cli/sparkshell.js +13 -3
package/dist/cli/sparkshell.js.map +1 -1
package/dist/cli/ultragoal.d.ts +1 -1
package/dist/cli/ultragoal.d.ts.map +1 -1
package/dist/cli/ultragoal.js +184 -10
package/dist/cli/ultragoal.js.map +1 -1
package/dist/cli/update.d.ts +2 -0
package/dist/cli/update.d.ts.map +1 -1
package/dist/cli/update.js +14 -3
package/dist/cli/update.js.map +1 -1
package/dist/compat/__tests__/doctor-contract.test.js +3 -0
package/dist/compat/__tests__/doctor-contract.test.js.map +1 -1
package/dist/config/__tests__/codex-feature-flags.test.js +11 -1
package/dist/config/__tests__/codex-feature-flags.test.js.map +1 -1
package/dist/config/__tests__/codex-hooks.test.js +22 -11
package/dist/config/__tests__/codex-hooks.test.js.map +1 -1
package/dist/config/__tests__/commit-lore-guard.test.d.ts +2 -0
package/dist/config/__tests__/commit-lore-guard.test.d.ts.map +1 -0
package/dist/config/__tests__/commit-lore-guard.test.js +20 -0
package/dist/config/__tests__/commit-lore-guard.test.js.map +1 -0
package/dist/config/codex-feature-flags.d.ts +4 -0
package/dist/config/codex-feature-flags.d.ts.map +1 -1
package/dist/config/codex-feature-flags.js +4 -0
package/dist/config/codex-feature-flags.js.map +1 -1
package/dist/config/codex-hooks.d.ts +1 -0
package/dist/config/codex-hooks.d.ts.map +1 -1
package/dist/config/codex-hooks.js +8 -10
package/dist/config/codex-hooks.js.map +1 -1
package/dist/config/commit-lore-guard.d.ts +1 -0
package/dist/config/commit-lore-guard.d.ts.map +1 -1
package/dist/config/commit-lore-guard.js +29 -3
package/dist/config/commit-lore-guard.js.map +1 -1
package/dist/config/generator.d.ts +17 -1
package/dist/config/generator.d.ts.map +1 -1
package/dist/config/generator.js +124 -11
package/dist/config/generator.js.map +1 -1
package/dist/goal-workflows/__tests__/codex-goal-snapshot.test.js +21 -0
package/dist/goal-workflows/__tests__/codex-goal-snapshot.test.js.map +1 -1
package/dist/goal-workflows/codex-goal-snapshot.d.ts +4 -0
package/dist/goal-workflows/codex-goal-snapshot.d.ts.map +1 -1
package/dist/goal-workflows/codex-goal-snapshot.js +50 -3
package/dist/goal-workflows/codex-goal-snapshot.js.map +1 -1
package/dist/hooks/__tests__/autopilot-skill-contract.test.js +27 -6
package/dist/hooks/__tests__/autopilot-skill-contract.test.js.map +1 -1
package/dist/hooks/__tests__/consensus-execution-handoff.test.d.ts +1 -1
package/dist/hooks/__tests__/consensus-execution-handoff.test.js +13 -11
package/dist/hooks/__tests__/consensus-execution-handoff.test.js.map +1 -1
package/dist/hooks/__tests__/deep-interview-contract.test.js +4 -3
package/dist/hooks/__tests__/deep-interview-contract.test.js.map +1 -1
package/dist/hooks/__tests__/keyword-detector.test.js +173 -17
package/dist/hooks/__tests__/keyword-detector.test.js.map +1 -1
package/dist/hooks/__tests__/notify-hook-team-tmux-guard.test.js +33 -0
package/dist/hooks/__tests__/notify-hook-team-tmux-guard.test.js.map +1 -1
package/dist/hooks/__tests__/prometheus-strict-contract.test.d.ts +2 -0
package/dist/hooks/__tests__/prometheus-strict-contract.test.d.ts.map +1 -0
package/dist/hooks/__tests__/prometheus-strict-contract.test.js +320 -0
package/dist/hooks/__tests__/prometheus-strict-contract.test.js.map +1 -0
package/dist/hooks/__tests__/prompt-guidance-wave-two.test.js +12 -0
package/dist/hooks/__tests__/prompt-guidance-wave-two.test.js.map +1 -1
package/dist/hooks/__tests__/research-workflow-boundaries.test.d.ts +2 -0
package/dist/hooks/__tests__/research-workflow-boundaries.test.d.ts.map +1 -0
package/dist/hooks/__tests__/research-workflow-boundaries.test.js +35 -0
package/dist/hooks/__tests__/research-workflow-boundaries.test.js.map +1 -0
package/dist/hooks/extensibility/__tests__/dispatcher.test.js +26 -3
package/dist/hooks/extensibility/__tests__/dispatcher.test.js.map +1 -1
package/dist/hooks/extensibility/dispatcher.d.ts.map +1 -1
package/dist/hooks/extensibility/dispatcher.js +29 -14
package/dist/hooks/extensibility/dispatcher.js.map +1 -1
package/dist/hooks/keyword-detector.d.ts +1 -1
package/dist/hooks/keyword-detector.d.ts.map +1 -1
package/dist/hooks/keyword-detector.js +36 -9
package/dist/hooks/keyword-detector.js.map +1 -1
package/dist/hooks/keyword-registry.d.ts.map +1 -1
package/dist/hooks/keyword-registry.js +1 -0
package/dist/hooks/keyword-registry.js.map +1 -1
package/dist/hooks/prompt-guidance-contract.d.ts.map +1 -1
package/dist/hooks/prompt-guidance-contract.js +14 -2
package/dist/hooks/prompt-guidance-contract.js.map +1 -1
package/dist/hud/__tests__/hud-tmux-injection.test.js +36 -8
package/dist/hud/__tests__/hud-tmux-injection.test.js.map +1 -1
package/dist/hud/__tests__/reconcile.test.js +122 -11
package/dist/hud/__tests__/reconcile.test.js.map +1 -1
package/dist/hud/__tests__/render.test.js +84 -0
package/dist/hud/__tests__/render.test.js.map +1 -1
package/dist/hud/__tests__/resource-leak-watch.test.d.ts +2 -0
package/dist/hud/__tests__/resource-leak-watch.test.d.ts.map +1 -0
package/dist/hud/__tests__/resource-leak-watch.test.js +28 -0
package/dist/hud/__tests__/resource-leak-watch.test.js.map +1 -0
package/dist/hud/__tests__/state.test.js +51 -1
package/dist/hud/__tests__/state.test.js.map +1 -1
package/dist/hud/__tests__/tmux.test.js +69 -23
package/dist/hud/__tests__/tmux.test.js.map +1 -1
package/dist/hud/index.d.ts +2 -2
package/dist/hud/index.d.ts.map +1 -1
package/dist/hud/index.js +17 -6
package/dist/hud/index.js.map +1 -1
package/dist/hud/reconcile.d.ts.map +1 -1
package/dist/hud/reconcile.js +6 -3
package/dist/hud/reconcile.js.map +1 -1
package/dist/hud/render.d.ts.map +1 -1
package/dist/hud/render.js +26 -0
package/dist/hud/render.js.map +1 -1
package/dist/hud/state.d.ts +2 -1
package/dist/hud/state.d.ts.map +1 -1
package/dist/hud/state.js +62 -1
package/dist/hud/state.js.map +1 -1
package/dist/hud/tmux.d.ts +10 -3
package/dist/hud/tmux.d.ts.map +1 -1
package/dist/hud/tmux.js +60 -11
package/dist/hud/tmux.js.map +1 -1
package/dist/hud/types.d.ts +22 -0
package/dist/hud/types.d.ts.map +1 -1
package/dist/hud/types.js.map +1 -1
package/dist/notifications/__tests__/http-client-resource.test.d.ts +2 -0
package/dist/notifications/__tests__/http-client-resource.test.d.ts.map +1 -0
package/dist/notifications/__tests__/http-client-resource.test.js +41 -0
package/dist/notifications/__tests__/http-client-resource.test.js.map +1 -0
package/dist/notifications/__tests__/verbosity.test.js +20 -0
package/dist/notifications/__tests__/verbosity.test.js.map +1 -1
package/dist/notifications/config.d.ts.map +1 -1
package/dist/notifications/config.js +6 -3
package/dist/notifications/config.js.map +1 -1
package/dist/notifications/http-client.d.ts.map +1 -1
package/dist/notifications/http-client.js +78 -27
package/dist/notifications/http-client.js.map +1 -1
package/dist/notifications/types.d.ts +2 -0
package/dist/notifications/types.d.ts.map +1 -1
package/dist/openclaw/__tests__/dispatcher.test.js +49 -1
package/dist/openclaw/__tests__/dispatcher.test.js.map +1 -1
package/dist/openclaw/dispatcher.d.ts +7 -4
package/dist/openclaw/dispatcher.d.ts.map +1 -1
package/dist/openclaw/dispatcher.js +32 -69
package/dist/openclaw/dispatcher.js.map +1 -1
package/dist/pipeline/__tests__/orchestrator.test.js +128 -4
package/dist/pipeline/__tests__/orchestrator.test.js.map +1 -1
package/dist/pipeline/__tests__/stages.test.js +460 -9
package/dist/pipeline/__tests__/stages.test.js.map +1 -1
package/dist/pipeline/index.d.ts +8 -2
package/dist/pipeline/index.d.ts.map +1 -1
package/dist/pipeline/index.js +5 -2
package/dist/pipeline/index.js.map +1 -1
package/dist/pipeline/orchestrator.d.ts +5 -4
package/dist/pipeline/orchestrator.d.ts.map +1 -1
package/dist/pipeline/orchestrator.js +85 -17
package/dist/pipeline/orchestrator.js.map +1 -1
package/dist/pipeline/stages/code-review.d.ts +2 -2
package/dist/pipeline/stages/code-review.d.ts.map +1 -1
package/dist/pipeline/stages/code-review.js +5 -3
package/dist/pipeline/stages/code-review.js.map +1 -1
package/dist/pipeline/stages/deep-interview.d.ts +15 -0
package/dist/pipeline/stages/deep-interview.d.ts.map +1 -0
package/dist/pipeline/stages/deep-interview.js +32 -0
package/dist/pipeline/stages/deep-interview.js.map +1 -0
package/dist/pipeline/stages/ralph-verify.d.ts +5 -5
package/dist/pipeline/stages/ralph-verify.d.ts.map +1 -1
package/dist/pipeline/stages/ralph-verify.js +2 -2
package/dist/pipeline/stages/ralph-verify.js.map +1 -1
package/dist/pipeline/stages/ralplan.d.ts.map +1 -1
package/dist/pipeline/stages/ralplan.js +41 -6
package/dist/pipeline/stages/ralplan.js.map +1 -1
package/dist/pipeline/stages/ultragoal.d.ts +19 -0
package/dist/pipeline/stages/ultragoal.d.ts.map +1 -0
package/dist/pipeline/stages/ultragoal.js +38 -0
package/dist/pipeline/stages/ultragoal.js.map +1 -0
package/dist/pipeline/stages/ultraqa.d.ts +30 -0
package/dist/pipeline/stages/ultraqa.d.ts.map +1 -0
package/dist/pipeline/stages/ultraqa.js +46 -0
package/dist/pipeline/stages/ultraqa.js.map +1 -0
package/dist/pipeline/types.d.ts +8 -6
package/dist/pipeline/types.d.ts.map +1 -1
package/dist/pipeline/types.js +2 -2
package/dist/question/__tests__/ui.test.js +43 -10
package/dist/question/__tests__/ui.test.js.map +1 -1
package/dist/question/ui.d.ts +12 -0
package/dist/question/ui.d.ts.map +1 -1
package/dist/question/ui.js +83 -46
package/dist/question/ui.js.map +1 -1
package/dist/ralplan/__tests__/runtime.test.js +200 -10
package/dist/ralplan/__tests__/runtime.test.js.map +1 -1
package/dist/ralplan/consensus-gate.d.ts +23 -0
package/dist/ralplan/consensus-gate.d.ts.map +1 -0
package/dist/ralplan/consensus-gate.js +212 -0
package/dist/ralplan/consensus-gate.js.map +1 -0
package/dist/ralplan/runtime.d.ts +25 -0
package/dist/ralplan/runtime.d.ts.map +1 -1
package/dist/ralplan/runtime.js +144 -8
package/dist/ralplan/runtime.js.map +1 -1
package/dist/scripts/__tests__/codex-native-hook.test.js +1358 -79
package/dist/scripts/__tests__/codex-native-hook.test.js.map +1 -1
package/dist/scripts/__tests__/docs-site-contract.test.d.ts +2 -0
package/dist/scripts/__tests__/docs-site-contract.test.d.ts.map +1 -0
package/dist/scripts/__tests__/docs-site-contract.test.js +42 -0
package/dist/scripts/__tests__/docs-site-contract.test.js.map +1 -0
package/dist/scripts/__tests__/notify-dispatcher.test.js +115 -2
package/dist/scripts/__tests__/notify-dispatcher.test.js.map +1 -1
package/dist/scripts/__tests__/run-test-files.test.js +57 -0
package/dist/scripts/__tests__/run-test-files.test.js.map +1 -1
package/dist/scripts/__tests__/smoke-packed-install.test.js +23 -1
package/dist/scripts/__tests__/smoke-packed-install.test.js.map +1 -1
package/dist/scripts/__tests__/verify-native-agents.test.js +18 -3
package/dist/scripts/__tests__/verify-native-agents.test.js.map +1 -1
package/dist/scripts/cleanup-explore-harness.js +1 -0
package/dist/scripts/cleanup-explore-harness.js.map +1 -1
package/dist/scripts/codex-native-hook.d.ts.map +1 -1
package/dist/scripts/codex-native-hook.js +372 -44
package/dist/scripts/codex-native-hook.js.map +1 -1
package/dist/scripts/codex-native-pre-post.d.ts.map +1 -1
package/dist/scripts/codex-native-pre-post.js +9 -1
package/dist/scripts/codex-native-pre-post.js.map +1 -1
package/dist/scripts/notify-dispatcher.js +188 -4
package/dist/scripts/notify-dispatcher.js.map +1 -1
package/dist/scripts/notify-hook/process-runner.d.ts.map +1 -1
package/dist/scripts/notify-hook/process-runner.js +39 -17
package/dist/scripts/notify-hook/process-runner.js.map +1 -1
package/dist/scripts/notify-hook/team-dispatch.d.ts.map +1 -1
package/dist/scripts/notify-hook/team-dispatch.js +9 -5
package/dist/scripts/notify-hook/team-dispatch.js.map +1 -1
package/dist/scripts/notify-hook/team-tmux-guard.d.ts +1 -1
package/dist/scripts/notify-hook/team-tmux-guard.d.ts.map +1 -1
package/dist/scripts/notify-hook/team-tmux-guard.js +7 -1
package/dist/scripts/notify-hook/team-tmux-guard.js.map +1 -1
package/dist/scripts/run-test-files.js +13 -0
package/dist/scripts/run-test-files.js.map +1 -1
package/dist/scripts/smoke-packed-install.d.ts +3 -0
package/dist/scripts/smoke-packed-install.d.ts.map +1 -1
package/dist/scripts/smoke-packed-install.js +99 -1
package/dist/scripts/smoke-packed-install.js.map +1 -1
package/dist/scripts/sync-plugin-mirror.js +2 -2
package/dist/scripts/sync-plugin-mirror.js.map +1 -1
package/dist/scripts/verify-native-agents.js +2 -2
package/dist/scripts/verify-native-agents.js.map +1 -1
package/dist/sidecar/__tests__/resource-leak-watch.test.d.ts +2 -0
package/dist/sidecar/__tests__/resource-leak-watch.test.d.ts.map +1 -0
package/dist/sidecar/__tests__/resource-leak-watch.test.js +38 -0
package/dist/sidecar/__tests__/resource-leak-watch.test.js.map +1 -0
package/dist/sidecar/index.d.ts +1 -1
package/dist/sidecar/index.d.ts.map +1 -1
package/dist/sidecar/index.js +29 -12
package/dist/sidecar/index.js.map +1 -1
package/dist/state/__tests__/operations-ralph-phase.test.js +88 -1
package/dist/state/__tests__/operations-ralph-phase.test.js.map +1 -1
package/dist/state/__tests__/workflow-transition.test.js +6 -0
package/dist/state/__tests__/workflow-transition.test.js.map +1 -1
package/dist/state/operations.d.ts.map +1 -1
package/dist/state/operations.js +11 -0
package/dist/state/operations.js.map +1 -1
package/dist/state/workflow-transition.d.ts +1 -1
package/dist/state/workflow-transition.d.ts.map +1 -1
package/dist/state/workflow-transition.js +7 -0
package/dist/state/workflow-transition.js.map +1 -1
package/dist/subagents/tracker.d.ts.map +1 -1
package/dist/subagents/tracker.js +4 -3
package/dist/subagents/tracker.js.map +1 -1
package/dist/team/__tests__/runtime.test.js +36 -44
package/dist/team/__tests__/runtime.test.js.map +1 -1
package/dist/team/__tests__/tmux-session.test.js +163 -15
package/dist/team/__tests__/tmux-session.test.js.map +1 -1
package/dist/team/runtime.d.ts.map +1 -1
package/dist/team/runtime.js +10 -20
package/dist/team/runtime.js.map +1 -1
package/dist/team/tmux-session.d.ts.map +1 -1
package/dist/team/tmux-session.js +51 -21
package/dist/team/tmux-session.js.map +1 -1
package/dist/ultragoal/__tests__/artifacts.test.js +764 -10
package/dist/ultragoal/__tests__/artifacts.test.js.map +1 -1
package/dist/ultragoal/__tests__/docs-contract.test.js +57 -1
package/dist/ultragoal/__tests__/docs-contract.test.js.map +1 -1
package/dist/ultragoal/__tests__/steering-fixtures.d.ts +68 -0
package/dist/ultragoal/__tests__/steering-fixtures.d.ts.map +1 -0
package/dist/ultragoal/__tests__/steering-fixtures.js +259 -0
package/dist/ultragoal/__tests__/steering-fixtures.js.map +1 -0
package/dist/ultragoal/__tests__/steering-fixtures.test.d.ts +2 -0
package/dist/ultragoal/__tests__/steering-fixtures.test.d.ts.map +1 -0
package/dist/ultragoal/__tests__/steering-fixtures.test.js +65 -0
package/dist/ultragoal/__tests__/steering-fixtures.test.js.map +1 -0
package/dist/ultragoal/artifacts.d.ts +97 -2
package/dist/ultragoal/artifacts.d.ts.map +1 -1
package/dist/ultragoal/artifacts.js +837 -256
package/dist/ultragoal/artifacts.js.map +1 -1
package/dist/utils/__tests__/sleep-resource.test.d.ts +2 -0
package/dist/utils/__tests__/sleep-resource.test.d.ts.map +1 -0
package/dist/utils/__tests__/sleep-resource.test.js +39 -0
package/dist/utils/__tests__/sleep-resource.test.js.map +1 -0
package/dist/utils/sleep.d.ts.map +1 -1
package/dist/utils/sleep.js +17 -6
package/dist/utils/sleep.js.map +1 -1
package/package.json +2 -1
package/plugins/oh-my-codex/.codex-plugin/plugin.json +4 -3
package/plugins/oh-my-codex/hooks/codex-native-hook.mjs +56 -0
package/plugins/oh-my-codex/hooks/hooks.json +77 -0
package/plugins/oh-my-codex/skills/autopilot/SKILL.md +92 -50
package/plugins/oh-my-codex/skills/autoresearch/SKILL.md +4 -0
package/plugins/oh-my-codex/skills/autoresearch-goal/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/best-practice-research/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/cancel/SKILL.md +2 -2
package/plugins/oh-my-codex/skills/deep-interview/SKILL.md +8 -8
package/plugins/oh-my-codex/skills/omx-setup/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/pipeline/SKILL.md +23 -12
package/plugins/oh-my-codex/skills/plan/SKILL.md +8 -8
package/plugins/oh-my-codex/skills/prometheus-strict/README.md +35 -0
package/plugins/oh-my-codex/skills/prometheus-strict/SKILL.md +219 -0
package/plugins/oh-my-codex/skills/ralph/SKILL.md +7 -0
package/plugins/oh-my-codex/skills/ralplan/SKILL.md +22 -7
package/plugins/oh-my-codex/skills/team/SKILL.md +1 -1
package/plugins/oh-my-codex/skills/ultragoal/SKILL.md +38 -4
package/plugins/oh-my-codex/skills/ultrawork/SKILL.md +1 -1
package/prompts/planner.md +1 -1
package/prompts/prometheus-strict-metis.md +274 -0
package/prompts/prometheus-strict-momus.md +82 -0
package/prompts/prometheus-strict-oracle.md +107 -0
package/prompts/researcher.md +22 -3
package/skills/autopilot/SKILL.md +92 -50
package/skills/autoresearch/SKILL.md +4 -0
package/skills/autoresearch-goal/SKILL.md +1 -1
package/skills/best-practice-research/SKILL.md +1 -1
package/skills/cancel/SKILL.md +2 -2
package/skills/deep-interview/SKILL.md +8 -8
package/skills/omx-setup/SKILL.md +1 -1
package/skills/pipeline/SKILL.md +23 -12
package/skills/plan/SKILL.md +8 -8
package/skills/prometheus-strict/README.md +35 -0
package/skills/prometheus-strict/SKILL.md +219 -0
package/skills/ralph/SKILL.md +7 -0
package/skills/ralplan/SKILL.md +22 -7
package/skills/team/SKILL.md +1 -1
package/skills/ultragoal/SKILL.md +38 -4
package/skills/ultrawork/SKILL.md +1 -1
package/src/scripts/__tests__/codex-native-hook.test.ts +1757 -210
package/src/scripts/__tests__/docs-site-contract.test.ts +47 -0
package/src/scripts/__tests__/notify-dispatcher.test.ts +132 -3
package/src/scripts/__tests__/run-test-files.test.ts +67 -0
package/src/scripts/__tests__/smoke-packed-install.test.ts +31 -0
package/src/scripts/__tests__/verify-native-agents.test.ts +23 -3
package/src/scripts/cleanup-explore-harness.ts +1 -0
package/src/scripts/codex-native-hook.ts +393 -40
package/src/scripts/codex-native-pre-post.ts +16 -1
package/src/scripts/notify-dispatcher.ts +202 -4
package/src/scripts/notify-hook/process-runner.ts +40 -16
package/src/scripts/notify-hook/team-dispatch.ts +9 -5
package/src/scripts/notify-hook/team-tmux-guard.ts +7 -0
package/src/scripts/run-test-files.ts +13 -0
package/src/scripts/smoke-packed-install.ts +105 -0
package/src/scripts/sync-plugin-mirror.ts +3 -3
package/src/scripts/verify-native-agents.ts +2 -2
package/templates/catalog-manifest.json +22 -0

package/plugins/oh-my-codex/skills/autopilot/SKILL.md CHANGED Viewed

@@ -1,56 +1,68 @@
 ---
 name: autopilot
-description: "[OMX] Strict autonomous loop: $ralplan -> $ralph -> $code-review"
+description: "[OMX] Strict autonomous loop: $deep-interview -> $ralplan -> $ultragoal (+ $team if needed) -> $code-review -> $ultraqa"
 ---
 <Purpose>
-Autopilot is the strict autonomous delivery loop for non-trivial work. Its primary contract is exactly:
+Autopilot is the strict autonomous delivery loop for non-trivial work. Its recommended/default contract is exactly:
 ```text
-$ralplan -> $ralph -> $code-review
+$deep-interview -> $ralplan -> $ultragoal (+ $team if needed) -> $code-review -> $ultraqa
 ```
-If `$code-review` is not clean, Autopilot returns to `$ralplan` with the review findings as the next planning input, then continues again through `$ralph` and `$code-review` until the review is clean or a hard blocker is reported.
+If `$code-review` or `$ultraqa` is not clean, Autopilot returns to `$ralplan` with the findings as the next planning input, then continues again through `$ultragoal`, `$code-review`, and `$ultraqa` until the gates are clean or a hard blocker is reported. Ralph is a legacy/explicit alternate execution loop only; do not advertise Ralph as the default Autopilot path.
 </Purpose>
 <Use_When>
-- User wants hands-off execution from a concrete idea, issue, PRD, or requirements artifact to reviewed code
+- User wants hands-off execution from a concrete idea, issue, PRD, or requirements artifact to reviewed and QA-checked code
 - User says `$autopilot`, "autopilot", "auto pilot", "autonomous", "build me", "create me", "make me", "full auto", "handle it all", or "I want a/an..."
-- Task needs planning, implementation, verification, and code review with automatic follow-up when review is not clean
+- Task needs clarification, planning, durable execution, verification, code review, and QA with automatic follow-up when gates are not clean
 </Use_When>
 <Do_Not_Use_When>
 - User wants to explore options or brainstorm -- use `$plan` / `$ralplan`
 - User says "just explain", "draft only", or "what would you suggest" -- respond conversationally
-- User wants a single focused code change -- use `$ralph` or direct executor work
+- User wants a single focused code change -- use `$ultragoal`, `$ralph` only when explicitly requested, or direct executor work
 - User wants only review/critique of existing code -- use `$code-review`
 </Do_Not_Use_When>
 <Strict_Loop_Contract>
-Autopilot must not run a separate broad expansion/planning/execution/QA/validation lifecycle as its primary behavior. It delegates those concerns to the three canonical workflow phases below:
+Autopilot must not run a separate broad expansion/planning/execution/QA/validation lifecycle as its primary behavior. It delegates those concerns to the canonical workflow phases below:
-1. **Phase `ralplan`** — consensus planning gate
-   - Ground the task with pre-context intake.
-   - Run or resume `$ralplan` to produce/update PRD and test-spec artifacts.
-   - When returning from a non-clean review, include `return_to_ralplan_reason` and the review findings as first-class planning input.
-   - Required handoff artifact: an approved plan/test spec suitable for `$ralph`.
-2. **Phase `ralph`** — implementation + verification loop
-   - Run `$ralph` from the approved ralplan artifacts.
-   - Ralph owns implementation, tests, build/lint/typecheck evidence, deslop where applicable, and architect verification.
-   - Required handoff artifact: implementation evidence and changed-file summary suitable for `$code-review`.
+1. **Phase `deep-interview`** — Socratic requirements clarification gate
+   - Run or resume `$deep-interview` to clarify intent, scope, non-goals, constraints, and decision boundaries.
+   - Required handoff artifact: a clarified spec or concise requirements summary suitable for `$ralplan`.
-3. **Phase `code-review`** — merge-readiness gate
-   - Run `$code-review` on the diff/artifacts produced by `$ralph`.
+2. **Phase `ralplan`** — consensus planning gate
+   - Ground the task with pre-context intake and the deep-interview artifact.
+   - Run or resume `$ralplan` to produce/update PRD and test-spec artifacts.
+   - PRD/test-spec files alone are not completion evidence. Ralplan may hand off only after durable consensus evidence records an `Architect` approval first and a subsequent `Critic` approval second.
+   - When returning from a non-clean review or QA pass, include `return_to_ralplan_reason` and the findings as first-class planning input.
+   - If either review is missing, blocked, out of order, or non-approving, remain in `ralplan` or report an explicit blocker/max-iteration outcome; do not progress to `$ultragoal`, `$team`, `$ralph`, or implementation.
+   - Required handoff artifact: an approved plan/test spec plus `ralplan_consensus_gate` evidence suitable for `$ultragoal`.
+3. **Phase `ultragoal`** — durable implementation + verification loop
+   - Run `$ultragoal` from the approved ralplan artifacts.
+   - Ultragoal owns durable Codex goal handoffs, `.omx/ultragoal` ledger checkpoints, implementation, tests, build/lint/typecheck evidence, cleanup, and final review gate discipline.
+   - Use `$team` only inside an active Ultragoal story when the story clearly benefits from coordinated parallel execution (for example independent file/module lanes, broad test matrix work, or multi-domain implementation). Team remains explicit and leader-owned; Ultragoal keeps the goal/ledger state.
+   - Required handoff artifact: implementation evidence, changed-file summary, verification evidence, and Ultragoal ledger/checkpoint references suitable for `$code-review`.
+4. **Phase `code-review`** — merge-readiness gate
+   - Run `$code-review` on the diff/artifacts produced by `$ultragoal`.
    - A clean review means final recommendation `APPROVE` with architectural status `CLEAR`.
    - `COMMENT`, `REQUEST CHANGES`, any architectural `WATCH`/`BLOCK`, or any unresolved finding is not clean.
    - If not clean, increment the review cycle, persist `review_verdict`, set `return_to_ralplan_reason`, and transition back to Phase `ralplan`.
-The only normal terminal state is `complete` after a clean code review. Cancellation, blocked credentials, unrecoverable repeated failures, or explicit user stop may terminate earlier with preserved state.
+5. **Phase `ultraqa`** — adversarial QA gate
+   - Run `$ultraqa` after a clean code review when user-facing behavior, workflows, CLI/runtime behavior, integration surfaces, or regression risk warrant adversarial QA.
+   - For docs-only or trivially non-runtime changes, record `ultraqa` as skipped with an explicit condition and evidence.
+   - If UltraQA finds issues, persist the QA verdict/evidence, set `return_to_ralplan_reason`, and transition back to Phase `ralplan`.
+The only normal terminal state is `complete` after clean code review and a passed or explicitly skipped UltraQA gate. Cancellation, blocked credentials, unrecoverable repeated failures, or explicit user stop may terminate earlier with preserved state.
 </Strict_Loop_Contract>
 <Pre-context Intake>
-Before Phase `ralplan` starts or resumes:
+Before Phase `deep-interview` or `ralplan` starts or resumes:
 1. Derive a task slug from the request.
 2. Reuse the latest relevant `.omx/context/{slug}-*.md` snapshot when available.
 3. If none exists, create `.omx/context/{slug}-{timestamp}.md` (UTC `YYYYMMDDTHHMMSSZ`) with:
@@ -60,16 +72,18 @@ Before Phase `ralplan` starts or resumes:
    - constraints
    - unknowns/open questions
    - likely codebase touchpoints
-4. If ambiguity remains high, run `explore` first for brownfield facts, then run the Socratic `$deep-interview --quick <task>` before `$ralplan`.
+4. If brownfield facts are missing, run `explore` first before or during `$deep-interview` (`$deep-interview --quick <task>` remains acceptable for bounded low-ambiguity intake); do not skip the clarification gate merely because the task sounds actionable.
 5. Carry the snapshot path in Autopilot state and all handoff artifacts.
 </Pre-context Intake>
 <Execution_Policy>
-- Always execute phases in order: `ralplan`, then `ralph`, then `code-review`.
-- Never skip directly from vague/freeform expansion to implementation; unclear input must be clarified or planned through `$ralplan`.
-- A non-clean `$code-review` always returns to `$ralplan`; do not patch findings ad hoc outside the loop.
+- Always execute the recommended phases in order: `deep-interview`, then `ralplan`, then `ultragoal`, then `code-review`, then `ultraqa`.
+- `$team` is conditional and explicit: use it only within an Ultragoal story when parallel execution materially improves throughput, quality, or safety.
+- Never skip directly from vague/freeform expansion to implementation; unclear input must be clarified and planned through `$deep-interview` and `$ralplan`.
+- A non-clean `$code-review` or failed `$ultraqa` always returns to `$ralplan`; do not patch findings ad hoc outside the loop.
 - Each phase must write/update Autopilot state before handing off.
-- Use existing hooks, `.omx/state`, `$ralplan`, `$ralph`, `$code-review`, and pipeline primitives; do not invent a separate execution framework.
+- Use existing hooks, `.omx/state`, `$deep-interview`, `$ralplan`, `$ultragoal`, optional `$team`, `$code-review`, `$ultraqa`, and pipeline primitives; do not invent a separate execution framework.
+- Preserve legacy compatibility: if a user explicitly requests the old Ralph execution lane, use `$ralph` as an intentional alternate execution phase, but do not present it as Autopilot's default recommended loop.
 - Continue automatically through safe reversible phase transitions. Ask only for destructive, credential-gated, or materially preference-dependent branches.
 - Apply the shared workflow guidance pattern: outcome-first framing, concise visible updates for multi-step execution, local overrides for the active workflow branch, validation proportional to risk, explicit stop rules, and automatic continuation for safe reversible steps. Ask only for material, destructive, credentialed, external-production, or preference-dependent branches.
 </Execution_Policy>
@@ -83,81 +97,109 @@ Required fields:
 {
   "mode": "autopilot",
   "active": true,
-  "current_phase": "ralplan",
+  "current_phase": "deep-interview",
   "iteration": 1,
   "review_cycle": 0,
   "max_iterations": 10,
-  "phase_cycle": ["ralplan", "ralph", "code-review"],
+  "phase_cycle": ["deep-interview", "ralplan", "ultragoal", "code-review", "ultraqa"],
   "handoff_artifacts": {
     "context_snapshot_path": ".omx/context/<slug>-<timestamp>.md",
+    "deep_interview": null,
     "ralplan": null,
-    "ralph": null,
-    "code_review": null
+    "ralplan_consensus_gate": {
+      "required": true,
+      "sequence": ["architect-review", "critic-review"],
+      "planning_artifacts_are_not_consensus": true,
+      "required_review_roles": ["architect", "critic"],
+      "ralplan_architect_review": null,
+      "ralplan_critic_review": null,
+      "complete": false
+    },
+    "ultragoal": null,
+    "code_review": null,
+    "ultraqa": null
   },
   "review_verdict": null,
+  "qa_verdict": null,
   "return_to_ralplan_reason": null
 }
 ```
-- **On start**: `omx state write --input '{"mode":"autopilot","active":true,"current_phase":"ralplan","iteration":1,"review_cycle":0,"state":{"phase_cycle":["ralplan","ralph","code-review"],"handoff_artifacts":{"context_snapshot_path":"<snapshot-path>","ralplan":null,"ralph":null,"code_review":null},"review_verdict":null,"return_to_ralplan_reason":null}}' --json`
-- **On ralplan -> ralph**: set `current_phase:"ralph"`, persist the plan/test-spec paths under `handoff_artifacts.ralplan`.
-- **On ralph -> code-review**: set `current_phase:"code-review"`, persist implementation/test evidence under `handoff_artifacts.ralph`.
-- **On clean review**: set `active:false`, `current_phase:"complete"`, persist `review_verdict:{recommendation:"APPROVE", architectural_status:"CLEAR", clean:true}` and `completed_at`.
-- **On non-clean review**: increment `iteration` and `review_cycle`, set `current_phase:"ralplan"`, persist `review_verdict:{..., clean:false}`, persist `handoff_artifacts.code_review`, and set `return_to_ralplan_reason` to a concise review-driven reason.
+- **On start**: `omx state write --input '{"mode":"autopilot","active":true,"current_phase":"deep-interview","iteration":1,"review_cycle":0,"state":{"phase_cycle":["deep-interview","ralplan","ultragoal","code-review","ultraqa"],"handoff_artifacts":{"context_snapshot_path":"<snapshot-path>","deep_interview":null,"ralplan":null,"ralplan_consensus_gate":{"required":true,"sequence":["architect-review","critic-review"],"planning_artifacts_are_not_consensus":true,"required_review_roles":["architect","critic"],"ralplan_architect_review":null,"ralplan_critic_review":null,"complete":false},"ultragoal":null,"code_review":null,"ultraqa":null},"review_verdict":null,"qa_verdict":null,"return_to_ralplan_reason":null}}' --json`
+- **On deep-interview -> ralplan**: set `current_phase:"ralplan"`, persist the clarified spec/requirements under `handoff_artifacts.deep_interview`.
+- **On ralplan -> ultragoal**: only after `ralplan_consensus_gate.complete:true`, with `ralplan_architect_review.agent_role:"architect"` and `ralplan_architect_review.verdict:"approve"` recorded before `ralplan_critic_review.agent_role:"critic"` and `ralplan_critic_review.verdict:"approve"`; set `current_phase:"ultragoal"` and persist the plan/test-spec paths under `handoff_artifacts.ralplan`.
+- **On missing ralplan consensus evidence**: keep `current_phase:"ralplan"`, persist `ralplan_consensus_gate.complete:false` with `blocked_reason`, and report an explicit blocker or max-iteration outcome instead of handing off to execution.
+- **On ultragoal -> code-review**: set `current_phase:"code-review"`, persist implementation/test/ledger evidence under `handoff_artifacts.ultragoal`.
+- **On code-review -> ultraqa**: set `current_phase:"ultraqa"`, persist the clean review under `handoff_artifacts.code_review`.
+- **On clean review + passed/skipped QA**: set `active:false`, `current_phase:"complete"`, persist `review_verdict:{recommendation:"APPROVE", architectural_status:"CLEAR", clean:true}`, `qa_verdict:{clean:true, skipped:<boolean>, reason:<string|null>}`, and `completed_at`.
+- **On non-clean review or failed QA**: increment `iteration` and `review_cycle`, set `current_phase:"ralplan"`, persist `review_verdict` or `qa_verdict`, persist the phase handoff, and set `return_to_ralplan_reason` to a concise findings-driven reason.
+- **Legacy Ralph state**: if a user explicitly selected the legacy Ralph execution lane, phase names and handoff keys may include `ralph`; preserve and resume them rather than rewriting history to Ultragoal.
 - **On cancellation**: run `$cancel`; preserve progress for resume rather than deleting handoff artifacts.
 </State_Management>
 <Continuation_And_Resume>
 When the user says `continue`, `resume`, or `keep going` while Autopilot is active, read `autopilot-state.json` and continue from `current_phase`:
+- `deep-interview`: clarify requirements and record the handoff artifact.
 - `ralplan`: run/update consensus planning from current handoffs and any `return_to_ralplan_reason`.
-- `ralph`: execute the approved plan and record verification evidence.
+- `ultragoal`: execute the approved plan durably and record verification/ledger evidence.
+- `team`: continue explicit team work only when it is nested under the active Ultragoal story and report evidence back to the leader.
 - `code-review`: review the current diff and decide clean vs return-to-ralplan.
+- `ultraqa`: run or explicitly skip adversarial QA based on the documented condition, then finish if clean or transition to `ralplan` with findings if not clean.
+- `ralph`: resume only for explicit legacy Ralph-path Autopilot state.
 - `complete`: report completion evidence; do not restart.
 Do not restart discovery or discard handoff artifacts on continuation.
 </Continuation_And_Resume>
 <Pipeline_Orchestrator>
-Autopilot may be represented by the configurable pipeline orchestrator (`src/pipeline/`) when useful. The Autopilot pipeline contract is:
+Autopilot may be represented by the configurable pipeline orchestrator (`src/pipeline/`) when useful. The default Autopilot pipeline contract is:
 ```text
-ralplan -> ralph -> code-review
+deep-interview -> ralplan -> ultragoal -> code-review -> ultraqa
 ```
-Pipeline state should use `current_phase` values that match the same phase names (`ralplan`, `ralph`, `code-review`, `complete`, `failed`) and should carry `iteration`, `review_cycle`, `handoff_artifacts`, `review_verdict`, and `return_to_ralplan_reason` alongside stage results.
+Pipeline state should use `current_phase` values that match the same phase names (`deep-interview`, `ralplan`, `ultragoal`, `code-review`, `ultraqa`, `complete`, `failed`) and should carry `iteration`, `review_cycle`, `handoff_artifacts`, `review_verdict`, `qa_verdict`, and `return_to_ralplan_reason` alongside stage results. `$team` is not a default pipeline stage; it is an explicit conditional execution engine inside an Ultragoal story.
 </Pipeline_Orchestrator>
 <Escalation_And_Stop_Conditions>
 - Stop and report a blocker when required credentials/authority are missing.
-- Stop and report when the same review or verification failure recurs across 3 review cycles with no meaningful new plan.
+- Stop and report when the same review or QA failure recurs across 3 review cycles with no meaningful new plan.
 - Stop when the user says "stop", "cancel", or "abort" and run `$cancel`.
-- Otherwise, continue the loop until `$code-review` is clean.
+- Otherwise, continue the loop until `$code-review` is clean and `$ultraqa` has passed or been explicitly skipped with evidence.
 </Escalation_And_Stop_Conditions>
 <Final_Checklist>
-- [ ] Phase `ralplan` produced/updated approved planning artifacts
-- [ ] Phase `ralph` implemented and verified the plan with fresh evidence
+- [ ] Phase `deep-interview` produced/updated clarified requirements or a concise spec
+- [ ] Phase `ralplan` produced/updated approved planning artifacts and durable sequential Architect→Critic consensus evidence
+- [ ] Phase `ultragoal` implemented and verified the plan with fresh evidence and durable ledger/checkpoint references
+- [ ] `$team` was used only if the active Ultragoal story needed coordinated parallel work, or explicitly recorded as not needed
 - [ ] Phase `code-review` returned a clean verdict (`APPROVE` + `CLEAR`)
-- [ ] `review_verdict.clean` is true and `return_to_ralplan_reason` is null
-- [ ] Tests/build/lint/typecheck evidence from Ralph is available in handoff artifacts
+- [ ] Phase `ultraqa` passed, or was explicitly skipped because the change was docs-only/trivially non-runtime with evidence
+- [ ] `review_verdict.clean` is true, `qa_verdict.clean` is true, and `return_to_ralplan_reason` is null
+- [ ] Tests/build/lint/typecheck evidence from Ultragoal is available in handoff artifacts
 - [ ] Autopilot state is marked `complete` or cancellation state is preserved coherently
-- [ ] User receives a concise summary with plan, implementation, verification, and review evidence
+- [ ] User receives a concise summary with clarification, plan, implementation, verification, review, and QA evidence
 </Final_Checklist>
 <Examples>
 <Good>
 User: `$autopilot implement GitHub issue #42`
-Flow: create/load context snapshot -> `$ralplan` issue plan -> `$ralph` implementation + tests -> `$code-review`; if review requests changes, return to `$ralplan` with findings.
+Flow: create/load context snapshot -> `$deep-interview` requirements check -> `$ralplan` issue plan -> `$ultragoal` durable implementation + tests (launch `$team` only if a story needs parallel lanes) -> `$code-review` -> `$ultraqa`; if review or QA requests changes, return to `$ralplan` with findings.
 </Good>
 <Good>
 User: `continue`
 Context: Autopilot state says `current_phase:"code-review"`.
-Flow: run `$code-review` on current diff, persist verdict, finish if clean or transition to `ralplan` with findings if not clean.
+Flow: run `$code-review` on current diff, persist verdict, transition to `ultraqa` if clean or to `ralplan` with findings if not clean.
+</Good>
+<Good>
+User: `$autopilot --legacy-ralph finish the migration`
+Flow: preserve the explicit legacy Ralph execution choice and run the old Ralph execution lane as an alternate, without changing the documented default Autopilot recommendation.
 </Good>
 <Bad>
 Autopilot invents independent "Expansion", "QA", and "Validation" phases and treats them as the primary lifecycle.
-Why bad: this bypasses the strict `$ralplan -> $ralph -> $code-review` contract.
+Why bad: this bypasses the strict `$deep-interview -> $ralplan -> $ultragoal -> $code-review -> $ultraqa` contract.
 </Bad>
 </Examples>

package/plugins/oh-my-codex/skills/autoresearch/SKILL.md CHANGED Viewed

@@ -8,6 +8,10 @@ description: Stateful validator-gated research loop with native-hook persistence
 Autoresearch is the skill-first replacement for the deprecated `omx autoresearch` command.
 It keeps the useful measured-research loop, but it now runs as a native-hook stateful workflow instead of a direct CLI or tmux launch surface.
+## Boundary with planning research
+Use `$autoresearch` when the research output itself is a bounded deliverable that must pass an explicit validator. Do not recommend it for ordinary pre-planning docs lookup or general best-practice checks; use `$best-practice-research` for that. If `$autoresearch` is intentionally run before architecture planning, its approved artifact should feed evidence into `$ralplan`; it should not become a final architecture/component unless the user explicitly asks for ongoing research automation.
 ## Use when
 - You want a Ralph-ish persistent research loop
 - The task should keep nudging until explicit validation evidence exists

package/plugins/oh-my-codex/skills/autoresearch-goal/SKILL.md CHANGED Viewed

@@ -5,7 +5,7 @@ description: Durable professor-critic research workflow over Codex goal mode wit
 # Autoresearch Goal
-Use this workflow when a research mission should be bound to Codex goal-mode focus while OMX remains the durable state owner.
+Use this workflow when a research mission should be bound to Codex goal-mode focus while OMX remains the durable state owner. This is for research projects that need Codex goal-mode management plus professor/critic-style validation; it is not the default answer for ordinary pre-planning best-practice lookup.
 ## Boundary
 - Do **not** use or revive the deprecated `omx autoresearch` direct launch surface.

package/plugins/oh-my-codex/skills/best-practice-research/SKILL.md CHANGED Viewed

@@ -10,7 +10,7 @@ Use this skill when a task depends on current external best practices, version-a
 ## Purpose
-Produce a cited, reusable best-practice answer or handoff that separates current external evidence from repo-local facts and dependency-selection decisions.
+Produce a cited, reusable best-practice answer or handoff that separates current external evidence from repo-local facts and dependency-selection decisions. For pre-planning investigation, this is the ordinary first research wrapper: gather official/upstream evidence, then hand it to `$ralplan` or the caller as planning input. Do not present `$best-practice-research` as a final architecture component or as a validator-gated research loop.
 ## Activate When

package/plugins/oh-my-codex/skills/cancel/SKILL.md CHANGED Viewed

@@ -56,7 +56,7 @@ For Ralph-targeted cancellation (standalone or linked), completion is defined by
 See: `docs/contracts/ralph-cancel-contract.md`.
 Active modes are still cancelled in dependency order:
-1. Autopilot (includes linked ralph/ultraqa/ecomode cleanup)
+1. Autopilot (includes linked ultragoal/ultraqa/ecomode cleanup plus explicit legacy Ralph cleanup)
 2. Ralph (cleans its linked ultrawork or ecomode)
 3. Ultrawork (standalone)
 4. Ecomode (standalone)
@@ -374,7 +374,7 @@ Mode-specific subsections below describe what extra cleanup each handler perform
 ## Notes
-- **Dependency-aware**: Autopilot cancellation cleans up Ralph and UltraQA
+- **Dependency-aware**: Autopilot cancellation cleans up Ultragoal/UltraQA state and any explicit legacy Ralph state
 - **Link-aware**: Ralph cancellation cleans up linked Ultrawork or Ecomode
 - **Safe**: Only clears linked Ultrawork, preserves standalone Ultrawork
 - **Local-only**: Clears state files in `.omx/state/` directory

package/plugins/oh-my-codex/skills/deep-interview/SKILL.md CHANGED Viewed

@@ -370,7 +370,7 @@ Include these product-facing suggestions when they fit the clarified spec, witho
 - **`$autoresearch-goal`** — use when the clarified context is a research project: a research question, reference/literature gathering, evaluator-backed analysis, or professor/critic-style deliverable.
 - **`$performance-goal`** — use when the clarified context is an optimization or performance project with measurable speed, latency, throughput, memory, benchmark, or evaluator criteria.
-Preserve `$ralph` for persistent single-owner execution/verification and `$team` for coordinated parallel implementation. Present goal-mode options as context-sensitive next steps, not as generic replacements for implementation lanes.
+Recommend `$ultragoal` as the default durable goal-mode follow-up because it supersedes Ralph for goal tracking. Preserve `$team` for coordinated parallel implementation and keep `$ralph` only as an explicit fallback for persistent single-owner execution/verification when the user specifically selects it.
 ### 1. **`$ralplan` (Recommended)**
 - **Input Artifact:** `.omx/specs/deep-interview-{slug}.md` (optionally accompanied by the transcript/context snapshot for traceability)
@@ -379,7 +379,7 @@ Preserve `$ralph` for persistent single-owner execution/verification and `$team`
 - **Skipped / Already-Satisfied Stages:** Requirements discovery, ambiguity clarification, and early intent-boundary elicitation
 - **Expected Output:** Canonical planning artifacts under `.omx/plans/`, especially `prd-*.md` and `test-spec-*.md`
 - **Best When:** Requirements are clear enough to stop interviewing, but architectural validation / consensus planning is still desirable
-- **Next Recommended Step:** Use the approved planning artifacts with `$autopilot`, `$ralph`, `$team`, or `$ultragoal` as the default goal-mode follow-up; choose `$autoresearch-goal` for research validation or `$performance-goal` for measurable optimization
+- **Next Recommended Step:** Use the approved planning artifacts with `$ultragoal` as the default durable goal-mode follow-up (optionally with `$team` for parallel lanes); choose `$autoresearch-goal` for research validation or `$performance-goal` for measurable optimization, and use `$ralph` only as an explicit fallback when a narrow single-owner persistence loop is requested
 ### 2. **`$autopilot`**
 - **Input Artifact:** `.omx/specs/deep-interview-{slug}.md`
@@ -388,25 +388,25 @@ Preserve `$ralph` for persistent single-owner execution/verification and `$team`
 - **Skipped / Already-Satisfied Stages:** Initial requirement discovery and ambiguity reduction
 - **Expected Output:** Planning/execution progress, QA evidence, and validation artifacts produced by autopilot
 - **Best When:** The clarified spec is already strong enough for direct planning + execution without an additional consensus gate
-- **Next Recommended Step:** Continue through autopilot's execution/QA/validation flow; if coordination-heavy execution emerges, prefer a follow-up `$team` or `$ralph` lane as appropriate
+- **Next Recommended Step:** Continue through autopilot's execution/QA/validation flow; if coordination-heavy execution emerges, prefer `$team` under a leader-owned `$ultragoal` ledger, using `$ralph` only as an explicit fallback when a narrow single-owner persistence loop is requested
-### 3. **`$ralph`**
+### 3. **`$ralph` (Explicit fallback only)**
 - **Input Artifact:** `.omx/specs/deep-interview-{slug}.md`
 - **Invocation:** `$ralph <spec-path>`
 - **Consumer Behavior:** Use the spec's acceptance criteria and boundary constraints as the persistence target. Do not reopen requirements discovery unless the user explicitly asks to refine further.
 - **Skipped / Already-Satisfied Stages:** Requirement interview, ambiguity clarification, and initial scope-definition work
 - **Expected Output:** Iterative execution progress and verification evidence tracked against the clarified criteria
-- **Best When:** The task benefits from persistent sequential completion pressure and the user wants execution to keep moving until the criteria are satisfied or a real blocker exists
-- **Next Recommended Step:** Continue Ralph's persistence loop; if work expands into coordination-heavy lanes, hand off to `$team` and keep Ralph for verification continuity
+- **Best When:** The user explicitly asks for Ralph's persistent sequential completion pressure; otherwise use `$ultragoal` for durable goal tracking and completion checkpoints
+- **Next Recommended Step:** If this explicit fallback is selected, continue Ralph's persistence loop; if work expands into coordination-heavy lanes, hand off to `$team` under `$ultragoal` checkpointing rather than promoting Ralph as the next default
 ### 4. **`$team`**
 - **Input Artifact:** `.omx/specs/deep-interview-{slug}.md`
 - **Invocation:** `$team <spec-path>`
 - **Consumer Behavior:** Treat the spec as shared execution context for coordinated parallel work. Preserve the clarified intent, non-goals, decision boundaries, and acceptance criteria as common lane constraints.
 - **Skipped / Already-Satisfied Stages:** Requirement clarification and early ambiguity reduction
-- **Expected Output:** Coordinated multi-agent execution against the shared spec, with evidence that can later feed a Ralph verification pass when appropriate
+- **Expected Output:** Coordinated multi-agent execution against the shared spec, with evidence that can later feed Ultragoal checkpoints by default, or an explicit Ralph verification pass only when requested
 - **Best When:** The task is large, multi-lane, or blocker-sensitive enough to justify coordinated parallel execution instead of a single persistent loop
-- **Next Recommended Step:** Follow the team verification path when the coordinated execution phase finishes; escalate to a separate Ralph loop only when a later persistent verification/fix owner is still needed
+- **Next Recommended Step:** Follow the team verification path when the coordinated execution phase finishes; checkpoint completion through `$ultragoal` by default, escalating to a separate Ralph loop only when the user explicitly asks for that persistent verification/fix owner
 ### 5. **Refine further**
 - **Input Artifact:** Existing transcript, context snapshot, and current spec draft

package/plugins/oh-my-codex/skills/omx-setup/SKILL.md CHANGED Viewed

@@ -60,7 +60,7 @@ Supported setup flags (current implementation):
   - `project`: local directories (`./.codex`, `./.codex/skills`, `./.omx/agents`)
 - User-scope skill delivery targets:
   - `legacy`: keep installing/updating OMX skills in the resolved user skill root
-  - `plugin`: rely on Codex plugin discovery for bundled skills and archive/remove legacy OMX-managed prompts/skills/native agents; setup still installs native Codex hooks and setup-owned runtime feature flags (`hooks = true` on current Codex, legacy `codex_hooks = true` when that is the only reported hook feature, plus `goals = true`) because plugins do not carry hooks or enable Codex goal mode by themselves.
+  - `plugin`: rely on Codex plugin discovery for bundled skills and plugin-scoped lifecycle hooks when Codex reports `plugin_hooks`; archive/remove legacy OMX-managed prompts/skills/native agents. Setup still enables setup-owned runtime feature flags (`plugin_hooks = true` and `goals = true` when supported, or legacy setup-managed `hooks`/`codex_hooks` fallback when plugin hooks are not reported).
 - Migration hint: in `user` scope, if historical `~/.agents/skills` still exists alongside `${CODEX_HOME:-~/.codex}/skills`, current setup prints a cleanup hint. **Why the paths differ**: `${CODEX_HOME:-~/.codex}/skills/` is the path current Codex CLI natively loads as its skill root; `~/.agents/skills/` was the skill root in an older Codex CLI release before `~/.codex` became the standard home directory. OMX writes only to the canonical `${CODEX_HOME:-~/.codex}/skills/` path. When both directories exist simultaneously, Codex discovers skills from both trees and may show duplicate entries in Enable/Disable Skills. Archive or remove `~/.agents/skills/` to resolve this.
 - If persisted scope is `project`, `omx` launch automatically uses `CODEX_HOME=./.codex` unless user explicitly overrides `CODEX_HOME`.
 - Plugin mode prompts separately for optional AGENTS.md defaults and optional `developer_instructions` defaults. If `developer_instructions` already exists, setup asks before overwriting it; non-interactive runs preserve it.

package/plugins/oh-my-codex/skills/pipeline/SKILL.md CHANGED Viewed

@@ -10,19 +10,21 @@ through a uniform `PipelineStage` interface, with state persistence and resume s
 ## Default Autopilot Pipeline
-The canonical OMX pipeline sequences:
+The default Autopilot pipeline sequences:
 ```
-RALPLAN (consensus planning) -> team-exec (Codex CLI workers) -> ralph-verify (architect verification)
+deep-interview -> ralplan -> ultragoal (+ team if needed) -> code-review -> ultraqa
 ```
+`$team` is conditional: use it only inside an active Ultragoal story when independent lanes or broad verification make coordinated parallel execution useful. Explicit legacy Ralph pipelines remain available through custom stages, but Ralph is not the advertised default Autopilot loop.
 ## Configuration
 Pipeline parameters are configurable per run:
 | Parameter | Default | Description |
 |-----------|---------|-------------|
-| `maxRalphIterations` | 10 | Ralph verification iteration ceiling |
+| `maxRalphIterations` | 10 | Quality-gate retry ceiling; legacy option name retained for compatibility |
 | `workerCount` | 2 | Number of Codex CLI team workers |
 | `agentType` | `executor` | Agent type for team workers |
@@ -43,9 +45,12 @@ return a `StageResult` with status, artifacts, and duration.
 ## Built-in Stages
-- **ralplan**: Consensus planning (planner + architect + critic). Skips only when both `prd-*.md` and `test-spec-*.md` planning artifacts already exist, and carries any `deep-interview-*.md` spec paths forward for traceability.
-- **team-exec**: Team execution via Codex CLI workers. Always the OMX execution backend.
-- **ralph-verify**: Ralph verification loop with configurable iteration count.
+- **deep-interview**: Requirements clarification and ambiguity gate.
+- **ralplan**: Consensus planning (planner + architect + critic). Skips only when both `prd-*.md` and `test-spec-*.md` planning artifacts already exist **and** durable consensus evidence records Architect approval followed by Critic approval. Plan/test-spec files alone are not consensus evidence. If either review is missing, blocked, out of order, or non-approving, the stage remains in ralplan or fails with an explicit blocker/max-iteration outcome instead of progressing to execution. Carries any `deep-interview-*.md` spec paths forward for traceability.
+- **ultragoal**: Durable goal-mode execution with `.omx/ultragoal` ledgers. Launch `$team` only from inside an Ultragoal story when parallel lanes are warranted.
+- **code-review**: Merge-readiness review gate.
+- **ultraqa**: Adversarial QA gate after a clean review; docs-only/trivially non-runtime changes may record an explicit skip reason.
+- **team-exec** and **ralph-verify**: Legacy/custom pipeline adapters retained for explicit non-default pipelines.
 ## State Management
@@ -62,16 +67,20 @@ The HUD renders pipeline phase automatically. Resume is supported from the last
 import {
   runPipeline,
   createAutopilotPipelineConfig,
+  createDeepInterviewStage,
   createRalplanStage,
-  createTeamExecStage,
-  createRalphVerifyStage,
+  createUltragoalStage,
+  createCodeReviewStage,
+  createUltraqaStage,
 } from './pipeline/index.js';
 const config = createAutopilotPipelineConfig('build feature X', {
   stages: [
+    createDeepInterviewStage(),
     createRalplanStage(),
-    createTeamExecStage({ workerCount: 3, agentType: 'executor' }),
-    createRalphVerifyStage({ maxIterations: 15 }),
+    createUltragoalStage(),
+    createCodeReviewStage(),
+    createUltraqaStage(),
   ],
 });
@@ -82,5 +91,7 @@ const result = await runPipeline(config);
 - **autopilot**: Autopilot can use pipeline as its execution engine (v0.8+)
 - **team**: Pipeline delegates execution to team mode (Codex CLI workers)
-- **ralph**: Pipeline delegates verification to ralph (configurable iterations)
-- **ralplan**: Pipeline's first stage runs RALPLAN consensus planning
+- **ultragoal**: Autopilot delegates durable execution to Ultragoal by default
+- **team**: Optional execution engine inside an Ultragoal story when parallel lanes are needed
+- **ralph**: Available only for explicit legacy/custom pipelines
+- **ralplan**: Pipeline planning runs RALPLAN consensus planning

package/plugins/oh-my-codex/skills/plan/SKILL.md CHANGED Viewed

@@ -94,11 +94,11 @@ Jumping into code without understanding requirements leads to rework, scope cree
    b. Deduplicate and categorize the suggestions
    c. Update the plan file in `.omx/plans/` with the accepted improvements (add missing details, refine steps, strengthen acceptance criteria, ADR updates, etc.)
    d. Note which improvements were applied in a brief changelog section at the end of the plan
-   e. Before any execution handoff, derive an explicit **available-agent-types roster** from the known prompt catalog and add concrete **follow-up staffing guidance** for both `$ralph` and `$team` (recommended roles, counts, suggested reasoning levels by lane, and why each lane exists)
-   f. Add a product-facing **Goal-Mode Follow-up Suggestions** section: recommend `$ultragoal` by default for general goal-oriented follow-up, `$autoresearch-goal` when the context is a research project, and `$performance-goal` when the context is an optimization or performance project. Keep these suggestions alongside the Ralph/team paths rather than replacing them when implementation delivery is still the main need. For durable-goal work that is also parallelizable, explicitly recommend **Team + Ultragoal**: Ultragoal remains leader-owned goal/ledger state and Team returns checkpoint-ready execution evidence.
-   g. For the `$team` path, add an explicit launch-hint block with concrete `omx team` / `$team` commands and a **team verification path** (what team proves before shutdown, what Ralph verifies after handoff). Distinguish Team + Ultragoal from a later Ralph follow-up: Team handles coordinated parallel lanes; Ralph is only for persistent sequential single-owner verification/fix pressure when needed.
+   e. Before any execution handoff, derive an explicit **available-agent-types roster** from the known prompt catalog and add concrete **follow-up staffing guidance** for `$ultragoal` and `$team` (recommended roles, counts, suggested reasoning levels by lane, and why each lane exists), plus an explicit `$ralph` fallback note only when persistent single-owner verification is intentionally selected
+   f. Add a product-facing **Goal-Mode Follow-up Suggestions** section: recommend `$ultragoal` by default for general goal-oriented follow-up, `$autoresearch-goal` only when the context is a research project with a research deliverable/evaluator, and `$performance-goal` when the context is an optimization or performance project. Keep these suggestions alongside the Team path and any explicit Ralph fallback rather than replacing implementation-delivery guidance. For ordinary pre-planning external docs or best-practice lookup, cite `$best-practice-research` evidence and synthesize it into the plan instead of recommending Autoresearch as a final architecture component. For durable-goal work that is also parallelizable, explicitly recommend **Team + Ultragoal**: Ultragoal remains leader-owned goal/ledger state and Team returns checkpoint-ready execution evidence.
+   g. For the `$team` path, add an explicit launch-hint block with concrete `omx team` / `$team` commands and a **team verification path** (what Team proves before shutdown and what Ultragoal checkpoints as durable completion evidence). Distinguish Team + Ultragoal from any explicit Ralph fallback: Team handles coordinated parallel lanes; Ultragoal is the default durable follow-up/ledger owner, and Ralph is only an explicitly requested legacy-style persistent sequential verification/fix lane when needed.
 7. On Critic approval (with improvements applied): *(--interactive only)* If running with `--interactive`, use `AskUserQuestion` / the structured question UI to present the plan with these options:
-   - **Approve and execute** — proceed to implementation via ralph+ultrawork
+   - **Approve durable goal execution** — proceed via `$ultragoal` by default (optionally with `$team` for parallel lanes)
    - **Approve and implement via team** — proceed to implementation via coordinated parallel team agents
    - **Start goal-mode follow-up** — proceed via `$ultragoal` by default, or `$autoresearch-goal` / `$performance-goal` when the approved plan specifically fits research validation or measurable optimization
    - **Request changes** — return to step 1 with user feedback
@@ -106,7 +106,7 @@ Jumping into code without understanding requirements leads to rework, scope cree
    If NOT running with `--interactive`, output the final approved plan and stop. Do NOT auto-execute.
 8. *(--interactive only)* User chooses via the structured question UI (never ask for approval in plain text when a structured surface is available)
 9. On user approval (--interactive only):
-   - **Approve and execute**: **MUST** invoke `$ralph` with the approved plan path from `.omx/plans/` as context **plus the explicit available-agent-types roster, suggested reasoning levels, concrete role allocation guidance, and direct launch hints for Ralph follow-up work**. Do NOT implement directly. Do NOT edit source code files in the planning agent. The ralph skill handles execution via ultrawork parallel agents.
+   - **Approve durable goal execution**: **MUST** invoke `$ultragoal` with the approved plan path from `.omx/plans/` as context **plus the explicit available-agent-types roster, suggested reasoning levels, concrete role allocation guidance, and direct launch hints for Ultragoal follow-up work**. Use `$team` alongside Ultragoal when parallel lanes are warranted. Do NOT implement directly. Do NOT edit source code files in the planning agent. Ralph is not the default follow-up; only invoke `$ralph` when the user explicitly selects a legacy/persistent single-owner execution lane.
    - **Approve and implement via team**: **MUST** invoke `$team` with the approved plan path from `.omx/plans/` as context **plus the explicit available-agent-types roster, suggested reasoning levels, concrete staffing / worker-role allocation guidance, explicit `omx team` / `$team` launch hints, and the team verification path**. Do NOT implement directly. The team skill coordinates parallel agents across the staged pipeline for faster execution on large tasks.
    - **Start goal-mode follow-up**: **MUST** invoke the selected goal workflow with the approved plan path and appropriate success context: `$ultragoal` as the default goal-mode path, `$autoresearch-goal` for research projects, or `$performance-goal` for optimization/performance projects with measurable evaluator criteria. Do NOT implement directly in the planning agent.
@@ -147,7 +147,7 @@ Plans are saved to `.omx/plans/`. Drafts go to `.omx/drafts/`.
 - **CRITICAL — Consensus mode agent calls MUST be sequential, never parallel.** Always await the Architect result before issuing the Critic call.
 - In consensus mode, default to RALPLAN-DR short mode; enable deliberate mode on `--deliberate` or explicit high-risk signals (auth/security, migrations, destructive changes, production incidents, compliance/PII, public API breakage)
 - In consensus mode with `--interactive`: use `AskUserQuestion` / the structured question UI for the user feedback step (step 2) and the final approval step (step 7) -- never ask for approval in plain text when a structured surface is available. Without `--interactive`, auto-proceed through planning steps without pausing. Output the final plan without execution.
-- In consensus mode with `--interactive`, on user approval **MUST** invoke the selected follow-up lane from step 9 (`$ralph`, `$team`, `$ultragoal`, `$autoresearch-goal`, or `$performance-goal`) -- never implement directly in the planning agent
+- In consensus mode with `--interactive`, on user approval **MUST** invoke the selected follow-up lane from step 9 (`$ultragoal`, `$team`, `$autoresearch-goal`, `$performance-goal`, or explicit `$ralph` fallback) -- never implement directly in the planning agent
 - In consensus mode, execution follow-up handoff **MUST** include an explicit available-agent-types roster plus concrete staffing / role-allocation guidance grounded in that roster, suggested reasoning levels by lane, product-facing goal-mode follow-up suggestions (`$ultragoal` by default, `$autoresearch-goal` for research projects, `$performance-goal` for optimization/performance projects), explicit `omx team` / `$team` launch hints, and a team verification path. For parallelizable durable-goal plans, recommend Team + Ultragoal with leader-owned checkpointing from Team evidence; reserve Ralph for persistent sequential single-owner verification/fix follow-up.
 </Tool_Usage>
@@ -212,8 +212,8 @@ Why bad: Decision fatigue. Present one option with trade-offs, get reaction, the
 <Escalation_And_Stop_Conditions>
 - Stop interviewing when requirements are clear enough to plan -- do not over-interview
 - In consensus mode, stop after 5 Planner/Architect/Critic iterations and present the best version
-- Consensus mode outputs the plan by default; with `--interactive`, user can approve and hand off to ralph/team
-- If the user says "just do it" or "skip planning", **MUST** invoke `$ralph` to transition to execution mode. Do NOT implement directly in the planning agent.
+- Consensus mode outputs the plan by default; with `--interactive`, user can approve and hand off to ultragoal/team, with Ralph only as an explicit legacy/persistent single-owner lane
+- If the user says "just do it" or "skip planning", **MUST** invoke `$ultragoal` to transition to durable goal execution mode by default; use `$ralph` only when the user explicitly asks for that fallback. Do NOT implement directly in the planning agent.
 - Escalate to the user when there are irreconcilable trade-offs that require a business decision
 </Escalation_And_Stop_Conditions>

package/plugins/oh-my-codex/skills/prometheus-strict/README.md ADDED Viewed

@@ -0,0 +1,35 @@
+# Prometheus Strict
+`$prometheus-strict` is a clean-room OMX planning skill for rigorous interview-driven planning before execution.
+It is inspired by the high-level OMO Prometheus concept only. It does not copy OMO source text, prompts, runtime code, or workflow implementation.
+Credit: Inspired by OMO Prometheus (`code-yeongyu/oh-my-openagent`), reimplemented from concept under MIT.
+## Roles
+- **Metis** clarifies requirements, constraints, non-goals, and acceptance criteria.
+- **Momus** challenges assumptions, scope, handoff risks, and missing verification.
+- **Oracle** synthesizes the approved plan and recommends the OMX-native handoff.
+## OMX Handoff
+Prometheus Strict is planning-only by default. It should hand off to:
+1. `$ultragoal` for durable goal execution.
+2. `$team` only when the Oracle plan identifies independent parallel lanes.
+## Non-Goals
+- No hook implementation.
+- No Sisyphus or `start-work` port.
+- No direct implementation unless a downstream execution workflow is explicitly invoked.
+- No verbatim source copying from the inspiration project.
+## Expected Output
+The skill returns a Prometheus Strict Plan with clarified requirements, resolved critique, an Oracle execution plan, a verification matrix, an optional durable artifact path under `.omx/plans/prometheus-strict/`, and clean-room credit.
+## Durable Plan Artifacts
+When the plan should survive handoff or review, write the final Oracle synthesis to `.omx/plans/prometheus-strict/<slug>.md` and include that path in the plan before invoking `$ultragoal` or `$team`. Inline-only plans may set the artifact path to `N/A - inline plan only`.