npm - oh-my-workflow - Versions diffs - 0.2.0 → 0.4.0 - Mend

oh-my-workflow 0.2.0 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (27) hide show

package/README.md +191 -106
package/conformance/budget-loop.ts +16 -0
package/conformance/fanout.ts +20 -0
package/conformance/pipeline.ts +21 -0
package/conformance/schema-gate.ts +21 -0
package/conformance/strict-throws.ts +13 -0
package/docs/launch/show-hn.md +41 -0
package/docs/site/index.html +540 -0
package/docs/site/robots.txt +2 -0
package/examples/deep-research/workflow.ts +11 -11
package/package.json +11 -3
package/scripts/build-docs.ts +10 -0
package/scripts/check-docs.ts +58 -0
package/skill/SKILL.md +247 -137
package/src/adapters/claude.ts +31 -5
package/src/adapters/codex.ts +5 -3
package/src/adapters/exec.ts +103 -0
package/src/adapters/fake.ts +4 -4
package/src/adapters/hermes.ts +24 -0
package/src/adapters/types.ts +33 -3
package/src/cli/codemod.ts +99 -0
package/src/cli/omw.ts +7 -2
package/src/cli/run.ts +222 -13
package/src/cli/skill.ts +32 -10
package/src/runtime.ts +171 -11
package/src/worktree.ts +72 -0
package/vercel.json +5 -0

package/README.md CHANGED Viewed

@@ -1,115 +1,184 @@
 # oh-my-workflow
-> Run the coding-agent CLIs you already have — `claude -p`, `codex exec` — as
-> nodes in a plain-JS workflow your host agent writes. omw is the thin glue: it
-> runs the script, schema-gates each node's output, and journals every step so the
-> agent can read its own failure and repair its own script. (What's
-> "deterministic" is scoped honestly below — the engine and `--agent fake`, not
-> your script.)
+> **Workflow mode for coding agents** — add `/omw`, describe a multi-step job,
+> and let your agent write and run the workflow. Under the hood it is the open,
+> portable twin of Claude Code's native dynamic Workflow: a plain-JS script whose
+> nodes are whole coding-agent CLIs (`claude -p`, `codex exec`). omw is the thin
+> glue: schema gates, bounded concurrency, repair, resume, and a JSONL journal
+> your authoring agent can read to fix its own script.
-## Try it now — free, no API key
+## Quickstart — add `/omw` to your coding agent
 ```sh
-git clone https://github.com/domuk-k/oh-my-workflow && cd oh-my-workflow
-bun install
-bun src/cli/omw.ts run examples/deep-research --agent fake
+npx skills add domuk-k/oh-my-workflow --skill omw
 ```
-```json
-{"confirmed":[{"topic":"a","hits":3,"verified":true},{"topic":"c","hits":5,"verified":true}],"summary":{"summary":"done","count":2}}
+Then ask from Claude Code, Codex, opencode, or any agent that reads skills:
+```text
+/omw write a workflow that reviews this repo, fans out three implementation plans,
+verifies them independently, and returns the best one with evidence.
 ```
-That's the whole spine in one pass — a `--pretty` tree shows it:
+That is the intended onboarding: install one skill, then ask your own coding
+agent to author and run the workflow. The skill teaches it to create a plain-JS
+workflow file, run `omw run`, read `.omw/<run>.jsonl`, and repair the script from
+structured failures.
+Today the same skill is bundled in the package:
 ```sh
-bun src/cli/omw.ts run examples/deep-research --agent fake --pretty
+bunx github:domuk-k/oh-my-workflow skill install          # Claude Code
+bunx github:domuk-k/oh-my-workflow skill install --codex  # Codex
+bunx github:domuk-k/oh-my-workflow skill install --opencode
 ```
+Prefer `npx skills add domuk-k/oh-my-workflow --skill omw` for the shared skills
+CLI; use `bunx github:domuk-k/oh-my-workflow skill install` when you want the
+bundled copy directly from the repo.
+## Try the runtime without a live agent
+```sh
+bunx github:domuk-k/oh-my-workflow run examples/deep-research --agent fake --pretty
 ```
-run r-… (examples/deep-research)
-  ▸ Scope
-    • call#1 [fake]
-      ✓ call#1
-  ▸ Search
-    • search:a [fake]
-    • search:b [fake]
-    • search:c [fake]
-      ✗ timeout call#3
-      ✓ call#4
-      ✓ call#2
-  ▸ Verify
-    • call#5 [fake]
-    • call#6 [fake]
-      ✓ call#5
-      ✓ call#6
-  ▸ Synthesize
-    • call#7 [fake]
-      ✓ call#7
-run_end ok=true · 6 ok / 1 failed
+`--agent fake` is a built-in deterministic adapter: no API key, no network, no
+cost. It exercises the same spine the skill asks your agent to use for real work:
+fan-out, pipeline, schema-fail to self-repair, timeout to `null`, and one final
+result JSON.
+## The twin framing
+Claude Code has a built-in **dynamic Workflow** tool: when a task is big enough,
+the model writes a deterministic JS orchestration script on the fly and the
+harness runs it — `agent()` to spawn subagents, `parallel`/`pipeline` to fan out,
+`budget` to bound the run, `workflow()` to nest. That surface is excellent and
+**closed** (it only runs inside Claude Code, and its nodes are in-harness
+subagents).
+**omw is the open twin of that surface.** Same authoring shape, same vocabulary —
+but the nodes are *whole external coding-agent CLIs* you already have, it runs
+from any host (Claude Code, Codex, opencode, a cron job), and the whole thing is
+a few hundred lines of standard TypeScript you can read.
+```ts
+// native dynamic Workflow — inside Claude Code
+export default async function ({ agent, parallel }) {
+  const found = await parallel(topics.map((t) => () => agent(`research ${t}`)));
+  return { found };
+}
 ```
-`search:a` (call#2) returns invalid JSON first and self-repairs to `✓`; `search:b`
-(call#3) times out and is dropped by `filter(Boolean)` — the run still ends green.
+```ts
+// omw — anywhere, same shape
+export default async function ({ agent, parallel }, args) {
+  const found = await parallel(topics.map((t) => () => agent(`research ${t}`)));
+  return { found: found.filter(Boolean) };
+}
+```
+The script is nearly the same; what differs is what a node *is* and where it runs.
-`--agent fake` is a built-in deterministic adapter — no API key, no network. A
-stranger runs the full fan-out + pipeline + a scripted schema-fail→self-repair +
-a scripted timeout→drop, and gets a stable result JSON. Swap `--agent claude`
-(after `claude login`) to run it for real.
+## Why this matters now
-> Once on npm this is `bunx oh-my-workflow run examples/deep-research --agent fake`
-> — the example ships inside the package and resolves from there, so it runs from
-> any directory. omw runs under **bun**; `npx` (Node) won't execute the TS bin.
+Coding agents have crossed from chat surfaces into everyday CLIs. Claude Code,
+Codex, opencode-style hosts, and smaller agent CLIs can all run meaningful work
+from a shell, but multi-agent jobs still need the same missing glue: fan-out,
+schema validation, retries, bounded concurrency, run journals, resume, and a
+single result for the caller.
+omw packages that glue without claiming to be a new agent framework. A node is a
+whole external coding-agent CLI. The workflow is plain JavaScript. The journal is
+JSONL you can inspect. That makes the pattern portable enough for a local script,
+a CI job, another coding agent, or a launch demo someone can try in 30 seconds
+with no key.
 ## What it is
-You write a plain-JS orchestration script. Its nodes are **whole coding agents**
-(`claude -p`, `codex exec`) — not single LLM calls. The runtime hands your script
-**five hooks** and nothing else:
+You write a plain-JS orchestration script. Its default export takes the **hooks**
+as a destructured first arg and your `args` as the second — exactly the native
+shape:
 ```ts
-export default async function (rt, args) {
-  rt.phase("Search");
-  const hits = (await rt.parallel(
-    args.queries.map((q) => () => rt.agent(`SEARCH: ${q}`, { schema: HIT, label: q })),
+export const meta = { name: "research", phases: [{ title: "Search" }] };
+export default async function ({ agent, parallel, phase, budget }, args) {
+  phase("Search");
+  const hits = (await parallel(
+    args.queries.map((q) => () => agent(`SEARCH: ${q}`, { schema: HIT, label: q })),
   )).filter(Boolean);                 // agent() returns null on failure, never throws
   return { hits, count: hits.length };
 }
 ```
-- `rt.agent(prompt, opts?)` — run one coding-agent CLI node. With a `schema`, omw
-  extracts JSON, validates it (ajv), and **re-prompts the node with the
-  validation errors** up to 2 times before giving up. Returns the validated
-  object, or `null`. **Never throws** — the load-bearing *null-contract*.
-- `rt.parallel(thunks)` — concurrent, barrier; failures become `null`.
-- `rt.pipeline(items, …stages)` — each item flows through all stages independently.
-- `rt.phase(title)` / `rt.log(msg)` — journal / `--pretty` side-channel only.
+The hooks — destructure only what you use:
+- **`agent(prompt, opts?)`** — run one coding-agent CLI node. With a `schema`, omw
+  extracts JSON, validates it (ajv), and **re-prompts the node with the validation
+  errors** up to 2 times before giving up. Returns the validated object, or
+  `null`. **Never throws** — the load-bearing *null-contract* (the one exception
+  is `budget` exhaustion; see below). `opts`: `schema`, `model`, `label`, `phase`,
+  `effort`, `agentType`, `isolation: 'worktree'`, `cwd`, `inheritMcp`, `timeoutMs`.
+- **`parallel(thunks)`** — concurrent, barrier; failures become `null`.
+- **`pipeline(items, …stages)`** — each item flows through all stages independently.
+- **`workflow(ref, args?)`** — run another workflow inline as a sub-step (one level
+  deep), sharing the adapter, journal, and budget pool.
+- **`budget`** — `{ total, spent(), remaining() }`. Set a ceiling with `--budget N`;
+  once spent reaches it, `agent()` throws `BudgetExceededError` (the documented
+  null-contract exception) so a bounded loop terminates instead of spinning.
+- **`phase(title)` / `log(msg)`** — journal / `--pretty` side-channel only.
+`export const meta` is optional — `{ name, description, whenToUse, model, phases }`,
+matching native. A per-phase or default `model` resolves along
+`opts.model > phase model > meta.model`.
 Concurrency is bounded at the `agent()` boundary (default 4, `--concurrency N`).
-Every step is recorded to the journal file `.omw/<runId>.jsonl`, so when a node
-fails you read the `kind` (`timeout` / `nonzero_exit` / `schema_violation` / …)
-and fix your script. stdout is one result JSON; the `--pretty` tree and a
-`journal: <path>` pointer go to stderr.
+Every step is recorded to `.omw/<runId>.jsonl`, so when a node fails you read the
+`kind` (`timeout` / `nonzero_exit` / `refusal` / `schema_violation` / …) and fix
+your script. stdout is one result JSON; the `--pretty` tree and a `journal: <path>`
+pointer go to stderr.
 **The full agent-facing guide is [`skill/SKILL.md`](skill/SKILL.md)** — patterns
 (fan-out / verify-vote / pipeline / loop-until-dry), the debug loop, and the
 conventions. That skill is the primary product; this README is the human intro.
+## No magic (the differentiator)
+The native tool can lean on the harness; an open twin has to earn portability.
+omw's deliberate choice: **be boring standard JS.**
+- **No source transform.** Your script is run as-is (`await import`), not rewritten.
+  What you write is what executes.
+- **No ambient globals.** Hooks arrive as a destructured argument, not injected
+  into scope. No hidden `agent` global to explain.
+- **No sandbox by default.** Determinism is a *convention you keep*; opt into
+  enforcement with `--strict` (freezes `Date`/`Math.random` to throw) when you
+  want a reproducible run.
+That's why the same script reads cleanly in a README, a repo, or another agent's
+context — there's nothing non-standard to carry along.
 ## Install the skill (the primary product)
 omw's primary product is an **agent-authoring skill** (`skill/SKILL.md`) — it
-teaches a coding agent to write, run, and repair omw workflows. After the package
-is installed, wire the skill into your agent in one step:
+teaches a coding agent to write, run, and repair omw workflows. The short path is:
 ```sh
-omw skill install            # → ~/.claude/skills/oh-my-workflow  (Claude Code auto-discovers it)
-omw skill install --project  # → ./.claude/skills/oh-my-workflow  (this repo only)
-omw skill path               # print the bundled SKILL.md path (cat / pipe / point an agent at it)
+npx skills add domuk-k/oh-my-workflow --skill omw
 ```
-Then ask your coding agent: *"use oh-my-workflow to &lt;task&gt;"* — it authors a
-`workflow.ts` and runs it with `omw run`. (The skill is Claude-Code-flavored;
-for other hosts use `omw skill path` and feed the file in however that host loads
-context.)
+For hosts that do not yet read that registry, install the bundled copy:
+```sh
+bunx github:domuk-k/oh-my-workflow skill install              # → ~/.claude/skills/omw
+bunx github:domuk-k/oh-my-workflow skill install --codex      # → ~/.codex/skills/omw
+bunx github:domuk-k/oh-my-workflow skill install --opencode   # → ~/.config/opencode/skills/omw
+bunx github:domuk-k/oh-my-workflow skill install --project    # → ./.claude/skills/omw
+```
+Then ask your coding agent with `/omw <task>`. It authors a workflow file, runs it
+with `omw run`, and uses the JSONL journal to repair failures.
 ## Adapters
@@ -117,17 +186,32 @@ A node is a coding agent driven through its headless prompt→result CLI.
 | adapter | status | notes |
 |---|---|---|
+| **auto** | default | honors `OMW_AGENT`, then host hints, then installed CLIs in order: `claude`, `codex`, `hermes` |
 | **fake** | built-in, free, deterministic | the no-key demo engine and test double |
-| **claude** | **full** (live-verified, 2.1.177) | `claude -p --output-format json`; `--resume` powers in-session schema self-repair |
-| **codex** | **experimental** (live-verified, 0.137.0) | `codex exec --json`; **no cost field**; tolerates malformed JSONL ([openai/codex#15451](https://github.com/openai/codex/issues/15451)) and fails *actionably* |
+| **claude** | **full** (live-verified, 2.1.x) | `claude -p --output-format json --strict-mcp-config` (nodes isolated from host MCP by default; opt in per call with `inheritMcp`); `--resume` (same cwd) powers in-session schema self-repair. `effort`/`agentType` have no faithful CLI flag yet → dropped with a one-time warn (honest-scope) |
+| **codex** | **experimental** (live-verified, 0.137.x) | `codex exec --json`; **no cost field**; tolerates malformed JSONL ([openai/codex#15451](https://github.com/openai/codex/issues/15451)) and fails *actionably* |
+| **hermes** | **experimental** | `hermes -z <prompt> --yolo` (one-shot, prints only the response); no in-session followUp (schema retries go fresh); no cost field |
 | **pi** | planned | not wired yet (`--agent pi` → exit 3 + install hint) |
-| **kiro** | not a fit | its CLI is an IDE launcher (open files/diffs), no headless prompt→result interface |
 A missing CLI exits `3` with an `install_hint` instead of failing mid-run. A node
 that hits `internal_error` (e.g. an invalid JSON Schema) escalates the run to exit
 `4` (result still on stdout) so an author bug doesn't hide behind the null-contract.
 `omw validate <wf>` is a pre-flight load + fake-fixture lint that spawns no agents.
+## Migrating from 0.3 (`(rt, args)` → destructured DI)
+0.3 scripts wrote `export default async function (rt, args) { rt.agent(…) }`. That
+**still works** in 0.4 — the same object is passed as the first arg, so a legacy
+`rt` script and a new `({ agent })` script both run. You'll get a one-time
+deprecation notice on legacy scripts; the bridge is removed in 0.5.
+Migrate mechanically:
+```sh
+omw codemod path/to/workflow.ts            # prints the destructured-DI version
+omw codemod path/to/workflow.ts --write    # rewrites in place
+```
 ## Honest scope (read before you judge the novelty)
 omw externalizes a pattern Claude Code uses internally for dynamic workflows
@@ -137,59 +221,60 @@ omw externalizes a pattern Claude Code uses internally for dynamic workflows
 - **"deterministic"** means: the engine's guarantees (stable resume keys, JSONL
   recording, schema-gate) **and** the `--agent fake` demo. Your *script's*
-  determinism is a **convention you keep** — there is **no sandbox**, so omw
-  can't stop a workflow from calling `Date.now()`.
+  determinism is a **convention you keep** unless you pass `--strict` (opt-in
+  freeze-throw on `Date`/`Math.random`).
 - **resume**: the journal format and resume key `(callIndex, promptHash,
-  optsHash)` (journaled as `call`) are **frozen and proven byte-stable** (identical re-run = 100% key
-  hits; edit the last node = hits up to the first change, then a miss). **Live
-  resume has landed**: `omw run <wf> --resume <journal>` reuses any node whose
-  `(callIndex, promptHash, optsHash)` key hits (adapter not invoked,
-  `agent_end{cached:true}`) and re-runs the rest — verified end-to-end on
-  `--agent fake`. Resume is **per-node key match, not dependency-aware**: it
-  behaves as longest-unchanged-prefix only when upstream outputs flow into
-  downstream prompts (the usual data-flow shape). When nodes instead pass state
-  through the **filesystem** (the normal coding-agent idiom — node 1 writes files
-  node 2 reads), an upstream edit re-runs node 1 but a cached node 2 serves a
-  **stale** result; re-run fresh, or thread a file digest into the downstream
-  prompt. Keeping per-node preserves parallel/pipeline sibling cache; a
-  `--strict-resume` prefix-truncation opt-in and dependency-aware cascade are v2.
-  It holds **only for deterministic workflows**: omw can't *enforce* determinism
-  (no sandbox), so that stays a convention you keep (enforcement is v2). `omw replay` remains a
-  read-only **fixture replay** (reconstructing a recorded run's view), a separate
-  command — not the resume path.
+  optsHash)` are **frozen and byte-stable** (identical re-run = 100% key hits;
+  edit the last node = hits up to the first change, then a miss). `omw run <wf>
+  --resume <journal|runId>` reuses any node whose key hits (adapter not invoked,
+  `agent_end{cached:true}`) and re-runs the rest. The key is **semantic** —
+  cosmetic `label`/`phase` changes don't bust the cache; `model`/`schema`/`effort`/
+  `isolation` do. Resume is **per-node key match, not dependency-aware**: when
+  nodes pass state through the **filesystem** (node 1 writes, node 2 reads), an
+  upstream edit re-runs node 1 but a cached node 2 can serve a **stale** result —
+  re-run fresh, or thread a file digest into the downstream prompt. A
+  dependency-aware cascade is v2.
+- **`budget`** bounds *output-token spend the adapter reports*. A token-less
+  failure (a killed timeout reports no `usage`) can't be counted, so a loop on a
+  purely-timing-out node isn't bounded by `--budget` alone — pair it with your own
+  iteration cap.
 - an omw node is a **whole external coding-agent CLI**, heavier than a single
-  in-harness subagent.
-- **not in v1** (the CC dynamic-workflow surface has these; omw doesn't yet):
-  `budget`, nested `workflow()`, a `meta`/`phases` block, custom `agentType`,
-  `run_in_background`, worktree isolation.
+  in-harness subagent. JSON is extracted heuristically; cross-node state often
+  flows through the **filesystem** (a real side-channel, not modeled by resume).
 The one genuinely novel piece of code is the **schema-gate self-repair loop** —
 the part a "subprocess + for-loop" comparison misses. Everything else is honest
-glue. The fuller positioning (4-way prior-art table, resemblance ledger) lives in
-[`skill/SKILL.md`](skill/SKILL.md) and the
-[launch strategy](https://github.com/domuk-k/oh-my-workflow/blob/main/docs/specs/2026-06-14-omw-launch-strategy.md).
+glue.
 ## Develop
 ```sh
 bun install
-bun test            # 136 pass / 2 skip (live adapters, OMW_LIVE=1) / 0 fail
-bun test --coverage # ~99% lines on the pure core
+bun test            # green (live adapters run only under OMW_LIVE=1) / 0 fail
 bun run typecheck   # tsc --noEmit, clean
 ```
-`test/spine.test.ts` is the gate: one full `scope → search → verify → synthesize`
-pass against the fake adapter, including the scripted schema-fail → self-repair →
-`filter(Boolean)` survival cycle. Live adapter tests run only under `OMW_LIVE=1`
-(they spend real tokens) and are skipped by default.
+The conformance suite (`conformance/*.ts` + `test/conformance.test.ts`) is the
+drop-in proof: native-shaped, destructured-DI scripts — fan-out, pipeline,
+schema-gate, budget-loop, `--strict` — all run green under `--agent fake`. Live
+adapter tests run only under `OMW_LIVE=1` (they spend real tokens).
 ## Docs
+- **Official docs site source**: [`docs/site/index.html`](docs/site/index.html)
+- Launch note: [`docs/launch/show-hn.md`](docs/launch/show-hn.md)
 - **Skill (primary product)**: [`skill/SKILL.md`](skill/SKILL.md)
+- Open-twin design: [`docs/specs/2026-06-23-omw-open-dynamic-workflow-twin-design.md`](https://github.com/domuk-k/oh-my-workflow/blob/main/docs/specs/2026-06-23-omw-open-dynamic-workflow-twin-design.md)
 - Product spec: [`docs/specs/2026-06-12-oh-my-workflow-design.md`](https://github.com/domuk-k/oh-my-workflow/blob/main/docs/specs/2026-06-12-oh-my-workflow-design.md)
-- Launch strategy + scorecard: [`docs/specs/2026-06-14-omw-launch-strategy.md`](https://github.com/domuk-k/oh-my-workflow/blob/main/docs/specs/2026-06-14-omw-launch-strategy.md)
 - Resume / determinism internals: [`docs/specs/2026-06-15-resume-internals-deepdive.md`](https://github.com/domuk-k/oh-my-workflow/blob/main/docs/specs/2026-06-15-resume-internals-deepdive.md)
+Deploy the docs with Vercel:
+```sh
+bun run docs:build
+bunx vercel deploy --prod
+```
 ## License
 MIT

package/conformance/budget-loop.ts ADDED Viewed

@@ -0,0 +1,16 @@
+// Conformance: a budget-bounded loop terminates when the ceiling is hit. Run
+// with `--budget 100`; each node spends 40, so agent() throws BudgetExceededError
+// on the third call and the run halts (exit 1, "budget exhausted").
+export const meta = { name: "budget-loop" };
+export default async function ({ agent }, _args) {
+  const done = [];
+  // No budget.remaining() guard on purpose: the ceiling itself must halt the
+  // loop (agent() throws BudgetExceededError once spent >= total).
+  while (true) {
+    done.push(await agent(`step ${done.length}`));
+  }
+}
+export const fake = { default: { text: "x", outputTokens: 40 } };

package/conformance/fanout.ts ADDED Viewed

@@ -0,0 +1,20 @@
+// Conformance: a fan-out over parallel(), survivors through filter(Boolean).
+// Native-shaped destructured-DI authoring; runs green under `--agent fake`.
+export const meta = { name: "fanout", phases: [{ title: "Search" }] };
+export default async function ({ agent, parallel, phase }, _args) {
+  phase("Search");
+  const found = await parallel(
+    ["a", "b", "c"].map((t) => () => agent(`SEARCH ${t}`, { label: `search:${t}` })),
+  );
+  return { found: found.filter(Boolean) };
+}
+export const fake = {
+  rules: [
+    { match: (p) => p.includes("SEARCH a"), responses: [{ text: "A" }] },
+    { match: (p) => p.includes("SEARCH b"), responses: [{ fail: "timeout" }] },
+    { match: (p) => p.includes("SEARCH c"), responses: [{ text: "C" }] },
+  ],
+};

package/conformance/pipeline.ts ADDED Viewed

@@ -0,0 +1,21 @@
+// Conformance: a two-stage pipeline() — each item flows through both stages
+// independently. Destructured-DI; runs green under `--agent fake`.
+export const meta = { name: "pipeline", phases: [{ title: "Map" }] };
+export default async function ({ agent, pipeline, phase }, _args) {
+  phase("Map");
+  const out = await pipeline(
+    ["x", "y"],
+    (item) => agent(`STAGE1 ${item}`),
+    (prev) => agent(`STAGE2 ${prev}`),
+  );
+  return { out };
+}
+export const fake = {
+  rules: [
+    { match: (p) => p.startsWith("STAGE1"), responses: [{ text: "s1" }] },
+    { match: (p) => p.startsWith("STAGE2"), responses: [{ text: "s2" }] },
+  ],
+};

package/conformance/schema-gate.ts ADDED Viewed

@@ -0,0 +1,21 @@
+// Conformance: the schema gate repairs a first non-conforming response, then
+// returns validated JSON. Destructured-DI; runs green under `--agent fake`.
+export const meta = { name: "schema-gate" };
+const numSchema = { type: "object", properties: { n: { type: "number" } }, required: ["n"], additionalProperties: false };
+export default async function ({ agent }, _args) {
+  const got = await agent("COMPUTE", { schema: numSchema });
+  return { got };
+}
+export const fake = {
+  rules: [
+    {
+      match: (p) => p.includes("COMPUTE"),
+      // First response is invalid (string n), gate feeds the error back, second is valid.
+      responses: [{ text: '{"n":"not-a-number"}' }, { text: '{"n":7}' }],
+    },
+  ],
+};

package/conformance/strict-throws.ts ADDED Viewed

@@ -0,0 +1,13 @@
+// Conformance: a script that reaches for wall-clock time fails under `--strict`.
+// Run with strict:true → exit 1 (script_error mentioning strict). The same script
+// without --strict completes. Proves the determinism sandbox is enforced.
+export const meta = { name: "strict-throws" };
+export default async function ({ agent }, _args) {
+  const t = Date.now(); // forbidden under --strict
+  const x = await agent("go");
+  return { t, x };
+}
+export const fake = { default: { text: "ok" } };

package/docs/launch/show-hn.md ADDED Viewed

@@ -0,0 +1,41 @@
+# Show HN / GeekNews launch note
+Title:
+> Show HN: oh-my-workflow - add /omw workflow mode to your coding agent
+Short post:
+I built `oh-my-workflow`, a tiny Bun/TypeScript runtime plus agent skill for giving your coding agent a workflow mode. You add `/omw`, describe a multi-step job, and the agent writes and runs a plain-JS workflow whose nodes are whole coding-agent CLIs like `claude -p`, `codex exec`, and `hermes`.
+The shape mirrors the dynamic workflow vocabulary people already reach for inside coding agents: `agent`, `parallel`, `pipeline`, `workflow`, `budget`, `phase`, and `log`. The difference is that omw runs outside any one host. It gives you a JSONL journal, bounded concurrency, schema-gated node output, automatic schema repair, run-level resume, and a deterministic fake adapter so anyone can try the full flow without an API key.
+Try it from your coding agent:
+```sh
+npx skills add domuk-k/oh-my-workflow --skill omw
+```
+Then ask:
+```text
+/omw review this repo, fan out three bug-finding passes, verify the strongest claims, and return fixes with evidence.
+```
+The CLI defaults to `--agent auto`, so the generated workflow can run without making you choose Claude vs Codex in the onboarding path. For a no-key runtime demo, use `bunx github:domuk-k/oh-my-workflow run examples/deep-research --agent fake --pretty`.
+Why now:
+- Coding agents have become real CLIs, not just chat UIs.
+- Multi-agent work keeps reimplementing the same glue: fan-out, verify, retry, resume, budget.
+- Closed host-native workflow tools are useful, but the pattern should be portable across Claude Code, Codex, opencode, cron jobs, and CI.
+What is deliberately small:
+- No DSL.
+- No source transform.
+- No ambient globals.
+- No sandbox by default.
+- A node is a whole coding-agent CLI, not a raw LLM call.
+Repo: https://github.com/domuk-k/oh-my-workflow