@a5c-ai/babysitter-opencode 5.0.1-staging.9e5052f8bc95 → 5.0.1-staging.a2865ee1a2da

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -29,7 +29,7 @@ function writeFileIfChanged(filePath, contents) {
29
29
  try {
30
30
  const existing = fs.readFileSync(filePath, 'utf8');
31
31
  if (existing === contents) return false;
32
- } catch {}
32
+ } catch (e) { process.stderr.write('[extension-mux] file read failed for ' + filePath + ', overwriting: ' + (e instanceof Error ? e.message : String(e)) + '\n'); }
33
33
  fs.mkdirSync(path.dirname(filePath), { recursive: true });
34
34
  fs.writeFileSync(filePath, contents);
35
35
  return true;
@@ -82,7 +82,7 @@ function writeJson(filePath, value) {
82
82
  function ensureExecutable(filePath) {
83
83
  try {
84
84
  fs.chmodSync(filePath, 0o755);
85
- } catch {}
85
+ } catch (e) { process.stderr.write('[extension-mux] chmod failed for ' + filePath + ': ' + (e instanceof Error ? e.message : String(e)) + '\n'); }
86
86
  }
87
87
 
88
88
  function normalizeMarketplaceSourcePath(source, marketplacePath) {
@@ -104,7 +104,7 @@ function ensureMarketplaceEntry(marketplacePath, pluginRoot) {
104
104
  name: PLUGIN_NAME,
105
105
  source: relSource,
106
106
  description: "Orchestrate complex, multi-step workflows with event-sourced state management, hook-based extensibility, and human-in-the-loop approval",
107
- version: "5.0.1-staging.9e5052f8bc95",
107
+ version: "5.0.1-staging.a2865ee1a2da",
108
108
  author: { name: "a5c.ai" },
109
109
  };
110
110
  if (idx >= 0) marketplace.plugins[idx] = entry;
@@ -147,7 +147,7 @@ function resolveCliCommand(packageRoot) {
147
147
  const versionsPath = path.join(packageRoot, 'versions.json');
148
148
  const versions = readJson(versionsPath) || {};
149
149
  const ver = versions.sdkVersion || 'latest';
150
- return `npx -y @a5c-ai/babysitter-sdk@${ver}`;
150
+ return `npm exec --yes --package @a5c-ai/babysitter-sdk@${ver} -- babysitter`;
151
151
  }
152
152
 
153
153
  function runCli(packageRoot, cliArgs, options = {}) {
@@ -0,0 +1,68 @@
1
+ ---
2
+ description: Pre-deploy gate that scans built JS chunks for forbidden substring markers (saga-era / obsolete code paths) listed in a project-local forbidden-markers.txt
3
+ argument-hint: "[--markers-file <path>] [--chunks-dir <path>] [--json] Optional overrides; defaults are project-relative."
4
+ allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
5
+ ---
6
+
7
+ Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md). Compose the gate from the shared helper at `library/processes/shared/forbidden-markers-scanner.js` (issue #477).
8
+
9
+ ## What this gate does
10
+
11
+ Reads a list of literal substring markers from `scripts/forbidden-markers.txt` (blank lines and `#`-prefixed comments stripped) and greps every `.js` chunk under `.vercel/output/static/_next/static/chunks/` (Next.js / Vercel default; configurable) for any occurrence. Reports structured hits per `(marker, chunk)` pair with occurrence counts. Designed to chain between `vercel build --prod` and `vercel deploy --prod`.
12
+
13
+ Use this gate when a refactor or restart-from-baseline replaced load-bearing code paths and you need a structural guarantee the obsolete symbols never re-ship. Burned-in evidence: cookbook VI-9 / VI-12 near-miss revivals during the 2026-05 iOS-Safari saga; the prototype lives at `cookbook/scripts/check-no-forbidden.mjs` and shipped two upstream contributions before being generalized as this gate.
14
+
15
+ ## When to use
16
+
17
+ - **Pre-deploy.** Insert after build, before deploy. Block the deploy when `ok: false`.
18
+ - **Post-restart.** After a baseline rollback + step-by-step re-add, snapshot the saga-era markers in `forbidden-markers.txt` and let CI hold the line.
19
+ - **Post-refactor.** When old helper / handler / module names must not coexist with the new ones in the same bundle.
20
+
21
+ ## Expected config locations
22
+
23
+ - `scripts/forbidden-markers.txt` — one marker per line, `#` for comments. The list is the contract; the gate is mechanical. Commit this file to source control.
24
+ - `.vercel/output/static/_next/static/chunks/` — default scan target. Override for non-Vercel frameworks via the `--chunks-dir` flag or the `chunksDir` task input.
25
+
26
+ A missing markers file is a no-op (`ok: true`, `reason: 'missing-markers-file'`) — misconfiguration is never a deploy block. A missing chunks directory is likewise a no-op (`reason: 'missing-chunks-dir'`) so the gate is safe to chain into `check:all` before the build runs.
27
+
28
+ ## Exit semantics
29
+
30
+ | Reason | `ok` | Deploy decision |
31
+ |-------------------------|--------|--------------------------------|
32
+ | `missing-markers-file` | true | Pass (no gate active) |
33
+ | `missing-chunks-dir` | true | Pass (run before build) |
34
+ | `empty-markers` | true | Pass (list is empty) |
35
+ | `no-chunks` | true | Pass (nothing to scan) |
36
+ | `clean` | true | Pass — proceed to deploy |
37
+ | `hits` | false | **BLOCK** — surface hits, ask for triage |
38
+
39
+ For each hit, the gate emits `{ marker, chunk, count }` so the operator sees the exact marker string, the absolute chunk path, and the number of occurrences in that chunk. Multiple hits across chunks for the same marker are reported separately.
40
+
41
+ ## Programmatic surface
42
+
43
+ ```js
44
+ import { scanForbiddenMarkers, checkForbiddenMarkersTask } from '@a5c-ai/babysitter-library/processes/shared';
45
+
46
+ // Direct call:
47
+ const result = await scanForbiddenMarkers({
48
+ markersFile: 'scripts/forbidden-markers.txt',
49
+ chunksDir: '.vercel/output/static/_next/static/chunks',
50
+ });
51
+ if (!result.ok) {
52
+ // result.hits: Array<{ marker, chunk, count }>
53
+ // result.reason === 'hits'
54
+ process.exit(1);
55
+ }
56
+
57
+ // Or dispatched as a babysitter task:
58
+ const gate = await ctx.task(checkForbiddenMarkersTask, {
59
+ projectDir: '.',
60
+ // markersFile / chunksDir are inferred from projectDir if omitted
61
+ });
62
+ ```
63
+
64
+ ## Reference
65
+
66
+ - Issue: https://github.com/a5c-ai/babysitter/issues/477
67
+ - Helper module: `library/processes/shared/forbidden-markers-scanner.js`
68
+ - Origin (cookbook prototype): `cookbook/scripts/check-no-forbidden.mjs` (81 lines)
@@ -6,7 +6,13 @@ allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSea
6
6
 
7
7
  Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
8
8
 
9
- Create and run a cleanup process using the process at `skills\babysit\process\cradle\cleanup-runs.js/processes/cleanup-runs.js`.
9
+ Resolve the active process library with:
10
+
11
+ ```bash
12
+ babysitter process-library:active --json
13
+ ```
14
+
15
+ Read `binding.dir` from that JSON and create/run the cleanup process from `cradle/cleanup-runs.js#process` relative to that active library root. Do not use plugin-cache-relative cradle paths.
10
16
 
11
17
  Implementation notes (for the process):
12
18
  - Parse arguments for `--dry-run` flag (if present, set dryRun: true in inputs) and `--keep-days N` (default: 7)
package/commands/help.md CHANGED
@@ -233,7 +233,8 @@ SECONDARY COMMANDS
233
233
  How it works: Runs npx @a5c-ai/babysitter-observer-dashboard@latest which watches
234
234
  the .a5c/runs/ directory (or a parent directory containing multiple projects) and
235
235
  serves a live dashboard. The process is blocking -- it runs until you stop it, and
236
- it prints the local URL to share with the user.
236
+ it prints the local URL to share with the user. Do not use `babysitter observe`
237
+ as a fallback; the core Babysitter CLI does not expose that subcommand.
237
238
 
238
239
  Example: /babysitter:observe
239
240
  (opens browser showing all runs with live-updating task
@@ -7,6 +7,11 @@ allowed-tools: Read, Grep, Write, Task, Bash
7
7
  Run the babysitter observer dashboard:
8
8
 
9
9
  1. Determine the watch directory — this is usually the project's container directory (the parent of the project dir), or the current working directory if not specified.
10
- 2. Launch the dashboard: `npx -y @a5c-ai/babysitter-observer-dashboard@latest --watch-dir <dir>`
10
+ 2. Launch the standalone dashboard package: `npx -y @a5c-ai/babysitter-observer-dashboard@latest --watch-dir <dir>`.
11
11
  3. This is a blocking process — it will keep running until stopped.
12
12
  4. Report the URL printed by the dashboard to the user, then open it in the browser.
13
+
14
+ Do not fall back to `babysitter observe`; the core Babysitter CLI does not expose
15
+ that subcommand. Some harness runtimes may provide a separate
16
+ `agent-platform observe` surface, but this skill uses the verified standalone
17
+ dashboard package.
package/commands/plan.md CHANGED
@@ -4,4 +4,14 @@ argument-hint: Specific instructions for the run.
4
4
  allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
5
5
  ---
6
6
 
7
- Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md). focus on creating the best process possible, but without creating and running the actual run.
7
+ Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md). Focus on creating the best process possible, but without creating and running the actual run.
8
+
9
+ Before drafting the process, run Phase 0 -- REUSE-AUDIT: extract keyword nouns and verbs from the request, scan for matching existing migrations, API routes, environment variables, SDK dependencies, and imports, honor `.a5c/reuse-audit.json` when present, and put a `Reuse-audit findings (REVIEW BEFORE PROCEEDING)` block before Phase 1 of the plan.
10
+
11
+ ## Process Shape Selection
12
+
13
+ Choose the process shape before authoring `process.js`:
14
+
15
+ - Use a flat phase list when the spec is well-defined, the work is wiring or composition, the bug class is already known if this is a fix, and execution should proceed sequentially through clear phases.
16
+ - Use a HYPOTHESES tree when the bug class is unknown, forensics are required, multiple causal models compete, and each hypothesis needs its own observations, falsifying observations, and follow-up phases.
17
+ - Rule of thumb: if the first phase is "investigate", use HYPOTHESES-tree mode. If the first phase is "implement X", use flat-phase-list mode.
package/commands/yolo.md CHANGED
@@ -4,7 +4,7 @@ argument-hint: Specific instructions for the run.
4
4
  allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
5
5
  ---
6
6
 
7
- Start the Babysitter run directly through the CLI, without any user interaction or breakpoints. Do not invoke the Skill tool and do not run an instructions-only command. In Claude Code, use Bash to run `babysitter harness:yolo --harness claude-code --workspace "$PWD" --prompt "<user arguments>" --json`; in Codex, run `babysitter harness:yolo --harness codex --workspace "$PWD" --prompt "<user arguments>" --json`; in other harnesses, use the same command with that harness id. Replace `<user arguments>` with the arguments shown below, wait for the command to finish, and treat the CLI completion proof as the result.
7
+ Run the Babysitter orchestration instructions directly through the CLI, without any user interaction or breakpoints. In Claude Code, use Bash to run `babysitter instructions:babysit-skill --harness claude-code --no-interactive`; in Codex, run `babysitter instructions:babysit-skill --harness codex --no-interactive`; in other harnesses, use the same command with that harness id. Then follow the returned instructions in this same turn until completion proof is produced. Do not stop after reading the instructions, do not invoke the Skill tool first, and use the non-interactive/no-breakpoints path when the instructions offer a mode choice.
8
8
 
9
9
  User arguments for this command:
10
10
 
@@ -6,7 +6,7 @@ var readFileSync = require("fs").readFileSync;
6
6
 
7
7
  var PLUGIN_ROOT = process.env.PLUGIN_ROOT || process.env.PLUGIN_ROOT || path.resolve(__dirname, "..");
8
8
  var stdin = "";
9
- try { stdin = readFileSync(0, "utf8"); } catch {}
9
+ try { stdin = readFileSync(0, "utf8"); } catch (e) { process.stderr.write("[extension-mux] stdin read failed: " + (e instanceof Error ? e.message : String(e)) + "\n"); }
10
10
  try {
11
11
  var result = execSync("bash " + JSON.stringify(path.join(PLUGIN_ROOT, "hooks/session-start.sh")), {
12
12
  input: stdin,
@@ -20,5 +20,7 @@ try {
20
20
  });
21
21
  process.stdout.write(result);
22
22
  } catch (e) {
23
- process.stdout.write("{}\n");
23
+ process.stderr.write("[extension-mux] hook execution failed: " + (e instanceof Error ? e.message : String(e)) + "\n");
24
+ process.stdout.write(JSON.stringify({ error: e instanceof Error ? e.message : String(e) }) + "\n");
25
+ process.exit(1);
24
26
  }
@@ -6,7 +6,7 @@ var readFileSync = require("fs").readFileSync;
6
6
 
7
7
  var PLUGIN_ROOT = process.env.PLUGIN_ROOT || process.env.PLUGIN_ROOT || path.resolve(__dirname, "..");
8
8
  var stdin = "";
9
- try { stdin = readFileSync(0, "utf8"); } catch {}
9
+ try { stdin = readFileSync(0, "utf8"); } catch (e) { process.stderr.write("[extension-mux] stdin read failed: " + (e instanceof Error ? e.message : String(e)) + "\n"); }
10
10
  try {
11
11
  var result = execSync("bash " + JSON.stringify(path.join(PLUGIN_ROOT, "hooks/shell-env.sh")), {
12
12
  input: stdin,
@@ -20,5 +20,7 @@ try {
20
20
  });
21
21
  process.stdout.write(result);
22
22
  } catch (e) {
23
- process.stdout.write("{}\n");
23
+ process.stderr.write("[extension-mux] hook execution failed: " + (e instanceof Error ? e.message : String(e)) + "\n");
24
+ process.stdout.write(JSON.stringify({ error: e instanceof Error ? e.message : String(e) }) + "\n");
25
+ process.exit(1);
24
26
  }
@@ -6,7 +6,7 @@ var readFileSync = require("fs").readFileSync;
6
6
 
7
7
  var PLUGIN_ROOT = process.env.PLUGIN_ROOT || process.env.PLUGIN_ROOT || path.resolve(__dirname, "..");
8
8
  var stdin = "";
9
- try { stdin = readFileSync(0, "utf8"); } catch {}
9
+ try { stdin = readFileSync(0, "utf8"); } catch (e) { process.stderr.write("[extension-mux] stdin read failed: " + (e instanceof Error ? e.message : String(e)) + "\n"); }
10
10
  try {
11
11
  var result = execSync("bash " + JSON.stringify(path.join(PLUGIN_ROOT, "hooks/post-tool-use.sh")), {
12
12
  input: stdin,
@@ -20,5 +20,7 @@ try {
20
20
  });
21
21
  process.stdout.write(result);
22
22
  } catch (e) {
23
- process.stdout.write("{}\n");
23
+ process.stderr.write("[extension-mux] hook execution failed: " + (e instanceof Error ? e.message : String(e)) + "\n");
24
+ process.stdout.write(JSON.stringify({ error: e instanceof Error ? e.message : String(e) }) + "\n");
25
+ process.exit(1);
24
26
  }
@@ -6,7 +6,7 @@ var readFileSync = require("fs").readFileSync;
6
6
 
7
7
  var PLUGIN_ROOT = process.env.PLUGIN_ROOT || process.env.PLUGIN_ROOT || path.resolve(__dirname, "..");
8
8
  var stdin = "";
9
- try { stdin = readFileSync(0, "utf8"); } catch {}
9
+ try { stdin = readFileSync(0, "utf8"); } catch (e) { process.stderr.write("[extension-mux] stdin read failed: " + (e instanceof Error ? e.message : String(e)) + "\n"); }
10
10
  try {
11
11
  var result = execSync("bash " + JSON.stringify(path.join(PLUGIN_ROOT, "hooks/pre-tool-use.sh")), {
12
12
  input: stdin,
@@ -20,5 +20,7 @@ try {
20
20
  });
21
21
  process.stdout.write(result);
22
22
  } catch (e) {
23
- process.stdout.write("{}\n");
23
+ process.stderr.write("[extension-mux] hook execution failed: " + (e instanceof Error ? e.message : String(e)) + "\n");
24
+ process.stdout.write(JSON.stringify({ error: e instanceof Error ? e.message : String(e) }) + "\n");
25
+ process.exit(1);
24
26
  }
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@a5c-ai/babysitter-opencode",
3
- "version": "5.0.1-staging.9e5052f8bc95",
3
+ "version": "5.0.1-staging.a2865ee1a2da",
4
4
  "description": "Orchestrate complex, multi-step workflows with event-sourced state management, hook-based extensibility, and human-in-the-loop approval",
5
5
  "scripts": {
6
6
  "deploy": "npm publish --access public",
@@ -35,7 +35,7 @@
35
35
  "access": "public"
36
36
  },
37
37
  "dependencies": {
38
- "@a5c-ai/babysitter-sdk": "5.0.1-staging.9e5052f8bc95"
38
+ "@a5c-ai/babysitter-sdk": "5.0.1-staging.a2865ee1a2da"
39
39
  },
40
40
  "repository": {
41
41
  "type": "git",
package/plugin.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "babysitter",
3
- "version": "5.0.1-staging.9e5052f8bc95",
3
+ "version": "5.0.1-staging.a2865ee1a2da",
4
4
  "description": "Orchestrate complex, multi-step workflows with event-sourced state management, hook-based extensibility, and human-in-the-loop approval",
5
5
  "author": "a5c.ai",
6
6
  "license": "MIT",
@@ -32,9 +32,15 @@ Read the SDK version from `versions.json` to ensure version compatibility:
32
32
  SDK_VERSION=$(node -e "try{const fs=require('fs');const probes=['./plugins/babysitter-unified/versions.json','./node_modules/@a5c-ai/babysitter-opencode/versions.json'];for(const probe of probes){if(fs.existsSync(probe)){console.log(JSON.parse(fs.readFileSync(probe,'utf8')).sdkVersion||'latest');process.exit(0)}}console.log('latest')}catch{console.log('latest')}")
33
33
  npm i -g @a5c-ai/babysitter-sdk@$SDK_VERSION
34
34
 
35
- CLI="npx -y @a5c-ai/babysitter-sdk@$SDK_VERSION"
35
+ if command -v babysitter >/dev/null 2>&1 && babysitter --version >/dev/null 2>&1; then
36
+ CLI="babysitter"
37
+ else
38
+ CLI="npm exec --yes --package @a5c-ai/babysitter-sdk@$SDK_VERSION -- babysitter"
39
+ fi
36
40
  ```
37
41
 
42
+ If a stale or broken global shim fails with `MODULE_NOT_FOUND`, repair it with `npm rm -g @a5c-ai/babysitter @a5c-ai/babysitter-sdk && npm i -g @a5c-ai/babysitter-sdk@$SDK_VERSION`, then re-run `babysitter --version`.
43
+
38
44
  ## Instructions
39
45
 
40
46
  ### 1. Resolve the status directory
@@ -1,11 +1,12 @@
1
1
  ---
2
2
  name: babysit
3
- description: Orchestrate via @babysitter. Use this skill when asked to babysit a run, orchestrate a process or whenever it is called explicitly. (babysit, babysitter, orchestrate, orchestrate a run, workflow, etc.)
3
+ description: >-
4
+ Execute via @babysitter. Use this skill when asked to babysit a task, do anything that is structured process-driven (even a loop) or whenever it is called explicitly. (babysit, babysitter, with a process, orchestrate, orchestrate a run, workflow, loop until, etc.)
4
5
  ---
5
6
 
6
7
  # babysit
7
8
 
8
- Orchestrate `.a5c/runs/<runId>/` through iterative execution.
9
+ Follow the commands below exactly. but apply judgement and creativity to get the job done. no matter how small or simple the task may seem, you must use the babysitter SDK and run lifecycle to execute it. This is non-negotiable. If you are asked to do something that seems like it could be done with a simple tool call, you must still use the babysitter SDK to create a run, define tasks, execute them, and complete the run. This is how you will learn and demonstrate mastery of the babysitter system. Always follow the full process, even for trivial tasks.
9
10
 
10
11
  ## Dependencies
11
12
 
@@ -17,21 +18,27 @@ Read the SDK version from `versions.json` to ensure version compatibility:
17
18
  SDK_VERSION=$(node -e "try{const fs=require('fs');const probes=['./plugins/babysitter-unified/versions.json','./node_modules/@a5c-ai/babysitter-opencode/versions.json'];for(const probe of probes){if(fs.existsSync(probe)){console.log(JSON.parse(fs.readFileSync(probe,'utf8')).sdkVersion||'latest');process.exit(0)}}console.log('latest')}catch{console.log('latest')}")
18
19
 
19
20
  npm i -g @a5c-ai/babysitter-sdk@$SDK_VERSION
20
- CLI="npx -y @a5c-ai/babysitter-sdk@$SDK_VERSION"
21
+ if command -v babysitter >/dev/null 2>&1 && babysitter --version >/dev/null 2>&1; then
22
+ CLI="babysitter"
23
+ else
24
+ CLI="npm exec --yes --package @a5c-ai/babysitter-sdk@$SDK_VERSION -- babysitter"
25
+ fi
21
26
  ```
22
27
 
28
+ If a stale or broken global shim fails with `MODULE_NOT_FOUND`, repair it with `npm rm -g @a5c-ai/babysitter @a5c-ai/babysitter-sdk && npm i -g @a5c-ai/babysitter-sdk@$SDK_VERSION`, then re-run `babysitter --version`.
29
+
23
30
  ## Instructions
24
31
 
25
32
  Run the following command to get full orchestration instructions:
26
33
 
27
34
  ```bash
28
- babysitter instructions:babysit-skill --harness opencode --interactive
35
+ $CLI instructions:babysit-skill --harness opencode --interactive
29
36
  ```
30
37
 
31
38
  For non-interactive mode:
32
39
 
33
40
  ```bash
34
- babysitter instructions:babysit-skill --harness opencode --no-interactive
41
+ $CLI instructions:babysit-skill --harness opencode --no-interactive
35
42
  ```
36
43
 
37
44
  Follow the instructions returned by the command above to orchestrate the run.
@@ -0,0 +1,69 @@
1
+ ---
2
+ name: check-forbidden-markers
3
+ description: Pre-deploy gate that scans built JS chunks for forbidden substring markers (saga-era / obsolete code paths) listed in a project-local forbidden-markers.txt
4
+ ---
5
+
6
+ # check-forbidden-markers
7
+
8
+ Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md). Compose the gate from the shared helper at `library/processes/shared/forbidden-markers-scanner.js` (issue #477).
9
+
10
+ ## What this gate does
11
+
12
+ Reads a list of literal substring markers from `scripts/forbidden-markers.txt` (blank lines and `#`-prefixed comments stripped) and greps every `.js` chunk under `.vercel/output/static/_next/static/chunks/` (Next.js / Vercel default; configurable) for any occurrence. Reports structured hits per `(marker, chunk)` pair with occurrence counts. Designed to chain between `vercel build --prod` and `vercel deploy --prod`.
13
+
14
+ Use this gate when a refactor or restart-from-baseline replaced load-bearing code paths and you need a structural guarantee the obsolete symbols never re-ship. Burned-in evidence: cookbook VI-9 / VI-12 near-miss revivals during the 2026-05 iOS-Safari saga; the prototype lives at `cookbook/scripts/check-no-forbidden.mjs` and shipped two upstream contributions before being generalized as this gate.
15
+
16
+ ## When to use
17
+
18
+ - **Pre-deploy.** Insert after build, before deploy. Block the deploy when `ok: false`.
19
+ - **Post-restart.** After a baseline rollback + step-by-step re-add, snapshot the saga-era markers in `forbidden-markers.txt` and let CI hold the line.
20
+ - **Post-refactor.** When old helper / handler / module names must not coexist with the new ones in the same bundle.
21
+
22
+ ## Expected config locations
23
+
24
+ - `scripts/forbidden-markers.txt` — one marker per line, `#` for comments. The list is the contract; the gate is mechanical. Commit this file to source control.
25
+ - `.vercel/output/static/_next/static/chunks/` — default scan target. Override for non-Vercel frameworks via the `--chunks-dir` flag or the `chunksDir` task input.
26
+
27
+ A missing markers file is a no-op (`ok: true`, `reason: 'missing-markers-file'`) — misconfiguration is never a deploy block. A missing chunks directory is likewise a no-op (`reason: 'missing-chunks-dir'`) so the gate is safe to chain into `check:all` before the build runs.
28
+
29
+ ## Exit semantics
30
+
31
+ | Reason | `ok` | Deploy decision |
32
+ |-------------------------|--------|--------------------------------|
33
+ | `missing-markers-file` | true | Pass (no gate active) |
34
+ | `missing-chunks-dir` | true | Pass (run before build) |
35
+ | `empty-markers` | true | Pass (list is empty) |
36
+ | `no-chunks` | true | Pass (nothing to scan) |
37
+ | `clean` | true | Pass — proceed to deploy |
38
+ | `hits` | false | **BLOCK** — surface hits, ask for triage |
39
+
40
+ For each hit, the gate emits `{ marker, chunk, count }` so the operator sees the exact marker string, the absolute chunk path, and the number of occurrences in that chunk. Multiple hits across chunks for the same marker are reported separately.
41
+
42
+ ## Programmatic surface
43
+
44
+ ```js
45
+ import { scanForbiddenMarkers, checkForbiddenMarkersTask } from '@a5c-ai/babysitter-library/processes/shared';
46
+
47
+ // Direct call:
48
+ const result = await scanForbiddenMarkers({
49
+ markersFile: 'scripts/forbidden-markers.txt',
50
+ chunksDir: '.vercel/output/static/_next/static/chunks',
51
+ });
52
+ if (!result.ok) {
53
+ // result.hits: Array<{ marker, chunk, count }>
54
+ // result.reason === 'hits'
55
+ process.exit(1);
56
+ }
57
+
58
+ // Or dispatched as a babysitter task:
59
+ const gate = await ctx.task(checkForbiddenMarkersTask, {
60
+ projectDir: '.',
61
+ // markersFile / chunksDir are inferred from projectDir if omitted
62
+ });
63
+ ```
64
+
65
+ ## Reference
66
+
67
+ - Issue: https://github.com/a5c-ai/babysitter/issues/477
68
+ - Helper module: `library/processes/shared/forbidden-markers-scanner.js`
69
+ - Origin (cookbook prototype): `cookbook/scripts/check-no-forbidden.mjs` (81 lines)
@@ -7,7 +7,13 @@ description: Clean up .a5c/runs and .a5c/processes directories. Aggregates insig
7
7
 
8
8
  Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
9
9
 
10
- Create and run a cleanup process using the process at `skills\babysit\process\cradle\cleanup-runs.js/processes/cleanup-runs.js`.
10
+ Resolve the active process library with:
11
+
12
+ ```bash
13
+ babysitter process-library:active --json
14
+ ```
15
+
16
+ Read `binding.dir` from that JSON and create/run the cleanup process from `cradle/cleanup-runs.js#process` relative to that active library root. Do not use plugin-cache-relative cradle paths.
11
17
 
12
18
  Implementation notes (for the process):
13
19
  - Parse arguments for `--dry-run` flag (if present, set dryRun: true in inputs) and `--keep-days N` (default: 7)
@@ -234,7 +234,8 @@ SECONDARY COMMANDS
234
234
  How it works: Runs npx @a5c-ai/babysitter-observer-dashboard@latest which watches
235
235
  the .a5c/runs/ directory (or a parent directory containing multiple projects) and
236
236
  serves a live dashboard. The process is blocking -- it runs until you stop it, and
237
- it prints the local URL to share with the user.
237
+ it prints the local URL to share with the user. Do not use `babysitter observe`
238
+ as a fallback; the core Babysitter CLI does not expose that subcommand.
238
239
 
239
240
  Example: /babysitter:observe
240
241
  (opens browser showing all runs with live-updating task
@@ -8,6 +8,11 @@ description: Launch the babysitter observer dashboard. Installs and runs the rea
8
8
  Run the babysitter observer dashboard:
9
9
 
10
10
  1. Determine the watch directory — this is usually the project's container directory (the parent of the project dir), or the current working directory if not specified.
11
- 2. Launch the dashboard: `npx -y @a5c-ai/babysitter-observer-dashboard@latest --watch-dir <dir>`
11
+ 2. Launch the standalone dashboard package: `npx -y @a5c-ai/babysitter-observer-dashboard@latest --watch-dir <dir>`.
12
12
  3. This is a blocking process — it will keep running until stopped.
13
13
  4. Report the URL printed by the dashboard to the user, then open it in the browser.
14
+
15
+ Do not fall back to `babysitter observe`; the core Babysitter CLI does not expose
16
+ that subcommand. Some harness runtimes may provide a separate
17
+ `agent-platform observe` surface, but this skill uses the verified standalone
18
+ dashboard package.
@@ -5,4 +5,14 @@ description: Plan a babysitter run. use this command to plan a complex workflow,
5
5
 
6
6
  # plan
7
7
 
8
- Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md). focus on creating the best process possible, but without creating and running the actual run.
8
+ Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md). Focus on creating the best process possible, but without creating and running the actual run.
9
+
10
+ Before drafting the process, run Phase 0 -- REUSE-AUDIT: extract keyword nouns and verbs from the request, scan for matching existing migrations, API routes, environment variables, SDK dependencies, and imports, honor `.a5c/reuse-audit.json` when present, and put a `Reuse-audit findings (REVIEW BEFORE PROCEEDING)` block before Phase 1 of the plan.
11
+
12
+ ## Process Shape Selection
13
+
14
+ Choose the process shape before authoring `process.js`:
15
+
16
+ - Use a flat phase list when the spec is well-defined, the work is wiring or composition, the bug class is already known if this is a fix, and execution should proceed sequentially through clear phases.
17
+ - Use a HYPOTHESES tree when the bug class is unknown, forensics are required, multiple causal models compete, and each hypothesis needs its own observations, falsifying observations, and follow-up phases.
18
+ - Rule of thumb: if the first phase is "investigate", use HYPOTHESES-tree mode. If the first phase is "implement X", use flat-phase-list mode.
@@ -5,7 +5,7 @@ description: Orchestrate a babysitter run. use this command to start babysitting
5
5
 
6
6
  # yolo
7
7
 
8
- Start the Babysitter run directly through the CLI, without any user interaction or breakpoints. Do not invoke the Skill tool and do not run an instructions-only command. In Claude Code, use Bash to run `babysitter harness:yolo --harness claude-code --workspace "$PWD" --prompt "<user arguments>" --json`; in Codex, run `babysitter harness:yolo --harness codex --workspace "$PWD" --prompt "<user arguments>" --json`; in other harnesses, use the same command with that harness id. Replace `<user arguments>` with the arguments shown below, wait for the command to finish, and treat the CLI completion proof as the result.
8
+ Run the Babysitter orchestration instructions directly through the CLI, without any user interaction or breakpoints. In Claude Code, use Bash to run `babysitter instructions:babysit-skill --harness claude-code --no-interactive`; in Codex, run `babysitter instructions:babysit-skill --harness codex --no-interactive`; in other harnesses, use the same command with that harness id. Then follow the returned instructions in this same turn until completion proof is produced. Do not stop after reading the instructions, do not invoke the Skill tool first, and use the non-interactive/no-breakpoints path when the instructions offer a mode choice.
9
9
 
10
10
  User arguments for this command:
11
11
 
package/versions.json CHANGED
@@ -1,4 +1,4 @@
1
1
  {
2
- "sdkVersion": "5.0.1-staging.9e5052f8bc95",
3
- "extensionVersion": "5.0.1-staging.9e5052f8bc95"
2
+ "sdkVersion": "5.0.1-staging.a2865ee1a2da",
3
+ "extensionVersion": "5.0.1-staging.a2865ee1a2da"
4
4
  }