@a5c-ai/babysitter-opencode 5.0.0 → 5.0.1-staging.04ca6ab00d21
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +17 -19
- package/bin/install-shared.js +1 -1
- package/commands/call.md +10 -6
- package/commands/cleanup.md +30 -8
- package/commands/contrib.md +31 -31
- package/commands/doctor.md +2 -3
- package/commands/forever.md +6 -6
- package/commands/help.md +2 -1
- package/commands/observe.md +6 -1
- package/commands/plan.md +7 -7
- package/commands/plugins.md +249 -249
- package/commands/project-install.md +10 -10
- package/commands/resume.md +8 -8
- package/commands/retrospect.md +55 -55
- package/commands/user-install.md +10 -10
- package/commands/yolo.md +5 -1
- package/hooks/babysitter-proxied-stop-hook.sh +3 -0
- package/hooks/hooks.json +0 -8
- package/package.json +3 -4
- package/plugin.json +1 -1
- package/scripts/publish-from-tag.mjs +32 -3
- package/skills/accomplish-status/SKILL.md +1 -1
- package/skills/babysit/SKILL.md +4 -3
- package/skills/call/SKILL.md +5 -1
- package/skills/cleanup/SKILL.md +30 -8
- package/skills/contrib/SKILL.md +25 -25
- package/skills/doctor/SKILL.md +2 -3
- package/skills/help/SKILL.md +2 -1
- package/skills/observe/SKILL.md +6 -1
- package/skills/plugins/SKILL.md +243 -243
- package/skills/project-install/SKILL.md +3 -3
- package/skills/resume/SKILL.md +1 -1
- package/skills/retrospect/SKILL.md +48 -48
- package/skills/user-install/SKILL.md +3 -3
- package/skills/yolo/SKILL.md +5 -1
- package/versions.json +2 -2
- package/hooks/babysitter-proxied-session-idle.js +0 -24
- package/hooks/babysitter-proxied-session-idle.sh +0 -3
package/commands/retrospect.md
CHANGED
|
@@ -1,55 +1,55 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Analysis for a run and its results, process, suggestions for process improvements, process optimizations, fixes, etc. for the next runs.
|
|
3
|
-
argument-hint: "[run-id...] [--all] Specific run IDs, --all for all runs, or defaults to latest"
|
|
4
|
-
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
-
|
|
9
|
-
create and run a retrospect process:
|
|
10
|
-
|
|
11
|
-
### Run Selection
|
|
12
|
-
|
|
13
|
-
- `--all` or "all runs": list all completed/failed runs and analyze collectively
|
|
14
|
-
- Multiple run IDs: analyze each specified run
|
|
15
|
-
- Single run ID or no ID: existing behavior (latest run)
|
|
16
|
-
- In interactive mode with no run specified: ask user whether to analyze latest, select specific runs, or all runs
|
|
17
|
-
|
|
18
|
-
### Cross-Run Analysis (multi-run mode)
|
|
19
|
-
|
|
20
|
-
When analyzing multiple runs, the retrospect process should additionally cover:
|
|
21
|
-
- Common failure patterns across runs
|
|
22
|
-
- Velocity trends (tasks/time across runs)
|
|
23
|
-
- Process evolution (how processes changed)
|
|
24
|
-
- Repeated breakpoint patterns
|
|
25
|
-
- Aggregate quality metrics
|
|
26
|
-
|
|
27
|
-
implementations notes (for the process):
|
|
28
|
-
- The process should analyze the run, the process that was followed, and provide suggestions for improvements, optimizations, and fixes.
|
|
29
|
-
- The process should such have many breakpoints where the user can steer the process, provide feedback, and make decisions about how to proceed with the retrospect.
|
|
30
|
-
- The process should be designed to be flexible and adaptable to different types of runs, projects, and goals, and should be able to provide insights and suggestions that are relevant and actionable for the user. (modification to the process, skills, etc.)
|
|
31
|
-
- The process should be designed to be iterative, allowing the user to go through multiple rounds of analysis and improvement, and should be able to track the changes and improvements made over time.
|
|
32
|
-
- The process should cover:
|
|
33
|
-
- Analysis of the run and its results, including what went well, what didn't go well, and what could be improved.
|
|
34
|
-
- Analysis of the process that was followed, including what steps were taken, what tools were used, and how effective they were.
|
|
35
|
-
- Suggestions for improvements, optimizations, and fixes for both the run and the process.
|
|
36
|
-
- Implementing the improvements, optimizations, and fixes, and tracking the changes made over time.
|
|
37
|
-
### Cleanup Suggestion
|
|
38
|
-
|
|
39
|
-
After retrospect analysis, suggest running `/babysitter:cleanup` to clean up old run data and reclaim disk space.
|
|
40
|
-
|
|
41
|
-
- Ending by explicitly prompting the user to contribute back -- even just reporting an issue is valuable, they don't need to implement the fix themselves. After analysis, display a clear call-to-action:
|
|
42
|
-
|
|
43
|
-
"You've identified [specific insight/improvement]. This could help other babysitter users too. Run `/babysitter:contrib` to share it upstream -- you can either report it as an issue or submit a PR with the fix."
|
|
44
|
-
|
|
45
|
-
Route to the specific contrib workflow based on what the user wants to do:
|
|
46
|
-
|
|
47
|
-
**Just reporting (no code changes needed):**
|
|
48
|
-
- Found a bug or weakness in a process -> `/babysitter:contrib bug report: [description of what went wrong]`
|
|
49
|
-
- Found missing or confusing documentation -> `/babysitter:contrib documentation question: [what was unclear]`
|
|
50
|
-
- Have an idea for improvement but don't want to implement it -> `/babysitter:contrib feature request: [description]`
|
|
51
|
-
|
|
52
|
-
**Contributing code changes:**
|
|
53
|
-
- Process/skill/agent improvements -> `/babysitter:contrib library contribution: [description]`
|
|
54
|
-
- Bug fixes in SDK or CLI -> `/babysitter:contrib bugfix: [description]`
|
|
55
|
-
- Plugin instruction improvements -> `/babysitter:contrib library contribution: improved [plugin-name] [install|configure|uninstall] instructions`
|
|
1
|
+
---
|
|
2
|
+
description: Analysis for a run and its results, process, suggestions for process improvements, process optimizations, fixes, etc. for the next runs.
|
|
3
|
+
argument-hint: "[run-id...] [--all] Specific run IDs, --all for all runs, or defaults to latest"
|
|
4
|
+
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
+
|
|
9
|
+
create and run a retrospect process:
|
|
10
|
+
|
|
11
|
+
### Run Selection
|
|
12
|
+
|
|
13
|
+
- `--all` or "all runs": list all completed/failed runs and analyze collectively
|
|
14
|
+
- Multiple run IDs: analyze each specified run
|
|
15
|
+
- Single run ID or no ID: existing behavior (latest run)
|
|
16
|
+
- In interactive mode with no run specified: ask user whether to analyze latest, select specific runs, or all runs
|
|
17
|
+
|
|
18
|
+
### Cross-Run Analysis (multi-run mode)
|
|
19
|
+
|
|
20
|
+
When analyzing multiple runs, the retrospect process should additionally cover:
|
|
21
|
+
- Common failure patterns across runs
|
|
22
|
+
- Velocity trends (tasks/time across runs)
|
|
23
|
+
- Process evolution (how processes changed)
|
|
24
|
+
- Repeated breakpoint patterns
|
|
25
|
+
- Aggregate quality metrics
|
|
26
|
+
|
|
27
|
+
implementations notes (for the process):
|
|
28
|
+
- The process should analyze the run, the process that was followed, and provide suggestions for improvements, optimizations, and fixes.
|
|
29
|
+
- The process should such have many breakpoints where the user can steer the process, provide feedback, and make decisions about how to proceed with the retrospect.
|
|
30
|
+
- The process should be designed to be flexible and adaptable to different types of runs, projects, and goals, and should be able to provide insights and suggestions that are relevant and actionable for the user. (modification to the process, skills, etc.)
|
|
31
|
+
- The process should be designed to be iterative, allowing the user to go through multiple rounds of analysis and improvement, and should be able to track the changes and improvements made over time.
|
|
32
|
+
- The process should cover:
|
|
33
|
+
- Analysis of the run and its results, including what went well, what didn't go well, and what could be improved.
|
|
34
|
+
- Analysis of the process that was followed, including what steps were taken, what tools were used, and how effective they were.
|
|
35
|
+
- Suggestions for improvements, optimizations, and fixes for both the run and the process.
|
|
36
|
+
- Implementing the improvements, optimizations, and fixes, and tracking the changes made over time.
|
|
37
|
+
### Cleanup Suggestion
|
|
38
|
+
|
|
39
|
+
After retrospect analysis, suggest running `/babysitter:cleanup` to clean up old run data and reclaim disk space.
|
|
40
|
+
|
|
41
|
+
- Ending by explicitly prompting the user to contribute back -- even just reporting an issue is valuable, they don't need to implement the fix themselves. After analysis, display a clear call-to-action:
|
|
42
|
+
|
|
43
|
+
"You've identified [specific insight/improvement]. This could help other babysitter users too. Run `/babysitter:contrib` to share it upstream -- you can either report it as an issue or submit a PR with the fix."
|
|
44
|
+
|
|
45
|
+
Route to the specific contrib workflow based on what the user wants to do:
|
|
46
|
+
|
|
47
|
+
**Just reporting (no code changes needed):**
|
|
48
|
+
- Found a bug or weakness in a process -> `/babysitter:contrib bug report: [description of what went wrong]`
|
|
49
|
+
- Found missing or confusing documentation -> `/babysitter:contrib documentation question: [what was unclear]`
|
|
50
|
+
- Have an idea for improvement but don't want to implement it -> `/babysitter:contrib feature request: [description]`
|
|
51
|
+
|
|
52
|
+
**Contributing code changes:**
|
|
53
|
+
- Process/skill/agent improvements -> `/babysitter:contrib library contribution: [description]`
|
|
54
|
+
- Bug fixes in SDK or CLI -> `/babysitter:contrib bugfix: [description]`
|
|
55
|
+
- Plugin instruction improvements -> `/babysitter:contrib library contribution: improved [plugin-name] [install|configure|uninstall] instructions`
|
package/commands/user-install.md
CHANGED
|
@@ -1,11 +1,11 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Set up babysitter for yourself. Guides you through onboarding — installs dependencies, interviews you about your specialties and preferences, builds your user profile, and configures the best tools for your workflow.
|
|
3
|
-
argument-hint: Specific instructions for the run.
|
|
4
|
-
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
-
|
|
1
|
+
---
|
|
2
|
+
description: Set up babysitter for yourself. Guides you through onboarding — installs dependencies, interviews you about your specialties and preferences, builds your user profile, and configures the best tools for your workflow.
|
|
3
|
+
argument-hint: Specific instructions for the run.
|
|
4
|
+
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
+
|
|
9
9
|
Before using the process library, resolve the active library root through the SDK CLI. If no binding exists yet, initialize the shared global SDK binding with:
|
|
10
10
|
|
|
11
11
|
```bash
|
|
@@ -13,5 +13,5 @@ babysitter process-library:active --json
|
|
|
13
13
|
```
|
|
14
14
|
|
|
15
15
|
Then use the `cradle/user-install` process from the active process library.
|
|
16
|
-
|
|
17
|
-
When the run completes, end with a friendly message that includes a polite and humorous ask to star the repo on GitHub: https://github.com/a5c-ai/babysitter
|
|
16
|
+
|
|
17
|
+
When the run completes, end with a friendly message that includes a polite and humorous ask to star the repo on GitHub: https://github.com/a5c-ai/babysitter
|
package/commands/yolo.md
CHANGED
|
@@ -4,4 +4,8 @@ argument-hint: Specific instructions for the run.
|
|
|
4
4
|
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
5
|
---
|
|
6
6
|
|
|
7
|
-
|
|
7
|
+
Run the Babysitter orchestration instructions directly through the CLI, without any user interaction or breakpoints. In Claude Code, use Bash to run `babysitter instructions:babysit-skill --harness claude-code --no-interactive`; in Codex, run `babysitter instructions:babysit-skill --harness codex --no-interactive`; in other harnesses, use the same command with that harness id. Then follow the returned instructions in this same turn until completion proof is produced. Do not stop after reading the instructions, do not invoke the Skill tool first, and use the non-interactive/no-breakpoints path when the instructions offer a mode choice.
|
|
8
|
+
|
|
9
|
+
User arguments for this command:
|
|
10
|
+
|
|
11
|
+
$ARGUMENTS
|
package/hooks/hooks.json
CHANGED
|
@@ -26,14 +26,6 @@
|
|
|
26
26
|
"timeoutMs": 30000
|
|
27
27
|
}
|
|
28
28
|
],
|
|
29
|
-
"session.idle": [
|
|
30
|
-
{
|
|
31
|
-
"type": "command",
|
|
32
|
-
"script": "./hooks/babysitter-proxied-session-idle.js",
|
|
33
|
-
"description": "babysitter SessionIdle hook",
|
|
34
|
-
"timeoutMs": 30000
|
|
35
|
-
}
|
|
36
|
-
],
|
|
37
29
|
"shell.env": [
|
|
38
30
|
{
|
|
39
31
|
"type": "command",
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "@a5c-ai/babysitter-opencode",
|
|
3
|
-
"version": "5.0.
|
|
3
|
+
"version": "5.0.1-staging.04ca6ab00d21",
|
|
4
4
|
"description": "Orchestrate complex, multi-step workflows with event-sourced state management, hook-based extensibility, and human-in-the-loop approval",
|
|
5
5
|
"scripts": {
|
|
6
6
|
"deploy": "npm publish --access public",
|
|
@@ -8,8 +8,7 @@
|
|
|
8
8
|
"postinstall": "node bin/install.js",
|
|
9
9
|
"preuninstall": "node bin/uninstall.js",
|
|
10
10
|
"team:install": "node scripts/team-install.js",
|
|
11
|
-
"test": "node test/integration.test.js"
|
|
12
|
-
"sync:commands": "node scripts/sync-command-docs.cjs"
|
|
11
|
+
"test": "node test/integration.test.js"
|
|
13
12
|
},
|
|
14
13
|
"bin": {
|
|
15
14
|
"babysitter-opencode": "bin/cli.js"
|
|
@@ -36,7 +35,7 @@
|
|
|
36
35
|
"access": "public"
|
|
37
36
|
},
|
|
38
37
|
"dependencies": {
|
|
39
|
-
"@a5c-ai/babysitter-sdk": "5.0.
|
|
38
|
+
"@a5c-ai/babysitter-sdk": "5.0.1-staging.04ca6ab00d21"
|
|
40
39
|
},
|
|
41
40
|
"repository": {
|
|
42
41
|
"type": "git",
|
package/plugin.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "babysitter",
|
|
3
|
-
"version": "5.0.
|
|
3
|
+
"version": "5.0.1-staging.04ca6ab00d21",
|
|
4
4
|
"description": "Orchestrate complex, multi-step workflows with event-sourced state management, hook-based extensibility, and human-in-the-loop approval",
|
|
5
5
|
"author": "a5c.ai",
|
|
6
6
|
"license": "MIT",
|
|
@@ -1,12 +1,41 @@
|
|
|
1
1
|
#!/usr/bin/env node
|
|
2
2
|
import { spawnSync } from 'node:child_process';
|
|
3
|
+
import { readFileSync } from 'node:fs';
|
|
3
4
|
|
|
4
|
-
function run(command, args) {
|
|
5
|
-
const result = spawnSync(command, args, { stdio: 'inherit' });
|
|
6
|
-
if (result.status !== 0) process.exit(result.status || 1);
|
|
5
|
+
function run(command, args, options = {}) {
|
|
6
|
+
const result = spawnSync(command, args, { stdio: options.stdio || 'inherit', encoding: options.encoding });
|
|
7
|
+
if (result.status !== 0 && !options.allowFailure) process.exit(result.status || 1);
|
|
8
|
+
return result;
|
|
7
9
|
}
|
|
8
10
|
|
|
11
|
+
function npmView(packageSpec) {
|
|
12
|
+
return run('npm', ['view', packageSpec, 'version'], { allowFailure: true, stdio: 'pipe', encoding: 'utf8' }).status === 0;
|
|
13
|
+
}
|
|
14
|
+
|
|
15
|
+
const pkg = JSON.parse(readFileSync('package.json', 'utf8'));
|
|
9
16
|
const ref = process.env.GITHUB_REF_NAME || '';
|
|
10
17
|
const branch = ref.split('/')[1] || 'develop';
|
|
11
18
|
const tag = branch === 'main' ? 'latest' : branch;
|
|
19
|
+
|
|
20
|
+
if (!process.env.NODE_AUTH_TOKEN) {
|
|
21
|
+
console.log('NODE_AUTH_TOKEN is not configured; skipping npm publish.');
|
|
22
|
+
process.exit(0);
|
|
23
|
+
}
|
|
24
|
+
|
|
25
|
+
if (npmView(pkg.name + '@' + pkg.version)) {
|
|
26
|
+
console.log(pkg.name + '@' + pkg.version + ' already exists; ensuring dist-tag ' + tag + '.');
|
|
27
|
+
run('npm', ['dist-tag', 'add', pkg.name + '@' + pkg.version, tag], { allowFailure: true });
|
|
28
|
+
process.exit(0);
|
|
29
|
+
}
|
|
30
|
+
|
|
31
|
+
for (const field of ['dependencies', 'peerDependencies', 'optionalDependencies']) {
|
|
32
|
+
for (const [name, version] of Object.entries(pkg[field] || {})) {
|
|
33
|
+
if (!name.startsWith('@a5c-ai/') || version.startsWith('^') || version.startsWith('~') || version === '*' || version.startsWith('workspace:')) continue;
|
|
34
|
+
if (!npmView(name + '@' + version)) {
|
|
35
|
+
console.log('Required internal dependency ' + name + '@' + version + ' is not published yet; skipping npm publish.');
|
|
36
|
+
process.exit(0);
|
|
37
|
+
}
|
|
38
|
+
}
|
|
39
|
+
}
|
|
40
|
+
|
|
12
41
|
run('npm', ['publish', '--access', 'public', '--tag', tag]);
|
|
@@ -29,7 +29,7 @@ Where `OPENCODE_CONFIG_DIR` is resolved from the `OPENCODE_CONFIG_DIR` environme
|
|
|
29
29
|
Read the SDK version from `versions.json` to ensure version compatibility:
|
|
30
30
|
|
|
31
31
|
```bash
|
|
32
|
-
SDK_VERSION=$(node -e "try{const fs=require('fs');const probes=['./plugins/babysitter-
|
|
32
|
+
SDK_VERSION=$(node -e "try{const fs=require('fs');const probes=['./plugins/babysitter-unified/versions.json','./node_modules/@a5c-ai/babysitter-opencode/versions.json'];for(const probe of probes){if(fs.existsSync(probe)){console.log(JSON.parse(fs.readFileSync(probe,'utf8')).sdkVersion||'latest');process.exit(0)}}console.log('latest')}catch{console.log('latest')}")
|
|
33
33
|
npm i -g @a5c-ai/babysitter-sdk@$SDK_VERSION
|
|
34
34
|
|
|
35
35
|
CLI="npx -y @a5c-ai/babysitter-sdk@$SDK_VERSION"
|
package/skills/babysit/SKILL.md
CHANGED
|
@@ -1,11 +1,12 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: babysit
|
|
3
|
-
description:
|
|
3
|
+
description: >-
|
|
4
|
+
Execute via @babysitter. Use this skill when asked to babysit a task, do anything that is structured process-driven (even a loop) or whenever it is called explicitly. (babysit, babysitter, with a process, orchestrate, orchestrate a run, workflow, loop until, etc.)
|
|
4
5
|
---
|
|
5
6
|
|
|
6
7
|
# babysit
|
|
7
8
|
|
|
8
|
-
|
|
9
|
+
Follow the commands below exactly. but apply judgement and creativity to get the job done. no matter how small or simple the task may seem, you must use the babysitter SDK and run lifecycle to execute it. This is non-negotiable. If you are asked to do something that seems like it could be done with a simple tool call, you must still use the babysitter SDK to create a run, define tasks, execute them, and complete the run. This is how you will learn and demonstrate mastery of the babysitter system. Always follow the full process, even for trivial tasks.
|
|
9
10
|
|
|
10
11
|
## Dependencies
|
|
11
12
|
|
|
@@ -14,7 +15,7 @@ Orchestrate `.a5c/runs/<runId>/` through iterative execution.
|
|
|
14
15
|
Read the SDK version from `versions.json` to ensure version compatibility:
|
|
15
16
|
|
|
16
17
|
```bash
|
|
17
|
-
SDK_VERSION=$(node -e "try{const fs=require('fs');const probes=['./plugins/babysitter-
|
|
18
|
+
SDK_VERSION=$(node -e "try{const fs=require('fs');const probes=['./plugins/babysitter-unified/versions.json','./node_modules/@a5c-ai/babysitter-opencode/versions.json'];for(const probe of probes){if(fs.existsSync(probe)){console.log(JSON.parse(fs.readFileSync(probe,'utf8')).sdkVersion||'latest');process.exit(0)}}console.log('latest')}catch{console.log('latest')}")
|
|
18
19
|
|
|
19
20
|
npm i -g @a5c-ai/babysitter-sdk@$SDK_VERSION
|
|
20
21
|
CLI="npx -y @a5c-ai/babysitter-sdk@$SDK_VERSION"
|
package/skills/call/SKILL.md
CHANGED
|
@@ -5,4 +5,8 @@ description: Orchestrate a babysitter run. use this command to start babysitting
|
|
|
5
5
|
|
|
6
6
|
# call
|
|
7
7
|
|
|
8
|
-
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
+
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md). Then continue executing the returned instructions in this same turn. Do not stop after the Skill tool returns; carry the requested run through to completion proof.
|
|
9
|
+
|
|
10
|
+
User arguments for this command:
|
|
11
|
+
|
|
12
|
+
$ARGUMENTS
|
package/skills/cleanup/SKILL.md
CHANGED
|
@@ -11,11 +11,33 @@ Create and run a cleanup process using the process at `skills\babysit\process\cr
|
|
|
11
11
|
|
|
12
12
|
Implementation notes (for the process):
|
|
13
13
|
- Parse arguments for `--dry-run` flag (if present, set dryRun: true in inputs) and `--keep-days N` (default: 7)
|
|
14
|
-
|
|
15
|
-
|
|
16
|
-
|
|
17
|
-
|
|
18
|
-
-
|
|
19
|
-
-
|
|
20
|
-
-
|
|
21
|
-
-
|
|
14
|
+
|
|
15
|
+
CRITICAL: The cleanup MUST follow this exact phase order. Do NOT delete any run before Phase 2 completes.
|
|
16
|
+
|
|
17
|
+
Phase 1 — Scan:
|
|
18
|
+
- Scan .a5c/runs/ for all runs
|
|
19
|
+
- Classify each as terminal (completed/failed) or active (in-progress/created)
|
|
20
|
+
- Identify terminal runs older than the keep-days threshold as removal candidates
|
|
21
|
+
- Never mark active/in-progress runs for removal
|
|
22
|
+
- Count and report: total runs, terminal, active, removal candidates, disk usage
|
|
23
|
+
|
|
24
|
+
Phase 2 — Aggregate insights (BEFORE any deletion):
|
|
25
|
+
- For EVERY removal candidate, read its run.json and journal/ events
|
|
26
|
+
- Extract: processId, prompt, status, event count, created date, task summaries
|
|
27
|
+
- Group by process type and extract patterns (retry counts, convergence behavior, failure modes)
|
|
28
|
+
- Append a new dated section to docs/run-history-insights.md with:
|
|
29
|
+
- Summary statistics (runs removed, disk freed, runs retained)
|
|
30
|
+
- Run categories with counts and descriptions
|
|
31
|
+
- Key patterns observed (multi-batch convergence, retry behavior, etc.)
|
|
32
|
+
- What worked well / what didn't from the run data
|
|
33
|
+
- This file MUST be written and verified before proceeding to Phase 3
|
|
34
|
+
|
|
35
|
+
Phase 3 — Confirm removal:
|
|
36
|
+
- In interactive mode, show the user what will be removed via a breakpoint
|
|
37
|
+
- In non-interactive mode (yolo), proceed with defaults
|
|
38
|
+
- In dry-run mode, stop here and show what would be removed
|
|
39
|
+
|
|
40
|
+
Phase 4 — Remove:
|
|
41
|
+
- Delete the terminal runs older than keep-days threshold
|
|
42
|
+
- Identify and remove orphaned process files not referenced by remaining runs
|
|
43
|
+
- Show remaining run count and disk usage after cleanup
|
package/skills/contrib/SKILL.md
CHANGED
|
@@ -5,30 +5,30 @@ description: Submit feedback or contribute to babysitter project
|
|
|
5
5
|
|
|
6
6
|
# contrib
|
|
7
7
|
|
|
8
|
-
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
9
|
-
|
|
10
|
-
## Process Routing
|
|
11
|
-
|
|
8
|
+
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
9
|
+
|
|
10
|
+
## Process Routing
|
|
11
|
+
|
|
12
12
|
Contribution processes live under the active process library's `cradle/` directory. Resolve the active library root with `babysitter process-library:active --json` and route based on arguments:
|
|
13
|
-
|
|
14
|
-
### Issue-based (opens a GitHub issue in a5c-ai/babysitter)
|
|
15
|
-
* **Bug report** → `cradle/bug-report.js#process` — Report a bug in the SDK, CLI, process library, etc.
|
|
16
|
-
* **Feature request** → `cradle/feature-request.js#process` — Request a new feature or enhancement
|
|
17
|
-
* **Documentation question** → `cradle/documentation-question.js#process` — Ask about undocumented behavior or missing docs
|
|
18
|
-
|
|
19
|
-
### PR-based (forks repo, creates branch, submits PR to a5c-ai/babysitter)
|
|
20
|
-
* **Bugfix** → `cradle/bugfix.js#process` — User already has the fix for a bug
|
|
21
|
-
* **Feature implementation** → `cradle/feature-implementation-contribute.js#process` — User already has a feature implementation
|
|
22
|
-
* **Harness integration** → `cradle/feature-harness-integration-contribute.js#process` — User has a harness (CI/CD, IDE, editor) integration
|
|
23
|
-
* **Library contribution** → `cradle/library-contribution.js#process` — New or improved process/skill/subagent for the library
|
|
24
|
-
* **Documentation answer** → `cradle/documentation-contribute-answer.js#process` — User has an answer for an unanswered docs question
|
|
25
|
-
|
|
26
|
-
### Router (when arguments are empty or general)
|
|
27
|
-
* **Contribute** → `cradle/contribute.js#process` — Explains contribution types and routes to the specific process
|
|
28
|
-
|
|
29
|
-
## Contribution Rules
|
|
30
|
-
|
|
31
|
-
* PR-based contributions: fork the babysitter repo (a5c-ai/babysitter) for the user, ask to star if not already starred, perform changes, submit PR
|
|
32
|
-
* Issue-based contributions: gather details, search for duplicates, review, then open an issue in a5c-ai/babysitter
|
|
33
|
-
* Add breakpoints (permissions) before ALL gh actions (fork, star, submit PR/issue) to allow user review and cancellation
|
|
13
|
+
|
|
14
|
+
### Issue-based (opens a GitHub issue in a5c-ai/babysitter)
|
|
15
|
+
* **Bug report** → `cradle/bug-report.js#process` — Report a bug in the SDK, CLI, process library, etc.
|
|
16
|
+
* **Feature request** → `cradle/feature-request.js#process` — Request a new feature or enhancement
|
|
17
|
+
* **Documentation question** → `cradle/documentation-question.js#process` — Ask about undocumented behavior or missing docs
|
|
18
|
+
|
|
19
|
+
### PR-based (forks repo, creates branch, submits PR to a5c-ai/babysitter)
|
|
20
|
+
* **Bugfix** → `cradle/bugfix.js#process` — User already has the fix for a bug
|
|
21
|
+
* **Feature implementation** → `cradle/feature-implementation-contribute.js#process` — User already has a feature implementation
|
|
22
|
+
* **Harness integration** → `cradle/feature-harness-integration-contribute.js#process` — User has a harness (CI/CD, IDE, editor) integration
|
|
23
|
+
* **Library contribution** → `cradle/library-contribution.js#process` — New or improved process/skill/subagent for the library
|
|
24
|
+
* **Documentation answer** → `cradle/documentation-contribute-answer.js#process` — User has an answer for an unanswered docs question
|
|
25
|
+
|
|
26
|
+
### Router (when arguments are empty or general)
|
|
27
|
+
* **Contribute** → `cradle/contribute.js#process` — Explains contribution types and routes to the specific process
|
|
28
|
+
|
|
29
|
+
## Contribution Rules
|
|
30
|
+
|
|
31
|
+
* PR-based contributions: fork the babysitter repo (a5c-ai/babysitter) for the user, ask to star if not already starred, perform changes, submit PR
|
|
32
|
+
* Issue-based contributions: gather details, search for duplicates, review, then open an issue in a5c-ai/babysitter
|
|
33
|
+
* Add breakpoints (permissions) before ALL gh actions (fork, star, submit PR/issue) to allow user review and cancellation
|
|
34
34
|
* If arguments are empty: use the `contribute.js` router process to show options and route accordingly
|
package/skills/doctor/SKILL.md
CHANGED
|
@@ -157,7 +157,6 @@ If it exists:
|
|
|
157
157
|
**Goal:** Inspect babysitter session files for health and detect runaway loops.
|
|
158
158
|
|
|
159
159
|
- Search for session state files using Glob:
|
|
160
|
-
- `plugins/babysitter/skills/babysit/state/*.md`
|
|
161
160
|
- `.a5c/state/*.md`
|
|
162
161
|
- `.a5c/state/*.json`
|
|
163
162
|
- For each session state file found:
|
|
@@ -261,7 +260,7 @@ Mark as PASS if total size < 500MB and no files > 10MB. Mark as WARN if total si
|
|
|
261
260
|
|
|
262
261
|
### 10a. Hook Registration
|
|
263
262
|
|
|
264
|
-
- Locate the plugin root. Check for `CLAUDE_PLUGIN_ROOT` env var, or search for `
|
|
263
|
+
- Locate the plugin root. Check for `CLAUDE_PLUGIN_ROOT` env var first, or search for a babysitter `hooks.json` by walking up from the current directory.
|
|
265
264
|
- If found, read `hooks.json` and verify:
|
|
266
265
|
- A `Stop` hook entry exists with a command referencing `babysitter-stop-hook.sh`.
|
|
267
266
|
- A `SessionStart` hook entry exists with a command referencing `babysitter-session-start-hook.sh`.
|
|
@@ -316,7 +315,7 @@ If the stop hook shows NO evidence of execution (no log entries, no journal even
|
|
|
316
315
|
|
|
317
316
|
Perform these diagnostic steps in order and report the first failure found:
|
|
318
317
|
|
|
319
|
-
1. **Plugin not installed**: Check if `
|
|
318
|
+
1. **Plugin not installed**: Check if `CLAUDE_PLUGIN_ROOT` is set or if a babysitter plugin directory exists relative to the project root. If neither exists, report: "Plugin not installed — the babysitter plugin directory is missing."
|
|
320
319
|
|
|
321
320
|
2. **Plugin not enabled**: Check for Claude settings files:
|
|
322
321
|
- `~/.claude/settings.json` — look for `babysitter` in `enabledPlugins`.
|
package/skills/help/SKILL.md
CHANGED
|
@@ -234,7 +234,8 @@ SECONDARY COMMANDS
|
|
|
234
234
|
How it works: Runs npx @a5c-ai/babysitter-observer-dashboard@latest which watches
|
|
235
235
|
the .a5c/runs/ directory (or a parent directory containing multiple projects) and
|
|
236
236
|
serves a live dashboard. The process is blocking -- it runs until you stop it, and
|
|
237
|
-
it prints the local URL to share with the user.
|
|
237
|
+
it prints the local URL to share with the user. Do not use `babysitter observe`
|
|
238
|
+
as a fallback; the core Babysitter CLI does not expose that subcommand.
|
|
238
239
|
|
|
239
240
|
Example: /babysitter:observe
|
|
240
241
|
(opens browser showing all runs with live-updating task
|
package/skills/observe/SKILL.md
CHANGED
|
@@ -8,6 +8,11 @@ description: Launch the babysitter observer dashboard. Installs and runs the rea
|
|
|
8
8
|
Run the babysitter observer dashboard:
|
|
9
9
|
|
|
10
10
|
1. Determine the watch directory — this is usually the project's container directory (the parent of the project dir), or the current working directory if not specified.
|
|
11
|
-
2. Launch the dashboard: `npx -y @a5c-ai/babysitter-observer-dashboard@latest --watch-dir <dir
|
|
11
|
+
2. Launch the standalone dashboard package: `npx -y @a5c-ai/babysitter-observer-dashboard@latest --watch-dir <dir>`.
|
|
12
12
|
3. This is a blocking process — it will keep running until stopped.
|
|
13
13
|
4. Report the URL printed by the dashboard to the user, then open it in the browser.
|
|
14
|
+
|
|
15
|
+
Do not fall back to `babysitter observe`; the core Babysitter CLI does not expose
|
|
16
|
+
that subcommand. Some harness runtimes may provide a separate
|
|
17
|
+
`agent-platform observe` surface, but this skill uses the verified standalone
|
|
18
|
+
dashboard package.
|