@a5c-ai/babysitter-github 5.0.1-staging.d8bdfcceaf4a → 5.0.1-staging.daf8e165bc4a
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bin/install-shared.js +4 -4
- package/commands/blueprints.md +64 -0
- package/commands/call.md +6 -6
- package/commands/check-forbidden-markers.md +68 -0
- package/commands/cleanup.md +7 -1
- package/commands/contrib.md +31 -31
- package/commands/forever.md +6 -6
- package/commands/help.md +9 -9
- package/commands/plan.md +17 -7
- package/commands/plugins.md +22 -255
- package/commands/project-install.md +10 -10
- package/commands/resume.md +8 -8
- package/commands/retrospect.md +55 -55
- package/commands/user-install.md +10 -10
- package/commands/yolo.md +1 -1
- package/package.json +2 -2
- package/plugin.json +1 -1
- package/skills/babysit/SKILL.md +31 -43
- package/skills/blueprints/SKILL.md +66 -0
- package/skills/check-forbidden-markers/SKILL.md +69 -0
- package/skills/cleanup/SKILL.md +7 -1
- package/skills/contrib/SKILL.md +25 -25
- package/skills/help/SKILL.md +9 -9
- package/skills/plan/SKILL.md +11 -1
- package/skills/plugins/SKILL.md +18 -251
- package/skills/project-install/SKILL.md +3 -3
- package/skills/resume/SKILL.md +1 -1
- package/skills/retrospect/SKILL.md +48 -48
- package/skills/user-install/SKILL.md +3 -3
- package/skills/yolo/SKILL.md +1 -1
- package/versions.json +2 -2
|
@@ -1,11 +1,11 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Set up a project for babysitting. Guides you through onboarding a new or existing project — researches the codebase, interviews you about goals and workflows, builds the project profile, installs the best tools, and optionally configures CI/CD integration.
|
|
3
|
-
argument-hint: Specific instructions for the run.
|
|
4
|
-
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
-
|
|
1
|
+
---
|
|
2
|
+
description: Set up a project for babysitting. Guides you through onboarding a new or existing project — researches the codebase, interviews you about goals and workflows, builds the project profile, installs the best tools, and optionally configures CI/CD integration.
|
|
3
|
+
argument-hint: Specific instructions for the run.
|
|
4
|
+
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
+
|
|
9
9
|
Before using the process library, resolve the active library root through the SDK CLI. If no binding exists yet, initialize the shared global SDK binding with:
|
|
10
10
|
|
|
11
11
|
```bash
|
|
@@ -13,5 +13,5 @@ babysitter process-library:active --json
|
|
|
13
13
|
```
|
|
14
14
|
|
|
15
15
|
Then use the `cradle/project-install` process from the active process library.
|
|
16
|
-
|
|
17
|
-
When the run completes, end with a friendly message that includes a polite and humorous ask to star the repo on GitHub: https://github.com/a5c-ai/babysitter
|
|
16
|
+
|
|
17
|
+
When the run completes, end with a friendly message that includes a polite and humorous ask to star the repo on GitHub: https://github.com/a5c-ai/babysitter
|
package/commands/resume.md
CHANGED
|
@@ -1,8 +1,8 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Resume orchestrating of a babysitter run. use this command to resume babysitting a complex workflow.
|
|
3
|
-
argument-hint: Specific run to resume
|
|
4
|
-
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md). to resume a run.
|
|
8
|
-
if no run was given, discover the runs and suggest which incomplete run to resume based on the run's status, inputs, process , etc.
|
|
1
|
+
---
|
|
2
|
+
description: Resume orchestrating of a babysitter run. use this command to resume babysitting a complex workflow.
|
|
3
|
+
argument-hint: Specific run to resume
|
|
4
|
+
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md). to resume a run.
|
|
8
|
+
if no run was given, discover the runs and suggest which incomplete run to resume based on the run's status, inputs, process , etc.
|
package/commands/retrospect.md
CHANGED
|
@@ -1,55 +1,55 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Analysis for a run and its results, process, suggestions for process improvements, process optimizations, fixes, etc. for the next runs.
|
|
3
|
-
argument-hint: "[run-id...] [--all] Specific run IDs, --all for all runs, or defaults to latest"
|
|
4
|
-
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
-
|
|
9
|
-
create and run a retrospect process:
|
|
10
|
-
|
|
11
|
-
### Run Selection
|
|
12
|
-
|
|
13
|
-
- `--all` or "all runs": list all completed/failed runs and analyze collectively
|
|
14
|
-
- Multiple run IDs: analyze each specified run
|
|
15
|
-
- Single run ID or no ID: existing behavior (latest run)
|
|
16
|
-
- In interactive mode with no run specified: ask user whether to analyze latest, select specific runs, or all runs
|
|
17
|
-
|
|
18
|
-
### Cross-Run Analysis (multi-run mode)
|
|
19
|
-
|
|
20
|
-
When analyzing multiple runs, the retrospect process should additionally cover:
|
|
21
|
-
- Common failure patterns across runs
|
|
22
|
-
- Velocity trends (tasks/time across runs)
|
|
23
|
-
- Process evolution (how processes changed)
|
|
24
|
-
- Repeated breakpoint patterns
|
|
25
|
-
- Aggregate quality metrics
|
|
26
|
-
|
|
27
|
-
implementations notes (for the process):
|
|
28
|
-
- The process should analyze the run, the process that was followed, and provide suggestions for improvements, optimizations, and fixes.
|
|
29
|
-
- The process should such have many breakpoints where the user can steer the process, provide feedback, and make decisions about how to proceed with the retrospect.
|
|
30
|
-
- The process should be designed to be flexible and adaptable to different types of runs, projects, and goals, and should be able to provide insights and suggestions that are relevant and actionable for the user. (modification to the process, skills, etc.)
|
|
31
|
-
- The process should be designed to be iterative, allowing the user to go through multiple rounds of analysis and improvement, and should be able to track the changes and improvements made over time.
|
|
32
|
-
- The process should cover:
|
|
33
|
-
- Analysis of the run and its results, including what went well, what didn't go well, and what could be improved.
|
|
34
|
-
- Analysis of the process that was followed, including what steps were taken, what tools were used, and how effective they were.
|
|
35
|
-
- Suggestions for improvements, optimizations, and fixes for both the run and the process.
|
|
36
|
-
- Implementing the improvements, optimizations, and fixes, and tracking the changes made over time.
|
|
37
|
-
### Cleanup Suggestion
|
|
38
|
-
|
|
39
|
-
After retrospect analysis, suggest running `/babysitter:cleanup` to clean up old run data and reclaim disk space.
|
|
40
|
-
|
|
41
|
-
- Ending by explicitly prompting the user to contribute back -- even just reporting an issue is valuable, they don't need to implement the fix themselves. After analysis, display a clear call-to-action:
|
|
42
|
-
|
|
43
|
-
"You've identified [specific insight/improvement]. This could help other babysitter users too. Run `/babysitter:contrib` to share it upstream -- you can either report it as an issue or submit a PR with the fix."
|
|
44
|
-
|
|
45
|
-
Route to the specific contrib workflow based on what the user wants to do:
|
|
46
|
-
|
|
47
|
-
**Just reporting (no code changes needed):**
|
|
48
|
-
- Found a bug or weakness in a process -> `/babysitter:contrib bug report: [description of what went wrong]`
|
|
49
|
-
- Found missing or confusing documentation -> `/babysitter:contrib documentation question: [what was unclear]`
|
|
50
|
-
- Have an idea for improvement but don't want to implement it -> `/babysitter:contrib feature request: [description]`
|
|
51
|
-
|
|
52
|
-
**Contributing code changes:**
|
|
53
|
-
- Process/skill/agent improvements -> `/babysitter:contrib library contribution: [description]`
|
|
54
|
-
- Bug fixes in SDK or CLI -> `/babysitter:contrib bugfix: [description]`
|
|
55
|
-
- Plugin instruction improvements -> `/babysitter:contrib library contribution: improved [plugin-name] [install|configure|uninstall] instructions`
|
|
1
|
+
---
|
|
2
|
+
description: Analysis for a run and its results, process, suggestions for process improvements, process optimizations, fixes, etc. for the next runs.
|
|
3
|
+
argument-hint: "[run-id...] [--all] Specific run IDs, --all for all runs, or defaults to latest"
|
|
4
|
+
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
+
|
|
9
|
+
create and run a retrospect process:
|
|
10
|
+
|
|
11
|
+
### Run Selection
|
|
12
|
+
|
|
13
|
+
- `--all` or "all runs": list all completed/failed runs and analyze collectively
|
|
14
|
+
- Multiple run IDs: analyze each specified run
|
|
15
|
+
- Single run ID or no ID: existing behavior (latest run)
|
|
16
|
+
- In interactive mode with no run specified: ask user whether to analyze latest, select specific runs, or all runs
|
|
17
|
+
|
|
18
|
+
### Cross-Run Analysis (multi-run mode)
|
|
19
|
+
|
|
20
|
+
When analyzing multiple runs, the retrospect process should additionally cover:
|
|
21
|
+
- Common failure patterns across runs
|
|
22
|
+
- Velocity trends (tasks/time across runs)
|
|
23
|
+
- Process evolution (how processes changed)
|
|
24
|
+
- Repeated breakpoint patterns
|
|
25
|
+
- Aggregate quality metrics
|
|
26
|
+
|
|
27
|
+
implementations notes (for the process):
|
|
28
|
+
- The process should analyze the run, the process that was followed, and provide suggestions for improvements, optimizations, and fixes.
|
|
29
|
+
- The process should such have many breakpoints where the user can steer the process, provide feedback, and make decisions about how to proceed with the retrospect.
|
|
30
|
+
- The process should be designed to be flexible and adaptable to different types of runs, projects, and goals, and should be able to provide insights and suggestions that are relevant and actionable for the user. (modification to the process, skills, etc.)
|
|
31
|
+
- The process should be designed to be iterative, allowing the user to go through multiple rounds of analysis and improvement, and should be able to track the changes and improvements made over time.
|
|
32
|
+
- The process should cover:
|
|
33
|
+
- Analysis of the run and its results, including what went well, what didn't go well, and what could be improved.
|
|
34
|
+
- Analysis of the process that was followed, including what steps were taken, what tools were used, and how effective they were.
|
|
35
|
+
- Suggestions for improvements, optimizations, and fixes for both the run and the process.
|
|
36
|
+
- Implementing the improvements, optimizations, and fixes, and tracking the changes made over time.
|
|
37
|
+
### Cleanup Suggestion
|
|
38
|
+
|
|
39
|
+
After retrospect analysis, suggest running `/babysitter:cleanup` to clean up old run data and reclaim disk space.
|
|
40
|
+
|
|
41
|
+
- Ending by explicitly prompting the user to contribute back -- even just reporting an issue is valuable, they don't need to implement the fix themselves. After analysis, display a clear call-to-action:
|
|
42
|
+
|
|
43
|
+
"You've identified [specific insight/improvement]. This could help other babysitter users too. Run `/babysitter:contrib` to share it upstream -- you can either report it as an issue or submit a PR with the fix."
|
|
44
|
+
|
|
45
|
+
Route to the specific contrib workflow based on what the user wants to do:
|
|
46
|
+
|
|
47
|
+
**Just reporting (no code changes needed):**
|
|
48
|
+
- Found a bug or weakness in a process -> `/babysitter:contrib bug report: [description of what went wrong]`
|
|
49
|
+
- Found missing or confusing documentation -> `/babysitter:contrib documentation question: [what was unclear]`
|
|
50
|
+
- Have an idea for improvement but don't want to implement it -> `/babysitter:contrib feature request: [description]`
|
|
51
|
+
|
|
52
|
+
**Contributing code changes:**
|
|
53
|
+
- Process/skill/agent improvements -> `/babysitter:contrib library contribution: [description]`
|
|
54
|
+
- Bug fixes in SDK or CLI -> `/babysitter:contrib bugfix: [description]`
|
|
55
|
+
- Plugin instruction improvements -> `/babysitter:contrib library contribution: improved [plugin-name] [install|configure|uninstall] instructions`
|
package/commands/user-install.md
CHANGED
|
@@ -1,11 +1,11 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Set up babysitter for yourself. Guides you through onboarding — installs dependencies, interviews you about your specialties and preferences, builds your user profile, and configures the best tools for your workflow.
|
|
3
|
-
argument-hint: Specific instructions for the run.
|
|
4
|
-
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
-
|
|
1
|
+
---
|
|
2
|
+
description: Set up babysitter for yourself. Guides you through onboarding — installs dependencies, interviews you about your specialties and preferences, builds your user profile, and configures the best tools for your workflow.
|
|
3
|
+
argument-hint: Specific instructions for the run.
|
|
4
|
+
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
+
|
|
9
9
|
Before using the process library, resolve the active library root through the SDK CLI. If no binding exists yet, initialize the shared global SDK binding with:
|
|
10
10
|
|
|
11
11
|
```bash
|
|
@@ -13,5 +13,5 @@ babysitter process-library:active --json
|
|
|
13
13
|
```
|
|
14
14
|
|
|
15
15
|
Then use the `cradle/user-install` process from the active process library.
|
|
16
|
-
|
|
17
|
-
When the run completes, end with a friendly message that includes a polite and humorous ask to star the repo on GitHub: https://github.com/a5c-ai/babysitter
|
|
16
|
+
|
|
17
|
+
When the run completes, end with a friendly message that includes a polite and humorous ask to star the repo on GitHub: https://github.com/a5c-ai/babysitter
|
package/commands/yolo.md
CHANGED
|
@@ -4,7 +4,7 @@ argument-hint: Specific instructions for the run.
|
|
|
4
4
|
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
5
|
---
|
|
6
6
|
|
|
7
|
-
Run the Babysitter orchestration instructions directly through the CLI, without any user interaction or breakpoints.
|
|
7
|
+
Run the Babysitter orchestration instructions directly through the CLI, without any user interaction or breakpoints. Use Bash to run `babysitter instructions:babysit-skill --harness github-copilot --no-interactive`, then follow the returned instructions in this same turn until completion proof is produced. Do not stop after reading the instructions, do not invoke the Skill tool first, and use the non-interactive/no-breakpoints path when the instructions offer a mode choice.
|
|
8
8
|
|
|
9
9
|
User arguments for this command:
|
|
10
10
|
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "@a5c-ai/babysitter-github",
|
|
3
|
-
"version": "5.0.1-staging.
|
|
3
|
+
"version": "5.0.1-staging.daf8e165bc4a",
|
|
4
4
|
"description": "Orchestrate complex, multi-step workflows with event-sourced state management, hook-based extensibility, and human-in-the-loop approval",
|
|
5
5
|
"scripts": {
|
|
6
6
|
"deploy": "npm publish --access public",
|
|
@@ -36,7 +36,7 @@
|
|
|
36
36
|
"access": "public"
|
|
37
37
|
},
|
|
38
38
|
"dependencies": {
|
|
39
|
-
"@a5c-ai/babysitter-sdk": "5.0.1-staging.
|
|
39
|
+
"@a5c-ai/babysitter-sdk": "5.0.1-staging.daf8e165bc4a"
|
|
40
40
|
},
|
|
41
41
|
"repository": {
|
|
42
42
|
"type": "git",
|
package/plugin.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "babysitter",
|
|
3
|
-
"version": "5.0.1-staging.
|
|
3
|
+
"version": "5.0.1-staging.daf8e165bc4a",
|
|
4
4
|
"description": "Orchestrate complex, multi-step workflows with event-sourced state management, hook-based extensibility, and human-in-the-loop approval",
|
|
5
5
|
"author": {
|
|
6
6
|
"name": "a5c.ai"
|
package/skills/babysit/SKILL.md
CHANGED
|
@@ -1,13 +1,25 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: babysit
|
|
3
|
-
description:
|
|
4
|
-
|
|
3
|
+
description: Execute via @babysitter. Use this skill when asked to babysit a task, do anything that is structured process-driven (even a loop) or whenever it is called explicitly. (babysit, babysitter, with a process, orchestrate, orchestrate a run, workflow, loop until, etc.)
|
|
4
|
+
allowed-tools: Read, Grep, Write, Task, Bash, Edit, Grep, Glob, WebFetch, WebSearch, Search, AskUserQuestion, TodoWrite, TodoRead, Skill, BashOutput, KillShell, MultiEdit, LS
|
|
5
|
+
version: 0.1.3
|
|
5
6
|
---
|
|
6
7
|
|
|
7
8
|
# babysit
|
|
8
9
|
|
|
9
10
|
Follow the commands below exactly. but apply judgement and creativity to get the job done. no matter how small or simple the task may seem, you must use the babysitter SDK and run lifecycle to execute it. This is non-negotiable. If you are asked to do something that seems like it could be done with a simple tool call, you must still use the babysitter SDK to create a run, define tasks, execute them, and complete the run. This is how you will learn and demonstrate mastery of the babysitter system. Always follow the full process, even for trivial tasks.
|
|
10
11
|
|
|
12
|
+
Subagents that need a scratch checkout or working directory must create it under
|
|
13
|
+
`/tmp/<descriptive-name>/`, not under `.a5c/runs/<runId>/work`. Before returning
|
|
14
|
+
deliverables, validate that no run-dir worktree was left behind, for example:
|
|
15
|
+
|
|
16
|
+
```bash
|
|
17
|
+
find .a5c/runs -maxdepth 3 -name work -type d -print
|
|
18
|
+
```
|
|
19
|
+
|
|
20
|
+
That command should print nothing. If it prints a non-empty work directory, move
|
|
21
|
+
or remove only the scratch data you created before returning.
|
|
22
|
+
|
|
11
23
|
## Dependencies
|
|
12
24
|
|
|
13
25
|
### Babysitter SDK and CLI
|
|
@@ -15,13 +27,19 @@ Follow the commands below exactly. but apply judgement and creativity to get the
|
|
|
15
27
|
Read the SDK version from `versions.json` to ensure version compatibility:
|
|
16
28
|
|
|
17
29
|
```bash
|
|
18
|
-
SDK_VERSION=$(node -e "try{console.log(JSON.parse(require('fs').readFileSync('${
|
|
19
|
-
npm i -g @a5c-ai/babysitter-sdk@$SDK_VERSION
|
|
20
|
-
|
|
21
|
-
|
|
30
|
+
SDK_VERSION=$(node -e "try{console.log(JSON.parse(require('fs').readFileSync('${COPILOT_PLUGIN_DIR}/versions.json','utf8')).sdkVersion||'latest')}catch{console.log('latest')}")
|
|
31
|
+
npm i -g @a5c-ai/babysitter-sdk@$SDK_VERSION || npm i -g @a5c-ai/babysitter-sdk@latest
|
|
32
|
+
|
|
33
|
+
if command -v babysitter >/dev/null 2>&1 && babysitter --version >/dev/null 2>&1; then
|
|
34
|
+
CLI="babysitter"
|
|
35
|
+
else
|
|
36
|
+
CLI="npm exec --yes --package @a5c-ai/babysitter-sdk@$SDK_VERSION -- babysitter"
|
|
37
|
+
fi
|
|
22
38
|
```
|
|
23
39
|
|
|
24
|
-
If
|
|
40
|
+
If the pinned version fails to install (e.g. not yet published), the fallback installs `latest`.
|
|
41
|
+
|
|
42
|
+
If a stale or broken global shim fails with `MODULE_NOT_FOUND`, repair it with `npm rm -g @a5c-ai/babysitter @a5c-ai/babysitter-sdk && npm i -g @a5c-ai/babysitter-sdk@$SDK_VERSION`, then re-run `babysitter --version`.
|
|
25
43
|
|
|
26
44
|
### jq
|
|
27
45
|
|
|
@@ -29,46 +47,16 @@ Make sure `jq` is installed and available in the path. If not, install it.
|
|
|
29
47
|
|
|
30
48
|
## Instructions
|
|
31
49
|
|
|
32
|
-
Run the following command to get full
|
|
50
|
+
Run the following command to get full instructions:
|
|
33
51
|
|
|
34
52
|
```bash
|
|
35
|
-
$CLI instructions:babysit-skill --harness github-copilot --
|
|
53
|
+
$CLI instructions:babysit-skill --harness github-copilot --interactive
|
|
36
54
|
```
|
|
37
55
|
|
|
38
|
-
|
|
39
|
-
|
|
40
|
-
## GitHub Copilot CLI -- In-Turn Loop Model
|
|
41
|
-
|
|
42
|
-
**IMPORTANT**: Unlike Claude Code, GitHub Copilot CLI does NOT have a Stop hook
|
|
43
|
-
that can drive the orchestration loop between turns. The `sessionEnd` hook output
|
|
44
|
-
is ignored and cannot block or restart the session.
|
|
45
|
-
|
|
46
|
-
Therefore, you MUST use **in-turn iteration**: run the full orchestration loop
|
|
47
|
-
within a single session turn. The pattern is:
|
|
48
|
-
|
|
49
|
-
1. `$CLI run:iterate --json` -- get pending actions
|
|
50
|
-
2. For each pending action: execute it (run tasks, post results via `task:post`)
|
|
51
|
-
3. `$CLI run:iterate --json` -- check for more pending actions
|
|
52
|
-
4. Repeat steps 2-3 until run completes or reaches a breakpoint requiring user input
|
|
53
|
-
5. If a breakpoint requires user input, ask the user and post the response, then continue iterating
|
|
54
|
-
|
|
55
|
-
All iteration happens within the same turn -- do NOT rely on hooks to re-enter
|
|
56
|
-
the orchestration loop. The agent drives the loop directly by calling
|
|
57
|
-
`run:iterate` repeatedly until completion.
|
|
58
|
-
|
|
59
|
-
### Loop Example
|
|
56
|
+
For non-interactive mode (running with `-p` flag or no AskUserQuestion tool):
|
|
60
57
|
|
|
61
58
|
```bash
|
|
62
|
-
|
|
63
|
-
RESULT=$($CLI run:iterate --run-id "$RUN_ID" --json)
|
|
64
|
-
STATUS=$(echo "$RESULT" | jq -r '.status')
|
|
65
|
-
|
|
66
|
-
while [ "$STATUS" != "completed" ] && [ "$STATUS" != "failed" ]; do
|
|
67
|
-
# Process pending actions from RESULT
|
|
68
|
-
# ... execute tasks, post results ...
|
|
69
|
-
|
|
70
|
-
# Iterate again
|
|
71
|
-
RESULT=$($CLI run:iterate --run-id "$RUN_ID" --json)
|
|
72
|
-
STATUS=$(echo "$RESULT" | jq -r '.status')
|
|
73
|
-
done
|
|
59
|
+
$CLI instructions:babysit-skill --harness github-copilot --no-interactive
|
|
74
60
|
```
|
|
61
|
+
|
|
62
|
+
Follow the instructions returned by the command above to orchestrate the run.
|
|
@@ -0,0 +1,66 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: blueprints
|
|
3
|
+
description: manage Babysitter blueprints. Use this command to list installed blueprints, browse marketplaces, install, update, uninstall, configure, or create a new blueprint.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# blueprints
|
|
7
|
+
|
|
8
|
+
This command installs and manages Babysitter blueprints. A blueprint is a version-managed package of contextual instructions or deterministic Babysitter processes, not a conventional software plugin.
|
|
9
|
+
|
|
10
|
+
If the command is run without arguments, list installed blueprints with their name, version, marketplace, installation date, and last update date. Also list configured marketplaces and show how to add the default marketplace when none exist.
|
|
11
|
+
|
|
12
|
+
Blueprints can be installed at two scopes:
|
|
13
|
+
|
|
14
|
+
- **global** (`--global`): stored under `~/.a5c/`, available for all projects
|
|
15
|
+
- **project** (`--project`): stored under `<projectDir>/.a5c/`, project-specific
|
|
16
|
+
|
|
17
|
+
## Marketplace Management
|
|
18
|
+
|
|
19
|
+
Marketplaces are git repositories containing a `marketplace.json` manifest and blueprint package directories. The SDK clones new marketplaces to `.a5c/blueprints/marketplaces/` for the selected scope and reads legacy `.a5c/marketplaces/` clones for compatibility.
|
|
20
|
+
|
|
21
|
+
### Add a marketplace
|
|
22
|
+
|
|
23
|
+
```bash
|
|
24
|
+
babysitter blueprint:add-marketplace --marketplace-url <url> [--marketplace-path <relative-path>] [--marketplace-branch <ref>] [--force] --global|--project [--json]
|
|
25
|
+
```
|
|
26
|
+
|
|
27
|
+
### Update a marketplace
|
|
28
|
+
|
|
29
|
+
```bash
|
|
30
|
+
babysitter blueprint:update-marketplace --marketplace-name <name> [--marketplace-branch <ref>] --global|--project [--json]
|
|
31
|
+
```
|
|
32
|
+
|
|
33
|
+
### List blueprints in a marketplace
|
|
34
|
+
|
|
35
|
+
```bash
|
|
36
|
+
babysitter blueprint:list-plugins --marketplace-name <name> --global|--project [--json]
|
|
37
|
+
```
|
|
38
|
+
|
|
39
|
+
## Blueprint Lifecycle
|
|
40
|
+
|
|
41
|
+
For `blueprint:install`, `blueprint:update`, `blueprint:configure`, and `blueprint:list-plugins`, the `--marketplace-name` flag is auto-detected when only one marketplace is cloned for the selected scope.
|
|
42
|
+
|
|
43
|
+
```bash
|
|
44
|
+
babysitter blueprint:install --plugin-name <name> [--marketplace-name <mp>] --global|--project [--json]
|
|
45
|
+
babysitter blueprint:update --plugin-name <name> --marketplace-name <mp> --global|--project [--json]
|
|
46
|
+
babysitter blueprint:configure --plugin-name <name> --marketplace-name <mp> --global|--project [--json]
|
|
47
|
+
babysitter blueprint:uninstall --plugin-name <name> --marketplace-name <mp> --global|--project [--json]
|
|
48
|
+
```
|
|
49
|
+
|
|
50
|
+
The `--plugin-name` flag is preserved for CLI compatibility with existing marketplace manifests. User-facing docs should call the installable a blueprint.
|
|
51
|
+
|
|
52
|
+
## Registry Management
|
|
53
|
+
|
|
54
|
+
```bash
|
|
55
|
+
babysitter blueprint:list-installed --global|--project [--json]
|
|
56
|
+
babysitter blueprint:update-registry --plugin-name <name> --plugin-version <ver> --marketplace-name <mp> --global|--project [--json]
|
|
57
|
+
babysitter blueprint:remove-from-registry --plugin-name <name> --global|--project [--json]
|
|
58
|
+
```
|
|
59
|
+
|
|
60
|
+
## Deprecated Aliases
|
|
61
|
+
|
|
62
|
+
The old `plugin:*` commands remain available as deprecated aliases for one release. Prefer `blueprint:*` in new docs, skills, and process instructions.
|
|
63
|
+
|
|
64
|
+
## Agent Plugins Are Separate
|
|
65
|
+
|
|
66
|
+
Do not rename or reinterpret agent harness plugins while handling blueprints. `CLAUDE_PLUGIN_ROOT`, `PI_PLUGIN_ROOT`, `.claude/plugins/`, hooks-mux, extension-mux, and agent plugin manifests stay plugin-specific.
|
|
@@ -0,0 +1,69 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: check-forbidden-markers
|
|
3
|
+
description: Pre-deploy gate that scans built JS chunks for forbidden substring markers (saga-era / obsolete code paths) listed in a project-local forbidden-markers.txt
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# check-forbidden-markers
|
|
7
|
+
|
|
8
|
+
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md). Compose the gate from the shared helper at `library/processes/shared/forbidden-markers-scanner.js` (issue #477).
|
|
9
|
+
|
|
10
|
+
## What this gate does
|
|
11
|
+
|
|
12
|
+
Reads a list of literal substring markers from `scripts/forbidden-markers.txt` (blank lines and `#`-prefixed comments stripped) and greps every `.js` chunk under `.vercel/output/static/_next/static/chunks/` (Next.js / Vercel default; configurable) for any occurrence. Reports structured hits per `(marker, chunk)` pair with occurrence counts. Designed to chain between `vercel build --prod` and `vercel deploy --prod`.
|
|
13
|
+
|
|
14
|
+
Use this gate when a refactor or restart-from-baseline replaced load-bearing code paths and you need a structural guarantee the obsolete symbols never re-ship. Burned-in evidence: cookbook VI-9 / VI-12 near-miss revivals during the 2026-05 iOS-Safari saga; the prototype lives at `cookbook/scripts/check-no-forbidden.mjs` and shipped two upstream contributions before being generalized as this gate.
|
|
15
|
+
|
|
16
|
+
## When to use
|
|
17
|
+
|
|
18
|
+
- **Pre-deploy.** Insert after build, before deploy. Block the deploy when `ok: false`.
|
|
19
|
+
- **Post-restart.** After a baseline rollback + step-by-step re-add, snapshot the saga-era markers in `forbidden-markers.txt` and let CI hold the line.
|
|
20
|
+
- **Post-refactor.** When old helper / handler / module names must not coexist with the new ones in the same bundle.
|
|
21
|
+
|
|
22
|
+
## Expected config locations
|
|
23
|
+
|
|
24
|
+
- `scripts/forbidden-markers.txt` — one marker per line, `#` for comments. The list is the contract; the gate is mechanical. Commit this file to source control.
|
|
25
|
+
- `.vercel/output/static/_next/static/chunks/` — default scan target. Override for non-Vercel frameworks via the `--chunks-dir` flag or the `chunksDir` task input.
|
|
26
|
+
|
|
27
|
+
A missing markers file is a no-op (`ok: true`, `reason: 'missing-markers-file'`) — misconfiguration is never a deploy block. A missing chunks directory is likewise a no-op (`reason: 'missing-chunks-dir'`) so the gate is safe to chain into `check:all` before the build runs.
|
|
28
|
+
|
|
29
|
+
## Exit semantics
|
|
30
|
+
|
|
31
|
+
| Reason | `ok` | Deploy decision |
|
|
32
|
+
|-------------------------|--------|--------------------------------|
|
|
33
|
+
| `missing-markers-file` | true | Pass (no gate active) |
|
|
34
|
+
| `missing-chunks-dir` | true | Pass (run before build) |
|
|
35
|
+
| `empty-markers` | true | Pass (list is empty) |
|
|
36
|
+
| `no-chunks` | true | Pass (nothing to scan) |
|
|
37
|
+
| `clean` | true | Pass — proceed to deploy |
|
|
38
|
+
| `hits` | false | **BLOCK** — surface hits, ask for triage |
|
|
39
|
+
|
|
40
|
+
For each hit, the gate emits `{ marker, chunk, count }` so the operator sees the exact marker string, the absolute chunk path, and the number of occurrences in that chunk. Multiple hits across chunks for the same marker are reported separately.
|
|
41
|
+
|
|
42
|
+
## Programmatic surface
|
|
43
|
+
|
|
44
|
+
```js
|
|
45
|
+
import { scanForbiddenMarkers, checkForbiddenMarkersTask } from '@a5c-ai/babysitter-library/processes/shared';
|
|
46
|
+
|
|
47
|
+
// Direct call:
|
|
48
|
+
const result = await scanForbiddenMarkers({
|
|
49
|
+
markersFile: 'scripts/forbidden-markers.txt',
|
|
50
|
+
chunksDir: '.vercel/output/static/_next/static/chunks',
|
|
51
|
+
});
|
|
52
|
+
if (!result.ok) {
|
|
53
|
+
// result.hits: Array<{ marker, chunk, count }>
|
|
54
|
+
// result.reason === 'hits'
|
|
55
|
+
process.exit(1);
|
|
56
|
+
}
|
|
57
|
+
|
|
58
|
+
// Or dispatched as a babysitter task:
|
|
59
|
+
const gate = await ctx.task(checkForbiddenMarkersTask, {
|
|
60
|
+
projectDir: '.',
|
|
61
|
+
// markersFile / chunksDir are inferred from projectDir if omitted
|
|
62
|
+
});
|
|
63
|
+
```
|
|
64
|
+
|
|
65
|
+
## Reference
|
|
66
|
+
|
|
67
|
+
- Issue: https://github.com/a5c-ai/babysitter/issues/477
|
|
68
|
+
- Helper module: `library/processes/shared/forbidden-markers-scanner.js`
|
|
69
|
+
- Origin (cookbook prototype): `cookbook/scripts/check-no-forbidden.mjs` (81 lines)
|
package/skills/cleanup/SKILL.md
CHANGED
|
@@ -7,7 +7,13 @@ description: Clean up .a5c/runs and .a5c/processes directories. Aggregates insig
|
|
|
7
7
|
|
|
8
8
|
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
9
9
|
|
|
10
|
-
|
|
10
|
+
Resolve the active process library with:
|
|
11
|
+
|
|
12
|
+
```bash
|
|
13
|
+
babysitter process-library:active --json
|
|
14
|
+
```
|
|
15
|
+
|
|
16
|
+
Read `binding.dir` from that JSON and create/run the cleanup process from `cradle/cleanup-runs.js#process` relative to that active library root. Do not use plugin-cache-relative cradle paths.
|
|
11
17
|
|
|
12
18
|
Implementation notes (for the process):
|
|
13
19
|
- Parse arguments for `--dry-run` flag (if present, set dryRun: true in inputs) and `--keep-days N` (default: 7)
|
package/skills/contrib/SKILL.md
CHANGED
|
@@ -5,30 +5,30 @@ description: Submit feedback or contribute to babysitter project
|
|
|
5
5
|
|
|
6
6
|
# contrib
|
|
7
7
|
|
|
8
|
-
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
9
|
-
|
|
10
|
-
## Process Routing
|
|
11
|
-
|
|
8
|
+
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
9
|
+
|
|
10
|
+
## Process Routing
|
|
11
|
+
|
|
12
12
|
Contribution processes live under the active process library's `cradle/` directory. Resolve the active library root with `babysitter process-library:active --json` and route based on arguments:
|
|
13
|
-
|
|
14
|
-
### Issue-based (opens a GitHub issue in a5c-ai/babysitter)
|
|
15
|
-
* **Bug report** → `cradle/bug-report.js#process` — Report a bug in the SDK, CLI, process library, etc.
|
|
16
|
-
* **Feature request** → `cradle/feature-request.js#process` — Request a new feature or enhancement
|
|
17
|
-
* **Documentation question** → `cradle/documentation-question.js#process` — Ask about undocumented behavior or missing docs
|
|
18
|
-
|
|
19
|
-
### PR-based (forks repo, creates branch, submits PR to a5c-ai/babysitter)
|
|
20
|
-
* **Bugfix** → `cradle/bugfix.js#process` — User already has the fix for a bug
|
|
21
|
-
* **Feature implementation** → `cradle/feature-implementation-contribute.js#process` — User already has a feature implementation
|
|
22
|
-
* **Harness integration** → `cradle/feature-harness-integration-contribute.js#process` — User has a harness (CI/CD, IDE, editor) integration
|
|
23
|
-
* **Library contribution** → `cradle/library-contribution.js#process` — New or improved process/skill/subagent for the library
|
|
24
|
-
* **Documentation answer** → `cradle/documentation-contribute-answer.js#process` — User has an answer for an unanswered docs question
|
|
25
|
-
|
|
26
|
-
### Router (when arguments are empty or general)
|
|
27
|
-
* **Contribute** → `cradle/contribute.js#process` — Explains contribution types and routes to the specific process
|
|
28
|
-
|
|
29
|
-
## Contribution Rules
|
|
30
|
-
|
|
31
|
-
* PR-based contributions: fork the babysitter repo (a5c-ai/babysitter) for the user, ask to star if not already starred, perform changes, submit PR
|
|
32
|
-
* Issue-based contributions: gather details, search for duplicates, review, then open an issue in a5c-ai/babysitter
|
|
33
|
-
* Add breakpoints (permissions) before ALL gh actions (fork, star, submit PR/issue) to allow user review and cancellation
|
|
13
|
+
|
|
14
|
+
### Issue-based (opens a GitHub issue in a5c-ai/babysitter)
|
|
15
|
+
* **Bug report** → `cradle/bug-report.js#process` — Report a bug in the SDK, CLI, process library, etc.
|
|
16
|
+
* **Feature request** → `cradle/feature-request.js#process` — Request a new feature or enhancement
|
|
17
|
+
* **Documentation question** → `cradle/documentation-question.js#process` — Ask about undocumented behavior or missing docs
|
|
18
|
+
|
|
19
|
+
### PR-based (forks repo, creates branch, submits PR to a5c-ai/babysitter)
|
|
20
|
+
* **Bugfix** → `cradle/bugfix.js#process` — User already has the fix for a bug
|
|
21
|
+
* **Feature implementation** → `cradle/feature-implementation-contribute.js#process` — User already has a feature implementation
|
|
22
|
+
* **Harness integration** → `cradle/feature-harness-integration-contribute.js#process` — User has a harness (CI/CD, IDE, editor) integration
|
|
23
|
+
* **Library contribution** → `cradle/library-contribution.js#process` — New or improved process/skill/subagent for the library
|
|
24
|
+
* **Documentation answer** → `cradle/documentation-contribute-answer.js#process` — User has an answer for an unanswered docs question
|
|
25
|
+
|
|
26
|
+
### Router (when arguments are empty or general)
|
|
27
|
+
* **Contribute** → `cradle/contribute.js#process` — Explains contribution types and routes to the specific process
|
|
28
|
+
|
|
29
|
+
## Contribution Rules
|
|
30
|
+
|
|
31
|
+
* PR-based contributions: fork the babysitter repo (a5c-ai/babysitter) for the user, ask to star if not already starred, perform changes, submit PR
|
|
32
|
+
* Issue-based contributions: gather details, search for duplicates, review, then open an issue in a5c-ai/babysitter
|
|
33
|
+
* Add breakpoints (permissions) before ALL gh actions (fork, star, submit PR/issue) to allow user review and cancellation
|
|
34
34
|
* If arguments are empty: use the `contribute.js` router process to show options and route accordingly
|
package/skills/help/SKILL.md
CHANGED
|
@@ -180,13 +180,13 @@ SECONDARY COMMANDS
|
|
|
180
180
|
a fuzzy comparison step before strict assertion. Implement this fix?")
|
|
181
181
|
|
|
182
182
|
|
|
183
|
-
/babysitter:
|
|
184
|
-
Manage
|
|
185
|
-
update, configure, uninstall, or create new
|
|
186
|
-
instruction packages
|
|
187
|
-
configure, and uninstall steps
|
|
183
|
+
/babysitter:blueprints [action]
|
|
184
|
+
Manage Babysitter blueprints: list installed blueprints, browse marketplaces,
|
|
185
|
+
install, update, configure, uninstall, or create new blueprints. Blueprints are
|
|
186
|
+
version-managed instruction packages or process bundles that guide the agent
|
|
187
|
+
through install, configure, and uninstall steps.
|
|
188
188
|
|
|
189
|
-
Without arguments: shows installed
|
|
189
|
+
Without arguments: shows installed blueprints (name, version, marketplace, dates) and
|
|
190
190
|
available marketplaces. With arguments: routes to the specific action.
|
|
191
191
|
|
|
192
192
|
Key actions:
|
|
@@ -194,11 +194,11 @@ SECONDARY COMMANDS
|
|
|
194
194
|
- configure <name> --global|--project: fetch configure.md and walk through options
|
|
195
195
|
- update <name> --global|--project: resolve migration chain via BFS and apply steps
|
|
196
196
|
- uninstall <name> --global|--project: fetch uninstall.md and execute removal
|
|
197
|
-
- create: scaffold a new
|
|
197
|
+
- create: scaffold a new blueprint package
|
|
198
198
|
|
|
199
|
-
Example: /babysitter:
|
|
199
|
+
Example: /babysitter:blueprints install sound-hooks --project
|
|
200
200
|
(fetches sound-hooks from marketplace, reads install.md, walks you through player
|
|
201
|
-
detection, sound selection, hook configuration, and registers
|
|
201
|
+
detection, sound selection, hook configuration, and registers the blueprint)
|
|
202
202
|
|
|
203
203
|
|
|
204
204
|
/babysitter:contrib [feedback]
|
package/skills/plan/SKILL.md
CHANGED
|
@@ -5,4 +5,14 @@ description: Plan a babysitter run. use this command to plan a complex workflow,
|
|
|
5
5
|
|
|
6
6
|
# plan
|
|
7
7
|
|
|
8
|
-
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md).
|
|
8
|
+
Invoke the babysitter:babysit skill (using the Skill tool) and follow its instructions (SKILL.md). Focus on creating the best process possible, but without creating and running the actual run.
|
|
9
|
+
|
|
10
|
+
Before drafting the process, run Phase 0 -- REUSE-AUDIT: extract keyword nouns and verbs from the request, scan for matching existing migrations, API routes, environment variables, SDK dependencies, and imports, honor `.a5c/reuse-audit.json` when present, and put a `Reuse-audit findings (REVIEW BEFORE PROCEEDING)` block before Phase 1 of the plan.
|
|
11
|
+
|
|
12
|
+
## Process Shape Selection
|
|
13
|
+
|
|
14
|
+
Choose the process shape before authoring `process.js`:
|
|
15
|
+
|
|
16
|
+
- Use a flat phase list when the spec is well-defined, the work is wiring or composition, the bug class is already known if this is a fix, and execution should proceed sequentially through clear phases.
|
|
17
|
+
- Use a HYPOTHESES tree when the bug class is unknown, forensics are required, multiple causal models compete, and each hypothesis needs its own observations, falsifying observations, and follow-up phases.
|
|
18
|
+
- Rule of thumb: if the first phase is "investigate", use HYPOTHESES-tree mode. If the first phase is "implement X", use flat-phase-list mode.
|