@bradygaster/squad-sdk 0.9.0 → 0.9.2-insider.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +44 -0
- package/dist/agents/history-shadow.d.ts +7 -5
- package/dist/agents/history-shadow.d.ts.map +1 -1
- package/dist/agents/history-shadow.js +39 -48
- package/dist/agents/history-shadow.js.map +1 -1
- package/dist/agents/index.d.ts +12 -1
- package/dist/agents/index.d.ts.map +1 -1
- package/dist/agents/index.js +62 -9
- package/dist/agents/index.js.map +1 -1
- package/dist/agents/lifecycle.d.ts +4 -0
- package/dist/agents/lifecycle.d.ts.map +1 -1
- package/dist/agents/lifecycle.js +6 -7
- package/dist/agents/lifecycle.js.map +1 -1
- package/dist/agents/onboarding.d.ts +4 -2
- package/dist/agents/onboarding.d.ts.map +1 -1
- package/dist/agents/onboarding.js +26 -16
- package/dist/agents/onboarding.js.map +1 -1
- package/dist/agents/personal.d.ts +2 -1
- package/dist/agents/personal.d.ts.map +1 -1
- package/dist/agents/personal.js +11 -12
- package/dist/agents/personal.js.map +1 -1
- package/dist/build/bundle.d.ts.map +1 -1
- package/dist/build/bundle.js +6 -6
- package/dist/build/bundle.js.map +1 -1
- package/dist/build/release.d.ts.map +1 -1
- package/dist/build/release.js +7 -5
- package/dist/build/release.js.map +1 -1
- package/dist/casting/index.d.ts.map +1 -1
- package/dist/casting/index.js +4 -3
- package/dist/casting/index.js.map +1 -1
- package/dist/config/agent-source.d.ts +5 -1
- package/dist/config/agent-source.d.ts.map +1 -1
- package/dist/config/agent-source.js +85 -41
- package/dist/config/agent-source.js.map +1 -1
- package/dist/config/init.d.ts +4 -3
- package/dist/config/init.d.ts.map +1 -1
- package/dist/config/init.js +84 -63
- package/dist/config/init.js.map +1 -1
- package/dist/config/legacy-fallback.d.ts +3 -2
- package/dist/config/legacy-fallback.d.ts.map +1 -1
- package/dist/config/legacy-fallback.js +16 -14
- package/dist/config/legacy-fallback.js.map +1 -1
- package/dist/config/models.d.ts +9 -6
- package/dist/config/models.d.ts.map +1 -1
- package/dist/config/models.js +35 -25
- package/dist/config/models.js.map +1 -1
- package/dist/index.d.ts +5 -1
- package/dist/index.d.ts.map +1 -1
- package/dist/index.js +14 -1
- package/dist/index.js.map +1 -1
- package/dist/marketplace/packaging.d.ts.map +1 -1
- package/dist/marketplace/packaging.js +18 -16
- package/dist/marketplace/packaging.js.map +1 -1
- package/dist/multi-squad.d.ts.map +1 -1
- package/dist/multi-squad.js +10 -9
- package/dist/multi-squad.js.map +1 -1
- package/dist/platform/comms-file-log.d.ts.map +1 -1
- package/dist/platform/comms-file-log.js +7 -6
- package/dist/platform/comms-file-log.js.map +1 -1
- package/dist/platform/comms.d.ts.map +1 -1
- package/dist/platform/comms.js +6 -5
- package/dist/platform/comms.js.map +1 -1
- package/dist/platform/index.d.ts.map +1 -1
- package/dist/platform/index.js +4 -3
- package/dist/platform/index.js.map +1 -1
- package/dist/ralph/capabilities.d.ts +30 -1
- package/dist/ralph/capabilities.d.ts.map +1 -1
- package/dist/ralph/capabilities.js +51 -6
- package/dist/ralph/capabilities.js.map +1 -1
- package/dist/ralph/index.d.ts +1 -1
- package/dist/ralph/index.d.ts.map +1 -1
- package/dist/ralph/index.js +4 -3
- package/dist/ralph/index.js.map +1 -1
- package/dist/ralph/rate-limiting.d.ts.map +1 -1
- package/dist/ralph/rate-limiting.js +4 -4
- package/dist/ralph/rate-limiting.js.map +1 -1
- package/dist/remote/bridge.d.ts.map +1 -1
- package/dist/remote/bridge.js +2 -2
- package/dist/remote/bridge.js.map +1 -1
- package/dist/resolution.d.ts +9 -0
- package/dist/resolution.d.ts.map +1 -1
- package/dist/resolution.js +39 -16
- package/dist/resolution.js.map +1 -1
- package/dist/roles/catalog.d.ts +1 -1
- package/dist/runtime/config.d.ts.map +1 -1
- package/dist/runtime/config.js +8 -7
- package/dist/runtime/config.js.map +1 -1
- package/dist/runtime/cross-squad.d.ts.map +1 -1
- package/dist/runtime/cross-squad.js +8 -7
- package/dist/runtime/cross-squad.js.map +1 -1
- package/dist/runtime/scheduler.d.ts.map +1 -1
- package/dist/runtime/scheduler.js +8 -8
- package/dist/runtime/scheduler.js.map +1 -1
- package/dist/runtime/squad-observer.d.ts.map +1 -1
- package/dist/runtime/squad-observer.js +7 -4
- package/dist/runtime/squad-observer.js.map +1 -1
- package/dist/sharing/consult.d.ts +1 -1
- package/dist/sharing/consult.d.ts.map +1 -1
- package/dist/sharing/consult.js +66 -64
- package/dist/sharing/consult.js.map +1 -1
- package/dist/sharing/export.d.ts.map +1 -1
- package/dist/sharing/export.js +16 -16
- package/dist/sharing/export.js.map +1 -1
- package/dist/sharing/import.d.ts.map +1 -1
- package/dist/sharing/import.js +13 -12
- package/dist/sharing/import.js.map +1 -1
- package/dist/skills/skill-loader.d.ts.map +1 -1
- package/dist/skills/skill-loader.js +10 -9
- package/dist/skills/skill-loader.js.map +1 -1
- package/dist/skills/skill-script-loader.d.ts.map +1 -1
- package/dist/skills/skill-script-loader.js +6 -4
- package/dist/skills/skill-script-loader.js.map +1 -1
- package/dist/skills/skill-source.d.ts +3 -1
- package/dist/skills/skill-source.d.ts.map +1 -1
- package/dist/skills/skill-source.js +18 -16
- package/dist/skills/skill-source.js.map +1 -1
- package/dist/state/collection-map.d.ts +43 -0
- package/dist/state/collection-map.d.ts.map +1 -0
- package/dist/state/collection-map.js +9 -0
- package/dist/state/collection-map.js.map +1 -0
- package/dist/state/collections.d.ts +102 -0
- package/dist/state/collections.d.ts.map +1 -0
- package/dist/state/collections.js +317 -0
- package/dist/state/collections.js.map +1 -0
- package/dist/state/domain-types.d.ts +122 -0
- package/dist/state/domain-types.d.ts.map +1 -0
- package/dist/state/domain-types.js +54 -0
- package/dist/state/domain-types.js.map +1 -0
- package/dist/state/handles.d.ts +16 -0
- package/dist/state/handles.d.ts.map +1 -0
- package/dist/state/handles.js +161 -0
- package/dist/state/handles.js.map +1 -0
- package/dist/state/index.d.ts +17 -0
- package/dist/state/index.d.ts.map +1 -0
- package/dist/state/index.js +15 -0
- package/dist/state/index.js.map +1 -0
- package/dist/state/io/charter-io.d.ts +28 -0
- package/dist/state/io/charter-io.d.ts.map +1 -0
- package/dist/state/io/charter-io.js +94 -0
- package/dist/state/io/charter-io.js.map +1 -0
- package/dist/state/io/decisions-io.d.ts +42 -0
- package/dist/state/io/decisions-io.d.ts.map +1 -0
- package/dist/state/io/decisions-io.js +66 -0
- package/dist/state/io/decisions-io.js.map +1 -0
- package/dist/state/io/history-io.d.ts +37 -0
- package/dist/state/io/history-io.d.ts.map +1 -0
- package/dist/state/io/history-io.js +102 -0
- package/dist/state/io/history-io.js.map +1 -0
- package/dist/state/io/index.d.ts +19 -0
- package/dist/state/io/index.d.ts.map +1 -0
- package/dist/state/io/index.js +19 -0
- package/dist/state/io/index.js.map +1 -0
- package/dist/state/io/routing-io.d.ts +37 -0
- package/dist/state/io/routing-io.d.ts.map +1 -0
- package/dist/state/io/routing-io.js +99 -0
- package/dist/state/io/routing-io.js.map +1 -0
- package/dist/state/io/team-io.d.ts +46 -0
- package/dist/state/io/team-io.d.ts.map +1 -0
- package/dist/state/io/team-io.js +82 -0
- package/dist/state/io/team-io.js.map +1 -0
- package/dist/state/schema.d.ts +24 -0
- package/dist/state/schema.d.ts.map +1 -0
- package/dist/state/schema.js +41 -0
- package/dist/state/schema.js.map +1 -0
- package/dist/state/squad-state.d.ts +42 -0
- package/dist/state/squad-state.d.ts.map +1 -0
- package/dist/state/squad-state.js +68 -0
- package/dist/state/squad-state.js.map +1 -0
- package/dist/storage/fs-storage-provider.d.ts +60 -0
- package/dist/storage/fs-storage-provider.d.ts.map +1 -0
- package/dist/storage/fs-storage-provider.js +377 -0
- package/dist/storage/fs-storage-provider.js.map +1 -0
- package/dist/storage/in-memory-storage-provider.d.ts +46 -0
- package/dist/storage/in-memory-storage-provider.d.ts.map +1 -0
- package/dist/storage/in-memory-storage-provider.js +264 -0
- package/dist/storage/in-memory-storage-provider.js.map +1 -0
- package/dist/storage/index.d.ts +6 -0
- package/dist/storage/index.d.ts.map +1 -0
- package/dist/storage/index.js +5 -0
- package/dist/storage/index.js.map +1 -0
- package/dist/storage/sqlite-storage-provider.d.ts +95 -0
- package/dist/storage/sqlite-storage-provider.d.ts.map +1 -0
- package/dist/storage/sqlite-storage-provider.js +383 -0
- package/dist/storage/sqlite-storage-provider.js.map +1 -0
- package/dist/storage/storage-error.d.ts +28 -0
- package/dist/storage/storage-error.d.ts.map +1 -0
- package/dist/storage/storage-error.js +35 -0
- package/dist/storage/storage-error.js.map +1 -0
- package/dist/storage/storage-provider.d.ts +161 -0
- package/dist/storage/storage-provider.d.ts.map +1 -0
- package/dist/storage/storage-provider.js +18 -0
- package/dist/storage/storage-provider.js.map +1 -0
- package/dist/streams/resolver.d.ts.map +1 -1
- package/dist/streams/resolver.js +6 -5
- package/dist/streams/resolver.js.map +1 -1
- package/dist/tools/index.d.ts +5 -1
- package/dist/tools/index.d.ts.map +1 -1
- package/dist/tools/index.js +54 -15
- package/dist/tools/index.js.map +1 -1
- package/dist/upstream/resolver.d.ts +3 -2
- package/dist/upstream/resolver.d.ts.map +1 -1
- package/dist/upstream/resolver.js +33 -32
- package/dist/upstream/resolver.js.map +1 -1
- package/package.json +33 -1
- package/templates/scribe-charter.md +4 -0
- package/templates/skills/cross-machine-coordination/SKILL.md +434 -0
- package/templates/skills/error-recovery/SKILL.md +99 -0
- package/templates/skills/iterative-retrieval/SKILL.md +165 -0
- package/templates/skills/notification-routing/SKILL.md +105 -0
- package/templates/skills/pr-screenshots/SKILL.md +149 -0
- package/templates/skills/ralph-two-pass-scan/SKILL.md +35 -0
- package/templates/skills/reflect/SKILL.md +229 -0
- package/templates/skills/release-process/SKILL.md +84 -376
- package/templates/skills/retro-enforcement/SKILL.md +148 -0
- package/templates/skills/tiered-memory/SKILL.md +234 -0
- package/templates/skills/windows-compatibility/SKILL.md +24 -0
- package/templates/{squad.agent.md → squad.agent.md.template} +57 -28
- package/templates/workflows/squad-ci.yml +1 -1
- package/templates/workflows/squad-heartbeat.yml +0 -4
- package/templates/workflows/squad-insider-release.yml +1 -1
- package/templates/workflows/squad-preview.yml +1 -1
- package/templates/workflows/squad-release.yml +1 -1
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "@bradygaster/squad-sdk",
|
|
3
|
-
"version": "0.9.
|
|
3
|
+
"version": "0.9.2-insider.1",
|
|
4
4
|
"description": "Squad SDK — Programmable multi-agent runtime for GitHub Copilot",
|
|
5
5
|
"type": "module",
|
|
6
6
|
"main": "./dist/index.js",
|
|
@@ -185,6 +185,34 @@
|
|
|
185
185
|
"./config/models": {
|
|
186
186
|
"types": "./dist/config/models.d.ts",
|
|
187
187
|
"import": "./dist/config/models.js"
|
|
188
|
+
},
|
|
189
|
+
"./storage": {
|
|
190
|
+
"types": "./dist/storage/index.d.ts",
|
|
191
|
+
"import": "./dist/storage/index.js"
|
|
192
|
+
},
|
|
193
|
+
"./platform": {
|
|
194
|
+
"types": "./dist/platform/index.d.ts",
|
|
195
|
+
"import": "./dist/platform/index.js"
|
|
196
|
+
},
|
|
197
|
+
"./remote": {
|
|
198
|
+
"types": "./dist/remote/index.d.ts",
|
|
199
|
+
"import": "./dist/remote/index.js"
|
|
200
|
+
},
|
|
201
|
+
"./roles": {
|
|
202
|
+
"types": "./dist/roles/index.d.ts",
|
|
203
|
+
"import": "./dist/roles/index.js"
|
|
204
|
+
},
|
|
205
|
+
"./state": {
|
|
206
|
+
"types": "./dist/state/index.d.ts",
|
|
207
|
+
"import": "./dist/state/index.js"
|
|
208
|
+
},
|
|
209
|
+
"./streams": {
|
|
210
|
+
"types": "./dist/streams/index.d.ts",
|
|
211
|
+
"import": "./dist/streams/index.js"
|
|
212
|
+
},
|
|
213
|
+
"./upstream": {
|
|
214
|
+
"types": "./dist/upstream/index.d.ts",
|
|
215
|
+
"import": "./dist/upstream/index.js"
|
|
188
216
|
}
|
|
189
217
|
},
|
|
190
218
|
"files": [
|
|
@@ -213,11 +241,15 @@
|
|
|
213
241
|
"@opentelemetry/sdk-trace-base": "^1.30.0",
|
|
214
242
|
"@opentelemetry/sdk-trace-node": "^1.30.0",
|
|
215
243
|
"@opentelemetry/semantic-conventions": "^1.28.0",
|
|
244
|
+
"sql.js": "^1.14.1",
|
|
216
245
|
"ws": "^8.18.0"
|
|
217
246
|
},
|
|
218
247
|
"devDependencies": {
|
|
219
248
|
"@types/node": "^22.0.0",
|
|
249
|
+
"@types/sql.js": "^1.4.10",
|
|
220
250
|
"@types/ws": "^8.5.13",
|
|
251
|
+
"typedoc": "~0.28.18",
|
|
252
|
+
"typedoc-plugin-markdown": "~4.11.0",
|
|
221
253
|
"typescript": "^5.7.0"
|
|
222
254
|
},
|
|
223
255
|
"keywords": [
|
|
@@ -15,6 +15,10 @@
|
|
|
15
15
|
- `.squad/decisions.md` — the shared decision log all agents read (canonical, merged)
|
|
16
16
|
- `.squad/decisions/inbox/` — decision drop-box (agents write here, I merge)
|
|
17
17
|
- Cross-agent context propagation — when one agent's decision affects another
|
|
18
|
+
- Decision archival — **HARD GATE**: enforce two-tier ceiling on decisions.md before every merge:
|
|
19
|
+
- **Tier 1 (30-day):** If >20KB, archive entries older than 30 days
|
|
20
|
+
- **Tier 2 (7-day):** If still >50KB after Tier 1, archive entries older than 7 days
|
|
21
|
+
- Emit HEALTH REPORT to session log after archival runs
|
|
18
22
|
|
|
19
23
|
## How I Work
|
|
20
24
|
|
|
@@ -0,0 +1,434 @@
|
|
|
1
|
+
# Skill: Cross-Machine Coordination Pattern
|
|
2
|
+
|
|
3
|
+
**Skill ID:** `cross-machine-coordination`
|
|
4
|
+
**Owner:** Ralph (Work Monitor)
|
|
5
|
+
**Squad Integration:** All agents
|
|
6
|
+
**Status:** Specification (ready for implementation)
|
|
7
|
+
|
|
8
|
+
---
|
|
9
|
+
|
|
10
|
+
## Overview
|
|
11
|
+
|
|
12
|
+
Enables squad agents running on different machines (laptop, DevBox, Azure VM) to securely share work, coordinate execution, and pass results without manual intervention.
|
|
13
|
+
|
|
14
|
+
**Pattern:** Git-based task queuing + GitHub Issues supplement
|
|
15
|
+
|
|
16
|
+
---
|
|
17
|
+
|
|
18
|
+
## Usage
|
|
19
|
+
|
|
20
|
+
### For Task Sources (Orchestrating Machine)
|
|
21
|
+
|
|
22
|
+
**To assign work to DevBox:**
|
|
23
|
+
|
|
24
|
+
```bash
|
|
25
|
+
# Create task file
|
|
26
|
+
cat > .squad/cross-machine/tasks/2026-03-14T1530Z-laptop-gpu-voice-clone.yaml << 'EOF'
|
|
27
|
+
id: gpu-voice-clone-001
|
|
28
|
+
source_machine: laptop-machine
|
|
29
|
+
target_machine: devbox
|
|
30
|
+
priority: high
|
|
31
|
+
created_at: 2026-03-14T15:30:00Z
|
|
32
|
+
task_type: gpu_workload
|
|
33
|
+
payload:
|
|
34
|
+
command: "python scripts/voice-clone.py --input voice.wav --output cloned.wav"
|
|
35
|
+
expected_duration_min: 15
|
|
36
|
+
resources:
|
|
37
|
+
gpu: true
|
|
38
|
+
memory_gb: 8
|
|
39
|
+
status: pending
|
|
40
|
+
EOF
|
|
41
|
+
|
|
42
|
+
# Commit & push
|
|
43
|
+
git add .squad/cross-machine/tasks/
|
|
44
|
+
git commit -m "Cross-machine task: GPU voice cloning [squad:machine-devbox]"
|
|
45
|
+
git push origin main
|
|
46
|
+
```
|
|
47
|
+
|
|
48
|
+
Ralph on DevBox will:
|
|
49
|
+
1. Pull the task on next cycle (5-10 min)
|
|
50
|
+
2. Validate schema & command whitelist
|
|
51
|
+
3. Execute the GPU workload
|
|
52
|
+
4. Write result to `.squad/cross-machine/results/gpu-voice-clone-001.yaml`
|
|
53
|
+
5. Commit & push the result
|
|
54
|
+
|
|
55
|
+
---
|
|
56
|
+
|
|
57
|
+
### For Task Executors (DevBox, Azure VMs)
|
|
58
|
+
|
|
59
|
+
Ralph automatically watches `.squad/cross-machine/tasks/` for work targeted at this machine.
|
|
60
|
+
|
|
61
|
+
**On each cycle (5-10 min):**
|
|
62
|
+
|
|
63
|
+
```python
|
|
64
|
+
# Pseudo-code (Ralph implementation)
|
|
65
|
+
1. git pull origin main
|
|
66
|
+
2. Load all .yaml files in .squad/cross-machine/tasks/
|
|
67
|
+
3. Filter for status=pending AND target_machine=HOSTNAME
|
|
68
|
+
4. For each task:
|
|
69
|
+
a. Validate schema (must have: id, source_machine, target_machine, payload)
|
|
70
|
+
b. Validate command against whitelist
|
|
71
|
+
c. Execute task (with timeout)
|
|
72
|
+
d. Write result to .squad/cross-machine/results/{id}.yaml
|
|
73
|
+
e. Commit & push result
|
|
74
|
+
```
|
|
75
|
+
|
|
76
|
+
---
|
|
77
|
+
|
|
78
|
+
### For Urgent/Ad-Hoc Tasks
|
|
79
|
+
|
|
80
|
+
**Use GitHub Issues with `squad:machine-{name}` label:**
|
|
81
|
+
|
|
82
|
+
```bash
|
|
83
|
+
# Create issue
|
|
84
|
+
gh issue create \
|
|
85
|
+
--title "GPU: Clone voice profile from sample.wav" \
|
|
86
|
+
--body "Execute voice cloning on DevBox. Input: /path/to/voice-input.wav" \
|
|
87
|
+
--label "squad:machine-devbox" \
|
|
88
|
+
--label "urgent"
|
|
89
|
+
```
|
|
90
|
+
|
|
91
|
+
Ralph on DevBox will:
|
|
92
|
+
1. Detect issue with `squad:machine-devbox` label
|
|
93
|
+
2. Parse task from issue body
|
|
94
|
+
3. Execute task
|
|
95
|
+
4. Comment with result
|
|
96
|
+
5. Close issue
|
|
97
|
+
|
|
98
|
+
---
|
|
99
|
+
|
|
100
|
+
## File Formats
|
|
101
|
+
|
|
102
|
+
### Task File (YAML)
|
|
103
|
+
|
|
104
|
+
**Location:** `.squad/cross-machine/tasks/{timestamp}-{machine}-{task-id}.yaml`
|
|
105
|
+
|
|
106
|
+
**Required Fields:**
|
|
107
|
+
```yaml
|
|
108
|
+
id: {task-id} # Unique identifier (alphanumeric + dash)
|
|
109
|
+
source_machine: {hostname} # Where task was created
|
|
110
|
+
target_machine: {hostname} # Where task will execute
|
|
111
|
+
priority: high|normal|low # Execution priority
|
|
112
|
+
created_at: 2026-03-14T15:30:00Z # ISO 8601 timestamp
|
|
113
|
+
task_type: gpu_workload|script|... # Category
|
|
114
|
+
payload:
|
|
115
|
+
command: "..." # Shell command to execute
|
|
116
|
+
expected_duration_min: 15 # Timeout (minutes)
|
|
117
|
+
resources:
|
|
118
|
+
gpu: true|false
|
|
119
|
+
memory_gb: 8
|
|
120
|
+
cpu_cores: 4
|
|
121
|
+
status: pending|executing|completed|failed
|
|
122
|
+
```
|
|
123
|
+
|
|
124
|
+
**Optional Fields:**
|
|
125
|
+
```yaml
|
|
126
|
+
description: "Human-readable task description"
|
|
127
|
+
timeout_override_min: 120 # Override default timeout
|
|
128
|
+
retry_count: 3 # Retry failed tasks
|
|
129
|
+
```
|
|
130
|
+
|
|
131
|
+
### Result File (YAML)
|
|
132
|
+
|
|
133
|
+
**Location:** `.squad/cross-machine/results/{task-id}.yaml`
|
|
134
|
+
|
|
135
|
+
```yaml
|
|
136
|
+
id: {task-id} # Links back to task
|
|
137
|
+
target_machine: devbox # Executed on
|
|
138
|
+
completed_at: 2026-03-14T15:45:00Z # When it finished
|
|
139
|
+
status: completed|failed|timeout # Outcome
|
|
140
|
+
exit_code: 0 # Shell exit code
|
|
141
|
+
stdout: "..." # Captured output
|
|
142
|
+
stderr: "..." # Captured errors
|
|
143
|
+
duration_seconds: 900 # How long it took
|
|
144
|
+
artifacts:
|
|
145
|
+
- path: "/path/to/artifacts/..." # Location of results
|
|
146
|
+
type: audio|text|model|...
|
|
147
|
+
size_mb: 2.5
|
|
148
|
+
```
|
|
149
|
+
|
|
150
|
+
---
|
|
151
|
+
|
|
152
|
+
## Security Model
|
|
153
|
+
|
|
154
|
+
### Validation Pipeline
|
|
155
|
+
|
|
156
|
+
All tasks go through:
|
|
157
|
+
|
|
158
|
+
1. **Schema Validation**
|
|
159
|
+
- YAML structure matches spec
|
|
160
|
+
- Required fields present
|
|
161
|
+
- No unexpected fields (reject)
|
|
162
|
+
|
|
163
|
+
2. **Command Whitelist**
|
|
164
|
+
- Only approved commands allowed
|
|
165
|
+
- Path validation (no `../../` escapes)
|
|
166
|
+
- Environment variable sanitization
|
|
167
|
+
- No inline shell operators (`&&`, `|`, `>`)
|
|
168
|
+
|
|
169
|
+
3. **Resource Limits**
|
|
170
|
+
- Timeout enforced (default: 60 min)
|
|
171
|
+
- Memory cap: 16GB (adjustable)
|
|
172
|
+
- CPU threads: 4 (adjustable)
|
|
173
|
+
- Disk write: 100GB (adjustable)
|
|
174
|
+
|
|
175
|
+
4. **Execution Isolation**
|
|
176
|
+
- Runs as unprivileged user
|
|
177
|
+
- Temp directory cleaned after execution
|
|
178
|
+
- Network access: read-only (no outbound writes)
|
|
179
|
+
|
|
180
|
+
5. **Audit Trail**
|
|
181
|
+
- All executions logged to git
|
|
182
|
+
- Commit signed with Ralph's key
|
|
183
|
+
- Result stored immutably
|
|
184
|
+
|
|
185
|
+
### Threat Mitigations
|
|
186
|
+
|
|
187
|
+
| Threat | Mitigation |
|
|
188
|
+
|--------|-----------|
|
|
189
|
+
| **Malicious task injection** | Branch protection + PR review before merge |
|
|
190
|
+
| **Credential leakage** | Pre-commit secret scan + environment scrubbing |
|
|
191
|
+
| **Resource exhaustion** | Timeout + memory limits |
|
|
192
|
+
| **Code injection** | Command whitelist + no shell evaluation |
|
|
193
|
+
| **Result tampering** | Git commit history is immutable |
|
|
194
|
+
|
|
195
|
+
---
|
|
196
|
+
|
|
197
|
+
## Configuration
|
|
198
|
+
|
|
199
|
+
Ralph reads config from `.squad/config.json`:
|
|
200
|
+
|
|
201
|
+
```json
|
|
202
|
+
{
|
|
203
|
+
"cross_machine": {
|
|
204
|
+
"enabled": true,
|
|
205
|
+
"poll_interval_seconds": 300,
|
|
206
|
+
"this_machine": "devbox",
|
|
207
|
+
"max_concurrent_tasks": 2,
|
|
208
|
+
"task_timeout_minutes": 60,
|
|
209
|
+
"command_whitelist": [
|
|
210
|
+
"python scripts/voice-clone.py",
|
|
211
|
+
"python scripts/data-process.py",
|
|
212
|
+
"bash scripts/cleanup.sh"
|
|
213
|
+
],
|
|
214
|
+
"result_ttl_days": 30
|
|
215
|
+
}
|
|
216
|
+
}
|
|
217
|
+
```
|
|
218
|
+
|
|
219
|
+
---
|
|
220
|
+
|
|
221
|
+
## Examples
|
|
222
|
+
|
|
223
|
+
### Example 1: GPU Voice Cloning (Laptop → DevBox)
|
|
224
|
+
|
|
225
|
+
**1. Laptop creates task:**
|
|
226
|
+
|
|
227
|
+
```yaml
|
|
228
|
+
# .squad/cross-machine/tasks/2026-03-14T1530Z-laptop-gpu-001.yaml
|
|
229
|
+
id: gpu-voice-clone-001
|
|
230
|
+
source_machine: laptop-machine
|
|
231
|
+
target_machine: devbox
|
|
232
|
+
priority: high
|
|
233
|
+
created_at: 2026-03-14T15:30:00Z
|
|
234
|
+
task_type: gpu_workload
|
|
235
|
+
payload:
|
|
236
|
+
command: "python scripts/voice-clone.py --input voice.wav --output cloned.wav"
|
|
237
|
+
expected_duration_min: 15
|
|
238
|
+
resources:
|
|
239
|
+
gpu: true
|
|
240
|
+
memory_gb: 8
|
|
241
|
+
status: pending
|
|
242
|
+
```
|
|
243
|
+
|
|
244
|
+
**2. Laptop commits & pushes:**
|
|
245
|
+
|
|
246
|
+
```bash
|
|
247
|
+
git add .squad/cross-machine/tasks/
|
|
248
|
+
git commit -m "Task: GPU voice cloning [squad:machine-devbox]"
|
|
249
|
+
git push origin main
|
|
250
|
+
```
|
|
251
|
+
|
|
252
|
+
**3. DevBox Ralph (5 min later):**
|
|
253
|
+
|
|
254
|
+
```
|
|
255
|
+
[Ralph Watch Cycle]
|
|
256
|
+
- Pulled origin/main
|
|
257
|
+
- Detected: gpu-voice-clone-001 (status: pending, target: devbox)
|
|
258
|
+
- Validation: ✅ Schema OK, command whitelisted
|
|
259
|
+
- Executing: python scripts/voice-clone.py ...
|
|
260
|
+
- [15 minutes of processing]
|
|
261
|
+
- Completed: exit code 0
|
|
262
|
+
- Writing result...
|
|
263
|
+
- Committing & pushing...
|
|
264
|
+
```
|
|
265
|
+
|
|
266
|
+
**4. Laptop Ralph (next cycle) sees result:**
|
|
267
|
+
|
|
268
|
+
```yaml
|
|
269
|
+
# .squad/cross-machine/results/gpu-voice-clone-001.yaml
|
|
270
|
+
id: gpu-voice-clone-001
|
|
271
|
+
target_machine: devbox
|
|
272
|
+
completed_at: 2026-03-14T15:45:00Z
|
|
273
|
+
status: completed
|
|
274
|
+
exit_code: 0
|
|
275
|
+
stdout: "Voice cloning completed. Output written to /tmp/cloned.wav"
|
|
276
|
+
stderr: ""
|
|
277
|
+
duration_seconds: 900
|
|
278
|
+
artifacts:
|
|
279
|
+
- path: "/path/to/artifacts/voice-clone-001/output.wav"
|
|
280
|
+
type: audio
|
|
281
|
+
size_mb: 2.5
|
|
282
|
+
```
|
|
283
|
+
|
|
284
|
+
---
|
|
285
|
+
|
|
286
|
+
### Example 2: Urgent Debug Request (Human → DevBox via Issue)
|
|
287
|
+
|
|
288
|
+
**Create issue:**
|
|
289
|
+
|
|
290
|
+
```bash
|
|
291
|
+
gh issue create \
|
|
292
|
+
--title "DevBox: Debug voice model failure" \
|
|
293
|
+
--body "Error: Model failed to load on last run. Please check /tmp/model.log and report findings." \
|
|
294
|
+
--label "squad:machine-devbox" \
|
|
295
|
+
--label "urgent"
|
|
296
|
+
```
|
|
297
|
+
|
|
298
|
+
**DevBox Ralph detects → executes → comments:**
|
|
299
|
+
|
|
300
|
+
```
|
|
301
|
+
✅ Executed on devbox at 2026-03-14 15:47:00
|
|
302
|
+
Command: python scripts/debug-model.py
|
|
303
|
+
|
|
304
|
+
Result:
|
|
305
|
+
------
|
|
306
|
+
Model file: /tmp/model-v2.bin (OK)
|
|
307
|
+
Checksum: a1b2c3d4e5f6 (matches expected)
|
|
308
|
+
Memory available: 12 GB (sufficient)
|
|
309
|
+
|
|
310
|
+
ERROR FOUND: Config file permission issue
|
|
311
|
+
- File: ~/.config/voice/model.yaml
|
|
312
|
+
- Permissions: -rw------- (owner-only)
|
|
313
|
+
- Expected: -rw-r--r-- (world-readable for service)
|
|
314
|
+
|
|
315
|
+
FIX: Run: chmod 644 ~/.config/voice/model.yaml
|
|
316
|
+
```
|
|
317
|
+
|
|
318
|
+
---
|
|
319
|
+
|
|
320
|
+
## Error Handling
|
|
321
|
+
|
|
322
|
+
### Task Execution Failures
|
|
323
|
+
|
|
324
|
+
If a task fails (exit code != 0):
|
|
325
|
+
|
|
326
|
+
1. Result written with `status: failed` + exit code
|
|
327
|
+
2. stderr captured in result
|
|
328
|
+
3. Committed to git for audit
|
|
329
|
+
4. Source machine can retry by re-pushing task with `status: pending`
|
|
330
|
+
|
|
331
|
+
### Stalled Tasks
|
|
332
|
+
|
|
333
|
+
If a task doesn't complete within timeout:
|
|
334
|
+
|
|
335
|
+
1. Process killed
|
|
336
|
+
2. Result written with `status: timeout`
|
|
337
|
+
3. stderr: "Execution exceeded X minutes"
|
|
338
|
+
4. Source can investigate or retry
|
|
339
|
+
|
|
340
|
+
### Network Failures
|
|
341
|
+
|
|
342
|
+
If git push/pull fails:
|
|
343
|
+
|
|
344
|
+
- Ralph retries on next cycle
|
|
345
|
+
- Tasks queue locally until connectivity restored
|
|
346
|
+
- No tasks lost (stored in local repo)
|
|
347
|
+
|
|
348
|
+
---
|
|
349
|
+
|
|
350
|
+
## Monitoring & Debugging
|
|
351
|
+
|
|
352
|
+
### Check Task Queue
|
|
353
|
+
|
|
354
|
+
```bash
|
|
355
|
+
ls -la .squad/cross-machine/tasks/
|
|
356
|
+
cat .squad/cross-machine/tasks/*.yaml | grep -E "^(id|status|target_machine):"
|
|
357
|
+
```
|
|
358
|
+
|
|
359
|
+
### Check Results
|
|
360
|
+
|
|
361
|
+
```bash
|
|
362
|
+
ls -la .squad/cross-machine/results/
|
|
363
|
+
cat .squad/cross-machine/results/{task-id}.yaml
|
|
364
|
+
```
|
|
365
|
+
|
|
366
|
+
### View Execution History
|
|
367
|
+
|
|
368
|
+
```bash
|
|
369
|
+
git log --oneline .squad/cross-machine/ | head -20
|
|
370
|
+
```
|
|
371
|
+
|
|
372
|
+
### Monitor Ralph Cycles
|
|
373
|
+
|
|
374
|
+
```bash
|
|
375
|
+
tail -f .squad/log/ralph-watch.log | grep "cross-machine"
|
|
376
|
+
```
|
|
377
|
+
|
|
378
|
+
---
|
|
379
|
+
|
|
380
|
+
## Integration with Ralph Watch
|
|
381
|
+
|
|
382
|
+
Ralph automatically includes this pattern in its watch loop:
|
|
383
|
+
|
|
384
|
+
```
|
|
385
|
+
Ralph Watch Cycle (every 5-10 min):
|
|
386
|
+
1. Fetch GitHub issues with squad:machine-* labels
|
|
387
|
+
2. Poll .squad/cross-machine/tasks/
|
|
388
|
+
3. For each matching task:
|
|
389
|
+
- Validate
|
|
390
|
+
- Execute
|
|
391
|
+
- Write result
|
|
392
|
+
- Commit & push
|
|
393
|
+
4. Update status in issue (if applicable)
|
|
394
|
+
5. Sleep until next cycle
|
|
395
|
+
```
|
|
396
|
+
|
|
397
|
+
No manual Ralph configuration needed — just create task files or issues with the right labels.
|
|
398
|
+
|
|
399
|
+
---
|
|
400
|
+
|
|
401
|
+
## Migration from Manual Handoff
|
|
402
|
+
|
|
403
|
+
**Before (today):**
|
|
404
|
+
- Laptop → user manually copies file to Teams chat
|
|
405
|
+
- user pastes into target terminal
|
|
406
|
+
- user copies output back
|
|
407
|
+
- user pastes result manually
|
|
408
|
+
|
|
409
|
+
**After (with this pattern):**
|
|
410
|
+
- Laptop Ralph writes task file → git push
|
|
411
|
+
- DevBox Ralph auto-executes → git push result
|
|
412
|
+
- Laptop Ralph auto-reads result
|
|
413
|
+
- 0 human intervention needed
|
|
414
|
+
|
|
415
|
+
---
|
|
416
|
+
|
|
417
|
+
## Future Enhancements
|
|
418
|
+
|
|
419
|
+
Potential expansions (Phase 2+):
|
|
420
|
+
|
|
421
|
+
1. **Task Priorities:** Execution order based on priority field
|
|
422
|
+
2. **Serial Pipelines:** Machine A → B → C task chains
|
|
423
|
+
3. **GPU Availability Polling:** Query DevBox before submitting work
|
|
424
|
+
4. **Cost Tracking:** Log resource usage per task
|
|
425
|
+
5. **Notification Webhooks:** Alert on task completion
|
|
426
|
+
6. **Web Dashboard:** Real-time task status visualization
|
|
427
|
+
|
|
428
|
+
---
|
|
429
|
+
|
|
430
|
+
## Questions?
|
|
431
|
+
|
|
432
|
+
Refer to research report: `research/active/cross-machine-agents/README.md`
|
|
433
|
+
|
|
434
|
+
Contact: Seven (Research & Docs) or Ralph (Work Monitor)
|
|
@@ -0,0 +1,99 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: "error-recovery"
|
|
3
|
+
description: "Standard recovery patterns for all squad agents. When something fails, adapt — don't just report the failure."
|
|
4
|
+
domain: "reliability, agent-coordination"
|
|
5
|
+
confidence: "high"
|
|
6
|
+
license: MIT
|
|
7
|
+
---
|
|
8
|
+
|
|
9
|
+
# Error Recovery Patterns
|
|
10
|
+
|
|
11
|
+
Standard recovery patterns for all squad agents. When something fails, **adapt** — don't just report the failure.
|
|
12
|
+
|
|
13
|
+
---
|
|
14
|
+
|
|
15
|
+
## 1. Retry with Backoff
|
|
16
|
+
|
|
17
|
+
**When:** Transient failures — API timeouts, rate limits, network errors, temporary service unavailability.
|
|
18
|
+
|
|
19
|
+
**Pattern:**
|
|
20
|
+
1. Wait briefly, then retry (start at 2s, double each attempt)
|
|
21
|
+
2. Maximum 3 retries before escalating
|
|
22
|
+
3. Log each attempt with the error received
|
|
23
|
+
|
|
24
|
+
**Example:** API call returns 429 Too Many Requests → wait 2s → retry → wait 4s → retry → wait 8s → retry → escalate if still failing.
|
|
25
|
+
|
|
26
|
+
---
|
|
27
|
+
|
|
28
|
+
## 2. Fallback Alternatives
|
|
29
|
+
|
|
30
|
+
**When:** Primary tool or approach fails and an alternative exists.
|
|
31
|
+
|
|
32
|
+
**Pattern:**
|
|
33
|
+
1. Attempt primary approach
|
|
34
|
+
2. On failure, identify alternative tool/method
|
|
35
|
+
3. Try the alternative with the same intent
|
|
36
|
+
4. Document which alternative was used and why
|
|
37
|
+
|
|
38
|
+
**Example:** Primary CLI tool fails → fall back to direct API call for the same operation.
|
|
39
|
+
|
|
40
|
+
---
|
|
41
|
+
|
|
42
|
+
## 3. Diagnose-and-Fix
|
|
43
|
+
|
|
44
|
+
**When:** Build failures, test failures, linting errors — structured errors with actionable output.
|
|
45
|
+
|
|
46
|
+
**Pattern:**
|
|
47
|
+
1. Read the full error output carefully
|
|
48
|
+
2. Identify the root cause from error messages
|
|
49
|
+
3. Attempt a targeted fix
|
|
50
|
+
4. Re-run to verify the fix
|
|
51
|
+
5. Maximum 3 fix-retry cycles before escalating
|
|
52
|
+
|
|
53
|
+
**Example:** Build fails with a type error → check for missing import → add it → rebuild.
|
|
54
|
+
|
|
55
|
+
---
|
|
56
|
+
|
|
57
|
+
## 4. Escalate with Context
|
|
58
|
+
|
|
59
|
+
**When:** Recovery attempts have been exhausted, or the failure requires human judgment.
|
|
60
|
+
|
|
61
|
+
**Pattern:**
|
|
62
|
+
1. Summarize what was attempted and what failed
|
|
63
|
+
2. Include the exact error messages
|
|
64
|
+
3. State what you believe the root cause is
|
|
65
|
+
4. Suggest next steps or who might be able to help
|
|
66
|
+
5. Hand off to the coordinator or the appropriate specialist
|
|
67
|
+
|
|
68
|
+
**Example:** After 3 failed build attempts → "Build fails on line 42 with null reference. Tried X, Y, Z. Likely a design issue in the Foo module. Recommend the code owner review."
|
|
69
|
+
|
|
70
|
+
---
|
|
71
|
+
|
|
72
|
+
## 5. Graceful Degradation
|
|
73
|
+
|
|
74
|
+
**When:** A non-critical step fails but the overall task can still deliver value.
|
|
75
|
+
|
|
76
|
+
**Pattern:**
|
|
77
|
+
1. Determine if the failed step is critical to the task outcome
|
|
78
|
+
2. If non-critical, log the failure and continue
|
|
79
|
+
3. Deliver partial results with a clear note of what was skipped
|
|
80
|
+
4. Offer to retry the skipped step separately
|
|
81
|
+
|
|
82
|
+
**Example:** Generating a report with 5 sections — section 3 data source is unavailable → produce the report with 4 sections, note that section 3 was skipped and why.
|
|
83
|
+
|
|
84
|
+
---
|
|
85
|
+
|
|
86
|
+
## Applying These Patterns
|
|
87
|
+
|
|
88
|
+
Each agent should reference these patterns in their charter's `## Error Recovery` section, tailored to their domain. The charter should list the agent's most common failure modes and map each to the appropriate pattern above.
|
|
89
|
+
|
|
90
|
+
**Selection guide:**
|
|
91
|
+
|
|
92
|
+
| Failure Type | Primary Pattern | Fallback Pattern |
|
|
93
|
+
|---|---|---|
|
|
94
|
+
| Network/API transient | Retry with Backoff | Escalate with Context |
|
|
95
|
+
| Tool/dependency missing | Fallback Alternatives | Escalate with Context |
|
|
96
|
+
| Build/test error | Diagnose-and-Fix | Escalate with Context |
|
|
97
|
+
| Auth/permissions | Retry with Backoff | Escalate with Context |
|
|
98
|
+
| Non-critical data missing | Graceful Degradation | — |
|
|
99
|
+
| Unknown/novel error | Escalate with Context | — |
|