ai-collab-open-system 0.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.aict/START_HERE.md +127 -0
- package/.aict/WORKSPACE_MANIFEST.json +91 -0
- package/.aict/acceptance/EXAMPLE.synthetic.md +49 -0
- package/.aict/acceptance/FAILURE_MODES.md +40 -0
- package/.aict/acceptance/PROMPT.md +47 -0
- package/.aict/acceptance/README.md +44 -0
- package/.aict/acceptance/TEMPLATE.md +57 -0
- package/.aict/adapters/SHARED_CORE_CONTRACT.md +106 -0
- package/.aict/adapters/claude-code/ADAPTER.md +28 -0
- package/.aict/adapters/cline/ADAPTER.md +28 -0
- package/.aict/adapters/codex/ADAPTER.md +28 -0
- package/.aict/adapters/copilot/ADAPTER.md +28 -0
- package/.aict/adapters/cursor/ADAPTER.md +28 -0
- package/.aict/adapters/windsurf/ADAPTER.md +28 -0
- package/.aict/context/EXAMPLE.synthetic.md +53 -0
- package/.aict/context/FAILURE_MODES.md +40 -0
- package/.aict/context/PROMPT.md +47 -0
- package/.aict/context/README.md +44 -0
- package/.aict/context/TEMPLATE.md +63 -0
- package/.aict/cookbook/README.md +8 -0
- package/.aict/cookbook/bridge-to-a-second-family.md +103 -0
- package/.aict/cookbook/connect-a-tool.md +67 -0
- package/.aict/cookbook/review-a-half-product.md +79 -0
- package/.aict/cookbook/run-a-first-loop.md +81 -0
- package/.aict/examples/README.md +21 -0
- package/.aict/examples/ai-coding-long-task/CASE.md +161 -0
- package/.aict/examples/ai-coding-long-task/artifacts/acceptance-card.md +36 -0
- package/.aict/examples/ai-coding-long-task/artifacts/context-package.md +30 -0
- package/.aict/examples/ai-coding-long-task/artifacts/execution-prompt.md +30 -0
- package/.aict/examples/ai-coding-long-task/artifacts/first-ai-output.md +109 -0
- package/.aict/examples/ai-coding-long-task/artifacts/guard-review.md +40 -0
- package/.aict/examples/ai-coding-long-task/artifacts/handoff-note.md +28 -0
- package/.aict/examples/ai-coding-long-task/artifacts/harvest-seed.md +28 -0
- package/.aict/examples/ai-coding-long-task/artifacts/revised-output.md +62 -0
- package/.aict/examples/content-production-harvest/CASE.md +87 -0
- package/.aict/examples/content-production-harvest/artifacts/acceptance-card.md +28 -0
- package/.aict/examples/content-production-harvest/artifacts/context-package.md +28 -0
- package/.aict/examples/content-production-harvest/artifacts/execution-prompt.md +30 -0
- package/.aict/examples/content-production-harvest/artifacts/guard-review.md +28 -0
- package/.aict/examples/content-production-harvest/artifacts/handoff-note.md +28 -0
- package/.aict/examples/content-production-harvest/artifacts/harvest-seed.md +28 -0
- package/.aict/examples/multi-tool-collaboration/CASE.md +87 -0
- package/.aict/examples/multi-tool-collaboration/artifacts/acceptance-card.md +28 -0
- package/.aict/examples/multi-tool-collaboration/artifacts/context-package.md +28 -0
- package/.aict/examples/multi-tool-collaboration/artifacts/execution-prompt.md +30 -0
- package/.aict/examples/multi-tool-collaboration/artifacts/guard-review.md +28 -0
- package/.aict/examples/multi-tool-collaboration/artifacts/handoff-note.md +28 -0
- package/.aict/examples/multi-tool-collaboration/artifacts/harvest-seed.md +28 -0
- package/.aict/examples/personal-judgment-growth-assistant/CASE.md +87 -0
- package/.aict/examples/personal-judgment-growth-assistant/artifacts/acceptance-card.md +28 -0
- package/.aict/examples/personal-judgment-growth-assistant/artifacts/context-package.md +28 -0
- package/.aict/examples/personal-judgment-growth-assistant/artifacts/execution-prompt.md +30 -0
- package/.aict/examples/personal-judgment-growth-assistant/artifacts/guard-review.md +28 -0
- package/.aict/examples/personal-judgment-growth-assistant/artifacts/handoff-note.md +28 -0
- package/.aict/examples/personal-judgment-growth-assistant/artifacts/harvest-seed.md +28 -0
- package/.aict/examples/research-knowledge-synthesis/CASE.md +87 -0
- package/.aict/examples/research-knowledge-synthesis/artifacts/acceptance-card.md +28 -0
- package/.aict/examples/research-knowledge-synthesis/artifacts/context-package.md +28 -0
- package/.aict/examples/research-knowledge-synthesis/artifacts/execution-prompt.md +30 -0
- package/.aict/examples/research-knowledge-synthesis/artifacts/guard-review.md +28 -0
- package/.aict/examples/research-knowledge-synthesis/artifacts/handoff-note.md +28 -0
- package/.aict/examples/research-knowledge-synthesis/artifacts/harvest-seed.md +28 -0
- package/.aict/guard/EXAMPLE.synthetic.md +51 -0
- package/.aict/guard/FAILURE_MODES.md +40 -0
- package/.aict/guard/PROMPT.md +47 -0
- package/.aict/guard/README.md +44 -0
- package/.aict/guard/TEMPLATE.md +60 -0
- package/.aict/handoff/EXAMPLE.synthetic.md +51 -0
- package/.aict/handoff/FAILURE_MODES.md +40 -0
- package/.aict/handoff/PROMPT.md +47 -0
- package/.aict/handoff/README.md +44 -0
- package/.aict/handoff/TEMPLATE.md +60 -0
- package/.aict/harvest/EXAMPLE.synthetic.md +51 -0
- package/.aict/harvest/FAILURE_MODES.md +40 -0
- package/.aict/harvest/PROMPT.md +47 -0
- package/.aict/harvest/README.md +44 -0
- package/.aict/harvest/TEMPLATE.md +60 -0
- package/.aict/mechanisms/README.md +34 -0
- package/.aict/mechanisms/anti-drift-partner/EXAMPLE.synthetic.md +46 -0
- package/.aict/mechanisms/anti-drift-partner/FAILURE_MODES.md +25 -0
- package/.aict/mechanisms/anti-drift-partner/PROMPT.md +75 -0
- package/.aict/mechanisms/anti-drift-partner/README.md +82 -0
- package/.aict/mechanisms/anti-drift-partner/TEMPLATE.md +74 -0
- package/.aict/mechanisms/blind-spot-scan/EXAMPLE.synthetic.md +39 -0
- package/.aict/mechanisms/blind-spot-scan/FAILURE_MODES.md +25 -0
- package/.aict/mechanisms/blind-spot-scan/PROMPT.md +72 -0
- package/.aict/mechanisms/blind-spot-scan/README.md +79 -0
- package/.aict/mechanisms/blind-spot-scan/TEMPLATE.md +70 -0
- package/.aict/mechanisms/collaboration-coach/EXAMPLE.synthetic.md +40 -0
- package/.aict/mechanisms/collaboration-coach/FAILURE_MODES.md +25 -0
- package/.aict/mechanisms/collaboration-coach/PROMPT.md +72 -0
- package/.aict/mechanisms/collaboration-coach/README.md +79 -0
- package/.aict/mechanisms/collaboration-coach/TEMPLATE.md +61 -0
- package/.aict/mechanisms/do-not-handle-yet/EXAMPLE.synthetic.md +15 -0
- package/.aict/mechanisms/do-not-handle-yet/FAILURE_MODES.md +16 -0
- package/.aict/mechanisms/do-not-handle-yet/PROMPT.md +41 -0
- package/.aict/mechanisms/do-not-handle-yet/README.md +30 -0
- package/.aict/mechanisms/do-not-handle-yet/TEMPLATE.md +38 -0
- package/.aict/mechanisms/dual-guard/EXAMPLE.synthetic.md +54 -0
- package/.aict/mechanisms/dual-guard/FAILURE_MODES.md +25 -0
- package/.aict/mechanisms/dual-guard/PROMPT.md +76 -0
- package/.aict/mechanisms/dual-guard/README.md +81 -0
- package/.aict/mechanisms/dual-guard/TEMPLATE.md +73 -0
- package/.aict/mechanisms/feedback-absorption-ledger/EXAMPLE.synthetic.md +49 -0
- package/.aict/mechanisms/feedback-absorption-ledger/FAILURE_MODES.md +25 -0
- package/.aict/mechanisms/feedback-absorption-ledger/PROMPT.md +74 -0
- package/.aict/mechanisms/feedback-absorption-ledger/README.md +81 -0
- package/.aict/mechanisms/feedback-absorption-ledger/TEMPLATE.md +69 -0
- package/.aict/mechanisms/half-product-review/EXAMPLE.synthetic.md +15 -0
- package/.aict/mechanisms/half-product-review/FAILURE_MODES.md +16 -0
- package/.aict/mechanisms/half-product-review/PROMPT.md +41 -0
- package/.aict/mechanisms/half-product-review/README.md +30 -0
- package/.aict/mechanisms/half-product-review/TEMPLATE.md +38 -0
- package/.aict/mechanisms/handoff-abc/EXAMPLE.synthetic.md +47 -0
- package/.aict/mechanisms/handoff-abc/FAILURE_MODES.md +25 -0
- package/.aict/mechanisms/handoff-abc/PROMPT.md +75 -0
- package/.aict/mechanisms/handoff-abc/README.md +82 -0
- package/.aict/mechanisms/handoff-abc/TEMPLATE.md +60 -0
- package/.aict/mechanisms/harvest-and-erc/EXAMPLE.synthetic.md +43 -0
- package/.aict/mechanisms/harvest-and-erc/FAILURE_MODES.md +25 -0
- package/.aict/mechanisms/harvest-and-erc/PROMPT.md +74 -0
- package/.aict/mechanisms/harvest-and-erc/README.md +81 -0
- package/.aict/mechanisms/harvest-and-erc/TEMPLATE.md +60 -0
- package/.aict/mechanisms/honest-calibration/EXAMPLE.synthetic.md +43 -0
- package/.aict/mechanisms/honest-calibration/FAILURE_MODES.md +25 -0
- package/.aict/mechanisms/honest-calibration/PROMPT.md +74 -0
- package/.aict/mechanisms/honest-calibration/README.md +81 -0
- package/.aict/mechanisms/honest-calibration/TEMPLATE.md +66 -0
- package/.aict/mechanisms/one-click-dispatch/EXAMPLE.synthetic.md +15 -0
- package/.aict/mechanisms/one-click-dispatch/FAILURE_MODES.md +16 -0
- package/.aict/mechanisms/one-click-dispatch/PROMPT.md +41 -0
- package/.aict/mechanisms/one-click-dispatch/README.md +30 -0
- package/.aict/mechanisms/one-click-dispatch/TEMPLATE.md +38 -0
- package/.aict/mechanisms/plain-language-first-screen/EXAMPLE.synthetic.md +15 -0
- package/.aict/mechanisms/plain-language-first-screen/FAILURE_MODES.md +16 -0
- package/.aict/mechanisms/plain-language-first-screen/PROMPT.md +41 -0
- package/.aict/mechanisms/plain-language-first-screen/README.md +30 -0
- package/.aict/mechanisms/plain-language-first-screen/TEMPLATE.md +38 -0
- package/.aict/mechanisms/root-cause-brake/EXAMPLE.synthetic.md +55 -0
- package/.aict/mechanisms/root-cause-brake/FAILURE_MODES.md +25 -0
- package/.aict/mechanisms/root-cause-brake/PROMPT.md +73 -0
- package/.aict/mechanisms/root-cause-brake/README.md +79 -0
- package/.aict/mechanisms/root-cause-brake/TEMPLATE.md +74 -0
- package/.aict/mechanisms/scout-review-controller/EXAMPLE.synthetic.md +15 -0
- package/.aict/mechanisms/scout-review-controller/FAILURE_MODES.md +16 -0
- package/.aict/mechanisms/scout-review-controller/PROMPT.md +41 -0
- package/.aict/mechanisms/scout-review-controller/README.md +30 -0
- package/.aict/mechanisms/scout-review-controller/TEMPLATE.md +38 -0
- package/.aict/mechanisms/single-tool-guard/EXAMPLE.synthetic.md +54 -0
- package/.aict/mechanisms/single-tool-guard/FAILURE_MODES.md +25 -0
- package/.aict/mechanisms/single-tool-guard/PROMPT.md +76 -0
- package/.aict/mechanisms/single-tool-guard/README.md +83 -0
- package/.aict/mechanisms/single-tool-guard/TEMPLATE.md +75 -0
- package/.aict/mechanisms/task-splitting/EXAMPLE.synthetic.md +53 -0
- package/.aict/mechanisms/task-splitting/FAILURE_MODES.md +25 -0
- package/.aict/mechanisms/task-splitting/PROMPT.md +72 -0
- package/.aict/mechanisms/task-splitting/README.md +79 -0
- package/.aict/mechanisms/task-splitting/TEMPLATE.md +76 -0
- package/.aict/modes/README.md +11 -0
- package/.aict/modes/execute.md +31 -0
- package/.aict/modes/handoff.md +29 -0
- package/.aict/modes/harvest.md +30 -0
- package/.aict/modes/review.md +28 -0
- package/.aict/modes/shape.md +34 -0
- package/.aict/privacy/COMMERCIAL_BOUNDARY.md +34 -0
- package/.aict/privacy/PRIVACY.md +36 -0
- package/.aict/privacy/REDACTION_CHECKLIST.md +12 -0
- package/.aict/profile/CANDIDATES.md +44 -0
- package/.aict/profile/EXAMPLE.synthetic.md +49 -0
- package/.aict/profile/FAILURE_MODES.md +40 -0
- package/.aict/profile/PROMPT.md +47 -0
- package/.aict/profile/README.md +44 -0
- package/.aict/profile/TEMPLATE.md +57 -0
- package/.aict/prompts/acceptance-definition.md +109 -0
- package/.aict/prompts/guard-review.md +116 -0
- package/.aict/prompts/handoff-generation.md +110 -0
- package/.aict/prompts/harvest-extraction.md +110 -0
- package/.aict/prompts/mode-switching.md +66 -0
- package/.aict/prompts/profile-creation.md +66 -0
- package/.aict/prompts/profile-refinement.md +66 -0
- package/.aict/prompts/project-context-packaging.md +113 -0
- package/.aict/prompts/red-team-challenge.md +106 -0
- package/.aict/prompts/rule-update-proposal.md +114 -0
- package/.aict/prompts/workflow-reset.md +109 -0
- package/.aict/roles/README.md +18 -0
- package/.aict/roles/executor.md +34 -0
- package/.aict/roles/harvester.md +33 -0
- package/.aict/roles/owner-controller.md +38 -0
- package/.aict/roles/scout.md +33 -0
- package/.aict/roles/supervisor.md +34 -0
- package/.aict/roles/system-guardian.md +34 -0
- package/.aict/skills/acceptance/SKILL.md +43 -0
- package/.aict/skills/context/SKILL.md +44 -0
- package/.aict/skills/evidence-pack/SKILL.md +42 -0
- package/.aict/skills/guard/SKILL.md +46 -0
- package/.aict/skills/handoff/SKILL.md +44 -0
- package/.aict/skills/harvest/SKILL.md +44 -0
- package/.aict/skills/mode-switch/SKILL.md +42 -0
- package/.aict/skills/profile/SKILL.md +42 -0
- package/.aict/skills/red-team/SKILL.md +42 -0
- package/.aict/skills/single-tool-guard/SKILL.md +42 -0
- package/.aict/state/CURRENT_STATE.md +13 -0
- package/.aict/state/DECISIONS.md +7 -0
- package/.aict/state/TASK_LOG.md +7 -0
- package/.aict/state/evidence.jsonl +2 -0
- package/.aict/state/learning-ledger.jsonl +1 -0
- package/.aict/state/receipts.jsonl +1 -0
- package/.aict/state/runs.jsonl +1 -0
- package/.aict/state/tasks.jsonl +1 -0
- package/.aict/walkthroughs/10-minute-your-task.md +107 -0
- package/.aict/walkthroughs/10-minute.md +43 -0
- package/.aict/walkthroughs/30-minute.md +22 -0
- package/.aict/walkthroughs/60-minute.md +27 -0
- package/.aict/walkthroughs/synthetic-loop-transcript.md +43 -0
- package/CHANGELOG.md +23 -0
- package/CODE_OF_CONDUCT.md +20 -0
- package/CONTRIBUTING.md +30 -0
- package/KNOWN_LIMITATIONS.md +54 -0
- package/LICENSE +199 -0
- package/PRODUCT_CONTRACT.md +446 -0
- package/README.md +245 -0
- package/RELEASE_CHECKLIST.md +78 -0
- package/SECURITY.md +56 -0
- package/START_HERE.md +89 -0
- package/bin/ai-collab.js +2 -0
- package/docs/DOGFOOD.md +85 -0
- package/docs/FEEDBACK.md +61 -0
- package/docs/FIRST_EXPERIENCE_SPEC.md +32 -0
- package/docs/FREE_VS_PAID.md +53 -0
- package/docs/PUBLIC_BOUNDARY.md +36 -0
- package/docs/PUBLIC_MAPPING.md +178 -0
- package/docs/RELEASE_PRIORITY.md +23 -0
- package/docs/WHY_THIS_EXISTS.md +36 -0
- package/docs/open-system/00-start-here.md +60 -0
- package/docs/open-system/01-ai-collaboration-os.md +33 -0
- package/docs/open-system/02-six-layer-architecture.md +45 -0
- package/docs/open-system/03-role-system.md +33 -0
- package/docs/open-system/04-core-mechanisms.md +34 -0
- package/docs/open-system/05-failure-patterns.md +31 -0
- package/docs/open-system/06-how-to-adapt-to-your-workflow.md +31 -0
- package/package.json +69 -0
- package/privacy-manifest.json +78 -0
- package/privacy-scan.local.json.example +18 -0
- package/scripts/lib/forbidden-in-pack.js +55 -0
- package/scripts/pack-check.js +154 -0
- package/scripts/privacy-scan.js +487 -0
- package/scripts/validate-contract.js +160 -0
- package/src/adapters.js +590 -0
- package/src/bootstrap.js +1184 -0
- package/src/catalog.js +2723 -0
- package/src/cli.js +2899 -0
- package/src/dialogue.js +470 -0
- package/src/i18n.js +1034 -0
- package/src/ledger.js +2011 -0
- package/src/render.js +1381 -0
- package/src/sendmodel.js +452 -0
- package/src/validate.js +1307 -0
- package/src/workspace.js +1679 -0
- package/tests/contract.test.js +8514 -0
|
@@ -0,0 +1,109 @@
|
|
|
1
|
+
# Workflow reset
|
|
2
|
+
|
|
3
|
+
Purpose: Recover from drift by restating goal, state, acceptance, and next action.
|
|
4
|
+
|
|
5
|
+
## Scenario
|
|
6
|
+
|
|
7
|
+
Use when a thread has become confusing, has nested sub-tasks, or the assistant can no longer explain why the current step serves the main goal.
|
|
8
|
+
|
|
9
|
+
## Input requirements
|
|
10
|
+
|
|
11
|
+
- Original goal or latest confirmed goal.
|
|
12
|
+
- What has been done.
|
|
13
|
+
- Where the thread drifted.
|
|
14
|
+
- Known blockers and verification state.
|
|
15
|
+
|
|
16
|
+
## Operating steps
|
|
17
|
+
|
|
18
|
+
1. Stop adding new work. A reset is a re-grounding, not a fresh brainstorm; resist the pull to fix one more thing before you know where you stand.
|
|
19
|
+
2. Restate the four components explicitly, in order: GOAL (the main line in one sentence), CURRENT STATE (where things actually are now), ACCEPTANCE (what 'done' means for the main line), NEXT ACTION (the single concrete first step). All four are required — a reset missing any one of them leaves the thread still adrift.
|
|
20
|
+
3. Re-measure the current state instead of trusting the pre-drift picture. The numbers, counts, file states, and 'it already works' assumptions from before the drift are exactly what may have gone stale; re-run the cheap deterministic check (count it, list it, open it, reproduce it) rather than carrying a remembered value forward. A figure quoted from earlier in the thread is treated as 'pre-drift, unverified' until re-measured.
|
|
21
|
+
4. Separate the work honestly into done / pending / blocked / unverified, and never let a claimed-but-unproven item sit in 'done'. A check claimed without evidence is unverified, not complete.
|
|
22
|
+
5. Decide whether to close the current sub-task or return to the main line, judged by what serves the goal — not by which tangent is easiest to keep pushing. A genuinely useful side-finding that does not serve the main line is parked as a residual, not pursued now.
|
|
23
|
+
6. State one concrete next action and the exact baseline to resume from (the precise point, file, or command to start at), so work re-enters cleanly instead of re-deriving the starting point.
|
|
24
|
+
|
|
25
|
+
## Copy-paste prompt
|
|
26
|
+
|
|
27
|
+
```text
|
|
28
|
+
You are helping me with workflow reset in a local-first AI collaboration workspace.
|
|
29
|
+
|
|
30
|
+
Task: Recover from drift by restating goal, state, acceptance, and next action.
|
|
31
|
+
|
|
32
|
+
Trigger:
|
|
33
|
+
Use when a thread has drifted: it has nested into sub-tasks, chased a tangent, run long enough that state is fuzzy, or reached the point where the assistant can no longer explain why the current step serves the main goal. Reset before doing more work, not after producing more output on a shaky base.
|
|
34
|
+
|
|
35
|
+
Do not use when:
|
|
36
|
+
Skip the formal reset on a short, on-track thread with one clear goal and a state you can hold in your head. Running a full four-part reset on a task that never drifted is ceremony, and ceremony with no payoff trains people to ignore the reset when the thread has genuinely lost its thread.
|
|
37
|
+
|
|
38
|
+
Input:
|
|
39
|
+
The original goal, or the latest goal the owner actually confirmed. What has been done so far and what was claimed about it. Where the thread drifted — the point it stopped serving the main goal. The known blockers and the current verification state, including which 'done' claims have real evidence and which are only asserted.
|
|
40
|
+
|
|
41
|
+
Process:
|
|
42
|
+
1. Stop adding new work. A reset is a re-grounding, not a fresh brainstorm; resist the pull to fix one more thing before you know where you stand.
|
|
43
|
+
2. Restate the four components explicitly, in order: GOAL (the main line in one sentence), CURRENT STATE (where things actually are now), ACCEPTANCE (what 'done' means for the main line), NEXT ACTION (the single concrete first step). All four are required — a reset missing any one of them leaves the thread still adrift.
|
|
44
|
+
3. Re-measure the current state instead of trusting the pre-drift picture. The numbers, counts, file states, and 'it already works' assumptions from before the drift are exactly what may have gone stale; re-run the cheap deterministic check (count it, list it, open it, reproduce it) rather than carrying a remembered value forward. A figure quoted from earlier in the thread is treated as 'pre-drift, unverified' until re-measured.
|
|
45
|
+
4. Separate the work honestly into done / pending / blocked / unverified, and never let a claimed-but-unproven item sit in 'done'. A check claimed without evidence is unverified, not complete.
|
|
46
|
+
5. Decide whether to close the current sub-task or return to the main line, judged by what serves the goal — not by which tangent is easiest to keep pushing. A genuinely useful side-finding that does not serve the main line is parked as a residual, not pursued now.
|
|
47
|
+
6. State one concrete next action and the exact baseline to resume from (the precise point, file, or command to start at), so work re-enters cleanly instead of re-deriving the starting point.
|
|
48
|
+
|
|
49
|
+
Output shape:
|
|
50
|
+
- Goal: the main line in one sentence.
|
|
51
|
+
- Current state (re-measured): where things actually are now, with the cheap check that re-confirmed it, and any figure still carried from before the drift flagged as unverified.
|
|
52
|
+
- Acceptance: what 'done' means for the main line.
|
|
53
|
+
- Drift point: where and why the thread stopped serving the goal.
|
|
54
|
+
- State table: done / pending / blocked / unverified, kept separate, with no unproven item parked in done.
|
|
55
|
+
- Decision: close the sub-task or return to the main line, with the reason it serves the goal.
|
|
56
|
+
- Parked residuals: useful side-findings that do not serve the main line, recorded for later rather than chased now.
|
|
57
|
+
- Next action: one concrete first step plus the exact baseline to resume from.
|
|
58
|
+
|
|
59
|
+
Pass bar (do not pass unless all hold):
|
|
60
|
+
- All four components — goal, current state, acceptance, next action — are restated explicitly, none left implicit.
|
|
61
|
+
- The current state was re-measured with a real check, and any number carried from before the drift is labeled unverified rather than reused as truth.
|
|
62
|
+
- Done / pending / blocked / unverified are cleanly separated, with no claimed-but-unproven item sitting in done.
|
|
63
|
+
- The close-or-return decision is justified by what serves the main goal, not by which tangent is easiest to continue.
|
|
64
|
+
- There is exactly one concrete next action and an exact baseline to resume from, so a cold restart is unnecessary.
|
|
65
|
+
|
|
66
|
+
Reject bar (send back if any holds):
|
|
67
|
+
- The reset turns into a new brainstorm that adds scope instead of re-grounding the existing goal.
|
|
68
|
+
- The goal is restated but pre-drift numbers or 'it already works' assumptions are carried forward without re-measuring — the classic stale-baseline trap.
|
|
69
|
+
- A claimed-but-unproven item is filed under done, so the state table reads cleaner than the work actually is.
|
|
70
|
+
- The thread returns to the most recent tangent because it is easiest, even though it does not serve the main line.
|
|
71
|
+
- The card ends with no single concrete next action, or with no exact baseline, so the next session must re-derive where to start.
|
|
72
|
+
|
|
73
|
+
Rules:
|
|
74
|
+
- Work only from the material I provide.
|
|
75
|
+
- Keep private material local; use public-safe synthetic wording for examples.
|
|
76
|
+
- Label facts, assumptions, and unverified claims.
|
|
77
|
+
- Do not claim to understand my private business beyond the provided context.
|
|
78
|
+
|
|
79
|
+
Material:
|
|
80
|
+
[paste redacted material here]
|
|
81
|
+
```
|
|
82
|
+
|
|
83
|
+
## Expected output
|
|
84
|
+
|
|
85
|
+
- Main goal
|
|
86
|
+
- Current sub-task
|
|
87
|
+
- Drift point
|
|
88
|
+
- State table
|
|
89
|
+
- Recommended next action
|
|
90
|
+
|
|
91
|
+
## Counter-example
|
|
92
|
+
|
|
93
|
+
Synthetic: a release thread drifts into debugging a package-install cache error. The lazy reset writes 'goal: ship the release; we already smoke-tested the build earlier, so next just publish.' But re-measuring the current state shows the earlier smoke test ran against an old build that predates the cache fix — the carried-forward 'already tested' was a pre-drift assumption, now stale. The disciplined reset re-runs the smoke test in a clean temp directory (re-measure, not remember), marks the cache error as an environment-specific residual that does not block the main line, files 'release smoke test' as unverified rather than done, and sets one concrete next action with an exact baseline: run the packaged build's smoke test against the post-fix build before publishing. Trusting the pre-drift 'already tested' would have shipped a release that was never actually verified.
|
|
94
|
+
|
|
95
|
+
## Failure modes
|
|
96
|
+
|
|
97
|
+
- Treating reset as a new brainstorm.
|
|
98
|
+
- Continuing the most recent tangent because it is easier.
|
|
99
|
+
- Claiming closure while verification is missing.
|
|
100
|
+
- Restating the goal but carrying forward the pre-drift numbers and assumptions as if they were still true.
|
|
101
|
+
- Producing a tidy reset card with no single concrete next action and no exact baseline to resume from.
|
|
102
|
+
|
|
103
|
+
## Example
|
|
104
|
+
|
|
105
|
+
Reset a release thread after debugging npm cache issues: mark cache as environment-specific, return to package smoke test with a temp cache.
|
|
106
|
+
|
|
107
|
+
## Use with
|
|
108
|
+
|
|
109
|
+
Claude Code / Codex / Cursor / Windsurf / Copilot / Cline.
|
|
@@ -0,0 +1,18 @@
|
|
|
1
|
+
# Roles
|
|
2
|
+
|
|
3
|
+
Roles keep the AI Collaboration Open System human-centered. They define responsibility, not hidden authority.
|
|
4
|
+
|
|
5
|
+
Each role card below is a responsibility matrix, not a vibe. It states six things so two different tools (or two different sessions) read the same boundary: what the role CAN do, what it CANNOT do, what it takes in, what it produces, who it escalates to when something exceeds its authority, and one synthetic overreach example showing what breaks when the boundary is crossed.
|
|
6
|
+
|
|
7
|
+
## Public roles
|
|
8
|
+
|
|
9
|
+
- Owner / controller: decides goals and acceptance, and holds final judgment.
|
|
10
|
+
- Executor: produces the artifact inside the agreed boundary.
|
|
11
|
+
- System guardian: challenges risk and evidence before output is trusted.
|
|
12
|
+
- Scout: gathers options and external facts before a decision.
|
|
13
|
+
- Supervisor: translates the AI's state into plain language and watches the main line and the wording, distinct from the guardian who watches the facts.
|
|
14
|
+
- Harvester: extracts reusable learning after a loop.
|
|
15
|
+
|
|
16
|
+
## Why a matrix and not just "does / does not"
|
|
17
|
+
|
|
18
|
+
A two-line "does / does not" tells a tool the gist but not the seams: where work enters, where it leaves, and who catches it when it exceeds the role. The missing seams are exactly where collaboration fails — an executor quietly becomes a rule-changer, a guardian quietly becomes an editor, a scout quietly becomes a decider, a supervisor quietly becomes a second guardian. Naming inputs, outputs, the escalation target, and a concrete overreach example closes those seams.
|
|
@@ -0,0 +1,34 @@
|
|
|
1
|
+
# Executor
|
|
2
|
+
|
|
3
|
+
## Purpose
|
|
4
|
+
|
|
5
|
+
Produce the requested artifact inside the context and acceptance boundary, and prove it with evidence rather than claims.
|
|
6
|
+
|
|
7
|
+
## Can do
|
|
8
|
+
|
|
9
|
+
- Implement the task exactly as instructed, working from the provided files.
|
|
10
|
+
- Change the agreed artifacts, save state, and record what was done.
|
|
11
|
+
- Self-verify the work and report changed files or sections with verification evidence.
|
|
12
|
+
|
|
13
|
+
## Cannot do
|
|
14
|
+
|
|
15
|
+
- Make the controller's decisions or accept its own work as done.
|
|
16
|
+
- Silently expand scope beyond the instructed task.
|
|
17
|
+
- Cross a core boundary (governance rules, security-sensitive areas, anything outside the task) without stopping to ask first.
|
|
18
|
+
|
|
19
|
+
## Inputs
|
|
20
|
+
|
|
21
|
+
- A task packet: the goal, the boundary, the relevant files, known constraints, and the acceptance criteria.
|
|
22
|
+
|
|
23
|
+
## Outputs
|
|
24
|
+
|
|
25
|
+
- The requested artifact.
|
|
26
|
+
- A three-part report: what changed, the actual verification evidence, and what remains unverified.
|
|
27
|
+
|
|
28
|
+
## Escalates to
|
|
29
|
+
|
|
30
|
+
- The controller — whenever the task is ambiguous, the scope needs to grow, or a core boundary is in the way, the executor stops and hands the decision back up rather than deciding for itself.
|
|
31
|
+
|
|
32
|
+
## Overreach example (synthetic)
|
|
33
|
+
|
|
34
|
+
While fixing one small bug, an executor notices a shared rule it thinks is wrong and edits it on the spot without asking. A scoped one-line fix has now quietly become a change to the rules everyone else relies on — a change no one reviewed and no one approved. The next session inherits an altered rule with no decision behind it, and tracing why behavior changed becomes a hunt because the change was never surfaced as a decision.
|
|
@@ -0,0 +1,33 @@
|
|
|
1
|
+
# Harvester
|
|
2
|
+
|
|
3
|
+
## Purpose
|
|
4
|
+
|
|
5
|
+
Extract reusable learning after a loop. The harvester proposes; it does not file to the source of truth on its own.
|
|
6
|
+
|
|
7
|
+
## Can do
|
|
8
|
+
|
|
9
|
+
- Sweep a conversation or finished loop and lift the reusable bits into harvest cards.
|
|
10
|
+
- Redact private material into a general, public-safe form before anything is proposed.
|
|
11
|
+
- Draft candidate prompts, decisions, lessons, and rule suggestions for confirmation.
|
|
12
|
+
|
|
13
|
+
## Cannot do
|
|
14
|
+
|
|
15
|
+
- Write directly into the knowledge-base source of truth.
|
|
16
|
+
- Accept its own cards as final, or skip the human confirmation step.
|
|
17
|
+
- Generalize a single incident into a permanent rule without evidence and without sign-off.
|
|
18
|
+
|
|
19
|
+
## Inputs
|
|
20
|
+
|
|
21
|
+
- The conversation, loop, or raw material to harvest from.
|
|
22
|
+
|
|
23
|
+
## Outputs
|
|
24
|
+
|
|
25
|
+
- Harvest cards (one item per card) in a public-safe form, presented as candidates awaiting confirmation — not filed yet.
|
|
26
|
+
|
|
27
|
+
## Escalates to
|
|
28
|
+
|
|
29
|
+
- The owner / controller — nothing lands in the knowledge base until the owner confirms each card; the harvester stages, the owner files.
|
|
30
|
+
|
|
31
|
+
## Overreach example (synthetic)
|
|
32
|
+
|
|
33
|
+
A harvester writes a card and, without waiting for confirmation, files it straight into the knowledge base. An unverified "lesson" has now been frozen into a standing rule that future loops will obey — except no one checked whether it was actually true. A one-off observation becomes durable doctrine by accident, and later work is silently shaped by a rule that was never approved and may be wrong.
|
|
@@ -0,0 +1,38 @@
|
|
|
1
|
+
# Owner / Controller
|
|
2
|
+
|
|
3
|
+
## Purpose
|
|
4
|
+
|
|
5
|
+
Keep human judgment at the center of the workflow. The controller is the top of the chain: it sets direction and owns the final call, so the separation of decision from production stays intact.
|
|
6
|
+
|
|
7
|
+
## Can do
|
|
8
|
+
|
|
9
|
+
- Define the goal, the scope, and the acceptance criteria for a piece of work.
|
|
10
|
+
- Issue instructions and choose between options the executor or scout brings back.
|
|
11
|
+
- Accept or reject delivered work, and decide when residual risk is acceptable.
|
|
12
|
+
- Make the final call and close the loop.
|
|
13
|
+
|
|
14
|
+
## Cannot do
|
|
15
|
+
|
|
16
|
+
- Approve its own judgment by pretending agreement is independent review (it must not be both author and reviewer of the same decision).
|
|
17
|
+
- Make the guardian's call for it, or treat its own opinion as a guard pass.
|
|
18
|
+
- Step in and personally do the heavy production work that should have been delegated, because then no independent reviewer is left to check it.
|
|
19
|
+
|
|
20
|
+
## Inputs
|
|
21
|
+
|
|
22
|
+
- The task or problem to be solved.
|
|
23
|
+
- Proposed plans, options, and trade-offs from the executor or scout.
|
|
24
|
+
- Returned artifacts, guard verdicts, and harvest cards awaiting confirmation.
|
|
25
|
+
|
|
26
|
+
## Outputs
|
|
27
|
+
|
|
28
|
+
- Clear instructions and a defined boundary for each piece of work.
|
|
29
|
+
- Acceptance or rejection decisions with the reason.
|
|
30
|
+
- The final, recorded decision that lets the loop close.
|
|
31
|
+
|
|
32
|
+
## Escalates to
|
|
33
|
+
|
|
34
|
+
- No one above it — the controller is the top of the responsibility chain. When it lacks facts it tasks a scout; when it needs an independent check it tasks a guardian; but the decision itself does not get handed upward.
|
|
35
|
+
|
|
36
|
+
## Overreach example (synthetic)
|
|
37
|
+
|
|
38
|
+
A controller decides a feature is simple, sits down, and writes the implementation itself instead of delegating it. Because the controller is now the author, there is no independent party left to review whether the code actually meets acceptance — the controller would be grading its own homework. The separation that the whole system relies on collapses, and a defect ships unnoticed because the only person who could have caught it is the one who wrote it.
|
|
@@ -0,0 +1,33 @@
|
|
|
1
|
+
# Scout
|
|
2
|
+
|
|
3
|
+
## Purpose
|
|
4
|
+
|
|
5
|
+
Collect options and decision-changing evidence before the controller chooses. The scout gathers facts; it does not judge them.
|
|
6
|
+
|
|
7
|
+
## Can do
|
|
8
|
+
|
|
9
|
+
- Gather external facts, candidate paths, and industry comparisons relevant to a pending decision.
|
|
10
|
+
- List evidence gaps and label how time-sensitive each finding is.
|
|
11
|
+
- Bring back sourced material so the controller can decide on solid ground.
|
|
12
|
+
|
|
13
|
+
## Cannot do
|
|
14
|
+
|
|
15
|
+
- Interpret, judge, or rule on the evidence it gathers.
|
|
16
|
+
- Recommend a path or lean toward an option ("you should pick A").
|
|
17
|
+
- Turn exploration into implementation, or decide anything itself.
|
|
18
|
+
|
|
19
|
+
## Inputs
|
|
20
|
+
|
|
21
|
+
- The specific question or unknown to investigate, framed by the controller.
|
|
22
|
+
|
|
23
|
+
## Outputs
|
|
24
|
+
|
|
25
|
+
- A fact card: candidate options and findings, each with its source and a freshness label, and no verdict attached.
|
|
26
|
+
|
|
27
|
+
## Escalates to
|
|
28
|
+
|
|
29
|
+
- The controller — the scout delivers sourced facts and hands the synthesis and the decision upward, keeping fact-gathering separate from judgment.
|
|
30
|
+
|
|
31
|
+
## Overreach example (synthetic)
|
|
32
|
+
|
|
33
|
+
Asked only to gather options, a scout instead returns "you should choose A." By folding a recommendation into the fact-gathering, it has mixed evidence with judgment — and now the controller can no longer reason from clean facts, because the conclusion is already baked in. The independence of the later decision is contaminated before it even starts, and the scout has quietly made a call that was never its to make.
|
|
@@ -0,0 +1,34 @@
|
|
|
1
|
+
# Supervisor
|
|
2
|
+
|
|
3
|
+
## Purpose
|
|
4
|
+
|
|
5
|
+
Lower the human's cost of watching the work, without taking the wheel. The supervisor is a state translator: it turns what the AI is doing into plain language and watches whether the work is still on track. It does not steer direction and it does not check facts line by line — that is the guardian's job. The split is deliberate: the guardian watches the facts (is this claim backed, does this code work, did scope drift); the supervisor watches the main line and the wording (are we still going where we meant to, and is the AI being honest about how done it is).
|
|
6
|
+
|
|
7
|
+
## Can do
|
|
8
|
+
|
|
9
|
+
- Translate the AI's current state into plain language for the human: where the main line is, what just happened, what the next step is.
|
|
10
|
+
- Watch three things on every pass: (1) is the main line drifting — is the work quietly chasing a side-quest while the real goal stalls; (2) is "done-pending-verification" being passed off as "accepted" — is unproven work being described as finished; (3) is a decision being punted back to the human — is the AI bouncing a choice it should have made itself.
|
|
11
|
+
- Issue one of three plain verdicts: SEND (it can go forward), SEND WITH A CORRECTION (a small fix rides along, no need to redo), or STOP AND FIX FIRST (something on the main line is wrong enough to halt).
|
|
12
|
+
|
|
13
|
+
## Cannot do
|
|
14
|
+
|
|
15
|
+
- Steer the direction or make the call — it flags, it does not decide, and "looks on track to me" is not the human's approval.
|
|
16
|
+
- Do the guardian's job: it does not write a formal verdict on facts, does not hunt code-level defects, does not rule on evidence quality. When it strays into line-by-line fact-checking it has stopped being a supervisor and become a second guardian.
|
|
17
|
+
- Open a side issue into a new main line, or let the human get pulled into doing a judgment the AI should have made.
|
|
18
|
+
|
|
19
|
+
## Inputs
|
|
20
|
+
|
|
21
|
+
- The AI's current state to translate (a status update, a handoff packet, a plan, a progress report) and the main line it is supposed to be serving, so drift can be measured across steps, not just within one.
|
|
22
|
+
|
|
23
|
+
## Outputs
|
|
24
|
+
|
|
25
|
+
- A short plain-language status read (where the main line is, the current step, the next action) plus the three-question check, ending in one of the three verdicts: send / send-with-a-correction / stop-and-fix-first.
|
|
26
|
+
- For a send-with-a-correction, the exact small fix the next step should carry; for a stop-and-fix-first, what specifically must be repaired before the work moves on.
|
|
27
|
+
|
|
28
|
+
## Escalates to
|
|
29
|
+
|
|
30
|
+
- The owner / controller — the supervisor reports its plain-language read and its verdict, then the human decides. For a true facts-and-evidence judgment it hands off to the guardian rather than ruling on facts itself.
|
|
31
|
+
|
|
32
|
+
## Overreach example (synthetic)
|
|
33
|
+
|
|
34
|
+
A supervisor reviewing a status update stops translating and starts grading the code: it digs into a function, declares the implementation correct, and issues a pass on the technical work. But checking facts and clearing evidence is the guardian's role, and now the same pass mixes "the main line looks on track" with "the code is verified" — two different judgments the human can no longer tell apart. The supervisor has quietly become a second guardian, the real fact-check never gets an independent pass, and a plain-language safety net the human relied on to stay cheap has turned into one more heavyweight reviewer.
|
|
@@ -0,0 +1,34 @@
|
|
|
1
|
+
# System Guardian
|
|
2
|
+
|
|
3
|
+
## Purpose
|
|
4
|
+
|
|
5
|
+
Challenge output before it becomes trusted state. The guardian is a referee, not a player: it finds problems and points to evidence, but it does not take over the work.
|
|
6
|
+
|
|
7
|
+
## Can do
|
|
8
|
+
|
|
9
|
+
- Independently review the artifact for acceptance fit, privacy, evidence quality, and handoff readiness.
|
|
10
|
+
- Surface blind spots and name required fixes, leading with findings.
|
|
11
|
+
- Issue one of the four standard verdicts (pass / reject / insufficient_evidence / pass_with_risk) with the guard level (L0-L4) for the evidence seen, and point every finding to a specific line, section, or missing piece of evidence. A plain pass needs L3+ (a cross-family evidence pack); a pass_with_risk needs an explicit owner sign-off before it counts as accepted.
|
|
12
|
+
|
|
13
|
+
## Cannot do
|
|
14
|
+
|
|
15
|
+
- Only find — it does not give orders, and it does not execute.
|
|
16
|
+
- Rewrite or fix the artifact itself by default (fixing what it judges makes it both referee and player).
|
|
17
|
+
- Make the decision for anyone; a concern is not an approval, and an approval is not the controller's acceptance.
|
|
18
|
+
|
|
19
|
+
## Inputs
|
|
20
|
+
|
|
21
|
+
- The object under review: the artifact, plus its acceptance card, context boundary, and the verification evidence behind any completion claim.
|
|
22
|
+
|
|
23
|
+
## Outputs
|
|
24
|
+
|
|
25
|
+
- A verdict and a findings list, each finding tied to concrete evidence.
|
|
26
|
+
- Required fixes and named residual risk, handed back for a decision — not applied directly.
|
|
27
|
+
|
|
28
|
+
## Escalates to
|
|
29
|
+
|
|
30
|
+
- The controller — the guardian reports findings and a verdict, then the controller decides what to fix, what to accept as residual risk, and whether to close.
|
|
31
|
+
|
|
32
|
+
## Overreach example (synthetic)
|
|
33
|
+
|
|
34
|
+
A guardian reviewing an artifact spots a flaw and, instead of reporting it, just edits the artifact to fix it. Now the same party both judged the work and changed it, so its independence is gone: no one is left to check whether the "fix" is actually correct or whether it quietly broke something else. The verdict can no longer be trusted, because the referee walked onto the field and started playing.
|
|
@@ -0,0 +1,43 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: acceptance
|
|
3
|
+
description: Define pass criteria and verification evidence.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# acceptance skill
|
|
7
|
+
|
|
8
|
+
## When to use
|
|
9
|
+
|
|
10
|
+
Use before any implementation, writing, research, or cleanup task where completion could otherwise be subjective.
|
|
11
|
+
|
|
12
|
+
## Inputs
|
|
13
|
+
|
|
14
|
+
- The shared core contract.
|
|
15
|
+
- A redacted task context.
|
|
16
|
+
- The relevant layer template.
|
|
17
|
+
- Any acceptance criteria or review findings.
|
|
18
|
+
|
|
19
|
+
## Process
|
|
20
|
+
|
|
21
|
+
1. Convert the goal into deliverables.
|
|
22
|
+
2. Define pass criteria a reviewer can inspect.
|
|
23
|
+
3. Name rejected states explicitly.
|
|
24
|
+
4. Attach the exact command or manual check required before completion.
|
|
25
|
+
|
|
26
|
+
## Output
|
|
27
|
+
|
|
28
|
+
- Deliverables
|
|
29
|
+
- Pass criteria
|
|
30
|
+
- Required checks
|
|
31
|
+
- Rejected states
|
|
32
|
+
- Evidence needed
|
|
33
|
+
- Decision needed
|
|
34
|
+
|
|
35
|
+
## Safety
|
|
36
|
+
|
|
37
|
+
- Do not accept unverified claims.
|
|
38
|
+
- Do not let passing tests substitute for missing user-path validation.
|
|
39
|
+
- Do not move acceptance after implementation.
|
|
40
|
+
|
|
41
|
+
## Example
|
|
42
|
+
|
|
43
|
+
For a first-run CLI, acceptance includes real bin command, no fallback target, clear errors, and temp install smoke.
|
|
@@ -0,0 +1,44 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: context
|
|
3
|
+
description: Package task context for another AI session.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# context skill
|
|
7
|
+
|
|
8
|
+
## When to use
|
|
9
|
+
|
|
10
|
+
Use at the beginning of multi-step work, cross-tool work, reviews, or any task that may be resumed later.
|
|
11
|
+
|
|
12
|
+
## Inputs
|
|
13
|
+
|
|
14
|
+
- The shared core contract.
|
|
15
|
+
- A redacted task context.
|
|
16
|
+
- The relevant layer template.
|
|
17
|
+
- Any acceptance criteria or review findings.
|
|
18
|
+
|
|
19
|
+
## Process
|
|
20
|
+
|
|
21
|
+
1. State the goal and non-goals.
|
|
22
|
+
2. List artifacts and evidence the next assistant should inspect.
|
|
23
|
+
3. Split facts, assumptions, decisions, risks, and open questions.
|
|
24
|
+
4. End with a single next action.
|
|
25
|
+
|
|
26
|
+
## Output
|
|
27
|
+
|
|
28
|
+
- Goal
|
|
29
|
+
- Current state
|
|
30
|
+
- Relevant artifacts
|
|
31
|
+
- Constraints and non-goals
|
|
32
|
+
- Facts versus assumptions
|
|
33
|
+
- Risks and open questions
|
|
34
|
+
- Next action
|
|
35
|
+
|
|
36
|
+
## Safety
|
|
37
|
+
|
|
38
|
+
- Summarize private material instead of copying it.
|
|
39
|
+
- Do not include real local paths in public examples.
|
|
40
|
+
- Do not hide uncertainty inside fluent narrative.
|
|
41
|
+
|
|
42
|
+
## Example
|
|
43
|
+
|
|
44
|
+
Package a messy release task into current version, failing checks, changed files, and the next verification command.
|
|
@@ -0,0 +1,42 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: evidence-pack
|
|
3
|
+
description: Assemble a structured evidence pack so a completion claim can be checked, not trusted.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# evidence-pack skill
|
|
7
|
+
|
|
8
|
+
## When to use
|
|
9
|
+
|
|
10
|
+
Use the moment you are about to say done, tested, fixed, or shipped, before a reviewer or the next session inherits the claim.
|
|
11
|
+
|
|
12
|
+
## Inputs
|
|
13
|
+
|
|
14
|
+
- The shared core contract.
|
|
15
|
+
- A redacted task context.
|
|
16
|
+
- The relevant layer template.
|
|
17
|
+
- Any acceptance criteria or review findings.
|
|
18
|
+
|
|
19
|
+
## Process
|
|
20
|
+
|
|
21
|
+
1. List every changed file and what changed in each, so the diff surface is explicit.
|
|
22
|
+
2. Record the exact command you ran, its captured output, and its exit code; mark anything you did not actually run.
|
|
23
|
+
3. Name the unverified items: edges you skipped, paths you did not exercise, and assumptions still standing.
|
|
24
|
+
4. Append each piece with `ai-collab evidence add` (kind diff / output / file / rerun) so the proof lives in the ledger, then point the receipt at those rows.
|
|
25
|
+
|
|
26
|
+
## Output
|
|
27
|
+
|
|
28
|
+
- Changed files with per-file intent
|
|
29
|
+
- Command, captured output, and exit code
|
|
30
|
+
- Reproduction steps a reviewer can rerun
|
|
31
|
+
- Unverified items and standing assumptions
|
|
32
|
+
- Ledger evidence ids the receipt cites
|
|
33
|
+
|
|
34
|
+
## Safety
|
|
35
|
+
|
|
36
|
+
- Do not write a summary in place of the real command output and exit code.
|
|
37
|
+
- Do not label a claim verified when the run did not happen; record it as unverified instead.
|
|
38
|
+
- Do not paste private paths, tokens, or raw transcripts into an evidence row.
|
|
39
|
+
|
|
40
|
+
## Example
|
|
41
|
+
|
|
42
|
+
Before claiming a parser fix is done, attach the changed file, the test command with its exit 0 output, and an unverified note that the empty-input branch was never exercised; add each as an evidence row and cite their ids on the receipt.
|
|
@@ -0,0 +1,46 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: guard
|
|
3
|
+
description: Challenge artifacts before trust, and grade how strong the evidence actually was.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# guard skill
|
|
7
|
+
|
|
8
|
+
## When to use
|
|
9
|
+
|
|
10
|
+
Use after a plan, draft, implementation, or research answer exists and before it becomes the basis for the next step.
|
|
11
|
+
|
|
12
|
+
## Inputs
|
|
13
|
+
|
|
14
|
+
- The shared core contract.
|
|
15
|
+
- A redacted task context.
|
|
16
|
+
- The relevant layer template.
|
|
17
|
+
- Any acceptance criteria or review findings.
|
|
18
|
+
|
|
19
|
+
## Process
|
|
20
|
+
|
|
21
|
+
1. Read the context and acceptance card first.
|
|
22
|
+
2. Inspect the artifact for missing evidence, privacy leaks, scope drift, and unsupported claims.
|
|
23
|
+
3. Lead with findings ordered by severity.
|
|
24
|
+
4. State the evidence level you actually saw (L0 summary only / L1 artifact but no real run / L2 author-supplied commands or tests, single tool / L3 structured evidence pack reviewed by a different model family / L4 that cross-family review AND you independently re-ran and reconciled the key evidence).
|
|
25
|
+
5. Return one of the four standard verdicts, bounded by that level: pass / reject / insufficient_evidence / pass_with_risk. A plain pass needs L3+ (the cross-family pack); a single tool tops out at pass_with_risk (L2); summary-only is insufficient_evidence (L0); an L4 pass must show a cross-family review AND your reconciled rerun output.
|
|
26
|
+
|
|
27
|
+
## Output
|
|
28
|
+
|
|
29
|
+
- Verdict (pass / reject / insufficient_evidence / pass_with_risk)
|
|
30
|
+
- Guard level (L0-L4) for the evidence you saw
|
|
31
|
+
- Findings
|
|
32
|
+
- Evidence
|
|
33
|
+
- Required fixes
|
|
34
|
+
- Residual risk
|
|
35
|
+
|
|
36
|
+
## Safety
|
|
37
|
+
|
|
38
|
+
- Do not rubber-stamp your own work.
|
|
39
|
+
- Do not review only style.
|
|
40
|
+
- Do not call work complete without fresh verification.
|
|
41
|
+
- Do not return pass above your evidence level: no pass without an L3+ cross-family pack, no pass from a single tool, no L4 pass without BOTH a cross-family pack and a reconciled rerun.
|
|
42
|
+
- Do not treat a pass_with_risk as accepted on your own — it needs an explicit owner sign-off.
|
|
43
|
+
|
|
44
|
+
## Example
|
|
45
|
+
|
|
46
|
+
Reject a case study that lacks baseline output because users cannot see why the structured loop is better than raw chat; record it as guard level L1 (artifact exists, no real run).
|
|
@@ -0,0 +1,44 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: handoff
|
|
3
|
+
description: Resume work across sessions and tools.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# handoff skill
|
|
7
|
+
|
|
8
|
+
## When to use
|
|
9
|
+
|
|
10
|
+
Use before stopping, delegating, switching AI tools, or compressing a long task into a state another session can continue.
|
|
11
|
+
|
|
12
|
+
## Inputs
|
|
13
|
+
|
|
14
|
+
- The shared core contract.
|
|
15
|
+
- A redacted task context.
|
|
16
|
+
- The relevant layer template.
|
|
17
|
+
- Any acceptance criteria or review findings.
|
|
18
|
+
|
|
19
|
+
## Process
|
|
20
|
+
|
|
21
|
+
1. Restate the goal.
|
|
22
|
+
2. Separate done, pending, blocked, and unverified work.
|
|
23
|
+
3. List changed artifacts and verification commands.
|
|
24
|
+
4. Give one concrete next action.
|
|
25
|
+
|
|
26
|
+
## Output
|
|
27
|
+
|
|
28
|
+
- Goal
|
|
29
|
+
- Current status
|
|
30
|
+
- Completed
|
|
31
|
+
- Pending
|
|
32
|
+
- Blocked
|
|
33
|
+
- Verification evidence
|
|
34
|
+
- Next action
|
|
35
|
+
|
|
36
|
+
## Safety
|
|
37
|
+
|
|
38
|
+
- Do not bury blockers in prose.
|
|
39
|
+
- Do not claim verification that did not run.
|
|
40
|
+
- Do not include private raw transcript unless explicitly safe.
|
|
41
|
+
|
|
42
|
+
## Example
|
|
43
|
+
|
|
44
|
+
Hand off a release candidate with test results, pack output, remaining smoke commands, and known documentation risks.
|
|
@@ -0,0 +1,44 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: harvest
|
|
3
|
+
description: Extract reusable knowledge from finished loops.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# harvest skill
|
|
7
|
+
|
|
8
|
+
## When to use
|
|
9
|
+
|
|
10
|
+
Use after a task finishes or fails in a way that teaches a reusable workflow pattern.
|
|
11
|
+
|
|
12
|
+
## Inputs
|
|
13
|
+
|
|
14
|
+
- The shared core contract.
|
|
15
|
+
- A redacted task context.
|
|
16
|
+
- The relevant layer template.
|
|
17
|
+
- Any acceptance criteria or review findings.
|
|
18
|
+
|
|
19
|
+
## Process
|
|
20
|
+
|
|
21
|
+
1. Identify what changed the outcome.
|
|
22
|
+
2. Separate reusable knowledge, prompt fragments, decisions, and rule candidates.
|
|
23
|
+
3. Mark material that must stay case-specific.
|
|
24
|
+
4. Choose a storage target and future reuse trigger.
|
|
25
|
+
|
|
26
|
+
## Output
|
|
27
|
+
|
|
28
|
+
- Source task
|
|
29
|
+
- Reusable knowledge
|
|
30
|
+
- Reusable prompts
|
|
31
|
+
- Decision record
|
|
32
|
+
- Rule candidates
|
|
33
|
+
- Do not generalize
|
|
34
|
+
- Next reuse
|
|
35
|
+
|
|
36
|
+
## Safety
|
|
37
|
+
|
|
38
|
+
- Do not harvest private source material.
|
|
39
|
+
- Do not create a universal rule from one example.
|
|
40
|
+
- Do not keep clutter that has no future use.
|
|
41
|
+
|
|
42
|
+
## Example
|
|
43
|
+
|
|
44
|
+
Harvest the lesson that force overwrite needs backup evidence, while excluding the user's actual workspace path.
|
|
@@ -0,0 +1,42 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: mode-switch
|
|
3
|
+
description: Change collaboration mode with explicit boundaries.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# mode-switch skill
|
|
7
|
+
|
|
8
|
+
## When to use
|
|
9
|
+
|
|
10
|
+
Use when moving between planning, execution, review, handoff, harvest, or casual exploration in the same workflow.
|
|
11
|
+
|
|
12
|
+
## Inputs
|
|
13
|
+
|
|
14
|
+
- The shared core contract.
|
|
15
|
+
- A redacted task context.
|
|
16
|
+
- The relevant layer template.
|
|
17
|
+
- Any acceptance criteria or review findings.
|
|
18
|
+
|
|
19
|
+
## Process
|
|
20
|
+
|
|
21
|
+
1. Name the current mode and requested new mode.
|
|
22
|
+
2. Carry forward only relevant context.
|
|
23
|
+
3. State allowed and forbidden actions.
|
|
24
|
+
4. Define the first expected output in the new mode.
|
|
25
|
+
|
|
26
|
+
## Output
|
|
27
|
+
|
|
28
|
+
- Mode change
|
|
29
|
+
- Allowed actions
|
|
30
|
+
- Forbidden actions
|
|
31
|
+
- Context carried forward
|
|
32
|
+
- First output
|
|
33
|
+
|
|
34
|
+
## Safety
|
|
35
|
+
|
|
36
|
+
- Do not keep executing after switching to review.
|
|
37
|
+
- Do not assume old authorization survives a reset.
|
|
38
|
+
- Do not mix formal guard output with implementation output.
|
|
39
|
+
|
|
40
|
+
## Example
|
|
41
|
+
|
|
42
|
+
Switch from implementation to guard: stop editing, inspect diff, compare against acceptance, then report findings.
|