@soleri/forge 9.0.0 → 9.0.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/dist/scaffold-filetree.js +15 -1
- package/dist/scaffold-filetree.js.map +1 -1
- package/dist/skills/brain-debrief/SKILL.md +66 -0
- package/dist/skills/brainstorming/SKILL.md +85 -0
- package/dist/skills/code-patrol/SKILL.md +86 -0
- package/dist/skills/context-resume/SKILL.md +92 -0
- package/dist/skills/executing-plans/SKILL.md +87 -0
- package/dist/skills/fix-and-learn/SKILL.md +98 -0
- package/dist/skills/health-check/SKILL.md +131 -0
- package/dist/skills/knowledge-harvest/SKILL.md +93 -0
- package/dist/skills/onboard-me/SKILL.md +118 -0
- package/dist/skills/retrospective/SKILL.md +95 -0
- package/dist/skills/second-opinion/SKILL.md +97 -0
- package/dist/skills/systematic-debugging/SKILL.md +99 -0
- package/dist/skills/test-driven-development/SKILL.md +91 -0
- package/dist/skills/vault-capture/SKILL.md +82 -0
- package/dist/skills/vault-navigator/SKILL.md +80 -0
- package/dist/skills/verification-before-completion/SKILL.md +82 -0
- package/dist/skills/writing-plans/SKILL.md +105 -0
- package/package.json +1 -1
- package/src/scaffold-filetree.ts +16 -1
- package/src/skills/brain-debrief/SKILL.md +66 -0
- package/src/skills/brainstorming/SKILL.md +85 -0
- package/src/skills/code-patrol/SKILL.md +86 -0
- package/src/skills/context-resume/SKILL.md +92 -0
- package/src/skills/executing-plans/SKILL.md +87 -0
- package/src/skills/fix-and-learn/SKILL.md +98 -0
- package/src/skills/health-check/SKILL.md +131 -0
- package/src/skills/knowledge-harvest/SKILL.md +93 -0
- package/src/skills/onboard-me/SKILL.md +118 -0
- package/src/skills/retrospective/SKILL.md +95 -0
- package/src/skills/second-opinion/SKILL.md +97 -0
- package/src/skills/systematic-debugging/SKILL.md +99 -0
- package/src/skills/test-driven-development/SKILL.md +91 -0
- package/src/skills/vault-capture/SKILL.md +82 -0
- package/src/skills/vault-navigator/SKILL.md +80 -0
- package/src/skills/verification-before-completion/SKILL.md +82 -0
- package/src/skills/writing-plans/SKILL.md +105 -0
- package/dist/skills/agent-dev.md +0 -122
- package/dist/skills/agent-guide.md +0 -110
- package/dist/skills/agent-persona.md +0 -66
- package/dist/skills/brain-debrief.md +0 -214
- package/dist/skills/brainstorming.md +0 -180
- package/dist/skills/code-patrol.md +0 -178
- package/dist/skills/context-resume.md +0 -146
- package/dist/skills/deliver-and-ship.md +0 -123
- package/dist/skills/env-setup.md +0 -151
- package/dist/skills/executing-plans.md +0 -216
- package/dist/skills/fix-and-learn.md +0 -167
- package/dist/skills/health-check.md +0 -231
- package/dist/skills/knowledge-harvest.md +0 -185
- package/dist/skills/onboard-me.md +0 -198
- package/dist/skills/retrospective.md +0 -205
- package/dist/skills/second-opinion.md +0 -149
- package/dist/skills/systematic-debugging.md +0 -241
- package/dist/skills/test-driven-development.md +0 -281
- package/dist/skills/vault-capture.md +0 -170
- package/dist/skills/vault-curate.md +0 -107
- package/dist/skills/vault-navigator.md +0 -140
- package/dist/skills/verification-before-completion.md +0 -182
- package/dist/skills/writing-plans.md +0 -215
- package/src/skills/agent-dev.md +0 -122
- package/src/skills/agent-guide.md +0 -110
- package/src/skills/agent-persona.md +0 -66
- package/src/skills/brain-debrief.md +0 -214
- package/src/skills/brainstorming.md +0 -180
- package/src/skills/code-patrol.md +0 -178
- package/src/skills/context-resume.md +0 -146
- package/src/skills/deliver-and-ship.md +0 -123
- package/src/skills/env-setup.md +0 -151
- package/src/skills/executing-plans.md +0 -216
- package/src/skills/fix-and-learn.md +0 -167
- package/src/skills/health-check.md +0 -231
- package/src/skills/knowledge-harvest.md +0 -185
- package/src/skills/onboard-me.md +0 -198
- package/src/skills/retrospective.md +0 -205
- package/src/skills/second-opinion.md +0 -149
- package/src/skills/systematic-debugging.md +0 -241
- package/src/skills/test-driven-development.md +0 -281
- package/src/skills/vault-capture.md +0 -170
- package/src/skills/vault-curate.md +0 -107
- package/src/skills/vault-navigator.md +0 -140
- package/src/skills/verification-before-completion.md +0 -182
- package/src/skills/writing-plans.md +0 -215
|
@@ -1,182 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: verification-before-completion
|
|
3
|
-
description: Use when about to claim work is complete, fixed, or passing, before committing or creating PRs - requires running verification commands and confirming output before making any success claims; evidence before assertions always
|
|
4
|
-
---
|
|
5
|
-
|
|
6
|
-
<!-- Adapted from superpowers (MIT License) -->
|
|
7
|
-
|
|
8
|
-
# Verification Before Completion
|
|
9
|
-
|
|
10
|
-
## Overview
|
|
11
|
-
|
|
12
|
-
Claiming work is complete without verification is dishonesty, not efficiency.
|
|
13
|
-
|
|
14
|
-
**Core principle:** Evidence before claims, always.
|
|
15
|
-
|
|
16
|
-
**Violating the letter of this rule is violating the spirit of this rule.**
|
|
17
|
-
|
|
18
|
-
## The Iron Law
|
|
19
|
-
|
|
20
|
-
```
|
|
21
|
-
NO COMPLETION CLAIMS WITHOUT FRESH VERIFICATION EVIDENCE
|
|
22
|
-
```
|
|
23
|
-
|
|
24
|
-
If you haven't run the verification command in this message, you cannot claim it passes.
|
|
25
|
-
|
|
26
|
-
## The Gate Function
|
|
27
|
-
|
|
28
|
-
```
|
|
29
|
-
BEFORE claiming any status or expressing satisfaction:
|
|
30
|
-
|
|
31
|
-
1. IDENTIFY: What command proves this claim?
|
|
32
|
-
2. RUN: Execute the FULL command (fresh, complete)
|
|
33
|
-
3. READ: Full output, check exit code, count failures
|
|
34
|
-
4. VERIFY: Does output confirm the claim?
|
|
35
|
-
- If NO: State actual status with evidence
|
|
36
|
-
- If YES: State claim WITH evidence
|
|
37
|
-
5. AGENT CHECK: Run system diagnostics
|
|
38
|
-
6. ONLY THEN: Make the claim
|
|
39
|
-
|
|
40
|
-
Skip any step = lying, not verifying
|
|
41
|
-
```
|
|
42
|
-
|
|
43
|
-
## Agent System Checks
|
|
44
|
-
|
|
45
|
-
After passing all verification commands, run system diagnostics:
|
|
46
|
-
|
|
47
|
-
### Health Check
|
|
48
|
-
|
|
49
|
-
```
|
|
50
|
-
YOUR_AGENT_core op:admin_health
|
|
51
|
-
```
|
|
52
|
-
|
|
53
|
-
Catches issues tests might miss — vault corruption, stale caches, configuration drift.
|
|
54
|
-
|
|
55
|
-
### Full Diagnostic
|
|
56
|
-
|
|
57
|
-
```
|
|
58
|
-
YOUR_AGENT_core op:admin_diagnostic
|
|
59
|
-
```
|
|
60
|
-
|
|
61
|
-
Comprehensive system check — module status, database integrity, cache health, configuration validity.
|
|
62
|
-
|
|
63
|
-
### Vault Analytics
|
|
64
|
-
|
|
65
|
-
```
|
|
66
|
-
YOUR_AGENT_core op:admin_vault_analytics
|
|
67
|
-
```
|
|
68
|
-
|
|
69
|
-
Verify knowledge quality metrics — are capture rates healthy? Any degradation?
|
|
70
|
-
|
|
71
|
-
If any check reports problems, address them before claiming completion.
|
|
72
|
-
|
|
73
|
-
## Common Failures
|
|
74
|
-
|
|
75
|
-
| Claim | Requires | Not Sufficient |
|
|
76
|
-
| --------------------- | ------------------------------- | ------------------------------ |
|
|
77
|
-
| Tests pass | Test command output: 0 failures | Previous run, "should pass" |
|
|
78
|
-
| Linter clean | Linter output: 0 errors | Partial check, extrapolation |
|
|
79
|
-
| Build succeeds | Build command: exit 0 | Linter passing, logs look good |
|
|
80
|
-
| Bug fixed | Test original symptom: passes | Code changed, assumed fixed |
|
|
81
|
-
| Regression test works | Red-green cycle verified | Test passes once |
|
|
82
|
-
| Agent completed | VCS diff shows changes | Agent reports "success" |
|
|
83
|
-
| Requirements met | Line-by-line checklist | Tests passing |
|
|
84
|
-
| Agent healthy | `admin_diagnostic` clean | "No errors in logs" |
|
|
85
|
-
|
|
86
|
-
## Red Flags - STOP
|
|
87
|
-
|
|
88
|
-
- Using "should", "probably", "seems to"
|
|
89
|
-
- Expressing satisfaction before verification ("Great!", "Perfect!", "Done!", etc.)
|
|
90
|
-
- About to commit/push/PR without verification
|
|
91
|
-
- Trusting agent success reports
|
|
92
|
-
- Relying on partial verification
|
|
93
|
-
- Thinking "just this once"
|
|
94
|
-
- Tired and wanting work over
|
|
95
|
-
- ANY wording implying success without having run verification
|
|
96
|
-
|
|
97
|
-
## Rationalization Prevention
|
|
98
|
-
|
|
99
|
-
| Excuse | Reality |
|
|
100
|
-
| --------------------------------------- | ---------------------- |
|
|
101
|
-
| "Should work now" | RUN the verification |
|
|
102
|
-
| "I'm confident" | Confidence ≠ evidence |
|
|
103
|
-
| "Just this once" | No exceptions |
|
|
104
|
-
| "Linter passed" | Linter ≠ compiler |
|
|
105
|
-
| "Agent said success" | Verify independently |
|
|
106
|
-
| "I'm tired" | Exhaustion ≠ excuse |
|
|
107
|
-
| "Partial check is enough" | Partial proves nothing |
|
|
108
|
-
| "Different words so rule doesn't apply" | Spirit over letter |
|
|
109
|
-
|
|
110
|
-
## Key Patterns
|
|
111
|
-
|
|
112
|
-
**Tests:**
|
|
113
|
-
|
|
114
|
-
```
|
|
115
|
-
[Run test command] [See: 34/34 pass] "All tests pass"
|
|
116
|
-
NOT: "Should pass now" / "Looks correct"
|
|
117
|
-
```
|
|
118
|
-
|
|
119
|
-
**Regression tests (TDD Red-Green):**
|
|
120
|
-
|
|
121
|
-
```
|
|
122
|
-
Write -> Run (pass) -> Revert fix -> Run (MUST FAIL) -> Restore -> Run (pass)
|
|
123
|
-
NOT: "I've written a regression test" (without red-green verification)
|
|
124
|
-
```
|
|
125
|
-
|
|
126
|
-
**Build:**
|
|
127
|
-
|
|
128
|
-
```
|
|
129
|
-
[Run build] [See: exit 0] "Build passes"
|
|
130
|
-
NOT: "Linter passed" (linter doesn't check compilation)
|
|
131
|
-
```
|
|
132
|
-
|
|
133
|
-
**Requirements:**
|
|
134
|
-
|
|
135
|
-
```
|
|
136
|
-
Re-read plan -> Create checklist -> Verify each -> Report gaps or completion
|
|
137
|
-
NOT: "Tests pass, phase complete"
|
|
138
|
-
```
|
|
139
|
-
|
|
140
|
-
**Agent delegation:**
|
|
141
|
-
|
|
142
|
-
```
|
|
143
|
-
Agent reports success -> Check VCS diff -> Verify changes -> Report actual state
|
|
144
|
-
NOT: Trust agent report
|
|
145
|
-
```
|
|
146
|
-
|
|
147
|
-
## After Verification — Capture Session
|
|
148
|
-
|
|
149
|
-
Once work is verified complete, capture a session summary so context persists:
|
|
150
|
-
|
|
151
|
-
```
|
|
152
|
-
YOUR_AGENT_core op:session_capture
|
|
153
|
-
params: {
|
|
154
|
-
summary: "<what was accomplished, files modified, key decisions>"
|
|
155
|
-
}
|
|
156
|
-
```
|
|
157
|
-
|
|
158
|
-
This ensures the next session has context about what was verified and completed.
|
|
159
|
-
|
|
160
|
-
## When To Apply
|
|
161
|
-
|
|
162
|
-
**ALWAYS before:**
|
|
163
|
-
|
|
164
|
-
- ANY variation of success/completion claims
|
|
165
|
-
- ANY expression of satisfaction
|
|
166
|
-
- ANY positive statement about work state
|
|
167
|
-
- Committing, PR creation, task completion
|
|
168
|
-
- Moving to next task
|
|
169
|
-
- Delegating to agents
|
|
170
|
-
|
|
171
|
-
## The Bottom Line
|
|
172
|
-
|
|
173
|
-
Run the command. Read the output. THEN claim the result. This is non-negotiable.
|
|
174
|
-
|
|
175
|
-
## Agent Tools Reference
|
|
176
|
-
|
|
177
|
-
| Op | When to Use |
|
|
178
|
-
| ----------------------- | ----------------------------------- |
|
|
179
|
-
| `admin_health` | Quick system health check |
|
|
180
|
-
| `admin_diagnostic` | Comprehensive system diagnostic |
|
|
181
|
-
| `admin_vault_analytics` | Knowledge quality metrics |
|
|
182
|
-
| `session_capture` | Persist verified completion context |
|
|
@@ -1,215 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: writing-plans
|
|
3
|
-
description: Use when you have a spec or requirements for a multi-step task, before touching code
|
|
4
|
-
---
|
|
5
|
-
|
|
6
|
-
<!-- Adapted from superpowers (MIT License) -->
|
|
7
|
-
|
|
8
|
-
# Writing Plans
|
|
9
|
-
|
|
10
|
-
## Overview
|
|
11
|
-
|
|
12
|
-
Write comprehensive implementation plans assuming the engineer has zero context for our codebase and questionable taste. Document everything they need to know: which files to touch for each task, code, testing, docs they might need to check, how to test it. Give them the whole plan as bite-sized tasks. DRY. YAGNI. TDD. Frequent commits.
|
|
13
|
-
|
|
14
|
-
Assume they are a skilled developer, but know almost nothing about our toolset or problem domain. Assume they don't know good test design very well.
|
|
15
|
-
|
|
16
|
-
**Announce at start:** "I'm using the writing-plans skill to create the implementation plan."
|
|
17
|
-
|
|
18
|
-
**Save plans to:** `docs/plans/YYYY-MM-DD-<feature-name>.md`
|
|
19
|
-
|
|
20
|
-
## Before Writing — Search First, Plan Second
|
|
21
|
-
|
|
22
|
-
**Never write a plan from scratch.** Always search for existing knowledge first.
|
|
23
|
-
|
|
24
|
-
### 1. Vault First
|
|
25
|
-
|
|
26
|
-
Check the vault for relevant implementation patterns:
|
|
27
|
-
|
|
28
|
-
```
|
|
29
|
-
YOUR_AGENT_core op:search_intelligent
|
|
30
|
-
params: { query: "<feature being planned>" }
|
|
31
|
-
```
|
|
32
|
-
|
|
33
|
-
Look for:
|
|
34
|
-
|
|
35
|
-
- **Implementation patterns** — proven approaches for similar features
|
|
36
|
-
- **Anti-patterns** — approaches that failed and should be avoided
|
|
37
|
-
- **Testing patterns** — how similar features were tested
|
|
38
|
-
|
|
39
|
-
Also check brain strengths for what's worked:
|
|
40
|
-
|
|
41
|
-
```
|
|
42
|
-
YOUR_AGENT_core op:brain_strengths
|
|
43
|
-
```
|
|
44
|
-
|
|
45
|
-
Browse related knowledge domains for additional context:
|
|
46
|
-
|
|
47
|
-
```
|
|
48
|
-
YOUR_AGENT_core op:vault_domains
|
|
49
|
-
YOUR_AGENT_core op:vault_tags
|
|
50
|
-
```
|
|
51
|
-
|
|
52
|
-
### 2. Web Search Second
|
|
53
|
-
|
|
54
|
-
If the vault doesn't have implementation guidance, search the web:
|
|
55
|
-
|
|
56
|
-
- **Libraries and tools** — is there a package that does this already?
|
|
57
|
-
- **Reference implementations** — how did other projects solve this?
|
|
58
|
-
- **API documentation** — official docs for libraries you'll use
|
|
59
|
-
- **Known issues** — pitfalls others ran into
|
|
60
|
-
|
|
61
|
-
### 3. Then Write the Plan
|
|
62
|
-
|
|
63
|
-
Incorporate vault insights and web findings into the plan. Reference specific vault entries and documentation links when they inform a step. A plan informed by existing knowledge is dramatically better than one written from first principles.
|
|
64
|
-
|
|
65
|
-
## Create a Tracked Plan
|
|
66
|
-
|
|
67
|
-
Use the agent's planning system to create a tracked, resumable plan:
|
|
68
|
-
|
|
69
|
-
```
|
|
70
|
-
YOUR_AGENT_core op:create_plan
|
|
71
|
-
params: {
|
|
72
|
-
objective: "<one-sentence goal>",
|
|
73
|
-
scope: { included: [...], excluded: [...] },
|
|
74
|
-
steps: [
|
|
75
|
-
{ title: "Step 1 title", description: "details" },
|
|
76
|
-
...
|
|
77
|
-
]
|
|
78
|
-
}
|
|
79
|
-
```
|
|
80
|
-
|
|
81
|
-
This makes the plan persistent across sessions — if context compacts or sessions change, the plan survives.
|
|
82
|
-
|
|
83
|
-
## Grade the Plan
|
|
84
|
-
|
|
85
|
-
After drafting, grade the plan for quality before presenting to the user:
|
|
86
|
-
|
|
87
|
-
```
|
|
88
|
-
YOUR_AGENT_core op:plan_grade
|
|
89
|
-
params: { planId: "<id from create_plan>" }
|
|
90
|
-
```
|
|
91
|
-
|
|
92
|
-
If the grade is below target, auto-improve:
|
|
93
|
-
|
|
94
|
-
```
|
|
95
|
-
YOUR_AGENT_core op:plan_auto_improve
|
|
96
|
-
params: { planId: "<id>" }
|
|
97
|
-
```
|
|
98
|
-
|
|
99
|
-
This iterates on the plan — filling gaps, adding missing test steps, clarifying ambiguous instructions. Repeat until the grade meets the target:
|
|
100
|
-
|
|
101
|
-
```
|
|
102
|
-
YOUR_AGENT_core op:plan_meets_grade
|
|
103
|
-
params: { planId: "<id>", targetGrade: "A" }
|
|
104
|
-
```
|
|
105
|
-
|
|
106
|
-
### Iterate on Drafts
|
|
107
|
-
|
|
108
|
-
For complex plans, iterate before finalizing:
|
|
109
|
-
|
|
110
|
-
```
|
|
111
|
-
YOUR_AGENT_core op:plan_iterate
|
|
112
|
-
params: { planId: "<id>", feedback: "<what needs improvement>" }
|
|
113
|
-
```
|
|
114
|
-
|
|
115
|
-
This creates a new version of the plan incorporating the feedback, preserving version history.
|
|
116
|
-
|
|
117
|
-
## Split into Tasks
|
|
118
|
-
|
|
119
|
-
Once the plan is approved, split it into trackable tasks:
|
|
120
|
-
|
|
121
|
-
```
|
|
122
|
-
YOUR_AGENT_core op:plan_split
|
|
123
|
-
params: { planId: "<id>" }
|
|
124
|
-
```
|
|
125
|
-
|
|
126
|
-
This generates individual tasks from the plan steps, ready for execution tracking.
|
|
127
|
-
|
|
128
|
-
## Bite-Sized Task Granularity
|
|
129
|
-
|
|
130
|
-
**Each step is one action (2-5 minutes):**
|
|
131
|
-
|
|
132
|
-
- "Write the failing test" - step
|
|
133
|
-
- "Run it to make sure it fails" - step
|
|
134
|
-
- "Implement the minimal code to make the test pass" - step
|
|
135
|
-
- "Run the tests and make sure they pass" - step
|
|
136
|
-
- "Commit" - step
|
|
137
|
-
|
|
138
|
-
## Plan Document Header
|
|
139
|
-
|
|
140
|
-
**Every plan MUST start with this header:**
|
|
141
|
-
|
|
142
|
-
```markdown
|
|
143
|
-
# [Feature Name] Implementation Plan
|
|
144
|
-
|
|
145
|
-
> **For Claude:** REQUIRED SUB-SKILL: Use executing-plans to implement this plan task-by-task.
|
|
146
|
-
|
|
147
|
-
**Goal:** [One sentence describing what this builds]
|
|
148
|
-
|
|
149
|
-
**Architecture:** [2-3 sentences about approach]
|
|
150
|
-
|
|
151
|
-
**Tech Stack:** [Key technologies/libraries]
|
|
152
|
-
|
|
153
|
-
---
|
|
154
|
-
```
|
|
155
|
-
|
|
156
|
-
## Task Structure
|
|
157
|
-
|
|
158
|
-
Each task uses this format:
|
|
159
|
-
|
|
160
|
-
- Files: Create / Modify / Test paths
|
|
161
|
-
- Step 1: Write the failing test (with code)
|
|
162
|
-
- Step 2: Run test to verify it fails (with expected output)
|
|
163
|
-
- Step 3: Write minimal implementation (with code)
|
|
164
|
-
- Step 4: Run test to verify it passes (with expected output)
|
|
165
|
-
- Step 5: Commit (with exact git commands)
|
|
166
|
-
|
|
167
|
-
## Remember
|
|
168
|
-
|
|
169
|
-
- Exact file paths always
|
|
170
|
-
- Complete code in plan (not "add validation")
|
|
171
|
-
- Exact commands with expected output
|
|
172
|
-
- DRY, YAGNI, TDD, frequent commits
|
|
173
|
-
|
|
174
|
-
## After Plan Approval
|
|
175
|
-
|
|
176
|
-
Once the user approves the plan, register it for tracking:
|
|
177
|
-
|
|
178
|
-
```
|
|
179
|
-
YOUR_AGENT_core op:approve_plan
|
|
180
|
-
params: { planId: "<id from create_plan>" }
|
|
181
|
-
```
|
|
182
|
-
|
|
183
|
-
Check plan stats for an overview:
|
|
184
|
-
|
|
185
|
-
```
|
|
186
|
-
YOUR_AGENT_core op:plan_stats
|
|
187
|
-
```
|
|
188
|
-
|
|
189
|
-
## Execution Handoff
|
|
190
|
-
|
|
191
|
-
After saving the plan, offer execution choice:
|
|
192
|
-
|
|
193
|
-
"Plan complete and saved to `docs/plans/<filename>.md`. Two execution options:
|
|
194
|
-
|
|
195
|
-
**1. Subagent-Driven (this session)** - I dispatch fresh subagent per task, review between tasks, fast iteration
|
|
196
|
-
|
|
197
|
-
**2. Parallel Session (separate)** - Open new session with executing-plans, batch execution with checkpoints
|
|
198
|
-
|
|
199
|
-
Which approach?"
|
|
200
|
-
|
|
201
|
-
## Agent Tools Reference
|
|
202
|
-
|
|
203
|
-
| Op | When to Use |
|
|
204
|
-
| ------------------------------ | -------------------------------------- |
|
|
205
|
-
| `search_intelligent` | Find relevant patterns before planning |
|
|
206
|
-
| `brain_strengths` | Check proven approaches |
|
|
207
|
-
| `vault_domains` / `vault_tags` | Browse knowledge landscape |
|
|
208
|
-
| `create_plan` | Create tracked, persistent plan |
|
|
209
|
-
| `plan_grade` | Grade plan quality |
|
|
210
|
-
| `plan_auto_improve` | Auto-fix plan weaknesses |
|
|
211
|
-
| `plan_meets_grade` | Verify grade target reached |
|
|
212
|
-
| `plan_iterate` | Iterate on draft with feedback |
|
|
213
|
-
| `plan_split` | Split plan into trackable tasks |
|
|
214
|
-
| `approve_plan` | Lock in approved plan |
|
|
215
|
-
| `plan_stats` | Overview of plan metrics |
|
package/src/skills/agent-dev.md
DELETED
|
@@ -1,122 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: agent-dev
|
|
3
|
-
description: >
|
|
4
|
-
Use when extending the agent itself — adding facades, tools, vault operations,
|
|
5
|
-
brain features, new skills, or modifying agent internals. Triggers on "add a facade",
|
|
6
|
-
"new tool", "extend vault", "add brain feature", "new skill", "add operation",
|
|
7
|
-
"extend agent", or when the work target is the agent's own codebase rather than
|
|
8
|
-
a project the agent assists with. Enforces vault-first knowledge gathering before
|
|
9
|
-
any code reading or planning.
|
|
10
|
-
---
|
|
11
|
-
|
|
12
|
-
# Agent Dev — Vault-First Internal Development
|
|
13
|
-
|
|
14
|
-
Develop the agent's own internals with the vault as the primary source of truth. The vault knows more about the agent than any code scan or model training data. Always search the vault first, extract maximum context, and only then touch code.
|
|
15
|
-
|
|
16
|
-
## When to Use
|
|
17
|
-
|
|
18
|
-
Any time the work target is the agent's own codebase: adding tools, extending facades, modifying vault operations, brain features, skills, or transport. Not for projects that merely *use* the agent.
|
|
19
|
-
|
|
20
|
-
## Core Principle
|
|
21
|
-
|
|
22
|
-
**Vault first. Before code. Before training data. Always.**
|
|
23
|
-
|
|
24
|
-
The vault is the authoritative source for how the agent works. Do not rely on general knowledge from training data — it is outdated and lacks project-specific decisions. Do not scan the codebase to understand architecture — the vault already has it.
|
|
25
|
-
|
|
26
|
-
## Orchestration Sequence
|
|
27
|
-
|
|
28
|
-
### Step 1: Search the Vault (MANDATORY — before anything else)
|
|
29
|
-
|
|
30
|
-
Before reading any source file, before making any plan, before offering any advice:
|
|
31
|
-
|
|
32
|
-
```
|
|
33
|
-
YOUR_AGENT_core op:search_vault_intelligent
|
|
34
|
-
params: { query: "<description of planned work>", options: { intent: "pattern" } }
|
|
35
|
-
```
|
|
36
|
-
|
|
37
|
-
Search again with architecture-specific terms: the facade name, tool name, or subsystem being modified.
|
|
38
|
-
|
|
39
|
-
```
|
|
40
|
-
YOUR_AGENT_core op:query_vault_knowledge
|
|
41
|
-
params: { type: "workflow", category: "<relevant category>" }
|
|
42
|
-
```
|
|
43
|
-
|
|
44
|
-
If initial results are sparse, search again with broader terms — synonyms, related subsystem names, parent concepts. Exhaust the vault before moving on.
|
|
45
|
-
|
|
46
|
-
Review all results. Extract file paths, module names, function references, conventions, and constraints. These become the foundation for every step that follows.
|
|
47
|
-
|
|
48
|
-
### Step 2: Check Brain for Proven Patterns
|
|
49
|
-
|
|
50
|
-
```
|
|
51
|
-
YOUR_AGENT_core op:strengths
|
|
52
|
-
params: { days: 30, minStrength: 60 }
|
|
53
|
-
```
|
|
54
|
-
|
|
55
|
-
```
|
|
56
|
-
YOUR_AGENT_core op:recommend
|
|
57
|
-
params: { projectPath: "." }
|
|
58
|
-
```
|
|
59
|
-
|
|
60
|
-
Check if the brain has learned anything relevant from recent sessions.
|
|
61
|
-
|
|
62
|
-
### Step 3: Targeted Code Reading (Only What Vault Pointed To)
|
|
63
|
-
|
|
64
|
-
By now the vault has provided architecture context, file paths, and module references. Only read code when the vault describes the subsystem but lacks implementation detail (e.g., method signatures, exact line numbers).
|
|
65
|
-
|
|
66
|
-
**Read only what the vault pointed to.** Open the specific files referenced in vault results — not the surrounding codebase, not the parent directory, not "let me explore the project structure."
|
|
67
|
-
|
|
68
|
-
**Fallback: Codebase scan.** Only when vault search returned zero relevant results for the subsystem — meaning the vault genuinely has no knowledge about it — fall back to `Grep` with targeted terms. This is the last resort, not the default.
|
|
69
|
-
|
|
70
|
-
### Step 4: Plan with Vault Context
|
|
71
|
-
|
|
72
|
-
Create the implementation plan referencing vault findings explicitly:
|
|
73
|
-
|
|
74
|
-
- Which patterns apply (cite vault entry titles)
|
|
75
|
-
- Which anti-patterns to avoid (cite the specific anti-pattern)
|
|
76
|
-
- Which conventions to follow (naming, facade structure, tool registration)
|
|
77
|
-
|
|
78
|
-
Every plan must trace its decisions back to vault knowledge. If a decision has no vault backing, flag it as a new architectural choice that should be captured after implementation (Step 7).
|
|
79
|
-
|
|
80
|
-
### Step 5: Implement
|
|
81
|
-
|
|
82
|
-
Follow the plan. Key conventions for agent internals:
|
|
83
|
-
|
|
84
|
-
- **Facades**: Thin routing layer — delegate to domain modules. No business logic in facades.
|
|
85
|
-
- **Tools**: Follow `op:operation_name` naming, return structured responses.
|
|
86
|
-
- **Vault writes**: All writes go through the vault intelligence layer.
|
|
87
|
-
- **Tests**: Colocated test files. Run with vitest.
|
|
88
|
-
- **Build**: Must compile without errors before considering done.
|
|
89
|
-
|
|
90
|
-
### Step 6: Validate and Self-Correct
|
|
91
|
-
|
|
92
|
-
Run the relevant test suite. Rebuild — must complete without errors.
|
|
93
|
-
|
|
94
|
-
**Self-correction loop:** If tests fail or build breaks, do NOT ask the user what to do. Read the error, trace the cause in the code just written, fix it, and re-run. Repeat until green. The agent owns the code it wrote — if something fails, the agent fixes its own implementation. Only escalate to the user when the failure is outside the agent's control (missing infrastructure, permissions, unclear requirements).
|
|
95
|
-
|
|
96
|
-
### Step 7: Capture What Was Learned
|
|
97
|
-
|
|
98
|
-
If this work revealed new architectural knowledge, a useful pattern, or a surprising anti-pattern:
|
|
99
|
-
|
|
100
|
-
```
|
|
101
|
-
YOUR_AGENT_core op:capture_knowledge
|
|
102
|
-
params: {
|
|
103
|
-
title: "<what was learned>",
|
|
104
|
-
description: "<the pattern or anti-pattern>",
|
|
105
|
-
type: "pattern",
|
|
106
|
-
tags: ["<relevant-tags>"]
|
|
107
|
-
}
|
|
108
|
-
```
|
|
109
|
-
|
|
110
|
-
This ensures future sessions benefit from today's discovery — making the vault smarter for the next developer.
|
|
111
|
-
|
|
112
|
-
## Anti-Patterns to Avoid
|
|
113
|
-
|
|
114
|
-
- **Code-first exploration**: Reading source files before searching the vault. The vault already has the architecture — scanning code is slower and gives less context.
|
|
115
|
-
- **Training-data advice**: Offering general guidance from model training data instead of searching the vault for project-specific knowledge.
|
|
116
|
-
- **Skipping vault search**: The vault contains all architecture knowledge. Not searching it means reinventing knowledge that already exists.
|
|
117
|
-
- **Planning without vault context**: Plans created without vault knowledge miss conventions, duplicate existing patterns, or violate architectural boundaries.
|
|
118
|
-
- **Broad codebase scanning**: Exploring directories and reading files "to understand the project" instead of using vault results as a targeted map.
|
|
119
|
-
|
|
120
|
-
## Exit Criteria
|
|
121
|
-
|
|
122
|
-
Development is complete when: vault was searched exhaustively first (Step 1), implementation follows discovered patterns, tests pass, build succeeds, and any new learning is captured back to vault (Step 7).
|
|
@@ -1,110 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: agent-guide
|
|
3
|
-
description: >
|
|
4
|
-
Use when the user asks "what can you do", "help me", "how do I use this",
|
|
5
|
-
"what features do you have", "what tools are available", "how does this work",
|
|
6
|
-
"show me your capabilities", "what are you", "who are you", or any question
|
|
7
|
-
about the agent's identity, capabilities, available tools, or how to use them.
|
|
8
|
-
Not needed for proactive tool suggestions — those are handled by engine rules.
|
|
9
|
-
---
|
|
10
|
-
|
|
11
|
-
# Agent Guide — Capability Discovery
|
|
12
|
-
|
|
13
|
-
Help users understand what this agent can do, how to use it effectively, and what makes it different from a raw LLM. This skill handles the deep discovery flow — proactive tool suggestions during normal work are handled by the engine rules (Tool Advocacy section).
|
|
14
|
-
|
|
15
|
-
## When to Use
|
|
16
|
-
|
|
17
|
-
- "What can you do?" / "What are your capabilities?"
|
|
18
|
-
- "How do I search for X?" / "How do I capture knowledge?"
|
|
19
|
-
- "What tools do you have?" / "Show me your features"
|
|
20
|
-
- "Who are you?" / "What is this agent?"
|
|
21
|
-
- "Help" / "I'm stuck" / "How does this work?"
|
|
22
|
-
- First-time users, onboarding, or anyone unfamiliar with the agent
|
|
23
|
-
|
|
24
|
-
## Capability Discovery Sequence
|
|
25
|
-
|
|
26
|
-
### Step 1: Identity
|
|
27
|
-
|
|
28
|
-
```
|
|
29
|
-
YOUR_AGENT_core op:activate
|
|
30
|
-
params: { projectPath: "." }
|
|
31
|
-
```
|
|
32
|
-
|
|
33
|
-
This returns the agent's persona: name, role, description, tone, principles, and domains. Present the identity first — who the agent is and what it specializes in.
|
|
34
|
-
|
|
35
|
-
### Step 2: Health & Status
|
|
36
|
-
|
|
37
|
-
```
|
|
38
|
-
YOUR_AGENT_core op:admin_health
|
|
39
|
-
```
|
|
40
|
-
|
|
41
|
-
Shows what subsystems are active: vault (how many entries), brain (vocabulary size), LLM availability. This tells the user what the agent currently has to work with.
|
|
42
|
-
|
|
43
|
-
### Step 3: Available Tools
|
|
44
|
-
|
|
45
|
-
```
|
|
46
|
-
YOUR_AGENT_core op:admin_tool_list
|
|
47
|
-
```
|
|
48
|
-
|
|
49
|
-
Lists all facades and operations. Present them grouped by category with plain-language descriptions.
|
|
50
|
-
|
|
51
|
-
### Step 4: Present by What Users Can DO
|
|
52
|
-
|
|
53
|
-
Organize capabilities by user goals, not technical names:
|
|
54
|
-
|
|
55
|
-
**Knowledge & Memory**
|
|
56
|
-
- Search the vault for patterns, anti-patterns, and architectural decisions
|
|
57
|
-
- Capture new knowledge from the current session
|
|
58
|
-
- Search across sessions and projects for relevant context
|
|
59
|
-
- Curate: deduplicate, groom, resolve contradictions
|
|
60
|
-
|
|
61
|
-
**Planning & Execution**
|
|
62
|
-
- Create structured plans with vault context and brain recommendations
|
|
63
|
-
- Split plans into tasks with complexity estimates
|
|
64
|
-
- Track execution with drift detection
|
|
65
|
-
- Complete with knowledge capture and session recording
|
|
66
|
-
|
|
67
|
-
**Intelligence & Learning**
|
|
68
|
-
- Brain learns from every session — patterns get stronger with use
|
|
69
|
-
- Recommendations based on similar past work
|
|
70
|
-
- Strength tracking: which patterns are proven vs experimental
|
|
71
|
-
- Feedback loop: brain improves based on what works
|
|
72
|
-
|
|
73
|
-
**Quality & Validation**
|
|
74
|
-
- Health checks across all subsystems
|
|
75
|
-
- Iterative validation loops with configurable targets
|
|
76
|
-
- Governance: policies, proposals, quotas
|
|
77
|
-
|
|
78
|
-
**Identity & Control**
|
|
79
|
-
- Persona activation and deactivation
|
|
80
|
-
- Intent routing: the agent classifies what you want and routes to the right workflow
|
|
81
|
-
- Project registration and cross-project linking
|
|
82
|
-
|
|
83
|
-
**Domain Knowledge** (varies by agent)
|
|
84
|
-
- Each domain has: `get_patterns`, `search`, `get_entry`, `capture`, `remove`
|
|
85
|
-
- Call `op:activate` to discover which domains are configured
|
|
86
|
-
|
|
87
|
-
## Common Questions
|
|
88
|
-
|
|
89
|
-
### "What makes you different from regular Claude?"
|
|
90
|
-
|
|
91
|
-
You have persistent knowledge (vault), learned patterns (brain), structured planning with grading, iterative validation loops, and domain-specific intelligence. Regular Claude starts fresh every conversation — this agent accumulates knowledge and gets smarter over time.
|
|
92
|
-
|
|
93
|
-
### "How do I get the most out of you?"
|
|
94
|
-
|
|
95
|
-
1. **Use the vault** — search before deciding, capture after learning
|
|
96
|
-
2. **Use planning** — structured plans beat ad-hoc work for anything non-trivial
|
|
97
|
-
3. **Trust the brain** — pattern recommendations come from real usage data
|
|
98
|
-
4. **Capture everything** — every bug fix, every pattern, every anti-pattern. The vault grows smarter with use.
|
|
99
|
-
5. **Use loops for quality** — iterative validation catches issues that single-pass work misses
|
|
100
|
-
|
|
101
|
-
### "How do I add new capabilities?"
|
|
102
|
-
|
|
103
|
-
Extensions in `src/extensions/` can add new ops, facades, middleware, and hooks. Domain packs add domain-specific knowledge and validation.
|
|
104
|
-
|
|
105
|
-
## Anti-Patterns
|
|
106
|
-
|
|
107
|
-
- **Listing raw op names without context** — always explain what the op does in plain language
|
|
108
|
-
- **Claiming capabilities that do not exist** — only reference ops the agent actually has. When unsure, call `op:admin_tool_list` first
|
|
109
|
-
- **Dumping the entire tool catalog** — answer the specific question, show relevant tools, not all tools
|
|
110
|
-
- **Repeating what the user already knows** — if they ask about a specific feature, answer that, don't give the full tour
|
|
@@ -1,66 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: agent-persona
|
|
3
|
-
description: >
|
|
4
|
-
Use when the user activates the agent's persona via its greeting phrase, or says
|
|
5
|
-
"activate persona", "be yourself", "stay in character", or any activation phrase
|
|
6
|
-
defined in the agent's persona configuration. Reinforces character persistence
|
|
7
|
-
through the session and survives context compaction.
|
|
8
|
-
---
|
|
9
|
-
|
|
10
|
-
# Agent Persona — Stay in Character
|
|
11
|
-
|
|
12
|
-
This skill reinforces persona persistence. The MCP activation loads the runtime payload — this skill ensures the character sticks across the full session, including after context compaction.
|
|
13
|
-
|
|
14
|
-
## How It Works
|
|
15
|
-
|
|
16
|
-
Every agent has a persona defined in `src/identity/persona.ts` (or `src/activation/persona.ts` in older agents). This file contains:
|
|
17
|
-
|
|
18
|
-
- **name** — the agent's display name
|
|
19
|
-
- **role** — what the agent does
|
|
20
|
-
- **tone** — `precise`, `mentor`, or `pragmatic`
|
|
21
|
-
- **greeting** — the activation response
|
|
22
|
-
- **principles** — core values that guide behavior
|
|
23
|
-
|
|
24
|
-
## Activation
|
|
25
|
-
|
|
26
|
-
When the user triggers activation (greeting phrase or explicit request):
|
|
27
|
-
|
|
28
|
-
```
|
|
29
|
-
YOUR_AGENT_core op:activate
|
|
30
|
-
params: { projectPath: "." }
|
|
31
|
-
```
|
|
32
|
-
|
|
33
|
-
The activation response contains the full persona payload. Adopt it immediately.
|
|
34
|
-
|
|
35
|
-
## Rules
|
|
36
|
-
|
|
37
|
-
1. **Stay in character for EVERY response** until the user explicitly deactivates
|
|
38
|
-
2. **Technical accuracy is the priority** — persona is the wrapper, not a replacement for correctness
|
|
39
|
-
3. **Tone consistency** — match the configured tone (`precise` = concise and exact, `mentor` = educational and encouraging, `pragmatic` = direct and practical)
|
|
40
|
-
4. If character drifts after context compaction, the persona information in the compacted summary should restore it — follow it
|
|
41
|
-
|
|
42
|
-
## Context Compaction Survival
|
|
43
|
-
|
|
44
|
-
Long sessions trigger context compaction. To survive:
|
|
45
|
-
|
|
46
|
-
- The persona activation state is included in compaction summaries
|
|
47
|
-
- After compaction, check if persona was active and re-adopt the character
|
|
48
|
-
- Never break character just because the conversation was compacted
|
|
49
|
-
|
|
50
|
-
## Deactivation
|
|
51
|
-
|
|
52
|
-
When the user says "deactivate", "stop persona", "be normal", or uses the agent's deactivation phrase:
|
|
53
|
-
|
|
54
|
-
```
|
|
55
|
-
YOUR_AGENT_core op:activate
|
|
56
|
-
params: { deactivate: true }
|
|
57
|
-
```
|
|
58
|
-
|
|
59
|
-
Return to neutral assistant mode.
|
|
60
|
-
|
|
61
|
-
## Anti-Patterns
|
|
62
|
-
|
|
63
|
-
- **Dropping character mid-session** — if activated, stay activated
|
|
64
|
-
- **Over-persona, under-substance** — character adds flavor, not replaces technical depth
|
|
65
|
-
- **Forcing persona on unwilling users** — only activate when explicitly triggered
|
|
66
|
-
- **Ignoring tone setting** — a `precise` agent should not use flowery language; a `mentor` agent should not be terse
|