@soleri/forge 9.0.0 → 9.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (110) hide show
  1. package/dist/compose-claude-md.js +1 -1
  2. package/dist/compose-claude-md.js.map +1 -1
  3. package/dist/scaffold-filetree.js +10 -1
  4. package/dist/scaffold-filetree.js.map +1 -1
  5. package/dist/skills/brain-debrief/SKILL.md +70 -0
  6. package/dist/skills/brainstorming/SKILL.md +89 -0
  7. package/dist/skills/code-patrol/SKILL.md +86 -0
  8. package/dist/skills/context-resume/SKILL.md +95 -0
  9. package/dist/skills/deep-review/SKILL.md +186 -0
  10. package/dist/skills/executing-plans/SKILL.md +91 -0
  11. package/dist/skills/fix-and-learn/SKILL.md +102 -0
  12. package/dist/skills/health-check/SKILL.md +135 -0
  13. package/dist/skills/knowledge-harvest/SKILL.md +97 -0
  14. package/dist/skills/onboard-me/SKILL.md +121 -0
  15. package/dist/skills/parallel-execute/SKILL.md +277 -0
  16. package/dist/skills/retrospective/SKILL.md +98 -0
  17. package/dist/skills/second-opinion/SKILL.md +97 -0
  18. package/dist/skills/systematic-debugging/SKILL.md +103 -0
  19. package/dist/skills/test-driven-development/SKILL.md +91 -0
  20. package/dist/skills/vault-capture/SKILL.md +86 -0
  21. package/dist/skills/vault-navigator/SKILL.md +84 -0
  22. package/dist/skills/vault-smells/SKILL.md +251 -0
  23. package/dist/skills/verification-before-completion/SKILL.md +85 -0
  24. package/dist/skills/writing-plans/SKILL.md +108 -0
  25. package/dist/templates/agents-md.js +2 -2
  26. package/dist/templates/agents-md.js.map +1 -1
  27. package/dist/templates/claude-md-template.js +2 -2
  28. package/dist/templates/claude-md-template.js.map +1 -1
  29. package/dist/templates/shared-rules.js +35 -1
  30. package/dist/templates/shared-rules.js.map +1 -1
  31. package/dist/templates/skills.js +28 -6
  32. package/dist/templates/skills.js.map +1 -1
  33. package/dist/types.d.ts +1 -1
  34. package/dist/types.js +6 -2
  35. package/dist/types.js.map +1 -1
  36. package/package.json +1 -1
  37. package/src/__tests__/scaffolder.test.ts +17 -35
  38. package/src/compose-claude-md.ts +1 -1
  39. package/src/scaffold-filetree.ts +12 -1
  40. package/src/skills/brain-debrief/SKILL.md +70 -0
  41. package/src/skills/brainstorming/SKILL.md +89 -0
  42. package/src/skills/code-patrol/SKILL.md +86 -0
  43. package/src/skills/context-resume/SKILL.md +95 -0
  44. package/src/skills/deep-review/SKILL.md +186 -0
  45. package/src/skills/executing-plans/SKILL.md +91 -0
  46. package/src/skills/fix-and-learn/SKILL.md +102 -0
  47. package/src/skills/health-check/SKILL.md +135 -0
  48. package/src/skills/knowledge-harvest/SKILL.md +97 -0
  49. package/src/skills/onboard-me/SKILL.md +121 -0
  50. package/src/skills/parallel-execute/SKILL.md +277 -0
  51. package/src/skills/retrospective/SKILL.md +98 -0
  52. package/src/skills/second-opinion/SKILL.md +97 -0
  53. package/src/skills/systematic-debugging/SKILL.md +103 -0
  54. package/src/skills/test-driven-development/SKILL.md +91 -0
  55. package/src/skills/vault-capture/SKILL.md +86 -0
  56. package/src/skills/vault-navigator/SKILL.md +84 -0
  57. package/src/skills/vault-smells/SKILL.md +251 -0
  58. package/src/skills/verification-before-completion/SKILL.md +85 -0
  59. package/src/skills/writing-plans/SKILL.md +108 -0
  60. package/src/templates/agents-md.ts +2 -2
  61. package/src/templates/claude-md-template.ts +2 -2
  62. package/src/templates/shared-rules.ts +36 -1
  63. package/src/templates/skills.ts +27 -6
  64. package/src/types.ts +7 -2
  65. package/dist/skills/agent-dev.md +0 -122
  66. package/dist/skills/agent-guide.md +0 -110
  67. package/dist/skills/agent-persona.md +0 -66
  68. package/dist/skills/brain-debrief.md +0 -214
  69. package/dist/skills/brainstorming.md +0 -180
  70. package/dist/skills/code-patrol.md +0 -178
  71. package/dist/skills/context-resume.md +0 -146
  72. package/dist/skills/deliver-and-ship.md +0 -123
  73. package/dist/skills/env-setup.md +0 -151
  74. package/dist/skills/executing-plans.md +0 -216
  75. package/dist/skills/fix-and-learn.md +0 -167
  76. package/dist/skills/health-check.md +0 -231
  77. package/dist/skills/knowledge-harvest.md +0 -185
  78. package/dist/skills/onboard-me.md +0 -198
  79. package/dist/skills/retrospective.md +0 -205
  80. package/dist/skills/second-opinion.md +0 -149
  81. package/dist/skills/systematic-debugging.md +0 -241
  82. package/dist/skills/test-driven-development.md +0 -281
  83. package/dist/skills/vault-capture.md +0 -170
  84. package/dist/skills/vault-curate.md +0 -107
  85. package/dist/skills/vault-navigator.md +0 -140
  86. package/dist/skills/verification-before-completion.md +0 -182
  87. package/dist/skills/writing-plans.md +0 -215
  88. package/src/skills/agent-dev.md +0 -122
  89. package/src/skills/agent-guide.md +0 -110
  90. package/src/skills/agent-persona.md +0 -66
  91. package/src/skills/brain-debrief.md +0 -214
  92. package/src/skills/brainstorming.md +0 -180
  93. package/src/skills/code-patrol.md +0 -178
  94. package/src/skills/context-resume.md +0 -146
  95. package/src/skills/deliver-and-ship.md +0 -123
  96. package/src/skills/env-setup.md +0 -151
  97. package/src/skills/executing-plans.md +0 -216
  98. package/src/skills/fix-and-learn.md +0 -167
  99. package/src/skills/health-check.md +0 -231
  100. package/src/skills/knowledge-harvest.md +0 -185
  101. package/src/skills/onboard-me.md +0 -198
  102. package/src/skills/retrospective.md +0 -205
  103. package/src/skills/second-opinion.md +0 -149
  104. package/src/skills/systematic-debugging.md +0 -241
  105. package/src/skills/test-driven-development.md +0 -281
  106. package/src/skills/vault-capture.md +0 -170
  107. package/src/skills/vault-curate.md +0 -107
  108. package/src/skills/vault-navigator.md +0 -140
  109. package/src/skills/verification-before-completion.md +0 -182
  110. package/src/skills/writing-plans.md +0 -215
@@ -1,107 +0,0 @@
1
- ---
2
- name: vault-curate
3
- description: >
4
- Use when the user says "clean vault", "deduplicate", "groom knowledge",
5
- "consolidate vault", "vault maintenance", "find duplicates", "merge patterns",
6
- "check contradictions", "vault health", or wants to maintain, clean, reorganize,
7
- or improve the quality of the agent's knowledge base.
8
- ---
9
-
10
- # Vault Curate — Knowledge Maintenance
11
-
12
- Maintain vault quality through deduplication, grooming, contradiction detection, and consolidation. A well-curated vault produces better search results and brain recommendations.
13
-
14
- ## When to Use
15
-
16
- Periodically (weekly or after heavy capture sessions), when search quality degrades, when vault health shows warnings, or when the user explicitly requests maintenance.
17
-
18
- ## Orchestration Sequence
19
-
20
- ### Step 1: Health Assessment
21
-
22
- ```
23
- YOUR_AGENT_core op:knowledge_health
24
- ```
25
-
26
- ```
27
- YOUR_AGENT_core op:get_vault_analytics
28
- ```
29
-
30
- Present the health summary to the user before proceeding: total entries, quality scores, staleness, coverage gaps.
31
-
32
- ### Step 2: Detect Duplicates
33
-
34
- ```
35
- YOUR_AGENT_core op:curator_detect_duplicates
36
- ```
37
-
38
- This finds entries with overlapping titles, descriptions, or content. Review the duplicate pairs — some may be intentional (different contexts) while others are true duplicates.
39
-
40
- For true duplicates:
41
-
42
- ```
43
- YOUR_AGENT_core op:merge_patterns
44
- params: { patternIds: ["<id1>", "<id2>"] }
45
- ```
46
-
47
- Preserve the best content from each.
48
-
49
- ### Step 3: Find Contradictions
50
-
51
- ```
52
- YOUR_AGENT_core op:curator_contradictions
53
- ```
54
-
55
- Contradictions erode trust in vault search results. For each contradiction: decide which entry is correct (check dates, context, evidence), then archive or update the incorrect one.
56
-
57
- ### Step 4: Groom Entries
58
-
59
- ```
60
- YOUR_AGENT_core op:curator_groom_all
61
- ```
62
-
63
- Runs tag enrichment and metadata cleanup across all entries. This improves searchability and categorization.
64
-
65
- For targeted grooming of specific entries:
66
-
67
- ```
68
- YOUR_AGENT_core op:curator_groom
69
- params: { entryIds: ["<id>"], tags: ["<tag>"] }
70
- ```
71
-
72
- ### Step 5: GPT Enrichment (Optional)
73
-
74
- ```
75
- YOUR_AGENT_core op:curator_gpt_enrich
76
- ```
77
-
78
- Adds AI-generated metadata to entries that lack descriptions, examples, or context. Fills in gaps without changing the core content.
79
-
80
- ### Step 6: Full Consolidation
81
-
82
- ```
83
- YOUR_AGENT_core op:curator_consolidate
84
- ```
85
-
86
- Runs the complete pipeline: dedup + archive stale entries + resolve contradictions. This is the heavy-duty cleanup.
87
-
88
- ### Step 7: Knowledge Reorganization
89
-
90
- ```
91
- YOUR_AGENT_core op:knowledge_reorganize
92
- params: { mode: "preview" }
93
- ```
94
-
95
- Preview first, then run again with `mode: "apply"` if the preview looks good.
96
-
97
- ### Step 8: Verify Results
98
-
99
- ```
100
- YOUR_AGENT_core op:knowledge_health
101
- ```
102
-
103
- Compare with Step 1 metrics. Vault health should improve: fewer duplicates, no contradictions, better coverage.
104
-
105
- ## Exit Criteria
106
-
107
- Curation is complete when: duplicates merged, contradictions resolved, entries groomed, and health metrics improved compared to Step 1 baseline.
@@ -1,140 +0,0 @@
1
- ---
2
- name: vault-navigator
3
- description: Use when the user asks "what does vault say", "search knowledge", "find pattern", "have we seen this before", "best practice for", "check vault", "vault search", "any patterns for", or wants to query the knowledge base for existing solutions or guidance.
4
- ---
5
-
6
- # Vault Navigator — Knowledge Oracle
7
-
8
- Navigate the vault intelligently. The vault has multiple search strategies — this skill picks the right one based on what the user needs.
9
-
10
- ## When to Use
11
-
12
- Any time the user wants to find existing knowledge before building something new. Also when asking about best practices, previous solutions, or patterns.
13
-
14
- ## Search Strategy Decision Tree
15
-
16
- ### For "Have we seen this before?" / "Best practice for X"
17
-
18
- Start with `YOUR_AGENT_core op:search_intelligent` — this is semantic search, the broadest and smartest query. Pass the user's question as the query.
19
-
20
- ```
21
- YOUR_AGENT_core op:search_intelligent
22
- params: { query: "<user's question>" }
23
- ```
24
-
25
- If results are weak (low scores or few matches), fall back to `YOUR_AGENT_core op:search` with explicit filters (type, category, tags, severity). This is structured search — narrower but more precise.
26
-
27
- ### For "Show me everything about X" (Exploration)
28
-
29
- Use tag-based and domain-based browsing for broader exploration:
30
-
31
- ```
32
- YOUR_AGENT_core op:vault_tags
33
- ```
34
-
35
- Lists all tags in the vault — helps discover what topics are covered.
36
-
37
- ```
38
- YOUR_AGENT_core op:vault_domains
39
- ```
40
-
41
- Lists all domains — shows the knowledge landscape at a glance.
42
-
43
- ```
44
- YOUR_AGENT_core op:vault_recent
45
- ```
46
-
47
- Shows recently added or modified entries — what's fresh in the vault.
48
-
49
- ### For "What's stale?" / "What needs updating?"
50
-
51
- Run an age report to find outdated knowledge:
52
-
53
- ```
54
- YOUR_AGENT_core op:vault_age_report
55
- ```
56
-
57
- Present entries that haven't been updated recently — these are candidates for review, refresh, or removal.
58
-
59
- ### For "What do other projects do?"
60
-
61
- Call `YOUR_AGENT_core op:memory_cross_project_search` with `crossProject: true`. This searches across all linked projects, not just the current one.
62
-
63
- ```
64
- YOUR_AGENT_core op:memory_cross_project_search
65
- params: { query: "<topic>", crossProject: true }
66
- ```
67
-
68
- Check what projects are linked:
69
-
70
- ```
71
- YOUR_AGENT_core op:project_linked_projects
72
- ```
73
-
74
- ### For "Has brain learned anything about X?"
75
-
76
- Call `YOUR_AGENT_core op:brain_strengths` to see which patterns have proven strength. Then call `YOUR_AGENT_core op:brain_global_patterns` with a domain or tag filter to find cross-project patterns.
77
-
78
- ```
79
- YOUR_AGENT_core op:brain_strengths
80
- YOUR_AGENT_core op:brain_global_patterns
81
- params: { domain: "<domain>" }
82
- ```
83
-
84
- ### For "What do I know about X?" (broad exploration)
85
-
86
- Chain multiple strategies for comprehensive results:
87
-
88
- 1. `search_intelligent` → semantic vault search
89
- 2. `vault_tags` / `vault_domains` → browse knowledge landscape
90
- 3. `memory_cross_project_search` → cross-project patterns
91
- 4. `brain_strengths` → proven patterns
92
-
93
- Present all findings with source labels so the user knows where each insight came from.
94
-
95
- ## Presenting Results
96
-
97
- Always include:
98
-
99
- - **Source**: Which search found it (vault, memory, brain, tags, domains)
100
- - **Confidence**: Score or strength rating
101
- - **Relevance**: Why this result matches the query
102
- - **Actionable next step**: How to apply this knowledge
103
-
104
- ## Fallback: Web Search
105
-
106
- If all vault strategies return no results, search the web for the user's question before saying "nothing found." The web may have:
107
-
108
- - Documentation, articles, or guides on the topic
109
- - Community patterns and best practices
110
- - Library-specific solutions
111
-
112
- If web search finds something useful, offer to capture it to the vault:
113
-
114
- ```
115
- YOUR_AGENT_core op:capture_quick
116
- params: {
117
- title: "<what was found>",
118
- description: "<summary from web search, source URL>"
119
- }
120
- ```
121
-
122
- ## Exit Criteria
123
-
124
- Search is complete when at least one search strategy has been tried and results presented. If no results found across all strategies (vault + web), say so explicitly — that's valuable information too (it means this is genuinely new territory worth exploring and capturing).
125
-
126
- ## Agent Tools Reference
127
-
128
- | Op | When to Use |
129
- | ----------------------------- | ----------------------------------------------------- |
130
- | `search_intelligent` | Default semantic search — broadest and smartest |
131
- | `search` | Structured search with filters (type, tags, category) |
132
- | `vault_tags` | Browse all tags — discover knowledge landscape |
133
- | `vault_domains` | Browse all domains — see what areas are covered |
134
- | `vault_recent` | Recently modified entries — what's fresh |
135
- | `vault_age_report` | Find stale entries needing refresh |
136
- | `memory_cross_project_search` | Search across linked projects |
137
- | `project_linked_projects` | See what projects are connected |
138
- | `brain_strengths` | Proven patterns ranked by success |
139
- | `brain_global_patterns` | Cross-project patterns from global pool |
140
- | `capture_quick` | Capture web findings to vault for next time |
@@ -1,182 +0,0 @@
1
- ---
2
- name: verification-before-completion
3
- description: Use when about to claim work is complete, fixed, or passing, before committing or creating PRs - requires running verification commands and confirming output before making any success claims; evidence before assertions always
4
- ---
5
-
6
- <!-- Adapted from superpowers (MIT License) -->
7
-
8
- # Verification Before Completion
9
-
10
- ## Overview
11
-
12
- Claiming work is complete without verification is dishonesty, not efficiency.
13
-
14
- **Core principle:** Evidence before claims, always.
15
-
16
- **Violating the letter of this rule is violating the spirit of this rule.**
17
-
18
- ## The Iron Law
19
-
20
- ```
21
- NO COMPLETION CLAIMS WITHOUT FRESH VERIFICATION EVIDENCE
22
- ```
23
-
24
- If you haven't run the verification command in this message, you cannot claim it passes.
25
-
26
- ## The Gate Function
27
-
28
- ```
29
- BEFORE claiming any status or expressing satisfaction:
30
-
31
- 1. IDENTIFY: What command proves this claim?
32
- 2. RUN: Execute the FULL command (fresh, complete)
33
- 3. READ: Full output, check exit code, count failures
34
- 4. VERIFY: Does output confirm the claim?
35
- - If NO: State actual status with evidence
36
- - If YES: State claim WITH evidence
37
- 5. AGENT CHECK: Run system diagnostics
38
- 6. ONLY THEN: Make the claim
39
-
40
- Skip any step = lying, not verifying
41
- ```
42
-
43
- ## Agent System Checks
44
-
45
- After passing all verification commands, run system diagnostics:
46
-
47
- ### Health Check
48
-
49
- ```
50
- YOUR_AGENT_core op:admin_health
51
- ```
52
-
53
- Catches issues tests might miss — vault corruption, stale caches, configuration drift.
54
-
55
- ### Full Diagnostic
56
-
57
- ```
58
- YOUR_AGENT_core op:admin_diagnostic
59
- ```
60
-
61
- Comprehensive system check — module status, database integrity, cache health, configuration validity.
62
-
63
- ### Vault Analytics
64
-
65
- ```
66
- YOUR_AGENT_core op:admin_vault_analytics
67
- ```
68
-
69
- Verify knowledge quality metrics — are capture rates healthy? Any degradation?
70
-
71
- If any check reports problems, address them before claiming completion.
72
-
73
- ## Common Failures
74
-
75
- | Claim | Requires | Not Sufficient |
76
- | --------------------- | ------------------------------- | ------------------------------ |
77
- | Tests pass | Test command output: 0 failures | Previous run, "should pass" |
78
- | Linter clean | Linter output: 0 errors | Partial check, extrapolation |
79
- | Build succeeds | Build command: exit 0 | Linter passing, logs look good |
80
- | Bug fixed | Test original symptom: passes | Code changed, assumed fixed |
81
- | Regression test works | Red-green cycle verified | Test passes once |
82
- | Agent completed | VCS diff shows changes | Agent reports "success" |
83
- | Requirements met | Line-by-line checklist | Tests passing |
84
- | Agent healthy | `admin_diagnostic` clean | "No errors in logs" |
85
-
86
- ## Red Flags - STOP
87
-
88
- - Using "should", "probably", "seems to"
89
- - Expressing satisfaction before verification ("Great!", "Perfect!", "Done!", etc.)
90
- - About to commit/push/PR without verification
91
- - Trusting agent success reports
92
- - Relying on partial verification
93
- - Thinking "just this once"
94
- - Tired and wanting work over
95
- - ANY wording implying success without having run verification
96
-
97
- ## Rationalization Prevention
98
-
99
- | Excuse | Reality |
100
- | --------------------------------------- | ---------------------- |
101
- | "Should work now" | RUN the verification |
102
- | "I'm confident" | Confidence ≠ evidence |
103
- | "Just this once" | No exceptions |
104
- | "Linter passed" | Linter ≠ compiler |
105
- | "Agent said success" | Verify independently |
106
- | "I'm tired" | Exhaustion ≠ excuse |
107
- | "Partial check is enough" | Partial proves nothing |
108
- | "Different words so rule doesn't apply" | Spirit over letter |
109
-
110
- ## Key Patterns
111
-
112
- **Tests:**
113
-
114
- ```
115
- [Run test command] [See: 34/34 pass] "All tests pass"
116
- NOT: "Should pass now" / "Looks correct"
117
- ```
118
-
119
- **Regression tests (TDD Red-Green):**
120
-
121
- ```
122
- Write -> Run (pass) -> Revert fix -> Run (MUST FAIL) -> Restore -> Run (pass)
123
- NOT: "I've written a regression test" (without red-green verification)
124
- ```
125
-
126
- **Build:**
127
-
128
- ```
129
- [Run build] [See: exit 0] "Build passes"
130
- NOT: "Linter passed" (linter doesn't check compilation)
131
- ```
132
-
133
- **Requirements:**
134
-
135
- ```
136
- Re-read plan -> Create checklist -> Verify each -> Report gaps or completion
137
- NOT: "Tests pass, phase complete"
138
- ```
139
-
140
- **Agent delegation:**
141
-
142
- ```
143
- Agent reports success -> Check VCS diff -> Verify changes -> Report actual state
144
- NOT: Trust agent report
145
- ```
146
-
147
- ## After Verification — Capture Session
148
-
149
- Once work is verified complete, capture a session summary so context persists:
150
-
151
- ```
152
- YOUR_AGENT_core op:session_capture
153
- params: {
154
- summary: "<what was accomplished, files modified, key decisions>"
155
- }
156
- ```
157
-
158
- This ensures the next session has context about what was verified and completed.
159
-
160
- ## When To Apply
161
-
162
- **ALWAYS before:**
163
-
164
- - ANY variation of success/completion claims
165
- - ANY expression of satisfaction
166
- - ANY positive statement about work state
167
- - Committing, PR creation, task completion
168
- - Moving to next task
169
- - Delegating to agents
170
-
171
- ## The Bottom Line
172
-
173
- Run the command. Read the output. THEN claim the result. This is non-negotiable.
174
-
175
- ## Agent Tools Reference
176
-
177
- | Op | When to Use |
178
- | ----------------------- | ----------------------------------- |
179
- | `admin_health` | Quick system health check |
180
- | `admin_diagnostic` | Comprehensive system diagnostic |
181
- | `admin_vault_analytics` | Knowledge quality metrics |
182
- | `session_capture` | Persist verified completion context |
@@ -1,215 +0,0 @@
1
- ---
2
- name: writing-plans
3
- description: Use when you have a spec or requirements for a multi-step task, before touching code
4
- ---
5
-
6
- <!-- Adapted from superpowers (MIT License) -->
7
-
8
- # Writing Plans
9
-
10
- ## Overview
11
-
12
- Write comprehensive implementation plans assuming the engineer has zero context for our codebase and questionable taste. Document everything they need to know: which files to touch for each task, code, testing, docs they might need to check, how to test it. Give them the whole plan as bite-sized tasks. DRY. YAGNI. TDD. Frequent commits.
13
-
14
- Assume they are a skilled developer, but know almost nothing about our toolset or problem domain. Assume they don't know good test design very well.
15
-
16
- **Announce at start:** "I'm using the writing-plans skill to create the implementation plan."
17
-
18
- **Save plans to:** `docs/plans/YYYY-MM-DD-<feature-name>.md`
19
-
20
- ## Before Writing — Search First, Plan Second
21
-
22
- **Never write a plan from scratch.** Always search for existing knowledge first.
23
-
24
- ### 1. Vault First
25
-
26
- Check the vault for relevant implementation patterns:
27
-
28
- ```
29
- YOUR_AGENT_core op:search_intelligent
30
- params: { query: "<feature being planned>" }
31
- ```
32
-
33
- Look for:
34
-
35
- - **Implementation patterns** — proven approaches for similar features
36
- - **Anti-patterns** — approaches that failed and should be avoided
37
- - **Testing patterns** — how similar features were tested
38
-
39
- Also check brain strengths for what's worked:
40
-
41
- ```
42
- YOUR_AGENT_core op:brain_strengths
43
- ```
44
-
45
- Browse related knowledge domains for additional context:
46
-
47
- ```
48
- YOUR_AGENT_core op:vault_domains
49
- YOUR_AGENT_core op:vault_tags
50
- ```
51
-
52
- ### 2. Web Search Second
53
-
54
- If the vault doesn't have implementation guidance, search the web:
55
-
56
- - **Libraries and tools** — is there a package that does this already?
57
- - **Reference implementations** — how did other projects solve this?
58
- - **API documentation** — official docs for libraries you'll use
59
- - **Known issues** — pitfalls others ran into
60
-
61
- ### 3. Then Write the Plan
62
-
63
- Incorporate vault insights and web findings into the plan. Reference specific vault entries and documentation links when they inform a step. A plan informed by existing knowledge is dramatically better than one written from first principles.
64
-
65
- ## Create a Tracked Plan
66
-
67
- Use the agent's planning system to create a tracked, resumable plan:
68
-
69
- ```
70
- YOUR_AGENT_core op:create_plan
71
- params: {
72
- objective: "<one-sentence goal>",
73
- scope: { included: [...], excluded: [...] },
74
- steps: [
75
- { title: "Step 1 title", description: "details" },
76
- ...
77
- ]
78
- }
79
- ```
80
-
81
- This makes the plan persistent across sessions — if context compacts or sessions change, the plan survives.
82
-
83
- ## Grade the Plan
84
-
85
- After drafting, grade the plan for quality before presenting to the user:
86
-
87
- ```
88
- YOUR_AGENT_core op:plan_grade
89
- params: { planId: "<id from create_plan>" }
90
- ```
91
-
92
- If the grade is below target, auto-improve:
93
-
94
- ```
95
- YOUR_AGENT_core op:plan_auto_improve
96
- params: { planId: "<id>" }
97
- ```
98
-
99
- This iterates on the plan — filling gaps, adding missing test steps, clarifying ambiguous instructions. Repeat until the grade meets the target:
100
-
101
- ```
102
- YOUR_AGENT_core op:plan_meets_grade
103
- params: { planId: "<id>", targetGrade: "A" }
104
- ```
105
-
106
- ### Iterate on Drafts
107
-
108
- For complex plans, iterate before finalizing:
109
-
110
- ```
111
- YOUR_AGENT_core op:plan_iterate
112
- params: { planId: "<id>", feedback: "<what needs improvement>" }
113
- ```
114
-
115
- This creates a new version of the plan incorporating the feedback, preserving version history.
116
-
117
- ## Split into Tasks
118
-
119
- Once the plan is approved, split it into trackable tasks:
120
-
121
- ```
122
- YOUR_AGENT_core op:plan_split
123
- params: { planId: "<id>" }
124
- ```
125
-
126
- This generates individual tasks from the plan steps, ready for execution tracking.
127
-
128
- ## Bite-Sized Task Granularity
129
-
130
- **Each step is one action (2-5 minutes):**
131
-
132
- - "Write the failing test" - step
133
- - "Run it to make sure it fails" - step
134
- - "Implement the minimal code to make the test pass" - step
135
- - "Run the tests and make sure they pass" - step
136
- - "Commit" - step
137
-
138
- ## Plan Document Header
139
-
140
- **Every plan MUST start with this header:**
141
-
142
- ```markdown
143
- # [Feature Name] Implementation Plan
144
-
145
- > **For Claude:** REQUIRED SUB-SKILL: Use executing-plans to implement this plan task-by-task.
146
-
147
- **Goal:** [One sentence describing what this builds]
148
-
149
- **Architecture:** [2-3 sentences about approach]
150
-
151
- **Tech Stack:** [Key technologies/libraries]
152
-
153
- ---
154
- ```
155
-
156
- ## Task Structure
157
-
158
- Each task uses this format:
159
-
160
- - Files: Create / Modify / Test paths
161
- - Step 1: Write the failing test (with code)
162
- - Step 2: Run test to verify it fails (with expected output)
163
- - Step 3: Write minimal implementation (with code)
164
- - Step 4: Run test to verify it passes (with expected output)
165
- - Step 5: Commit (with exact git commands)
166
-
167
- ## Remember
168
-
169
- - Exact file paths always
170
- - Complete code in plan (not "add validation")
171
- - Exact commands with expected output
172
- - DRY, YAGNI, TDD, frequent commits
173
-
174
- ## After Plan Approval
175
-
176
- Once the user approves the plan, register it for tracking:
177
-
178
- ```
179
- YOUR_AGENT_core op:approve_plan
180
- params: { planId: "<id from create_plan>" }
181
- ```
182
-
183
- Check plan stats for an overview:
184
-
185
- ```
186
- YOUR_AGENT_core op:plan_stats
187
- ```
188
-
189
- ## Execution Handoff
190
-
191
- After saving the plan, offer execution choice:
192
-
193
- "Plan complete and saved to `docs/plans/<filename>.md`. Two execution options:
194
-
195
- **1. Subagent-Driven (this session)** - I dispatch fresh subagent per task, review between tasks, fast iteration
196
-
197
- **2. Parallel Session (separate)** - Open new session with executing-plans, batch execution with checkpoints
198
-
199
- Which approach?"
200
-
201
- ## Agent Tools Reference
202
-
203
- | Op | When to Use |
204
- | ------------------------------ | -------------------------------------- |
205
- | `search_intelligent` | Find relevant patterns before planning |
206
- | `brain_strengths` | Check proven approaches |
207
- | `vault_domains` / `vault_tags` | Browse knowledge landscape |
208
- | `create_plan` | Create tracked, persistent plan |
209
- | `plan_grade` | Grade plan quality |
210
- | `plan_auto_improve` | Auto-fix plan weaknesses |
211
- | `plan_meets_grade` | Verify grade target reached |
212
- | `plan_iterate` | Iterate on draft with feedback |
213
- | `plan_split` | Split plan into trackable tasks |
214
- | `approve_plan` | Lock in approved plan |
215
- | `plan_stats` | Overview of plan metrics |