@soleri/forge 8.1.0 → 9.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (111) hide show
  1. package/dist/agent-schema.d.ts +16 -18
  2. package/dist/agent-schema.js +6 -6
  3. package/dist/agent-schema.js.map +1 -1
  4. package/dist/scaffold-filetree.js +19 -3
  5. package/dist/scaffold-filetree.js.map +1 -1
  6. package/dist/scaffolder.js +0 -4
  7. package/dist/scaffolder.js.map +1 -1
  8. package/dist/skills/brain-debrief/SKILL.md +66 -0
  9. package/dist/skills/brainstorming/SKILL.md +85 -0
  10. package/dist/skills/code-patrol/SKILL.md +86 -0
  11. package/dist/skills/context-resume/SKILL.md +92 -0
  12. package/dist/skills/executing-plans/SKILL.md +87 -0
  13. package/dist/skills/fix-and-learn/SKILL.md +98 -0
  14. package/dist/skills/health-check/SKILL.md +131 -0
  15. package/dist/skills/knowledge-harvest/SKILL.md +93 -0
  16. package/dist/skills/onboard-me/SKILL.md +118 -0
  17. package/dist/skills/retrospective/SKILL.md +95 -0
  18. package/dist/skills/second-opinion/SKILL.md +97 -0
  19. package/dist/skills/systematic-debugging/SKILL.md +99 -0
  20. package/dist/skills/test-driven-development/SKILL.md +91 -0
  21. package/dist/skills/vault-capture/SKILL.md +82 -0
  22. package/dist/skills/vault-navigator/SKILL.md +80 -0
  23. package/dist/skills/verification-before-completion/SKILL.md +82 -0
  24. package/dist/skills/writing-plans/SKILL.md +105 -0
  25. package/dist/templates/agents-md.js +0 -1
  26. package/dist/templates/agents-md.js.map +1 -1
  27. package/dist/templates/claude-md-template.js +2 -2
  28. package/dist/templates/claude-md-template.js.map +1 -1
  29. package/dist/templates/entry-point.js +1 -1
  30. package/dist/templates/entry-point.js.map +1 -1
  31. package/dist/templates/shared-rules.js +19 -0
  32. package/dist/templates/shared-rules.js.map +1 -1
  33. package/dist/templates/test-facades.js +0 -11
  34. package/dist/templates/test-facades.js.map +1 -1
  35. package/dist/types.d.ts +20 -20
  36. package/dist/types.js +6 -6
  37. package/dist/types.js.map +1 -1
  38. package/package.json +2 -2
  39. package/src/__tests__/scaffold-filetree.test.ts +0 -2
  40. package/src/agent-schema.ts +6 -6
  41. package/src/scaffold-filetree.ts +21 -2
  42. package/src/scaffolder.ts +0 -4
  43. package/src/skills/brain-debrief/SKILL.md +66 -0
  44. package/src/skills/brainstorming/SKILL.md +85 -0
  45. package/src/skills/code-patrol/SKILL.md +86 -0
  46. package/src/skills/context-resume/SKILL.md +92 -0
  47. package/src/skills/executing-plans/SKILL.md +87 -0
  48. package/src/skills/fix-and-learn/SKILL.md +98 -0
  49. package/src/skills/health-check/SKILL.md +131 -0
  50. package/src/skills/knowledge-harvest/SKILL.md +93 -0
  51. package/src/skills/onboard-me/SKILL.md +118 -0
  52. package/src/skills/retrospective/SKILL.md +95 -0
  53. package/src/skills/second-opinion/SKILL.md +97 -0
  54. package/src/skills/systematic-debugging/SKILL.md +99 -0
  55. package/src/skills/test-driven-development/SKILL.md +91 -0
  56. package/src/skills/vault-capture/SKILL.md +82 -0
  57. package/src/skills/vault-navigator/SKILL.md +80 -0
  58. package/src/skills/verification-before-completion/SKILL.md +82 -0
  59. package/src/skills/writing-plans/SKILL.md +105 -0
  60. package/src/templates/agents-md.ts +0 -1
  61. package/src/templates/claude-md-template.ts +1 -2
  62. package/src/templates/entry-point.ts +1 -1
  63. package/src/templates/shared-rules.ts +20 -0
  64. package/src/templates/test-facades.ts +0 -11
  65. package/src/types.ts +6 -6
  66. package/dist/skills/agent-dev.md +0 -122
  67. package/dist/skills/agent-guide.md +0 -110
  68. package/dist/skills/agent-persona.md +0 -66
  69. package/dist/skills/brain-debrief.md +0 -214
  70. package/dist/skills/brainstorming.md +0 -180
  71. package/dist/skills/code-patrol.md +0 -178
  72. package/dist/skills/context-resume.md +0 -146
  73. package/dist/skills/deliver-and-ship.md +0 -123
  74. package/dist/skills/env-setup.md +0 -151
  75. package/dist/skills/executing-plans.md +0 -216
  76. package/dist/skills/fix-and-learn.md +0 -167
  77. package/dist/skills/health-check.md +0 -231
  78. package/dist/skills/knowledge-harvest.md +0 -185
  79. package/dist/skills/onboard-me.md +0 -198
  80. package/dist/skills/retrospective.md +0 -205
  81. package/dist/skills/second-opinion.md +0 -149
  82. package/dist/skills/systematic-debugging.md +0 -241
  83. package/dist/skills/test-driven-development.md +0 -281
  84. package/dist/skills/vault-capture.md +0 -170
  85. package/dist/skills/vault-curate.md +0 -107
  86. package/dist/skills/vault-navigator.md +0 -140
  87. package/dist/skills/verification-before-completion.md +0 -182
  88. package/dist/skills/writing-plans.md +0 -215
  89. package/src/skills/agent-dev.md +0 -122
  90. package/src/skills/agent-guide.md +0 -110
  91. package/src/skills/agent-persona.md +0 -66
  92. package/src/skills/brain-debrief.md +0 -214
  93. package/src/skills/brainstorming.md +0 -180
  94. package/src/skills/code-patrol.md +0 -178
  95. package/src/skills/context-resume.md +0 -146
  96. package/src/skills/deliver-and-ship.md +0 -123
  97. package/src/skills/env-setup.md +0 -151
  98. package/src/skills/executing-plans.md +0 -216
  99. package/src/skills/fix-and-learn.md +0 -167
  100. package/src/skills/health-check.md +0 -231
  101. package/src/skills/knowledge-harvest.md +0 -185
  102. package/src/skills/onboard-me.md +0 -198
  103. package/src/skills/retrospective.md +0 -205
  104. package/src/skills/second-opinion.md +0 -149
  105. package/src/skills/systematic-debugging.md +0 -241
  106. package/src/skills/test-driven-development.md +0 -281
  107. package/src/skills/vault-capture.md +0 -170
  108. package/src/skills/vault-curate.md +0 -107
  109. package/src/skills/vault-navigator.md +0 -140
  110. package/src/skills/verification-before-completion.md +0 -182
  111. package/src/skills/writing-plans.md +0 -215
@@ -1,170 +0,0 @@
1
- ---
2
- name: vault-capture
3
- description: Use when the user says "capture this", "save to vault", "remember this pattern", "log this anti-pattern", "store this knowledge", "add to vault", "capture what we learned", or wants to persist a pattern, anti-pattern, workflow, or principle to the knowledge base.
4
- ---
5
-
6
- # Vault Capture — Persist Knowledge
7
-
8
- Capture patterns, anti-patterns, workflows, and principles to the vault. Captured knowledge compounds — it informs future vault searches, brain recommendations, and team reviews.
9
-
10
- ## When to Use
11
-
12
- After discovering something worth remembering: a solution that worked, a mistake to avoid, a workflow that proved effective, or a principle that should guide future work.
13
-
14
- ## Orchestration Sequence
15
-
16
- ### Step 1: Check for Duplicates
17
-
18
- Call `YOUR_AGENT_core op:search_intelligent` with the knowledge title or description. If a similar entry exists, consider updating it instead of creating a duplicate.
19
-
20
- ```
21
- YOUR_AGENT_core op:search_intelligent
22
- params: { query: "<knowledge title or description>" }
23
- ```
24
-
25
- Also run duplicate detection explicitly:
26
-
27
- ```
28
- YOUR_AGENT_core op:curator_detect_duplicates
29
- ```
30
-
31
- If duplicates are found, decide: update the existing entry or merge them.
32
-
33
- ### Step 2: Classify the Knowledge
34
-
35
- Determine the entry type:
36
-
37
- - **pattern** — Something that works and should be repeated
38
- - **anti-pattern** — Something that fails and should be avoided
39
- - **workflow** — A sequence of steps for a specific task
40
- - **principle** — A guiding rule or heuristic
41
- - **decision** — An architectural or design choice with rationale
42
-
43
- Use intent routing to help classify:
44
-
45
- ```
46
- YOUR_AGENT_core op:route_intent
47
- params: { prompt: "<description of the knowledge>" }
48
- ```
49
-
50
- ### Step 3: Capture
51
-
52
- For quick, single-entry captures:
53
- Call `YOUR_AGENT_core op:capture_knowledge` with:
54
-
55
- - **title**: Clear, searchable name
56
- - **description**: What it is and when it applies
57
- - **type**: From Step 2 classification
58
- - **category**: Domain area (e.g., "component-patterns", "api-design", "infrastructure")
59
- - **tags**: Searchable keywords
60
- - **example**: Code snippet or before/after if applicable
61
- - **why**: The reasoning — this is what makes the entry actionable
62
-
63
- ```
64
- YOUR_AGENT_core op:capture_knowledge
65
- params: {
66
- title: "<clear, searchable name>",
67
- description: "<what it is and when it applies>",
68
- type: "<pattern|anti-pattern|workflow|principle|decision>",
69
- category: "<domain area>",
70
- tags: ["<tag1>", "<tag2>"],
71
- example: "<code or before/after>",
72
- why: "<reasoning>"
73
- }
74
- ```
75
-
76
- For quick captures:
77
-
78
- ```
79
- YOUR_AGENT_core op:capture_quick
80
- params: { title: "<name>", description: "<details>" }
81
- ```
82
-
83
- ### Step 4: Post-Capture Quality
84
-
85
- After capturing, run the curator to ensure quality:
86
-
87
- **Groom the entry** — normalize tags, fix metadata:
88
-
89
- ```
90
- YOUR_AGENT_core op:curator_groom
91
- params: { entryId: "<captured entry id>" }
92
- ```
93
-
94
- **Enrich the entry** — use LLM to add context, improve description:
95
-
96
- ```
97
- YOUR_AGENT_core op:curator_enrich
98
- params: { entryId: "<captured entry id>" }
99
- ```
100
-
101
- **Check for contradictions** — does this conflict with existing knowledge?
102
-
103
- ```
104
- YOUR_AGENT_core op:curator_contradictions
105
- ```
106
-
107
- If contradictions found, resolve them:
108
-
109
- ```
110
- YOUR_AGENT_core op:curator_resolve_contradiction
111
- params: { contradictionId: "<id>" }
112
- ```
113
-
114
- ### Step 5: Handle Governance (if enabled)
115
-
116
- If governance policy requires review, the capture returns a `proposalId`. The entry is queued for approval.
117
-
118
- ```
119
- YOUR_AGENT_core op:governance_proposals
120
- params: { action: "list" }
121
- ```
122
-
123
- Present pending proposals to the user for approval.
124
-
125
- ### Step 6: Promote to Global (Optional)
126
-
127
- If the knowledge applies across projects (not project-specific):
128
-
129
- ```
130
- YOUR_AGENT_core op:memory_promote_to_global
131
- params: { entryId: "<entry id>" }
132
- ```
133
-
134
- This makes it available in cross-project searches and brain recommendations.
135
-
136
- ### Step 7: Verify Health
137
-
138
- Confirm the capture was stored and vault health is maintained:
139
-
140
- ```
141
- YOUR_AGENT_core op:admin_health
142
- ```
143
-
144
- Check vault analytics for overall knowledge quality:
145
-
146
- ```
147
- YOUR_AGENT_core op:admin_vault_analytics
148
- ```
149
-
150
- ## Exit Criteria
151
-
152
- Capture is complete when: the entry is stored (or queued for review), categorized, tagged, groomed, and vault health confirmed. If promoted to global, cross-project availability is verified.
153
-
154
- ## Agent Tools Reference
155
-
156
- | Op | When to Use |
157
- | ------------------------------- | ----------------------------------- |
158
- | `search_intelligent` | Check for duplicates before capture |
159
- | `curator_detect_duplicates` | Explicit duplicate detection |
160
- | `route_intent` | Help classify knowledge type |
161
- | `capture_knowledge` | Full-metadata capture |
162
- | `capture_quick` | Fast capture for simple entries |
163
- | `curator_groom` | Normalize tags and metadata |
164
- | `curator_enrich` | LLM-powered metadata enrichment |
165
- | `curator_contradictions` | Find conflicting entries |
166
- | `curator_resolve_contradiction` | Resolve conflicts |
167
- | `governance_proposals` | Check/manage approval queue |
168
- | `memory_promote_to_global` | Share across projects |
169
- | `admin_health` | Verify system health |
170
- | `admin_vault_analytics` | Overall knowledge quality metrics |
@@ -1,107 +0,0 @@
1
- ---
2
- name: vault-curate
3
- description: >
4
- Use when the user says "clean vault", "deduplicate", "groom knowledge",
5
- "consolidate vault", "vault maintenance", "find duplicates", "merge patterns",
6
- "check contradictions", "vault health", or wants to maintain, clean, reorganize,
7
- or improve the quality of the agent's knowledge base.
8
- ---
9
-
10
- # Vault Curate — Knowledge Maintenance
11
-
12
- Maintain vault quality through deduplication, grooming, contradiction detection, and consolidation. A well-curated vault produces better search results and brain recommendations.
13
-
14
- ## When to Use
15
-
16
- Periodically (weekly or after heavy capture sessions), when search quality degrades, when vault health shows warnings, or when the user explicitly requests maintenance.
17
-
18
- ## Orchestration Sequence
19
-
20
- ### Step 1: Health Assessment
21
-
22
- ```
23
- YOUR_AGENT_core op:knowledge_health
24
- ```
25
-
26
- ```
27
- YOUR_AGENT_core op:get_vault_analytics
28
- ```
29
-
30
- Present the health summary to the user before proceeding: total entries, quality scores, staleness, coverage gaps.
31
-
32
- ### Step 2: Detect Duplicates
33
-
34
- ```
35
- YOUR_AGENT_core op:curator_detect_duplicates
36
- ```
37
-
38
- This finds entries with overlapping titles, descriptions, or content. Review the duplicate pairs — some may be intentional (different contexts) while others are true duplicates.
39
-
40
- For true duplicates:
41
-
42
- ```
43
- YOUR_AGENT_core op:merge_patterns
44
- params: { patternIds: ["<id1>", "<id2>"] }
45
- ```
46
-
47
- Preserve the best content from each.
48
-
49
- ### Step 3: Find Contradictions
50
-
51
- ```
52
- YOUR_AGENT_core op:curator_contradictions
53
- ```
54
-
55
- Contradictions erode trust in vault search results. For each contradiction: decide which entry is correct (check dates, context, evidence), then archive or update the incorrect one.
56
-
57
- ### Step 4: Groom Entries
58
-
59
- ```
60
- YOUR_AGENT_core op:curator_groom_all
61
- ```
62
-
63
- Runs tag enrichment and metadata cleanup across all entries. This improves searchability and categorization.
64
-
65
- For targeted grooming of specific entries:
66
-
67
- ```
68
- YOUR_AGENT_core op:curator_groom
69
- params: { entryIds: ["<id>"], tags: ["<tag>"] }
70
- ```
71
-
72
- ### Step 5: GPT Enrichment (Optional)
73
-
74
- ```
75
- YOUR_AGENT_core op:curator_gpt_enrich
76
- ```
77
-
78
- Adds AI-generated metadata to entries that lack descriptions, examples, or context. Fills in gaps without changing the core content.
79
-
80
- ### Step 6: Full Consolidation
81
-
82
- ```
83
- YOUR_AGENT_core op:curator_consolidate
84
- ```
85
-
86
- Runs the complete pipeline: dedup + archive stale entries + resolve contradictions. This is the heavy-duty cleanup.
87
-
88
- ### Step 7: Knowledge Reorganization
89
-
90
- ```
91
- YOUR_AGENT_core op:knowledge_reorganize
92
- params: { mode: "preview" }
93
- ```
94
-
95
- Preview first, then run again with `mode: "apply"` if the preview looks good.
96
-
97
- ### Step 8: Verify Results
98
-
99
- ```
100
- YOUR_AGENT_core op:knowledge_health
101
- ```
102
-
103
- Compare with Step 1 metrics. Vault health should improve: fewer duplicates, no contradictions, better coverage.
104
-
105
- ## Exit Criteria
106
-
107
- Curation is complete when: duplicates merged, contradictions resolved, entries groomed, and health metrics improved compared to Step 1 baseline.
@@ -1,140 +0,0 @@
1
- ---
2
- name: vault-navigator
3
- description: Use when the user asks "what does vault say", "search knowledge", "find pattern", "have we seen this before", "best practice for", "check vault", "vault search", "any patterns for", or wants to query the knowledge base for existing solutions or guidance.
4
- ---
5
-
6
- # Vault Navigator — Knowledge Oracle
7
-
8
- Navigate the vault intelligently. The vault has multiple search strategies — this skill picks the right one based on what the user needs.
9
-
10
- ## When to Use
11
-
12
- Any time the user wants to find existing knowledge before building something new. Also when asking about best practices, previous solutions, or patterns.
13
-
14
- ## Search Strategy Decision Tree
15
-
16
- ### For "Have we seen this before?" / "Best practice for X"
17
-
18
- Start with `YOUR_AGENT_core op:search_intelligent` — this is semantic search, the broadest and smartest query. Pass the user's question as the query.
19
-
20
- ```
21
- YOUR_AGENT_core op:search_intelligent
22
- params: { query: "<user's question>" }
23
- ```
24
-
25
- If results are weak (low scores or few matches), fall back to `YOUR_AGENT_core op:search` with explicit filters (type, category, tags, severity). This is structured search — narrower but more precise.
26
-
27
- ### For "Show me everything about X" (Exploration)
28
-
29
- Use tag-based and domain-based browsing for broader exploration:
30
-
31
- ```
32
- YOUR_AGENT_core op:vault_tags
33
- ```
34
-
35
- Lists all tags in the vault — helps discover what topics are covered.
36
-
37
- ```
38
- YOUR_AGENT_core op:vault_domains
39
- ```
40
-
41
- Lists all domains — shows the knowledge landscape at a glance.
42
-
43
- ```
44
- YOUR_AGENT_core op:vault_recent
45
- ```
46
-
47
- Shows recently added or modified entries — what's fresh in the vault.
48
-
49
- ### For "What's stale?" / "What needs updating?"
50
-
51
- Run an age report to find outdated knowledge:
52
-
53
- ```
54
- YOUR_AGENT_core op:vault_age_report
55
- ```
56
-
57
- Present entries that haven't been updated recently — these are candidates for review, refresh, or removal.
58
-
59
- ### For "What do other projects do?"
60
-
61
- Call `YOUR_AGENT_core op:memory_cross_project_search` with `crossProject: true`. This searches across all linked projects, not just the current one.
62
-
63
- ```
64
- YOUR_AGENT_core op:memory_cross_project_search
65
- params: { query: "<topic>", crossProject: true }
66
- ```
67
-
68
- Check what projects are linked:
69
-
70
- ```
71
- YOUR_AGENT_core op:project_linked_projects
72
- ```
73
-
74
- ### For "Has brain learned anything about X?"
75
-
76
- Call `YOUR_AGENT_core op:brain_strengths` to see which patterns have proven strength. Then call `YOUR_AGENT_core op:brain_global_patterns` with a domain or tag filter to find cross-project patterns.
77
-
78
- ```
79
- YOUR_AGENT_core op:brain_strengths
80
- YOUR_AGENT_core op:brain_global_patterns
81
- params: { domain: "<domain>" }
82
- ```
83
-
84
- ### For "What do I know about X?" (broad exploration)
85
-
86
- Chain multiple strategies for comprehensive results:
87
-
88
- 1. `search_intelligent` → semantic vault search
89
- 2. `vault_tags` / `vault_domains` → browse knowledge landscape
90
- 3. `memory_cross_project_search` → cross-project patterns
91
- 4. `brain_strengths` → proven patterns
92
-
93
- Present all findings with source labels so the user knows where each insight came from.
94
-
95
- ## Presenting Results
96
-
97
- Always include:
98
-
99
- - **Source**: Which search found it (vault, memory, brain, tags, domains)
100
- - **Confidence**: Score or strength rating
101
- - **Relevance**: Why this result matches the query
102
- - **Actionable next step**: How to apply this knowledge
103
-
104
- ## Fallback: Web Search
105
-
106
- If all vault strategies return no results, search the web for the user's question before saying "nothing found." The web may have:
107
-
108
- - Documentation, articles, or guides on the topic
109
- - Community patterns and best practices
110
- - Library-specific solutions
111
-
112
- If web search finds something useful, offer to capture it to the vault:
113
-
114
- ```
115
- YOUR_AGENT_core op:capture_quick
116
- params: {
117
- title: "<what was found>",
118
- description: "<summary from web search, source URL>"
119
- }
120
- ```
121
-
122
- ## Exit Criteria
123
-
124
- Search is complete when at least one search strategy has been tried and results presented. If no results found across all strategies (vault + web), say so explicitly — that's valuable information too (it means this is genuinely new territory worth exploring and capturing).
125
-
126
- ## Agent Tools Reference
127
-
128
- | Op | When to Use |
129
- | ----------------------------- | ----------------------------------------------------- |
130
- | `search_intelligent` | Default semantic search — broadest and smartest |
131
- | `search` | Structured search with filters (type, tags, category) |
132
- | `vault_tags` | Browse all tags — discover knowledge landscape |
133
- | `vault_domains` | Browse all domains — see what areas are covered |
134
- | `vault_recent` | Recently modified entries — what's fresh |
135
- | `vault_age_report` | Find stale entries needing refresh |
136
- | `memory_cross_project_search` | Search across linked projects |
137
- | `project_linked_projects` | See what projects are connected |
138
- | `brain_strengths` | Proven patterns ranked by success |
139
- | `brain_global_patterns` | Cross-project patterns from global pool |
140
- | `capture_quick` | Capture web findings to vault for next time |
@@ -1,182 +0,0 @@
1
- ---
2
- name: verification-before-completion
3
- description: Use when about to claim work is complete, fixed, or passing, before committing or creating PRs - requires running verification commands and confirming output before making any success claims; evidence before assertions always
4
- ---
5
-
6
- <!-- Adapted from superpowers (MIT License) -->
7
-
8
- # Verification Before Completion
9
-
10
- ## Overview
11
-
12
- Claiming work is complete without verification is dishonesty, not efficiency.
13
-
14
- **Core principle:** Evidence before claims, always.
15
-
16
- **Violating the letter of this rule is violating the spirit of this rule.**
17
-
18
- ## The Iron Law
19
-
20
- ```
21
- NO COMPLETION CLAIMS WITHOUT FRESH VERIFICATION EVIDENCE
22
- ```
23
-
24
- If you haven't run the verification command in this message, you cannot claim it passes.
25
-
26
- ## The Gate Function
27
-
28
- ```
29
- BEFORE claiming any status or expressing satisfaction:
30
-
31
- 1. IDENTIFY: What command proves this claim?
32
- 2. RUN: Execute the FULL command (fresh, complete)
33
- 3. READ: Full output, check exit code, count failures
34
- 4. VERIFY: Does output confirm the claim?
35
- - If NO: State actual status with evidence
36
- - If YES: State claim WITH evidence
37
- 5. AGENT CHECK: Run system diagnostics
38
- 6. ONLY THEN: Make the claim
39
-
40
- Skip any step = lying, not verifying
41
- ```
42
-
43
- ## Agent System Checks
44
-
45
- After passing all verification commands, run system diagnostics:
46
-
47
- ### Health Check
48
-
49
- ```
50
- YOUR_AGENT_core op:admin_health
51
- ```
52
-
53
- Catches issues tests might miss — vault corruption, stale caches, configuration drift.
54
-
55
- ### Full Diagnostic
56
-
57
- ```
58
- YOUR_AGENT_core op:admin_diagnostic
59
- ```
60
-
61
- Comprehensive system check — module status, database integrity, cache health, configuration validity.
62
-
63
- ### Vault Analytics
64
-
65
- ```
66
- YOUR_AGENT_core op:admin_vault_analytics
67
- ```
68
-
69
- Verify knowledge quality metrics — are capture rates healthy? Any degradation?
70
-
71
- If any check reports problems, address them before claiming completion.
72
-
73
- ## Common Failures
74
-
75
- | Claim | Requires | Not Sufficient |
76
- | --------------------- | ------------------------------- | ------------------------------ |
77
- | Tests pass | Test command output: 0 failures | Previous run, "should pass" |
78
- | Linter clean | Linter output: 0 errors | Partial check, extrapolation |
79
- | Build succeeds | Build command: exit 0 | Linter passing, logs look good |
80
- | Bug fixed | Test original symptom: passes | Code changed, assumed fixed |
81
- | Regression test works | Red-green cycle verified | Test passes once |
82
- | Agent completed | VCS diff shows changes | Agent reports "success" |
83
- | Requirements met | Line-by-line checklist | Tests passing |
84
- | Agent healthy | `admin_diagnostic` clean | "No errors in logs" |
85
-
86
- ## Red Flags - STOP
87
-
88
- - Using "should", "probably", "seems to"
89
- - Expressing satisfaction before verification ("Great!", "Perfect!", "Done!", etc.)
90
- - About to commit/push/PR without verification
91
- - Trusting agent success reports
92
- - Relying on partial verification
93
- - Thinking "just this once"
94
- - Tired and wanting work over
95
- - ANY wording implying success without having run verification
96
-
97
- ## Rationalization Prevention
98
-
99
- | Excuse | Reality |
100
- | --------------------------------------- | ---------------------- |
101
- | "Should work now" | RUN the verification |
102
- | "I'm confident" | Confidence ≠ evidence |
103
- | "Just this once" | No exceptions |
104
- | "Linter passed" | Linter ≠ compiler |
105
- | "Agent said success" | Verify independently |
106
- | "I'm tired" | Exhaustion ≠ excuse |
107
- | "Partial check is enough" | Partial proves nothing |
108
- | "Different words so rule doesn't apply" | Spirit over letter |
109
-
110
- ## Key Patterns
111
-
112
- **Tests:**
113
-
114
- ```
115
- [Run test command] [See: 34/34 pass] "All tests pass"
116
- NOT: "Should pass now" / "Looks correct"
117
- ```
118
-
119
- **Regression tests (TDD Red-Green):**
120
-
121
- ```
122
- Write -> Run (pass) -> Revert fix -> Run (MUST FAIL) -> Restore -> Run (pass)
123
- NOT: "I've written a regression test" (without red-green verification)
124
- ```
125
-
126
- **Build:**
127
-
128
- ```
129
- [Run build] [See: exit 0] "Build passes"
130
- NOT: "Linter passed" (linter doesn't check compilation)
131
- ```
132
-
133
- **Requirements:**
134
-
135
- ```
136
- Re-read plan -> Create checklist -> Verify each -> Report gaps or completion
137
- NOT: "Tests pass, phase complete"
138
- ```
139
-
140
- **Agent delegation:**
141
-
142
- ```
143
- Agent reports success -> Check VCS diff -> Verify changes -> Report actual state
144
- NOT: Trust agent report
145
- ```
146
-
147
- ## After Verification — Capture Session
148
-
149
- Once work is verified complete, capture a session summary so context persists:
150
-
151
- ```
152
- YOUR_AGENT_core op:session_capture
153
- params: {
154
- summary: "<what was accomplished, files modified, key decisions>"
155
- }
156
- ```
157
-
158
- This ensures the next session has context about what was verified and completed.
159
-
160
- ## When To Apply
161
-
162
- **ALWAYS before:**
163
-
164
- - ANY variation of success/completion claims
165
- - ANY expression of satisfaction
166
- - ANY positive statement about work state
167
- - Committing, PR creation, task completion
168
- - Moving to next task
169
- - Delegating to agents
170
-
171
- ## The Bottom Line
172
-
173
- Run the command. Read the output. THEN claim the result. This is non-negotiable.
174
-
175
- ## Agent Tools Reference
176
-
177
- | Op | When to Use |
178
- | ----------------------- | ----------------------------------- |
179
- | `admin_health` | Quick system health check |
180
- | `admin_diagnostic` | Comprehensive system diagnostic |
181
- | `admin_vault_analytics` | Knowledge quality metrics |
182
- | `session_capture` | Persist verified completion context |