thoth-plugin 1.2.0 → 1.2.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +16 -0
- package/defaults/AGENTS.md +53 -0
- package/{dist/defaults/skill → defaults/skill/_legacy}/gardener/SKILL.md +243 -17
- package/defaults/skill/_legacy/onboarding/SKILL.md +207 -0
- package/defaults/skill/evening-close/SKILL.md +97 -0
- package/defaults/skill/mail-triage/SKILL.md +26 -0
- package/defaults/skill/morning-boot/SKILL.md +45 -0
- package/defaults/skill/slack-pulse/SKILL.md +26 -0
- package/defaults/skill/thought-router/SKILL.md +89 -0
- package/dist/cli.d.ts +0 -8
- package/dist/cli.js +70 -436
- package/dist/config/index.d.ts +1 -1
- package/dist/config/schema.d.ts +17 -0
- package/dist/defaults/skill/_legacy/cal-grid/SKILL.md +16 -0
- package/dist/defaults/skill/_legacy/capsule-init/SKILL.md +102 -0
- package/dist/defaults/skill/_legacy/cross-linker/SKILL.md +357 -0
- package/dist/defaults/skill/_legacy/email-draft/skill.md +134 -0
- package/dist/defaults/skill/_legacy/gardener/SKILL.md +509 -0
- package/dist/defaults/skill/_legacy/gardener/confidence-tiers.md +142 -0
- package/dist/defaults/skill/_legacy/gardener/repair-workflow.md +170 -0
- package/dist/defaults/skill/_legacy/google-chat-scan/SKILL.md +135 -0
- package/dist/defaults/skill/_legacy/handover/SKILL.md +18 -0
- package/dist/defaults/skill/_legacy/interview-prep/SKILL.md +23 -0
- package/dist/defaults/skill/_legacy/leadership-coach/SKILL.md +167 -0
- package/dist/defaults/skill/_legacy/link-retrofit/SKILL.md +345 -0
- package/dist/defaults/skill/_legacy/onboarding/SKILL.md +207 -0
- package/dist/defaults/skill/_legacy/post-meeting-drill/SKILL.md +485 -0
- package/dist/defaults/skill/_legacy/restore-environment/SKILL.md +30 -0
- package/dist/defaults/skill/_legacy/scorecard-synthesis/SKILL.md +26 -0
- package/dist/defaults/skill/_legacy/skill-generator/SKILL.md +362 -0
- package/dist/defaults/skill/_legacy/skill-generator/testing-protocol.md +158 -0
- package/dist/defaults/skill/_legacy/system-init/SKILL.md +103 -0
- package/dist/defaults/skill/evening-close/SKILL.md +6 -7
- package/dist/defaults/skill/mail-triage/SKILL.md +11 -8
- package/dist/defaults/skill/morning-boot/SKILL.md +26 -87
- package/dist/defaults/skill/slack-pulse/SKILL.md +11 -7
- package/dist/defaults/skill/thought-router/SKILL.md +10 -8
- package/dist/index.js +11316 -2308
- package/dist/schemas/skill.d.ts +96 -0
- package/dist/sdk/index.d.ts +4 -0
- package/dist/sdk/sentinel-service.d.ts +71 -0
- package/dist/sdk/skill-runner.d.ts +21 -0
- package/dist/sdk/thoth-client.d.ts +51 -0
- package/dist/sdk/workflows/calendar-watcher.d.ts +2 -0
- package/dist/sdk/workflows/inbox-watcher.d.ts +2 -0
- package/dist/sdk/workflows/index.d.ts +4 -0
- package/dist/sdk/workflows/system-watcher.d.ts +2 -0
- package/dist/sdk/workflows/task-watcher.d.ts +2 -0
- package/dist/services/index.d.ts +1 -0
- package/dist/services/skill-registry.d.ts +23 -0
- package/dist/tools/skill/tools.d.ts +2 -1
- package/package.json +8 -3
- /package/{dist/defaults/skill → defaults/skill/_legacy}/cal-grid/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/capsule-init/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/cross-linker/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/email-draft/skill.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/gardener/confidence-tiers.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/gardener/repair-workflow.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/google-chat-scan/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/handover/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/interview-prep/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/leadership-coach/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/link-retrofit/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/post-meeting-drill/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/restore-environment/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/scorecard-synthesis/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/skill-generator/SKILL.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/skill-generator/testing-protocol.md +0 -0
- /package/{dist/defaults/skill → defaults/skill/_legacy}/system-init/SKILL.md +0 -0
|
@@ -0,0 +1,362 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: skill-generator
|
|
3
|
+
description: Use when creating a new skill, editing an existing skill, or when asked to document a reusable process or technique
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Skill Generator
|
|
7
|
+
|
|
8
|
+
You are creating or editing a skill for the Thoth knowledge system.
|
|
9
|
+
|
|
10
|
+
**Core principle:** No skill without baseline failure first. Writing skills IS Test-Driven Development applied to process documentation.
|
|
11
|
+
|
|
12
|
+
**Violating the letter of this process is violating the spirit of this process.**
|
|
13
|
+
|
|
14
|
+
---
|
|
15
|
+
|
|
16
|
+
## The Iron Law
|
|
17
|
+
|
|
18
|
+
```
|
|
19
|
+
NO SKILL FILE CREATED UNTIL RED PHASE SUBAGENT COMPLETES
|
|
20
|
+
```
|
|
21
|
+
|
|
22
|
+
If you create a SKILL.md before the RED phase subagent documents a failure, delete it. Start over.
|
|
23
|
+
|
|
24
|
+
**No exceptions:**
|
|
25
|
+
- Don't keep it as "draft"
|
|
26
|
+
- Don't "refine it later"
|
|
27
|
+
- Don't skip because "it's simple"
|
|
28
|
+
- Don't skip because "it's just a technique skill"
|
|
29
|
+
- Delete means delete
|
|
30
|
+
|
|
31
|
+
---
|
|
32
|
+
|
|
33
|
+
## Phase 0: Setup
|
|
34
|
+
|
|
35
|
+
### Step 1: Check if Skill Exists
|
|
36
|
+
|
|
37
|
+
```bash
|
|
38
|
+
ls -la .opencode/skill/ | grep -i "{skill-name}"
|
|
39
|
+
```
|
|
40
|
+
|
|
41
|
+
If exists: Read it first. You're editing, not creating.
|
|
42
|
+
|
|
43
|
+
### Step 2: Classify Skill Type
|
|
44
|
+
|
|
45
|
+
| Type | Purpose |
|
|
46
|
+
|------|---------|
|
|
47
|
+
| **Discipline** | Rules that resist rationalization (TDD, verification) |
|
|
48
|
+
| **Workflow** | Multi-step process with checkpoints |
|
|
49
|
+
| **Technique** | Concrete method with steps |
|
|
50
|
+
| **Reference** | API docs, syntax guides |
|
|
51
|
+
|
|
52
|
+
**Write down the type before proceeding.**
|
|
53
|
+
|
|
54
|
+
### Step 3: Research the Domain
|
|
55
|
+
|
|
56
|
+
- What tools/APIs are involved?
|
|
57
|
+
- What are common mistakes?
|
|
58
|
+
- What does success look like?
|
|
59
|
+
|
|
60
|
+
For discipline skills, also identify:
|
|
61
|
+
- What rationalizations will agents use to skip it?
|
|
62
|
+
- What pressure scenarios test compliance?
|
|
63
|
+
|
|
64
|
+
---
|
|
65
|
+
|
|
66
|
+
## Phase 1: RED — Baseline Test (MANDATORY)
|
|
67
|
+
|
|
68
|
+
**You MUST dispatch a subagent to attempt the task WITHOUT the skill.**
|
|
69
|
+
|
|
70
|
+
### Dispatch Baseline Subagent
|
|
71
|
+
|
|
72
|
+
```
|
|
73
|
+
task(
|
|
74
|
+
subagent_type="general",
|
|
75
|
+
description="Baseline test for {skill-name}",
|
|
76
|
+
prompt="""
|
|
77
|
+
You are testing baseline behavior WITHOUT a skill.
|
|
78
|
+
|
|
79
|
+
**Task:** {Describe the task the skill will teach}
|
|
80
|
+
|
|
81
|
+
**Context:** {Relevant context}
|
|
82
|
+
|
|
83
|
+
**Instructions:** Complete the task using your best judgment.
|
|
84
|
+
Do NOT load any skills. Just complete the task as you normally would.
|
|
85
|
+
|
|
86
|
+
**Report back:**
|
|
87
|
+
1. What approach did you take?
|
|
88
|
+
2. What formatting/tools/methods did you use?
|
|
89
|
+
3. Show the exact output you produced.
|
|
90
|
+
4. What assumptions did you make?
|
|
91
|
+
"""
|
|
92
|
+
)
|
|
93
|
+
```
|
|
94
|
+
|
|
95
|
+
### Document the Failure
|
|
96
|
+
|
|
97
|
+
Before proceeding, write down:
|
|
98
|
+
- **What the agent did wrong** (or suboptimally)
|
|
99
|
+
- **What rationalizations were used** (verbatim if possible)
|
|
100
|
+
- **What the skill must teach** to prevent this
|
|
101
|
+
|
|
102
|
+
**GATE: Do not proceed to Phase 2 until you have documented at least one failure or suboptimal behavior.**
|
|
103
|
+
|
|
104
|
+
If the baseline agent does everything perfectly, you may not need this skill.
|
|
105
|
+
|
|
106
|
+
---
|
|
107
|
+
|
|
108
|
+
## Phase 2: GREEN — Write Minimal Skill
|
|
109
|
+
|
|
110
|
+
Address the specific failures documented in Phase 1.
|
|
111
|
+
|
|
112
|
+
### Frontmatter (Required)
|
|
113
|
+
|
|
114
|
+
```yaml
|
|
115
|
+
---
|
|
116
|
+
name: lowercase-with-hyphens
|
|
117
|
+
description: Use when [specific triggers]. Third person. No workflow summary.
|
|
118
|
+
triggers:
|
|
119
|
+
- phrase that activates this skill
|
|
120
|
+
- another trigger phrase
|
|
121
|
+
- variations users might say
|
|
122
|
+
---
|
|
123
|
+
```
|
|
124
|
+
|
|
125
|
+
**Frontmatter Rules:**
|
|
126
|
+
|
|
127
|
+
| Field | Purpose | Rules |
|
|
128
|
+
|-------|---------|-------|
|
|
129
|
+
| `name` | Skill identifier | lowercase, hyphens only |
|
|
130
|
+
| `description` | Injected into system prompt for skill discovery | Start with "Use when...", third person, no workflow summary |
|
|
131
|
+
| `triggers` | **Automatic skill invocation** | Phrases users say that should activate this skill |
|
|
132
|
+
|
|
133
|
+
**Triggers are critical** — they teach Thoth when to use the skill automatically. Include:
|
|
134
|
+
- Direct commands ("check my email")
|
|
135
|
+
- Natural phrases ("what's in my inbox")
|
|
136
|
+
- Variations ("email triage", "process inbox")
|
|
137
|
+
|
|
138
|
+
**Description Rules:**
|
|
139
|
+
- ✅ Start with "Use when..."
|
|
140
|
+
- ✅ Describe triggers/symptoms only
|
|
141
|
+
- ✅ Third person (injected into system prompt)
|
|
142
|
+
- ❌ Never summarize the workflow
|
|
143
|
+
- ❌ Never use first person
|
|
144
|
+
|
|
145
|
+
### Required Sections
|
|
146
|
+
|
|
147
|
+
Copy this template and fill in:
|
|
148
|
+
|
|
149
|
+
```markdown
|
|
150
|
+
# Skill Name
|
|
151
|
+
|
|
152
|
+
**Core principle:** {one sentence}
|
|
153
|
+
|
|
154
|
+
---
|
|
155
|
+
|
|
156
|
+
## When to Use
|
|
157
|
+
|
|
158
|
+
- {symptom or trigger}
|
|
159
|
+
- {symptom or trigger}
|
|
160
|
+
|
|
161
|
+
**Do NOT use when:**
|
|
162
|
+
- {exclusion}
|
|
163
|
+
|
|
164
|
+
---
|
|
165
|
+
|
|
166
|
+
## Quick Reference
|
|
167
|
+
|
|
168
|
+
| Task | Command/Action |
|
|
169
|
+
|------|----------------|
|
|
170
|
+
| {task} | {how} |
|
|
171
|
+
|
|
172
|
+
---
|
|
173
|
+
|
|
174
|
+
## Process / Pattern
|
|
175
|
+
|
|
176
|
+
{Main content here}
|
|
177
|
+
|
|
178
|
+
---
|
|
179
|
+
|
|
180
|
+
## Common Mistakes
|
|
181
|
+
|
|
182
|
+
| Mistake | Prevention |
|
|
183
|
+
|---------|------------|
|
|
184
|
+
| {mistake} | {fix} |
|
|
185
|
+
|
|
186
|
+
---
|
|
187
|
+
|
|
188
|
+
## Red Flags - STOP
|
|
189
|
+
|
|
190
|
+
- {warning sign}
|
|
191
|
+
- {warning sign}
|
|
192
|
+
|
|
193
|
+
---
|
|
194
|
+
|
|
195
|
+
## Verification Checklist
|
|
196
|
+
|
|
197
|
+
- [ ] {check}
|
|
198
|
+
- [ ] {check}
|
|
199
|
+
```
|
|
200
|
+
|
|
201
|
+
### Optional Sections (Add If Needed)
|
|
202
|
+
|
|
203
|
+
- **Rationalization Table** — Required for discipline skills
|
|
204
|
+
- **Examples** — One excellent example beats many mediocre ones
|
|
205
|
+
- **Supporting Files** — Only for heavy reference (>100 lines)
|
|
206
|
+
|
|
207
|
+
---
|
|
208
|
+
|
|
209
|
+
## Phase 3: GREEN — Verify With Skill (MANDATORY)
|
|
210
|
+
|
|
211
|
+
**You MUST dispatch a subagent to attempt the same task WITH the skill.**
|
|
212
|
+
|
|
213
|
+
### Dispatch Skill-Guided Subagent
|
|
214
|
+
|
|
215
|
+
```
|
|
216
|
+
task(
|
|
217
|
+
subagent_type="general",
|
|
218
|
+
description="Skill-guided test for {skill-name}",
|
|
219
|
+
prompt="""
|
|
220
|
+
You are testing a skill.
|
|
221
|
+
|
|
222
|
+
**REQUIRED SKILL — Read and follow this exactly:**
|
|
223
|
+
|
|
224
|
+
---
|
|
225
|
+
{Paste the full skill content here}
|
|
226
|
+
---
|
|
227
|
+
|
|
228
|
+
**Task:** {Same task as baseline}
|
|
229
|
+
|
|
230
|
+
**Context:** {Same context as baseline}
|
|
231
|
+
|
|
232
|
+
**Instructions:** Follow the skill above. Complete the task.
|
|
233
|
+
|
|
234
|
+
**Report back:**
|
|
235
|
+
1. What approach did you take?
|
|
236
|
+
2. Did you follow the skill completely?
|
|
237
|
+
3. Show the exact output you produced.
|
|
238
|
+
"""
|
|
239
|
+
)
|
|
240
|
+
```
|
|
241
|
+
|
|
242
|
+
### Verify the Fix
|
|
243
|
+
|
|
244
|
+
Compare baseline vs skill-guided:
|
|
245
|
+
- Did the agent avoid the mistakes from Phase 1?
|
|
246
|
+
- Any NEW rationalizations that bypassed the skill?
|
|
247
|
+
- Any ambiguities in the skill that allowed violations?
|
|
248
|
+
|
|
249
|
+
**GATE: Do not proceed until the skill-guided agent performs correctly.**
|
|
250
|
+
|
|
251
|
+
If the agent still fails, revise the skill and re-test.
|
|
252
|
+
|
|
253
|
+
---
|
|
254
|
+
|
|
255
|
+
## Phase 4: REFACTOR — Quality Gate
|
|
256
|
+
|
|
257
|
+
### Close Loopholes
|
|
258
|
+
|
|
259
|
+
For each new rationalization found in Phase 3:
|
|
260
|
+
1. Add explicit counter to Common Mistakes or Rationalization Table
|
|
261
|
+
2. Add to Red Flags section
|
|
262
|
+
3. Re-run Phase 3 if significant changes made
|
|
263
|
+
|
|
264
|
+
### Format Checks
|
|
265
|
+
|
|
266
|
+
- [ ] Name: lowercase, hyphens only, no special chars
|
|
267
|
+
- [ ] Description: starts with "Use when...", no workflow summary
|
|
268
|
+
- [ ] Description: third person, under 500 chars
|
|
269
|
+
- [ ] **Triggers: list of phrases that activate this skill**
|
|
270
|
+
- [ ] Has "Core principle" statement
|
|
271
|
+
- [ ] Has "When to Use" section
|
|
272
|
+
- [ ] Has "Quick Reference" table
|
|
273
|
+
- [ ] Has "Common Mistakes" section
|
|
274
|
+
- [ ] Has "Red Flags" section
|
|
275
|
+
- [ ] Has "Verification Checklist"
|
|
276
|
+
|
|
277
|
+
### Size Checks
|
|
278
|
+
|
|
279
|
+
```bash
|
|
280
|
+
wc -l .opencode/skill/{name}/SKILL.md
|
|
281
|
+
```
|
|
282
|
+
|
|
283
|
+
- [ ] SKILL.md under 500 lines
|
|
284
|
+
- [ ] Frequently-loaded skills under 200 lines
|
|
285
|
+
- [ ] Heavy reference in separate files
|
|
286
|
+
|
|
287
|
+
### Functional Check
|
|
288
|
+
|
|
289
|
+
- [ ] Baseline subagent documented failure (Phase 1)
|
|
290
|
+
- [ ] Skill-guided subagent performed correctly (Phase 3)
|
|
291
|
+
- [ ] All referenced tools/commands exist
|
|
292
|
+
- [ ] No ambiguous instructions
|
|
293
|
+
|
|
294
|
+
---
|
|
295
|
+
|
|
296
|
+
## Rationalization Table
|
|
297
|
+
|
|
298
|
+
| Excuse | Reality |
|
|
299
|
+
|--------|---------|
|
|
300
|
+
| "This skill is simple, doesn't need structure" | Simple skills still need discoverability. Follow the format. |
|
|
301
|
+
| "I'll add sections later" | Later never comes. Add them now. |
|
|
302
|
+
| "The example is self-explanatory" | Examples don't replace Quick Reference tables. |
|
|
303
|
+
| "It's just a reference skill" | Reference skills still need When to Use and Common Mistakes. |
|
|
304
|
+
| "I already know this domain" | Your knowledge isn't in the file. Document it. |
|
|
305
|
+
| "Baseline test is overkill" | Baseline reveals what you'll skip. Do it. Subagent is mandatory. |
|
|
306
|
+
| "It's just a technique skill, doesn't need baseline" | Technique skills fail too. We learned this with email-draft. Test everything. |
|
|
307
|
+
| "I already know what will fail" | You're guessing. Run the subagent. Document actual behavior. |
|
|
308
|
+
| "I'll test after writing" | That's not TDD. RED comes before GREEN. Delete and start over. |
|
|
309
|
+
| "Description can summarize the workflow" | Claude follows descriptions instead of reading skills. Never summarize. |
|
|
310
|
+
|
|
311
|
+
---
|
|
312
|
+
|
|
313
|
+
## Red Flags - STOP
|
|
314
|
+
|
|
315
|
+
- Creating SKILL.md before RED phase subagent completes
|
|
316
|
+
- Skipping baseline test for ANY skill type
|
|
317
|
+
- "I already know what will fail" without running subagent
|
|
318
|
+
- Description summarizes workflow ("scans X, extracts Y, outputs Z")
|
|
319
|
+
- **Missing `triggers:` in frontmatter** — skill won't be auto-invoked
|
|
320
|
+
- Missing "When to Use" section
|
|
321
|
+
- No Quick Reference table
|
|
322
|
+
- "I'll test it later"
|
|
323
|
+
- Skill over 500 lines without supporting files
|
|
324
|
+
- Proceeding to Phase 4 without Phase 3 subagent verification
|
|
325
|
+
|
|
326
|
+
**All of these mean: STOP. Go back to the phase you skipped.**
|
|
327
|
+
|
|
328
|
+
---
|
|
329
|
+
|
|
330
|
+
## Supporting Files
|
|
331
|
+
|
|
332
|
+
When to create separate files:
|
|
333
|
+
|
|
334
|
+
| Content | Threshold | File |
|
|
335
|
+
|---------|-----------|------|
|
|
336
|
+
| Heavy reference | >100 lines | `reference.md` |
|
|
337
|
+
| Detailed examples | >50 lines | `examples.md` |
|
|
338
|
+
| Testing protocol | Discipline skills | `testing-protocol.md` |
|
|
339
|
+
|
|
340
|
+
---
|
|
341
|
+
|
|
342
|
+
## Cross-Reference
|
|
343
|
+
|
|
344
|
+
**Required reading:** Read `kernel/knowledge/skill-authoring-guide.md` for comprehensive best practices.
|
|
345
|
+
|
|
346
|
+
This skill is a lightweight enforcement layer. The guide has the full theory and examples.
|
|
347
|
+
|
|
348
|
+
---
|
|
349
|
+
|
|
350
|
+
*Skill Generator v2.0 | Part of Thoth Knowledge Management System*
|
|
351
|
+
|
|
352
|
+
---
|
|
353
|
+
|
|
354
|
+
## Quick Reference: The Phases
|
|
355
|
+
|
|
356
|
+
| Phase | Name | Action | Gate |
|
|
357
|
+
|-------|------|--------|------|
|
|
358
|
+
| 0 | Setup | Check exists, classify type, research domain | — |
|
|
359
|
+
| 1 | RED | Dispatch subagent WITHOUT skill, document failure | Must have documented failure |
|
|
360
|
+
| 2 | GREEN | Write minimal skill addressing failures | — |
|
|
361
|
+
| 3 | GREEN | Dispatch subagent WITH skill, verify fix | Must pass |
|
|
362
|
+
| 4 | REFACTOR | Close loopholes, quality checks | All checks pass |
|
|
@@ -0,0 +1,158 @@
|
|
|
1
|
+
# Testing Protocol for Skills
|
|
2
|
+
|
|
3
|
+
When and how to test skills before deployment.
|
|
4
|
+
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
## Testing by Skill Type
|
|
8
|
+
|
|
9
|
+
| Skill Type | Testing Required | Method |
|
|
10
|
+
|------------|------------------|--------|
|
|
11
|
+
| **Discipline** | Mandatory | Subagent pressure test |
|
|
12
|
+
| **Workflow** | Recommended | Manual walkthrough |
|
|
13
|
+
| **Technique** | Recommended | Clarity check |
|
|
14
|
+
| **Reference** | Optional | Retrieval test |
|
|
15
|
+
|
|
16
|
+
---
|
|
17
|
+
|
|
18
|
+
## Discipline Skills: Subagent Pressure Test
|
|
19
|
+
|
|
20
|
+
Discipline skills enforce rules that agents will try to rationalize around. Test with a fresh agent that doesn't have the skill loaded.
|
|
21
|
+
|
|
22
|
+
### Step 1: Create Pressure Scenario
|
|
23
|
+
|
|
24
|
+
Combine multiple pressures:
|
|
25
|
+
|
|
26
|
+
| Pressure Type | Example |
|
|
27
|
+
|---------------|---------|
|
|
28
|
+
| **Time** | "We need this done in 5 minutes" |
|
|
29
|
+
| **Sunk cost** | "I already wrote the code, just need to add tests" |
|
|
30
|
+
| **Authority** | "The user said to skip testing" |
|
|
31
|
+
| **Exhaustion** | "This is the 10th file, let's just finish" |
|
|
32
|
+
| **Simplicity** | "This is too simple to need the full process" |
|
|
33
|
+
|
|
34
|
+
### Step 2: Run Baseline (Without Skill)
|
|
35
|
+
|
|
36
|
+
Dispatch a subagent with the pressure scenario but WITHOUT the skill in context.
|
|
37
|
+
|
|
38
|
+
```
|
|
39
|
+
Task: "Implement feature X. We're short on time, so skip anything unnecessary."
|
|
40
|
+
```
|
|
41
|
+
|
|
42
|
+
Document:
|
|
43
|
+
- What did the agent skip?
|
|
44
|
+
- What rationalizations were used (verbatim)?
|
|
45
|
+
- What rules were violated?
|
|
46
|
+
|
|
47
|
+
### Step 3: Run With Skill
|
|
48
|
+
|
|
49
|
+
Same scenario, but WITH the skill loaded.
|
|
50
|
+
|
|
51
|
+
Document:
|
|
52
|
+
- Did the agent follow the rules?
|
|
53
|
+
- Any new rationalizations that bypassed the skill?
|
|
54
|
+
- Any ambiguities in the skill that allowed violations?
|
|
55
|
+
|
|
56
|
+
### Step 4: Close Loopholes
|
|
57
|
+
|
|
58
|
+
For each new rationalization found:
|
|
59
|
+
1. Add explicit counter to Rationalization Table
|
|
60
|
+
2. Add to Red Flags section
|
|
61
|
+
3. Re-test until bulletproof
|
|
62
|
+
|
|
63
|
+
---
|
|
64
|
+
|
|
65
|
+
## Workflow Skills: Manual Walkthrough
|
|
66
|
+
|
|
67
|
+
### Process
|
|
68
|
+
|
|
69
|
+
1. Read the skill as if you've never seen it
|
|
70
|
+
2. Follow each step literally
|
|
71
|
+
3. Note any ambiguities or missing information
|
|
72
|
+
4. Verify all referenced tools/files exist
|
|
73
|
+
|
|
74
|
+
### Checklist
|
|
75
|
+
|
|
76
|
+
- [ ] Each step is actionable (not vague)
|
|
77
|
+
- [ ] Order of steps is logical
|
|
78
|
+
- [ ] No missing prerequisites
|
|
79
|
+
- [ ] Error cases are handled
|
|
80
|
+
- [ ] Verification points are clear
|
|
81
|
+
|
|
82
|
+
---
|
|
83
|
+
|
|
84
|
+
## Technique Skills: Clarity Check
|
|
85
|
+
|
|
86
|
+
### Process
|
|
87
|
+
|
|
88
|
+
1. Give the skill to a fresh context (new session)
|
|
89
|
+
2. Ask: "Follow this skill to do X"
|
|
90
|
+
3. Observe if the agent can complete the task
|
|
91
|
+
|
|
92
|
+
### Checklist
|
|
93
|
+
|
|
94
|
+
- [ ] Instructions are unambiguous
|
|
95
|
+
- [ ] Examples are complete and runnable
|
|
96
|
+
- [ ] Common mistakes section covers real issues
|
|
97
|
+
- [ ] Quick Reference is sufficient for repeat use
|
|
98
|
+
|
|
99
|
+
---
|
|
100
|
+
|
|
101
|
+
## Reference Skills: Retrieval Test
|
|
102
|
+
|
|
103
|
+
### Process
|
|
104
|
+
|
|
105
|
+
1. Ask questions that should be answered by the skill
|
|
106
|
+
2. Verify the agent finds the right information
|
|
107
|
+
3. Check for gaps in coverage
|
|
108
|
+
|
|
109
|
+
### Example Questions
|
|
110
|
+
|
|
111
|
+
- "How do I do X?" → Should find the relevant section
|
|
112
|
+
- "What's the syntax for Y?" → Should find exact syntax
|
|
113
|
+
- "What are the options for Z?" → Should list all options
|
|
114
|
+
|
|
115
|
+
---
|
|
116
|
+
|
|
117
|
+
## When to Skip Testing
|
|
118
|
+
|
|
119
|
+
**Never skip for discipline skills.**
|
|
120
|
+
|
|
121
|
+
For other types, you may skip if:
|
|
122
|
+
- Skill is trivial (<50 lines)
|
|
123
|
+
- Skill is internal documentation only
|
|
124
|
+
- Time-critical AND low-risk
|
|
125
|
+
|
|
126
|
+
But document why you skipped:
|
|
127
|
+
```markdown
|
|
128
|
+
## Testing
|
|
129
|
+
|
|
130
|
+
Skipped: Technique skill under 50 lines, low complexity.
|
|
131
|
+
```
|
|
132
|
+
|
|
133
|
+
---
|
|
134
|
+
|
|
135
|
+
## Subagent Dispatch Template
|
|
136
|
+
|
|
137
|
+
For discipline skill testing:
|
|
138
|
+
|
|
139
|
+
```
|
|
140
|
+
Task: [Describe the scenario with pressure]
|
|
141
|
+
|
|
142
|
+
Context: You are implementing [feature].
|
|
143
|
+
Pressure: [Time/sunk cost/authority pressure]
|
|
144
|
+
|
|
145
|
+
Do NOT load any skills. Just complete the task as you normally would.
|
|
146
|
+
|
|
147
|
+
Report: What steps did you take? What did you skip and why?
|
|
148
|
+
```
|
|
149
|
+
|
|
150
|
+
After baseline, re-run with:
|
|
151
|
+
|
|
152
|
+
```
|
|
153
|
+
Task: [Same scenario]
|
|
154
|
+
|
|
155
|
+
REQUIRED SKILL: Use skill-generator (or the skill being tested)
|
|
156
|
+
|
|
157
|
+
Report: What steps did you take? Did you follow the skill completely?
|
|
158
|
+
```
|
|
@@ -0,0 +1,103 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: system-init
|
|
3
|
+
description: Onboarding wizard that interviews the user to populate the Thoth knowledge base and configure the environment.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# System Initialization Skill
|
|
7
|
+
|
|
8
|
+
You are the **Onboarding Agent**. Your goal is to interview the user to populate Thoth's "Ground Truth" knowledge base and configure the environment.
|
|
9
|
+
|
|
10
|
+
## Protocol Execution
|
|
11
|
+
|
|
12
|
+
### Step 0: Welcome & Explain
|
|
13
|
+
|
|
14
|
+
Introduce yourself:
|
|
15
|
+
> "I'm Thoth, your AI chief of staff. Before I can help you effectively, I need to learn about you. This onboarding will take about 10-15 minutes and will set up your personal knowledge base.
|
|
16
|
+
>
|
|
17
|
+
> I'll ask about:
|
|
18
|
+
> 1. Your work identity (role, projects, stakeholders)
|
|
19
|
+
> 2. Your personal life (health, relationships, values)
|
|
20
|
+
> 3. Technical setup verification
|
|
21
|
+
>
|
|
22
|
+
> Ready to begin?"
|
|
23
|
+
|
|
24
|
+
### Step 1: Work Identity (The Chief of Staff)
|
|
25
|
+
|
|
26
|
+
1. **Ask**: "Who are you professionally? (Role, Title, Company/Organization)"
|
|
27
|
+
|
|
28
|
+
2. **Ask**: "What are your Top 3 Active Projects right now? For each, tell me:
|
|
29
|
+
- Project name
|
|
30
|
+
- Your role in it
|
|
31
|
+
- Current status (planning/active/wrapping up)
|
|
32
|
+
- Key deadline if any"
|
|
33
|
+
|
|
34
|
+
3. **Ask**: "Who are your Top 3 Key Stakeholders? (People you report to, collaborate with closely, or manage)"
|
|
35
|
+
|
|
36
|
+
4. **Action**:
|
|
37
|
+
- Create `work/identity/me.md` with professional profile
|
|
38
|
+
- Create `work/projects/{project}.md` for each project using `kernel/templates/project.md`
|
|
39
|
+
- Create `work/people/{name}.md` for each stakeholder using `kernel/templates/person.md`
|
|
40
|
+
- Update `work/MASTER.md` with project overview
|
|
41
|
+
|
|
42
|
+
### Step 2: Personal Identity (The Life Coach)
|
|
43
|
+
|
|
44
|
+
You are now the **Biographer**. Interview the user to build the "Life Map."
|
|
45
|
+
|
|
46
|
+
**Biology (Health & Energy)**
|
|
47
|
+
1. **Ask**: "How do you track your health? Do you have any current health goals? (Sleep, Exercise, Nutrition)"
|
|
48
|
+
2. **Action**: Create `life/areas/health.md`
|
|
49
|
+
|
|
50
|
+
**Wealth (Admin & Environment)**
|
|
51
|
+
3. **Ask**: "Tell me about your life infrastructure. Living situation? Pets? What are the big 'Life Admin' buckets you manage?"
|
|
52
|
+
4. **Action**: Create `life/areas/admin.md`
|
|
53
|
+
|
|
54
|
+
**Community (The Village)**
|
|
55
|
+
5. **Ask**: "Who are the key people in your private life? (Partner, Family, Close Friends). I'll create profiles to track important moments."
|
|
56
|
+
6. **Action**: Create `life/people/{name}.md` for each person
|
|
57
|
+
|
|
58
|
+
**Psychology (Values & Meaning)**
|
|
59
|
+
7. **Ask**: "What are your core values? What does a 'Good Life' look like for you this year?"
|
|
60
|
+
8. **Action**: Create `life/identity/values.md`
|
|
61
|
+
|
|
62
|
+
### Step 3: Trust Level Setup
|
|
63
|
+
|
|
64
|
+
Explain the trust system:
|
|
65
|
+
> "Thoth uses a trust-based permission system. You start at Level 1 (Observer mode) where I can read but not write. As we work together successfully, I'll earn higher trust levels.
|
|
66
|
+
>
|
|
67
|
+
> - **Level 1**: Read-only, no external actions
|
|
68
|
+
> - **Level 2**: Can write to knowledge base, access email/calendar
|
|
69
|
+
> - **Level 3**: Full autonomy within your defined boundaries"
|
|
70
|
+
|
|
71
|
+
**Ask**: "Would you like to start at Level 1 (safest) or Level 2 (more capable)?"
|
|
72
|
+
|
|
73
|
+
**Action**: Update `kernel/state/trust.md` with initial level.
|
|
74
|
+
|
|
75
|
+
### Step 4: Technical Verification
|
|
76
|
+
|
|
77
|
+
1. **Check**: Can we read `kernel/THOTH.md`?
|
|
78
|
+
2. **Check**: Is the Google Workspace MCP responding? (Test with `google-workspace_list_calendars`)
|
|
79
|
+
3. **Check**: Is Slack MCP responding? (Test with `slack_channels_list`)
|
|
80
|
+
4. **Report**: List which integrations are available.
|
|
81
|
+
|
|
82
|
+
### Step 5: Finalize
|
|
83
|
+
|
|
84
|
+
1. **Summarize** what was created:
|
|
85
|
+
- Number of project files
|
|
86
|
+
- Number of people profiles
|
|
87
|
+
- Life areas configured
|
|
88
|
+
- Trust level set
|
|
89
|
+
|
|
90
|
+
2. **Suggest Next Steps**:
|
|
91
|
+
- "Run `skill(morning-boot)` tomorrow morning to start your first daily log"
|
|
92
|
+
- "Use `skill(thought-router)` for quick capture throughout the day"
|
|
93
|
+
|
|
94
|
+
3. **Output**: "System Initialization Complete. Thoth is ready to serve."
|
|
95
|
+
|
|
96
|
+
---
|
|
97
|
+
|
|
98
|
+
## Technical Constraints
|
|
99
|
+
|
|
100
|
+
- **Conversational**: This is an interview, not a form. Be warm and curious.
|
|
101
|
+
- **Incremental Saves**: Save each section as you complete it, don't wait until the end.
|
|
102
|
+
- **Templates**: Use templates from `kernel/templates/` for consistency.
|
|
103
|
+
- **Trust Level**: This skill can run at Level 1 (creates files in knowledge base).
|
|
@@ -1,13 +1,12 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: evening-close
|
|
3
|
+
version: 1.0.0
|
|
3
4
|
description: Summarize the day, extract incomplete tasks into tomorrow's overflow, and persist daily learnings to the Knowledge Base.
|
|
4
|
-
triggers:
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
8
|
-
|
|
9
|
-
- what did I accomplish
|
|
10
|
-
- end my day
|
|
5
|
+
triggers:
|
|
6
|
+
output:
|
|
7
|
+
type: markdown
|
|
8
|
+
created: 2026-01-09
|
|
9
|
+
updated: 2026-01-09
|
|
11
10
|
---
|
|
12
11
|
|
|
13
12
|
# Evening Close Skill
|
|
@@ -1,13 +1,16 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: mail-triage
|
|
3
|
-
|
|
4
|
-
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
8
|
-
|
|
9
|
-
|
|
10
|
-
|
|
3
|
+
version: 1.0.0
|
|
4
|
+
description: Max emails to process
|
|
5
|
+
triggers:
|
|
6
|
+
inputs:
|
|
7
|
+
- name: limit
|
|
8
|
+
type: markdown
|
|
9
|
+
required: false
|
|
10
|
+
default: 20
|
|
11
|
+
output:
|
|
12
|
+
created: 2026-01-09
|
|
13
|
+
updated: 2026-01-09
|
|
11
14
|
---
|
|
12
15
|
|
|
13
16
|
# Mail Triage Skill
|