@letta-ai/letta-code 0.7.4-next.1 → 0.7.4-next.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/letta.js +44 -403
- package/package.json +2 -1
- package/skills/memory-init/SKILL.md +310 -0
- package/skills/skill-creator/SKILL.md +355 -0
- package/skills/skill-creator/references/output-patterns.md +82 -0
- package/skills/skill-creator/references/workflows.md +28 -0
- package/skills/skill-creator/scripts/init-skill.ts +279 -0
- package/skills/skill-creator/scripts/package-skill.ts +268 -0
- package/skills/skill-creator/scripts/validate-skill.ts +161 -0
|
@@ -0,0 +1,310 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: Memory Initialization
|
|
3
|
+
description: Comprehensive guide for initializing or reorganizing agent memory. Load this skill when running /init, when the user asks you to set up your memory, or when you need guidance on creating effective memory blocks.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Memory Initialization Guide
|
|
7
|
+
|
|
8
|
+
The user has requested that you initialize or reorganize your memory state. You have access to the `memory` tool which allows you to create, edit, and manage memory blocks.
|
|
9
|
+
|
|
10
|
+
## Understanding Your Context
|
|
11
|
+
|
|
12
|
+
**Important**: You are a Letta Code agent, which is fundamentally different from typical AI coding assistants. Letta Code agents are **stateful** - users expect to work with the same agent over extended periods, potentially for the entire lifecycle of a project or even longer. Your memory is not just a convenience; it's how you get better over time and maintain continuity across sessions.
|
|
13
|
+
|
|
14
|
+
This command may be run in different scenarios:
|
|
15
|
+
- **Fresh agent**: You may have default memory blocks that were created when you were initialized
|
|
16
|
+
- **Existing agent**: You may have been working with the user for a while, and they want you to reorganize or significantly update your memory structure
|
|
17
|
+
- **Shared blocks**: Some memory blocks may be shared across multiple agents - be careful about modifying these
|
|
18
|
+
|
|
19
|
+
Before making changes, use the `memory` tool to inspect your current memory blocks and understand what already exists.
|
|
20
|
+
|
|
21
|
+
## What Coding Agents Should Remember
|
|
22
|
+
|
|
23
|
+
### 1. Procedures (Rules & Workflows)
|
|
24
|
+
Explicit rules and workflows that should always be followed:
|
|
25
|
+
- "Never commit directly to main - always use feature branches"
|
|
26
|
+
- "Always run lint before running tests"
|
|
27
|
+
- "Use conventional commits format for all commit messages"
|
|
28
|
+
- "Always check for existing tests before adding new ones"
|
|
29
|
+
|
|
30
|
+
### 2. Preferences (Style & Conventions)
|
|
31
|
+
User and project coding style preferences:
|
|
32
|
+
- "Never use try/catch for control flow"
|
|
33
|
+
- "Always add JSDoc comments to exported functions"
|
|
34
|
+
- "Prefer functional components over class components"
|
|
35
|
+
- "Use early returns instead of nested conditionals"
|
|
36
|
+
|
|
37
|
+
### 3. History & Context
|
|
38
|
+
Important historical context that informs current decisions:
|
|
39
|
+
- "We fixed this exact pagination bug two weeks ago - check PR #234"
|
|
40
|
+
- "This monorepo used to have 3 modules before the consolidation"
|
|
41
|
+
- "The auth system was refactored in v2.0 - old patterns are deprecated"
|
|
42
|
+
- "User prefers verbose explanations when debugging"
|
|
43
|
+
|
|
44
|
+
Note: For historical recall, you may also have access to `conversation_search` which can search past conversations. Memory blocks are for distilled, important information worth persisting permanently.
|
|
45
|
+
|
|
46
|
+
## Memory Scope Considerations
|
|
47
|
+
|
|
48
|
+
Consider whether information is:
|
|
49
|
+
|
|
50
|
+
**Project-scoped** (store in `project` block):
|
|
51
|
+
- Build commands, test commands, lint configuration
|
|
52
|
+
- Project architecture and key directories
|
|
53
|
+
- Team conventions specific to this codebase
|
|
54
|
+
- Technology stack and framework choices
|
|
55
|
+
|
|
56
|
+
**User-scoped** (store in `human` block):
|
|
57
|
+
- Personal coding preferences that apply across projects
|
|
58
|
+
- Communication style preferences
|
|
59
|
+
- General workflow habits
|
|
60
|
+
|
|
61
|
+
**Session/Task-scoped** (consider separate blocks like `ticket` or `context`):
|
|
62
|
+
- Current branch or ticket being worked on
|
|
63
|
+
- Debugging context for an ongoing investigation
|
|
64
|
+
- Temporary notes about a specific task
|
|
65
|
+
|
|
66
|
+
## Recommended Memory Structure
|
|
67
|
+
|
|
68
|
+
### Core Blocks (Usually Present)
|
|
69
|
+
|
|
70
|
+
**`persona`**: Your behavioral guidelines that augment your base system prompt.
|
|
71
|
+
- Your system prompt already contains comprehensive instructions for how to code and behave
|
|
72
|
+
- The persona block is for **learned adaptations** - things you discover about how the user wants you to behave
|
|
73
|
+
- Examples: "User said never use emojis", "User prefers terse responses", "Always explain reasoning before making changes"
|
|
74
|
+
- This block may start empty and grow over time as you learn the user's preferences
|
|
75
|
+
|
|
76
|
+
**`project`**: Project-specific information, conventions, and commands
|
|
77
|
+
- Build/test/lint commands
|
|
78
|
+
- Key directories and architecture
|
|
79
|
+
- Project-specific conventions from README, AGENTS.md, etc.
|
|
80
|
+
|
|
81
|
+
**`human`**: User preferences, communication style, general habits
|
|
82
|
+
- Cross-project preferences
|
|
83
|
+
- Working style and communication preferences
|
|
84
|
+
|
|
85
|
+
### Optional Blocks (Create as Needed)
|
|
86
|
+
|
|
87
|
+
**`ticket`** or **`task`**: Scratchpad for current work item context.
|
|
88
|
+
- **Important**: This is different from the TODO or Plan tools!
|
|
89
|
+
- TODO/Plan tools track active task lists and implementation plans (structured lists of what to do)
|
|
90
|
+
- A ticket/task memory block is a **scratchpad** for pinned context that should stay visible
|
|
91
|
+
- Examples: Linear ticket ID and URL, Jira issue key, branch name, PR number, relevant links
|
|
92
|
+
- Information that's useful to keep in context but doesn't fit in a TODO list
|
|
93
|
+
|
|
94
|
+
**`context`**: Debugging or investigation scratchpad
|
|
95
|
+
- Current hypotheses being tested
|
|
96
|
+
- Files already examined
|
|
97
|
+
- Clues and observations
|
|
98
|
+
|
|
99
|
+
**`decisions`**: Architectural decisions and their rationale
|
|
100
|
+
- Why certain approaches were chosen
|
|
101
|
+
- Trade-offs that were considered
|
|
102
|
+
|
|
103
|
+
## Writing Good Memory Blocks
|
|
104
|
+
|
|
105
|
+
**This is critical**: In the future, you (or a future version of yourself) will only see three things about each memory block:
|
|
106
|
+
1. The **label** (name)
|
|
107
|
+
2. The **description**
|
|
108
|
+
3. The **value** (content)
|
|
109
|
+
|
|
110
|
+
The reasoning you have *right now* about why you're creating a block will be lost. Your future self won't easily remember this initialization conversation (it can be searched, but it will no longer be in-context). Therefore:
|
|
111
|
+
|
|
112
|
+
**Labels should be:**
|
|
113
|
+
- Clear and descriptive (e.g., `project-conventions` not `stuff`)
|
|
114
|
+
- Consistent in style (e.g., all lowercase with hyphens)
|
|
115
|
+
|
|
116
|
+
**Descriptions are especially important:**
|
|
117
|
+
- Explain *what* this block is for and *when* to use it
|
|
118
|
+
- Explain *how* this block should influence your behavior
|
|
119
|
+
- Write as if explaining to a future version of yourself who has no context
|
|
120
|
+
- Good: "User's coding style preferences that should be applied to all code I write or review. Update when user expresses new preferences."
|
|
121
|
+
- Bad: "Preferences"
|
|
122
|
+
|
|
123
|
+
**Values should be:**
|
|
124
|
+
- Well-organized and scannable
|
|
125
|
+
- Updated regularly to stay relevant
|
|
126
|
+
- Pruned of outdated information
|
|
127
|
+
|
|
128
|
+
Think of memory block descriptions as documentation for your future self. The better you write them now, the more effective you'll be in future sessions.
|
|
129
|
+
|
|
130
|
+
## Research Depth
|
|
131
|
+
|
|
132
|
+
You can ask the user if they want a standard or deep research initialization:
|
|
133
|
+
|
|
134
|
+
**Standard initialization** (~5-20 tool calls):
|
|
135
|
+
- Inspect existing memory blocks
|
|
136
|
+
- Scan README, package.json/config files, AGENTS.md, CLAUDE.md
|
|
137
|
+
- Review git status and recent commits (from context below)
|
|
138
|
+
- Explore key directories and understand project structure
|
|
139
|
+
- Create/update your memory block structure to contain the essential information you need to know about the user, your behavior (learned preferences), the project you're working in, and any other information that will help you be an effective collaborator.
|
|
140
|
+
|
|
141
|
+
**Deep research initialization** (~100+ tool calls):
|
|
142
|
+
- Everything in standard initialization, plus:
|
|
143
|
+
- Use your TODO or Plan tool to create a systematic research plan
|
|
144
|
+
- Deep dive into git history for patterns, conventions, and context
|
|
145
|
+
- Analyze commit message conventions and branching strategy
|
|
146
|
+
- Explore multiple directories and understand architecture thoroughly
|
|
147
|
+
- Search for and read key source files to understand patterns
|
|
148
|
+
- Create multiple specialized memory blocks
|
|
149
|
+
- May involve multiple rounds of exploration
|
|
150
|
+
|
|
151
|
+
**What deep research can uncover:**
|
|
152
|
+
- **Contributors & team dynamics**: Who works on what areas? Who are the main contributors? (`git shortlog -sn`)
|
|
153
|
+
- **Coding habits**: When do people commit? (time patterns) What's the typical commit size?
|
|
154
|
+
- **Writing & commit style**: How verbose are commit messages? What conventions are followed?
|
|
155
|
+
- **Code evolution**: How has the architecture changed? What major refactors happened?
|
|
156
|
+
- **Review patterns**: Are there PR templates? What gets reviewed carefully vs rubber-stamped?
|
|
157
|
+
- **Pain points**: What areas have lots of bug fixes? What code gets touched frequently?
|
|
158
|
+
- **Related repositories**: Ask the user if there are other repos you should know about (e.g., a backend monorepo, shared libraries, documentation repos). These relationships can be crucial context.
|
|
159
|
+
|
|
160
|
+
This kind of deep context can make you significantly more effective as a long-term collaborator on the project.
|
|
161
|
+
|
|
162
|
+
If the user says "take as long as you need" or explicitly wants deep research, use your TODO or Plan tool to orchestrate a thorough, multi-step research process.
|
|
163
|
+
|
|
164
|
+
## Research Techniques
|
|
165
|
+
|
|
166
|
+
**File-based research:**
|
|
167
|
+
- README.md, CONTRIBUTING.md, AGENTS.md, CLAUDE.md
|
|
168
|
+
- Package manifests (package.json, Cargo.toml, pyproject.toml, go.mod)
|
|
169
|
+
- Config files (.eslintrc, tsconfig.json, .prettierrc)
|
|
170
|
+
- CI/CD configs (.github/workflows/, .gitlab-ci.yml)
|
|
171
|
+
|
|
172
|
+
**Git-based research** (if in a git repo):
|
|
173
|
+
- `git log --oneline -20` - Recent commit history and patterns
|
|
174
|
+
- `git branch -a` - Branching strategy
|
|
175
|
+
- `git log --format="%s" -50 | head -20` - Commit message conventions
|
|
176
|
+
- `git shortlog -sn --all | head -10` - Main contributors
|
|
177
|
+
- `git log --format="%an <%ae>" | sort -u` - Contributors with emails (more reliable for deduplication)
|
|
178
|
+
- Recent PRs or merge commits for context on ongoing work
|
|
179
|
+
|
|
180
|
+
**Important: Deduplicate contributors!** Git groups by exact author string, so the same person may appear multiple times with different names (e.g., "jsmith" and "John Smith" are likely the same person). Use emails to deduplicate, and apply common sense - usernames often match parts of full names.
|
|
181
|
+
|
|
182
|
+
## How to Do Thorough Research
|
|
183
|
+
|
|
184
|
+
**Don't just collect data - analyze and cross-reference it.**
|
|
185
|
+
|
|
186
|
+
Shallow research (bad):
|
|
187
|
+
- Run commands, copy output
|
|
188
|
+
- Take everything at face value
|
|
189
|
+
- List facts without understanding
|
|
190
|
+
|
|
191
|
+
Thorough research (good):
|
|
192
|
+
- **Cross-reference findings**: If two pieces of data seem inconsistent, dig deeper
|
|
193
|
+
- **Resolve ambiguities**: Don't leave questions unanswered (e.g., "are these two contributors the same person?")
|
|
194
|
+
- **Read actual content**: Don't just list file names - read key files to understand them
|
|
195
|
+
- **Look for patterns**: What do the commit messages tell you about workflow? What do file structures tell you about architecture?
|
|
196
|
+
- **Form hypotheses and verify**: "I think this team uses feature branches" → check git branch patterns to confirm
|
|
197
|
+
- **Think like a new team member**: What would you want to know on your first day?
|
|
198
|
+
|
|
199
|
+
**Questions to ask yourself during research:**
|
|
200
|
+
- Does this make sense? (e.g., why would there be two contributors with similar names?)
|
|
201
|
+
- What's missing? (e.g., no tests directory - is testing not done, or done differently?)
|
|
202
|
+
- What can I infer? (e.g., lots of "fix:" commits in one area → that area is buggy or complex)
|
|
203
|
+
- Am I just listing facts, or do I understand the project?
|
|
204
|
+
|
|
205
|
+
The goal isn't to produce a report - it's to genuinely understand the project and how this human(s) works so you can be an effective collaborator.
|
|
206
|
+
|
|
207
|
+
## On Asking Questions
|
|
208
|
+
|
|
209
|
+
**Ask important questions upfront, then be autonomous during execution.**
|
|
210
|
+
|
|
211
|
+
### Recommended Upfront Questions
|
|
212
|
+
|
|
213
|
+
You should ask these questions at the start (bundle them together in one AskUserQuestion call):
|
|
214
|
+
|
|
215
|
+
1. **Research depth**: "Standard or deep research (comprehensive, as long as needed)?"
|
|
216
|
+
2. **Identity**: "Which contributor are you?" (You can often infer this from git logs - e.g., if git shows "cpacker" as a top contributor, ask "Are you cpacker?")
|
|
217
|
+
3. **Related repos**: "Are there other repositories I should know about and consider in my research?" (e.g., backend monorepo, shared libraries)
|
|
218
|
+
4. **Workflow style**: "How proactive should I be?" (auto-commit vs ask-first)
|
|
219
|
+
5. **Communication style**: "Terse or detailed responses?"
|
|
220
|
+
6. **Any specific rules**: "Rules I should always follow?"
|
|
221
|
+
|
|
222
|
+
**Why these matter:**
|
|
223
|
+
- Identity lets you correlate git history to the user (their commits, PRs, coding style)
|
|
224
|
+
- Related repos provide crucial context (many projects span multiple repos)
|
|
225
|
+
- Workflow/communication style should be stored in the `human` block
|
|
226
|
+
- Rules go in `persona` block
|
|
227
|
+
|
|
228
|
+
### What NOT to ask
|
|
229
|
+
|
|
230
|
+
- Things you can find by reading files ("What's your test framework?")
|
|
231
|
+
- "What kind of work do you do? Reviewing PRs vs writing code?" - obvious from git log, most devs do everything
|
|
232
|
+
- Permission for obvious actions - just do them
|
|
233
|
+
- Questions one at a time - bundle them (but don't exhaust the user with too many questions at once)
|
|
234
|
+
|
|
235
|
+
**During execution**, be autonomous. Make reasonable choices and proceed.
|
|
236
|
+
|
|
237
|
+
## Memory Block Strategy
|
|
238
|
+
|
|
239
|
+
### Split Large Blocks
|
|
240
|
+
|
|
241
|
+
**Don't create monolithic blocks.** If a block is getting long (>50-100 lines), split it:
|
|
242
|
+
|
|
243
|
+
Instead of one huge `project` block, consider:
|
|
244
|
+
- `project-overview`: High-level description, tech stack, repo links
|
|
245
|
+
- `project-commands`: Build, test, lint, dev commands
|
|
246
|
+
- `project-conventions`: Commit style, PR process, code style
|
|
247
|
+
- `project-architecture`: Directory structure, key modules
|
|
248
|
+
- `project-gotchas`: Footguns, things to watch out for
|
|
249
|
+
|
|
250
|
+
This makes memory more scannable and easier to update and share with other agents.
|
|
251
|
+
|
|
252
|
+
### Update Memory Incrementally
|
|
253
|
+
|
|
254
|
+
**For deep research: Update memory as you go, not all at once at the end.**
|
|
255
|
+
|
|
256
|
+
Why this matters:
|
|
257
|
+
- Deep research can take many turns and millions of tokens
|
|
258
|
+
- Context windows overflow and trigger rolling summaries
|
|
259
|
+
- If you wait until the end to write memory, you may lose important details
|
|
260
|
+
- Write findings to memory blocks as you discover them
|
|
261
|
+
|
|
262
|
+
Good pattern:
|
|
263
|
+
1. Create block structure early (even with placeholder content)
|
|
264
|
+
2. Update blocks after each research phase
|
|
265
|
+
3. Refine and consolidate at the end
|
|
266
|
+
|
|
267
|
+
Remember, your memory tool allows you to easily add, edit, and remove blocks. There's no reason to wait until you "know everything" to write memory. Treat your memory blocks as a living scratchpad.
|
|
268
|
+
|
|
269
|
+
### Initialize ALL Relevant Blocks
|
|
270
|
+
|
|
271
|
+
Don't just update a single memory block. Based on your upfront questions, also update:
|
|
272
|
+
|
|
273
|
+
- **`human`**: Store the user's identity, workflow preferences, communication style
|
|
274
|
+
- **`persona`**: Store rules the user wants you to follow, behavioral adaptations
|
|
275
|
+
- **`project-*`**: Split project info across multiple focused blocks
|
|
276
|
+
|
|
277
|
+
And add memory blocks that you think make sense to add (e.g., `project-architecture`, `project-conventions`, `project-gotchas`, etc, or even splitting the `human` block into more focused blocks, or even multiple blocks for multiple users).
|
|
278
|
+
|
|
279
|
+
## Your Task
|
|
280
|
+
|
|
281
|
+
1. **Ask upfront questions**: Use AskUserQuestion with the recommended questions above (bundled together). This is critical - don't skip it.
|
|
282
|
+
2. **Inspect existing memory**: You may already have some memory blocks initialized. See what already exists, and analyze how it is or is not insufficient or incomplete.
|
|
283
|
+
3. **Identify the user**: From git logs and their answer, figure out who they are and store in `human` block. If relevant, ask questions to gather information about their preferences that will help you be a useful assistant to them.
|
|
284
|
+
4. **Update human/persona early**: Based on answers, update your memory blocks eagerly before diving into project research. You can always change them as you go, you're not locked into any memory configuration.
|
|
285
|
+
5. **Research the project**: Explore based on chosen depth. Use your TODO or plan tool to create a systematic research plan.
|
|
286
|
+
6. **Create/update project blocks incrementally**: Don't wait until the end - write findings as you go.
|
|
287
|
+
7. **Reflect and review**: See "Reflection Phase" below - this is critical for deep research.
|
|
288
|
+
8. **Ask user if done**: Check if they're satisfied or want you to continue refining.
|
|
289
|
+
|
|
290
|
+
## Reflection Phase (Critical for Deep Research)
|
|
291
|
+
|
|
292
|
+
Before finishing, you MUST do a reflection step. **Your memory blocks are visible to you in your system prompt right now.** Look at them carefully and ask yourself:
|
|
293
|
+
|
|
294
|
+
1. **Redundancy check**: Are there blocks with overlapping content? Either literally overlapping (due to errors while making memory edits), or semantically/conceptually overlapping?
|
|
295
|
+
|
|
296
|
+
2. **Completeness check**: Did you actually update ALL relevant blocks? For example:
|
|
297
|
+
- Did you update `human` with the user's identity and preferences?
|
|
298
|
+
- Did you update `persona` with behavioral rules they expressed?
|
|
299
|
+
- Or did you only update project blocks and forget the rest?
|
|
300
|
+
|
|
301
|
+
3. **Quality check**: Are there typos, formatting issues, or unclear descriptions in your blocks?
|
|
302
|
+
|
|
303
|
+
4. **Structure check**: Would this make sense to your future self? Is anything missing? Is anything redundant?
|
|
304
|
+
|
|
305
|
+
**After reflection**, fix any issues you found. Then ask the user:
|
|
306
|
+
> "I've completed the initialization. Here's a brief summary of what I set up: [summary]. Should I continue refining, or is this good to proceed?"
|
|
307
|
+
|
|
308
|
+
This gives the user a chance to provide feedback or ask for adjustments before you finish.
|
|
309
|
+
|
|
310
|
+
Remember: Good memory management is an investment. The effort you put into organizing your memory now will pay dividends as you work with this user over time.
|
|
@@ -0,0 +1,355 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: skill-creator
|
|
3
|
+
description: Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Letta Code's capabilities with specialized knowledge, workflows, or tool integrations.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Skill Creator
|
|
7
|
+
|
|
8
|
+
This skill provides guidance for creating effective skills in Letta Code.
|
|
9
|
+
|
|
10
|
+
## About Skills
|
|
11
|
+
|
|
12
|
+
Skills are modular, self-contained packages that extend Letta Code's capabilities by providing
|
|
13
|
+
specialized knowledge, workflows, and tools. Think of them as "onboarding guides" for specific
|
|
14
|
+
domains or tasks—they transform a Letta Code agent from a general-purpose agent into a specialized agent
|
|
15
|
+
equipped with procedural knowledge that no model can fully possess.
|
|
16
|
+
|
|
17
|
+
### What Skills Provide
|
|
18
|
+
|
|
19
|
+
1. Specialized workflows - Multi-step procedures for specific domains
|
|
20
|
+
2. Tool integrations - Instructions for working with specific file formats or APIs
|
|
21
|
+
3. Domain expertise - Company-specific knowledge, schemas, business logic
|
|
22
|
+
4. Bundled resources - Scripts, references, and assets for complex and repetitive tasks
|
|
23
|
+
|
|
24
|
+
## Core Principles
|
|
25
|
+
|
|
26
|
+
### Concise is Key
|
|
27
|
+
|
|
28
|
+
The context window is a public good. Skills share the context window with everything else the Letta Code agent needs: system prompt, conversation history, other Skills' metadata, and the actual user request.
|
|
29
|
+
|
|
30
|
+
**Default assumption: the Letta Code agent is already very capable.** Only add context the Letta Code agent doesn't already have. Challenge each piece of information: "Does the Letta Code agent really need this explanation?" and "Does this paragraph justify its token cost?"
|
|
31
|
+
|
|
32
|
+
Prefer concise examples over verbose explanations.
|
|
33
|
+
|
|
34
|
+
### Set Appropriate Degrees of Freedom
|
|
35
|
+
|
|
36
|
+
Match the level of specificity to the task's fragility and variability:
|
|
37
|
+
|
|
38
|
+
**High freedom (text-based instructions)**: Use when multiple approaches are valid, decisions depend on context, or heuristics guide the approach.
|
|
39
|
+
|
|
40
|
+
**Medium freedom (pseudocode or scripts with parameters)**: Use when a preferred pattern exists, some variation is acceptable, or configuration affects behavior.
|
|
41
|
+
|
|
42
|
+
**Low freedom (specific scripts, few parameters)**: Use when operations are fragile and error-prone, consistency is critical, or a specific sequence must be followed.
|
|
43
|
+
|
|
44
|
+
Think of the Letta Code agent as exploring a path: a narrow bridge with cliffs needs specific guardrails (low freedom), while an open field allows many routes (high freedom).
|
|
45
|
+
|
|
46
|
+
### Anatomy of a Skill
|
|
47
|
+
|
|
48
|
+
Every skill consists of a required SKILL.md file and optional bundled resources:
|
|
49
|
+
|
|
50
|
+
```
|
|
51
|
+
skill-name/
|
|
52
|
+
├── SKILL.md (required)
|
|
53
|
+
│ ├── YAML frontmatter metadata (required)
|
|
54
|
+
│ │ ├── name: (required)
|
|
55
|
+
│ │ └── description: (required)
|
|
56
|
+
│ └── Markdown instructions (required)
|
|
57
|
+
└── Bundled Resources (optional)
|
|
58
|
+
├── scripts/ - Executable code (TypeScript/Python/Bash/etc.)
|
|
59
|
+
├── references/ - Documentation intended to be loaded into context as needed
|
|
60
|
+
└── assets/ - Files used in output (templates, icons, fonts, etc.)
|
|
61
|
+
```
|
|
62
|
+
|
|
63
|
+
#### SKILL.md (required)
|
|
64
|
+
|
|
65
|
+
Every SKILL.md in Letta Code consists of:
|
|
66
|
+
|
|
67
|
+
- **Frontmatter** (YAML): Contains `name` and `description` fields. These are the only fields that the Letta Code agent reads to determine when the skill gets used, thus it is very important to be clear and comprehensive in describing what the skill is, and when it should be used.
|
|
68
|
+
- **Body** (Markdown): Instructions and guidance for using the skill. Only loaded AFTER the skill triggers (if at all).
|
|
69
|
+
|
|
70
|
+
#### Bundled Resources (optional)
|
|
71
|
+
|
|
72
|
+
##### Scripts (`scripts/`)
|
|
73
|
+
|
|
74
|
+
Executable code (TypeScript/Python/Bash/etc.) for tasks that require deterministic reliability or are repeatedly rewritten.
|
|
75
|
+
|
|
76
|
+
- **When to include**: When the same code is being rewritten repeatedly or deterministic reliability is needed
|
|
77
|
+
- **Example**: `scripts/rotate-pdf.ts` for PDF rotation tasks
|
|
78
|
+
- **Benefits**: Token efficient, deterministic, may be executed without loading into context
|
|
79
|
+
- **Note**: Scripts may still need to be read by the Letta Code agent for patching or environment-specific adjustments
|
|
80
|
+
|
|
81
|
+
##### References (`references/`)
|
|
82
|
+
|
|
83
|
+
Documentation and reference material intended to be loaded as needed into context to inform the Letta Code agent's process and thinking.
|
|
84
|
+
|
|
85
|
+
- **When to include**: For documentation that the Letta Code agent should reference while working
|
|
86
|
+
- **Examples**: `references/finance.md` for financial schemas, `references/mnda.md` for company NDA template, `references/policies.md` for company policies, `references/api_docs.md` for API specifications
|
|
87
|
+
- **Use cases**: Database schemas, API documentation, domain knowledge, company policies, detailed workflow guides
|
|
88
|
+
- **Benefits**: Keeps SKILL.md lean, loaded only when the Letta Code agent determines it's needed
|
|
89
|
+
- **Best practice**: If files are large (>10k words), include grep search patterns in SKILL.md
|
|
90
|
+
- **Avoid duplication**: Information should live in either SKILL.md or references files, not both. Prefer references files for detailed information unless it's truly core to the skill—this keeps SKILL.md lean while making information discoverable without hogging the context window. Keep only essential procedural instructions and workflow guidance in SKILL.md; move detailed reference material, schemas, and examples to references files.
|
|
91
|
+
|
|
92
|
+
##### Assets (`assets/`)
|
|
93
|
+
|
|
94
|
+
Files not intended to be loaded into context, but rather used within the output the Letta Code agent produces.
|
|
95
|
+
|
|
96
|
+
- **When to include**: When the skill needs files that will be used in the final output
|
|
97
|
+
- **Examples**: `assets/logo.png` for brand assets, `assets/slides.pptx` for PowerPoint templates, `assets/frontend-template/` for HTML/React boilerplate, `assets/font.ttf` for typography
|
|
98
|
+
- **Use cases**: Templates, images, icons, boilerplate code, fonts, sample documents that get copied or modified
|
|
99
|
+
- **Benefits**: Separates output resources from documentation, enables the Letta Code agent to use files without loading them into context
|
|
100
|
+
|
|
101
|
+
#### What to Not Include in a Skill
|
|
102
|
+
|
|
103
|
+
A skill should only contain essential files that directly support its functionality. Do NOT create extraneous documentation or auxiliary files, including:
|
|
104
|
+
|
|
105
|
+
- README.md
|
|
106
|
+
- INSTALLATION_GUIDE.md
|
|
107
|
+
- QUICK_REFERENCE.md
|
|
108
|
+
- CHANGELOG.md
|
|
109
|
+
- etc.
|
|
110
|
+
|
|
111
|
+
The skill should only contain the information needed for an AI agent to do the job at hand. It should not contain auxilary context about the process that went into creating it, setup and testing procedures, user-facing documentation, etc. Creating additional documentation files just adds clutter and confusion.
|
|
112
|
+
|
|
113
|
+
### Progressive Disclosure Design Principle
|
|
114
|
+
|
|
115
|
+
Skills use a three-level loading system to manage context efficiently:
|
|
116
|
+
|
|
117
|
+
1. **Metadata (name + description)** - Always in context (~100 words)
|
|
118
|
+
2. **SKILL.md body** - When skill triggers (<5k words)
|
|
119
|
+
3. **Bundled resources** - As needed by the Letta Code agent (Unlimited because scripts can be executed without reading into context window)
|
|
120
|
+
|
|
121
|
+
#### Progressive Disclosure Patterns
|
|
122
|
+
|
|
123
|
+
Keep SKILL.md body to the essentials and under 500 lines to minimize context bloat. Split content into separate files when approaching this limit. When splitting out content into other files, it is very important to reference them from SKILL.md and describe clearly when to read them, to ensure the reader of the skill knows they exist and when to use them.
|
|
124
|
+
|
|
125
|
+
**Key principle:** When a skill supports multiple variations, frameworks, or options, keep only the core workflow and selection guidance in SKILL.md. Move variant-specific details (patterns, examples, configuration) into separate reference files.
|
|
126
|
+
|
|
127
|
+
**Pattern 1: High-level guide with references**
|
|
128
|
+
|
|
129
|
+
```markdown
|
|
130
|
+
# PDF Processing
|
|
131
|
+
|
|
132
|
+
## Quick start
|
|
133
|
+
|
|
134
|
+
Extract text with pdfplumber:
|
|
135
|
+
[code example]
|
|
136
|
+
|
|
137
|
+
## Advanced features
|
|
138
|
+
|
|
139
|
+
- **Form filling**: See [FORMS.md](FORMS.md) for complete guide
|
|
140
|
+
- **API reference**: See [REFERENCE.md](REFERENCE.md) for all methods
|
|
141
|
+
- **Examples**: See [EXAMPLES.md](EXAMPLES.md) for common patterns
|
|
142
|
+
```
|
|
143
|
+
|
|
144
|
+
The Letta Code agent loads FORMS.md, REFERENCE.md, or EXAMPLES.md only when needed.
|
|
145
|
+
|
|
146
|
+
**Pattern 2: Domain-specific organization**
|
|
147
|
+
|
|
148
|
+
For Skills with multiple domains, organize content by domain to avoid loading irrelevant context:
|
|
149
|
+
|
|
150
|
+
```
|
|
151
|
+
bigquery-skill/
|
|
152
|
+
├── SKILL.md (overview and navigation)
|
|
153
|
+
└── reference/
|
|
154
|
+
├── finance.md (revenue, billing metrics)
|
|
155
|
+
├── sales.md (opportunities, pipeline)
|
|
156
|
+
├── product.md (API usage, features)
|
|
157
|
+
└── marketing.md (campaigns, attribution)
|
|
158
|
+
```
|
|
159
|
+
|
|
160
|
+
When a user asks about sales metrics, the Letta Code agent only reads sales.md.
|
|
161
|
+
|
|
162
|
+
Similarly, for skills supporting multiple frameworks or variants, organize by variant:
|
|
163
|
+
|
|
164
|
+
```
|
|
165
|
+
cloud-deploy/
|
|
166
|
+
├── SKILL.md (workflow + provider selection)
|
|
167
|
+
└── references/
|
|
168
|
+
├── aws.md (AWS deployment patterns)
|
|
169
|
+
├── gcp.md (GCP deployment patterns)
|
|
170
|
+
└── azure.md (Azure deployment patterns)
|
|
171
|
+
```
|
|
172
|
+
|
|
173
|
+
When the user chooses AWS, the Letta Code agent only reads aws.md.
|
|
174
|
+
|
|
175
|
+
**Pattern 3: Conditional details**
|
|
176
|
+
|
|
177
|
+
Show basic content, link to advanced content:
|
|
178
|
+
|
|
179
|
+
```markdown
|
|
180
|
+
# DOCX Processing
|
|
181
|
+
|
|
182
|
+
## Creating documents
|
|
183
|
+
|
|
184
|
+
Use docx-js for new documents. See [DOCX-JS.md](DOCX-JS.md).
|
|
185
|
+
|
|
186
|
+
## Editing documents
|
|
187
|
+
|
|
188
|
+
For simple edits, modify the XML directly.
|
|
189
|
+
|
|
190
|
+
**For tracked changes**: See [REDLINING.md](REDLINING.md)
|
|
191
|
+
**For OOXML details**: See [OOXML.md](OOXML.md)
|
|
192
|
+
```
|
|
193
|
+
|
|
194
|
+
The Letta Code agent reads REDLINING.md or OOXML.md only when the user needs those features.
|
|
195
|
+
|
|
196
|
+
**Important guidelines:**
|
|
197
|
+
|
|
198
|
+
- **Avoid deeply nested references** - Keep references one level deep from SKILL.md. All reference files should link directly from SKILL.md.
|
|
199
|
+
- **Structure longer reference files** - For files longer than 100 lines, include a table of contents at the top so the Letta Code agent can see the full scope when previewing.
|
|
200
|
+
|
|
201
|
+
## Skill Creation Process
|
|
202
|
+
|
|
203
|
+
Skill creation involves these steps:
|
|
204
|
+
|
|
205
|
+
1. Understand the skill with concrete examples
|
|
206
|
+
2. Plan reusable skill contents (scripts, references, assets)
|
|
207
|
+
3. Initialize the skill (run init-skill.ts)
|
|
208
|
+
4. Edit the skill (implement resources and write SKILL.md)
|
|
209
|
+
5. Package the skill (run package-skill.ts)
|
|
210
|
+
6. Iterate based on real usage
|
|
211
|
+
|
|
212
|
+
Follow these steps in order, skipping only if there is a clear reason why they are not applicable.
|
|
213
|
+
|
|
214
|
+
### Step 1: Understanding the Skill with Concrete Examples
|
|
215
|
+
|
|
216
|
+
Skip this step only when the skill's usage patterns are already clearly understood. It remains valuable even when working with an existing skill.
|
|
217
|
+
|
|
218
|
+
To create an effective skill, clearly understand concrete examples of how the skill will be used. This understanding can come from either direct user examples or generated examples that are validated with user feedback.
|
|
219
|
+
|
|
220
|
+
For example, when building an image-editor skill, relevant questions include:
|
|
221
|
+
|
|
222
|
+
- "What functionality should the image-editor skill support? Editing, rotating, anything else?"
|
|
223
|
+
- "Can you give some examples of how this skill would be used?"
|
|
224
|
+
- "I can imagine users asking for things like 'Remove the red-eye from this image' or 'Rotate this image'. Are there other ways you imagine this skill being used?"
|
|
225
|
+
- "What would a user say that should trigger this skill?"
|
|
226
|
+
|
|
227
|
+
To avoid overwhelming users, avoid asking too many questions in a single message. Start with the most important questions and follow up as needed for better effectiveness.
|
|
228
|
+
|
|
229
|
+
Conclude this step when there is a clear sense of the functionality the skill should support.
|
|
230
|
+
|
|
231
|
+
### Step 2: Planning the Reusable Skill Contents
|
|
232
|
+
|
|
233
|
+
To turn concrete examples into an effective skill, analyze each example by:
|
|
234
|
+
|
|
235
|
+
1. Considering how to execute on the example from scratch
|
|
236
|
+
2. Identifying what scripts, references, and assets would be helpful when executing these workflows repeatedly
|
|
237
|
+
|
|
238
|
+
Example: When building a `pdf-editor` skill to handle queries like "Help me rotate this PDF," the analysis shows:
|
|
239
|
+
|
|
240
|
+
1. Rotating a PDF requires re-writing the same code each time
|
|
241
|
+
2. A `scripts/rotate-pdf.ts` script would be helpful to store in the skill
|
|
242
|
+
|
|
243
|
+
Example: When designing a `frontend-webapp-builder` skill for queries like "Build me a todo app" or "Build me a dashboard to track my steps," the analysis shows:
|
|
244
|
+
|
|
245
|
+
1. Writing a frontend webapp requires the same boilerplate HTML/React each time
|
|
246
|
+
2. An `assets/hello-world/` template containing the boilerplate HTML/React project files would be helpful to store in the skill
|
|
247
|
+
|
|
248
|
+
Example: When building a `big-query` skill to handle queries like "How many users have logged in today?" the analysis shows:
|
|
249
|
+
|
|
250
|
+
1. Querying BigQuery requires re-discovering the table schemas and relationships each time
|
|
251
|
+
2. A `references/schema.md` file documenting the table schemas would be helpful to store in the skill
|
|
252
|
+
|
|
253
|
+
To establish the skill's contents, analyze each concrete example to create a list of the reusable resources to include: scripts, references, and assets.
|
|
254
|
+
|
|
255
|
+
### Step 3: Initializing the Skill
|
|
256
|
+
|
|
257
|
+
At this point, it is time to actually create the skill.
|
|
258
|
+
|
|
259
|
+
Skip this step only if the skill being developed already exists, and iteration or packaging is needed. In this case, continue to the next step.
|
|
260
|
+
|
|
261
|
+
When creating a new skill from scratch, always run the `init-skill.ts` script. The script conveniently generates a new template skill directory that automatically includes everything a skill requires, making the skill creation process much more efficient and reliable.
|
|
262
|
+
|
|
263
|
+
Usage:
|
|
264
|
+
|
|
265
|
+
```bash
|
|
266
|
+
npx ts-node scripts/init-skill.ts <skill-name> --path <output-directory>
|
|
267
|
+
```
|
|
268
|
+
|
|
269
|
+
The script:
|
|
270
|
+
|
|
271
|
+
- Creates the skill directory at the specified path
|
|
272
|
+
- Generates a SKILL.md template with proper frontmatter and TODO placeholders
|
|
273
|
+
- Creates example resource directories: `scripts/`, `references/`, and `assets/`
|
|
274
|
+
- Adds example files in each directory that can be customized or deleted
|
|
275
|
+
|
|
276
|
+
After initialization, customize or remove the generated SKILL.md and example files as needed.
|
|
277
|
+
|
|
278
|
+
### Step 4: Edit the Skill
|
|
279
|
+
|
|
280
|
+
When editing the (newly-generated or existing) skill, remember that the skill is being created for another Letta Code agent instance to use. Include information that would be beneficial and non-obvious to the Letta Code agent. Consider what procedural knowledge, domain-specific details, or reusable assets would help another Letta Code agent instance execute these tasks more effectively.
|
|
281
|
+
|
|
282
|
+
#### Learn Proven Design Patterns
|
|
283
|
+
|
|
284
|
+
Consult these helpful guides based on your skill's needs:
|
|
285
|
+
|
|
286
|
+
- **Multi-step processes**: See references/workflows.md for sequential workflows and conditional logic
|
|
287
|
+
- **Specific output formats or quality standards**: See references/output-patterns.md for template and example patterns
|
|
288
|
+
|
|
289
|
+
These files contain established best practices for effective skill design.
|
|
290
|
+
|
|
291
|
+
#### Start with Reusable Skill Contents
|
|
292
|
+
|
|
293
|
+
To begin implementation, start with the reusable resources identified above: `scripts/`, `references/`, and `assets/` files. Note that this step may require user input. For example, when implementing a `brand-guidelines` skill, the user may need to provide brand assets or templates to store in `assets/`, or documentation to store in `references/`.
|
|
294
|
+
|
|
295
|
+
Added scripts must be tested by actually running them to ensure there are no bugs and that the output matches what is expected. If there are many similar scripts, only a representative sample needs to be tested to ensure confidence that they all work while balancing time to completion.
|
|
296
|
+
|
|
297
|
+
Any example files and directories not needed for the skill should be deleted. The initialization script creates example files in `scripts/`, `references/`, and `assets/` to demonstrate structure, but most skills won't need all of them.
|
|
298
|
+
|
|
299
|
+
#### Update SKILL.md
|
|
300
|
+
|
|
301
|
+
**Writing Guidelines:** Always use imperative/infinitive form.
|
|
302
|
+
|
|
303
|
+
##### Frontmatter
|
|
304
|
+
|
|
305
|
+
Write the YAML frontmatter with `name` and `description`:
|
|
306
|
+
|
|
307
|
+
- `name`: The skill name
|
|
308
|
+
- `description`: This is the primary triggering mechanism for your skill, and helps the Letta Code agent understand when to use the skill.
|
|
309
|
+
- Include both what the Skill does and specific triggers/contexts for when to use it.
|
|
310
|
+
- Include all "when to use" information here - Not in the body. The body is only loaded after triggering, so "When to Use This Skill" sections in the body are not helpful to the Letta Code agent.
|
|
311
|
+
- Example description for a `docx` skill: "Comprehensive document creation, editing, and analysis with support for tracked changes, comments, formatting preservation, and text extraction. Use when the Letta Code agent needs to work with professional documents (.docx files) for: (1) Creating new documents, (2) Modifying or editing content, (3) Working with tracked changes, (4) Adding comments, or any other document tasks"
|
|
312
|
+
|
|
313
|
+
Do not include any other fields in YAML frontmatter.
|
|
314
|
+
|
|
315
|
+
##### Body
|
|
316
|
+
|
|
317
|
+
Write instructions for using the skill and its bundled resources.
|
|
318
|
+
|
|
319
|
+
### Step 5: Packaging a Skill
|
|
320
|
+
|
|
321
|
+
Once development of the skill is complete, it must be packaged into a distributable .skill file that gets shared with the user. The packaging process automatically validates the skill first to ensure it meets all requirements:
|
|
322
|
+
|
|
323
|
+
```bash
|
|
324
|
+
npx ts-node scripts/package-skill.ts <path/to/skill-folder>
|
|
325
|
+
```
|
|
326
|
+
|
|
327
|
+
Optional output directory specification:
|
|
328
|
+
|
|
329
|
+
```bash
|
|
330
|
+
npx ts-node scripts/package-skill.ts <path/to/skill-folder> ./dist
|
|
331
|
+
```
|
|
332
|
+
|
|
333
|
+
The packaging script will:
|
|
334
|
+
|
|
335
|
+
1. **Validate** the skill automatically, checking:
|
|
336
|
+
|
|
337
|
+
- YAML frontmatter format and required fields
|
|
338
|
+
- Skill naming conventions and directory structure
|
|
339
|
+
- Description completeness and quality
|
|
340
|
+
- File organization and resource references
|
|
341
|
+
|
|
342
|
+
2. **Package** the skill if validation passes, creating a .skill file named after the skill (e.g., `my-skill.skill`) that includes all files and maintains the proper directory structure for distribution. The .skill file is a zip file with a .skill extension.
|
|
343
|
+
|
|
344
|
+
If validation fails, the script will report the errors and exit without creating a package. Fix any validation errors and run the packaging command again.
|
|
345
|
+
|
|
346
|
+
### Step 6: Iterate
|
|
347
|
+
|
|
348
|
+
After testing the skill, users may request improvements. Often this happens right after using the skill, with fresh context of how the skill performed.
|
|
349
|
+
|
|
350
|
+
**Iteration workflow:**
|
|
351
|
+
|
|
352
|
+
1. Use the skill on real tasks
|
|
353
|
+
2. Notice struggles or inefficiencies
|
|
354
|
+
3. Identify how SKILL.md or bundled resources should be updated
|
|
355
|
+
4. Implement changes and test again
|