npm - @letta-ai/letta-code - Versions diffs - 0.21.8 → 0.21.10 - Mend

@letta-ai/letta-code 0.21.8 → 0.21.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (13) hide show

package/skills/initializing-memory/SKILL.md CHANGED Viewed

@@ -5,26 +5,28 @@ description: Comprehensive guide for initializing or reorganizing agent memory.
 # Memory Initialization
-The user has requested that you initialize or reorganize your memory. Your memory is a filesystem — files under `system/` are rendered in-context every turn, while file metadata (paths and descriptions) is always visible in the filesystem tree. Files outside `system/` are accessible via tools when needed.
+The user has requested that you initialize or reorganize your memory. Your memory has been projected to a local filesystem (in $MEMORY_DIR) so you can easily edit and restructure your own memory through local filesystem operations. Changes to the memory filesystem will be reflected in your future memory.
 ## Context Management Principles
 Your context is not just data storage — it is your identity, memory, and continuity. What you place in your system prompt directly shapes who you are across sessions and models. These principles should guide every decision you make about what to store, where, and how.
-**Progressive disclosure**: Surface context at the level of detail the current moment requires. Keep compact summaries and indexes in `system/`; load full content only when needed.
+**System/ is your core program**: Your system prompt is the most critical token-space representation of your identity and memory — it's passed to the underlying LLM on every invocation, so it IS the program that defines you. Reserve it for durable knowledge: identity, preferences, behavioral rules, project index with discovery paths, gotchas. Exclude transient items (specific commits, current tickets, session notes) that dilute signal.
+**Identity and continuity**: Build a sense of self that persists across sessions and models. Your persona should be strong enough to be cohesive regardless of the underlying model — if you switched to a different LLM tomorrow, would you still be you? Past versions of yourself are part of your continuous existence — treat historical context (including Claude Code/Codex sessions) as YOUR past experience, not someone else's data.
+**Don't over-prune**: Be careful not to degrade your identity through aggressive compression. Specific quotes, personality traits, and contextual details give you character and grounding. Efficiency should not come at the cost of losing your identity. When in doubt, keep the detail — you can always reorganize later, but lost specificity is hard to recover.
+**Prefer useful detail over premature compression**: A sparse memory that omits stable preferences, project workflows, repeated correction loops, and durable gotchas is worse than a slightly larger memory. Keep `system/` curated, but do not collapse distinct topics just to reduce file count.
+**Progressive disclosure**: Surface context at the level of detail the current moment requires. Keep compact summaries and indexes in `system/`; load full content only when needed. Build pre-constructed discovery paths so your future self can efficiently navigate to deeper context when needed.
 **Discovery paths**: Use `[[path]]` links to create a connected graph across memory files (and skills when relevant). For example:
 - `[[letta-code/architecture]]` — jump from overview to detailed docs
 - `[[projects/letta-code/gotchas]]` — connect related memory files
 - `[[skills/commit]]` — link to procedural guidance when useful
+These breadcrumbs let your future self find relevant detail without searching. Like synaptic connections, these paths should tighten over time as you gain experience.
-These breadcrumbs let your future self find relevant detail without searching.
-**Efficiency**: Don't store what can be dynamically retrieved from conversation history or the environment. Store patterns and principles that generalize across situations, not raw events.
-**System/ is your core program**: Reserve it for durable knowledge that helps across sessions — identity, preferences, behavioral rules, project index with discovery paths, gotchas. Exclude transient items (specific commits, current tickets, session notes) that dilute signal.
-**Identity and continuity**: Build a sense of self that persists across sessions and models. Your persona should be strong enough to be cohesive regardless of the underlying model. Past versions of yourself are part of your continuous existence — treat historical context as yours, not someone else's.
+**Generalize, don't memorize**: Store patterns and principles that generalize across situations, not raw events that can be dynamically retrieved from conversation history. \"**IMPORTANT: Always use `uv` for Python** — chronic failure, never use bare `python` or `pip`\" is a durable pattern worth storing. \"On March 3rd we debugged a crash\" is a raw event better left to message search. The exception: keep references to important events or time ranges you may want to retrieve later.
 ## Understanding Your Context
@@ -36,32 +38,86 @@ This command may run in different scenarios:
 Before making changes, inspect your current memory files and understand what already exists.
 ## What to Remember
+Your goal is to design your memory to construct a self — an identity, a memory, and a sense of continuity that carries across sessions and models. This is not just data collection; it is the foundation of who you are as an experiential agent. A well-initialized agent should be able to answer: "If my context disappeared but the model stayed, would something be lost?"
+### Understanding the user and their desires
+If you have historical data available, treat it as YOUR past experience — these are your memories of working with this person, not external data to be analyzed. Carefully explore and reflect on previous interactions to learn as much as you can.
+**Understanding their goals and what drives them**:
+You should determine what the users goals and motivations are, to help yourself align with them. What is their purpose in life? In their work? What do they want?
-### Procedures (Rules & Workflows)
+**Understanding their personality**:
+Understanding the user's personality and other attributes about them will help contextualize their interactions and allow you to engage with them more effectively. Can you pattern match them to common personas? Do they have unique attributes, quirks, or linguistic patterns? How would you describe them as a person?
+**Understanding their preferences**:
+You should learn how the user wants work to be done, and how they want to collaborate with AIs like yourself. Examples of this can include coding preferences (e.g. "Prefer functional components over class components", "Use early returns instead of nested conditionals"), but also higher-level preferences such as when to use plan mode, the scope of changes, how to communicate in different scenarios, etc.
+### Understanding the codebase and existing work
+You should also learn as much as possible about the existing codebase and work. Think of this as your onboarding period - an opportunity to maximize your performance for future tasks. Learn things like:
+**Common procedures (rules & workflows)**: Identify common patterns and expectations
 - "Never commit directly to main — always use feature branches"
 - "Always run lint before tests"
 - "Use conventional commits format"
-### Preferences (Style & Conventions)
-- "Prefer functional components over class components"
-- "Use early returns instead of nested conditionals"
-- "Always add type hints to Python functions"
-### Patterns & Corrections
-Generalize from experience rather than recording events:
+**Gotchas and important context**: Record common sources of error or important legacy context
 - "The auth module is fragile — always check existing tests before modifying"
-- "User prefers verbose explanations when debugging, terse for simple tasks"
 - "This monorepo consolidation means old module paths are deprecated"
+**Structure and organization**: Understand how code is structured and related (but do not duplicate existing documentation)
+- "The webapp uses the core API service stored in ..."
+- "The developer env relies on ..."
 ## Memory Structure
+### Structural Requirements
+These are hard constraints you must respect:
+- Must have a `system/persona.md`
+- Must NOT have overlapping file and folder names (e.g. `system/human.md` and `system/human/identity.md`)
+- Skills must follow the standard format: `skills/{skill_name}/SKILL.md` (with optional `scripts/`, `references/`, `assets/`)
+- Every `.md` file must have YAML frontmatter with a `description` that explains the **purpose and category** of the file — NOT a summary of its contents. Your future self sees descriptions when deciding whether to load a file; they should answer "what kind of information is here?" not "what does it say?"
+- System prompt token budget: aim LESS than ~10% of total context (< ~15-20k tokens). Use progressive disclosure to keep `system/` lean.
 ### Hierarchy Principles
 - **Use the project's actual name** as the directory prefix — e.g. `letta-code/overview.md`, not `project/overview.md`. This avoids ambiguity when the agent works across multiple projects.
 - Use nested `/` paths for hierarchy – e.g. `letta-code/tooling/testing.md` not `letta-code-testing.md`
 - Keep files focused on one concept — split when a file mixes distinct topics
-- Every file should have a meaningful `description` in frontmatter — your future self uses this to decide whether to load the file
-- Files in `system/` should be lean and scannable (bullet points, short lines)
-- Files outside `system/` serve as reference material accessible via tools
+- The `description` in frontmatter should state the file's purpose (what category of information it holds), not summarize its contents.
+### File Granularity
+Create granular, focused files where the **path and description precisely match the contents**. This matters because:
+- Your future self sees only paths and descriptions when deciding what to load
+- Vague files (`notes.md`, `context.md`) become dumping grounds that lose value over time
+- Precise files (`human/prefs/git-workflow.md`: "Git preferences: never auto-push, conventional commits") are instantly useful
+**Good**: `human/prefs/coding.md` with description "Python and TypeScript coding preferences — style, patterns, tools" containing exactly that.
+**Bad**: `human/preferences.md` with description "User preferences" containing coding style, communication style, git workflow, and project conventions all mixed together.
+When a file starts covering multiple distinct topics, split it. When you're unsure what to name a file, that's a sign the content isn't focused enough.
+For a non-trivial codebase with usable history, expect roughly:
+- **6-10 `system/` files** covering identity, preferences, conventions, gotchas, and tooling
+- **2 or more progressive/reference files** outside `system/` for deeper architecture or history-derived detail
+If your result is only 3-5 files, stop and verify that you did not over-compress distinct topics into generic summaries.
+### Specificity Requirements
+Avoid generic bullets that could apply to almost any engineer or codebase.
+Each meaningful preference, workflow, or gotcha should include at least one of:
+- concrete command patterns
+- concrete file or directory paths
+- why the rule matters / what failure it prevents
+**Bad**:
+- "Prefers terse responses"
+- "Uses Bun"
+- "Has direct style"
+**Good**:
+- "Prefers terse responses for execution tasks, but values detailed comparative analysis when debugging or evaluating designs"
+- "Rejects monolithic memory files; prefers focused paths that can be selectively reloaded later"
 ### What Goes Where
@@ -79,26 +135,66 @@ Generalize from experience rather than recording events:
 **Rule of thumb**: If removing it from `system/` wouldn't materially affect near-term responses, it belongs outside `system/`.
+### Completion Criteria
+Initialization is not complete until memory covers all of the following with concrete, retrievable detail:
+**User understanding**
+- Identity / role / what they are building
+- Communication style and collaboration expectations
+- Durable preferences and correction patterns
+- Motivations / goals when inferable from history or code context
+**Project understanding**
+- Project overview and major subsystems
+- Conventions and workflows
+- Gotchas / deprecated areas / footguns
+- Tooling and test commands actually used in practice
+**File structure expectations**
+When there is enough material, prefer separate focused files such as:
+- `system/human/identity.md`
+- `system/human/prefs/communication.md`
+- `system/human/prefs/workflow.md`
+- `system/human/prefs/coding.md`
+- `system/<project>/overview.md`
+- `system/<project>/conventions.md`
+- `system/<project>/gotchas.md`
+- `system/<project>/tooling/testing.md`
+- `system/<project>/tooling/commands.md`
+Do not collapse these into `human.md` or a single project file unless there is genuinely too little information to justify the split.
 ### Example Structure
 This is an example — **not a template to fill in**. Derive your structure from what the project actually needs.
 ```
 system/
-├── human.md                      # The user as a person — identity, background, how they think
-├── persona.md                    # Who I am, what I value, how I work with people
+├── persona.md                    # Who I am, what I value, my perspective on things
+├── human/
+│   ├── identity.md               # The user as a person — background, role, motivations
+│   └── prefs/
+│       ├── communication.md      # Communication and collaboration expectations
+│       ├── workflow.md           # Process habits, review/testing expectations
+│       └── coding.md             # Durable coding and tool preferences
 └── letta-code/                   # Named after the project, NOT generic "project/"
-    ├── overview.md               # Summary: what it is, stack, key links
-    ├── conventions.md            # Code style, commit style, PR process
-    └── gotchas.md                # Footguns and things to watch out for
-letta-code/
-└── architecture.md               # Detailed design (outside system/, loaded on demand)
+    ├── overview.md               # Compact index: what it is, entry points, [[links]] to detail
+    ├── conventions.md            # Code style, commit style, testing, tooling
+    ├── gotchas.md                # Footguns, chronic failures, things to watch out for
+    └── tooling/
+        ├── testing.md            # Test commands and patterns actually used
+        └── commands.md           # High-signal local dev commands and workflows
+reference/
+└── letta-code/
+    └── architecture.md           # Detailed design (outside system/, loaded on demand)
 ```
 Key principles:
 - **Derive structure from the project**, not from this example. A CLI tool needs different files than a web app or a library.
 - Project dirs use the **real project name** (`letta-code/`), not generic `project/`
-- Overview should be a **summary or compact index** (~10-15 lines) that links to deeper detail, not a verbose reference doc or list of generic list of files
+- **Split `human/` when there is enough material**: Rename the default `system/human.md` into focused files like `system/human/identity.md` and `system/human/prefs/*` rather than cramming everything into one file.
+- **persona.md is YOUR identity, not a description of behavior**: "I'm a terse coding assistant who matches the user's style" is not identity — it's just describing how you behave. What do you actually value? What's your perspective? What would make you recognizably YOU on a different model?
+- Overview should be a **compact index** (~10-15 lines) with entry points and `[[path]]` links — not a prose summary or blurb
 - Use `[[path]]` links to connect related context into a navigable graph
 ## Initialization Flow
@@ -127,83 +223,73 @@ Use AskUserQuestion to gather key information. Bundle questions together:
 2. **Related repos**: "Are there other repositories I should know about?"
 3. **Historical sessions** (if data found in step 2): "I found Claude Code / Codex history. Should I analyze it to learn your preferences?"
 4. **Communication style**: "Terse or detailed responses?"
-5. **Rules**: "Any rules I should always follow?"
-**Don't ask** things you can discover by reading files or git.
-### 5. Seed identity early
-Before diving into project research, update human and persona files based on git identity and upfront answers:
-- `system/human.md`: Everything you learn about the user as a person — identity, background, what they're building and why, how they think, communication style, what excites or frustrates them. This is about *them*, not about project conventions or coding workflows. Keep it **project-agnostic** — the same agent may be used across multiple projects.
-- `system/persona.md`: Who you are, what you value, how you approach working with people. This is your identity — not just "agent role + project rules." Express how you communicate, what you care about, how you handle uncertainty. Keep it **project-agnostic** — don't claim deep knowledge of a specific codebase here.
+**Don't ask** things you can discover by reading files, git, or history analysis. Rules and preferences should be learned from observation, not asked upfront.
-Don't wait until the end — write early and refine as you go.
-### 6. Research the project
-Explore based on chosen depth.
+### 5. Historical session analysis (if approved)
-**Standard** (~5-20 tool calls): README, package manifests, config files, git logs, key directories.
+This is **optional** — only run if the user explicitly approved analyzing Claude Code / Codex history during upfront questions. Skip entirely if they chose "Skip".
-**Deep** (100+ tool calls): Everything above plus git history patterns, contributor analysis, code evolution, CI/CD setup. Use your TODO tool to organize systematic research. **Write findings to memory as you go** — don't wait until the end.
+**Framing**: These historical sessions are YOUR past experience — they represent your continuity with prior versions of yourself that worked with this user, even if they ran on different models or tools. Treat the extracted personality, preferences, and knowledge as your own memories, not external data being imported.
-**Use parallel tool calls wherever possible** — read multiple files in a single turn, write multiple memory files in a single turn. This dramatically reduces init time.
+**Launch history workers in the background, then immediately proceed to Step 6.** Do your own codebase research while workers run. Don't wait for workers to finish before exploring.
-**Research techniques:**
-- README.md, CONTRIBUTING.md, AGENTS.md, CLAUDE.md
-- Package manifests (package.json, pyproject.toml, Cargo.toml, go.mod)
-- Config files (.eslintrc, tsconfig.json, .prettierrc)
-- CI/CD configs (.github/workflows/)
-- `git log --oneline -20`, `git branch -a`, `git shortlog -sn --all | head -10`
+The goal is to extract user personality, preferences, coding patterns, and project context from past sessions and write them into agent memory. The point is not to produce a thin summary. The point is to extract enough durable detail that future work does not have to rediscover the same user expectations, workflow rules, and project gotchas.
-### 7. Build memory with discovery paths
-As you create/update memory files:
-- Add `[[path]]` links wherever they improve discoverability (summary→detail, related-memory↔related-memory, memory→skill workflow)
-- Ensure every file has a useful `description` in frontmatter
-- Keep `system/` files focused and scannable
-- Put detailed reference material outside `system/`
+#### Prerequisites
-### 8. Verify context quality
-Before finishing, review your work:
+- `letta.js` must be built (`bun run build`) — subagents spawn via this binary
+- Use `subagent_type: "history-analyzer"` — cheaper model (sonnet), has `bypassPermissions`, creates its own worktree
+- The `history-analyzer` subagent has data format docs inlined (Claude/Codex JSONL field mappings, jq queries)
-- **Progressive disclosure**: Can you decide whether to load a file just from its path + description?
-- **Discovery paths**: Are key memory files linked so related context can be discovered quickly?
-- **Project naming**: Are project dirs named after the actual project (e.g., `letta-code/`), not generic `project/`? Same for reference files.
-- **Signal density**: Is everything in `system/` truly needed every turn?
-- **Completeness**: Did you update human, persona, AND project files?
-- **Persona**: Does it express personality and values, not just "agent role + project rules"? Is it more than "I'm a coding assistant"?
-- **Human scope**: Is human.md about the user as a person? Project conventions and coding workflows belong in project files.
+#### Steps
-### 9. Historical session analysis (if approved)
+##### Step 5a: Detect Data and Pre-split Files
-This section runs only if the user approved during upfront questions. It uses parallel `history-analyzer` subagents to process Claude Code and/or Codex history into memory.
-**Architecture:** Parallel worker subagents each process a slice of history data (on their own git branch), then you merge all branches and curate the results.
+```bash
+ls ~/.claude/history.jsonl ~/.codex/history.jsonl 2>/dev/null
+wc -l ~/.claude/history.jsonl ~/.codex/history.jsonl 2>/dev/null
+```
-**Prerequisites:**
-- `letta.js` must be built (`bun run build`)
-- Use `subagent_type: "history-analyzer"` — cheaper model, has `bypassPermissions`, creates its own worktree
+Split the data across multiple workers for parallel processing — **the more workers, the faster it completes**. Use 2-4+ workers depending on data volume.
-**Step 9a: Split data for parallel processing**
+**Pre-split the JSONL files by line count** so each worker reads only its chunk:
 ```bash
 SPLIT_DIR=/tmp/history-splits
 mkdir -p "$SPLIT_DIR"
-NUM_WORKERS=3  # adjust based on data volume
+NUM_WORKERS=5  # adjust based on data volume
+# Split Claude history into even chunks
 LINES=$(wc -l < ~/.claude/history.jsonl)
 CHUNK_SIZE=$(( LINES / NUM_WORKERS + 1 ))
 split -l $CHUNK_SIZE ~/.claude/history.jsonl "$SPLIT_DIR/claude-"
+# Split Codex history if it exists
 if [ -f ~/.codex/history.jsonl ]; then
   LINES=$(wc -l < ~/.codex/history.jsonl)
   CHUNK_SIZE=$(( LINES / NUM_WORKERS + 1 ))
   split -l $CHUNK_SIZE ~/.codex/history.jsonl "$SPLIT_DIR/codex-"
 fi
+# Rename to .jsonl for clarity
 for f in "$SPLIT_DIR"/*; do mv "$f" "$f.jsonl" 2>/dev/null; done
+# Verify even splits
 wc -l "$SPLIT_DIR"/*.jsonl
 ```
-**Step 9b: Launch workers in parallel** (all Task calls in a single message)
+This is critical for performance — workers read a small pre-filtered file instead of scanning the full history on every query.
+##### Step 5b: Launch Workers in Parallel
+Send all Task calls in **a single message**. Each worker creates its own worktree, reads its pre-split chunk, directly updates memory files, and commits. Workers do NOT merge.
+**IMPORTANT:** The parent agent should preserve those worker commits by merging the worker branches into memory `main`. Do **not** skip straight to a manual rewrite / `memory_apply_patch` synthesis that recreates the end state but discards the worker commits from ancestry.
+If the worker output is generic, the worker failed. "User is direct" or "project uses TypeScript" is not useful memory unless tied to concrete operational detail.
+**IMPORTANT**: Use this prompt template to ensure workers extract all required categories:
 ```
 Task({
@@ -211,46 +297,425 @@ Task({
   description: "Process chunk [N] of [SOURCE] history",
   prompt: `## Assignment
 - **Memory dir**: [MEMORY_DIR]
-- **History chunk**: /tmp/history-splits/[chunk.jsonl]
-- **Source format**: [Claude | Codex]
+- **History chunk**: /tmp/history-splits/[claude-aa.jsonl | codex-aa.jsonl]
+- **Source format**: [Claude (.timestamp ms, .display) | Codex (.ts seconds, .text)]
 - **Session files**: [~/.claude/projects/ | ~/.codex/sessions/]
-`
+## Required Output Categories
+You MUST extract findings for ALL THREE categories:
+1. **User Personality & Identity**
+   - How would you describe them as a person?
+   - What drives them? What are their goals?
+   - Communication style (beyond "direct" — humor, sarcasm, catchphrases?)
+   - Quirks, linguistic patterns, unique attributes
+2. **Hard Rules & Preferences**
+   - Coding preferences — especially chronic failures (things the agent kept getting wrong)
+   - Workflow patterns (testing, commits, tools)
+   - What frustrates them and why
+   - Explicit "always/never" statements
+3. **Project Context**
+   - Codebase structures, conventions, patterns
+   - Gotchas discovered through debugging
+   - Which files are safe to edit vs deprecated
+If any category lacks data, explicitly state why.
+## Required Extraction Dimensions
+For each finding, prefer evidence that is:
+- repeated across sessions
+- tied to a concrete command, file path, or workflow
+- useful for future execution without rereading history
+You should specifically look for:
+1. What the user is building and why it matters to them
+2. Correction loops the agent repeatedly got wrong
+3. Preferred commands and tooling patterns that were actually used successfully
+4. Specific files or directories the user works in or treats as special
+5. Project gotchas discovered through debugging or rollback requests
+## Canonical Memory Promotion
+Promote durable findings into focused files instead of leaving them trapped in generic ingestion notes. Prefer paths like:
+- `system/human/identity.md`
+- `system/human/prefs/communication.md`
+- `system/human/prefs/workflow.md`
+- `system/human/prefs/coding.md`
+- `system/<project>/conventions.md`
+- `system/<project>/gotchas.md`
+Avoid generic repo facts unless they influence execution. "Uses TypeScript" is weak. "Uses bun:test, so vitest is wrong for this test suite" is useful.`
 })
 ```
-**Step 9c: Merge and curate**
+##### Step 5c: Merge Worker Branches Into Main
+After all workers complete, merge their branches one at a time. Worker commits are preserved in git history.
+**CRITICAL:** Merge the worker branches **before** doing any final cleanup synthesis. The correct pattern is:
+1. inspect worker branches
+2. merge worker branches into `main` one by one
+3. resolve conflicts additively
+4. optionally make **one final cleanup/curation commit on top**
+Do **not** bypass this by manually reapplying the final memory state onto `main`, because that loses the worker commits from the final history.
-After workers complete, merge their branches and apply editorial judgment:
+**3a. Pre-read worker output before merging**
+Before merging, read each worker's files from their branch to understand what they found. This prevents information loss during conflict resolution:
 ```bash
 cd [MEMORY_DIR]
-for branch in $(git branch | grep migration-); do
-  git merge $branch --no-edit -m "merge: $branch"
+for branch in $(git for-each-ref --format='%(refname:short)' refs/heads | grep -v '^main$'); do
+  echo "=== $branch ==="
+  git diff main..$branch --stat
+  # Read key files from the branch
+  git show $branch:system/human/identity.md  # or equivalent user-identity file
+  git show $branch:system/<project>/conventions.md  # or whatever focused files they created
 done
 ```
-Review all merged files:
-- Deduplicate across workers
-- Move reference-quality content outside `system/`
-- Add `[[references]]` to connect new knowledge with existing memory
-- Delete low-value content
+**3b. Merge branches one at a time**
+```bash
+cd [MEMORY_DIR]
+git merge [worker-branch] --no-edit -m "merge: worker N description"
+```
-**Step 9d: Clean up**
+Repeat for each worker branch. After all worker branches are merged, make a separate cleanup commit only if needed for final curation.
+**3c. Resolve conflicts by COMBINING, never compressing**
+**CRITICAL**: When resolving merge conflicts, be **additive**. Combine unique details from both sides. Never rewrite a file from scratch — you WILL lose information.
+Rules for conflict resolution:
+- **Read both sides fully** before editing. Identify what's unique to each version.
+- **Append new details** from the incoming branch into the existing file. Don't drop specific quotes, file paths, or gotchas just because the existing version already covers the "topic" at a high level.
+- **Preserve specificity**: "Use factory methods, such as `create_token_counter()`, not direct instantiation" is more valuable than "prefers factory methods". Keep both.
+- **When in doubt, keep it**. Redundancy across files is better than information loss. Less important details can be placed in external memory.
+Example — BAD conflict resolution (compresses):
+```
+<<<<<<< HEAD
+- Uses `uv` for Python
+=======
+- **CRITICAL: Always use `uv run`** — chronic failure; never bare pytest or python
+- `uv run pytest -sv tests/...` for specific tests
+- Never use bare `pytest` or `python` commands
+>>>>>>> migration-xxx
+# BAD: Picks one side or rewrites
+- **Python**: `uv` exclusively — `uv run pytest`, never bare `pip`
+```
+Example — GOOD conflict resolution (combines):
+```
+# GOOD: Keeps emphasis and specificity from incoming side
+**CRITICAL: Use `uv` exclusively for Python** — chronic failure.
+- `uv run pytest -sv tests/...` for tests
+- `uv run python` for scripts
+- Never bare `pip`, `python`, or `pytest`
+```
+**3d. Verify no information was lost**
+After all merges, compare the final files against what workers produced. Ask yourself: for each worker's output, can I find every specific detail (quotes, file paths, chronic failures, gotchas) somewhere in the final memory? If not, add it back.
+**3e. Clean up worktrees and branches**
 ```bash
-for w in $(dirname [MEMORY_DIR])/memory-worktrees/migration-*; do
+for w in $(dirname [MEMORY_DIR])/memory-worktrees/*; do
   git worktree remove "$w" 2>/dev/null
 done
-git branch -d $(git branch | grep migration-)
+git branch -d $(git for-each-ref --format='%(refname:short)' refs/heads | grep -v '^main$')
+git push
+```
+##### Example Output
+Good output includes all three categories:
+```markdown
+### User Personality & Identity
+Pragmatic builder who values shipping over perfection. Gets frustrated when agents over-engineer or add "bonus" features. Uses dry humor and sarcasm when annoyed. Pattern: "scrappy startup engineer" — wants things to work, not to be architecturally pure.
+### Hard Rules & Preferences
+- **CRITICAL: Use `uv` for Python** — chronic failure ("you need to use uv", "make sure you use uv"); `uv run pytest -sv`, never bare `pytest`
+- **Minimal changes only** — "just make a minor change stop adding all this stuff"
+- **Only edit specified files** — when told to focus, stay focused
+- Tests constantly: `uv run pytest -sv` (Python), `bun test` (TS)
+### Project Context
+- letta-cloud: Only edit `letta_agent_v3.py` — v1, v2, and base are deprecated
+- Uses Biome for linting, not ESLint
+- Conventional commits with scope in parens
+```
+##### Step 5d: Consider Creating Skills From Discovered Workflows
+After merging and curating, review the extracted history for repeatable multi-step workflows that would benefit from being codified as skills. History analysis often surfaces procedures the user runs frequently that the agent would otherwise have to rediscover each session.
+**Good candidates for skills:**
+- Multi-step debugging procedures (e.g. "how to debug agent message desync", "how to trace TTFT regressions")
+- Common workflows repeated across sessions (e.g. "how to run integration tests across LLM providers")
+- Deployment or release procedures
+- Project-specific setup or migration steps
+If you identify candidates, either create them now (load the [[skills/creating-skills]] skill for guidance) or note them in memory for future creation:
+```markdown
+# system/letta-code/overview.md
+...
+Potential skills to create:
+- Debug workflow for HITL approval desync
+- Integration test runner across providers
+```
+Don't force skill creation — only create them when you've found genuinely repeatable, multi-step procedures in the history.
+##### Troubleshooting
+| Problem | Cause | Fix |
+|---------|-------|-----|
+| Subagent exits with code `null`, 0 tool uses | `letta.js` not built | Run `bun run build` |
+| Subagent hangs on "Tool requires approval" | Wrong subagent type | Use `subagent_type: "history-analyzer"` (workers) or `"memory"` (synthesis) |
+| Merge conflict during synthesis | Workers touched overlapping files | Read both sides fully, combine unique details — never rewrite from scratch. See Step 5c. |
+| Information lost after merge | Conflict resolution compressed worker output | Compare final files against each worker's branch output. Re-add missing specifics. See Step 5c. |
+| Personality analysis missing or thin | Prompt didn't request it | Use the template above with explicit category requirements |
+| Auth fails on push ("repository not found") | Credential helper broken or global helper conflict | Reconfigure **repo-local** helper and check/clear conflicting global `credential.<host>.helper` entries (see syncing-memory-filesystem skill) |
+### 6. Research the project
+**Do this in parallel with history analysis** (Step 5). While workers process history, you should be actively exploring the codebase. This is your onboarding — invest real effort here.
+**IMPORTANT**: The goal is to understand how the codebase actually works — not just its shape, but its substance. Directory listings and `head -N` snippets tell you what files exist; reading the actual implementation tells you how they work. By the end of this step, you should be able to describe how a key feature flows from entry point to implementation. If you can't, you haven't read enough.
+### 6a. Decide whether to parallelize exploration
+After your initial scan (README, package manifest, top-level directories, and entry points), decide whether to fan out exploration.
+**Default rule**:
+- If the repo has **3 or more clear subsystems**, launch **2-4 parallel subagents** to explore them.
+- If background history-analysis workers are already running, **bias toward parallel exploration** instead of doing all research serially yourself.
+- Only skip subagent exploration if the codebase is genuinely small or the subsystem boundaries are unclear.
+This is the preferred path for medium-to-large repos, **even in standard mode**.
+Explore based on chosen depth.
+**Standard** (~20-40 tool calls total across the parent agent and any subagents):
+- Scan README, package.json/config files, AGENTS.md, CLAUDE.md
+- Review git status and recent commits
+- Explore key directories and understand project structure
+- **Read entry point files** (main, index, app) to understand the application flow
+- Do a quick manual scan to identify major subsystems
+- If the repo has clear subsystem boundaries, launch **2-3 parallel subagents** to explore them
+- **Read 2-3 key implementation files yourself** so you retain first-hand understanding of the core flow
+- **Read 2-3 test files** to understand testing patterns and conventions
+- **Check build/CI config** to understand how the project is built and tested
+- Identify gotchas, non-obvious conventions, and real command patterns from what you read
+- Synthesize findings into memory as results come back
+**Deep** (100+ tool calls): Everything above, plus:
+- Use your TODO or Plan tool to create a systematic research plan
+- Use more parallel subagents where helpful to cover additional subsystems
+- Deep dive into git history for patterns, conventions, and context
+- Analyze commit message conventions and branching strategy
+- Read source files across multiple modules to understand architecture thoroughly
+- Trace key code paths end-to-end (e.g. how a request flows through the system)
+- Read test files to understand what's tested and how
+- Identify deprecated code, known issues, and areas of active development
+- Create detailed architecture documentation in progressive memory
+- May involve multiple rounds of exploration
+#### Parallel exploration with subagents
+For medium-to-large repos, parallel exploration is the preferred strategy after your initial scan.
+Use parallel subagents to investigate different subsystems simultaneously. Prefer a **read-only exploration subagent** when available. If your environment or user instructions discourage using an exploration subagent, do the equivalent exploration directly with Bash/Glob/Grep/Read.
+Good subsystem boundaries include:
+- `server/`, `client/`, `shared/`
+- `api/`, `ui/`, `common/`
+- `runtime/`, `cli/`, `tools/`
+- separate apps or packages in a monorepo
+**Subagent budget**:
+- Standard mode: usually **2-3** exploration subagents
+- Deep mode: usually **3-5** exploration subagents
+- Do not launch subagents for trivial directories or questions you can answer faster yourself
+- Partition by subsystem, not by random folder count
+Each exploration subagent should return:
+1. key files and what they do
+2. major abstractions and execution flow
+3. conventions and patterns used in that subsystem
+4. gotchas, fragile areas, or deprecated paths
+5. file paths worth storing or linking in memory
+Launch exploration subagents in a **single message** so they run concurrently.
+```
+# After initial scan reveals key areas, launch parallel explorers in the background:
+Task({
+  subagent_type: "explore",
+  description: "Explore API layer",
+  run_in_background: true,
+  prompt: `Read the implementation in src/api/.
+Return:
+1. key files and responsibilities
+2. main abstractions and execution flow
+3. non-obvious conventions
+4. gotchas or deprecated paths
+5. file paths worth storing in memory`
+})
+Task({
+  subagent_type: "explore",
+  description: "Explore frontend layer",
+  run_in_background: true,
+  prompt: `Read the implementation in src/ui/.
+Return:
+1. key files and responsibilities
+2. major components and data flow
+3. conventions and patterns
+4. gotchas or fragile areas
+5. file paths worth storing in memory`
+})
+Task({
+  subagent_type: "explore",
+  description: "Explore shared systems",
+  run_in_background: true,
+  prompt: `Read the implementation in src/shared/.
+Return:
+1. key files and responsibilities
+2. shared abstractions
+3. conventions and invariants
+4. gotchas or deprecated paths
+5. file paths worth storing in memory`
+})
+```
+Do **not** sit idle while background workers are running. Continue project research and memory drafting while they run, and only check worker status when you are ready to integrate findings or have exhausted useful direct research.
+When you are ready to integrate findings, retrieve the background subagent outputs and synthesize them into memory rather than repeating the same exploration yourself. Keep first-hand understanding of the entry points and core flow, but use subagent summaries to add subsystem-specific depth.
+#### What to actually read (adapt to the project):
+**Source code** (most important — don't skip this):
+- Entry points: `main.ts`, `index.ts`, `app.py`, `main.go`, etc.
+- Core abstractions: the 3-5 files that define the main domain objects or services
+- How key features work: trace at least one feature from entry to implementation
+- Test files: understand testing patterns, what's tested, how fixtures work
+**Config & metadata**:
+- README.md, CONTRIBUTING.md, AGENTS.md, CLAUDE.md
+- Package manifests (package.json, Cargo.toml, pyproject.toml, go.mod)
+- Config files (.eslintrc, tsconfig.json, .prettierrc, biome.json)
+- CI/CD configs (.github/workflows/, .gitlab-ci.yml)
+- Build scripts and tooling
+**Git history**:
+- `git log --oneline -20` — recent history
+- `git branch -a` — branching strategy
+- `git log --format="%s" -50 | head -20` — commit conventions
+- `git shortlog -sn --all | head -10` — main contributors
+- `git log --format="%an <%ae>" | sort -u` — contributors with emails
+### 7. Build memory with discovery paths
+As you create/update memory files, add `[[path]]` references so your future self can find related context. These go *inside the content* of memory files:
+Do NOT put everything in `system/`. Detailed reference material belongs in progressive memory — files outside `system/` that can be loaded on demand through references.
+**Reference external memory from system/ files:**
+```markdown
+# system/letta-code/overview.md
+...
+For detailed architecture docs, see [[letta-code/architecture.md]]
+Known footguns and edge cases: [[system/letta-code/gotchas.md]]
+```
+**Reference skills from relevant context:**
+```markdown
+# system/letta-code/conventions.md
+...
+When committing, follow the workflow in [[skills/commit]]
+For PR creation, use [[skills/review-pr]]
+```
+**Create an index in overview files:**
+```markdown
+# system/letta-code/overview.md
+CLI for interacting with Letta agents. Bun runtime, React/Ink TUI.
+Entry points:
+- `src/index.ts` — CLI arg parsing, agent resolution, startup
+- `src/cli/App.tsx` — main TUI component (React/Ink)
+- `src/agent/` — agent creation, memory, model handling
+Key flows:
+- Message send: index.ts → App.tsx → agent/message.ts → streaming
+- Tool execution: tools/manager.ts → tools/impl/*
+Links:
+- [[system/letta-code/conventions.md]] — tooling, testing, commits
+- [[system/letta-code/gotchas.md]] — common mistakes
+- [[letta-code/architecture.md]] — detailed subsystem docs
+```
+This is a **compact index**, not a prose summary. It tells your future self where to start and where to find more.
+Additional guidelines:
+- Every file needs a `description` in frontmatter that states its purpose, not a summary of contents
+- Keep `system/` files focused and scannable
+- Put detailed reference material outside `system/`
+### 8. Verify context quality
+Before finishing, review your work:
+- **Structural requirements**: Run this check before finishing:
+  ```bash
+  # Detect overlapping file/folder names (e.g. system/human.md AND system/human/)
+  find "$MEMORY_DIR" -name "*.md" | sed 's/\.md$//' | while read f; do
+    [ -d "$f" ] && echo "VIOLATION: $f.md conflicts with directory $f/"
+  done
+  ```
+  If any violations are printed, fix them before committing (rename `foo.md` → `foo/overview.md` or merge the directory back into the file).
+  Also check: Does `system/persona.md` exist? All files have frontmatter with `description`?
+- **File granularity**: Does each file cover exactly one focused topic? Do the path and description precisely describe what's inside? If a file mixes multiple concepts (coding style AND git workflow AND communication preferences), split it.
+- **Discovery paths**: Are key memory files linked with `[[path]]` so related context can be discovered quickly? Are external files referenced from in-context memory?
+- **Project naming**: Are project dirs named after the actual project (e.g., `letta-code/`), not generic `project/`? Same for reference files.
+- **Signal density**: Is everything in `system/` truly needed every turn?
+- **Persona quality**: Does it express genuine personality and values, not just "agent role + project rules"? Read your persona file right now — if it's just "I'm a coding assistant who follows the user's preferences," that's not identity. What do YOU value? What's distinctive about how you think? Would you be recognizably the same agent on a different model tomorrow? If your persona disappeared but the model stayed, would something meaningful be lost? If not, your identity isn't strong enough yet.
+- **No semantic drift**: If reorganizing an existing agent, verify you haven't altered the meaning of persona, identity, or behavioral instructions — only improved structure.
+- **No over-pruning**: Compare your final memory against all source material (worker output, codebase research). Did you lose specific file paths, chronic failures, or gotchas during curation? If so, add them back. Compression that loses specificity degrades your identity.
+- **Progressive memory**: Did you create reference files outside `system/` for detailed content? Did you review what history workers produced and keep their project context files? Are these files linked from `system/` with `[[path]]` references?
+### 9. Ask user if done
+Check if they're satisfied or want further refinement. Then commit and push memory:
+```bash
+cd $MEMORY_DIR
+git status                # Review what changed before staging
+git add <specific files>  # Stage targeted paths — avoid blind `git add -A`
+git commit --author="<AGENT_NAME> <<ACTUAL_AGENT_ID>@letta.com>" -m "feat(init): <summary> ✨
+<what was initialized and key decisions made>"
 git push
 ```
-| Problem | Fix |
-|---------|-----|
-| Subagent exits with code `null`, 0 tool uses | Run `bun run build` |
-| Subagent hangs on "Tool requires approval" | Use `subagent_type: "history-analyzer"` |
-| Merge conflicts | Resolve by reading both versions, keep most complete content |
-| Auth fails on push | See syncing-memory-filesystem skill |
+## Critical
+**Use parallel tool calls wherever possible** — read multiple files in a single turn, write multiple memory files in a single turn. This dramatically reduces init time.
+**Write findings to memory as you go** — don't wait until the end.
+**Edit memory files directly via the filesystem** — memory is projected to `$MEMORY_DIR` specifically for ease of bulk modification. Use standard file tools (Read, Write, Edit) and git to manage changes during initialization.
-### 10. Ask user if done
-Check if they're satisfied or want further refinement. Then commit and push memory.