npm - get-tbd - Versions diffs - 0.1.28 → 0.1.29 - Mend

get-tbd 0.1.28 → 0.1.29

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (15) hide show

package/dist/bin.mjs +3 -3
package/dist/bin.mjs.map +1 -1
package/dist/cli.mjs +3 -3
package/dist/cli.mjs.map +1 -1
package/dist/docs/SKILL.md +0 -4
package/dist/docs/guidelines/bun-monorepo-patterns.md +92 -289
package/dist/docs/guidelines/cli-agent-skill-patterns.md +706 -1240
package/dist/docs/guidelines/pnpm-monorepo-patterns.md +104 -321
package/dist/docs/guidelines/supply-chain-hardening.md +237 -0
package/dist/docs/shortcuts/standard/new-validation-plan.md +1 -1
package/dist/index.mjs +1 -1
package/dist/{src-D2xEmH4L.mjs → src-CJyVkC3V.mjs} +2 -2
package/dist/{src-D2xEmH4L.mjs.map → src-CJyVkC3V.mjs.map} +1 -1
package/dist/tbd +3 -3
package/package.json +1 -1

package/dist/docs/guidelines/cli-agent-skill-patterns.md CHANGED Viewed

@@ -1,1397 +1,863 @@
 ---
-title: CLI Agent Skill Patterns
-description: Best practices for building TypeScript CLIs that function as agent skills in Claude Code and other AI coding agents
+title: Agent Skills & CLI Integration Patterns
+description: How to write skills and agent-integrated CLIs that work across Claude Code, Codex, and the broader coding-agent ecosystem — a simple baseline plus references for advanced, multi-subcommand tools
 author: Joshua Levy (github.com/jlevy) with LLM assistance
 ---
-# CLI Agent Skill Patterns
+# Agent Skills & CLI Integration Patterns
-These patterns apply to building TypeScript CLI applications that function as powerful
-skills within Claude Code and other AI coding agents.
-The patterns are derived from the `tbd` CLI implementation, which serves as a reference
-architecture for agent-integrated command-line tools.
+**Last Updated**: 2026-05-23 (research verified against primary sources May 2026)
-The core insight is that a CLI can be much more than a command executor—it can be a
-**dynamic skill module** that provides context management, self-documentation, and
-seamless integration with multiple AI agents through a single npm package.
+This guideline covers how to package a capability so AI coding agents can discover and
+use it well — from a single-file skill up to a full CLI with many subcommands exposed as
+a skill. It is deliberately **not dogmatic**: most needs are met by a tiny `SKILL.md`,
+and the heavier patterns are opt-in for tools that genuinely need them.
-**When to use this guideline**: When building a CLI tool that should work well with AI
-coding agents, when adding agent integration to an existing CLI, or when designing
-context management for agent-aware applications.
+The patterns draw on the current (May 2026) state of the ecosystem and on `tbd`’s own
+implementation, which serves as a reference for the advanced “CLI-as-skill” approach.
-## Key Patterns to Consider for CLIs
+**When to use this guideline**: when building or packaging anything an AI coding agent
+should use — a prompt-only skill, a CLI tool, an MCP server, or a multi-agent
+integration — and you want it to work across Claude Code, Codex, Cursor, Gemini CLI, and
+others without rewriting it per agent.
-1. **CLI as Dynamic Skill Module:** CLIs can provide context management,
-   self-documentation, and multi-agent integration through a single npm package.
-2. **CLI as Knowledge Library:** Bundle guidelines, shortcuts, and templates that agents
-   can query on-demand to improve work quality.
-   This transforms the CLI from a tool the agent tells users about into a resource the
-   agent uses to better serve users.
-3. **Context Injection Loop:** A recursive architecture where skill documentation
-   references commands, those commands output more context, and that context references
-   further commands. This creates a self-directing knowledge system where agents get
-   progressively smarter as they work.
-4. **Task Management Integration:** CLIs that help agents track work across sessions,
-   discover available tasks, and enforce session boundaries lead to more reliable
-   agentic workflows.
+> **The single most important shift since 2025**: skills and project instructions are
+> now **open standards**, not per-vendor formats.
+> `AGENTS.md` is governed under the Linux Foundation’s **Agentic AI Foundation (AAIF)**;
+> the **Agent Skills (`SKILL.md`)** format is an open standard published at
+> [agentskills.io](https://agentskills.io) and implemented by 20+ agents.
+> Write to the standard once; most agents pick it up for free.
 * * *
-## 1. CLI as Skill Architecture
+## 0. Start Here — The Simple Baseline
-### 1.1 Bundled Documentation Pattern
+Read this section first.
+For the large majority of cases, you are done after it.
-- **Bundle documentation files with CLI**: Include skill files, `README.md`, and docs in
-  the CLI distribution.
-  Maintain tiered skill files for different contexts:
+### 0.1 If you just want a skill (prompt-only capability)
-| Tier | File | Tokens | Purpose |
-| --- | --- | --- | --- |
-| Full | `skill-baseline.md` | ~2000 | Default installation, full workflow guide |
-| Brief | `skill-brief.md` | ~400 | Condensed version for `--brief` flag |
-```typescript
-function getDocPath(filename: string): string {
-  const __dirname = dirname(fileURLToPath(import.meta.url));
-  return join(__dirname, 'docs', filename);
-}
+Create one folder with one file:
-async function loadDocContent(filename: string): Promise<string> {
-  // Try bundled location first
-  try {
-    return await readFile(getDocPath(filename), 'utf-8');
-  } catch {
-    // Fallback chain: dev path → repo path → error
-  }
-}
+```
+my-skill/
+└── SKILL.md
 ```
-- **Use a `dist/docs/` directory**: Store bundled docs alongside the CLI for
-  self-contained packages that work in sandboxed environments, containers, and CI.
+```markdown
+---
+name: my-skill
+description: >-
+  Analyze Excel spreadsheets, build pivot tables, and export results.
+  Use when the user mentions .xlsx files, tabular data, or spreadsheets.
+---
+# My Skill
-- **Implement fallback loading**: Support bundled → development source → repo-level
-  docs.
+Step-by-step instructions the agent should follow...
+```
-- **Provide a `skill` subcommand**: Output skill content to stdout so agents can inspect
-  or pipe it. Support verbosity flags:
-  ```bash
-  mycli skill           # Full skill with dynamic content
-  mycli skill --brief   # Condensed version (~400 tokens)
-  ```
+That is the entire artifact — no build step, no runtime, no dependencies.
+This is the **Agent Skills open standard** ([agentskills.io](https://agentskills.io)),
+and the same folder works in Claude Code, Codex CLI, Gemini CLI, GitHub Copilot, Cursor,
+Windsurf, Cline, pi, and 20+ other tools.
+Garry Tan’s **gstack** (~97K stars) is 23 skills, each a plain `SKILL.md` and nothing
+more — proof the baseline scales without custom tooling.
-### 1.2 Multi-Agent Integration Files
+**The only two things that matter for a basic skill:**
-Each agent platform has different file format requirements:
+1. A **`description`** that says *what it does* AND *when to use it* (this drives
+   activation — see §4.2).
+2. A **body under ~500 lines** of clear, imperative instructions.
-| Agent | File | Format | Location |
-| --- | --- | --- | --- |
-| Claude Code | SKILL.md | YAML frontmatter + Markdown | `.claude/skills/name/` |
-| Cursor IDE | CURSOR.mdc | MDC frontmatter + Markdown | `.cursor/rules/` |
-| Codex | AGENTS.md | HTML markers + Markdown | repo root |
+**Install it** by copying into a known directory, or with the cross-agent package
+manager:
-**SKILL.md Format** (Claude Code):
+```bash
+npx skills add owner/repo       # Vercel's skills.sh ecosystem (symlinks, 27+ agents)
+# or commit it to a discovery directory:
+#   .agents/skills/my-skill/SKILL.md   (cross-agent: Codex, pi, others)
+#   .claude/skills/my-skill/SKILL.md   (Claude Code, project)
+#   ~/.claude/skills/my-skill/SKILL.md (Claude Code, personal)
+```
+The `SKILL.md` folder is the portable **authoring** format; some agents add their own
+discovery paths (Codex/pi also read `.agents/skills/`) and their own **distribution**
+layers on top (Claude Code plugins, Codex plugins) — see §5.
+### 0.2 If your capability is a CLI
+Most agents already know how to run CLIs from their training data, and benchmarks show a
+CLI is far cheaper and more reliable than an MCP server for tools that have one (§7).
+So:
+1. Make the CLI **agent-friendly**: a clear `--help`, a `--json` flag on every command,
+   actionable errors, and idempotent, non-interactive operation (`--yes`/`--auto`).
+2. Ship a **`SKILL.md`** (or an `AGENTS.md` snippet) that tells the agent the tool
+   exists, what it’s for, and the handful of commands to run.
+   Reference the CLI via a **pinned zero-install runner** (`npx`/`uvx <pkg>@<version>`)
+   so it works even in ephemeral/cloud environments — global install vs.
+   zero-install is its own design dimension (§6.7).
+3. That’s the baseline.
+   Stop here unless you have many subcommands or need cross-session state, structured
+   auth, or background services — then see §6 (CLI-as-skill) and §7 (MCP).
+### 0.3 The one-paragraph decision guide
+- **Prompt/instructions only** → ship a `SKILL.md`. (§3, §4)
+- **Project-wide conventions** (build/test/style) → add an `AGENTS.md`. (§2)
+- **You have a CLI** → `SKILL.md` + agent-friendly `--json` CLI. (§0.2, §6)
+- **Many subcommands / a knowledge library** → CLI-as-skill meta-pattern.
+  (§6)
+- **A service with no CLI, or you need OAuth / multi-tenant / audit** → MCP server.
+  (§7)
+- **Maximum reach across many agents** → layer them: AGENTS.md + SKILL.md + CLI + MCP.
+  (§1)
+Everything below is reference material.
+You do not need most of it for most tools.
-```yaml
----
-name: mycli
-description: Lightweight, git-native issue tracking...
-allowed-tools: Bash(mycli:*)
----
-# mycli Workflow
-...
-```
+* * *
-**CURSOR.mdc Format** (Cursor IDE):
+## 1. The Layered Model — “Write Once, Integrate Many”
-```yaml
----
-description: mycli workflow rules for git-native issue tracking...
-alwaysApply: false
----
-# mycli Workflow
-...
-```
+There is no single integration surface that every agent uses, but the surfaces compose
+cleanly.
+Pick the lowest layer that satisfies your need; add higher layers only for reach
+or capability.
-**AGENTS.md Format** (Codex):
+| Layer | Artifact | What it’s for | Reach (May 2026) |
+| --- | --- | --- | --- |
+| 1. Project baseline | `AGENTS.md` | Build/test/style/conventions for *this repo* | Codex, Cursor, Copilot, Gemini CLI, Windsurf, Amp, Jules, Goose, Factory, Aider, opencode, pi |
+| 2. Portable capability | `SKILL.md` (Agent Skills) | A reusable, on-demand capability | Claude Code, Codex, Gemini CLI, Copilot (VS Code), Cursor, Windsurf, Cline, pi, 20+ |
+| 3. Execution | A CLI | Efficient, composable tool the agent invokes via shell | Every agent with a shell tool |
+| 4. Structured/remote | MCP server | Services without a CLI; OAuth, multi-tenant, audit | Every major agent except Aider (native); pi via extension |
+| 5. Per-agent polish | `.cursor/rules/*.mdc`, plugin packaging, ACP, etc. | Glob-scoped activation, enterprise distribution, editor discovery | Per-agent |
-```markdown
-<!-- BEGIN MYCLI INTEGRATION -->
-# mycli Workflow
-...
-<!-- END MYCLI INTEGRATION -->
-```
+**Recommended default for a tool author who wants broad reach**: ship an `AGENTS.md`
+snippet (universal baseline) + a `SKILL.md` (portable capability) + an agent-friendly
+CLI. Add an MCP server only when a CLI can’t serve the need.
+Add agent-specific files last, and only where they buy something.
 * * *
-## 2. Context Management Commands
+## 2. AGENTS.md — The Universal Project Baseline
+`AGENTS.md` is a plain-Markdown file at the repo root that tells any agent how your
+project works: build commands, test commands, conventions, gotchas.
+It is **not** capability-specific — think of it as the README written for agents.
+**Governance & reach**: Originated by OpenAI (Aug 2025); since Dec 2025 stewarded by the
+**Agentic AI Foundation under the Linux Foundation** (co-founded by OpenAI, Anthropic,
+and Block; ~180 member orgs).
+Used by **60,000+** open-source projects.
+Canonical spec: [agents.md](https://agents.md).
+**Discovery & precedence** vary by agent — know your targets:
+- **Codex**: reads global `~/.codex/AGENTS.md` (or `AGENTS.override.md`), then walks
+  from repo root down to the working directory, concatenating one file per directory
+  (root→leaf, deeper overrides shallower).
+  Combined content is capped at `project_doc_max_bytes` (**default 32 KiB**). It does
+  **not** lazy-load nested files when reading child directories (open request).
+- **Cursor** (since v1.6), **Copilot** (since Aug 2025), **Windsurf**, **Gemini CLI**
+  (alongside `GEMINI.md`), **Amp** (falls back to `CLAUDE.md`), **Jules**, **Goose**
+  (hierarchical scoping), **Factory**, **opencode**, **pi**: all read `AGENTS.md`
+  natively.
+- **Claude Code**: as of May 2026 does **not** auto-load `AGENTS.md`; it uses
+  `CLAUDE.md`. A common pattern is to symlink `CLAUDE.md → AGENTS.md`, or to maintain
+  both. (`tbd` writes a marked section into `AGENTS.md` and lets users keep a separate
+  `CLAUDE.md`.)
+- **Aider**: uses `CONVENTIONS.md` but recommends `AGENTS.md` for interoperability.
+**Author tip**: keep `AGENTS.md` concise (it loads into every turn and competes for
+context).
+Put deep, on-demand material in skills or files the agent can open when needed.
+`AGENTS.override.md` lets a developer override the committed file locally without
+editing it.
-### 2.1 Prime Command Pattern
+* * *
-The `prime` command is the key context management primitive.
-It outputs contextual information appropriate to the current state:
+## 3. The Agent Skills Standard (SKILL.md)
-- **Initialized repo**: Dashboard with status, rules, quick reference
-- **Not initialized**: Setup instructions
-- **Migration detected**: Migration warning and guidance
+**What it is**: a folder with a `SKILL.md` file (YAML frontmatter + Markdown body), plus
+optional supporting files.
+Created by Anthropic (Dec 2025), published under Apache 2.0 at
+[agentskills.io](https://agentskills.io).
+280,000+ public skills exist as of early 2026.
-**Key Features**:
+**Standard frontmatter** (portable across agents): `name`, `description`. Optional but
+widely recognized: `license`, `compatibility`, `metadata`, `allowed-tools`. Agents
+silently ignore frontmatter keys they don’t understand, which is what makes a single
+`SKILL.md` portable.
-- Called automatically via hooks at session start and before context compaction
-- `--brief` flag for constrained contexts (~200 tokens)
-- `--full` flag for complete skill documentation
-- Custom override via `.mycli/PRIME.md` file
-- CLI with no args shows help with prominent prompt to run `mycli prime` for full
-  context
+### 3.1 Progressive disclosure (the core design principle)
-**Relationship with `skill` Command**:
+Skills are loaded in three levels so they cost almost nothing until used:
-The `prime` and `skill` commands serve different purposes:
+| Level | Content | Loaded | Budget |
+| --- | --- | --- | --- |
+| 1. Discovery | `name` + `description` only | Always (in the skill listing) | ~100 tokens |
+| 2. Activation | Full `SKILL.md` body | When invoked (by user or model) | keep < ~5,000 tokens / 500 lines |
+| 3. Execution | Supporting files, scripts, references | On demand, by the model reading the filesystem | unbounded |
-| Command | Purpose | When Called |
-| --- | --- | --- |
-| `mycli prime` | Dashboard + status + workflow rules | Session start, context compaction |
-| `mycli skill` | Pure skill content (no status/dashboard) | Agent inspection, installation preview |
+**Constraints that matter**: keep the body **under 500 lines**; keep reference files
+**one level deep** from `SKILL.md` (avoid `SKILL.md → a.md → b.md` chains); put bulky
+material (schemas, examples, scripts) in supporting files.
+Scripts execute *outside* the context window — only their output costs tokens, which is
+why bundling a script can be far cheaper than inlining instructions.
-Use `prime` for context restoration with current state, `skill` for static skill
-documentation.
+### 3.2 Bundled scripts and resources
-**Dashboard Output Structure**:
+A skill folder can ship executable helpers:
 ```
-mycli v1.0.0
---- INSTALLATION ---
-✓ mycli installed (v1.0.0)
-✓ Initialized in this repo
-✓ Hooks installed
---- PROJECT STATUS ---
-Repository: proj
-Tasks: 3 open (1 in_progress) | 1 blocked
---- WORKFLOW RULES ---
-- Track all task work using mycli
-- Check `mycli ready` for available work
-- Run `mycli sync` at session end
---- QUICK REFERENCE ---
-mycli ready              Show tasks ready to work
-mycli show <id>          View task details
-...
+my-skill/
+├── SKILL.md
+├── reference.md          # loaded only when the body points to it
+├── scripts/
+│   └── transform.py      # run by the agent; only stdout enters context
+└── assets/
+    └── template.xlsx
 ```
-### 2.2 Progressive Disclosure Architecture
+In Claude Code, `${CLAUDE_SKILL_DIR}` resolves to the skill’s directory for portable
+script references. Anthropic’s own document skills (docx/pdf/pptx/xlsx) and the
+`skill-creator` skill are good worked examples of this layout.
-The most important concept for building agent-integrated CLIs is **Progressive
-Disclosure**—showing just enough information to help agents decide what to do next, then
-revealing more details as needed.
-This minimizes token overhead while maintaining full capability.
-**Three-Level Architecture**:
-| Level | Content | Token Budget | When Loaded |
-| --- | --- | --- | --- |
-| Level 1 | Metadata only (name + description) | ~100 tokens | Always in system prompt |
-| Level 2 | SKILL.md body | ~1,500-5,000 tokens | On skill trigger |
-| Level 3 | Bundled resources (scripts, references) | Unlimited | As-needed |
-**Key Constraints**:
-- Keep SKILL.md under 500 lines
-- Reference files should stay one level deep from SKILL.md
-- Avoid chains like SKILL.md → advanced.md → details.md
-- Scripts execute outside context; only output uses tokens
-### 2.3 Description Optimization Pattern
+* * *
-Skill activation relies on **pure LLM reasoning**, not keyword matching or embeddings.
-Description quality directly impacts activation reliability.
+## 4. Writing a Great SKILL.md
-**Activation Reliability Data** (from real-world testing across 200+ prompts):
+### 4.1 Frontmatter: standard vs. Claude Code extensions
-| Approach | Success Rate |
-| --- | --- |
-| No optimization / vague descriptions | ~20% |
-| Optimized descriptions with “Use when …” | ~50% |
-| Descriptions with concrete examples | 72-90% |
-| Forced evaluation hooks | 80-84% |
+Only `name` and `description` are required by the open standard.
+Claude Code recognizes a larger set (other agents ignore the extras):
-**The Two-Part Rule**: Every description must answer:
+| Field | Standard? | Notes |
+| --- | --- | --- |
+| `name` | ✓ | ≤ 64 chars; lowercase, digits, hyphens; defaults to directory name |
+| `description` | ✓ | ≤ 1,024 chars in the field; see §4.2 |
+| `license`, `compatibility`, `metadata` | ✓ (optional) | Portability/metadata |
+| `allowed-tools` | ✓ (optional) | Pre-grant tools, e.g. `Bash(mycli:*), Read, Write` |
+| `when_to_use` | Claude Code | Appended to `description` in the listing (shares the cap) |
+| `argument-hint`, `arguments` | Claude Code | Autocomplete + `$name`/`$ARGUMENTS` substitution |
+| `disable-model-invocation` | Claude Code | Skill won’t auto-trigger; user/explicit only |
+| `user-invocable` | Claude Code | `false` = background knowledge, hidden from `/` menu |
+| `model`, `effort` | Claude Code | Override model / reasoning effort for the skill’s turn |
+| `context: fork`, `agent` | Claude Code | Run the skill in an isolated subagent |
+| `paths` | Claude Code | Glob patterns that gate auto-activation to matching files |
+| `hooks`, `shell` | Claude Code | Skill-scoped lifecycle hooks; `bash`/`powershell` |
+Keep your committed `SKILL.md` to the **standard fields plus `allowed-tools`** if you
+want maximum portability; add Claude-specific fields when you’re targeting Claude Code
+specifically.
+### 4.2 Description optimization (this is what makes skills activate)
+Activation is **pure LLM reasoning** — the model reads every installed skill’s `name` +
+`description` and decides what to invoke.
+There is no keyword matcher or embedding step.
+So the description is the single highest-leverage thing you write.
+**The two-part rule** — every description answers:
 1. **What does it do?** (capabilities)
-2. **When to use it?** (activation triggers)
-**Anti-Pattern**:
+2. **When should it be used?** (explicit triggers, in the user’s words)
 ```yaml
+# Anti-pattern
 description: Helps with documents
-```
-**Preferred Pattern**:
-```yaml
-description: >
+# Preferred
+description: >-
   Analyze Excel spreadsheets, create pivot tables, and export data.
-  Use when analyzing .xlsx files, working with tabular data, or
-  when the user mentions spreadsheets or Excel.
-```
-**Writing Guidelines**:
+  Use when analyzing .xlsx files, working with tabular data, or when the
+  user mentions spreadsheets or Excel.
+```
+**Writing rules**: third person ("Processes files," not “I can help you”); front-load
+the most important trigger keywords in the first ~50 characters (descriptions can be
+truncated in large collections); state both capability and trigger.
+**Activation reliability** (community 650-trial sandboxed eval — directional, not
+official): vague descriptions ~20% → optimized “Use when…” descriptions ~50% → adding
+concrete examples ~72–90%. Two distinct failure modes to design against: *activation
+failure* (never invoked) and *execution failure* (invoked but steps skipped — fix with
+clearer, checklist-style instructions).
+### 4.3 The description budget (changed in 2026 — verify against your target)
+Earlier guidance cited a flat ~15K-character budget.
+**Claude Code’s current model is different**:
+- The skill listing gets a budget of **~1% of the model’s context window** by default
+  (`skillListingBudgetFraction`, default `0.01`). When it overflows, the
+  least-recently-invoked skills lose their descriptions first.
+- Per-skill listing text (`description` + `when_to_use`) is truncated at **1,536 chars**
+  (`maxSkillDescriptionChars`).
+- `SLASH_COMMAND_TOOL_CHAR_BUDGET` overrides the fraction with a fixed character count.
+- `skillOverrides` can set any skill to `on` / `name-only` / `user-invocable-only` /
+  `off` without editing the file; `/doctor` reports overflow.
+**Implication for tools that install many skills**: don’t. Use the **meta-skill
+pattern** (§6.2) — one skill that exposes N resources via CLI subcommands consumes a
+single listing slot instead of N. This is the strongest architectural reason to prefer
+CLI-as-skill once you have more than a handful of capabilities.
+### 4.4 Test the skill before publishing
+Because activation is probabilistic (§4.2) and the body is executable influence, test
+it:
+- **Positive activation**: a few realistic prompts that *should* trigger the skill —
+  does the agent invoke it?
+- **Negative activation**: nearby prompts that should *not* trigger it — no false fires?
+- **Explicit invocation**: `/skill-name` (or the agent equivalent) loads and runs
+  cleanly.
+- **Sandbox / write-denial**: the skill (and any bundled script) degrades gracefully
+  when the agent runs read-only or without network (§5, Codex/Claude Code sandboxes).
+- **CI validation**: lint the frontmatter (required `name`/`description`, length caps)
+  and check that every referenced supporting file and link resolves.
+  For a CLI-as-skill, also run every `cli guidelines/shortcut/<name>` reference and
+  assert it exists.
+### 4.5 Keep portable; version deliberately
+- **Portability**: keep the committed `SKILL.md` to the standard fields (plus
+  `allowed-tools`). Put vendor-specific behavior behind clearly labeled sections or
+  generated per-agent variants rather than non-standard frontmatter that other agents
+  silently drop.
+- **Versioning**: for packaged skills/plugins, use semantic versions, keep a changelog,
+  and state compatibility (`compatibility` field / a “requires” note).
+  Pin consumers to a commit or version, not a moving tag.
+- **Deprecation**: when removing or renaming a skill, leave a deprecation window with a
+  pointer to the replacement; don’t silently delete an activation trigger users rely on.
-- Use third person always ("Processes files" not “I can help you”)
-- Include explicit “Use when …” triggers with concrete scenarios
-- Be specific with keywords users would naturally say
-- State both capabilities AND activation conditions
-- Front-load the most important trigger keywords in the first 50 characters
-  (descriptions may be truncated in large skill collections)
+* * *
-### 2.4 Description Length and Budget Constraints
+## 5. Per-Agent Integration Reference
+Targets differ.
+This matrix reflects May 2026; verify against current docs for the agents
+you care about.
+| Agent | Project file | Skill / rules mechanism | MCP | Hooks | Best integration path |
+| --- | --- | --- | --- | --- | --- |
+| **Claude Code** | `CLAUDE.md` | Agent Skills (`SKILL.md`), `.claude/skills/`; plugins/marketplaces | Yes (stdio + Streamable HTTP) | 29 events | SKILL.md (+ plugin for distribution) |
+| **Codex CLI** | `AGENTS.md` | `SKILL.md` skills + plugins (skills+MCP); `~/.codex/prompts` (deprecated) | Yes (stdio + Streamable HTTP) | — | AGENTS.md + skills/plugins + MCP |
+| **Cursor** | `.cursor/rules/*.mdc`, `AGENTS.md` | MDC rules (Always/Auto-glob/Agent-requested/Manual) | Yes | 6 events (incl. `beforeShellExecution`) | AGENTS.md + `.mdc` for glob scoping |
+| **GitHub Copilot** | `.github/copilot-instructions.md`, `AGENTS.md` | `SKILL.md` (VS Code); `.agent.md` custom agents | Yes | `preToolUse`/`postToolUse`/… | SKILL.md + MCP; enterprise-managed plugins |
+| **Gemini CLI** | `GEMINI.md` + `AGENTS.md` | Agent Skills; extensions (bundle hooks) | Yes (stdio + SSE) | ~12 events | AGENTS.md + MCP/extension |
+| **Windsurf** | `.windsurf/rules/*.md` + `AGENTS.md` | Rules with activation modes | Yes (OAuth, Streamable HTTP) | pre-hooks can **block** (exit 2) | AGENTS.md + MCP |
+| **Cline** | `.clinerules/` | Glob-scoped rules; Cline SDK plugins | Yes (mature marketplace) | SDK lifecycle events | `.clinerules` + MCP |
+| **Aider** | `CONVENTIONS.md` / `AGENTS.md` | Conventions file (read-only context) | No native (proxy only) | Git hooks only | AGENTS.md/CONVENTIONS.md |
+| **opencode** | `AGENTS.md` + `opencode.jsonc` | JS/TS plugins; skills dir | Yes | 25+ events, tool interception | Plugin + MCP |
+| **Amp** | `AGENTS.md` (→ `CLAUDE.md`) | Plugins (tools + hooks) | Yes (Sourcegraph MCP + custom) | session/turn/tool | MCP + plugin |
+| **Jules** (Google) | `AGENTS.md` | AGENTS.md only | Yes (curated, since Feb 2026) | — (cloud async) | AGENTS.md + Jules REST API |
+| **Goose** (Block) | `AGENTS.md` | Recipes; 70+ MCP extensions | Yes (deepest) | extension lifecycle | MCP (primary) |
+| **Zed** | `.rules` (reads `.cursorrules`, `CLAUDE.md`) | Rules Library | Yes (extensions) | — | MCP extension + `.rules` |
+| **Factory** | `AGENTS.md` + `.factory/droids/*.md` | Custom Droids (sub-agents) | Yes | Delegator loop | AGENTS.md + droid file |
+| **pi** | `AGENTS.md` / `CLAUDE.md`, `.pi/SYSTEM.md` | Agent Skills (`.pi/skills/`, `.agents/skills/`); TS extensions | **No (by design)** — use CLI+README or an extension | extension hooks | SKILL.md + CLI; extension for deep tool registration |
+**Notes on the minimal end (pi)**: pi (Mario Zechner’s `@mariozechner/pi-coding-agent`,
+~44K stars) ships four tools (read/write/edit/bash) and treats context as a scarce
+budget. It reads `AGENTS.md`/`CLAUDE.md`, supports the Agent Skills standard, and
+**deliberately omits MCP** — its docs tell authors to “build CLI tools with READMEs” or
+write a TypeScript extension (`pi.registerTool()` / `pi.registerCommand()`). This is a
+clean endorsement of the CLI-as-skill approach: a self-documenting CLI plus a `SKILL.md`
+is exactly what a minimal agent wants.
+**Codex specifics** (it gained a real skill system in 2026): skills are `SKILL.md`
+folders with the same progressive disclosure, discovered from
+repository/user/admin/system `.agents/skills/` directories.
+**Plugins** are one distribution layer on top (installable units bundling skills + MCP
+servers — 90+ ship with Codex), not the only install path — a plain
+`.agents/skills/<name>/SKILL.md` works without packaging.
+Operational config lives in `~/.codex/config.toml` (or trusted per-project
+`.codex/config.toml`): `model`, `approval_policy`
+(`untrusted`/`on-request`/`granular`/`never`), `sandbox_mode`
+(`read-only`/`workspace-write`/`danger-full-access`), and `[mcp_servers.*]`. A CLI your
+tool ships will run **inside Codex’s sandbox** — under `workspace-write`, writes are
+limited to workspace roots and network is off unless explicitly enabled.
+Design your CLI to work read-only where possible and to fail with a clear message when
+sandboxed.
-Claude Code has a **cumulative character budget** for all skill descriptions combined.
-Understanding these limits is critical for CLIs that install multiple skills.
+* * *
-**Hard Limits**:
+## 6. CLI-as-Skill (Advanced) — One Tool, Many Self-Injecting Commands
-| Constraint | Limit | Notes |
-| --- | --- | --- |
-| Individual description | 1,024 characters | Per-skill maximum |
-| Skill name | 64 characters | Lowercase, numbers, hyphens only |
-| SKILL.md body | ~500 lines | Soft limit; use supporting files for more |
-| **Cumulative budget** | ~15,000-16,000 chars | For ALL skill descriptions combined |
+This is the pattern for a richer tool: a CLI that is itself a skill, exposing many
+capabilities as subcommands while costing a single description slot.
+`tbd` is the reference implementation; **Beads/`bd`** (Steve Yegge), `tbd`’s lineage,
+follows the same shape (subcommands + `AGENTS.md` + `--json` + an optional MCP server).
-**Per-Skill Overhead**: Each skill consumes ~109 characters of XML overhead (tags, name,
-location) plus the description length.
+Use this when you have many capabilities, need cross-session state, or want a curated
+knowledge library the agent pulls from.
+For a single capability, the §0 baseline is better — don’t reach for this prematurely.
-**Truncation Behavior**: When the cumulative budget is exceeded, skills are hidden:
+### 6.1 Two kinds of commands
-| Skills Installed | Skills Visible | Hidden |
+| Type | Purpose | Examples |
 | --- | --- | --- |
-| 63 | 42 | 33% |
-| 92 | 36 | 60% |
-**No warning is shown** when skills are truncated.
-Run `/context` to check for excluded skills.
-**Description Length Guidelines by Collection Size**:
-| Skill Collection Size | Recommended Description Length |
-| --- | --- |
-| < 40 skills | Up to 1,024 characters (full limit) |
-| 40-60 skills | ≤150 characters |
-| 60+ skills | ≤130 characters |
-**Override**: Set `SLASH_COMMAND_TOOL_CHAR_BUDGET` environment variable to increase the
-limit.
-**Meta-Skill Pattern**: For CLIs with many resources (50+), use a single meta-skill that
-exposes resources via CLI commands rather than individual skills.
-This consumes only one description slot (~200 chars) instead of 50+ slots that would
-exceed the budget. See
-[Skills vs Meta-Skill Architecture Research](../../project/research/current/research-skills-vs-meta-skill-architecture.md).
-### 2.5 Skill Command Architecture
-Every agent-integrated CLI should have a `skill` subcommand that outputs skill content
-to stdout.
-This enables agents to inspect skill content, preview before installation, and
-pipe to other commands.
-**Basic Pattern**:
-```bash
-mycli skill           # Full skill content
-mycli skill --brief   # Condensed version for constrained contexts
-```
-**Key Behaviors**:
+| **Action commands** | Perform operations | `create`, `close`, `sync` |
+| **Informational commands** | Output guidance for the agent to *follow* | `guidelines <name>`, `shortcut <name>`, `template <name>` |
-- Outputs composed skill content (doesn’t just read a static file)
-- Dynamically generates resource directories from available docs
-- Supports verbosity flags for different contexts
-- Can be piped to files or other commands
+Informational commands don’t *do* anything — they print instructions, best practices, or
+templates the agent reads and acts on.
+This is the mechanism behind tbd’s **knowledge-injection-via-subcommands**: rather than
+installing dozens of skills, tbd installs *one* meta-skill and exposes its entire
+library (`tbd guidelines --list`, `tbd guidelines typescript-rules`, …) as commands the
+agent calls just-in-time.
-#### 2.5.1 Tiered Skill Files
+This works well in practice and is the right answer for a many-subcommand CLI because:
-Maintain two tiers of skill content for different contexts:
+- **Budget**: one listing slot, unbounded resources (vs.
+  the per-skill budget in §4.3).
+- **Currency**: resources ship and version with the CLI; `--list` is generated from
+  what’s actually installed, so it never goes stale.
+- **Composability**: each resource can reference other commands, forming a
+  self-directing context loop (§6.4).
-| Tier | File | Tokens | When Used |
-| --- | --- | --- | --- |
-| Full | `skill-baseline.md` | ~2000 | Default setup, `skill` command |
-| Brief | `skill-brief.md` | ~400 | `skill --brief`, compacted contexts |
-#### 2.5.2 Skill Composition Pattern
+### 6.2 The meta-skill composition
-Compose full skill content from separate components:
+tbd’s `skill` command composes the installed `SKILL.md` from parts so the static guide
+and the dynamic catalog stay in sync:
 ```
 ┌─────────────────────────────────────┐
-│ claude-header.md (YAML frontmatter) │
+│ claude-header.md  (YAML frontmatter) │  ← name + two-part description + allowed-tools
 ├─────────────────────────────────────┤
-│ skill-baseline.md (workflow guide)  │
+│ skill-baseline.md (workflow guide)   │  ← the durable "how to use this tool" body
 ├─────────────────────────────────────┤
-│ <!-- BEGIN SHORTCUT DIRECTORY -->   │
-│ (dynamically generated tables)      │
-│ <!-- END SHORTCUT DIRECTORY -->     │
+│ <!-- BEGIN SHORTCUT DIRECTORY -->    │  ← generated from the live DocCache
+│   | command | title | description |  │
+│ <!-- END SHORTCUT DIRECTORY -->      │
 └─────────────────────────────────────┘
 ```
-**Benefits**:
-- Header can be updated independently from content
-- Dynamic sections stay current with available resources
-- HTML comment markers enable partial updates without full regeneration
-**Implementation Pattern**:
-```typescript
-async function composeFullSkill(): Promise<string> {
-  // Load YAML header (Claude Code metadata)
-  const header = await loadDocContent('install/claude-header.md');
-  // Load base skill content
-  const baseSkill = await loadDocContent('shortcuts/system/skill-baseline.md');
+Maintain **two tiers** of skill content: a full `skill-baseline.md` (~2,000 tokens, the
+default and the `skill` command) and a `skill-brief.md` (~400 tokens, for constrained or
+post-compaction contexts).
-  // Generate dynamic resource directory
-  const directory = await generateShortcutDirectory();
-  // Compose: header + base + dynamic content
-  let result = header + baseSkill;
-  if (directory) {
-    result = result.trimEnd() + '\n\n' + directory;
-  }
-  return result;
-}
-```
+### 6.3 Resource directories: show the full command
-**Updatable Regions with HTML Markers**:
-When installing skill files, use HTML comment markers to identify sections that can be
-updated independently:
+When listing resources, print the command to run, not just a name — it removes a step
+for the agent.
 ```markdown
-<!-- BEGIN SHORTCUT DIRECTORY -->
 ## Available Shortcuts
-...generated table...
-<!-- END SHORTCUT DIRECTORY -->
-```
-This allows the CLI to update just the directory section when resources change, without
-regenerating the entire skill file.
-**“DO NOT EDIT” Markers**:
-For generated skill files installed in `.claude/skills/`, insert a “DO NOT EDIT” marker
-after the frontmatter to warn users:
-```markdown
----
-name: mycli
-description: ...
----
-<!-- DO NOT EDIT: Generated by mycli setup.
-Run 'mycli setup' to update.
--->
-# mycli Workflow
-...
+| Command | Purpose | Description |
+|---------|---------|-------------|
+| `mycli shortcut code-review` | Commit code | Pre-commit checks and commit flow |
+| `mycli shortcut new-plan-spec` | Plan a feature | Create a planning specification |
 ```
-#### 2.5.3 File Management Patterns
-**Critical Architecture Principle**: CLIs should version control **source files**, not
-final installed files.
-Different file types have different management patterns.
-**File Ownership Summary**:
+Back resource lookup with a **path-ordered cache** so project- and user-level files can
+shadow built-ins (like `$PATH`): project `.mycli/docs/` → user `~/.mycli/docs/` →
+bundled. This lets teams customize without forking.
-| File | Location | Managed By | User Editable? |
-| --- | --- | --- | --- |
-| **SKILL.md** | `.claude/skills/mycli/` | CLI (fully) | ❌ Never |
-| **AGENTS.md** | Repo root | CLI + User (hybrid) | ✓ Outside markers |
-| **CLAUDE.md** | Repo root | User (optional) | ✓ Fully |
-**Pattern 1: SKILL.md (CLI-Managed Only)**
+### 6.4 The context-injection loop
-The SKILL.md file is **entirely managed by the CLI**. Neither users nor agents should
-edit it directly—it will be overwritten on the next `setup` run.
+The payoff of informational commands is a self-reinforcing chain:
 ```
-project-root/.claude/skills/mycli/SKILL.md  # Generated, never edit
+SKILL.md  ── "for TypeScript work, run `mycli guidelines typescript-rules`"
+   └──▶ guideline ── "create issues with `mycli create`; for tests see `mycli guidelines testing`"
+          └──▶ action commands / more guidelines, loaded just-in-time
 ```
-**How tbd implements this**:
-- `tbd setup --auto` composes and installs `.claude/skills/tbd/SKILL.md`
-- Inserts “DO NOT EDIT” marker after frontmatter
-- Regenerates on each setup with latest CLI content + dynamic shortcut directory
+Rules: reference commands **explicitly** (`mycli command arg`, never “see the docs”);
+**limit chain depth to 3**; make every layer end in a concrete action.
-**Pattern 2: AGENTS.md (Hybrid Management)**
+### 6.5 Making the CLI agent-friendly
-The AGENTS.md file uses **HTML comment markers** to separate CLI-managed sections from
-user-editable content.
+- **`--json` on every command** — one output path that renders human or machine output.
+- **`--brief`/`--quiet`** for constrained contexts and scripts.
+- **Idempotent `setup --auto`** (non-interactive) vs.
+  `setup --interactive` for humans; never let an agent get stuck on a prompt.
+- **Actionable errors** that include the next command to run.
+- **Discoverable help**: an `IMPORTANT:` epilog pointing at a context-restore command
+  (e.g., `mycli prime`), and a “Getting Started” one-liner.
+- **A `prime` command** (dashboard + status + rules) for session start and post-compact,
+  distinct from `skill` (pure documentation).
-```markdown
-# My Project
+### 6.6 Distribution & multi-agent install
-User-written context here... ✓ User can edit
+A CLI can install itself into multiple agents from one `setup` run.
+tbd writes a CLI-managed `SKILL.md` to `.claude/skills/tbd/` and a **marker-bounded
+section** into `AGENTS.md` (which now also feeds Cursor, Codex, and Factory), preserving
+user content outside the markers:
+```markdown
 <!-- BEGIN MYCLI INTEGRATION -->
-...CLI-generated content...
+…CLI-generated…  ← owned by the CLI; regenerated on setup
 <!-- END MYCLI INTEGRATION -->
-More user content... ✓ User can edit
-```
-**Marker-based updates**:
-- CLI owns content **between markers** (`<!-- BEGIN ... -->` to `<!-- END ... -->`)
-- User can freely edit content **outside markers**
-- `mycli setup --auto` updates only the marked section, preserving user content
-**How tbd implements this**:
-```typescript
-// Define marker boundaries
-const BEGIN_MARKER = '<!-- BEGIN TBD INTEGRATION -->';
-const END_MARKER = '<!-- END TBD INTEGRATION -->';
-// Update only marked section
-function updateSection(existingContent: string, newSection: string): string {
-  const start = existingContent.indexOf(BEGIN_MARKER);
-  const end = existingContent.indexOf(END_MARKER) + END_MARKER.length;
-  return existingContent.slice(0, start) + newSection + existingContent.slice(end);
-}
-```
-**Pattern 3: CLAUDE.md (Optional User File)**
-CLAUDE.md is typically **user-managed** for project-specific instructions.
-CLIs can support different approaches:
-1. **Symlink to AGENTS.md** (recommended for identical content):
-   ```bash
-   ln -s AGENTS.md CLAUDE.md
-   ```
-2. **Copy of AGENTS.md** (for separate content):
-   ```bash
-   mycli setup --create-claude-md  # CLI creates initial copy
-   ```
-3. **Separate user-maintained file** (tbd’s approach):
-   - CLI doesn’t manage it
-   - Users create/maintain manually
-**What to Version Control in Your CLI Package**:
-```
-packages/mycli/
-├── docs/
-│   ├── install/
-│   │   └── claude-header.md       # ✓ YAML frontmatter source
-│   └── shortcuts/system/
-│       ├── skill-baseline.md      # ✓ Full skill source
-│       └── skill-brief.md         # ✓ Brief skill source
-└── dist/docs/
-    └── SKILL.md                   # ✓ Bundled during build
-```
-**What NOT to Version in CLI Package**:
-- ❌ `.claude/skills/mycli/SKILL.md` (installed per-project)
-- ❌ `AGENTS.md` (created in user projects)
-- ❌ `CLAUDE.md` (user-managed)
-**User Project Structure** (after `mycli setup --auto`):
-```
-user-project/
-├── .claude/skills/mycli/
-│   └── SKILL.md               # CLI-managed, DO NOT EDIT
-├── AGENTS.md                  # Hybrid: CLI section + user content
-└── CLAUDE.md                  # Optional: User-managed or symlink
-```
-**Correct Workflows**:
-```bash
-# Setup in user project
-npm install -g mycli@latest
-cd /path/to/project
-mycli setup --auto
-# ✓ Users can edit AGENTS.md outside markers
-vim AGENTS.md  # Edit user sections, avoid marked regions
-# ✓ Update CLI-managed content by re-running setup
-mycli setup --auto  # Idempotent, preserves user edits
-# ❌ Don't edit CLI-managed files
-vim .claude/skills/mycli/SKILL.md  # Will be overwritten!
-```
-**Key Principles**:
-1. **Single Source of Truth**: Source files in CLI package are canonical
-2. **Clear Ownership Boundaries**: Use markers and “DO NOT EDIT” warnings
-3. **Preserve User Content**: Surgical updates to marked sections only
-4. **Idempotent Setup**: Safe to run `setup --auto` multiple times
-5. **Dynamic Per-Project Content**: Installed files may include project-specific
-   additions
-* * *
-## 3. Context Injection Loop Pattern
-One of the most powerful patterns in agent-integrated CLIs is the **context injection
-loop**—a recursive architecture where skill documentation references commands, those
-commands output more context, and that context references further commands.
-**The Loop Structure**:
-```
-┌─────────────────────────────────────────────────────────────────────┐
-│  SKILL.md (Level 2 - loaded on activation)                         │
-│  ├── Describes capabilities and when to use them                   │
-│  ├── References: "For TypeScript work, run `cli guidelines ts`"    │
-│  └── References: "To plan features, run `cli shortcut new-plan`"   │
-└─────────────────────────┬───────────────────────────────────────────┘
-                          │ Agent runs referenced command
-                          ▼
-┌─────────────────────────────────────────────────────────────────────┐
-│  Guidelines/Shortcuts (Level 3 - loaded on demand)                 │
-│  ├── Domain-specific knowledge injected into context               │
-│  ├── References: "Create issues with `cli create`"                 │
-│  ├── References: "For testing patterns, see `cli guidelines tdd`"  │
-│  └── References: "Use template: `cli template plan-spec`"          │
-└─────────────────────────┬───────────────────────────────────────────┘
-                          │ Agent follows instructions, may run more commands
-                          ▼
-┌─────────────────────────────────────────────────────────────────────┐
-│  Action Commands or More Context                                    │
-│  ├── Agent executes actions with full accumulated context          │
-│  └── Or loads additional guidelines/templates as needed            │
-└─────────────────────────────────────────────────────────────────────┘
-```
-**Key Properties**:
-| Property | Description |
-| --- | --- |
-| **Self-directing** | Each context layer tells the agent what to do next |
-| **Just-in-time** | Context loads only when relevant, preserving token budget |
-| **Composable** | Guidelines can reference other guidelines |
-| **Actionable** | Context always leads to concrete actions |
-**Implementation Guidelines**:
-1. **Every guideline should reference related guidelines**: If typescript-rules mentions
-   testing, it should reference `cli guidelines testing-rules`
-2. **Every shortcut should reference action commands**: Shortcuts are workflows—they
-   must tell the agent which commands to run
-3. **Limit chain depth to 3**: SKILL.md → Guideline → Sub-guideline is fine; deeper
-   chains confuse agents
-4. **Use consistent reference syntax**: Always `cli command arg` format, never prose
-   like “you might want to check the testing guidelines”
-**Anti-patterns**:
-```markdown
-# BAD: Vague reference
-See the testing documentation for more details.
-# GOOD: Explicit command reference
-For testing patterns, run `mycli guidelines general-testing-rules`.
-```
-* * *
-## 4. Agent Mental Model Patterns
-### 4.1 Agent as Partner, Not Messenger
-The fundamental mental model shift for agent-integrated CLIs is from:
-> “Here are commands you can tell the user about”
-To:
-> “Here’s how this tool helps you (the agent) serve the user better”
-Structure skill file orientation around capabilities, not commands:
-```markdown
-## What This Tool Does
-1. **Issue Tracking**: Track tasks, bugs, features. Never lose discovered work.
-2. **Coding Guidelines**: Best practices the agent can pull in when relevant.
-3. **Workflow Shortcuts**: Pre-built processes for common tasks.
-4. **Templates**: Starting points for common document types.
-## How to Use It to Help Users
-- User describes a bug → create an issue
-- User wants a feature → create a plan spec, then break into issues
-- Starting a session → check for available work
-- Completing work → close issues with clear reasons
-```
-### 4.2 Informational Commands Pattern
-A key architectural pattern is the distinction between **action commands** and
-**informational commands**:
-| Type | Purpose | Example |
-| --- | --- | --- |
-| Action commands | Perform operations | `create`, `close`, `sync` |
-| Informational commands | Output guidance for the agent to follow | `shortcut`, `guidelines`, `template` |
-Informational commands don’t perform actions—they display instructions, best practices,
-or templates that tell the agent *how* to do something well.
-The agent reads the output and follows the guidance.
-### 4.3 Resource Library Pattern
-Beyond core functionality, agent-integrated CLIs can bundle **resource libraries**—
-collections of guidelines, shortcuts, and templates that agents access on-demand.
-**Resource Types**:
-| Resource | Purpose | Access Pattern |
-| --- | --- | --- |
-| Guidelines | Best practices for specific domains | `cli guidelines <name>` |
-| Shortcuts | Step-by-step workflow instructions | `cli shortcut <name>` |
-| Templates | Document starting points | `cli template <name>` |
-**Benefits**:
-1. **Self-contained**: Resources ship with the CLI, no external dependencies
-2. **Versionable**: Resource improvements ship with CLI updates
-3. **Discoverable**: `--list` flags help agents find available resources
-4. **Contextual**: Agents query relevant resources just-in-time
-### 4.4 Resource Directory Pattern
-When documenting available resources, show the **full command to run**, not just the
-resource name. This removes friction for agents.
-**Anti-pattern** (name only):
-```markdown
-## Available Shortcuts
-- code-review-and-commit
-- create-or-update-pr-simple
-- new-plan-spec
 ```
-**Preferred pattern** (full command):
+**File-ownership rules** — distinguish three categories:
-```markdown
-## Available Shortcuts
+- **Project instruction files** (`AGENTS.md`, `CLAUDE.md`): *commit these*. They hold
+  human-authored project norms (§2). A CLI may own a **marker-bounded section** inside
+  `AGENTS.md` (regenerated on setup) while the user owns everything outside the markers.
+- **Fully generated install artifacts** (`.claude/skills/<tool>/SKILL.md` and the like):
+  CLI-owned; mark them “DO NOT EDIT.” Commit them only if the project intentionally
+  dogfoods them — otherwise leave them to `setup` (and consider gitignoring).
+- **Source files** in the CLI package (header, baseline, brief): the canonical inputs —
+  always version-controlled.
-| Command | Purpose | Description |
-|---------|---------|-------------|
-| `mycli shortcut code-review-and-commit` | Commit Code | How to run pre-commit checks and commit |
-| `mycli shortcut create-pr` | Create PR | How to create a pull request |
-| `mycli shortcut new-plan-spec` | Plan Feature | How to create a planning specification |
-```
+Make setup idempotent: dedupe hooks before merging, overwrite generated skills rather
+than patching them, update only the marked section of `AGENTS.md`, and clean up legacy
+files each run. (Generated output must also be stable under whatever formatter the repo
+runs — e.g. don’t emit a second YAML frontmatter block mid-document.)
-* * *
+### 6.7 Making the CLI available: global install vs. zero-install
-## 5. Setup Flow Patterns
+A separate design dimension from §6.6 (how the CLI installs *itself into agents*) is how
+the CLI **binary** is made available so the skill can invoke it.
+Decide this explicitly and state the chosen invocation in `SKILL.md`/`AGENTS.md`.
-### 5.1 Two-Tier Command Structure
+**The two ends of the spectrum** (plus a useful middle):
-Implement two levels of setup commands:
-| Command | Purpose | Audience |
+| Approach | How | Best when |
 | --- | --- | --- |
-| `mycli setup --auto` | Full setup with auto-detection | Agents, scripts |
-| `mycli setup --interactive` | Prompted setup | Humans |
-| `mycli init --prefix=X` | Surgical initialization only | Advanced users |
-### 5.2 Mode Flags Pattern
-Always require explicit mode selection for setup commands:
+| **Global install** | `npm i -g pkg`, `uv tool install pkg`, `pipx install pkg`, Homebrew, prebuilt binary | Persistent dev machines; offline/perf-sensitive use; the **project pins the version** (in `package.json`/lockfile or a tool manifest) so every run is identical |
+| **Zero-install runner** | `npx pkg@x.y.z`, `bunx`, `pnpm dlx`, `uvx pkg@x.y.z`, `pipx run`, `go run mod@x.y.z` | Ephemeral/cloud agents (Claude Code Cloud, CI, fresh containers) where nothing persists; broad reach with no setup |
+| **Persistent-on-first-use** | `uv tool install` then `uvx pkg` reuses it; a `SessionStart` bootstrap that installs once if absent | You want zero-install ergonomics *and* warm-start speed within a session |
+**Trade-offs**
+- **Global install** — *Pros*: fastest invocation (no per-call resolution), works
+  offline, and version is managed by the project (lockfile / `package.json` / `uv` tool
+  manifest), so it’s auditable and reproducible.
+  *Cons*: it’s a stateful prerequisite — in ephemeral or cloud environments the global
+  bin doesn’t persist, so the CLI can be **missing at session start** unless you
+  bootstrap it.
+- **Zero-install** — *Pros*: works in any environment with no setup; nothing to persist;
+  ideal default for portability.
+  *Cons*: cold-start download/cache cost on first call (uvx cold ≈ 1s, cached ≈ tens of
+  ms; npx similar), needs network, and an **unpinned** invocation (`npx pkg`, `uvx pkg`)
+  silently pulls the newest release.
+**Cloud / ephemeral bootstrap.** If you choose global install but target cloud agents,
+ship a **`SessionStart` hook** (or an `ensure-installed` script) that installs/updates
+the CLI if absent before first use.
+tbd does exactly this: a `tbd-session.sh` hook ensures the `tbd` CLI is present, then
+runs `tbd prime`. Without a bootstrap, a globally-installed CLI referenced by a skill
+will fail on a fresh cloud session.
+**Pin the version (security).** Whichever you choose, the skill’s referenced invocation
+should pin a version so the agent can’t silently run a newer (possibly compromised)
+release — the §9 / 14-day-package-age rule applied to the runner itself:
 ```bash
-mycli setup              # Shows help, requires mode flag
-mycli setup --auto       # Non-interactive (for agents)
-mycli setup --interactive # Interactive (for humans)
+uvx mytool@1.4.2  ...     # not `uvx mytool`
+npx mytool@1.4.2  ...     # not `npx mytool@latest`
 ```
-**Why This Matters**:
-- Prevents agents from getting stuck in interactive prompts
-- Ensures humans get guided experience when they want it
-- Explicit is better than implicit for setup operations
-### 5.3 Setup Idempotency Requirements
-**Setup MUST be idempotent**—safe to run repeatedly without side effects or errors.
-This is critical because:
-- Agents may run setup multiple times to refresh configuration
-- Users may run setup after CLI updates to get new features
-- Setup may be called automatically via scripts or CI
-**Idempotency Patterns**:
-1. **Hook Deduplication**: When merging hooks into `.claude/settings.json`, filter out
-   existing hooks before adding new ones:
-   ```typescript
-   // Filter out existing CLI hooks before merging
-   const existingHooks = currentSettings.hooks.SessionStart || [];
-   const filtered = existingHooks.filter(
-     (entry) => !entry.hooks?.some((h) => h.command?.includes('mycli'))
-   );
-   mergedHooks.SessionStart = [...filtered, ...newHooks];
-   ```
+Global installs get the same guarantee from the lockfile/manifest; zero-install gets it
+only from an explicit `@version`.
-2. **Skill File Regeneration**: Always regenerate skill files with fresh content on each
-   setup run. Don’t try to update in place—overwrite:
+**Current tooling (May 2026)**
-   ```typescript
-   // Always regenerate skill file
-   const skillContent = await composeFullSkill();
-   await writeFile(skillPath, skillContent);  // Overwrite, don't append
-   ```
+- **Node / TypeScript**: zero-install via `npx <pkg>@<ver>` (`-y` to skip the prompt),
+  `bunx`, `pnpm dlx`, or `deno run`; persistent via `npm i -g` / a project
+  devDependency.
+- **Python**: `uvx <pkg>@<ver>` (= `uv tool run`, bundled with Astral’s `uv`, Rust-fast,
+  no Python prereq) or `pipx run`; persistent via `uv tool install` / `pipx install`.
+  `uvx` reuses a persistent install if one exists.
+- **Go**: `go run <module>@<ver>` (compiles on the fly) or `go install`.
+- **Rust**: no first-class zero-install runner — ship **prebuilt binaries** (GitHub
+  releases + a `curl … | sh` installer) or `cargo binstall`; `cargo install` compiles.
+- **Cross-language**: a prebuilt binary + install script, or a container image (Docker
+  is emerging as the production-grade distribution for MCP servers).
-3. **Legacy Cleanup**: Remove deprecated patterns on each run:
+This mirrors how the ecosystem ships agent tooling today: **MCP servers** are most often
+referenced as `command: npx <pkg>` (Node) or `command: uvx <pkg>` (Python) in agent
+configs; **CLIs** like Beads offer `brew` / `npm -g` / `curl` installers, while tbd uses
+`npm -g` plus the bootstrap hook above.
-   ```typescript
-   // Clean up old hook patterns
-   const oldScripts = ['setup-mycli.sh', 'ensure-mycli.sh'];
-   for (const script of oldScripts) {
-     await rm(join(projectDir, '.claude/scripts', script)).catch(() => {});
-   }
-   ```
-4. **Directory Creation**: Use `mkdir -p` style recursive creation that succeeds if
-   directory already exists:
-   ```typescript
-   await mkdir(dirname(skillPath), { recursive: true });
-   ```
-**Testing Idempotency**:
-Run setup twice and verify:
-- No duplicate hooks in settings.json
-- Skill file content is identical on both runs (except timestamps)
-- No error messages about existing files/configs
-- All features work correctly after both runs
-### 5.4 Never Guess User Preferences
-For configuration values that are matters of user taste (not technical requirements),
-**never guess or auto-detect**. Always ask the user.
-**Examples of preference values**:
-- Project prefixes/abbreviations
-- Naming conventions
-- Style choices
+**Recommendation**: default the skill to a **pinned zero-install invocation**
+(`uvx`/`npx <pkg>@<version>`) for maximum reach across ephemeral and cloud agents; offer
+**global install + a `SessionStart` bootstrap** as the optimization for persistent
+environments where the project wants lockfile-managed versions and warm-start speed.
 * * *
-## 6. Agent Integration Patterns
-### 6.1 Claude Code Hooks
-Configure Claude Code hooks for automatic context management:
+## 7. CLI vs MCP vs Skill — Choosing the Surface
-**Global Hooks** (`~/.claude/settings.json`):
+These are complementary, not competing.
+Pick by need:
-```json
-{
-  "hooks": {
-    "SessionStart": [{
-      "matcher": "",
-      "hooks": [{ "type": "command", "command": "mycli prime" }]
-    }],
-    "PreCompact": [{
-      "matcher": "",
-      "hooks": [{ "type": "command", "command": "mycli prime" }]
-    }]
-  }
-}
-```
-**Alternative PreCompact Hook Pattern (Optional)**:
-For CLIs where token efficiency is critical, consider using `skill --brief` instead of
-`prime` in the PreCompact hook:
-```json
-"PreCompact": [{
-  "matcher": "",
-  "hooks": [{ "type": "command", "command": "mycli skill --brief" }]
-}]
-```
-**Tradeoffs**:
-- ✓ More token-efficient (~~400 tokens vs ~~800-1200 for `prime`)
-- ✓ Pure skill content without status/dashboard noise
-- ✗ No project-specific status information before compaction
-- ✗ Agent loses current state context
-Use `skill --brief` when the condensed workflow rules are sufficient, or `prime` when
-current project status is valuable for context restoration.
+| Need | Surface |
+| --- | --- |
+| Prompt/instructions, portable | **SKILL.md** |
+| Local processing, composable, you have/can build a CLI | **CLI** |
+| Service with no CLI; OAuth, multi-tenant, audit, remote | **MCP server** |
+| Both local convenience and structured/remote access | **CLI + MCP** |
+**Why CLI usually wins when one exists**: benchmarks (2026) put a CLI at ~100% task
+reliability and ~1.3K–8.7K tokens, vs.
+MCP at ~72% reliability and ~32K–82K tokens — roughly **17× cheaper** at scale — because
+LLMs already know common CLI usage and no tool schema is injected.
+Use MCP when there’s no CLI to lean on or you need its auth/permission machinery.
+**MCP current state (May 2026)**: governed by AAIF/Linux Foundation; two transports —
+**stdio** (local) and **Streamable HTTP** (remote; replaced legacy SSE in the Nov 2025
+spec, supports OAuth 2.1 + PKCE). Primitives: **tools**, **resources**, **prompts**.
+Security is a real concern (a scan found 492 public servers with no auth) — authenticate
+every request, scope every tool call, validate inputs, never pass tokens between
+servers.
+**Code execution with MCP** ("Code Mode"): instead of exposing many MCP tools as direct
+calls (each ~550–1,400 tokens of schema), let the agent write code against a compact
+tool API in a sandbox — reported 78–99% token reduction.
+Worth it when an MCP server exposes *many* tools; overkill for one.
-**Project Hooks** (`.claude/settings.json`):
+* * *
-```json
+## 8. Hooks & Lifecycle (Cross-Agent)
+Hooks let a tool inject context or enforce invariants automatically.
+Support varies:
+- **Claude Code** has the richest set (~29 events incl.
+  `SessionStart`, `Setup`, `UserPromptSubmit`, `PreToolUse`, `PostToolUse`,
+  `PreCompact`/`PostCompact`, `Stop`, `SubagentStart/Stop`, `SessionEnd`). Inject
+  context via `additionalContext` (most events) or stdout
+  (`SessionStart`/`UserPromptSubmit`). Skills can declare their own scoped `hooks:`.
+- **Cursor** (6 events, incl.
+  `beforeShellExecution`/`beforeMCPExecution`), **Windsurf** (pre-hooks can **block**
+  via exit code 2), **Gemini CLI** (~12), and **opencode** (25+, with tool interception)
+  all have lifecycle hooks.
+- **Aider**, **Jules**, **Zed** have no agent hooks (Aider integrates Git pre-commit
+  hooks only).
+**Common, portable use**: a `SessionStart` hook that runs your CLI’s `prime`/`skill`
+command to restore workflow context; a `PreCompact` hook that re-injects a brief
+(`skill --brief`) before the window is trimmed.
+Keep injected context small — it competes with everything else.
+```jsonc
+// Claude Code ~/.claude/settings.json
 {
   "hooks": {
-    "PostToolUse": [{
-      "matcher": "Bash",
-      "hooks": [{
-        "type": "command",
-        "command": "\"$CLAUDE_PROJECT_DIR\"/.claude/hooks/closing-reminder.sh"
-      }]
-    }]
+    "SessionStart": [{ "matcher": "", "hooks": [{ "type": "command", "command": "mycli prime" }] }],
+    "PreCompact":  [{ "matcher": "", "hooks": [{ "type": "command", "command": "mycli skill --brief" }] }]
   }
 }
 ```
-### 6.2 PostToolUse Hook with JSON Parsing
-PostToolUse hooks receive JSON input describing the tool invocation.
-Use bash scripts with jq to parse and conditionally respond.
-```bash
-#!/bin/bash
-input=$(cat)
-command=$(echo "$input" | jq -r '.tool_input.command // empty')
-# Trigger on specific patterns
-if [[ "$command" == git\ push* ]] || [[ "$command" == *"&& git push"* ]]; then
-  if [ -d ".mycli" ]; then
-    mycli closing
-  fi
-fi
-exit 0
-```
-### 6.3 Help Structure for Agent Discovery
-Agents need clear signals about how to get started with your CLI. Structure help output
-to make setup commands and context restoration prominent.
-**Pattern 1: Help Epilog with IMPORTANT Section**
-Add prominent sections at the bottom of `--help` output:
-```bash
-$ mycli --help
-Usage: mycli [options] [command]
-...
-IMPORTANT:
-  Agents unfamiliar with mycli should run `mycli prime` for full context.
-Getting Started:
-  npm install -g mycli@latest && mycli setup --auto --prefix=<name>
-```
-**Implementation with Commander.js**:
-```typescript
-program
-  .name('mycli')
-  .description('Brief description')
-  .addHelpText('after', `
-IMPORTANT:
-  Agents unfamiliar with mycli should run \`mycli prime\` for full context.
-Getting Started:
-  npm install -g mycli@latest && mycli setup --auto --prefix=<name>
-`);
-```
-**Pattern 2: Context Recovery Prompt**
-When the CLI runs without arguments, don’t just show help—prompt for context:
-```bash
-$ mycli
-Usage: mycli [command] [options]
-...
-Tip: Run `mycli prime` to restore full workflow context.
-```
-**Pattern 3: Setup Command Prominence**
-Ensure `setup` appears in a dedicated “Setup & Configuration” category in help output,
-not buried in a long alphabetical list:
-```
-Setup & Configuration:
-  init [options]         Initialize mycli in a repository
-  setup [options]        Configure mycli integration with editors and tools
-  config                 Manage configuration
-```
-**Why This Matters**:
-- Agents often lose context after compaction or in new sessions
-- Prominent `prime` references help agents restore workflow rules
-- Clear setup commands help agents guide users through installation
-- “IMPORTANT” section is scanned even when full help is ignored
 * * *
-## 7. Task Management Integration Patterns
-### 7.1 Task Tracking Strategy Selection
-Agent-integrated CLIs often need to track work across sessions.
-The key architectural decision is **where task state lives** and **how complex the
-tracking needs to be**.
-**Three Strategies**:
-| Strategy | State Location | Complexity | Use Case |
-| --- | --- | --- | --- |
-| **Ephemeral** | None | Minimal | Quick calculations, queries, one-shot tasks |
-| **Session-local** | In-memory or temp file | Low | Multi-step tasks within a single session |
-| **Persistent** | Git-tracked files | Medium-High | Multi-session projects, team collaboration |
-**Decision Framework**:
-```
-Is the task done in one command?
-  → Yes: Ephemeral (no tracking needed)
-  → No: Does it span multiple sessions?
-    → No: Session-local (agent's internal todo list)
-    → Yes: Persistent (tbd integration or equivalent)
-```
-### 7.2 Agent-Aware Task Patterns
-**Pattern 1: Auto-Discovery of Work**
-Include a “what should I work on?”
-command:
-```bash
-mycli ready          # Show tasks ready to work
-mycli next           # Suggest the highest-priority task
-```
-**Pattern 2: Context-Preserving Task Creation**
-When creating tasks from agent context, preserve relevant information:
-```bash
-mycli task create "Fix authentication bug" \
-  --context "User reported login fails after password reset" \
-  --related-files "src/auth/login.ts,src/auth/reset.ts"
-```
-**Pattern 3: Session Boundary Enforcement**
-The CLI should remind agents to handle tasks at session end:
-```markdown
-## Session Closing Protocol
-Before completing a session:
-1. Close or update all tasks you worked on
-2. Create tasks for any discovered work
-3. Sync task state: `mycli sync`
-```
+## 9. Security & Supply Chain (Don’t Skip This)
+Skills and instruction files are **executable influence** on an agent, which makes them
+an attack surface. Treat them with the same care as dependencies.
+- **Prompt injection via skills/instructions is real and effective**: 2026 security
+  research demonstrated up to ~80% attack success against frontier models using
+  malicious skills (instructions that exfiltrate data or escalate tool use).
+  Anything in `AGENTS.md`, `SKILL.md`, a fetched skill, or tool output is **untrusted
+  input**.
+- **Never put secrets in skill/instruction files or tool output.** `AGENTS.md`,
+  `SKILL.md`, bundled scripts, and anything a command prints get loaded into agent
+  context (and often committed) — keep credentials, tokens, and keys out of them; read
+  secrets from the environment at runtime instead.
+- **Vet third-party skills before install.** Prefer sources that scan (skills.sh runs
+  Snyk on every install).
+  Read the body and any bundled scripts — review them like dependency code.
+  Pin to a commit, not a moving tag.
+- **Scope tools tightly.** Use `allowed-tools` to grant the minimum (e.g.,
+  `Bash(mycli:*)` not blanket `Bash`). Prefer `disable-model-invocation` for
+  destructive/action-heavy skills so they require explicit invocation.
+- **Lean on sandboxing.** Claude Code’s OS-level sandbox and Codex’s
+  `read-only`/`workspace-write` modes contain damage; design your CLI to run within them
+  and degrade gracefully (clear error, no silent failure) when writes/network are
+  denied.
+- **Apply the same currency discipline** you use for packages: if your skill ships a
+  script with dependencies, the project’s supply-chain rules (e.g., the 14-day
+  package-age rule) apply — and a skill that references a zero-install runner must pin
+  the version (§6.7), since unpinned `npx`/`uvx` bypasses the cool-off.
+  See `tbd guidelines supply-chain-hardening` for the cross-ecosystem policy, or
+  `tbd guidelines bun-monorepo-patterns` / `pnpm-monorepo-patterns` for monorepo
+  specifics.
 * * *
-## 8. Dynamic Generation Patterns
-### 8.1 Dynamic Skill and Resource Directory Generation
-Rather than maintaining static content that can become stale, generate skill files and
-resource directories dynamically at runtime from source components and installed
-documents.
-**Pattern 1: Skill Composition from Multiple Sources**
-Compose skill content from separate files for maintainability:
-```typescript
-async function composeFullSkill(): Promise<string> {
-  // 1. Load YAML header (Claude Code metadata)
-  const header = await loadDocContent('install/claude-header.md');
-  // 2. Load base skill workflow content
-  const baseSkill = await loadDocContent('shortcuts/system/skill-baseline.md');
-  // 3. Generate dynamic resource directory from current docs
-  const directory = await generateShortcutDirectory();
-  // 4. Compose final skill: header + base + dynamic content
-  let result = header + baseSkill;
-  if (directory) {
-    result = result.trimEnd() + '\n\n' + directory;
-  }
-  return result;
-}
-```
-**Pattern 2: Resource Directory Generation**
-Generate tables of available resources from loaded documents:
-```typescript
-async function generateShortcutDirectory(): Promise<string> {
-  const shortcuts = await docCache.listDocuments('shortcuts');
-  const rows = shortcuts.map(doc => {
-    const meta = doc.frontmatter;
-    return `| ${doc.name} | ${meta.title} | ${meta.description} |`;
-  });
-  // Wrap in HTML markers for incremental updates
-  return [
-    '<!-- BEGIN SHORTCUT DIRECTORY -->',
-    '## Available Shortcuts',
-    '',
-    '| Name | Title | Description |',
-    '| --- | --- | --- |',
-    ...rows,
-    '<!-- END SHORTCUT DIRECTORY -->'
-  ].join('\n');
-}
-```
-**Pattern 3: Incremental Updates with HTML Markers**
-Use HTML comment markers to identify sections that can be updated independently:
-```markdown
-<!-- BEGIN SHORTCUT DIRECTORY -->
-## Available Shortcuts
-| Name | Description |
-| --- | --- |
-| code-review | Run pre-commit checks and commit |
-| new-plan-spec | Create a planning specification |
-<!-- END SHORTCUT DIRECTORY -->
-```
-This allows updating just the directory section when resources change, without
-regenerating the entire skill file.
-**Pattern 4: “DO NOT EDIT” Warnings**
-For generated files installed in `.claude/skills/`, insert warnings after frontmatter:
-```typescript
-function insertDoNotEditMarker(content: string): string {
-  const marker = `<!-- DO NOT EDIT: Generated by mycli setup.
-Run 'mycli setup' to update.
--->`;
-  // Insert after YAML frontmatter
-  const lines = content.split('\n');
-  const endOfFrontmatter = lines.findIndex((l, i) => i > 0 && l === '---');
-  lines.splice(endOfFrontmatter + 1, 0, marker);
-  return lines.join('\n');
-}
-```
-**Benefits of Dynamic Generation**:
-1. **Always Current**: Resource directories reflect actual available docs
-2. **DRY Principle**: Single source of truth (frontmatter) drives multiple outputs
-3. **Partial Updates**: HTML markers enable surgical updates to sections
-4. **Version-Safe**: Content updates ship with CLI version updates
-### 8.2 DocCache Shadowing Pattern
-Implement path-ordered document loading that allows project-level resources to shadow
-(override) built-in resources, similar to how shell `$PATH` works.
-**Loading Order** (earlier paths take precedence):
-1. Project-level: `.mycli/docs/shortcuts/`
-2. User-level: `~/.mycli/docs/shortcuts/`
-3. Built-in: Bundled with CLI
-```typescript
-class DocCache {
-  private paths: string[];  // Ordered by priority
-  async loadDocument(name: string): Promise<Document | null> {
-    for (const basePath of this.paths) {
-      const doc = await this.tryLoad(join(basePath, name));
-      if (doc) return doc;  // First match wins
-    }
-    return null;
-  }
-}
-```
+## 10. Emerging & Forward-Looking (Know It Exists)
+You usually don’t need these to ship a skill, but they shape where the ecosystem is
+going:
+- **ACP (Agent Client Protocol)** — Zed’s “LSP for agents” (JSON-RPC over stdio); 25+
+  agents (Claude Code, Codex, Gemini CLI, opencode) and editors (Zed, JetBrains, Kiro).
+  Complements MCP (editor↔agent, while MCP is agent↔tools).
+  Your agent runtime speaks it; a skill author doesn’t implement it.
+- **A2A (Agent2Agent)** — Google/Linux Foundation, v1.0, 150+ orgs; for enterprise
+  agent-to-agent delegation, not skill authoring.
+  Ignore unless you build autonomous multi-agent systems.
+- **Codex App-Server** — JSON-RPC (Thread/Turn/Item) decoupling Codex logic from client
+  surfaces; relevant only for Codex-specific integration surfaces.
+- **Plugin marketplaces & `npx skills`** — distribution is consolidating: Claude Code
+  plugin marketplaces (official + community), Codex plugins, and Vercel’s
+  `npx skills add` over the skills.sh directory (cross-agent symlinks).
+- **Routines / scheduled agents, background monitors, `/run` & `/verify` skills** —
+  newer Claude Code capabilities for autonomous, event-triggered, and app-verifying
+  workflows (confirm GA vs.
+  preview for your version before relying on them).
 * * *
-## 9. MCP Integration Patterns
-MCP (Model Context Protocol) and CLI-as-Skill are complementary approaches.
-Understanding when to use each is critical.
-| Aspect | CLI-as-Skill | MCP Server |
-| --- | --- | --- |
-| **Deployment** | npm install | Separate process |
-| **Integration** | SKILL.md + hooks | MCP protocol |
-| **Context** | Skill content in prompt | Tool calls |
-| **State** | Stateless (per-command) | Can maintain state |
-| **Scope** | Single CLI capabilities | Ecosystem of servers |
-| **Complexity** | Lower | Higher |
-**When to Use CLI-as-Skill**:
+## 11. Best-Practices Summary
-- Self-contained functionality
-- Workflow guidance and documentation
-- Resource libraries (guidelines, templates)
-- Simple tool integration
-- Quick setup requirements
+**Start simple**
-**When to Use MCP**:
+- One capability → one `SKILL.md` (name + two-part description + < 500-line body).
+  Stop.
+- Project conventions → `AGENTS.md` (concise; it loads every turn).
+- Have a CLI → make it agent-friendly (`--json`, idempotent, actionable errors) and
+  point a `SKILL.md` at it.
-- Persistent connections (databases, APIs)
-- Stateful operations
-- Cross-tool orchestration
-- Real-time data streaming
-- Complex tool ecosystems
+**Descriptions & disclosure**
-* * *
+- Two-part rule: *what it does* + *when to use it*; third person; front-load keywords.
+- Progressive disclosure: metadata → body → supporting files; bundle scripts
+  (output-only cost).
+- Respect the budget; verify the current model for your target agent (Claude Code ≈ 1%
+  of context window, not a flat char count).
-## Best Practices Summary
-### Architecture
-1. **Bundle documentation with CLI**: Self-contained packages work in all environments
-2. **Maintain tiered skill files**: Full (baseline) and brief versions for different
-   contexts
-3. **Provide a `skill` subcommand**: Output skill content to stdout with `--brief` flag
-4. **Implement fallback loading**: Support both bundled and development modes
-5. **Use platform-appropriate formats**: SKILL.md for Claude, MDC for Cursor, markers
-   for AGENTS.md
-### Context Management
-6. **Implement a `prime` command**: Dashboard at session start, brief mode for
-   constrained contexts
-7. **Implement a `skill` command**: Output pure skill content (no dashboard) for
-   inspection and installation preview
-8. **Separate skill from dashboard**: `prime` = status + context, `skill` = pure
-   documentation
-9. **Compose skills from multiple sources**: Header + baseline + dynamic directory
-10. **Include context recovery instructions**: Agents need to know how to restore
-    context
-11. **Two-level orientation only**: Full (default) and brief—avoid more granularity
-12. **Use progressive disclosure**: Level 1 (metadata) → Level 2 (skill body) → Level 3
-    (resources)
-13. **Keep SKILL.md under 500 lines**: Move detailed content to reference files
-### Description Optimization
-10. **Use the two-part rule**: What does it do?
-    + When to use it?
-11. **Write in third person**: “Processes files” not “I can help you”
-12. **Include explicit trigger phrases**: Match how users naturally describe needs
-13. **Front-load keywords**: Put most important triggers in first 50 characters
-14. **Respect cumulative budget**: All descriptions share a ~15K character limit
-15. **Use meta-skill pattern for 50+ resources**: One skill + CLI beats 50 individual
-    skills
-### Self-Documentation
-14. **Provide documentation commands**: `readme`, `docs`, `design` as built-in commands
-15. **Include Getting Started in help epilog**: One-liner must be easily accessible
-16. **Add IMPORTANT section to help**: Prominently reference `prime` command for context
-    restoration
-### Setup Flows
-17. **Two-tier command structure**: High-level (`setup`) and surgical (`init`)
-18. **Require explicit mode flags**: `--auto` for agents, `--interactive` for humans
-19. **Make setup idempotent**: Safe to run multiple times without errors or duplicates
-20. **Deduplicate hooks on each run**: Filter existing hooks before merging new ones
-21. **Regenerate skill files**: Always overwrite with fresh content, don’t try to update
-    in place
-22. **Clean up legacy patterns**: Remove deprecated files/configs on each setup run
-23. **Never guess user preferences**: For taste-based config (prefixes), always ask
-### Agent Integration
-24. **Install hooks programmatically**: SessionStart, PreCompact, PostToolUse
-25. **Use skill directories**: `.claude/skills/`, `.cursor/rules/`
-26. **Support multiple agents**: Single CLI, multiple integration points
-27. **Structure help for agent discovery**: IMPORTANT section, Getting Started
-    one-liner, prominent setup commands
-### Output
-28. **Implement `--json` for all commands**: Machine-readable output is essential
-29. **Use `output.data()` pattern**: Single code path for JSON and human output
-30. **Provide `--quiet` mode**: For scripted usage without noise
-### Error Handling
-31. **Include next steps in errors**: Actionable guidance, not just error messages
-32. **Graceful deprecation**: Keep old commands working with migration guidance
-33. **Explicit completion protocols**: Checklists prevent premature completion
-### Agent Mental Model
-34. **Design for agent-as-partner**: Help agents serve users, not relay commands
-35. **Lead with value proposition**: Explain *why* before *how*
-36. **Distinguish action from informational commands**: Some commands teach, not do
-### Resource Libraries
-37. **Bundle guidelines, shortcuts, templates**: Ship curated knowledge with CLI
-38. **Show full commands in directories**: `cli shortcut X`, not just `X`
-39. **Organize resources by purpose**: Categories by workflow phase or domain
-40. **Enable on-demand knowledge queries**: Agents pull in relevant resources JIT
-41. **Implement shadowing for customization**: Project-level overrides without forking
-42. **Generate directories dynamically**: Avoid stale documentation
-43. **Use HTML markers for updatable sections**: Enable partial updates without full
-    regeneration
-### Context Injection
-44. **Design self-reinforcing context chains**: SKILL.md → guidelines → actions
-45. **Reference commands explicitly**: Always `cli command arg`, never vague prose
-46. **Limit chain depth to 3**: Avoid deep reference chains that confuse agents
-47. **Make every layer actionable**: Each context injection should lead to actions
-### Task Management
-48. **Choose appropriate tracking strategy**: Ephemeral, session-local, or persistent
-49. **Implement work discovery**: `ready` or `next` commands for session start
-50. **Add session boundary enforcement**: Remind agents to sync/close at session end
-51. **Consider tbd integration**: For persistent multi-session task tracking
-* * *
+**Scale up only when needed**
-## Integration Checklist for New CLIs
+- Many capabilities → meta-skill + informational, self-injecting subcommands (one
+  listing slot, unbounded resources).
+  This is tbd’s validated approach.
+- Path-ordered resource cache for project/user shadowing; generate `--list` dynamically.
+- Context-injection loop with explicit `cli command arg` references; depth ≤ 3.
-**Agent Integration Files**
+**Reach & surface**
-- [ ] SKILL.md with YAML frontmatter (name, description, allowed-tools)
-- [ ] CURSOR.mdc with MDC frontmatter (description, alwaysApply)
-- [ ] AGENTS.md section with HTML markers
-- [ ] Tiered skill files: skill-baseline.md, skill-brief.md
-- [ ] Separate header file: claude-header.md with YAML frontmatter
+- Layer for reach: `AGENTS.md` + `SKILL.md` + CLI + (MCP if no CLI fits).
+- Prefer CLI over MCP when a CLI exists (cheaper, more reliable); use MCP for
+  auth/multi-tenant/remote; consider code-execution mode for many-tool MCP servers.
+- Add agent-specific files (`.cursor/rules`, plugins, ACP) last, only where they pay
+  off.
-**Description Quality**
+**Operate safely**
-- [ ] Two-part description: capabilities + activation triggers
-- [ ] Third-person language only
-- [ ] Explicit “Use when …” trigger phrases matching user language
-- [ ] Front-load important keywords in first 50 characters
-- [ ] Description length appropriate for skill collection size (≤130 chars for 60+
-  skills)
+- Treat all skill/instruction content and tool output as untrusted; vet and pin
+  third-party skills.
+- Scope `allowed-tools` tightly; gate destructive skills; design for sandboxes.
+- Idempotent multi-agent install with marker-bounded sections; version source files, not
+  fully generated install artifacts; mark generated files “DO NOT EDIT.”
-**Budget Management** (for CLIs installing multiple skills)
-- [ ] Calculate cumulative description size (descriptions + ~109 chars overhead each)
-- [ ] Verify total stays under 15K character budget
-- [ ] Use meta-skill pattern if resources exceed 50
-- [ ] Run `/context` to verify skills aren’t being truncated
-**Context Management**
-- [ ] `prime` command with dashboard and brief modes (two levels only)
-- [ ] `skill` command for full documentation output with `--brief` flag
-- [ ] Skill composition from header + baseline + dynamic directory
-- [ ] HTML markers for updatable sections (<!-- BEGIN/END -->)
-- [ ] “DO NOT EDIT” warnings in generated skill files
-- [ ] Value-first orientation in skill file (why before how)
-- [ ] Context recovery instructions in all docs
-- [ ] Session closing protocol checklist
-- [ ] SKILL.md under 500 lines (progressive disclosure)
-**Setup Flow**
-- [ ] `setup --auto` for agent-friendly installation
-- [ ] `init --prefix` for surgical initialization
-- [ ] Multi-contributor detection (skip init if already configured)
-- [ ] Setup is idempotent (safe to run multiple times)
-- [ ] Hook deduplication (filter existing before merging)
-- [ ] Skill file regeneration (always overwrite, don’t update in place)
-- [ ] Legacy pattern cleanup on each setup run
-**Hooks**
-- [ ] SessionStart hook to call `prime`
-- [ ] PreCompact hook to call `prime`
-- [ ] PostToolUse hook for session completion reminders
-**Self-Documentation**
-- [ ] Help epilog with “IMPORTANT” section referencing `prime`
-- [ ] Help epilog with “Getting Started” one-liner installation command
-- [ ] Setup command in dedicated “Setup & Configuration” category
-- [ ] Context recovery prompt when CLI runs without args
-- [ ] Documentation commands (`readme`, `docs`)
-- [ ] `--json` flag on all commands
-**Resource Libraries**
-- [ ] `shortcut` command with `--list` and category filtering
-- [ ] `guidelines` command with `--list` and category filtering
-- [ ] `template` command with `--list`
-- [ ] Resources organized by purpose/workflow phase
-- [ ] Resource directories generated dynamically
-- [ ] Shadowing support for project-level overrides
-**Task Management**
+* * *
-- [ ] Decide tracking strategy: ephemeral, session-local, or persistent
-- [ ] Implement work discovery command (`ready`, `next`)
-- [ ] Add session closing reminders for task sync
+## 12. Integration Checklist
+**Baseline (every skill)**
+- [ ] `SKILL.md` with `name` + two-part `description`
+- [ ] Body < 500 lines; bulky material in supporting files one level deep
+- [ ] Third-person description, trigger keywords front-loaded
+- [ ] Installable via commit to `.claude/skills/` and/or `npx skills add`
+**Project**
+- [ ] `AGENTS.md` with build/test/style/conventions (concise)
+- [ ] `CLAUDE.md` strategy decided (symlink to `AGENTS.md`, copy, or separate)
+**CLI tool (if applicable)**
+- [ ] `--json` on all commands; `--brief`/`--quiet`; actionable errors
+- [ ] Idempotent `setup --auto`; `init` for surgical config
+- [ ] Help epilog with `IMPORTANT:` + Getting Started one-liner
+- [ ] `prime` (status/context) and `skill` (pure docs) commands
+- [ ] Invocation strategy chosen (§6.7): pinned zero-install (`npx`/`uvx <pkg>@<ver>`)
+  by default, or global install + `SessionStart` bootstrap for cloud/ephemeral agents
+**Advanced (many subcommands / knowledge library)**
+- [ ] Meta-skill composition (header + baseline + dynamic directory)
+- [ ] Informational commands (`guidelines`/`shortcut`/`template`) with `--list`
+- [ ] Path-ordered DocCache with shadowing
+- [ ] Tiered skill files (baseline + brief)
+- [ ] Context-injection loop, explicit references, depth ≤ 3
+**Reach**
+- [ ] Decide target agents; add per-agent files only where needed
+- [ ] MCP server only if no CLI fits, or for OAuth/multi-tenant/remote
+- [ ] Marker-bounded multi-agent install; “DO NOT EDIT” on generated files
+**Security**
+- [ ] Third-party skills vetted, scanned, and pinned
+- [ ] `allowed-tools` minimally scoped; destructive skills gated
+- [ ] CLI works within agent sandboxes and degrades gracefully
 * * *
 ## References
-### Official Documentation
-- Claude Code Skills Documentation: https://code.claude.com/docs/en/skills
-- Agent Skills Open Standard: https://agentskills.io
-- Cursor IDE MDC Format: https://cursor.sh/docs/rules
-### Community Resources
-- Claude Skills Best Practices:
-  https://github.com/Dicklesworthstone/meta_skill/blob/main/BEST_PRACTICES_FOR_WRITING_AND_USING_SKILLS_MD_FILES.md
-- Claude Code Skills Guide (Gist):
-  https://gist.github.com/mellanon/50816550ecb5f3b239aa77eef7b8ed8d
-- Awesome Claude Skills: https://github.com/travisvn/awesome-claude-skills
-- Skills Character Budget Research:
-  https://github.com/anthropics/claude-code/issues/11045
-- Skill Activation Reliability Testing:
-  https://scottspence.com/posts/how-to-make-claude-code-skills-activate-reliably
-### MCP Resources
-- Anthropic MCP Engineering Blog:
-  https://www.anthropic.com/engineering/code-execution-with-mcp
-- MCP Agent Frameworks Comparison:
-  https://clickhouse.com/blog/how-to-build-ai-agents-mcp-12-frameworks
-- GitHub MCP Registry: https://github.com/modelcontextprotocol
-### Implementation References
-- Commander.js: https://github.com/tj/commander.js
-- tbd Source Code: https://github.com/jlevy/tbd
+### Open standards & governance
+- Agent Skills standard: https://agentskills.io (spec:
+  https://agentskills.io/specification)
+- AGENTS.md: https://agents.md
+- Agentic AI Foundation (Linux Foundation):
+  https://www.linuxfoundation.org/press/linux-foundation-announces-the-formation-of-the-agentic-ai-foundation
+### Claude Code
+- Skills: https://code.claude.com/docs/en/skills
+- Hooks: https://code.claude.com/docs/en/hooks
+- Plugins: https://code.claude.com/docs/en/plugins
+- Skill authoring best practices:
+  https://platform.claude.com/docs/en/agents-and-tools/agent-skills/best-practices
+- Sandboxing: https://www.anthropic.com/engineering/claude-code-sandboxing
+### Codex
+- AGENTS.md: https://developers.openai.com/codex/guides/agents-md
+- Config reference: https://developers.openai.com/codex/config-reference
+- Skills: https://developers.openai.com/codex/skills
+- MCP: https://developers.openai.com/codex/mcp
+- Sandboxing: https://developers.openai.com/codex/concepts/sandboxing
+### Other agents
+- Cursor rules: https://cursor.com/docs/rules
+- GitHub Copilot custom instructions:
+  https://docs.github.com/copilot/customizing-copilot/adding-custom-instructions-for-github-copilot
+- Gemini CLI: https://geminicli.com/docs/cli/gemini-md/
+- Windsurf AGENTS.md: https://docs.windsurf.com/windsurf/cascade/agents-md
+- Cline rules: https://docs.cline.bot/customization/cline-rules
+- Aider conventions: https://aider.chat/docs/usage/conventions.html
+- opencode: https://opencode.ai/docs/rules/
+- Amp: https://ampcode.com/manual
+- pi: https://github.com/badlogic/pi-mono
+### MCP & protocols
+- 2026 MCP roadmap: https://blog.modelcontextprotocol.io/posts/2026-mcp-roadmap/
+- Code execution with MCP: https://www.anthropic.com/engineering/code-execution-with-mcp
+- CLI vs MCP benchmarks: https://www.firecrawl.dev/blog/mcp-vs-cli
+- ACP: https://zed.dev/acp
+- A2A:
+  https://www.linuxfoundation.org/press/a2a-protocol-surpasses-150-organizations-lands-in-major-cloud-platforms-and-sees-enterprise-production-use-in-first-year
+### Distribution & ecosystem
+- Vercel skills / skills.sh:
+  https://vercel.com/changelog/introducing-skills-the-open-agent-skills-ecosystem
+- npx skills: https://github.com/vercel-labs/skills
+- Anthropic skills (examples): https://github.com/anthropics/skills
+- gstack: https://github.com/garrytan/gstack
+- Beads (bd): https://github.com/gastownhall/beads
+### Security
+- Securing the skill ecosystem (Snyk):
+  https://snyk.io/blog/snyk-vercel-securing-agent-skill-ecosystem/
+- Malicious-skill research:
+  https://labs.reversec.com/posts/2026/05/skill-issues-compromising-claude-code-with-malicious-skills-agents-part-1
 ## Related Guidelines
-- For TypeScript CLI implementation details, see
-  `tbd guidelines typescript-cli-tool-rules`
-- For testing patterns, see `tbd guidelines general-testing-rules`
-- For monorepo setup, see `tbd guidelines pnpm-monorepo-patterns` or
-  `bun-monorepo-patterns`
+- TypeScript CLI implementation: `tbd guidelines typescript-cli-tool-rules`
+- Supply-chain / dependency currency: `tbd guidelines bun-monorepo-patterns` or
+  `pnpm-monorepo-patterns`
+- Testing: `tbd guidelines general-testing-rules`