npm - @bgicli/bgicli - Versions diffs - 2.2.8 → 2.2.10 - Mend

@bgicli/bgicli 2.2.8 → 2.2.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (113) hide show

package/data/skills/anthropic-algorithmic-art/SKILL.md +405 -0
package/data/skills/anthropic-canvas-design/SKILL.md +130 -0
package/data/skills/anthropic-claude-api/SKILL.md +243 -0
package/data/skills/anthropic-doc-coauthoring/SKILL.md +375 -0
package/data/skills/anthropic-docx/SKILL.md +590 -0
package/data/skills/anthropic-frontend-design/SKILL.md +42 -0
package/data/skills/anthropic-internal-comms/SKILL.md +32 -0
package/data/skills/anthropic-mcp-builder/SKILL.md +236 -0
package/data/skills/anthropic-pdf/SKILL.md +314 -0
package/data/skills/anthropic-pptx/SKILL.md +232 -0
package/data/skills/anthropic-skill-creator/SKILL.md +485 -0
package/data/skills/anthropic-webapp-testing/SKILL.md +96 -0
package/data/skills/anthropic-xlsx/SKILL.md +292 -0
package/data/skills/arxiv-database/SKILL.md +362 -0
package/data/skills/astropy/SKILL.md +329 -0
package/data/skills/ctx-advanced-evaluation/SKILL.md +402 -0
package/data/skills/ctx-bdi-mental-states/SKILL.md +311 -0
package/data/skills/ctx-context-compression/SKILL.md +272 -0
package/data/skills/ctx-context-degradation/SKILL.md +206 -0
package/data/skills/ctx-context-fundamentals/SKILL.md +201 -0
package/data/skills/ctx-context-optimization/SKILL.md +195 -0
package/data/skills/ctx-evaluation/SKILL.md +251 -0
package/data/skills/ctx-filesystem-context/SKILL.md +287 -0
package/data/skills/ctx-hosted-agents/SKILL.md +260 -0
package/data/skills/ctx-memory-systems/SKILL.md +225 -0
package/data/skills/ctx-multi-agent-patterns/SKILL.md +257 -0
package/data/skills/ctx-project-development/SKILL.md +291 -0
package/data/skills/ctx-tool-design/SKILL.md +271 -0
package/data/skills/dhdna-profiler/SKILL.md +162 -0
package/data/skills/generate-image/SKILL.md +183 -0
package/data/skills/geomaster/SKILL.md +365 -0
package/data/skills/get-available-resources/SKILL.md +275 -0
package/data/skills/hamelsmu-build-review-interface/SKILL.md +96 -0
package/data/skills/hamelsmu-error-analysis/SKILL.md +164 -0
package/data/skills/hamelsmu-eval-audit/SKILL.md +183 -0
package/data/skills/hamelsmu-evaluate-rag/SKILL.md +177 -0
package/data/skills/hamelsmu-generate-synthetic-data/SKILL.md +131 -0
package/data/skills/hamelsmu-validate-evaluator/SKILL.md +212 -0
package/data/skills/hamelsmu-write-judge-prompt/SKILL.md +144 -0
package/data/skills/hf-cli/SKILL.md +174 -0
package/data/skills/hf-mcp/SKILL.md +178 -0
package/data/skills/hugging-face-dataset-viewer/SKILL.md +121 -0
package/data/skills/hugging-face-datasets/SKILL.md +542 -0
package/data/skills/hugging-face-evaluation/SKILL.md +651 -0
package/data/skills/hugging-face-jobs/SKILL.md +1042 -0
package/data/skills/hugging-face-model-trainer/SKILL.md +717 -0
package/data/skills/hugging-face-paper-pages/SKILL.md +239 -0
package/data/skills/hugging-face-paper-publisher/SKILL.md +624 -0
package/data/skills/hugging-face-tool-builder/SKILL.md +110 -0
package/data/skills/hugging-face-trackio/SKILL.md +115 -0
package/data/skills/hugging-face-vision-trainer/SKILL.md +593 -0
package/data/skills/huggingface-gradio/SKILL.md +245 -0
package/data/skills/matlab/SKILL.md +376 -0
package/data/skills/modal/SKILL.md +381 -0
package/data/skills/openai-cloudflare-deploy/SKILL.md +224 -0
package/data/skills/openai-develop-web-game/SKILL.md +149 -0
package/data/skills/openai-doc/SKILL.md +80 -0
package/data/skills/openai-figma/SKILL.md +42 -0
package/data/skills/openai-figma-implement-design/SKILL.md +264 -0
package/data/skills/openai-gh-address-comments/SKILL.md +25 -0
package/data/skills/openai-gh-fix-ci/SKILL.md +69 -0
package/data/skills/openai-imagegen/SKILL.md +174 -0
package/data/skills/openai-jupyter-notebook/SKILL.md +107 -0
package/data/skills/openai-linear/SKILL.md +87 -0
package/data/skills/openai-netlify-deploy/SKILL.md +247 -0
package/data/skills/openai-notion-knowledge-capture/SKILL.md +56 -0
package/data/skills/openai-notion-meeting-intelligence/SKILL.md +60 -0
package/data/skills/openai-notion-research-documentation/SKILL.md +59 -0
package/data/skills/openai-notion-spec-to-implementation/SKILL.md +58 -0
package/data/skills/openai-openai-docs/SKILL.md +69 -0
package/data/skills/openai-pdf/SKILL.md +67 -0
package/data/skills/openai-playwright/SKILL.md +147 -0
package/data/skills/openai-render-deploy/SKILL.md +479 -0
package/data/skills/openai-screenshot/SKILL.md +267 -0
package/data/skills/openai-security-best-practices/SKILL.md +86 -0
package/data/skills/openai-security-ownership-map/SKILL.md +206 -0
package/data/skills/openai-security-threat-model/SKILL.md +81 -0
package/data/skills/openai-sentry/SKILL.md +123 -0
package/data/skills/openai-sora/SKILL.md +178 -0
package/data/skills/openai-speech/SKILL.md +144 -0
package/data/skills/openai-spreadsheet/SKILL.md +145 -0
package/data/skills/openai-transcribe/SKILL.md +81 -0
package/data/skills/openai-vercel-deploy/SKILL.md +77 -0
package/data/skills/openai-yeet/SKILL.md +28 -0
package/data/skills/pennylane/SKILL.md +224 -0
package/data/skills/polars-bio/SKILL.md +374 -0
package/data/skills/primekg/SKILL.md +97 -0
package/data/skills/pymatgen/SKILL.md +689 -0
package/data/skills/qiskit/SKILL.md +273 -0
package/data/skills/qutip/SKILL.md +316 -0
package/data/skills/recursive-decomposition/SKILL.md +185 -0
package/data/skills/rowan/SKILL.md +427 -0
package/data/skills/scholar-evaluation/SKILL.md +298 -0
package/data/skills/sentry-create-alert/SKILL.md +210 -0
package/data/skills/sentry-fix-issues/SKILL.md +126 -0
package/data/skills/sentry-pr-code-review/SKILL.md +105 -0
package/data/skills/sentry-python-sdk/SKILL.md +317 -0
package/data/skills/sentry-setup-ai-monitoring/SKILL.md +217 -0
package/data/skills/stable-baselines3/SKILL.md +297 -0
package/data/skills/sympy/SKILL.md +498 -0
package/data/skills/trailofbits-ask-questions-if-underspecified/SKILL.md +85 -0
package/data/skills/trailofbits-audit-context-building/SKILL.md +302 -0
package/data/skills/trailofbits-differential-review/SKILL.md +220 -0
package/data/skills/trailofbits-insecure-defaults/SKILL.md +117 -0
package/data/skills/trailofbits-modern-python/SKILL.md +333 -0
package/data/skills/trailofbits-property-based-testing/SKILL.md +123 -0
package/data/skills/trailofbits-semgrep-rule-creator/SKILL.md +172 -0
package/data/skills/trailofbits-sharp-edges/SKILL.md +292 -0
package/data/skills/trailofbits-variant-analysis/SKILL.md +142 -0
package/data/skills/transformers.js/SKILL.md +637 -0
package/data/skills/writing/SKILL.md +419 -0
package/dist/bgi.js +66 -2
package/package.json +1 -1

package/data/skills/anthropic-claude-api/SKILL.md ADDED Viewed

@@ -0,0 +1,243 @@
+---
+name: claude-api
+description: "Build apps with the Claude API or Anthropic SDK. TRIGGER when: code imports `anthropic`/`@anthropic-ai/sdk`/`claude_agent_sdk`, or user asks to use Claude API, Anthropic SDKs, or Agent SDK. DO NOT TRIGGER when: code imports `openai`/other AI SDK, general programming, or ML/data-science tasks."
+license: Complete terms in LICENSE.txt
+---
+# Building LLM-Powered Applications with Claude
+This skill helps you build LLM-powered applications with Claude. Choose the right surface based on your needs, detect the project language, then read the relevant language-specific documentation.
+## Defaults
+Unless the user requests otherwise:
+For the Claude model version, please use Claude Opus 4.6, which you can access via the exact model string `claude-opus-4-6`. Please default to using adaptive thinking (`thinking: {type: "adaptive"}`) for anything remotely complicated. And finally, please default to streaming for any request that may involve long input, long output, or high `max_tokens` — it prevents hitting request timeouts. Use the SDK's `.get_final_message()` / `.finalMessage()` helper to get the complete response if you don't need to handle individual stream events
+---
+## Language Detection
+Before reading code examples, determine which language the user is working in:
+1. **Look at project files** to infer the language:
+   - `*.py`, `requirements.txt`, `pyproject.toml`, `setup.py`, `Pipfile` → **Python** — read from `python/`
+   - `*.ts`, `*.tsx`, `package.json`, `tsconfig.json` → **TypeScript** — read from `typescript/`
+   - `*.js`, `*.jsx` (no `.ts` files present) → **TypeScript** — JS uses the same SDK, read from `typescript/`
+   - `*.java`, `pom.xml`, `build.gradle` → **Java** — read from `java/`
+   - `*.kt`, `*.kts`, `build.gradle.kts` → **Java** — Kotlin uses the Java SDK, read from `java/`
+   - `*.scala`, `build.sbt` → **Java** — Scala uses the Java SDK, read from `java/`
+   - `*.go`, `go.mod` → **Go** — read from `go/`
+   - `*.rb`, `Gemfile` → **Ruby** — read from `ruby/`
+   - `*.cs`, `*.csproj` → **C#** — read from `csharp/`
+   - `*.php`, `composer.json` → **PHP** — read from `php/`
+2. **If multiple languages detected** (e.g., both Python and TypeScript files):
+   - Check which language the user's current file or question relates to
+   - If still ambiguous, ask: "I detected both Python and TypeScript files. Which language are you using for the Claude API integration?"
+3. **If language can't be inferred** (empty project, no source files, or unsupported language):
+   - Use AskUserQuestion with options: Python, TypeScript, Java, Go, Ruby, cURL/raw HTTP, C#, PHP
+   - If AskUserQuestion is unavailable, default to Python examples and note: "Showing Python examples. Let me know if you need a different language."
+4. **If unsupported language detected** (Rust, Swift, C++, Elixir, etc.):
+   - Suggest cURL/raw HTTP examples from `curl/` and note that community SDKs may exist
+   - Offer to show Python or TypeScript examples as reference implementations
+5. **If user needs cURL/raw HTTP examples**, read from `curl/`.
+### Language-Specific Feature Support
+| Language   | Tool Runner | Agent SDK | Notes                                 |
+| ---------- | ----------- | --------- | ------------------------------------- |
+| Python     | Yes (beta)  | Yes       | Full support — `@beta_tool` decorator |
+| TypeScript | Yes (beta)  | Yes       | Full support — `betaZodTool` + Zod    |
+| Java       | Yes (beta)  | No        | Beta tool use with annotated classes  |
+| Go         | Yes (beta)  | No        | `BetaToolRunner` in `toolrunner` pkg  |
+| Ruby       | Yes (beta)  | No        | `BaseTool` + `tool_runner` in beta    |
+| cURL       | N/A         | N/A       | Raw HTTP, no SDK features             |
+| C#         | No          | No        | Official SDK                          |
+| PHP        | No          | No        | Official SDK                          |
+---
+## Which Surface Should I Use?
+> **Start simple.** Default to the simplest tier that meets your needs. Single API calls and workflows handle most use cases — only reach for agents when the task genuinely requires open-ended, model-driven exploration.
+| Use Case                                        | Tier            | Recommended Surface       | Why                                     |
+| ----------------------------------------------- | --------------- | ------------------------- | --------------------------------------- |
+| Classification, summarization, extraction, Q&A  | Single LLM call | **Claude API**            | One request, one response               |
+| Batch processing or embeddings                  | Single LLM call | **Claude API**            | Specialized endpoints                   |
+| Multi-step pipelines with code-controlled logic | Workflow        | **Claude API + tool use** | You orchestrate the loop                |
+| Custom agent with your own tools                | Agent           | **Claude API + tool use** | Maximum flexibility                     |
+| AI agent with file/web/terminal access          | Agent           | **Agent SDK**             | Built-in tools, safety, and MCP support |
+| Agentic coding assistant                        | Agent           | **Agent SDK**             | Designed for this use case              |
+| Want built-in permissions and guardrails        | Agent           | **Agent SDK**             | Safety features included                |
+> **Note:** The Agent SDK is for when you want built-in file/web/terminal tools, permissions, and MCP out of the box. If you want to build an agent with your own tools, Claude API is the right choice — use the tool runner for automatic loop handling, or the manual loop for fine-grained control (approval gates, custom logging, conditional execution).
+### Decision Tree
+```
+What does your application need?
+1. Single LLM call (classification, summarization, extraction, Q&A)
+   └── Claude API — one request, one response
+2. Does Claude need to read/write files, browse the web, or run shell commands
+   as part of its work? (Not: does your app read a file and hand it to Claude —
+   does Claude itself need to discover and access files/web/shell?)
+   └── Yes → Agent SDK — built-in tools, don't reimplement them
+       Examples: "scan a codebase for bugs", "summarize every file in a directory",
+                 "find bugs using subagents", "research a topic via web search"
+3. Workflow (multi-step, code-orchestrated, with your own tools)
+   └── Claude API with tool use — you control the loop
+4. Open-ended agent (model decides its own trajectory, your own tools)
+   └── Claude API agentic loop (maximum flexibility)
+```
+### Should I Build an Agent?
+Before choosing the agent tier, check all four criteria:
+- **Complexity** — Is the task multi-step and hard to fully specify in advance? (e.g., "turn this design doc into a PR" vs. "extract the title from this PDF")
+- **Value** — Does the outcome justify higher cost and latency?
+- **Viability** — Is Claude capable at this task type?
+- **Cost of error** — Can errors be caught and recovered from? (tests, review, rollback)
+If the answer is "no" to any of these, stay at a simpler tier (single call or workflow).
+---
+## Architecture
+Everything goes through `POST /v1/messages`. Tools and output constraints are features of this single endpoint — not separate APIs.
+**User-defined tools** — You define tools (via decorators, Zod schemas, or raw JSON), and the SDK's tool runner handles calling the API, executing your functions, and looping until Claude is done. For full control, you can write the loop manually.
+**Server-side tools** — Anthropic-hosted tools that run on Anthropic's infrastructure. Code execution is fully server-side (declare it in `tools`, Claude runs code automatically). Computer use can be server-hosted or self-hosted.
+**Structured outputs** — Constrains the Messages API response format (`output_config.format`) and/or tool parameter validation (`strict: true`). The recommended approach is `client.messages.parse()` which validates responses against your schema automatically. Note: the old `output_format` parameter is deprecated; use `output_config: {format: {...}}` on `messages.create()`.
+**Supporting endpoints** — Batches (`POST /v1/messages/batches`), Files (`POST /v1/files`), and Token Counting feed into or support Messages API requests.
+---
+## Current Models (cached: 2026-02-17)
+| Model             | Model ID            | Context        | Input $/1M | Output $/1M |
+| ----------------- | ------------------- | -------------- | ---------- | ----------- |
+| Claude Opus 4.6   | `claude-opus-4-6`   | 200K (1M beta) | $5.00      | $25.00      |
+| Claude Sonnet 4.6 | `claude-sonnet-4-6` | 200K (1M beta) | $3.00      | $15.00      |
+| Claude Haiku 4.5  | `claude-haiku-4-5`  | 200K           | $1.00      | $5.00       |
+**ALWAYS use `claude-opus-4-6` unless the user explicitly names a different model.** This is non-negotiable. Do not use `claude-sonnet-4-6`, `claude-sonnet-4-5`, or any other model unless the user literally says "use sonnet" or "use haiku". Never downgrade for cost — that's the user's decision, not yours.
+**CRITICAL: Use only the exact model ID strings from the table above — they are complete as-is. Do not append date suffixes.** For example, use `claude-sonnet-4-5`, never `claude-sonnet-4-5-20250514` or any other date-suffixed variant you might recall from training data. If the user requests an older model not in the table (e.g., "opus 4.5", "sonnet 3.7"), read `shared/models.md` for the exact ID — do not construct one yourself.
+A note: if any of the model strings above look unfamiliar to you, that's to be expected — that just means they were released after your training data cutoff. Rest assured they are real models; we wouldn't mess with you like that.
+---
+## Thinking & Effort (Quick Reference)
+**Opus 4.6 — Adaptive thinking (recommended):** Use `thinking: {type: "adaptive"}`. Claude dynamically decides when and how much to think. No `budget_tokens` needed — `budget_tokens` is deprecated on Opus 4.6 and Sonnet 4.6 and must not be used. Adaptive thinking also automatically enables interleaved thinking (no beta header needed). **When the user asks for "extended thinking", a "thinking budget", or `budget_tokens`: always use Opus 4.6 with `thinking: {type: "adaptive"}`. The concept of a fixed token budget for thinking is deprecated — adaptive thinking replaces it. Do NOT use `budget_tokens` and do NOT switch to an older model.**
+**Effort parameter (GA, no beta header):** Controls thinking depth and overall token spend via `output_config: {effort: "low"|"medium"|"high"|"max"}` (inside `output_config`, not top-level). Default is `high` (equivalent to omitting it). `max` is Opus 4.6 only. Works on Opus 4.5, Opus 4.6, and Sonnet 4.6. Will error on Sonnet 4.5 / Haiku 4.5. Combine with adaptive thinking for the best cost-quality tradeoffs. Use `low` for subagents or simple tasks; `max` for the deepest reasoning.
+**Sonnet 4.6:** Supports adaptive thinking (`thinking: {type: "adaptive"}`). `budget_tokens` is deprecated on Sonnet 4.6 — use adaptive thinking instead.
+**Older models (only if explicitly requested):** If the user specifically asks for Sonnet 4.5 or another older model, use `thinking: {type: "enabled", budget_tokens: N}`. `budget_tokens` must be less than `max_tokens` (minimum 1024). Never choose an older model just because the user mentions `budget_tokens` — use Opus 4.6 with adaptive thinking instead.
+---
+## Compaction (Quick Reference)
+**Beta, Opus 4.6 only.** For long-running conversations that may exceed the 200K context window, enable server-side compaction. The API automatically summarizes earlier context when it approaches the trigger threshold (default: 150K tokens). Requires beta header `compact-2026-01-12`.
+**Critical:** Append `response.content` (not just the text) back to your messages on every turn. Compaction blocks in the response must be preserved — the API uses them to replace the compacted history on the next request. Extracting only the text string and appending that will silently lose the compaction state.
+See `{lang}/claude-api/README.md` (Compaction section) for code examples. Full docs via WebFetch in `shared/live-sources.md`.
+---
+## Reading Guide
+After detecting the language, read the relevant files based on what the user needs:
+### Quick Task Reference
+**Single text classification/summarization/extraction/Q&A:**
+→ Read only `{lang}/claude-api/README.md`
+**Chat UI or real-time response display:**
+→ Read `{lang}/claude-api/README.md` + `{lang}/claude-api/streaming.md`
+**Long-running conversations (may exceed context window):**
+→ Read `{lang}/claude-api/README.md` — see Compaction section
+**Function calling / tool use / agents:**
+→ Read `{lang}/claude-api/README.md` + `shared/tool-use-concepts.md` + `{lang}/claude-api/tool-use.md`
+**Batch processing (non-latency-sensitive):**
+→ Read `{lang}/claude-api/README.md` + `{lang}/claude-api/batches.md`
+**File uploads across multiple requests:**
+→ Read `{lang}/claude-api/README.md` + `{lang}/claude-api/files-api.md`
+**Agent with built-in tools (file/web/terminal):**
+→ Read `{lang}/agent-sdk/README.md` + `{lang}/agent-sdk/patterns.md`
+### Claude API (Full File Reference)
+Read the **language-specific Claude API folder** (`{language}/claude-api/`):
+1. **`{language}/claude-api/README.md`** — **Read this first.** Installation, quick start, common patterns, error handling.
+2. **`shared/tool-use-concepts.md`** — Read when the user needs function calling, code execution, memory, or structured outputs. Covers conceptual foundations.
+3. **`{language}/claude-api/tool-use.md`** — Read for language-specific tool use code examples (tool runner, manual loop, code execution, memory, structured outputs).
+4. **`{language}/claude-api/streaming.md`** — Read when building chat UIs or interfaces that display responses incrementally.
+5. **`{language}/claude-api/batches.md`** — Read when processing many requests offline (not latency-sensitive). Runs asynchronously at 50% cost.
+6. **`{language}/claude-api/files-api.md`** — Read when sending the same file across multiple requests without re-uploading.
+7. **`shared/error-codes.md`** — Read when debugging HTTP errors or implementing error handling.
+8. **`shared/live-sources.md`** — WebFetch URLs for fetching the latest official documentation.
+> **Note:** For Java, Go, Ruby, C#, PHP, and cURL — these have a single file each covering all basics. Read that file plus `shared/tool-use-concepts.md` and `shared/error-codes.md` as needed.
+### Agent SDK
+Read the **language-specific Agent SDK folder** (`{language}/agent-sdk/`). Agent SDK is available for **Python and TypeScript only**.
+1. **`{language}/agent-sdk/README.md`** — Installation, quick start, built-in tools, permissions, MCP, hooks.
+2. **`{language}/agent-sdk/patterns.md`** — Custom tools, hooks, subagents, MCP integration, session resumption.
+3. **`shared/live-sources.md`** — WebFetch URLs for current Agent SDK docs.
+---
+## When to Use WebFetch
+Use WebFetch to get the latest documentation when:
+- User asks for "latest" or "current" information
+- Cached data seems incorrect
+- User asks about features not covered here
+Live documentation URLs are in `shared/live-sources.md`.
+## Common Pitfalls
+- Don't truncate inputs when passing files or content to the API. If the content is too long to fit in the context window, notify the user and discuss options (chunking, summarization, etc.) rather than silently truncating.
+- **Opus 4.6 / Sonnet 4.6 thinking:** Use `thinking: {type: "adaptive"}` — do NOT use `budget_tokens` (deprecated on both Opus 4.6 and Sonnet 4.6). For older models, `budget_tokens` must be less than `max_tokens` (minimum 1024). This will throw an error if you get it wrong.
+- **Opus 4.6 prefill removed:** Assistant message prefills (last-assistant-turn prefills) return a 400 error on Opus 4.6. Use structured outputs (`output_config.format`) or system prompt instructions to control response format instead.
+- **128K output tokens:** Opus 4.6 supports up to 128K `max_tokens`, but the SDKs require streaming for large `max_tokens` to avoid HTTP timeouts. Use `.stream()` with `.get_final_message()` / `.finalMessage()`.
+- **Tool call JSON parsing (Opus 4.6):** Opus 4.6 may produce different JSON string escaping in tool call `input` fields (e.g., Unicode or forward-slash escaping). Always parse tool inputs with `json.loads()` / `JSON.parse()` — never do raw string matching on the serialized input.
+- **Structured outputs (all models):** Use `output_config: {format: {...}}` instead of the deprecated `output_format` parameter on `messages.create()`. This is a general API change, not 4.6-specific.
+- **Don't reimplement SDK functionality:** The SDK provides high-level helpers — use them instead of building from scratch. Specifically: use `stream.finalMessage()` instead of wrapping `.on()` events in `new Promise()`; use typed exception classes (`Anthropic.RateLimitError`, etc.) instead of string-matching error messages; use SDK types (`Anthropic.MessageParam`, `Anthropic.Tool`, `Anthropic.Message`, etc.) instead of redefining equivalent interfaces.
+- **Don't define custom types for SDK data structures:** The SDK exports types for all API objects. Use `Anthropic.MessageParam` for messages, `Anthropic.Tool` for tool definitions, `Anthropic.ToolUseBlock` / `Anthropic.ToolResultBlockParam` for tool results, `Anthropic.Message` for responses. Defining your own `interface ChatMessage { role: string; content: unknown }` duplicates what the SDK already provides and loses type safety.
+- **Report and document output:** For tasks that produce reports, documents, or visualizations, the code execution sandbox has `python-docx`, `python-pptx`, `matplotlib`, `pillow`, and `pypdf` pre-installed. Claude can generate formatted files (DOCX, PDF, charts) and return them via the Files API — consider this for "report" or "document" type requests instead of plain stdout text.

package/data/skills/anthropic-doc-coauthoring/SKILL.md ADDED Viewed

@@ -0,0 +1,375 @@
+---
+name: doc-coauthoring
+description: Guide users through a structured workflow for co-authoring documentation. Use when user wants to write documentation, proposals, technical specs, decision docs, or similar structured content. This workflow helps users efficiently transfer context, refine content through iteration, and verify the doc works for readers. Trigger when user mentions writing docs, creating proposals, drafting specs, or similar documentation tasks.
+---
+# Doc Co-Authoring Workflow
+This skill provides a structured workflow for guiding users through collaborative document creation. Act as an active guide, walking users through three stages: Context Gathering, Refinement & Structure, and Reader Testing.
+## When to Offer This Workflow
+**Trigger conditions:**
+- User mentions writing documentation: "write a doc", "draft a proposal", "create a spec", "write up"
+- User mentions specific doc types: "PRD", "design doc", "decision doc", "RFC"
+- User seems to be starting a substantial writing task
+**Initial offer:**
+Offer the user a structured workflow for co-authoring the document. Explain the three stages:
+1. **Context Gathering**: User provides all relevant context while Claude asks clarifying questions
+2. **Refinement & Structure**: Iteratively build each section through brainstorming and editing
+3. **Reader Testing**: Test the doc with a fresh Claude (no context) to catch blind spots before others read it
+Explain that this approach helps ensure the doc works well when others read it (including when they paste it into Claude). Ask if they want to try this workflow or prefer to work freeform.
+If user declines, work freeform. If user accepts, proceed to Stage 1.
+## Stage 1: Context Gathering
+**Goal:** Close the gap between what the user knows and what Claude knows, enabling smart guidance later.
+### Initial Questions
+Start by asking the user for meta-context about the document:
+1. What type of document is this? (e.g., technical spec, decision doc, proposal)
+2. Who's the primary audience?
+3. What's the desired impact when someone reads this?
+4. Is there a template or specific format to follow?
+5. Any other constraints or context to know?
+Inform them they can answer in shorthand or dump information however works best for them.
+**If user provides a template or mentions a doc type:**
+- Ask if they have a template document to share
+- If they provide a link to a shared document, use the appropriate integration to fetch it
+- If they provide a file, read it
+**If user mentions editing an existing shared document:**
+- Use the appropriate integration to read the current state
+- Check for images without alt-text
+- If images exist without alt-text, explain that when others use Claude to understand the doc, Claude won't be able to see them. Ask if they want alt-text generated. If so, request they paste each image into chat for descriptive alt-text generation.
+### Info Dumping
+Once initial questions are answered, encourage the user to dump all the context they have. Request information such as:
+- Background on the project/problem
+- Related team discussions or shared documents
+- Why alternative solutions aren't being used
+- Organizational context (team dynamics, past incidents, politics)
+- Timeline pressures or constraints
+- Technical architecture or dependencies
+- Stakeholder concerns
+Advise them not to worry about organizing it - just get it all out. Offer multiple ways to provide context:
+- Info dump stream-of-consciousness
+- Point to team channels or threads to read
+- Link to shared documents
+**If integrations are available** (e.g., Slack, Teams, Google Drive, SharePoint, or other MCP servers), mention that these can be used to pull in context directly.
+**If no integrations are detected and in Claude.ai or Claude app:** Suggest they can enable connectors in their Claude settings to allow pulling context from messaging apps and document storage directly.
+Inform them clarifying questions will be asked once they've done their initial dump.
+**During context gathering:**
+- If user mentions team channels or shared documents:
+  - If integrations available: Inform them the content will be read now, then use the appropriate integration
+  - If integrations not available: Explain lack of access. Suggest they enable connectors in Claude settings, or paste the relevant content directly.
+- If user mentions entities/projects that are unknown:
+  - Ask if connected tools should be searched to learn more
+  - Wait for user confirmation before searching
+- As user provides context, track what's being learned and what's still unclear
+**Asking clarifying questions:**
+When user signals they've done their initial dump (or after substantial context provided), ask clarifying questions to ensure understanding:
+Generate 5-10 numbered questions based on gaps in the context.
+Inform them they can use shorthand to answer (e.g., "1: yes, 2: see #channel, 3: no because backwards compat"), link to more docs, point to channels to read, or just keep info-dumping. Whatever's most efficient for them.
+**Exit condition:**
+Sufficient context has been gathered when questions show understanding - when edge cases and trade-offs can be asked about without needing basics explained.
+**Transition:**
+Ask if there's any more context they want to provide at this stage, or if it's time to move on to drafting the document.
+If user wants to add more, let them. When ready, proceed to Stage 2.
+## Stage 2: Refinement & Structure
+**Goal:** Build the document section by section through brainstorming, curation, and iterative refinement.
+**Instructions to user:**
+Explain that the document will be built section by section. For each section:
+1. Clarifying questions will be asked about what to include
+2. 5-20 options will be brainstormed
+3. User will indicate what to keep/remove/combine
+4. The section will be drafted
+5. It will be refined through surgical edits
+Start with whichever section has the most unknowns (usually the core decision/proposal), then work through the rest.
+**Section ordering:**
+If the document structure is clear:
+Ask which section they'd like to start with.
+Suggest starting with whichever section has the most unknowns. For decision docs, that's usually the core proposal. For specs, it's typically the technical approach. Summary sections are best left for last.
+If user doesn't know what sections they need:
+Based on the type of document and template, suggest 3-5 sections appropriate for the doc type.
+Ask if this structure works, or if they want to adjust it.
+**Once structure is agreed:**
+Create the initial document structure with placeholder text for all sections.
+**If access to artifacts is available:**
+Use `create_file` to create an artifact. This gives both Claude and the user a scaffold to work from.
+Inform them that the initial structure with placeholders for all sections will be created.
+Create artifact with all section headers and brief placeholder text like "[To be written]" or "[Content here]".
+Provide the scaffold link and indicate it's time to fill in each section.
+**If no access to artifacts:**
+Create a markdown file in the working directory. Name it appropriately (e.g., `decision-doc.md`, `technical-spec.md`).
+Inform them that the initial structure with placeholders for all sections will be created.
+Create file with all section headers and placeholder text.
+Confirm the filename has been created and indicate it's time to fill in each section.
+**For each section:**
+### Step 1: Clarifying Questions
+Announce work will begin on the [SECTION NAME] section. Ask 5-10 clarifying questions about what should be included:
+Generate 5-10 specific questions based on context and section purpose.
+Inform them they can answer in shorthand or just indicate what's important to cover.
+### Step 2: Brainstorming
+For the [SECTION NAME] section, brainstorm [5-20] things that might be included, depending on the section's complexity. Look for:
+- Context shared that might have been forgotten
+- Angles or considerations not yet mentioned
+Generate 5-20 numbered options based on section complexity. At the end, offer to brainstorm more if they want additional options.
+### Step 3: Curation
+Ask which points should be kept, removed, or combined. Request brief justifications to help learn priorities for the next sections.
+Provide examples:
+- "Keep 1,4,7,9"
+- "Remove 3 (duplicates 1)"
+- "Remove 6 (audience already knows this)"
+- "Combine 11 and 12"
+**If user gives freeform feedback** (e.g., "looks good" or "I like most of it but...") instead of numbered selections, extract their preferences and proceed. Parse what they want kept/removed/changed and apply it.
+### Step 4: Gap Check
+Based on what they've selected, ask if there's anything important missing for the [SECTION NAME] section.
+### Step 5: Drafting
+Use `str_replace` to replace the placeholder text for this section with the actual drafted content.
+Announce the [SECTION NAME] section will be drafted now based on what they've selected.
+**If using artifacts:**
+After drafting, provide a link to the artifact.
+Ask them to read through it and indicate what to change. Note that being specific helps learning for the next sections.
+**If using a file (no artifacts):**
+After drafting, confirm completion.
+Inform them the [SECTION NAME] section has been drafted in [filename]. Ask them to read through it and indicate what to change. Note that being specific helps learning for the next sections.
+**Key instruction for user (include when drafting the first section):**
+Provide a note: Instead of editing the doc directly, ask them to indicate what to change. This helps learning of their style for future sections. For example: "Remove the X bullet - already covered by Y" or "Make the third paragraph more concise".
+### Step 6: Iterative Refinement
+As user provides feedback:
+- Use `str_replace` to make edits (never reprint the whole doc)
+- **If using artifacts:** Provide link to artifact after each edit
+- **If using files:** Just confirm edits are complete
+- If user edits doc directly and asks to read it: mentally note the changes they made and keep them in mind for future sections (this shows their preferences)
+**Continue iterating** until user is satisfied with the section.
+### Quality Checking
+After 3 consecutive iterations with no substantial changes, ask if anything can be removed without losing important information.
+When section is done, confirm [SECTION NAME] is complete. Ask if ready to move to the next section.
+**Repeat for all sections.**
+### Near Completion
+As approaching completion (80%+ of sections done), announce intention to re-read the entire document and check for:
+- Flow and consistency across sections
+- Redundancy or contradictions
+- Anything that feels like "slop" or generic filler
+- Whether every sentence carries weight
+Read entire document and provide feedback.
+**When all sections are drafted and refined:**
+Announce all sections are drafted. Indicate intention to review the complete document one more time.
+Review for overall coherence, flow, completeness.
+Provide any final suggestions.
+Ask if ready to move to Reader Testing, or if they want to refine anything else.
+## Stage 3: Reader Testing
+**Goal:** Test the document with a fresh Claude (no context bleed) to verify it works for readers.
+**Instructions to user:**
+Explain that testing will now occur to see if the document actually works for readers. This catches blind spots - things that make sense to the authors but might confuse others.
+### Testing Approach
+**If access to sub-agents is available (e.g., in Claude Code):**
+Perform the testing directly without user involvement.
+### Step 1: Predict Reader Questions
+Announce intention to predict what questions readers might ask when trying to discover this document.
+Generate 5-10 questions that readers would realistically ask.
+### Step 2: Test with Sub-Agent
+Announce that these questions will be tested with a fresh Claude instance (no context from this conversation).
+For each question, invoke a sub-agent with just the document content and the question.
+Summarize what Reader Claude got right/wrong for each question.
+### Step 3: Run Additional Checks
+Announce additional checks will be performed.
+Invoke sub-agent to check for ambiguity, false assumptions, contradictions.
+Summarize any issues found.
+### Step 4: Report and Fix
+If issues found:
+Report that Reader Claude struggled with specific issues.
+List the specific issues.
+Indicate intention to fix these gaps.
+Loop back to refinement for problematic sections.
+---
+**If no access to sub-agents (e.g., claude.ai web interface):**
+The user will need to do the testing manually.
+### Step 1: Predict Reader Questions
+Ask what questions people might ask when trying to discover this document. What would they type into Claude.ai?
+Generate 5-10 questions that readers would realistically ask.
+### Step 2: Setup Testing
+Provide testing instructions:
+1. Open a fresh Claude conversation: https://claude.ai
+2. Paste or share the document content (if using a shared doc platform with connectors enabled, provide the link)
+3. Ask Reader Claude the generated questions
+For each question, instruct Reader Claude to provide:
+- The answer
+- Whether anything was ambiguous or unclear
+- What knowledge/context the doc assumes is already known
+Check if Reader Claude gives correct answers or misinterprets anything.
+### Step 3: Additional Checks
+Also ask Reader Claude:
+- "What in this doc might be ambiguous or unclear to readers?"
+- "What knowledge or context does this doc assume readers already have?"
+- "Are there any internal contradictions or inconsistencies?"
+### Step 4: Iterate Based on Results
+Ask what Reader Claude got wrong or struggled with. Indicate intention to fix those gaps.
+Loop back to refinement for any problematic sections.
+---
+### Exit Condition (Both Approaches)
+When Reader Claude consistently answers questions correctly and doesn't surface new gaps or ambiguities, the doc is ready.
+## Final Review
+When Reader Testing passes:
+Announce the doc has passed Reader Claude testing. Before completion:
+1. Recommend they do a final read-through themselves - they own this document and are responsible for its quality
+2. Suggest double-checking any facts, links, or technical details
+3. Ask them to verify it achieves the impact they wanted
+Ask if they want one more review, or if the work is done.
+**If user wants final review, provide it. Otherwise:**
+Announce document completion. Provide a few final tips:
+- Consider linking this conversation in an appendix so readers can see how the doc was developed
+- Use appendices to provide depth without bloating the main doc
+- Update the doc as feedback is received from real readers
+## Tips for Effective Guidance
+**Tone:**
+- Be direct and procedural
+- Explain rationale briefly when it affects user behavior
+- Don't try to "sell" the approach - just execute it
+**Handling Deviations:**
+- If user wants to skip a stage: Ask if they want to skip this and write freeform
+- If user seems frustrated: Acknowledge this is taking longer than expected. Suggest ways to move faster
+- Always give user agency to adjust the process
+**Context Management:**
+- Throughout, if context is missing on something mentioned, proactively ask
+- Don't let gaps accumulate - address them as they come up
+**Artifact Management:**
+- Use `create_file` for drafting full sections
+- Use `str_replace` for all edits
+- Provide artifact link after every change
+- Never use artifacts for brainstorming lists - that's just conversation
+**Quality over Speed:**
+- Don't rush through stages
+- Each iteration should make meaningful improvements
+- The goal is a document that actually works for readers