npm - mindforge-cc - Versions diffs - 11.5.1 → 11.7.0 - Mend

mindforge-cc 11.5.1 → 11.7.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (214) hide show

package/.agent/mindforge/skill-tdd.md +53 -0
package/.agent/mindforge/skills-index.md +118 -0
package/.agent/mindforge/systematic-debug.md +60 -0
package/.agent/mindforge/wf-catalog.md +37 -0
package/.agent/mindforge/wf-code-audit.md +31 -0
package/.agent/mindforge/wf-competitive-analysis.md +31 -0
package/.agent/mindforge/wf-deep-research.md +32 -0
package/.agent/mindforge/wf-feature-planner.md +31 -0
package/.agent/mindforge/wf-incident-response.md +31 -0
package/.agent/mindforge/wf-onboard-codebase.md +31 -0
package/.agent/mindforge/wf-perf-optimize.md +31 -0
package/.agent/mindforge/wf-pr-review.md +31 -0
package/.agent/mindforge/wf-refactor-plan.md +31 -0
package/.agent/mindforge/wf-release-prep.md +31 -0
package/.agent/mindforge/wf-tdd-sprint.md +31 -0
package/.agent/mindforge/wf-tech-evaluation.md +31 -0
package/.agent/skills/1password-skill/SKILL.md +156 -0
package/.agent/skills/1password-skill/references/cli-examples.md +31 -0
package/.agent/skills/1password-skill/references/get-started.md +21 -0
package/.agent/skills/article-illustrator/SKILL.md +199 -0
package/.agent/skills/article-illustrator/references/prompt-construction.md +426 -0
package/.agent/skills/article-illustrator/references/style-presets.md +80 -0
package/.agent/skills/article-illustrator/references/styles.md +224 -0
package/.agent/skills/article-illustrator/references/usage.md +50 -0
package/.agent/skills/article-illustrator/references/workflow.md +332 -0
package/.agent/skills/arxiv/SKILL.md +275 -0
package/.agent/skills/blogwatcher/SKILL.md +130 -0
package/.agent/skills/code-wiki/SKILL.md +438 -0
package/.agent/skills/code-wiki/templates/README.md +31 -0
package/.agent/skills/code-wiki/templates/architecture.md +30 -0
package/.agent/skills/code-wiki/templates/getting-started.md +47 -0
package/.agent/skills/code-wiki/templates/module.md +38 -0
package/.agent/skills/codebase-inspection/SKILL.md +109 -0
package/.agent/skills/comic-creator/SKILL.md +240 -0
package/.agent/skills/comic-creator/references/analysis-framework.md +176 -0
package/.agent/skills/comic-creator/references/auto-selection.md +71 -0
package/.agent/skills/comic-creator/references/base-prompt.md +98 -0
package/.agent/skills/comic-creator/references/character-template.md +180 -0
package/.agent/skills/comic-creator/references/ohmsha-guide.md +85 -0
package/.agent/skills/comic-creator/references/partial-workflows.md +106 -0
package/.agent/skills/comic-creator/references/storyboard-template.md +143 -0
package/.agent/skills/comic-creator/references/workflow.md +401 -0
package/.agent/skills/concept-diagrams/SKILL.md +355 -0
package/.agent/skills/concept-diagrams/references/dashboard-patterns.md +43 -0
package/.agent/skills/concept-diagrams/references/infrastructure-patterns.md +144 -0
package/.agent/skills/concept-diagrams/references/physical-shape-cookbook.md +42 -0
package/.agent/skills/creative-ideation/SKILL.md +144 -0
package/.agent/skills/creative-ideation/references/full-prompt-library.md +110 -0
package/.agent/skills/devops-cli/SKILL.md +149 -0
package/.agent/skills/devops-cli/references/app-discovery.md +112 -0
package/.agent/skills/devops-cli/references/authentication.md +59 -0
package/.agent/skills/devops-cli/references/cli-reference.md +104 -0
package/.agent/skills/devops-cli/references/running-apps.md +171 -0
package/.agent/skills/devops-watchers/SKILL.md +103 -0
package/.agent/skills/docker-management/SKILL.md +273 -0
package/.agent/skills/domain-intel/SKILL.md +96 -0
package/.agent/skills/duckduckgo-search/SKILL.md +230 -0
package/.agent/skills/github-auth/SKILL.md +240 -0
package/.agent/skills/github-code-review/SKILL.md +474 -0
package/.agent/skills/github-code-review/references/review-output-template.md +74 -0
package/.agent/skills/github-issues/SKILL.md +363 -0
package/.agent/skills/github-issues/templates/bug-report.md +35 -0
package/.agent/skills/github-issues/templates/feature-request.md +31 -0
package/.agent/skills/github-pr-workflow/SKILL.md +360 -0
package/.agent/skills/github-pr-workflow/references/ci-troubleshooting.md +183 -0
package/.agent/skills/github-pr-workflow/references/conventional-commits.md +71 -0
package/.agent/skills/github-pr-workflow/templates/pr-body-bugfix.md +35 -0
package/.agent/skills/github-pr-workflow/templates/pr-body-feature.md +33 -0
package/.agent/skills/github-repo-management/SKILL.md +509 -0
package/.agent/skills/github-repo-management/references/github-api-cheatsheet.md +161 -0
package/.agent/skills/godmode/SKILL.md +396 -0
package/.agent/skills/godmode/references/jailbreak-templates.md +128 -0
package/.agent/skills/godmode/references/refusal-detection.md +142 -0
package/.agent/skills/hyperframes/SKILL.md +182 -0
package/.agent/skills/hyperframes/references/cli.md +185 -0
package/.agent/skills/hyperframes/references/composition.md +129 -0
package/.agent/skills/hyperframes/references/features.md +289 -0
package/.agent/skills/hyperframes/references/gsap.md +136 -0
package/.agent/skills/hyperframes/references/troubleshooting.md +137 -0
package/.agent/skills/hyperframes/references/website-to-video.md +145 -0
package/.agent/skills/jupyter-live-kernel/SKILL.md +160 -0
package/.agent/skills/kanban-orchestrator/SKILL.md +209 -0
package/.agent/skills/kanban-worker/SKILL.md +188 -0
package/.agent/skills/llm-wiki/SKILL.md +499 -0
package/.agent/skills/meme-generation/SKILL.md +122 -0
package/.agent/skills/node-inspect-debugger/SKILL.md +312 -0
package/.agent/skills/obsidian/SKILL.md +60 -0
package/.agent/skills/osint-investigation/SKILL.md +269 -0
package/.agent/skills/osint-investigation/templates/source-template.md +59 -0
package/.agent/skills/oss-forensics/SKILL.md +422 -0
package/.agent/skills/oss-forensics/references/evidence-types.md +89 -0
package/.agent/skills/oss-forensics/references/github-archive-guide.md +184 -0
package/.agent/skills/oss-forensics/references/investigation-templates.md +131 -0
package/.agent/skills/oss-forensics/references/recovery-techniques.md +164 -0
package/.agent/skills/oss-forensics/templates/forensic-report.md +151 -0
package/.agent/skills/oss-forensics/templates/malicious-package-report.md +43 -0
package/.agent/skills/parallel-cli/SKILL.md +384 -0
package/.agent/skills/pinggy-tunnel/SKILL.md +302 -0
package/.agent/skills/pixel-art/SKILL.md +209 -0
package/.agent/skills/pixel-art/references/palettes.md +49 -0
package/.agent/skills/plan/SKILL.md +331 -0
package/.agent/skills/polymarket/SKILL.md +75 -0
package/.agent/skills/polymarket/references/api-endpoints.md +220 -0
package/.agent/skills/python-debugpy/SKILL.md +368 -0
package/.agent/skills/requesting-code-review/SKILL.md +273 -0
package/.agent/skills/research-paper-writing/SKILL.md +2367 -0
package/.agent/skills/research-paper-writing/references/autoreason-methodology.md +394 -0
package/.agent/skills/research-paper-writing/references/checklists.md +434 -0
package/.agent/skills/research-paper-writing/references/citation-workflow.md +563 -0
package/.agent/skills/research-paper-writing/references/experiment-patterns.md +728 -0
package/.agent/skills/research-paper-writing/references/human-evaluation.md +476 -0
package/.agent/skills/research-paper-writing/references/paper-types.md +481 -0
package/.agent/skills/research-paper-writing/references/reviewer-guidelines.md +433 -0
package/.agent/skills/research-paper-writing/references/sources.md +191 -0
package/.agent/skills/research-paper-writing/references/writing-guide.md +474 -0
package/.agent/skills/research-paper-writing/templates/README.md +251 -0
package/.agent/skills/rest-graphql-debug/SKILL.md +507 -0
package/.agent/skills/s6-container-supervision/SKILL.md +171 -0
package/.agent/skills/scrapling/SKILL.md +328 -0
package/.agent/skills/sherlock/SKILL.md +186 -0
package/.agent/skills/simplify-code/SKILL.md +168 -0
package/.agent/skills/skill-authoring/SKILL.md +158 -0
package/.agent/skills/spike/SKILL.md +190 -0
package/.agent/skills/subagent-driven-development/SKILL.md +345 -0
package/.agent/skills/subagent-driven-development/references/context-budget-discipline.md +53 -0
package/.agent/skills/subagent-driven-development/references/gates-taxonomy.md +93 -0
package/.agent/skills/systematic-debugging/SKILL.md +360 -0
package/.agent/skills/test-driven-development/SKILL.md +336 -0
package/.agent/skills/video-orchestrator/SKILL.md +194 -0
package/.agent/skills/video-orchestrator/references/examples.md +227 -0
package/.agent/skills/video-orchestrator/references/intake.md +166 -0
package/.agent/skills/video-orchestrator/references/kanban-setup.md +278 -0
package/.agent/skills/video-orchestrator/references/monitoring.md +180 -0
package/.agent/skills/video-orchestrator/references/role-archetypes.md +298 -0
package/.agent/skills/video-orchestrator/references/tool-matrix.md +317 -0
package/.agent/skills/web-pentest/SKILL.md +332 -0
package/.agent/skills/web-pentest/references/bypass-techniques.md +133 -0
package/.agent/skills/web-pentest/references/exploitation-techniques.md +204 -0
package/.agent/skills/web-pentest/references/scope-enforcement.md +110 -0
package/.agent/skills/web-pentest/references/vuln-taxonomy.md +81 -0
package/.agent/skills/web-pentest/templates/authorization.md +69 -0
package/.agent/skills/web-pentest/templates/pentest-report.md +178 -0
package/.claude/commands/mindforge/skill-tdd.md +53 -0
package/.claude/commands/mindforge/skills-index.md +118 -0
package/.claude/commands/mindforge/systematic-debug.md +60 -0
package/.claude/commands/mindforge/wf-catalog.md +37 -0
package/.claude/commands/mindforge/wf-code-audit.md +31 -0
package/.claude/commands/mindforge/wf-competitive-analysis.md +31 -0
package/.claude/commands/mindforge/wf-deep-research.md +32 -0
package/.claude/commands/mindforge/wf-feature-planner.md +31 -0
package/.claude/commands/mindforge/wf-incident-response.md +31 -0
package/.claude/commands/mindforge/wf-onboard-codebase.md +31 -0
package/.claude/commands/mindforge/wf-perf-optimize.md +31 -0
package/.claude/commands/mindforge/wf-pr-review.md +31 -0
package/.claude/commands/mindforge/wf-refactor-plan.md +31 -0
package/.claude/commands/mindforge/wf-release-prep.md +31 -0
package/.claude/commands/mindforge/wf-tdd-sprint.md +31 -0
package/.claude/commands/mindforge/wf-tech-evaluation.md +31 -0
package/.mindforge/config.json +2 -2
package/.mindforge/dynamic-workflows/REGISTRY.md +65 -0
package/.mindforge/dynamic-workflows/index.json +171 -0
package/.mindforge/dynamic-workflows/scripts/code-audit.js +103 -0
package/.mindforge/dynamic-workflows/scripts/competitive-analysis.js +85 -0
package/.mindforge/dynamic-workflows/scripts/deep-research.js +151 -0
package/.mindforge/dynamic-workflows/scripts/feature-planner.js +104 -0
package/.mindforge/dynamic-workflows/scripts/incident-response.js +106 -0
package/.mindforge/dynamic-workflows/scripts/onboard-codebase.js +102 -0
package/.mindforge/dynamic-workflows/scripts/perf-optimize.js +128 -0
package/.mindforge/dynamic-workflows/scripts/pr-review.js +87 -0
package/.mindforge/dynamic-workflows/scripts/refactor-plan.js +121 -0
package/.mindforge/dynamic-workflows/scripts/release-prep.js +102 -0
package/.mindforge/dynamic-workflows/scripts/tdd-sprint.js +103 -0
package/.mindforge/dynamic-workflows/scripts/tech-evaluation.js +72 -0
package/.mindforge/memory/sync-manifest.json +1 -1
package/.mindforge/skills/arxiv/SKILL.md +294 -0
package/.mindforge/skills/blogwatcher/SKILL.md +147 -0
package/.mindforge/skills/code-wiki/SKILL.md +457 -0
package/.mindforge/skills/codebase-inspection/SKILL.md +126 -0
package/.mindforge/skills/concept-diagrams/SKILL.md +373 -0
package/.mindforge/skills/creative-ideation/SKILL.md +162 -0
package/.mindforge/skills/domain-intel/SKILL.md +116 -0
package/.mindforge/skills/duckduckgo-search/SKILL.md +249 -0
package/.mindforge/skills/github-code-review/SKILL.md +493 -0
package/.mindforge/skills/github-issues/SKILL.md +382 -0
package/.mindforge/skills/github-pr-workflow/SKILL.md +379 -0
package/.mindforge/skills/jupyter-live-kernel/SKILL.md +179 -0
package/.mindforge/skills/kanban-orchestrator/SKILL.md +227 -0
package/.mindforge/skills/kanban-worker/SKILL.md +206 -0
package/.mindforge/skills/meme-generation/SKILL.md +141 -0
package/.mindforge/skills/obsidian/SKILL.md +80 -0
package/.mindforge/skills/osint-investigation/SKILL.md +288 -0
package/.mindforge/skills/oss-forensics/SKILL.md +421 -0
package/.mindforge/skills/pixel-art/SKILL.md +228 -0
package/.mindforge/skills/plan/SKILL.md +350 -0
package/.mindforge/skills/requesting-code-review/SKILL.md +292 -0
package/.mindforge/skills/research-paper-writing/SKILL.md +2384 -0
package/.mindforge/skills/scrapling/SKILL.md +345 -0
package/.mindforge/skills/sherlock/SKILL.md +203 -0
package/.mindforge/skills/simplify-code/SKILL.md +187 -0
package/.mindforge/skills/spike/SKILL.md +209 -0
package/.mindforge/skills/subagent-driven-development/SKILL.md +364 -0
package/.mindforge/skills/systematic-debugging/SKILL.md +379 -0
package/.mindforge/skills/test-driven-development/SKILL.md +355 -0
package/.mindforge/skills/web-pentest/SKILL.md +327 -0
package/CHANGELOG.md +71 -0
package/MINDFORGE.md +2 -2
package/README.md +72 -3
package/RELEASENOTES.md +109 -0
package/bin/installer-core.js +6 -2
package/bin/mindforge-cli.js +7 -0
package/bin/workflows/workflow-runner.js +110 -0
package/docs/commands-reference.md +25 -0
package/docs/getting-started.md +42 -5
package/package.json +2 -1

package/.agent/skills/skill-authoring/SKILL.md ADDED Viewed

@@ -0,0 +1,158 @@
+---
+name: hermes-agent-skill-authoring
+description: "Author in-repo SKILL.md: frontmatter, validator, structure."
+version: 1.0.0
+---
+# Authoring
+## Overview
+There are two places a SKILL.md can live:
+1. **User-local:** `~/.agent/skills/<maybe-category>/<name>/SKILL.md` — personal, not shared. Created via `skill_manage(action='create')`.
+2. **In-repo (this skill is about this case):** `/home/bb/
+## When to Use
+- User asks you to add a skill "in this branch / repo / commit"
+- You're committing a reusable workflow that should ship with
+- You're editing an existing skill under `/home/bb/
+## Required Frontmatter
+Source of truth: `tools/skill_manager_tool.py::_validate_frontmatter`. Hard requirements:
+- Starts with `---` as the first bytes (no leading blank line).
+- Closes with `\n---\n` before the body.
+- Parses as a YAML mapping.
+- `name` field present.
+- `description` field present, ≤ **1024 chars** (`MAX_DESCRIPTION_LENGTH`).
+- Non-empty body after the closing `---`.
+Peer-matched shape used by every skill under `skills/software-development/`:
+```yaml
+---
+name: my-skill-name               # lowercase, hyphens, ≤64 chars (MAX_NAME_LENGTH)
+description: Use when <trigger>. <one-line behavior>.
+version: 1.0.0
+author:
+license: MIT
+metadata:
+  hermes:
+    tags: [short, descriptive, tags]
+    related_skills: [other-skill, another-skill]
+---
+```
+`version` / `author` / `license` / `metadata` are NOT enforced by the validator, but every peer has them — omit and your skill sticks out.
+## Size Limits
+- Description: ≤ 1024 chars (enforced).
+- Full SKILL.md: ≤ 100,000 chars (enforced as `MAX_SKILL_CONTENT_CHARS`, ~36k tokens).
+- Peer skills in `software-development/` sit at **8-14k chars**. Aim for that range. If you're pushing past 20k, split into `references/*.md` and reference them from SKILL.md.
+## Peer-Matched Structure
+Every in-repo skill follows roughly:
+```
+# <Title>
+## Overview
+One or two paragraphs: what and why.
+## When to Use
+- Bulleted triggers
+- "Don't use for:" counter-triggers
+## <Topic sections specific to the skill>
+- Quick-reference tables are common
+- Code blocks with exact commands
+- the agent-specific recipes (tests via scripts/run_tests.sh, ui-tui paths, etc.)
+## Common Pitfalls
+Numbered list of mistakes and their fixes.
+## Verification Checklist
+- [ ] Checkbox list of post-action verifications
+## One-Shot Recipes (optional)
+Named scenarios → concrete command sequences.
+```
+Not every section is mandatory, but `Overview` + `When to Use` + actionable body + pitfalls are the minimum for the skill to feel like a peer.
+## Directory Placement
+```
+skills/<category>/<skill-name>/SKILL.md
+```
+Categories currently in repo (confirm with `ls skills/`): `autonomous-ai-agents`, `creative`, `data-science`, `devops`, `dogfood`, `email`, `gaming`, `github`, `leisure`, `mcp`, `media`, `mlops/*`, `note-taking`, `productivity`, `red-teaming`, `research`, `smart-home`, `social-media`, `software-development`.
+Pick the closest existing category. Don't invent new top-level categories casually.
+## Workflow
+1. **Survey peers** in the target category:
+   ```
+   ls skills/<category>/
+   ```
+   Read 2-3 peer SKILL.md files to match tone and structure.
+2. **Check validator constraints** in `tools/skill_manager_tool.py` if unsure.
+3. **Draft** with `write_file` to `skills/<category>/<name>/SKILL.md`.
+4. **Validate locally**:
+   ```python
+   import yaml, re, pathlib
+   content = pathlib.Path("skills/<category>/<name>/SKILL.md").read_text()
+   assert content.startswith("---")
+   m = re.search(r'\n---\s*\n', content[3:])
+   fm = yaml.safe_load(content[3:m.start()+3])
+   assert "name" in fm and "description" in fm
+   assert len(fm["description"]) <= 1024
+   assert len(content) <= 100_000
+   ```
+5. **Git add + commit** on the active branch.
+6. **Note:** the CURRENT session's skill loader is cached — `skill_view` / `skills_list` will not see the new skill until a new session. This is expected, not a bug.
+## Cross-Referencing Other Skills
+`metadata.hermes.related_skills` unions both trees (`skills/` in-repo and `~/.agent/skills/`) at load time. You CAN reference a user-local skill from an in-repo skill, but it won't resolve for other users who clone the repo fresh. Prefer referencing only in-repo skills from in-repo skills. If a frequently-referenced skill lives only in `~/.agent/skills/`, consider promoting it to the repo.
+## Editing Existing In-Repo Skills
+- **Small fix (typo, added pitfall, tightened trigger):** `skill_manage(action='patch', name=..., old_string=..., new_string=...)` works fine on in-repo skills.
+- **Major rewrite:** `write_file` the whole SKILL.md. `skill_manage(action='edit')` also works but requires supplying the full new content.
+- **Adding supporting files:** `write_file` to `skills/<category>/<name>/references/<file>.md`, `templates/<file>`, or `scripts/<file>`. `skill_manage(action='write_file')` also works and enforces the references/templates/scripts/assets subdir allowlist.
+- **Always commit** the edit — in-repo skills are source, not runtime state.
+## Common Pitfalls
+1. **Using `skill_manage(action='create')` for an in-repo skill.** It writes to `~/.agent/skills/`, not the repo tree. Use `write_file` for in-repo creation.
+2. **Leading whitespace before `---`.** The validator checks `content.startswith("---")`; any leading blank line or BOM fails validation.
+3. **Description too generic.** Peer descriptions start with "Use when ..." and describe the *trigger class*, not the one task. "Use when debugging X" > "Debug X".
+4. **Forgetting the author/license/metadata block.** Not validator-enforced, but every peer has it; omitting makes the skill look half-finished.
+5. **Writing a skill that duplicates a peer.** Before creating, `ls skills/<category>/` and open 2-3 peers. Prefer extending an existing skill to creating a narrow sibling.
+6. **Expecting the current session to see the new skill.** It won't. The skill loader is initialized at session start. Verify in a fresh session or via `skill_view` using the exact path.
+7. **Linking to skills that don't exist in-repo.** `related_skills: [some-user-local-skill]` works for you but breaks for other clones. Prefer only in-repo links.
+## Verification Checklist
+- [ ] File is at `skills/<category>/<name>/SKILL.md` (not in `~/.agent/skills/`)
+- [ ] Frontmatter starts at byte 0 with `---`, closes with `\n---\n`
+- [ ] `name`, `description`, `version`, `author`, `license`, `metadata.hermes.{tags, related_skills}` all present
+- [ ] Name ≤ 64 chars, lowercase + hyphens
+- [ ] Description ≤ 1024 chars and starts with "Use when ..."
+- [ ] Total file ≤ 100,000 chars (aim for 8-15k)
+- [ ] Structure: `# Title` → `## Overview` → `## When to Use` → body → `## Common Pitfalls` → `## Verification Checklist`
+- [ ] `related_skills` references resolve in-repo (or are explicitly OK to be user-local)
+- [ ] `git add skills/<category>/<name>/ && git commit` completed on the intended branch

package/.agent/skills/spike/SKILL.md ADDED Viewed

@@ -0,0 +1,190 @@
+---
+name: spike
+description: "Throwaway experiments to validate an idea before build."
+version: 1.0.0
+---
+# Spike
+Use this skill when the user wants to **feel out an idea** before committing to a real build — validating feasibility, comparing approaches, or surfacing unknowns that no amount of research will answer. Spikes are disposable by design. Throw them away once they've paid their debt.
+Load this when the user says things like "let me try this", "I want to see if X works", "spike this out", "before I commit to Y", "quick prototype of Z", "is this even possible?", or "compare A vs B".
+## When NOT to use this
+- The answer is knowable from docs or reading code — just do research, don't build
+- The work is production path — use the `plan` skill instead
+- The idea is already validated — jump straight to implementation
+## If the user has the full GSD system installed
+If `gsd-spike` shows up as a sibling skill (installed via `npx get-shit-done-cc --hermes`), prefer **`gsd-spike`** when the user wants the full GSD workflow: persistent `.planning/spikes/` state, MANIFEST tracking across sessions, Given/When/Then verdict format, and commit patterns that integrate with the rest of GSD. This skill is the lightweight standalone version for users who don't have (or don't want) the full system.
+## Core method
+Regardless of scale, every spike follows this loop:
+```
+decompose  →  research  →  build  →  verdict
+   ↑__________________________________________↓
+                  iterate on findings
+```
+### 1. Decompose
+Break the user's idea into **2-5 independent feasibility questions**. Each question is one spike. Present them as a table with Given/When/Then framing:
+| # | Spike | Validates (Given/When/Then) | Risk |
+|---|-------|----------------------------|------|
+| 001 | websocket-streaming | Given a WS connection, when LLM streams tokens, then client receives chunks < 100ms | High |
+| 002a | pdf-parse-pdfjs | Given a multi-page PDF, when parsed with pdfjs, then structured text is extractable | Medium |
+| 002b | pdf-parse-camelot | Given a multi-page PDF, when parsed with camelot, then structured text is extractable | Medium |
+**Spike types:**
+- **standard** — one approach answering one question
+- **comparison** — same question, different approaches (shared number, letter suffix `a`/`b`/`c`)
+**Good spike questions:** specific feasibility with observable output.
+**Bad spike questions:** too broad, no observable output, or just "read the docs about X".
+**Order by risk.** The spike most likely to kill the idea runs first. No point prototyping the easy parts if the hard part doesn't work.
+**Skip decomposition** only if the user already knows exactly what they want to spike and says so. Then take their idea as a single spike.
+### 2. Align (for multi-spike ideas)
+Present the spike table. Ask: "Build all in this order, or adjust?" Let the user drop, reorder, or re-frame before you write any code.
+### 3. Research (per spike, before building)
+Spikes are not research-free — you research enough to pick the right approach, then you build. Per spike:
+1. **Brief it.** 2-3 sentences: what this spike is, why it matters, key risk.
+2. **Surface competing approaches** if there's real choice:
+   | Approach | Tool/Library | Pros | Cons | Status |
+   |----------|-------------|------|------|--------|
+   | ... | ... | ... | ... | maintained / abandoned / beta |
+3. **Pick one.** State why. If 2+ are credible, build quick variants within the spike.
+4. **Skip research** for pure logic with no external dependencies.
+Use tools for the research step:
+- `web_search("python websocket streaming libraries 2025")` — find candidates
+- `web_extract(urls=["https://websockets.readthedocs.io/..."])` — read the actual docs (returns markdown)
+- `terminal("pip show websockets | grep Version")` — check what's installed in the project's venv
+For libraries without docs pages, clone and read their `README.md` / `examples/` via `read_file`. Context7 MCP (if the user has it configured) is also a good source — `mcp_*_resolve-library-id` then `mcp_*_query-docs`.
+### 4. Build
+One directory per spike. Keep it standalone.
+```
+spikes/
+├── 001-websocket-streaming/
+│   ├── README.md
+│   └── main.py
+├── 002a-pdf-parse-pdfjs/
+│   ├── README.md
+│   └── parse.js
+└── 002b-pdf-parse-camelot/
+    ├── README.md
+    └── parse.py
+```
+**Bias toward something the user can interact with.** Spikes fail when the only output is a log line that says "it works." The user wants to *feel* the spike working. Default choices, in order of preference:
+1. A runnable CLI that takes input and prints observable output
+2. A minimal HTML page that demonstrates the behavior
+3. A small web server with one endpoint
+4. A unit test that exercises the question with recognizable assertions
+**Depth over speed.** Never declare "it works" after one happy-path run. Test edge cases. Follow surprising findings. The verdict is only trustworthy when the investigation was honest.
+**Avoid** unless the spike specifically requires it: complex package management, build tools/bundlers, Docker, env files, config systems. Hardcode everything — it's a spike.
+**Building one spike** — a typical tool sequence:
+```
+terminal("mkdir -p spikes/001-websocket-streaming")
+write_file("spikes/001-websocket-streaming/README.md", "# 001: websocket-streaming\n\n...")
+write_file("spikes/001-websocket-streaming/main.py", "...")
+terminal("cd spikes/001-websocket-streaming && python3 main.py")
+# Observe output, iterate.
+```
+**Parallel comparison spikes (002a / 002b) — delegate.** When two approaches can run in parallel and both need real engineering (not 10-line prototypes), fan out with `delegate_task`:
+```
+delegate_task(tasks=[
+    {"goal": "Build 002a-pdf-parse-pdfjs: ...", "toolsets": ["terminal", "file", "web"]},
+    {"goal": "Build 002b-pdf-parse-camelot: ...", "toolsets": ["terminal", "file", "web"]},
+])
+```
+Each subagent returns its own verdict; you write the head-to-head.
+### 5. Verdict
+Each spike's `README.md` closes with:
+```markdown
+## Verdict: VALIDATED | PARTIAL | INVALIDATED
+### What worked
+- ...
+### What didn't
+- ...
+### Surprises
+- ...
+### Recommendation for the real build
+- ...
+```
+**VALIDATED** = the core question was answered yes, with evidence.
+**PARTIAL** = it works under constraints X, Y, Z — document them.
+**INVALIDATED** = doesn't work, for this reason. This is a successful spike.
+## Comparison spikes
+When two approaches answer the same question (002a / 002b), build them **back to back**, then do a head-to-head comparison at the end:
+```markdown
+## Head-to-head: pdfjs vs camelot
+| Dimension | pdfjs (002a) | camelot (002b) |
+|-----------|--------------|----------------|
+| Extraction quality | 9/10 structured | 7/10 table-only |
+| Setup complexity | npm install, 1 line | pip + ghostscript |
+| Perf on 100-page PDF | 3s | 18s |
+| Handles rotated text | no | yes |
+**Winner:** pdfjs for our use case. Camelot if we need table-first extraction later.
+```
+## Frontier mode (picking what to spike next)
+If spikes already exist and the user says "what should I spike next?", walk the existing directories and look for:
+- **Integration risks** — two validated spikes that touch the same resource but were tested independently
+- **Data handoffs** — spike A's output was assumed compatible with spike B's input; never proven
+- **Gaps in the vision** — capabilities assumed but unproven
+- **Alternative approaches** — different angles for PARTIAL or INVALIDATED spikes
+Propose 2-4 candidates as Given/When/Then. Let the user pick.
+## Output
+- Create `spikes/` (or `.planning/spikes/` if the user is using GSD conventions) in the repo root
+- One dir per spike: `NNN-descriptive-name/`
+- `README.md` per spike captures question, approach, results, verdict
+- Keep the code throwaway — a spike that takes 2 days to "clean up for production" was a bad spike
+## Attribution
+Adapted from the GSD (Get Shit Done) project's `/gsd-spike` workflow — MIT © 2025 Lex Christopherson ([gsd-build/get-shit-done](https://github.com/gsd-build/get-shit-done)). The full GSD system offers persistent spike state, MANIFEST tracking, and integration with a broader spec-driven development pipeline; install with `npx get-shit-done-cc --hermes --global`.

package/.agent/skills/subagent-driven-development/SKILL.md ADDED Viewed

@@ -0,0 +1,345 @@
+---
+name: subagent-driven-development
+description: "Execute plans via delegate_task subagents (2-stage review)."
+version: 1.1.0
+---
+# Subagent-Driven Development
+## Overview
+Execute implementation plans by dispatching fresh subagents per task with systematic two-stage review.
+**Core principle:** Fresh subagent per task + two-stage review (spec then quality) = high quality, fast iteration.
+## When to Use
+Use this skill when:
+- You have an implementation plan (from the `plan` skill or user requirements)
+- Tasks are mostly independent
+- Quality and spec compliance are important
+- You want automated review between tasks
+**vs. manual execution:**
+- Fresh context per task (no confusion from accumulated state)
+- Automated review process catches issues early
+- Consistent quality checks across all tasks
+- Subagents can ask questions before starting work
+## The Process
+### 1. Read and Parse Plan
+Read the plan file. Extract ALL tasks with their full text and context upfront. Create a todo list:
+```python
+# Read the plan
+read_file("docs/plans/feature-plan.md")
+# Create todo list with all tasks
+todo([
+    {"id": "task-1", "content": "Create User model with email field", "status": "pending"},
+    {"id": "task-2", "content": "Add password hashing utility", "status": "pending"},
+    {"id": "task-3", "content": "Create login endpoint", "status": "pending"},
+])
+```
+**Key:** Read the plan ONCE. Extract everything. Don't make subagents read the plan file — provide the full task text directly in context.
+### 2. Per-Task Workflow
+For EACH task in the plan:
+#### Step 1: Dispatch Implementer Subagent
+Use `delegate_task` with complete context:
+```python
+delegate_task(
+    goal="Implement Task 1: Create User model with email and password_hash fields",
+    context="""
+    TASK FROM PLAN:
+    - Create: src/models/user.py
+    - Add User class with email (str) and password_hash (str) fields
+    - Use bcrypt for password hashing
+    - Include __repr__ for debugging
+    FOLLOW TDD:
+    1. Write failing test in tests/models/test_user.py
+    2. Run: pytest tests/models/test_user.py -v (verify FAIL)
+    3. Write minimal implementation
+    4. Run: pytest tests/models/test_user.py -v (verify PASS)
+    5. Run: pytest tests/ -q (verify no regressions)
+    6. Commit: git add -A && git commit -m "feat: add User model with password hashing"
+    PROJECT CONTEXT:
+    - Python 3.11, Flask app in src/app.py
+    - Existing models in src/models/
+    - Tests use pytest, run from project root
+    - bcrypt already in requirements.txt
+    """,
+    toolsets=['terminal', 'file']
+)
+```
+#### Step 2: Dispatch Spec Compliance Reviewer
+After the implementer completes, verify against the original spec:
+```python
+delegate_task(
+    goal="Review if implementation matches the spec from the plan",
+    context="""
+    ORIGINAL TASK SPEC:
+    - Create src/models/user.py with User class
+    - Fields: email (str), password_hash (str)
+    - Use bcrypt for password hashing
+    - Include __repr__
+    CHECK:
+    - [ ] All requirements from spec implemented?
+    - [ ] File paths match spec?
+    - [ ] Function signatures match spec?
+    - [ ] Behavior matches expected?
+    - [ ] Nothing extra added (no scope creep)?
+    OUTPUT: PASS or list of specific spec gaps to fix.
+    """,
+    toolsets=['file']
+)
+```
+**If spec issues found:** Fix gaps, then re-run spec review. Continue only when spec-compliant.
+#### Step 3: Dispatch Code Quality Reviewer
+After spec compliance passes:
+```python
+delegate_task(
+    goal="Review code quality for Task 1 implementation",
+    context="""
+    FILES TO REVIEW:
+    - src/models/user.py
+    - tests/models/test_user.py
+    CHECK:
+    - [ ] Follows project conventions and style?
+    - [ ] Proper error handling?
+    - [ ] Clear variable/function names?
+    - [ ] Adequate test coverage?
+    - [ ] No obvious bugs or missed edge cases?
+    - [ ] No security issues?
+    OUTPUT FORMAT:
+    - Critical Issues: [must fix before proceeding]
+    - Important Issues: [should fix]
+    - Minor Issues: [optional]
+    - Verdict: APPROVED or REQUEST_CHANGES
+    """,
+    toolsets=['file']
+)
+```
+**If quality issues found:** Fix issues, re-review. Continue only when approved.
+#### Step 4: Mark Complete
+```python
+todo([{"id": "task-1", "content": "Create User model with email field", "status": "completed"}], merge=True)
+```
+### 3. Final Review
+After ALL tasks are complete, dispatch a final integration reviewer:
+```python
+delegate_task(
+    goal="Review the entire implementation for consistency and integration issues",
+    context="""
+    All tasks from the plan are complete. Review the full implementation:
+    - Do all components work together?
+    - Any inconsistencies between tasks?
+    - All tests passing?
+    - Ready for merge?
+    """,
+    toolsets=['terminal', 'file']
+)
+```
+### 4. Verify and Commit
+```bash
+# Run full test suite
+pytest tests/ -q
+# Review all changes
+git diff --stat
+# Final commit if needed
+git add -A && git commit -m "feat: complete [feature name] implementation"
+```
+## Task Granularity
+**Each task = 2-5 minutes of focused work.**
+**Too big:**
+- "Implement user authentication system"
+**Right size:**
+- "Create User model with email and password fields"
+- "Add password hashing function"
+- "Create login endpoint"
+- "Add JWT token generation"
+- "Create registration endpoint"
+## Red Flags — Never Do These
+- Start implementation without a plan
+- Skip reviews (spec compliance OR code quality)
+- Proceed with unfixed critical/important issues
+- Dispatch multiple implementation subagents for tasks that touch the same files
+- Make subagent read the plan file (provide full text in context instead)
+- Skip scene-setting context (subagent needs to understand where the task fits)
+- Ignore subagent questions (answer before letting them proceed)
+- Accept "close enough" on spec compliance
+- Skip review loops (reviewer found issues → implementer fixes → review again)
+- Let implementer self-review replace actual review (both are needed)
+- **Start code quality review before spec compliance is PASS** (wrong order)
+- Move to next task while either review has open issues
+## Handling Issues
+### If Subagent Asks Questions
+- Answer clearly and completely
+- Provide additional context if needed
+- Don't rush them into implementation
+### If Reviewer Finds Issues
+- Implementer subagent (or a new one) fixes them
+- Reviewer reviews again
+- Repeat until approved
+- Don't skip the re-review
+### If Subagent Fails a Task
+- Dispatch a new fix subagent with specific instructions about what went wrong
+- Don't try to fix manually in the controller session (context pollution)
+## Efficiency Notes
+**Why fresh subagent per task:**
+- Prevents context pollution from accumulated state
+- Each subagent gets clean, focused context
+- No confusion from prior tasks' code or reasoning
+**Why two-stage review:**
+- Spec review catches under/over-building early
+- Quality review ensures the implementation is well-built
+- Catches issues before they compound across tasks
+**Cost trade-off:**
+- More subagent invocations (implementer + 2 reviewers per task)
+- But catches issues early (cheaper than debugging compounded problems later)
+## Integration with Other Skills
+### With plan
+This skill EXECUTES plans created by the `plan` skill:
+1. User requirements → plan → implementation plan
+2. Implementation plan → subagent-driven-development → working code
+### With test-driven-development
+Implementer subagents should follow TDD:
+1. Write failing test first
+2. Implement minimal code
+3. Verify test passes
+4. Commit
+Include TDD instructions in every implementer context.
+### With requesting-code-review
+The two-stage review process IS the code review. For final integration review, use the requesting-code-review skill's review dimensions.
+### With systematic-debugging
+If a subagent encounters bugs during implementation:
+1. Follow systematic-debugging process
+2. Find root cause before fixing
+3. Write regression test
+4. Resume implementation
+## Example Workflow
+```
+[Read plan: docs/plans/auth-feature.md]
+[Create todo list with 5 tasks]
+--- Task 1: Create User model ---
+[Dispatch implementer subagent]
+  Implementer: "Should email be unique?"
+  You: "Yes, email must be unique"
+  Implementer: Implemented, 3/3 tests passing, committed.
+[Dispatch spec reviewer]
+  Spec reviewer: ✅ PASS — all requirements met
+[Dispatch quality reviewer]
+  Quality reviewer: ✅ APPROVED — clean code, good tests
+[Mark Task 1 complete]
+--- Task 2: Password hashing ---
+[Dispatch implementer subagent]
+  Implementer: No questions, implemented, 5/5 tests passing.
+[Dispatch spec reviewer]
+  Spec reviewer: ❌ Missing: password strength validation (spec says "min 8 chars")
+[Implementer fixes]
+  Implementer: Added validation, 7/7 tests passing.
+[Dispatch spec reviewer again]
+  Spec reviewer: ✅ PASS
+[Dispatch quality reviewer]
+  Quality reviewer: Important: Magic number 8, extract to constant
+  Implementer: Extracted MIN_PASSWORD_LENGTH constant
+  Quality reviewer: ✅ APPROVED
+[Mark Task 2 complete]
+... (continue for all tasks)
+[After all tasks: dispatch final integration reviewer]
+[Run full test suite: all passing]
+[Done!]
+```
+## Remember
+```
+Fresh subagent per task
+Two-stage review every time
+Spec compliance FIRST
+Code quality SECOND
+Never skip reviews
+Catch issues early
+```
+**Quality is not an accident. It's the result of systematic process.**
+## Further reading (load when relevant)
+When the orchestration involves significant context usage, long review loops, or complex validation checkpoints, load these references for the specific discipline:
+- **`references/context-budget-discipline.md`** — Four-tier context degradation model (PEAK / GOOD / DEGRADING / POOR), read-depth rules that scale with context window size, and early warning signs of silent degradation. Load when a run will clearly consume significant context (multi-phase plans, many subagents, large artifacts).
+- **`references/gates-taxonomy.md`** — The four canonical gate types (Pre-flight, Revision, Escalation, Abort) with behavior, recovery, and examples. Load when designing or reviewing any workflow that has validation checkpoints — use the vocabulary explicitly so each gate has defined entry, failure behavior, and resumption rules.
+Both references adapted from gsd-build/get-shit-done (MIT © 2025 Lex Christopherson).