npm - deepflow - Versions diffs - 0.1.111 → 0.1.113 - Mend

deepflow 0.1.111 → 0.1.113

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (31) hide show

package/bin/install.js +68 -84
package/bin/install.test.js +1 -0
package/bin/ratchet.js +185 -8
package/bin/ratchet.test.js +4 -4
package/hooks/ac-coverage.js +70 -64
package/hooks/df-check-update.js +1 -0
package/hooks/df-command-usage.js +1 -0
package/hooks/df-dashboard-push.js +1 -0
package/hooks/df-execution-history.js +1 -0
package/hooks/df-explore-protocol.js +1 -21
package/hooks/df-harness-score.js +389 -0
package/hooks/df-invariant-check.js +154 -1
package/hooks/df-quota-logger.js +1 -0
package/hooks/df-snapshot-guard.js +1 -0
package/hooks/df-spec-lint.js +8 -2
package/hooks/df-spec-lint.test.js +57 -1
package/hooks/df-statusline.js +1 -0
package/hooks/df-subagent-registry.js +1 -0
package/hooks/df-tool-usage-spike.js +2 -0
package/hooks/df-tool-usage.js +1 -0
package/hooks/df-worktree-guard.js +157 -0
package/hooks/lib/installer-utils.js +114 -0
package/package.json +1 -1
package/src/commands/df/debate.md +20 -4
package/src/commands/df/discover.md +1 -1
package/src/commands/df/execute.md +66 -11
package/src/commands/df/plan.md +4 -5
package/src/commands/df/spec.md +1 -0
package/src/skills/repo-inspect/SKILL.md +205 -0
package/templates/config-template.yaml +3 -0
package/templates/spec-template.md +17 -0

package/src/commands/df/discover.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: df:discover
 description: Explore a problem space deeply through structured questioning to surface requirements and constraints
-allowed-tools: [AskUserQuestion, Read, Agent]
+allowed-tools: [AskUserQuestion, Agent]
 ---
 # /df:discover — Deep Problem Exploration

package/src/commands/df/execute.md CHANGED Viewed

@@ -148,7 +148,7 @@ Context ≥50% → checkpoint and exit. Before spawning: `TaskUpdate(status: "in
 **Token tracking start:** Store `start_percentage` (from context.json) and `start_timestamp` (ISO 8601) keyed by task_id. Omit if unavailable.
-**NEVER use `isolation: "worktree"`.** Deepflow manages one worktree **per spec** (§1.5). Tasks from the same spec commit to the same branch so wave 2 sees wave 1 commits; tasks from different specs commit to different branches and never interleave. **Spawn ALL ready tasks in ONE message** except file conflicts.
+**Intra-wave isolation:** Each task in a wave runs with `isolation: "worktree"` — tasks from the same spec share that spec's worktree branch so wave 2 sees wave 1 commits; tasks from different specs run in different worktrees and never interleave. **Spawn ALL ready tasks in ONE message** except file conflicts.
 **Per-spec routing (CRITICAL):** Each task in `WAVE_JSON` carries a `spec` field (from `bin/wave-runner.js`). When building the agent prompt (§6), you MUST set `Working directory: ${SPEC_WORKTREES[task.spec].path}` — the worktree for that task's spec, NOT the first spec in the map. Cross-spec contamination (spawning a task from spec B into spec A's worktree) corrupts branch history and breaks `/df:verify`. If `task.spec` is absent from the JSON, fall back to deriving it from the task's mini-plan file `.deepflow/plans/doing-{specName}.md`; if still unresolvable, defer the task and log `"⚠ T{N} deferred — cannot resolve spec"`.
@@ -156,11 +156,26 @@ Context ≥50% → checkpoint and exit. Before spawning: `TaskUpdate(status: "in
 **≥2 [SPIKE] tasks same problem →** Parallel Spike Probes (§5.7). **[OPTIMIZE] tasks →** Optimize Cycle (§5.9), one at a time. **[INTEGRATION] tasks** (`task.isIntegration === true` in WAVE_JSON) **→** use the Integration Task prompt template (§6 Integration Task), not the Standard Task template. Integration tasks always land in the final wave via `Blocked by:` — wave-runner guarantees this, so they execute after all producer/consumer implementation tasks have committed. Route them to the **consumer spec's** worktree via `SPEC_WORKTREES[task.spec].path` (plan.md §4.8.2 places the integration task under the consumer's section header, so `task.spec` is already the consumer).
+### 5.1. INTRA-WAVE CHERRY-PICK MERGE
+After ALL wave-N agents complete, cherry-pick each wave-N commit back to the main branch BEFORE wave N+1 begins. This ensures wave N+1 agents see all wave-N changes regardless of which worktree they run in.
+**Wave gate:** Wave N+1 MUST NOT start until all wave-N cherry-picks complete.
+**Ordering:** Apply cherry-picks in ascending task-number order (e.g., T1 before T2 before T3) for determinism.
+**Steps (per wave completion):**
+1. Collect all task commits from wave N (from ratchet PASS records).
+2. Sort commits by ascending task-number order.
+3. For each commit, spawn haiku context-fork (§5.8): `git cherry-pick {sha}`. Receive one-line summary.
+4. On conflict: log `"⚠ cherry-pick conflict: {sha} — {file}"`, abort cherry-pick, mark task as needing manual resolution.
+5. Only after all wave-N cherry-picks finish → proceed to spawn wave N+1 agents.
 ### 5.5. RATCHET CHECK
-Run `node "${HOME}/.claude/bin/ratchet.js"` in the **task's spec worktree** after each agent completes, using that spec's snapshot file:
+Run `node bin/ratchet.js` in the **task's spec worktree** after each agent completes, using that spec's snapshot file:
 ```bash
-node "${HOME}/.claude/bin/ratchet.js" --worktree ${SPEC_WORKTREES[task.spec].path} --snapshot .deepflow/auto-snapshot-{task.spec}.txt --task T{N}
+node bin/ratchet.js --worktree ${SPEC_WORKTREES[task.spec].path} --snapshot .deepflow/auto-snapshot-{task.spec}.txt --task T{N}
 ```
 The script handles all health checks internally and outputs structured JSON:
@@ -187,7 +202,7 @@ The script handles all health checks internally and outputs structured JSON:
   ```
   (Fall back to text mode if `--json` is unavailable: `node "${HOME}/.claude/bin/wave-runner.js" --plan PLAN.md --recalc --failed T{N}`)
   Report: `"✗ T{n}: reverted"`.
-- **Exit 2 (SALVAGEABLE):** Spawn `Agent(model="sonnet")` to fix lint/typecheck issues. Re-run `node "${HOME}/.claude/bin/ratchet.js"`. If still non-zero → revert both commits, set status pending.
+- **Exit 2 (SALVAGEABLE):** Spawn `Agent(model="sonnet")` to fix lint/typecheck issues. Re-run `node bin/ratchet.js`. If still non-zero → revert both commits, set status pending.
 #### 5.5.1. AC COVERAGE CHECK (after ratchet pass)
@@ -207,11 +222,12 @@ where `{spec_path}` is the path to `specs/doing-{spec_name}.md` and `{agent_outp
 Parse the agent's response for `DECISIONS:` line. If present:
 1. Split by ` | ` to get individual decisions
-2. Each decision has format `[TAG] description — rationale` where TAG ∈ {APPROACH, PROVISIONAL, ASSUMPTION, FUTURE, UPDATE}
-3. Append to `.deepflow/decisions.md` under `### {date} — {spec_name}` header (create header if first decision for this spec today, reuse if exists)
-4. Format: `- [TAG] description — rationale`
+2. If any entry does not start with `[TAG]` where TAG ∈ {APPROACH, PROVISIONAL, ASSUMPTION, FUTURE, UPDATE}, emit SALVAGEABLE and skip writing that entry to decisions.md (valid entries still get written).
+3. Each decision has format `[TAG] description — rationale` where TAG ∈ {APPROACH, PROVISIONAL, ASSUMPTION, FUTURE, UPDATE}
+4. Append to `.deepflow/decisions.md` under `### {date} — {spec_name}` header (create header if first decision for this spec today, reuse if exists)
+5. Format: `- [TAG] description — rationale`
-If no `DECISIONS:` line in agent output → skip silently (mechanical tasks don't produce decisions).
+If no `DECISIONS:` line in agent output and the task effort is not `low` → emit SALVAGEABLE (non-trivial tasks without a decision line may indicate the agent skipped documenting architectural choices). For tasks with effort `low`, skip silently (mechanical tasks don't produce decisions).
 **This runs on every ratchet pass, not just at verify time.** Decisions are captured incrementally as tasks complete, so they're never lost even if verify fails or merge is manual.
@@ -232,6 +248,20 @@ tokens:
 ```
 Omit if context.json/token-history.jsonl/awk unavailable. Never fail ratchet for tracking errors.
+### 5.6. WAVE TEST AGENT
+Trigger: task type is [TEST] or orchestrator spawns a dedicated test-writing agent for a wave.
+Before spawning the test agent, collect context:
+```bash
+SNAPSHOT_FILES=!`cat .deepflow/auto-snapshot.txt 2>/dev/null || echo ''`
+EXISTING_TEST_NAMES=!`grep -h -E "^\s*(it|test|describe)\(" ${SNAPSHOT_FILES} 2>/dev/null | sed "s/^[[:space:]]*//" || echo ''`
+```
+Pass `SNAPSHOT_FILES` and `EXISTING_TEST_NAMES` into the agent prompt so it can avoid duplication.
+**Implementation diff:** The wave test agent reads the implementation diff itself using the `Read` tool or `git diff` — do NOT capture or pass the raw diff to the wave test prompt inline. Injecting large diffs inflates context and causes rot.
 ### 5.7. PARALLEL SPIKE PROBES
 Trigger: ≥2 [SPIKE] tasks with same blocker or identical hypothesis.
@@ -399,7 +429,7 @@ Success criteria: {ACs from spec relevant to this task}
 {If spec contains ## Domain Model section:
 --- CONTEXT: Domain Model ---
 {Domain Model section content from doing-*.md, extracted via shell injection:
-  DOMAIN_MODEL=!`sed -n '/^## Domain Model$/,/^## [^D]/p' specs/doing-{spec_name}.md | head -n -1 2>/dev/null || echo 'NOT_FOUND'`
+  DOMAIN_MODEL=!`sed -n '/^## Domain Model$/,/^## /p' specs/doing-{spec_name}.md | head -n -1 2>/dev/null || echo 'NOT_FOUND'`
 }
 }
 {If EXISTING_TYPES is non-empty:
@@ -421,7 +451,7 @@ AC-2:skip:reason here (if applicable)
 AC_COVERAGE_END
 ```
 Format: one line per AC with either `AC-N:done` or `AC-N:skip:reason`. Omit this block if the spec has no acceptance criteria.
-DECISIONS: If you made non-obvious choices, append to the LAST LINE BEFORE TASK_STATUS:
+DECISIONS: If you made non-obvious choices, cite with [APPROACH]. Append to the LAST LINE BEFORE TASK_STATUS:
 DECISIONS: [TAG] {decision} — {rationale} | [TAG] {decision2} — {rationale2}
 Tags:
   [APPROACH] — chose X over Y (architectural/design choice)
@@ -430,6 +460,7 @@ Tags:
   [FUTURE] — deferred X because Y; revisit when Z
   [UPDATE] — changed prior decision from X to Y because Z
 Skip for trivial/mechanical changes.
+Files: List every file you modified or created, one per line, in the format `Files: path/to/file.ts, path/to/other.ts`. This is required so the orchestrator can detect file conflicts across concurrent tasks.
 Last line of your response MUST be: TASK_STATUS:pass (if successful) or TASK_STATUS:fail (if failed) or TASK_STATUS:revert (if reverted)
 ```
@@ -442,6 +473,7 @@ Integration ACs: {list from PLAN.md}
 Specs involved: {spec file paths}
 Interface Map: {from integration task detail}
 Contract Risks: {from integration task detail}
+LSP documentSymbol on Impact files → Read with offset/limit on relevant ranges only (never read full files)
 --- END ---
 RULES:
 - Fix the CONSUMER to match the PRODUCER's declared interface. Never weaken the producer.
@@ -464,7 +496,28 @@ Last line: TASK_STATUS:pass or TASK_STATUS:fail
 **Bootstrap:** `BOOTSTRAP: Write tests for edit_scope files. Do NOT change implementation. Commit as test({spec}): bootstrap. Last line: TASK_STATUS:pass or TASK_STATUS:fail`
-**Spike:** `{task_id} [SPIKE]: {hypothesis}. Files+Spec. {reverted warnings}. Minimal spike. Commit as spike({spec}): {desc}. If you discovered constraints, rejected approaches, or made assumptions, report: DECISIONS: [TAG] {finding} — {why it matters} (use PROVISIONAL for "works but needs revisit", ASSUMPTION for "assumed X; if wrong Y breaks", APPROACH for definitive choices). Last line: TASK_STATUS:pass or TASK_STATUS:fail`
+**Wave Test** (`Agent(model="sonnet")`):
+```
+--- START ---
+{task_id} [TEST]: Write tests for {spec_name}. Files+Spec.
+Pre-existing test files:
+{SNAPSHOT_FILES}
+Existing test function names (do NOT duplicate these):
+{EXISTING_TEST_NAMES}
+--- MIDDLE ---
+Spec: {spec_path}
+Edit scope: {edit_scope}
+--- END ---
+RULES:
+- Use the `Read` tool (or `git diff HEAD~1`) to inspect what the implementation changed before writing tests.
+- Do not duplicate tests that already exist in the pre-existing test files listed above.
+- Do not modify pre-existing test files — write new test files only.
+- Commit as test({spec}): {description}.
+Last line of your response MUST be: TASK_STATUS:pass (if successful) or TASK_STATUS:fail (if failed)
+```
+**Spike**: `{task_id} [SPIKE]: {hypothesis}. Files+Spec. {reverted warnings}. Minimal spike. Commit as spike({spec}): {desc}. If you discovered constraints, rejected approaches, or made assumptions, report: DECISIONS: [TAG] {finding} — {why it matters} (use PROVISIONAL for "works but needs revisit", ASSUMPTION for "assumed X; if wrong Y breaks", APPROACH for definitive choices). Last line: TASK_STATUS:pass or TASK_STATUS:fail`
 **Optimize Task** (`Agent(model="opus")`):
 ```
@@ -474,6 +527,7 @@ Current: {val} (baseline: {b}, best: {best}). Target: {t} ({dir}). Metric: {cmd}
 CONSTRAINT: ONE atomic change.
 --- MIDDLE ---
 Last 5 cycles + failed hypotheses + Impact/deps.
+LSP documentSymbol on Impact files → Read with offset/limit on relevant ranges only (never read full files)
 --- END ---
 {Learnings}. ONE change + commit. No metric run, no multiple changes.
 Last line of your response MUST be: TASK_STATUS:pass or TASK_STATUS:fail or TASK_STATUS:revert
@@ -489,6 +543,7 @@ Current/Target. Role instruction:
   ingenua: "Ignore prior. Fresh approach."
 --- MIDDLE ---
 Full history + all failed hypotheses.
+LSP documentSymbol on Impact files → Read with offset/limit on relevant ranges only (never read full files)
 --- END ---
 ONE atomic change. Commit. STOP.
 Last line of your response MUST be: TASK_STATUS:pass or TASK_STATUS:fail or TASK_STATUS:revert

package/src/commands/df/plan.md CHANGED Viewed

@@ -230,7 +230,7 @@ You are a spec planner. Your job is to independently analyze a spec and produce
 2. **Compute spec layer** — determine L0–L3 based on sections present (see layer rules below)
 3. **Check experiments** — glob `.deepflow/experiments/{topic}--*` for past spikes
 4. **Explore the codebase** — detect code style, patterns, integration points relevant to this spec
-5. **Impact analysis** (L3 only) — LSP-first blast radius for files in scope
+5. **Impact analysis** (L3 only) — LSP documentSymbol on impact files → Read with offset/limit on relevant ranges only (never read full files)
 6. **Targeted exploration** — follow `templates/explore-agent.md` spawn rules for post-LSP gaps
 7. **Generate tasks** — produce a mini-plan following the output format below
@@ -402,10 +402,9 @@ The reasoner prompt:
 ```
 You are the plan reasoner. Analyze this spec and produce a prioritized task plan.
-## Spec file path
-{spec_path}
-Read the spec using the Read tool on the path above. Do NOT read any implementation files.
+## Spec content
+<!-- {spec_content} — injected by orchestrator before spawning; do NOT use Read tool on the spec -->
+{spec_content}
 ## Agent summaries (from §3 parallel agents)

package/src/commands/df/spec.md CHANGED Viewed

@@ -55,6 +55,7 @@ Spawn reasoner agent (`subagent_type: "reasoner"`, `model: "opus"`). The reasone
 - Flags conflicts with existing code
 - Verifies every REQ-N has a corresponding AC; flags uncovered requirements
 - Flags vague/untestable requirements (e.g., "should be fast" without a metric)
+- If Explore agents found type definitions or interfaces relevant to this spec, include a ## Domain Model section with Key Types (signatures only) and Ubiquitous Language (domain terms). Omit if no relevant types found.
 ### 4. GENERATE SPEC

package/src/skills/repo-inspect/SKILL.md ADDED Viewed

@@ -0,0 +1,205 @@
+---
+name: repo-inspect
+description: Produces structured JSON intelligence for a remote GitHub repo — fetches metadata and file tree via gh api, reads key files via WebFetch. No local clone. Use when evaluating an unfamiliar repo before planning integration work.
+context: fork
+allowed-tools: [Bash, WebFetch]
+---
+# Repo-Inspect
+Inspect a GitHub repository and emit a single JSON object describing its architecture. No clones, no tmpdir, no local filesystem writes.
+**Input:** `{owner}/{repo}` or a full GitHub URL (e.g., `https://github.com/owner/repo`).
+**Output:** Raw JSON only — no markdown, no commentary.
+---
+## Protocol
+### Step 0 — Parse Input
+Strip `https://github.com/` prefix if present. Extract `{owner}` and `{repo}` from the remaining `owner/repo` string.
+### Step 1 — Fetch Repo Metadata (1 Bash call)
+```bash
+gh api repos/{owner}/{repo}
+```
+Extract: `description`, `language`, `topics`, `default_branch`, `stargazers_count`, `forks_count`.
+On error (non-zero exit or JSON with `message` field indicating 404/403):
+```json
+{"error": "api_failed", "message": "<gh api error text>"}
+```
+Stop and return this error JSON immediately.
+### Step 2 — Fetch Full File Tree (1 Bash call)
+```bash
+gh api "repos/{owner}/{repo}/git/trees/{default_branch}?recursive=1"
+```
+Parse `tree[]` array. Each item has: `path`, `type` (`blob`|`tree`), `size`.
+If tree is truncated (`truncated: true`), note it but proceed — the tree API returns up to ~100K entries which covers virtually all repos.
+### Step 3 — Language Detection
+Scan tree paths for manifest files in priority order:
+| Manifest | Language |
+|---|---|
+| `Cargo.toml` | Rust |
+| `package.json` | JavaScript/TypeScript |
+| `pyproject.toml` or `setup.py` or `requirements.txt` | Python |
+| `go.mod` | Go |
+| `pom.xml` or `build.gradle` | Java |
+| `mix.exs` | Elixir |
+| `Gemfile` | Ruby |
+| `build.zig` | Zig |
+| `CMakeLists.txt` | C/C++ |
+Use the **first match** (highest priority). If no manifest found, fall back to `language` field from Step 1 metadata.
+Record: `detected_language`, `manifest_path` (path of matched manifest, or null).
+### Step 4 — File Selection (3–6 files)
+Build a prioritized list of files to fetch. Select 3–6 total:
+1. **README** — find `README.md` or `README.rst` or `README` in tree root (depth 0). Always include if present.
+2. **Manifest** — the manifest file detected in Step 3. Always include if present.
+3. **Primary entry point** — search tree for (in order): `src/main.*`, `src/lib.*`, `src/index.*`, `index.*`, `app.*`, `main.*`. Pick the first match at the shallowest depth.
+4. **Supplemental files** — from remaining blobs: prefer shallowest paths, then largest `size`. Pick source files (`.rs`, `.ts`, `.js`, `.py`, `.go`, `.java`, `.ex`, `.rb`, `.zig`, `.c`, `.cpp`, `.h`). Fill up to 6 total.
+For monorepos (detected when tree contains `packages/*/`, `crates/*/`, `apps/*/` directories, or manifest workspace field): select 1-2 representative sub-package manifests/entry points instead of generic supplemental files.
+### Step 5 — Fetch File Contents (3–6 WebFetch calls)
+For each selected file path, fetch:
+```
+https://raw.githubusercontent.com/{owner}/{repo}/{default_branch}/{path}
+```
+Use WebFetch. If a fetch fails (404 or network error), skip that file and note it. Do not retry.
+Collect: list of `{path, content}` pairs for all successfully fetched files.
+### Step 6 — Extract Intelligence from Fetched Content
+From manifest content (if fetched):
+- **dependency_count**: Count entries in `[dependencies]` (Cargo.toml), `dependencies` + `devDependencies` keys (package.json), `[tool.poetry.dependencies]` (pyproject.toml), `require` directives (go.mod/Gemfile), `<dependency>` tags (pom.xml). Use 0 if manifest not fetched.
+- **test_framework**: Check dev-dependencies for known test frameworks:
+  - JS/TS: `jest`, `vitest`, `mocha`, `jasmine`, `tap`, `ava`
+  - Python: `pytest`, `unittest` (stdlib), `nose`
+  - Rust: built-in (`#[test]`), `rstest`, `proptest`
+  - Go: built-in (`testing` package)
+  - Java: `junit`, `testng`
+  - Ruby: `rspec`, `minitest`
+  - Elixir: built-in (`ExUnit`)
+  Also check tree for `test/`, `tests/`, `spec/`, `__tests__/` directories as corroboration.
+- **monorepo**: true if tree contains at least 2 of `packages/`, `crates/`, `apps/`, `libs/` top-level dirs, OR if manifest has workspace/workspaces field.
+From README content (if fetched):
+- Extract the first non-heading paragraph as a candidate for `purpose`. Trim to ≤ 200 chars.
+Fallback for `purpose`: use repo `description` from Step 1 metadata.
+### Step 7 — Derive key_modules
+From the tree blob paths, identify directories containing 2+ source files (files with extensions `.rs`, `.ts`, `.js`, `.tsx`, `.jsx`, `.py`, `.go`, `.java`, `.ex`, `.rb`, `.zig`, `.c`, `.cpp`, `.h`, `.swift`, `.kt`).
+Algorithm:
+1. For each blob, extract parent directory path.
+2. Count source files per directory.
+3. Keep directories with count >= 2.
+4. Sort by file count descending, then by path depth ascending (shallower = more significant).
+5. Take up to 10 modules.
+6. Strip common prefixes (e.g., if all modules share `src/`, keep `src/` as a module too).
+Return directory names (last path segment) for the `key_modules` array. If fewer than 3 candidate directories exist, include directories with 1 source file to reach 3, or return what's available.
+### Step 8 — Derive concepts_applicable
+Based on language, test framework, monorepo status, and key module names, suggest applicable engineering concepts. Examples:
+- Monorepo → `"workspace-management"`, `"cross-package-testing"`
+- Rust → `"ownership-model"`, `"cargo-workspace"` (if monorepo)
+- TypeScript → `"type-safety"`, `"module-resolution"`
+- Has `auth` module → `"authentication-patterns"`
+- Has `db` or `models` module → `"data-modeling"`
+- Has `api` or `routes` module → `"rest-api-design"`
+- Has tests → `"tdd"` or `"bdd"` (if rspec/jasmine)
+Limit to 3–7 concepts. These are suggestions for the caller — not exhaustive.
+### Step 9 — Confidence Score
+Set `confidence` based on data quality:
+| Condition | Confidence |
+|---|---|
+| README + manifest + entry point all fetched | `high` |
+| README or manifest fetched, but not both | `medium` |
+| Neither README nor manifest fetched | `low` |
+### Step 10 — Emit JSON Output
+Output **exactly one JSON object** with no surrounding text, no markdown code fences, no comments:
+```json
+{
+  "repo": "{owner}/{repo}",
+  "purpose": "<first non-heading README paragraph or repo description, ≤200 chars>",
+  "architecture": {
+    "language": "<detected language>",
+    "entry_points": ["<relative paths of main/lib/index files>"],
+    "key_modules": ["<directory names with 2+ source files>"],
+    "dependencies_count": 0,
+    "test_framework": "<framework name or 'unknown'>"
+  },
+  "concepts_applicable": ["<concept1>", "<concept2>"],
+  "files_inspected": ["<path1>", "<path2>"],
+  "confidence": "high|medium|low"
+}
+```
+**Critical:** The very last thing you output must be this JSON object and nothing else. Do not wrap in code blocks. Do not add explanation.
+---
+## Error Handling
+| Scenario | Action |
+|---|---|
+| `gh api` returns non-zero exit for metadata | Return `{"error": "api_failed", "message": "<stderr>"}` and stop |
+| `gh api` returns 404 JSON | Return `{"error": "api_failed", "message": "Repository not found or not accessible"}` |
+| Tree fetch fails | Return `{"error": "tree_failed", "message": "<stderr>"}` and stop |
+| All WebFetch calls fail | Set confidence to "low", proceed with tree-only analysis |
+| Single WebFetch fails | Skip file, continue |
+---
+## Efficiency Budget
+- `gh api` calls: exactly 2 (metadata + tree)
+- WebFetch calls: 3–6 (selected files)
+- Analysis steps: ~5 (no extra Bash calls needed)
+- **Total tool calls: ≤ 20**
+- **Wall time: ≤ 60s**
+- **Tokens: ≤ 30K**
+Do not make extra `gh api` calls. Do not fetch files not in the selection list. The tree endpoint returns all paths in one call — no Glob, no Read, no additional listing needed.
+---
+## Rules
+- Never write to local filesystem (no `> file`, no `mktemp`, no `git clone`).
+- Never use Read, Glob, or Grep tools — this skill operates on remote data only.
+- Output raw JSON only — the caller parses it, not reads it as prose.
+- Private repos work automatically via `gh auth` stored token.
+- Strip `context: fork` means this skill's token usage doesn't pollute the caller's context.

package/templates/config-template.yaml CHANGED Viewed

@@ -96,6 +96,9 @@ quality:
   # Timeout in seconds to wait for the dev server to become ready (default: 30)
   browser_timeout: 30
+  # Minimum quality score threshold for harness verification (0.0-1.0, default: 0.6)
+  harness_min_score: 0.6
 # Ratchet configuration for /df:verify health gate
 # Ratchet snapshots baseline metrics (tests passing, coverage, type checks) before execution
 # and ensures subsequent runs don't regress. These overrides control which commands ratchet monitors.

package/templates/spec-template.md CHANGED Viewed

@@ -43,6 +43,23 @@
 - [Explicitly excluded: e.g., "Video upload is NOT included"]
+## Domain Model
+<!-- Optional. Define the core entities and vocabulary. -->
+### Key Types
+```typescript
+// Core domain types and entities
+```
+### Ubiquitous Language
+- **Term**: Definition
+- **Term**: Definition
+_Note: Keep to max 15 terms for clarity._
 ## Acceptance Criteria
 - [ ] [Testable criterion: e.g., "User can upload jpg/png/webp files"]