npm - @tekyzinc/gsd-t - Versions diffs - 2.46.11 → 2.50.10 - Mend

@tekyzinc/gsd-t 2.46.11 → 2.50.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (40) hide show

package/CHANGELOG.md +11 -0
package/README.md +22 -2
package/bin/debug-ledger.js +193 -0
package/bin/gsd-t.js +259 -1
package/commands/gsd-t-debug.md +26 -1
package/commands/gsd-t-execute.md +31 -3
package/commands/gsd-t-help.md +18 -2
package/commands/gsd-t-integrate.md +16 -0
package/commands/gsd-t-quick.md +18 -1
package/commands/gsd-t-test-sync.md +5 -1
package/commands/gsd-t-verify.md +6 -1
package/commands/gsd-t-wave.md +26 -0
package/docs/GSD-T-README.md +83 -1
package/docs/architecture.md +9 -1
package/docs/requirements.md +30 -0
package/package.json +1 -1
package/templates/CLAUDE-global.md +19 -2
package/templates/stacks/_security.md +243 -0
package/templates/stacks/desktop.ini +2 -0
package/templates/stacks/docker.md +202 -0
package/templates/stacks/firebase.md +166 -0
package/templates/stacks/flutter.md +205 -0
package/templates/stacks/github-actions.md +201 -0
package/templates/stacks/graphql.md +216 -0
package/templates/stacks/neo4j.md +218 -0
package/templates/stacks/nextjs.md +184 -0
package/templates/stacks/node-api.md +196 -0
package/templates/stacks/playwright.md +528 -0
package/templates/stacks/postgresql.md +225 -0
package/templates/stacks/python.md +243 -0
package/templates/stacks/react-native.md +216 -0
package/templates/stacks/react.md +293 -0
package/templates/stacks/redux.md +193 -0
package/templates/stacks/rest-api.md +202 -0
package/templates/stacks/supabase.md +188 -0
package/templates/stacks/tailwind.md +169 -0
package/templates/stacks/typescript.md +176 -0
package/templates/stacks/vite.md +176 -0
package/templates/stacks/vue.md +189 -0
package/templates/stacks/zustand.md +203 -0

package/CHANGELOG.md CHANGED Viewed

@@ -2,6 +2,17 @@
 All notable changes to GSD-T are documented here. Updated with each release.
+## [2.50.10] - 2026-03-25
+### Added
+- **18 new stack rule files** — python, flutter, tailwind, react-native, vite, nextjs, vue, docker, postgresql (with graph-in-SQL section), github-actions, rest-api, supabase, firebase, graphql, zustand, redux, neo4j, playwright. Total: 22 stack rules (was 4).
+- **Playwright best practices** — coverage matrix per feature, pairwise combinatorial testing, state transition testing, multi-step workflow testing, Page Object Model, API mocking patterns. Enforces rigorous test depth across permutations.
+- **react.md expanded** — added state management decision table, form management (react-hook-form + zod), React naming conventions (3 new sections from external best practices review).
+### Changed
+- Stack detection in execute, quick, and debug commands updated to cover all 22 stack files with conditional detection per project dependencies.
+- PostgreSQL graph-in-SQL patterns (adjacency lists, junction tables, recursive CTEs) added to postgresql.md based on real project analysis.
 ## [2.46.11] - 2026-03-24
 ### Added

package/README.md CHANGED Viewed

@@ -3,6 +3,7 @@
 A methodology for reliable, parallelizable development using Claude Code with optional Agent Teams support.
 **Eliminates context rot** — task-level fresh dispatch (one subagent per task, ~10-20% context each) means compaction never triggers.
+**Compaction-proof debug loops** — `gsd-t headless --debug-loop` runs test-fix-retest cycles as separate `claude -p` sessions. A JSONL debug ledger persists all hypothesis/fix/learning history across fresh sessions. Anti-repetition preamble injection prevents retrying failed hypotheses. Escalation tiers (sonnet → opus → human) and a hard iteration ceiling enforced externally.
 **Safe parallel execution** — worktree isolation gives each domain agent its own filesystem; sequential atomic merges prevent conflicts.
 **Maintains test coverage** — automatically keeps tests aligned with code changes.
 **Catches downstream effects** — analyzes impact before changes break things.
@@ -11,6 +12,7 @@ A methodology for reliable, parallelizable development using Claude Code with op
 **Generates visual scan reports** — every `/gsd-t-scan` produces a self-contained HTML report with 6 live architectural diagrams, a tech debt register, and domain health scores; optional DOCX/PDF export via `--export docx|pdf`.
 **Self-learning rule engine** — declarative rules in rules.jsonl detect failure patterns from task metrics. Candidate patches progress through a 5-stage lifecycle (candidate, applied, measured, promoted, graduated) with >55% improvement gates before becoming permanent methodology artifacts.
 **Cross-project learning** — proven rules propagate to `~/.claude/metrics/` and sync across all registered projects via `update-all`. Rules validated in 3+ projects become universal; 5+ projects qualify for npm distribution. Cross-project signal comparison and global ELO rankings available via `gsd-t-metrics --cross-project` and `gsd-t-status`.
+**Stack Rules Engine** — auto-detects project tech stack (React, TypeScript, Node API, Python, Go, Rust) from manifest files and injects mandatory best-practice rules into subagent prompts at execute-time. Universal security rules always apply; stack-specific rules layer on top. Extensible: drop a `.md` file in `templates/stacks/` to add a new stack.
 ---
@@ -83,8 +85,21 @@ npx @tekyzinc/gsd-t uninstall      # Remove commands (keeps project files)
 gsd-t headless verify --json --timeout=1200  # Run verify non-interactively
 gsd-t headless query status                  # Get project state (no LLM, <100ms)
 gsd-t headless query domains                 # List domains (no LLM)
+# Headless debug-loop (compaction-proof automated test-fix-retest)
+gsd-t headless --debug-loop                             # Auto-detect test cmd, up to 20 iterations
+gsd-t headless --debug-loop --max-iterations=10         # Cap at 10 iterations
+gsd-t headless --debug-loop --test-cmd="npm test"       # Override test command
+gsd-t headless --debug-loop --fix-scope="src/auth/**"   # Limit fix scope
+gsd-t headless --debug-loop --json --log                # Structured output + per-iteration logs
 ```
+Each iteration runs as a fresh `claude -p` session. A cumulative debug ledger (`.gsd-t/debug-state.jsonl`) preserves hypothesis/fix/learning history across sessions. An anti-repetition preamble prevents retrying failed approaches.
+**Escalation tiers**: sonnet (iterations 1–5) → opus (6–15) → STOP with diagnostic summary (16–20)
+**Exit codes**: `0` all tests pass · `1` max iterations reached · `2` compaction error · `3` process error · `4` needs human decision
 ### Updating
 When a new version is published:
@@ -321,7 +336,7 @@ get-stuff-done-teams/
 │   ├── branch.md                      # Git branch helper
 │   ├── checkin.md                     # Auto-version + commit/push helper
 │   └── Claude-md.md                   # Reload CLAUDE.md directives
-├── templates/                         # Document templates
+├── templates/                         # Document templates (9 base + stacks/)
 │   ├── CLAUDE-global.md
 │   ├── CLAUDE-project.md
 │   ├── requirements.md
@@ -330,7 +345,12 @@ get-stuff-done-teams/
 │   ├── infrastructure.md
 │   ├── progress.md
 │   ├── backlog.md
-│   └── backlog-settings.md
+│   ├── backlog-settings.md
+│   └── stacks/                        # Stack Rules Engine templates
+│       ├── _security.md               # Universal — always injected
+│       ├── react.md
+│       ├── typescript.md
+│       └── node-api.md
 ├── scripts/                           # Runtime utility scripts (installed to ~/.claude/scripts/)
 │   ├── gsd-t-tools.js                 # State CLI (get/set/validate/list)
 │   ├── gsd-t-statusline.js            # Context usage bar

package/bin/debug-ledger.js ADDED Viewed

@@ -0,0 +1,193 @@
+#!/usr/bin/env node
+/**
+ * GSD-T Debug Ledger — Persistent debug iteration store
+ *
+ * Reads and writes debug iteration records to .gsd-t/debug-state.jsonl.
+ * Supports compaction detection and ledger lifecycle management.
+ *
+ * Zero external dependencies (Node.js built-ins only).
+ */
+const fs = require("fs");
+const path = require("path");
+// ── Constants ─────────────────────────────────────────────────────────────────
+const COMPACTION_THRESHOLD = 51200; // 50KB
+const REQUIRED_FIELDS = [
+  "iteration", "timestamp", "test", "error",
+  "hypothesis", "fix", "fixFiles", "result",
+  "learning", "model", "duration",
+];
+const VALID_RESULTS = new Set(["PASS", "STILL_FAILS"]);
+// ── Exports ───────────────────────────────────────────────────────────────────
+module.exports = {
+  readLedger, appendEntry, getLedgerStats, clearLedger,
+  compactLedger, generateAntiRepetitionPreamble,
+};
+// ── readLedger ────────────────────────────────────────────────────────────────
+/**
+ * Read all entries from the debug ledger.
+ * @param {string} projectDir - Root directory of the project
+ * @returns {object[]} Array of parsed ledger entry objects
+ */
+function readLedger(projectDir) {
+  const fp = ledgerPath(projectDir);
+  if (!fs.existsSync(fp)) return [];
+  const content = fs.readFileSync(fp, "utf8").trim();
+  if (!content) return [];
+  return content.split("\n").map(safeParse).filter(Boolean);
+}
+// ── appendEntry ───────────────────────────────────────────────────────────────
+/**
+ * Validate and append one debug iteration entry to the ledger.
+ * Creates the file and parent directories if they do not exist.
+ * @param {string} projectDir - Root directory of the project
+ * @param {object} entry - Debug iteration record (see Required Fields)
+ * @throws {Error} If required fields are missing or invalid
+ */
+function appendEntry(projectDir, entry) {
+  const err = validateEntry(entry);
+  if (err) throw new Error(err);
+  const fp = ledgerPath(projectDir);
+  ensureDir(path.dirname(fp));
+  fs.appendFileSync(fp, JSON.stringify(entry) + "\n");
+}
+// ── getLedgerStats ────────────────────────────────────────────────────────────
+/**
+ * Return summary statistics for the current ledger.
+ * @param {string} projectDir - Root directory of the project
+ * @returns {{ entryCount: number, sizeBytes: number, needsCompaction: boolean, failedHypotheses: string[], passCount: number, failCount: number }}
+ */
+function getLedgerStats(projectDir) {
+  const fp = ledgerPath(projectDir);
+  const entries = readLedger(projectDir);
+  const sizeBytes = fs.existsSync(fp) ? fs.statSync(fp).size : 0;
+  const failedHypotheses = entries
+    .filter((e) => e.result === "STILL_FAILS" && e.hypothesis)
+    .map((e) => e.hypothesis);
+  const passCount = entries.filter((e) => e.result === "PASS").length;
+  const failCount = entries.filter((e) => e.result === "STILL_FAILS").length;
+  return {
+    entryCount: entries.length,
+    sizeBytes,
+    needsCompaction: sizeBytes > COMPACTION_THRESHOLD,
+    failedHypotheses,
+    passCount,
+    failCount,
+  };
+}
+// ── clearLedger ───────────────────────────────────────────────────────────────
+/**
+ * Delete the debug ledger file. Called when all tests pass.
+ * No-op if the file does not exist.
+ * @param {string} projectDir - Root directory of the project
+ */
+function clearLedger(projectDir) {
+  const fp = ledgerPath(projectDir);
+  if (fs.existsSync(fp)) fs.unlinkSync(fp);
+}
+// ── compactLedger ─────────────────────────────────────────────────────────────
+/**
+ * Compact the ledger by replacing all but the last 5 entries with a summary.
+ * @param {string} projectDir - Root directory of the project
+ * @param {string} summary - Summarization of compacted entries
+ */
+function compactLedger(projectDir, summary) {
+  const entries = readLedger(projectDir);
+  const tail = entries.slice(-5);
+  const compactedEntry = {
+    compacted: true,
+    learning: summary,
+    iteration: 0,
+    timestamp: new Date().toISOString(),
+    test: "compacted",
+    error: "see summary",
+    hypothesis: "compacted",
+    fix: "compacted",
+    fixFiles: [],
+    result: "compacted",
+    model: "haiku",
+    duration: 0,
+  };
+  const fp = ledgerPath(projectDir);
+  ensureDir(path.dirname(fp));
+  const lines = [compactedEntry, ...tail].map((e) => JSON.stringify(e)).join("\n") + "\n";
+  fs.writeFileSync(fp, lines);
+}
+// ── generateAntiRepetitionPreamble ────────────────────────────────────────────
+/**
+ * Build a preamble string listing failed hypotheses and the current narrowing
+ * direction. Injected into each claude -p session to prevent repeated attempts.
+ * @param {string} projectDir - Root directory of the project
+ * @returns {string} Formatted preamble, or empty string if ledger is empty
+ */
+function generateAntiRepetitionPreamble(projectDir) {
+  const entries = readLedger(projectDir);
+  if (!entries.length) return "";
+  const failed = entries.filter((e) => e.result === "STILL_FAILS");
+  const learnings = entries.filter((e) => e.learning && !e.compacted);
+  const lastLearning = learnings.length ? learnings[learnings.length - 1].learning : null;
+  const failLines = failed
+    .map((e, i) => `${i + 1}. [iteration ${e.iteration}] "${e.hypothesis}" — FAILED: ${e.error}`)
+    .join("\n");
+  const stillFailing = failed.map((e) => `- ${e.test}: ${e.error}`).join("\n");
+  const direction = lastLearning
+    ? `Based on ${entries.length} iterations, the evidence points to: ${lastLearning}`
+    : "No narrowing direction established yet.";
+  return [
+    "## Debug Ledger Context (DO NOT retry failed approaches)",
+    "",
+    "### Failed Hypotheses (DO NOT retry these):",
+    failLines || "(none yet)",
+    "",
+    "### Current Narrowing Direction:",
+    direction,
+    "",
+    "### Tests Still Failing:",
+    stillFailing || "(none recorded)",
+  ].join("\n");
+}
+// ── Internal helpers ──────────────────────────────────────────────────────────
+function ledgerPath(projectDir) {
+  return path.join(projectDir || process.cwd(), ".gsd-t", "debug-state.jsonl");
+}
+function ensureDir(dir) {
+  if (!fs.existsSync(dir)) fs.mkdirSync(dir, { recursive: true });
+}
+function safeParse(line) {
+  try { return JSON.parse(line); } catch { return null; }
+}
+function validateEntry(entry) {
+  if (!entry || typeof entry !== "object") return "Entry must be an object";
+  for (const f of REQUIRED_FIELDS) {
+    if (entry[f] === undefined || entry[f] === null) return `Missing required field: ${f}`;
+  }
+  if (typeof entry.iteration !== "number") return "iteration must be a number";
+  if (typeof entry.duration !== "number") return "duration must be a number";
+  if (!Array.isArray(entry.fixFiles)) return "fixFiles must be an array";
+  if (!VALID_RESULTS.has(entry.result)) return `result must be "PASS" or "STILL_FAILS"`;
+  return null;
+}

package/bin/gsd-t.js CHANGED Viewed

@@ -19,6 +19,7 @@ const fs = require("fs");
 const path = require("path");
 const os = require("os");
 const { execFileSync, spawn: cpSpawn } = require("child_process");
+const debugLedger = require(path.join(__dirname, "debug-ledger.js"));
 // ─── Configuration ───────────────────────────────────────────────────────────
@@ -2174,6 +2175,236 @@ function doHeadlessQuery(type) {
   process.stdout.write(JSON.stringify(result) + "\n");
 }
+/**
+ * Parse debug-loop flags from args array.
+ * Extracts --max-iterations, --test-cmd, --fix-scope, --json, --log from args.
+ */
+function parseDebugLoopFlags(args) {
+  const flags = { maxIterations: 20, testCmd: null, fixScope: null, json: false, log: false };
+  const positional = [];
+  for (const arg of args) {
+    if (arg.startsWith("--max-iterations=")) {
+      const n = parseInt(arg.slice("--max-iterations=".length), 10);
+      if (!isNaN(n) && n > 0) flags.maxIterations = n;
+    } else if (arg.startsWith("--test-cmd=")) {
+      flags.testCmd = arg.slice("--test-cmd=".length);
+    } else if (arg.startsWith("--fix-scope=")) {
+      flags.fixScope = arg.slice("--fix-scope=".length);
+    } else if (arg === "--json") {
+      flags.json = true;
+    } else if (arg === "--log") {
+      flags.log = true;
+    } else {
+      positional.push(arg);
+    }
+  }
+  return { flags, positional };
+}
+/**
+ * Return the escalation model for a given iteration number.
+ * Tiers: 1-5 → sonnet, 6-15 → opus, 16+ → null (stop)
+ */
+function getEscalationModel(iteration) {
+  if (iteration >= 1 && iteration <= 5) return "sonnet";
+  if (iteration >= 6 && iteration <= 15) return "opus";
+  return null;
+}
+/**
+ * Spawn a single `claude -p` session and return stdout as a string.
+ * Returns null if the process fails.
+ */
+function spawnClaudeSession(prompt, model) {
+  try {
+    return execFileSync("claude", ["-p", prompt, "--model", model], {
+      encoding: "utf8", timeout: 300000,
+      stdio: ["pipe", "pipe", "pipe"],
+    });
+  } catch (e) {
+    return (e.stdout || "") + (e.stderr || "") || null;
+  }
+}
+/**
+ * Parse test pass/fail from claude output.
+ * Returns { passed: bool, summary: string }.
+ */
+function parseTestResult(output) {
+  const out = (output || "").toLowerCase();
+  const passed =
+    /\ball tests? pass(ed|ing)?\b/.test(out) ||
+    /\ball \d+ tests? pass/.test(out) ||
+    /\bno (test )?failures?\b/.test(out) ||
+    /\btests? (all )?pass(ed)?\b/.test(out);
+  const failed =
+    /\bfail(ed|ing|ure)?\b/.test(out) ||
+    /\berror\b/.test(out) ||
+    /\bnot ok\b/.test(out);
+  const summary = (output || "").slice(0, 500).replace(/\n/g, " ").trim();
+  return { passed: passed && !failed, summary };
+}
+/**
+ * Run ledger compaction: spawn haiku to summarize, then compact.
+ */
+function runLedgerCompaction(projectDir, jsonMode) {
+  const entries = debugLedger.readLedger(projectDir);
+  const compactPrompt =
+    "Read this debug ledger. Produce a condensed summary of what has been tried, " +
+    "what failed, and what the evidence suggests. Be concise.\n\n" +
+    JSON.stringify(entries, null, 2);
+  let summary = "Compacted — see previous entries.";
+  try {
+    const out = execFileSync("claude", ["-p", compactPrompt, "--model", "haiku"], {
+      encoding: "utf8", timeout: 120000, stdio: ["pipe", "pipe", "pipe"],
+    });
+    summary = (out || "").trim() || summary;
+  } catch (e) {
+    if (!jsonMode) warn("Compaction haiku session failed — using default summary");
+  }
+  debugLedger.compactLedger(projectDir, summary);
+}
+/**
+ * Write a per-iteration log file under .gsd-t/.
+ */
+function writeIterationLog(projectDir, ts, iteration, entry, rawOutput) {
+  const logDir = path.join(projectDir, ".gsd-t");
+  if (!fs.existsSync(logDir)) fs.mkdirSync(logDir, { recursive: true });
+  const fname = `headless-debug-${ts}-iter-${iteration}.log`;
+  const content = [
+    `Iteration: ${iteration}`,
+    `Timestamp: ${entry.timestamp}`,
+    `Model: ${entry.model}`,
+    `Result: ${entry.result}`,
+    `Fix: ${entry.fix}`,
+    `Learning: ${entry.learning}`,
+    `---`,
+    rawOutput || "",
+  ].join("\n");
+  fs.writeFileSync(path.join(logDir, fname), content);
+}
+/**
+ * Full debug-loop: validate flags, check claude CLI, run iteration cycle.
+ */
+function doHeadlessDebugLoop(flags) {
+  const opts = flags || {};
+  const jsonMode = opts.json || false;
+  const projectDir = process.cwd();
+  if (opts.maxIterations < 1) {
+    const msg = "--max-iterations must be >= 1";
+    if (jsonMode) process.stdout.write(JSON.stringify({ success: false, exitCode: 3, error: msg }) + "\n");
+    else error(msg);
+    process.exit(3);
+  }
+  try {
+    execFileSync("claude", ["--version"], { encoding: "utf8", timeout: 5000, stdio: ["pipe", "pipe", "pipe"] });
+  } catch {
+    const msg = "claude CLI not found. Install with: npm install -g @anthropic-ai/claude-code";
+    if (jsonMode) process.stdout.write(JSON.stringify({ success: false, exitCode: 3, error: msg }) + "\n");
+    else error(msg);
+    process.exit(3);
+  }
+  if (!jsonMode) {
+    heading("GSD-T Headless — Debug Loop");
+    info(`Max iterations: ${opts.maxIterations}`);
+    if (opts.testCmd) info(`Test command: ${opts.testCmd}`);
+    if (opts.fixScope) info(`Fix scope: ${opts.fixScope}`);
+    if (opts.log) info(`Logging: enabled`);
+    log("");
+  }
+  const ts = Date.now();
+  for (let iteration = 1; iteration <= opts.maxIterations; iteration++) {
+    const model = getEscalationModel(iteration);
+    // STOP tier: escalation stop
+    if (model === null) {
+      const entries = debugLedger.readLedger(projectDir);
+      const stats = debugLedger.getLedgerStats(projectDir);
+      const diagMsg = `ESCALATION STOP at iteration ${iteration}. ` +
+        `Entries: ${stats.entryCount}, Failures: ${stats.failCount}. ` +
+        `Failed hypotheses:\n${stats.failedHypotheses.map((h, i) => `  ${i + 1}. ${h}`).join("\n")}`;
+      if (jsonMode) {
+        process.stdout.write(JSON.stringify({ success: false, exitCode: 4, iteration, diagnostic: diagMsg, entries }) + "\n");
+      } else {
+        log("");
+        warn(diagMsg);
+      }
+      process.exit(4);
+    }
+    // Check compaction
+    const stats = debugLedger.getLedgerStats(projectDir);
+    if (stats.needsCompaction) {
+      if (!jsonMode) info("Ledger compaction triggered...");
+      try { runLedgerCompaction(projectDir, jsonMode); }
+      catch { process.exit(2); }
+    }
+    // Generate preamble and build prompt
+    const preamble = debugLedger.generateAntiRepetitionPreamble(projectDir);
+    const scopeHint = opts.fixScope ? `\nFix scope: ${opts.fixScope}` : "";
+    const testHint = opts.testCmd ? `\nRun tests with: ${opts.testCmd}` : "";
+    const prompt = [preamble, `Fix the failing test(s). Write your fix, then run the test suite. Report results.${scopeHint}${testHint}`]
+      .filter(Boolean).join("\n\n");
+    if (!jsonMode) info(`Iteration ${iteration}/${opts.maxIterations} [${model}]...`);
+    const iterStart = Date.now();
+    let rawOutput = null;
+    try { rawOutput = spawnClaudeSession(prompt, model); }
+    catch (e) {
+      if (jsonMode) process.stdout.write(JSON.stringify({ success: false, exitCode: 3, iteration, error: String(e) }) + "\n");
+      else error(`Process error at iteration ${iteration}: ${e.message}`);
+      process.exit(3);
+    }
+    const duration = Math.round((Date.now() - iterStart) / 1000);
+    const { passed, summary } = parseTestResult(rawOutput);
+    const result = passed ? "PASS" : "STILL_FAILS";
+    // Extract fix description from output (first 200 chars of output)
+    const fixDesc = (rawOutput || "").split("\n").find((l) => l.trim().length > 20) || "see output";
+    const entry = {
+      iteration, timestamp: new Date().toISOString(),
+      test: opts.testCmd || "unspecified", error: passed ? "" : summary,
+      hypothesis: `iteration-${iteration}`, fix: fixDesc.trim().slice(0, 200),
+      fixFiles: [], result, learning: summary.slice(0, 300),
+      model, duration,
+    };
+    try { debugLedger.appendEntry(projectDir, entry); }
+    catch (e) {
+      if (!jsonMode) warn(`Failed to append ledger entry: ${e.message}`);
+    }
+    if (opts.log) writeIterationLog(projectDir, ts, iteration, entry, rawOutput);
+    if (jsonMode) {
+      process.stdout.write(JSON.stringify({ success: passed, exitCode: passed ? 0 : 1, iteration, result, model, duration, summary }) + "\n");
+    } else {
+      info(`  Result: ${result}`);
+    }
+    if (passed) {
+      debugLedger.clearLedger(projectDir);
+      if (!jsonMode) log(`\n${GREEN}All tests pass — debug loop complete.${RESET}`);
+      process.exit(0);
+    }
+  }
+  // Max iterations reached
+  if (!jsonMode) warn(`Max iterations (${opts.maxIterations}) reached without all tests passing.`);
+  process.exit(1);
+}
 function doHeadless(args) {
   const sub = args[0];
   if (!sub || sub === "--help" || sub === "-h") {
@@ -2181,6 +2412,12 @@ function doHeadless(args) {
     return;
   }
+  if (sub === "--debug-loop") {
+    const { flags } = parseDebugLoopFlags(args.slice(1));
+    doHeadlessDebugLoop(flags);
+    return;
+  }
   if (sub === "query") {
     const type = args[1];
     doHeadlessQuery(type);
@@ -2196,7 +2433,24 @@ function showHeadlessHelp() {
   log(`\n${BOLD}GSD-T Headless Mode${RESET}\n`);
   log(`${BOLD}Usage:${RESET}`);
   log(`  ${CYAN}gsd-t headless${RESET} <command> [args] [--json] [--timeout=N] [--log]`);
-  log(`  ${CYAN}gsd-t headless query${RESET} <type>\n`);
+  log(`  ${CYAN}gsd-t headless query${RESET} <type>`);
+  log(`  ${CYAN}gsd-t headless --debug-loop${RESET} [--max-iterations=N] [--test-cmd=CMD] [--fix-scope=SCOPE] [--json] [--log]\n`);
+  log(`${BOLD}Debug-loop flags:${RESET}`);
+  log(`  ${CYAN}--max-iterations=N${RESET}  Hard ceiling on iterations (default: 20)`);
+  log(`  ${CYAN}--test-cmd=CMD${RESET}      Override test command`);
+  log(`  ${CYAN}--fix-scope=SCOPE${RESET}   Limit fix scope to specific files or test patterns`);
+  log(`  ${CYAN}--json${RESET}              Structured JSON output per iteration`);
+  log(`  ${CYAN}--log${RESET}               Write per-iteration logs to .gsd-t/\n`);
+  log(`${BOLD}Debug-loop escalation tiers:${RESET}`);
+  log(`  Iterations 1-5:   sonnet  (standard debug)`);
+  log(`  Iterations 6-15:  opus    (deeper reasoning)`);
+  log(`  Iterations 16-20: STOP    (exit code 4 — needs human)\n`);
+  log(`${BOLD}Debug-loop exit codes:${RESET}`);
+  log(`  0  all tests pass`);
+  log(`  1  max iterations reached`);
+  log(`  2  ledger compaction error`);
+  log(`  3  process error`);
+  log(`  4  escalation stop — needs human\n`);
   log(`${BOLD}Exec flags:${RESET}`);
   log(`  ${CYAN}--json${RESET}        Structured JSON output`);
   log(`  ${CYAN}--timeout=N${RESET}   Kill after N seconds (default: 300)`);
@@ -2304,6 +2558,10 @@ module.exports = {
   doHeadlessExec,
   doHeadlessQuery,
   doHeadless,
+  // Headless debug-loop
+  parseDebugLoopFlags,
+  getEscalationModel,
+  doHeadlessDebugLoop,
   queryStatus,
   queryDomains,
   queryContracts,

package/commands/gsd-t-debug.md CHANGED Viewed

@@ -8,6 +8,22 @@ To give this debug session a fresh context window and prevent compaction, always
 **If you are the orchestrating agent** (you received the slash command directly):
+**Stack Rules Detection (before spawning subagent):**
+Run via Bash to detect project stack and collect matching rules:
+`GSD_T_DIR=$(npm root -g 2>/dev/null)/@tekyzinc/gsd-t; STACKS_DIR="$GSD_T_DIR/templates/stacks"; STACK_RULES=""; if [ -d "$STACKS_DIR" ]; then for f in "$STACKS_DIR"/_*.md; do [ -f "$f" ] && STACK_RULES="${STACK_RULES}$(cat "$f")"$'\n\n'; done; if [ -f "package.json" ]; then grep -q '"react-native"' package.json 2>/dev/null && [ -f "$STACKS_DIR/react-native.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/react-native.md")"$'\n\n'; grep -q '"react"' package.json 2>/dev/null && ! grep -q '"react-native"' package.json 2>/dev/null && [ -f "$STACKS_DIR/react.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/react.md")"$'\n\n'; grep -q '"next"' package.json 2>/dev/null && [ -f "$STACKS_DIR/nextjs.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/nextjs.md")"$'\n\n'; grep -q '"vue"' package.json 2>/dev/null && [ -f "$STACKS_DIR/vue.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/vue.md")"$'\n\n'; (grep -q '"typescript"' package.json 2>/dev/null || [ -f "tsconfig.json" ]) && [ -f "$STACKS_DIR/typescript.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/typescript.md")"$'\n\n'; grep -qE '"(express|fastify|hono|koa)"' package.json 2>/dev/null && [ -f "$STACKS_DIR/node-api.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/node-api.md")"$'\n\n'; grep -q '"tailwindcss"' package.json 2>/dev/null && [ -f "$STACKS_DIR/tailwind.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/tailwind.md")"$'\n\n'; grep -q '"vite"' package.json 2>/dev/null && [ -f "$STACKS_DIR/vite.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/vite.md")"$'\n\n'; grep -q '"@supabase/supabase-js"' package.json 2>/dev/null && [ -f "$STACKS_DIR/supabase.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/supabase.md")"$'\n\n'; grep -q '"firebase"' package.json 2>/dev/null && [ -f "$STACKS_DIR/firebase.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/firebase.md")"$'\n\n'; grep -qE '"(graphql|@apollo/client|urql)"' package.json 2>/dev/null && [ -f "$STACKS_DIR/graphql.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/graphql.md")"$'\n\n'; grep -q '"zustand"' package.json 2>/dev/null && [ -f "$STACKS_DIR/zustand.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/zustand.md")"$'\n\n'; grep -q '"@reduxjs/toolkit"' package.json 2>/dev/null && [ -f "$STACKS_DIR/redux.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/redux.md")"$'\n\n'; grep -q '"neo4j-driver"' package.json 2>/dev/null && [ -f "$STACKS_DIR/neo4j.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/neo4j.md")"$'\n\n'; grep -qE '"(pg|prisma|drizzle-orm|knex)"' package.json 2>/dev/null && [ -f "$STACKS_DIR/postgresql.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/postgresql.md")"$'\n\n'; grep -qE '"(express|fastify|hono|koa)"' package.json 2>/dev/null && [ -f "$STACKS_DIR/rest-api.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/rest-api.md")"$'\n\n'; fi; ([ -f "requirements.txt" ] || [ -f "pyproject.toml" ] || [ -f "Pipfile" ]) && [ -f "$STACKS_DIR/python.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/python.md")"$'\n\n'; ([ -f "requirements.txt" ] && grep -q "psycopg" requirements.txt 2>/dev/null || [ -f "pyproject.toml" ] && grep -q "psycopg" pyproject.toml 2>/dev/null) && [ -f "$STACKS_DIR/postgresql.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/postgresql.md")"$'\n\n'; ([ -f "requirements.txt" ] && grep -q "neo4j" requirements.txt 2>/dev/null) && [ -f "$STACKS_DIR/neo4j.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/neo4j.md")"$'\n\n'; [ -f "pubspec.yaml" ] && [ -f "$STACKS_DIR/flutter.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/flutter.md")"$'\n\n'; [ -f "Dockerfile" ] && [ -f "$STACKS_DIR/docker.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/docker.md")"$'\n\n'; [ -d ".github/workflows" ] && [ -f "$STACKS_DIR/github-actions.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/github-actions.md")"$'\n\n'; ([ -f "playwright.config.ts" ] || [ -f "playwright.config.js" ]) && [ -f "$STACKS_DIR/playwright.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/playwright.md")"$'\n\n'; [ -f "go.mod" ] && [ -f "$STACKS_DIR/go.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/go.md")"$'\n\n'; [ -f "Cargo.toml" ] && [ -f "$STACKS_DIR/rust.md" ] && STACK_RULES="${STACK_RULES}$(cat "$STACKS_DIR/rust.md")"$'\n\n'; fi`
+If STACK_RULES is non-empty, append to the subagent prompt:
+```
+## Stack Rules (MANDATORY — violations fail this task)
+{STACK_RULES}
+These standards have the same enforcement weight as contract compliance.
+Violations are task failures, not warnings.
+```
+If STACK_RULES is empty (no templates/stacks/ dir or no matches), skip silently.
 **OBSERVABILITY LOGGING (MANDATORY):**
 Before spawning — run via Bash:
 `T_START=$(date +%s) && DT_START=$(date +"%Y-%m-%d %H:%M") && TOK_START=${CLAUDE_CONTEXT_TOKENS_USED:-0} && TOK_MAX=${CLAUDE_CONTEXT_TOKENS_MAX:-200000}`
@@ -215,7 +231,16 @@ When you encounter unexpected situations during the fix:
 3. **Blocker (missing file, wrong API response)** → Fix blocker and continue. Log if non-trivial.
 4. **Architectural change required to fix correctly** → STOP. Explain what exists, what needs to change, what breaks, and a migration path. Wait for user approval. Never self-approve.
-**3-attempt limit**: If your fix doesn't work after 3 attempts within this session, treat it as a loop. Do NOT keep trying the same approach. Log the attempt to `.gsd-t/progress.md` Decision Log with a `[failure]` prefix, then return to Step 1.5 and run Deep Research Mode before any further attempts. Present findings and options to the user before proceeding.
+**3-attempt limit**: If your fix doesn't work after 3 attempts within this session, treat it as a loop. Do NOT keep trying the same approach. Before entering Deep Research Mode, first try the headless debug-loop:
+1. Write current failure context to `.gsd-t/debug-state.jsonl` via appendEntry
+2. Log: "Delegating to headless debug-loop (3 in-context attempts exhausted)"
+3. Run: `gsd-t headless --debug-loop --max-iterations 10`
+4. Check exit code:
+   - 0: Tests pass, continue
+   - 1/4: Log to `.gsd-t/deferred-items.md`, then enter Deep Research Mode
+   - 3: Report error, stop
+If the debug-loop also fails (exit 1/4), log the attempt to `.gsd-t/progress.md` Decision Log with a `[failure]` prefix, return to Step 1.5 and run Deep Research Mode before any further attempts. Present findings and options to the user before proceeding.
 ### Solo Mode
 1. Reproduce the issue — **reproduction script must exist before step 2** (see Step 2.5)