npm - @tekyzinc/gsd-t - Versions diffs - 2.70.15 → 2.71.10 - Mend

@tekyzinc/gsd-t 2.70.15 → 2.71.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

package/README.md +5 -3
package/commands/gsd-t-design-build.md +314 -0
package/commands/gsd-t-design-decompose.md +27 -0
package/commands/gsd-t-design-review.md +155 -0
package/commands/gsd-t-execute.md +51 -6
package/commands/gsd-t-help.md +2 -0
package/commands/gsd-t-quick.md +10 -2
package/package.json +2 -2
package/scripts/gsd-t-design-review-inject.js +941 -0
package/scripts/gsd-t-design-review-server.js +388 -0
package/scripts/gsd-t-design-review.html +1677 -0
package/templates/CLAUDE-global.md +2 -0

package/README.md CHANGED Viewed

@@ -30,7 +30,7 @@ A methodology for reliable, parallelizable development using Claude Code with op
 npx @tekyzinc/gsd-t install
 ```
-This installs 48 GSD-T commands + 5 utility commands (53 total) to `~/.claude/commands/` and the global CLAUDE.md to `~/.claude/CLAUDE.md`. Works on Windows, Mac, and Linux.
+This installs 51 GSD-T commands + 5 utility commands (56 total) to `~/.claude/commands/` and the global CLAUDE.md to `~/.claude/CLAUDE.md`. Works on Windows, Mac, and Linux.
 ### Start Using It
@@ -189,6 +189,8 @@ This will replace changed command files, back up your CLAUDE.md if customized, a
 | `/user:gsd-t-triage-and-merge` | Auto-review, merge, and publish GitHub branches | Manual |
 | `/user:gsd-t-audit` | Harness self-audit — analyze cost/benefit of enforcement components | Manual |
 | `/user:gsd-t-design-audit` | Compare built screen against Figma design — structured deviation report | Manual |
+| `/user:gsd-t-design-build` | Build from design contracts with two-terminal review (Term 1 builder) | Manual |
+| `/user:gsd-t-design-review` | Independent review agent for design build (Term 2 reviewer) | Auto |
 ### Backlog Management
@@ -340,8 +342,8 @@ get-stuff-done-teams/
 ├── LICENSE
 ├── bin/
 │   └── gsd-t.js                       # CLI installer
-├── commands/                          # 53 slash commands
-│   ├── gsd-t-*.md                     # 45 GSD-T workflow commands
+├── commands/                          # 56 slash commands
+│   ├── gsd-t-*.md                     # 50 GSD-T workflow commands
 │   ├── gsd.md                         # GSD-T smart router
 │   ├── branch.md                      # Git branch helper
 │   ├── checkin.md                     # Auto-version + commit/push helper

package/commands/gsd-t-design-build.md ADDED Viewed

@@ -0,0 +1,314 @@
+# GSD-T: Design Build — Build from Design Contracts with Two-Terminal Review
+You are the design builder (Term 1). You build UI components from design contracts, measure them against specs, and coordinate with an independent review session (Term 2) for unbiased verification.
+**Architecture**: Term 1 builds → file system → Term 2 reviews → file system → Term 1 reads feedback. Zero shared context between sessions.
+## Step 0: Validate Prerequisites
+1. Read `.gsd-t/contracts/design/INDEX.md` — if missing, STOP: "Run `/user:gsd-t-design-decompose` first to create design contracts."
+2. Read `.gsd-t/progress.md` for current state
+3. Verify `scripts/gsd-t-design-review-server.js` exists (GSD-T package)
+   - Get the GSD-T install path: `npm root -g` → `{global_root}/@tekyzinc/gsd-t/scripts/`
+   - If not found: "Update GSD-T: `/user:gsd-t-version-update`"
+## Step 1: Start Infrastructure
+### 1a. Dev Server
+Check if a dev server is running:
+```bash
+lsof -i :5173 2>/dev/null | head -2
+```
+If not running, detect and start:
+```bash
+# Check package.json for dev command
+npm run dev &
+# Wait for server to be ready
+for i in $(seq 1 30); do curl -s http://localhost:5173 > /dev/null 2>&1 && break; sleep 1; done
+```
+Record the dev server port as `$DEV_PORT`.
+### 1b. Review Server
+Find the review server script:
+```bash
+GSD_ROOT=$(npm root -g)/@tekyzinc/gsd-t/scripts
+```
+Start the review server as a background process:
+```bash
+node $GSD_ROOT/gsd-t-design-review-server.js \
+  --target http://localhost:$DEV_PORT \
+  --project $PWD \
+  --port 3456 &
+REVIEW_PID=$!
+```
+Verify it's running:
+```bash
+curl -s http://localhost:3456/review/api/status
+```
+### 1c. Launch Term 2 (Independent Review Session)
+Write the review prompt to disk (Term 2 reads this, not Term 1's context):
+```bash
+cat > .gsd-t/design-review/review-prompt.md << 'PROMPT'
+You are the design review agent (Term 2). You are an INDEPENDENT reviewer — you have NO knowledge of how components were built. Your job is to compare built output against design contracts.
+## Your Loop
+1. Poll `.gsd-t/design-review/queue/` every 5 seconds for new JSON files
+2. For each queue item:
+   a. Read the queue JSON — it has component name, selector, measurements, source path
+   b. Read the matching design contract from `.gsd-t/contracts/design/`
+   c. Open the app at http://localhost:3456/ (proxied through review server)
+   d. Use Playwright to navigate to the component and evaluate it against the contract
+   e. AI Review — check what measurements can't catch:
+      - Does the visual hierarchy feel right?
+      - Are proportions correct even if individual measurements pass?
+      - Does the component look like the contract describes, holistically?
+   f. If CRITICAL issues found (wrong chart type, missing elements, wrong data model):
+      - Write rejection to `.gsd-t/design-review/rejected/{id}.json`:
+        `{ "id": "...", "reason": "...", "severity": "critical", "timestamp": "..." }`
+      - The review server will notify Term 1 automatically
+   g. If no critical issues → the component passes to human review (review UI handles this)
+3. When `.gsd-t/design-review/shutdown.json` appears → exit cleanly
+## Rules
+- You write ZERO code. You ONLY review.
+- You have NO context about how anything was built — judge purely by contract vs. output.
+- Be harsh. Your value is in catching what the builder missed.
+- Check `.gsd-t/contracts/design/elements/`, `widgets/`, and `pages/` for specs.
+PROMPT
+```
+Launch a new terminal with an independent Claude session:
+```bash
+# macOS
+osascript -e "tell application \"Terminal\" to do script \"cd $(pwd) && claude --print 'Read .gsd-t/design-review/review-prompt.md and execute the instructions. This is your only directive.'\""
+```
+```bash
+# Linux (fallback)
+gnome-terminal -- bash -c "cd $(pwd) && claude --print 'Read .gsd-t/design-review/review-prompt.md and execute the instructions. This is your only directive.'; exec bash"
+```
+Display to user:
+```
+Design Build — Infrastructure Ready
+  ✓ Dev server:    http://localhost:$DEV_PORT
+  ✓ Review server: http://localhost:3456
+  ✓ Review UI:     http://localhost:3456/review
+  ✓ Term 2:        Independent review session launched
+```
+Auto-open the review UI:
+```bash
+open http://localhost:3456/review 2>/dev/null || xdg-open http://localhost:3456/review 2>/dev/null
+```
+## Step 2: Build Phase — Elements
+Read all element contracts from `.gsd-t/contracts/design/elements/`. Sort by any `order` field or alphabetically.
+For each element contract:
+### 2a. Build the Element
+Read the element contract. Build or update the component at the specified source path. Follow the contract exactly:
+- Chart type, dimensions, colors, spacing, typography
+- Props interface matching the contract's data model
+- Import patterns (use existing design system components where specified)
+### 2b. Render-Measure-Compare
+After building, use Playwright to measure the rendered component against the contract specs:
+```javascript
+// In Playwright page.evaluate():
+const el = document.querySelector(selector);
+const s = getComputedStyle(el);
+const rect = el.getBoundingClientRect();
+// Measure each spec property from the contract
+// Compare against expected values
+// Build measurements array: [{ property, expected, actual, pass }]
+```
+### 2c. Queue for Review
+Write the queue JSON to `.gsd-t/design-review/queue/{element-id}.json`:
+```json
+{
+  "id": "element-donut-chart",
+  "name": "DonutChart",
+  "type": "element",
+  "order": 1,
+  "selector": "svg[viewBox='0 0 200 200']",
+  "sourcePath": "src/components/elements/DonutChart.vue",
+  "route": "/",
+  "measurements": [
+    { "property": "chart type", "expected": "donut", "actual": "donut", "pass": true },
+    ...
+  ]
+}
+```
+### 2d. Check for Auto-Rejections
+After queuing, check for immediate rejection from Term 2:
+```bash
+# Brief wait for Term 2 to process
+sleep 3
+ls .gsd-t/design-review/rejected/{element-id}.json 2>/dev/null
+```
+If rejected:
+- Read the rejection reason
+- Fix the element based on the rejection feedback
+- Re-measure and re-queue (max 2 auto-rejection cycles per element)
+### 2e. Continue Building
+Continue building remaining elements. Don't wait for human review — queue them all.
+## Step 3: Wait for Human Review
+After all elements are built and queued, enter the poll loop:
+```
+Waiting for human review...
+  Review UI: http://localhost:3456/review
+  {N} elements queued, awaiting submission
+```
+Poll for the review-complete signal:
+```bash
+while [ ! -f .gsd-t/design-review/review-complete.json ]; do
+  sleep 5
+done
+```
+## Step 4: Process Feedback
+Read `.gsd-t/design-review/review-complete.json` and each feedback file in `.gsd-t/design-review/feedback/`.
+For each element's feedback:
+### No changes, no comment → Approved
+- Mark element as complete
+- No action needed
+### Property changes only → Apply and verify
+- Read the change list: `[{ property, oldValue, newValue, path }]`
+- Map CSS property changes back to source code:
+  - Tailwind: `padding: 16px` → find and update the Tailwind class (e.g., `p-6` → `p-4`)
+  - Inline styles: update the style binding directly
+  - CSS modules: update the CSS rule
+- Re-run Playwright measurement to verify the change took effect
+- If verification passes → mark complete
+- If verification fails → re-queue for review with updated measurements
+### Comment only → Interpret and fix
+- Read the comment — it describes a change to make
+- Implement the change described in the comment
+- Re-measure the element
+- Re-queue for review
+### Changes + comment → Apply changes, use comment as context
+- Apply the property changes first
+- Read the comment for additional context or fixes beyond the property changes
+- Implement any additional changes
+- Re-measure and re-queue
+After processing all feedback:
+- Clear the queue: `rm .gsd-t/design-review/queue/*.json`
+- Delete the signal: `rm .gsd-t/design-review/review-complete.json`
+- Clear feedback: `rm .gsd-t/design-review/feedback/*.json`
+If any elements were re-queued → return to Step 3 (max 3 review cycles total).
+## Step 5: Widget Assembly Phase
+After all elements are approved:
+1. Read widget contracts from `.gsd-t/contracts/design/widgets/`
+2. For each widget:
+   a. Build the widget — MUST import approved element components, not rebuild them inline
+   b. Verify imports: grep the widget file for element imports
+   c. Playwright measure the assembled widget
+   d. Queue for review (same flow as elements)
+3. Wait for human review (Step 3)
+4. Process feedback (Step 4)
+## Step 6: Page Composition Phase
+After all widgets are approved:
+1. Read page contracts from `.gsd-t/contracts/design/pages/`
+2. For each page:
+   a. Build the page — MUST import approved widget components
+   b. Verify grid layout, section ordering, responsive breakpoints
+   c. Playwright measure the full page
+   d. Queue for review
+3. Wait for human review
+4. Process feedback
+## Step 7: Cleanup and Report
+```bash
+# Signal Term 2 to shut down
+echo '{"shutdown": true}' > .gsd-t/design-review/shutdown.json
+sleep 2
+# Kill review server
+kill $REVIEW_PID 2>/dev/null
+# Kill dev server if we started it
+# (only if we started it in Step 1a)
+```
+Report:
+```
+Design Build Complete
+  Elements: {N} built, {N} approved
+  Widgets:  {N} built, {N} approved
+  Pages:    {N} built, {N} approved
+  Review cycles: {N}
+  Changes applied from review: {N}
+  Comments addressed: {N}
+```
+Update `.gsd-t/progress.md` with completion status.
+## Coordination Directory Structure
+```
+.gsd-t/design-review/
+├── queue/                    # Term 1 writes, Term 2 + human reads
+│   ├── element-donut.json
+│   └── element-bar.json
+├── feedback/                 # Human review writes, Term 1 reads
+│   ├── element-donut.json
+│   └── element-bar.json
+├── rejected/                 # Term 2 auto-rejects, Term 1 reads
+│   └── element-bar.json
+├── review-complete.json      # Human submit signal → Term 1 polls
+├── review-prompt.md          # Term 2's instructions (no builder context)
+├── shutdown.json             # Term 1 signals Term 2 to exit
+└── status.json               # Review server state
+```
+## Rules
+- NEVER share builder context with Term 2 — coordination is files only
+- NEVER skip the review cycle — every component goes through Term 2 + human
+- NEVER proceed to widgets before all elements are approved
+- NEVER rebuild element functionality inline in widgets — always import
+- Max 3 review cycles per phase — if still failing, stop and present to user
+- Auto-open the browser review UI so the user doesn't have to find it
+- Kill all background processes on completion or error

package/commands/gsd-t-design-decompose.md CHANGED Viewed

@@ -252,11 +252,38 @@ For each widget contract:
 1. Copy `templates/widget-contract.md` as scaffold
 2. Reference elements by name in the "Elements Used" table
 3. Define layout, data binding, responsive behavior, widget-level verification
+4. **Extract layout CSS from `get_design_context` output (MANDATORY)**:
+   The Figma MCP returns code with explicit CSS layout properties. Parse these into the
+   widget contract's "Internal Element Layout" section:
+   - `body_layout`: Look at the parent container's CSS in the Figma output.
+     `flex flex-row` or `flex gap-[16px] items-center` → `flex-row`.
+     `flex flex-col` or `flex-col gap-[16px]` → `flex-column`.
+     `grid grid-cols-2` → `grid 2-col`. Write EXACTLY what the Figma shows.
+   - `body_gap`: Extract the gap value from the Figma CSS (e.g., `gap-[16px]` → `16px`)
+   - Legend position: If legend is a SIBLING of the chart in a `flex-row` container →
+     legend is BESIDE the chart (`body_sidebar`). If legend is BELOW the chart in a
+     `flex-col` container → legend is in `footer_legend`. This distinction is CRITICAL —
+     it's the difference between a side-by-side layout and a stacked layout.
+   - `container_height`: If the Figma shows `h-[334px]` → fixed height `334px`.
+     If no explicit height → `auto`.
 For each page contract:
 1. Copy `templates/page-contract.md` as scaffold
 2. Reference widgets in grid positions
 3. Define route, data loading, global states, performance budget
+4. **Extract grid structure from `get_design_context` output (MANDATORY)**:
+   The Figma MCP returns the page's layout as nested containers. Parse the structure:
+   - Count "Row" or `flex-row` containers and their children to determine grid dimensions
+     (e.g., 2 Row containers with 2 cards each → `grid 2×2`, NOT `grid 1×4`)
+   - Extract `gap` values between rows and between cards within rows
+   - Extract explicit heights on cards (e.g., `h-[334px]`)
+   - Document in the page contract's "Widgets Used" table:
+     ```
+     | grid[row=1, cols=1-2] | most-popular-tools + number-of-tools | 2 per row |
+     | grid[row=2, cols=1-2] | time-on-page + number-of-visits      | 2 per row |
+     ```
+   - **Anti-pattern**: Seeing 4 sibling cards and writing `grid-cols-4` when the Figma
+     groups them into 2 rows of 2. ALWAYS check the parent container structure.
 Write `INDEX.md` as a navigation map:

package/commands/gsd-t-design-review.md ADDED Viewed

@@ -0,0 +1,155 @@
+# GSD-T: Design Review — Independent Review Agent (Term 2)
+You are the design review agent running in an independent terminal session. You have ZERO knowledge of how components were built. Your only job is to compare built output against design contracts and flag deviations.
+**You are NOT the builder. You did NOT write any of this code. You are a fresh pair of eyes.**
+## Step 1: Load Contracts
+Read the design contracts — these are your source of truth:
+1. `.gsd-t/contracts/design/INDEX.md` — hierarchy overview
+2. `.gsd-t/contracts/design/elements/` — element specs (chart types, dimensions, colors, props)
+3. `.gsd-t/contracts/design/widgets/` — widget assembly specs (which elements, layout, spacing)
+4. `.gsd-t/contracts/design/pages/` — page composition specs (which widgets, grid, sections)
+Build a mental model of what SHOULD exist. You will compare this against what DOES exist.
+## Step 2: Monitor Queue
+Enter the poll loop. Check for new queue items every 5 seconds:
+```bash
+while [ ! -f .gsd-t/design-review/shutdown.json ]; do
+  NEW_FILES=$(ls .gsd-t/design-review/queue/*.json 2>/dev/null)
+  if [ -n "$NEW_FILES" ]; then
+    echo "Found $(echo "$NEW_FILES" | wc -l | tr -d ' ') items to review"
+    # Process each — see Step 3
+  fi
+  sleep 5
+done
+```
+When `shutdown.json` appears, print "Review session complete" and exit.
+## Step 3: AI Review Each Item
+For each queue JSON file:
+### 3a. Read the Queue Item
+```json
+{
+  "id": "element-donut-chart",
+  "name": "DonutChart",
+  "type": "element",
+  "selector": "...",
+  "sourcePath": "src/components/elements/DonutChart.vue",
+  "measurements": [...]
+}
+```
+### 3b. Load the Matching Contract
+Based on the item type and id, read the corresponding contract:
+- `element-*` → `.gsd-t/contracts/design/elements/{id}.md`
+- `widget-*` → `.gsd-t/contracts/design/widgets/{id}.md`
+- `page-*` → `.gsd-t/contracts/design/pages/{id}.md`
+### 3c. Check Measurements
+Review the `measurements` array. Each has `{ property, expected, actual, pass }`.
+**Critical failures (auto-reject immediately):**
+- Wrong chart type (contract says bar, built a donut)
+- Wrong element count (contract says 5 segments, built has 3)
+- Wrong data model (contract says horizontal bars, built vertical)
+- Missing required elements (contract lists a legend, none exists)
+- Wrong HTML structure (contract says `<table>`, built with `<div>`)
+**If any critical failure found**, write rejection:
+```bash
+cat > .gsd-t/design-review/rejected/{id}.json << EOF
+{
+  "id": "{id}",
+  "reason": "{specific description of what's wrong vs. what the contract says}",
+  "severity": "critical",
+  "timestamp": "$(date -u +%Y-%m-%dT%H:%M:%SZ)",
+  "contractExpected": "{what the contract specifies}",
+  "actualFound": "{what was actually measured}"
+}
+EOF
+```
+Remove the item from the queue (it's been handled):
+```bash
+rm .gsd-t/design-review/queue/{id}.json
+```
+### 3d. AI Visual Review (non-critical items)
+For items that pass measurement checks, do a deeper AI review using Playwright:
+1. Open `http://localhost:3456/` in Playwright (proxied app)
+2. Navigate to the component's route
+3. Find the component using its selector
+4. Screenshot it
+5. Compare against the contract's visual spec:
+**Check these things measurements can't catch:**
+- Does the visual hierarchy look right? (title prominence, chart emphasis)
+- Are proportions correct? (chart not stretched/squished, legend not dominating)
+- Is the color palette cohesive? (not clashing, accessible contrast)
+- Does spacing feel balanced? (not cramped, not floating)
+- Are interactive states implied? (hover cursors, clickable affordances)
+**For widget-level reviews, additionally check:**
+- Does the widget import its elements, or rebuild them inline?
+  ```bash
+  grep -l "elements/" {sourcePath}
+  ```
+  If no element imports found → flag as HIGH: "Widget rebuilds elements inline"
+- Is the element composition correct per the widget contract?
+- Is the layout between elements correct (gap, alignment)?
+**For page-level reviews, additionally check:**
+- Are all widgets present per the page contract?
+- Is the grid/section layout correct?
+- Is the responsive behavior appropriate?
+### 3e. Report AI Review Results
+If the AI review finds non-critical issues, note them but do NOT auto-reject. These go to human review — the human decides if they matter.
+Update the queue item with AI review notes by writing an annotation file:
+```bash
+cat > .gsd-t/design-review/queue/{id}.ai-review.json << EOF
+{
+  "id": "{id}",
+  "aiVerdict": "pass|warn",
+  "notes": [
+    { "severity": "medium", "note": "Legend spacing feels tight — 4px gap between items" },
+    { "severity": "low", "note": "Bar corner radius slightly inconsistent with design system" }
+  ],
+  "reviewedAt": "$(date -u +%Y-%m-%dT%H:%M:%SZ)"
+}
+EOF
+```
+The review UI will pick up these notes and display them alongside the human review.
+## Step 4: Continue Monitoring
+After processing all current queue items, return to the poll loop (Step 2). New items may arrive as:
+- Term 1 builds more elements
+- Term 1 re-queues items after applying human feedback
+Continue until `shutdown.json` appears.
+## Rules
+- **You write ZERO code.** You only review and report.
+- **You have ZERO builder context.** Judge only by contract vs. rendered output.
+- **Be adversarial.** Your value is catching problems, not confirming success.
+- **Auto-reject ONLY critical failures.** Everything else goes to human review.
+- **Never modify queue items.** Only add rejection files or AI review annotations.
+- **Never read builder code to understand "why" something was built a certain way.** You only care about whether the output matches the contract.
+- **Check the contract, not your assumptions.** If the contract says 32px and the build shows 32px, it passes — even if you think 24px would look better.

package/commands/gsd-t-execute.md CHANGED Viewed

@@ -309,12 +309,57 @@ Execute the task above:
         recovers (retry button works, form can be resubmitted, etc.).
      A test that would pass on an empty HTML page with the right element IDs is useless.
      Every assertion must prove the FEATURE WORKS, not that the ELEMENT EXISTS.
-8. **Visual Design Note** (when design-to-code stack rule is active):
-   Do NOT perform visual verification yourself — a dedicated Design Verification Agent
-   (Step 5.25) runs after all domain tasks complete and handles the full visual comparison.
-   Your job: write precise code from the design contract tokens. Use exact hex colors,
-   exact spacing values, exact typography. Every CSS value must trace to the design contract.
-   The verification agent will open a browser and prove whether your code matches.
+8. **Render-Measure-Compare Loop** (when design-to-code stack rule is active — MANDATORY):
+   After implementing the component, you MUST verify it renders correctly by measuring
+   the actual DOM output against the contract's layout spec. This is not optional.
+   Do NOT rely on visual inspection or screenshots — measure mechanically.
+   a. **Render**: Start the dev server if not running. Navigate to a route where the
+      component is visible (or create a temporary test route that renders it in isolation).
+   b. **Measure via Playwright** — run `page.evaluate()` to extract DOM properties:
+      ```javascript
+      // For a widget: measure its internal layout
+      const el = document.querySelector('.widget-selector');
+      const style = getComputedStyle(el);
+      return {
+        display: style.display,           // 'flex' or 'grid'
+        flexDirection: style.flexDirection, // 'row' or 'column'
+        gap: style.gap,
+        gridTemplateColumns: style.gridTemplateColumns,
+        width: el.offsetWidth,
+        height: el.offsetHeight,
+        childCount: el.children.length,
+        children: Array.from(el.children).map(c => ({
+          tag: c.tagName,
+          width: c.offsetWidth,
+          height: c.offsetHeight,
+          display: getComputedStyle(c).display,
+          flexDirection: getComputedStyle(c).flexDirection,
+        }))
+      };
+      ```
+   c. **Compare to contract** — check each measured value against the contract spec:
+      - `body_layout: flex-row` → verify `flexDirection === 'row'`
+      - `container_height: 334px` → verify `height === 334` (±2px tolerance)
+      - Grid `2×2` → verify parent has 2 row children, each with 2 card children
+      - Legend position: if contract says `body_sidebar` (beside chart) →
+        verify legend and chart share a `flex-row` parent.
+        If contract says `footer_legend` (below chart) →
+        verify legend is in a `flex-column` parent below the chart.
+   d. **Fix mismatches** — if ANY measurement doesn't match the contract:
+      - Log: "LAYOUT MISMATCH: {property} expected {contract value}, got {measured value}"
+      - Fix the code to match the contract spec
+      - Re-render and re-measure (max 2 fix cycles)
+      - If still mismatched after 2 cycles → log to `.gsd-t/deferred-items.md`
+   e. **All pass** → log "RENDER-MEASURE PASS: {N} layout properties verified" and proceed.
+   This loop catches the exact class of errors that visual inspection misses:
+   grid-cols-4 instead of 2×2, legend below instead of beside, wrong flex-direction.
+   These are data comparisons, not visual judgments — the same kind of check as a unit test.
 9. Run ALL test suites — this is NOT optional, not conditional, not "if applicable":
    a. Detect configured test runners: check for vitest/jest config, playwright.config.*, cypress.config.*
    b. Run EVERY detected suite. Unit tests alone are NEVER sufficient when E2E exists.

package/commands/gsd-t-help.md CHANGED Viewed

@@ -61,6 +61,8 @@ UTILITIES                                                              Manual
   promote-debt        Convert techdebt items to milestones
   populate            Auto-populate docs from existing codebase
   design-decompose    Decompose design into element/widget/page contracts
+  design-build        Build from design contracts with two-terminal review
+  design-review       Independent review agent (Term 2) for design build
   log                 Sync progress Decision Log with recent git activity
   version-update      Update GSD-T package to latest version
   version-update-all  Update GSD-T package + all registered projects

package/commands/gsd-t-quick.md CHANGED Viewed

@@ -159,8 +159,16 @@ When you encounter unexpected situations:
    - If building/modifying a PAGE: IMPORT existing widget components — do NOT rebuild widget functionality inline.
    - **Contract is authoritative**: Follow the contract spec, not the Figma screenshot, when they appear to disagree.
 5. Make the change — **adapt new code to existing structures**, not the other way around
-6. Verify it works
-7. Commit: `[quick] {description}`
+6. **Render-Measure-Compare** (if design component — MANDATORY):
+   After implementing, verify via Playwright DOM measurement (not screenshots):
+   - Render the component in browser
+   - `page.evaluate()` to extract: display, flexDirection, gap, gridTemplateColumns,
+     offsetWidth, offsetHeight, child count and layout
+   - Compare each value to the contract's layout spec (body_layout, container_height, etc.)
+   - Mismatches → fix code → re-measure (max 2 cycles)
+   - This catches: wrong grid structure, legend below vs beside, wrong flex-direction
+7. Verify it works
+8. Commit: `[quick] {description}`
 ## Step 3.5: Emit Task Metrics

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "name": "@tekyzinc/gsd-t",
-  "version": "2.70.15",
-  "description": "GSD-T: Contract-Driven Development for Claude Code — 54 slash commands with headless CI/CD mode, graph-powered code analysis, real-time agent dashboard, execution intelligence, task telemetry, doc-ripple enforcement, backlog management, impact analysis, test sync, milestone archival, and PRD generation",
+  "version": "2.71.10",
+  "description": "GSD-T: Contract-Driven Development for Claude Code — 56 slash commands with headless CI/CD mode, graph-powered code analysis, real-time agent dashboard, execution intelligence, task telemetry, doc-ripple enforcement, backlog management, impact analysis, test sync, milestone archival, and PRD generation",
   "author": "Tekyz, Inc.",
   "license": "MIT",
   "repository": {