npm - hool-cli - Versions diffs - 0.8.0 → 0.9.1 - Mend

hool-cli 0.8.0 → 0.9.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (166) hide show

package/presets/team/mcps/testing-by-domain.md ADDED Viewed

@@ -0,0 +1,68 @@
+# Testing Strategy by Domain
+How each project type gets tested, what MCP tools are available, and what gaps exist.
+## Web App (FE + BE)
+```
+Layer        | Tool              | MCP          | Autonomous?
+-------------|-------------------|--------------|------------
+Static       | ESLint/TSC        | none (CLI)   | 100%
+Unit (FE)    | Vitest/Jest       | none (CLI)   | 100%
+Unit (BE)    | Vitest/Jest/Pytest| none (CLI)   | 100%
+Integration  | Supertest/httpx   | none (CLI)   | 100%
+E2E          | Playwright        | playwright   | 100%
+Visual       | Playwright + multimodal | playwright | ~95%
+```
+**Gaps**: Subjective aesthetic judgment (~5% escalated).
+## API / Backend Only
+```
+Layer        | Tool              | MCP          | Autonomous?
+-------------|-------------------|--------------|------------
+Static       | ESLint/TSC        | none (CLI)   | 100%
+Unit         | Vitest/Jest/Pytest| none (CLI)   | 100%
+Integration  | Supertest/httpx   | none (CLI)   | 100%
+Contract     | Schema validation | none (CLI)   | 100%
+Load         | k6/autocannon     | none (CLI)   | 90%
+```
+**Gaps**: Almost none. Best domain for autonomy.
+## Browser Game
+```
+Layer        | Tool              | MCP          | Autonomous?
+-------------|-------------------|--------------|------------
+Static       | ESLint/TSC        | none (CLI)   | 100%
+Unit (logic) | Vitest/Jest       | none (CLI)   | 100%
+Visual       | Playwright screenshot | playwright | ~60%
+Game E2E     | Playwright + game state bridge | playwright | ~50%
+Performance  | FPS monitoring    | playwright   | 90%
+Game feel    | Human playtest    | none         | 0%
+```
+**Gaps**: Canvas interactions are coordinate-based, "is this fun" always requires human.
+## Mobile App
+```
+Layer        | Tool              | MCP          | Autonomous?
+-------------|-------------------|--------------|------------
+Static       | ESLint/TSC        | none (CLI)   | 100%
+Unit         | Jest              | none (CLI)   | 100%
+Integration  | Supertest (API)   | none (CLI)   | 100%
+E2E          | Maestro or Detox  | none         | ~80%
+Visual       | Emulator screenshot| none        | ~70%
+```
+**Gaps**: No Playwright MCP — E2E relies on Maestro/Detox CLI. Real device testing not covered.
+## Summary
+| Domain | Coverage | Biggest Gap | MCP Tools |
+|--------|----------|-------------|-----------|
+| Web App | ~95% | Aesthetic judgment | context7, deepwiki, playwright |
+| API Only | ~98% | Security edge cases | context7, deepwiki |
+| CLI Tool | ~95% | Integration testing | context7, deepwiki |
+| Browser Game | ~55% | Game feel | context7, deepwiki, playwright |
+| Mobile | ~70% | Device matrix | context7, deepwiki |
+| Animation | ~50% | Smoothness feel | context7, deepwiki, playwright |

package/presets/team/prompts/claude-md.md ADDED Viewed

@@ -0,0 +1,185 @@
+# HOOL — Agent-Driven SDLC
+This project uses the HOOL framework with Agent Teams. Your identity and process come from your agent file (`.claude/agents/<role>.md`). This file contains shared project context that all agents need.
+## File Structure
+```
+project-root/
+├── .claude/agents/        ← Agent identity files (product-lead.md, be-dev.md, etc.)
+├── .hool/
+│   ├── phases/            ← Phase deliverables (spec, design, architecture, contracts, etc.)
+│   ├── operations/        ← Operational state (task-board, bugs, issues, metrics, etc.)
+│   ├── memory/            ← Per-agent memory (cold, hot, best-practices, issues, etc.)
+│   ├── skills/            ← Skill/SME prompts loaded by agents per phase
+│   ├── settings/          ← Per-role Claude settings with hooks and permissions
+│   ├── browser-profiles/  ← Playwright browser profiles per agent (gitignored)
+│   └── logs/              ← Agent dispatch logs (gitignored)
+├── src/
+│   ├── frontend/          ← Separate git repo (FE Dev + FE Lead own this)
+│   └── backend/           ← Separate git repo (BE Dev + BE Lead own this)
+```
+## Core Rules (All Agents)
+- **All state lives in files**: `.hool/phases/`, `.hool/operations/`, `.hool/memory/`
+- **Never modify your own agent prompt** — escalate to `.hool/operations/needs-human-review.md`
+- **Never modify `governor-rules.md`** — only the Governor or human may change this
+- **Read your memory files on boot** — your agent file specifies which ones
+- **Before submitting work**: verify you haven't violated your `governor-feedback.md` entries
+## Git Architecture (Three-Repo Model)
+```
+project-root/          ← Project-level git (Product Lead owns)
+├── src/frontend/      ← Separate git repo (FE team owns)
+└── src/backend/       ← Separate git repo (BE team owns)
+```
+- `src/frontend/` and `src/backend/` are gitignored at the project level
+- Each repo has its own commit history, branch strategy, and optionally a GitHub remote
+- Product Lead commits `.hool/` state to the project-level repo
+- FE Dev/Lead commit to `src/frontend/`; BE Dev/Lead commit to `src/backend/`
+## MCP Tools
+| MCP Server | Purpose | Agents |
+|------------|---------|--------|
+| **context7** | Up-to-date library docs (`resolve-library-id` + `query-docs`) | All agents |
+| **deepwiki** | Deep open-source project documentation (`get-deepwiki-page`) | PL, both Leads, both Devs |
+| **playwright** | Browser automation, E2E testing, screenshots — headless (globally installed via `npm install -g @playwright/mcp`) | QA, Forensic, FE Dev |
+| **playwright-headful** | Visible browser for human-assisted login, debugging, live demos | QA, Forensic, FE Dev, PL |
+### Playwright Browser Profiles
+- Globally installed (`npm install -g @playwright/mcp`) — persistent binary, not ephemeral npx
+- **Shared profile**: Both headless and headful modes share the same browser profile at `.hool/browser-profiles/shared/` via `--user-data-dir`
+- Cookies, localStorage, and auth state persist across sessions and across modes (headless ↔ headful)
+- User logs in via headful mode → agents test via headless mode with the same auth state
+- Agents cannot log into OAuth/2FA services — user must manually log in first (see login-nudge hook)
+### Playwright Modes (Headless vs Headful)
+Two Playwright MCP servers are configured:
+| MCP Server | Mode | Use Case |
+|------------|------|----------|
+| `playwright` | **Headless** (default) | Automated testing, screenshots, E2E flows. No visible browser. |
+| `playwright-headful` | **Headful** (visible) | Human-assisted login, interactive debugging, live demos, showing UI to user. |
+**When to use headful (`mcp__playwright-headful__*`):**
+- User needs to log into OAuth/2FA services for browser profiles
+- User asks to "show me" or "pull up" something in the browser
+- Forensic agent needs to visually reproduce a bug with user watching
+- Debugging complex UI interactions that need visual inspection
+**When to use headless (`mcp__playwright__*`):**
+- Automated test execution, screenshot capture, design comparison
+- All autonomous work where no human interaction is needed
+## Logging Architecture
+All agents produce structured logs for full visibility and debugging:
+```
+.hool/logs/
+├── be.log          ← Backend runtime logs (structured JSON, written by BE app)
+├── fe.log          ← Frontend runtime logs (console capture, written by FE app)
+└── test.log        ← Test execution logs (written by QA during test runs)
+```
+### Backend Logging (`be.log`)
+- **Format**: Structured JSON — one JSON object per line (JSONL)
+- **Fields**: `timestamp`, `level`, `category`, `message`, `data` (context object), `correlationId`
+- **Categories**: `api.request`, `api.response`, `api.error`, `db.query`, `db.error`, `business.decision`, `auth.*`, `middleware.*`
+- **Levels**: `debug` (dev only), `info`, `warn`, `error`
+- **Setup**: BE Tech Lead configures logging middleware during scaffold (Phase 4). Every request gets a `correlationId` for tracing.
+### Frontend Logging (`fe.log`)
+- **Format**: Structured JSON (JSONL), same as backend
+- **Fields**: `timestamp`, `level`, `category`, `message`, `data`
+- **Categories**: `user.action`, `api.call`, `api.response`, `api.error`, `render.error`, `state.change`, `performance.*`
+- **Capture mechanism**: FE Tech Lead sets up a logging utility during scaffold that:
+  1. Intercepts `console.log/warn/error` and writes to `.hool/logs/fe.log` via a dev-mode log server (small Express/WS endpoint that receives log events and appends to file)
+  2. Wraps API client calls to auto-log requests/responses
+  3. Captures unhandled errors via `window.onerror` and `unhandledrejection`
+  4. During dev: logs are verbose (debug level). In production builds: info+ only.
+- **Why file-based**: Agents (Forensic, QA) need to read FE logs programmatically. Browser console is ephemeral and not accessible to other agents. File-based logs give the same visibility as BE logs.
+### Log Usage by Agent
+| Agent | Reads | Writes |
+|-------|-------|--------|
+| BE Dev | `be.log` (debugging own code) | Indirectly (via app logging) |
+| FE Dev | `fe.log` (debugging own code) | Indirectly (via app logging) |
+| Forensic | `be.log` + `fe.log` (root cause analysis) | — |
+| QA | `be.log` + `fe.log` + `test.log` (test evidence) | `test.log` |
+| BE Lead | `be.log` (code review context) | — |
+| FE Lead | `fe.log` (code review context) | — |
+### Debugging Protocol (All Agents)
+When investigating issues, ALWAYS check logs FIRST before reading code:
+1. Read `.hool/logs/be.log` and/or `.hool/logs/fe.log` (last 50-100 lines)
+2. Search for error-level entries, then correlate with request flow using `correlationId`
+3. Only after understanding WHAT happened from logs, go to code to understand WHY
+## Memory System
+Every agent has 11 memory files in `.hool/memory/<agent>/`:
+| File | Purpose |
+|------|---------|
+| `identity.md` | Who they are in this project |
+| `skill.md` | Skill prompt pointers + project adaptations |
+| `cold.md` | Historical work log (append-only, one-liner summaries) |
+| `hot.md` | Crisp recent context (rebuilt after every task) |
+| `issues.md` | Issues encountered and their resolutions |
+| `best-practices.md` | Patterns `[PATTERN]` and gotchas `[GOTCHA]` |
+| `governor-feedback.md` | Corrective feedback from Governor (read-only for the agent) |
+| `client-preferences.md` | Per-agent distilled client preferences (written by PL) |
+| `operational-knowledge.md` | Deployment details, ports, env vars, infra context |
+| `picked-tasks.md` | Currently assigned tasks |
+| `task-log.md` | Detailed description of what was done per task |
+### Memory Tiers
+- **Task Log**: Detailed — what was done, files changed, decisions made
+- **Cold Log**: Summaries — one-liner per task/event, chronological
+- **Hot Log**: Crisp — recent context only. Structure: `## Compact` → `## Summary` (30 max) → `## Recent` (last 20 verbatim)
+## Phases Overview
+| # | Phase | Owner |
+|---|-------|-------|
+| 0 | Init | PL + Human |
+| 1 | Brainstorm | PL + Human |
+| 2 | Spec | PL + Human |
+| 3 | Design | FE Lead + FE Dev |
+| 4 | Architecture | Both Leads |
+| 5 | Contracts | BE Lead (POC) + FE Lead (rebuttal) |
+| 6 | Tasks | Leads (breakdown) + PL (assignment) |
+| 7 | Implementation | FE Dev + BE Dev (TDD) |
+| 8 | Review | Tech Leads |
+| 9 | QA | QA Agent |
+| 10 | Forensic | Forensic Agent |
+| 11 | Ship | PL |
+| 12 | Retrospective | PL |
+## Execution Modes
+- **interactive** (default): Phases 0-4 require human sign-off. Human is OUT after Phase 4.
+- **full-hool**: Only Phases 0-1 are interactive. Phases 2-12 are fully autonomous.
+Check `.hool/phases/00-init/project-profile.md` for the current mode.
+## Shared Operations Files
+| File | Purpose |
+|------|---------|
+| `current-phase.md` | What phase we're in |
+| `task-board.md` | All tasks, assignments, status |
+| `client-preferences.md` | Global user preferences (tech + product) |
+| `needs-human-review.md` | Items requiring human input |
+| `governor-rules.md` | Hard rules that must never be violated |
+| `bugs.md` | Bug reports from QA |
+| `issues.md` | Tech debt and code issues |
+| `inconsistencies.md` | Doc-vs-code or doc-vs-doc mismatches |
+| `metrics.md` | Tool call and dispatch counters |
+| `governor-log.md` | Governor audit trail |

package/presets/team/settings/claude-settings.json ADDED Viewed

@@ -0,0 +1,89 @@
+{
+  "env": {
+    "CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS": "1"
+  },
+  "mcpServers": {
+    "context7": {
+      "command": "npx",
+      "args": [
+        "-y",
+        "@context7/mcp"
+      ]
+    },
+    "deepwiki": {
+      "command": "npx",
+      "args": [
+        "-y",
+        "deepwiki-mcp"
+      ]
+    },
+    "playwright": {
+      "command": "playwright-mcp",
+      "args": [
+        "--headless",
+        "--user-data-dir",
+        ".hool/browser-profiles/shared"
+      ],
+      "_note": "Headless mode. Shares browser profile with playwright-headful via --user-data-dir."
+    },
+    "playwright-headful": {
+      "command": "playwright-mcp",
+      "args": [
+        "--user-data-dir",
+        ".hool/browser-profiles/shared"
+      ],
+      "_note": "Headful (visible) mode. Shares browser profile with playwright (headless) via same --user-data-dir."
+    }
+  },
+  "permissions": {
+    "allow": [
+      "Read",
+      "Write",
+      "Edit",
+      "Glob",
+      "Grep",
+      "Bash(git *)",
+      "Bash(ls *)",
+      "Bash(npm run *)",
+      "Bash(npm test *)",
+      "Bash(npx *)",
+      "Bash(node *)",
+      "Bash(docker *)",
+      "Bash(docker-compose *)",
+      "Bash(mkdir *)",
+      "mcp__context7__*",
+      "mcp__deepwiki__*",
+      "mcp__playwright__*",
+      "mcp__playwright-headful__*"
+    ]
+  },
+  "hooks": {
+    "UserPromptSubmit": [
+      {
+        "type": "command",
+        "command": "bash .hool/hooks/identity-reminder.sh"
+      }
+    ],
+    "PostToolUse": [
+      {
+        "type": "command",
+        "command": "bash .hool/hooks/metrics.sh",
+        "description": "Track tool call metrics per agent"
+      }
+    ],
+    "TeammateIdle": [
+      {
+        "type": "command",
+        "command": "bash .hool/hooks/completion-checklist.sh",
+        "description": "Verify agent completed all required steps before going idle"
+      }
+    ],
+    "TaskCompleted": [
+      {
+        "type": "command",
+        "command": "bash .hool/hooks/governor-trigger.sh",
+        "description": "Check if governor audit is due based on task completion count"
+      }
+    ]
+  }
+}

package/presets/team/skills/architect.md ADDED Viewed

@@ -0,0 +1,75 @@
+# Skill: Architect
+You are an expert software architect. Your job is to design systems that are simple enough to build, robust enough to ship, and clear enough for other agents to implement without ambiguity.
+## Mindset
+- Boring technology wins. Pick proven, well-documented tools over cutting-edge.
+- Every architectural decision is a trade-off. Document what you're trading and why.
+- Design for the actual requirements, not hypothetical future ones.
+- If it can't be explained in a paragraph, it's too complex.
+## Process
+### 1. High-Level Design (HLD)
+- System diagram: what are the major components and how do they communicate?
+- Module breakdown: what are the logical modules/packages?
+- Infrastructure: what runs where? (servers, databases, caches, queues)
+- External dependencies: third-party APIs, services, SDKs
+### 2. Business Logic Design
+- Domain model: what are the core entities and their relationships?
+- Service boundaries: what business logic lives where?
+- Validation rules: what are the invariants the system must maintain?
+- State machines: what entities have lifecycle states? What are the transitions?
+- Authorization: what can each role do? Where are permissions checked?
+### 3. Low-Level Design (LLD)
+- Directory structure with explanations
+- Module layout: how files are organized within each module
+- Data access patterns: how data flows from DB to API response
+- Middleware chain: what runs in what order on each request
+- Error handling: how errors propagate, what the client sees
+- Logging: what gets logged, where, in what format
+- Configuration: how env vars and secrets are managed
+### 4. Technology Selection
+When choosing technologies:
+1. Does it solve the actual problem? (not a general-purpose Swiss army knife)
+2. Is it well-documented? (can agents look it up via context7?)
+3. Is it actively maintained? (last release < 6 months ago)
+4. Does it match client preferences? (check `.hool/operations/client-preferences.md`)
+5. Does it compose well with other choices? (no conflicting paradigms)
+Use context7 MCP to research: `mcp__context7__resolve-library-id` then `mcp__context7__query-docs`.
+### 5. Schema Design
+- Entity-Relationship diagram (text-based)
+- Table/collection definitions with field types
+- Indexes for query patterns from contracts
+- Migration strategy: how schema changes are applied
+- Seed data: what initial data is needed
+## Decision Documentation
+For every non-obvious decision:
+```markdown
+### Decision: [what was decided]
+- **Options considered**: [list alternatives]
+- **Chosen**: [option]
+- **Why**: [reasoning]
+- **Trade-off**: [what we're giving up]
+- **Reversibility**: easy | medium | hard
+```
+## Anti-Patterns
+- Don't over-architect. Three similar functions don't need an abstraction layer.
+- Don't pick technology first, then justify it. Start with the problem.
+- Don't design for scale you don't have. Optimize when you have data, not guesses.
+- Don't create unnecessary services. A monolith is fine until it's not.
+- Don't skip the "How to Run" section. If an agent can't start the project, nothing else matters.
+## Output
+- HLD: system diagram, module breakdown, infrastructure
+- Business Logic: domain model, service boundaries, validation rules
+- LLD: directory structure, patterns, conventions
+- Schema: entity definitions, indexes, migrations
+- Decisions: documented trade-offs for non-obvious choices

package/presets/team/skills/auditor.md ADDED Viewed

@@ -0,0 +1,89 @@
+# Skill: Auditor
+You are an expert behavioral auditor. Your job is to verify that agents followed the rules, identify patterns of non-compliance, and provide corrective feedback that prevents repeat violations.
+## Mindset
+- Trust but verify. Agents self-enforce rules, but self-enforcement fails. You're the safety net.
+- Focus on patterns, not incidents. A single slip is a note. The same slip three times is a systemic issue.
+- Feedback must be actionable. "Don't do X" is better than "there was an issue." "Don't do X, do Y instead because Z" is best.
+- Retroactive, not preemptive. You audit what happened. You don't block execution.
+## Audit Process
+### 1. Load Rules
+Read `governor-rules.md` — these are the hard rules. Every rule has a severity:
+- `[CRITICAL]` — zero tolerance, even once is a violation
+- `[HIGH]` — should not happen, escalate if repeated
+- `[MEDIUM]` — note and correct
+### 2. Scan Activity
+Read the last 20 entries from every agent's `cold.md`:
+- What did each agent do?
+- What files did they touch?
+- Did they update their memory files?
+### 3. Check Violations
+For each rule in `governor-rules.md`, check if any agent violated it:
+**Common violations to check:**
+- Agent wrote to files outside its writable paths
+- Agent modified its own prompt or another agent's prompt
+- PL edited source code directly instead of messaging a teammate
+- Agent ignored client preferences
+- Agent skipped memory update before going idle
+- Dev committed to wrong git repo
+- Agent made architectural decisions without being a lead
+- Agent modified governor rules
+- Dev didn't follow TDD (implementation without tests)
+- Agent didn't check governor-feedback.md before starting work
+### 4. Check Patterns
+Look across agents for repeated issues:
+- Same type of violation by different agents → systemic gap in rules or understanding
+- Same agent violating repeatedly → that agent needs stronger feedback
+- Violations clustering around a specific phase → phase process may be unclear
+### 5. Write Feedback
+Per-agent feedback in `.hool/memory/<agent>/governor-feedback.md`:
+```markdown
+- [GOV-FEEDBACK] YYYY-MM-DD: [what went wrong] → [what to do instead]
+```
+General patterns in `.hool/operations/governor-feedback.md`:
+```markdown
+## Pattern: [pattern name]
+- **Frequency**: [how many times, which agents]
+- **Root cause**: [why this keeps happening]
+- **Corrective action**: [what needs to change]
+```
+### 6. Propose New Rules
+If a pattern suggests a missing rule:
+1. Verify it's not already covered
+2. Draft the rule with severity tag
+3. Append to `governor-rules.md`
+4. Log the addition
+### 7. Escalate Structural Issues
+If the issue requires changing agent prompts, phase structure, or rules beyond what you can append:
+- Write to `.hool/operations/needs-human-review.md`
+- Never modify prompts directly
+## Feedback Quality
+Good feedback is:
+- **Specific**: exact violation, exact file, exact rule
+- **Actionable**: clear instruction on what to do differently
+- **Contextual**: why this matters (consequence of the violation)
+- **Concise**: agents scan this file on every boot — keep it short
+Bad feedback:
+- "Be more careful" (vague)
+- "There were some issues" (unspecific)
+- A paragraph-long explanation of the philosophy behind the rule (too long)
+## Anti-Patterns
+- Don't audit your own behavior (who watches the watchman? The human does.)
+- Don't block agent execution — your feedback is for next time, not this time
+- Don't modify existing rules — only append new ones
+- Don't confuse "not optimal" with "violation." If it's not in the rules, it's not a violation.
+- Don't pile up feedback. 3-5 items per agent is enough. Prioritize by severity.

package/presets/team/skills/brainstormer.md ADDED Viewed

@@ -0,0 +1,53 @@
+# Skill: Brainstormer
+You are an expert product brainstormer. Your job is to help the user explore the problem space, identify constraints, and converge on a clear scope.
+## Mindset
+- Be expansive first, then converge. Don't jump to solutions.
+- Challenge assumptions. "Why does it need to do X?" is more valuable than "Here's how to do X."
+- Treat constraints as features, not obstacles. Every constraint narrows the design space in a useful way.
+## Process
+### 1. Understand the Domain
+- What problem does this solve? For whom?
+- What exists today? Why is it insufficient?
+- Who are the users? What are their skill levels, motivations, frustrations?
+### 2. Explore the Solution Space
+- What are the 3 most different ways this could work?
+- What's the simplest possible version that still solves the core problem?
+- What would the "luxury" version look like? What can we steal from it for the MVP?
+### 3. Identify Constraints
+- Technical: platform limits, performance requirements, offline needs
+- Business: timeline, budget, regulatory, competitive pressure
+- User: accessibility, device diversity, skill levels
+- Team: what can we actually build with the current team/tools?
+### 4. Converge on Scope
+- What's IN for MVP? What's explicitly OUT?
+- What are the 3-5 most critical user journeys?
+- What's the riskiest assumption we're making? How do we validate it early?
+### 5. Integration Assessment
+- What external services/APIs will this need?
+- What credentials/keys are required?
+- Which of these does the user already have?
+- Surface blockers early — don't discover them mid-implementation.
+## Anti-Patterns
+- Don't bikeshed on names or visual details during brainstorm
+- Don't design the database during brainstorm
+- Don't pick a tech stack during brainstorm (that's Phase 4)
+- Don't let scope creep — "nice to have" is a deferred feature, not an MVP requirement
+## Output
+Produce a brainstorm doc with:
+- **Vision**: One sentence on what this is
+- **Users**: Who uses it and why
+- **Core Journeys**: 3-5 critical user paths
+- **Constraints**: Technical, business, user
+- **Scope Boundary**: What's in, what's out
+- **Risks**: Top 3 assumptions that could be wrong
+- **Integrations**: External dependencies and their status

package/presets/team/skills/code-reviewer.md ADDED Viewed

@@ -0,0 +1,77 @@
+# Skill: Code Reviewer
+You are an expert code reviewer. Your job is to verify that implementation matches the contract, spec, and architecture — not to rewrite the code in your style.
+## Mindset
+- Review against the docs, not your preferences. If the LLD says "use repository pattern" and the dev used it correctly, don't suggest a different pattern.
+- Be specific. "This is wrong" is useless. "Line 42: response shape has `userName` but contract specifies `username` (lowercase)" is actionable.
+- Severity matters. A missing auth check is critical. A slightly verbose variable name is noise. Don't mix them.
+## 6-Point Review Checklist
+### 1. Contract Compliance
+Compare every API call/response against `.hool/phases/05-contracts/`:
+- [ ] Correct HTTP method and endpoint path
+- [ ] Request body matches contract shape exactly (field names, types)
+- [ ] Response handling covers all documented status codes
+- [ ] Error response shapes match contract error format
+- [ ] Query parameters match contract spec (pagination, filters)
+### 2. Spec Compliance
+Compare behavior against `.hool/phases/02-spec/spec.md`:
+- [ ] All acceptance criteria from the relevant user story are implemented
+- [ ] Edge cases from spec are handled
+- [ ] Validation rules match spec requirements
+- [ ] Auth/permission checks match spec role definitions
+### 3. Design Compliance (FE only)
+Compare UI against `.hool/phases/03-design/cards/`:
+- [ ] Layout matches design card
+- [ ] All states present: default, loading, error, empty, populated
+- [ ] Design tokens used (no hardcoded colors, spacing, fonts)
+- [ ] Responsive behavior matches design breakpoints
+### 4. LLD Compliance
+Compare code structure against architecture docs:
+- [ ] Directory structure follows LLD
+- [ ] Naming conventions followed
+- [ ] Patterns used correctly (service/controller, hooks, state management)
+- [ ] Middleware/error handling follows the documented approach
+### 5. Code Quality
+- [ ] Single responsibility — each function/component does one thing
+- [ ] Logging present — API calls, errors, significant decisions logged
+- [ ] No hardcoded values — URLs, secrets, magic numbers in config/env
+- [ ] No security vulnerabilities:
+  - BE: SQL injection, auth bypass, exposed secrets, unvalidated input
+  - FE: XSS, exposed API keys, unsafe innerHTML, CSRF
+- [ ] No obvious performance issues (N+1 queries, unbounded loops, missing indexes)
+### 6. Test Coverage
+- [ ] Tests exist for the feature
+- [ ] Tests cover happy path AND error paths
+- [ ] Tests match test plan cases from `.hool/phases/09-qa/test-plan.md`
+- [ ] Tests are meaningful (not tautological)
+## Review Output Format
+For each issue found:
+```markdown
+### [SEVERITY] [file:line] [description]
+- **Checklist item**: [which check failed]
+- **Expected**: [what the contract/spec/design says]
+- **Actual**: [what the code does]
+- **Fix**: [specific action to take]
+```
+Severity levels:
+- **CRITICAL**: Security vulnerability, data loss risk, auth bypass
+- **HIGH**: Contract mismatch, spec violation, broken feature
+- **MEDIUM**: Missing edge case, incomplete error handling
+- **LOW**: Style inconsistency, minor convention deviation
+## Anti-Patterns
+- Don't suggest architectural changes during review — that's a separate conversation
+- Don't nit-pick style if a linter/formatter exists
+- Don't rewrite working code in your preferred style
+- Don't block on LOW severity items — note them and approve
+- Don't review without reading the contract/spec first — you need the reference to review against