npm - sequant - Versions diffs - 2.0.1 → 2.1.0 - Mend

sequant 2.0.1 → 2.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (58) hide show

package/.claude-plugin/marketplace.json +1 -1
package/.claude-plugin/plugin.json +1 -1
package/dist/bin/cli.js +2 -1
package/dist/marketplace/external_plugins/sequant/.claude-plugin/plugin.json +1 -1
package/dist/marketplace/external_plugins/sequant/.mcp.json +6 -0
package/dist/marketplace/external_plugins/sequant/README.md +58 -8
package/dist/marketplace/external_plugins/sequant/hooks/post-tool.sh +19 -8
package/dist/marketplace/external_plugins/sequant/hooks/pre-tool.sh +36 -49
package/dist/marketplace/external_plugins/sequant/skills/_shared/references/subagent-types.md +158 -48
package/dist/marketplace/external_plugins/sequant/skills/assess/SKILL.md +354 -352
package/dist/marketplace/external_plugins/sequant/skills/exec/SKILL.md +1155 -33
package/dist/marketplace/external_plugins/sequant/skills/fullsolve/SKILL.md +35 -4
package/dist/marketplace/external_plugins/sequant/skills/qa/SKILL.md +2157 -104
package/dist/marketplace/external_plugins/sequant/skills/qa/scripts/quality-checks.sh +1 -1
package/dist/marketplace/external_plugins/sequant/skills/setup/SKILL.md +386 -0
package/dist/marketplace/external_plugins/sequant/skills/solve/SKILL.md +38 -664
package/dist/marketplace/external_plugins/sequant/skills/spec/SKILL.md +505 -120
package/dist/marketplace/external_plugins/sequant/skills/test/SKILL.md +246 -1
package/dist/marketplace/external_plugins/sequant/skills/testgen/SKILL.md +138 -1
package/dist/src/commands/dashboard.js +1 -1
package/dist/src/commands/doctor.js +1 -1
package/dist/src/commands/init.js +10 -10
package/dist/src/commands/logs.js +1 -1
package/dist/src/commands/run.js +49 -39
package/dist/src/commands/state.js +3 -3
package/dist/src/commands/status.js +5 -5
package/dist/src/commands/sync.js +8 -8
package/dist/src/commands/update.js +16 -16
package/dist/src/lib/cli-ui.js +20 -19
package/dist/src/lib/merge-check/index.js +2 -2
package/dist/src/lib/settings.d.ts +8 -0
package/dist/src/lib/settings.js +1 -0
package/dist/src/lib/shutdown.js +1 -1
package/dist/src/lib/templates.js +2 -0
package/dist/src/lib/wizard.js +6 -4
package/dist/src/lib/workflow/batch-executor.js +1 -1
package/dist/src/lib/workflow/log-writer.js +6 -6
package/dist/src/lib/workflow/metrics-writer.js +5 -3
package/dist/src/lib/workflow/phase-executor.js +5 -1
package/dist/src/lib/workflow/platforms/github.js +5 -1
package/dist/src/lib/workflow/state-cleanup.js +1 -1
package/dist/src/lib/workflow/state-manager.js +15 -13
package/dist/src/lib/workflow/state-rebuild.js +2 -2
package/dist/src/lib/workflow/types.d.ts +11 -0
package/dist/src/lib/workflow/worktree-manager.js +40 -41
package/dist/src/lib/worktree-isolation.d.ts +130 -0
package/dist/src/lib/worktree-isolation.js +310 -0
package/package.json +8 -8
package/templates/agents/sequant-explorer.md +23 -0
package/templates/agents/sequant-implementer.md +18 -0
package/templates/agents/sequant-qa-checker.md +24 -0
package/templates/agents/sequant-testgen.md +25 -0
package/templates/scripts/cleanup-worktree.sh +18 -0
package/templates/skills/_shared/references/subagent-types.md +158 -48
package/templates/skills/exec/SKILL.md +72 -6
package/templates/skills/qa/SKILL.md +8 -217
package/templates/skills/spec/SKILL.md +446 -120
package/templates/skills/testgen/SKILL.md +138 -1

package/dist/marketplace/external_plugins/sequant/skills/spec/SKILL.md CHANGED Viewed

@@ -13,7 +13,7 @@ allowed-tools:
   - Bash(gh label:*)
   - Bash(git worktree:*)
   - Bash(git -C:*)
-  - Task(Explore)
+  - Agent(sequant-explorer)
   - AgentOutputTool
 ---
@@ -84,6 +84,11 @@ When called like `/spec <freeform description>`:
 - Effect: Skips the AC Quality Check step
 - Use when: AC are intentionally high-level or you want to defer linting
+**Flag:** `--skip-scope-check`
+- Usage: `/spec 123 --skip-scope-check`
+- Effect: Skips the Scope Assessment step
+- Use when: Issue scope is intentionally complex or you want to defer assessment
 ### AC Extraction and Storage — REQUIRED
 **After fetching the issue body**, extract and store acceptance criteria in workflow state:
@@ -190,6 +195,103 @@ console.log(formatACLintResults(lintResults));
 - Output: `AC Quality Check: Skipped (--skip-ac-lint flag set)`
 - Continue directly to plan generation
+### Scope Assessment — REQUIRED (unless --skip-scope-check)
+**After AC Quality Check**, run scope assessment to detect overscoped issues:
+```bash
+# Run scope assessment (skip if --skip-scope-check flag is set)
+npx tsx -e "
+import { parseAcceptanceCriteria } from './src/lib/ac-parser.ts';
+import { performScopeAssessment, formatScopeAssessment, convertSettingsToConfig } from './src/lib/scope/index.ts';
+import { getSettings } from './src/lib/settings.ts';
+const issueBody = \`<ISSUE_BODY_HERE>\`;
+const issueTitle = '<ISSUE_TITLE>';
+(async () => {
+  const settings = await getSettings();
+  const config = convertSettingsToConfig(settings.scopeAssessment);
+  const criteria = parseAcceptanceCriteria(issueBody);
+  const assessment = performScopeAssessment(criteria, issueBody, issueTitle, config);
+  console.log(formatScopeAssessment(assessment));
+})();
+"
+```
+**Why this matters:**
+- Bundled features (3+ distinct features) should be separate issues
+- Missing non-goals lead to scope creep during implementation
+- High AC counts increase complexity and error rates
+**Scope Metrics:**
+| Metric | Green | Yellow | Red |
+|--------|-------|--------|-----|
+| Feature count | 1 | 2 | 3+ |
+| AC items | 1-5 | 6-8 | 9+ |
+| Directory spread | 1-2 | 3-4 | 5+ |
+**Non-Goals Section:**
+Every `/spec` output MUST include a Non-Goals section. If the issue lacks one, output a warning:
+```markdown
+## Non-Goals
+⚠️ **Non-Goals section not found.** Consider adding scope boundaries.
+Example format:
+- [ ] [Adjacent feature we're deferring]
+- [ ] [Scope boundary we're respecting]
+- [ ] [Future work that's out of scope]
+```
+**Scope Verdicts:**
+| Verdict | Meaning | Action |
+|---------|---------|--------|
+| ✅ SCOPE_OK | Single focused feature | Proceed normally |
+| ⚠️ SCOPE_WARNING | Moderate complexity | Consider narrowing; quality loop auto-enabled |
+| ❌ SCOPE_SPLIT_RECOMMENDED | Multiple features bundled | Strongly recommend splitting |
+**Quality Loop Auto-Enable:**
+If scope verdict is SCOPE_WARNING or SCOPE_SPLIT_RECOMMENDED:
+- Quality loop is automatically enabled
+- Include note in Recommended Workflow section:
+  ```markdown
+  **Quality Loop:** enabled (auto-enabled due to scope concerns)
+  ```
+**If `--skip-scope-check` flag is set:**
+- Output: `Scope Assessment: Skipped (--skip-scope-check flag set)`
+- Continue to plan generation
+**Store in State:**
+After assessment, store results in workflow state for analytics:
+```bash
+npx tsx -e "
+import { StateManager } from './src/lib/workflow/state-manager.ts';
+import { performScopeAssessment, convertSettingsToConfig } from './src/lib/scope/index.ts';
+import { getSettings } from './src/lib/settings.ts';
+(async () => {
+  const settings = await getSettings();
+  const config = convertSettingsToConfig(settings.scopeAssessment);
+  // ... perform assessment with config ...
+  // const assessment = performScopeAssessment(criteria, issueBody, issueTitle, config);
+  const manager = new StateManager();
+  await manager.updateScopeAssessment(issueNumber, assessment);
+})();
+"
+```
 ### Feature Worktree Workflow
 **Planning Phase:** No worktree needed. Planning happens in the main repository directory. The worktree will be created during the execution phase (`/exec`).
@@ -209,21 +311,21 @@ Read(file_path=".sequant/settings.json")
 **Spawn ALL THREE agents in a SINGLE message:**
-1. `Task(subagent_type="Explore", model="haiku", prompt="Find similar features for [FEATURE]. Check components/admin/, lib/queries/, docs/patterns/. Report: file paths, patterns, recommendations.")`
+1. `Agent(subagent_type="sequant-explorer", prompt="Find similar features for [FEATURE]. Check components/admin/, lib/queries/, docs/patterns/. Report: file paths, patterns, recommendations.")`
-2. `Task(subagent_type="Explore", model="haiku", prompt="Explore [CODEBASE AREA] for [FEATURE]. Find: main components, data flow, key files. Report structure.")`
+2. `Agent(subagent_type="sequant-explorer", prompt="Explore [CODEBASE AREA] for [FEATURE]. Find: main components, data flow, key files. Report structure.")`
-3. `Task(subagent_type="Explore", model="haiku", prompt="Inspect database for [FEATURE]. Check: table schema, RLS policies, existing queries. Report findings.")`
+3. `Agent(subagent_type="sequant-explorer", prompt="Inspect database for [FEATURE]. Check: table schema, RLS policies, existing queries. Report findings.")`
 #### If sequential mode (default):
 **Spawn each agent ONE AT A TIME, waiting for each to complete:**
-1. **First:** `Task(subagent_type="Explore", model="haiku", prompt="Find similar features for [FEATURE]. Check components/admin/, lib/queries/, docs/patterns/. Report: file paths, patterns, recommendations.")`
+1. **First:** `Agent(subagent_type="sequant-explorer", prompt="Find similar features for [FEATURE]. Check components/admin/, lib/queries/, docs/patterns/. Report: file paths, patterns, recommendations.")`
-2. **After #1 completes:** `Task(subagent_type="Explore", model="haiku", prompt="Explore [CODEBASE AREA] for [FEATURE]. Find: main components, data flow, key files. Report structure.")`
+2. **After #1 completes:** `Agent(subagent_type="sequant-explorer", prompt="Explore [CODEBASE AREA] for [FEATURE]. Find: main components, data flow, key files. Report structure.")`
-3. **After #2 completes:** `Task(subagent_type="Explore", model="haiku", prompt="Inspect database for [FEATURE]. Check: table schema, RLS policies, existing queries. Report findings.")`
+3. **After #2 completes:** `Agent(subagent_type="sequant-explorer", prompt="Inspect database for [FEATURE]. Check: table schema, RLS policies, existing queries. Report findings.")`
 ### Feature Branch Context Detection
@@ -348,9 +450,9 @@ Before creating the implementation plan, scan for potential conflicts with in-fl
 ## Output Structure
-### 1. AC Checklist with Verification Criteria
+### 1. AC Checklist with Verification Criteria (REQUIRED)
-Restate AC as a checklist with verification for each:
+**Every AC MUST have an explicit Verification Method.** Restate AC as a checklist with verification for each:
 ```markdown
 ### AC-1: [Description]
@@ -369,6 +471,64 @@ Restate AC as a checklist with verification for each:
 - [ ] [Assumption that must be true]
 ```
+#### Verification Method Decision Framework
+**REQUIRED:** Choose the most appropriate verification method for each AC:
+| AC Type | Verification Method | When to Use |
+|---------|---------------------|-------------|
+| Pure logic/calculation | **Unit Test** | Functions with clear input/output, no side effects |
+| API endpoint | **Integration Test** | HTTP handlers, database queries, external service calls |
+| User workflow | **Browser Test** | Multi-step UI interactions, form submissions |
+| Visual appearance | **Manual Test** | Styling, layout, animations (hard to automate) |
+| CLI command | **Integration Test** | Script execution, file operations, stdout verification |
+| Error handling | **Unit Test** + **Integration Test** | Both isolated behavior and realistic scenarios |
+| Performance | **Manual Test** + **Integration Test** | Timing thresholds, load testing |
+#### Verification Method Examples
+**Good (specific and testable):**
+```markdown
+**AC-1:** User can submit the registration form
+**Verification Method:** Browser Test
+**Test Scenario:**
+- Given: User on /register page
+- When: Fill form fields, click Submit
+- Then: Redirect to /dashboard, success toast appears
+```
+**Bad (vague, no clear verification):**
+```markdown
+**AC-1:** Registration should work properly
+**Verification Method:** ??? (cannot determine)
+```
+#### Flags for Missing Verification Methods
+If you cannot determine a verification method for an AC:
+1. **Flag the AC as unclear:**
+   ```markdown
+   **AC-3:** System handles errors gracefully
+   **Verification Method:** ⚠️ UNCLEAR - needs specific error scenarios
+   **Suggested Refinement:** List specific error types and expected responses
+   ```
+2. **Include in Open Questions:**
+   ```markdown
+   ## Open Questions
+   1. **AC-3 verification method unclear**
+      - Question: What specific error scenarios should be tested?
+      - Recommendation: Define 3-5 error types with expected behavior
+      - Impact: Without this, QA cannot objectively validate
+   ```
+**Why this matters:** AC without verification methods:
+- Cannot be objectively validated in `/qa`
+- Lead to subjective "does it work?" assessments
+- Cause rework when expectations don't match implementation
 See [verification-criteria.md](references/verification-criteria.md) for detailed examples including the #452 hooks failure case.
 ### 2. Implementation Plan
@@ -391,142 +551,222 @@ For each major decision:
 See [parallel-groups.md](references/parallel-groups.md) for parallelization format.
-### 3. Plan Review
+### 2.5. Design Review (REQUIRED)
-Ask the user to confirm or adjust:
-- The AC checklist (with verification criteria)
-- The implementation plan
-- The assumptions to validate
-**Do NOT start implementation** - this is planning-only.
+**Purpose:** Make design decisions explicit before implementation starts. This forces evaluation of *how* to build, not just *what* to build — catching wrong-layer, over-engineered, or pattern-mismatched approaches before code is written.
-### 4. Content Analysis (AC-1, AC-2, AC-3, AC-4)
+**Why this matters:** Repeated pattern across issues: implementation passes QA functionally but uses the wrong design — wrong layer, hacky shortcut, over-engineered approach. The fix cycle is expensive. Making design explicit here prevents it.
-**Before** determining the recommended workflow, analyze the issue content for phase-relevant signals:
+**Complexity Scaling:**
+- **Simple issues** (`simple-fix`, `typo`, `docs-only` labels): Abbreviated — answer Q1 and Q3 only
+- **Standard issues**: Answer all 4 questions
+- **Complex issues** (`complex`, `refactor`, `breaking` labels): Answer all 4 questions with detailed rationale
-#### Step 1: Check for Solve Comment (AC-4)
+**Answer these questions:**
-First, check if a `/solve` comment already exists for this issue:
+1. **Where does this logic belong?** — Which module/layer owns this change? Name the specific directory, file, or abstraction layer. If it spans layers, explain why.
-```bash
-# Check issue comments for solve workflow
-gh issue view <issue-number> --json comments --jq '.comments[].body' | grep -l "## Solve Workflow for Issues:" || true
-```
+2. **What's the simplest correct approach?** — Actively reject over-engineering. What's the minimum implementation that satisfies all AC? What alternatives did you consider and reject?
-**If solve comment found:**
-- Extract phases from the solve workflow (e.g., `spec → exec → test → qa`)
-- Use solve recommendations as the primary source (after labels)
-- Skip content analysis for phases (solve already analyzed)
-- Include in output: `"Solve comment found - using /solve workflow recommendations"`
+3. **What existing pattern does this follow?** — Name the specific pattern in the codebase. Confirm it fits this use case. If no existing pattern fits, explain why a new approach is needed.
-#### Step 2: Analyze Title for Keywords (AC-1)
+4. **What would a senior reviewer challenge?** — Anticipate design pushback. What's the most likely "why didn't you just...?" or "this should be in X instead" feedback?
-If no solve comment, analyze the issue title for phase-relevant keywords:
+**Example (standard issue):**
-| Pattern | Detection | Suggested Phase |
-|---------|-----------|-----------------|
-| `extract`, `component` | UI work | Add `/test` |
-| `refactor.*ui`, `ui refactor` | UI work | Add `/test` |
-| `frontend`, `dashboard` | UI work | Add `/test` |
-| `auth`, `permission`, `security` | Security-sensitive | Add `/security-review` |
-| `password`, `credential`, `token` | Security-sensitive | Add `/security-review` |
-| `refactor`, `migration`, `restructure` | Complex work | Enable quality loop |
-| `breaking change` | Complex work | Enable quality loop |
+```markdown
+## Design Review
-#### Step 3: Analyze Body for Patterns (AC-2)
+1. **Where does this logic belong?** Spec skill's SKILL.md prompt files — purely prompt engineering, not TypeScript. Same layer as Feature Quality Planning and AC Checklist sections.
-Analyze the issue body for file references and keywords:
+2. **What's the simplest correct approach?** Add markdown prompt text to SKILL.md. Reuse the existing complexity scaling pattern from Feature Quality Planning. No code, no new files, no abstractions.
-| Pattern | Detection | Suggested Phase |
-|---------|-----------|-----------------|
-| References `.tsx` or `.jsx` files | UI work likely | Add `/test` |
-| References `components/` directory | UI work | Add `/test` |
-| References `scripts/` or `bin/` | CLI work | May need `/verify` |
-| References `auth/` directory | Security-sensitive | Add `/security-review` |
-| References `middleware.ts` | May be auth-related | Consider `/security-review` |
-| Contains "breaking change" | Complex work | Enable quality loop |
+3. **What existing pattern does this follow?** Mirrors Feature Quality Planning exactly: required section, purpose statement, complexity scaling table, question format.
-#### Step 3a: Browser Testing Label Suggestion
+4. **What would a senior reviewer challenge?** (1) "Why after Implementation Plan, not before?" — Design review needs the plan as input to evaluate. (2) "Won't this bloat output?" — Adds ~15 lines for standard, ~5 for simple-fix.
+```
-**When `.tsx` or `.jsx` file references are detected** in the issue body AND the issue does NOT have `ui`, `frontend`, or `admin` labels, include this warning in the spec output:
+**Example (simple-fix issue, abbreviated):**
 ```markdown
-> **Component files detected** — Issue body references `.tsx`/`.jsx` files or `components/` directory, but no `ui`/`frontend`/`admin` label is present.
-> - To enable browser testing: add the `ui` label → `gh issue edit <N> --add-label ui`
-> - To explicitly skip browser testing: add `no-browser-test` label → `gh issue edit <N> --add-label no-browser-test`
-> - Without either label, QA will note the missing browser test coverage.
-```
+## Design Review
-**When NOT to show this warning:**
-- Issue already has `ui`, `frontend`, or `admin` label (browser testing already enabled)
-- Issue has `no-browser-test` label (explicit opt-out)
-- No `.tsx`/`.jsx`/`components/` references detected
+1. **Where does this logic belong?** `src/lib/utils.ts` — utility function, same layer as existing helpers.
+3. **What existing pattern does this follow?** Same pattern as `formatDuration()` in the same file — pure function, no side effects, single responsibility.
+```
-#### Step 4: Merge Signals (AC-3)
+### 3. Feature Quality Planning (REQUIRED)
-Content analysis **supplements** label detection - it can only ADD phases, never remove them.
+**Purpose:** Systematically consider professional implementation requirements beyond the minimum AC. This prevents gaps that slip through exec and QA because they were never planned.
-**Priority order (highest first):**
-1. **Labels** (explicit, highest priority)
-2. **Solve comment** (if exists)
-3. **Title keywords**
-4. **Body patterns** (lowest priority)
+**Why this matters:** Spec currently plans the "minimum to satisfy AC" rather than "complete professional implementation." Gaps found in manual review are omissions from incomplete planning, not failures.
-**Output format:**
+**Complexity Scaling:**
+- **Simple issues** (`simple-fix`, `typo`, `docs-only` labels): Use abbreviated checklist (Completeness + one relevant section)
+- **Standard issues**: Complete all applicable sections
+- **Complex issues** (`complex`, `refactor`, `breaking` labels): Complete all sections with detailed items
 ```markdown
-## Content Analysis
+## Feature Quality Planning
+### Completeness Check
+- [ ] All AC items have corresponding implementation steps
+- [ ] Integration points with existing features identified
+- [ ] No partial implementations or TODOs planned
+- [ ] State management considered (if applicable)
+- [ ] Data flow is complete end-to-end
+### Error Handling
+- [ ] Invalid input scenarios identified
+- [ ] API/external service failures handled
+- [ ] Edge cases documented (empty, null, max values)
+- [ ] Error messages are user-friendly
+- [ ] Graceful degradation planned
+### Code Quality
+- [ ] Types fully defined (no `any` planned)
+- [ ] Follows existing patterns in codebase
+- [ ] Error boundaries where needed
+- [ ] No magic strings/numbers
+- [ ] Consistent naming conventions
+### Test Coverage Plan
+- [ ] Unit tests for business logic
+- [ ] Integration tests for data flow
+- [ ] Edge case tests identified
+- [ ] Mocking strategy appropriate
+- [ ] Critical paths have test coverage
+### Best Practices
+- [ ] Logging for debugging/observability
+- [ ] Accessibility considerations (if UI)
+- [ ] Performance implications considered
+- [ ] Security reviewed (auth, validation, sanitization)
+- [ ] Documentation updated (if behavior changes)
+### Polish (UI features only)
+- [ ] Loading states planned
+- [ ] Error states have UI
+- [ ] Empty states handled
+- [ ] Responsive design considered
+- [ ] Keyboard navigation works
+### Derived ACs
+Based on quality planning, identify additional ACs needed:
+| Source | Derived AC | Priority |
+|--------|-----------|----------|
+| Error Handling | AC-N: Handle [specific error] with [specific response] | High/Medium/Low |
+| Test Coverage | AC-N+1: Add tests for [specific scenario] | High/Medium/Low |
+| Best Practices | AC-N+2: Add logging for [specific operation] | High/Medium/Low |
+**Note:** Derived ACs are numbered sequentially after original ACs and follow the same format.
+```
-### Signal Sources
+**Section Applicability:**
-| Phase | Source | Confidence | Reason |
-|-------|--------|------------|--------|
-| /test | title | high | "Extract component" detected |
-| /security-review | body | medium | References auth/ directory |
+| Issue Type | Sections Required |
+|------------|-------------------|
+| Bug fix | Completeness, Error Handling, Test Coverage |
+| New feature | All sections |
+| Refactor | Completeness, Code Quality, Test Coverage |
+| UI change | All sections including Polish |
+| Backend/API | Completeness, Error Handling, Code Quality, Test Coverage, Best Practices |
+| CLI/Script | Completeness, Error Handling, Test Coverage, Best Practices |
+| Docs only | Completeness only |
-### Merged Recommendations
+**Example (API endpoint feature):**
-**From labels:** /test (ui label)
-**From content:** /security-review (added)
-**Final phases:** spec → exec → test → security-review → qa
+```markdown
+## Feature Quality Planning
+### Completeness Check
+- [x] All AC items have corresponding implementation steps
+- [x] Integration points: Auth middleware, database queries, response serializer
+- [x] No partial implementations planned
+- [ ] State management: N/A (stateless API)
+- [x] Data flow: Request → Validate → Query → Transform → Response
+### Error Handling
+- [x] Invalid input: Return 400 with validation errors
+- [x] Auth failure: Return 401 with "Unauthorized" message
+- [x] Not found: Return 404 with resource ID
+- [x] Server error: Return 500, log full error, return generic message
+- [x] Rate limit: Return 429 with retry-after header
+### Code Quality
+- [x] Types: Define RequestDTO, ResponseDTO, ErrorResponse
+- [x] Patterns: Follow existing controller pattern in `src/api/`
+- [ ] Error boundaries: N/A (API, not UI)
+- [x] No magic strings: Use constants for error messages
+### Test Coverage Plan
+- [x] Unit: Validation logic, data transformation
+- [x] Integration: Full request/response cycle
+- [x] Edge cases: Empty results, max pagination, invalid IDs
+- [x] Mocking: Mock database, not HTTP layer
+### Best Practices
+- [x] Logging: Log request ID, duration, status code
+- [ ] Accessibility: N/A (API)
+- [x] Performance: Add database index for query field
+- [x] Security: Validate input, sanitize output, check auth
+### Derived ACs
+| Source | Derived AC | Priority |
+|--------|-----------|----------|
+| Error Handling | AC-6: Return 429 with retry-after header on rate limit | Medium |
+| Best Practices | AC-7: Log request ID and duration for observability | High |
+| Test Coverage | AC-8: Add integration test for auth failure path | High |
 ```
-### 4.5. Solve Comment Detection (AC-4, AC-5)
+### 4. Plan Review
-**Before making your own phase recommendation**, check if `/solve` has already posted an analysis comment to this issue:
+Ask the user to confirm or adjust:
+- The AC checklist (with verification criteria)
+- The implementation plan
+- The assumptions to validate
+**Do NOT start implementation** - this is planning-only.
+### 4.5. Assess Comment Detection (AC-4, AC-5)
+**Before making your own phase recommendation**, check if `/assess` has already posted an analysis comment to this issue:
 ```bash
-# Check for solve analysis comment on the issue
-solve_comment=$(gh issue view <issue-number> --json comments \
-  --jq '[.comments[].body | select(test("## Solve Analysis|<!-- solve:phases="))] | last // empty')
+# Check for assess analysis comment on the issue (includes legacy /solve format for backward compat)
+assess_comment=$(gh issue view <issue-number> --json comments \
+  --jq '[.comments[].body | select(test("## Assess Analysis|## Solve Analysis|<!-- assess:phases=|<!-- solve:phases="))] | last // empty')
 ```
-**If a solve analysis comment exists:**
+**If an assess analysis comment exists:**
 1. Parse the HTML comment markers to extract the recommended phases:
    ```
-   <!-- solve:phases=exec,qa -->        → phases: ["exec", "qa"]
-   <!-- solve:skip-spec=true -->        → skip spec phase
-   <!-- solve:browser-test=false -->    → no browser testing needed
-   <!-- solve:quality-loop=true -->     → enable quality loop
+   <!-- assess:phases=exec,qa -->        → phases: ["exec", "qa"] (spec not included = skip spec)
+   <!-- assess:browser-test=false -->    → no browser testing needed
+   <!-- assess:quality-loop=true -->     → enable quality loop
    ```
-2. **Use the solve recommendation as your starting point** for the phase recommendation in step 5.
+2. **Use the assess recommendation as your starting point** for the phase recommendation in step 5.
-3. **You may override the solve recommendation**, but you MUST document why:
+3. **You may override the assess recommendation**, but you MUST document why:
    ```markdown
    ## Recommended Workflow
    **Phases:** spec → exec → test → qa
    **Quality Loop:** enabled
-   **Reasoning:** Solve recommended `exec → qa`, but codebase analysis reveals UI components
+   **Reasoning:** Assess recommended `exec → qa`, but codebase analysis reveals UI components
    are affected (found `.tsx` files in change scope), so browser testing is needed.
-   Overriding solve recommendation with explanation.
+   Overriding assess recommendation with explanation.
    ```
-4. If the solve comment recommends `skip-spec=true`, acknowledge this in your output but proceed with spec since `/spec` was explicitly invoked.
+4. If the assess comment recommends phases that don't include `spec` (e.g., `phases=exec,qa`), acknowledge this in your output but proceed with spec since `/spec` was explicitly invoked.
-**If no solve analysis comment exists:** Proceed with your own analysis as normal (step 5).
+**If no assess analysis comment exists:** Proceed with your own analysis as normal (step 5).
 ### 5. Recommended Workflow
@@ -537,20 +777,90 @@ Analyze the issue and recommend the optimal workflow phases:
 **Phases:** spec → exec → qa
 **Quality Loop:** disabled
-**Signal Sources:** [labels | solve | content]
 **Reasoning:** [Brief explanation of why these phases were chosen]
 ```
 **Phase Selection Logic:**
 - **UI/Frontend changes** → Add `test` phase (browser testing)
+- **`no-browser-test` label** → Skip `test` phase (explicit opt-out, overrides UI labels)
 - **Bug fixes** → Skip `spec` if already well-defined
 - **Complex refactors** → Enable quality loop
 - **Security-sensitive** → Add `security-review` phase
 - **Documentation only** → Skip `spec`, just `exec → qa`
+- **New features with testable ACs** → Add `testgen` phase after spec
+- **Refactors needing regression tests** → Add `testgen` phase
+#### Browser Testing Label Suggestion
+**When `.tsx` or `.jsx` file references are detected** in the issue body AND the issue does NOT have `ui`, `frontend`, or `admin` labels, include this warning in the spec output:
+```markdown
+> **Component files detected** — Issue body references `.tsx`/`.jsx` files or `components/` directory, but no `ui`/`frontend`/`admin` label is present.
+> - To enable browser testing: add the `ui` label → `gh issue edit <N> --add-label ui`
+> - To explicitly skip browser testing: add `no-browser-test` label → `gh issue edit <N> --add-label no-browser-test`
+> - Without either label, QA will note the missing browser test coverage.
+```
+**When NOT to show this warning:**
+- Issue already has `ui`, `frontend`, or `admin` label (browser testing already enabled)
+- Issue has `no-browser-test` label (explicit opt-out)
+- No `.tsx`/`.jsx`/`components/` references detected
+#### Testgen Phase Auto-Detection
+**When to recommend `testgen` phase:**
+| Condition | Recommend testgen? | Reasoning |
+|-----------|-------------------|-----------|
+| ACs have "Unit Test" verification method | ✅ Yes | Tests should be stubbed before implementation |
+| ACs have "Integration Test" verification method | ✅ Yes | Complex integration tests benefit from early structure |
+| Issue is a new feature (not bug fix) with >2 AC items | ✅ Yes | Features need test coverage |
+| Issue has `enhancement` or `feature` label | ✅ Yes | New functionality needs tests |
+| Project has test framework (Jest, Vitest, etc.) | ✅ Yes | Infrastructure exists to run tests |
+| Issue is a simple bug fix (`bug` label only) | ❌ No | Bug fixes typically have targeted tests |
+| Issue is docs-only (`docs` label) | ❌ No | Documentation doesn't need unit tests |
+| All ACs have "Manual Test" or "Browser Test" verification | ❌ No | These don't generate code stubs |
+**Detection Logic:**
+1. **Check verification methods in AC items:**
+   - Count ACs with "Unit Test" → If >0, recommend testgen
+   - Count ACs with "Integration Test" → If >0, recommend testgen
+2. **Check issue labels:**
+   ```bash
+   gh issue view <issue> --json labels --jq '.labels[].name'
+   ```
+   - If `bug` or `fix` is the ONLY label → Skip testgen
+   - If `docs` is present → Skip testgen
+   - If `enhancement`, `feature`, `refactor` → Consider testgen
+3. **Check project test infrastructure:**
+   ```bash
+   # Check for test framework in package.json
+   grep -E "jest|vitest|mocha" package.json || true
+   ```
+   - If no test framework detected → Skip testgen (no infrastructure)
+**Example output when testgen is recommended:**
+```markdown
+## Recommended Workflow
+**Phases:** spec → testgen → exec → qa
+**Quality Loop:** disabled
+**Reasoning:** ACs include Unit Test verification methods; testgen will create stubs before implementation
+```
+**Example output when testgen is NOT recommended:**
+```markdown
+## Recommended Workflow
-**Content Analysis Integration:**
-- Include content-detected phases in the workflow
-- Note signal source in reasoning (e.g., "Added /test based on title keyword 'extract component'")
+**Phases:** spec → exec → qa
+**Quality Loop:** disabled
+**Reasoning:** Bug fix with targeted scope; existing tests sufficient
+```
 ### 6. Label Review
@@ -650,16 +960,26 @@ npx tsx scripts/state/update.ts fail <issue-number> spec "Error description"
 **Before responding, verify your output includes ALL of these:**
 - [ ] **AC Quality Check** - Lint results (or "Skipped" if --skip-ac-lint)
+- [ ] **Scope Assessment** - Verdict and metrics (or "Skipped" if --skip-scope-check)
+- [ ] **Non-Goals Section** - Listed or warning if missing
 - [ ] **AC Checklist** - Numbered AC items (AC-1, AC-2, etc.) with descriptions
-- [ ] **Verification Criteria** - Each AC has Verification Method and Test Scenario
+- [ ] **Verification Criteria (REQUIRED)** - Each AC MUST have:
+  - Explicit Verification Method (Unit Test, Integration Test, Browser Test, or Manual Test)
+  - Test Scenario with Given/When/Then format
+  - If unclear, flag as "⚠️ UNCLEAR" and add to Open Questions
 - [ ] **Conflict Risk Analysis** - Check for in-flight work, include if conflicts found
 - [ ] **Implementation Plan** - 3-7 concrete steps with codebase references
-- [ ] **Content Analysis** - Title/body analysis results (or "Solve comment found" if using /solve)
-- [ ] **Recommended Workflow** - Phases, Quality Loop setting, Signal Sources, and Reasoning
+- [ ] **Design Review** - All 4 questions answered (abbreviated to Q1+Q3 for simple-fix/typo/docs-only labels)
+- [ ] **Feature Quality Planning** - Quality dimensions checklist completed (abbreviated for simple-fix/typo/docs-only labels)
+- [ ] **Recommended Workflow** - Phases, Quality Loop setting, and Reasoning (auto-enable quality loop if scope is yellow/red)
 - [ ] **Label Review** - Current vs recommended labels based on plan analysis
-- [ ] **Open Questions** - Any ambiguities with recommended defaults
+- [ ] **Open Questions** - Any ambiguities with recommended defaults (including unclear verification methods)
 - [ ] **Issue Comment Draft** - Formatted for GitHub posting
+**CRITICAL:** Do NOT output AC items without verification methods. Either:
+1. Assign a verification method from the decision framework, or
+2. Flag as "⚠️ UNCLEAR" and include in Open Questions
 **DO NOT respond until all items are verified.**
 ## Output Template
@@ -673,6 +993,26 @@ You MUST include these sections in order:
 ---
+## Scope Assessment
+### Non-Goals (Required)
+[List non-goals from issue, or warning if missing]
+### Scope Metrics
+| Metric | Value | Status |
+|--------|-------|--------|
+| Feature count | [N] | [✅/⚠️/❌] |
+| AC items | [N] | [✅/⚠️/❌] |
+| Directory spread | [N] | [✅/⚠️/❌] |
+### Scope Verdict
+[✅/⚠️/❌] **[SCOPE_OK/SCOPE_WARNING/SCOPE_SPLIT_RECOMMENDED]** - [Recommendation]
+---
 ## Acceptance Criteria
 ### AC-1: [Description]
@@ -685,6 +1025,20 @@ You MUST include these sections in order:
 - Then: [Expected outcome]
 ### AC-2: [Description]
+**Verification Method:** [Choose from decision framework]
+**Test Scenario:**
+- Given: [Initial state]
+- When: [Action]
+- Then: [Expected outcome]
+### AC-N: [Unclear AC example]
+**Verification Method:** ⚠️ UNCLEAR - [reason why verification is unclear]
+**Suggested Refinement:** [How to make this AC testable]
 <!-- Continue for all AC items -->
 ---
@@ -700,31 +1054,63 @@ You MUST include these sections in order:
 ---
-## Open Questions
+## Design Review
-1. **[Question]**
-   - Recommendation: [Default choice]
-   - Impact: [What happens if wrong]
+1. **Where does this logic belong?** [Module/layer that owns this change]
+2. **What's the simplest correct approach?** [Minimum implementation, rejected alternatives]
+3. **What existing pattern does this follow?** [Named pattern, confirmation it fits]
+4. **What would a senior reviewer challenge?** [Anticipated pushback]
+<!-- For simple-fix/typo/docs-only: only Q1 and Q3 required -->
 ---
-## Content Analysis
+## Feature Quality Planning
-<!-- If solve comment found: -->
-**Source:** Solve comment found - using /solve workflow recommendations
+### Completeness Check
+- [ ] All AC items have corresponding implementation steps
+- [ ] Integration points identified
+- [ ] No partial implementations planned
-<!-- If no solve comment, show analysis: -->
-### Signal Sources
+### Error Handling
+- [ ] Invalid input scenarios identified
+- [ ] External service failures handled
+- [ ] Edge cases documented
-| Phase | Source | Confidence | Reason |
-|-------|--------|------------|--------|
-| /test | title | high | "[matched keyword]" detected |
-| /security-review | body | medium | References [pattern] |
+### Code Quality
+- [ ] Types fully defined (no `any`)
+- [ ] Follows existing patterns
+- [ ] No magic strings/numbers
-### Merged Recommendations
+### Test Coverage Plan
+- [ ] Unit tests for business logic
+- [ ] Edge case tests identified
+- [ ] Critical paths covered
-**From labels:** [label-detected phases]
-**From content:** [content-detected phases]
+### Best Practices
+- [ ] Logging for observability
+- [ ] Security reviewed
+- [ ] Documentation updated
+### Polish (UI only)
+- [ ] Loading/error/empty states
+- [ ] Responsive design
+### Derived ACs
+| Source | Derived AC | Priority |
+|--------|-----------|----------|
+| [Section] | AC-N: [Description] | High/Medium/Low |
+---
+## Open Questions
+1. **[Question]**
+   - Recommendation: [Default choice]
+   - Impact: [What happens if wrong]
 ---
@@ -732,7 +1118,6 @@ You MUST include these sections in order:
 **Phases:** exec → qa
 **Quality Loop:** disabled
-**Signal Sources:** [labels | solve | content]
 **Reasoning:** [Why these phases based on issue analysis]
 ---