npm - all-for-claudecode - Versions diffs - 2.10.0 → 2.12.0 - Mend

all-for-claudecode 2.10.0 → 2.12.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (32) hide show

package/.claude-plugin/marketplace.json +2 -2
package/.claude-plugin/plugin.json +1 -1
package/MIGRATION.md +2 -2
package/README.md +12 -4
package/bin/cli.mjs +1 -0
package/package.json +1 -1
package/scripts/afc-consistency-check.sh +8 -6
package/scripts/afc-doctor.sh +18 -4
package/scripts/session-start-context.sh +1 -1
package/skills/analyze/SKILL.md +10 -8
package/skills/architect/SKILL.md +4 -4
package/skills/auto/SKILL.md +664 -93
package/skills/clarify/SKILL.md +4 -3
package/skills/clean/SKILL.md +17 -16
package/skills/consult/SKILL.md +19 -18
package/skills/debug/SKILL.md +1 -1
package/skills/doctor/SKILL.md +23 -19
package/skills/implement/SKILL.md +36 -23
package/skills/init/SKILL.md +24 -177
package/skills/learner/SKILL.md +4 -4
package/skills/plan/SKILL.md +1 -1
package/skills/pr-comment/SKILL.md +4 -4
package/skills/principles/SKILL.md +1 -1
package/skills/qa/SKILL.md +2 -2
package/skills/release-notes/SKILL.md +8 -4
package/skills/review/SKILL.md +12 -12
package/skills/security/SKILL.md +19 -4
package/skills/setup/SKILL.md +217 -0
package/skills/spec/SKILL.md +2 -2
package/skills/tasks/SKILL.md +4 -4
package/skills/test/SKILL.md +1 -1
package/skills/triage/SKILL.md +7 -8

package/skills/auto/SKILL.md CHANGED Viewed

@@ -10,6 +10,7 @@ argument-hint: "[feature description in natural language]"
 > Tasks are generated automatically at implement start (no separate tasks phase).
 > Critic Loop runs at each phase with unified safety cap (5). Convergence terminates early when quality is sufficient.
 > Pre-implementation gates (clarify, TDD pre-gen, blast-radius) run conditionally within the implement phase.
+> **Skill Advisor**: 5 checkpoints (A–E) at phase boundaries dynamically invoke auxiliary skills (ideate, consult, architect, security, analyze, test, qa, learner) based on signal detection. Budget-controlled (max 5 per pipeline).
 ## Arguments
@@ -35,9 +36,59 @@ If config file is missing:
 ---
+## Skill Advisor System
+> Auxiliary skills (ideate, consult, architect, security, analyze, test, qa, learner) are dynamically invoked at phase boundaries based on **intent-based evaluation**. Each checkpoint uses LLM semantic judgment — not keyword counting — to determine whether auxiliary skills would add value.
+### Core Principle: Intent-Based Evaluation
+Each checkpoint contains a **structured evaluation prompt** that the orchestrator answers by reading the actual artifact content (not scanning for keywords). The evaluation produces a 1–5 score per signal. Score >= 3 triggers the corresponding skill.
+**Why not keywords**: Keyword matching produces false positives (e.g., "token" in CSS vs auth context) and misses implicit intent (e.g., "user upload feature" implies security concerns without mentioning "XSS"). The orchestrator is an LLM — it should use semantic understanding.
+### Execution Modes
+| Mode | Description | Context cost | Example |
+|------|-------------|-------------|---------|
+| **Transform** | Skill output **replaces or restructures** the next phase's input | High (blocking) | ideate → $ARGUMENTS restructured |
+| **Enrich** | Skill output **appends context** to the next phase's input | Low (fork/Task) | consult → domain constraints section added |
+| **Observe** | Skill output is **metadata only** (logged, flags set) | Low (fork) | qa → quality score recorded |
+### Budget Control
+| Constraint | Limit | Rationale |
+|-----------|-------|-----------|
+| Per checkpoint | max 2 skills | Phase transition delay cap |
+| Pipeline total | max 5 auxiliary invocations | Total execution time cap |
+| Transform mode | max 1 per pipeline | Main context pollution prevention |
+| Concurrent fork | max 3 per checkpoint | Agent resource limit |
+Track auxiliary invocations in `ADVISOR_COUNT` (starts at 0, increments per invocation). If `ADVISOR_COUNT >= 5`, skip remaining checkpoints. Transform invocations tracked in `ADVISOR_TRANSFORM_USED` (boolean). **Every increment** must persist to pipeline state: `afc_state_write "advisorCount" "$ADVISOR_COUNT"`. On context recovery, restore from state: `ADVISOR_COUNT = afc_state_read "advisorCount"`.
+### Expert Agent Routing
+When a checkpoint determines that domain expertise is needed, route to the appropriate expert agent:
+| Domain | Agent ID | When to route |
+|--------|----------|---------------|
+| backend | `afc-backend-expert` | API design, database schema, server architecture, auth flows |
+| infra | `afc-infra-expert` | Deployment, CI/CD, cloud infrastructure, containerization, scaling |
+| pm | `afc-pm-expert` | Product decisions, user stories, prioritization, metrics |
+| design | `afc-design-expert` | UI/UX, accessibility, component design, visual hierarchy |
+| marketing | `afc-marketing-expert` | SEO, analytics, growth, conversion optimization |
+| legal | `afc-legal-expert` | Privacy regulations, licensing, compliance, data protection |
+| security | `afc-appsec-expert` | Application security, vulnerability patterns, secure coding |
+| advisor | `afc-tech-advisor` | Technology selection, library comparison, stack decisions |
+Route based on **what expertise the feature actually needs**, not keyword presence. Consider the project's `{config.architecture}` and tech stack — skip domains irrelevant to the project.
+**Agent ID lookup**: Use the Agent ID column directly as the `subagent_type` value (e.g., `subagent_type: "afc:afc-backend-expert"`). Do NOT construct agent names from the domain name — `security` maps to `afc-appsec-expert` (not `afc-security-expert`), and `advisor` maps to `afc-tech-advisor` (not `afc-advisor-expert`).
+---
 ## Critic Loop Rules (common to all phases)
-> **Always** read `${CLAUDE_PLUGIN_ROOT}/docs/critic-loop-rules.md` first and follow it.
+> **Always** read `${CLAUDE_SKILL_DIR}/../../docs/critic-loop-rules.md` first and follow it.
 > Core: minimum 1 concern per criterion + mandatory Adversarial failure scenario each pass + quantitative evidence required. "PASS" as a single word is prohibited. Uses convergence-based termination with 4 verdicts (PASS/FAIL/ESCALATE/DEFER). On ESCALATE: pause and present options to user even in auto mode.
 ---
@@ -51,20 +102,25 @@ If config file is missing:
 3. Determine feature name (2-3 keywords → kebab-case)
 3.5. **Preflight Check**:
    ```bash
-   "${CLAUDE_PLUGIN_ROOT}/scripts/afc-preflight-check.sh"
+   "${CLAUDE_SKILL_DIR}/../../scripts/afc-preflight-check.sh"
    ```
    - If exit 1 (hard failure) → print error and **abort**
    - If warnings only (exit 0) → print warnings and continue
 4. **Activate Pipeline Flag** (hook integration):
    ```bash
-   "${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" start {feature}
+   "${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" start {feature}
    ```
    - Safety Snapshot created automatically (`afc/pre-auto` git tag)
    - Stop Gate Hook activated (blocks response termination on CI failure)
    - File change tracking started
-   - Timeline log: `"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" log pipeline-start "Auto pipeline: {feature}"`
+   - Timeline log: `"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" log pipeline-start "Auto pipeline: {feature}"`
 5. Create `.claude/afc/specs/{feature}/` directory → **record path as `PIPELINE_ARTIFACT_DIR`** (for Clean scope)
-6. Start notification:
+6. **Initialize Skill Advisor**: `ADVISOR_COUNT = 0`, `ADVISOR_TRANSFORM_USED = false`. Persist to pipeline state for context-loss resilience:
+   ```bash
+   afc_state_write "advisorCount" "0"
+   afc_state_write "advisorTransformUsed" "false"
+   ```
+7. Start notification:
    ```
    Auto pipeline started: {feature}
    ├─ Clarify? → 1/5 Spec → 2/5 Plan → 3/5 Implement → 4/5 Review → 5/5 Clean
@@ -78,7 +134,7 @@ Before investing pipeline resources, evaluate whether the request warrants execu
 1. **Necessity check**: Explore codebase for existing implementations related to `$ARGUMENTS`.
    - If the feature substantially exists → ask user via AskUserQuestion:
      - "This feature appears to already exist at {path}. (1) Enhance existing (2) Replace entirely (3) Abort"
-   - If user chooses abort → release pipeline flag (`"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" end`), end with: `"Pipeline aborted — feature already exists."`
+   - If user chooses abort → release pipeline flag (`"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" end`), end with: `"Pipeline aborted — feature already exists."`
 2. **Scope check**: Estimate the scope of `$ARGUMENTS`:
    - If description implies 10+ files or multiple unrelated concerns → warn:
@@ -115,8 +171,8 @@ If all checks pass, proceed to Phase 0.8.
 3. If change touches > 2 files OR modifies any `.sh` script: **rollback fast-path changes** (`git reset --hard afc/pre-auto`), then restart with full pipeline
 4. **Checkpoint**:
    ```bash
-   "${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" phase fast-path
-   "${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" ci-pass
+   "${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" phase fast-path
+   "${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" ci-pass
    ```
 5. Run `/afc:review` logic inline (mini-review only — single Critic pass)
 6. Run Phase 5 Clean logic (artifact cleanup, CI gate, pipeline flag release)
@@ -131,7 +187,7 @@ If all checks pass, proceed to Phase 0.8.
 ### Phase 0.5: Auto-Clarify Gate (conditional)
-`"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" phase clarify`
+`"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" phase clarify`
 **Trigger condition**: Score `$ARGUMENTS` on 5 ambiguity signals. If score >= 3, trigger clarification.
@@ -152,9 +208,73 @@ If all checks pass, proceed to Phase 0.8.
 **If score < 3** (clear): skip silently, proceed to Phase 1.
+### Skill Advisor Checkpoint A (Pre-Spec)
+> Evaluate auxiliary skill triggers BEFORE entering Phase 1. Budget: max 2 skills, max 1 Transform. Skip all if `ADVISOR_COUNT >= 5`.
+**Intent evaluation** — Read `$ARGUMENTS` and answer these questions semantically (not by keyword scanning):
+| # | Question | Score 1–5 | If >= 3 | Skill | Mode |
+|---|----------|-----------|---------|-------|------|
+| A1 | Is this request at the **idea/vision level** rather than a concrete feature? (e.g., "make onboarding better" vs "add email verification to signup flow") Does it lack specific technical scope — no file paths, no API endpoints, no component names? | 1=concrete spec-ready, 5=pure vision | `ideate` | Transform |
+| A2 | Does implementing this feature require **specialized domain knowledge** that a generalist developer wouldn't have? Consider: regulatory requirements, industry-specific patterns, domain-specific anti-patterns, compliance rules. Which domain from the Expert Agent Routing table would add the most value? | 1=general programming, 5=deep domain expertise essential | `consult({domain})` | Enrich (fork) |
+**If A1 >= 3** (Transform — skip if `ADVISOR_TRANSFORM_USED`):
+1. Execute `/afc:ideate` inline with `$ARGUMENTS`
+2. If ideate fails or produces no output:
+   - Do NOT set `ADVISOR_TRANSFORM_USED = true`
+   - Proceed with original `$ARGUMENTS`
+   - Log: `"Skill Advisor [A]: ideate failed, proceeding with original input"`
+3. On success: read generated `ideate.md` → extract `## Problem Statement` + `## Value Proposition` + `## Core Features (MoSCoW)` sections
+4. Construct enriched spec input:
+   ```
+   SPEC_INPUT = "$ARGUMENTS
+   ## Ideation Context (auto-generated)
+   {extracted Problem Statement section}
+   {extracted Value Proposition section}
+   {extracted Core Features (MoSCoW) — Must Have items only}"
+   ```
+5. Replace `$ARGUMENTS` with `SPEC_INPUT` for Phase 1
+6. Set `ADVISOR_TRANSFORM_USED = true`, increment `ADVISOR_COUNT`, persist: `afc_state_write "advisorCount" "$ADVISOR_COUNT"` and `afc_state_write "advisorTransformUsed" "true"`
+7. Progress: `  ├─ Skill Advisor [A]: ideate (score: {N}/5, input restructured from idea to structured brief)`
+**If A2 >= 3** (Enrich):
+1. Determine which domain from Expert Agent Routing table best matches the **actual expertise gap** (not keyword presence)
+2. Verify domain relevance: does this project's `{config.architecture}` and tech stack make this domain applicable? (e.g., skip `design` for a CLI tool, skip `infra` if the project has no deployment config)
+3. Invoke expert agent (look up the agent-id from the Expert Agent Routing table — do NOT construct from domain name):
+   ```
+   Task("Domain pre-consultation: {domain}", subagent_type: "afc:{agent-id-from-routing-table}",
+     prompt: "You are being consulted automatically during pipeline spec preparation.
+     ## Feature Context
+     {$ARGUMENTS}
+     ## Why You Were Consulted
+     {1-sentence explanation of what domain expertise gap was identified}
+     ## Instructions
+     1. Read your MEMORY.md for prior project context
+     2. Read .claude/afc/project-profile.md if it exists
+     3. Provide domain-specific constraints, regulations, and anti-patterns that MUST be reflected in the spec
+     4. Format your response EXACTLY as:
+        ## Domain Constraints ({domain})
+        - [MUST] {constraint}: {rationale}
+        - [MUST NOT] {anti-pattern}: {risk}
+        - [CONSIDER] {best practice}: {benefit}
+     5. Keep to max 10 items. Prioritize by risk severity.
+     6. Update your MEMORY.md with the consultation context")
+   ```
+4. Store output as `DOMAIN_CONSTRAINTS` → injected into Phase 1 spec context
+5. Spec phase MUST include a `## Domain Constraints` section reflecting these items
+6. Increment `ADVISOR_COUNT`
+7. Progress: `  ├─ Skill Advisor [A]: consult({domain}) (score: {N}/5, {M} constraints injected)`
+**If all scores < 3**: proceed silently to Phase 1.
 ### Phase 1: Spec (1/5)
-`"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" phase spec`
+`"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" phase spec`
 Execute `/afc:spec` logic inline:
@@ -178,18 +298,100 @@ Execute `/afc:spec` logic inline:
    - MEASURABILITY: are success criteria measurable, not subjective? **Is quantitative evidence provided for numerical targets?**
    - INDEPENDENCE: are implementation details (code, library names) absent from the spec?
    - EDGE_CASES: are at least 2 identified? Any missing boundary conditions?
+   - TESTABILITY: Does every System Requirement follow one of the 5 EARS patterns (WHEN/WHILE/IF/WHERE/SHALL)? Does each EARS requirement have a mapped TC (`→ TC: should_...`)? If not → FAIL and auto-fix: rewrite to EARS + generate TC mapping.
    - FAIL → auto-fix and continue. ESCALATE → pause, present options, resume after response. DEFER → record reason, mark clean.
 7. **Checkpoint**: phase transition already recorded by `afc-pipeline-manage.sh phase spec` at phase start
 8. Progress: `✓ 1/5 Spec complete (US: {N}, FR: {N}, researched: {N}, Critic: converged ({N} passes, {M} fixes, {E} escalations))`
+### Skill Advisor Checkpoint B (Post-Spec)
+> Evaluate auxiliary skill triggers AFTER spec completion, BEFORE plan creation. Budget: max 2 skills. Skip all if `ADVISOR_COUNT >= 5`.
+**Intent evaluation** — Read the completed spec.md and answer these questions:
+| # | Question | Score 1–5 | If >= 3 | Skill | Mode |
+|---|----------|-----------|---------|-------|------|
+| B1 | Does this feature **handle, store, or transmit sensitive data or trust boundaries**? Consider: user authentication/authorization, cryptographic operations, PII/financial data processing, external input that reaches internal systems, session/token lifecycle. Judge by the feature's actual behavior, not by whether security-related words appear. | 1=no trust boundary touched, 5=core security feature | `security` | Enrich (fork) |
+| B2 | Does this feature **cross multiple architectural boundaries** or introduce a new structural pattern? Consider: does it touch 3+ layers (e.g., API + service + data + external), create a new component type not seen in the codebase, or require coordination between independently-deployable units? | 1=single-layer change, 5=cross-cutting architectural change | `architect` | Enrich (fork) |
+**If B1 >= 3** (Enrich):
+1. Invoke security agent for pre-plan threat modeling:
+   ```
+   Task("Threat Model: {feature}", subagent_type: "afc:afc-security",
+     prompt: "Generate a threat model BEFORE implementation planning begins.
+     ## Spec Summary
+     {spec.md FR/NFR/Key Entities — security-relevant items only}
+     ## Why This Was Triggered
+     {1-sentence explanation of which trust boundary or sensitive data flow was identified}
+     ## Instructions
+     1. Read your MEMORY.md for known vulnerability patterns in this project
+     2. Identify attack surfaces from the spec requirements
+     3. For each threat, specify the mitigation that MUST appear in the plan
+     4. Format your response EXACTLY as:
+        ## Threat Model (pre-scan)
+        | Threat | Attack Surface | Mitigation Required | Priority |
+        |--------|---------------|-------------------|----------|
+     5. Max 8 threats. Prioritize by exploitability and impact.")
+   ```
+2. Store output as `THREAT_MODEL` → injected into Phase 2 plan context
+3. Plan phase MUST address each mitigation in its Risk & Mitigation section
+4. Plan Critic RISK criterion MUST verify: `{M}/{N} threat mitigations addressed`
+5. Increment `ADVISOR_COUNT`
+6. Progress: `  ├─ Skill Advisor [B]: security (score: {N}/5, threat model: {M} threats identified)`
+**If B2 >= 3** (Enrich):
+1. Invoke architect agent for pre-plan guidance:
+   ```
+   Task("Architecture Advisory: {feature}", subagent_type: "afc:afc-architect",
+     prompt: "Provide architecture guidance BEFORE plan creation.
+     ## Spec Summary
+     {spec.md Key Entities + layer analysis from {config.architecture}}
+     ## Why This Was Triggered
+     {1-sentence explanation of which architectural boundary crossing was identified}
+     ## Instructions
+     1. Read your MEMORY.md for prior ADRs and architecture patterns
+     2. Recommend: component placement, layer boundaries, interface contracts
+     3. Flag conflicts with existing architecture patterns
+     4. Format your response EXACTLY as:
+        ## Architecture Advisory (pre-plan)
+        - [PLACE] {component} → {layer/module}: {rationale}
+        - [BOUNDARY] {interface}: {contract description}
+        - [CONFLICT] {existing} ↔ {new}: {resolution recommendation}
+        - [PATTERN] {recommended pattern}: {why it fits}
+     5. Max 10 items.
+     6. Update your MEMORY.md if new patterns are identified")
+   ```
+2. Store output as `ARCH_ADVISORY` → injected into Phase 2 plan context
+3. Plan Critic ARCHITECTURE criterion MUST validate against this advisory
+4. Increment `ADVISOR_COUNT`
+5. Progress: `  ├─ Skill Advisor [B]: architect (score: {N}/5, advisory: {M} recommendations, {K} conflicts)`
+**If both B1 and B2 >= 3**: launch both agents in a **single message** (parallel fork). Both count toward budget. After both return:
+1. Apply `THREAT_MODEL` to plan Risk & Mitigation section
+2. Apply `ARCH_ADVISORY` to plan Architecture Decision section
+3. If security mitigations conflict with architecture proposals (e.g., "encrypt at rest" vs "use in-memory cache") → **ESCALATE** to user with conflict details
+**If all scores < 3**: proceed silently to Phase 2.
 ### Phase 2: Plan (2/5)
-`"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" phase plan`
+`"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" phase plan`
 Execute `/afc:plan` logic inline:
 1. Load spec.md
-2. If technical uncertainties exist → auto-resolve via WebSearch/code exploration → create research.md
+2. **Research (ReWOO pattern, if needed)**:
+   Extract technical uncertainties from spec.md (libraries/APIs not yet used, unverified performance requirements, unclear integration approach). If no uncertain items: skip.
+   If there are uncertain items, follow the 3-step ReWOO flow:
+   - **Step 1 — Plan**: List all research topics as a numbered list (NO execution yet): `1. {topic} — {what we need to know}`
+   - **Step 2 — Execute**: If topics are independent → launch parallel Task() calls in a **single message**: `Task("Research: {topic1}", subagent_type: "general-purpose")`. If a topic depends on another's result → execute sequentially. For 1-2 topics → resolve directly via WebSearch/codebase exploration (no delegation).
+   - **Step 3 — Solve**: Collect all results and record in `.claude/afc/specs/{feature}/research.md` with: Decision, Rationale, Alternatives, Source per topic.
 3. **Memory loading** (skip gracefully if directories are empty or absent):
    - **Quality history**: if `.claude/afc/memory/quality-history/*.json` exists, load the **most recent 10 files** (sorted by filename descending) and display trend summary: "Last {N} pipelines: avg critic_fixes {X}, avg ci_failures {Y}, avg escalations {Z}". Use trends to inform plan risk assessment.
    - **Decisions**: if `.claude/afc/memory/decisions/` exists, load the **most recent 30 files** (sorted by filename descending) and check for conflicts with the current feature's design direction. Flag any contradictions.
@@ -248,35 +450,140 @@ Execute `/afc:plan` logic inline:
 9. **Checkpoint**: phase transition already recorded by `afc-pipeline-manage.sh phase plan` at phase start
 10. Progress: `✓ 2/5 Plan complete (Critic: converged ({N} passes, {M} fixes, {E} escalations), files: {N}, ADR: {N} recorded, Implementation Context: {W} words)`
+### Skill Advisor Checkpoint C (Post-Plan)
+> Evaluate auxiliary skill triggers AFTER plan completion, BEFORE implementation. Budget: max 2 skills. Skip all if `ADVISOR_COUNT >= 5`.
+**Intent evaluation** — Read the completed plan.md and answer these questions:
+| # | Question | Score 1–5 | If >= 3 | Skill | Mode |
+|---|----------|-----------|---------|-------|------|
+| C1 | Is the **implementation risk high enough** that a dependency pre-analysis would catch problems the plan missed? Consider: are there files in the File Change Map that import each other (potential circular dependency)? Are there shared utility files that many other files depend on (high fan-out risk)? Are the declared `Depends On` relationships complete, or could there be hidden coupling? | 1=isolated changes, 5=deeply interconnected change set | dependency analysis (general-purpose fork) | Observe |
+| C2 | Does the plan contain **unresolved domain uncertainties** — items tagged `[UNCERTAIN]`, open questions in Implementation Context, or design decisions that assume domain knowledge the team may not have? | 1=all decisions are well-grounded, 5=critical domain questions remain open | `consult({domain})` expert agent | Enrich (fork) |
+**If C1 >= 3** (Observe):
+1. Invoke analysis in fork context:
+   ```
+   Task("Complexity Analysis: {feature}", subagent_type: "general-purpose",
+     prompt: "Analyze the dependency graph of files listed in the plan's File Change Map.
+     ## File Change Map
+     {paste File Change Map table from plan.md}
+     ## Instructions
+     1. For each file in the map, check its imports/dependencies in the codebase (Grep for import/require/source patterns)
+     2. Identify:
+        - Circular dependencies between planned files
+        - High fan-out files (>5 dependents outside the change set)
+        - Hidden coupling not captured in the Depends On column
+        - Files that are imported by many other files (risk of breakage)
+     3. Format your response EXACTLY as:
+        ## Complexity Analysis
+        - [CIRCULAR] {file A} ↔ {file B}: {description}
+        - [FAN-OUT] {file} → {N} dependents: {list top 5}
+        - [COUPLING] {file A} → {file B}: {not in Depends On column}
+        - [HIGH-RISK] {file}: {reason — most impactful if broken}
+        ## Risk Summary
+        Circular: {N}, High fan-out: {N}, Hidden coupling: {N}
+     4. If no issues found, return: '## Complexity Analysis\nNo significant risks detected.'")
+   ```
+2. Store output to `.claude/afc/specs/{feature}/complexity-analysis.md`
+3. Implement phase reads this file → high-risk files get extra verification after modification
+4. If circular dependencies found → **ESCALATE** to user (circular deps in implementation plan are a design flaw)
+5. Increment `ADVISOR_COUNT`
+6. Progress: `  ├─ Skill Advisor [C]: analyze (score: {N}/5, circular: {C}, fan-out: {F}, coupling: {H})`
+**If C2 >= 3** (Enrich):
+1. Determine which domain expert can best resolve the uncertainties (based on the nature of the open questions, not keywords)
+2. Invoke expert agent (look up the agent-id from the Expert Agent Routing table — do NOT construct from domain name):
+   ```
+   Task("Domain gap resolution: {domain}", subagent_type: "afc:{agent-id-from-routing-table}",
+     prompt: "Resolve domain uncertainties found during planning.
+     ## Uncertain Items
+     {extract all [UNCERTAIN] tagged items and open questions from plan.md}
+     ## Plan Context
+     {Implementation Context section from plan.md}
+     ## Instructions
+     1. For each uncertain item, provide a definitive answer with rationale
+     2. Format your response EXACTLY as:
+        ## Domain Resolutions
+        - [RESOLVED] {item}: {answer} — {rationale}
+        - [NEEDS-USER] {item}: {why this requires human judgment}
+     3. Update your MEMORY.md with the resolution context")
+   ```
+3. Apply resolutions to plan.md Implementation Context (replace `[UNCERTAIN]` with `[RESOLVED: {answer}]`)
+4. `[NEEDS-USER]` items → **ESCALATE** to user via AskUserQuestion
+5. Increment `ADVISOR_COUNT`
+6. Progress: `  ├─ Skill Advisor [C]: consult({domain}) (score: {N}/5, {M} resolved, {K} needs-user)`
+**If all scores < 3**: proceed silently to Phase 3.
 ### Phase 3: Implement (3/5)
-`"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" phase implement`
+`"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" phase implement`
 **Session context reload**: At implement start, read `.claude/afc/specs/{feature}/context.md` if it exists. This restores key decisions and constraints from Plan phase (resilient to context compaction).
-Execute `/afc:implement` logic inline — **follow all orchestration rules defined in `commands/implement.md`** (task generation, mode selection, batch/swarm execution, failure recovery, task execution pattern). The implement command is the single source of truth for orchestration details.
+**Advisor context reload**: If Checkpoint C produced `.claude/afc/specs/{feature}/complexity-analysis.md`, read it and flag high-risk files (circular dependencies, high fan-out) for extra verification after modification during task execution.
+Execute `/afc:implement` logic inline — **follow all orchestration rules defined in `skills/implement/SKILL.md`** (task generation, mode selection, batch/swarm execution, failure recovery, task execution pattern). The implement skill is the single source of truth for orchestration details.
 **Auto-specific additions** (beyond implement.md):
 #### Step 3.1: Task Generation + Validation
-1. Generate tasks.md from plan.md File Change Map (as defined in implement.md Step 1.3)
+1. Generate tasks.md from plan.md File Change Map using the following format and principles:
+   **Task Format** (required):
+   ```markdown
+   - [ ] T{NNN} {[P]} {[US*]} {description} `{file path}` {depends: [TXXX, TXXX]}
+   ```
+   | Component | Required | Description |
+   |-----------|----------|-------------|
+   | `T{NNN}` | Yes | 3-digit sequential ID (T001, T002, ...) |
+   | `[P]` | No | **Mandatory parallel execution** — task MUST run in parallel with other [P] tasks in the same phase. Requires no file overlap. |
+   | `[US*]` | No | User Story label from spec.md |
+   | description | Yes | Clear task description (start with a verb) |
+   | file path | Yes | Primary target file (wrapped in backticks) |
+   | `depends:` | No | Explicit dependency list — task cannot start until all listed complete |
+   **Decomposition Principles**:
+   - **1 task = 1 file** principle (where possible)
+   - **Same file = sequential**, **different files = [P] candidate**
+   - **Explicit dependencies**: Use `depends: [T001, T002]` for blocking dependencies
+   - **Test tasks**: Include a verification task for each testable unit
+   - **Phase gate**: Add a `{config.gate}` validation task at the end of each Phase
+   **Phase Structure**: Group tasks by Phase (Setup → Core → UI → Integration & Polish)
+   **Coverage Mapping** (append after tasks):
+   ```markdown
+   ## Coverage Mapping
+   | Requirement | Tasks |
+   |-------------|-------|
+   | FR-001 | T003, T007 |
+   ```
+   Every FR-*/NFR-* must be mapped to at least one task.
 2. **Retrospective check**: if `.claude/afc/memory/retrospectives/` exists, load the **most recent 10 files** (sorted by filename descending) and check:
    - Were there previous parallel conflict issues ([P] file overlaps)? Flag similar file patterns.
    - Were there tasks that were over-decomposed or under-decomposed? Adjust granularity.
-3. Script validation (DAG + parallel overlap) — no critic loop, script-based only
-4. Progress: `  ├─ Tasks generated: {N} ({P} parallelizable)`
+3. **Script validation**: Run DAG validation (`afc-dag-validate.sh`) and parallel overlap validation (`afc-parallel-validate.sh`) — no critic loop, script-based only. Fix any conflicts before proceeding.
+4. Progress: `  ├─ Tasks generated: {N} ({P} parallelizable), Coverage: FR {M}%, NFR {K}%`
 #### Step 3.2: TDD Pre-Generation (conditional)
-`"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" phase test-pre-gen`
+`"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" phase test-pre-gen`
 **Trigger condition**: tasks.md contains at least 1 task targeting a `.sh` file in `scripts/`.
 **If triggered**:
 1. Run the test pre-generation script:
    ```bash
-   "${CLAUDE_PLUGIN_ROOT}/scripts/afc-test-pre-gen.sh" ".claude/afc/specs/{feature}/tasks.md" "spec/"
+   "${CLAUDE_SKILL_DIR}/../../scripts/afc-test-pre-gen.sh" ".claude/afc/specs/{feature}/tasks.md" "spec/"
    ```
 2. Review generated skeleton files — verify they are parseable:
    ```bash
@@ -291,14 +598,14 @@ Execute `/afc:implement` logic inline — **follow all orchestration rules defin
 #### Step 3.3: Blast Radius Analysis (conditional)
-`"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" phase blast-radius`
+`"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" phase blast-radius`
 **Trigger condition**: plan.md File Change Map lists >= 3 files to change.
 **If triggered**:
 1. Run the blast radius analysis:
    ```bash
-   "${CLAUDE_PLUGIN_ROOT}/scripts/afc-blast-radius.sh" ".claude/afc/specs/{feature}/plan.md" "${CLAUDE_PROJECT_DIR}"
+   "${CLAUDE_SKILL_DIR}/../../scripts/afc-blast-radius.sh" ".claude/afc/specs/{feature}/plan.md" "${CLAUDE_PROJECT_DIR}"
    ```
 2. If exit 1 (cycle detected): **ESCALATE** — present the cycle to user with options:
    - Option 1: Refactor plan to break the cycle
@@ -314,11 +621,11 @@ Execute `/afc:implement` logic inline — **follow all orchestration rules defin
 0. **Baseline test** (follows implement.md Step 1, item 5): if `{config.test}` is non-empty, run `{config.test}` before starting task execution. On failure, report pre-existing test failures to user and ask: "(1) Proceed anyway (2) Fix first (3) Abort". On pass or empty config, continue.
 1. Execute tasks phase by phase using implement.md orchestration rules (sequential/batch/swarm based on [P] count)
 2. **Implementation Context injection**: Every sub-agent prompt includes the `## Implementation Context` section from plan.md **and relevant FR/AC items from spec.md** (ensures spec intent propagates to workers)
-3. Perform **3-step gate** on each Implementation Phase completion — **always** read `${CLAUDE_PLUGIN_ROOT}/docs/phase-gate-protocol.md` first. Cannot advance to next phase without passing the gate.
-   - On gate pass: create phase rollback point `"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" phase-tag {phase_number}`
+3. Perform **3-4 step gate** on each Implementation Phase completion — **always** read `${CLAUDE_SKILL_DIR}/../../docs/phase-gate-protocol.md` first. Cannot advance to next phase without passing the gate.
+   - On gate pass: create phase rollback point `"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" phase-tag {phase_number}`
 4. Real-time `[x]` updates in tasks.md
 5. After full completion, run `{config.ci}` final verification
-   - On pass: `"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" ci-pass` (releases Stop Gate)
+   - On pass: `"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" ci-pass` (releases Stop Gate)
    - **On fail: Debug-based RCA** (replaces blind retry):
      1. Execute `/afc:debug` logic inline with the CI error output as input
      2. Debug performs RCA: error trace → data flow → hypothesis → targeted fix
@@ -349,7 +656,7 @@ Execute `/afc:implement` logic inline — **follow all orchestration rules defin
 #### Step 3.6: Implement Critic Loop
-> **Always** read `${CLAUDE_PLUGIN_ROOT}/docs/critic-loop-rules.md` first and follow it.
+> **Always** read `${CLAUDE_SKILL_DIR}/../../docs/critic-loop-rules.md` first and follow it.
 **Critic Loop until convergence** (safety cap: 5, follow Critic Loop rules):
 - **SCOPE_ADHERENCE**: Compare `git diff` changed files against plan.md File Change Map. Flag any file modified that is NOT in the plan. Flag any planned file NOT modified. Provide "M of N files match" count.
@@ -367,77 +674,338 @@ Execute `/afc:implement` logic inline — **follow all orchestration rules defin
 7. **Checkpoint**: phase transition already recorded by `afc-pipeline-manage.sh phase implement` at phase start
 8. Progress: `✓ 3/5 Implement complete ({completed}/{total} tasks, CI: ✓, Critic: converged ({N} passes, {M} fixes, {E} escalations))`
-### Phase 4: Review (4/5)
+### Skill Advisor Checkpoint D (Post-Implement)
-`"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" phase review`
+> Evaluate auxiliary skill triggers AFTER implementation, BEFORE review. Budget: max 2 skills. Skip all if `ADVISOR_COUNT >= 5`.
-Execute `/afc:review` logic inline — **follow all review perspectives defined in `commands/review.md`** (A through H). The review command is the single source of truth for review criteria.
+**Intent evaluation** — Examine the implementation results and answer these questions:
-**Context reload**: Re-read `.claude/afc/specs/{feature}/context.md` (contains full AC) and `.claude/afc/specs/{feature}/spec.md` to ensure spec context is available for SPEC_ALIGNMENT validation (these may have been compacted since Phase 1).
+| # | Question | Score 1–5 | If >= 3 | Skill | Mode |
+|---|----------|-----------|---------|-------|------|
+| D1 | Were **testable source files changed without corresponding test coverage**? Look at `git diff --name-only` — for each changed source file, does a test file covering its behavior also appear in the diff? Consider the project's test convention and whether the changed files contain logic that should be tested (skip config files, types-only files, static assets). Only evaluate if `{config.test}` is non-empty. | 1=all changes have test coverage, 5=critical logic changed with zero tests | test generation (general-purpose fork) | Enrich |
+| D2 | Based on **past pipeline quality data**, is there reason to believe this implementation has hidden quality issues? Check `.claude/afc/memory/quality-history/*.json` (if exists) — have recent pipelines shown elevated critical findings? Are there recurring problem categories that this feature's changed files might be susceptible to? | 1=clean history or no history, 5=strong pattern of recurring issues in similar areas | pre-review QA (general-purpose fork) | Observe |
-1. Review implemented changed files (`git diff HEAD`)
-2. **Specialist agent delegation** (parallel, perspectives B and C):
-   Launch architect and security agents in a **single message** to leverage their persistent memory:
+**If D1 >= 3 AND `{config.test}` is non-empty** (Enrich — skip if no test framework configured):
+1. Identify which changed source files lack test coverage — focus on files with meaningful logic (not config, not types, not assets):
    ```
-   Task("Architecture Review: {feature}", subagent_type: "afc:afc-architect",
-     prompt: "Review the following changed files for architecture compliance.
-     ## Changed Files
-     {list of changed files from git diff}
+   For each changed source file:
+   - Does the project have a test file for it? (check test directory patterns)
+   - Was that test file also modified in this diff?
+   - Does the source file contain testable exports? (functions, classes, handlers)
+   → List files that have testable logic but no test coverage in this diff
+   ```
+2. Invoke test generation (fork):
+   ```
+   Task("Coverage boost: {feature}", subagent_type: "general-purpose",
+     prompt: "Generate missing tests for recently implemented files.
-     ## Architecture Rules
-     {config.architecture}
+     ## Uncovered Files (testable logic, no test changes in this diff)
+     {list of uncovered source files with their full paths}
      ## Instructions
-     1. Read your MEMORY.md for prior architecture patterns and ADRs
-     2. Check each file against architecture rules (layer boundaries, naming, placement)
-     3. Cross-reference with ADRs recorded during Plan phase — any violations?
-     4. Return findings as: severity (Critical/Warning/Info), file:line, issue, suggested fix
-     5. Update your MEMORY.md with any new architecture patterns discovered")
+     1. Read each uncovered file to understand its exports and behavior
+     2. Read existing test files in the project for pattern reference
+     3. Generate unit tests targeting:
+        - Exported functions/classes
+        - Edge cases and error paths
+        - Integration points (if the file calls other changed files)
+     4. Follow the project's test framework: {config.test framework}
+     5. Place test files following project convention
+     6. Run {config.test} to verify tests pass
+     7. Return: files created, test count, pass/fail status")
+   ```
+3. New test files automatically enter review scope (Phase 4)
+4. Increment `ADVISOR_COUNT`
+5. Progress: `  ├─ Skill Advisor [D]: test (score: {N}/5, {M} uncovered files → {K} test files generated)`
-   Task("Security Review: {feature}", subagent_type: "afc:afc-security",
-     prompt: "Scan the following changed files for security vulnerabilities.
+**If D2 >= 3** (Observe):
+1. Load `.claude/afc/memory/quality-history/*.json` (most recent 3 files, sorted by filename descending)
+2. Identify recurring problem categories and which changed files are most at risk:
+   ```
+   Task("Pre-review QA: {feature}", subagent_type: "general-purpose",
+     prompt: "Perform a pre-review quality audit focused on historically problematic areas.
      ## Changed Files
-     {list of changed files from git diff}
+     {git diff --name-only}
+     ## Quality History Context
+     {summary of patterns from recent quality-history reports — categories, frequencies, affected file types}
      ## Instructions
-     1. Read your MEMORY.md for known vulnerability patterns and false positives
-     2. Check for: command injection, path traversal, unvalidated input, sensitive data exposure
-     3. Skip patterns recorded as false positives in your memory
-     4. Return findings as: severity (Critical/Warning/Info), file:line, issue, suggested fix
-     5. Update your MEMORY.md with new patterns or confirmed false positives")
-   ```
-   - Collect agent outputs and merge into the consolidated review
-   - Agent findings inherit their severity classification directly
-3. Check across **8 perspectives** (A-H as defined in review.md):
-   - A. Code Quality — `{config.code_style}` compliance (direct review)
-   - B. Architecture — **delegated to afc-architect agent** (persistent memory, ADR-aware)
-   - C. Security — **delegated to afc-security agent** (persistent memory, false-positive-aware)
-   - D. Performance — framework-specific patterns from Project Context (direct review)
-   - E. Project Pattern Compliance — conventions and idioms (direct review)
-   - **F. Reusability** — DRY, shared utilities, abstraction level (direct review)
-   - **G. Maintainability** — AI/human comprehension, naming clarity, self-contained files (direct review)
-   - **H. Extensibility** — extension points, OCP, future modification cost (direct review)
-4. **Auto-resolved validation**: Check all `[AUTO-RESOLVED]` items from spec phase — does the implementation match the guess? Flag mismatches as Critical.
-5. **Past reviews check**: if `.claude/afc/memory/reviews/` exists, load the **most recent 15 files** (sorted by filename descending) and scan for recurring finding patterns across past review reports. Prioritize those areas.
-6. **Retrospective check**: if `.claude/afc/memory/retrospectives/` exists, load the **most recent 10 files** (sorted by filename descending) and check:
+     1. Focus on the recurring problem categories identified above
+     2. Check: error handling completeness, input validation, resource cleanup
+     3. Format your response EXACTLY as:
+        ## Pre-Review QA Findings
+        - [{severity}] {file}:{line} — {issue}: {suggested fix}
+        ## Priority Hints for Review
+        - {file}: focus on {area} (historically problematic)
+     4. Read-only — do NOT modify any files")
+   ```
+3. Store output as `QA_FINDINGS` → injected into Phase 4 review context
+4. Review phase uses "Priority Hints" to focus attention
+5. Increment `ADVISOR_COUNT`
+6. Progress: `  ├─ Skill Advisor [D]: qa (score: {N}/5, {M} priority hints for review)`
+**If all scores < 3**: proceed silently to Phase 4.
+### Phase 4: Review (4/5)
+`"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" phase review`
+Execute `/afc:review` logic inline — **follow all review perspectives defined in `skills/review/SKILL.md`** (A through H). The review skill is the single source of truth for review criteria.
+**Context reload**: Re-read `.claude/afc/specs/{feature}/context.md` (contains full AC) and `.claude/afc/specs/{feature}/spec.md` to ensure spec context is available for SPEC_ALIGNMENT validation (these may have been compacted since Phase 1).
+#### Step 4.1: Collect Review Targets
+1. Collect changed files via `git diff HEAD`
+2. Read **full content** of each changed file (not just the diff — full context needed for review)
+#### Step 4.2: Reverse Impact Analysis
+Before reviewing, identify **files affected by the changes** (not just the changed files themselves):
+1. **For each changed file**, find files that depend on it:
+   - **LSP (preferred)**: `LSP(findReferences)` on exported symbols — tracks type references, function calls, re-exports
+   - **Grep (fallback)**: `Grep` for `import.*{filename}`, `require.*{filename}`, `source.*{filename}` patterns across the codebase
+   - LSP and Grep are complementary — use both when LSP is available
+2. **Build impact map**:
+   ```
+   Impact Map:
+   ├─ src/auth/login.ts (changed)
+   │  └─ affected: src/pages/LoginPage.tsx, src/middleware/auth.ts
+   └─ Total: {N} changed files → {M} affected files
+   ```
+3. **Scope decision**: Affected files are NOT full review targets. Include them as **cross-reference context** in review and cross-boundary verification. If an affected file has >3 references to a changed symbol → flag for closer inspection.
+4. **Limitations** (include in review output):
+   > ⚠ Dynamic dependencies not covered: runtime dispatch, reflection, cross-language calls, config/env-driven branching.
+#### Step 4.3: Scaled Review Orchestration
+Choose review orchestration based on the number of changed files:
+**Pre-scan: Call Chain Context** (for Parallel Batch and Review Swarm modes only):
+Before distributing files to review agents, collect cross-boundary context:
+1. For each changed file, identify **outbound calls** to other changed files (imports + function calls)
+2. For each outbound call target, extract: function signature + 1-line side-effect summary
+3. Include the **Impact Map** from Step 4.2 — each agent receives the list of affected files
+4. Include this context in each review agent's prompt as `## Cross-File Context`
+For Direct review mode (≤5 files): skip pre-scan — orchestrator already has full context.
+**5 or fewer files**: Direct review — review all files directly in the current context (no delegation).
+**6–10 files**: Parallel Batch — distribute to parallel review agents (2–3 files per agent) in a **single message**:
+```
+Task("Review: {file1, file2}", subagent_type: "general-purpose")
+Task("Review: {file3, file4}", subagent_type: "general-purpose")
+```
+**11+ files**: Review Swarm — group files into batches (2-3 per worker), spawn N review workers in a **single message** (N = min(5, file count / 2)). Review is read-only — no write race conditions.
+#### Step 4.4: Specialist Agent Delegation (parallel, perspectives B and C)
+Launch architect and security agents in a **single message** to leverage their persistent memory:
+```
+Task("Architecture Review: {feature}", subagent_type: "afc:afc-architect",
+  prompt: "Review the following changed files for architecture compliance.
+  ## Changed Files
+  {list of changed files from git diff}
+  ## Architecture Rules
+  {config.architecture}
+  ## Instructions
+  1. Read your MEMORY.md for prior architecture patterns and ADRs
+  2. Check each file against architecture rules (layer boundaries, naming, placement)
+  3. Cross-reference with ADRs recorded during Plan phase — any violations?
+  4. Return findings as: severity (Critical/Warning/Info), file:line, issue, suggested fix
+  5. Update your MEMORY.md with any new architecture patterns discovered")
+Task("Security Review: {feature}", subagent_type: "afc:afc-security",
+  prompt: "Scan the following changed files for security vulnerabilities.
+  ## Changed Files
+  {list of changed files from git diff}
+  ## Instructions
+  1. Read your MEMORY.md for known vulnerability patterns and false positives
+  2. Check for: command injection, path traversal, unvalidated input, sensitive data exposure
+  3. Skip patterns recorded as false positives in your memory
+  4. Return findings as: severity (Critical/Warning/Info), file:line, issue, suggested fix
+  5. Include any new vulnerability patterns or confirmed false positives in your response (orchestrator will record them)")
+```
+- Collect agent outputs and merge into the consolidated review
+- Agent findings inherit their severity classification directly
+#### Step 4.5: Perform Review (8 perspectives)
+Check across **8 perspectives** (A-H as defined in `skills/review/SKILL.md`):
+- A. Code Quality — `{config.code_style}` compliance (direct review)
+- B. Architecture — **delegated to afc-architect agent** (persistent memory, ADR-aware)
+- C. Security — **delegated to afc-security agent** (persistent memory, false-positive-aware)
+- D. Performance — framework-specific patterns from Project Context (direct review)
+- E. Project Pattern Compliance — conventions and idioms (direct review)
+- **F. Reusability** — DRY, shared utilities, abstraction level (direct review)
+- **G. Maintainability** — AI/human comprehension, naming clarity, self-contained files (direct review)
+- **H. Extensibility** — extension points, OCP, future modification cost (direct review)
+#### Step 4.6: Cross-Boundary Verification (MANDATORY)
+After individual/parallel reviews and specialist agents complete, the **orchestrator** MUST perform a cross-boundary check. This is a required step, not optional — skipping it is a review defect.
+**For High complexity (Review Swarm) reviews**: This is especially critical because individual review agents cannot see cross-file interactions. The orchestrator MUST read callee implementations directly.
+0. **Impact Map integration**: Use the Impact Map from Step 4.2 to prioritize verification. Affected files with significant coupling to changed symbols (behavioral call references, not just type imports, especially in critical code paths) should be read and checked for breakage — even if no finding was raised against them.
+1. **Filter**: From all collected findings, select those involving:
+   - Call order changes (function A now calls B before C)
+   - Error handling modifications (try/catch scope changes, error propagation changes)
+   - State mutation changes (new writes to shared state, removed cleanup)
+2. **Verify**: For each behavioral finding rated Critical or Warning:
+   - **Read the callee's implementation** (the function/method being called) — this read is mandatory, not optional
+   - **Skip external dependencies**: If the callee is in `node_modules/`, `vendor/`, or other third-party directories, verify against type definitions or documented API contract instead. Note: "verified against types/docs, not source"
+   - Check: does the callee's internal behavior (side effects, state changes, return values) actually conflict with the change?
+   - If no conflict → downgrade: Critical → Info, Warning → Info (append "verified: no cross-boundary impact")
+   - If confirmed conflict → keep severity, enrich description with callee behavior details
+3. **False positive reference** (security-related findings only): Check `afc-security` agent's MEMORY.md `## False Positives` section if it exists. Known false positive patterns should be noted in findings.
+4. **Output**: Append verification summary before Review Output:
+   ```
+   Cross-Boundary Check: {N} behavioral findings verified
+   ├─ Confirmed: {M} (severity kept)
+   ├─ Downgraded: {K} (false positive — callee compatible)
+   └─ Skipped: {J} (no behavioral change)
+   ```
+This step runs in the orchestrator context (not delegated), as it requires reading code across file boundaries.
+#### Step 4.7: Inject Advisor Context
+If Checkpoint D produced outputs, inject them into the review context:
+- **`QA_FINDINGS`** (from D2): read the stored findings and include as "Priority Hints for Review" — focus review attention on historically problematic areas and pre-identified quality concerns.
+- **New test files** (from D1): include in the review scope alongside implementation changes.
+#### Step 4.8: Auto-specific Validations
+1. **Auto-resolved validation**: Check all `[AUTO-RESOLVED]` items from spec phase — does the implementation match the guess? Flag mismatches as Critical.
+2. **Past reviews check**: if `.claude/afc/memory/reviews/` exists, load the **most recent 15 files** (sorted by filename descending) and scan for recurring finding patterns across past review reports. Prioritize those areas.
+3. **Retrospective check**: if `.claude/afc/memory/retrospectives/` exists, load the **most recent 10 files** (sorted by filename descending) and check:
    - Were there recurring Critical finding categories in past reviews? Prioritize those perspectives.
    - Were there false positives that wasted effort? Reduce sensitivity for those patterns.
-7. **Critic Loop until convergence** (safety cap: 5, follow Critic Loop rules):
-   - COMPLETENESS: were all changed files reviewed across all 8 perspectives (A-H)?
-   - SPEC_ALIGNMENT: cross-check implementation against spec.md — (1) every SC verified with `{M}/{N}` count, (2) every acceptance scenario (GWT) has corresponding code path, (3) no spec constraint is violated
-   - PRECISION: are there unnecessary changes? Are there out-of-scope modifications?
-   - FAIL → auto-fix and continue. ESCALATE → pause, present options, resume after response. DEFER → record reason, mark clean.
-8. **Handling SC shortfalls**:
-   - Fixable → attempt auto-fix → re-run `{config.ci}` verification
-   - Not fixable → state in final report with reason (no post-hoc rationalization; record as Plan-phase target-setting error)
-9. **Checkpoint**: phase transition already recorded by `afc-pipeline-manage.sh phase review` at phase start
-10. Progress: `✓ 4/5 Review complete (Critical:{N} Warning:{N} Info:{N}, SC shortfalls: {N})`
+#### Step 4.9: Critic Loop
+> **Always** read `${CLAUDE_SKILL_DIR}/../../docs/critic-loop-rules.md` first and follow it.
+**Critic Loop until convergence** (safety cap: 5, follow Critic Loop rules):
+- COMPLETENESS: were all changed files reviewed across all 8 perspectives (A-H)?
+- SPEC_ALIGNMENT: cross-check implementation against spec.md — (1) every SC verified with `{M}/{N}` count, (2) every acceptance scenario (GWT) has corresponding code path, (3) no spec constraint is violated
+- SIDE_EFFECT_AWARENESS: For findings involving call order changes, error handling modifications, or state mutation changes: did the reviewer verify the callee's internal behavior? If a Critical finding assumes a side effect without reading the target implementation → auto-downgrade to Info with note "cross-boundary unverified". Provide "{M} of {N} behavioral findings verified" count.
+- PRECISION: are there unnecessary changes? Are there out-of-scope modifications? Are findings actual issues, not false positives?
+- FAIL → auto-fix and continue. ESCALATE → pause, present options, resume after response. DEFER → record reason, mark clean.
+#### Step 4.10: Handling SC shortfalls
+- Fixable → attempt auto-fix → re-run `{config.ci}` verification
+- Not fixable → state in final report with reason (no post-hoc rationalization; record as Plan-phase target-setting error)
+#### Step 4.11: Retrospective Entry (if new pattern found)
+If this review reveals a recurring pattern not previously documented in `.claude/afc/memory/retrospectives/`:
+Append to `.claude/afc/memory/retrospectives/{YYYY-MM-DD}.md`:
+```markdown
+## Pattern: {category}
+**What happened**: {concrete description}
+**Root cause**: {why this keeps occurring}
+**Prevention rule**: {actionable rule — usable in future plan/implement phases}
+**Severity**: Critical | Warning
+```
+Only write if the pattern is new and actionable. Generic observations are prohibited.
+#### Step 4.12: Archive Review Report
+Persist the review results for memory:
+1. Write full review output (Summary table + Impact Analysis + Detailed Findings + Positives + Cross-Boundary Check) to `.claude/afc/specs/{feature}/review-report.md`
+2. Include metadata header:
+   ```markdown
+   # Review Report: {feature name}
+   > Date: {YYYY-MM-DD}
+   > Files reviewed: {count}
+   > Findings: Critical {N} / Warning {N} / Info {N}
+   ```
+3. This file is copied to `.claude/afc/memory/reviews/{feature}-{date}.md` during Clean phase before .claude/afc/specs/ deletion.
+#### Step 4.12: Checkpoint & Progress
+- **Checkpoint**: phase transition already recorded by `afc-pipeline-manage.sh phase review` at phase start
+- Progress: `✓ 4/5 Review complete (Critical:{N} Warning:{N} Info:{N}, Cross-boundary: {M} verified, SC shortfalls: {N})`
+### Skill Advisor Checkpoint E (Post-Review)
+> Evaluate auxiliary skill triggers AFTER review, BEFORE clean. Budget: max 1 skill. Skip all if `ADVISOR_COUNT >= 5`.
+**Intent evaluation** — Examine review findings and retrospective history:
+| # | Question | Score 1–5 | If >= 3 | Skill | Mode |
+|---|----------|-----------|---------|-------|------|
+| E1 | Are there **recurring problem patterns** across this and past pipelines that should be codified as project rules? Check `.claude/afc/memory/retrospectives/` — do the same types of issues (e.g., "missing error handling in hooks", "forgotten spec file updates") keep appearing? Also consider: did this pipeline's review reveal issues that match past retrospective patterns? | 1=no retrospective history or no patterns, 5=same issue type recurred 3+ times and is not yet a project rule | pattern promotion (general-purpose fork, with learner guardrails) | Observe |
+**If E1 >= 3** (Observe):
+1. Read retrospective files and identify recurring pattern categories:
+   - What types of issues keep recurring?
+   - Are they already covered by existing rules in `.claude/rules/afc-learned.md`?
+   - Would a project rule have prevented the recurrence?
+2. Invoke learner:
+   ```
+   Task("Pattern promotion: {feature}", subagent_type: "general-purpose",
+     prompt: "Review recurring patterns for potential promotion to project rules.
+     ## Recurring Patterns
+     {list each pattern with: category, occurrence count, concrete examples from retrospective entries}
+     ## Current Review Findings
+     {summary of this pipeline's review findings that match retrospective patterns}
+     ## Current Rules
+     {read .claude/rules/afc-learned.md if it exists, else 'No learned rules yet'}
+     ## Instructions
+     1. For each recurring pattern, evaluate:
+        - Is it actionable? (specific enough to enforce)
+        - Is it already covered by existing rules?
+        - Would enforcing it have prevented the recurrence?
+     2. For patterns worth promoting, write a rule in this format:
+        ### {Category}
+        - **Rule**: {concise, enforceable statement}
+        - **Rationale**: {why — based on {N} occurrences across pipelines}
+        - **Enforcement**: {how to check — linter, review criterion, or convention}
+     3. **Safety guardrails** (mandatory):
+        - Do NOT create rules about: permissions, security policies, hook behavior, tool access
+        - Do NOT create rules that contradict existing CLAUDE.md or .claude/rules/ content
+        - Each rule must be scoped to code conventions only (naming, style, workflow, testing, architecture)
+        - Verify no duplicate or contradictory rule exists before appending
+     4. Append new rules to .claude/rules/afc-learned.md (create if absent)
+     5. Do NOT duplicate existing rules
+     6. Return: {N} patterns evaluated, {M} promoted, {K} already covered")
+   ```
+3. Increment `ADVISOR_COUNT`
+4. Progress: `  ├─ Skill Advisor [E]: learner (score: {N}/5, {M} patterns evaluated, {K} promoted to rules)`
+**If score < 3**: proceed silently to Phase 5.
 ### Phase 5: Clean (5/5)
-`"${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" phase clean`
+`"${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" phase clean`
 Artifact cleanup and codebase hygiene check after implementation and review:
@@ -460,23 +1028,24 @@ Artifact cleanup and codebase hygiene check after implementation and review:
    - **If retrospective.md exists** → record as patterns missed by the Plan phase Critic Loop in `.claude/afc/memory/retrospectives/` (reuse as RISK checklist items in future runs)
    - **If review-report.md exists** → copy to `.claude/afc/memory/reviews/{feature}-{date}.md` before .claude/afc/specs/ deletion
    - **If research.md exists** and was not already persisted in Plan phase → copy to `.claude/afc/memory/research/{feature}.md`
-   - **Agent memory consolidation**: architect and security agents have already updated their persistent MEMORY.md during Review phase. **Size enforcement**: check each agent's MEMORY.md line count — if either exceeds 100 lines, invoke the respective agent to self-prune:
+   - **Agent memory consolidation**: Check each agent's MEMORY.md for bloat — if it contains redundant, obsolete, or superseded entries that reduce signal-to-noise ratio, invoke the agent to self-prune:
      ```
      Task("Memory cleanup: afc-architect", subagent_type: "afc:afc-architect",
-       prompt: "Your MEMORY.md exceeds 100 lines. Read it, prune old/redundant entries, and rewrite to under 100 lines following your size limit rules.")
+       prompt: "Review your MEMORY.md. Read it, identify and prune old/redundant/obsolete entries, and rewrite it keeping only entries that are still relevant and non-overlapping.")
      ```
-     (Same pattern for afc-security if needed. Skip if both are under 100 lines.)
-   - **Memory rotation**: for each memory subdirectory, check file count and prune oldest files if over threshold:
-     | Directory | Threshold | Action |
-     |-----------|-----------|--------|
-     | `quality-history/` | 30 files | Delete oldest files beyond threshold |
-     | `reviews/` | 40 files | Delete oldest files beyond threshold |
-     | `retrospectives/` | 30 files | Delete oldest files beyond threshold |
-     | `research/` | 50 files | Delete oldest files beyond threshold |
-     | `decisions/` | 60 files | Delete oldest files beyond threshold |
-     - Sort by filename ascending (oldest first), delete excess
+     Use semantic assessment (are entries still relevant? do entries overlap?) rather than a line-count threshold. (Same pattern for afc-security if needed.)
+   - **Memory rotation**: For each memory subdirectory, assess whether the oldest files still provide value. Prune files that are superseded by newer entries, reference features/code that no longer exists, or overlap with other files. As a practical guideline, keep the most recent and relevant entries — if a directory has grown large enough that scanning it would be slow (roughly 30+ files), prioritize pruning the least relevant entries:
+     | Directory | Pruning Intent | Soft Guideline |
+     |-----------|---------------|----------------|
+     | `quality-history/` | Remove superseded or redundant quality records | ~30 files |
+     | `reviews/` | Remove reviews for features no longer in the codebase | ~40 files |
+     | `retrospectives/` | Remove retrospectives whose learnings are already captured elsewhere | ~30 files |
+     | `research/` | Remove research for libraries/patterns no longer used | ~50 files |
+     | `decisions/` | Remove decisions that have been reversed or are no longer relevant | ~60 files |
+     - These numbers are soft guidelines, not hard cutoffs — use judgment based on relevance
+     - Sort by filename ascending (oldest first) when pruning by recency
      - Log: `"Memory rotation: {dir} pruned {N} files"`
-     - Skip directories that do not exist or are under threshold
+     - Skip directories that do not exist or clearly do not need pruning
 5. **Quality report** (structured pipeline metrics):
    - Generate `.claude/afc/memory/quality-history/{feature}-{date}.json` with the following structure:
      ```json
@@ -509,11 +1078,11 @@ Artifact cleanup and codebase hygiene check after implementation and review:
    - Clear `.claude/afc/memory/checkpoint.md` **and** `~/.claude/projects/{ENCODED_PATH}/memory/checkpoint.md` (pipeline complete = session goal achieved, dual-delete prevents stale checkpoint in either location; `ENCODED_PATH` = project path with `/` replaced by `-`)
 7. **Timeline finalize**:
    ```bash
-   "${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" log pipeline-end "Pipeline complete: {feature}"
+   "${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" log pipeline-end "Pipeline complete: {feature}"
    ```
 8. **Release Pipeline Flag** (hook integration):
    ```bash
-   "${CLAUDE_PLUGIN_ROOT}/scripts/afc-pipeline-manage.sh" end
+   "${CLAUDE_SKILL_DIR}/../../scripts/afc-pipeline-manage.sh" end
    ```
    - Stop Gate Hook deactivated
    - Change tracking log deleted
@@ -536,6 +1105,8 @@ Auto pipeline complete: {feature}
 │   ├─ Perspectives: Quality, Architecture*, Security*, Performance, Patterns, Reusability, Maintainability, Extensibility
 │   └─ (* = delegated to persistent-memory agent)
 ├─ 5/5 Clean: {N} artifacts deleted, {N} dead code removed
+├─ Skill Advisor: {ADVISOR_COUNT} auxiliary skills invoked
+│   {for each invoked: ├─ [{checkpoint}] {skill}: {summary}}
 ├─ Changed files: {N}
 ├─ Auto-resolved: {N} ({M} validated in review)
 ├─ Agent memory: architect {updated/skipped}, security {updated/skipped}