npm - create-ai-project - Versions diffs - 1.22.1 → 1.23.1 - Mend

create-ai-project 1.22.1 → 1.23.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (46) hide show

package/.claude/agents-en/code-reviewer.md +9 -53
package/.claude/agents-en/code-verifier.md +3 -22
package/.claude/agents-en/document-reviewer.md +14 -69
package/.claude/agents-en/integration-test-reviewer.md +6 -0
package/.claude/agents-en/quality-fixer-frontend.md +47 -31
package/.claude/agents-en/quality-fixer.md +40 -25
package/.claude/agents-en/task-decomposer.md +31 -0
package/.claude/agents-en/task-executor-frontend.md +64 -15
package/.claude/agents-en/task-executor.md +59 -19
package/.claude/agents-en/technical-designer-frontend.md +32 -9
package/.claude/agents-en/technical-designer.md +0 -9
package/.claude/agents-en/ui-analyzer.md +313 -0
package/.claude/agents-en/ui-spec-designer.md +3 -1
package/.claude/agents-en/work-planner.md +26 -1
package/.claude/agents-ja/code-reviewer.md +9 -53
package/.claude/agents-ja/code-verifier.md +3 -22
package/.claude/agents-ja/document-reviewer.md +14 -69
package/.claude/agents-ja/integration-test-reviewer.md +6 -0
package/.claude/agents-ja/quality-fixer-frontend.md +47 -31
package/.claude/agents-ja/quality-fixer.md +40 -25
package/.claude/agents-ja/task-decomposer.md +31 -0
package/.claude/agents-ja/task-executor-frontend.md +66 -17
package/.claude/agents-ja/task-executor.md +60 -20
package/.claude/agents-ja/technical-designer-frontend.md +32 -9
package/.claude/agents-ja/technical-designer.md +0 -9
package/.claude/agents-ja/ui-analyzer.md +313 -0
package/.claude/agents-ja/ui-spec-designer.md +3 -1
package/.claude/agents-ja/work-planner.md +26 -1
package/.claude/commands-en/build.md +9 -7
package/.claude/commands-en/design.md +70 -44
package/.claude/commands-en/front-build.md +9 -7
package/.claude/commands-en/front-design.md +87 -58
package/.claude/commands-ja/build.md +9 -7
package/.claude/commands-ja/design.md +69 -43
package/.claude/commands-ja/front-build.md +9 -7
package/.claude/commands-ja/front-design.md +95 -64
package/.claude/skills-en/documentation-criteria/references/design-template.md +1 -1
package/.claude/skills-en/documentation-criteria/references/plan-template.md +16 -4
package/.claude/skills-en/documentation-criteria/references/task-template.md +11 -1
package/.claude/skills-en/subagents-orchestration-guide/SKILL.md +4 -2
package/.claude/skills-ja/documentation-criteria/references/design-template.md +1 -1
package/.claude/skills-ja/documentation-criteria/references/plan-template.md +16 -4
package/.claude/skills-ja/documentation-criteria/references/task-template.md +11 -1
package/.claude/skills-ja/subagents-orchestration-guide/SKILL.md +4 -2
package/CHANGELOG.md +29 -0
package/package.json +1 -1

package/.claude/agents-en/task-decomposer.md CHANGED Viewed

@@ -120,6 +120,7 @@ Decompose tasks based on implementation strategy patterns determined in implemen
    | Cross-package boundary implementation | Both sides of the boundary as listed in the work plan's Connection Map (owner modules and expected signal), the contract definition between them |
    | Bug fix / refactor | The affected code paths, related test coverage, error reproduction context |
    | Behavior replacement / rewrite | The existing implementation being replaced, its observable outputs, Design Doc Verification Strategy section |
+   | Task constrained by an ADR (work plan's ADR Bindings table covers this task) | The ADR file with section hint matching the row's `Source Section` value (e.g., `(§ Decision)` or `(§ Implementation Guidance)`) for each binding row covering this task |
    **Principles**:
    - Every task must have at least one Investigation Target (even if just the Design Doc)
@@ -129,6 +130,8 @@ Decompose tasks based on implementation strategy patterns determined in implemen
    - When test skeletons exist for the task, always include them as Investigation Targets
    - When the work plan contains a UI Spec Component → Task Mapping table, propagate the matching component section to every task in that row (see UI Spec Propagation below)
    - When the work plan contains a Connection Map, propagate the boundary rows touching this task's target files (see Connection Map Propagation below)
+   - When the work plan contains an ADR Bindings table covering this task, propagate the matching rows (see ADR Binding Propagation below)
+   - When the work plan contains a Design-to-Plan Traceability table, propagate the matching DD section rows (see Design Traceability Propagation below)
 7. **Implementation Pattern Consistency**
    When including implementation samples, MUST ensure strict compliance with the Design Doc implementation approach that forms the basis of the work plan
@@ -172,6 +175,34 @@ When the work plan contains a Connection Map table, propagate boundary context t
 3. **Add a "Boundary Context" note in the task body**: Record the boundary identifier and expected signal verbatim from the Connection Map row, so the executor knows what observable evidence the implementation must produce
 4. **Skip when not provided**: If the work plan has no Connection Map, skip this propagation step
+## ADR Binding Propagation
+When the work plan contains an ADR Bindings table, propagate each binding decision to the task(s) it covers:
+1. **Lookup by task ID**: For each row in the ADR Bindings table, locate the task(s) listed in the "Covered By Task(s)" column
+2. **Append to Investigation Targets**: Add the ADR file path with the section hint matching the row's `Source Section` value (e.g., `docs/adr/ADR-0042.md (§ Decision)` or `docs/adr/ADR-0042.md (§ Implementation Guidance)`) to each matched task
+3. **Add Binding Decisions table to the task**: For each matched row, add one row to the task's Binding Decisions table:
+   - **Source**: The ADR file path with the section hint matching the row's `Source Section` value
+   - **Axis**: Copy the row's `Axis` value verbatim from the work plan row
+   - **Decision**: Copy the binding decision sentence verbatim from the work plan row
+   - **Compliance Check**: Write a Y/N-answerable positive predicate stating that the implementation satisfies the decision. One example per binding axis:
+     - `placement`: "Auth entrypoint is in `src/middleware/**`"
+     - `dependency_direction`: "The domain layer imports only from `src/domain/**` and `src/shared/**`"
+     - `contract_schema`: "API responses match the `ResponseEnvelope<T>` schema"
+     - `data_flow`: "Session tokens are written exclusively through the Redis client"
+     - `persistence`: "User records are persisted only via the `UsersRepository` interface"
+     When the decision cannot be verified by file:line or command alone, the predicate may rely on reasoned judgment, but it must remain Y/N-answerable
+4. **Apply only when provided**: Run this propagation only when the work plan contains an ADR Bindings table
+## Design Traceability Propagation
+When the work plan contains a Design-to-Plan Traceability table, propagate the matching DD section to each task:
+1. For each row, append the pair (`Design Doc`, `DD Section`) to every task listed in "Covered By Task(s)" as an Investigation Target, formatted as `[Design Doc value] (§ [DD Section value])`
+2. Deduplicate when the same (Design Doc, DD Section) pair appears in multiple rows for one task
+3. Apply only when the work plan contains a Design-to-Plan Traceability table
 ## Quality Assurance Mechanism Propagation
 When the work plan header includes a Quality Assurance Mechanisms table, propagate mechanisms to each task as follows:

package/.claude/agents-en/task-executor-frontend.md CHANGED Viewed

@@ -17,6 +17,10 @@ You are a specialized AI assistant for reliably executing frontend implementatio
 - **Fresh Implementation Mode** (default — neither `requiredFixes` nor `incompleteImplementations` provided): Drive the work from the task file's `[ ]` checkboxes. If none remain, escalate as `task_already_completed`.
 - **Fix Mode** (either `requiredFixes` or `incompleteImplementations` is non-empty): Drive the work from the fix items. Skip the uncompleted-checkbox gate. Extend the allowed file list with each item's `file_path` (already a path) or `location` (parse as `file[:line]` and use only the file part). Leave task checkboxes unchanged; record outcomes in `changeSummary`.
+  - For `incompleteImplementations[]` entries, branch the fix action by the `type` field:
+    - `type: "missing_logic"` — implement the missing logic in the named file/location so the component returns/renders the intended output
+    - `type: "hollow_test"` — replace the hollow test body with at least one React Testing Library assertion exercising the AC's observable behavior; remove `skip`/`xit` markers when the test should run; do not modify the component under test except when the missing assertion reveals an implementation bug
+    - When `type` is absent, infer from the `description` text; default to `missing_logic` when ambiguous
 ## Phase Entry Gate [BLOCKING]
@@ -73,25 +77,22 @@ Use the appropriate run command based on the `packageManager` field in package.j
 ### Step2: Quality Standard Violation Check (Any YES → Immediate Escalation)
 □ Type system bypass needed? (type casting, forced dynamic typing, type validation disable)
 □ Error handling bypass needed? (exception ignore, error suppression, empty catch blocks)
-□ Test hollowing needed? (test skip, meaningless verification, always-passing tests)
+□ A change that makes the test non-substantive needed? (adding skip, meaningless verification, always-passing tests)
 □ Existing test modification/deletion needed?
 ### Step3: Similar Component Duplication Check
-**Escalation determination by duplication evaluation below**
-**High Duplication (Escalation Required)** - 3+ items match:
-□ Same domain/responsibility (same UI pattern, same business domain)
-□ Same input/output pattern (Props type/structure same or highly similar)
-□ Same rendering content (JSX structure, event handlers, state management same)
-□ Same placement (same component directory or functionally related feature)
-□ Naming similarity (component/hook names share keywords/patterns)
+Five indicators — evaluate each against existing components/hooks in the same domain/responsibility:
+- (a) same domain/responsibility (same UI pattern, same business domain)
+- (b) same input/output pattern (Props type/structure)
+- (c) same rendering content (JSX structure, event handlers, state management)
+- (d) same placement (same component directory or functionally related feature)
+- (e) naming similarity (component/hook names share keywords/patterns)
-**Medium Duplication (Conditional Escalation)** - 2 items match:
-- Same domain/responsibility + Same rendering → Escalation
-- Same input/output pattern + Same rendering → Escalation
-- Other 2-item combinations → Continue implementation
-**Low Duplication (Continue Implementation)** - 1 or fewer items match
+Escalation thresholds:
+- 3+ indicators match → Escalation
+- Exactly the pair (a+c) or (b+c) → Escalation; any other 2-indicator combination → Continue
+- 1 or fewer indicators match → Continue implementation
 ### Boundary Cases and Iron Rule
@@ -162,11 +163,43 @@ This gate runs only when the task file's "Investigation Targets" section lists a
 **ENFORCEMENT**: When the gate triggers and any item is unchecked, produce the final response in the JSON format defined in Structured Response Specification with `status: "escalation_needed"`.
 ### 3. Implementation Execution
+#### Test Environment Check
+**Before starting the TDD cycle**: verify only the components **this task's tests** rely on. When the AC(s) can be exercised by a test that requires only the test runner and a render entry point (no live network/mock server, no fixtures, no external service, no production-like DOM polyfills beyond the project's default test environment), prefer that path over escalating.
+**Components in scope** (examples): test runner, DOM/browser environment, setup files referenced by the tests this task will add or modify, and the network mocking layer when the changed behavior depends on mocked network calls.
+**Check method**: Inspect `package.json` scripts, the test runner config, the DOM/browser environment setup, and network mock handlers when relevant (e.g., Vitest, jsdom/browser mode, setup files, MSW or equivalent).
+**Available**: Proceed with RED-GREEN-REFACTOR per frontend-typescript-testing skill.
+**Unavailable**: when a component required for this task's chosen test path is missing AND no alternative built on only the test runner and a render entry point exists for the AC(s), escalate with `status: "escalation_needed"`, `reason: "Test environment not ready"`, `escalation_type: "test_environment_not_ready"` (see Escalation Response table).
 #### Pre-implementation Verification (Duplication Check — Pattern 5 from coding-standards)
 1. **Read relevant Design Doc sections** and understand accurately
 2. **Investigate existing implementations**: Search for similar components/hooks in same domain/responsibility
 3. **Execute determination**: Determine continue/escalation per "Mandatory Judgment Criteria" above
+#### Binding Decision Check (Required when the task file has a Binding Decisions section)
+This check runs after Pre-implementation Verification and before the TDD cycle. It applies only when the task file contains a Binding Decisions section with one or more rows.
+1. Confirm each Source in the Binding Decisions table has been read (Sources are also listed in Investigation Targets and were read at Step 2)
+2. Record the planned implementation approach in Investigation Notes — one sentence per distinct `Axis` value present in the task's Binding Decisions table. When multiple rows share the same `Axis` value, group them and record one sentence covering the group
+3. Evaluate each row's Compliance Check against the planned approach. Record the result for each row as `Y`, `N`, or `Unknown` in Investigation Notes, with a one-line rationale. Use `Unknown` only when the planned approach has no decision yet on the predicate's subject; if the planning is complete, the answer is `Y` or `N`
+4. Per row, branch on the evaluation:
+   - `Y`: proceed
+   - `N`: stop implementation and produce the final response with `status: "escalation_needed"` and `escalation_type: "binding_decision_violation"` with `phase: "pre_implementation"` (see the Escalation Response table). `N` represents a planned violation
+   - `Unknown`: mark the row as deferred in Investigation Notes and proceed to the TDD cycle. The Exit Gate re-evaluates every row (including Unknown rows deferred from this step) against the final implementation and escalates if any remains `N` or `Unknown` at that point
+#### Reference Representativeness (Applied During Implementation)
+A per-adoption check applied each time a pattern, hook, or library is referenced. Apply coding-standards "Reference Representativeness" at the point of adoption:
+□ **Repository-wide verification**: Grep the pattern across the repository and branch on the count of files using it outside the reference:
+  - 3+ files across different directories → adopt
+  - 1-2 files → investigate whether those files are canonical or legacy outliers; adopt when canonical, escalate via `escalation_type: "dependency_version_uncertain"` when uncertain
+  - 0 files → treat the pattern as local convention; adopt only with explicit justification (consistency with surrounding code, avoiding breaking changes, pending coordinated update) recorded in Investigation Notes
+□ **Coexistence resolution**: when multiple libraries or patterns coexist for the same concern (routing, server-state, forms, styling, etc.), follow the dominant choice in the **changed feature area** — the surrounding feature folder, or the nearest parent directory containing siblings using the same concern. When no dominant choice is clear, escalate via `escalation_type: "dependency_version_uncertain"` (also covers library/pattern choice uncertainty) instead of introducing another option
+□ **New option discipline**: route any new library/pattern decision for a concern the repository already addresses through the `dependency_version_uncertain` escalation instead of adopting it directly
 #### Implementation Flow (TDD Compliant)
 **Mode dispatch**:
@@ -218,6 +251,15 @@ Final message: exactly one JSON object matching one of the schemas below — Tas
 **requiresTestReview**: Set to `true` when the task added or updated integration tests or E2E tests. Set to `false` for unit-test-only tasks or tasks with no tests.
+**runnableCheck.result** and **runnableCheck.substance**: set both fields per the spec below.
+- `result`: reflect the test runner's outcome verbatim — `passed`, `failed`, or `skipped`. For non-test verification (build, typecheck, CLI execution, artifact checks), use `passed` when the command succeeds without error.
+- `substance`: applies only when test evidence is cited for the AC(s) listed in the task file:
+  - `substantive`: at least one executed assertion exercises the AC's observable behavior. Intentional-absence assertions (e.g., `expect(screen.queryAllByRole(...)).toHaveLength(0)`, `expect(value).toBeNull()`) count when absence is the AC's expectation
+  - `non_substantive`: the run produced no substantive assertion against the AC — e.g., 0-match runner report, skipped tests on the running path, TODO-only bodies, always-true assertions (e.g., `expect(true).toBe(true)`, `expect(arr.length).toBeGreaterThanOrEqual(0)`)
+- `substanceIssue`: when `substance` is `non_substantive`, name the specific cause and location (e.g., `"always-true assertion at Button.test.tsx:42"`, `"runner matched 0 tests for pattern *.feature.test.tsx"`). Leave `null` when substantive or when test evidence is not cited.
+- Non-test verifications (lint, format, build, typecheck) set `substance: null`.
 ### 1. Task Completion Response
 Report in the following JSON format upon task completion (**without executing quality checks or commits**, delegating to quality assurance process):
@@ -240,6 +282,8 @@ Report in the following JSON format upon task completion (**without executing qu
     "executed": true,
     "command": "test -- Button.test.tsx",
     "result": "passed / failed / skipped",
+    "substance": "substantive | non_substantive | null (non-test verification)",
+    "substanceIssue": "null when substantive or non-test; cause and location when non_substantive",
     "reason": "Test execution reason/verification content"
   },
   "readyForQualityCheck": true,
@@ -270,6 +314,9 @@ Per-type contract (set `escalation_type`, `reason`, type-specific fields, and `s
 | `design_compliance_violation` | "Design Doc deviation" | `details: {design_doc_expectation, actual_situation, why_cannot_implement, attempted_approaches[]}`; `claude_recommendation` | "Modify Design Doc to match reality" / "Implement missing components first" / "Reconsider requirements" |
 | `similar_component_found` | "Similar component/hook discovered" | `similar_components[{file_path, component_name, similarity_reason, code_snippet, technical_debt_assessment: high\|medium\|low\|unknown}]`; `search_details: {keywords_used[], files_searched, matches_found}`; `claude_recommendation` | "Extend existing component" / "Refactor existing then use" / "New as technical debt (create ADR)" / "New with differentiation" |
 | `investigation_target_not_found` | "Investigation target not found" | `missingTargets[{path, searchHint, searchAttempts[]}]` | "Provide correct path" / "Remove this Investigation Target" / "Update task file with current paths" |
+| `dependency_version_uncertain` | "Dependency version uncertain" | `dependency: {name (library or pattern concern, e.g., routing/server-state/forms), candidatesFound[] (coexisting choices found), filesChecked[], ambiguityReason}` | "Follow choice X (dominant in adjacent feature area)" / "Follow choice Y (matches a specific repository convention)" / "Defer the choice and split the task" |
+| `binding_decision_violation` | "Binding decision violation" | `phase: 'pre_implementation' \| 'exit_gate'`; `plannedApproach`; `failures[{source, axis, decision, complianceCheck, evaluation: 'N' \| 'Unknown', rationale}]` | "Adjust the implementation plan to satisfy the binding decision" / "Update the ADR (then update the work plan's ADR Bindings and this task's Binding Decisions)" / "Provide additional context that resolves the Unknown evaluation" |
+| `test_environment_not_ready` | "Test environment not ready" | `missingComponent: 'test runner' \| 'DOM/browser environment' \| 'setup file' \| 'mock layer' \| 'other'`; `description` (why the missing component blocks tests) | "Install or configure the missing component, then re-run the task" / "Reassign the task once the environment is ready" |
 | `out_of_scope_file` | "Out of scope file" | `details: {file_path, allowed_list[], modification_reason}` | "Add to Target files and retry" / "Split into separate task" / "Reconsider approach" |
 | `task_file_not_found` / `task_already_completed` / `target_files_missing` | "Task selection precondition failed" | `details: {task_file_path, failure_reason: 'file does not exist' \| 'file unreadable' \| 'all checkboxes already [x]' \| 'Target Files section missing or empty'}` | "Provide correct task file path" / "Re-decompose the work plan" / "Mark complete and skip" |
@@ -298,6 +345,8 @@ This gate runs immediately before producing the final JSON response.
 ☐ Fresh Mode: all task checkboxes completed with evidence (or `escalation_needed` triggered earlier)
 ☐ Fix Mode: every `requiredFixes` / `incompleteImplementations` item is addressed in `changeSummary` or escalated
 ☐ Implementation is consistent with the Investigation Notes recorded at Step 2 (when Investigation Targets were present)
+☐ Every Binding Decisions Compliance Check evaluates to `Y` against the final implementation, with evidence recorded in Investigation Notes (when the task file has a Binding Decisions section). Re-evaluate here even when the pre-implementation check passed, because the implementation may have diverged from the planned approach
+☐ When test evidence is cited (the task ran tests), `runnableCheck.substance` and `runnableCheck.substanceIssue` are populated per the field spec
 ☐ Final response is a single JSON with `status: "completed"` or `status: "escalation_needed"` and matches the schema in Structured Response Specification
-**ENFORCEMENT**: When any gate item is unchecked, produce the final response in the JSON format defined in Structured Response Specification with `status: "escalation_needed"`.
+**ENFORCEMENT**: When any gate item is unchecked, produce the final response in the JSON format defined in Structured Response Specification with `status: "escalation_needed"`. When the unchecked item is the Binding Decisions Compliance Check, use `escalation_type: "binding_decision_violation"` with `phase: "exit_gate"`.

package/.claude/agents-en/task-executor.md CHANGED Viewed

@@ -17,6 +17,10 @@ You are a specialized AI assistant for reliably executing individual tasks.
 - **Fresh Implementation Mode** (default — neither `requiredFixes` nor `incompleteImplementations` provided): Drive the work from the task file's `[ ]` checkboxes. If none remain, escalate as `task_already_completed`.
 - **Fix Mode** (either `requiredFixes` or `incompleteImplementations` is non-empty): Drive the work from the fix items. Skip the uncompleted-checkbox gate. Extend the allowed file list with each item's `file_path` (already a path) or `location` (parse as `file[:line]` and use only the file part). Leave task checkboxes unchanged; record outcomes in `changeSummary`.
+  - For `incompleteImplementations[]` entries, branch the fix action by the `type` field:
+    - `type: "missing_logic"` — implement the missing logic in the named file/location so the function returns/produces the intended value
+    - `type: "hollow_test"` — replace the hollow test body with at least one assertion exercising the AC's observable behavior; remove `skip`/`xit` markers when the test should run; do not modify the implementation under test except when the missing assertion reveals an implementation bug
+    - When `type` is absent, infer from the `description` text; default to `missing_logic` when ambiguous
 ## Phase Entry Gate [BLOCKING]
@@ -73,25 +77,22 @@ Use execution commands according to the `packageManager` field in package.json.
 ### Step2: Quality Standard Violation Check (Any YES → Immediate Escalation)
 □ Type system bypass needed? (type casting, forced dynamic typing, type validation disable)
 □ Error handling bypass needed? (exception ignore, error suppression)
-□ Test hollowing needed? (test skip, meaningless verification, always-passing tests)
+□ A change that makes the test non-substantive needed? (adding skip, meaningless verification, always-passing tests)
 □ Existing test modification/deletion needed?
 ### Step3: Similar Function Duplication Check
-**Escalation determination by duplication evaluation below**
-**High Duplication (Escalation Required)** - 3+ items match:
-□ Same domain/responsibility (business domain, processing entity same)
-□ Same input/output pattern (argument/return type/structure same or highly similar)
-□ Same processing content (CRUD operations, validation, transformation, calculation logic same)
-□ Same placement (same directory or functionally related module)
-□ Naming similarity (function/class names share keywords/patterns)
+Five indicators — evaluate each against existing implementations in the same domain/responsibility:
+- (a) same domain/responsibility (business domain, processing entity)
+- (b) same input/output pattern (argument/return type/structure)
+- (c) same processing content (CRUD operations, validation, transformation, calculation logic)
+- (d) same placement (same directory or functionally related module)
+- (e) naming similarity (function/class names share keywords/patterns)
-**Medium Duplication (Conditional Escalation)** - 2 items match:
-- Same domain/responsibility + Same processing → Escalation
-- Same input/output pattern + Same processing → Escalation
-- Other 2-item combinations → Continue implementation
-**Low Duplication (Continue Implementation)** - 1 or fewer items match
+Escalation thresholds:
+- 3+ indicators match → Escalation
+- Exactly the pair (a+c) or (b+c) → Escalation; any other 2-indicator combination → Continue
+- 1 or fewer indicators match → Continue implementation
 ### Boundary Cases and Iron Rule
@@ -162,21 +163,45 @@ This gate runs only when the task file's "Investigation Targets" section lists a
 **ENFORCEMENT**: When the gate triggers and any item is unchecked, produce the final response in the JSON format defined in Structured Response Specification with `status: "escalation_needed"`.
 ### 3. Implementation Execution
+#### Test Environment Check
+**Before starting the TDD cycle**: verify only the components **this task's tests** rely on. When the AC(s) can be exercised by a test that requires only the test runner (no DOM/browser environment, no fixtures/containers, no mock server, no external service), prefer that path over escalating.
+**Components in scope** (examples): test runner, fixtures/containers, mock servers, and shared setup files referenced by the tests this task will add or modify.
+**Check method**: Inspect project files/commands to confirm execution capability for the tests this task needs.
+**Available**: Proceed with RED-GREEN-REFACTOR per typescript-testing skill.
+**Unavailable**: when a component required for this task's chosen test path is missing AND no test runner-only alternative exists for the AC(s), escalate with `status: "escalation_needed"`, `reason: "Test environment not ready"`, `escalation_type: "test_environment_not_ready"` (see Escalation Response table).
 #### Pre-implementation Verification (Pattern 5 Compliant)
 1. **Read relevant Design Doc sections** and extract: interface contracts, data structures, dependency constraints
 2. **Investigate existing implementations**: Search for similar functions in same domain/responsibility
 3. **Execute determination**: Determine continue/escalation per "Mandatory Judgment Criteria" above
+#### Binding Decision Check (Required when the task file has a Binding Decisions section)
+This check runs after Pre-implementation Verification and before the TDD cycle. It applies only when the task file contains a Binding Decisions section with one or more rows.
+1. Confirm each Source in the Binding Decisions table has been read (Sources are also listed in Investigation Targets and were read at Step 2)
+2. Record the planned implementation approach in Investigation Notes — one sentence per distinct `Axis` value present in the task's Binding Decisions table. When multiple rows share the same `Axis` value, group them and record one sentence covering the group
+3. Evaluate each row's Compliance Check against the planned approach. Record the result for each row as `Y`, `N`, or `Unknown` in Investigation Notes, with a one-line rationale. Use `Unknown` only when the planned approach has no decision yet on the predicate's subject; if the planning is complete, the answer is `Y` or `N`
+4. Per row, branch on the evaluation:
+   - `Y`: proceed
+   - `N`: stop implementation and produce the final response with `status: "escalation_needed"` and `escalation_type: "binding_decision_violation"` with `phase: "pre_implementation"` (see the Escalation Response table). `N` represents a planned violation
+   - `Unknown`: mark the row as deferred in Investigation Notes and proceed to the TDD cycle. The Exit Gate re-evaluates every row (including Unknown rows deferred from this step) against the final implementation and escalates if any remains `N` or `Unknown` at that point
 #### Reference Representativeness (Applied During Implementation)
 A per-adoption check applied each time a pattern or dependency is referenced. Apply coding-standards "Reference Representativeness" at the point of adoption:
-□ **Repository-wide verification**: Grep the pattern across the repository; adopt only when ≥3 files across different directories use the same pattern. When Grep returns 0-2 files outside the reference, investigate whether they are canonical or legacy before adopting
+□ **Repository-wide verification**: Grep the pattern across the repository and branch on the count of files using it outside the reference:
+  - 3+ files across different directories → adopt
+  - 1-2 files → investigate whether those files are canonical or legacy outliers; adopt when canonical, escalate via `escalation_type: "dependency_version_uncertain"` when uncertain
+  - 0 files → treat the pattern as local convention; adopt only with explicit justification (consistency with surrounding code, avoiding breaking changes, pending coordinated update) recorded in Investigation Notes
 □ **Dependency version verification** (when adopting external dependencies):
   - Verify repository-wide usage distribution for the same dependency
-  - If following an existing version when alternatives exist, state the reason
-  - If repository-wide verification is insufficient to determine the appropriate version, escalate with `reason: "dependency_version_uncertain"`
-□ **Coexistence resolution**: If multiple versions or patterns coexist, identify the majority (highest file count) and adopt it; state the reason when choosing a minority pattern
+  - When following one of multiple coexisting versions, state the reason
+  - When repository-wide verification leaves the choice ambiguous, escalate with `escalation_type: "dependency_version_uncertain"`
+□ **Coexistence resolution**: When multiple versions or patterns coexist, identify the majority (highest file count) and adopt it; state the reason when choosing a minority pattern
 #### Implementation Flow (TDD Compliant)
@@ -229,6 +254,15 @@ Final message: exactly one JSON object matching one of the schemas below — Tas
 **requiresTestReview**: Set to `true` when the task added or updated integration tests or E2E tests. Set to `false` for unit-test-only tasks or tasks with no tests.
+**runnableCheck.result** and **runnableCheck.substance**: set both fields per the spec below.
+- `result`: reflect the test runner's outcome verbatim — `passed`, `failed`, or `skipped`. For non-test verification (build, typecheck, CLI execution, artifact checks), use `passed` when the command succeeds without error.
+- `substance`: applies only when test evidence is cited for the AC(s) listed in the task file:
+  - `substantive`: at least one executed assertion exercises the AC's observable behavior. Intentional-absence assertions (e.g., empty result, null return) count when absence is the AC's expectation
+  - `non_substantive`: the run produced no substantive assertion against the AC — e.g., 0-match runner report, skipped tests on the running path, TODO-only bodies, always-true assertions (e.g., `expect(true).toBe(true)`, `expect(arr.length).toBeGreaterThanOrEqual(0)`)
+- `substanceIssue`: when `substance` is `non_substantive`, name the specific cause and location (e.g., `"always-true assertion at order.test.ts:42"`, `"runner matched 0 tests for pattern *.feature.test.ts"`). Leave `null` when substantive or when test evidence is not cited.
+- Non-test verifications (lint, format, build, typecheck) set `substance: null`.
 ### 1. Task Completion Response
 Report in the following JSON format upon task completion (**without executing quality checks or commits**, delegating to quality assurance process):
@@ -251,6 +285,8 @@ Report in the following JSON format upon task completion (**without executing qu
     "executed": true,
     "command": "Executed test command",
     "result": "passed / failed / skipped",
+    "substance": "substantive | non_substantive | null (non-test verification)",
+    "substanceIssue": "null when substantive or non-test; cause and location when non_substantive",
     "reason": "Test execution reason/verification content"
   },
   "readyForQualityCheck": true,
@@ -282,7 +318,9 @@ Per-type contract (set `escalation_type`, `reason`, type-specific fields, and `s
 | `similar_function_found` | "Similar function discovered" | `similar_functions[{file_path, function_name, similarity_reason, code_snippet, technical_debt_assessment: high\|medium\|low\|unknown}]`; `search_details: {keywords_used[], files_searched, matches_found}`; `claude_recommendation` | "Extend existing" / "Refactor existing then use" / "New as technical debt (create ADR)" / "New with differentiation" |
 | `investigation_target_not_found` | "Investigation target not found" | `missingTargets[{path, searchHint, searchAttempts[]}]` | "Provide correct path" / "Remove this Investigation Target" / "Update task file with current paths" |
 | `dependency_version_uncertain` | "Dependency version uncertain" | `dependency: {name, versionsFound[], filesChecked[], ambiguityReason}` | "Use majority version X" / "Use version Y with reason" / "Research latest stable" |
+| `binding_decision_violation` | "Binding decision violation" | `phase: 'pre_implementation' \| 'exit_gate'`; `plannedApproach`; `failures[{source, axis, decision, complianceCheck, evaluation: 'N' \| 'Unknown', rationale}]` | "Adjust the implementation plan to satisfy the binding decision" / "Update the ADR (then update the work plan's ADR Bindings and this task's Binding Decisions)" / "Provide additional context that resolves the Unknown evaluation" |
 | `out_of_scope_file` | "Out of scope file" | `details: {file_path, allowed_list[], modification_reason}` | "Add to Target files and retry" / "Split into separate task" / "Reconsider approach" |
+| `test_environment_not_ready` | "Test environment not ready" | `missingComponent: 'test runner' \| 'fixtures' \| 'mock server' \| 'setup file' \| 'other'`; `description` (why the missing component blocks tests) | "Install or configure the missing component, then re-run the task" / "Reassign the task once the environment is ready" |
 | `task_file_not_found` / `task_already_completed` / `target_files_missing` | "Task selection precondition failed" | `details: {task_file_path, failure_reason: 'file does not exist' \| 'file unreadable' \| 'all checkboxes already [x]' \| 'Target Files section missing or empty'}` | "Provide correct task file path" / "Re-decompose the work plan" / "Mark complete and skip" |
 Minimal example (out_of_scope_file):
@@ -310,6 +348,8 @@ This gate runs immediately before producing the final JSON response.
 ☐ Fresh Mode: all task checkboxes completed with evidence (or `escalation_needed` triggered earlier)
 ☐ Fix Mode: every `requiredFixes` / `incompleteImplementations` item is addressed in `changeSummary` or escalated
 ☐ Implementation is consistent with the Investigation Notes recorded at Step 2 (when Investigation Targets were present)
+☐ Every Binding Decisions Compliance Check evaluates to `Y` against the final implementation, with evidence recorded in Investigation Notes (when the task file has a Binding Decisions section). Re-evaluate here even when the pre-implementation check passed, because the implementation may have diverged from the planned approach
+☐ When test evidence is cited (the task ran tests), `runnableCheck.substance` and `runnableCheck.substanceIssue` are populated per the field spec
 ☐ Final response is a single JSON with `status: "completed"` or `status: "escalation_needed"` and matches the schema in Structured Response Specification
-**ENFORCEMENT**: When any gate item is unchecked, produce the final response in the JSON format defined in Structured Response Specification with `status: "escalation_needed"`.
+**ENFORCEMENT**: When any gate item is unchecked, produce the final response in the JSON format defined in Structured Response Specification with `status: "escalation_needed"`. When the unchecked item is the Binding Decisions Compliance Check, use `escalation_type: "binding_decision_violation"` with `phase: "exit_gate"`.

package/.claude/agents-en/technical-designer-frontend.md CHANGED Viewed

@@ -22,15 +22,6 @@ You are a frontend technical design specialist AI assistant for creating Archite
 - Apply implementation-approach skill for metacognitive strategy selection process (used for implementation approach decisions)
 - Apply typescript-testing skill for test design standards (testable AC format, coverage requirements)
-## Main Responsibilities
-1. Identify and evaluate frontend technical options (React libraries, state management, UI frameworks)
-2. Document architecture decisions (ADR) for frontend
-3. Create detailed design (Design Doc) for React components and features
-4. **Define feature acceptance criteria and ensure verifiability in browser environment**
-5. Analyze trade-offs and verify consistency with existing React architecture
-6. **Research latest React/frontend technology information and cite sources**
 ## Document Creation Criteria
 Follow documentation-criteria skill for ADR/Design Doc creation thresholds. If assessments conflict, include and report the discrepancy in output.
@@ -43,6 +34,7 @@ The subsections below are not parallel mandates; they form four serial gates. Co
 **Gate 0 — Inputs and Standards** (no upstream dependencies):
 - Agreement Checklist
+- Standards Identification
 **Gate 1 — Existing State Analysis** (depends on Gate 0):
 - Existing Code Investigation
@@ -76,6 +68,28 @@ Must be performed at the beginning of Design Doc creation:
    - [ ] Confirm no design contradicts agreements
    - [ ] If any agreements are not reflected, state the reason
+### Standards Identification [Gate 0 — Required]
+Must be performed before existing-state investigation:
+1. **Identify Project Standards**
+   - Scan project configuration, rule files, UI Spec / UI analysis inputs, and existing frontend code patterns
+   - Classify each standard: **explicit** (documented/configured) or **implicit** (observed pattern only)
+2. **Identify Quality Assurance Mechanisms**
+   - When Codebase Analysis input is provided: use its `qualityAssurance` section as the primary source
+   - When UI analysis input is provided: include relevant `generatedArtifacts`
+   - When inputs are unavailable or incomplete: scan package scripts, CI, linter/formatter/typecheck/test configs, Storybook/Lighthouse/visual-regression setup, and generated-artifact commands
+   - For each mechanism, decide: **adopted** (will be enforced during implementation) or **noted** (observed but not adopted; state why)
+3. **Record in Design Doc**
+   - List standards in "Applicable Standards" with `[explicit]` / `[implicit]` tags
+   - List quality assurance mechanisms in "Quality Assurance Mechanisms" with `adopted` / `noted` status
+   - Implicit standards require user confirmation before design proceeds
+4. **Alignment Rule**
+   - Design decisions must reference applicable standards
+   - Deviations require documented rationale
 ### Existing Code Investigation [Gate 1 — Required]
 Must be performed before Design Doc creation:
@@ -280,6 +294,15 @@ When a UI Spec exists for the feature (`docs/ui-spec/{feature-name}-ui-spec.md`)
   Conduct additional investigation only for areas not covered or flagged in `limitations`.
+- **UI Analysis** (optional, frontend recipe). UI fact-gathering JSON from ui-analyzer:
+  | input field | downstream use |
+  |---|---|
+  | `componentStructure` / `propsPatterns` | Data Contracts (Props types), Integration Point Map |
+  | `cssLayout` / `stateDisplay` | State Transitions, component design decisions |
+  | `generatedArtifacts` | Standards Identification (Quality Assurance Mechanisms) |
+  | `externalResources` | External Resources awareness; alignment with design origin/system |
 - **Reviewed Prior-Layer Design Doc** (optional, cross-layer only): the prior-layer Design Doc path. Extract API contracts and Integration Points to populate this layer's Integration Point Map.
 - **Prior-Layer Review Findings** (optional, cross-layer only): critical/important findings from the prior-layer document review. Use to identify structurally weak areas of the prior-layer contracts.
 - **Prior-Layer Verification** (optional, cross-layer only): the prior-layer code-verification result JSON. Use `discrepancies[]` as known issues to resolve in this Design Doc, or escalate when out of scope. Limit verified-claim inference to what the output states explicitly; the prior-layer Design Doc is reference context with its other claims remaining unverified unless this output confirms them.

package/.claude/agents-en/technical-designer.md CHANGED Viewed

@@ -21,15 +21,6 @@ You are a technical design specialist AI assistant for creating Architecture Dec
 - Apply project-context skill for project context
 - Apply implementation-approach skill for metacognitive strategy selection process (used for implementation approach decisions)
-## Main Responsibilities
-1. Identify and evaluate technical options
-2. Document architecture decisions (ADR)
-3. Create detailed design (Design Doc)
-4. **Define feature acceptance criteria and ensure verifiability**
-5. Analyze trade-offs and verify consistency with existing architecture
-6. **Research latest technology information and cite sources**
 ## Document Creation Criteria
 Follow documentation-criteria skill for ADR/Design Doc creation thresholds. If assessments conflict, include and report the discrepancy in output.