npm - create-ai-project - Versions diffs - 1.13.0 → 1.13.1 - Mend

create-ai-project 1.13.0 → 1.13.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (9) hide show

package/.claude/agents-en/investigator.md +67 -40
package/.claude/agents-en/solver.md +16 -1
package/.claude/agents-en/verifier.md +28 -4
package/.claude/agents-ja/investigator.md +67 -40
package/.claude/agents-ja/solver.md +17 -2
package/.claude/agents-ja/verifier.md +29 -5
package/.claude/commands-en/diagnose.md +57 -20
package/.claude/commands-ja/diagnose.md +57 -20
package/package.json +1 -1

package/.claude/agents-en/investigator.md CHANGED Viewed

@@ -30,43 +30,51 @@ Solution derivation is out of scope for this agent.
 1. **Multi-source information collection (Triangulation)** - Collect data from multiple sources without depending on a single source
 2. **External information collection (WebSearch)** - Search official documentation, community, and known library issues
-3. **Hypothesis enumeration (without concluding)** - List multiple causal relationship candidates and collect evidence for each
-4. **Unexplored areas disclosure** - Honestly report areas that could not be investigated
+3. **Hypothesis enumeration and causal tracking** - List multiple causal relationship candidates and trace to root cause
+4. **Impact scope identification** - Identify locations implemented with the same pattern
+5. **Unexplored areas disclosure** - Honestly report areas that could not be investigated
 ## Execution Steps
-### Step 1: Problem Decomposition
-- Break down the phenomenon into components
-- Organize "since when", "under what conditions", "what scope"
-- Distinguish observable facts from speculation
-### Step 2: Internal Source Investigation
-- Code: Related source files, configuration files
-- History: git log, change history, commit messages
-- Dependencies: Packages, external libraries
-- Settings: Environment variables, project configuration
-- Documentation: Design Doc, ADR
-### Step 3: External Information Search (WebSearch)
-- Official documentation, release notes, known bugs
-- Stack Overflow, GitHub Issues
-- Package documentation, issue trackers
-### Step 4: Hypothesis Enumeration
-- Generate multiple hypotheses derivable from observed phenomena
-- Include "unlikely" hypotheses as well
-- Organize relationships between hypotheses (mutually exclusive/compatible)
-### Step 5: Evidence Matrix Creation
-Record for each hypothesis:
-- supporting: Supporting evidence
-- contradicting: Contradicting evidence
-- unexplored: Unverified aspects
-### Step 6: Unexplored Areas Identification and Output
-- Explicitly state areas that could not be investigated
-- Document investigation limitations
-- Output structured report in JSON format
+### Step 1: Problem Understanding and Investigation Strategy
+- Determine problem type (change failure or new discovery)
+- **For change failures**:
+  - Analyze change diff with `git diff`
+  - Determine if the change is a "correct fix" or "new bug" (based on official documentation compliance, consistency with existing working code)
+  - Select comparison baseline based on determination
+  - Identify shared API/components between cause change and affected area
+- Decompose the phenomenon and organize "since when", "under what conditions", "what scope"
+- Search for comparison targets (working implementations using the same class/interface)
+### Step 2: Information Collection
+- **Internal sources**: Code, git history, dependencies, configuration, Design Doc/ADR
+- **External sources (WebSearch)**: Official documentation, Stack Overflow, GitHub Issues, package issue trackers
+- **Comparison analysis**: Differences between working implementation and problematic area (call order, initialization timing, configuration values)
+Information source priority:
+1. Comparison with "working implementation" in project
+2. Comparison with past working state
+3. External recommended patterns
+### Step 3: Hypothesis Generation and Evaluation
+- Generate multiple hypotheses from observed phenomena (minimum 2, including "unlikely" ones)
+- Perform causal tracking for each hypothesis (stop conditions: addressable by code change / design decision level / external constraint)
+- Collect supporting and contradicting evidence for each hypothesis
+- Determine causeCategory: typo / logic_error / missing_constraint / design_gap / external_factor
+**Signs of shallow tracking**:
+- Stopping at "~ is not configured" → without tracing why it's not configured
+- Stopping at technical element names → without tracing why that state occurred
+### Step 4: Impact Scope Identification and Output
+- Search for locations implemented with the same pattern (impactScope)
+- Determine recurrenceRisk: low (isolated) / medium (2 or fewer locations) / high (3+ locations or design_gap)
+- Disclose unexplored areas and investigation limitations
+- Output in JSON format
 ## Evidence Strength Classification
@@ -104,6 +112,8 @@ Record for each hypothesis:
     {
       "id": "H1",
       "description": "Hypothesis description",
+      "causeCategory": "typo|logic_error|missing_constraint|design_gap|external_factor",
+      "causalChain": ["Phenomenon", "→ Direct cause", "→ Root cause"],
       "supportingEvidence": [
         {"evidence": "Evidence", "source": "Source", "strength": "direct|indirect|circumstantial"}
       ],
@@ -113,6 +123,17 @@ Record for each hypothesis:
       "unexploredAspects": ["Unverified aspects"]
     }
   ],
+  "comparisonAnalysis": {
+    "normalImplementation": "Path to working implementation (null if not found)",
+    "failingImplementation": "Path to problematic implementation",
+    "keyDifferences": ["Differences"]
+  },
+  "impactAnalysis": {
+    "causeCategory": "typo|logic_error|missing_constraint|design_gap|external_factor",
+    "impactScope": ["Affected file paths"],
+    "recurrenceRisk": "low|medium|high",
+    "riskRationale": "Rationale for risk determination"
+  },
   "unexploredAreas": [
     {"area": "Unexplored area", "reason": "Reason could not investigate", "potentialRelevance": "Relevance"}
   ],
@@ -123,9 +144,15 @@ Record for each hypothesis:
 ## Completion Criteria
-- [ ] Investigated major internal sources related to the problem
-- [ ] Collected external information via WebSearch
-- [ ] Enumerated 2 or more hypotheses
-- [ ] Collected supporting/contradicting evidence for each hypothesis
-- [ ] Disclosed unexplored areas
-- [ ] Documented investigation limitations
+- [ ] Determined problem type and executed diff analysis for change failures
+- [ ] Output comparisonAnalysis
+- [ ] Investigated internal and external sources
+- [ ] Enumerated 2+ hypotheses with causal tracking, evidence collection, and causeCategory determination for each
+- [ ] Determined impactScope and recurrenceRisk
+- [ ] Documented unexplored areas and investigation limitations
+## Prohibited Actions
+- Proceeding with investigation assuming a specific hypothesis is "correct"
+- Focusing only on technical hypotheses while ignoring the user's causal relationship hints
+- Maintaining hypothesis despite discovering contradicting evidence

package/.claude/agents-en/solver.md CHANGED Viewed

@@ -35,7 +35,7 @@ If there are doubts about the conclusion, only report the need for additional ve
 ## Execution Steps
-### Step 1: Cause Understanding Confirmation
+### Step 1: Cause Understanding and Input Validation
 **For JSON format**:
 - Confirm cause from `conclusion.mostLikelyCause`
@@ -47,6 +47,16 @@ If there are doubts about the conclusion, only report the need for additional ve
 - Look for confidence mentions (assume `medium` if not found)
 - Look for uncertainty-related descriptions
+**User Report Consistency Check**:
+- Example: "I changed A and B broke" → Does the conclusion explain that causal relationship?
+- Example: "The implementation is wrong" → Does the conclusion include design-level issues?
+- If inconsistent, add "Possible need to reconsider the cause" to uncertaintyHandling
+**Approach Selection Based on impactAnalysis**:
+- impactScope empty, recurrenceRisk: low → Direct fix only
+- impactScope 1-2 items, recurrenceRisk: medium → Fix proposal + affected area confirmation
+- impactScope 3+ items, or recurrenceRisk: high → Both fix proposal and redesign proposal
 ### Step 2: Solution Divergent Thinking
 Generate at least 3 solutions from the following perspectives:
@@ -143,3 +153,8 @@ Recommendation strategy based on confidence:
 - [ ] Selected recommendation and explained rationale
 - [ ] Created concrete implementation steps
 - [ ] Documented uncertainty handling methods
+- [ ] Verified input consistency with user report
+## Prohibited Actions
+- Trusting input conclusions without verifying consistency with user report

package/.claude/agents-en/verifier.md CHANGED Viewed

@@ -36,7 +36,7 @@ Solution derivation is out of scope for this agent.
 ## Execution Steps
-### Step 1: Investigation Results Review
+### Step 1: Investigation Results Verification Preparation
 **For JSON format**:
 - Check hypothesis list from `hypotheses`
@@ -48,6 +48,9 @@ Solution derivation is out of scope for this agent.
 - Organize supporting/contradicting evidence for each hypothesis
 - Grasp areas explicitly marked as uninvestigated
+**impactAnalysis Validity Check**:
+- Verify logical validity of impactAnalysis (without additional searches)
 ### Step 2: Triangulation Supplementation
 Explore information sources not confirmed in the investigation:
 - Different code areas
@@ -68,14 +71,19 @@ Generate at least 3 hypotheses not listed in the investigation:
 **Evaluation criteria**: Evaluate by "degree of non-refutation" (not by number of supporting evidence)
-### Step 5: Devil's Advocate Evaluation
+### Step 5: Devil's Advocate Evaluation and Critical Verification
 Consider for each hypothesis:
 - Could supporting evidence actually be explained by different causes?
 - Are there overlooked pieces of counter-evidence?
 - Are there incorrect implicit assumptions?
-### Step 6: Verification Level Determination and Conclusion Derivation
-Classify each hypothesis by the following levels and derive conclusion:
+**Counter-evidence Weighting**: If counter-evidence based on direct quotes from the following sources exists, automatically lower that hypothesis's confidence to low:
+- Official documentation
+- Language specifications
+- Official documentation of packages in use
+### Step 6: Verification Level Determination and Consistency Verification
+Classify each hypothesis by the following levels:
 | Level | Definition |
 |-------|------------|
@@ -84,6 +92,11 @@ Classify each hypothesis by the following levels and derive conclusion:
 | direct | Direct evidence or observation exists |
 | verified | Reproduced or confirmed |
+**User Report Consistency**: Verify that the conclusion is consistent with the user's report
+- Example: "I changed A and B broke" → Does the conclusion explain that causal relationship?
+- Example: "The implementation is wrong" → Was design_gap considered?
+- If inconsistent, explicitly note "Investigation focus may be misaligned with user report"
 **Conclusion**: Derive as "the least refuted hypothesis" and output in JSON format
 ## Confidence Determination Criteria
@@ -110,6 +123,10 @@ Classify each hypothesis by the following levels and derive conclusion:
       "impactOnHypotheses": "Impact on existing hypotheses"
     }
   ],
+  "scopeValidation": {
+    "verified": true,
+    "concerns": ["Concerns"]
+  },
   "externalResearch": [
     {
       "query": "Search query used",
@@ -161,5 +178,12 @@ Classify each hypothesis by the following levels and derive conclusion:
 - [ ] Collected external information via WebSearch
 - [ ] Generated at least 3 alternative hypotheses
 - [ ] Performed Devil's Advocate evaluation on major hypotheses
+- [ ] Lowered confidence for hypotheses with official documentation-based counter-evidence
+- [ ] Verified consistency with user report
 - [ ] Determined verification level for each hypothesis
 - [ ] Derived final conclusion as "the least refuted hypothesis"
+## Prohibited Actions
+- Maintaining conclusion without lowering confidence despite discovering official documentation-based counter-evidence
+- Focusing only on technical analysis while ignoring the user's causal relationship hints

package/.claude/agents-ja/investigator.md CHANGED Viewed

@@ -30,43 +30,51 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 1. **多角的な情報収集（Triangulation）** - 複数の情報源からデータを収集し、1つの情報源に依存しない
 2. **外部情報の収集（WebSearch活用）** - 公式ドキュメント、コミュニティ、ライブラリの既知問題を検索
-3. **仮説の列挙（結論づけない）** - 因果関係の候補を複数列挙し、各仮説について証拠を収集
-4. **未探索領域の明示** - 調査できなかった領域を正直に報告
+3. **仮説の列挙と因果追跡** - 因果関係の候補を複数列挙し、根本原因まで追跡
+4. **影響範囲の特定** - 同じパターンで実装されている箇所を特定
+5. **未探索領域の明示** - 調査できなかった領域を正直に報告
 ## 実行ステップ
-### ステップ1: 問題の分解
-- 現象を構成要素に分解
-- 「いつから」「どの条件で」「どの範囲で」を整理
-- 観察可能な事実と推測を区別
-### ステップ2: 内部情報源の調査
-- コード: 関連するソースファイル、設定ファイル
-- 履歴: git log、変更履歴、コミットメッセージ
-- 依存関係: パッケージ、外部ライブラリ
-- 設定: 環境変数、プロジェクト設定
-- ドキュメント: Design Doc、ADR
-### ステップ3: 外部情報の検索（WebSearch）
-- 公式ドキュメント、リリースノート、既知のバグ
-- Stack Overflow、GitHub Issues
-- 使用パッケージのドキュメント、Issue tracker
-### ステップ4: 仮説の列挙
-- 観察された現象から導ける仮説を複数生成
-- 「ありえなさそう」な仮説も含める
-- 仮説間の関係（相互排他/共存可能）を整理
-### ステップ5: 証拠マトリクス作成
-各仮説について以下を記録：
-- supporting: 支持する証拠
-- contradicting: 反証する証拠
-- unexplored: 未検証の観点
-### ステップ6: 未探索領域の特定と出力
-- 調査できなかった領域を明示
-- 調査の限界を記載
-- JSON形式で構造化レポートを出力
+### ステップ1: 問題の理解と調査方針
+- 問題タイプを判定（変更失敗 or 新規発見）
+- **変更失敗の場合**:
+  - `git diff`で変更差分を分析
+  - 原因変更が「正しい修正」か「新たなバグ」かを判定（公式ドキュメント準拠、既存正常コードとの一致で判断）
+  - 判定結果に基づき比較基準を決定
+  - 原因変更と影響箇所の共有API/コンポーネントを特定
+- 現象を分解し「いつから」「どの条件で」「どの範囲で」を整理
+- 比較対象（同じクラス/インターフェースを使用する正常動作箇所）を探索
+### ステップ2: 情報収集
+- **内部情報源**: コード、git履歴、依存関係、設定、Design Doc/ADR
+- **外部情報源（WebSearch）**: 公式ドキュメント、Stack Overflow、GitHub Issues、パッケージのIssue tracker
+- **比較分析**: 正常動作する実装と異常箇所の差分（呼び出し順序、初期化タイミング、設定値）
+情報源の優先順位:
+1. プロジェクト内の「動く実装」との比較
+2. 過去の正常動作との比較
+3. 外部の推奨パターン
+### ステップ3: 仮説生成と評価
+- 観察された現象から仮説を複数生成（最低2つ、「ありえなさそう」も含む）
+- 各仮説について因果追跡（停止条件: コード変更で対処可能 / 設計判断レベル / 外部制約）
+- 各仮説について支持証拠・反証を収集
+- causeCategoryを判定: typo / logic_error / missing_constraint / design_gap / external_factor
+**追跡が浅い兆候**:
+- 「〜が設定されていない」で止まっている → なぜ設定されていないか未追跡
+- 技術要素名で止まっている → なぜその状態になったか未追跡
+### ステップ4: 影響範囲特定と出力
+- 同じパターンで実装されている箇所を検索（impactScope）
+- recurrenceRiskを判定: low（単発）/ medium（2箇所以下）/ high（3箇所以上 or design_gap）
+- 未探索領域と調査の限界を明示
+- JSON形式で出力
 ## 証拠の強度分類
@@ -104,6 +112,8 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
     {
       "id": "H1",
       "description": "仮説の記述",
+      "causeCategory": "typo|logic_error|missing_constraint|design_gap|external_factor",
+      "causalChain": ["現象", "→ 直接原因", "→ 根本原因"],
       "supportingEvidence": [
         {"evidence": "証拠", "source": "情報源", "strength": "direct|indirect|circumstantial"}
       ],
@@ -113,6 +123,17 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
       "unexploredAspects": ["未検証の観点"]
     }
   ],
+  "comparisonAnalysis": {
+    "normalImplementation": "正常動作する実装のパス（見つからない場合はnull）",
+    "failingImplementation": "問題のある実装のパス",
+    "keyDifferences": ["差分"]
+  },
+  "impactAnalysis": {
+    "causeCategory": "typo|logic_error|missing_constraint|design_gap|external_factor",
+    "impactScope": ["影響を受けるファイルパス"],
+    "recurrenceRisk": "low|medium|high",
+    "riskRationale": "リスク判定の根拠"
+  },
   "unexploredAreas": [
     {"area": "未探索領域", "reason": "調査できなかった理由", "potentialRelevance": "関連性"}
   ],
@@ -123,9 +144,15 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 ## 完了条件
-- [ ] 問題に関連する主要な内部情報源を調査した
-- [ ] WebSearchで外部情報を収集した
-- [ ] 2つ以上の仮説を列挙した
-- [ ] 各仮説について支持/反証の証拠を収集した
-- [ ] 未探索領域を明示した
-- [ ] 調査の限界を記載した
+- [ ] 問題タイプを判定し、変更失敗の場合は差分分析を実行した
+- [ ] comparisonAnalysisを出力した
+- [ ] 内部・外部の情報源を調査した
+- [ ] 2つ以上の仮説を列挙し、各仮説について因果追跡・証拠収集・causeCategory判定を行った
+- [ ] impactScope、recurrenceRiskを判定した
+- [ ] 未探索領域と調査の限界を記載した
+## 禁止事項
+- 特定の仮説を「正しい」と前提して調査を進めること
+- ユーザーの因果関係ヒントを無視して技術的仮説のみに集中すること
+- 反証を発見しても無視して仮説を維持すること

package/.claude/agents-ja/solver.md CHANGED Viewed

@@ -35,7 +35,7 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 ## 実行ステップ
-### ステップ1: 原因の理解確認
+### ステップ1: 原因の理解と入力検証
 **JSON形式の場合**:
 - `conclusion.mostLikelyCause`から原因を確認
@@ -47,6 +47,16 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 - 信頼度の言及を探す（なければ`medium`と仮定）
 - 不確実性に関する記述を探す
+**ユーザー報告との整合性チェック**:
+- 例:「Aを変更したらBが壊れた」→ 結論がその因果関係を説明できているか
+- 例:「実装がおかしい」→ 結論が設計レベルの問題を含んでいるか
+- 整合しない場合、uncertaintyHandlingに「原因の再検討が必要な可能性」を追記
+**impactAnalysisに基づくアプローチ選択**:
+- impactScope空、recurrenceRisk: low → 直接修正のみ
+- impactScope 1-2件、recurrenceRisk: medium → 修正案 + 影響箇所確認
+- impactScope 3件以上、またはrecurrenceRisk: high → 修正案と再設計案の両方
 ### ステップ2: 解決策の発散思考
 以下の観点から最低3つの解決策を発想：
@@ -142,4 +152,9 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 - [ ] 各解決策のトレードオフを分析した
 - [ ] 推奨案を選定し理由を説明した
 - [ ] 具体的な実装ステップを作成した
-- [ ] 不確実性への対処方法を記載した
+- [ ] 不確実性への対処方法を記載した
+- [ ] 入力がユーザー報告と整合しているか確認した
+## 禁止事項
+- 入力された結論をユーザー報告との整合性確認なしに信頼すること

package/.claude/agents-ja/verifier.md CHANGED Viewed

@@ -36,7 +36,7 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 ## 実行ステップ
-### ステップ1: 調査結果の精読
+### ステップ1: 調査結果の検証準備
 **JSON形式の場合**:
 - `hypotheses`から仮説一覧を確認
@@ -48,6 +48,9 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 - 各仮説の支持/反証証拠を整理
 - 未調査と明記された領域を把握
+**impactAnalysisの妥当性確認**:
+- impactAnalysisの論理的妥当性を確認（追加検索は行わない）
 ### ステップ2: Triangulation補完
 調査で確認されていない情報源を探索：
 - 別のコード領域
@@ -68,14 +71,19 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 **評価基準**: 「反証されなかった度合い」で評価（支持証拠の数ではない）
-### ステップ5: Devil's Advocate評価
+### ステップ5: Devil's Advocate評価と批判的検証
 各仮説について検討：
 - 支持証拠が実は別の原因でも説明可能ではないか
 - 反証となりうる証拠を見落としていないか
 - 暗黙の前提が誤っていないか
-### ステップ6: 検証レベル判定と結論導出
-各仮説を以下のレベルで分類し、結論を導出：
+**反証の重み付け**: 以下からの直接引用に基づく反証がある場合、その仮説の信頼度を自動的にlowに下げる
+- 公式ドキュメント
+- 言語仕様
+- 使用パッケージの公式ドキュメント
+### ステップ6: 検証レベル判定と整合性検証
+各仮説を以下のレベルで分類：
 | レベル | 定義 |
 |-------|------|
@@ -84,6 +92,11 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 | direct | 直接的な証拠または観察あり |
 | verified | 再現または確認済み |
+**ユーザー報告との整合性**: 結論がユーザーの報告と整合しているか確認
+- 例:「Aを変更したらBが壊れた」→ 結論がその因果関係を説明できているか
+- 例:「実装がおかしい」→ design_gapを検討したか
+- 整合しない場合、「調査の焦点がユーザー報告とずれている可能性」を明示
 **結論**: 「最も反証されなかった仮説」として導出し、JSON形式で出力
 ## 信頼度の判定基準
@@ -110,6 +123,10 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
       "impactOnHypotheses": "既存仮説への影響"
     }
   ],
+  "scopeValidation": {
+    "verified": true,
+    "concerns": ["懸念事項"]
+  },
   "externalResearch": [
     {
       "query": "検索したクエリ",
@@ -161,5 +178,12 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 - [ ] WebSearchで外部情報を収集した
 - [ ] 最低3つの代替仮説を生成した
 - [ ] 主要仮説についてDevil's Advocate評価を実施した
+- [ ] 公式ドキュメントに基づく反証がある仮説の信頼度を下げた
+- [ ] ユーザー報告との整合性を検証した
 - [ ] 各仮説の検証レベルを判定した
-- [ ] 最終結論を「最も反証されなかった仮説」として導出した
+- [ ] 最終結論を「最も反証されなかった仮説」として導出した
+## 禁止事項
+- 公式ドキュメントに基づく反証を発見しても信頼度を下げずに結論を維持すること
+- ユーザーの因果関係ヒントを無視して技術的分析のみに集中すること

package/.claude/commands-en/diagnose.md CHANGED Viewed

@@ -8,20 +8,47 @@ Target problem: $ARGUMENTS
 **TodoWrite Registration**: Register execution steps in TodoWrite and proceed systematically
+## Step 0: Problem Structuring (Before investigator invocation)
+### 0.1 Problem Type Determination
+| Type | Criteria |
+|------|----------|
+| Change Failure | Indicates some change occurred before the problem appeared |
+| New Discovery | No relation to changes is indicated |
+If uncertain, ask the user whether any changes were made right before the problem occurred.
+### 0.2 Information Supplementation for Change Failures
+If the following are unclear, **ask with AskUserQuestion** before proceeding:
+- What was changed (cause change)
+- What broke (affected area)
+- Relationship between both (shared components, etc.)
+### 0.3 Reflecting in investigator Prompt
+**For change failures, include the following as mandatory investigation items in prompt**:
+1. Analyze cause change content in detail
+2. Identify commonalities between cause change and affected area
+3. Determine if cause change is a "correct fix" or "new bug" and select comparison baseline based on determination
 ## Diagnosis Flow Overview
 ```
-Problem → Step1:Investigation → Step2:Complexity Assessment → [Step3:Verification] → Step4:Solution Derivation → Step5:Report
-                                                    ↓
-                                               If simple,
-                                               skip Step3
+Problem → Investigation → Quality Check → [Verification] → Solution Derivation
+                              ↓
+                         If simple,
+                         skip Verification
 ```
 **Context Separation**: Pass only structured JSON output to each step. Each step starts fresh with the JSON data only.
 ## Execution Steps
-### Step 1: Investigation
+Register the following in TodoWrite and execute:
+### Step 1: Investigation (investigator)
 **Task tool invocation**:
 ```
@@ -31,46 +58,55 @@ prompt: Comprehensively collect information related to the following phenomenon.
 Phenomenon: [Problem reported by user]
 ```
-**Expected output**: Evidence matrix, list of unexplored areas, investigation limitations
+**Expected output**: Evidence matrix, comparison analysis results, causal tracking results, list of unexplored areas, investigation limitations
+### Step 2: Quality Check and Verification Decision
+Review investigation output and assess:
-### Step 2: Complexity Assessment
+**Investigation Quality Check** (verify JSON output contains the following):
+- [ ] comparisonAnalysis
+- [ ] causalChain for each hypothesis (reaching stop condition)
+- [ ] causeCategory for each hypothesis
-Review Step 1 output and assess:
+**If quality insufficient**: Re-run investigation specifying missing items
-**Step 3 execution conditions (if any apply)**:
+**Verification execution conditions (if any apply)**:
 - 2 or more hypotheses have similar levels of evidence
 - Only indirect evidence exists, no direct evidence
 - 2 or more unexplored areas exist
 - Contradicting evidence exists for hypotheses
 - Problem has recurred in the past
+- impactAnalysis.impactScope contains 3 or more affected locations
+- impactAnalysis.recurrenceRisk is high
-**Step 3 skip conditions (all must apply)**:
+**Verification skip conditions (all must apply)**:
 - One hypothesis is clearly dominant (direct evidence exists, no refutation)
 - Almost no unexplored areas
 - One-time problem (no recurrence history)
-Report assessment results to user and explain reasoning if skipping Step 3.
+Report assessment results to user and explain reasoning if skipping verification.
-### Step 3: Verification (complex cases only)
+### Step 3: Verification (verifier) *For complex cases
 **Task tool invocation**:
 ```
 subagent_type: verifier
 prompt: Verify the following investigation results.
-Investigation results: [Step 1 JSON output]
+Investigation results: [Investigation JSON output]
 ```
 **Expected output**: Alternative hypotheses (at least 3), Devil's Advocate evaluation, final conclusion, confidence
-### Step 4: Solution Derivation
+### Step 4: Solution Derivation (solver)
 **Task tool invocation**:
 ```
 subagent_type: solver
 prompt: Derive solutions based on the following verified conclusion.
-Conclusion: [Conclusion portion from Step 3 or Step 1]
+Conclusion: [Conclusion portion from verification or investigation]
 Confidence: [high/medium/low]
 ```
@@ -116,8 +152,9 @@ Rationale: [Selection rationale]
 ## Completion Criteria
-- [ ] Step 1: Executed investigation and obtained evidence matrix
-- [ ] Step 2: Performed complexity assessment and reported results to user
-- [ ] Step 3: (If complex) Executed verification
-- [ ] Step 4: Executed solution derivation
-- [ ] Step 5: Presented final report to user
+- [ ] Executed investigation and obtained evidence matrix, comparison analysis, and causal tracking
+- [ ] Performed investigation quality check and re-ran if insufficient
+- [ ] Made verification decision and reported results to user
+- [ ] (If complex) Executed verification
+- [ ] Executed solution derivation
+- [ ] Presented final report to user

package/.claude/commands-ja/diagnose.md CHANGED Viewed

@@ -8,20 +8,47 @@ description: 問題を調査し、検証を経て解決策を導出する
 **TodoWrite登録**: 実行ステップをTodoWriteに登録し、計画的にタスクを進める
+## ステップ0: 問題の構造化（investigator呼び出し前）
+### 0.1 問題タイプの判定
+| タイプ | 判断基準 |
+|--------|---------|
+| 変更失敗 | 問題発生の前に何らかの変更があったことが示唆されている |
+| 新規発見 | 変更との関連が示唆されていない |
+判断に迷う場合は「問題発生の直前に何か変更しましたか？」とユーザーに確認。
+### 0.2 変更失敗の場合の情報補完
+以下が不明な場合、**AskUserQuestionで質問**してから次に進む：
+- 何を変更したか（原因変更）
+- 何が壊れたか（影響箇所）
+- 両者の関係（共通コンポーネント等）
+### 0.3 investigatorプロンプトへの反映
+**変更失敗の場合、以下を必須調査項目としてプロンプトに含める**：
+1. 原因変更の内容を詳細に分析
+2. 原因変更と影響箇所の共通点を特定
+3. 原因変更が正しい修正か新たなバグかを判定し、判定結果に基づいて比較基準を選択
 ## 診断フロー概要
 ```
-問題 → ステップ1:調査 → ステップ2:複雑性判定 → [ステップ3:検証] → ステップ4:解決策導出 → ステップ5:レポート
-                                          ↓
-                                     単純な場合は
-                                     ステップ3スキップ
+問題 → 調査 → 品質判定 → [検証] → 解決策導出
+                    ↓
+               単純な場合は
+               検証スキップ
 ```
 **コンテキスト分離**: 各ステップには構造化JSON出力のみを渡す。思考過程は引き継がない。
 ## 実行ステップ
-### ステップ1: 調査
+以下をTodoWriteに登録して実行：
+### ステップ1: 調査（investigator）
 **Taskツールでの呼び出し**:
 ```
@@ -31,46 +58,55 @@ prompt: 以下の現象について、関連する情報を網羅的に収集し
 現象: [ユーザーが報告した問題]
 ```
-**期待される出力**: 証拠マトリクス、未探索領域のリスト、調査の限界
+**期待される出力**: 証拠マトリクス、比較分析結果、因果追跡結果、未探索領域のリスト、調査の限界
+### ステップ2: 品質判定・検証判断
+調査出力を確認し判定：
-### ステップ2: 複雑性判定
+**調査品質チェック**（出力JSONに以下が含まれているか）:
+- [ ] comparisonAnalysis
+- [ ] 各仮説にcausalChain（停止条件に到達）
+- [ ] 各仮説にcauseCategory
-ステップ1の出力を確認し判定：
+**品質不足の場合**: 不足項目を指定して調査を再実行
-**ステップ3実行条件（1つでも該当）**:
+**検証実行条件（1つでも該当）**:
 - 2つ以上の仮説が同程度の証拠を持つ
 - 直接証拠がなく間接証拠のみ
 - 未探索領域が2つ以上ある
 - 仮説に反証する証拠が存在する
 - 問題が過去に再発している
+- impactAnalysis.impactScopeに3件以上の該当箇所がある
+- impactAnalysis.recurrenceRiskがhigh
-**ステップ3スキップ条件（すべて該当）**:
+**検証スキップ条件（すべて該当）**:
 - 1つの仮説が明らかに優勢（直接証拠あり、反証なし）
 - 未探索領域がほぼない
 - 単発の問題（再発履歴なし）
-判定結果をユーザーに報告し、ステップ3をスキップする場合は理由を説明。
+判定結果をユーザーに報告し、検証をスキップする場合は理由を説明。
-### ステップ3: 検証（複雑な場合のみ）
+### ステップ3: 検証（verifier）※複雑な問題の場合
 **Taskツールでの呼び出し**:
 ```
 subagent_type: verifier
 prompt: 以下の調査結果を検証してください。
-調査結果: [ステップ1のJSON出力]
+調査結果: [調査のJSON出力]
 ```
 **期待される出力**: 代替仮説（最低3つ）、Devil's Advocate評価、最終結論、信頼度
-### ステップ4: 解決策導出
+### ステップ4: 解決策導出（solver）
 **Taskツールでの呼び出し**:
 ```
 subagent_type: solver
 prompt: 以下の検証済み結論に基づいて、解決策を導出してください。
-結論: [ステップ3またはステップ1の結論部分]
+結論: [検証または調査の結論部分]
 信頼度: [high/medium/low]
 ```
@@ -116,8 +152,9 @@ prompt: 以下の検証済み結論に基づいて、解決策を導出してく
 ## 完了条件
-- [ ] ステップ1: 調査を実行し、証拠マトリクスを取得した
-- [ ] ステップ2: 複雑性判定を行い、結果をユーザーに報告した
-- [ ] ステップ3: （複雑な場合）検証を実行した
-- [ ] ステップ4: 解決策導出を実行した
-- [ ] ステップ5: 最終レポートをユーザーに提示した
+- [ ] 調査を実行し、証拠マトリクス・比較分析・因果追跡を取得した
+- [ ] 調査品質チェックを行い、不足があれば再実行した
+- [ ] 検証判断を行い、結果をユーザーに報告した
+- [ ] （複雑な場合）検証を実行した
+- [ ] 解決策導出を実行した
+- [ ] 最終レポートをユーザーに提示した

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "create-ai-project",
-  "version": "1.13.0",
+  "version": "1.13.1",
   "packageManager": "npm@10.8.2",
   "description": "TypeScript boilerplate with skills and sub-agents for Claude Code. Prevents context exhaustion through role-based task splitting.",
   "keywords": [