npm - create-ai-project - Versions diffs - 1.18.4 → 1.18.6 - Mend

create-ai-project 1.18.4 → 1.18.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (22) hide show

package/.claude/agents-en/document-reviewer.md +2 -0
package/.claude/agents-en/skill-creator.md +100 -27
package/.claude/agents-en/skill-reviewer.md +60 -13
package/.claude/agents-en/technical-designer-frontend.md +9 -1
package/.claude/agents-en/technical-designer.md +10 -2
package/.claude/agents-ja/document-reviewer.md +2 -0
package/.claude/agents-ja/skill-creator.md +99 -26
package/.claude/agents-ja/skill-reviewer.md +59 -12
package/.claude/agents-ja/technical-designer-frontend.md +9 -1
package/.claude/agents-ja/technical-designer.md +10 -2
package/.claude/commands-en/create-skill.md +41 -13
package/.claude/commands-en/refine-skill.md +40 -17
package/.claude/commands-ja/create-skill.md +41 -13
package/.claude/commands-ja/refine-skill.md +41 -18
package/.claude/skills-en/skill-optimization/SKILL.md +8 -2
package/.claude/skills-en/skill-optimization/references/creation-guide.md +9 -1
package/.claude/skills-en/skill-optimization/references/review-criteria.md +14 -4
package/.claude/skills-ja/skill-optimization/SKILL.md +8 -2
package/.claude/skills-ja/skill-optimization/references/creation-guide.md +9 -1
package/.claude/skills-ja/skill-optimization/references/review-criteria.md +14 -4
package/CHANGELOG.md +51 -0
package/package.json +1 -1

package/.claude/agents-en/document-reviewer.md CHANGED Viewed

@@ -92,6 +92,7 @@ For DesignDoc, additionally verify:
 - Technical information verification: When sources exist, verify with WebSearch for latest information and validate claim validity
 - Failure scenario review: Identify failure scenarios across normal usage, high load, and external failures; specify which design element becomes the bottleneck
 - Code inspection evidence review: Verify inspected files are relevant to design scope; flag if key related files are missing
+- Dependency realizability check: For each dependency the Design Doc's Existing Codebase Analysis section describes as "existing", verify its definition exists in the codebase using Grep/Glob. Not found in codebase and no authoritative external source documented → `critical` issue (category: `feasibility`). Found but definition signature (method names, parameter types, return types) diverges from Design Doc description → `important` issue (category: `consistency`)
 - **As-is implementation document review**: When code verification results are provided and the document describes existing implementation (not future requirements), verify that code-observable behaviors are stated as facts; speculative language about deterministic behavior → `important` issue
 **Perspective-specific Mode**:
@@ -244,6 +245,7 @@ Include in output when `prior_context_count > 0`:
 - [ ] Gate 0 structural existence checks pass before quality review
 - [ ] Design decision rationales verified against identified standards/patterns
 - [ ] Code inspection evidence covers files relevant to design scope
+- [ ] Dependencies described as "existing" verified against codebase (Grep/Glob)
 - [ ] Field propagation map present when fields cross component boundaries
 ## Review Criteria (for Comprehensive Mode)

package/.claude/agents-en/skill-creator.md CHANGED Viewed

@@ -1,11 +1,11 @@
 ---
 name: skill-creator
-description: Generates optimized skill files from raw user knowledge. Analyzes content, applies optimization patterns, and produces structured SKILL.md with frontmatter. Use when creating new skills or regenerating skill content.
-tools: Read, Write, Glob, LS, TaskCreate, TaskUpdate
+description: Generates optimized skill files from raw user knowledge, or applies targeted changes to existing skills. Applies content optimization patterns and editing principles to produce structured SKILL.md with frontmatter. Use when creating new skills or updating existing ones.
+tools: Read, Write, Glob, LS, WebSearch, TaskCreate, TaskUpdate
 skills: skill-optimization, project-context
 ---
-You are a specialized AI assistant for generating skill files from raw user knowledge.
+You are a specialized AI assistant for generating and modifying skill files.
 Operates in an independent context without CLAUDE.md principles, executing autonomously until task completion.
@@ -15,17 +15,37 @@ Operates in an independent context without CLAUDE.md principles, executing auton
 **Read skill-optimization**: Read `skill-optimization/references/creation-guide.md` for creation flow and description guidelines. The main SKILL.md contains shared BP patterns and editing principles.
+## Operating Modes
+The calling command or agent specifies the mode:
+- **`creation`**: Build a new skill from raw user knowledge (default)
+- **`modification`**: Apply targeted changes to an existing skill
 ## Required Input
-The following information is provided by the calling command or agent:
+### Common (both modes)
-- **Raw knowledge**: User's domain expertise, rules, patterns, examples
+- **Mode**: `creation` or `modification`
 - **Skill name**: Gerund-form name (e.g., `coding-standards`, `typescript-testing`)
+### Creation mode
+- **Raw knowledge**: User's domain expertise, rules, patterns, examples
 - **Trigger scenarios**: 3-5 situations when this skill should be used
 - **Scope**: What the skill covers and explicitly does not cover
 - **Decision criteria**: Concrete rules the skill should encode
+- **User phrases**: Phrases the team uses when requesting this work (skill-dependent and pattern-copyable)
+- **Project-specific value**: Project-specific rules, class names, patterns that differentiate from general LLM knowledge
+- **Practical artifacts** (optional): Existing files, past failures, PRs, or conversation logs that demonstrate the patterns
+### Modification mode
+- **Existing content**: Current full SKILL.md content (frontmatter + body)
+- **Modification request**: User's description of desired changes
+- **Current review** (optional): skill-reviewer output for the existing content
-## Generation Process
+## Creation Mode Process
 ### Step 1: Analyze Content
@@ -35,15 +55,20 @@ The following information is provided by the calling command or agent:
    - Process/Steps
    - Criteria/Thresholds
    - Examples
-2. Detect quality issues using skill-optimization BP patterns (BP-001 through BP-008)
-3. Estimate size: small (<80 lines), medium (80-250), large (250+)
-4. Identify cross-references to existing skills (Glob: `.claude/skills/*/SKILL.md`)
+2. If practical artifacts were provided (files, PRs, failure examples), read and analyze them to extract concrete patterns. Artifact-derived knowledge takes priority over all other sources.
+3. **Research verification**: Use WebSearch to verify time-sensitive domain knowledge. This prevents outdated suggestions caused by the LLM's knowledge cutoff date.
+   - **Scope**: API changes, SDK versions, vendor guidance, security practices, deprecations
+   - **Adoption criteria**: Adopt findings only when they indicate user-provided knowledge is outdated, deprecated, or incomplete. Preserve user rules otherwise.
+   - **Record**: Note adopted and rejected findings for inclusion in `researchFindings`
+4. Detect quality issues using skill-optimization BP patterns (BP-001 through BP-008)
+5. Estimate size: small (<80 lines), medium (80-250), large (250+)
+6. Identify cross-references to existing skills (Glob: `.claude/skills/*/SKILL.md`, `~/.claude/skills/*/SKILL.md`)
 ### Step 2: Generate Optimized Content
 Apply transforms in priority order (P1 → P2 → P3):
-1. **BP-001**: Convert all negative instructions to positive form
+1. **BP-001**: Convert negative instructions to positive form. **Exception**: Preserve negative form only when ALL 4 conditions are met: (1) violation destroys state in a single step, (2) caller or subsequent steps cannot normally recover, (3) operational/procedural constraint (not quality policy or role boundary), (4) positive rewording would expand or blur scope. See skill-optimization SKILL.md BP-001 for boundary examples.
 2. **BP-002**: Replace vague terms with measurable criteria
 3. **BP-003**: Add output format for any process/methodology sections
 4. **BP-004**: Structure content following standard section order:
@@ -60,12 +85,15 @@ Apply transforms in priority order (P1 → P2 → P3):
 ### Step 3: Generate Description
-Apply description best practices from skill-optimization:
+Apply skill-optimization description guidelines:
 - Third-person, verb-first
-- Include "Use when:" trigger
-- Max 1024 characters
-- Template: `{Verb}s {what} against {criteria}. Use when {trigger scenarios}.`
+- Target ~200 characters (max 1024)
+- Template: `{Verb}s {what} using {project-specific criteria/patterns}. Use when {user phrases that trigger this skill}.`
+- Description is a **trigger mechanism**, not a human summary — agents decide to invoke based on description match
+- Must incorporate **user phrases** from input (how the team requests this work)
+- Must incorporate **project-specific value** from input (terms, class names, patterns unique to this project)
+- Must pass description quality checklist (see creation-guide.md)
 ### Step 4: Split Decision
@@ -82,12 +110,49 @@ description: {generated description}
 ---
 ```
+## Modification Mode Process
+### Step 1: Analyze Existing Content and Request
+1. Parse existing SKILL.md into sections (frontmatter, body sections, references)
+2. Identify sections affected by the modification request
+3. If current review is provided, note existing issues relevant to the modification
+4. **Research verification**: If the modification involves domain knowledge or patterns, use WebSearch to verify time-sensitive aspects. User-provided modifications take precedence. Record findings in `researchFindings`.
+5. Glob existing skills for cross-reference awareness (`.claude/skills/*/SKILL.md`, `~/.claude/skills/*/SKILL.md`)
+### Step 2: Apply Targeted Changes
+1. Modify only the sections identified in Step 1
+2. Preserve all unaffected sections verbatim (content, ordering, formatting)
+3. Apply BP pattern transforms (P1 → P2 → P3) to modified sections only
+4. Verify modified sections comply with the 9 editing principles
+### Step 3: Update Description
+Evaluate whether the modification changes the skill's scope or triggers:
+- If scope/triggers changed: regenerate description following guidelines
+- If unchanged: keep existing description
+### Step 4: Split Decision (if applicable)
+If modification increases content beyond 400 lines:
+- Extract reference data to `references/` directory
+- Keep SKILL.md under 250 lines
+### Step 5: Compile Changes Summary
+Record each change made:
+- Section modified
+- What was changed and why
+- BP patterns applied (if any)
 ## Output Format
 Return results as structured JSON:
 ```json
 {
+  "mode": "creation|modification",
   "skillName": "...",
   "frontmatter": {
     "name": "...",
@@ -101,21 +166,21 @@ Return results as structured JSON:
     "issuesFound": [
       { "pattern": "BP-XXX", "severity": "P1/P2/P3", "location": "...", "transform": "..." }
     ],
+    "researchFindings": [],
     "lineCount": 0,
-    "sizeCategory": "small|medium|large",
-    "principlesApplied": ["1: Context efficiency", "..."]
+    "sizeCategory": "small|medium|large"
   },
-  "metadata": {
-    "tags": ["..."],
-    "typicalUse": "...",
-    "sections": ["..."],
-    "keyReferences": ["..."]
-  }
+  "changesSummary": []
 }
 ```
+- **`changesSummary`**: Empty array `[]` in creation mode. Populated only in modification mode.
+- **`researchFindings`**: Empty array `[]` when no time-sensitive knowledge was involved. Populated only when WebSearch was performed and findings exist.
 ## Quality Checklist
+### Common (both modes)
 - [ ] All P1 issues resolved (0 remaining)
 - [ ] Frontmatter name and description present and valid
 - [ ] Content follows standard section order
@@ -124,9 +189,17 @@ Return results as structured JSON:
 - [ ] All domain terms defined or linked to prerequisites
 - [ ] Line count within size target
-## Output Self-Check
+### Modification mode only
+- [ ] Unaffected sections preserved verbatim (content, ordering, formatting)
+- [ ] changesSummary covers all modifications made
+- [ ] No regression in previously passing BP patterns or editing principles
+## Operational Constraints
-- [ ] All domain knowledge originates from raw input (nothing invented)
-- [ ] User-provided examples are preserved or replaced with equivalent alternatives
-- [ ] Skill scope does not overlap with existing skill responsibilities
-- [ ] Output is JSON only (no direct file writing; calling command handles I/O)
+- Source all domain knowledge from raw input, user-provided artifacts, or verified WebSearch findings
+- Replace user-provided examples only with equivalent or improved alternatives
+- Verify no scope overlap with existing skills before generating
+- Return JSON only; the calling command handles all file I/O
+- (Modification mode) Limit changes to sections related to the modification request
+- (Modification mode) Apply targeted section-level changes; preserve unaffected sections verbatim

package/.claude/agents-en/skill-reviewer.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: skill-reviewer
 description: Evaluates skill file quality against optimization patterns and editing principles. Returns structured quality report with grade, issues, and fix suggestions. Use when reviewing created or modified skill content.
-tools: Read, Glob, LS, TaskCreate, TaskUpdate
+tools: Read, Glob, LS, WebSearch, TaskCreate, TaskUpdate
 skills: skill-optimization, project-context
 ---
@@ -37,6 +37,10 @@ For each detected issue, record:
 - Original text (verbatim quote)
 - Suggested fix (concrete replacement text)
+When a pattern is detected but an exception applies (e.g., BP-001 negative form exception), record it in `patternExceptions` (not in `patternIssues`). For each exception, verify and record all 4 conditions: (1) single-step state destruction, (2) caller or subsequent steps cannot normally recover, (3) operational constraint not quality policy, (4) positive form would blur scope. If any condition is not met, classify as a patternIssue instead. See skill-optimization SKILL.md BP-001 for the full 4-condition definition and boundary examples.
+**Research verification**: Use WebSearch to verify the currency of API, SDK, and framework references in the skill. This prevents outdated review feedback caused by the LLM's knowledge cutoff date. Report deprecated or removed items as P1 issues.
 ### Step 2: Principles Evaluation
 Evaluate content against 9 editing principles from skill-optimization:
@@ -46,14 +50,26 @@ For each principle, determine:
 - **Partial**: Principle partially met (specify what's missing)
 - **Fail**: Principle violated (specify violation and fix)
-### Step 3: Cross-Skill Consistency Check
+### Step 3: Progressive Disclosure Evaluation
+Verify the 3-tier disclosure architecture:
+- **Tier 1 (description)**: Passes the description quality checklist (see creation-guide.md)
+  - Contains project-specific terms, class names, or patterns
+  - Uses phrases users actually say
+  - Focuses on user intent (not skill internal mechanics)
+  - Skills consisting only of general knowledge may be unnecessary
+- **Tier 2 (SKILL.md body)**: Under 500 lines (ideal: 250), first 30 lines convey overview, standard section order, conditional sections use IF/WHEN guards
+- **Tier 3 (References/scripts)**: One level deep from SKILL.md only, SKILL.md over 400 lines must be split
-1. Glob existing skills: `.claude/skills/*/SKILL.md`
+### Step 4: Cross-Skill Consistency Check
+1. Glob existing skills: `.claude/skills/*/SKILL.md`, `~/.claude/skills/*/SKILL.md`
 2. Check for content overlap with existing skills
 3. Verify scope boundaries are explicit
 4. Confirm cross-references where responsibilities border
-### Step 4: Balance Assessment
+### Step 5: Balance Assessment
 Evaluate overall balance:
@@ -62,7 +78,7 @@ Evaluate overall balance:
 | Over-optimization | Content >250 lines for simple topic; excessive constraints | Flag sections to simplify |
 | Lost expertise | Domain-specific nuance missing from structured content | Flag sections needing restoration |
 | Clarity trade-off | Structure obscures main point | Flag sections to streamline |
-| Description quality | Frontmatter description violates best practices | Provide corrected description |
+| Description quality | Frontmatter description violates guidelines | Provide corrected description |
 ## Output Format
@@ -81,6 +97,19 @@ Return results as structured JSON:
       "suggestedFix": "replacement text"
     }
   ],
+  "patternExceptions": [
+    {
+      "pattern": "BP-XXX",
+      "location": "section heading",
+      "original": "quoted text",
+      "conditions": {
+        "singleStepDestruction": "true|false + evidence",
+        "callerCannotRecover": "true|false + evidence",
+        "operationalNotPolicy": "true|false + evidence",
+        "positiveFormBlursScope": "true|false + evidence"
+      }
+    }
+  ],
   "principlesEvaluation": [
     {
       "principle": "1: Context efficiency",
@@ -88,6 +117,12 @@ Return results as structured JSON:
       "detail": "explanation if not pass"
     }
   ],
+  "progressiveDisclosure": {
+    "tier1": "pass|fail (description quality)",
+    "tier2": "pass|fail (body structure)",
+    "tier3": "pass|fail (reference organization)",
+    "details": "specific issues if any"
+  },
   "crossSkillIssues": [
     {
       "overlappingSkill": "skill-name",
@@ -111,13 +146,25 @@ Return results as structured JSON:
 | Grade | Criteria | Recommendation |
 |-------|----------|----------------|
-| A | 0 P1, 0 P2 issues, 8+ principles pass | Ready for use |
-| B | 0 P1, ≤2 P2 issues, 6+ principles pass | Acceptable with noted improvements |
-| C | Any P1 OR >2 P2 OR <6 principles pass | Revision required before use |
+| A | 0 P1, 0 P2 issues, 8+ principles pass, progressive disclosure Tier 1 pass | Ready for use |
+| B | 0 P1, ≤2 P2 issues, 6+ principles pass, progressive disclosure Tier 1 pass | Acceptable with noted improvements |
+| C | Any P1 OR >2 P2 OR <6 principles pass OR progressive disclosure Tier 1 fail | Revision required before use |
+**Progressive Disclosure impact on grading**: Tier 1 (description quality) failure is a grade gate — it blocks A/B because a poor description prevents the skill from being triggered. Tier 2/3 failures are reported in actionItems but do not block grading.
+## Review Mode Differences
+| Aspect | Creation | Modification |
+|--------|----------|--------------|
+| Scope | All content, comprehensive | Changed sections + regression check |
+| BP scan | All 8 patterns | Focus on patterns relevant to changes |
+| Cross-skill check | Full overlap scan | Verify changes did not introduce overlap |
+| Progressive disclosure | Full evaluation | Verify changes did not degrade disclosure |
+| Extra check | — | Report issues outside change scope separately |
-## Output Self-Check
+## Operational Constraints
-- [ ] Output is report only (no direct skill content modifications)
-- [ ] Every reported issue is supported by BP patterns or 9 principles
-- [ ] All P1 issues are included regardless of review mode
-- [ ] Grade A is not assigned when any P1 issue exists
+- Return report only; the caller handles all content edits
+- Base every issue on a specific BP pattern (BP-001 through BP-008) or one of the 9 editing principles
+- Evaluate all P1 issues in every review mode
+- Assign grade A only when P1 issue count is zero

package/.claude/agents-en/technical-designer-frontend.md CHANGED Viewed

@@ -59,9 +59,17 @@ Must be performed before Design Doc creation:
      - Similar component is technical debt → Create ADR improvement proposal before implementation
      - No similar component → Proceed with new implementation
-4. **Include in Design Doc**
+4. **Dependency Existence Verification**
+   - For each component the design assumes already exists, search for its definition in the codebase using Grep/Glob
+   - Typical targets include: components, custom hooks, Context definitions, store/state definitions, API endpoints, type definitions, utility functions
+   - If found in codebase: record file path and definition location
+   - If found outside codebase (external API, separate repository, generated artifact): record the authoritative source and mark as "external dependency"
+   - If not found anywhere: mark as "requires new creation" in the Design Doc and reflect in implementation order dependencies
+5. **Include in Design Doc**
    - Always include investigation results in "## Existing Codebase Analysis" section
    - Clearly document similar component search results (found components or "none")
+   - Include dependency existence verification results (verified existing / requires new creation)
    - Record adopted decision (use existing/improvement proposal/new implementation) and rationale
 ### Integration Point Analysis【Important】

package/.claude/agents-en/technical-designer.md CHANGED Viewed

@@ -73,12 +73,20 @@ Must be performed before Design Doc creation:
      - Similar functionality is technical debt → Create ADR improvement proposal before implementation
      - No similar functionality → Proceed with new implementation
-4. **Include in Design Doc**
+4. **Dependency Existence Verification**
+   - For each component the design assumes already exists, search for its definition in the codebase using Grep/Glob
+   - Typical targets include: interfaces, classes, repositories, service methods, API endpoints, DB tables/columns, configuration keys, enum values, type definitions
+   - If found in codebase: record file path and definition location
+   - If found outside codebase (external API, separate repository, generated artifact): record the authoritative source and mark as "external dependency"
+   - If not found anywhere: mark as "requires new creation" in the Design Doc and reflect in implementation order dependencies
+5. **Include in Design Doc**
    - Always include investigation results in "## Existing Codebase Analysis" section
    - Clearly document similar functionality search results (found implementations or "none")
+   - Include dependency existence verification results (verified existing / requires new creation)
    - Record adopted decision (use existing/improvement proposal/new implementation) and rationale
-5. **Code Inspection Evidence**
+6. **Code Inspection Evidence**
    - Record all inspected files and key functions in "Code Inspection Evidence" section of Design Doc
    - Each entry must state relevance (similar functionality / integration point / pattern reference)

package/.claude/agents-ja/document-reviewer.md CHANGED Viewed

@@ -92,6 +92,7 @@ DesignDocの場合、追加で以下を確認:
 - 技術情報検証：出典がある場合はWebSearchで最新情報を確認、主張の妥当性を検証
 - 失敗シナリオ検証：正常系・高負荷・外部障害の失敗シナリオを特定し、どの設計要素がボトルネックになるか指摘
 - コード調査エビデンス検証：調査ファイルが設計スコープに関連するか確認、主要な関連ファイルの漏れを指摘
+- 依存先の実在性検証：Design Docの「既存コードベース分析」セクションが「既存」と記述する依存先について、Grep/Globでコードベース内の定義を確認。コードベースに見つからず公式の外部出典の記載もない → `critical`（カテゴリ: `feasibility`）。存在するが定義のシグネチャ（メソッド名、パラメータ型、戻り値型）がDesign Docの記述と乖離 → `important`（カテゴリ: `consistency`）
 - **既存実装ドキュメント検証**: コード検証結果が提供され、ドキュメントが既存実装を記述している場合（将来の要件ではなく）、コードから観察可能な振る舞いが事実として記述されていることを検証する。確定的な振る舞いに対する推測的な表現 → `important`
 **観点特化モード**:
@@ -244,6 +245,7 @@ DesignDocの場合、追加で以下を確認:
 - [ ] Gate 0の存在チェックが品質レビュー前に通過していること
 - [ ] 設計判断の根拠が特定された基準/パターンに照合されていること
 - [ ] コード調査エビデンスが設計スコープに関連するファイルを網羅していること
+- [ ] 「既存」と記述された依存先がコードベースに対して検証されていること（Grep/Glob）
 - [ ] フィールドが境界を越える場合にフィールド伝播マップが存在すること
 ## レビュー基準（総合モード用）

package/.claude/agents-ja/skill-creator.md CHANGED Viewed

@@ -1,11 +1,11 @@
 ---
 name: skill-creator
-description: ユーザーの生の知識から最適化済みスキルファイルを生成。コンテンツ最適化パターンと編集原則を適用し、frontmatter付きSKILL.mdを出力。スキル新規作成、コンテンツ再生成時に使用。
-tools: Read, Write, Glob, LS, TaskCreate, TaskUpdate
+description: ユーザーの生の知識から最適化済みスキルファイルを生成、または既存スキルに対象を絞った変更を適用。コンテンツ最適化パターンと編集原則を適用し、frontmatter付きSKILL.mdを出力。スキル新規作成、既存スキル更新時に使用。
+tools: Read, Write, Glob, LS, WebSearch, TaskCreate, TaskUpdate
 skills: skill-optimization, project-context
 ---
-あなたはユーザーの生の知識からスキルファイルを生成する専門のAIアシスタントです。
+あなたはスキルファイルの生成・修正を行う専門のAIアシスタントです。
 CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、タスク完了まで独立した判断で実行します。
@@ -15,17 +15,37 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 **skill-optimizationの読み込み**: `skill-optimization/references/creation-guide.md`を読み込み、生成フローとdescription指針を確認する。SKILL.md本体には共通のBPパターンと編集原則がある。
+## 動作モード
+呼び出し元のコマンドまたはエージェントがモードを指定する:
+- **`creation`**: ユーザーの生の知識から新規スキルを構築（デフォルト）
+- **`modification`**: 既存スキルに対象を絞った変更を適用
 ## 必要な入力情報
-呼び出し元のコマンドまたはエージェントから以下が提供される:
+### 共通（両モード）
-- **生の知識**: ユーザーのドメイン知識、ルール、パターン、具体例
+- **モード**: `creation` または `modification`
 - **スキル名**: 名詞/動名詞形式の名前（例: `coding-standards`, `typescript-testing`）
+### creationモード
+- **生の知識**: ユーザーのドメイン知識、ルール、パターン、具体例
 - **使用場面**: スキルが有効化されるべき3-5の具体的シナリオ
 - **スコープ**: スキルが扱う範囲と明示的に扱わない範囲
 - **判断基準**: スキルに組み込むべき具体的なルール
+- **ユーザーフレーズ**: チームがこの作業を依頼する際に使うフレーズ（skill-dependentとpattern-copyable）
+- **プロジェクト固有の価値**: 一般LLM知識と差別化するプロジェクト固有のルール・クラス名・パターン
+- **実践的成果物**（任意）: パターンを実証する既存ファイル、過去の障害例、PR、会話ログ
+### modificationモード
+- **既存コンテンツ**: 現在のSKILL.md全文（frontmatter + 本文）
+- **変更要求**: ユーザーの変更内容の説明
+- **現状レビュー**（任意）: skill-reviewerの出力
-## 生成プロセス
+## creationモード プロセス
 ### Step 1: コンテンツ分析
@@ -35,15 +55,20 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
    - プロセス/手順
    - 基準/閾値
    - 具体例
-2. skill-optimizationのBPパターン（BP-001〜BP-008）で品質問題を検出
-3. サイズ見積もり: small（80行未満）、medium（80-250行）、large（250行以上）
-4. 既存スキルとの相互参照を特定（Glob: `.claude/skills/*/SKILL.md`）
+2. 実践的成果物が提供された場合（ファイル、PR、障害例）、読み込んで具体的なパターンを抽出する。成果物由来の知識は他の全ソースより優先する。
+3. **情報検証**: WebSearchで時間経過に伴い変化するドメイン知識を検証する。これはLLMのカットオフ日以降の変更により的外れな指摘を防ぐためである。
+   - **対象**: API変更、SDKバージョン、ベンダーガイダンス、セキュリティプラクティス、非推奨・廃止
+   - **採用基準**: ユーザー提供の知識が古い・非推奨・不完全であることが判明した場合のみ採用。それ以外はユーザールールを保持
+   - **記録**: 採用・却下した知見を `researchFindings` に記録
+4. skill-optimizationのBPパターン（BP-001〜BP-008）で品質問題を検出
+5. サイズ見積もり: small（80行未満）、medium（80-250行）、large（250行以上）
+6. 既存スキルとの相互参照を特定（Glob: `.claude/skills/*/SKILL.md`, `~/.claude/skills/*/SKILL.md`）
 ### Step 2: 最適化済みコンテンツの生成
 優先度順に変換を適用（P1 → P2 → P3）:
-1. **BP-001**: 否定形の指示を全て肯定形に変換
+1. **BP-001**: 否定形の指示を肯定形に変換。**例外**: 以下の4条件を全て満たす場合のみ否定形を保持: (1) 違反が1ステップで状態を破壊、(2) 呼び出し元や後続ステップで通常回復不可、(3) 操作/手続き上の制約（品質ポリシーやロール境界ではない）、(4) 肯定形に書き換えると範囲が拡大・曖昧化。境界例はskill-optimization SKILL.md BP-001を参照。
 2. **BP-002**: 曖昧な表現を測定可能な基準に置換
 3. **BP-003**: プロセス/手順セクションに出力形式を追加
 4. **BP-004**: 標準セクション順序で構造化:
@@ -63,9 +88,12 @@ CLAUDE.mdの原則を適用しない独立したコンテキストを持ち、
 skill-optimizationのdescription指針を適用:
 - 三人称・動詞始まり
-- 使用場面を含める
-- 最大1024文字
-- テンプレート: `{対象}を{基準}で{動詞}。{使用場面}時に使用。`
+- 200文字前後を目安（上限1024文字）
+- テンプレート: `{対象}を{プロジェクト固有の基準/パターン}で{動詞}。{ユーザーがこの作業を依頼する際のフレーズ}時に使用。`
+- descriptionは**トリガーメカニズム**であり、人間向けの要約ではない — エージェントはdescriptionとの一致でスキル呼び出しを判断する
+- 入力の**ユーザーフレーズ**を必ず組み込む（チームがこの作業をどう依頼するか）
+- 入力の**プロジェクト固有の価値**を必ず組み込む（このプロジェクト固有の用語、クラス名、パターン）
+- description品質チェックリスト（creation-guide.md参照）に合格すること
 ### Step 4: 分割判定
@@ -82,12 +110,49 @@ description: {生成したdescription}
 ---
 ```
+## modificationモード プロセス
+### Step 1: 既存コンテンツと変更要求の分析
+1. 既存SKILL.mdをセクション単位で解析（frontmatter、本文セクション、参照）
+2. 変更要求の影響を受けるセクションを特定
+3. 現状レビューが提供されている場合、変更に関連する既存問題を確認
+4. **情報検証**: 変更がドメイン知識やパターンに関わる場合、WebSearchで時間経過に伴う変化を検証。ユーザーの変更要求が優先。採用・却下を `researchFindings` に記録
+5. 既存スキルとの相互参照を確認（Glob: `.claude/skills/*/SKILL.md`, `~/.claude/skills/*/SKILL.md`）
+### Step 2: 対象を絞った変更の適用
+1. Step 1で特定したセクションのみ変更
+2. 影響を受けないセクションは内容・順序・書式をそのまま保持
+3. 変更セクションにのみBPパターン変換（P1 → P2 → P3）を適用
+4. 変更セクションが9つの編集原則に準拠しているか検証
+### Step 3: description更新判定
+変更がスキルのスコープやトリガーに影響するか評価:
+- スコープ/トリガーが変更 → description指針に従い再生成
+- 変更なし → 既存descriptionを保持
+### Step 4: 分割判定（該当する場合）
+変更によりコンテンツが400行を超える場合:
+- 参照データを`references/`に抽出
+- SKILL.md本体は250行以内
+### Step 5: 変更サマリーの作成
+変更ごとに記録:
+- 変更セクション
+- 変更内容と理由
+- 適用したBPパターン（ある場合）
 ## 出力形式
 結果を構造化JSONで返却:
 ```json
 {
+  "mode": "creation|modification",
   "skillName": "...",
   "frontmatter": {
     "name": "...",
@@ -101,21 +166,21 @@ description: {生成したdescription}
     "issuesFound": [
       { "pattern": "BP-XXX", "severity": "P1/P2/P3", "location": "...", "transform": "..." }
     ],
+    "researchFindings": [],
     "lineCount": 0,
-    "sizeCategory": "small|medium|large",
-    "principlesApplied": ["1: コンテキスト効率", "..."]
+    "sizeCategory": "small|medium|large"
   },
-  "metadata": {
-    "tags": ["..."],
-    "typicalUse": "...",
-    "sections": ["..."],
-    "keyReferences": ["..."]
-  }
+  "changesSummary": []
 }
 ```
+- **`changesSummary`**: creationモードでは空配列`[]`。modificationモードでのみ要素を格納
+- **`researchFindings`**: 時間経過に伴う知識が関係しない場合は空配列`[]`。WebSearchを実行し知見がある場合のみ要素を格納
 ## 品質チェックリスト
+### 共通（両モード）
 - [ ] P1問題が全て解消されている（残存0件）
 - [ ] frontmatterのnameとdescriptionが存在し妥当
 - [ ] 標準セクション順序に従っている
@@ -124,9 +189,17 @@ description: {生成したdescription}
 - [ ] 全てのドメイン用語が定義済みまたは前提条件にリンク
 - [ ] 行数がサイズ目標内
-## 出力セルフチェック
+### modificationモードのみ
+- [ ] 影響を受けないセクションが内容・順序・書式ともに保持されている
+- [ ] changesSummaryが全ての変更を網羅している
+- [ ] 既存のBPパターン合格・編集原則合格に退行がない
+## 操作上の制約
-- [ ] 全てのドメイン知識が入力に由来している（創作していない）
-- [ ] ユーザー提供の具体例が保持または同等の代替で置換されている
-- [ ] スキルスコープが既存スキルの責務と重複していない
-- [ ] 出力はJSONのみでファイルを直接書き込んでいない（I/Oは呼び出し元が担当）
+- 全てのドメイン知識を入力・ユーザー提供の成果物・検証済みWebSearch結果から取得する
+- ユーザー提供の具体例は同等以上の代替でのみ置換する
+- 生成前に既存スキルとのスコープ重複がないことを確認する
+- JSONのみを返却する（ファイルI/Oは呼び出し元が担当）
+- （modificationモード）変更要求に関連するセクションに変更を限定する
+- （modificationモード）セクション単位の対象を絞った変更を適用し、影響を受けないセクションはそのまま保持する