musubix 3.6.0 → 3.7.3
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.github/AGENTS.md +949 -0
- package/.github/prompts/sdd-change-apply.prompt.md +283 -0
- package/.github/prompts/sdd-change-archive.prompt.md +241 -0
- package/.github/prompts/sdd-change-init.prompt.md +269 -0
- package/.github/prompts/sdd-design.prompt.md +250 -0
- package/.github/prompts/sdd-implement.prompt.md +387 -0
- package/.github/prompts/sdd-requirements.prompt.md +193 -0
- package/.github/prompts/sdd-review.prompt.md +155 -0
- package/.github/prompts/sdd-security.prompt.md +228 -0
- package/.github/prompts/sdd-steering.prompt.md +269 -0
- package/.github/prompts/sdd-tasks.prompt.md +255 -0
- package/.github/prompts/sdd-test.prompt.md +230 -0
- package/.github/prompts/sdd-validate.prompt.md +304 -0
- package/.github/skills/build-fix/SKILL.md +124 -0
- package/.github/skills/checkpoint/SKILL.md +131 -0
- package/.github/skills/codemap/SKILL.md +120 -0
- package/.github/skills/codemap/templates/codemap-index.md +142 -0
- package/.github/skills/codemap/templates/package-codemap.md +160 -0
- package/.github/skills/context-optimizer/SKILL.md +109 -0
- package/.github/skills/context-optimizer/contexts/dev.md +40 -0
- package/.github/skills/context-optimizer/contexts/research.md +55 -0
- package/.github/skills/context-optimizer/contexts/review.md +49 -0
- package/.github/skills/e2e-runner/SKILL.md +145 -0
- package/.github/skills/eval-harness/SKILL.md +111 -0
- package/.github/skills/eval-harness/examples/capability-eval.md +158 -0
- package/.github/skills/eval-harness/examples/human-grader-template.md +326 -0
- package/.github/skills/eval-harness/examples/regression-eval.md +228 -0
- package/.github/skills/learning-hooks/SKILL.md +101 -0
- package/.github/skills/learning-hooks/templates/learned-skill-template.md +79 -0
- package/.github/skills/musubix-adr-generation/SKILL.md +74 -0
- package/.github/skills/musubix-best-practices/SKILL.md +85 -0
- package/.github/skills/musubix-c4-design/SKILL.md +86 -0
- package/.github/skills/musubix-code-generation/SKILL.md +90 -0
- package/.github/skills/musubix-domain-inference/SKILL.md +82 -0
- package/.github/skills/musubix-ears-validation/SKILL.md +74 -0
- package/.github/skills/musubix-sdd-workflow/SKILL.md +95 -0
- package/.github/skills/musubix-technical-writing/SKILL.md +108 -0
- package/.github/skills/musubix-test-generation/SKILL.md +88 -0
- package/.github/skills/musubix-traceability/SKILL.md +94 -0
- package/.github/skills/refactor-cleaner/SKILL.md +105 -0
- package/.github/skills/session-manager/SKILL.md +119 -0
- package/.github/skills/session-manager/scripts/session-end.sh +175 -0
- package/.github/skills/session-manager/scripts/session-start.sh +87 -0
- package/.github/skills/verification-loop/SKILL.md +111 -0
- package/.github/skills/verification-loop/scripts/verify.sh +305 -0
- package/AGENTS.md +333 -0
- package/LICENSE +21 -0
- package/README.ja.md +313 -0
- package/README.md +315 -50
- package/bin/musubix-mcp.js +15 -0
- package/bin/musubix.js +9 -1
- package/docs/API-REFERENCE.md +1425 -0
- package/docs/CODEMAPS/CODEMAP.md +1 -0
- package/docs/GITHUB-ACTIONS-NPM-SETUP.md +132 -0
- package/docs/INSTALL-GUIDE.ja.md +459 -0
- package/docs/INSTALL-GUIDE.md +459 -0
- package/docs/MIGRATION-v3.0.md +324 -0
- package/docs/MUSUBI-enhancement_roadmap_20260105.md +651 -0
- package/docs/MUSUBIX-v3.0-User-Guide.md +1357 -0
- package/docs/MUSUBIXv2.2.0-Manual-outline.md +136 -0
- package/docs/MUSUBIXv2.2.0-Manual.md +3123 -0
- package/docs/MUSUBIXv2.3.5-Refactering.md +1310 -0
- package/docs/MUSUBIv1.6.1-enhancement_roadmap_20260105.md +291 -0
- package/docs/MUSUBIv2.2.0-USERGUIDE.md +2079 -0
- package/docs/ROADMAP-v1.5.md +116 -0
- package/docs/SwarmCoding.md +1284 -0
- package/docs/Test-prompt.md +105 -0
- package/docs/USER-GUIDE-v1.8.0.md +2371 -0
- package/docs/USER-GUIDE.ja.md +2147 -0
- package/docs/USER-GUIDE.md +3022 -0
- package/docs/YATA-GLOBAL-GUIDE.ja.md +750 -0
- package/docs/YATA-GLOBAL-GUIDE.md +595 -0
- package/docs/YATA-LOCAL-GUIDE.ja.md +989 -0
- package/docs/YATA-LOCAL-GUIDE.md +730 -0
- package/docs/adr/0001-real-time-pattern-learning-architecture-for-v1-5-0.md +75 -0
- package/docs/adr/0002-pattern-sharing-protocol-for-cross-team-collaborat.md +79 -0
- package/docs/adr/0003-owl-2-rl-implementation-strategy-for-advanced-infe.md +90 -0
- package/docs/adr/ADR-v3.4.0-001-deep-research-architecture.md +217 -0
- package/docs/adr/ADR-v3.4.0-002-search-provider-selection.md +308 -0
- package/docs/adr/ADR-v3.4.0-003-lm-api-integration.md +475 -0
- package/docs/adr/ADR-v3.7.0-001-everything-claude-code-integration.md +102 -0
- package/docs/enterprise-knowledge-management.md +1737 -0
- package/docs/evolution-from-musubi-to-musubix.md +2170 -0
- package/docs/experiments/EXPERIMENT-ASSISTANT-AXIS-DRIFT-DETECTION.md +155 -0
- package/docs/getting-started-with-sdd.md +1602 -0
- package/docs/moodle-refactering-codegraph-musubix.md +391 -0
- package/docs/moodle-refactering-codegraph.md +278 -0
- package/docs/overview/MUSUBIX-CodeGraph.md +322 -0
- package/docs/overview/MUSUBIX-Core.md +671 -0
- package/docs/overview/MUSUBIX-Decisions.md +494 -0
- package/docs/overview/MUSUBIX-FormalVerify.md +566 -0
- package/docs/overview/MUSUBIX-Knowledge.md +1231 -0
- package/docs/overview/MUSUBIX-Learning.md +837 -0
- package/docs/overview/MUSUBIX-MCP-Server.md +535 -0
- package/docs/overview/MUSUBIX-Overview.md +264 -0
- package/docs/overview/MUSUBIX-Phase1-Complete.md +271 -0
- package/docs/overview/MUSUBIX-Phase2-Complete.md +310 -0
- package/docs/overview/MUSUBIX-Policy.md +477 -0
- package/docs/overview/MUSUBIX-Roadmap-v2.md +399 -0
- package/docs/overview/MUSUBIX-Security-Plan.md +939 -0
- package/docs/overview/MUSUBIX-Security-v2.1.md +668 -0
- package/docs/overview/MUSUBIX-Security.md +891 -0
- package/docs/overview/MUSUBIX-YATA.md +666 -0
- package/docs/overview/MUSUBIX-v2.2.0-Advanced-Learning.md +513 -0
- package/docs/overview/Neuro-SymbolicAI.md +159 -0
- package/docs/packages/knowledge.md +594 -0
- package/docs/qiita/musubix-v3.6.0-fastrender-insights.md +625 -0
- package/docs/qiita-linux-kernel-knowledge-graph.md +596 -0
- package/docs/qiita-musubix-assistant-axis.md +380 -0
- package/package.json +58 -52
- package/scripts/generate-quality-gate-report.ts +106 -0
- package/scripts/postinstall.js +94 -0
- package/scripts/register-release-knowledge.ts +127 -0
- package/steering/.musubi-version +1 -0
- package/steering/product.ja.md +572 -0
- package/steering/project.yml +66 -0
- package/steering/rules/constitution.md +491 -0
- package/steering/structure.ja.md +503 -0
- package/steering/tech.ja.md +208 -0
- package/dist/index.d.ts +0 -25
- package/dist/index.d.ts.map +0 -1
- package/dist/index.js +0 -74
- package/dist/index.js.map +0 -1
|
@@ -0,0 +1,111 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: eval-harness
|
|
3
|
+
description: pass@kメトリクスでAIコード生成の品質を評価。Capability/Regression評価をサポート。
|
|
4
|
+
license: MIT
|
|
5
|
+
version: 1.1.0
|
|
6
|
+
triggers:
|
|
7
|
+
- /eval
|
|
8
|
+
- 機能評価
|
|
9
|
+
- 回帰テスト
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
# Eval Harness
|
|
13
|
+
|
|
14
|
+
> **要約**: AIコード生成の品質を定量的に評価。pass@kメトリクスと複数の評価タイプをサポート。
|
|
15
|
+
|
|
16
|
+
## 📊 評価タイプ
|
|
17
|
+
|
|
18
|
+
### 1. Capability Eval (REQ-EH-001)
|
|
19
|
+
|
|
20
|
+
**WHEN** 新機能の品質評価が必要
|
|
21
|
+
**DO** 以下のフォーマットで定義
|
|
22
|
+
|
|
23
|
+
```markdown
|
|
24
|
+
[CAPABILITY EVAL: <feature-name>]
|
|
25
|
+
Task: <達成タスク>
|
|
26
|
+
Success Criteria:
|
|
27
|
+
- [ ] 基準1
|
|
28
|
+
- [ ] 基準2
|
|
29
|
+
- [ ] 基準3
|
|
30
|
+
Expected Output: <期待出力>
|
|
31
|
+
Test Command: <テストコマンド>
|
|
32
|
+
```
|
|
33
|
+
|
|
34
|
+
---
|
|
35
|
+
|
|
36
|
+
### 2. Regression Eval (REQ-EH-002)
|
|
37
|
+
|
|
38
|
+
**WHEN** 既存機能の品質維持を確認
|
|
39
|
+
**DO** 以下のフォーマットで定義
|
|
40
|
+
|
|
41
|
+
```markdown
|
|
42
|
+
[REGRESSION EVAL: <feature-name>]
|
|
43
|
+
Baseline: <Git SHA / チェックポイント>
|
|
44
|
+
Tests:
|
|
45
|
+
- test-1: PASS/FAIL
|
|
46
|
+
- test-2: PASS/FAIL
|
|
47
|
+
Result: X/Y passed (previously Y/Y)
|
|
48
|
+
Regression: Yes/No
|
|
49
|
+
```
|
|
50
|
+
|
|
51
|
+
---
|
|
52
|
+
|
|
53
|
+
## 📈 pass@k Metrics (REQ-EH-003)
|
|
54
|
+
|
|
55
|
+
| メトリクス | 定義 | 用途 |
|
|
56
|
+
|-----------|------|------|
|
|
57
|
+
| **pass@1** | 初回試行成功率 | 基本信頼度 |
|
|
58
|
+
| **pass@3** | 3回中1回以上成功 | 一般ターゲット |
|
|
59
|
+
| **consecutive@3** | 3回連続成功 | クリティカルパス |
|
|
60
|
+
|
|
61
|
+
**計算**:
|
|
62
|
+
```
|
|
63
|
+
pass@1 = 成功数 / 試行数
|
|
64
|
+
pass@k = 1 - C(n-c, k) / C(n, k)
|
|
65
|
+
consecutive@3 = 連続成功シーケンス数 / 可能シーケンス数
|
|
66
|
+
```
|
|
67
|
+
|
|
68
|
+
**レポート例**:
|
|
69
|
+
```
|
|
70
|
+
📊 Eval Report: user-auth
|
|
71
|
+
━━━━━━━━━━━━━━━━━━━━
|
|
72
|
+
pass@1: 80% (8/10)
|
|
73
|
+
pass@3: 95%
|
|
74
|
+
consecutive@3: 60%
|
|
75
|
+
━━━━━━━━━━━━━━━━━━━━
|
|
76
|
+
Status: ✅ Meets target
|
|
77
|
+
```
|
|
78
|
+
|
|
79
|
+
---
|
|
80
|
+
|
|
81
|
+
## 🔍 Grader Types (REQ-EH-004/005)
|
|
82
|
+
|
|
83
|
+
| タイプ | 説明 | 使用場面 |
|
|
84
|
+
|--------|------|---------|
|
|
85
|
+
| **Code-Based** | コマンド実行で判定 | 決定的な検証 |
|
|
86
|
+
| **Model-Based** | LLMで判定 | 自由形式の評価 |
|
|
87
|
+
| **Human** | 人手で判定 | 主観的品質評価 |
|
|
88
|
+
|
|
89
|
+
### Human Grader Template
|
|
90
|
+
|
|
91
|
+
```markdown
|
|
92
|
+
[HUMAN GRADE: <feature>]
|
|
93
|
+
Reviewer: @username
|
|
94
|
+
Checklist:
|
|
95
|
+
- [ ] 仕様を満たしている
|
|
96
|
+
- [ ] エッジケース考慮
|
|
97
|
+
- [ ] API互換性維持
|
|
98
|
+
- [ ] セキュリティ問題なし
|
|
99
|
+
Verdict: PASS/FAIL
|
|
100
|
+
Notes: [コメント]
|
|
101
|
+
```
|
|
102
|
+
|
|
103
|
+
---
|
|
104
|
+
|
|
105
|
+
## トレーサビリティ
|
|
106
|
+
|
|
107
|
+
- REQ-EH-001: Capability Eval Definition
|
|
108
|
+
- REQ-EH-002: Regression Eval Definition
|
|
109
|
+
- REQ-EH-003: pass@k Metrics
|
|
110
|
+
- REQ-EH-004: Grader Types
|
|
111
|
+
- REQ-EH-005: Human Grader Support
|
|
@@ -0,0 +1,158 @@
|
|
|
1
|
+
# Capability Eval Examples
|
|
2
|
+
|
|
3
|
+
## Example 1: User Authentication Feature
|
|
4
|
+
|
|
5
|
+
```markdown
|
|
6
|
+
[CAPABILITY EVAL: user-authentication]
|
|
7
|
+
|
|
8
|
+
Task: JWTベースのユーザー認証機能を実装する
|
|
9
|
+
|
|
10
|
+
Success Criteria:
|
|
11
|
+
- [x] ログインエンドポイントが正常に動作する
|
|
12
|
+
- [x] JWTトークンが正しく生成される
|
|
13
|
+
- [x] トークン検証ミドルウェアが機能する
|
|
14
|
+
- [ ] リフレッシュトークンが実装されている
|
|
15
|
+
|
|
16
|
+
Expected Output:
|
|
17
|
+
- POST /api/auth/login → 200 OK + JWT token
|
|
18
|
+
- GET /api/protected → 401 without token, 200 with valid token
|
|
19
|
+
|
|
20
|
+
Constraints:
|
|
21
|
+
- bcryptを使用したパスワードハッシュ
|
|
22
|
+
- トークン有効期限: 15分
|
|
23
|
+
- リフレッシュトークン有効期限: 7日
|
|
24
|
+
|
|
25
|
+
Test Command: npm run test:auth
|
|
26
|
+
|
|
27
|
+
Result: 3/4 criteria met
|
|
28
|
+
Status: PARTIAL PASS
|
|
29
|
+
```
|
|
30
|
+
|
|
31
|
+
---
|
|
32
|
+
|
|
33
|
+
## Example 2: API Rate Limiting
|
|
34
|
+
|
|
35
|
+
```markdown
|
|
36
|
+
[CAPABILITY EVAL: api-rate-limiting]
|
|
37
|
+
|
|
38
|
+
Task: APIエンドポイントにレート制限を実装する
|
|
39
|
+
|
|
40
|
+
Success Criteria:
|
|
41
|
+
- [x] IPベースのレート制限が機能する
|
|
42
|
+
- [x] 制限超過時に429エラーを返す
|
|
43
|
+
- [x] X-RateLimit-* ヘッダーが正しく設定される
|
|
44
|
+
- [x] Redis/メモリでの状態管理が動作する
|
|
45
|
+
- [x] ホワイトリストIPが除外される
|
|
46
|
+
|
|
47
|
+
Expected Output:
|
|
48
|
+
- 正常リクエスト → 200 OK + X-RateLimit-Remaining: 99
|
|
49
|
+
- 制限超過 → 429 Too Many Requests + Retry-After header
|
|
50
|
+
|
|
51
|
+
Constraints:
|
|
52
|
+
- 制限: 100 requests/minute/IP
|
|
53
|
+
- スライディングウィンドウアルゴリズム
|
|
54
|
+
|
|
55
|
+
Test Command: npm run test:rate-limit
|
|
56
|
+
|
|
57
|
+
Result: 5/5 criteria met
|
|
58
|
+
Status: PASS
|
|
59
|
+
```
|
|
60
|
+
|
|
61
|
+
---
|
|
62
|
+
|
|
63
|
+
## Example 3: Data Export Feature
|
|
64
|
+
|
|
65
|
+
```markdown
|
|
66
|
+
[CAPABILITY EVAL: data-export-csv]
|
|
67
|
+
|
|
68
|
+
Task: ユーザーデータをCSV形式でエクスポートする機能を実装する
|
|
69
|
+
|
|
70
|
+
Success Criteria:
|
|
71
|
+
- [x] CSVファイルが正しいフォーマットで生成される
|
|
72
|
+
- [x] 大量データ(10万件以上)でもメモリ効率が良い
|
|
73
|
+
- [x] ダウンロードAPIエンドポイントが動作する
|
|
74
|
+
- [ ] 非同期エクスポートがサポートされる
|
|
75
|
+
- [x] 文字エンコーディング(UTF-8 BOM)が正しい
|
|
76
|
+
|
|
77
|
+
Expected Output:
|
|
78
|
+
- GET /api/export/users → CSV file download
|
|
79
|
+
- Content-Type: text/csv; charset=utf-8
|
|
80
|
+
- Content-Disposition: attachment; filename="users-YYYY-MM-DD.csv"
|
|
81
|
+
|
|
82
|
+
Constraints:
|
|
83
|
+
- ストリーミング処理でメモリ消費を抑制
|
|
84
|
+
- 最大エクスポート件数: 100万件
|
|
85
|
+
|
|
86
|
+
Test Command: npm run test:export
|
|
87
|
+
|
|
88
|
+
Result: 4/5 criteria met
|
|
89
|
+
Status: PARTIAL PASS
|
|
90
|
+
|
|
91
|
+
Notes:
|
|
92
|
+
- 非同期エクスポートは次のスプリントで実装予定
|
|
93
|
+
```
|
|
94
|
+
|
|
95
|
+
---
|
|
96
|
+
|
|
97
|
+
## Example 4: Search Feature with Elasticsearch
|
|
98
|
+
|
|
99
|
+
```markdown
|
|
100
|
+
[CAPABILITY EVAL: elasticsearch-search]
|
|
101
|
+
|
|
102
|
+
Task: Elasticsearchを使用した全文検索機能を実装する
|
|
103
|
+
|
|
104
|
+
Success Criteria:
|
|
105
|
+
- [x] 基本的なキーワード検索が動作する
|
|
106
|
+
- [x] ファセット検索(フィルタリング)がサポートされる
|
|
107
|
+
- [x] ハイライト機能が動作する
|
|
108
|
+
- [x] ページネーションが正しく実装される
|
|
109
|
+
- [x] 検索結果のソートが機能する
|
|
110
|
+
- [x] 検索サジェスト(オートコンプリート)が動作する
|
|
111
|
+
|
|
112
|
+
Expected Output:
|
|
113
|
+
- GET /api/search?q=keyword → { hits: [...], total: N, facets: {...} }
|
|
114
|
+
- レスポンス時間: 200ms以内
|
|
115
|
+
|
|
116
|
+
Constraints:
|
|
117
|
+
- Elasticsearch 8.x 使用
|
|
118
|
+
- 日本語形態素解析(kuromoji)対応
|
|
119
|
+
- 同義語辞書のサポート
|
|
120
|
+
|
|
121
|
+
Test Command: npm run test:search
|
|
122
|
+
|
|
123
|
+
Result: 6/6 criteria met
|
|
124
|
+
Status: PASS
|
|
125
|
+
```
|
|
126
|
+
|
|
127
|
+
---
|
|
128
|
+
|
|
129
|
+
## Template for New Capability Eval
|
|
130
|
+
|
|
131
|
+
```markdown
|
|
132
|
+
[CAPABILITY EVAL: <feature-name>]
|
|
133
|
+
|
|
134
|
+
Task: <達成すべきタスクの詳細説明>
|
|
135
|
+
|
|
136
|
+
Success Criteria:
|
|
137
|
+
- [ ] <成功基準1 - 測定可能で明確に>
|
|
138
|
+
- [ ] <成功基準2>
|
|
139
|
+
- [ ] <成功基準3>
|
|
140
|
+
- [ ] <成功基準4>
|
|
141
|
+
- [ ] <成功基準5>
|
|
142
|
+
|
|
143
|
+
Expected Output:
|
|
144
|
+
<期待される出力の具体例>
|
|
145
|
+
|
|
146
|
+
Constraints:
|
|
147
|
+
- <制約条件1>
|
|
148
|
+
- <制約条件2>
|
|
149
|
+
- <制約条件3>
|
|
150
|
+
|
|
151
|
+
Test Command: <テスト実行コマンド>
|
|
152
|
+
|
|
153
|
+
Result: X/Y criteria met
|
|
154
|
+
Status: <PASS|PARTIAL PASS|FAIL>
|
|
155
|
+
|
|
156
|
+
Notes:
|
|
157
|
+
- <追加の注記や改善点>
|
|
158
|
+
```
|
|
@@ -0,0 +1,326 @@
|
|
|
1
|
+
# Human Grader Templates
|
|
2
|
+
|
|
3
|
+
## Standard Human Grader Template
|
|
4
|
+
|
|
5
|
+
```markdown
|
|
6
|
+
[HUMAN GRADE: <feature-name>]
|
|
7
|
+
|
|
8
|
+
Reviewer: @<username>
|
|
9
|
+
Date: YYYY-MM-DD
|
|
10
|
+
Review Type: <Initial|Re-review|Final>
|
|
11
|
+
|
|
12
|
+
---
|
|
13
|
+
|
|
14
|
+
## Checklist
|
|
15
|
+
|
|
16
|
+
### Functional Requirements
|
|
17
|
+
- [ ] 仕様を満たしている
|
|
18
|
+
- [ ] 全ての機能が正常に動作する
|
|
19
|
+
- [ ] エッジケースが考慮されている
|
|
20
|
+
- [ ] エラーハンドリングが適切
|
|
21
|
+
|
|
22
|
+
### Code Quality
|
|
23
|
+
- [ ] コードが読みやすい
|
|
24
|
+
- [ ] 命名規則が一貫している
|
|
25
|
+
- [ ] 重複コードがない(DRY原則)
|
|
26
|
+
- [ ] 適切な抽象化レベル
|
|
27
|
+
|
|
28
|
+
### Compatibility
|
|
29
|
+
- [ ] 既存API互換性が維持されている
|
|
30
|
+
- [ ] 後方互換性が確保されている
|
|
31
|
+
- [ ] 破壊的変更がある場合は文書化されている
|
|
32
|
+
|
|
33
|
+
### Security
|
|
34
|
+
- [ ] セキュリティ上の懸念がない
|
|
35
|
+
- [ ] 入力バリデーションが適切
|
|
36
|
+
- [ ] 機密データの取り扱いが安全
|
|
37
|
+
- [ ] 認証・認可が正しく実装されている
|
|
38
|
+
|
|
39
|
+
### Performance
|
|
40
|
+
- [ ] パフォーマンス要件を満たす
|
|
41
|
+
- [ ] N+1クエリがない
|
|
42
|
+
- [ ] 適切なキャッシュ戦略
|
|
43
|
+
- [ ] メモリリークの懸念がない
|
|
44
|
+
|
|
45
|
+
### Testing
|
|
46
|
+
- [ ] テストカバレッジが十分(80%以上)
|
|
47
|
+
- [ ] ユニットテストが存在する
|
|
48
|
+
- [ ] 統合テストが存在する
|
|
49
|
+
- [ ] エッジケースのテストがある
|
|
50
|
+
|
|
51
|
+
### Documentation
|
|
52
|
+
- [ ] コードコメントが適切
|
|
53
|
+
- [ ] APIドキュメントが更新されている
|
|
54
|
+
- [ ] READMEが更新されている(必要な場合)
|
|
55
|
+
|
|
56
|
+
---
|
|
57
|
+
|
|
58
|
+
## Scores (1-5)
|
|
59
|
+
|
|
60
|
+
| Category | Score | Notes |
|
|
61
|
+
|-----------------|-------|--------------------------|
|
|
62
|
+
| Functionality | /5 | |
|
|
63
|
+
| Code Quality | /5 | |
|
|
64
|
+
| Maintainability | /5 | |
|
|
65
|
+
| Documentation | /5 | |
|
|
66
|
+
|
|
67
|
+
**Total: XX/20**
|
|
68
|
+
|
|
69
|
+
---
|
|
70
|
+
|
|
71
|
+
## Detailed Feedback
|
|
72
|
+
|
|
73
|
+
### Strengths
|
|
74
|
+
1.
|
|
75
|
+
2.
|
|
76
|
+
3.
|
|
77
|
+
|
|
78
|
+
### Areas for Improvement
|
|
79
|
+
1.
|
|
80
|
+
2.
|
|
81
|
+
3.
|
|
82
|
+
|
|
83
|
+
### Critical Issues (if any)
|
|
84
|
+
-
|
|
85
|
+
|
|
86
|
+
---
|
|
87
|
+
|
|
88
|
+
## Verdict
|
|
89
|
+
|
|
90
|
+
**Decision: <PASS|FAIL|NEEDS_REVISION>**
|
|
91
|
+
|
|
92
|
+
### If NEEDS_REVISION, required changes:
|
|
93
|
+
1.
|
|
94
|
+
2.
|
|
95
|
+
|
|
96
|
+
### Follow-up Actions:
|
|
97
|
+
- [ ]
|
|
98
|
+
- [ ]
|
|
99
|
+
|
|
100
|
+
---
|
|
101
|
+
|
|
102
|
+
**Signature:** [Reviewer Name]
|
|
103
|
+
**Date:** YYYY-MM-DD
|
|
104
|
+
```
|
|
105
|
+
|
|
106
|
+
---
|
|
107
|
+
|
|
108
|
+
## Specialized Templates
|
|
109
|
+
|
|
110
|
+
### Security Review Template
|
|
111
|
+
|
|
112
|
+
```markdown
|
|
113
|
+
[HUMAN GRADE: <feature-name> - Security Review]
|
|
114
|
+
|
|
115
|
+
Reviewer: @<security-reviewer>
|
|
116
|
+
Date: YYYY-MM-DD
|
|
117
|
+
Review Type: Security Audit
|
|
118
|
+
|
|
119
|
+
---
|
|
120
|
+
|
|
121
|
+
## Security Checklist
|
|
122
|
+
|
|
123
|
+
### Authentication
|
|
124
|
+
- [ ] 認証メカニズムが適切
|
|
125
|
+
- [ ] セッション管理が安全
|
|
126
|
+
- [ ] パスワードポリシーが適切
|
|
127
|
+
- [ ] MFAの考慮
|
|
128
|
+
|
|
129
|
+
### Authorization
|
|
130
|
+
- [ ] アクセス制御が正しく実装
|
|
131
|
+
- [ ] 権限昇格の脆弱性なし
|
|
132
|
+
- [ ] リソースベースの認可
|
|
133
|
+
|
|
134
|
+
### Data Protection
|
|
135
|
+
- [ ] 機密データの暗号化
|
|
136
|
+
- [ ] PII/PHIの適切な取り扱い
|
|
137
|
+
- [ ] データの最小化原則
|
|
138
|
+
|
|
139
|
+
### Input Validation
|
|
140
|
+
- [ ] 全入力のバリデーション
|
|
141
|
+
- [ ] SQLインジェクション対策
|
|
142
|
+
- [ ] XSS対策
|
|
143
|
+
- [ ] CSRF対策
|
|
144
|
+
|
|
145
|
+
### API Security
|
|
146
|
+
- [ ] レート制限
|
|
147
|
+
- [ ] 適切なエラーメッセージ
|
|
148
|
+
- [ ] ログの適切な出力
|
|
149
|
+
|
|
150
|
+
---
|
|
151
|
+
|
|
152
|
+
## Risk Assessment
|
|
153
|
+
|
|
154
|
+
| Vulnerability Type | Risk Level | Status |
|
|
155
|
+
|-------------------|------------|--------|
|
|
156
|
+
| Injection | Low/Med/High | ✅/⚠️/❌ |
|
|
157
|
+
| Broken Auth | Low/Med/High | ✅/⚠️/❌ |
|
|
158
|
+
| XSS | Low/Med/High | ✅/⚠️/❌ |
|
|
159
|
+
| CSRF | Low/Med/High | ✅/⚠️/❌ |
|
|
160
|
+
| Broken Access | Low/Med/High | ✅/⚠️/❌ |
|
|
161
|
+
|
|
162
|
+
---
|
|
163
|
+
|
|
164
|
+
## Findings
|
|
165
|
+
|
|
166
|
+
### Critical (Must Fix)
|
|
167
|
+
1.
|
|
168
|
+
|
|
169
|
+
### High (Should Fix)
|
|
170
|
+
1.
|
|
171
|
+
|
|
172
|
+
### Medium (Consider Fixing)
|
|
173
|
+
1.
|
|
174
|
+
|
|
175
|
+
### Low (Nice to Fix)
|
|
176
|
+
1.
|
|
177
|
+
|
|
178
|
+
---
|
|
179
|
+
|
|
180
|
+
**Security Verdict: <APPROVED|REJECTED|CONDITIONAL>**
|
|
181
|
+
```
|
|
182
|
+
|
|
183
|
+
---
|
|
184
|
+
|
|
185
|
+
### Architecture Review Template
|
|
186
|
+
|
|
187
|
+
```markdown
|
|
188
|
+
[HUMAN GRADE: <feature-name> - Architecture Review]
|
|
189
|
+
|
|
190
|
+
Reviewer: @<architect>
|
|
191
|
+
Date: YYYY-MM-DD
|
|
192
|
+
Review Type: Architecture Decision
|
|
193
|
+
|
|
194
|
+
---
|
|
195
|
+
|
|
196
|
+
## Architecture Checklist
|
|
197
|
+
|
|
198
|
+
### Design Principles
|
|
199
|
+
- [ ] Single Responsibility Principle
|
|
200
|
+
- [ ] Open/Closed Principle
|
|
201
|
+
- [ ] Liskov Substitution Principle
|
|
202
|
+
- [ ] Interface Segregation Principle
|
|
203
|
+
- [ ] Dependency Inversion Principle
|
|
204
|
+
|
|
205
|
+
### Patterns
|
|
206
|
+
- [ ] 適切なデザインパターンの使用
|
|
207
|
+
- [ ] アンチパターンの回避
|
|
208
|
+
- [ ] 過度な複雑さの回避
|
|
209
|
+
|
|
210
|
+
### Scalability
|
|
211
|
+
- [ ] 水平スケーラビリティの考慮
|
|
212
|
+
- [ ] ボトルネックの特定と対策
|
|
213
|
+
- [ ] 負荷分散の戦略
|
|
214
|
+
|
|
215
|
+
### Maintainability
|
|
216
|
+
- [ ] モジュール性
|
|
217
|
+
- [ ] テスタビリティ
|
|
218
|
+
- [ ] 変更容易性
|
|
219
|
+
|
|
220
|
+
---
|
|
221
|
+
|
|
222
|
+
## Architecture Decision Record
|
|
223
|
+
|
|
224
|
+
### Context
|
|
225
|
+
<決定の背景>
|
|
226
|
+
|
|
227
|
+
### Decision
|
|
228
|
+
<選択したアプローチ>
|
|
229
|
+
|
|
230
|
+
### Consequences
|
|
231
|
+
<この決定の結果・影響>
|
|
232
|
+
|
|
233
|
+
### Alternatives Considered
|
|
234
|
+
1. <代替案1>: <却下理由>
|
|
235
|
+
2. <代替案2>: <却下理由>
|
|
236
|
+
|
|
237
|
+
---
|
|
238
|
+
|
|
239
|
+
**Architecture Verdict: <APPROVED|NEEDS_REVISION|REJECTED>**
|
|
240
|
+
```
|
|
241
|
+
|
|
242
|
+
---
|
|
243
|
+
|
|
244
|
+
### UX Review Template
|
|
245
|
+
|
|
246
|
+
```markdown
|
|
247
|
+
[HUMAN GRADE: <feature-name> - UX Review]
|
|
248
|
+
|
|
249
|
+
Reviewer: @<ux-designer>
|
|
250
|
+
Date: YYYY-MM-DD
|
|
251
|
+
Review Type: User Experience
|
|
252
|
+
|
|
253
|
+
---
|
|
254
|
+
|
|
255
|
+
## UX Checklist
|
|
256
|
+
|
|
257
|
+
### Usability
|
|
258
|
+
- [ ] 直感的なUI
|
|
259
|
+
- [ ] 一貫したデザイン言語
|
|
260
|
+
- [ ] 適切なフィードバック
|
|
261
|
+
- [ ] エラー回復が容易
|
|
262
|
+
|
|
263
|
+
### Accessibility
|
|
264
|
+
- [ ] キーボードナビゲーション
|
|
265
|
+
- [ ] スクリーンリーダー対応
|
|
266
|
+
- [ ] 色覚多様性への配慮
|
|
267
|
+
- [ ] 適切なコントラスト比
|
|
268
|
+
|
|
269
|
+
### Performance Perception
|
|
270
|
+
- [ ] ローディング状態の表示
|
|
271
|
+
- [ ] スケルトンUI
|
|
272
|
+
- [ ] 楽観的更新
|
|
273
|
+
- [ ] 適切なアニメーション
|
|
274
|
+
|
|
275
|
+
---
|
|
276
|
+
|
|
277
|
+
## Heuristic Evaluation
|
|
278
|
+
|
|
279
|
+
| Heuristic | Score (1-5) |
|
|
280
|
+
|------------------------------|-------------|
|
|
281
|
+
| Visibility of System Status | /5 |
|
|
282
|
+
| Match with Real World | /5 |
|
|
283
|
+
| User Control & Freedom | /5 |
|
|
284
|
+
| Consistency & Standards | /5 |
|
|
285
|
+
| Error Prevention | /5 |
|
|
286
|
+
| Recognition over Recall | /5 |
|
|
287
|
+
| Flexibility & Efficiency | /5 |
|
|
288
|
+
| Aesthetic & Minimal Design | /5 |
|
|
289
|
+
| Help Users with Errors | /5 |
|
|
290
|
+
| Help & Documentation | /5 |
|
|
291
|
+
|
|
292
|
+
**Total: XX/50**
|
|
293
|
+
|
|
294
|
+
---
|
|
295
|
+
|
|
296
|
+
**UX Verdict: <APPROVED|NEEDS_IMPROVEMENT|REJECTED>**
|
|
297
|
+
```
|
|
298
|
+
|
|
299
|
+
---
|
|
300
|
+
|
|
301
|
+
## Usage Guidelines
|
|
302
|
+
|
|
303
|
+
### When to Use Human Grader
|
|
304
|
+
|
|
305
|
+
1. **主観的判断が必要な場合**
|
|
306
|
+
- UI/UXの品質評価
|
|
307
|
+
- コードの読みやすさ
|
|
308
|
+
- 設計の妥当性
|
|
309
|
+
|
|
310
|
+
2. **ドメイン専門知識が必要な場合**
|
|
311
|
+
- ビジネスロジックの正確性
|
|
312
|
+
- 法規制への準拠
|
|
313
|
+
- 業界標準への適合
|
|
314
|
+
|
|
315
|
+
3. **自動化が困難な場合**
|
|
316
|
+
- クロスブラウザ互換性
|
|
317
|
+
- アクセシビリティ
|
|
318
|
+
- パフォーマンス体感
|
|
319
|
+
|
|
320
|
+
### Best Practices
|
|
321
|
+
|
|
322
|
+
- 複数のレビュアーによる評価を推奨
|
|
323
|
+
- チェックリストは事前に合意して一貫性を保つ
|
|
324
|
+
- 主観的判断には必ず理由を明記
|
|
325
|
+
- NEEDS_REVISIONの場合は具体的な改善点を列挙
|
|
326
|
+
- フォローアップアクションを明確に定義
|