opencode-swarm 7.57.0 → 7.58.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.opencode/skills/brainstorm/SKILL.md +142 -0
- package/.opencode/skills/clarify/SKILL.md +103 -0
- package/.opencode/skills/clarify-spec/SKILL.md +58 -0
- package/.opencode/skills/codebase-review-swarm/INSTALL.md +75 -0
- package/.opencode/skills/codebase-review-swarm/README.md +44 -0
- package/.opencode/skills/codebase-review-swarm/SKILL.md +65 -0
- package/.opencode/skills/codebase-review-swarm/agents/openai.yaml +6 -0
- package/.opencode/skills/codebase-review-swarm/assets/jsonl-schemas.md +239 -0
- package/.opencode/skills/codebase-review-swarm/assets/review-report-template.md +244 -0
- package/.opencode/skills/codebase-review-swarm/references/compatibility-and-research-notes.md +25 -0
- package/.opencode/skills/codebase-review-swarm/references/full-v7-source-prompt.md +2373 -0
- package/.opencode/skills/codebase-review-swarm/references/review-protocol-v8.2.md +310 -0
- package/.opencode/skills/codebase-review-swarm/scripts/init-review-run.py +134 -0
- package/.opencode/skills/codebase-review-swarm/scripts/validate-skill-package.py +62 -0
- package/.opencode/skills/consult/SKILL.md +16 -0
- package/.opencode/skills/council/SKILL.md +147 -0
- package/.opencode/skills/critic-gate/SKILL.md +59 -0
- package/.opencode/skills/deep-dive/SKILL.md +142 -0
- package/.opencode/skills/design-docs/SKILL.md +81 -0
- package/.opencode/skills/discover/SKILL.md +20 -0
- package/.opencode/skills/execute/SKILL.md +191 -0
- package/.opencode/skills/issue-ingest/SKILL.md +64 -0
- package/.opencode/skills/phase-wrap/SKILL.md +123 -0
- package/.opencode/skills/plan/SKILL.md +293 -0
- package/.opencode/skills/pre-phase-briefing/SKILL.md +69 -0
- package/.opencode/skills/resume/SKILL.md +23 -0
- package/.opencode/skills/specify/SKILL.md +175 -0
- package/.opencode/skills/swarm-pr-feedback/SKILL.md +192 -0
- package/.opencode/skills/swarm-pr-review/SKILL.md +884 -0
- package/dist/cli/index.js +1350 -1158
- package/dist/commands/command-dispatch.d.ts +1 -0
- package/dist/commands/index.d.ts +1 -0
- package/dist/commands/registry.d.ts +15 -14
- package/dist/config/bundled-skills.d.ts +25 -0
- package/dist/index.js +2815 -2605
- package/dist/tools/index.d.ts +3 -1
- package/dist/tools/tool-metadata.d.ts +1 -1
- package/dist/tools/web-search.d.ts +2 -1
- package/package.json +20 -1
|
@@ -0,0 +1,239 @@
|
|
|
1
|
+
# JSONL and Structured Block Schemas
|
|
2
|
+
|
|
3
|
+
Use these exact fields unless a field is not applicable, in which case write `N/A` or an explicit reason. Prefer one block per record in markdown ledgers and JSON object per line in `.jsonl` artifacts.
|
|
4
|
+
|
|
5
|
+
## Coverage unit
|
|
6
|
+
|
|
7
|
+
```json
|
|
8
|
+
{"unit_id":"COV-001","track":"security","unit_type":"trust_boundary","path_or_id":"BOUNDARY-001","status":"UNREVIEWED","depth_tier":"focused|multi_track|complete_integrated|custom","passes_required":["candidate","deterministic_tool","caller_callee_trace","test_or_guard_check","reviewer_validation","critic_if_required"],"passes_completed":[],"evidence_refs":[],"deterministic_checks":[],"runtime_checks_or_reason":"","validation_refs":[],"remaining_uncertainty":"","reason":"","updated_at":"<iso8601>"}
|
|
9
|
+
```
|
|
10
|
+
|
|
11
|
+
Terminal `status` values: `REVIEWED`, `NOT_APPLICABLE`, `SKIPPED_WITH_REASON`, `BLOCKED`. Final report is forbidden for selected tracks while any unit remains `UNASSIGNED` or `UNREVIEWED`. `REVIEWED` is valid only when `passes_completed` satisfies the selected track's `TRACK_DEPTH_PLAN`.
|
|
12
|
+
|
|
13
|
+
## Track depth plan
|
|
14
|
+
|
|
15
|
+
Write one block per selected track to `ledgers/review-depth-plan.md` after track selection and before Phase 1.
|
|
16
|
+
|
|
17
|
+
```text
|
|
18
|
+
TRACK_DEPTH_PLAN
|
|
19
|
+
track: <A|B|C|D|E|F|G|1X>
|
|
20
|
+
mode: focused | multi_track | complete_integrated | custom
|
|
21
|
+
coverage_unit_basis: <public_surface | trust_boundary | test_cluster | ui_component_family | hot_path | dependency_family | ai_surface | domain_component | cross_boundary_pair>
|
|
22
|
+
expected_units: <count or unknown_until_inventory>
|
|
23
|
+
granularity_rule: <how complex units are split>
|
|
24
|
+
required_passes: <inventory excerpts, candidate pass, deterministic tool pass, caller/callee trace, tests/claims check, validation, critic>
|
|
25
|
+
deterministic_tools_to_attempt: <commands/tools or N/A with reason>
|
|
26
|
+
runtime_validation_policy: <when to run, when to mark UNVERIFIED>
|
|
27
|
+
reviewer_batch_rule: <local reasoning unit definition>
|
|
28
|
+
critic_rule: <inline/final/enhancement/systemic>
|
|
29
|
+
non_dilution_check: <why this track is not shallower because of selected breadth>
|
|
30
|
+
END
|
|
31
|
+
```
|
|
32
|
+
|
|
33
|
+
## Candidate finding
|
|
34
|
+
|
|
35
|
+
```text
|
|
36
|
+
CANDIDATE_FINDING
|
|
37
|
+
id: <track>-<scope>-<sequence>
|
|
38
|
+
track: functionality | security | supply_chain | testing | ui_ux | performance | observability | ai_slop | docs_claims | cross_platform | cross_boundary
|
|
39
|
+
group: <short category>
|
|
40
|
+
provisional_severity: CRITICAL | HIGH | MEDIUM | LOW | INFO
|
|
41
|
+
confidence: HIGH | MEDIUM
|
|
42
|
+
grounding_assessment: HIGH | MEDIUM
|
|
43
|
+
file: <relative path>
|
|
44
|
+
line: <line or range>
|
|
45
|
+
exact_quote: <verbatim evidence>
|
|
46
|
+
title: <specific one-line title>
|
|
47
|
+
problem: <factual description>
|
|
48
|
+
impact: <why it matters>
|
|
49
|
+
likely_fix: <concrete likely remediation>
|
|
50
|
+
evidence_checked: <files, callers, configs, tests, docs, manifests, runtime paths checked>
|
|
51
|
+
alternative_interpretation: <what could make this wrong>
|
|
52
|
+
disproof_attempt: <required for CRITICAL/HIGH; recommended for all>
|
|
53
|
+
linked_claims: <claim ids or N/A>
|
|
54
|
+
linked_surfaces: <surface ids or N/A>
|
|
55
|
+
linked_boundaries: <boundary ids or N/A>
|
|
56
|
+
ai_pattern: <optional or N/A>
|
|
57
|
+
needs_runtime_validation: yes | no
|
|
58
|
+
size: S | M | L
|
|
59
|
+
END
|
|
60
|
+
```
|
|
61
|
+
|
|
62
|
+
## Enhancement candidate
|
|
63
|
+
|
|
64
|
+
```text
|
|
65
|
+
ENHANCEMENT_CANDIDATE
|
|
66
|
+
id: ENH-<track>-<sequence>
|
|
67
|
+
track: enhancement | architecture | code_quality | testing | ui_ux | performance | observability | resilience | developer_experience
|
|
68
|
+
domain: <specific subsystem or component family>
|
|
69
|
+
category: architecture | code_quality | simplification | developer_experience | performance | resilience | observability | ui_hierarchy | ui_interaction | ui_accessibility | ui_typography | ui_performance | ui_consistency | testing
|
|
70
|
+
value_level: high | medium | low
|
|
71
|
+
confidence: HIGH | MEDIUM
|
|
72
|
+
grounding_assessment: HIGH | MEDIUM
|
|
73
|
+
file: <relative path>
|
|
74
|
+
line: <line or range>
|
|
75
|
+
exact_quote: <verbatim current-state evidence>
|
|
76
|
+
title: <specific one-line title>
|
|
77
|
+
current_state: <what exists now, without calling it broken>
|
|
78
|
+
confirms_current_code_is_working: yes | no
|
|
79
|
+
enhancement: <specific implementable improvement>
|
|
80
|
+
expected_impact: <what improves>
|
|
81
|
+
effort: S | M | L
|
|
82
|
+
dependencies: <other enhancement ids or N/A>
|
|
83
|
+
alternative_interpretation: <why current design might be intentional>
|
|
84
|
+
disproof_attempt: <required for high-value; recommended for all>
|
|
85
|
+
rejection_risk: <what would make this a bad suggestion>
|
|
86
|
+
END
|
|
87
|
+
```
|
|
88
|
+
|
|
89
|
+
## Validated finding
|
|
90
|
+
|
|
91
|
+
```text
|
|
92
|
+
VALIDATED_FINDING
|
|
93
|
+
candidate_id:
|
|
94
|
+
status: CONFIRMED | DISPROVED | UNVERIFIED | PRE_EXISTING
|
|
95
|
+
final_severity: CRITICAL | HIGH | MEDIUM | LOW | INFO
|
|
96
|
+
confidence: HIGH | MEDIUM
|
|
97
|
+
grounding_assessment: HIGH | MEDIUM | LOW
|
|
98
|
+
file:
|
|
99
|
+
line:
|
|
100
|
+
exact_quote:
|
|
101
|
+
title:
|
|
102
|
+
problem:
|
|
103
|
+
impact:
|
|
104
|
+
fix:
|
|
105
|
+
validation_evidence:
|
|
106
|
+
disproof_reason: <required if DISPROVED>
|
|
107
|
+
verification_mode: STATIC | STATIC_PLUS_RUNTIME
|
|
108
|
+
runtime_validation: <command or N/A>
|
|
109
|
+
linked_claims:
|
|
110
|
+
linked_surfaces:
|
|
111
|
+
linked_boundaries:
|
|
112
|
+
ai_pattern: <same value from candidate or N/A>
|
|
113
|
+
inline_routing: CRITIC_REQUIRED | REVIEWER_FINALIZED | REVIEWER_DOWNGRADED
|
|
114
|
+
finalization_status: FINALIZED | DOWNGRADED | N/A
|
|
115
|
+
size: S | M | L
|
|
116
|
+
END
|
|
117
|
+
```
|
|
118
|
+
|
|
119
|
+
## Validated enhancement
|
|
120
|
+
|
|
121
|
+
```text
|
|
122
|
+
VALIDATED_ENHANCEMENT
|
|
123
|
+
candidate_id:
|
|
124
|
+
status: CONFIRMED_HIGH_VALUE | CONFIRMED_MEDIUM_VALUE | REJECTED | UNVERIFIED
|
|
125
|
+
track:
|
|
126
|
+
domain:
|
|
127
|
+
category:
|
|
128
|
+
confidence: HIGH | MEDIUM
|
|
129
|
+
grounding_assessment: HIGH | MEDIUM | LOW
|
|
130
|
+
file:
|
|
131
|
+
line:
|
|
132
|
+
exact_quote:
|
|
133
|
+
title:
|
|
134
|
+
current_state:
|
|
135
|
+
confirms_current_code_is_working: yes | no
|
|
136
|
+
enhancement:
|
|
137
|
+
expected_impact:
|
|
138
|
+
effort: S | M | L
|
|
139
|
+
validation_evidence:
|
|
140
|
+
dependency_map:
|
|
141
|
+
rejection_reason: <required if REJECTED>
|
|
142
|
+
END
|
|
143
|
+
```
|
|
144
|
+
|
|
145
|
+
## Critic result
|
|
146
|
+
|
|
147
|
+
```text
|
|
148
|
+
CRITIC_RESULT
|
|
149
|
+
finding_id:
|
|
150
|
+
verdict: UPHELD | REFINED | DOWNGRADED | OVERTURNED
|
|
151
|
+
original_severity: CRITICAL | HIGH
|
|
152
|
+
final_severity:
|
|
153
|
+
grounding_assessment: HIGH | MEDIUM | LOW
|
|
154
|
+
file:
|
|
155
|
+
line:
|
|
156
|
+
exact_quote:
|
|
157
|
+
title:
|
|
158
|
+
final_problem:
|
|
159
|
+
final_fix:
|
|
160
|
+
ai_pattern: <same value from validated finding or N/A>
|
|
161
|
+
verdict_reason:
|
|
162
|
+
coverage_gap:
|
|
163
|
+
END
|
|
164
|
+
```
|
|
165
|
+
|
|
166
|
+
## Enhancement critic result
|
|
167
|
+
|
|
168
|
+
```text
|
|
169
|
+
ENHANCEMENT_CRITIC_RESULT
|
|
170
|
+
enhancement_id:
|
|
171
|
+
verdict: UPHELD_HIGH_VALUE | UPHELD_MEDIUM_VALUE | REFINED | MERGED | DOWNGRADED | REJECTED
|
|
172
|
+
final_category:
|
|
173
|
+
final_title:
|
|
174
|
+
grounding_assessment: HIGH | MEDIUM | LOW
|
|
175
|
+
file:
|
|
176
|
+
line:
|
|
177
|
+
exact_quote:
|
|
178
|
+
final_enhancement:
|
|
179
|
+
expected_impact:
|
|
180
|
+
effort: S | M | L
|
|
181
|
+
dependencies:
|
|
182
|
+
verdict_reason:
|
|
183
|
+
END
|
|
184
|
+
```
|
|
185
|
+
|
|
186
|
+
## Test drift review
|
|
187
|
+
|
|
188
|
+
```text
|
|
189
|
+
TEST_DRIFT_REVIEW
|
|
190
|
+
related_findings:
|
|
191
|
+
commands_run:
|
|
192
|
+
behavior_assertions_verified:
|
|
193
|
+
stale_tests_found:
|
|
194
|
+
weak_assertions_found:
|
|
195
|
+
property_based_opportunities:
|
|
196
|
+
mutation_resilience_gaps:
|
|
197
|
+
remaining_uncertainty:
|
|
198
|
+
END
|
|
199
|
+
```
|
|
200
|
+
|
|
201
|
+
## Final critic check
|
|
202
|
+
|
|
203
|
+
```text
|
|
204
|
+
FINAL_CRITIC_CHECK
|
|
205
|
+
verdict: PASS | REVISE
|
|
206
|
+
required_revisions:
|
|
207
|
+
severity_adjustments:
|
|
208
|
+
findings_to_drop:
|
|
209
|
+
findings_to_reclassify_as_enhancements:
|
|
210
|
+
enhancements_to_reclassify_as_defects:
|
|
211
|
+
unsupported_report_claims:
|
|
212
|
+
missing_or_empty_ledgers:
|
|
213
|
+
unsupported_strengths:
|
|
214
|
+
coverage_note_fixes:
|
|
215
|
+
count_mismatches:
|
|
216
|
+
coverage_closure_failures:
|
|
217
|
+
depth_plan_failures:
|
|
218
|
+
selected_track_dilution_detected: yes | no
|
|
219
|
+
END
|
|
220
|
+
```
|
|
221
|
+
|
|
222
|
+
## Source-of-truth packet outline
|
|
223
|
+
|
|
224
|
+
```markdown
|
|
225
|
+
# Source of Truth Packet
|
|
226
|
+
|
|
227
|
+
## Repo Identity
|
|
228
|
+
## Tech Stack
|
|
229
|
+
## Commands
|
|
230
|
+
## Public Surfaces
|
|
231
|
+
## Trust Boundaries
|
|
232
|
+
## MCP and Agent Surfaces
|
|
233
|
+
## Claims Needing Verification
|
|
234
|
+
## Test and Quality Gates
|
|
235
|
+
## UI Applicability
|
|
236
|
+
## AI/Agent Applicability
|
|
237
|
+
## Review Track Recommendation
|
|
238
|
+
## Prohibited Assumptions
|
|
239
|
+
```
|
|
@@ -0,0 +1,244 @@
|
|
|
1
|
+
# Codebase Review Report
|
|
2
|
+
|
|
3
|
+
Generated: [timestamp]
|
|
4
|
+
Repository: [name/path]
|
|
5
|
+
Git HEAD: [SHA]
|
|
6
|
+
Selected Review Tracks: [tracks]
|
|
7
|
+
Skipped Tracks: [tracks and why]
|
|
8
|
+
Review Mode: [complete integrated | defect-focused | focused | enhancement-only | custom]
|
|
9
|
+
|
|
10
|
+
## Executive Summary
|
|
11
|
+
|
|
12
|
+
[2-5 sentences. Strongest confirmed themes only. No unvalidated or unquoted claims.]
|
|
13
|
+
|
|
14
|
+
## Review Scope and Method
|
|
15
|
+
|
|
16
|
+
- Phase 0 inventory completed: yes
|
|
17
|
+
- User-selected tracks:
|
|
18
|
+
- Explorer candidates generated:
|
|
19
|
+
- Reviewer validation completed:
|
|
20
|
+
- Inline critic used for CRITICAL/HIGH:
|
|
21
|
+
- Reviewer finalization used for MEDIUM/LOW:
|
|
22
|
+
- Enhancement critic used:
|
|
23
|
+
- Final whole-report critic verdict:
|
|
24
|
+
- Coverage closure verified: yes (N units reviewed, 0 unreviewed)
|
|
25
|
+
- Runtime validation commands run:
|
|
26
|
+
|
|
27
|
+
## Findings Count
|
|
28
|
+
|
|
29
|
+
```text
|
|
30
|
+
Defect Findings by Track:
|
|
31
|
+
functionality_correctness: C / H / M / L / I
|
|
32
|
+
security_privacy: C / H / M / L / I
|
|
33
|
+
llm_ai_security: C / H / M / L / I
|
|
34
|
+
supply_chain: C / H / M / L / I
|
|
35
|
+
testing_quality: C / H / M / L / I
|
|
36
|
+
ui_ux_accessibility: C / H / M / L / I
|
|
37
|
+
performance: C / H / M / L / I
|
|
38
|
+
observability: C / H / M / L / I
|
|
39
|
+
ai_slop_provenance: C / H / M / L / I
|
|
40
|
+
docs_claims_drift: C / H / M / L / I
|
|
41
|
+
cross_platform: C / H / M / L / I
|
|
42
|
+
cross_boundary: C / H / M / L / I
|
|
43
|
+
total: C / H / M / L / I
|
|
44
|
+
|
|
45
|
+
Validation Outcomes:
|
|
46
|
+
candidates_generated:
|
|
47
|
+
confirmed:
|
|
48
|
+
pre_existing:
|
|
49
|
+
disproved:
|
|
50
|
+
unverified:
|
|
51
|
+
reviewer_downgraded:
|
|
52
|
+
critic_upheld:
|
|
53
|
+
critic_refined:
|
|
54
|
+
critic_downgraded:
|
|
55
|
+
critic_overturned:
|
|
56
|
+
|
|
57
|
+
Enhancement Outcomes:
|
|
58
|
+
candidates_generated:
|
|
59
|
+
upheld_high_value:
|
|
60
|
+
upheld_medium_value:
|
|
61
|
+
refined:
|
|
62
|
+
merged:
|
|
63
|
+
downgraded:
|
|
64
|
+
rejected:
|
|
65
|
+
unverified:
|
|
66
|
+
|
|
67
|
+
Claim Ledger:
|
|
68
|
+
supported:
|
|
69
|
+
partially_supported:
|
|
70
|
+
unsupported:
|
|
71
|
+
contradicted:
|
|
72
|
+
stealth_change:
|
|
73
|
+
unverified:
|
|
74
|
+
|
|
75
|
+
Coverage Closure:
|
|
76
|
+
total_coverage_units:
|
|
77
|
+
reviewed:
|
|
78
|
+
not_applicable:
|
|
79
|
+
skipped_with_reason:
|
|
80
|
+
blocked:
|
|
81
|
+
unreviewed: 0
|
|
82
|
+
```
|
|
83
|
+
|
|
84
|
+
## Critical and High Confirmed Defect Findings
|
|
85
|
+
|
|
86
|
+
[Full details. Do not include PRE_EXISTING here.]
|
|
87
|
+
|
|
88
|
+
## High-Severity Pre-Existing Findings
|
|
89
|
+
|
|
90
|
+
[Required if any CRITICAL/HIGH PRE_EXISTING findings exist.]
|
|
91
|
+
|
|
92
|
+
## Medium Defect Findings
|
|
93
|
+
|
|
94
|
+
[Full details or grouped details.]
|
|
95
|
+
|
|
96
|
+
## Low and Info Defect Findings
|
|
97
|
+
|
|
98
|
+
[Condensed but evidence-grounded.]
|
|
99
|
+
|
|
100
|
+
## Security, Privacy, LLM/MCP, and Supply Chain Notes
|
|
101
|
+
|
|
102
|
+
[Include only if selected or relevant.]
|
|
103
|
+
|
|
104
|
+
## Unsupported, Contradicted, or Partially Supported Claims
|
|
105
|
+
|
|
106
|
+
[Claim ledger outcomes.]
|
|
107
|
+
|
|
108
|
+
## AI Slop and Code Provenance Patterns
|
|
109
|
+
|
|
110
|
+
[Evidence-based patterns only. Never vibe-based.]
|
|
111
|
+
|
|
112
|
+
## Testing and Test Drift Findings
|
|
113
|
+
|
|
114
|
+
[Test-quality and drift results.]
|
|
115
|
+
|
|
116
|
+
## UI/UX and Accessibility Findings
|
|
117
|
+
|
|
118
|
+
[Include only if selected and UI exists.]
|
|
119
|
+
|
|
120
|
+
## Performance and Observability Findings
|
|
121
|
+
|
|
122
|
+
[Include only if selected.]
|
|
123
|
+
|
|
124
|
+
## Systemic Themes
|
|
125
|
+
|
|
126
|
+
[Themes synthesized from validated findings only.]
|
|
127
|
+
|
|
128
|
+
## Enhancement Opportunities
|
|
129
|
+
|
|
130
|
+
[Include only if selected.]
|
|
131
|
+
|
|
132
|
+
### Top 10 Highest-Impact Enhancements
|
|
133
|
+
|
|
134
|
+
[Top validated high-value opportunities, ranked by impact.]
|
|
135
|
+
|
|
136
|
+
### Full Enhancement Catalog
|
|
137
|
+
|
|
138
|
+
#### Architecture Enhancements (ARCH-*)
|
|
139
|
+
#### Code Quality Enhancements (QUAL-*)
|
|
140
|
+
#### Performance Enhancements (PERF-*)
|
|
141
|
+
#### Resilience and Observability Enhancements (RES-*)
|
|
142
|
+
#### Testing Enhancements (TEST-*)
|
|
143
|
+
#### UI/UX — Visual Hierarchy and Layout (UI-HIER-*)
|
|
144
|
+
#### UI/UX — Interaction Design and Feedback (UI-INT-*)
|
|
145
|
+
#### UI/UX — Accessibility and Inclusivity (UI-A11Y-*)
|
|
146
|
+
#### UI/UX — Typography and Visual Polish (UI-VIS-*)
|
|
147
|
+
#### UI/UX — Performance and Perceived Performance (UI-PERF-*)
|
|
148
|
+
#### UI/UX — Consistency and Design System Alignment (UI-CON-*)
|
|
149
|
+
|
|
150
|
+
### Implementation Roadmap
|
|
151
|
+
|
|
152
|
+
#### Phase 1 — Quick Wins
|
|
153
|
+
|
|
154
|
+
Low effort, high clarity. List by ID with one-line description.
|
|
155
|
+
|
|
156
|
+
#### Phase 2 — Meaningful Improvements
|
|
157
|
+
|
|
158
|
+
Medium effort, clear payoff. List by ID with dependencies noted.
|
|
159
|
+
|
|
160
|
+
#### Phase 3 — Architectural Investments
|
|
161
|
+
|
|
162
|
+
High effort, transformational impact. List by ID.
|
|
163
|
+
|
|
164
|
+
### Codebase Strengths
|
|
165
|
+
|
|
166
|
+
[Specific patterns worth preserving. Each strength must cite file and line range and include exact quote evidence.]
|
|
167
|
+
|
|
168
|
+
## Recommended Remediation Order
|
|
169
|
+
|
|
170
|
+
1. Security, supply-chain, data-loss, and broken shipped functionality.
|
|
171
|
+
2. Unsupported public claims and stealth behavior changes.
|
|
172
|
+
3. Trust-boundary and authorization defects.
|
|
173
|
+
4. Test gaps that allow confirmed defects to recur.
|
|
174
|
+
5. Performance and observability gaps affecting production diagnosis.
|
|
175
|
+
6. AI slop and provenance cleanup by repeated pattern.
|
|
176
|
+
7. Validated enhancement opportunities by dependency order.
|
|
177
|
+
|
|
178
|
+
## Coverage and Depth Notes
|
|
179
|
+
|
|
180
|
+
- Tracks not run:
|
|
181
|
+
- Areas inventoried but not deeply reviewed:
|
|
182
|
+
- Runtime validations not run and why:
|
|
183
|
+
- UNVERIFIED findings worth future attention:
|
|
184
|
+
- Files or generated artifacts intentionally excluded:
|
|
185
|
+
|
|
186
|
+
## Validation Notes
|
|
187
|
+
|
|
188
|
+
- candidates generated:
|
|
189
|
+
- reviewer confirmed:
|
|
190
|
+
- reviewer disproved:
|
|
191
|
+
- reviewer unverified:
|
|
192
|
+
- critic upheld/refined/downgraded/overturned:
|
|
193
|
+
- enhancements upheld/rejected:
|
|
194
|
+
- final critic verdict:
|
|
195
|
+
- coverage units: total / reviewed / not_applicable / skipped / blocked / unreviewed
|
|
196
|
+
- depth plan failures: none or list
|
|
197
|
+
- selected-track dilution detected: yes/no
|
|
198
|
+
|
|
199
|
+
## Per-Finding Format
|
|
200
|
+
|
|
201
|
+
### [SEVERITY] [Title]
|
|
202
|
+
|
|
203
|
+
Location: `path:line`
|
|
204
|
+
Track: [track]
|
|
205
|
+
Status: CONFIRMED | PRE_EXISTING
|
|
206
|
+
Confidence: HIGH | MEDIUM
|
|
207
|
+
Grounding: HIGH | MEDIUM
|
|
208
|
+
|
|
209
|
+
Evidence:
|
|
210
|
+
> [exact quote]
|
|
211
|
+
|
|
212
|
+
Problem:
|
|
213
|
+
[factual issue]
|
|
214
|
+
|
|
215
|
+
Impact:
|
|
216
|
+
[specific impact]
|
|
217
|
+
|
|
218
|
+
Validation:
|
|
219
|
+
[what reviewer checked, runtime command if any, critic outcome if high severity]
|
|
220
|
+
|
|
221
|
+
Recommended Fix:
|
|
222
|
+
[actionable remediation]
|
|
223
|
+
|
|
224
|
+
## Per-Enhancement Format
|
|
225
|
+
|
|
226
|
+
### [ENHANCEMENT-ID] [Title]
|
|
227
|
+
|
|
228
|
+
Location: `path:line`
|
|
229
|
+
Category: [category]
|
|
230
|
+
Value: High | Medium
|
|
231
|
+
Effort: S | M | L
|
|
232
|
+
Grounding: HIGH | MEDIUM
|
|
233
|
+
|
|
234
|
+
Current State:
|
|
235
|
+
> [exact quote]
|
|
236
|
+
|
|
237
|
+
Opportunity:
|
|
238
|
+
[specific improvement]
|
|
239
|
+
|
|
240
|
+
Expected Impact:
|
|
241
|
+
[what improves]
|
|
242
|
+
|
|
243
|
+
Validation:
|
|
244
|
+
[critic result and dependencies]
|
|
@@ -0,0 +1,25 @@
|
|
|
1
|
+
# Compatibility and Research Notes
|
|
2
|
+
|
|
3
|
+
This package targets the shared Agent Skills shape: a directory containing `SKILL.md`, plus optional `references/`, `assets/`, `scripts/`, and Codex-specific `agents/openai.yaml` metadata.
|
|
4
|
+
|
|
5
|
+
## Compatibility decisions
|
|
6
|
+
|
|
7
|
+
- Canonical opencode-swarm repo install path: `.opencode/skills/codebase-review-swarm/`.
|
|
8
|
+
- Claude Code repo adapter path: `.claude/skills/codebase-review-swarm/`.
|
|
9
|
+
- Codex repo adapter path: `.agents/skills/codebase-review-swarm/`.
|
|
10
|
+
- Portable OpenCode install paths for other repositories: `.opencode/skills/codebase-review-swarm/`, `.claude/skills/codebase-review-swarm/`, or `.agents/skills/codebase-review-swarm/`.
|
|
11
|
+
- Frontmatter is intentionally minimal and portable: `name`, `description`, `license`, `compatibility`, and `metadata`.
|
|
12
|
+
- Long operational content is progressively disclosed via `references/` and `assets/` rather than packed only into `SKILL.md`.
|
|
13
|
+
- The full v7 source is retained verbatim in `references/full-v7-source-prompt.md` for long checklists and provenance.
|
|
14
|
+
|
|
15
|
+
## Standards updates in v8.2
|
|
16
|
+
|
|
17
|
+
- OWASP ASVS: use 5.0.0 as the stable baseline. The source v7 prompt referenced 4.0.3 with v5.0 draft; this package supersedes that for current reviews.
|
|
18
|
+
- OWASP Top 10 for LLM Applications: use 2025 categories, including system prompt leakage and vector/embedding weaknesses.
|
|
19
|
+
- SLSA: use v1.2 terminology for provenance, build levels/tracks, and attestation expectations.
|
|
20
|
+
- UI accessibility: use WCAG 2.2 AA unless repository policy requires stricter.
|
|
21
|
+
- Observability: use OpenTelemetry traces, metrics, logs, and context propagation as the default model.
|
|
22
|
+
|
|
23
|
+
## Invocation policy
|
|
24
|
+
|
|
25
|
+
This review is heavy and can run many read-only commands. Codex-specific `agents/openai.yaml` sets `allow_implicit_invocation: false` to prefer explicit `$codebase-review-swarm` usage. Other hosts may still suggest it based on the `description`.
|