@fro.bot/systematic 2.0.1 → 2.0.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (57) hide show
  1. package/agents/design/figma-design-sync.md +1 -1
  2. package/agents/document-review/coherence-reviewer.md +40 -0
  3. package/agents/document-review/design-lens-reviewer.md +46 -0
  4. package/agents/document-review/feasibility-reviewer.md +42 -0
  5. package/agents/document-review/product-lens-reviewer.md +50 -0
  6. package/agents/document-review/scope-guardian-reviewer.md +54 -0
  7. package/agents/document-review/security-lens-reviewer.md +38 -0
  8. package/agents/research/best-practices-researcher.md +2 -1
  9. package/agents/research/git-history-analyzer.md +1 -1
  10. package/agents/research/repo-research-analyst.md +164 -9
  11. package/agents/review/api-contract-reviewer.md +49 -0
  12. package/agents/review/correctness-reviewer.md +49 -0
  13. package/agents/review/data-migrations-reviewer.md +53 -0
  14. package/agents/review/maintainability-reviewer.md +49 -0
  15. package/agents/review/pattern-recognition-specialist.md +2 -1
  16. package/agents/review/performance-reviewer.md +51 -0
  17. package/agents/review/reliability-reviewer.md +49 -0
  18. package/agents/review/schema-drift-detector.md +12 -10
  19. package/agents/review/security-reviewer.md +51 -0
  20. package/agents/review/testing-reviewer.md +48 -0
  21. package/agents/workflow/pr-comment-resolver.md +1 -1
  22. package/agents/workflow/spec-flow-analyzer.md +60 -89
  23. package/dist/index.js +3 -3
  24. package/package.json +1 -1
  25. package/skills/agent-browser/SKILL.md +69 -48
  26. package/skills/ce-brainstorm/SKILL.md +2 -1
  27. package/skills/ce-compound/SKILL.md +26 -1
  28. package/skills/ce-compound-refresh/SKILL.md +11 -1
  29. package/skills/ce-ideate/SKILL.md +2 -1
  30. package/skills/ce-plan/SKILL.md +424 -414
  31. package/skills/ce-review/SKILL.md +12 -13
  32. package/skills/ce-review-beta/SKILL.md +506 -0
  33. package/skills/ce-review-beta/references/diff-scope.md +31 -0
  34. package/skills/ce-review-beta/references/findings-schema.json +128 -0
  35. package/skills/ce-review-beta/references/persona-catalog.md +50 -0
  36. package/skills/ce-review-beta/references/review-output-template.md +115 -0
  37. package/skills/ce-review-beta/references/subagent-template.md +56 -0
  38. package/skills/ce-work/SKILL.md +14 -6
  39. package/skills/ce-work-beta/SKILL.md +14 -8
  40. package/skills/claude-permissions-optimizer/SKILL.md +15 -14
  41. package/skills/deepen-plan/SKILL.md +348 -483
  42. package/skills/document-review/SKILL.md +160 -52
  43. package/skills/feature-video/SKILL.md +209 -178
  44. package/skills/file-todos/SKILL.md +72 -94
  45. package/skills/frontend-design/SKILL.md +243 -27
  46. package/skills/git-worktree/SKILL.md +37 -28
  47. package/skills/lfg/SKILL.md +7 -7
  48. package/skills/reproduce-bug/SKILL.md +154 -60
  49. package/skills/resolve-pr-parallel/SKILL.md +19 -12
  50. package/skills/resolve-todo-parallel/SKILL.md +9 -6
  51. package/skills/setup/SKILL.md +33 -56
  52. package/skills/slfg/SKILL.md +5 -5
  53. package/skills/test-browser/SKILL.md +69 -145
  54. package/skills/test-xcode/SKILL.md +61 -183
  55. package/skills/triage/SKILL.md +10 -10
  56. package/skills/ce-plan-beta/SKILL.md +0 -571
  57. package/skills/deepen-plan-beta/SKILL.md +0 -323
@@ -1,323 +0,0 @@
1
- ---
2
- name: deepen-plan-beta
3
- description: '[BETA] Stress-test an existing implementation plan and selectively strengthen weak sections with targeted research. Use when a plan needs more confidence around decisions, sequencing, system-wide impact, risks, or verification. Best for Standard or Deep plans, or high-risk topics such as auth, payments, migrations, external APIs, and security. For structural or clarity improvements, prefer document-review instead.'
4
- argument-hint: '[path to plan file]'
5
- disable-model-invocation: true
6
- ---
7
-
8
- # Deepen Plan
9
-
10
- ## Introduction
11
-
12
- **Note: The current year is 2026.** Use this when searching for recent documentation and best practices.
13
-
14
- `ce:plan-beta` does the first planning pass. `deepen-plan-beta` is a second-pass confidence check.
15
-
16
- Use this skill when the plan already exists and the question is not "Is this document clear?" but rather "Is this plan grounded enough for the complexity and risk involved?"
17
-
18
- This skill does **not** turn plans into implementation scripts. It identifies weak sections, runs targeted research only for those sections, and strengthens the plan in place.
19
-
20
- `document-review` and `deepen-plan-beta` are different:
21
- - Use the `document-review` skill when the document needs clarity, simplification, completeness, or scope control
22
- - Use `deepen-plan-beta` when the document is structurally sound but still needs stronger rationale, sequencing, risk treatment, or system-wide thinking
23
-
24
- ## Interaction Method
25
-
26
- Use the platform's question tool when available. When asking the user a question, prefer the platform's blocking question tool if one exists (`question` in OpenCode, `request_user_input` in Codex, `ask_user` in Gemini). Otherwise, present numbered options in chat and wait for the user's reply before proceeding.
27
-
28
- Ask one question at a time. Prefer a concise single-select choice when natural options exist.
29
-
30
- ## Plan File
31
-
32
- <plan_path> #$ARGUMENTS </plan_path>
33
-
34
- If the plan path above is empty:
35
- 1. Check `docs/plans/` for recent files
36
- 2. Ask the user which plan to deepen using the platform's blocking question tool when available (see Interaction Method). Otherwise, present numbered options in chat and wait for the user's reply before proceeding
37
-
38
- Do not proceed until you have a valid plan file path.
39
-
40
- ## Core Principles
41
-
42
- 1. **Stress-test, do not inflate** - Deepening should increase justified confidence, not make the plan longer for its own sake.
43
- 2. **Selective depth only** - Focus on the weakest 2-5 sections rather than enriching everything.
44
- 3. **Preserve the planning boundary** - No implementation code, no git command choreography, no exact test command recipes.
45
- 4. **Use artifact-contained evidence** - Work from the written plan, its `Context & Research`, `Sources & References`, and its origin document when present.
46
- 5. **Respect product boundaries** - Do not invent new product requirements. If deepening reveals a product-level gap, surface it as an open question or route back to `ce:brainstorm`.
47
- 6. **Prioritize risk and cross-cutting impact** - The more dangerous or interconnected the work, the more valuable another planning pass becomes.
48
-
49
- ## Workflow
50
-
51
- ### Phase 0: Load the Plan and Decide Whether Deepening Is Warranted
52
-
53
- #### 0.1 Read the Plan and Supporting Inputs
54
-
55
- Read the plan file completely.
56
-
57
- If the plan frontmatter includes an `origin:` path:
58
- - Read the origin document too
59
- - Use it to check whether the plan still reflects the product intent, scope boundaries, and success criteria
60
-
61
- #### 0.2 Classify Plan Depth and Topic Risk
62
-
63
- Determine the plan depth from the document:
64
- - **Lightweight** - small, bounded, low ambiguity, usually 2-4 implementation units
65
- - **Standard** - moderate complexity, some technical decisions, usually 3-6 units
66
- - **Deep** - cross-cutting, high-risk, or strategically important work, usually 4-8 units or phased delivery
67
-
68
- Also build a risk profile. Treat these as high-risk signals:
69
- - Authentication, authorization, or security-sensitive behavior
70
- - Payments, billing, or financial flows
71
- - Data migrations, backfills, or persistent data changes
72
- - External APIs or third-party integrations
73
- - Privacy, compliance, or user data handling
74
- - Cross-interface parity or multi-surface behavior
75
- - Significant rollout, monitoring, or operational concerns
76
-
77
- #### 0.3 Decide Whether to Deepen
78
-
79
- Use this default:
80
- - **Lightweight** plans usually do not need deepening unless they are high-risk or the user explicitly requests it
81
- - **Standard** plans often benefit when one or more important sections still look thin
82
- - **Deep** or high-risk plans often benefit from a targeted second pass
83
-
84
- If the plan already appears sufficiently grounded:
85
- - Say so briefly
86
- - Recommend moving to `/ce:work` or the `document-review` skill
87
- - If the user explicitly asked to deepen anyway, continue with a light pass and deepen at most 1-2 sections
88
-
89
- ### Phase 1: Parse the Current `ce:plan-beta` Structure
90
-
91
- Map the plan into the current template. Look for these sections, or their nearest equivalents:
92
- - `Overview`
93
- - `Problem Frame`
94
- - `Requirements Trace`
95
- - `Scope Boundaries`
96
- - `Context & Research`
97
- - `Key Technical Decisions`
98
- - `Open Questions`
99
- - `Implementation Units`
100
- - `System-Wide Impact`
101
- - `Risks & Dependencies`
102
- - `Documentation / Operational Notes`
103
- - `Sources & References`
104
- - Optional deep-plan sections such as `Alternative Approaches Considered`, `Success Metrics`, `Phased Delivery`, `Risk Analysis & Mitigation`, and `Operational / Rollout Notes`
105
-
106
- If the plan was written manually or uses different headings:
107
- - Map sections by intent rather than exact heading names
108
- - If a section is structurally present but titled differently, treat it as the equivalent section
109
- - If the plan truly lacks a section, decide whether that absence is intentional for the plan depth or a confidence gap worth scoring
110
-
111
- Also collect:
112
- - Frontmatter, including existing `deepened:` date if present
113
- - Number of implementation units
114
- - Which files and test files are named
115
- - Which learnings, patterns, or external references are cited
116
- - Which sections appear omitted because they were unnecessary versus omitted because they are missing
117
-
118
- ### Phase 2: Score Confidence Gaps
119
-
120
- Use a checklist-first, risk-weighted scoring pass.
121
-
122
- For each section, compute:
123
- - **Trigger count** - number of checklist problems that apply
124
- - **Risk bonus** - add 1 if the topic is high-risk and this section is materially relevant to that risk
125
- - **Critical-section bonus** - add 1 for `Key Technical Decisions`, `Implementation Units`, `System-Wide Impact`, `Risks & Dependencies`, or `Open Questions` in `Standard` or `Deep` plans
126
-
127
- Treat a section as a candidate if:
128
- - it hits **2+ total points**, or
129
- - it hits **1+ point** in a high-risk domain and the section is materially important
130
-
131
- Choose only the top **2-5** sections by score. If the user explicitly asked to deepen a lightweight plan, cap at **1-2** sections unless the topic is high-risk.
132
-
133
- Example:
134
- - A `Key Technical Decisions` section with 1 checklist trigger and the critical-section bonus scores **2 points** and is a candidate
135
- - A `Risks & Dependencies` section with 1 checklist trigger in a high-risk migration plan also becomes a candidate because the risk bonus applies
136
-
137
- If the plan already has a `deepened:` date:
138
- - Prefer sections that have not yet been substantially strengthened, if their scores are comparable
139
- - Revisit an already-deepened section only when it still scores clearly higher than alternatives or the user explicitly asks for another pass on it
140
-
141
- #### 2.1 Section Checklists
142
-
143
- Use these triggers.
144
-
145
- **Requirements Trace**
146
- - Requirements are vague or disconnected from implementation units
147
- - Success criteria are missing or not reflected downstream
148
- - Units do not clearly advance the traced requirements
149
- - Origin requirements are not clearly carried forward
150
-
151
- **Context & Research / Sources & References**
152
- - Relevant repo patterns are named but never used in decisions or implementation units
153
- - Cited learnings or references do not materially shape the plan
154
- - High-risk work lacks appropriate external or internal grounding
155
- - Research is generic instead of tied to this repo or this plan
156
-
157
- **Key Technical Decisions**
158
- - A decision is stated without rationale
159
- - Rationale does not explain tradeoffs or rejected alternatives
160
- - The decision does not connect back to scope, requirements, or origin context
161
- - An obvious design fork exists but the plan never addresses why one path won
162
-
163
- **Open Questions**
164
- - Product blockers are hidden as assumptions
165
- - Planning-owned questions are incorrectly deferred to implementation
166
- - Resolved questions have no clear basis in repo context, research, or origin decisions
167
- - Deferred items are too vague to be useful later
168
-
169
- **Implementation Units**
170
- - Dependency order is unclear or likely wrong
171
- - File paths or test file paths are missing where they should be explicit
172
- - Units are too large, too vague, or broken into micro-steps
173
- - Approach notes are thin or do not name the pattern to follow
174
- - Test scenarios or verification outcomes are vague
175
-
176
- **System-Wide Impact**
177
- - Affected interfaces, callbacks, middleware, entry points, or parity surfaces are missing
178
- - Failure propagation is underexplored
179
- - State lifecycle, caching, or data integrity risks are absent where relevant
180
- - Integration coverage is weak for cross-layer work
181
-
182
- **Risks & Dependencies / Documentation / Operational Notes**
183
- - Risks are listed without mitigation
184
- - Rollout, monitoring, migration, or support implications are missing when warranted
185
- - External dependency assumptions are weak or unstated
186
- - Security, privacy, performance, or data risks are absent where they obviously apply
187
-
188
- Use the plan's own `Context & Research` and `Sources & References` as evidence. If those sections cite a pattern, learning, or risk that never affects decisions, implementation units, or verification, treat that as a confidence gap.
189
-
190
- ### Phase 3: Select Targeted Research Agents
191
-
192
- For each selected section, choose the smallest useful agent set. Do **not** run every agent. Use at most **1-3 agents per section** and usually no more than **8 agents total**.
193
-
194
- Use fully-qualified agent names inside Task calls.
195
-
196
- #### 3.1 Deterministic Section-to-Agent Mapping
197
-
198
- **Requirements Trace / Open Questions classification**
199
- - `systematic:workflow:spec-flow-analyzer` for missing user flows, edge cases, and handoff gaps
200
- - `systematic:research:repo-research-analyst` for repo-grounded patterns, conventions, and implementation reality checks
201
-
202
- **Context & Research / Sources & References gaps**
203
- - `systematic:research:learnings-researcher` for institutional knowledge and past solved problems
204
- - `systematic:research:framework-docs-researcher` for official framework or library behavior
205
- - `systematic:research:best-practices-researcher` for current external patterns and industry guidance
206
- - Add `systematic:research:git-history-analyzer` only when historical rationale or prior art is materially missing
207
-
208
- **Key Technical Decisions**
209
- - `systematic:review:architecture-strategist` for design integrity, boundaries, and architectural tradeoffs
210
- - Add `systematic:research:framework-docs-researcher` or `systematic:research:best-practices-researcher` when the decision needs external grounding beyond repo evidence
211
-
212
- **Implementation Units / Verification**
213
- - `systematic:research:repo-research-analyst` for concrete file targets, patterns to follow, and repo-specific sequencing clues
214
- - `systematic:review:pattern-recognition-specialist` for consistency, duplication risks, and alignment with existing patterns
215
- - Add `systematic:workflow:spec-flow-analyzer` when sequencing depends on user flow or handoff completeness
216
-
217
- **System-Wide Impact**
218
- - `systematic:review:architecture-strategist` for cross-boundary effects, interface surfaces, and architectural knock-on impact
219
- - Add the specific specialist that matches the risk:
220
- - `systematic:review:performance-oracle` for scalability, latency, throughput, and resource-risk analysis
221
- - `systematic:review:security-sentinel` for auth, validation, exploit surfaces, and security boundary review
222
- - `systematic:review:data-integrity-guardian` for migrations, persistent state safety, consistency, and data lifecycle risks
223
-
224
- **Risks & Dependencies / Operational Notes**
225
- - Use the specialist that matches the actual risk:
226
- - `systematic:review:security-sentinel` for security, auth, privacy, and exploit risk
227
- - `systematic:review:data-integrity-guardian` for persistent data safety, constraints, and transaction boundaries
228
- - `systematic:review:data-migration-expert` for migration realism, backfills, and production data transformation risk
229
- - `systematic:review:deployment-verification-agent` for rollout checklists, rollback planning, and launch verification
230
- - `systematic:review:performance-oracle` for capacity, latency, and scaling concerns
231
-
232
- #### 3.2 Agent Prompt Shape
233
-
234
- For each selected section, pass:
235
- - A short plan summary
236
- - The exact section text
237
- - Why the section was selected, including which checklist triggers fired
238
- - The plan depth and risk profile
239
- - A specific question to answer
240
-
241
- Instruct the agent to return:
242
- - findings that change planning quality
243
- - stronger rationale, sequencing, verification, risk treatment, or references
244
- - no implementation code
245
- - no shell commands
246
-
247
- ### Phase 4: Run Targeted Research and Review
248
-
249
- Launch the selected agents in parallel.
250
-
251
- Prefer local repo and institutional evidence first. Use external research only when the gap cannot be closed responsibly from repo context or already-cited sources.
252
-
253
- If a selected section can be improved by reading the origin document more carefully, do that before dispatching external agents.
254
-
255
- If agent outputs conflict:
256
- - Prefer repo-grounded and origin-grounded evidence over generic advice
257
- - Prefer official framework documentation over secondary best-practice summaries when the conflict is about library behavior
258
- - If a real tradeoff remains, record it explicitly in the plan rather than pretending the conflict does not exist
259
-
260
- ### Phase 5: Synthesize and Rewrite the Plan
261
-
262
- Strengthen only the selected sections. Keep the plan coherent and preserve its overall structure.
263
-
264
- Allowed changes:
265
- - Clarify or strengthen decision rationale
266
- - Tighten requirements trace or origin fidelity
267
- - Reorder or split implementation units when sequencing is weak
268
- - Add missing pattern references, file/test paths, or verification outcomes
269
- - Expand system-wide impact, risks, or rollout treatment where justified
270
- - Reclassify open questions between `Resolved During Planning` and `Deferred to Implementation` when evidence supports the change
271
- - Add an optional deep-plan section only when it materially improves execution quality
272
- - Add or update `deepened: YYYY-MM-DD` in frontmatter when the plan was substantively improved
273
-
274
- Do **not**:
275
- - Add fenced implementation code blocks unless the plan itself is about code shape as a design artifact
276
- - Add git commands, commit choreography, or exact test command recipes
277
- - Add generic `Research Insights` subsections everywhere
278
- - Rewrite the entire plan from scratch
279
- - Invent new product requirements, scope changes, or success criteria without surfacing them explicitly
280
-
281
- If research reveals a product-level ambiguity that should change behavior or scope:
282
- - Do not silently decide it here
283
- - Record it under `Open Questions`
284
- - Recommend `ce:brainstorm` if the gap is truly product-defining
285
-
286
- ### Phase 6: Final Checks and Write the File
287
-
288
- Before writing:
289
- - Confirm the plan is stronger in specific ways, not merely longer
290
- - Confirm the planning boundary is intact
291
- - Confirm the selected sections were actually the weakest ones
292
- - Confirm origin decisions were preserved when an origin document exists
293
- - Confirm the final plan still feels right-sized for its depth
294
-
295
- Update the plan file in place by default.
296
-
297
- If the user explicitly requests a separate file, append `-deepened` before `.md`, for example:
298
- - `docs/plans/2026-03-15-001-feat-example-plan-deepened.md`
299
-
300
- ## Post-Enhancement Options
301
-
302
- If substantive changes were made, present next steps using the platform's blocking question tool when available (see Interaction Method). Otherwise, present numbered options in chat and wait for the user's reply before proceeding.
303
-
304
- **Question:** "Plan deepened at `[plan_path]`. What would you like to do next?"
305
-
306
- **Options:**
307
- 1. **View diff** - Show what changed
308
- 2. **Run `document-review` skill** - Improve the updated plan through structured document review
309
- 3. **Start `ce:work` skill** - Begin implementing the plan
310
- 4. **Deepen specific sections further** - Run another targeted deepening pass on named sections
311
-
312
- Based on selection:
313
- - **View diff** -> Show the important additions and changed sections
314
- - **`document-review` skill** -> Load the `document-review` skill with the plan path
315
- - **Start `ce:work` skill** -> Call the `ce:work` skill with the plan path
316
- - **Deepen specific sections further** -> Ask which sections still feel weak and run another targeted pass only for those sections
317
-
318
- If no substantive changes were warranted:
319
- - Say that the plan already appears sufficiently grounded
320
- - Offer the `document-review` skill or `/ce:work` as the next step instead
321
-
322
- NEVER CODE! Research, challenge, and strengthen the plan.
323
-