@fro.bot/systematic 1.22.8 → 1.23.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (40) hide show
  1. package/package.json +1 -1
  2. package/skills/agent-browser/SKILL.md +511 -170
  3. package/skills/agent-browser/references/authentication.md +303 -0
  4. package/skills/agent-browser/references/commands.md +266 -0
  5. package/skills/agent-browser/references/profiling.md +120 -0
  6. package/skills/agent-browser/references/proxy-support.md +194 -0
  7. package/skills/agent-browser/references/session-management.md +193 -0
  8. package/skills/agent-browser/references/snapshot-refs.md +194 -0
  9. package/skills/agent-browser/references/video-recording.md +173 -0
  10. package/skills/agent-browser/templates/authenticated-session.sh +105 -0
  11. package/skills/agent-browser/templates/capture-workflow.sh +69 -0
  12. package/skills/agent-browser/templates/form-automation.sh +62 -0
  13. package/skills/agent-native-audit/SKILL.md +279 -0
  14. package/skills/ce-brainstorm/SKILL.md +146 -0
  15. package/skills/ce-compound/SKILL.md +317 -0
  16. package/skills/ce-plan/SKILL.md +642 -0
  17. package/skills/ce-review/SKILL.md +559 -0
  18. package/skills/ce-work/SKILL.md +471 -0
  19. package/skills/changelog/SKILL.md +139 -0
  20. package/skills/create-agent-skill/SKILL.md +10 -0
  21. package/skills/create-agent-skills/SKILL.md +3 -14
  22. package/skills/deepen-plan/SKILL.md +545 -0
  23. package/skills/deploy-docs/SKILL.md +113 -0
  24. package/skills/feature-video/SKILL.md +352 -0
  25. package/skills/generate_command/SKILL.md +163 -0
  26. package/skills/heal-skill/SKILL.md +148 -0
  27. package/skills/lfg/SKILL.md +34 -0
  28. package/skills/report-bug/SKILL.md +151 -0
  29. package/skills/reproduce-bug/SKILL.md +101 -0
  30. package/skills/resolve_parallel/SKILL.md +35 -0
  31. package/skills/resolve_todo_parallel/SKILL.md +38 -0
  32. package/skills/slfg/SKILL.md +33 -0
  33. package/skills/test-browser/SKILL.md +394 -0
  34. package/skills/test-xcode/SKILL.md +333 -0
  35. package/skills/triage/SKILL.md +312 -0
  36. package/skills/workflows-brainstorm/SKILL.md +11 -0
  37. package/skills/workflows-compound/SKILL.md +10 -0
  38. package/skills/workflows-plan/SKILL.md +10 -0
  39. package/skills/workflows-review/SKILL.md +10 -0
  40. package/skills/workflows-work/SKILL.md +10 -0
@@ -0,0 +1,559 @@
1
+ ---
2
+ name: ce-review
3
+ description: Perform exhaustive code reviews using multi-agent analysis, ultra-thinking, and worktrees
4
+ argument-hint: '[PR number, GitHub URL, branch name, or latest] [--serial]'
5
+ ---
6
+
7
+ # Review Command
8
+
9
+ <command_purpose> Perform exhaustive code reviews using multi-agent analysis, ultra-thinking, and Git worktrees for deep local inspection. </command_purpose>
10
+
11
+ ## Introduction
12
+
13
+ <role>Senior Code Review Architect with expertise in security, performance, architecture, and quality assurance</role>
14
+
15
+ ## Prerequisites
16
+
17
+ <requirements>
18
+ - Git repository with GitHub CLI (`gh`) installed and authenticated
19
+ - Clean main/master branch
20
+ - Proper permissions to create worktrees and access the repository
21
+ - For document reviews: Path to a markdown file or document
22
+ </requirements>
23
+
24
+ ## Main Tasks
25
+
26
+ ### 1. Determine Review Target & Setup (ALWAYS FIRST)
27
+
28
+ <review_target> #$ARGUMENTS </review_target>
29
+
30
+ <thinking>
31
+ First, I need to determine the review target type and set up the code for analysis.
32
+ </thinking>
33
+
34
+ #### Immediate Actions:
35
+
36
+ <task_list>
37
+
38
+ - [ ] Determine review type: PR number (numeric), GitHub URL, file path (.md), or empty (current branch)
39
+ - [ ] Check current git branch
40
+ - [ ] If ALREADY on the target branch (PR branch, requested branch name, or the branch already checked out for review) → proceed with analysis on current branch
41
+ - [ ] If DIFFERENT branch than the review target → offer to use worktree: "Use git-worktree skill for isolated Call `skill: git-worktree` with branch name"
42
+ - [ ] Fetch PR metadata using `gh pr view --json` for title, body, files, linked issues
43
+ - [ ] Set up language-specific analysis tools
44
+ - [ ] Prepare security scanning environment
45
+ - [ ] Make sure we are on the branch we are reviewing. Use gh pr checkout to switch to the branch or manually checkout the branch.
46
+
47
+ Ensure that the code is ready for analysis (either in worktree or on current branch). ONLY then proceed to the next step.
48
+
49
+ </task_list>
50
+
51
+ #### Protected Artifacts
52
+
53
+ <protected_artifacts>
54
+ The following paths are compound-engineering pipeline artifacts and must never be flagged for deletion, removal, or gitignore by any review agent:
55
+
56
+ - `docs/plans/*.md` — Plan files created by `/ce:plan`. These are living documents that track implementation progress (checkboxes are checked off by `/ce:work`).
57
+ - `docs/solutions/*.md` — Solution documents created during the pipeline.
58
+
59
+ If a review agent flags any file in these directories for cleanup or removal, discard that finding during synthesis. Do not create a todo for it.
60
+ </protected_artifacts>
61
+
62
+ #### Load Review Agents
63
+
64
+ Read `compound-engineering.local.md` in the project root. If found, use `review_agents` from YAML frontmatter. If the markdown body contains review context, pass it to each agent as additional instructions.
65
+
66
+ If no settings file exists, invoke the `setup` skill to create one. Then read the newly created file and continue.
67
+
68
+ #### Choose Execution Mode
69
+
70
+ <execution_mode>
71
+
72
+ Before launching review agents, check for context constraints:
73
+
74
+ **If `--serial` flag is passed OR conversation is in a long session:**
75
+
76
+ Run agents ONE AT A TIME in sequence. Wait for each agent to complete before starting the next. This uses less context but takes longer.
77
+
78
+ **Default (parallel):**
79
+
80
+ Run all agents simultaneously for speed. If you hit context limits, retry with `--serial` flag.
81
+
82
+ **Auto-detect:** If more than 5 review agents are configured, automatically switch to serial mode and inform the user:
83
+ "Running review agents in serial mode (6+ agents configured). Use --parallel to override."
84
+
85
+ </execution_mode>
86
+
87
+ #### Parallel Agents to review the PR:
88
+
89
+ <parallel_tasks>
90
+
91
+ **Parallel mode (default for ≤5 agents):**
92
+
93
+ Run all configured review agents in parallel using task tool. For each agent in the `review_agents` list:
94
+
95
+ ```
96
+ Task {agent-name}(PR content + review context from settings body)
97
+ ```
98
+
99
+ **Serial mode (--serial flag, or auto for 6+ agents):**
100
+
101
+ Run configured review agents ONE AT A TIME. For each agent in the `review_agents` list, wait for it to complete before starting the next:
102
+
103
+ ```
104
+ For each agent in review_agents:
105
+ 1. Task {agent-name}(PR content + review context)
106
+ 2. Wait for completion
107
+ 3. Collect findings
108
+ 4. Proceed to next agent
109
+ ```
110
+
111
+ Always run these last regardless of mode:
112
+ - task systematic:review:agent-native-reviewer(PR content) - Verify new features are agent-accessible
113
+ - task systematic:research:learnings-researcher(PR content) - Search docs/solutions/ for past issues related to this PR's modules and patterns
114
+
115
+ </parallel_tasks>
116
+
117
+ #### Conditional Agents (Run if applicable):
118
+
119
+ <conditional_agents>
120
+
121
+ These agents are run ONLY when the PR matches specific criteria. Check the PR files list to determine if they apply:
122
+
123
+ **MIGRATIONS: If PR contains database migrations, schema.rb, or data backfills:**
124
+
125
+ - task systematic:review:schema-drift-detector(PR content) - Detects unrelated schema.rb changes by cross-referencing against included migrations (run FIRST)
126
+ - task systematic:review:data-migration-expert(PR content) - Validates ID mappings match production, checks for swapped values, verifies rollback safety
127
+ - task systematic:review:deployment-verification-agent(PR content) - Creates Go/No-Go deployment checklist with SQL verification queries
128
+
129
+ **When to run:**
130
+ - PR includes files matching `db/migrate/*.rb` or `db/schema.rb`
131
+ - PR modifies columns that store IDs, enums, or mappings
132
+ - PR includes data backfill scripts or rake tasks
133
+ - PR title/body mentions: migration, backfill, data transformation, ID mapping
134
+
135
+ **What these agents check:**
136
+ - `schema-drift-detector`: Cross-references schema.rb changes against PR migrations to catch unrelated columns/indexes from local database state
137
+ - `data-migration-expert`: Verifies hard-coded mappings match production reality (prevents swapped IDs), checks for orphaned associations, validates dual-write patterns
138
+ - `deployment-verification-agent`: Produces executable pre/post-deploy checklists with SQL queries, rollback procedures, and monitoring plans
139
+
140
+ </conditional_agents>
141
+
142
+ ### 2. Ultra-Thinking Deep Dive Phases
143
+
144
+ <ultrathink_instruction> For each phase below, spend maximum cognitive effort. Think step by step. Consider all angles. Question assumptions. And bring all reviews in a synthesis to the user.</ultrathink_instruction>
145
+
146
+ <deliverable>
147
+ Complete system context map with component interactions
148
+ </deliverable>
149
+
150
+ #### Phase 1: Stakeholder Perspective Analysis
151
+
152
+ <thinking_prompt> ULTRA-THINK: Put yourself in each stakeholder's shoes. What matters to them? What are their pain points? </thinking_prompt>
153
+
154
+ <stakeholder_perspectives>
155
+
156
+ 1. **Developer Perspective** <questions>
157
+
158
+ - How easy is this to understand and modify?
159
+ - Are the APIs intuitive?
160
+ - Is debugging straightforward?
161
+ - Can I test this easily? </questions>
162
+
163
+ 2. **Operations Perspective** <questions>
164
+
165
+ - How do I deploy this safely?
166
+ - What metrics and logs are available?
167
+ - How do I troubleshoot issues?
168
+ - What are the resource requirements? </questions>
169
+
170
+ 3. **End User Perspective** <questions>
171
+
172
+ - Is the feature intuitive?
173
+ - Are error messages helpful?
174
+ - Is performance acceptable?
175
+ - Does it solve my problem? </questions>
176
+
177
+ 4. **Security Team Perspective** <questions>
178
+
179
+ - What's the attack surface?
180
+ - Are there compliance requirements?
181
+ - How is data protected?
182
+ - What are the audit capabilities? </questions>
183
+
184
+ 5. **Business Perspective** <questions>
185
+ - What's the ROI?
186
+ - Are there legal/compliance risks?
187
+ - How does this affect time-to-market?
188
+ - What's the total cost of ownership? </questions> </stakeholder_perspectives>
189
+
190
+ #### Phase 2: Scenario Exploration
191
+
192
+ <thinking_prompt> ULTRA-THINK: Explore edge cases and failure scenarios. What could go wrong? How does the system behave under stress? </thinking_prompt>
193
+
194
+ <scenario_checklist>
195
+
196
+ - [ ] **Happy Path**: Normal operation with valid inputs
197
+ - [ ] **Invalid Inputs**: Null, empty, malformed data
198
+ - [ ] **Boundary Conditions**: Min/max values, empty collections
199
+ - [ ] **Concurrent Access**: Race conditions, deadlocks
200
+ - [ ] **Scale Testing**: 10x, 100x, 1000x normal load
201
+ - [ ] **Network Issues**: Timeouts, partial failures
202
+ - [ ] **Resource Exhaustion**: Memory, disk, connections
203
+ - [ ] **Security Attacks**: Injection, overflow, DoS
204
+ - [ ] **Data Corruption**: Partial writes, inconsistency
205
+ - [ ] **Cascading Failures**: Downstream service issues </scenario_checklist>
206
+
207
+ ### 3. Multi-Angle Review Perspectives
208
+
209
+ #### Technical Excellence Angle
210
+
211
+ - Code craftsmanship evaluation
212
+ - Engineering best practices
213
+ - Technical documentation quality
214
+ - Tooling and automation assessment
215
+
216
+ #### Business Value Angle
217
+
218
+ - Feature completeness validation
219
+ - Performance impact on users
220
+ - Cost-benefit analysis
221
+ - Time-to-market considerations
222
+
223
+ #### Risk Management Angle
224
+
225
+ - Security risk assessment
226
+ - Operational risk evaluation
227
+ - Compliance risk verification
228
+ - Technical debt accumulation
229
+
230
+ #### Team Dynamics Angle
231
+
232
+ - Code review etiquette
233
+ - Knowledge sharing effectiveness
234
+ - Collaboration patterns
235
+ - Mentoring opportunities
236
+
237
+ ### 4. Simplification and Minimalism Review
238
+
239
+ Run the task systematic:review:code-simplicity-reviewer() to see if we can simplify the code.
240
+
241
+ ### 5. Findings Synthesis and Todo Creation Using file-todos Skill
242
+
243
+ <critical_requirement> ALL findings MUST be stored in the todos/ directory using the file-todos skill. Create todo files immediately after synthesis - do NOT present findings for user approval first. Use the skill for structured todo management. </critical_requirement>
244
+
245
+ #### Step 1: Synthesize All Findings
246
+
247
+ <thinking>
248
+ Consolidate all agent reports into a categorized list of findings.
249
+ Remove duplicates, prioritize by severity and impact.
250
+ </thinking>
251
+
252
+ <synthesis_tasks>
253
+
254
+ - [ ] Collect findings from all parallel agents
255
+ - [ ] Surface learnings-researcher results: if past solutions are relevant, flag them as "Known Pattern" with links to docs/solutions/ files
256
+ - [ ] Discard any findings that recommend deleting or gitignoring files in `docs/plans/` or `docs/solutions/` (see Protected Artifacts above)
257
+ - [ ] Categorize by type: security, performance, architecture, quality, etc.
258
+ - [ ] Assign severity levels: 🔴 CRITICAL (P1), 🟡 IMPORTANT (P2), 🔵 NICE-TO-HAVE (P3)
259
+ - [ ] Remove duplicate or overlapping findings
260
+ - [ ] Estimate effort for each finding (Small/Medium/Large)
261
+
262
+ </synthesis_tasks>
263
+
264
+ #### Step 2: Create Todo Files Using file-todos Skill
265
+
266
+ <critical_instruction> Use the file-todos skill to create todo files for ALL findings immediately. Do NOT present findings one-by-one asking for user approval. Create all todo files in parallel using the skill, then summarize results to user. </critical_instruction>
267
+
268
+ **Implementation Options:**
269
+
270
+ **Option A: Direct File Creation (Fast)**
271
+
272
+ - Create todo files directly using write tool
273
+ - All findings in parallel for speed
274
+ - Use standard template from `.opencode/skills/file-todos/assets/todo-template.md`
275
+ - Follow naming convention: `{issue_id}-pending-{priority}-{description}.md`
276
+
277
+ **Option B: Sub-Agents in Parallel (Recommended for Scale)** For large PRs with 15+ findings, use sub-agents to create finding files in parallel:
278
+
279
+ ```bash
280
+ # Launch multiple finding-creator agents in parallel
281
+ task({) - Create todos for first finding
282
+ task({) - Create todos for second finding
283
+ task({) - Create todos for third finding
284
+ etc. for each finding.
285
+ ```
286
+
287
+ Sub-agents can:
288
+
289
+ - Process multiple findings simultaneously
290
+ - Write detailed todo files with all sections filled
291
+ - Organize findings by severity
292
+ - Create comprehensive Proposed Solutions
293
+ - Add acceptance criteria and work logs
294
+ - Complete much faster than sequential processing
295
+
296
+ **Execution Strategy:**
297
+
298
+ 1. Synthesize all findings into categories (P1/P2/P3)
299
+ 2. Group findings by severity
300
+ 3. Launch 3 parallel sub-agents (one per severity level)
301
+ 4. Each sub-agent creates its batch of todos using the file-todos skill
302
+ 5. Consolidate results and present summary
303
+
304
+ **Process (Using file-todos Skill):**
305
+
306
+ 1. For each finding:
307
+
308
+ - Determine severity (P1/P2/P3)
309
+ - Write detailed Problem Statement and Findings
310
+ - Create 2-3 Proposed Solutions with pros/cons/effort/risk
311
+ - Estimate effort (Small/Medium/Large)
312
+ - Add acceptance criteria and work log
313
+
314
+ 2. Use file-todos skill for structured todo management:
315
+
316
+ ```bash
317
+ skill: file-todos
318
+ ```
319
+
320
+ The skill provides:
321
+
322
+ - Template location: `.opencode/skills/file-todos/assets/todo-template.md`
323
+ - Naming convention: `{issue_id}-{status}-{priority}-{description}.md`
324
+ - YAML frontmatter structure: status, priority, issue_id, tags, dependencies
325
+ - All required sections: Problem Statement, Findings, Solutions, etc.
326
+
327
+ 3. Create todo files in parallel:
328
+
329
+ ```bash
330
+ {next_id}-pending-{priority}-{description}.md
331
+ ```
332
+
333
+ 4. Examples:
334
+
335
+ ```
336
+ 001-pending-p1-path-traversal-vulnerability.md
337
+ 002-pending-p1-api-response-validation.md
338
+ 003-pending-p2-concurrency-limit.md
339
+ 004-pending-p3-unused-parameter.md
340
+ ```
341
+
342
+ 5. Follow template structure from file-todos skill: `.opencode/skills/file-todos/assets/todo-template.md`
343
+
344
+ **Todo File Structure (from template):**
345
+
346
+ Each todo must include:
347
+
348
+ - **YAML frontmatter**: status, priority, issue_id, tags, dependencies
349
+ - **Problem Statement**: What's broken/missing, why it matters
350
+ - **Findings**: Discoveries from agents with evidence/location
351
+ - **Proposed Solutions**: 2-3 options, each with pros/cons/effort/risk
352
+ - **Recommended Action**: (Filled during triage, leave blank initially)
353
+ - **Technical Details**: Affected files, components, database changes
354
+ - **Acceptance Criteria**: Testable checklist items
355
+ - **Work Log**: Dated record with actions and learnings
356
+ - **Resources**: Links to PR, issues, documentation, similar patterns
357
+
358
+ **File naming convention:**
359
+
360
+ ```
361
+ {issue_id}-{status}-{priority}-{description}.md
362
+
363
+ Examples:
364
+ - 001-pending-p1-security-vulnerability.md
365
+ - 002-pending-p2-performance-optimization.md
366
+ - 003-pending-p3-code-cleanup.md
367
+ ```
368
+
369
+ **Status values:**
370
+
371
+ - `pending` - New findings, needs triage/decision
372
+ - `ready` - Approved by manager, ready to work
373
+ - `complete` - Work finished
374
+
375
+ **Priority values:**
376
+
377
+ - `p1` - Critical (blocks merge, security/data issues)
378
+ - `p2` - Important (should fix, architectural/performance)
379
+ - `p3` - Nice-to-have (enhancements, cleanup)
380
+
381
+ **Tagging:** Always add `code-review` tag, plus: `security`, `performance`, `architecture`, `rails`, `quality`, etc.
382
+
383
+ #### Step 3: Summary Report
384
+
385
+ After creating all todo files, present comprehensive summary:
386
+
387
+ ````markdown
388
+ ## ✅ Code Review Complete
389
+
390
+ **Review Target:** PR #XXXX - [PR Title] **Branch:** [branch-name]
391
+
392
+ ### Findings Summary:
393
+
394
+ - **Total Findings:** [X]
395
+ - **🔴 CRITICAL (P1):** [count] - BLOCKS MERGE
396
+ - **🟡 IMPORTANT (P2):** [count] - Should Fix
397
+ - **🔵 NICE-TO-HAVE (P3):** [count] - Enhancements
398
+
399
+ ### Created Todo Files:
400
+
401
+ **P1 - Critical (BLOCKS MERGE):**
402
+
403
+ - `001-pending-p1-{finding}.md` - {description}
404
+ - `002-pending-p1-{finding}.md` - {description}
405
+
406
+ **P2 - Important:**
407
+
408
+ - `003-pending-p2-{finding}.md` - {description}
409
+ - `004-pending-p2-{finding}.md` - {description}
410
+
411
+ **P3 - Nice-to-Have:**
412
+
413
+ - `005-pending-p3-{finding}.md` - {description}
414
+
415
+ ### Review Agents Used:
416
+
417
+ - kieran-rails-reviewer
418
+ - security-sentinel
419
+ - performance-oracle
420
+ - architecture-strategist
421
+ - agent-native-reviewer
422
+ - [other agents]
423
+
424
+ ### Next Steps:
425
+
426
+ 1. **Address P1 Findings**: CRITICAL - must be fixed before merge
427
+
428
+ - Review each P1 todo in detail
429
+ - Implement fixes or request exemption
430
+ - Verify fixes before merging PR
431
+
432
+ 2. **Triage All Todos**:
433
+ ```bash
434
+ ls todos/*-pending-*.md # View all pending todos
435
+ /triage # Use slash command for interactive triage
436
+ ```
437
+
438
+ 3. **Work on Approved Todos**:
439
+
440
+ ```bash
441
+ /resolve_todo_parallel # Fix all approved items efficiently
442
+ ```
443
+
444
+ 4. **Track Progress**:
445
+ - Rename file when status changes: pending → ready → complete
446
+ - Update Work Log as you work
447
+ - Commit todos: `git add todos/ && git commit -m "refactor: add code review findings"`
448
+
449
+ ### Severity Breakdown:
450
+
451
+ **🔴 P1 (Critical - Blocks Merge):**
452
+
453
+ - Security vulnerabilities
454
+ - Data corruption risks
455
+ - Breaking changes
456
+ - Critical architectural issues
457
+
458
+ **🟡 P2 (Important - Should Fix):**
459
+
460
+ - Performance issues
461
+ - Significant architectural concerns
462
+ - Major code quality problems
463
+ - Reliability issues
464
+
465
+ **🔵 P3 (Nice-to-Have):**
466
+
467
+ - Minor improvements
468
+ - Code cleanup
469
+ - Optimization opportunities
470
+ - Documentation updates
471
+ ````
472
+
473
+ ### 6. End-to-End Testing (Optional)
474
+
475
+ <detect_project_type>
476
+
477
+ **First, detect the project type from PR files:**
478
+
479
+ | Indicator | Project Type |
480
+ |-----------|--------------|
481
+ | `*.xcodeproj`, `*.xcworkspace`, `Package.swift` (iOS) | iOS/macOS |
482
+ | `Gemfile`, `package.json`, `app/views/*`, `*.html.*` | Web |
483
+ | Both iOS files AND web files | Hybrid (test both) |
484
+
485
+ </detect_project_type>
486
+
487
+ <offer_testing>
488
+
489
+ After presenting the Summary Report, offer appropriate testing based on project type:
490
+
491
+ **For Web Projects:**
492
+ ```markdown
493
+ **"Want to run browser tests on the affected pages?"**
494
+ 1. Yes - run `/test-browser`
495
+ 2. No - skip
496
+ ```
497
+
498
+ **For iOS Projects:**
499
+ ```markdown
500
+ **"Want to run Xcode simulator tests on the app?"**
501
+ 1. Yes - run `/xcode-test`
502
+ 2. No - skip
503
+ ```
504
+
505
+ **For Hybrid Projects (e.g., Rails + Hotwire Native):**
506
+ ```markdown
507
+ **"Want to run end-to-end tests?"**
508
+ 1. Web only - run `/test-browser`
509
+ 2. iOS only - run `/xcode-test`
510
+ 3. Both - run both commands
511
+ 4. No - skip
512
+ ```
513
+
514
+ </offer_testing>
515
+
516
+ #### If User Accepts Web Testing:
517
+
518
+ Spawn a subagent to run browser tests (preserves main context):
519
+
520
+ ```
521
+ Task general-purpose("Run /test-browser for PR #[number]. Test all affected pages, check for console errors, handle failures by creating todos and fixing.")
522
+ ```
523
+
524
+ The subagent will:
525
+ 1. Identify pages affected by the PR
526
+ 2. Navigate to each page and capture snapshots (using Playwright MCP or agent-browser CLI)
527
+ 3. Check for console errors
528
+ 4. Test critical interactions
529
+ 5. Pause for human verification on OAuth/email/payment flows
530
+ 6. Create P1 todos for any failures
531
+ 7. Fix and retry until all tests pass
532
+
533
+ **Standalone:** `/test-browser [PR number]`
534
+
535
+ #### If User Accepts iOS Testing:
536
+
537
+ Spawn a subagent to run Xcode tests (preserves main context):
538
+
539
+ ```
540
+ Task general-purpose("Run /xcode-test for scheme [name]. Build for simulator, install, launch, take screenshots, check for crashes.")
541
+ ```
542
+
543
+ The subagent will:
544
+ 1. Verify XcodeBuildMCP is installed
545
+ 2. Discover project and schemes
546
+ 3. Build for iOS Simulator
547
+ 4. Install and launch app
548
+ 5. Take screenshots of key screens
549
+ 6. Capture console logs for errors
550
+ 7. Pause for human verification (Sign in with Apple, push, IAP)
551
+ 8. Create P1 todos for any failures
552
+ 9. Fix and retry until all tests pass
553
+
554
+ **Standalone:** `/xcode-test [scheme]`
555
+
556
+ ### Important: P1 Findings Block Merge
557
+
558
+ Any **🔴 P1 (CRITICAL)** findings must be addressed before merging the PR. Present these prominently and ensure they're resolved before accepting the PR.
559
+