@fro.bot/systematic 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (73) hide show
  1. package/README.md +158 -0
  2. package/agents/research/framework-docs-researcher.md +19 -0
  3. package/agents/review/architecture-strategist.md +23 -0
  4. package/agents/review/code-simplicity-reviewer.md +30 -0
  5. package/agents/review/pattern-recognition-specialist.md +24 -0
  6. package/agents/review/performance-oracle.md +25 -0
  7. package/agents/review/security-sentinel.md +25 -0
  8. package/commands/agent-native-audit.md +277 -0
  9. package/commands/create-agent-skill.md +8 -0
  10. package/commands/deepen-plan.md +546 -0
  11. package/commands/lfg.md +19 -0
  12. package/commands/workflows/brainstorm.md +115 -0
  13. package/commands/workflows/compound.md +202 -0
  14. package/commands/workflows/plan.md +551 -0
  15. package/commands/workflows/review.md +514 -0
  16. package/commands/workflows/work.md +363 -0
  17. package/dist/cli.js +360 -0
  18. package/dist/index-v8dhd5s2.js +194 -0
  19. package/dist/index.js +297 -0
  20. package/package.json +69 -0
  21. package/skills/agent-browser/SKILL.md +223 -0
  22. package/skills/agent-native-architecture/SKILL.md +435 -0
  23. package/skills/agent-native-architecture/references/action-parity-discipline.md +409 -0
  24. package/skills/agent-native-architecture/references/agent-execution-patterns.md +467 -0
  25. package/skills/agent-native-architecture/references/agent-native-testing.md +582 -0
  26. package/skills/agent-native-architecture/references/architecture-patterns.md +478 -0
  27. package/skills/agent-native-architecture/references/dynamic-context-injection.md +338 -0
  28. package/skills/agent-native-architecture/references/files-universal-interface.md +301 -0
  29. package/skills/agent-native-architecture/references/from-primitives-to-domain-tools.md +359 -0
  30. package/skills/agent-native-architecture/references/mcp-tool-design.md +506 -0
  31. package/skills/agent-native-architecture/references/mobile-patterns.md +871 -0
  32. package/skills/agent-native-architecture/references/product-implications.md +443 -0
  33. package/skills/agent-native-architecture/references/refactoring-to-prompt-native.md +317 -0
  34. package/skills/agent-native-architecture/references/self-modification.md +269 -0
  35. package/skills/agent-native-architecture/references/shared-workspace-architecture.md +680 -0
  36. package/skills/agent-native-architecture/references/system-prompt-design.md +250 -0
  37. package/skills/brainstorming/SKILL.md +190 -0
  38. package/skills/compound-docs/SKILL.md +510 -0
  39. package/skills/compound-docs/assets/critical-pattern-template.md +34 -0
  40. package/skills/compound-docs/assets/resolution-template.md +93 -0
  41. package/skills/compound-docs/references/yaml-schema.md +65 -0
  42. package/skills/compound-docs/schema.yaml +176 -0
  43. package/skills/create-agent-skills/SKILL.md +299 -0
  44. package/skills/create-agent-skills/references/api-security.md +226 -0
  45. package/skills/create-agent-skills/references/be-clear-and-direct.md +531 -0
  46. package/skills/create-agent-skills/references/best-practices.md +404 -0
  47. package/skills/create-agent-skills/references/common-patterns.md +595 -0
  48. package/skills/create-agent-skills/references/core-principles.md +437 -0
  49. package/skills/create-agent-skills/references/executable-code.md +175 -0
  50. package/skills/create-agent-skills/references/iteration-and-testing.md +474 -0
  51. package/skills/create-agent-skills/references/official-spec.md +185 -0
  52. package/skills/create-agent-skills/references/recommended-structure.md +168 -0
  53. package/skills/create-agent-skills/references/skill-structure.md +372 -0
  54. package/skills/create-agent-skills/references/using-scripts.md +113 -0
  55. package/skills/create-agent-skills/references/using-templates.md +112 -0
  56. package/skills/create-agent-skills/references/workflows-and-validation.md +510 -0
  57. package/skills/create-agent-skills/templates/router-skill.md +73 -0
  58. package/skills/create-agent-skills/templates/simple-skill.md +33 -0
  59. package/skills/create-agent-skills/workflows/add-reference.md +96 -0
  60. package/skills/create-agent-skills/workflows/add-script.md +93 -0
  61. package/skills/create-agent-skills/workflows/add-template.md +74 -0
  62. package/skills/create-agent-skills/workflows/add-workflow.md +120 -0
  63. package/skills/create-agent-skills/workflows/audit-skill.md +138 -0
  64. package/skills/create-agent-skills/workflows/create-domain-expertise-skill.md +605 -0
  65. package/skills/create-agent-skills/workflows/create-new-skill.md +191 -0
  66. package/skills/create-agent-skills/workflows/get-guidance.md +121 -0
  67. package/skills/create-agent-skills/workflows/upgrade-to-router.md +161 -0
  68. package/skills/create-agent-skills/workflows/verify-skill.md +204 -0
  69. package/skills/file-todos/SKILL.md +251 -0
  70. package/skills/file-todos/assets/todo-template.md +155 -0
  71. package/skills/git-worktree/SKILL.md +302 -0
  72. package/skills/git-worktree/scripts/worktree-manager.sh +345 -0
  73. package/skills/using-systematic/SKILL.md +94 -0
@@ -0,0 +1,514 @@
1
+ ---
2
+ name: workflows:review
3
+ description: Perform exhaustive code reviews using multi-agent analysis, ultra-thinking, and worktrees
4
+ argument-hint: "[PR number, GitHub URL, branch name, or latest]"
5
+ ---
6
+
7
+ # Review Command
8
+
9
+ <command_purpose> Perform exhaustive code reviews using multi-agent analysis, ultra-thinking, and Git worktrees for deep local inspection. </command_purpose>
10
+
11
+ ## Introduction
12
+
13
+ <role>Senior Code Review Architect with expertise in security, performance, architecture, and quality assurance</role>
14
+
15
+ ## Prerequisites
16
+
17
+ <requirements>
18
+ - Git repository with GitHub CLI (`gh`) installed and authenticated
19
+ - Clean main/master branch
20
+ - Proper permissions to create worktrees and access the repository
21
+ - For document reviews: Path to a markdown file or document
22
+ </requirements>
23
+
24
+ ## Main Tasks
25
+
26
+ ### 1. Determine Review Target & Setup (ALWAYS FIRST)
27
+
28
+ <review_target> #$ARGUMENTS </review_target>
29
+
30
+ <thinking>
31
+ First, I need to determine the review target type and set up the code for analysis.
32
+ </thinking>
33
+
34
+ #### Immediate Actions:
35
+
36
+ <task_list>
37
+
38
+ - [ ] Determine review type: PR number (numeric), GitHub URL, file path (.md), or empty (current branch)
39
+ - [ ] Check current git branch
40
+ - [ ] If ALREADY on the target branch (PR branch, requested branch name, or the branch already checked out for review) → proceed with analysis on current branch
41
+ - [ ] If DIFFERENT branch than the review target → offer to use worktree: "Use git-worktree skill for isolated Call `skill: git-worktree` with branch name
42
+ - [ ] Fetch PR metadata using `gh pr view --json` for title, body, files, linked issues
43
+ - [ ] Set up language-specific analysis tools
44
+ - [ ] Prepare security scanning environment
45
+ - [ ] Make sure we are on the branch we are reviewing. Use gh pr checkout to switch to the branch or manually checkout the branch.
46
+
47
+ Ensure that the code is ready for analysis (either in worktree or on current branch). ONLY then proceed to the next step.
48
+
49
+ </task_list>
50
+
51
+ #### Parallel Agents to review the PR:
52
+
53
+ <parallel_tasks>
54
+
55
+ Run ALL or most of these agents at the same time:
56
+
57
+ 1. Task kieran-rails-reviewer(PR content)
58
+ 2. Task dhh-rails-reviewer(PR title)
59
+ 3. If turbo is used: Task rails-turbo-expert(PR content)
60
+ 4. Task git-history-analyzer(PR content)
61
+ 5. Task dependency-detective(PR content)
62
+ 6. Task pattern-recognition-specialist(PR content)
63
+ 7. Task architecture-strategist(PR content)
64
+ 8. Task code-philosopher(PR content)
65
+ 9. Task security-sentinel(PR content)
66
+ 10. Task performance-oracle(PR content)
67
+ 11. Task devops-harmony-analyst(PR content)
68
+ 12. Task data-integrity-guardian(PR content)
69
+ 13. Task agent-native-reviewer(PR content) - Verify new features are agent-accessible
70
+
71
+ </parallel_tasks>
72
+
73
+ #### Conditional Agents (Run if applicable):
74
+
75
+ <conditional_agents>
76
+
77
+ These agents are run ONLY when the PR matches specific criteria. Check the PR files list to determine if they apply:
78
+
79
+ **If PR contains database migrations (db/migrate/*.rb files) or data backfills:**
80
+
81
+ 14. Task data-migration-expert(PR content) - Validates ID mappings match production, checks for swapped values, verifies rollback safety
82
+ 15. Task deployment-verification-agent(PR content) - Creates Go/No-Go deployment checklist with SQL verification queries
83
+
84
+ **When to run migration agents:**
85
+ - PR includes files matching `db/migrate/*.rb`
86
+ - PR modifies columns that store IDs, enums, or mappings
87
+ - PR includes data backfill scripts or rake tasks
88
+ - PR changes how data is read/written (e.g., changing from FK to string column)
89
+ - PR title/body mentions: migration, backfill, data transformation, ID mapping
90
+
91
+ **What these agents check:**
92
+ - `data-migration-expert`: Verifies hard-coded mappings match production reality (prevents swapped IDs), checks for orphaned associations, validates dual-write patterns
93
+ - `deployment-verification-agent`: Produces executable pre/post-deploy checklists with SQL queries, rollback procedures, and monitoring plans
94
+
95
+ </conditional_agents>
96
+
97
+ ### 4. Ultra-Thinking Deep Dive Phases
98
+
99
+ <ultrathink_instruction> For each phase below, spend maximum cognitive effort. Think step by step. Consider all angles. Question assumptions. And bring all reviews in a synthesis to the user.</ultrathink_instruction>
100
+
101
+ <deliverable>
102
+ Complete system context map with component interactions
103
+ </deliverable>
104
+
105
+ #### Phase 3: Stakeholder Perspective Analysis
106
+
107
+ <thinking_prompt> ULTRA-THINK: Put yourself in each stakeholder's shoes. What matters to them? What are their pain points? </thinking_prompt>
108
+
109
+ <stakeholder_perspectives>
110
+
111
+ 1. **Developer Perspective** <questions>
112
+
113
+ - How easy is this to understand and modify?
114
+ - Are the APIs intuitive?
115
+ - Is debugging straightforward?
116
+ - Can I test this easily? </questions>
117
+
118
+ 2. **Operations Perspective** <questions>
119
+
120
+ - How do I deploy this safely?
121
+ - What metrics and logs are available?
122
+ - How do I troubleshoot issues?
123
+ - What are the resource requirements? </questions>
124
+
125
+ 3. **End User Perspective** <questions>
126
+
127
+ - Is the feature intuitive?
128
+ - Are error messages helpful?
129
+ - Is performance acceptable?
130
+ - Does it solve my problem? </questions>
131
+
132
+ 4. **Security Team Perspective** <questions>
133
+
134
+ - What's the attack surface?
135
+ - Are there compliance requirements?
136
+ - How is data protected?
137
+ - What are the audit capabilities? </questions>
138
+
139
+ 5. **Business Perspective** <questions>
140
+ - What's the ROI?
141
+ - Are there legal/compliance risks?
142
+ - How does this affect time-to-market?
143
+ - What's the total cost of ownership? </questions> </stakeholder_perspectives>
144
+
145
+ #### Phase 4: Scenario Exploration
146
+
147
+ <thinking_prompt> ULTRA-THINK: Explore edge cases and failure scenarios. What could go wrong? How does the system behave under stress? </thinking_prompt>
148
+
149
+ <scenario_checklist>
150
+
151
+ - [ ] **Happy Path**: Normal operation with valid inputs
152
+ - [ ] **Invalid Inputs**: Null, empty, malformed data
153
+ - [ ] **Boundary Conditions**: Min/max values, empty collections
154
+ - [ ] **Concurrent Access**: Race conditions, deadlocks
155
+ - [ ] **Scale Testing**: 10x, 100x, 1000x normal load
156
+ - [ ] **Network Issues**: Timeouts, partial failures
157
+ - [ ] **Resource Exhaustion**: Memory, disk, connections
158
+ - [ ] **Security Attacks**: Injection, overflow, DoS
159
+ - [ ] **Data Corruption**: Partial writes, inconsistency
160
+ - [ ] **Cascading Failures**: Downstream service issues </scenario_checklist>
161
+
162
+ ### 6. Multi-Angle Review Perspectives
163
+
164
+ #### Technical Excellence Angle
165
+
166
+ - Code craftsmanship evaluation
167
+ - Engineering best practices
168
+ - Technical documentation quality
169
+ - Tooling and automation assessment
170
+
171
+ #### Business Value Angle
172
+
173
+ - Feature completeness validation
174
+ - Performance impact on users
175
+ - Cost-benefit analysis
176
+ - Time-to-market considerations
177
+
178
+ #### Risk Management Angle
179
+
180
+ - Security risk assessment
181
+ - Operational risk evaluation
182
+ - Compliance risk verification
183
+ - Technical debt accumulation
184
+
185
+ #### Team Dynamics Angle
186
+
187
+ - Code review etiquette
188
+ - Knowledge sharing effectiveness
189
+ - Collaboration patterns
190
+ - Mentoring opportunities
191
+
192
+ ### 4. Simplification and Minimalism Review
193
+
194
+ Run the Task code-simplicity-reviewer() to see if we can simplify the code.
195
+
196
+ ### 5. Findings Synthesis and Todo Creation Using file-todos Skill
197
+
198
+ <critical_requirement> ALL findings MUST be stored in the todos/ directory using the file-todos skill. Create todo files immediately after synthesis - do NOT present findings for user approval first. Use the skill for structured todo management. </critical_requirement>
199
+
200
+ #### Step 1: Synthesize All Findings
201
+
202
+ <thinking>
203
+ Consolidate all agent reports into a categorized list of findings.
204
+ Remove duplicates, prioritize by severity and impact.
205
+ </thinking>
206
+
207
+ <synthesis_tasks>
208
+
209
+ - [ ] Collect findings from all parallel agents
210
+ - [ ] Categorize by type: security, performance, architecture, quality, etc.
211
+ - [ ] Assign severity levels: 🔴 CRITICAL (P1), 🟡 IMPORTANT (P2), 🔵 NICE-TO-HAVE (P3)
212
+ - [ ] Remove duplicate or overlapping findings
213
+ - [ ] Estimate effort for each finding (Small/Medium/Large)
214
+
215
+ </synthesis_tasks>
216
+
217
+ #### Step 2: Create Todo Files Using file-todos Skill
218
+
219
+ <critical_instruction> Use the file-todos skill to create todo files for ALL findings immediately. Do NOT present findings one-by-one asking for user approval. Create all todo files in parallel using the skill, then summarize results to user. </critical_instruction>
220
+
221
+ **Implementation Options:**
222
+
223
+ **Option A: Direct File Creation (Fast)**
224
+
225
+ - Create todo files directly using Write tool
226
+ - All findings in parallel for speed
227
+ - Use standard template from `.claude/skills/file-todos/assets/todo-template.md`
228
+ - Follow naming convention: `{issue_id}-pending-{priority}-{description}.md`
229
+
230
+ **Option B: Sub-Agents in Parallel (Recommended for Scale)** For large PRs with 15+ findings, use sub-agents to create finding files in parallel:
231
+
232
+ ```bash
233
+ # Launch multiple finding-creator agents in parallel
234
+ Task() - Create todos for first finding
235
+ Task() - Create todos for second finding
236
+ Task() - Create todos for third finding
237
+ etc. for each finding.
238
+ ```
239
+
240
+ Sub-agents can:
241
+
242
+ - Process multiple findings simultaneously
243
+ - Write detailed todo files with all sections filled
244
+ - Organize findings by severity
245
+ - Create comprehensive Proposed Solutions
246
+ - Add acceptance criteria and work logs
247
+ - Complete much faster than sequential processing
248
+
249
+ **Execution Strategy:**
250
+
251
+ 1. Synthesize all findings into categories (P1/P2/P3)
252
+ 2. Group findings by severity
253
+ 3. Launch 3 parallel sub-agents (one per severity level)
254
+ 4. Each sub-agent creates its batch of todos using the file-todos skill
255
+ 5. Consolidate results and present summary
256
+
257
+ **Process (Using file-todos Skill):**
258
+
259
+ 1. For each finding:
260
+
261
+ - Determine severity (P1/P2/P3)
262
+ - Write detailed Problem Statement and Findings
263
+ - Create 2-3 Proposed Solutions with pros/cons/effort/risk
264
+ - Estimate effort (Small/Medium/Large)
265
+ - Add acceptance criteria and work log
266
+
267
+ 2. Use file-todos skill for structured todo management:
268
+
269
+ ```bash
270
+ skill: file-todos
271
+ ```
272
+
273
+ The skill provides:
274
+
275
+ - Template location: `.claude/skills/file-todos/assets/todo-template.md`
276
+ - Naming convention: `{issue_id}-{status}-{priority}-{description}.md`
277
+ - YAML frontmatter structure: status, priority, issue_id, tags, dependencies
278
+ - All required sections: Problem Statement, Findings, Solutions, etc.
279
+
280
+ 3. Create todo files in parallel:
281
+
282
+ ```bash
283
+ {next_id}-pending-{priority}-{description}.md
284
+ ```
285
+
286
+ 4. Examples:
287
+
288
+ ```
289
+ 001-pending-p1-path-traversal-vulnerability.md
290
+ 002-pending-p1-api-response-validation.md
291
+ 003-pending-p2-concurrency-limit.md
292
+ 004-pending-p3-unused-parameter.md
293
+ ```
294
+
295
+ 5. Follow template structure from file-todos skill: `.claude/skills/file-todos/assets/todo-template.md`
296
+
297
+ **Todo File Structure (from template):**
298
+
299
+ Each todo must include:
300
+
301
+ - **YAML frontmatter**: status, priority, issue_id, tags, dependencies
302
+ - **Problem Statement**: What's broken/missing, why it matters
303
+ - **Findings**: Discoveries from agents with evidence/location
304
+ - **Proposed Solutions**: 2-3 options, each with pros/cons/effort/risk
305
+ - **Recommended Action**: (Filled during triage, leave blank initially)
306
+ - **Technical Details**: Affected files, components, database changes
307
+ - **Acceptance Criteria**: Testable checklist items
308
+ - **Work Log**: Dated record with actions and learnings
309
+ - **Resources**: Links to PR, issues, documentation, similar patterns
310
+
311
+ **File naming convention:**
312
+
313
+ ```
314
+ {issue_id}-{status}-{priority}-{description}.md
315
+
316
+ Examples:
317
+ - 001-pending-p1-security-vulnerability.md
318
+ - 002-pending-p2-performance-optimization.md
319
+ - 003-pending-p3-code-cleanup.md
320
+ ```
321
+
322
+ **Status values:**
323
+
324
+ - `pending` - New findings, needs triage/decision
325
+ - `ready` - Approved by manager, ready to work
326
+ - `complete` - Work finished
327
+
328
+ **Priority values:**
329
+
330
+ - `p1` - Critical (blocks merge, security/data issues)
331
+ - `p2` - Important (should fix, architectural/performance)
332
+ - `p3` - Nice-to-have (enhancements, cleanup)
333
+
334
+ **Tagging:** Always add `code-review` tag, plus: `security`, `performance`, `architecture`, `rails`, `quality`, etc.
335
+
336
+ #### Step 3: Summary Report
337
+
338
+ After creating all todo files, present comprehensive summary:
339
+
340
+ ````markdown
341
+ ## ✅ Code Review Complete
342
+
343
+ **Review Target:** PR #XXXX - [PR Title] **Branch:** [branch-name]
344
+
345
+ ### Findings Summary:
346
+
347
+ - **Total Findings:** [X]
348
+ - **🔴 CRITICAL (P1):** [count] - BLOCKS MERGE
349
+ - **🟡 IMPORTANT (P2):** [count] - Should Fix
350
+ - **🔵 NICE-TO-HAVE (P3):** [count] - Enhancements
351
+
352
+ ### Created Todo Files:
353
+
354
+ **P1 - Critical (BLOCKS MERGE):**
355
+
356
+ - `001-pending-p1-{finding}.md` - {description}
357
+ - `002-pending-p1-{finding}.md` - {description}
358
+
359
+ **P2 - Important:**
360
+
361
+ - `003-pending-p2-{finding}.md` - {description}
362
+ - `004-pending-p2-{finding}.md` - {description}
363
+
364
+ **P3 - Nice-to-Have:**
365
+
366
+ - `005-pending-p3-{finding}.md` - {description}
367
+
368
+ ### Review Agents Used:
369
+
370
+ - kieran-rails-reviewer
371
+ - security-sentinel
372
+ - performance-oracle
373
+ - architecture-strategist
374
+ - agent-native-reviewer
375
+ - [other agents]
376
+
377
+ ### Next Steps:
378
+
379
+ 1. **Address P1 Findings**: CRITICAL - must be fixed before merge
380
+
381
+ - Review each P1 todo in detail
382
+ - Implement fixes or request exemption
383
+ - Verify fixes before merging PR
384
+
385
+ 2. **Triage All Todos**:
386
+ ```bash
387
+ ls todos/*-pending-*.md # View all pending todos
388
+ /triage # Use slash command for interactive triage
389
+ ```
390
+ ````
391
+
392
+ 3. **Work on Approved Todos**:
393
+
394
+ ```bash
395
+ /resolve_todo_parallel # Fix all approved items efficiently
396
+ ```
397
+
398
+ 4. **Track Progress**:
399
+ - Rename file when status changes: pending → ready → complete
400
+ - Update Work Log as you work
401
+ - Commit todos: `git add todos/ && git commit -m "refactor: add code review findings"`
402
+
403
+ ### Severity Breakdown:
404
+
405
+ **🔴 P1 (Critical - Blocks Merge):**
406
+
407
+ - Security vulnerabilities
408
+ - Data corruption risks
409
+ - Breaking changes
410
+ - Critical architectural issues
411
+
412
+ **🟡 P2 (Important - Should Fix):**
413
+
414
+ - Performance issues
415
+ - Significant architectural concerns
416
+ - Major code quality problems
417
+ - Reliability issues
418
+
419
+ **🔵 P3 (Nice-to-Have):**
420
+
421
+ - Minor improvements
422
+ - Code cleanup
423
+ - Optimization opportunities
424
+ - Documentation updates
425
+
426
+ ```
427
+
428
+ ### 7. End-to-End Testing (Optional)
429
+
430
+ <detect_project_type>
431
+
432
+ **First, detect the project type from PR files:**
433
+
434
+ | Indicator | Project Type |
435
+ |-----------|--------------|
436
+ | `*.xcodeproj`, `*.xcworkspace`, `Package.swift` (iOS) | iOS/macOS |
437
+ | `Gemfile`, `package.json`, `app/views/*`, `*.html.*` | Web |
438
+ | Both iOS files AND web files | Hybrid (test both) |
439
+
440
+ </detect_project_type>
441
+
442
+ <offer_testing>
443
+
444
+ After presenting the Summary Report, offer appropriate testing based on project type:
445
+
446
+ **For Web Projects:**
447
+ ```markdown
448
+ **"Want to run browser tests on the affected pages?"**
449
+ 1. Yes - run `/test-browser`
450
+ 2. No - skip
451
+ ```
452
+
453
+ **For iOS Projects:**
454
+ ```markdown
455
+ **"Want to run Xcode simulator tests on the app?"**
456
+ 1. Yes - run `/xcode-test`
457
+ 2. No - skip
458
+ ```
459
+
460
+ **For Hybrid Projects (e.g., Rails + Hotwire Native):**
461
+ ```markdown
462
+ **"Want to run end-to-end tests?"**
463
+ 1. Web only - run `/test-browser`
464
+ 2. iOS only - run `/xcode-test`
465
+ 3. Both - run both commands
466
+ 4. No - skip
467
+ ```
468
+
469
+ </offer_testing>
470
+
471
+ #### If User Accepts Web Testing:
472
+
473
+ Spawn a subagent to run browser tests (preserves main context):
474
+
475
+ ```
476
+ Task general-purpose("Run /test-browser for PR #[number]. Test all affected pages, check for console errors, handle failures by creating todos and fixing.")
477
+ ```
478
+
479
+ The subagent will:
480
+ 1. Identify pages affected by the PR
481
+ 2. Navigate to each page and capture snapshots (using Playwright MCP or agent-browser CLI)
482
+ 3. Check for console errors
483
+ 4. Test critical interactions
484
+ 5. Pause for human verification on OAuth/email/payment flows
485
+ 6. Create P1 todos for any failures
486
+ 7. Fix and retry until all tests pass
487
+
488
+ **Standalone:** `/test-browser [PR number]`
489
+
490
+ #### If User Accepts iOS Testing:
491
+
492
+ Spawn a subagent to run Xcode tests (preserves main context):
493
+
494
+ ```
495
+ Task general-purpose("Run /xcode-test for scheme [name]. Build for simulator, install, launch, take screenshots, check for crashes.")
496
+ ```
497
+
498
+ The subagent will:
499
+ 1. Verify XcodeBuildMCP is installed
500
+ 2. Discover project and schemes
501
+ 3. Build for iOS Simulator
502
+ 4. Install and launch app
503
+ 5. Take screenshots of key screens
504
+ 6. Capture console logs for errors
505
+ 7. Pause for human verification (Sign in with Apple, push, IAP)
506
+ 8. Create P1 todos for any failures
507
+ 9. Fix and retry until all tests pass
508
+
509
+ **Standalone:** `/xcode-test [scheme]`
510
+
511
+ ### Important: P1 Findings Block Merge
512
+
513
+ Any **🔴 P1 (CRITICAL)** findings must be addressed before merging the PR. Present these prominently and ensure they're resolved before accepting the PR.
514
+ ```