mobile-debug-mcp 0.26.0 → 0.26.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/docs/ROADMAP.md CHANGED
@@ -1,14 +1,26 @@
1
- # Mobile Debug MCP Prioritized Roadmap
1
+ # Mobile Debug MCP Roadmap
2
2
 
3
- ## Prioritization Criteria
3
+ ## Planning Principles
4
4
 
5
5
  Ordered by:
6
6
 
7
+
7
8
  1. Impact on agent reliability
8
9
  2. Reduction in retries / brittleness
9
10
  3. Breadth of app coverage improved
10
11
  4. Implementation complexity vs payoff
11
12
 
13
+ ## Capability Status Definitions
14
+
15
+ - **Completed**
16
+ Capability implemented and considered part of the baseline platform.
17
+
18
+ - **Spec Ready**
19
+ Capability design or RFC is mature and implementation-ready, but not yet delivered.
20
+
21
+ - **Planned**
22
+ Capability is prioritized on the roadmap, but detailed specification and/or implementation work remains ahead.
23
+
12
24
  ## Program-Level Success Metrics
13
25
  Track roadmap impact across releases using:
14
26
 
@@ -26,23 +38,35 @@ Higher task success with fewer retries.
26
38
 
27
39
  ---
28
40
 
29
- # Completed
41
+ # Roadmap Status Overview
42
+
43
+ ## Completed Capabilities
44
+
45
+ - Stronger State Verification — Complete (Foundational verification layer shipped)
46
+ - Richer Element Identity — Complete (Identity and selector confidence foundations shipped)
30
47
 
31
- These priorities are done and kept here for history:
48
+ ## Current Focus
32
49
 
33
- - Priority 1 Stronger State Verification
34
- - Priority 2 — Richer Element Identity
50
+ - Wait and Synchronization Reliability
51
+ - Actionability Resolution
52
+
53
+ ## Upcoming Work
35
54
 
36
- Completion notes:
55
+ - Adjustable Control Support
56
+ - Signal-Oriented Diagnostic Filtering
57
+ - Long Press Gesture
58
+ - Better Compose / Custom Control Semantics
37
59
 
38
- - State-aware verification is now implemented and wired through the tool surface.
39
- - Platform-native element metadata and selector-confidence hints are now part of the runtime contract.
60
+ ## Later Horizon
61
+
62
+ - Pinch to Zoom
63
+ - Action Trace Correlation
40
64
 
41
65
  ---
42
66
 
43
- # Priority 1 — Stronger State Verification
67
+ # Stronger State Verification
44
68
 
45
- ## Why first
69
+ ## Rationale
46
70
  Highest leverage improvement.
47
71
 
48
72
  **Status:** Completed
@@ -52,7 +76,7 @@ Most failures are not “can’t act,” they’re:
52
76
  - weak verification
53
77
  - retry loops caused by inference
54
78
 
55
- ## Deliver
79
+ ## Scope
56
80
  - Direct readable control values
57
81
  - Expanded `expect_*` verification
58
82
  - Move from inference to state introspection
@@ -60,7 +84,7 @@ Most failures are not “can’t act,” they’re:
60
84
  ## Expected Impact
61
85
  Very high.
62
86
 
63
- ## Done Criteria
87
+ ## Exit Criteria
64
88
  - Control state readable for core widgets (toggle, slider, input, dropdown)
65
89
  - New expect_* state verifiers implemented
66
90
  - Agents can verify state without visual inference in representative flows
@@ -73,15 +97,15 @@ Very high.
73
97
 
74
98
  ## Dependencies
75
99
  Blocks or strengthens:
76
- - Priority 5 — Better Compose / Custom Control Semantics
77
- - Priority 6 — Pinch to Zoom verification
78
- - Priority 7 — Action Trace Correlation
100
+ - Better Compose / Custom Control Semantics
101
+ - Pinch to Zoom
102
+ - Action Trace Correlation
79
103
 
80
104
  ---
81
105
 
82
- # Priority 2 — Richer Element Identity
106
+ # Richer Element Identity
83
107
 
84
- ## Why second
108
+ ## Rationale
85
109
  Directly reduces selector brittleness.
86
110
 
87
111
  **Status:** Completed
@@ -91,7 +115,7 @@ Improves:
91
115
  - repeatability
92
116
  - agent confidence
93
117
 
94
- ## Deliver
118
+ ## Scope
95
119
  - Stable IDs / test tags prioritization
96
120
  - Selector confidence metadata
97
121
  - Preferred selector hierarchy
@@ -99,7 +123,7 @@ Improves:
99
123
  ## Expected Impact
100
124
  Very high.
101
125
 
102
- ## Done Criteria
126
+ ## Exit Criteria
103
127
  - Stable selector preference order implemented
104
128
  - Test tags/resource IDs surfaced where available
105
129
  - Selector confidence metadata available
@@ -112,24 +136,26 @@ Very high.
112
136
 
113
137
  ## Dependencies
114
138
  Blocks or strengthens:
115
- - Priority 4 — Long Press targeting reliability
116
- - Priority 5 — Better Compose / Custom Control Semantics
117
- - Priority 6 — Pinch to Zoom targeting
139
+ - Long Press Gesture
140
+ - Better Compose / Custom Control Semantics
141
+ - Pinch to Zoom
118
142
 
119
143
  ---
120
144
 
121
- # Priority 3 — Wait and Synchronization Reliability
145
+ # Wait and Synchronization Reliability
122
146
 
123
- ## Why third
147
+ ## Rationale
124
148
  Reliable async synchronization is foundational for agent success and should precede gesture expansion.
125
149
 
150
+ **Status:** Spec Ready
151
+
126
152
  Addresses failures where agents:
127
153
  - skip UI waits after actions
128
154
  - rely on network/log signals too early
129
155
  - struggle with in-place UI updates
130
156
  - misread stale UI snapshots
131
157
 
132
- ## Deliver
158
+ ## Scope
133
159
  - UI-first synchronization policy guidance
134
160
  - wait_for_ui_change (hierarchy diff based waiting)
135
161
  - Structured loading state detection
@@ -139,7 +165,7 @@ Addresses failures where agents:
139
165
  ## Expected Impact
140
166
  Very high.
141
167
 
142
- ## Done Criteria
168
+ ## Exit Criteria
143
169
  - wait_for_ui_change implemented
144
170
  - Loading state detection available for representative controls
145
171
  - Snapshot revision or staleness metadata exposed
@@ -154,20 +180,159 @@ Very high.
154
180
 
155
181
  ## Dependencies
156
182
  Depends on:
157
- - Priority 1 — Stronger State Verification
158
- - Priority 2 — Richer Element Identity
183
+ - Stronger State Verification
184
+ - Richer Element Identity
185
+
186
+ Blocks or strengthens:
187
+ - Better Compose / Custom Control Semantics
188
+ - Action Trace Correlation
189
+
190
+ ---
191
+
192
+ # Actionability Resolution
193
+
194
+ ## Rationale
195
+ Reduces failures caused by interacting with discoverable but non-actionable UI nodes.
196
+
197
+ **Status:** Planned
198
+
199
+ Addresses cases where:
200
+ - visible text is not the true click target
201
+ - child nodes differ from actionable containers
202
+ - affordance exists but handler ownership is ambiguous
203
+
204
+ ## Scope
205
+ - Actionable container resolution
206
+ - Executable-target preference rules
207
+ - Actionability confidence metadata
208
+ - Post-action state verification integration
209
+
210
+ ## Expected Impact
211
+ High.
212
+
213
+ ## Exit Criteria
214
+ - Actionable target resolution implemented
215
+ - Preference rules defined for executable containers over leaf nodes
216
+ - Actionability confidence surfaced
217
+ - Benchmark flows show reduced false taps and submit ambiguity
218
+
219
+ ## Success Metrics
220
+ - Reduced mis-targeted action failures
221
+ - Lower retarget retries
222
+ - Higher first-attempt action success
223
+
224
+ ## Dependencies
225
+ Depends on:
226
+ - Stronger State Verification
227
+ - Richer Element Identity
228
+ - Wait and Synchronization Reliability
229
+
230
+ Blocks or strengthens:
231
+ - Adjustable Control Support
232
+ - Better Compose / Custom Control Semantics
233
+
234
+ ---
235
+
236
+ # Adjustable Control Support
237
+
238
+ ## Rationale
239
+ High leverage improvement for sliders and parameterized controls.
240
+
241
+ **Status:** Planned
242
+
243
+ Addresses friction around:
244
+ - coordinate-calibrated slider interaction
245
+ - snapping and quantized controls
246
+ - weak state confirmation after adjustment
247
+
248
+ ## Scope
249
+ New semantic control support:
250
+
251
+ ```json
252
+ set_slider_value(target, value, tolerance?)
253
+ ```
254
+
255
+ Includes:
256
+ - semantic adjustable control manipulation
257
+ - read-back verification loop
258
+ - tolerance-aware value setting
259
+ - fallback coordinate calibration only when needed
260
+
261
+ ## Expected Impact
262
+ High.
263
+
264
+ ## Exit Criteria
265
+ - Adjustable control primitive implemented
266
+ - Verification loop reads and confirms resulting values
267
+ - Tolerance model defined
268
+ - Benchmark slider/custom control flows validated
269
+
270
+ ## Success Metrics
271
+ - Higher custom control interaction success rate
272
+ - Fewer retries adjusting controls
273
+ - Reduced coordinate-guessing failures
274
+
275
+ ## Dependencies
276
+ Depends on:
277
+ - Stronger State Verification
278
+ - Richer Element Identity
279
+ - Actionability Resolution
159
280
 
160
281
  Blocks or strengthens:
161
- - Priority 5 — Better Compose / Custom Control Semantics
162
- - Priority 7 — Action Trace Correlation
282
+ - Better Compose / Custom Control Semantics
283
+ - Pinch to Zoom
284
+
285
+ ---
286
+
287
+ # Signal-Oriented Diagnostic Filtering
288
+
289
+ ## Rationale
290
+ Improves observability by separating causal signals from diagnostic noise.
291
+
292
+ **Status:** Planned
293
+
294
+ Addresses friction from:
295
+ - noisy log streams
296
+ - weak signal extraction
297
+ - difficult action-to-signal attribution
298
+
299
+ ## Scope
300
+ - Structured diagnostic classification
301
+ - Noise filtering heuristics
302
+ - Signal relevance scoring
303
+ - App vs system event tagging
304
+
305
+ ## Expected Impact
306
+ High.
307
+
308
+ ## Exit Criteria
309
+ - Diagnostic signal classification model defined
310
+ - Noise filtering available in representative flows
311
+ - Relevant action-linked signals surfaced separately from background noise
312
+ - Debug workflows validated with filtered signals
313
+
314
+ ## Success Metrics
315
+ - Lower time-to-root-cause
316
+ - Faster identification of relevant action signals
317
+ - Reduced diagnostic ambiguity
318
+
319
+ ## Dependencies
320
+ Depends on:
321
+ - Stronger State Verification
322
+ - Wait and Synchronization Reliability
323
+
324
+ Strengthens:
325
+ - Action Trace Correlation
163
326
 
164
327
  ---
165
328
 
166
- # Priority 4 — Long Press Gesture
329
+ # Long Press Gesture
167
330
 
168
- ## Why fourth
331
+ ## Rationale
169
332
  High utility, relatively low complexity.
170
333
 
334
+ **Status:** Planned
335
+
171
336
  Unlocks many currently awkward interactions:
172
337
 
173
338
  - context menus
@@ -177,7 +342,7 @@ Unlocks many currently awkward interactions:
177
342
 
178
343
  Broad usefulness.
179
344
 
180
- ## Deliver
345
+ ## Scope
181
346
  New tool:
182
347
 
183
348
  ```json
@@ -191,7 +356,7 @@ Verification alignment:
191
356
  ## Expected Impact
192
357
  High.
193
358
 
194
- ## Done Criteria
359
+ ## Exit Criteria
195
360
  - long_press tool implemented across supported platforms
196
361
  - Duration defaults and overrides supported
197
362
  - Verification patterns for long press outcomes defined
@@ -204,25 +369,28 @@ High.
204
369
 
205
370
  ## Dependencies
206
371
  Depends on:
207
- - Priority 2 — Richer Element Identity
372
+ - Richer Element Identity
208
373
 
209
374
  Strengthens:
210
- - Priority 5 semantics interaction contracts
375
+ - Better Compose / Custom Control Semantics
211
376
 
212
377
  ---
213
378
 
214
- # Priority 5 — Better Compose / Custom Control Semantics
379
+ # Better Compose / Custom Control Semantics
380
+
381
+ ## Rationale
382
+ Important, but strengthened by earlier capabilities first.
215
383
 
216
- ## Why fifth
217
- Important, but strengthened by priorities 1–4 first.
384
+ **Status:** Planned
218
385
 
219
386
  Semantics become more useful once:
220
387
  - identity is stronger
221
388
  - verification is stronger
222
389
  - gestures are richer
223
390
  - synchronization is more reliable
391
+ - action execution is more precise
224
392
 
225
- ## Deliver
393
+ ## Scope
226
394
  - Composite control traits
227
395
  - Control role enrichment (adjustable, expandable, selectable_group)
228
396
  - Interaction contracts metadata
@@ -233,7 +401,7 @@ Semantics become more useful once:
233
401
  ## Expected Impact
234
402
  High.
235
403
 
236
- ## Done Criteria
404
+ ## Exit Criteria
237
405
  - Semantic traits implemented for major custom control classes
238
406
  - Interaction contracts surfaced in snapshot model
239
407
  - Confidence model defined for derived semantics
@@ -246,18 +414,23 @@ High.
246
414
 
247
415
  ## Dependencies
248
416
  Depends on:
249
- - Priority 1 — Stronger State Verification
250
- - Priority 2 — Richer Element Identity
251
- - Priority 3 — Wait and Synchronization Reliability
252
- - Priority 4 — Long Press
417
+ - Stronger State Verification
418
+ - Richer Element Identity
419
+ - Wait and Synchronization Reliability
420
+ - Actionability Resolution
421
+ - Adjustable Control Support
422
+ - Signal-Oriented Diagnostic Filtering
423
+ - Long Press Gesture
253
424
 
254
425
  ---
255
426
 
256
- # Priority 6 — Pinch to Zoom
427
+ # Pinch to Zoom
257
428
 
258
- ## Why sixth
429
+ ## Rationale
259
430
  Valuable, but narrower than long press.
260
431
 
432
+ **Status:** Planned
433
+
261
434
  Applies mainly to:
262
435
  - maps
263
436
  - images
@@ -266,7 +439,7 @@ Applies mainly to:
266
439
 
267
440
  Useful, but less universal.
268
441
 
269
- ## Deliver
442
+ ## Scope
270
443
 
271
444
  ```json
272
445
  pinch_to_zoom(target, scale, center?)
@@ -279,7 +452,7 @@ Verification:
279
452
  ## Expected Impact
280
453
  Medium-high.
281
454
 
282
- ## Done Criteria
455
+ ## Exit Criteria
283
456
  - pinch_to_zoom implemented
284
457
  - Zoom in/out flows supported
285
458
  - Verification primitives for viewport or zoom state available
@@ -292,27 +465,29 @@ Medium-high.
292
465
 
293
466
  ## Dependencies
294
467
  Depends on:
295
- - Priority 1 — Stronger State Verification
296
- - Priority 2 — Richer Element Identity
468
+ - Stronger State Verification
469
+ - Richer Element Identity
297
470
 
298
471
  ---
299
472
 
300
- # Priority 7 — Action Trace Correlation
473
+ # Action Trace Correlation
301
474
 
302
- ## Why seventh
475
+ ## Rationale
303
476
  Very valuable for debugging,
304
477
  but less critical than improving control success first.
305
478
 
479
+ **Status:** Planned
480
+
306
481
  Improves diagnosis more than task completion.
307
482
 
308
- ## Deliver
483
+ ## Scope
309
484
  - Action correlation metadata
310
485
  - UI/network/log linkage
311
486
 
312
487
  ## Expected Impact
313
488
  Medium-high.
314
489
 
315
- ## Done Criteria
490
+ ## Exit Criteria
316
491
  - Action correlation model defined
317
492
  - UI/network/log linkage captured for representative actions
318
493
  - Correlation metadata exposed to agents
@@ -325,79 +500,97 @@ Medium-high.
325
500
 
326
501
  ## Dependencies
327
502
  Depends on:
328
- - Priority 1 — Stronger State Verification
329
- - Priority 2 — Richer Element Identity
330
- - Priority 3 — Wait and Synchronization Reliability
503
+ - Stronger State Verification
504
+ - Richer Element Identity
505
+ - Wait and Synchronization Reliability
331
506
 
332
507
  ---
333
508
 
334
- # Delivery Waves
509
+ # Roadmap Sequence
335
510
 
336
511
  ## Dependency Summary
337
- Foundational sequence:
338
512
 
339
- Layer 1 (Foundations)
340
- - Priority 1
341
- - Priority 2
513
+ Foundation
514
+ - Stronger State Verification
515
+ - Richer Element Identity
516
+
517
+ Synchronization & Actionability
518
+ - Wait and Synchronization Reliability
519
+ - Actionability Resolution
342
520
 
343
- Layer 2 (Synchronization)
344
- - Priority 3 depends on 1,2
521
+ Control Precision & Observability
522
+ - Adjustable Control Support
523
+ - Signal-Oriented Diagnostic Filtering
345
524
 
346
- Layer 3 (Interaction Expansion)
347
- - Priority 4 depends on 2
348
- - Priority 5 depends on 1,2,3,4
349
- - Priority 6 depends on 1,2
525
+ Interaction Expansion
526
+ - Long Press Gesture
527
+ - Better Compose / Custom Control Semantics
528
+ - Pinch to Zoom
350
529
 
351
- Layer 4 (Observability)
352
- - Priority 7 depends on 1,2,3
530
+ Deep Observability
531
+ - Action Trace Correlation
353
532
 
354
- ## Wave 1 (Immediate)
533
+ ## Wave 1 (Current Focus)
355
534
  - Stronger State Verification
356
535
  - Richer Element Identity
357
536
  - Wait and Synchronization Reliability
537
+ - Actionability Resolution
358
538
 
359
539
  Focus:
360
540
  Make core loop more reliable.
361
541
 
362
542
  ---
363
543
 
364
- ## Wave 2
365
- - Long Press
366
- - Better Compose Semantics
544
+ ## Wave 2 (Control Precision + Diagnostics)
545
+ - Adjustable Control Support
546
+ - Signal-Oriented Diagnostic Filtering
547
+
548
+ Focus:
549
+ Improve control precision and signal observability.
550
+
551
+ ---
552
+
553
+ ## Wave 3 (Interaction Expansion)
554
+ - Long Press Gesture
555
+ - Better Compose / Custom Control Semantics
367
556
 
368
557
  Focus:
369
558
  Expand interaction capability.
370
559
 
371
560
  ---
372
561
 
373
- ## Wave 3
562
+ ## Wave 4 (Advanced Gestures + Deep Observability)
374
563
  - Pinch to Zoom
375
564
  - Action Trace Correlation
376
565
 
377
566
  Focus:
378
- Advanced gestures + observability.
567
+ Advanced gestures + deep observability.
379
568
 
380
569
  ---
381
570
 
382
- # Priority Stack Summary
571
+ # Roadmap Ordering
383
572
 
384
- Execution Order:
573
+ Roadmap Ordering:
385
574
  1. Stronger State Verification
386
575
  2. Richer Element Identity
387
576
  3. Wait and Synchronization Reliability
388
- 4. Long Press
389
- 5. Better Compose / Custom Control Semantics
390
- 6. Pinch to Zoom
391
- 7. Action Trace Correlation
577
+ 4. Actionability Resolution
578
+ 5. Adjustable Control Support
579
+ 6. Signal-Oriented Diagnostic Filtering
580
+ 7. Long Press Gesture
581
+ 8. Better Compose / Custom Control Semantics
582
+ 9. Pinch to Zoom
583
+ 10. Action Trace Correlation
392
584
 
393
585
  Rationale:
394
- - Priorities 1–3 harden control, verification, and synchronization.
395
- - Priorities 4–6 expand interaction capability.
396
- - Priority 7 adds observability once control reliability matures.
586
+ - Early roadmap items harden state, targeting, synchronization, action execution.
587
+ - Mid roadmap items improve control precision and signal observability.
588
+ - Later interaction-focused items expand interaction coverage.
589
+ - Final observability work deepens debugging observability.
397
590
 
398
591
  ---
399
592
 
400
- ## Explicitly Deferred
593
+ ## Future Considerations
401
594
  Still out of scope:
402
595
 
403
596
  - Recovery planning logic