npm - @ulpi/browse - Versions diffs - 2.3.3 → 2.3.4 - Mend

@ulpi/browse 2.3.3 → 2.3.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

package/dist/browse.cjs +1044 -859
package/package.json +1 -1
package/skill/{SKILL.md → browse/SKILL.md} +87 -8
package/skill/browse-qa/SKILL.md +299 -0
/package/skill/{references → browse/references}/commands.md +0 -0
/package/skill/{references → browse/references}/guides.md +0 -0
/package/skill/{references → browse/references}/permissions.md +0 -0

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@ulpi/browse",
-  "version": "2.3.3",
+  "version": "2.3.4",
   "homepage": "https://browse.ulpi.io",
   "repository": {
     "type": "git",

package/skill/{SKILL.md → browse/SKILL.md} RENAMED Viewed

@@ -2,19 +2,96 @@
 name: browse
 version: 3.5.0
 description: |
-  Fast web browsing and web app testing for AI coding agents via persistent headless Chromium daemon.
-  Browse any URL, read page content, click elements, fill forms, run JavaScript, take screenshots,
-  inspect CSS/DOM, capture console/network logs, and more. Ideal for verifying local dev servers,
-  testing UI changes, and validating web app behavior end-to-end. ~100ms per command after
-  first call. Works with Claude Code, Cursor, Cline, Windsurf, and any agent that can run Bash.
-  No MCP, no Chrome extension — just fast CLI.
+  Fast web browsing, web app testing, and native app automation for AI coding agents.
+  Persistent headless Chromium for web. Android, iOS, and macOS app automation via accessibility APIs.
+  Browse URLs, read content, click elements, fill forms, run JavaScript, take screenshots,
+  automate native apps — all through the same CLI and @ref workflow. ~100ms per command.
+  Auto-installs Android toolchain. Works with Claude Code, Cursor, Cline, Windsurf, and any agent that can run Bash.
 allowed-tools:
   - Bash
   - Read
 ---
-# browse: Persistent Browser for AI Coding Agents
+# browse: Browser & Native App Automation for AI Agents
+## Target Decision — ALWAYS check this first
+Before running any browse command, decide the correct target:
+| User wants to... | Target | Command pattern |
+|---|---|---|
+| Open a URL, test a website, scrape web content | **Browser** (default) | `browse goto <url>` |
+| Test a local dev server (`localhost`) | **Browser** | `browse goto http://localhost:3000` |
+| Interact with an iOS app (Settings, Safari, custom app) | **iOS Simulator** | `browse --platform ios --app <bundleId> <cmd>` |
+| Interact with an Android app (Settings, Chrome, custom app) | **Android Emulator** | `browse --platform android --app <package> <cmd>` |
+| Interact with a macOS desktop app (System Settings, TextEdit) | **macOS App** | `browse --app <name> <cmd>` |
+| Install and test an iOS .app or .ipa file | **iOS Simulator** | `browse sim start --platform ios --app ./MyApp.app --visible` |
+| Install and test an Android .apk file | **Android Emulator** | `browse sim start --platform android --app ./app.apk --visible` |
+**Key rules:**
+- **No `--platform` or `--app` flag** → browser target (Chromium). Use `goto` to navigate.
+- **`--app` without `--platform`** → macOS app automation. App must be running.
+- **`--platform ios --app`** → iOS Simulator. Use `browse sim start` first if not running.
+- **`--platform android --app`** → Android Emulator. Use `browse sim start` first if not running.
+- **Native app targets do NOT support**: `goto`, `js`, `eval`, `tabs`, `cookies`, `route`, `har`. These are browser-only.
+- **All targets support**: `snapshot`, `text`, `tap`, `fill`, `type`, `press`, `swipe`, `screenshot`.
+- **If unsure which target to use, ASK the user.** "Should I open this in the browser, or do you want me to use the iOS/Android/macOS app?" Don't guess — wrong target = wasted work.
+### Native App Quick Start
+```bash
+# iOS — boot simulator, open Settings, interact
+browse sim start --platform ios --app com.apple.Preferences --visible
+browse --platform ios --app com.apple.Preferences snapshot -i
+browse --platform ios --app com.apple.Preferences tap @e3
+# Android — boot emulator (auto-installs toolchain), open Settings, interact
+browse sim start --platform android --app com.android.settings --visible
+browse --platform android --app com.android.settings snapshot -i
+browse --platform android --app com.android.settings tap @e3
+# macOS — no sim needed, app must be running
+browse --app "System Settings" snapshot -i
+browse --app "System Settings" tap @e5
+# Install and test your own app from a file
+browse sim start --platform ios --app ./build/MyApp.app --visible   # .app bundle
+browse sim start --platform ios --app ./MyApp.ipa --visible          # .ipa archive
+browse sim start --platform android --app ./app-debug.apk --visible  # .apk file
+# Switch app targets (iOS: instant reconfigure, Android: driver restart)
+browse --platform ios --app com.apple.mobilesafari snapshot -i
+browse sim start --platform android --app com.google.android.dialer --visible
+# Enable platforms (install dependencies, build drivers — run once)
+browse enable android    # Auto-installs adb, JDK, SDK, emulator, driver
+browse enable ios        # Builds iOS runner (needs Xcode)
+browse enable macos      # Builds browse-ax bridge
+browse enable all        # All platforms
+```
+### Common iOS Bundle IDs
+| App | Bundle ID |
+|-----|-----------|
+| Settings | `com.apple.Preferences` |
+| Safari | `com.apple.mobilesafari` |
+| Maps | `com.apple.Maps` |
+| Photos | `com.apple.mobileslideshow` |
+| Calendar | `com.apple.mobilecal` |
+### Common Android Package Names
+| App | Package Name |
+|-----|-------------|
+| Settings | `com.android.settings` |
+| Chrome | `com.android.chrome` |
+| Dialer | `com.google.android.dialer` |
+| Messages | `com.google.android.apps.messaging` |
+| Calculator | `com.google.android.calculator` |
+---
+## Browser Quick Start
 Persistent headless Chromium daemon. First call auto-starts the server (~3s).
 Every subsequent call: ~100-200ms. Auto-shuts down after 30 min idle.
@@ -190,7 +267,7 @@ Refs are invalidated on navigation — run `snapshot` again after `goto`.
 ### Native App Automation
 ```
-browse sim start --platform ios|android --app <id> [--visible]  Start simulator/emulator + app
+browse sim start --platform ios|android --app <id-or-path> [--visible]  Start + install/launch app
 browse sim stop --platform ios|android                          Stop simulator/emulator
 browse sim status --platform ios|android                        Check runner status
 browse --platform ios --app <bundleId> <command>                Target iOS app
@@ -198,6 +275,8 @@ browse --platform android --app <package> <command>             Target Android a
 browse --app <name> <command>                                   Target macOS app
 ```
+The `--app` flag accepts a bundle ID, package name, **or file path** (.app/.ipa/.apk). File paths auto-install the app into the simulator/emulator.
 Supported commands on all app platforms: `snapshot`, `text`, `tap`, `fill`, `type`, `press`, `swipe`, `screenshot`.
 macOS also supports modifier combos: `browse --app TextEdit press "cmd+n"`.
 Android auto-installs adb, Java, SDK, and emulator on first use (macOS via Homebrew).

package/skill/browse-qa/SKILL.md ADDED Viewed

@@ -0,0 +1,299 @@
+---
+name: browse-qa
+description: |
+  AI QA agent powered by browse. Validates features, runs exploratory testing, and generates
+  automated regression tests across web, iOS, Android, and macOS. Accepts work from any source:
+  Jira stories, Linear tickets, GitHub issues, plain English descriptions, or acceptance criteria.
+  Outputs a reusable browse flow file that can be rerun for regression testing.
+  Use when the user says "QA this", "test this feature", "validate this story", "check this works",
+  or provides a ticket ID/URL to verify.
+allowed-tools:
+  - Bash
+  - Read
+  - Write
+---
+# browse-qa: AI QA Agent
+QA any feature across web, iOS, Android, and macOS. Accept a spec from anywhere, test it live, generate automated regression tests.
+## When to Use
+- User says "QA this", "test this feature", "validate this story", "check this works"
+- User provides a Jira ticket, Linear issue, GitHub issue, or any feature description
+- User wants to verify acceptance criteria against a live app or website
+- User wants to generate automated tests from a manual QA session
+- User asks for exploratory testing of a feature or page
+## Input Sources
+The spec can come from:
+1. **Jira MCP** — if `mcp__jira__*` tools are available, fetch the ticket:
+   ```
+   mcp__jira__get_issue({ issueKey: "PROJ-123" })
+   ```
+2. **Linear MCP** — if `mcp__linear__*` tools are available
+3. **GitHub issue** — use browse to read the issue page
+4. **Plain text** — user describes what to test
+5. **File** — user points to a spec file
+6. **URL** — user provides a ticket URL (use browse to read it)
+## Workflow
+### Phase 1: Understand What to QA
+1. **Get the spec** — from MCP, user text, or URL
+2. **Break it down** into concrete, testable scenarios:
+   - What is the feature?
+   - Where does it live? (URL, app, platform)
+   - What are the acceptance criteria?
+   - What should happen? What should NOT happen?
+   - Edge cases: empty input, invalid data, boundary values
+3. **Present the QA plan**:
+   ```
+   QA plan for [feature]:
+   Target: [URL / app / platform]
+   Scenarios:
+   1. [Happy path] — expected: [outcome]
+   2. [Edge case] — expected: [outcome]
+   3. [Error case] — expected: [outcome]
+   4. [Boundary] — expected: [outcome]
+   ```
+4. **Confirm with user** before executing
+### Phase 2: Decide the Target
+| Spec mentions... | Target | Setup |
+|---|---|---|
+| URL, website, web page, localhost | **Browser** | `browse --headed goto <url>` |
+| iOS app, iPhone, iPad, Swift | **iOS Simulator** | `browse sim start --platform ios --app <id> --visible` |
+| Android app, Kotlin, phone | **Android Emulator** | `browse sim start --platform android --app <id> --visible` |
+| macOS app, desktop app | **macOS** | `browse --app <name>` |
+| User has a .app/.ipa build to test | **iOS Simulator** | `browse sim start --platform ios --app ./path/to/App.app --visible` |
+| User has an .apk build to test | **Android Emulator** | `browse sim start --platform android --app ./path/to/app.apk --visible` |
+| Ambiguous ("our app", "the app") | **Ask the user** | "Which platform? Do you have a build file (.app/.ipa/.apk)?" |
+**If unsure, always ask.** Don't guess the platform. If the user has a build artifact, ask for the file path.
+### Phase 3: Execute QA
+1. **Start recording** — every command becomes part of the regression test:
+   ```bash
+   browse record start
+   ```
+2. **For each scenario**, follow this pattern:
+   **Navigate:**
+   ```bash
+   browse goto <url>                    # web
+   browse wait --network-idle           # web
+   browse --platform ios --app <id> snapshot -i  # native
+   ```
+   **Interact:**
+   ```bash
+   browse snapshot -i                   # find elements
+   browse click @e3                     # click
+   browse fill @e4 "test value"         # fill
+   browse press Enter                   # submit
+   browse swipe up                      # scroll (native)
+   ```
+   **Verify — this is the critical part:**
+   ```bash
+   browse text                          # check visible text contains expected content
+   browse snapshot -i                   # check element exists/state
+   browse js "document.querySelector('.success')?.textContent"  # specific check
+   browse count ".error-message"        # check error count
+   browse screenshot .browse/sessions/default/scenario-1.png
+   ```
+3. **Report each scenario immediately:**
+   ```
+   ✓ Scenario 1: Valid discount code
+     Verified: price updated from AED 500 to AED 400
+     Screenshot: .browse/sessions/default/scenario-1.png
+   ✗ Scenario 2: Invalid discount code
+     Expected: error message "Invalid code"
+     Actual: no error shown, page unchanged
+     Screenshot: .browse/sessions/default/scenario-2.png
+   ```
+### Phase 4: Exploratory Testing
+After testing the stated scenarios, do exploratory testing:
+- **Try unexpected inputs** — special characters, very long text, HTML injection
+- **Test responsiveness** — `browse emulate "iPhone 14"` then recheck
+- **Check edge states** — empty cart, logged out, slow network (`browse offline on`)
+- **Verify error handling** — what happens when things go wrong?
+- **Check accessibility** — `browse a11y-audit` on key pages
+Report any additional findings as bugs, not scenario failures.
+### Phase 5: Generate Regression Tests
+For complex features, generate **multiple flow files** — one per logical scenario or user journey. Each should be independently runnable.
+**Strategy:** Start recording before each scenario group, stop and save after it completes. Then start a new recording for the next group.
+```bash
+# Scenario group 1: Happy path
+browse record start
+# ... execute happy path steps ...
+browse record stop
+browse flow save checkout-happy-path
+# Scenario group 2: Validation errors
+browse record start
+# ... execute validation error steps ...
+browse record stop
+browse flow save checkout-validation-errors
+# Scenario group 3: Discount codes
+browse record start
+# ... execute discount code steps ...
+browse record stop
+browse flow save checkout-discount-codes
+```
+**When to split into multiple flows:**
+- Different user journeys (login → browse → checkout is 3 flows, not 1)
+- Different personas (guest checkout vs logged-in checkout)
+- Different error scenarios (each error type gets its own flow)
+- Different platforms (web flow + iOS flow for the same feature)
+- When a single flow would exceed ~20 steps
+**When to keep as one flow:**
+- Simple feature with 1-5 steps
+- Linear flow with no branching
+- Single acceptance criterion
+Tell the user how to rerun:
+```
+Regression tests saved:
+  .browse/flows/checkout-happy-path.yaml
+  .browse/flows/checkout-validation-errors.yaml
+  .browse/flows/checkout-discount-codes.yaml
+Rerun all:
+  browse flow run checkout-happy-path
+  browse flow run checkout-validation-errors
+  browse flow run checkout-discount-codes
+Or run individually:
+  browse flow run checkout-happy-path
+```
+### Phase 6: QA Report
+```
+## QA Report: [Feature Name]
+**Source:** [Jira PROJ-123 / user description]
+**Target:** [https://example.com / com.example.app on iOS]
+**Tested:** [date]
+### Acceptance Criteria
+| # | Criteria | Result | Evidence |
+|---|----------|--------|----------|
+| 1 | [criteria from spec] | ✓ PASS | [what was verified] |
+| 2 | [criteria from spec] | ✗ FAIL | [expected vs actual] |
+### Exploratory Findings
+| # | Finding | Severity | Details |
+|---|---------|----------|---------|
+| 1 | [issue found] | High | [description + screenshot] |
+| 2 | [edge case] | Low | [description] |
+### Regression Tests
+| Flow | Steps | Rerun |
+|------|-------|-------|
+| .browse/flows/<name-1>.yaml | [N] | browse flow run <name-1> |
+| .browse/flows/<name-2>.yaml | [N] | browse flow run <name-2> |
+### Screenshots
+[list of screenshots taken during QA]
+### Verdict
+[PASS / FAIL / PASS WITH ISSUES]
+[Summary of what works, what doesn't, and recommended actions]
+```
+## Rules
+1. **Always record** — `browse record start` before any testing. Every QA session should produce a regression test.
+2. **Verify, don't just navigate** — the value of QA is checking outcomes, not clicking around. Use `browse text`, `browse count`, `browse js` to verify.
+3. **Screenshot at every checkpoint** — evidence for the report and useful for debugging failures.
+4. **Test the unhappy path** — edge cases, errors, and boundaries are where bugs hide.
+5. **Ask when ambiguous** — if the spec doesn't say where to test or what "correct" looks like, ask.
+6. **Name flows after the feature** — `checkout-discount`, `user-registration`, `search-filters`. Not `test1`.
+7. **Split complex features into multiple flows** — one flow per logical scenario or user journey. A checkout feature might produce `checkout-happy-path.yaml`, `checkout-discount-code.yaml`, `checkout-validation-errors.yaml`. Each flow should be independently runnable.
+8. **Report honestly** — if something doesn't work, say so clearly. Don't gloss over failures.
+## Native App QA
+Same workflow, native commands:
+```bash
+# iOS — from bundle ID (app already installed)
+browse sim start --platform ios --app com.example.myapp --visible
+# iOS — install from .app or .ipa build file
+browse sim start --platform ios --app ./build/MyApp.app --visible
+browse sim start --platform ios --app ./MyApp.ipa --visible
+# Android — install from .apk build file
+browse sim start --platform android --app ./app/build/outputs/apk/debug/app-debug.apk --visible
+# Then QA as normal
+browse record start
+browse --platform ios --app com.example.myapp snapshot -i
+browse --platform ios --app com.example.myapp tap @e3
+browse --platform ios --app com.example.myapp fill @e5 "test"
+browse --platform ios --app com.example.myapp press return
+browse --platform ios --app com.example.myapp screenshot .browse/sessions/default/ios-test.png
+browse record stop
+browse flow save ios-feature-qa
+```
+## Examples
+### Example 1: Jira Story
+**User:** "QA SHOP-789"
+**Agent:**
+1. `mcp__jira__get_issue({ issueKey: "SHOP-789" })` → "Add discount code field to checkout. Valid codes reduce price. Invalid codes show error."
+2. Plans: happy path (valid code), error path (invalid), edge case (empty submit), boundary (expired code)
+3. Opens checkout page in headed browser
+4. Tests each scenario, screenshots at each step
+5. Finds: valid code works, invalid code works, BUT empty submit crashes the page
+6. Reports: 3/4 PASS, 1 FAIL (empty submit bug), flow saved as `checkout-discount-qa`
+### Example 2: Plain English
+**User:** "QA the search on mumzworld.com — make sure filters work"
+**Agent:**
+1. Plans: search text, apply price filter, apply brand filter, clear filters, empty search
+2. Opens mumzworld.com in headed browser
+3. Tests each filter combination
+4. Reports results with screenshots
+5. Saves flow: `mumzworld-search-filters-qa`
+### Example 3: Mobile App
+**User:** "QA the login flow on our iOS app, bundle is com.ourcompany.app"
+**Agent:**
+1. Boots iOS Simulator with the app
+2. Plans: valid login, invalid password, empty fields, forgot password link
+3. Tests each scenario in the simulator
+4. Reports with screenshots
+5. Saves flow: `ios-login-qa`

/package/skill/{references → browse/references}/commands.md RENAMED Viewed

File without changes

/package/skill/{references → browse/references}/guides.md RENAMED Viewed

File without changes

/package/skill/{references → browse/references}/permissions.md RENAMED Viewed

File without changes