npm - chromeflow - Versions diffs - 0.1.32 → 0.1.34 - Mend

chromeflow 0.1.32 → 0.1.34

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

package/CLAUDE.md CHANGED Viewed

@@ -22,11 +22,11 @@ Do NOT ask "should I open the browser?" — just do it. The user expects seamles
    `scroll_page` then retry, or use `highlight_region` to show the user. Never use
    `osascript`, `applescript`, or any shell command to control the browser.
-2. **Never use `take_screenshot` to find element positions or confirm actions.**
-   `get_elements` returns exact DOM coordinates — always use that first. `get_page_text`
-   tells you what happened after an action — always use that before reaching for a screenshot.
-   `take_screenshot` is only for when you genuinely have no idea what the page looks like
-   and DOM queries can't help. It is a last resort, not a routine check.
+2. **Never use `take_screenshot` to read page content.** After `scroll_page`, after
+   `click_element`, after navigation — always call `get_page_text`, not `take_screenshot`.
+   `get_page_text` returns up to 20,000 characters; if truncated it tells you the next
+   `startIndex` to paginate. Screenshots are only for locating an element's pixel position
+   when DOM queries have already failed. Never take more than 1–2 screenshots in a row.
 3. **Use `wait_for_selector` to wait for async page changes** (build completion, modals,
    toasts). Never poll with repeated `take_screenshot` calls.

package/dist/tools/browser.js CHANGED Viewed

@@ -6,7 +6,7 @@ import { execSync } from "child_process";
 function registerBrowserTools(server, bridge) {
   server.tool(
     "open_page",
-    "Navigate to a URL. By default reuses the active tab. Set new_tab=true to open alongside the current tab without losing it.",
+    "Navigate to a URL. By default reuses the active tab. Set new_tab=true to open alongside the current tab without losing it. After navigating, call get_page_text to read the page \u2014 do NOT take a screenshot.",
     {
       url: z.string().url().describe("The URL to navigate to"),
       new_tab: z.boolean().optional().describe("Open in a new tab instead of replacing the current one (default false)")
@@ -50,7 +50,7 @@ ${lines.join("\n")}` }]
   );
   server.tool(
     "take_screenshot",
-    "Capture a screenshot and return it to Claude only \u2014 no file is saved, nothing goes to the clipboard. Use ONLY when you need to visually inspect the page layout or get pixel coordinates for highlight_region. DO NOT use to check page state or confirm actions \u2014 use get_page_text for that. To also save or copy the image, use take_and_copy_screenshot instead.",
+    "Capture a screenshot of the current page. IMPORTANT: Do NOT use this to read page content or check what is on the page \u2014 call get_page_text instead, which is faster and returns searchable text. Screenshots are ONLY for locating a specific element's pixel coordinates when get_elements has already failed. Never take a screenshot immediately after open_page, scroll_page, or click_element \u2014 always use get_page_text after those actions. Never take more than 1-2 screenshots in a row. To also save or copy the image, use take_and_copy_screenshot instead.",
     {},
     async () => {
       const response = await bridge.request({ type: "screenshot" });

package/dist/tools/capture.js CHANGED Viewed

@@ -60,14 +60,18 @@ After filling, call wait_for_click only if the user needs to review/confirm; oth
     "get_page_text",
     `Get the visible text content of the current page without taking a screenshot.
 Use this instead of take_screenshot whenever you need to read what's on the page \u2014 errors, build status, form labels, confirmation messages, etc.
-Only use take_screenshot when you need to locate an element's pixel position for highlight_region.`,
+Returns up to 20,000 characters at a time. If the response ends with "... (N more characters)", call again with startIndex to read the next chunk.
+Never use take_screenshot just to read page content \u2014 paginate with startIndex instead.`,
     {
       selector: z.string().optional().describe(
         `CSS selector to scope the extraction (e.g. 'main', '.error-toast', '[data-testid="status"]'). Omit to auto-extract from the main content area.`
+      ),
+      startIndex: z.number().optional().describe(
+        "Character offset to start from. Use this to read past the first 20,000 characters \u2014 the response will tell you the next startIndex when more content exists."
       )
     },
-    async ({ selector }) => {
-      const response = await bridge.request({ type: "get_page_text", selector });
+    async ({ selector, startIndex }) => {
+      const response = await bridge.request({ type: "get_page_text", selector, startIndex });
       if (response.type !== "page_text_response") throw new Error("Unexpected response");
       const text = response.text;
       return {

package/dist/tools/flow.js CHANGED Viewed

@@ -2,7 +2,7 @@ import { z } from "zod";
 function registerFlowTools(server, bridge) {
   server.tool(
     "scroll_page",
-    "Scroll the page or the focused panel up or down. Use this when the target location is unknown. If you know which field or element you need, use scroll_to_element instead \u2014 it scrolls precisely without guessing. After scrolling, retry click_element or fill_input.",
+    "Scroll the page or the focused panel up or down. Use this when the target location is unknown. If you know which field or element you need, use scroll_to_element instead \u2014 it scrolls precisely without guessing. After scrolling, call get_page_text to read the new content \u2014 NEVER call take_screenshot after scrolling.",
     {
       direction: z.enum(["down", "up"]).describe("Scroll direction"),
       amount: z.number().optional().describe("Pixels to scroll (default 400)")

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "chromeflow",
-  "version": "0.1.32",
+  "version": "0.1.34",
   "description": "Browser guidance MCP server for Claude Code — highlights, clicks, fills, and captures from the web so you don't have to.",
   "type": "module",
   "bin": {