tabctl 0.6.0-rc.6 → 0.6.0-rc.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -149,7 +149,7 @@ tabctl schema
149
149
 
150
150
  Read-only browser features include:
151
151
  - `inspectTabs` for page metadata and selector-based reads
152
- - `readTabs` for page HTML → Markdown conversion with best-effort extraction
152
+ - `readTabs` for page HTML → Markdown conversion with Kreuzberg preprocessing and per-tab diagnostics
153
153
  - `reportTabs`, `captureScreenshots`, and browser-state history queries for summaries and context
154
154
 
155
155
  See [CLI.md](CLI.md) for the full command reference, options, and examples.
@@ -169,8 +169,8 @@ tabctl query 'query { inspectTabs(tabIds: [456], signals: ["page-meta"]) { entri
169
169
  # Inspect selectors with typed attrs and filters
170
170
  tabctl query 'query { inspectTabs(windowId: 123, selectors: [{ name: "prices", selector: ".price", attr: "text", all: true }, { name: "buy_now_visible", selector: "button.buy-now", attr: "visible" }, { name: "buy_now_style", selector: "button.buy-now", attr: "styles", styleProps: ["color", "background-color"] }, { name: "review_count", selector: ".review", attr: "count" }, { name: "email_value", selector: "input[type=email]", attr: "value" }]) { entries { tabId url signals { name valueJson } } } }'
171
171
 
172
- # Read tab content as Markdown (best-effort extraction by default)
173
- tabctl query 'query { readTabs(windowId: 123, extract: true, maxChars: 30000) { entries { tabId title url markdown chars truncated extracted error } } }'
172
+ # Read tab content as Markdown (Kreuzberg preprocessing by default)
173
+ tabctl query 'query { readTabs(windowId: 123, extract: true, maxChars: 30000) { entries { tabId title url markdown chars truncated extracted cached status emptyReason diagnostics { source sourceHtmlChars sourceTextChars documentReadyState truncatedHtml cachedAt cacheAgeMs } error } } }'
174
174
 
175
175
  # Generate reports
176
176
  tabctl query '{ reportTabs(windowId: 123) { entries { tabId title url description } } }'
@@ -427,7 +427,8 @@ Notes:
427
427
  - Browser reads and mutations now go through GraphQL via `tabctl query`.
428
428
  - Reports include short descriptions from page metadata and a fallback snippet.
429
429
  - `inspectTabs` supports `page-meta` plus selector reads with `all`, `text`, `textMode`, `styleProps`, and attrs such as `html`, `value`, `count`, `box`, `styles`, `visible`, `enabled`, and `checked`.
430
- - `readTabs` converts main-frame page HTML to Markdown, with best-effort content extraction enabled by default and status surfaced through `extracted`/`error` fields.
430
+ - `readTabs` converts main-frame page HTML to Markdown with Kreuzberg `html-to-markdown`; per-tab `status`, `emptyReason`, `diagnostics`, and `error` distinguish empty pages, unsupported URLs, injection failures, conversion failures, timeouts, and cached fallbacks (`status: CACHED`, `cached: true`, `diagnostics.source: "cache"`).
431
+ - Cached fallbacks use a bounded profile-local open-tab HTML cache refreshed after successful reads, active tab switches, and quiescent active pages; diagnostics include cache age and match mode when cached content is used.
431
432
  - `captureScreenshots` returns tile metadata and image data from GraphQL.
432
433
  - `undoAction` accepts either an explicit `txid` or `latest: true`.
433
434
  - `tabctl history --json` returns a top-level JSON array.