npm - pi-web-access - Versions diffs - 0.7.1 → 0.7.3 - Mend

pi-web-access 0.7.1 → 0.7.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

package/CHANGELOG.md CHANGED Viewed

@@ -2,6 +2,36 @@
 All notable changes to this project will be documented in this file.
+## [0.7.3] - 2026-02-05
+### Added
+- Jina Reader fallback for JS-rendered pages. When Readability returns insufficient content (cookie notices, consent walls, SPA shells), the extraction chain now tries Jina Reader (`r.jina.ai`) before falling back to Gemini. Jina handles JavaScript rendering server-side and returns clean markdown. No API key required.
+- JS-render detection heuristic (`isLikelyJSRendered`) produces more specific error messages when pages appear to load content dynamically.
+- Actionable guidance when all extraction methods fail, listing steps to configure Gemini API or use `web_search` instead.
+### Changed
+- HTTP fetch headers now mimic Chrome (realistic `User-Agent`, `Sec-Fetch-*`, `Accept-Language`) instead of the default Node.js user agent. Reduces blocks from bot-detection systems.
+- Short Readability output (< 500 chars) is now treated as a content failure, triggering the fallback chain. Previously, a 266-char cookie notice was returned as "successful" content.
+- Extraction fallback order is now: HTTP+Readability → RSC → Jina Reader → Gemini URL Context → Gemini Web → error with guidance.
+### Fixed
+- `parseTimestamp` now rejects negative values in colon-separated format (`-1:30`, `1:-30`). Previously only the numeric path (`-90`) rejected negatives, while the colon path computed and returned negative seconds.
+## [0.7.2] - 2026-02-03
+### Added
+- `model` parameter on `fetch_content` to override the Gemini model per-request (e.g. `model: "gemini-2.5-flash"`)
+- Collapsed TUI results now show a 200-char text preview instead of just the status line
+- LICENSE file (MIT)
+### Changed
+- Default Gemini model updated from `gemini-2.5-flash` to `gemini-3-flash-preview` across all API, search, URL context, YouTube, and video paths. Gemini Web gracefully falls back to `gemini-2.5-flash` when the model header isn't available.
+- README rewritten: added tagline, badges, "Why" section, Quick Start, corrected "How It Works" routing order, fixed inaccurate env var precedence claim, added missing `/v/` YouTube format, restored `/search` command docs, collapsible Files table
+### Fixed
+- `PERPLEXITY_API_KEY` env var now takes precedence over config file value, matching `GEMINI_API_KEY` behavior and README documentation (was reversed)
+- `package.json` now includes `repository`, `homepage`, `bugs`, and `description` fields (repo link was missing from pi packages site)
 ## [0.7.0] - 2026-02-03
 ### Added

package/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2025 Nico Bailon
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

package/README.md CHANGED Viewed

@@ -4,14 +4,23 @@
 # Pi Web Access
-An extension for [Pi coding agent](https://github.com/badlogic/pi-mono/) that gives Pi web capabilities: search via Perplexity AI or Gemini, fetch and extract content from URLs, clone GitHub repos for local exploration, read PDFs, understand YouTube videos, and analyze local video files.
+**Web search, content extraction, and video understanding for Pi agent. Zero config with Chrome, or bring your own API keys.**
+[![npm version](https://img.shields.io/npm/v/pi-web-access?style=for-the-badge)](https://www.npmjs.com/package/pi-web-access)
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg?style=for-the-badge)](https://opensource.org/licenses/MIT)
+[![Platform](https://img.shields.io/badge/Platform-macOS%20%7C%20Linux%20%7C%20Windows*-blue?style=for-the-badge)]()
 https://github.com/user-attachments/assets/cac6a17a-1eeb-4dde-9818-cdf85d8ea98f
-```typescript
-web_search({ query: "TypeScript best practices 2025" })
-fetch_content({ url: "https://docs.example.com/guide" })
-```
+## Why Pi Web Access
+**Zero Config** — Signed into Google in Chrome? That's it. The extension reads your Chrome session cookies to access Gemini directly. No API keys, no setup, no subscriptions.
+**Video Understanding** — Point it at a YouTube video or local screen recording and ask questions about what's on screen. Full transcripts, visual descriptions, and frame extraction at exact timestamps.
+**Smart Fallbacks** — Every capability has a fallback chain. Search tries Perplexity, then Gemini API, then Gemini Web. YouTube tries Gemini Web, then API, then Perplexity. Blocked pages retry through Jina Reader and Gemini extraction. Something always works.
+**GitHub Cloning** — GitHub URLs are cloned locally instead of scraped. The agent gets real file contents and a local path to explore, not rendered HTML.
 ## Install
@@ -19,281 +28,183 @@ fetch_content({ url: "https://docs.example.com/guide" })
 pi install npm:pi-web-access
 ```
-**Zero config if you're signed into Google in Chrome.** The extension reads your Chrome session cookies to access Gemini — no API keys needed. This gives you web search, YouTube video understanding, page extraction fallbacks, and local video analysis for free.
-If you're not signed into Chrome, or want to use a different provider, add API keys to `~/.pi/web-search.json`:
-```json
-{ "geminiApiKey": "AIza..." }
-```
+If you're not signed into Chrome, or prefer a different provider, add API keys to `~/.pi/web-search.json`:
 ```json
-{ "perplexityApiKey": "pplx-..." }
+{
+  "perplexityApiKey": "pplx-...",
+  "geminiApiKey": "AIza..."
+}
 ```
-You can configure both. In `auto` mode (default), the extension tries Perplexity first (if configured), then Gemini API, then Gemini Web via Chrome cookies.
-**Requires:** Pi v0.37.3+
+You can configure one or both. In `auto` mode (default), `web_search` tries Perplexity first, then Gemini API, then Gemini Web.
-**Optional dependencies** for video frame extraction:
+Optional dependencies for video frame extraction:
 ```bash
 brew install ffmpeg   # frame extraction, video thumbnails, local video duration
-brew install yt-dlp   # YouTube frame extraction (stream URL + duration lookup)
+brew install yt-dlp   # YouTube stream URLs for frame extraction
 ```
-Without these, video content analysis (transcripts via Gemini) still works. The binaries are only needed for extracting visual frames from videos. `ffprobe` (bundled with ffmpeg) is used for local video duration lookup when sampling frames across an entire video.
-## Tools
+Without these, video content analysis (transcripts, visual descriptions via Gemini) still works. The binaries are only needed for extracting individual frames as images.
-### web_search
+Requires Pi v0.37.3+.
-Search the web via Perplexity AI or Gemini. Returns synthesized answer with source citations.
+## Quick Start
 ```typescript
-// Single query
-web_search({ query: "rust async programming" })
+// Search the web
+web_search({ query: "TypeScript best practices 2025" })
-// Multiple queries (batch)
-web_search({ queries: ["query 1", "query 2"] })
+// Fetch a page
+fetch_content({ url: "https://docs.example.com/guide" })
-// With options
-web_search({
-  query: "latest news",
-  numResults: 10,              // Default: 5, max: 20
-  recencyFilter: "week",       // day, week, month, year
-  domainFilter: ["github.com"] // Prefix with - to exclude
-})
+// Clone a GitHub repo
+fetch_content({ url: "https://github.com/owner/repo" })
-// Explicit provider
-web_search({ query: "...", provider: "gemini" })  // auto, perplexity, gemini
+// Understand a YouTube video
+fetch_content({ url: "https://youtube.com/watch?v=abc", prompt: "What libraries are shown?" })
-// Fetch full page content (async)
-web_search({ query: "...", includeContent: true })
+// Analyze a screen recording
+fetch_content({ url: "/path/to/recording.mp4", prompt: "What error appears on screen?" })
 ```
-When `includeContent: true`, sources are fetched in the background. Agent receives notification when ready.
-Provider selection in `auto` mode: Perplexity (if key configured) → Gemini API (if key configured, uses Google Search grounding) → Gemini Web (if signed into Chrome). Gemini API returns structured citations with source mappings. Gemini Web returns markdown with embedded links.
+## Tools
-### fetch_content
+### web_search
-Fetch URL(s) and extract readable content as markdown.
+Search the web via Perplexity AI or Gemini. Returns a synthesized answer with source citations.
 ```typescript
-// Single URL - returns content directly (also stored for retrieval)
-fetch_content({ url: "https://example.com/article" })
-// Multiple URLs - returns summary (content stored for retrieval)
-fetch_content({ urls: ["url1", "url2", "url3"] })
-// PDFs - extracted and saved to ~/Downloads/
-fetch_content({ url: "https://arxiv.org/pdf/1706.03762" })
-// → "PDF extracted and saved to: ~/Downloads/arxiv-170603762.md"
+web_search({ query: "rust async programming" })
+web_search({ queries: ["query 1", "query 2"] })
+web_search({ query: "latest news", numResults: 10, recencyFilter: "week" })
+web_search({ query: "...", domainFilter: ["github.com"] })
+web_search({ query: "...", provider: "gemini" })
+web_search({ query: "...", includeContent: true })
 ```
-**GitHub repos:** GitHub code URLs are automatically detected and cloned locally instead of scraping HTML. The agent gets actual file contents and a local path to explore with `read` and `bash`.
-```typescript
-// Clone a repo - returns structure + README
-fetch_content({ url: "https://github.com/owner/repo" })
-// → "Repository cloned to: /tmp/pi-github-repos/owner/repo"
-// Specific file - returns file contents
-fetch_content({ url: "https://github.com/owner/repo/blob/main/src/index.ts" })
+| Parameter | Description |
+|-----------|-------------|
+| `query` / `queries` | Single query or batch of queries |
+| `numResults` | Results per query (default: 5, max: 20) |
+| `recencyFilter` | `day`, `week`, `month`, or `year` |
+| `domainFilter` | Limit to domains (prefix with `-` to exclude) |
+| `provider` | `auto` (default), `perplexity`, or `gemini` |
+| `includeContent` | Fetch full page content from sources in background |
-// Directory - returns listing
-fetch_content({ url: "https://github.com/owner/repo/tree/main/src" })
-// Force-clone a large repo that exceeds the size threshold
-fetch_content({ url: "https://github.com/big/repo", forceClone: true })
-```
-Repos over 350MB get a lightweight API-based view instead of a full clone. Commit SHA URLs are also handled via the API. Clones are cached for the session -- multiple files from the same repo share one clone, but clones are wiped on session change/shutdown and re-cloned as needed.
+### fetch_content
-**YouTube videos:** YouTube URLs are automatically detected and processed via Gemini for full video understanding (visual + audio + transcript). Three-tier fallback:
+Fetch URL(s) and extract readable content as markdown. Automatically detects and handles GitHub repos, YouTube videos, PDFs, local video files, and regular web pages.
 ```typescript
-// Returns transcript with timestamps, visual descriptions, chapter markers
-fetch_content({ url: "https://youtube.com/watch?v=dQw4w9WgXcQ" })
-// Ask a specific question about the video
-fetch_content({ url: "https://youtube.com/watch?v=abc", prompt: "What libraries are imported?" })
+fetch_content({ url: "https://example.com/article" })
+fetch_content({ urls: ["url1", "url2", "url3"] })
+fetch_content({ url: "https://github.com/owner/repo" })
+fetch_content({ url: "https://youtube.com/watch?v=abc", prompt: "What libraries are shown?" })
+fetch_content({ url: "/path/to/recording.mp4", prompt: "What error appears on screen?" })
+fetch_content({ url: "https://youtube.com/watch?v=abc", timestamp: "23:41-25:00", frames: 4 })
 ```
-1. **Gemini Web** (primary) -- reads your Chrome session cookies. Zero config if you're signed into Google.
-2. **Gemini API** (secondary) -- uses `GEMINI_API_KEY` env var or `geminiApiKey` in config.
-3. **Perplexity** (fallback) -- topic summary when neither Gemini path is available.
-YouTube results include the video thumbnail as an image content part, so the agent receives visual context alongside the transcript.
+| Parameter | Description |
+|-----------|-------------|
+| `url` / `urls` | Single URL/path or multiple URLs |
+| `prompt` | Question to ask about a YouTube video or local video file |
+| `timestamp` | Extract frame(s) — single (`"23:41"`), range (`"23:41-25:00"`), or seconds (`"85"`) |
+| `frames` | Number of frames to extract (max 12) |
+| `forceClone` | Clone GitHub repos that exceed the 350MB size threshold |
-Handles all YouTube URL formats: `/watch?v=`, `youtu.be/`, `/shorts/`, `/live/`, `/embed/`, `/v/`, `m.youtube.com`. Playlist-only URLs fall through to normal extraction.
+### get_search_content
-**Local video files:** Pass a file path to analyze video content via Gemini. Supports MP4, MOV, WebM, AVI, and other common formats. Max 50MB (configurable).
+Retrieve stored content from previous searches or fetches. Content over 30,000 chars is truncated in tool responses but stored in full for retrieval here.
 ```typescript
-// Analyze a screen recording
-fetch_content({ url: "/path/to/recording.mp4" })
-// Ask about specific content in the video
-fetch_content({ url: "./demo.mov", prompt: "What error message appears on screen?" })
+get_search_content({ responseId: "abc123", urlIndex: 0 })
+get_search_content({ responseId: "abc123", url: "https://..." })
+get_search_content({ responseId: "abc123", query: "original query" })
 ```
-Two-tier fallback: Gemini API (needs key, proper Files API with MIME types) → Gemini Web (free, needs Chrome login). File paths are detected by prefix (`/`, `./`, `../`, `file://`). If ffmpeg is installed, a frame from the video is included as a thumbnail image alongside the analysis.
-**Video frame extraction (YouTube + local):** Use `timestamp` and/or `frames` to pull visuals for scanning.
-```typescript
-// Single frame at an exact time
-fetch_content({ url: "https://youtube.com/watch?v=abc", timestamp: "23:41" })
+## Capabilities
-// Range scan (default 6 frames)
-fetch_content({ url: "https://youtube.com/watch?v=abc", timestamp: "23:41-25:00" })
+### GitHub repos
-// Custom density across a range
-fetch_content({ url: "https://youtube.com/watch?v=abc", timestamp: "23:41-25:00", frames: 3 })
+GitHub URLs are cloned locally instead of scraped. The agent gets real file contents and a local path to explore with `read` and `bash`. Root URLs return the repo tree + README, `/tree/` paths return directory listings, `/blob/` paths return file contents.
-// N frames at 5s intervals starting from a single timestamp
-fetch_content({ url: "https://youtube.com/watch?v=abc", timestamp: "23:41", frames: 5 })
+Repos over 350MB get a lightweight API-based view instead of a full clone (override with `forceClone: true`). Commit SHA URLs are handled via the API. Clones are cached for the session and wiped on session change. Private repos require the `gh` CLI.
-// Whole-video sampling (no timestamp)
-fetch_content({ url: "https://youtube.com/watch?v=abc", frames: 6 })
-```
+### YouTube videos
-The same `timestamp`/`frames` syntax works with local file paths (e.g. `/path/to/video.mp4`).
+YouTube URLs are processed via Gemini for full video understanding — visual descriptions, transcripts with timestamps, and chapter markers. Pass a `prompt` to ask specific questions about the video. Results include the video thumbnail so the agent gets visual context alongside the transcript.
-Requirements: YouTube frame extraction needs `yt-dlp` + `ffmpeg`. Local video frames need `ffmpeg` (and `ffprobe`, bundled with ffmpeg, for whole-video sampling).
+Fallback: Gemini Web → Gemini API → Perplexity (text summary only). Handles all URL formats: `/watch?v=`, `youtu.be/`, `/shorts/`, `/live/`, `/embed/`, `/v/`.
-Common errors include missing binaries, private/age-restricted videos, region blocks, live streams, expired stream URLs (403), and timestamps beyond the video duration.
+### Local video files
-**Gemini extraction fallback:** When Readability fails or a site blocks bot traffic (403, 429), the extension automatically retries via Gemini URL Context (API) or Gemini Web. This handles SPAs, JS-heavy pages, and anti-bot protections that the HTTP pipeline can't.
+Pass a file path (`/`, `./`, `../`, or `file://` prefix) to analyze video content via Gemini. Supports MP4, MOV, WebM, AVI, and other common formats up to 50MB. Pass a `prompt` to ask about specific content. If ffmpeg is installed, a thumbnail frame is included alongside the analysis.
-**PDF handling:** When fetching a PDF URL, the extension extracts text and saves it as a markdown file in `~/Downloads/`. The agent can then use `read` to access specific sections without loading 200K+ chars into context.
+Fallback: Gemini API (Files API upload) → Gemini Web.
-### get_search_content
+### Video frame extraction
-Retrieve stored content from previous searches or fetches.
+Use `timestamp` and/or `frames` on any YouTube URL or local video file to extract visual frames as images.
 ```typescript
-// By response ID (from web_search or fetch_content)
-get_search_content({ responseId: "abc123", urlIndex: 0 })
-// By URL
-get_search_content({ responseId: "abc123", url: "https://..." })
-// By query (for search results)
-get_search_content({ responseId: "abc123", query: "original query" })
+fetch_content({ url: "...", timestamp: "23:41" })                       // single frame
+fetch_content({ url: "...", timestamp: "23:41-25:00" })                 // range, 6 frames
+fetch_content({ url: "...", timestamp: "23:41-25:00", frames: 3 })      // range, custom count
+fetch_content({ url: "...", timestamp: "23:41", frames: 5 })            // 5 frames at 5s intervals
+fetch_content({ url: "...", frames: 6 })                                // sample whole video
 ```
-## Features
+Requires `ffmpeg` (and `yt-dlp` for YouTube). Timestamps accept `H:MM:SS`, `MM:SS`, or bare seconds.
-### Activity Monitor (Ctrl+Shift+W)
+### PDFs
-Toggle live request/response activity:
+PDF URLs are extracted as text and saved to `~/Downloads/` as markdown. The agent can then `read` specific sections without loading the full document into context. Text-based extraction only — no OCR.
-```
-─── Web Search Activity ────────────────────────────────────
-  API  "typescript best practices"     200    2.1s ✓
-  GET  docs.example.com/article        200    0.8s ✓
-  GET  blog.example.com/post           404    0.3s ✗
-  GET  news.example.com/latest         ...    1.2s ⋯
-────────────────────────────────────────────────────────────
-Rate: 3/10 (resets in 42s)
-```
-### RSC Content Extraction
+### Blocked pages
-Next.js App Router pages embed content as RSC (React Server Components) flight data in script tags. When Readability fails, the extension parses these JSON payloads directly, reconstructing markdown with headings, tables, code blocks, and links.
+When Readability fails or returns only a cookie notice, the extension retries via Jina Reader (handles JS rendering server-side, no API key needed), then Gemini URL Context API, then Gemini Web extraction. Handles SPAs, JS-heavy pages, and anti-bot protections transparently. Also parses Next.js RSC flight data when present.
-### TUI Rendering
-Tool calls render with real-time progress:
+## How It Works
 ```
-┌─ search "TypeScript best practices 2025" ─────────────────────────┐
-│ [████████░░] searching                                            │
-└───────────────────────────────────────────────────────────────────┘
+fetch_content(url)
+  → Video file?  Gemini API (Files API) → Gemini Web
+  → GitHub URL?  Clone repo, return file contents + local path
+  → YouTube URL? Gemini Web → Gemini API → Perplexity
+  → HTTP fetch → PDF? Extract text, save to ~/Downloads/
+               → HTML? Readability → RSC parser → Jina Reader → Gemini fallback
+               → Text/JSON/Markdown? Return directly
 ```
 ## Skills
-Skills are bundled with the extension and available automatically after install -- no extra setup needed.
 ### librarian
-Structured research workflow for open-source libraries with evidence-backed answers and GitHub permalinks. Loaded automatically when the task involves understanding library internals, finding implementation details, or tracing code history.
-Combines `fetch_content` (GitHub cloning), `web_search` (recent info), and git operations (blame, log, show). Pi auto-detects when to load it based on your prompt. If you have [pi-skill-palette](https://github.com/nicobailon/pi-skill-palette) installed, you can also load it explicitly via `/skill:librarian`.
+Bundled research workflow for investigating open-source libraries. Combines GitHub cloning, web search, and git operations (blame, log, show) to produce evidence-backed answers with permalinks. Pi loads it automatically based on your prompt. Also available via `/skill:librarian` with [pi-skill-palette](https://github.com/nicobailon/pi-skill-palette).
 ## Commands
 ### /search
-Browse stored search results interactively.
-## How It Works
-### fetch_content routing
+Browse stored search results interactively. Lists all results from the current session with their response IDs for easy retrieval.
-```
-fetch_content(url_or_path, prompt?)
-       │
-       ├── Local video file? ──→ Gemini API → Gemini Web
-       │                              ↓
-       │                         Video analysis (prompt forwarded)
-       │
-       ├── github.com code URL? ──→ Clone repo (gh/git --depth 1)
-       │                                    │
-       │                            ┌───────┼───────┐
-       │                            ↓       ↓       ↓
-       │                          root    tree     blob
-       │                            ↓       ↓       ↓
-       │                         tree +   dir     file
-       │                         README  listing  contents
-       │                            │       │       │
-       │                            └───────┼───────┘
-       │                                    ↓
-       │                           Return content + local
-       │                           path for read/bash
-       │
-       ├── YouTube URL? ──→ Gemini Web → Gemini API → Perplexity
-       │                         ↓              (prompt forwarded)
-       │                    Transcript + visual descriptions
-       │
-       ├── PDF? ──→ unpdf → Save to ~/Downloads/
-       │
-       ├── Plain text/markdown/JSON? ──→ Return directly
-       │
-       └── HTML ──→ Readability → Markdown
-                         │
-                    [if fails]
-                         ↓
-                    RSC Parser → Markdown
-                         │
-                    [if all fail]
-                         ↓
-                    Gemini URL Context → Gemini Web extraction
-```
+## Activity Monitor
-### web_search routing
+Toggle with **Ctrl+Shift+W** to see live request/response activity:
 ```
-web_search(query, provider?)
-       │
-       ├── provider = "perplexity" ──→ Perplexity API
-       ├── provider = "gemini"     ──→ Gemini API → Gemini Web
-       └── provider = "auto"
-              ├── Perplexity key? ──→ Perplexity API
-              ├── Gemini API key? ──→ Gemini API (grounded search)
-              ├── Chrome cookies? ──→ Gemini Web (grounded search)
-              └── Error
+─── Web Search Activity ────────────────────────────────────
+  API  "typescript best practices"     200    2.1s ✓
+  GET  docs.example.com/article        200    0.8s ✓
+  GET  blog.example.com/post           404    0.3s ✗
+────────────────────────────────────────────────────────────
 ```
-When `includeContent: true`, sources are fetched in the background using the fetch_content routing above, and the agent receives a notification when ready.
 ## Configuration
-All config lives in `~/.pi/web-search.json`:
+All config lives in `~/.pi/web-search.json`. Every field is optional.
 ```json
 {
@@ -308,61 +219,51 @@ All config lives in `~/.pi/web-search.json`:
   },
   "youtube": {
     "enabled": true,
-    "preferredModel": "gemini-2.5-flash"
+    "preferredModel": "gemini-3-flash-preview"
   },
   "video": {
     "enabled": true,
-    "preferredModel": "gemini-2.5-flash",
+    "preferredModel": "gemini-3-flash-preview",
     "maxSizeMB": 50
   }
 }
 ```
-All fields are optional. `GEMINI_API_KEY` and `PERPLEXITY_API_KEY` env vars take precedence over config file values. Set `"enabled": false` under `githubClone`, `youtube`, or `video` to disable those features.
+`GEMINI_API_KEY` and `PERPLEXITY_API_KEY` env vars take precedence over config file values. `searchProvider` sets the `web_search` default: `"auto"`, `"perplexity"`, or `"gemini"`. Set `"enabled": false` under any feature to disable it. Config changes require a Pi restart.
-`searchProvider` controls `web_search` default: `"auto"` (Perplexity → Gemini API → Gemini Web), `"perplexity"`, or `"gemini"` (API → Web).
+Rate limits: Perplexity is capped at 10 requests/minute (client-side). Content fetches run 3 concurrent with a 30s timeout per URL.
-## Rate Limits
+## Limitations
-- **Perplexity API**: 10 requests/minute (enforced client-side)
-- **Content Fetch**: 3 concurrent requests, 30s timeout per URL
-- **Cache TTL**: 1 hour
+- Chrome cookie extraction is macOS-only — other platforms fall through to API keys. First-time access may trigger a Keychain dialog.
+- YouTube private/age-restricted videos may fail on all extraction paths.
+- Gemini can process videos up to ~1 hour; longer videos may be truncated.
+- PDFs are text-extracted only (no OCR for scanned documents).
+- GitHub branch names with slashes may misresolve file paths; the clone still works and the agent can navigate manually.
+- Non-code GitHub URLs (issues, PRs, wiki) fall through to normal web extraction.
-## Files
+<details>
+<summary>Files</summary>
 | File | Purpose |
 |------|---------|
 | `index.ts` | Extension entry, tool definitions, commands, widget |
-| `perplexity.ts` | Perplexity API client, rate limiting |
-| `gemini-search.ts` | Gemini search providers (Web + API with grounding), search routing |
-| `extract.ts` | URL/file path routing, HTTP extraction, Gemini fallback orchestration |
+| `extract.ts` | URL/file path routing, HTTP extraction, fallback orchestration |
+| `gemini-search.ts` | Search routing across Perplexity, Gemini API, Gemini Web |
 | `gemini-url-context.ts` | Gemini URL Context + Web extraction fallbacks |
-| `video-extract.ts` | Local video file detection, upload, Gemini Web/API analysis |
-| `youtube-extract.ts` | YouTube URL detection, three-tier extraction orchestrator |
-| `chrome-cookies.ts` | macOS Chrome cookie extraction (Keychain + SQLite) |
 | `gemini-web.ts` | Gemini Web client (cookie auth, StreamGenerate) |
-| `gemini-api.ts` | Gemini REST API client (generateContent, file upload) |
-| `utils.ts` | Shared formatting (`formatSeconds`) and error helpers for frame extraction |
-| `github-extract.ts` | GitHub URL parser, clone cache, content generation |
-| `github-api.ts` | GitHub API fallback for oversized repos and commit SHAs |
+| `gemini-api.ts` | Gemini REST API client (generateContent) |
+| `chrome-cookies.ts` | macOS Chrome cookie extraction (Keychain + SQLite) |
+| `youtube-extract.ts` | YouTube detection, three-tier extraction, frame extraction |
+| `video-extract.ts` | Local video detection, Files API upload, Gemini analysis |
+| `github-extract.ts` | GitHub URL parsing, clone cache, content generation |
+| `github-api.ts` | GitHub API fallback for large repos and commit SHAs |
+| `perplexity.ts` | Perplexity API client with rate limiting |
 | `pdf-extract.ts` | PDF text extraction, saves to markdown |
 | `rsc-extract.ts` | RSC flight data parser for Next.js pages |
+| `utils.ts` | Shared formatting and error helpers |
 | `storage.ts` | Session-aware result storage |
-| `activity.ts` | Activity tracking for observability widget |
-| `skills/librarian/` | Bundled skill for library research with permalinks |
-## Limitations
+| `activity.ts` | Activity tracking for the observability widget |
+| `skills/librarian/` | Bundled skill for library research |
-- Content extraction works best on article-style pages; JS-heavy sites fall back to Gemini extraction when available
-- Gemini extraction fallback requires either a Gemini API key or Chrome login to Google
-- PDFs are extracted as text (no OCR for scanned documents)
-- Max response size: 20MB for PDFs, 5MB for HTML
-- Max inline content: 30,000 chars per URL (larger content stored for retrieval via get_search_content)
-- GitHub cloning requires `gh` CLI for private repos (public repos fall back to `git clone`)
-- GitHub branch names with slashes (e.g. `feature/foo`) may resolve the wrong file path; the clone still succeeds and the agent can navigate manually
-- Non-code GitHub URLs (issues, PRs, wiki, etc.) fall through to normal Readability extraction
-- YouTube extraction via Gemini Web requires macOS (Chrome cookie decryption is OS-specific); other platforms fall through to Gemini API or Perplexity
-- YouTube private/age-restricted videos may fail on all paths
-- Gemini can process videos up to ~1 hour at default resolution; longer videos may be truncated
-- First-time Chrome cookie access may trigger a macOS Keychain permission dialog
-- Requires Pi restart after config file changes
+</details>

package/extract.ts CHANGED Viewed

@@ -15,6 +15,7 @@ const DEFAULT_TIMEOUT_MS = 30000;
 const CONCURRENT_LIMIT = 3;
 const NON_RECOVERABLE_ERRORS = ["Unsupported content type", "Response too large"];
+const MIN_USEFUL_CONTENT = 500;
 const turndown = new TurndownService({
 	headingStyle: "atx",
@@ -48,13 +49,72 @@ export interface ExtractOptions {
 	prompt?: string;
 	timestamp?: string;
 	frames?: number;
+	model?: string;
+}
+const JINA_READER_BASE = "https://r.jina.ai/";
+const JINA_TIMEOUT_MS = 30000;
+async function extractWithJinaReader(
+	url: string,
+	signal?: AbortSignal,
+): Promise<ExtractedContent | null> {
+	const jinaUrl = JINA_READER_BASE + url;
+	const activityId = activityMonitor.logStart({ type: "api", query: `jina: ${url}` });
+	try {
+		const res = await fetch(jinaUrl, {
+			headers: {
+				"Accept": "text/markdown",
+				"X-No-Cache": "true",
+			},
+			signal: AbortSignal.any([
+				AbortSignal.timeout(JINA_TIMEOUT_MS),
+				...(signal ? [signal] : []),
+			]),
+		});
+		if (!res.ok) {
+			activityMonitor.logComplete(activityId, res.status);
+			return null;
+		}
+		const content = await res.text();
+		activityMonitor.logComplete(activityId, res.status);
+		const contentStart = content.indexOf("Markdown Content:");
+		if (contentStart < 0) {
+			return null;
+		}
+		const markdownPart = content.slice(contentStart + 17).trim(); // 17 = "Markdown Content:".length
+		// Check for failed JS rendering or minimal content
+		if (markdownPart.length < 100 ||
+			markdownPart.startsWith("Loading...") ||
+			markdownPart.startsWith("Please enable JavaScript")) {
+			return null;
+		}
+		const title = extractHeadingTitle(markdownPart) ?? (new URL(url).pathname.split("/").pop() || url);
+		return { url, title, content: markdownPart, error: null };
+	} catch (err) {
+		const message = err instanceof Error ? err.message : String(err);
+		if (message.toLowerCase().includes("abort")) {
+			activityMonitor.logComplete(activityId, 0);
+		} else {
+			activityMonitor.logError(activityId, message);
+		}
+		return null;
+	}
 }
 function parseTimestamp(ts: string): number | null {
 	const num = Number(ts);
 	if (!isNaN(num) && num >= 0) return Math.floor(num);
 	const parts = ts.split(":").map(Number);
-	if (parts.some(isNaN)) return null;
+	if (parts.some(p => isNaN(p) || p < 0)) return null;
 	if (parts.length === 3) return Math.floor(parts[0] * 3600 + parts[1] * 60 + parts[2]);
 	if (parts.length === 2) return Math.floor(parts[0] * 60 + parts[1]);
 	return null;
@@ -269,7 +329,7 @@ export async function extractContent(
 	const ytInfo = isYouTubeURL(url);
 	if (ytInfo.isYouTube && isYouTubeEnabled()) {
 		try {
-			const ytResult = await extractYouTube(url, signal, options?.prompt);
+			const ytResult = await extractYouTube(url, signal, options?.prompt, options?.model);
 			if (ytResult) return ytResult;
 		} catch {}
 		return {
@@ -285,10 +345,45 @@ export async function extractContent(
 	if (!httpResult.error || signal?.aborted) return httpResult;
 	if (NON_RECOVERABLE_ERRORS.some(prefix => httpResult.error!.startsWith(prefix))) return httpResult;
+	const jinaResult = await extractWithJinaReader(url, signal);
+	if (jinaResult) return jinaResult;
 	const geminiResult = await extractWithUrlContext(url, signal)
 		?? await extractWithGeminiWeb(url, signal);
-	return geminiResult ?? httpResult;
+	if (geminiResult) return geminiResult;
+	const guidance = [
+		httpResult.error,
+		"",
+		"Fallback options:",
+		"  \u2022 Set GEMINI_API_KEY in ~/.pi/web-search.json",
+		"  \u2022 Sign into gemini.google.com in Chrome",
+		"  \u2022 Use web_search to find content about this topic",
+	].join("\n");
+	return { ...httpResult, error: guidance };
+}
+function isLikelyJSRendered(html: string): boolean {
+	// Extract body content
+	const bodyMatch = html.match(/<body[^>]*>([\s\S]*?)<\/body>/i);
+	if (!bodyMatch) return false;
+	const bodyHtml = bodyMatch[1];
+	// Strip tags to get text content
+	const textContent = bodyHtml
+		.replace(/<script[\s\S]*?<\/script>/gi, "")
+		.replace(/<style[\s\S]*?<\/style>/gi, "")
+		.replace(/<[^>]+>/g, "")
+		.replace(/\s+/g, " ")
+		.trim();
+	// Count scripts
+	const scriptCount = (html.match(/<script/gi) || []).length;
+	// Heuristic: little text content but many scripts suggests JS rendering
+	return textContent.length < 500 && scriptCount > 3;
 }
 async function extractViaHttp(
@@ -309,8 +404,15 @@ async function extractViaHttp(
 		const response = await fetch(url, {
 			signal: controller.signal,
 			headers: {
-				"User-Agent": "Mozilla/5.0 (compatible; pi-agent/1.0)",
-				Accept: "text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8",
+				"User-Agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/122.0.0.0 Safari/537.36",
+				"Accept": "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8",
+				"Accept-Language": "en-US,en;q=0.9",
+				"Cache-Control": "no-cache",
+				"Sec-Fetch-Dest": "document",
+				"Sec-Fetch-Mode": "navigate",
+				"Sec-Fetch-Site": "none",
+				"Sec-Fetch-User": "?1",
+				"Upgrade-Insecure-Requests": "1",
 			},
 		});
@@ -394,16 +496,35 @@ async function extractViaHttp(
 			}
 			activityMonitor.logComplete(activityId, response.status);
+			// Provide more specific error message
+			const jsRendered = isLikelyJSRendered(text);
+			const errorMsg = jsRendered
+				? "Page appears to be JavaScript-rendered (content loads dynamically)"
+				: "Could not extract readable content from HTML structure";
 			return {
 				url,
 				title: "",
 				content: "",
-				error: "Could not extract readable content",
+				error: errorMsg,
 			};
 		}
 		const markdown = turndown.turndown(article.content);
 		activityMonitor.logComplete(activityId, response.status);
+		if (markdown.length < MIN_USEFUL_CONTENT) {
+			return {
+				url,
+				title: article.title || "",
+				content: markdown,
+				error: isLikelyJSRendered(text)
+					? "Page appears to be JavaScript-rendered (content loads dynamically)"
+					: "Extracted content appears incomplete",
+			};
+		}
 		return { url, title: article.title || "", content: markdown, error: null };
 	} catch (err) {
 		const message = err instanceof Error ? err.message : String(err);

package/gemini-api.ts CHANGED Viewed

@@ -4,7 +4,7 @@ import { join } from "node:path";
 export const API_BASE = "https://generativelanguage.googleapis.com/v1beta";
 const CONFIG_PATH = join(homedir(), ".pi", "web-search.json");
-export const DEFAULT_MODEL = "gemini-2.5-flash";
+export const DEFAULT_MODEL = "gemini-3-flash-preview";
 interface GeminiApiConfig {
 	geminiApiKey?: string;

package/gemini-search.ts CHANGED Viewed

@@ -123,7 +123,7 @@ async function searchWithGeminiWeb(query: string, options: SearchOptions = {}):
 	try {
 		const text = await queryWithCookies(prompt, cookies, {
-			model: "gemini-2.5-flash",
+			model: "gemini-3-flash-preview",
 			signal: options.signal,
 			timeoutMs: 60000,
 		});

package/gemini-url-context.ts CHANGED Viewed

@@ -80,7 +80,7 @@ export async function extractWithGeminiWeb(
 	try {
 		const text = await queryWithCookies(EXTRACTION_PROMPT + url, cookies, {
-			model: "gemini-2.5-flash",
+			model: "gemini-3-flash-preview",
 			signal,
 			timeoutMs: 60000,
 		});

package/index.ts CHANGED Viewed

@@ -420,6 +420,9 @@ export default function (pi: ExtensionAPI) {
 				maximum: 12,
 				description: "Number of frames to extract. Use with timestamp range for custom density, with single timestamp to get N frames at 5s intervals, or alone to sample across the entire video. Requires yt-dlp + ffmpeg for YouTube, ffmpeg for local video.",
 			})),
+			model: Type.Optional(Type.String({
+				description: "Override the Gemini model for video/YouTube analysis (e.g. 'gemini-2.5-flash', 'gemini-3-flash-preview'). Defaults to config or gemini-3-flash-preview.",
+			})),
 		}),
 		async execute(_toolCallId, params, signal, onUpdate, _ctx) {
@@ -441,6 +444,7 @@ export default function (pi: ExtensionAPI) {
 				prompt: params.prompt,
 				timestamp: params.timestamp,
 				frames: params.frames,
+				model: params.model,
 			});
 			const successful = fetchResults.filter((r) => !r.error).length;
 			const totalChars = fetchResults.reduce((sum, r) => sum + r.content.length, 0);
@@ -527,7 +531,7 @@ export default function (pi: ExtensionAPI) {
 		},
 		renderCall(args, theme) {
-			const { url, urls, prompt, timestamp, frames } = args as { url?: string; urls?: string[]; prompt?: string; timestamp?: string; frames?: number };
+			const { url, urls, prompt, timestamp, frames, model } = args as { url?: string; urls?: string[]; prompt?: string; timestamp?: string; frames?: number; model?: string };
 			const urlList = urls ?? (url ? [url] : []);
 			if (urlList.length === 0) {
 				return new Text(theme.fg("toolTitle", theme.bold("fetch ")) + theme.fg("error", "(no URL)"), 0, 0);
@@ -556,6 +560,9 @@ export default function (pi: ExtensionAPI) {
 				const display = prompt.length > 250 ? prompt.slice(0, 247) + "..." : prompt;
 				lines.push(theme.fg("dim", "  prompt: ") + theme.fg("muted", `"${display}"`));
 			}
+			if (model) {
+				lines.push(theme.fg("dim", "  model: ") + theme.fg("warning", model));
+			}
 			return new Text(lines.join("\n"), 0, 0);
 		},
@@ -603,8 +610,10 @@ export default function (pi: ExtensionAPI) {
 				if (typeof details?.duration === "number") {
 					statusLine += theme.fg("muted", ` | ${formatSeconds(Math.floor(details.duration))} total`);
 				}
+				const textContent = result.content.find((c) => c.type === "text")?.text || "";
 				if (!expanded) {
-					return new Text(statusLine, 0, 0);
+					const brief = textContent.length > 200 ? textContent.slice(0, 200) + "..." : textContent;
+					return new Text(statusLine + "\n" + theme.fg("dim", brief), 0, 0);
 				}
 				const lines = [statusLine];
 				if (details?.prompt) {
@@ -617,7 +626,6 @@ export default function (pi: ExtensionAPI) {
 				if (typeof details?.frames === "number") {
 					lines.push(theme.fg("dim", `  frames: ${details.frames}`));
 				}
-				const textContent = result.content.find((c) => c.type === "text")?.text || "";
 				const preview = textContent.length > 500 ? textContent.slice(0, 500) + "..." : textContent;
 				lines.push(theme.fg("dim", preview));
 				return new Text(lines.join("\n"), 0, 0);

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "pi-web-access",
-  "version": "0.7.1",
+  "version": "0.7.3",
   "description": "Web search, URL fetching, GitHub repo cloning, PDF extraction, YouTube video understanding, and local video analysis for Pi coding agent",
   "type": "module",
   "keywords": [

package/perplexity.ts CHANGED Viewed

@@ -56,7 +56,7 @@ function loadConfig(): WebSearchConfig {
 function getApiKey(): string {
 	const config = loadConfig();
-	const key = config.perplexityApiKey || process.env.PERPLEXITY_API_KEY;
+	const key = process.env.PERPLEXITY_API_KEY || config.perplexityApiKey;
 	if (!key) {
 		throw new Error(
 			"Perplexity API key not found. Either:\n" +
@@ -93,7 +93,7 @@ function validateDomainFilter(domains: string[]): string[] {
 export function isPerplexityAvailable(): boolean {
 	const config = loadConfig();
-	return Boolean(config.perplexityApiKey || process.env.PERPLEXITY_API_KEY);
+	return Boolean(process.env.PERPLEXITY_API_KEY || config.perplexityApiKey);
 }
 export async function searchWithPerplexity(query: string, options: SearchOptions = {}): Promise<SearchResponse> {

package/video-extract.ts CHANGED Viewed

@@ -46,7 +46,7 @@ interface VideoConfig {
 const VIDEO_CONFIG_DEFAULTS: VideoConfig = {
 	enabled: true,
-	preferredModel: "gemini-2.5-flash",
+	preferredModel: "gemini-3-flash-preview",
 	maxSizeMB: 50,
 };
@@ -123,11 +123,12 @@ export async function extractVideo(
 ): Promise<ExtractedContent | null> {
 	const config = loadVideoConfig();
 	const effectivePrompt = options?.prompt ?? DEFAULT_VIDEO_PROMPT;
+	const effectiveModel = options?.model ?? config.preferredModel;
 	const displayName = basename(info.absolutePath);
 	const activityId = activityMonitor.logStart({ type: "fetch", url: `video:${displayName}` });
-	const result = await tryVideoGeminiApi(info, effectivePrompt, config, signal)
-		?? await tryVideoGeminiWeb(info, effectivePrompt, config, signal);
+	const result = await tryVideoGeminiApi(info, effectivePrompt, effectiveModel, signal)
+		?? await tryVideoGeminiWeb(info, effectivePrompt, effectiveModel, signal);
 	if (result) {
 		const thumbnail = await extractVideoFrame(info.absolutePath);
@@ -183,7 +184,7 @@ export async function getLocalVideoDuration(filePath: string): Promise<number |
 async function tryVideoGeminiWeb(
 	info: VideoFileInfo,
 	prompt: string,
-	config: VideoConfig,
+	model: string,
 	signal?: AbortSignal,
 ): Promise<ExtractedContent | null> {
 	try {
@@ -193,7 +194,7 @@ async function tryVideoGeminiWeb(
 		const text = await queryWithCookies(prompt, cookies, {
 			files: [info.absolutePath],
-			model: config.preferredModel,
+			model,
 			signal,
 			timeoutMs: 180000,
 		});
@@ -212,7 +213,7 @@ async function tryVideoGeminiWeb(
 async function tryVideoGeminiApi(
 	info: VideoFileInfo,
 	prompt: string,
-	config: VideoConfig,
+	model: string,
 	signal?: AbortSignal,
 ): Promise<ExtractedContent | null> {
 	const apiKey = getApiKey();
@@ -227,7 +228,7 @@ async function tryVideoGeminiApi(
 		await pollFileState(fileName, apiKey, signal, 120000);
 		const text = await queryGeminiApiWithVideo(prompt, uploaded.uri, {
-			model: config.preferredModel,
+			model,
 			mimeType: info.mimeType,
 			signal,
 			timeoutMs: 120000,

package/youtube-extract.ts CHANGED Viewed

@@ -26,7 +26,7 @@ interface YouTubeConfig {
 	preferredModel: string;
 }
-const defaults: YouTubeConfig = { enabled: true, preferredModel: "gemini-2.5-flash" };
+const defaults: YouTubeConfig = { enabled: true, preferredModel: "gemini-3-flash-preview" };
 let cachedConfig: YouTubeConfig | null = null;
 function loadYouTubeConfig(): YouTubeConfig {
@@ -69,6 +69,7 @@ export async function extractYouTube(
 	url: string,
 	signal?: AbortSignal,
 	prompt?: string,
+	model?: string,
 ): Promise<ExtractedContent | null> {
 	const config = loadYouTubeConfig();
 	const { videoId } = isYouTubeURL(url);
@@ -76,11 +77,12 @@ export async function extractYouTube(
 		? `https://www.youtube.com/watch?v=${videoId}`
 		: url;
 	const effectivePrompt = prompt ?? YOUTUBE_PROMPT;
+	const effectiveModel = model ?? config.preferredModel;
 	const activityId = activityMonitor.logStart({ type: "fetch", url: `youtube.com/${videoId ?? "video"}` });
-	const result = await tryGeminiWeb(canonicalUrl, effectivePrompt, config, signal)
-		?? await tryGeminiApi(canonicalUrl, effectivePrompt, config, signal)
+	const result = await tryGeminiWeb(canonicalUrl, effectivePrompt, effectiveModel, signal)
+		?? await tryGeminiApi(canonicalUrl, effectivePrompt, effectiveModel, signal)
 		?? await tryPerplexity(url, effectivePrompt, signal);
 	if (result) {
@@ -190,7 +192,7 @@ export async function fetchYouTubeThumbnail(videoId: string): Promise<{ data: st
 async function tryGeminiWeb(
 	url: string,
 	prompt: string,
-	config: YouTubeConfig,
+	model: string,
 	signal?: AbortSignal,
 ): Promise<ExtractedContent | null> {
 	try {
@@ -201,7 +203,7 @@ async function tryGeminiWeb(
 		const text = await queryWithCookies(prompt, cookies, {
 			youtubeUrl: url,
-			model: config.preferredModel,
+			model,
 			signal,
 			timeoutMs: 120000,
 		});
@@ -220,7 +222,7 @@ async function tryGeminiWeb(
 async function tryGeminiApi(
 	url: string,
 	prompt: string,
-	config: YouTubeConfig,
+	model: string,
 	signal?: AbortSignal,
 ): Promise<ExtractedContent | null> {
 	try {
@@ -229,7 +231,7 @@ async function tryGeminiApi(
 		if (signal?.aborted) return null;
 		const text = await queryGeminiApiWithVideo(prompt, url, {
-			model: config.preferredModel,
+			model,
 			signal,
 			timeoutMs: 120000,
 		});