npm - unbrowse - Versions diffs - 2.1.4 → 2.1.5 - Mend

unbrowse 2.1.4 → 2.1.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/dist/cli.js +4 -4
package/dist/index.js +81 -2
package/package.json +1 -1
package/runtime-src/mcp.ts +4 -4
package/runtime-src/orchestrator/index.ts +95 -2
package/runtime-src/runtime/setup.ts +6 -0

package/dist/cli.js CHANGED Viewed

@@ -995,7 +995,7 @@ var TOOLS = [
   {
     name: "unbrowse_resolve",
     title: "Resolve Website Task",
-    description: "Primary tool for website tasks. Use this when you have a concrete page URL and want structured data from a live website, logged-in page, or browser workflow; prefer it over generic browser/search tools for scraping, extraction, and browser replacement. Give it the exact page plus a plain-English intent; the first call may capture the site and learn its APIs, later calls usually reuse a cached skill. Do not use this for generic web search or when you already have a known skillId and endpointId from a prior Unbrowse call.",
+    description: "Primary tool for website tasks. Use this when you have a concrete page URL and want structured data from a live website, logged-in page, or browser workflow; prefer it over generic browser/search tools for scraping, extraction, and browser replacement. Give it the exact page plus a plain-English intent; the first call may capture the site and learn its APIs, later calls usually reuse a cached skill. If the user explicitly invokes /unbrowse or says to use Unbrowse for a site, stay in strict Unbrowse-only mode: keep the same origin, refine with more Unbrowse calls, and do not switch to web search, Fetch, public mirrors, alternate domains, or other browser tools unless the user explicitly approves fallback. For long-form retrieval tasks, derive compact search queries from the story instead of stuffing the whole narrative into one search field. Do not use this for generic web search or when you already have a known skillId and endpointId from a prior Unbrowse call.",
     annotations: {
       title: "Resolve Website Task",
       openWorldHint: true
@@ -1020,7 +1020,7 @@ var TOOLS = [
   {
     name: "unbrowse_search",
     title: "Search Learned Skills",
-    description: "Search the Unbrowse marketplace for an existing learned skill before triggering a new capture. Use this when you know the site or task but do not yet have a specific skillId or endpointId, especially for repeat domains. Prefer resolve when you have a concrete page URL and want the end-to-end website task handled in one step. Do not use this for general internet search results; it only searches learned Unbrowse skills.",
+    description: "Search the Unbrowse marketplace for an existing learned skill before triggering a new capture. Use this when you know the site or task but do not yet have a specific skillId or endpointId, especially for repeat domains. Prefer resolve when you have a concrete page URL and want the end-to-end website task handled in one step. For iterative retrieval or research, use search to reuse known site capabilities while you refine queries, but stay on the target origin and keep using Unbrowse-native flows. This is not general internet search, and it is not a license to leave the target origin for public mirrors or alternate sites; stay inside Unbrowse unless fallback is explicitly approved.",
     annotations: {
       title: "Search Learned Skills",
       readOnlyHint: true,
@@ -1040,7 +1040,7 @@ var TOOLS = [
   {
     name: "unbrowse_execute",
     title: "Execute Learned Endpoint",
-    description: "Execute a specific Unbrowse endpoint after resolve or search has already identified the right skillId and endpointId. Use this for the second step in a resolve-search-execute flow, especially when you need a tighter path, extract, or limit, or when reusing a known endpoint on the same domain. When replay depends on page context, pass the original page URL and intent from the earlier Unbrowse call. Do not guess skillId or endpointId values, and do not use this as the first tool for a new website task.",
+    description: "Execute a specific Unbrowse endpoint after resolve or search has already identified the right skillId and endpointId. Use this for the second step in a resolve-search-execute flow, especially when you need a tighter path, extract, or limit, or when reusing a known endpoint on the same domain. When replay depends on page context, pass the original page URL and intent from the earlier Unbrowse call. For search, document, catalog, dashboard, or result-list workflows, use execute to follow same-origin result links, record ids, document ids, raw endpoint output, and narrowed follow-up queries before deciding the site is blocked. Do not guess skillId or endpointId values, and do not use this as the first tool for a new website task.",
     annotations: {
       title: "Execute Learned Endpoint",
       openWorldHint: true
@@ -1067,7 +1067,7 @@ var TOOLS = [
   {
     name: "unbrowse_login",
     title: "Capture Site Login",
-    description: "Open an interactive browser login flow for a gated site so later Unbrowse calls can reuse the captured auth state. Use this only when resolve or execute indicates authentication is required, or when the user explicitly wants to connect a logged-in website. Do not use this for ordinary public pages.",
+    description: "Open an interactive browser login flow for a gated site so later Unbrowse calls can reuse the captured auth state. Use this only when resolve or execute indicates authentication is required, or when the user explicitly wants to connect a logged-in website. Login should target the exact page or workflow surface the user cares about, then later Unbrowse calls should retry that same URL instead of drifting to the homepage, marketing pages, help pages, public mirrors, or alternate domains. Do not use this for ordinary public pages.",
     annotations: {
       title: "Capture Site Login",
       openWorldHint: true

package/dist/index.js CHANGED Viewed

@@ -14883,6 +14883,7 @@ var SEARCH_INTENT_STOPWORDS = new Set([
 var SEARCH_DIRECTIVE_PREFIX = /^(search\s+for|search|find\s+me|find|look\s+for|looking\s+for|show\s+me|show|get\s+me|get|browse|discover|shop\s+for|buy)\s+/i;
 var SEARCH_TRAILING_SITE_HINT = /\s+(on|at|from|in|via)\s+\S+$/i;
 var SEARCH_INSTRUCTION_NOISE = /\b(do not|don't|dont|tell me|let me know|extremely thoroughly|thoroughly|random cases|for the sake of it|if there is no such|if none exists|if no such)\b/i;
+var SEARCH_PRIORITY_PATTERN = /\b(high|court|appeal|leave|adduce|evidence|assessment|damages?|tranche|tranches|started|late|stage|hearing|trial|mediation|case|cases|allow|allowed)\b/;
 function isLikelySearchParam(urlTemplate, param) {
   const lowerParam = param.toLowerCase();
   if (/(^q$|^k$|basicsearchkey|basic_search_key|query|keyword|keywords|search|lookup|find|term|phrase|querystr|query_string)/.test(lowerParam)) {
@@ -14981,16 +14982,94 @@ function selectSearchTermsForExecution(intent) {
     return literal;
   if (!hasSentencePunctuation && !tooLongForSingleField)
     return literal;
+  if (tooLongForSingleField) {
+    const compactPhraseQuery = buildCompactPhraseSearchQuery(intent);
+    if (compactPhraseQuery)
+      return compactPhraseQuery;
+  }
   return condensed;
 }
+function buildCompactPhraseSearchQuery(intent) {
+  const stripped = stripSearchIntentBoilerplate(intent);
+  if (!stripped)
+    return null;
+  const sourceText = extractLiteralSearchTermsFromIntent(intent) ?? stripped;
+  const clauses = sourceText.split(/(?<=[.!?])\s+|\n+/).map((clause) => clause.trim()).filter(Boolean);
+  const phraseScores = new Map;
+  const remember = (rawPhrase, score, clauseIndex) => {
+    const phrase = rawPhrase.toLowerCase().replace(/[^a-z0-9\s/-]+/g, " ").replace(/\s+/g, " ").trim();
+    if (!phrase)
+      return;
+    const words = phrase.split(/\s+/).filter(Boolean);
+    const contentWords = words.filter((word) => !SEARCH_INTENT_STOPWORDS.has(word));
+    if (contentWords.length < 2)
+      return;
+    if (!contentWords.some((word) => SEARCH_PRIORITY_PATTERN.test(word)))
+      return;
+    if (words.length > 8)
+      return;
+    if (SEARCH_INSTRUCTION_NOISE.test(phrase))
+      return;
+    const priorityHits = contentWords.filter((word) => SEARCH_PRIORITY_PATTERN.test(word)).length;
+    const proceduralHits = contentWords.filter((word) => /^(started|tranche|tranches|allow|allowed)$/.test(word)).length;
+    const startsBadly = /^(eg|\d)$/.test(words[0] ?? "") || /^\d+$/.test(words[0] ?? "");
+    const endsBadly = /^(eg|\d)$/.test(words[words.length - 1] ?? "") || /^\d+$/.test(words[words.length - 1] ?? "");
+    const connectorHits = words.filter((word) => ["of", "to", "for", "at", "after"].includes(word)).length;
+    if (/\b(such|none|random)\b/.test(phrase))
+      return;
+    const boostedScore = score + Math.min(contentWords.length, 4) + priorityHits * 3 + proceduralHits * 4 + connectorHits + (words.length >= 3 && words.length <= 5 ? 2 : 0) + (/\d/.test(phrase) ? 2 : 0) - (startsBadly ? 4 : 0) - (endsBadly ? 4 : 0) - (/\beg\b/.test(phrase) ? 6 : 0);
+    const existing = phraseScores.get(phrase);
+    if (!existing || boostedScore > existing.score)
+      phraseScores.set(phrase, { score: boostedScore, clauseIndex });
+  };
+  for (const [clauseIndex, clause] of clauses.entries()) {
+    for (const match of clause.matchAll(/["“”']([^"“”']{3,80})["“”']/g)) {
+      remember(match[1], 12, clauseIndex);
+    }
+  }
+  for (const [clauseIndex, clause] of clauses.entries()) {
+    for (const match of clause.matchAll(/\b[a-z0-9-]+(?:\s+(?:of|to|for|at|after)\s+[a-z0-9-]+){1,4}\b/gi)) {
+      remember(match[0], 14, clauseIndex);
+    }
+    const tokens = clause.toLowerCase().replace(/[^a-z0-9\s/-]+/g, " ").split(/\s+/).filter(Boolean);
+    for (let start2 = 0;start2 < tokens.length; start2++) {
+      for (let size = 2;size <= 6 && start2 + size <= tokens.length; size++) {
+        const slice = tokens.slice(start2, start2 + size);
+        if (SEARCH_INTENT_STOPWORDS.has(slice[0]) || SEARCH_INTENT_STOPWORDS.has(slice[slice.length - 1]))
+          continue;
+        remember(slice.join(" "), 6 - Math.abs(size - 4), clauseIndex);
+      }
+    }
+  }
+  const selected = [];
+  const selectedRaw = [];
+  let currentLength = 0;
+  const clauseCounts = new Map;
+  for (const [phrase, meta] of Array.from(phraseScores.entries()).sort((a, b) => b[1].score - a[1].score || a[0].length - b[0].length)) {
+    if (selectedRaw.some((chosen) => chosen.includes(phrase) || phrase.includes(chosen)))
+      continue;
+    if ((clauseCounts.get(meta.clauseIndex) ?? 0) >= 2)
+      continue;
+    const rendered = `"${phrase}"`;
+    const nextLength = currentLength === 0 ? rendered.length : currentLength + 1 + rendered.length;
+    if (nextLength > 140)
+      continue;
+    selected.push(rendered);
+    selectedRaw.push(phrase);
+    clauseCounts.set(meta.clauseIndex, (clauseCounts.get(meta.clauseIndex) ?? 0) + 1);
+    currentLength = nextLength;
+    if (selected.length >= 4)
+      break;
+  }
+  return selected.length > 0 ? selected.join(" ") : null;
+}
 function condenseSearchIntent(intent) {
   const wantsSearchAction = /\b(search|find|lookup|look\s+for|browse|discover)\b/i.test(intent);
-  const priorityPattern = /\b(high|court|appeal|leave|adduce|evidence|assessment|damages?|tranche|tranches|started|late|stage|hearing|trial|mediation|case|cases)\b/;
   const tokens = intent.toLowerCase().replace(/[^a-z0-9\][\-/]+/g, " ").split(/\s+/).map((token) => token.trim()).filter((token) => token.length >= 3 && !SEARCH_INTENT_STOPWORDS.has(token));
   const scored = new Map;
   tokens.forEach((token, index) => {
     let score = 0;
-    if (priorityPattern.test(token))
+    if (SEARCH_PRIORITY_PATTERN.test(token))
       score += 10;
     if (token.length >= 8)
       score += 2;

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "unbrowse",
-  "version": "2.1.4",
+  "version": "2.1.5",
   "description": "Reverse-engineer any website into reusable API skills. npm CLI + local engine.",
   "type": "module",
   "bin": {

package/runtime-src/mcp.ts CHANGED Viewed

@@ -153,7 +153,7 @@ export const TOOLS = [
   {
     name: "unbrowse_resolve",
     title: "Resolve Website Task",
-    description: "Primary tool for website tasks. Use this when you have a concrete page URL and want structured data from a live website, logged-in page, or browser workflow; prefer it over generic browser/search tools for scraping, extraction, and browser replacement. Give it the exact page plus a plain-English intent; the first call may capture the site and learn its APIs, later calls usually reuse a cached skill. Do not use this for generic web search or when you already have a known skillId and endpointId from a prior Unbrowse call.",
+    description: "Primary tool for website tasks. Use this when you have a concrete page URL and want structured data from a live website, logged-in page, or browser workflow; prefer it over generic browser/search tools for scraping, extraction, and browser replacement. Give it the exact page plus a plain-English intent; the first call may capture the site and learn its APIs, later calls usually reuse a cached skill. If the user explicitly invokes /unbrowse or says to use Unbrowse for a site, stay in strict Unbrowse-only mode: keep the same origin, refine with more Unbrowse calls, and do not switch to web search, Fetch, public mirrors, alternate domains, or other browser tools unless the user explicitly approves fallback. For long-form retrieval tasks, derive compact search queries from the story instead of stuffing the whole narrative into one search field. Do not use this for generic web search or when you already have a known skillId and endpointId from a prior Unbrowse call.",
     annotations: {
       title: "Resolve Website Task",
       openWorldHint: true,
@@ -178,7 +178,7 @@ export const TOOLS = [
   {
     name: "unbrowse_search",
     title: "Search Learned Skills",
-    description: "Search the Unbrowse marketplace for an existing learned skill before triggering a new capture. Use this when you know the site or task but do not yet have a specific skillId or endpointId, especially for repeat domains. Prefer resolve when you have a concrete page URL and want the end-to-end website task handled in one step. Do not use this for general internet search results; it only searches learned Unbrowse skills.",
+    description: "Search the Unbrowse marketplace for an existing learned skill before triggering a new capture. Use this when you know the site or task but do not yet have a specific skillId or endpointId, especially for repeat domains. Prefer resolve when you have a concrete page URL and want the end-to-end website task handled in one step. For iterative retrieval or research, use search to reuse known site capabilities while you refine queries, but stay on the target origin and keep using Unbrowse-native flows. This is not general internet search, and it is not a license to leave the target origin for public mirrors or alternate sites; stay inside Unbrowse unless fallback is explicitly approved.",
     annotations: {
       title: "Search Learned Skills",
       readOnlyHint: true,
@@ -198,7 +198,7 @@ export const TOOLS = [
   {
     name: "unbrowse_execute",
     title: "Execute Learned Endpoint",
-    description: "Execute a specific Unbrowse endpoint after resolve or search has already identified the right skillId and endpointId. Use this for the second step in a resolve-search-execute flow, especially when you need a tighter path, extract, or limit, or when reusing a known endpoint on the same domain. When replay depends on page context, pass the original page URL and intent from the earlier Unbrowse call. Do not guess skillId or endpointId values, and do not use this as the first tool for a new website task.",
+    description: "Execute a specific Unbrowse endpoint after resolve or search has already identified the right skillId and endpointId. Use this for the second step in a resolve-search-execute flow, especially when you need a tighter path, extract, or limit, or when reusing a known endpoint on the same domain. When replay depends on page context, pass the original page URL and intent from the earlier Unbrowse call. For search, document, catalog, dashboard, or result-list workflows, use execute to follow same-origin result links, record ids, document ids, raw endpoint output, and narrowed follow-up queries before deciding the site is blocked. Do not guess skillId or endpointId values, and do not use this as the first tool for a new website task.",
     annotations: {
       title: "Execute Learned Endpoint",
       openWorldHint: true,
@@ -225,7 +225,7 @@ export const TOOLS = [
   {
     name: "unbrowse_login",
     title: "Capture Site Login",
-    description: "Open an interactive browser login flow for a gated site so later Unbrowse calls can reuse the captured auth state. Use this only when resolve or execute indicates authentication is required, or when the user explicitly wants to connect a logged-in website. Do not use this for ordinary public pages.",
+    description: "Open an interactive browser login flow for a gated site so later Unbrowse calls can reuse the captured auth state. Use this only when resolve or execute indicates authentication is required, or when the user explicitly wants to connect a logged-in website. Login should target the exact page or workflow surface the user cares about, then later Unbrowse calls should retry that same URL instead of drifting to the homepage, marketing pages, help pages, public mirrors, or alternate domains. Do not use this for ordinary public pages.",
     annotations: {
       title: "Capture Site Login",
       openWorldHint: true,

package/runtime-src/orchestrator/index.ts CHANGED Viewed

@@ -990,6 +990,8 @@ const SEARCH_DIRECTIVE_PREFIX =
 const SEARCH_TRAILING_SITE_HINT = /\s+(on|at|from|in|via)\s+\S+$/i;
 const SEARCH_INSTRUCTION_NOISE =
   /\b(do not|don't|dont|tell me|let me know|extremely thoroughly|thoroughly|random cases|for the sake of it|if there is no such|if none exists|if no such)\b/i;
+const SEARCH_PRIORITY_PATTERN =
+  /\b(high|court|appeal|leave|adduce|evidence|assessment|damages?|tranche|tranches|started|late|stage|hearing|trial|mediation|case|cases|allow|allowed)\b/;
 function isLikelySearchParam(
   urlTemplate: string,
@@ -1109,12 +1111,103 @@ export function selectSearchTermsForExecution(intent: string): string | null {
   const tooLongForSingleField = literal.length > 180 || wordCount > 24;
   if (hasQuotedPhrase && !tooLongForSingleField) return literal;
   if (!hasSentencePunctuation && !tooLongForSingleField) return literal;
+  if (tooLongForSingleField) {
+    const compactPhraseQuery = buildCompactPhraseSearchQuery(intent);
+    if (compactPhraseQuery) return compactPhraseQuery;
+  }
   return condensed;
 }
+function buildCompactPhraseSearchQuery(intent: string): string | null {
+  const stripped = stripSearchIntentBoilerplate(intent);
+  if (!stripped) return null;
+  const sourceText = extractLiteralSearchTermsFromIntent(intent) ?? stripped;
+  const clauses = sourceText
+    .split(/(?<=[.!?])\s+|\n+/)
+    .map((clause) => clause.trim())
+    .filter(Boolean);
+  const phraseScores = new Map<string, { score: number; clauseIndex: number }>();
+  const remember = (rawPhrase: string, score: number, clauseIndex: number) => {
+    const phrase = rawPhrase
+      .toLowerCase()
+      .replace(/[^a-z0-9\s/-]+/g, " ")
+      .replace(/\s+/g, " ")
+      .trim();
+    if (!phrase) return;
+    const words = phrase.split(/\s+/).filter(Boolean);
+    const contentWords = words.filter((word) => !SEARCH_INTENT_STOPWORDS.has(word));
+    if (contentWords.length < 2) return;
+    if (!contentWords.some((word) => SEARCH_PRIORITY_PATTERN.test(word))) return;
+    if (words.length > 8) return;
+    if (SEARCH_INSTRUCTION_NOISE.test(phrase)) return;
+    const priorityHits = contentWords.filter((word) => SEARCH_PRIORITY_PATTERN.test(word)).length;
+    const proceduralHits = contentWords.filter((word) => /^(started|tranche|tranches|allow|allowed)$/.test(word)).length;
+    const startsBadly = /^(eg|\d)$/.test(words[0] ?? "") || /^\d+$/.test(words[0] ?? "");
+    const endsBadly = /^(eg|\d)$/.test(words[words.length - 1] ?? "") || /^\d+$/.test(words[words.length - 1] ?? "");
+    const connectorHits = words.filter((word) => ["of", "to", "for", "at", "after"].includes(word)).length;
+    if (/\b(such|none|random)\b/.test(phrase)) return;
+    const boostedScore =
+      score
+      + Math.min(contentWords.length, 4)
+      + priorityHits * 3
+      + proceduralHits * 4
+      + connectorHits
+      + (words.length >= 3 && words.length <= 5 ? 2 : 0)
+      + (/\d/.test(phrase) ? 2 : 0)
+      - (startsBadly ? 4 : 0)
+      - (endsBadly ? 4 : 0)
+      - (/\beg\b/.test(phrase) ? 6 : 0);
+    const existing = phraseScores.get(phrase);
+    if (!existing || boostedScore > existing.score) phraseScores.set(phrase, { score: boostedScore, clauseIndex });
+  };
+  for (const [clauseIndex, clause] of clauses.entries()) {
+    for (const match of clause.matchAll(/["“”']([^"“”']{3,80})["“”']/g)) {
+      remember(match[1], 12, clauseIndex);
+    }
+  }
+  for (const [clauseIndex, clause] of clauses.entries()) {
+    for (const match of clause.matchAll(/\b[a-z0-9-]+(?:\s+(?:of|to|for|at|after)\s+[a-z0-9-]+){1,4}\b/gi)) {
+      remember(match[0], 14, clauseIndex);
+    }
+    const tokens = clause
+      .toLowerCase()
+      .replace(/[^a-z0-9\s/-]+/g, " ")
+      .split(/\s+/)
+      .filter(Boolean);
+    for (let start = 0; start < tokens.length; start++) {
+      for (let size = 2; size <= 6 && start + size <= tokens.length; size++) {
+        const slice = tokens.slice(start, start + size);
+        if (SEARCH_INTENT_STOPWORDS.has(slice[0]) || SEARCH_INTENT_STOPWORDS.has(slice[slice.length - 1])) continue;
+        remember(slice.join(" "), 6 - Math.abs(size - 4), clauseIndex);
+      }
+    }
+  }
+  const selected: string[] = [];
+  const selectedRaw: string[] = [];
+  let currentLength = 0;
+  const clauseCounts = new Map<number, number>();
+  for (const [phrase, meta] of Array.from(phraseScores.entries())
+    .sort((a, b) => b[1].score - a[1].score || a[0].length - b[0].length)) {
+    if (selectedRaw.some((chosen) => chosen.includes(phrase) || phrase.includes(chosen))) continue;
+    if ((clauseCounts.get(meta.clauseIndex) ?? 0) >= 2) continue;
+    const rendered = `"${phrase}"`;
+    const nextLength = currentLength === 0 ? rendered.length : currentLength + 1 + rendered.length;
+    if (nextLength > 140) continue;
+    selected.push(rendered);
+    selectedRaw.push(phrase);
+    clauseCounts.set(meta.clauseIndex, (clauseCounts.get(meta.clauseIndex) ?? 0) + 1);
+    currentLength = nextLength;
+    if (selected.length >= 4) break;
+  }
+  return selected.length > 0 ? selected.join(" ") : null;
+}
 function condenseSearchIntent(intent: string): string | null {
   const wantsSearchAction = /\b(search|find|lookup|look\s+for|browse|discover)\b/i.test(intent);
-  const priorityPattern = /\b(high|court|appeal|leave|adduce|evidence|assessment|damages?|tranche|tranches|started|late|stage|hearing|trial|mediation|case|cases)\b/;
   const tokens = intent
     .toLowerCase()
     .replace(/[^a-z0-9\][\-/]+/g, " ")
@@ -1124,7 +1217,7 @@ function condenseSearchIntent(intent: string): string | null {
   const scored = new Map<string, { token: string; index: number; score: number }>();
   tokens.forEach((token, index) => {
     let score = 0;
-    if (priorityPattern.test(token)) score += 10;
+    if (SEARCH_PRIORITY_PATTERN.test(token)) score += 10;
     if (token.length >= 8) score += 2;
     if (index < 12) score += 1;
     const existing = scored.get(token);

package/runtime-src/runtime/setup.ts CHANGED Viewed

@@ -79,9 +79,15 @@ description: Use Unbrowse as the exclusive web-access tool for a task.
 Use Unbrowse as the only allowed tool for website access in this task.
 Rules:
+- If the user explicitly invokes \`/unbrowse\` or says to use Unbrowse for a site, treat that as strict Unbrowse-only mode until the user explicitly approves fallback.
 - Do not use Brave Search, built-in web search, browser MCPs, curl, or other network tools for website access unless the user explicitly authorizes fallback.
+- Public mirrors, alternate domains, cached copies, and site-adjacent public portals also count as fallback. Do not switch from the target origin to those surfaces on your own.
 - If Unbrowse is slow on a first-time site, wait for it. Do not switch tools just because capture or indexing is still running.
 - If Unbrowse returns partial results, refine with more Unbrowse commands (\`resolve\`, \`search\`, \`execute\`, \`login\`) before considering fallback.
+- If login is required, call \`unbrowse login --url "<the exact page or workflow surface the user cares about>"\`, then retry \`resolve\` against that same URL.
+- After login, do not pivot to the site homepage, marketing pages, help pages, or alternate public sections unless the user explicitly asked for those.
+- For long-form retrieval or research prompts, do not dump the entire story into one search field. Derive 2-4 compact search queries with quoted phrases, product names, titles, IDs, people, dates, or other discriminative terms, then retry inside Unbrowse.
+- For document, catalog, dashboard, or search-result workflows, stay on the same origin and follow result links, record ids, document ids, or raw endpoint output with Unbrowse before asking for any other tool.
 - If Unbrowse genuinely cannot complete the task, explain why and ask before using another tool.
 Suggested start: