@pencil-agent/nano-pencil 2.0.1 → 2.0.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +267 -267
- package/dist/build-meta.json +3 -3
- package/dist/core/export-html/AGENT.md +11 -11
- package/dist/core/export-html/template.css +971 -971
- package/dist/core/export-html/template.html +54 -54
- package/dist/core/model/custom-providers.js +1 -1
- package/dist/core/model-registry.js +5 -5
- package/dist/extensions/builtin/AGENT.md +115 -115
- package/dist/extensions/builtin/browser/AGENT.md +17 -17
- package/dist/extensions/builtin/browser/agent-workspace/agent_helpers.py +12 -12
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/amazon/product-search.md +198 -198
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/archive-org/scraping.md +341 -341
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/arxiv/scraping.md +311 -311
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/arxiv-bulk/scraping.md +333 -333
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/atlas/overview.md +70 -70
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/booking-com/scraping.md +578 -578
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/capterra/scraping.md +440 -440
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/centilebrain/generate-estimates.md +110 -110
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/coingecko/scraping.md +325 -325
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/coinmarketcap/scraping.md +463 -463
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/coursera/scraping.md +360 -360
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/craigslist/scraping.md +390 -390
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/crossref/scraping.md +568 -568
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/dev-to/scraping.md +323 -323
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/duckduckgo/scraping.md +349 -349
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/ebay/scraping.md +435 -435
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/etsy/scraping.md +506 -506
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/eventbrite/scraping.md +363 -363
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/expedia/automation.md +168 -168
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/facebook/groups.md +236 -236
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/facebook/pages.md +295 -295
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/framer/editor.md +108 -108
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/fred/scraping.md +493 -493
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/g2/scraping.md +580 -580
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/genius/scraping.md +511 -511
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/github/repo-actions.md +65 -65
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/github/scraping.md +184 -184
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/glassdoor/scraping.md +543 -543
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/gmail/compose.md +122 -122
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/goodreads/scraping.md +461 -461
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/gutenberg/scraping.md +383 -383
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/hackernews/scraping.md +243 -243
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/howlongtobeat/scraping.md +473 -473
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/imdb/scraping.md +271 -271
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/itch-io/scraping.md +436 -436
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/job-boards/indeed-glassdoor.md +1021 -1021
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/letterboxd/scraping.md +349 -349
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/linkedin/invitation-manager.md +109 -109
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/loom/folder-enumeration.md +170 -170
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/macrotrends/scraping.md +537 -537
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/medium/article-hydration.md +120 -120
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/medium/scraping.md +414 -414
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/metacritic/scraping.md +477 -477
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/musicbrainz/scraping.md +478 -478
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/nasa/scraping.md +339 -339
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/news-aggregation/multi-source.md +205 -205
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/open-library/scraping.md +472 -472
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/openalex/scraping.md +470 -470
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/openstreetmap/scraping.md +490 -490
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/package-registries/npm-pypi.md +478 -478
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/polymarket/scraping.md +234 -234
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/producthunt/scraping.md +307 -307
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/pubmed/scraping.md +421 -421
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/quora/scraping.md +364 -364
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/rawg/scraping.md +352 -352
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/reddit/scraping.md +124 -124
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/rest-countries/scraping.md +233 -233
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/sec-edgar/scraping.md +361 -361
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/README.md +36 -36
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/embedded-apps.md +72 -72
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/knowledge-base.md +109 -109
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/polaris-inputs.md +137 -137
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/soundcloud/scraping.md +362 -362
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/spotify/scraping.md +339 -339
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/stackoverflow/scraping.md +435 -435
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/steam/scraping.md +575 -575
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/substack/scraping.md +338 -338
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/thetechgeeks/pricing.md +52 -52
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/tiktok/upload.md +107 -107
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/tradingview/scraping.md +309 -309
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/trello/boards-and-lists.md +88 -88
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/trustpilot/scraping.md +375 -375
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/walmart/scraping.md +444 -444
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/wayback-machine/scraping.md +306 -306
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/weather/scraping.md +398 -398
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/wellfound/scraping.md +596 -596
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/world-bank/scraping.md +356 -356
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/xiaohongshu/scraping.md +84 -84
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/youtube/scraping.md +418 -418
- package/dist/extensions/builtin/browser/agent-workspace/domain-skills/zillow/scraping.md +433 -433
- package/dist/extensions/builtin/browser/browser.md +73 -73
- package/dist/extensions/builtin/browser/install.md +142 -142
- package/dist/extensions/builtin/browser/interaction-skills/connection.md +48 -48
- package/dist/extensions/builtin/browser/interaction-skills/cookies.md +3 -3
- package/dist/extensions/builtin/browser/interaction-skills/cross-origin-iframes.md +3 -3
- package/dist/extensions/builtin/browser/interaction-skills/dialogs.md +64 -64
- package/dist/extensions/builtin/browser/interaction-skills/downloads.md +3 -3
- package/dist/extensions/builtin/browser/interaction-skills/drag-and-drop.md +3 -3
- package/dist/extensions/builtin/browser/interaction-skills/dropdowns.md +3 -3
- package/dist/extensions/builtin/browser/interaction-skills/iframes.md +3 -3
- package/dist/extensions/builtin/browser/interaction-skills/network-requests.md +3 -3
- package/dist/extensions/builtin/browser/interaction-skills/print-as-pdf.md +3 -3
- package/dist/extensions/builtin/browser/interaction-skills/profile-sync.md +90 -90
- package/dist/extensions/builtin/browser/interaction-skills/screenshots.md +17 -17
- package/dist/extensions/builtin/browser/interaction-skills/scrolling.md +3 -3
- package/dist/extensions/builtin/browser/interaction-skills/shadow-dom.md +3 -3
- package/dist/extensions/builtin/browser/interaction-skills/tabs.md +69 -69
- package/dist/extensions/builtin/browser/interaction-skills/uploads.md +1 -1
- package/dist/extensions/builtin/browser/interaction-skills/viewport.md +3 -3
- package/dist/extensions/builtin/browser/src/browser_harness/AGENT.md +15 -15
- package/dist/extensions/builtin/browser/src/browser_harness/__init__.py +8 -8
- package/dist/extensions/builtin/browser/src/browser_harness/_ipc.py +90 -90
- package/dist/extensions/builtin/browser/src/browser_harness/admin.py +722 -722
- package/dist/extensions/builtin/browser/src/browser_harness/daemon.py +328 -328
- package/dist/extensions/builtin/browser/src/browser_harness/helpers.py +396 -396
- package/dist/extensions/builtin/browser/src/browser_harness/run.py +103 -103
- package/dist/extensions/builtin/discipline/skills/brainstorming/SKILL.md +33 -33
- package/dist/extensions/builtin/discipline/skills/executing-plans/SKILL.md +25 -25
- package/dist/extensions/builtin/discipline/skills/finishing-development-branch/SKILL.md +25 -25
- package/dist/extensions/builtin/discipline/skills/receiving-code-review/SKILL.md +22 -22
- package/dist/extensions/builtin/discipline/skills/requesting-code-review/SKILL.md +31 -31
- package/dist/extensions/builtin/discipline/skills/systematic-debugging/SKILL.md +28 -28
- package/dist/extensions/builtin/discipline/skills/test-driven-development/SKILL.md +32 -32
- package/dist/extensions/builtin/discipline/skills/using-git-worktrees/SKILL.md +25 -25
- package/dist/extensions/builtin/discipline/skills/verification-before-completion/SKILL.md +27 -27
- package/dist/extensions/builtin/discipline/skills/writing-plans/SKILL.md +26 -26
- package/dist/extensions/builtin/goal/README.md +67 -67
- package/dist/extensions/builtin/grub/README.md +112 -112
- package/dist/extensions/builtin/link-world/agent-workspace/README.md +16 -16
- package/dist/extensions/builtin/link-world/internet-search/internet-search.md +65 -65
- package/dist/extensions/builtin/link-world/link-world-agent.md +82 -82
- package/dist/extensions/builtin/link-world/linkworld.md +313 -313
- package/dist/extensions/builtin/link-world/network-routing/network-routing.md +67 -67
- package/dist/extensions/builtin/loop/README.md +92 -92
- package/dist/extensions/builtin/mcp/figma-design.md +68 -68
- package/dist/extensions/builtin/mcp/mcp-management.md +85 -85
- package/dist/extensions/builtin/recap/AGENT.md +15 -15
- package/dist/extensions/builtin/sal/README.md +72 -72
- package/dist/extensions/builtin/security-audit/README.md +289 -289
- package/dist/extensions/builtin/team/AGENT.md +112 -112
- package/dist/extensions/builtin/team/TESTING.md +299 -299
- package/dist/extensions/builtin/token-save/README.md +56 -56
- package/dist/extensions/optional/AGENT.md +10 -10
- package/dist/modes/interactive/controllers/input-submit-controller.js +2 -2
- package/dist/modes/interactive/controllers/stream-render-controller.js +2 -2
- package/dist/modes/interactive/interactive-mode.js +19 -19
- package/dist/modes/interactive/theme/dark.json +85 -85
- package/dist/modes/interactive/theme/light.json +84 -84
- package/dist/modes/interactive/theme/theme-schema.json +335 -335
- package/dist/modes/interactive/theme/warm.json +81 -81
- package/dist/node_modules/@pencil-agent/ai/dist/cli.js +0 -0
- package/dist/node_modules/@pencil-agent/ai/dist/models.generated.js +1 -1
- package/docs/ACP/345/215/217/350/256/256/351/233/206/346/210/220/345/274/200/345/217/221/346/226/207/346/241/243.md +851 -0
- package/docs/SDK-TESTING.md +364 -0
- package/docs/codex-goal-command-impl.md +1055 -1055
- package/docs/codex-goal-vs-grub.md +500 -500
- package/docs/custom-provider.md +27 -27
- package/docs/extensions.md +27 -27
- package/docs/keybindings.md +27 -27
- package/docs/loop /351/207/215/346/236/204/345/256/214/346/210/220/346/200/273/347/273/223.md" +250 -250
- package/docs/loop /351/207/215/346/236/204/345/256/214/346/210/220/346/212/245/345/221/212.md" +122 -122
- package/docs/loop /351/207/215/346/236/204/346/226/271/346/241/210.md" +1222 -1222
- package/docs/loop /351/207/215/346/236/204/346/226/271/346/241/210/345/256/236/347/216/260/346/212/245/345/221/212.md" +158 -158
- package/docs/loop /351/207/215/346/236/204/346/226/271/346/241/210/345/257/271/346/257/224/345/210/206/346/236/220.md" +128 -128
- package/docs/loop /351/207/215/346/236/204/350/256/241/345/210/222.md" +320 -320
- package/docs/loop-usage-examples.md +214 -214
- package/docs/mem-core/346/212/200/346/234/257/346/226/207/346/241/243.md +593 -0
- package/docs/models.md +27 -27
- package/docs/packages.md +27 -27
- package/docs/pi-design-philosophy.md +457 -457
- package/docs/planmode.md +1987 -1987
- package/docs/prompt-templates.md +27 -27
- package/docs/providers.md +27 -27
- package/docs/sdk.md +27 -27
- package/docs/skills.md +27 -27
- package/docs/startup-performance-optimization.md +301 -0
- package/docs/themes.md +27 -27
- package/docs/tui.md +27 -27
- package/docs//350/256/244/347/237/245/345/234/260/345/233/276.md +47 -0
- package/package.json +190 -190
- package/docs/cc-agent-design.md +0 -1297
- package/docs/cc-tui-design.md +0 -1333
- package/docs/nanoPencil-/345/255/246/344/271/240/350/256/241/345/210/222.md +0 -170
- package/docs/scan-report.md +0 -3820
- package/docs//345/257/271/346/240/207Claude-Code.md +0 -1775
- package/docs//351/230/277/351/207/214/345/267/264/345/267/264/350/264/242/346/212/245/345/210/206/346/236/220/344/271/246.md +0 -261
package/dist/extensions/builtin/browser/agent-workspace/domain-skills/linkedin/invitation-manager.md
CHANGED
|
@@ -1,109 +1,109 @@
|
|
|
1
|
-
# LinkedIn — Invitation Manager
|
|
2
|
-
|
|
3
|
-
Accept or ignore pending connection invitations in bulk from
|
|
4
|
-
`https://www.linkedin.com/mynetwork/invitation-manager/received/<FILTER>/`.
|
|
5
|
-
|
|
6
|
-
## URL filters
|
|
7
|
-
|
|
8
|
-
The trailing slug pre-filters the received invitations. Observed slugs:
|
|
9
|
-
|
|
10
|
-
- `PEOPLE_WITH_MUTUAL_CONNECTION` — people who share a mutual connection
|
|
11
|
-
- `PEOPLE_WITH_MUTUAL_SCHOOL` — people who share a school
|
|
12
|
-
- omit the slug (`.../received/`) for all pending invitations
|
|
13
|
-
|
|
14
|
-
The filter chip at the top of the page mirrors the URL and also renders
|
|
15
|
-
`All (N)`, `Mutual Connections (N)`, `Your School (N)` — the `(N)` is the
|
|
16
|
-
authoritative remaining-count for the active filter and is what you loop on.
|
|
17
|
-
|
|
18
|
-
## Button selectors
|
|
19
|
-
|
|
20
|
-
Each pending-invitation card contains an Accept and an Ignore control.
|
|
21
|
-
**The aria-label formats are different** for the two buttons — don't derive
|
|
22
|
-
one from the other:
|
|
23
|
-
|
|
24
|
-
- Accept: `aria-label = "Accept <Name>'s invitation"` (note: curly `’`, not ASCII `'`)
|
|
25
|
-
- Ignore: `aria-label = "Ignore an invitation to connect from <Name>"`
|
|
26
|
-
|
|
27
|
-
```python
|
|
28
|
-
# Match either — both are unique per card
|
|
29
|
-
accepts = js("Array.from(document.querySelectorAll('button, a')).filter(b => (b.getAttribute('aria-label')||'').startsWith('Accept ')).length")
|
|
30
|
-
ignores = js("Array.from(document.querySelectorAll('button')).filter(b => (b.getAttribute('aria-label')||'').toLowerCase().startsWith('ignore')).length")
|
|
31
|
-
```
|
|
32
|
-
|
|
33
|
-
## Trap: "follows you" cards render Accept as `<a>`, not `<button>`
|
|
34
|
-
|
|
35
|
-
For invitations labeled `<Name> follows you and is inviting you to connect`
|
|
36
|
-
(typically Premium users' auto-invites), the Accept control is an `<a href>`,
|
|
37
|
-
not a `<button>` — and the `href` points at the **current page URL**.
|
|
38
|
-
|
|
39
|
-
`<a>.click()` follows the href → same-URL soft-nav → accept never fires.
|
|
40
|
-
Dispatched `MouseEvent`s and coordinate `Input.dispatchMouseEvent` clicks
|
|
41
|
-
also land on the element (you can see the focus ring appear) but do not
|
|
42
|
-
trigger the underlying accept handler. **There is no known way to accept
|
|
43
|
-
these via CDP.** Click the Ignore button instead (Ignore is always a
|
|
44
|
-
`<button>` and works with a normal coordinate click), or skip the row.
|
|
45
|
-
|
|
46
|
-
Detect with `element.tagName === 'A'` on the Accept element.
|
|
47
|
-
|
|
48
|
-
```python
|
|
49
|
-
# In your extractor, capture the tag so downstream logic can route these
|
|
50
|
-
rows = js(r"""
|
|
51
|
-
(() => {
|
|
52
|
-
const accepts = Array.from(document.querySelectorAll('button, a'))
|
|
53
|
-
.filter(b => (b.getAttribute('aria-label')||'').startsWith('Accept ') && !b.disabled);
|
|
54
|
-
return accepts.map(a => ({aria: a.getAttribute('aria-label'), tag: a.tagName}));
|
|
55
|
-
})()
|
|
56
|
-
""")
|
|
57
|
-
```
|
|
58
|
-
|
|
59
|
-
## Pagination — reload, don't scroll
|
|
60
|
-
|
|
61
|
-
The list only renders ~10 cards at a time. After you click Accept on the
|
|
62
|
-
visible batch, LinkedIn replaces the pending section with a "X is now a
|
|
63
|
-
connection" acknowledgment list + "Suggestions for you" — the next batch of
|
|
64
|
-
pending invites does **not** auto-mount. Window-scroll does not trigger
|
|
65
|
-
lazy-load either.
|
|
66
|
-
|
|
67
|
-
Pattern:
|
|
68
|
-
|
|
69
|
-
1. Navigate to the filter URL, `wait_for_load()`, sleep ~2.5s.
|
|
70
|
-
2. Extract visible rows, decide, click Accept/Ignore for each (`.click()` via
|
|
71
|
-
JS works for `<button>` Accept and Ignore; coordinate click via
|
|
72
|
-
`Input.dispatchMouseEvent` also works).
|
|
73
|
-
3. Reload the URL (`cdp("Page.navigate", url=...)`). Do **not** rely on
|
|
74
|
-
scrolling or clicking a "show more" control.
|
|
75
|
-
4. Repeat until the filter chip shows `(0)` or no Accept buttons remain.
|
|
76
|
-
|
|
77
|
-
Chip count decreases by the number of successful accepts + ignores per
|
|
78
|
-
cycle — use it as the loop guard.
|
|
79
|
-
|
|
80
|
-
## Safety modal: "Take care when connecting"
|
|
81
|
-
|
|
82
|
-
LinkedIn occasionally interposes a `"Take care when connecting"` dialog
|
|
83
|
-
when you click Accept on a connection it considers unfamiliar. The dialog
|
|
84
|
-
has `View profile` and `Accept invite` buttons — click `Accept invite` to
|
|
85
|
-
proceed. Watch for it between accepts; it's intermittent, not per-row.
|
|
86
|
-
|
|
87
|
-
## Quick sketch
|
|
88
|
-
|
|
89
|
-
```python
|
|
90
|
-
import time
|
|
91
|
-
|
|
92
|
-
def chip():
|
|
93
|
-
return js(r"""(() => {
|
|
94
|
-
const el = Array.from(document.querySelectorAll('button, a')).map(e => (e.textContent||'').trim())
|
|
95
|
-
.find(t => /^Mutual Connections \(/.test(t));
|
|
96
|
-
return el || '';
|
|
97
|
-
})()""")
|
|
98
|
-
|
|
99
|
-
while True:
|
|
100
|
-
cdp("Page.navigate", url="https://www.linkedin.com/mynetwork/invitation-manager/received/PEOPLE_WITH_MUTUAL_CONNECTION/")
|
|
101
|
-
wait_for_load()
|
|
102
|
-
time.sleep(2.5)
|
|
103
|
-
n = int(js(r"""(() => Array.from(document.querySelectorAll('button, a'))
|
|
104
|
-
.filter(b => (b.getAttribute('aria-label')||'').startsWith('Accept ') && !b.disabled).length)()"""))
|
|
105
|
-
if n == 0:
|
|
106
|
-
break
|
|
107
|
-
# click each Accept (route tag === 'A' rows to Ignore — see trap above)
|
|
108
|
-
...
|
|
109
|
-
```
|
|
1
|
+
# LinkedIn — Invitation Manager
|
|
2
|
+
|
|
3
|
+
Accept or ignore pending connection invitations in bulk from
|
|
4
|
+
`https://www.linkedin.com/mynetwork/invitation-manager/received/<FILTER>/`.
|
|
5
|
+
|
|
6
|
+
## URL filters
|
|
7
|
+
|
|
8
|
+
The trailing slug pre-filters the received invitations. Observed slugs:
|
|
9
|
+
|
|
10
|
+
- `PEOPLE_WITH_MUTUAL_CONNECTION` — people who share a mutual connection
|
|
11
|
+
- `PEOPLE_WITH_MUTUAL_SCHOOL` — people who share a school
|
|
12
|
+
- omit the slug (`.../received/`) for all pending invitations
|
|
13
|
+
|
|
14
|
+
The filter chip at the top of the page mirrors the URL and also renders
|
|
15
|
+
`All (N)`, `Mutual Connections (N)`, `Your School (N)` — the `(N)` is the
|
|
16
|
+
authoritative remaining-count for the active filter and is what you loop on.
|
|
17
|
+
|
|
18
|
+
## Button selectors
|
|
19
|
+
|
|
20
|
+
Each pending-invitation card contains an Accept and an Ignore control.
|
|
21
|
+
**The aria-label formats are different** for the two buttons — don't derive
|
|
22
|
+
one from the other:
|
|
23
|
+
|
|
24
|
+
- Accept: `aria-label = "Accept <Name>'s invitation"` (note: curly `’`, not ASCII `'`)
|
|
25
|
+
- Ignore: `aria-label = "Ignore an invitation to connect from <Name>"`
|
|
26
|
+
|
|
27
|
+
```python
|
|
28
|
+
# Match either — both are unique per card
|
|
29
|
+
accepts = js("Array.from(document.querySelectorAll('button, a')).filter(b => (b.getAttribute('aria-label')||'').startsWith('Accept ')).length")
|
|
30
|
+
ignores = js("Array.from(document.querySelectorAll('button')).filter(b => (b.getAttribute('aria-label')||'').toLowerCase().startsWith('ignore')).length")
|
|
31
|
+
```
|
|
32
|
+
|
|
33
|
+
## Trap: "follows you" cards render Accept as `<a>`, not `<button>`
|
|
34
|
+
|
|
35
|
+
For invitations labeled `<Name> follows you and is inviting you to connect`
|
|
36
|
+
(typically Premium users' auto-invites), the Accept control is an `<a href>`,
|
|
37
|
+
not a `<button>` — and the `href` points at the **current page URL**.
|
|
38
|
+
|
|
39
|
+
`<a>.click()` follows the href → same-URL soft-nav → accept never fires.
|
|
40
|
+
Dispatched `MouseEvent`s and coordinate `Input.dispatchMouseEvent` clicks
|
|
41
|
+
also land on the element (you can see the focus ring appear) but do not
|
|
42
|
+
trigger the underlying accept handler. **There is no known way to accept
|
|
43
|
+
these via CDP.** Click the Ignore button instead (Ignore is always a
|
|
44
|
+
`<button>` and works with a normal coordinate click), or skip the row.
|
|
45
|
+
|
|
46
|
+
Detect with `element.tagName === 'A'` on the Accept element.
|
|
47
|
+
|
|
48
|
+
```python
|
|
49
|
+
# In your extractor, capture the tag so downstream logic can route these
|
|
50
|
+
rows = js(r"""
|
|
51
|
+
(() => {
|
|
52
|
+
const accepts = Array.from(document.querySelectorAll('button, a'))
|
|
53
|
+
.filter(b => (b.getAttribute('aria-label')||'').startsWith('Accept ') && !b.disabled);
|
|
54
|
+
return accepts.map(a => ({aria: a.getAttribute('aria-label'), tag: a.tagName}));
|
|
55
|
+
})()
|
|
56
|
+
""")
|
|
57
|
+
```
|
|
58
|
+
|
|
59
|
+
## Pagination — reload, don't scroll
|
|
60
|
+
|
|
61
|
+
The list only renders ~10 cards at a time. After you click Accept on the
|
|
62
|
+
visible batch, LinkedIn replaces the pending section with a "X is now a
|
|
63
|
+
connection" acknowledgment list + "Suggestions for you" — the next batch of
|
|
64
|
+
pending invites does **not** auto-mount. Window-scroll does not trigger
|
|
65
|
+
lazy-load either.
|
|
66
|
+
|
|
67
|
+
Pattern:
|
|
68
|
+
|
|
69
|
+
1. Navigate to the filter URL, `wait_for_load()`, sleep ~2.5s.
|
|
70
|
+
2. Extract visible rows, decide, click Accept/Ignore for each (`.click()` via
|
|
71
|
+
JS works for `<button>` Accept and Ignore; coordinate click via
|
|
72
|
+
`Input.dispatchMouseEvent` also works).
|
|
73
|
+
3. Reload the URL (`cdp("Page.navigate", url=...)`). Do **not** rely on
|
|
74
|
+
scrolling or clicking a "show more" control.
|
|
75
|
+
4. Repeat until the filter chip shows `(0)` or no Accept buttons remain.
|
|
76
|
+
|
|
77
|
+
Chip count decreases by the number of successful accepts + ignores per
|
|
78
|
+
cycle — use it as the loop guard.
|
|
79
|
+
|
|
80
|
+
## Safety modal: "Take care when connecting"
|
|
81
|
+
|
|
82
|
+
LinkedIn occasionally interposes a `"Take care when connecting"` dialog
|
|
83
|
+
when you click Accept on a connection it considers unfamiliar. The dialog
|
|
84
|
+
has `View profile` and `Accept invite` buttons — click `Accept invite` to
|
|
85
|
+
proceed. Watch for it between accepts; it's intermittent, not per-row.
|
|
86
|
+
|
|
87
|
+
## Quick sketch
|
|
88
|
+
|
|
89
|
+
```python
|
|
90
|
+
import time
|
|
91
|
+
|
|
92
|
+
def chip():
|
|
93
|
+
return js(r"""(() => {
|
|
94
|
+
const el = Array.from(document.querySelectorAll('button, a')).map(e => (e.textContent||'').trim())
|
|
95
|
+
.find(t => /^Mutual Connections \(/.test(t));
|
|
96
|
+
return el || '';
|
|
97
|
+
})()""")
|
|
98
|
+
|
|
99
|
+
while True:
|
|
100
|
+
cdp("Page.navigate", url="https://www.linkedin.com/mynetwork/invitation-manager/received/PEOPLE_WITH_MUTUAL_CONNECTION/")
|
|
101
|
+
wait_for_load()
|
|
102
|
+
time.sleep(2.5)
|
|
103
|
+
n = int(js(r"""(() => Array.from(document.querySelectorAll('button, a'))
|
|
104
|
+
.filter(b => (b.getAttribute('aria-label')||'').startsWith('Accept ') && !b.disabled).length)()"""))
|
|
105
|
+
if n == 0:
|
|
106
|
+
break
|
|
107
|
+
# click each Accept (route tag === 'A' rows to Ignore — see trap above)
|
|
108
|
+
...
|
|
109
|
+
```
|
package/dist/extensions/builtin/browser/agent-workspace/domain-skills/loom/folder-enumeration.md
CHANGED
|
@@ -1,170 +1,170 @@
|
|
|
1
|
-
# Loom — Library Folder Enumeration
|
|
2
|
-
|
|
3
|
-
Field-tested against loom.com on 2026-04-26.
|
|
4
|
-
For private workspace library folders that require an authenticated session.
|
|
5
|
-
|
|
6
|
-
## TL;DR — When to use this skill vs yt-dlp
|
|
7
|
-
|
|
8
|
-
Loom has **two** kinds of folder URLs and they need different tools:
|
|
9
|
-
|
|
10
|
-
| URL pattern | Visibility | Tool |
|
|
11
|
-
|---|---|---|
|
|
12
|
-
| `loom.com/share/folder/<32-hex>` | Public-shared (anyone with link) | **yt-dlp** — `LoomFolderIE` already supports this. Skip browser-harness entirely. |
|
|
13
|
-
| `loom.com/looms/videos/<slug>-<32-hex>` | Private workspace library | **browser-harness** (this skill). yt-dlp doesn't support library folders, and the underlying `/v1/folders/<id>` endpoint returns `Forbidden` even with cookies. |
|
|
14
|
-
|
|
15
|
-
The library variant is what every Loom user sees in their own workspace sidebar. There is no public read API for it; the only programmatic route is the authenticated session in the user's browser. That's why this skill exists.
|
|
16
|
-
|
|
17
|
-
For the **download** itself (after enumeration), `yt-dlp --cookies-from-browser chrome -f http-transcoded` is the fast path — a single HTTP MP4 stream rather than the ~125 HLS fragments the default selection grabs. Substantially faster for bulk runs. See "Pipe to yt-dlp" at the bottom.
|
|
18
|
-
|
|
19
|
-
---
|
|
20
|
-
|
|
21
|
-
## 1. Attach to the user's open Loom tab
|
|
22
|
-
|
|
23
|
-
Always attach to the existing tab. **Do not** call `new_tab()` for `loom.com` — it spawns duplicate tabs in the user's Chrome profile (observed: four duplicate Loom tabs accumulated in one debugging session). The user has to clean those up manually afterwards.
|
|
24
|
-
|
|
25
|
-
```python
|
|
26
|
-
import time
|
|
27
|
-
tabs = cdp("Target.getTargets")
|
|
28
|
-
loom_tid = next(
|
|
29
|
-
(t["targetId"] for t in tabs.get("targetInfos", [])
|
|
30
|
-
if "loom.com/looms/videos/" in t.get("url", "")),
|
|
31
|
-
None,
|
|
32
|
-
)
|
|
33
|
-
if not loom_tid:
|
|
34
|
-
raise SystemExit("User must open the Loom library folder in Chrome first.")
|
|
35
|
-
switch_tab(loom_tid)
|
|
36
|
-
time.sleep(0.3)
|
|
37
|
-
```
|
|
38
|
-
|
|
39
|
-
If multiple Loom tabs of the same folder are already open (common after a few sessions), pick the freshest one and close the others with `cdp("Target.closeTarget", targetId=tid)` before scrolling — keeps the user's Chrome tidy and avoids future ambiguity in `Target.getTargets`.
|
|
40
|
-
|
|
41
|
-
---
|
|
42
|
-
|
|
43
|
-
## 2. Selector — `[data-videoid]`
|
|
44
|
-
|
|
45
|
-
Each video card is an `<article data-videoid="<32-hex>">`. Inside, the first text line is the title (with two ARIA prefix/suffix strings to strip):
|
|
46
|
-
|
|
47
|
-
```python
|
|
48
|
-
items = js("""
|
|
49
|
-
Array.from(document.querySelectorAll("[data-videoid]")).map(e => [
|
|
50
|
-
e.getAttribute("data-videoid"),
|
|
51
|
-
(e.innerText || "")
|
|
52
|
-
.split("\\n")[0]
|
|
53
|
-
.replace(/^Add /, "")
|
|
54
|
-
.replace(/ for bulk actions$/, "")
|
|
55
|
-
.trim()
|
|
56
|
-
])
|
|
57
|
-
""")
|
|
58
|
-
# items: [[id, title], ...] for the cards currently rendered
|
|
59
|
-
```
|
|
60
|
-
|
|
61
|
-
The visible `<a href>` on the card points at `loom.com/share/<id>`, so once you have the ID you can hand it straight to yt-dlp.
|
|
62
|
-
|
|
63
|
-
---
|
|
64
|
-
|
|
65
|
-
## 3. The virtualization quirk — `scrollIntoView`, NOT `scrollTop`
|
|
66
|
-
|
|
67
|
-
Loom's library uses an aggressive virtual scroller that:
|
|
68
|
-
- Renders ~30–60 cards at a time
|
|
69
|
-
- **Caps `document.scrollingElement.scrollTop` to a value far smaller than `scrollHeight`** as long as the bottom of the list isn't the bottom of the viewport
|
|
70
|
-
- Unmounts cards above the viewport once you scroll past them
|
|
71
|
-
|
|
72
|
-
Setting `scrollTop = N` directly silently fails (the value snaps back) once you hit the cap. `window.scrollTo` behaves the same way. Mouse-wheel and PageDown via CDP weren't fully tested in our run, but given they end up at the same `scrollingElement` they're unlikely to escape the cap either.
|
|
73
|
-
|
|
74
|
-
The reliable mechanic is to take the **last currently rendered card** and scroll it into view at the bottom — the virtual scroller responds by mounting the next batch below it:
|
|
75
|
-
|
|
76
|
-
```python
|
|
77
|
-
import time
|
|
78
|
-
ids_seen = {}
|
|
79
|
-
prev = -1; stuck = 0
|
|
80
|
-
js("document.scrollingElement.scrollTop = 0")
|
|
81
|
-
time.sleep(0.8)
|
|
82
|
-
|
|
83
|
-
for i in range(80):
|
|
84
|
-
items = js("""
|
|
85
|
-
Array.from(document.querySelectorAll("[data-videoid]")).map(e => [
|
|
86
|
-
e.getAttribute("data-videoid"),
|
|
87
|
-
(e.innerText||"").split("\\n")[0]
|
|
88
|
-
.replace(/^Add /, "").replace(/ for bulk actions$/, "").trim()
|
|
89
|
-
])
|
|
90
|
-
""")
|
|
91
|
-
for id_, title in (items or []):
|
|
92
|
-
ids_seen[id_] = title
|
|
93
|
-
|
|
94
|
-
js("""
|
|
95
|
-
(() => {
|
|
96
|
-
const a = document.querySelectorAll("[data-videoid]");
|
|
97
|
-
if (a.length) a[a.length - 1].scrollIntoView({block: "end"});
|
|
98
|
-
})()
|
|
99
|
-
""")
|
|
100
|
-
time.sleep(0.6)
|
|
101
|
-
|
|
102
|
-
if len(ids_seen) == prev:
|
|
103
|
-
stuck += 1
|
|
104
|
-
else:
|
|
105
|
-
stuck = 0
|
|
106
|
-
prev = len(ids_seen)
|
|
107
|
-
if stuck > 12:
|
|
108
|
-
break
|
|
109
|
-
|
|
110
|
-
print(f"collected {len(ids_seen)} videos")
|
|
111
|
-
```
|
|
112
|
-
|
|
113
|
-
Empirical numbers from one test run on a 78-video folder:
|
|
114
|
-
- `scrollTop`-based scrolling: stuck at 60 of 78 (cap hit at `scrollTop ≈ 2967` while `scrollHeight` was `4529`).
|
|
115
|
-
- `scrollIntoView`-based scrolling: 78 of 78 in a single pass; `scrollHeight` grew to `5884` as the virtualizer extended.
|
|
116
|
-
|
|
117
|
-
The `stuck` counter (12 idle iterations) is the right signal for "done" — `paging.total` style metadata is not exposed in the DOM, and the visible "78 videos" header at the top is a separate widget that does not refresh after scroll.
|
|
118
|
-
|
|
119
|
-
---
|
|
120
|
-
|
|
121
|
-
## 4. Endpoints that look promising but don't help
|
|
122
|
-
|
|
123
|
-
For completeness — a few dead ends so the next agent doesn't waste time:
|
|
124
|
-
|
|
125
|
-
- `https://www.loom.com/v1/folders/<id>?limit=10000` — works for `/share/folder/...` IDs (this is what `LoomFolderIE` uses), returns `Forbidden` for library folder IDs even with the user's cookies.
|
|
126
|
-
- `https://www.loom.com/graphql` — fires hundreds of times during page load. A folder-listing operation almost certainly lives in there, but the `query` strings come from the bundled React app and would have to be reverse-engineered from the JS bundle. Likely brittle long-term. Reading the rendered DOM is more durable.
|
|
127
|
-
- `performance.getEntriesByType("resource")` — useful for proving these endpoints exist, but only returns URLs/timings, not request bodies.
|
|
128
|
-
|
|
129
|
-
---
|
|
130
|
-
|
|
131
|
-
## 5. Pipe to yt-dlp for the actual download
|
|
132
|
-
|
|
133
|
-
The DOM scrape gives you IDs. Hand them to yt-dlp for the bytes — don't try to grab MP4 URLs yourself. yt-dlp already knows the GraphQL flow for single videos (`LoomIE`), handles CDN signature URLs, and merges audio + video tracks.
|
|
134
|
-
|
|
135
|
-
```bash
|
|
136
|
-
# One-time: cache cookies from Chrome (saves ~2s/video on bulk runs)
|
|
137
|
-
yt-dlp --cookies-from-browser chrome --cookies /tmp/loom_cookies.txt \
|
|
138
|
-
--skip-download --no-warnings \
|
|
139
|
-
"https://www.loom.com/share/<any-known-id>" >/dev/null
|
|
140
|
-
|
|
141
|
-
# Bulk: 16 videos in parallel, single-stream 1080p MP4 (~10× faster than HLS default)
|
|
142
|
-
download_one() {
|
|
143
|
-
yt-dlp --cookies /tmp/loom_cookies.txt \
|
|
144
|
-
-f http-transcoded \
|
|
145
|
-
-o "%(title)s.%(ext)s" \
|
|
146
|
-
--no-progress --no-warnings --no-mtime --no-overwrites \
|
|
147
|
-
"https://www.loom.com/share/$1"
|
|
148
|
-
}
|
|
149
|
-
export -f download_one
|
|
150
|
-
|
|
151
|
-
cat /tmp/loom_ids.json \
|
|
152
|
-
| python3 -c "import sys, json; [print(k) for k in json.load(sys.stdin)]" \
|
|
153
|
-
| xargs -P 16 -I {} bash -c 'download_one "$@"' _ {}
|
|
154
|
-
```
|
|
155
|
-
|
|
156
|
-
Format notes:
|
|
157
|
-
- `-f http-transcoded` is a **single HTTP MP4 stream at 1920×1080**. The default selection picks `hls-raw-3200` + `hls-raw-audio-audio`, which is also 1080p but split into ~125 fragments per video. For bulk runs the single-stream form is dramatically faster (one TCP connection per video at full bandwidth, no per-fragment overhead). For a single video the difference is negligible.
|
|
158
|
-
- Loom does not currently expose anything above 1080p for transcoded videos.
|
|
159
|
-
- `--no-overwrites` makes the bulk job idempotent: re-running picks up only what's missing.
|
|
160
|
-
|
|
161
|
-
---
|
|
162
|
-
|
|
163
|
-
## Gotchas
|
|
164
|
-
|
|
165
|
-
- **Two different folder URL families.** `loom.com/share/folder/<id>` (public) is yt-dlp territory. `loom.com/looms/videos/<slug>-<id>` (library) needs this skill. Don't mix them up — it's the difference between a one-line yt-dlp call and a DOM scrape.
|
|
166
|
-
- **`scrollTop` is silently capped.** Always use `scrollIntoView({block: "end"})` on the last rendered card. Setting `scrollTop` plateaus before the bottom of the list and gives you a partial enumeration that *looks* complete because the loop hits its idle threshold.
|
|
167
|
-
- **Never `new_tab()` for Loom.** Attach to the user's existing tab via `Target.getTargets`. New tabs accumulate in the user's Chrome profile across sessions, and the user has to clean them up.
|
|
168
|
-
- **Idle-counter is the reliable end-of-list signal.** The "N videos" count in the page header may or may not be in sync with what's actually rendered (we didn't fully verify either direction). Use `len(ids_seen)` going N iterations without growing as the stop condition rather than reading the header.
|
|
169
|
-
- **Title strings are wrapped in ARIA noise.** The first line of `innerText` is `Add <title> for bulk actions`. Strip both prefix and suffix before using as a filename.
|
|
170
|
-
- **One MP4 per ID via `loom.com/share/<id>`.** Library-internal share links are valid and yt-dlp accepts them — no need to transform IDs into anything fancier.
|
|
1
|
+
# Loom — Library Folder Enumeration
|
|
2
|
+
|
|
3
|
+
Field-tested against loom.com on 2026-04-26.
|
|
4
|
+
For private workspace library folders that require an authenticated session.
|
|
5
|
+
|
|
6
|
+
## TL;DR — When to use this skill vs yt-dlp
|
|
7
|
+
|
|
8
|
+
Loom has **two** kinds of folder URLs and they need different tools:
|
|
9
|
+
|
|
10
|
+
| URL pattern | Visibility | Tool |
|
|
11
|
+
|---|---|---|
|
|
12
|
+
| `loom.com/share/folder/<32-hex>` | Public-shared (anyone with link) | **yt-dlp** — `LoomFolderIE` already supports this. Skip browser-harness entirely. |
|
|
13
|
+
| `loom.com/looms/videos/<slug>-<32-hex>` | Private workspace library | **browser-harness** (this skill). yt-dlp doesn't support library folders, and the underlying `/v1/folders/<id>` endpoint returns `Forbidden` even with cookies. |
|
|
14
|
+
|
|
15
|
+
The library variant is what every Loom user sees in their own workspace sidebar. There is no public read API for it; the only programmatic route is the authenticated session in the user's browser. That's why this skill exists.
|
|
16
|
+
|
|
17
|
+
For the **download** itself (after enumeration), `yt-dlp --cookies-from-browser chrome -f http-transcoded` is the fast path — a single HTTP MP4 stream rather than the ~125 HLS fragments the default selection grabs. Substantially faster for bulk runs. See "Pipe to yt-dlp" at the bottom.
|
|
18
|
+
|
|
19
|
+
---
|
|
20
|
+
|
|
21
|
+
## 1. Attach to the user's open Loom tab
|
|
22
|
+
|
|
23
|
+
Always attach to the existing tab. **Do not** call `new_tab()` for `loom.com` — it spawns duplicate tabs in the user's Chrome profile (observed: four duplicate Loom tabs accumulated in one debugging session). The user has to clean those up manually afterwards.
|
|
24
|
+
|
|
25
|
+
```python
|
|
26
|
+
import time
|
|
27
|
+
tabs = cdp("Target.getTargets")
|
|
28
|
+
loom_tid = next(
|
|
29
|
+
(t["targetId"] for t in tabs.get("targetInfos", [])
|
|
30
|
+
if "loom.com/looms/videos/" in t.get("url", "")),
|
|
31
|
+
None,
|
|
32
|
+
)
|
|
33
|
+
if not loom_tid:
|
|
34
|
+
raise SystemExit("User must open the Loom library folder in Chrome first.")
|
|
35
|
+
switch_tab(loom_tid)
|
|
36
|
+
time.sleep(0.3)
|
|
37
|
+
```
|
|
38
|
+
|
|
39
|
+
If multiple Loom tabs of the same folder are already open (common after a few sessions), pick the freshest one and close the others with `cdp("Target.closeTarget", targetId=tid)` before scrolling — keeps the user's Chrome tidy and avoids future ambiguity in `Target.getTargets`.
|
|
40
|
+
|
|
41
|
+
---
|
|
42
|
+
|
|
43
|
+
## 2. Selector — `[data-videoid]`
|
|
44
|
+
|
|
45
|
+
Each video card is an `<article data-videoid="<32-hex>">`. Inside, the first text line is the title (with two ARIA prefix/suffix strings to strip):
|
|
46
|
+
|
|
47
|
+
```python
|
|
48
|
+
items = js("""
|
|
49
|
+
Array.from(document.querySelectorAll("[data-videoid]")).map(e => [
|
|
50
|
+
e.getAttribute("data-videoid"),
|
|
51
|
+
(e.innerText || "")
|
|
52
|
+
.split("\\n")[0]
|
|
53
|
+
.replace(/^Add /, "")
|
|
54
|
+
.replace(/ for bulk actions$/, "")
|
|
55
|
+
.trim()
|
|
56
|
+
])
|
|
57
|
+
""")
|
|
58
|
+
# items: [[id, title], ...] for the cards currently rendered
|
|
59
|
+
```
|
|
60
|
+
|
|
61
|
+
The visible `<a href>` on the card points at `loom.com/share/<id>`, so once you have the ID you can hand it straight to yt-dlp.
|
|
62
|
+
|
|
63
|
+
---
|
|
64
|
+
|
|
65
|
+
## 3. The virtualization quirk — `scrollIntoView`, NOT `scrollTop`
|
|
66
|
+
|
|
67
|
+
Loom's library uses an aggressive virtual scroller that:
|
|
68
|
+
- Renders ~30–60 cards at a time
|
|
69
|
+
- **Caps `document.scrollingElement.scrollTop` to a value far smaller than `scrollHeight`** as long as the bottom of the list isn't the bottom of the viewport
|
|
70
|
+
- Unmounts cards above the viewport once you scroll past them
|
|
71
|
+
|
|
72
|
+
Setting `scrollTop = N` directly silently fails (the value snaps back) once you hit the cap. `window.scrollTo` behaves the same way. Mouse-wheel and PageDown via CDP weren't fully tested in our run, but given they end up at the same `scrollingElement` they're unlikely to escape the cap either.
|
|
73
|
+
|
|
74
|
+
The reliable mechanic is to take the **last currently rendered card** and scroll it into view at the bottom — the virtual scroller responds by mounting the next batch below it:
|
|
75
|
+
|
|
76
|
+
```python
|
|
77
|
+
import time
|
|
78
|
+
ids_seen = {}
|
|
79
|
+
prev = -1; stuck = 0
|
|
80
|
+
js("document.scrollingElement.scrollTop = 0")
|
|
81
|
+
time.sleep(0.8)
|
|
82
|
+
|
|
83
|
+
for i in range(80):
|
|
84
|
+
items = js("""
|
|
85
|
+
Array.from(document.querySelectorAll("[data-videoid]")).map(e => [
|
|
86
|
+
e.getAttribute("data-videoid"),
|
|
87
|
+
(e.innerText||"").split("\\n")[0]
|
|
88
|
+
.replace(/^Add /, "").replace(/ for bulk actions$/, "").trim()
|
|
89
|
+
])
|
|
90
|
+
""")
|
|
91
|
+
for id_, title in (items or []):
|
|
92
|
+
ids_seen[id_] = title
|
|
93
|
+
|
|
94
|
+
js("""
|
|
95
|
+
(() => {
|
|
96
|
+
const a = document.querySelectorAll("[data-videoid]");
|
|
97
|
+
if (a.length) a[a.length - 1].scrollIntoView({block: "end"});
|
|
98
|
+
})()
|
|
99
|
+
""")
|
|
100
|
+
time.sleep(0.6)
|
|
101
|
+
|
|
102
|
+
if len(ids_seen) == prev:
|
|
103
|
+
stuck += 1
|
|
104
|
+
else:
|
|
105
|
+
stuck = 0
|
|
106
|
+
prev = len(ids_seen)
|
|
107
|
+
if stuck > 12:
|
|
108
|
+
break
|
|
109
|
+
|
|
110
|
+
print(f"collected {len(ids_seen)} videos")
|
|
111
|
+
```
|
|
112
|
+
|
|
113
|
+
Empirical numbers from one test run on a 78-video folder:
|
|
114
|
+
- `scrollTop`-based scrolling: stuck at 60 of 78 (cap hit at `scrollTop ≈ 2967` while `scrollHeight` was `4529`).
|
|
115
|
+
- `scrollIntoView`-based scrolling: 78 of 78 in a single pass; `scrollHeight` grew to `5884` as the virtualizer extended.
|
|
116
|
+
|
|
117
|
+
The `stuck` counter (12 idle iterations) is the right signal for "done" — `paging.total` style metadata is not exposed in the DOM, and the visible "78 videos" header at the top is a separate widget that does not refresh after scroll.
|
|
118
|
+
|
|
119
|
+
---
|
|
120
|
+
|
|
121
|
+
## 4. Endpoints that look promising but don't help
|
|
122
|
+
|
|
123
|
+
For completeness — a few dead ends so the next agent doesn't waste time:
|
|
124
|
+
|
|
125
|
+
- `https://www.loom.com/v1/folders/<id>?limit=10000` — works for `/share/folder/...` IDs (this is what `LoomFolderIE` uses), returns `Forbidden` for library folder IDs even with the user's cookies.
|
|
126
|
+
- `https://www.loom.com/graphql` — fires hundreds of times during page load. A folder-listing operation almost certainly lives in there, but the `query` strings come from the bundled React app and would have to be reverse-engineered from the JS bundle. Likely brittle long-term. Reading the rendered DOM is more durable.
|
|
127
|
+
- `performance.getEntriesByType("resource")` — useful for proving these endpoints exist, but only returns URLs/timings, not request bodies.
|
|
128
|
+
|
|
129
|
+
---
|
|
130
|
+
|
|
131
|
+
## 5. Pipe to yt-dlp for the actual download
|
|
132
|
+
|
|
133
|
+
The DOM scrape gives you IDs. Hand them to yt-dlp for the bytes — don't try to grab MP4 URLs yourself. yt-dlp already knows the GraphQL flow for single videos (`LoomIE`), handles CDN signature URLs, and merges audio + video tracks.
|
|
134
|
+
|
|
135
|
+
```bash
|
|
136
|
+
# One-time: cache cookies from Chrome (saves ~2s/video on bulk runs)
|
|
137
|
+
yt-dlp --cookies-from-browser chrome --cookies /tmp/loom_cookies.txt \
|
|
138
|
+
--skip-download --no-warnings \
|
|
139
|
+
"https://www.loom.com/share/<any-known-id>" >/dev/null
|
|
140
|
+
|
|
141
|
+
# Bulk: 16 videos in parallel, single-stream 1080p MP4 (~10× faster than HLS default)
|
|
142
|
+
download_one() {
|
|
143
|
+
yt-dlp --cookies /tmp/loom_cookies.txt \
|
|
144
|
+
-f http-transcoded \
|
|
145
|
+
-o "%(title)s.%(ext)s" \
|
|
146
|
+
--no-progress --no-warnings --no-mtime --no-overwrites \
|
|
147
|
+
"https://www.loom.com/share/$1"
|
|
148
|
+
}
|
|
149
|
+
export -f download_one
|
|
150
|
+
|
|
151
|
+
cat /tmp/loom_ids.json \
|
|
152
|
+
| python3 -c "import sys, json; [print(k) for k in json.load(sys.stdin)]" \
|
|
153
|
+
| xargs -P 16 -I {} bash -c 'download_one "$@"' _ {}
|
|
154
|
+
```
|
|
155
|
+
|
|
156
|
+
Format notes:
|
|
157
|
+
- `-f http-transcoded` is a **single HTTP MP4 stream at 1920×1080**. The default selection picks `hls-raw-3200` + `hls-raw-audio-audio`, which is also 1080p but split into ~125 fragments per video. For bulk runs the single-stream form is dramatically faster (one TCP connection per video at full bandwidth, no per-fragment overhead). For a single video the difference is negligible.
|
|
158
|
+
- Loom does not currently expose anything above 1080p for transcoded videos.
|
|
159
|
+
- `--no-overwrites` makes the bulk job idempotent: re-running picks up only what's missing.
|
|
160
|
+
|
|
161
|
+
---
|
|
162
|
+
|
|
163
|
+
## Gotchas
|
|
164
|
+
|
|
165
|
+
- **Two different folder URL families.** `loom.com/share/folder/<id>` (public) is yt-dlp territory. `loom.com/looms/videos/<slug>-<id>` (library) needs this skill. Don't mix them up — it's the difference between a one-line yt-dlp call and a DOM scrape.
|
|
166
|
+
- **`scrollTop` is silently capped.** Always use `scrollIntoView({block: "end"})` on the last rendered card. Setting `scrollTop` plateaus before the bottom of the list and gives you a partial enumeration that *looks* complete because the loop hits its idle threshold.
|
|
167
|
+
- **Never `new_tab()` for Loom.** Attach to the user's existing tab via `Target.getTargets`. New tabs accumulate in the user's Chrome profile across sessions, and the user has to clean them up.
|
|
168
|
+
- **Idle-counter is the reliable end-of-list signal.** The "N videos" count in the page header may or may not be in sync with what's actually rendered (we didn't fully verify either direction). Use `len(ids_seen)` going N iterations without growing as the stop condition rather than reading the header.
|
|
169
|
+
- **Title strings are wrapped in ARIA noise.** The first line of `innerText` is `Add <title> for bulk actions`. Strip both prefix and suffix before using as a filename.
|
|
170
|
+
- **One MP4 per ID via `loom.com/share/<id>`.** Library-internal share links are valid and yt-dlp accepts them — no need to transform IDs into anything fancier.
|