imperium-crawl 2.5.2 → 2.5.3
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +3 -2
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -177,7 +177,7 @@ Headers → TLS fingerprinting → headless browser + CAPTCHA solving. Automatic
|
|
|
177
177
|
🧠 **Self-Improving**
|
|
178
178
|
Adaptive learning engine remembers what works per domain. Second visit is 3x faster. The more you use it, the smarter it gets.
|
|
179
179
|
|
|
180
|
-
🧰 **
|
|
180
|
+
🧰 **39 Tools, 2 Modes**
|
|
181
181
|
CLI tool or interactive TUI. Scraping, crawling, search, extraction, API discovery, WebSocket monitoring, browser automation, batch processing.
|
|
182
182
|
|
|
183
183
|
📜 **14 Built-in Recipes**
|
|
@@ -193,7 +193,7 @@ Teach it once, run forever. Auto-detect patterns on any page, save as reusable s
|
|
|
193
193
|
| Feature | **imperium-crawl** | Firecrawl | Crawl4AI | Browserbase | Puppeteer |
|
|
194
194
|
|---------|:------------------:|:---------:|:--------:|:-----------:|:---------:|
|
|
195
195
|
| Price | **Free forever** | $19+/month | Free | $0.01/min | Free |
|
|
196
|
-
| Total tools | **
|
|
196
|
+
| Total tools | **39** | 5 | 2 | 4 | N/A |
|
|
197
197
|
| Stealth levels | **3 (auto-escalate)** | Cloud-based | 1 | Cloud-based | None |
|
|
198
198
|
| Anti-bot detection | **7 systems** | Partial | Partial | Partial | None |
|
|
199
199
|
| TLS fingerprinting | **JA3/JA4** | No | No | No | No |
|
|
@@ -350,6 +350,7 @@ Second visit to cloudflare.com:
|
|
|
350
350
|
| Tool | What It Does |
|
|
351
351
|
|------|-------------|
|
|
352
352
|
| **download** | Download media files from any URL — images, video, YouTube, TikTok, bulk. **v2.5.1**: Browser-based image extraction with 100% coverage (lazy-load, shadow DOM, iframes, JSON-LD, CSS backgrounds). Target specific images via `--selector`, `--index`, `--alt-match`. Auto-click "load more" buttons. Referer injection fixes 403 on CDNs. |
|
|
353
|
+
| **batch_download** | Download multiple files (PDFs, images, documents) in parallel with session cookie support. Uses L1 HTTP fetch — 10x faster than browser-based downloads. Ideal for bulk file retrieval from authenticated sessions. |
|
|
353
354
|
| **rss** | Fetch and parse RSS/Atom feeds. Filter by date, output as JSON or Markdown. |
|
|
354
355
|
|
|
355
356
|
### 📦 Batch Processing (no API key needed)
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "imperium-crawl",
|
|
3
|
-
"version": "2.5.
|
|
3
|
+
"version": "2.5.3",
|
|
4
4
|
"description": "39-tool open-source CLI for web scraping, PDF extraction, content monitoring, reusable browser flows, RSS aggregation, and custom skills. Zero API keys for core tools.",
|
|
5
5
|
"type": "module",
|
|
6
6
|
"bin": {
|