@monostate/node-scraper 1.6.0 → 1.7.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +13 -9
- package/package.json +2 -3
- package/bin/lightpanda +0 -0
package/README.md
CHANGED
|
@@ -7,7 +7,7 @@
|
|
|
7
7
|
[](../../LICENSE)
|
|
8
8
|
[](https://nodejs.org/)
|
|
9
9
|
|
|
10
|
-
##
|
|
10
|
+
## Quick Start
|
|
11
11
|
|
|
12
12
|
### Installation
|
|
13
13
|
|
|
@@ -19,18 +19,22 @@ yarn add @monostate/node-scraper
|
|
|
19
19
|
pnpm add @monostate/node-scraper
|
|
20
20
|
```
|
|
21
21
|
|
|
22
|
-
|
|
22
|
+
**Fixed in v1.7.0**: Critical cross-platform compatibility fix - binaries are now correctly downloaded per platform instead of being bundled.
|
|
23
23
|
|
|
24
|
-
|
|
24
|
+
**New in v1.6.0**: Method override support! Force specific scraping methods with `method` parameter for testing and optimization.
|
|
25
25
|
|
|
26
|
-
|
|
26
|
+
**New in v1.5.0**: AI-powered Q&A! Ask questions about any website using OpenRouter, OpenAI, or built-in AI.
|
|
27
|
+
|
|
28
|
+
**Also in v1.3.0**: PDF parsing support added! Automatically extracts text, metadata, and page count from PDF documents.
|
|
29
|
+
|
|
30
|
+
**Also in v1.2.0**: Lightpanda binary is now automatically downloaded and configured during installation! No manual setup required.
|
|
27
31
|
|
|
28
32
|
### Zero-Configuration Setup
|
|
29
33
|
|
|
30
34
|
The package now automatically:
|
|
31
|
-
-
|
|
32
|
-
-
|
|
33
|
-
-
|
|
35
|
+
- Downloads the correct Lightpanda binary for your platform (macOS, Linux, Windows/WSL)
|
|
36
|
+
- Configures binary paths and permissions
|
|
37
|
+
- Validates installation health on first use
|
|
34
38
|
|
|
35
39
|
### Basic Usage
|
|
36
40
|
|
|
@@ -106,7 +110,7 @@ const result = await smartScrape('https://example.com', { method: 'auto' });
|
|
|
106
110
|
}
|
|
107
111
|
```
|
|
108
112
|
|
|
109
|
-
##
|
|
113
|
+
## How It Works
|
|
110
114
|
|
|
111
115
|
BNCA uses a sophisticated multi-tier system with intelligent detection:
|
|
112
116
|
|
|
@@ -235,7 +239,7 @@ Clean up resources (close browser instances).
|
|
|
235
239
|
await scraper.cleanup();
|
|
236
240
|
```
|
|
237
241
|
|
|
238
|
-
###
|
|
242
|
+
### AI-Powered Q&A
|
|
239
243
|
|
|
240
244
|
Ask questions about any website and get AI-generated answers:
|
|
241
245
|
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "@monostate/node-scraper",
|
|
3
|
-
"version": "1.
|
|
3
|
+
"version": "1.7.0",
|
|
4
4
|
"description": "Intelligent web scraping with AI Q&A, PDF support and multi-level fallback system - 11x faster than traditional scrapers",
|
|
5
5
|
"type": "module",
|
|
6
6
|
"main": "index.js",
|
|
@@ -16,8 +16,7 @@
|
|
|
16
16
|
"index.d.ts",
|
|
17
17
|
"README.md",
|
|
18
18
|
"package.json",
|
|
19
|
-
"scripts/"
|
|
20
|
-
"bin/"
|
|
19
|
+
"scripts/"
|
|
21
20
|
],
|
|
22
21
|
"scripts": {
|
|
23
22
|
"postinstall": "node scripts/install-lightpanda.js"
|
package/bin/lightpanda
DELETED
|
Binary file
|