mineru-open-sdk 0.2.0 → 0.2.5
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +5 -2
- package/README.zh-CN.md +5 -2
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -31,7 +31,7 @@ npm install mineru-open-sdk
|
|
|
31
31
|
|
|
32
32
|
## 🛠️ Quick Start
|
|
33
33
|
|
|
34
|
-
### 1. Flash Extract (Fast, No Auth
|
|
34
|
+
### 1. Flash Extract (Fast, No Auth)
|
|
35
35
|
Ideal for quick previews. No token required.
|
|
36
36
|
|
|
37
37
|
```typescript
|
|
@@ -119,7 +119,7 @@ console.log(result.images);
|
|
|
119
119
|
| **File Limit** | Max 10 MB | Max 200 MB |
|
|
120
120
|
| **Page Limit** | Max 20 Pages | Max 600 Pages |
|
|
121
121
|
| **Formats** | PDF, Images, Docx, PPTx, Excel | PDF, Images, Doc/x, Ppt/x, Html |
|
|
122
|
-
| **Content** | Markdown
|
|
122
|
+
| **Content** | Markdown (formula & table on by default, OCR off) | Full assets (Images, Tables, Formulas) |
|
|
123
123
|
| **Output** | Markdown | MD, Docx, LaTeX, HTML, JSON |
|
|
124
124
|
|
|
125
125
|
---
|
|
@@ -165,6 +165,9 @@ These defaults apply to `extract()`, `submit()`, `extractBatch()`, and `submitBa
|
|
|
165
165
|
| :--- | :--- | :--- |
|
|
166
166
|
| `language` | `"ch"` | Chinese is the default |
|
|
167
167
|
| `pageRange` | `undefined` | Full page range allowed by the flash API |
|
|
168
|
+
| `ocr` | `undefined` | OCR is off (API default) |
|
|
169
|
+
| `formula` | `undefined` | Formula recognition is on (API default) |
|
|
170
|
+
| `table` | `undefined` | Table recognition is on (API default) |
|
|
168
171
|
| `timeout` | `300` seconds | Max total polling time |
|
|
169
172
|
|
|
170
173
|
---
|
package/README.zh-CN.md
CHANGED
|
@@ -31,7 +31,7 @@ npm install mineru-open-sdk
|
|
|
31
31
|
|
|
32
32
|
## 🛠️ 快速上手
|
|
33
33
|
|
|
34
|
-
### 1. Agent 轻量解析 (Flash Extract -
|
|
34
|
+
### 1. Agent 轻量解析 (Flash Extract - 免登录)
|
|
35
35
|
适合快速预览。无需配置 Token。
|
|
36
36
|
|
|
37
37
|
```typescript
|
|
@@ -119,7 +119,7 @@ console.log(result.images);
|
|
|
119
119
|
| **文件大小上限** | 最大 10 MB | 最大 200 MB |
|
|
120
120
|
| **文件页数上限** | 最大 20 页 | 最大 600 页 |
|
|
121
121
|
| **支持格式** | PDF, 图片, Docx, PPTx, Excel | PDF, 图片, Doc/x, Ppt/x, Html |
|
|
122
|
-
| **内容完整度** |
|
|
122
|
+
| **内容完整度** | Markdown(公式和表格默认开启,OCR 默认关闭) | 完整资源 (图片、表格、公式全部保留) |
|
|
123
123
|
| **输出格式** | Markdown | MD, Docx, LaTeX, HTML, JSON |
|
|
124
124
|
|
|
125
125
|
---
|
|
@@ -165,6 +165,9 @@ console.log(result.images);
|
|
|
165
165
|
| :--- | :--- | :--- |
|
|
166
166
|
| `language` | `"ch"` | 默认中文 |
|
|
167
167
|
| `pageRange` | `undefined` | 默认处理 flash API 允许的完整页范围 |
|
|
168
|
+
| `ocr` | `undefined` | OCR 默认关闭(API 默认行为) |
|
|
169
|
+
| `formula` | `undefined` | 公式识别默认开启(API 默认行为) |
|
|
170
|
+
| `table` | `undefined` | 表格识别默认开启(API 默认行为) |
|
|
168
171
|
| `timeout` | `300` 秒 | 总轮询超时 |
|
|
169
172
|
|
|
170
173
|
---
|