npm - @opendataloader/pdf - Versions diffs - 2.0.1 → 2.0.2 - Mend

@opendataloader/pdf 2.0.1 → 2.0.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/README.md +1 -1
package/lib/opendataloader-pdf-cli.jar +0 -0
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -494,7 +494,7 @@ Yes. For digital PDFs, text extraction works out of the box. For scanned PDFs, u
 ### How fast is it?
-Local mode processes 20+ pages per second on CPU (0.05s/page). Hybrid mode processes 2+ pages per second (0.43s/page) with significantly higher accuracy for complex documents. No GPU required. Benchmarked on Apple M4. [Full benchmark details](https://github.com/opendataloader-project/opendataloader-bench)
+Local mode processes 20+ pages per second on CPU (0.05s/page). Hybrid mode processes 2+ pages per second (0.43s/page) with significantly higher accuracy for complex documents. No GPU required. Benchmarked on Apple M4. [Full benchmark details](https://github.com/opendataloader-project/opendataloader-bench). With multi-process batch processing, throughput exceeds 100 pages per second on 8+ core machines.
 ### Does it handle multi-column layouts?

package/lib/opendataloader-pdf-cli.jar CHANGED Viewed

Binary file

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@opendataloader/pdf",
-  "version": "2.0.1",
+  "version": "2.0.2",
   "description": "A Node.js wrapper for the opendataloader-pdf Java CLI.",
   "main": "./dist/index.cjs",
   "module": "./dist/index.js",