mistocr 0.2.4__tar.gz → 0.2.5__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {mistocr-0.2.4/mistocr.egg-info → mistocr-0.2.5}/PKG-INFO +9 -1
- {mistocr-0.2.4 → mistocr-0.2.5}/README.md +8 -0
- mistocr-0.2.5/mistocr/__init__.py +1 -0
- {mistocr-0.2.4 → mistocr-0.2.5/mistocr.egg-info}/PKG-INFO +9 -1
- {mistocr-0.2.4 → mistocr-0.2.5}/settings.ini +1 -1
- mistocr-0.2.4/mistocr/__init__.py +0 -1
- {mistocr-0.2.4 → mistocr-0.2.5}/LICENSE +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/MANIFEST.in +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/mistocr/_modidx.py +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/mistocr/core.py +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/mistocr/pipeline.py +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/mistocr/refine.py +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/mistocr.egg-info/SOURCES.txt +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/mistocr.egg-info/dependency_links.txt +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/mistocr.egg-info/entry_points.txt +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/mistocr.egg-info/not-zip-safe +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/mistocr.egg-info/requires.txt +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/mistocr.egg-info/top_level.txt +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/pyproject.toml +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/setup.cfg +0 -0
- {mistocr-0.2.4 → mistocr-0.2.5}/setup.py +0 -0
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.4
|
|
2
2
|
Name: mistocr
|
|
3
|
-
Version: 0.2.
|
|
3
|
+
Version: 0.2.5
|
|
4
4
|
Summary: Batch OCR for PDFs with heading restoration and visual content integration
|
|
5
5
|
Home-page: https://github.com/franckalbinet/mistocr
|
|
6
6
|
Author: Solveit
|
|
@@ -76,6 +76,14 @@ fundamental challenges that raw OCR output leaves unsolved:
|
|
|
76
76
|
**In short**: Complete PDF OCR with heading hierarchy fixes and image
|
|
77
77
|
descriptions for RAG and LLM pipelines.
|
|
78
78
|
|
|
79
|
+
> [!NOTE]
|
|
80
|
+
>
|
|
81
|
+
> **Want to see mistocr in action?** This
|
|
82
|
+
> [tutorial](https://share.solve.it.com/d/97f75412ca949af76a5945b4dfc443c7)
|
|
83
|
+
> demonstrates real-world PDF processing and shows how clean markdown
|
|
84
|
+
> enables structure-aware navigation through long documents—letting you
|
|
85
|
+
> find exactly what you need, fast.
|
|
86
|
+
|
|
79
87
|
## Get Started
|
|
80
88
|
|
|
81
89
|
Install latest from [pypi](https://pypi.org/project/mistocr), then:
|
|
@@ -36,6 +36,14 @@ fundamental challenges that raw OCR output leaves unsolved:
|
|
|
36
36
|
**In short**: Complete PDF OCR with heading hierarchy fixes and image
|
|
37
37
|
descriptions for RAG and LLM pipelines.
|
|
38
38
|
|
|
39
|
+
> [!NOTE]
|
|
40
|
+
>
|
|
41
|
+
> **Want to see mistocr in action?** This
|
|
42
|
+
> [tutorial](https://share.solve.it.com/d/97f75412ca949af76a5945b4dfc443c7)
|
|
43
|
+
> demonstrates real-world PDF processing and shows how clean markdown
|
|
44
|
+
> enables structure-aware navigation through long documents—letting you
|
|
45
|
+
> find exactly what you need, fast.
|
|
46
|
+
|
|
39
47
|
## Get Started
|
|
40
48
|
|
|
41
49
|
Install latest from [pypi](https://pypi.org/project/mistocr), then:
|
|
@@ -0,0 +1 @@
|
|
|
1
|
+
__version__ = "0.2.5"
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.4
|
|
2
2
|
Name: mistocr
|
|
3
|
-
Version: 0.2.
|
|
3
|
+
Version: 0.2.5
|
|
4
4
|
Summary: Batch OCR for PDFs with heading restoration and visual content integration
|
|
5
5
|
Home-page: https://github.com/franckalbinet/mistocr
|
|
6
6
|
Author: Solveit
|
|
@@ -76,6 +76,14 @@ fundamental challenges that raw OCR output leaves unsolved:
|
|
|
76
76
|
**In short**: Complete PDF OCR with heading hierarchy fixes and image
|
|
77
77
|
descriptions for RAG and LLM pipelines.
|
|
78
78
|
|
|
79
|
+
> [!NOTE]
|
|
80
|
+
>
|
|
81
|
+
> **Want to see mistocr in action?** This
|
|
82
|
+
> [tutorial](https://share.solve.it.com/d/97f75412ca949af76a5945b4dfc443c7)
|
|
83
|
+
> demonstrates real-world PDF processing and shows how clean markdown
|
|
84
|
+
> enables structure-aware navigation through long documents—letting you
|
|
85
|
+
> find exactly what you need, fast.
|
|
86
|
+
|
|
79
87
|
## Get Started
|
|
80
88
|
|
|
81
89
|
Install latest from [pypi](https://pypi.org/project/mistocr), then:
|
|
@@ -1 +0,0 @@
|
|
|
1
|
-
__version__ = "0.2.4"
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|