PyPI - mistral-ai-ocr - Versions diffs - 1.0__tar.gz → 1.2__tar.gz - Mend

mistral-ai-ocr 1.0tar.gz → 1.2tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/PKG-INFO RENAMED Viewed

@@ -1,10 +1,46 @@
 Metadata-Version: 2.1
 Name: mistral-ai-ocr
-Version: 1.0
+Version: 1.2
 Description-Content-Type: text/markdown
 # Mistral AI OCR
-This is a simple script that uses the Mistral AI OCR API to extract text from a PDF or image file
+This is a simple script that uses the Mistral AI OCR API to get the Markdown text from a PDF or image file
+# Usage
+## Install the Requirements
+To install the necessary requirements, run the following command:
+```sh
+pip install mistral-ai-ocr
+```
+## Typical Usage
+```sh
+mistral-ai-ocr paper.pdf
+mistral-ai-ocr paper.pdf --api-key jrWjJE5lFketfB2sA6vvhQK2SoHQ6R39
+mistral-ai-ocr paper.pdf -o revision
+mistral-ai-ocr paper.pdf -e
+mistral-ai-ocr paper.pdf -m FULL
+mistral-ai-ocr page74.jpg -e
+mistral-ai-ocr -j paper.json
+mistral-ai-ocr -j paper.json -m TEXT_NO_PAGES -n
+```
+## Arguments
+| Argument || Description |
+|-|-|-|
+| | | input PDF or image file |
+| -k API_KEY | --api-key API_KEY | Mistral API key, can be set via the **MISTRAL_API_KEY** environment variable |
+| -o OUTPUT | --output OUTPUT | output directory path. If not set, a directory will be created in the current working directory using the same stem (filename without extension) as the input file |
+| -j JSON_OCR_RESPONSE | --json-ocr-response JSON_OCR_RESPONSE | path from which to load a pre-existing JSON OCR response (any input file will be ignored) |
+| -m MODE | --mode MODE | mode of operation: either the name or numerical value of the mode. _Defaults to FULL_NO_PAGES_ |
+| -s PAGE_SEPARATOR | --page-separator PAGE_SEPARATOR | page separator to use when writing the Markdown file. _Defaults to `\n`_ |
+| -n | --no-json | do not write the JSON OCR response to a file. By default, the response is written |
+| -e | --load-dot-env | load the .env file from the current directory using [`python-dotenv`](https://pypi.org/project/python-dotenv/), to retrieve the Mistral API key |
 ## Modes
@@ -106,42 +142,6 @@ paper
 By default, the JSON response from the Mistral AI OCR API is saved in the output directory. To disable JSON output, use the `-n` or `--no-json` argument. To experiment with a different **mode** without using additional API calls, reuse an existing JSON response instead of the original input file
-# Usage
-## Install the Requirements
-To install the necessary requirements, run the following command:
-```sh
-pip install mistral-ai-ocr
-```
-## Typical Usage
-```sh
-mistral-ai-ocr paper.pdf
-mistral-ai-ocr paper.pdf --api-key jrWjJE5lFketfB2sA6vvhQK2SoHQ6R39
-mistral-ai-ocr paper.pdf -o revision
-mistral-ai-ocr paper.pdf -e
-mistral-ai-ocr paper.pdf -m FULL
-mistral-ai-ocr page74.jpg -e
-mistral-ai-ocr -j paper.json
-mistral-ai-ocr -j paper.json -m TEXT_NO_PAGES -n
-```
-## Arguments
-| Argument || Description |
-|-|-|-|
-| | | input PDF or image file |
-| -k API_KEY | --api-key API_KEY | Mistral API key, can be set via the **MISTRAL_API_KEY** environment variable |
-| -o OUTPUT | --output OUTPUT | output directory path. If not set, a directory will be created in the current working directory using the same stem (filename without extension) as the input file |
-| -j JSON_OCR_RESPONSE | --json-ocr-response JSON_OCR_RESPONSE | path from which to load a pre-existing JSON OCR response (any input file will be ignored) |
-| -m MODE | --mode MODE | mode of operation: either the name or numerical value of the mode. _Defaults to FULL_NO_PAGES_ |
-| -s PAGE_SEPARATOR | --page-separator PAGE_SEPARATOR | page separator to use when writing the Markdown file. _Defaults to `\n`_ |
-| -n | --no-json | do not write the JSON OCR response to a file. By default, the response is written |
-| -e | --load-dot-env | load the .env file from the current directory using [`python-dotenv`](https://pypi.org/project/python-dotenv/), to retrieve the Mistral API key |
 ### Mistral AI API Key
 To obtain an API key, you need a [Mistral AI](https://auth.mistral.ai/ui/registration) account. Then visit [https://admin.mistral.ai/organization/api-keys](https://admin.mistral.ai/organization/api-keys) and click the **Create new key** button

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/README.md RENAMED Viewed

@@ -1,5 +1,41 @@
 # Mistral AI OCR
-This is a simple script that uses the Mistral AI OCR API to extract text from a PDF or image file
+This is a simple script that uses the Mistral AI OCR API to get the Markdown text from a PDF or image file
+# Usage
+## Install the Requirements
+To install the necessary requirements, run the following command:
+```sh
+pip install mistral-ai-ocr
+```
+## Typical Usage
+```sh
+mistral-ai-ocr paper.pdf
+mistral-ai-ocr paper.pdf --api-key jrWjJE5lFketfB2sA6vvhQK2SoHQ6R39
+mistral-ai-ocr paper.pdf -o revision
+mistral-ai-ocr paper.pdf -e
+mistral-ai-ocr paper.pdf -m FULL
+mistral-ai-ocr page74.jpg -e
+mistral-ai-ocr -j paper.json
+mistral-ai-ocr -j paper.json -m TEXT_NO_PAGES -n
+```
+## Arguments
+| Argument || Description |
+|-|-|-|
+| | | input PDF or image file |
+| -k API_KEY | --api-key API_KEY | Mistral API key, can be set via the **MISTRAL_API_KEY** environment variable |
+| -o OUTPUT | --output OUTPUT | output directory path. If not set, a directory will be created in the current working directory using the same stem (filename without extension) as the input file |
+| -j JSON_OCR_RESPONSE | --json-ocr-response JSON_OCR_RESPONSE | path from which to load a pre-existing JSON OCR response (any input file will be ignored) |
+| -m MODE | --mode MODE | mode of operation: either the name or numerical value of the mode. _Defaults to FULL_NO_PAGES_ |
+| -s PAGE_SEPARATOR | --page-separator PAGE_SEPARATOR | page separator to use when writing the Markdown file. _Defaults to `\n`_ |
+| -n | --no-json | do not write the JSON OCR response to a file. By default, the response is written |
+| -e | --load-dot-env | load the .env file from the current directory using [`python-dotenv`](https://pypi.org/project/python-dotenv/), to retrieve the Mistral API key |
 ## Modes
@@ -101,42 +137,6 @@ paper
 By default, the JSON response from the Mistral AI OCR API is saved in the output directory. To disable JSON output, use the `-n` or `--no-json` argument. To experiment with a different **mode** without using additional API calls, reuse an existing JSON response instead of the original input file
-# Usage
-## Install the Requirements
-To install the necessary requirements, run the following command:
-```sh
-pip install mistral-ai-ocr
-```
-## Typical Usage
-```sh
-mistral-ai-ocr paper.pdf
-mistral-ai-ocr paper.pdf --api-key jrWjJE5lFketfB2sA6vvhQK2SoHQ6R39
-mistral-ai-ocr paper.pdf -o revision
-mistral-ai-ocr paper.pdf -e
-mistral-ai-ocr paper.pdf -m FULL
-mistral-ai-ocr page74.jpg -e
-mistral-ai-ocr -j paper.json
-mistral-ai-ocr -j paper.json -m TEXT_NO_PAGES -n
-```
-## Arguments
-| Argument || Description |
-|-|-|-|
-| | | input PDF or image file |
-| -k API_KEY | --api-key API_KEY | Mistral API key, can be set via the **MISTRAL_API_KEY** environment variable |
-| -o OUTPUT | --output OUTPUT | output directory path. If not set, a directory will be created in the current working directory using the same stem (filename without extension) as the input file |
-| -j JSON_OCR_RESPONSE | --json-ocr-response JSON_OCR_RESPONSE | path from which to load a pre-existing JSON OCR response (any input file will be ignored) |
-| -m MODE | --mode MODE | mode of operation: either the name or numerical value of the mode. _Defaults to FULL_NO_PAGES_ |
-| -s PAGE_SEPARATOR | --page-separator PAGE_SEPARATOR | page separator to use when writing the Markdown file. _Defaults to `\n`_ |
-| -n | --no-json | do not write the JSON OCR response to a file. By default, the response is written |
-| -e | --load-dot-env | load the .env file from the current directory using [`python-dotenv`](https://pypi.org/project/python-dotenv/), to retrieve the Mistral API key |
 ### Mistral AI API Key
 To obtain an API key, you need a [Mistral AI](https://auth.mistral.ai/ui/registration) account. Then visit [https://admin.mistral.ai/organization/api-keys](https://admin.mistral.ai/organization/api-keys) and click the **Create new key** button

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/mistral_ai_ocr/__main__.py RENAMED Viewed

@@ -49,6 +49,7 @@ def main():
   if args.load_dot_env:
     load_dotenv()
+    load_dotenv(".env")
   if args.api_key is None:
     args.api_key = getenv("MISTRAL_API_KEY")

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/mistral_ai_ocr.egg-info/PKG-INFO RENAMED Viewed

@@ -1,10 +1,46 @@
 Metadata-Version: 2.1
 Name: mistral-ai-ocr
-Version: 1.0
+Version: 1.2
 Description-Content-Type: text/markdown
 # Mistral AI OCR
-This is a simple script that uses the Mistral AI OCR API to extract text from a PDF or image file
+This is a simple script that uses the Mistral AI OCR API to get the Markdown text from a PDF or image file
+# Usage
+## Install the Requirements
+To install the necessary requirements, run the following command:
+```sh
+pip install mistral-ai-ocr
+```
+## Typical Usage
+```sh
+mistral-ai-ocr paper.pdf
+mistral-ai-ocr paper.pdf --api-key jrWjJE5lFketfB2sA6vvhQK2SoHQ6R39
+mistral-ai-ocr paper.pdf -o revision
+mistral-ai-ocr paper.pdf -e
+mistral-ai-ocr paper.pdf -m FULL
+mistral-ai-ocr page74.jpg -e
+mistral-ai-ocr -j paper.json
+mistral-ai-ocr -j paper.json -m TEXT_NO_PAGES -n
+```
+## Arguments
+| Argument || Description |
+|-|-|-|
+| | | input PDF or image file |
+| -k API_KEY | --api-key API_KEY | Mistral API key, can be set via the **MISTRAL_API_KEY** environment variable |
+| -o OUTPUT | --output OUTPUT | output directory path. If not set, a directory will be created in the current working directory using the same stem (filename without extension) as the input file |
+| -j JSON_OCR_RESPONSE | --json-ocr-response JSON_OCR_RESPONSE | path from which to load a pre-existing JSON OCR response (any input file will be ignored) |
+| -m MODE | --mode MODE | mode of operation: either the name or numerical value of the mode. _Defaults to FULL_NO_PAGES_ |
+| -s PAGE_SEPARATOR | --page-separator PAGE_SEPARATOR | page separator to use when writing the Markdown file. _Defaults to `\n`_ |
+| -n | --no-json | do not write the JSON OCR response to a file. By default, the response is written |
+| -e | --load-dot-env | load the .env file from the current directory using [`python-dotenv`](https://pypi.org/project/python-dotenv/), to retrieve the Mistral API key |
 ## Modes
@@ -106,42 +142,6 @@ paper
 By default, the JSON response from the Mistral AI OCR API is saved in the output directory. To disable JSON output, use the `-n` or `--no-json` argument. To experiment with a different **mode** without using additional API calls, reuse an existing JSON response instead of the original input file
-# Usage
-## Install the Requirements
-To install the necessary requirements, run the following command:
-```sh
-pip install mistral-ai-ocr
-```
-## Typical Usage
-```sh
-mistral-ai-ocr paper.pdf
-mistral-ai-ocr paper.pdf --api-key jrWjJE5lFketfB2sA6vvhQK2SoHQ6R39
-mistral-ai-ocr paper.pdf -o revision
-mistral-ai-ocr paper.pdf -e
-mistral-ai-ocr paper.pdf -m FULL
-mistral-ai-ocr page74.jpg -e
-mistral-ai-ocr -j paper.json
-mistral-ai-ocr -j paper.json -m TEXT_NO_PAGES -n
-```
-## Arguments
-| Argument || Description |
-|-|-|-|
-| | | input PDF or image file |
-| -k API_KEY | --api-key API_KEY | Mistral API key, can be set via the **MISTRAL_API_KEY** environment variable |
-| -o OUTPUT | --output OUTPUT | output directory path. If not set, a directory will be created in the current working directory using the same stem (filename without extension) as the input file |
-| -j JSON_OCR_RESPONSE | --json-ocr-response JSON_OCR_RESPONSE | path from which to load a pre-existing JSON OCR response (any input file will be ignored) |
-| -m MODE | --mode MODE | mode of operation: either the name or numerical value of the mode. _Defaults to FULL_NO_PAGES_ |
-| -s PAGE_SEPARATOR | --page-separator PAGE_SEPARATOR | page separator to use when writing the Markdown file. _Defaults to `\n`_ |
-| -n | --no-json | do not write the JSON OCR response to a file. By default, the response is written |
-| -e | --load-dot-env | load the .env file from the current directory using [`python-dotenv`](https://pypi.org/project/python-dotenv/), to retrieve the Mistral API key |
 ### Mistral AI API Key
 To obtain an API key, you need a [Mistral AI](https://auth.mistral.ai/ui/registration) account. Then visit [https://admin.mistral.ai/organization/api-keys](https://admin.mistral.ai/organization/api-keys) and click the **Create new key** button

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/setup.py RENAMED Viewed

@@ -6,7 +6,7 @@ with open("README.md", "r", encoding="utf-8") as fh:
 setup(
   name="mistral-ai-ocr",
-  version="1.0",
+  version="1.2",
   packages=find_packages(),
   entry_points={
     'console_scripts': [

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/mistral_ai_ocr/__init__.py RENAMED Viewed

File without changes

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/mistral_ai_ocr.egg-info/SOURCES.txt RENAMED Viewed

File without changes

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/mistral_ai_ocr.egg-info/dependency_links.txt RENAMED Viewed

File without changes

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/mistral_ai_ocr.egg-info/entry_points.txt RENAMED Viewed

File without changes

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/mistral_ai_ocr.egg-info/requires.txt RENAMED Viewed

File without changes

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/mistral_ai_ocr.egg-info/top_level.txt RENAMED Viewed

File without changes

{mistral-ai-ocr-1.0 → mistral-ai-ocr-1.2}/setup.cfg RENAMED Viewed

File without changes

mistral-ai-ocr 1.0__tar.gz → 1.2__tar.gz

mistral-ai-ocr 1.0tar.gz → 1.2tar.gz