PyPI - cnocr - Versions diffs - 2.2.4.2__tar.gz → 2.3.0.1__tar.gz - Mend

cnocr 2.2.4.2tar.gz → 2.3.0.1tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (61) hide show

{cnocr-2.2.4.2 → cnocr-2.3.0.1}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.1
 Name: cnocr
-Version: 2.2.4.2
+Version: 2.3.0.1
 Summary: Python3 package for Chinese/English OCR, with small pretrained models
 Home-page: https://github.com/breezedeus/cnocr
 Author: breezedeus
@@ -33,6 +33,7 @@ License-File: LICENSE
   <div>&nbsp;</div>
 [![Downloads](https://static.pepy.tech/personalized-badge/cnocr?period=total&units=international_system&left_color=grey&right_color=orange&left_text=Downloads)](https://pepy.tech/project/cnocr)
+[![Visitors](https://api.visitorbadge.io/api/visitors?path=https%3A%2F%2Fgithub.com%2Fbreezedeus%2FCnOCR&label=Visitors&countColor=%23f5c791&style=flat&labelStyle=none)](https://visitorbadge.io/status?path=https%3A%2F%2Fgithub.com%2Fbreezedeus%2FCnOCR)
 [![license](https://img.shields.io/github/license/breezedeus/cnocr)](./LICENSE)
 [![Docs](https://readthedocs.org/projects/cnocr/badge/?version=latest)](https://cnocr.readthedocs.io/zh/latest/?badge=latest)
 [![PyPI version](https://badge.fury.io/py/cnocr.svg)](https://badge.fury.io/py/cnocr)
@@ -67,15 +68,23 @@ License-File: LICENSE
 ---
 </div>
-### [Update 2023.09.27]：发布 V2.2.4
+### [Update 2023.12.24]：发布 V2.3
 主要变更：
-* 加入了纯数字识别系列模型 `number-*`（见 [识别模型列表](#可使用的识别模型)），可用于纯数字识别场景，如银行卡识别、身份证识别、硬币年份识别等；
-* 对各个包的新版做了接口适配，如 `pytorch_lightning`、`onnxruntime`、`pillow`等；
-* 优化了训练过程使用的数据增强方式，并借鉴了**Nougat** 中的数据增强方法；
-* 增加了对更大模型的支持，如 `densenet-lite-666`、`gru_large` 等；
-* 以前的 `*-gru` 系列模型，现在也有 ONNX 版了；
-* 修复了一堆的bugs，如 `val-complete_match-epoch` 训练过程一直为 `0` 等。
+* 重新训练了所有的模型，比上一版精度更高。
+* 按使用场景把模型分为几大类场景（见 [识别模型列表](#可使用的识别模型)）：
+  * `scene`：场景图片，适合识别一般拍照图片中的文字。此类模型以 `scene-` 开头，如模型 `scene-densenet_lite_136-gru`。
+  * `doc`：文档图片，适合识别规则文档的截图图片，如书籍扫描件等。此类模型以 `doc-` 开头，如模型 `doc-densenet_lite_136-gru`。
+  * `number`：仅识别**纯数字**（只能识别 `0~9` 十个数字）图片，适合银行卡号、身份证号等场景。此类模型以 `number-` 开头，如模型 `number-densenet_lite_136-gru`。
+  * `general`: 通用场景，适合图片无明显倾向的一般图片。此类模型无特定开头，与旧版模型名称保持一致，如模型 `densenet_lite_136-gru`。
+  > 注意 ⚠️：以上说明仅为参考，具体选择模型时建议以实际效果为准。
+* 加入了两个更大的系列模型：
+  * `*-densenet_lite_246-gru_base`：优先供 **知识星球** [**CnOCR/CnSTD私享群**](https://t.zsxq.com/FEYZRJQ) 会员使用，一个月后会免费开源。
+  * `*-densenet_lite_666-gru_large`：Pro 模型，购买后可使用。
+更多细节请参考：[CnOCR V2.3 新版发布：模型更好、更多、更大 | Breezedeus.com](https://www.breezedeus.com/article/cnocr-v2.3-better-more)。
 [**CnOCR**](https://github.com/breezedeus/cnocr) 是 **Python 3** 下的**文字识别**（**Optical Character Recognition**，简称**OCR**）工具包，支持**简体中文**、**繁体中文**（部分模型）、**英文**和**数字**的常见字符识别，支持竖排文字的识别。自带了**20+个** [训练好的模型](https://cnocr.readthedocs.io/zh/latest/models/)，适用于不同应用场景，安装后即可直接使用。同时，CnOCR也提供简单的[训练命令](https://cnocr.readthedocs.io/zh/latest/train/)供使用者训练自己的模型。欢迎扫码加小助手为好友，备注 `ocr`，小助手会定期统一邀请大家入群：
@@ -85,7 +94,16 @@ License-File: LICENSE
 </div>
-作者也维护 **知识星球** [**CnOCR/CnSTD私享群**](https://t.zsxq.com/FEYZRJQ) ，这里面的提问会较快得到作者的回复，欢迎加入。**知识星球私享群**也会陆续发布一些CnOCR/CnSTD相关的私有资料，包括[**更详细的训练教程**](https://articles.zsxq.com/id_u6b4u0wrf46e.html)，**未公开的模型**，**不同应用场景的调用代码**，使用过程中遇到的难题解答等。本群也会发布OCR/STD相关的最新研究资料。此外，**私享群中作者每月提供两次免费特有数据的训练服务**。
+作者也维护 **知识星球** [**CnOCR/CnSTD私享群**](https://t.zsxq.com/FEYZRJQ) ，这里面的提问会较快得到作者的回复，欢迎加入。**知识星球会员** 可享受以下福利：
+- 可免费下载部分**未开源的付费模型**；
+- 购买其他所有的付费模型一律八折优化；
+- 作者快速回复使用过程中遇到的各种困难；
+- 作者每月提供两次免费特有数据的训练服务。
+- 星球会陆续发布一些CnOCR/CnSTD相关的私有资料；
+- 星球会持续发布 OCR/STD/CV 等相关的最新研究资料。
 ## 详细文档
@@ -278,7 +296,15 @@ $ pip install cnocr[ort-gpu]
-安装速度慢的话，可以指定国内的安装源，如使用豆瓣源：
+如果要训练自己的模型，，可以使用以下命令安装：
+```bash
+$ pip install cnocr[dev]
+```
+安装速度慢的话，可以指定国内的安装源，如使用阿里云的安装源：
 ```bash
 $ pip install cnocr[ort-cpu] -i https://mirrors.aliyun.com/pypi/simple
@@ -363,7 +389,7 @@ print(ocr_out)
 ### 可使用的检测模型
-参考CnSTD的下载方式。
+具体参考 [CnSTD的下载说明](https://github.com/breezedeus/CnSTD?tab=readme-ov-file#%E5%B7%B2%E6%9C%89std%E6%A8%A1%E5%9E%8B)。
 | `det_model_name`                                             | PyTorch 版本 | ONNX 版本 | 模型原始来源 | 模型文件大小 | 支持语言                       | 是否支持竖排文字识别 |
 | ------------------------------------------------------------ | ------------ | --------- | ------------ | ------------ | ------------------------------ | -------------------- |
@@ -382,17 +408,36 @@ print(ocr_out)
 ### 可使用的识别模型
+相比于 CnOCR V2.2.* 版本，**V2.3** 中的大部分模型都经过了重新训练和精调，精度比旧版模型更高。同时，加入了两个参数量更多的模型系列：
+  * `*-densenet_lite_246-gru_base`：优先供 **知识星球** [**CnOCR/CnSTD私享群**](https://t.zsxq.com/FEYZRJQ) 会员使用，2024 年 2 月都会免费开源。
+  * `*-densenet_lite_666-gru_large`：**Pro 模型**，购买后可使用。购买链接见文档：
+**V2.3** 中的模型按使用场景可以分为以下几大类：
+* `scene`：场景图片，适合识别一般拍照图片中的文字。此类模型以 `scene-` 开头，如模型 `scene-densenet_lite_136-gru`。
+* `doc`：文档图片，适合识别规则文档的截图图片，如书籍扫描件等。此类模型以 `doc-` 开头，如模型 `doc-densenet_lite_136-gru`。
+* `number`：仅识别**纯数字**（只能识别 `0~9` 十个数字）图片，适合银行卡号、身份证号等场景。此类模型以 `number-` 开头，如模型 `number-densenet_lite_136-gru`。
+* `general`: 通用场景，适合图片无明显倾向的一般图片。此类模型无特定开头，与旧版模型名称保持一致，如模型 `densenet_lite_136-gru`。
+> 注意 ⚠️：以上说明仅供参考，具体选择模型时建议以实际效果为准。
+更多说明见：[可用模型](https://cnocr.readthedocs.io/zh/latest/models/)。
 | `rec_model_name`                                             | PyTorch 版本 | ONNX 版本 | 模型原始来源 | 模型文件大小 | 支持语言                            | 是否支持竖排文字识别 |
 | ------------------------------------------------------------ | ------------ | --------- | ------------ | ------------ | ----------------------------------- | -------------------- |
+| **densenet_lite_136-gru** 🆕                                  | √            | √         | cnocr        | 12 M         | 简体中文、英文、数字                | X                    |
+| **scene-densenet_lite_136-gru** 🆕                            | √            | √         | cnocr        | 12 M         | 简体中文、英文、数字                | X                    |
+| **doc-densenet_lite_136-gru** 🆕                              | √            | √         | cnocr        | 12 M         | 简体中文、英文、数字                | X                    |
+| **densenet_lite_246-gru_base** 🆕 <br /> ([星球会员](https://t.zsxq.com/FEYZRJQ)专享，2 月开源) | √            | √         | cnocr        | 25 M         | 简体中文、英文、数字                | X                    |
+| **scene-densenet_lite_246-gru_base** 🆕 <br /> ([星球会员](https://t.zsxq.com/FEYZRJQ)专享，2 月开源) | √            | √         | cnocr        | 25 M         | 简体中文、英文、数字                | X                    |
+| **doc-densenet_lite_246-gru_base** 🆕 <br /> ([星球会员](https://t.zsxq.com/FEYZRJQ)专享，2 月开源) | √            | √         | cnocr        | 25 M         | 简体中文、英文、数字                | X                    |
+| **densenet_lite_666-gru_large** 🆕 <br />（购买链接：[B站](https://gf.bilibili.com/item/detail/1104812055)、[Lemon Squeezy](https://ocr.lemonsqueezy.com/)） | √            | √         | cnocr        | 82 M         | 简体中文、英文、数字                | X                    |
+| **scene-densenet_lite_666-gru_large** 🆕 <br />（购买链接：[B站](https://gf.bilibili.com/item/detail/1104815055)、[Lemon Squeezy](https://ocr.lemonsqueezy.com/)） | √            | √         | cnocr        | 82 M         | 简体中文、英文、数字                | X                    |
+| **doc-densenet_lite_666-gru_large** 🆕 <br />（购买链接：[B站](https://gf.bilibili.com/item/detail/1104820055)、[Lemon Squeezy](https://ocr.lemonsqueezy.com/)） | √            | √         | cnocr        | 82 M         | 简体中文、英文、数字                | X                    |
 | **number-densenet_lite_136-fc** 🆕                            | √            | √         | cnocr        | 2.7 M        | **纯数字**（仅包含 `0~9` 十个数字） | X                    |
-| **number-densenet_lite_136-gru**  🆕 <br /> ([星球会员](https://t.zsxq.com/FEYZRJQ)专享) | √            | √         | cnocr        | 5.5 M        | **纯数字**（仅包含 `0~9` 十个数字） | X                    |
-| **number-densenet_lite_666-gru_large** 🆕 <br />（[购买链接](https://gf.bilibili.com/item/detail/1104055055)） | √            | √         | cnocr        | M            | **纯数字**（仅包含 `0~9` 十个数字） | X                    |
-| densenet_lite_114-fc                                         | √            | √         | cnocr        | 4.9 M        | 简体中文、英文、数字                | X                    |
-| densenet_lite_124-fc                                         | √            | √         | cnocr        | 5.1 M        | 简体中文、英文、数字                | X                    |
-| densenet_lite_134-fc                                         | √            | √         | cnocr        | 5.4 M        | 简体中文、英文、数字                | X                    |
-| densenet_lite_136-fc                                         | √            | √         | cnocr        | 5.9 M        | 简体中文、英文、数字                | X                    |
-| densenet_lite_134-gru                                        | √            | √         | cnocr        | 11 M         | 简体中文、英文、数字                | X                    |
-| densenet_lite_136-gru                                        | √            | √         | cnocr        | 12 M         | 简体中文、英文、数字                | X                    |
+| **number-densenet_lite_136-gru**  🆕 <br /> ([星球会员](https://t.zsxq.com/FEYZRJQ)专享，2 月开源) | √            | √         | cnocr        | 5.5 M        | **纯数字**（仅包含 `0~9` 十个数字） | X                    |
+| **number-densenet_lite_666-gru_large** 🆕 <br />（购买链接：[B站](https://gf.bilibili.com/item/detail/1104055055)、[Lemon Squeezy](https://ocr.lemonsqueezy.com/)） | √            | √         | cnocr        | 55 M         | **纯数字**（仅包含 `0~9` 十个数字） | X                    |
 | ch_PP-OCRv3                                                  | X            | √         | ppocr        | 10 M         | 简体中文、英文、数字                | √                    |
 | ch_ppocr_mobile_v2.0                                         | X            | √         | ppocr        | 4.2 M        | 简体中文、英文、数字                | √                    |
 | en_PP-OCRv3                                                  | X            | √         | ppocr        | 8.5 M        | **英文**、数字                      | √                    |
@@ -414,7 +459,8 @@ print(ocr_out)
 * [x] 基于 PyTorch 训练更高效的模型
 * [x] 支持列格式的文字识别
 * [x] 打通与 [CnSTD](https://github.com/breezedeus/cnstd) 的无缝衔接（since `V2.2`）
-* [ ] 支持更多的应用场景，如公式识别、表格识别、版面分析等
+* [ ] 模型精度进一步优化
+* [ ] 支持更多的应用场景
@@ -426,3 +472,5 @@ print(ocr_out)
 官方代码库：[https://github.com/breezedeus/cnocr](https://github.com/breezedeus/cnocr)。

{cnocr-2.2.4.2 → cnocr-2.3.0.1}/README.md RENAMED Viewed

@@ -3,6 +3,7 @@
   <div>&nbsp;</div>
 [![Downloads](https://static.pepy.tech/personalized-badge/cnocr?period=total&units=international_system&left_color=grey&right_color=orange&left_text=Downloads)](https://pepy.tech/project/cnocr)
+[![Visitors](https://api.visitorbadge.io/api/visitors?path=https%3A%2F%2Fgithub.com%2Fbreezedeus%2FCnOCR&label=Visitors&countColor=%23f5c791&style=flat&labelStyle=none)](https://visitorbadge.io/status?path=https%3A%2F%2Fgithub.com%2Fbreezedeus%2FCnOCR)
 [![license](https://img.shields.io/github/license/breezedeus/cnocr)](./LICENSE)
 [![Docs](https://readthedocs.org/projects/cnocr/badge/?version=latest)](https://cnocr.readthedocs.io/zh/latest/?badge=latest)
 [![PyPI version](https://badge.fury.io/py/cnocr.svg)](https://badge.fury.io/py/cnocr)
@@ -37,15 +38,23 @@
 ---
 </div>
-### [Update 2023.09.27]：发布 V2.2.4
+### [Update 2023.12.24]：发布 V2.3
 主要变更：
-* 加入了纯数字识别系列模型 `number-*`（见 [识别模型列表](#可使用的识别模型)），可用于纯数字识别场景，如银行卡识别、身份证识别、硬币年份识别等；
-* 对各个包的新版做了接口适配，如 `pytorch_lightning`、`onnxruntime`、`pillow`等；
-* 优化了训练过程使用的数据增强方式，并借鉴了**Nougat** 中的数据增强方法；
-* 增加了对更大模型的支持，如 `densenet-lite-666`、`gru_large` 等；
-* 以前的 `*-gru` 系列模型，现在也有 ONNX 版了；
-* 修复了一堆的bugs，如 `val-complete_match-epoch` 训练过程一直为 `0` 等。
+* 重新训练了所有的模型，比上一版精度更高。
+* 按使用场景把模型分为几大类场景（见 [识别模型列表](#可使用的识别模型)）：
+  * `scene`：场景图片，适合识别一般拍照图片中的文字。此类模型以 `scene-` 开头，如模型 `scene-densenet_lite_136-gru`。
+  * `doc`：文档图片，适合识别规则文档的截图图片，如书籍扫描件等。此类模型以 `doc-` 开头，如模型 `doc-densenet_lite_136-gru`。
+  * `number`：仅识别**纯数字**（只能识别 `0~9` 十个数字）图片，适合银行卡号、身份证号等场景。此类模型以 `number-` 开头，如模型 `number-densenet_lite_136-gru`。
+  * `general`: 通用场景，适合图片无明显倾向的一般图片。此类模型无特定开头，与旧版模型名称保持一致，如模型 `densenet_lite_136-gru`。
+  > 注意 ⚠️：以上说明仅为参考，具体选择模型时建议以实际效果为准。
+* 加入了两个更大的系列模型：
+  * `*-densenet_lite_246-gru_base`：优先供 **知识星球** [**CnOCR/CnSTD私享群**](https://t.zsxq.com/FEYZRJQ) 会员使用，一个月后会免费开源。
+  * `*-densenet_lite_666-gru_large`：Pro 模型，购买后可使用。
+更多细节请参考：[CnOCR V2.3 新版发布：模型更好、更多、更大 | Breezedeus.com](https://www.breezedeus.com/article/cnocr-v2.3-better-more)。
 [**CnOCR**](https://github.com/breezedeus/cnocr) 是 **Python 3** 下的**文字识别**（**Optical Character Recognition**，简称**OCR**）工具包，支持**简体中文**、**繁体中文**（部分模型）、**英文**和**数字**的常见字符识别，支持竖排文字的识别。自带了**20+个** [训练好的模型](https://cnocr.readthedocs.io/zh/latest/models/)，适用于不同应用场景，安装后即可直接使用。同时，CnOCR也提供简单的[训练命令](https://cnocr.readthedocs.io/zh/latest/train/)供使用者训练自己的模型。欢迎扫码加小助手为好友，备注 `ocr`，小助手会定期统一邀请大家入群：
@@ -55,7 +64,16 @@
 </div>
-作者也维护 **知识星球** [**CnOCR/CnSTD私享群**](https://t.zsxq.com/FEYZRJQ) ，这里面的提问会较快得到作者的回复，欢迎加入。**知识星球私享群**也会陆续发布一些CnOCR/CnSTD相关的私有资料，包括[**更详细的训练教程**](https://articles.zsxq.com/id_u6b4u0wrf46e.html)，**未公开的模型**，**不同应用场景的调用代码**，使用过程中遇到的难题解答等。本群也会发布OCR/STD相关的最新研究资料。此外，**私享群中作者每月提供两次免费特有数据的训练服务**。
+作者也维护 **知识星球** [**CnOCR/CnSTD私享群**](https://t.zsxq.com/FEYZRJQ) ，这里面的提问会较快得到作者的回复，欢迎加入。**知识星球会员** 可享受以下福利：
+- 可免费下载部分**未开源的付费模型**；
+- 购买其他所有的付费模型一律八折优化；
+- 作者快速回复使用过程中遇到的各种困难；
+- 作者每月提供两次免费特有数据的训练服务。
+- 星球会陆续发布一些CnOCR/CnSTD相关的私有资料；
+- 星球会持续发布 OCR/STD/CV 等相关的最新研究资料。
 ## 详细文档
@@ -248,7 +266,15 @@ $ pip install cnocr[ort-gpu]
-安装速度慢的话，可以指定国内的安装源，如使用豆瓣源：
+如果要训练自己的模型，，可以使用以下命令安装：
+```bash
+$ pip install cnocr[dev]
+```
+安装速度慢的话，可以指定国内的安装源，如使用阿里云的安装源：
 ```bash
 $ pip install cnocr[ort-cpu] -i https://mirrors.aliyun.com/pypi/simple
@@ -333,7 +359,7 @@ print(ocr_out)
 ### 可使用的检测模型
-参考CnSTD的下载方式。
+具体参考 [CnSTD的下载说明](https://github.com/breezedeus/CnSTD?tab=readme-ov-file#%E5%B7%B2%E6%9C%89std%E6%A8%A1%E5%9E%8B)。
 | `det_model_name`                                             | PyTorch 版本 | ONNX 版本 | 模型原始来源 | 模型文件大小 | 支持语言                       | 是否支持竖排文字识别 |
 | ------------------------------------------------------------ | ------------ | --------- | ------------ | ------------ | ------------------------------ | -------------------- |
@@ -352,17 +378,36 @@ print(ocr_out)
 ### 可使用的识别模型
+相比于 CnOCR V2.2.* 版本，**V2.3** 中的大部分模型都经过了重新训练和精调，精度比旧版模型更高。同时，加入了两个参数量更多的模型系列：
+  * `*-densenet_lite_246-gru_base`：优先供 **知识星球** [**CnOCR/CnSTD私享群**](https://t.zsxq.com/FEYZRJQ) 会员使用，2024 年 2 月都会免费开源。
+  * `*-densenet_lite_666-gru_large`：**Pro 模型**，购买后可使用。购买链接见文档：
+**V2.3** 中的模型按使用场景可以分为以下几大类：
+* `scene`：场景图片，适合识别一般拍照图片中的文字。此类模型以 `scene-` 开头，如模型 `scene-densenet_lite_136-gru`。
+* `doc`：文档图片，适合识别规则文档的截图图片，如书籍扫描件等。此类模型以 `doc-` 开头，如模型 `doc-densenet_lite_136-gru`。
+* `number`：仅识别**纯数字**（只能识别 `0~9` 十个数字）图片，适合银行卡号、身份证号等场景。此类模型以 `number-` 开头，如模型 `number-densenet_lite_136-gru`。
+* `general`: 通用场景，适合图片无明显倾向的一般图片。此类模型无特定开头，与旧版模型名称保持一致，如模型 `densenet_lite_136-gru`。
+> 注意 ⚠️：以上说明仅供参考，具体选择模型时建议以实际效果为准。
+更多说明见：[可用模型](https://cnocr.readthedocs.io/zh/latest/models/)。
 | `rec_model_name`                                             | PyTorch 版本 | ONNX 版本 | 模型原始来源 | 模型文件大小 | 支持语言                            | 是否支持竖排文字识别 |
 | ------------------------------------------------------------ | ------------ | --------- | ------------ | ------------ | ----------------------------------- | -------------------- |
+| **densenet_lite_136-gru** 🆕                                  | √            | √         | cnocr        | 12 M         | 简体中文、英文、数字                | X                    |
+| **scene-densenet_lite_136-gru** 🆕                            | √            | √         | cnocr        | 12 M         | 简体中文、英文、数字                | X                    |
+| **doc-densenet_lite_136-gru** 🆕                              | √            | √         | cnocr        | 12 M         | 简体中文、英文、数字                | X                    |
+| **densenet_lite_246-gru_base** 🆕 <br /> ([星球会员](https://t.zsxq.com/FEYZRJQ)专享，2 月开源) | √            | √         | cnocr        | 25 M         | 简体中文、英文、数字                | X                    |
+| **scene-densenet_lite_246-gru_base** 🆕 <br /> ([星球会员](https://t.zsxq.com/FEYZRJQ)专享，2 月开源) | √            | √         | cnocr        | 25 M         | 简体中文、英文、数字                | X                    |
+| **doc-densenet_lite_246-gru_base** 🆕 <br /> ([星球会员](https://t.zsxq.com/FEYZRJQ)专享，2 月开源) | √            | √         | cnocr        | 25 M         | 简体中文、英文、数字                | X                    |
+| **densenet_lite_666-gru_large** 🆕 <br />（购买链接：[B站](https://gf.bilibili.com/item/detail/1104812055)、[Lemon Squeezy](https://ocr.lemonsqueezy.com/)） | √            | √         | cnocr        | 82 M         | 简体中文、英文、数字                | X                    |
+| **scene-densenet_lite_666-gru_large** 🆕 <br />（购买链接：[B站](https://gf.bilibili.com/item/detail/1104815055)、[Lemon Squeezy](https://ocr.lemonsqueezy.com/)） | √            | √         | cnocr        | 82 M         | 简体中文、英文、数字                | X                    |
+| **doc-densenet_lite_666-gru_large** 🆕 <br />（购买链接：[B站](https://gf.bilibili.com/item/detail/1104820055)、[Lemon Squeezy](https://ocr.lemonsqueezy.com/)） | √            | √         | cnocr        | 82 M         | 简体中文、英文、数字                | X                    |
 | **number-densenet_lite_136-fc** 🆕                            | √            | √         | cnocr        | 2.7 M        | **纯数字**（仅包含 `0~9` 十个数字） | X                    |
-| **number-densenet_lite_136-gru**  🆕 <br /> ([星球会员](https://t.zsxq.com/FEYZRJQ)专享) | √            | √         | cnocr        | 5.5 M        | **纯数字**（仅包含 `0~9` 十个数字） | X                    |
-| **number-densenet_lite_666-gru_large** 🆕 <br />（[购买链接](https://gf.bilibili.com/item/detail/1104055055)） | √            | √         | cnocr        | M            | **纯数字**（仅包含 `0~9` 十个数字） | X                    |
-| densenet_lite_114-fc                                         | √            | √         | cnocr        | 4.9 M        | 简体中文、英文、数字                | X                    |
-| densenet_lite_124-fc                                         | √            | √         | cnocr        | 5.1 M        | 简体中文、英文、数字                | X                    |
-| densenet_lite_134-fc                                         | √            | √         | cnocr        | 5.4 M        | 简体中文、英文、数字                | X                    |
-| densenet_lite_136-fc                                         | √            | √         | cnocr        | 5.9 M        | 简体中文、英文、数字                | X                    |
-| densenet_lite_134-gru                                        | √            | √         | cnocr        | 11 M         | 简体中文、英文、数字                | X                    |
-| densenet_lite_136-gru                                        | √            | √         | cnocr        | 12 M         | 简体中文、英文、数字                | X                    |
+| **number-densenet_lite_136-gru**  🆕 <br /> ([星球会员](https://t.zsxq.com/FEYZRJQ)专享，2 月开源) | √            | √         | cnocr        | 5.5 M        | **纯数字**（仅包含 `0~9` 十个数字） | X                    |
+| **number-densenet_lite_666-gru_large** 🆕 <br />（购买链接：[B站](https://gf.bilibili.com/item/detail/1104055055)、[Lemon Squeezy](https://ocr.lemonsqueezy.com/)） | √            | √         | cnocr        | 55 M         | **纯数字**（仅包含 `0~9` 十个数字） | X                    |
 | ch_PP-OCRv3                                                  | X            | √         | ppocr        | 10 M         | 简体中文、英文、数字                | √                    |
 | ch_ppocr_mobile_v2.0                                         | X            | √         | ppocr        | 4.2 M        | 简体中文、英文、数字                | √                    |
 | en_PP-OCRv3                                                  | X            | √         | ppocr        | 8.5 M        | **英文**、数字                      | √                    |
@@ -384,7 +429,8 @@ print(ocr_out)
 * [x] 基于 PyTorch 训练更高效的模型
 * [x] 支持列格式的文字识别
 * [x] 打通与 [CnSTD](https://github.com/breezedeus/cnstd) 的无缝衔接（since `V2.2`）
-* [ ] 支持更多的应用场景，如公式识别、表格识别、版面分析等
+* [ ] 模型精度进一步优化
+* [ ] 支持更多的应用场景

{cnocr-2.2.4.2 → cnocr-2.3.0.1}/cnocr/__init__.py RENAMED Viewed

@@ -1,5 +1,5 @@
 # coding: utf-8
-# Copyright (C) 2021, [Breezedeus](https://github.com/breezedeus).
+# Copyright (C) 2021-2023, [Breezedeus](https://github.com/breezedeus).
 # Licensed to the Apache Software Foundation (ASF) under one
 # or more contributor license agreements.  See the NOTICE file
 # distributed with this work for additional information
@@ -26,7 +26,7 @@ from .consts import (
     NUMBERS,
     ENG_LETTERS,
 )
-from .utils import read_img
+from .utils import read_img, set_logger
 from .cn_ocr import CnOcr
 from .recognizer import gen_model
 from .line_split import line_split

{cnocr-2.2.4.2 → cnocr-2.3.0.1}/cnocr/__version__.py RENAMED Viewed

@@ -17,4 +17,4 @@
 # specific language governing permissions and limitations
 # under the License.
-__version__ = '2.2.4.2'
+__version__ = '2.3.0.1'

{cnocr-2.2.4.2 → cnocr-2.3.0.1}/cnocr/app.py RENAMED Viewed

@@ -134,7 +134,7 @@ def main():
     all_models = list(REC_AVAILABLE_MODELS.all_models())
     all_models.sort()
-    idx = all_models.index(('densenet_lite_136-fc', 'onnx'))
+    idx = all_models.index(('densenet_lite_136-gru', 'onnx'))
     rec_model_name = st.sidebar.selectbox('选择识别模型', all_models, index=idx)
     st.sidebar.subheader('检测参数')

{cnocr-2.2.4.2 → cnocr-2.3.0.1}/cnocr/cli.py RENAMED Viewed

@@ -30,6 +30,7 @@ from multiprocessing import Process
 import subprocess
 import click
+import numpy as np
 import torchmetrics
 from torchvision import transforms as T
 import torch
@@ -42,9 +43,9 @@ from cnocr.utils import (
     save_img,
     read_img,
     draw_ocr_results,
+    read_charset,
 )
 from cnocr.data_utils.aug import NormalizeAug
-from cnocr.dataset import OcrDataModule
 from cnocr.trainer import PlTrainer, resave_model, Metrics
 from cnocr import CnOcr, gen_model
 from cnocr.recognizer import Recognizer
@@ -52,7 +53,7 @@ from cnocr.recognizer import Recognizer
 _CONTEXT_SETTINGS = {"help_option_names": ['-h', '--help']}
 logger = set_logger(log_level=logging.INFO)
-DEFAULT_MODEL_NAME = 'densenet_lite_136-fc'
+DEFAULT_MODEL_NAME = 'densenet_lite_136-gru'
 LEGAL_MODEL_NAMES = {
     enc_name + '-' + dec_name
     for enc_name in ENCODER_CONFIGS.keys()
@@ -86,6 +87,9 @@ def cli():
     required=True,
     help='识别模型训练使用的json配置文件，参考 `docs/examples/train_config.json`',
 )
+@click.option(
+    "--finetuning", is_flag=True, help="是否为精调模式（精调模式使用更温柔的transform）。默认为 `False`",
+)
 @click.option(
     '-r',
     '--resume-from-checkpoint',
@@ -105,11 +109,19 @@ def train(
     rec_model_name,
     index_dir,
     train_config_fp,
+    finetuning,
     resume_from_checkpoint,
     pretrained_model_fp,
 ):
     """训练识别模型"""
-    from cnocr.data_utils.transforms import train_transform, test_transform
+    from cnocr.dataset import OcrDataModule
+    from cnocr.data_utils.transforms import (
+        train_transform,
+        ft_transform,
+        test_transform,
+    )
     check_model_name(rec_model_name)
     # train_transform = T.Compose(
     #     [
@@ -135,7 +147,7 @@ def train(
         index_dir=index_dir,
         vocab_fp=train_config['vocab_fp'],
         img_folder=train_config['img_folder'],
-        train_transforms=train_transform,
+        train_transforms=train_transform if not finetuning else ft_transform,
         val_transforms=val_transform,
         batch_size=train_config['batch_size'],
         train_bucket_size=train_config.get('train_bucket_size'),
@@ -239,6 +251,9 @@ def visualize_example(example, fp_prefix):
 @click.option(
     "--draw-font-path", default='./docs/fonts/simfang.ttf', help="画出检测与识别效果图时使用的字体文件",
 )
+@click.option(
+    "--verbose", is_flag=True, default=False, help="是否打印详细日志信息。默认值为 `False`",
+)
 def predict(
     rec_model_name,
     rec_model_backend,
@@ -251,8 +266,14 @@ def predict(
     single_line,
     draw_results_dir,
     draw_font_path,
+    verbose,
 ):
-    """模型预测"""
+    """模型预测""",
+    if verbose:
+        logger = set_logger(log_level=logging.DEBUG)
+    else:
+        logger = set_logger(log_level=logging.INFO)
     fp_list = []
     if os.path.isfile(img_file_or_dir):
         fp_list.append(img_file_or_dir)
@@ -381,6 +402,11 @@ def evaluate(
     verbose,
 ):
     """评估模型效果。检测模型使用 `det_model_name='naive_det'` 。"""
+    if verbose:
+        logger = set_logger(log_level=logging.DEBUG)
+    else:
+        logger = set_logger(log_level=logging.INFO)
     ocr = CnOcr(
         rec_model_name=rec_model_name,
         rec_model_backend=rec_model_backend,
@@ -392,10 +418,7 @@ def evaluate(
     fn_labels_list = read_input_file(eval_index_fp)
-    metrics_config = {
-        "complete_match": {},
-        "cer": {}
-    }
+    metrics_config = {"complete_match": {}, "cer": {}}
     metrics = Metrics.from_config(metrics_config)
     cer = torchmetrics.text.CharErrorRate()
     miss_cnt, redundant_cnt = Counter(), Counter()

{cnocr-2.2.4.2 → cnocr-2.3.0.1}/cnocr/cn_ocr.py RENAMED Viewed

@@ -62,7 +62,7 @@ class OcrResult(object):
 class CnOcr(object):
     def __init__(
         self,
-        rec_model_name: str = 'densenet_lite_136-fc',
+        rec_model_name: str = 'densenet_lite_136-gru',
         *,
         det_model_name: str = 'ch_PP-OCRv3_det',
         cand_alphabet: Optional[Union[Collection, str]] = None,
@@ -82,7 +82,7 @@ class CnOcr(object):
         识别模型初始化函数。
         Args:
-            rec_model_name (str): 识别模型名称。默认为 `densenet_lite_136-fc`
+            rec_model_name (str): 识别模型名称。默认为 `densenet_lite_136-gru`
             det_model_name (str): 检测模型名称。默认为 `ch_PP-OCRv3_det`
             cand_alphabet (Optional[Union[Collection, str]]): 待识别字符所在的候选集合。默认为 `None`，表示不限定识别字符范围
             context (str): 'cpu', or 'gpu'。表明预测时是使用CPU还是GPU。默认为 `cpu`。
@@ -94,7 +94,7 @@ class CnOcr(object):
                 若训练的自有模型更改了字符集，看通过此参数传入新的字符集文件路径。
             rec_more_configs (Optional[Dict[str, Any]]): 识别模型初始化时传入的其他参数。
             rec_root (Union[str, Path]): 识别模型文件所在的根目录。
-                Linux/Mac下默认值为 `~/.cnocr`，表示模型文件所处文件夹类似 `~/.cnocr/2.2/densenet_lite_136-fc`。
+                Linux/Mac下默认值为 `~/.cnocr`，表示模型文件所处文件夹类似 `~/.cnocr/2.3/densenet_lite_136-gru`。
                 Windows下默认值为 `C:/Users/<username>/AppData/Roaming/cnocr`。
             det_model_fp (Optional[str]): 如果不使用系统自带的检测模型，可以通过此参数直接指定所使用的模型文件（'.ckpt' 文件）
             det_model_backend (str): 'pytorch', or 'onnx'。表明检测时是使用 PyTorch 版本模型，还是使用 ONNX 版本模型。
@@ -110,13 +110,13 @@ class CnOcr(object):
             >>> ocr = CnOcr()
             使用指定模型：
-            >>> ocr = CnOcr('densenet_lite_136-fc')
+            >>> ocr = CnOcr('densenet_lite_136-gru')
             识别时只考虑数字：
-            >>> ocr = CnOcr(rec_model_name='densenet_lite_136-fc', det_model_name='naive_det', cand_alphabet='0123456789')
+            >>> ocr = CnOcr(rec_model_name='densenet_lite_136-gru', det_model_name='naive_det', cand_alphabet='0123456789')
             只检测和识别水平文字：
-            >>> ocr = CnOcr(rec_model_name='densenet_lite_136-fc', det_model_name='db_shufflenet_v2_small', det_more_configs={'rotated_bbox': False})
+            >>> ocr = CnOcr(rec_model_name='densenet_lite_136-gru', det_model_name='db_shufflenet_v2_small', det_more_configs={'rotated_bbox': False})
         """
         if kwargs.get('model_name') is not None and rec_model_name is None:

cnocr 2.2.4.2__tar.gz → 2.3.0.1__tar.gz

cnocr 2.2.4.2tar.gz → 2.3.0.1tar.gz