PyPI - onnx2tf - Versions diffs - 1.23.3__py3-none-any.whl → 1.25.8__py3-none-any.whl - Mend

onnx2tf 1.23.3py3-none-any.whl → 1.25.8py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (33) hide show

onnx2tf/__init__.py +1 -1
onnx2tf/onnx2tf.py +181 -30
onnx2tf/ops/Add.py +29 -0
onnx2tf/ops/AveragePool.py +20 -10
onnx2tf/ops/BatchNormalization.py +270 -24
onnx2tf/ops/Concat.py +4 -4
onnx2tf/ops/DepthToSpace.py +8 -0
onnx2tf/ops/Div.py +30 -0
onnx2tf/ops/Expand.py +207 -0
onnx2tf/ops/Gather.py +67 -18
onnx2tf/ops/Mod.py +29 -0
onnx2tf/ops/Mul.py +30 -0
onnx2tf/ops/ReduceL1.py +3 -0
onnx2tf/ops/ReduceL2.py +3 -0
onnx2tf/ops/ReduceLogSum.py +3 -0
onnx2tf/ops/ReduceLogSumExp.py +3 -0
onnx2tf/ops/ReduceMax.py +3 -0
onnx2tf/ops/ReduceMean.py +3 -0
onnx2tf/ops/ReduceMin.py +3 -0
onnx2tf/ops/ReduceProd.py +3 -0
onnx2tf/ops/ReduceSum.py +3 -0
onnx2tf/ops/ReduceSumSquare.py +3 -0
onnx2tf/ops/Shape.py +2 -0
onnx2tf/ops/Sub.py +29 -0
onnx2tf/ops/Transpose.py +14 -0
onnx2tf/utils/common_functions.py +2 -2
{onnx2tf-1.23.3.dist-info → onnx2tf-1.25.8.dist-info}/METADATA +269 -28
{onnx2tf-1.23.3.dist-info → onnx2tf-1.25.8.dist-info}/RECORD +33 -33
{onnx2tf-1.23.3.dist-info → onnx2tf-1.25.8.dist-info}/WHEEL +1 -1
{onnx2tf-1.23.3.dist-info → onnx2tf-1.25.8.dist-info}/LICENSE +0 -0
{onnx2tf-1.23.3.dist-info → onnx2tf-1.25.8.dist-info}/LICENSE_onnx-tensorflow +0 -0
{onnx2tf-1.23.3.dist-info → onnx2tf-1.25.8.dist-info}/entry_points.txt +0 -0
{onnx2tf-1.23.3.dist-info → onnx2tf-1.25.8.dist-info}/top_level.txt +0 -0

{onnx2tf-1.23.3.dist-info → onnx2tf-1.25.8.dist-info}/METADATA RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.1
 Name: onnx2tf
-Version: 1.23.3
+Version: 1.25.8
 Summary: Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf).
 Home-page: https://github.com/PINTO0309/onnx2tf
 Author: Katsuya Hyodo
@@ -16,6 +16,8 @@ License-File: LICENSE_onnx-tensorflow
 # onnx2tf
 Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in [onnx-tensorflow](https://github.com/onnx/onnx-tensorflow) ([onnx-tf](https://pypi.org/project/onnx-tf/)). I don't need a Star, but give me a pull request. Since I am adding challenging model optimizations and fixing bugs almost daily, I frequently embed potential bugs that would otherwise break through CI's regression testing. Therefore, if you encounter new problems, I recommend that you try a package that is a few versions older, or try the latest package that will be released in a few days.
+Incidentally, I have never used this tool in practice myself since I started working on it. It doesn't matter.
 <p align="center">
   <img src="https://user-images.githubusercontent.com/33194443/193840307-fa69eace-05a9-4d93-9c5d-999cf88af28e.png" />
 </p>
@@ -269,12 +271,12 @@ Video speed is adjusted approximately 50 times slower than actual speed.
 ## Environment
 - Linux / Windows
-- onnx==1.15.0
-- onnxruntime==1.17.1
+- onnx==1.16.1
+- onnxruntime==1.18.1
 - onnx-simplifier==0.4.33 or 0.4.30 `(onnx.onnx_cpp2py_export.shape_inference.InferenceError: [ShapeInferenceError] (op_type:Slice, node name: /xxxx/Slice): [ShapeInferenceError] Inferred shape and existing shape differ in rank: (x) vs (y))`
 - onnx_graphsurgeon
 - simple_onnx_processing_tools
-- tensorflow==2.16.1, Special bugs: [#436](https://github.com/PINTO0309/onnx2tf/issues/436)
+- tensorflow==2.17.0, Special bugs: [#436](https://github.com/PINTO0309/onnx2tf/issues/436)
 - psutil==5.9.5
 - ml_dtypes==0.3.2
 - flatbuffers-compiler (Optional, Only when using the `-coion` option. Executable file named `flatc`.)
@@ -290,10 +292,14 @@ Video speed is adjusted approximately 50 times slower than actual speed.
 ## Sample Usage
 ### 1. Install
+#### Note:
+**1. If you are using TensorFlow v2.13.0 or earlier, use a version older than onnx2tf v1.17.5. onnx2tf v1.17.6 or later will not work properly due to changes in TensorFlow's API.**
-**Note: If you are using TensorFlow v2.13.0 or earlier, use a version older than onnx2tf v1.17.5. onnx2tf v1.17.6 or later will not work properly due to changes in TensorFlow's API.**
+**2. The latest onnx2tf implementation is based on Keras API 3 and will not work properly if you install TensorFlow v2.15.0 or earlier.**
 - HostPC
+  <details><summary>Click to expand</summary><div>
   - When using GHCR, see `Authenticating to the Container registry`
     https://docs.github.com/en/packages/working-with-a-github-packages-registry/working-with-the-container-registry#authenticating-to-the-container-registry
@@ -308,7 +314,7 @@ Video speed is adjusted approximately 50 times slower than actual speed.
   docker run --rm -it \
   -v `pwd`:/workdir \
   -w /workdir \
-  ghcr.io/pinto0309/onnx2tf:1.23.3
+  ghcr.io/pinto0309/onnx2tf:1.25.8
   or
@@ -316,19 +322,19 @@ Video speed is adjusted approximately 50 times slower than actual speed.
   docker run --rm -it \
   -v `pwd`:/workdir \
   -w /workdir \
-  docker.io/pinto0309/onnx2tf:1.23.3
+  docker.io/pinto0309/onnx2tf:1.25.8
   or
-  pip install -U onnx==1.15.0 \
+  pip install -U onnx==1.16.1 \
   && pip install -U nvidia-pyindex \
   && pip install -U onnx-graphsurgeon \
-  && pip install -U onnxruntime==1.17.1 \
+  && pip install -U onnxruntime==1.18.1 \
   && pip install -U onnxsim==0.4.33 \
   && pip install -U simple_onnx_processing_tools \
   && pip install -U sne4onnx>=1.0.13 \
   && pip install -U sng4onnx>=1.0.4 \
-  && pip install -U tensorflow==2.16.1 \
+  && pip install -U tensorflow==2.17.0 \
   && pip install -U protobuf==3.20.3 \
   && pip install -U onnx2tf \
   && pip install -U h5py==3.11.0 \
@@ -342,9 +348,13 @@ Video speed is adjusted approximately 50 times slower than actual speed.
   pip install -e .
   ```
+  </div></details>
 or
 - Google Colaboratory Python3.10
+  <details><summary>Click to expand</summary><div>
   ```
   !sudo apt-get -y update
   !sudo apt-get -y install python3-pip
@@ -354,11 +364,11 @@ or
     && sudo chmod +x flatc \
     && sudo mv flatc /usr/bin/
   !pip install -U pip \
-    && pip install tensorflow==2.16.1 \
-    && pip install -U onnx==1.15.0 \
+    && pip install tensorflow==2.17.0 \
+    && pip install -U onnx==1.16.1 \
     && python -m pip install onnx_graphsurgeon \
           --index-url https://pypi.ngc.nvidia.com \
-    && pip install -U onnxruntime==1.17.1 \
+    && pip install -U onnxruntime==1.18.1 \
     && pip install -U onnxsim==0.4.33 \
     && pip install -U simple_onnx_processing_tools \
     && pip install -U onnx2tf \
@@ -370,24 +380,48 @@ or
     && pip install flatbuffers>=23.5.26
   ```
+  </div></details>
 ### 2. Run test
 Only patterns that are considered to be used particularly frequently are described. In addition, there are several other options, such as disabling Flex OP and additional options to improve inference performance. See: [CLI Parameter](#cli-parameter)
 ```bash
 # Float32, Float16
-# This is the fastest way to generate tflite,
-# but the accompanying saved_model will not have a signature.
-# "ValueError: Only support at least one signature key."
-# If you are having trouble with this error, please use the `-osd` option.
+# This is the fastest way to generate tflite.
+# Improved to automatically generate `signature` without `-osd` starting from v1.25.3.
+# Also, starting from v1.24.0, efficient TFLite can be generated
+# without unrolling `GroupConvolution`. e.g. YOLOv9, YOLOvN
+# Conversion to other frameworks. e.g. TensorFlow.js, CoreML, etc
+# https://github.com/PINTO0309/onnx2tf#19-conversion-to-tensorflowjs
+# https://github.com/PINTO0309/onnx2tf#20-conversion-to-coreml
 wget https://github.com/PINTO0309/onnx2tf/releases/download/0.0.2/resnet18-v1-7.onnx
 onnx2tf -i resnet18-v1-7.onnx
-# saved_model with signaturedefs added.
-# Output in the form of saved_model that can be used for serving
-# or conversion to other frameworks. e.g. TensorFlow.js, CoreML, etc
-# https://github.com/PINTO0309/onnx2tf#17-conversion-to-tensorflowjs
-# https://github.com/PINTO0309/onnx2tf#18-conversion-to-coreml
-wget https://github.com/PINTO0309/onnx2tf/releases/download/0.0.2/resnet18-v1-7.onnx
-onnx2tf -i resnet18-v1-7.onnx -osd
+ls -lh saved_model/
+assets
+fingerprint.pb
+resnet18-v1-7_float16.tflite
+resnet18-v1-7_float32.tflite
+saved_model.pb
+variables
+TF_CPP_MIN_LOG_LEVEL=3 \
+saved_model_cli show \
+--dir saved_model \
+--signature_def serving_default \
+--tag_set serve
+The given SavedModel SignatureDef contains the following input(s):
+  inputs['data'] tensor_info:
+      dtype: DT_FLOAT
+      shape: (-1, 224, 224, 3)
+      name: serving_default_data:0
+The given SavedModel SignatureDef contains the following output(s):
+  outputs['output_0'] tensor_info:
+      dtype: DT_FLOAT
+      shape: (1, 1000) # <-- Model design bug in resnet18-v1-7.onnx
+      name: PartitionedCall:0
+Method name is: tensorflow/serving/predict
 # In the interest of efficiency for my development and debugging of onnx2tf,
 # the default configuration shows a large amount of debug level logs.
@@ -451,9 +485,20 @@ onnx2tf -i emotion-ferplus-8.onnx -oiqt
 # INT8 Quantization (per-tensor)
 onnx2tf -i emotion-ferplus-8.onnx -oiqt -qt per-tensor
+# Split the model at the middle position for debugging
+# Specify the input name of the OP
+wget https://github.com/PINTO0309/onnx2tf/releases/download/1.25.0/cf_fus.onnx
+onnx2tf -i cf_fus.onnx -inimc 448
 # Split the model at the middle position for debugging
 # Specify the output name of the OP
-onnx2tf -i resnet18-v1-7.onnx -onimc resnetv15_stage2_conv1_fwd resnetv15_stage2_conv2_fwd
+wget https://github.com/PINTO0309/onnx2tf/releases/download/1.25.0/cf_fus.onnx
+onnx2tf -i cf_fus.onnx -onimc dep_sec
+# Split the model at the middle position for debugging
+# Specify the input/output name of the OP
+wget https://github.com/PINTO0309/onnx2tf/releases/download/1.25.0/cf_fus.onnx
+onnx2tf -i cf_fus.onnx -inimc 448 -onimc velocity
 # Suppress generation of Flex OP and replace with Pseudo-Function
 # [
@@ -502,6 +547,9 @@ onnx2tf -i human_segmentation_pphumanseg_2021oct.onnx -prf replace.json
 ```
 ### 3. Accuracy check
+<details><summary>Click to expand</summary><div>
 Perform error checking of ONNX output and TensorFlow output. Verify that the error of all outputs, one operation at a time, is below a certain threshold. Automatically determines before and after which OPs the tool's automatic conversion of the model failed. Know where dimensional compression, dimensional expansion, and dimensional transposition by `Reshape` and `Traspose` are failing. Once you have identified the problem area, you can refer to the tutorial on [Parameter replacement](#parameter-replacement) to modify the tool's behavior.
 After many upgrades, the need for JSON parameter correction has become much less common, but there are still some edge cases where JSON correction is required. If the PC has sufficient free space in its RAM, onnx2tf will convert the model while carefully performing accuracy checks on all OPs. Thus, at the cost of successful model conversion, the conversion speed is a little slower. If the amount of RAM required for the accuracy check is expected to exceed 80% of the total available RAM capacity of the entire PC, the conversion operation will be performed without an accuracy check. Therefore, if the accuracy of the converted model is found to be significantly degraded, the accuracy may be automatically corrected by re-conversion on a PC with a large amount of RAM. For example, my PC has 128GB of RAM, but the StableDiffusion v1.5 model is too complex in its structure and consumed about 180GB of RAM in total with 50GB of SWAP space.
@@ -527,8 +575,16 @@ onnx2tf -i mobilenetv2-12.onnx -cotof -cotoa 1e-1 -cind "input" "/your/path/x.np
 ![Kazam_screencast_00108_](https://user-images.githubusercontent.com/33194443/212460284-f3480105-4d94-4519-94dc-320d641f5647.gif)
+</div></details>
 ### 4. Match tflite input/output names and input/output order to ONNX
+<details><summary>Click to expand</summary><div>
 If you want to match tflite's input/output OP names and the order of input/output OPs with ONNX, you can use the `interpreter.get_signature_runner()` to infer this after using the `-coion` / `--copy_onnx_input_output_names_to_tflite` option to output tflite file. See: https://github.com/PINTO0309/onnx2tf/issues/228
+onnx2tf automatically compares the final input/output shapes of ONNX and the generated TFLite and tries to automatically correct the input/output order as much as possible if there is a difference. However, if INT8 quantization is used and there are multiple inputs and outputs with the same shape, automatic correction may fail. This is because TFLiteConverter shuffles the input-output order by itself only when INT8 quantization is performed.
 ```python
 import torch
 import onnxruntime
@@ -607,7 +663,12 @@ print("[TFLite] Model Predictions:", tf_lite_output)
 ```
 ![image](https://user-images.githubusercontent.com/33194443/223318437-b89e56c1-4376-4e91-8c0c-08d29a604637.png)
+</div></details>
 ### 5. Rewriting of tflite input/output OP names and `signature_defs`
+<details><summary>Click to expand</summary><div>
 If you do not like tflite input/output names such as `serving_default_*:0` or `StatefulPartitionedCall:0`, you can rewrite them using the following tools and procedures. It can be rewritten from any name to any name, so it does not have to be `serving_default_*:0` or `StatefulPartitionedCall:0`.
 https://github.com/PINTO0309/tflite-input-output-rewriter
@@ -643,8 +704,12 @@ pip install -U tfliteiorewriter
   ![03](https://github.com/PINTO0309/onnx2tf/assets/33194443/f7b7be16-c69c-4593-b8b5-e1cc23e61be9)
+</div></details>
 ### 6. Embed metadata in tflite
+<details><summary>Click to expand</summary><div>
 If you want to embed label maps, quantization parameters, descriptions, etc. into your tflite file, you can refer to the official tutorial and try it yourself. For now, this tool does not plan to implement the ability to append metadata, as I do not want to write byte arrays to the tflite file that are not essential to its operation.
 - Adding metadata to TensorFlow Lite models
@@ -652,7 +717,12 @@ If you want to embed label maps, quantization parameters, descriptions, etc. int
   https://www.tensorflow.org/lite/models/convert/metadata
   ![image](https://user-images.githubusercontent.com/33194443/221345428-639ffa41-a03c-4d0b-bd72-9c23fb3847f3.png)
+</div></details>
 ### 7. If the accuracy of the INT8 quantized model degrades significantly
+<details><summary>Click to expand</summary><div>
 It is a matter of model structure. The activation function (`SiLU`/`Swish`), kernel size and stride for `Pooling`, and kernel size and stride for `Conv` should be completely revised. See: https://github.com/PINTO0309/onnx2tf/issues/269
 If you want to see the difference in quantization error between `SiLU` and `ReLU`, please check this Gist by [@motokimura](https://gist.github.com/motokimura) who helped us in our research. Thanks Motoki!
@@ -716,7 +786,12 @@ The accuracy error rates after quantization for different activation functions a
   2. Pattern with fixed value `-128.0` padded on 4 sides of tensor
     ![image](https://github.com/PINTO0309/onnx2tf/assets/33194443/35c7d540-b304-4662-894a-af0e053642d7)
+</div></details>
 ### 8. Calibration data creation for INT8 quantization
+<details><summary>Click to expand</summary><div>
 Calibration data (.npy) for INT8 quantization (`-cind`) is generated as follows. This is a sample when the data used for training is image data. See: https://github.com/PINTO0309/onnx2tf/issues/222
 https://www.tensorflow.org/lite/performance/post_training_quantization
@@ -768,7 +843,12 @@ e.g. How to specify calibration data in CLI or Script respectively.
 """
 ```
+</div></details>
 ### 9. INT8 quantization of models with multiple inputs requiring non-image data
+<details><summary>Click to expand</summary><div>
 If you do not need to perform INT8 quantization with this tool alone, the following method is the easiest.
 The `-osd` option will output a `saved_model.pb` in the `saved_model` folder with the full size required for quantization. That is, a default signature named `serving_default` is embedded in `.pb`. The `-b` option is used to convert the batch size by rewriting it as a static integer.
@@ -848,7 +928,12 @@ https://www.tensorflow.org/lite/performance/post_training_quantization
 See: https://github.com/PINTO0309/onnx2tf/issues/248
+</div></details>
 ### 10. Fixing the output of NonMaxSuppression (NMS)
+<details><summary>Click to expand</summary><div>
 PyTorch's `NonMaxSuppression (torchvision.ops.nms)` and ONNX's `NonMaxSuppression` are not fully compatible. TorchVision's NMS is very inefficient. Therefore, it is inevitable that converting ONNX using NMS in object detection models and other models will be very redundant and will be converted with a structure that is difficult for TensorFlow.js and TFLite models to take advantage of in devices. This is due to the indefinite number of tensors output by the NMS. In this chapter, I share how to easily tune the ONNX generated using TorchVision's redundant NMS to generate an optimized NMS.
 1. There are multiple issues with TorchVision's NMS. First, the batch size specification is not supported; second, the `max_output_boxes_per_class` parameter cannot be specified. Please see the NMS sample ONNX part I generated. The `max_output_boxes_per_class` has been changed to `896` instead of `-Infinity`. The biggest problem with TorchVision NMS is that it generates ONNX with `max_output_boxes_per_class` set to `-Infinity` or `9223372036854775807 (Maximum value of INT64)`, resulting in a variable number of NMS outputs from zero to infinite. Thus, by rewriting `-Infinity` or `9223372036854775807 (Maximum value of INT64)` to a constant value, it is possible to output an NMS that can be effortlessly inferred by TFJS or TFLite.
@@ -885,7 +970,12 @@ PyTorch's `NonMaxSuppression (torchvision.ops.nms)` and ONNX's `NonMaxSuppressio
     I would be happy if this is a reference for Android + Java or TFJS implementations. There are tons more tricky model optimization techniques described in my blog posts, so you'll have to find them yourself. I don't dare to list the URL here because it is annoying to see so many `issues` being posted. And unfortunately, all articles are in Japanese.
     ![image](https://user-images.githubusercontent.com/33194443/230780749-9967a34b-abf6-47fe-827d-92e0f6bddf46.png)
+</div></details>
 ### 11. RNN (RNN, GRU, LSTM) Inference Acceleration
+<details><summary>Click to expand</summary><div>
 TensorFlow's RNN has a speedup option called `unroll`. The network will be unrolled, else a symbolic loop will be used. Unrolling can speed-up a RNN, although it tends to be more memory-intensive. Unrolling is only suitable for short sequences. onnx2tf allows you to deploy RNNs into memory-intensive operations by specifying the `--enable_rnn_unroll` or `-eru` options. The `--enable_rnn_unroll` option is available for `RNN`, `GRU`, and `LSTM`.
 - Keras https://keras.io/api/layers/recurrent_layers/lstm/
@@ -907,7 +997,12 @@ An example of `BidirectionalLSTM` conversion with the `--enable_rnn_unroll` opti
   ![image](https://user-images.githubusercontent.com/33194443/234149995-7b68b550-90d9-4070-abd0-158d1e824315.png)
+</div></details>
 ### 12. If the accuracy of the Float32 model degrades significantly
+<details><summary>Click to expand</summary><div>
 The pattern of accuracy degradation of the converted model does not only occur when INT8 quantization is performed. A special edge case is when there is a problem with the implementation of a particular OP on the TFLite runtime side. Below, I will reproduce the problem by means of a very simple CNN model and further explain its workaround. Here is the issue that prompted me to add this explanation. [[Conv-TasNet] Facing issue in converting Conv-TasNet model #447](https://github.com/PINTO0309/onnx2tf/issues/447)
 Download a sample model for validation.
@@ -1008,7 +1103,12 @@ Again, run the test code to check the inference results. The figure below shows
   ![20230817175701](https://github.com/PINTO0309/onnx2tf/assets/33194443/cf1f7b8c-de8f-4f66-a9a7-02fa01506391)
+</div></details>
 ### 13. Problem of extremely large calculation error in `InstanceNormalization`
+<details><summary>Click to expand</summary><div>
 Even if the conversion is successful, `InstanceNormalization` tends to have very large errors. This is an ONNX specification.
 - See.1: https://discuss.pytorch.org/t/understanding-instance-normalization-2d-with-running-mean-and-running-var/144139
@@ -1020,7 +1120,12 @@ I verified this with a very simple sample model. There are more than 8 million e
 ![image](https://github.com/PINTO0309/onnx2tf/assets/33194443/61d1fd6e-1703-4c0b-8112-e66e3001fc13)
+</div></details>
 ### 14. Inference with dynamic tensors in TFLite
+<details><summary>Click to expand</summary><div>
 For some time now, TFLite runtime has supported inference by dynamic tensors. However, the existence of this important function is not widely recognized. In this chapter, I will show how I can convert an ONNX file that contains dynamic geometry in batch size directly into a TFLite file that contains dynamic geometry and then further infer it in variable batch conditions. The issue that inspired me to add this tutorial is here. [[Dynamic batch / Dynamic shape] onnx model with dynamic input is converted to tflite with static input 1 #441](https://github.com/PINTO0309/onnx2tf/issues/441), or [Cannot use converted model with dynamic input shape #521](https://github.com/PINTO0309/onnx2tf/issues/521)
@@ -1105,6 +1210,8 @@ If you want to infer in variable batches, you need to infer using `signature`. I
 https://github.com/PINTO0309/onnx2tf#4-match-tflite-inputoutput-names-and-inputoutput-order-to-onnx
+You can use `signature_runner` to handle dynamic input tensors by performing inference using `signature`. Below I show that both `batch_size=5` and `batch_size=3` tensors can be inferred with the same model.
 - `test.py` - Batch size: `5`
   ```python
   import numpy as np
@@ -1164,7 +1271,12 @@ https://github.com/PINTO0309/onnx2tf#4-match-tflite-inputoutput-names-and-inputo
           3.7874976e-01, 0.0000000e+00]], dtype=float32)}
   ```
+</div></details>
 ### 15. Significant optimization of the entire model through `Einsum` and `OneHot` optimizations
+<details><summary>Click to expand</summary><div>
 `Einsum` and `OneHot` are not optimized to the maximum by the standard behavior of onnx-optimizer. Therefore, pre-optimizing the `Einsum` OP and `OneHot` OP using my original method can significantly improve the success rate of model conversion, and the input ONNX model itself can be significantly optimized compared to when onnxsim alone is optimized. See: https://github.com/PINTO0309/onnx2tf/issues/569
 - I have made a few unique customizations to the cited model structure.
@@ -1192,7 +1304,12 @@ onnx2tf -i sjy_fused_static_spo.onnx
 ![image](https://github.com/PINTO0309/onnx2tf/assets/33194443/35adb529-58cc-4f10-96b3-f6ecf4f31db1)
+</div></details>
 ### 16. Add constant outputs to the model that are not connected to the model body
+<details><summary>Click to expand</summary><div>
 Sometimes you want to always output constants that are not connected to the model body. See: [https://github.com/PINTO0309/onnx2tf/issues/627](https://github.com/PINTO0309/onnx2tf/issues/627). For example, in the case of ONNX as shown in the figure below. You may want to keep scaling parameters and other parameters as fixed values inside the model and always include the same value in the output.
 ![image](https://github.com/PINTO0309/onnx2tf/assets/33194443/38080d00-8048-4a2e-8df4-90378487cebc)
@@ -1253,7 +1370,96 @@ Constant Output:
 array([1., 2., 3., 4., 5.], dtype=float32)
 ```
-### 17. Conversion to TensorFlow.js
+</div></details>
+### 17. Conversion of models that use variable length tokens and embedding, such as LLM and sound models
+<details><summary>Click to expand</summary><div>
+This refers to a model with undefined dimensions, either all dimensions or multiple dimensions including batch size, as shown in the figure below.
+- Sample model
+  https://github.com/PINTO0309/onnx2tf/releases/download/1.24.0/bge-m3.onnx
+- Structure
+  ![image](https://github.com/PINTO0309/onnx2tf/assets/33194443/2235bf5e-b8d8-458a-8f1c-65d2c6de5249)
+If such a model is converted without any options, TensorFlow/Keras will abort. This is an internal TensorFlow/Keras implementation issue rather than an onnx2tf issue. TensorFlow/Keras does not allow more than two undefined dimensions in the `shape` attribute of `Reshape` due to the specification, so an error occurs during the internal transformation operation of the `Reshape` OP as shown below. This has been an inherent problem in TensorFlow/Keras since long ago and has not been resolved to this day. See: [RuntimeError: tensorflow/lite/kernels/range.cc:39 (start > limit && delta < 0) || (start < limit && delta > 0) was not true.Node number 3 (RANGE) failed to invoke. Node number 393 (WHILE) failed to invoke. current error :RuntimeError: tensorflow/lite/kernels/reshape.cc:55 stretch_dim != -1 (0 != -1)Node number 83 (RESHAPE) failed to prepare. #40504](https://github.com/tensorflow/tensorflow/issues/40504)
+- OP where the problem occurs
+  ![image](https://github.com/PINTO0309/onnx2tf/assets/33194443/ffb16181-9d60-432a-94f0-fafc5d19c512)
+- Error message
+  ```
+  error: 'tf.Reshape' op requires 'shape' to have at most one dynamic dimension, but got multiple dynamic dimensions at indices 0 and 3
+  ```
+Thus, for models such as this, where all dimensions, including batch size, are dynamic shapes, it is often possible to convert by fixing the batch size to `1` with the `-b 1` or `--batch_size 1` option.
+```
+onnx2tf -i model.onnx -b 1 -osd
+```
+- Results
+  ![image](https://github.com/PINTO0309/onnx2tf/assets/33194443/213d9bdc-eec6-4289-9fa9-2b7f43f49720)
+  When the converted tflite is displayed in Netron, all the dimensions of the dynamic shape are displayed as `1`, but this is a display problem in Netron, and the shape is actually converted to `-1` or `None`.
+  ![image](https://github.com/PINTO0309/onnx2tf/assets/33194443/ccad4eaa-ce1d-46aa-80e9-9720467a3afb)
+Click here to see how to perform inference using the dynamic shape tensor.
+https://github.com/PINTO0309/onnx2tf/tree/main?tab=readme-ov-file#14-inference-with-dynamic-tensors-in-tflite
+</div></details>
+### 18. Convert only the intermediate structural part of the ONNX model
+<details><summary>Click to expand</summary><div>
+By specifying ONNX input or output names, only the middle part of the model can be converted. This is useful when you want to see what output is obtained in what part of the model after conversion, or when debugging the model conversion operation itself.
+For example, take a model with multiple inputs and multiple outputs as shown in the figure below to try a partial transformation.
+![image](https://github.com/user-attachments/assets/2bfd01e4-3476-47fe-b0d0-d422dafe78bd)
+- To convert by specifying only the input name to start the conversion
+  ```
+  wget https://github.com/PINTO0309/onnx2tf/releases/download/1.25.0/cf_fus.onnx
+  onnx2tf -i cf_fus.onnx -inimc 448 -coion
+  ```
+  ![image](https://github.com/user-attachments/assets/de873481-3104-4a81-9240-3cfbd0baaf2f)
+- To convert by specifying only the output name to end the conversion
+  ```
+  wget https://github.com/PINTO0309/onnx2tf/releases/download/1.25.0/cf_fus.onnx
+  onnx2tf -i cf_fus.onnx -onimc dep_sec -coion
+  ```
+  ![image](https://github.com/user-attachments/assets/9f1f78b8-0334-43ea-a358-35dc76619891)
+- To perform a conversion by specifying the input name to start the conversion and the output name to end the conversion
+  ```
+  wget https://github.com/PINTO0309/onnx2tf/releases/download/1.25.0/cf_fus.onnx
+  onnx2tf -i cf_fus.onnx -inimc 448 -onimc velocity -coion
+  ```
+  ![image](https://github.com/user-attachments/assets/fd42a258-4338-4260-a6e6-5e108a926bad)
+</div></details>
+### 19. Conversion to TensorFlow.js
+<details><summary>Click to expand</summary><div>
 When converting to TensorFlow.js, process as follows.
 ```bash
@@ -1272,7 +1478,12 @@ See: https://github.com/tensorflow/tfjs/tree/master/tfjs-converter
 ![image](https://user-images.githubusercontent.com/33194443/224186149-0b9ce9dc-fe09-48d4-b430-6cc3d0687140.png)
-### 18. Conversion to CoreML
+</div></details>
+### 20. Conversion to CoreML
+<details><summary>Click to expand</summary><div>
 When converting to CoreML, process as follows. The `-k` option is for conversion while maintaining the input channel order in ONNX's NCHW format.
 ```bash
@@ -1296,9 +1507,12 @@ See: https://github.com/apple/coremltools
 ![image](https://user-images.githubusercontent.com/33194443/224185761-bd0c086c-65e8-4de7-a500-f49b666eea0a.png)
+</div></details>
 ## CLI Parameter
-```
+<details><summary>Click to expand</summary><div>
+```
 onnx2tf -h
 usage: onnx2tf
@@ -1325,6 +1539,7 @@ usage: onnx2tf
 [-k KEEP_NCW_OR_NCHW_OR_NCDHW_INPUT_NAMES [KEEP_NCW_OR_NCHW_OR_NCDHW_INPUT_NAMES ...]]
 [-kt KEEP_NWC_OR_NHWC_OR_NDHWC_INPUT_NAMES [KEEP_NWC_OR_NHWC_OR_NDHWC_INPUT_NAMES ...]]
 [-kat KEEP_SHAPE_ABSOLUTELY_INPUT_NAMES [KEEP_SHAPE_ABSOLUTELY_INPUT_NAMES ...]]
+[-inimc INPUT_NAMES [INPUT_NAMES ...]]
 [-onimc OUTPUT_NAMES [OUTPUT_NAMES ...]]
 [-dgc]
 [-eatfp16]
@@ -1541,6 +1756,13 @@ optional arguments:
     If a nonexistent INPUT OP name is specified, it is ignored.
     e.g. --keep_shape_absolutely_input_names "input0" "input1" "input2"
+  -inimc INPUT_NAMES [INPUT_NAMES ...], \
+      --input_names_to_interrupt_model_conversion INPUT_NAMES [INPUT_NAMES ...]
+    Input names of ONNX that interrupt model conversion.
+    Interrupts model transformation at the specified input name and inputs the
+    model partitioned into subgraphs.
+    e.g. --input_names_to_interrupt_model_conversion "input0" "input1" "input2"
   -onimc OUTPUT_NAMES [OUTPUT_NAMES ...], \
       --output_names_to_interrupt_model_conversion OUTPUT_NAMES [OUTPUT_NAMES ...]
     Output names of ONNX that interrupt model conversion.
@@ -1761,7 +1983,12 @@ optional arguments:
     Default: "debug" (for backwards compatability)
 ```
+</div></details>
 ## In-script Usage
+<details><summary>Click to expand</summary><div>
 ```python
 >>> from onnx2tf import convert
 >>> help(convert)
@@ -1791,6 +2018,7 @@ convert(
   keep_ncw_or_nchw_or_ncdhw_input_names: Union[List[str], NoneType] = None,
   keep_nwc_or_nhwc_or_ndhwc_input_names: Union[List[str], NoneType] = None,
   keep_shape_absolutely_input_names: Optional[List[str]] = None,
+  input_names_to_interrupt_model_conversion: Union[List[str], NoneType] = None,
   output_names_to_interrupt_model_conversion: Union[List[str], NoneType] = None,
   disable_group_convolution: Union[bool, NoneType] = False,
   enable_batchmatmul_unfold: Optional[bool] = False,
@@ -2010,6 +2238,13 @@ convert(
       e.g.
       keep_shape_absolutely_input_names=['input0','input1','input2']
+    input_names_to_interrupt_model_conversion: Optional[List[str]]
+      Input names of ONNX that interrupt model conversion.
+      Interrupts model transformation at the specified input name
+      and inputs the model partitioned into subgraphs.
+      e.g.
+      input_names_to_interrupt_model_conversion=['input0','input1','input2']
     output_names_to_interrupt_model_conversion: Optional[List[str]]
       Output names of ONNX that interrupt model conversion.
       Interrupts model transformation at the specified output name
@@ -2239,9 +2474,13 @@ convert(
       Model
 ```
+</div></details>
 ## Parameter replacement
 This tool is used to convert `NCW` to `NWC`, `NCHW` to `NHWC`, `NCDHW` to `NDHWC`, `NCDDHW` to `NDDHWC`, `NCDDDDDDHW` to `NDDDDDDHWC`. Therefore, as stated in the Key Concepts, the conversion will inevitably break down at some point in the model. You need to look at the entire conversion log to see which OP transpositions are failing and correct them yourself. I dare to explain very little because I know that no matter how much detail I put in the README, you guys will not read it at all. `attribute` or `INPUT constant` or `INPUT Initializer` can be replaced with the specified value.
+<details><summary>Click to expand</summary><div>
 Starting from `v1.3.0`, almost all OPs except for some special OPs support pre- and post-transposition by `pre_process_transpose` and `post_process_transpose`.
 1. "A conversion error occurs."
@@ -2370,6 +2609,8 @@ Do not submit an issue that only contains an amount of information that cannot b
   </div></details>
+</div></details>
 ## Generated Model
 - YOLOv7-tiny with Post-Process (NMS) ONNX to TFLite Float32
   https://github.com/PINTO0309/onnx2tf/releases/download/0.0.33/yolov7_tiny_head_0.768_post_480x640.onnx

onnx2tf 1.23.3__py3-none-any.whl → 1.25.8__py3-none-any.whl

onnx2tf 1.23.3py3-none-any.whl → 1.25.8py3-none-any.whl