PyPI - imb - Versions diffs - 1.0.0__tar.gz → 1.0.1__tar.gz - Mend

imb 1.0.0tar.gz → 1.0.1tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (20) hide show

imb-1.0.1/PKG-INFO +105 -0
imb-1.0.1/README.md +70 -0
imb-1.0.1/imb/__init__.py +0 -0
imb-1.0.1/imb.egg-info/PKG-INFO +105 -0
{imb-1.0.0 → imb-1.0.1}/imb.egg-info/SOURCES.txt +4 -5
imb-1.0.1/imb.egg-info/requires.txt +15 -0
{imb-1.0.0 → imb-1.0.1}/setup.py +8 -1
imb-1.0.0/PKG-INFO +0 -30
imb-1.0.0/README.md +0 -4
imb-1.0.0/imb/__init__.py +0 -1
imb-1.0.0/imb/inference_clients/__init__.py +0 -2
imb-1.0.0/imb.egg-info/PKG-INFO +0 -30
imb-1.0.0/imb.egg-info/requires.txt +0 -3
{imb-1.0.0 → imb-1.0.1}/LICENSE +0 -0
{imb-1.0.0/imb/inference_clients → imb-1.0.1/imb}/base.py +0 -0
{imb-1.0.0/imb/inference_clients → imb-1.0.1/imb}/onnx.py +0 -0
{imb-1.0.0/imb/inference_clients → imb-1.0.1/imb}/triton.py +0 -0
{imb-1.0.0 → imb-1.0.1}/imb.egg-info/dependency_links.txt +0 -0
{imb-1.0.0 → imb-1.0.1}/imb.egg-info/top_level.txt +0 -0
{imb-1.0.0 → imb-1.0.1}/setup.cfg +0 -0

imb-1.0.1/PKG-INFO ADDED Viewed

@@ -0,0 +1,105 @@
+Metadata-Version: 2.2
+Name: imb
+Version: 1.0.1
+Summary: Python library for run inference of deep learning models in different backends
+Home-page: https://github.com/TheConstant3/InferenceMultiBackend
+Author: p-constant
+Author-email: nikshorop@gmail.com
+Classifier: Programming Language :: Python :: 3.8
+Classifier: License :: OSI Approved :: MIT License
+Classifier: Operating System :: OS Independent
+Requires-Python: >=3.8
+Description-Content-Type: text/markdown
+License-File: LICENSE
+Requires-Dist: numpy
+Provides-Extra: triton
+Requires-Dist: tritonclient[all]>=2.38.0; extra == "triton"
+Provides-Extra: onnxcpu
+Requires-Dist: onnxruntime>=1.16.0; extra == "onnxcpu"
+Provides-Extra: onnxgpu
+Requires-Dist: onnxruntime-gpu>=1.16.0; extra == "onnxgpu"
+Provides-Extra: all
+Requires-Dist: tritonclient[all]>=2.38.0; extra == "all"
+Requires-Dist: onnxruntime>=1.16.0; extra == "all"
+Requires-Dist: onnxruntime-gpu>=1.16.0; extra == "all"
+Dynamic: author
+Dynamic: author-email
+Dynamic: classifier
+Dynamic: description
+Dynamic: description-content-type
+Dynamic: home-page
+Dynamic: provides-extra
+Dynamic: requires-dist
+Dynamic: requires-python
+Dynamic: summary
+# InferenceMultiBackend
+Python library for run inference of deep learning models in different backends
+## Installation
+For use triton inference client:
+```pip install imb[triton]```
+For use onnxruntime-gpu client:
+```pip install imb[onnxgpu]```
+For use onnxruntime client:
+```pip install imb[onnxcpu]```
+For support all implemented clients:
+```pip install imb[all]```
+## Usage
+OnnxClient usage example
+```
+onnx_client = OnnxClient(
+    model_path='model.onnx',
+    model_name='any name',
+    providers=['CUDAExecutionProvider', 'CPUExecutionProvider'],
+    max_batch_size=16,
+    return_dict=True,
+    fixed_batch=True,
+    warmup=True
+)
+# if model has fixed input size (except batch size) then sample_inputs will be created
+sample_inputs = onnx_client.sample_inputs
+print('inputs shapes', [o.shape for o in sample_inputs])
+outputs = onnx_client(*sample_inputs)
+print('outputs shapes', [(o_name, o_value.shape) for o_name, o_value in outputs.items()])
+```
+TritonClient usage example
+```
+triton_client = TritonClient(
+    url='localhost:8000',
+    model_name='arcface',
+    max_batch_size=16,
+    timeout=10,
+    resend_count=10,
+    fixed_batch=True,
+    is_async=False,
+    cuda_shm=False,
+    max_shm_regions=2,
+    scheme='http',
+    return_dict=True,
+    warmup=False
+)
+# if model has fixed input size (except batch size) then sample_inputs will be created
+sample_inputs = triton_client.sample_inputs
+print('inputs shapes', [o.shape for o in sample_inputs])
+outputs = triton_client(*sample_inputs)
+print('outputs shapes', [(o_name, o_value.shape) for o_name, o_value in outputs.items()])
+```
+## Notes
+max_batch_size - maximum batch size for inference. If input data larger that max_batch_size, then input data will be splitted to several batches.
+fixed_batch - if fixed batch is True, then each batch will have fixed size (padding the smallest batch to max_batch_size).
+warmup - if True, model will run several calls on sample_inputs while initialization.
+return_dict - if True, __call__ return dict {'output_name1': output_value1, ...}, else [output_value1, ...]

imb-1.0.1/README.md ADDED Viewed

@@ -0,0 +1,70 @@
+# InferenceMultiBackend
+Python library for run inference of deep learning models in different backends
+## Installation
+For use triton inference client:
+```pip install imb[triton]```
+For use onnxruntime-gpu client:
+```pip install imb[onnxgpu]```
+For use onnxruntime client:
+```pip install imb[onnxcpu]```
+For support all implemented clients:
+```pip install imb[all]```
+## Usage
+OnnxClient usage example
+```
+onnx_client = OnnxClient(
+    model_path='model.onnx',
+    model_name='any name',
+    providers=['CUDAExecutionProvider', 'CPUExecutionProvider'],
+    max_batch_size=16,
+    return_dict=True,
+    fixed_batch=True,
+    warmup=True
+)
+# if model has fixed input size (except batch size) then sample_inputs will be created
+sample_inputs = onnx_client.sample_inputs
+print('inputs shapes', [o.shape for o in sample_inputs])
+outputs = onnx_client(*sample_inputs)
+print('outputs shapes', [(o_name, o_value.shape) for o_name, o_value in outputs.items()])
+```
+TritonClient usage example
+```
+triton_client = TritonClient(
+    url='localhost:8000',
+    model_name='arcface',
+    max_batch_size=16,
+    timeout=10,
+    resend_count=10,
+    fixed_batch=True,
+    is_async=False,
+    cuda_shm=False,
+    max_shm_regions=2,
+    scheme='http',
+    return_dict=True,
+    warmup=False
+)
+# if model has fixed input size (except batch size) then sample_inputs will be created
+sample_inputs = triton_client.sample_inputs
+print('inputs shapes', [o.shape for o in sample_inputs])
+outputs = triton_client(*sample_inputs)
+print('outputs shapes', [(o_name, o_value.shape) for o_name, o_value in outputs.items()])
+```
+## Notes
+max_batch_size - maximum batch size for inference. If input data larger that max_batch_size, then input data will be splitted to several batches.
+fixed_batch - if fixed batch is True, then each batch will have fixed size (padding the smallest batch to max_batch_size).
+warmup - if True, model will run several calls on sample_inputs while initialization.
+return_dict - if True, __call__ return dict {'output_name1': output_value1, ...}, else [output_value1, ...]

imb-1.0.1/imb/__init__.py ADDED Viewed

File without changes

imb-1.0.1/imb.egg-info/PKG-INFO ADDED Viewed

@@ -0,0 +1,105 @@
+Metadata-Version: 2.2
+Name: imb
+Version: 1.0.1
+Summary: Python library for run inference of deep learning models in different backends
+Home-page: https://github.com/TheConstant3/InferenceMultiBackend
+Author: p-constant
+Author-email: nikshorop@gmail.com
+Classifier: Programming Language :: Python :: 3.8
+Classifier: License :: OSI Approved :: MIT License
+Classifier: Operating System :: OS Independent
+Requires-Python: >=3.8
+Description-Content-Type: text/markdown
+License-File: LICENSE
+Requires-Dist: numpy
+Provides-Extra: triton
+Requires-Dist: tritonclient[all]>=2.38.0; extra == "triton"
+Provides-Extra: onnxcpu
+Requires-Dist: onnxruntime>=1.16.0; extra == "onnxcpu"
+Provides-Extra: onnxgpu
+Requires-Dist: onnxruntime-gpu>=1.16.0; extra == "onnxgpu"
+Provides-Extra: all
+Requires-Dist: tritonclient[all]>=2.38.0; extra == "all"
+Requires-Dist: onnxruntime>=1.16.0; extra == "all"
+Requires-Dist: onnxruntime-gpu>=1.16.0; extra == "all"
+Dynamic: author
+Dynamic: author-email
+Dynamic: classifier
+Dynamic: description
+Dynamic: description-content-type
+Dynamic: home-page
+Dynamic: provides-extra
+Dynamic: requires-dist
+Dynamic: requires-python
+Dynamic: summary
+# InferenceMultiBackend
+Python library for run inference of deep learning models in different backends
+## Installation
+For use triton inference client:
+```pip install imb[triton]```
+For use onnxruntime-gpu client:
+```pip install imb[onnxgpu]```
+For use onnxruntime client:
+```pip install imb[onnxcpu]```
+For support all implemented clients:
+```pip install imb[all]```
+## Usage
+OnnxClient usage example
+```
+onnx_client = OnnxClient(
+    model_path='model.onnx',
+    model_name='any name',
+    providers=['CUDAExecutionProvider', 'CPUExecutionProvider'],
+    max_batch_size=16,
+    return_dict=True,
+    fixed_batch=True,
+    warmup=True
+)
+# if model has fixed input size (except batch size) then sample_inputs will be created
+sample_inputs = onnx_client.sample_inputs
+print('inputs shapes', [o.shape for o in sample_inputs])
+outputs = onnx_client(*sample_inputs)
+print('outputs shapes', [(o_name, o_value.shape) for o_name, o_value in outputs.items()])
+```
+TritonClient usage example
+```
+triton_client = TritonClient(
+    url='localhost:8000',
+    model_name='arcface',
+    max_batch_size=16,
+    timeout=10,
+    resend_count=10,
+    fixed_batch=True,
+    is_async=False,
+    cuda_shm=False,
+    max_shm_regions=2,
+    scheme='http',
+    return_dict=True,
+    warmup=False
+)
+# if model has fixed input size (except batch size) then sample_inputs will be created
+sample_inputs = triton_client.sample_inputs
+print('inputs shapes', [o.shape for o in sample_inputs])
+outputs = triton_client(*sample_inputs)
+print('outputs shapes', [(o_name, o_value.shape) for o_name, o_value in outputs.items()])
+```
+## Notes
+max_batch_size - maximum batch size for inference. If input data larger that max_batch_size, then input data will be splitted to several batches.
+fixed_batch - if fixed batch is True, then each batch will have fixed size (padding the smallest batch to max_batch_size).
+warmup - if True, model will run several calls on sample_inputs while initialization.
+return_dict - if True, __call__ return dict {'output_name1': output_value1, ...}, else [output_value1, ...]

{imb-1.0.0 → imb-1.0.1}/imb.egg-info/SOURCES.txt RENAMED Viewed

@@ -3,12 +3,11 @@ README.md
 setup.cfg
 setup.py
 imb/__init__.py
+imb/base.py
+imb/onnx.py
+imb/triton.py
 imb.egg-info/PKG-INFO
 imb.egg-info/SOURCES.txt
 imb.egg-info/dependency_links.txt
 imb.egg-info/requires.txt
-imb.egg-info/top_level.txt
-imb/inference_clients/__init__.py
-imb/inference_clients/base.py
-imb/inference_clients/onnx.py
-imb/inference_clients/triton.py
+imb.egg-info/top_level.txt

imb-1.0.1/imb.egg-info/requires.txt ADDED Viewed

@@ -0,0 +1,15 @@
+numpy
+[all]
+tritonclient[all]>=2.38.0
+onnxruntime>=1.16.0
+onnxruntime-gpu>=1.16.0
+[onnxcpu]
+onnxruntime>=1.16.0
+[onnxgpu]
+onnxruntime-gpu>=1.16.0
+[triton]
+tritonclient[all]>=2.38.0

{imb-1.0.0 → imb-1.0.1}/setup.py RENAMED Viewed

@@ -1,4 +1,5 @@
 from setuptools import setup, find_packages
+from itertools import chain
 import os
@@ -11,9 +12,14 @@ def readme():
     with open('README.md', 'r') as f:
         return f.read()
+extras = ['triton', 'onnxcpu', 'onnxgpu']
+extras_require = {extra: req_file(f"requirements_{extra}.txt") for extra in extras}
+extras_require["all"] = list(chain(extras_require.values()))
 setup(
     name='imb',
-    version='1.0.0',
+    version='1.0.1',
     author='p-constant',
     author_email='nikshorop@gmail.com',
     description='Python library for run inference of deep learning models in different backends',
@@ -22,6 +28,7 @@ setup(
     url='https://github.com/TheConstant3/InferenceMultiBackend',
     packages=find_packages(),
     install_requires=req_file(),
+    extras_require=extras_require,
     classifiers=[
         "Programming Language :: Python :: 3.8",
         "License :: OSI Approved :: MIT License",

imb-1.0.0/PKG-INFO DELETED Viewed

@@ -1,30 +0,0 @@
-Metadata-Version: 2.2
-Name: imb
-Version: 1.0.0
-Summary: Python library for run inference of deep learning models in different backends
-Home-page: https://github.com/TheConstant3/InferenceMultiBackend
-Author: p-constant
-Author-email: nikshorop@gmail.com
-Classifier: Programming Language :: Python :: 3.8
-Classifier: License :: OSI Approved :: MIT License
-Classifier: Operating System :: OS Independent
-Requires-Python: >=3.8
-Description-Content-Type: text/markdown
-License-File: LICENSE
-Requires-Dist: onnxruntime-gpu>=1.16.0
-Requires-Dist: tritonclient[all]>=2.38.0
-Requires-Dist: numpy>=1.19.4
-Dynamic: author
-Dynamic: author-email
-Dynamic: classifier
-Dynamic: description
-Dynamic: description-content-type
-Dynamic: home-page
-Dynamic: requires-dist
-Dynamic: requires-python
-Dynamic: summary
-# InferenceMultiBackend
-Python library for run inference of deep learning models in different backends

imb-1.0.0/README.md DELETED Viewed

@@ -1,4 +0,0 @@
-# InferenceMultiBackend
-Python library for run inference of deep learning models in different backends

imb-1.0.0/imb/__init__.py DELETED Viewed

	@@ -1 +0,0 @@
1	- from .inference_clients import OnnxClient, TritonClient

imb-1.0.0/imb/inference_clients/__init__.py DELETED Viewed

	@@ -1,2 +0,0 @@
1	- from .onnx import OnnxClient
2	- from .triton import TritonClient

imb-1.0.0/imb.egg-info/PKG-INFO DELETED Viewed

@@ -1,30 +0,0 @@
-Metadata-Version: 2.2
-Name: imb
-Version: 1.0.0
-Summary: Python library for run inference of deep learning models in different backends
-Home-page: https://github.com/TheConstant3/InferenceMultiBackend
-Author: p-constant
-Author-email: nikshorop@gmail.com
-Classifier: Programming Language :: Python :: 3.8
-Classifier: License :: OSI Approved :: MIT License
-Classifier: Operating System :: OS Independent
-Requires-Python: >=3.8
-Description-Content-Type: text/markdown
-License-File: LICENSE
-Requires-Dist: onnxruntime-gpu>=1.16.0
-Requires-Dist: tritonclient[all]>=2.38.0
-Requires-Dist: numpy>=1.19.4
-Dynamic: author
-Dynamic: author-email
-Dynamic: classifier
-Dynamic: description
-Dynamic: description-content-type
-Dynamic: home-page
-Dynamic: requires-dist
-Dynamic: requires-python
-Dynamic: summary
-# InferenceMultiBackend
-Python library for run inference of deep learning models in different backends