PyPI - compressed-tensors-nightly - Versions diffs - 0.5.0.20240812__py3-none-any.whl → 0.5.0.20240813__py3-none-any.whl - Mend

compressed-tensors-nightly 0.5.0.20240812py3-none-any.whl → 0.5.0.20240813py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

{compressed_tensors_nightly-0.5.0.20240812.dist-info → compressed_tensors_nightly-0.5.0.20240813.dist-info}/METADATA RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.1
 Name: compressed-tensors-nightly
-Version: 0.5.0.20240812
+Version: 0.5.0.20240813
 Summary: Library for utilization of compressed safetensors of neural network models
 Home-page: https://github.com/neuralmagic/compressed-tensors
 Author: Neuralmagic, Inc.
@@ -20,32 +20,43 @@ Requires-Dist: flake8>=3.8.3; extra == "dev"
 Requires-Dist: pytest>=6.0.0; extra == "dev"
 Requires-Dist: nbconvert>=7.16.3; extra == "dev"
-# compressed_tensors
+# compressed-tensors
-This repository extends a [safetensors](https://github.com/huggingface/safetensors) format to efficiently store sparse and/or quantized tensors on disk. `compressed-tensors` format supports multiple compression types to minimize the disk space and facilitate the tensor manipulation.
+The `compressed-tensors` library extends the [safetensors](https://github.com/huggingface/safetensors) format, providing a versatile and efficient way to store and manage compressed tensor data. This library supports various quantization and sparsity schemes, making it a unified format for handling different model optimizations like GPTQ, AWQ, SmoothQuant, INT8, FP8, SparseGPT, and more.
-## Motivation
+## Why `compressed-tensors`?
-### Reduce disk space by saving sparse tensors in a compressed format
+As model compression becomes increasingly important for efficient deployment of LLMs, the landscape of quantization and compression techniques has become increasingly fragmented.
+Each method often comes with its own storage format and loading procedures, making it challenging to work with multiple techniques or switch between them.
+`compressed-tensors` addresses this by providing a single, extensible format that can represent a wide variety of compression schemes.
-The compressed format stores the data much more efficiently by taking advantage of two properties of tensors:
+* **Unified Checkpoint Format**: Supports various compression schemes in a single, consistent format.
+* **Wide Compatibility**: Works with popular quantization methods like GPTQ, SmoothQuant, and FP8. See [llm-compressor](https://github.com/vllm-project/llm-compressor)
+* **Flexible Quantization Support**:
+  * Weight-only quantization (e.g., W4A16, W8A16, WnA16)
+  * Activation quantization (e.g., W8A8)
+  * KV cache quantization
+  * Non-uniform schemes (different layers can be quantized in different ways!)
+* **Sparsity Support**: Handles both unstructured and semi-structured (e.g., 2:4) sparsity patterns.
+* **Open-Source Integration**: Designed to work seamlessly with Hugging Face models and PyTorch.
-- Sparse tensors -> due to a large number of entries that are equal to zero.
-- Quantized -> due to their low precision representation.
-### Introduce an elegant interface to save/load compressed tensors
-The library provides the user with the ability to compress/decompress tensors. The properties of tensors are defined by human-readable configs, allowing the users to understand the compression format at a quick glance.
+This allows developers and researchers to easily experiment with composing different quantization methods, simplify model deployment pipelines, and reduce the overhead of supporting multiple compression formats in inference engines.
 ## Installation
-### Pip
+### From [PyPI](https://pypi.org/project/compressed-tensors)
+Stable release:
 ```bash
 pip install compressed-tensors
 ```
-### From source
+Nightly release:
+```bash
+pip install compressed-tensors-nightly
+```
+### From Source
 ```bash
 git clone https://github.com/neuralmagic/compressed-tensors

{compressed_tensors_nightly-0.5.0.20240812.dist-info → compressed_tensors_nightly-0.5.0.20240813.dist-info}/RECORD RENAMED Viewed

@@ -41,8 +41,8 @@ compressed_tensors/utils/offload.py,sha256=qAMwoFT3WEQ9nB_SegE12ob8ghDugddQseE6z
 compressed_tensors/utils/permutations_24.py,sha256=kx6fsfDHebx94zsSzhXGyCyuC9sVyah6BUUir_StT28,2530
 compressed_tensors/utils/safetensors_load.py,sha256=0MheXwx1jeY12PeISppiSIZHs6rmN2YddwPpFb9V67I,8527
 compressed_tensors/utils/semi_structured_conversions.py,sha256=g1EZHzdv-ko7ufPX430dp7wE33o6FWJXuSP4zZydCu0,13488
-compressed_tensors_nightly-0.5.0.20240812.dist-info/LICENSE,sha256=xx0jnfkXJvxRnG63LTGOxlggYnIysveWIZ6H3PNdCrQ,11357
-compressed_tensors_nightly-0.5.0.20240812.dist-info/METADATA,sha256=hm6LdxXI04P0BWQv4mhejSipHT1WIeu3blhiGHBWfJQ,5680
-compressed_tensors_nightly-0.5.0.20240812.dist-info/WHEEL,sha256=eOLhNAGa2EW3wWl_TU484h7q1UNgy0JXjjoqKoxAAQc,92
-compressed_tensors_nightly-0.5.0.20240812.dist-info/top_level.txt,sha256=w2i-GyPs2s1UwVxvutSvN_lM22SXC2hQFBmoMcPnV7Y,19
-compressed_tensors_nightly-0.5.0.20240812.dist-info/RECORD,,
+compressed_tensors_nightly-0.5.0.20240813.dist-info/LICENSE,sha256=xx0jnfkXJvxRnG63LTGOxlggYnIysveWIZ6H3PNdCrQ,11357
+compressed_tensors_nightly-0.5.0.20240813.dist-info/METADATA,sha256=N-3hz80sjo8Y-9KQiKWyhdSophoXCBWKHZH3KXwJwZE,6749
+compressed_tensors_nightly-0.5.0.20240813.dist-info/WHEEL,sha256=eOLhNAGa2EW3wWl_TU484h7q1UNgy0JXjjoqKoxAAQc,92
+compressed_tensors_nightly-0.5.0.20240813.dist-info/top_level.txt,sha256=w2i-GyPs2s1UwVxvutSvN_lM22SXC2hQFBmoMcPnV7Y,19
+compressed_tensors_nightly-0.5.0.20240813.dist-info/RECORD,,

{compressed_tensors_nightly-0.5.0.20240812.dist-info → compressed_tensors_nightly-0.5.0.20240813.dist-info}/LICENSE RENAMED Viewed

File without changes

{compressed_tensors_nightly-0.5.0.20240812.dist-info → compressed_tensors_nightly-0.5.0.20240813.dist-info}/WHEEL RENAMED Viewed

File without changes

{compressed_tensors_nightly-0.5.0.20240812.dist-info → compressed_tensors_nightly-0.5.0.20240813.dist-info}/top_level.txt RENAMED Viewed

File without changes

compressed-tensors-nightly 0.5.0.20240812__py3-none-any.whl → 0.5.0.20240813__py3-none-any.whl

compressed-tensors-nightly 0.5.0.20240812py3-none-any.whl → 0.5.0.20240813py3-none-any.whl