compressed-tensors 0.11.1a20250828__tar.gz → 0.11.1a20250903__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/PKG-INFO +1 -1
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/examples/llama_1.1b/ex_llmcompressor_quantization.py +1 -1
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/version.py +1 -1
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors.egg-info/PKG-INFO +1 -1
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/lifecycle/test_forward.py +18 -30
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/.gitkeep +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/actions/test/action.yml +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/scripts/step-status +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/workflows/build-test.yml +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/workflows/build.yml +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/workflows/quality-check.yaml +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/workflows/report.yml +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/workflows/test-check.yaml +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/workflows/test.yml +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/workflows/trigger-all.yml +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/workflows/upload.yml +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.gitignore +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/LICENSE +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/Makefile +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/README.md +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/examples/bit_packing/ex_quantize_and_pack.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/examples/bit_packing/int4_config.json +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/examples/bitmask_compression.ipynb +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/examples/llama_1.1b/ex_config_quantization.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/examples/llama_1.1b/example_quant_config.json +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/examples/llama_1.1b/example_quant_recipe.yaml +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/examples/quantize_and_pack_int4.ipynb +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/pyproject.toml +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/setup.cfg +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/setup.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/README.md +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/base.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/base.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/helpers.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/model_compressors/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/model_compressors/model_compressor.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/quantized_compressors/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/quantized_compressors/base.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/quantized_compressors/naive_quantized.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/quantized_compressors/nvfp4_quantized.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/quantized_compressors/pack_quantized.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/sparse_compressors/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/sparse_compressors/base.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/sparse_compressors/dense.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/sparse_compressors/sparse_24_bitmask.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/sparse_compressors/sparse_bitmask.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/sparse_quantized_compressors/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/compressors/sparse_quantized_compressors/marlin_24.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/config/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/config/base.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/config/dense.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/config/sparse_24_bitmask.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/config/sparse_bitmask.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/linear/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/linear/compressed_linear.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/lifecycle/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/lifecycle/apply.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/lifecycle/compressed.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/lifecycle/forward.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/lifecycle/helpers.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/lifecycle/initialize.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/quant_args.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/quant_config.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/quant_scheme.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/utils/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/quantization/utils/helpers.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/registry/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/registry/registry.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/apply.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/factory/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/factory/base.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/factory/hadamard.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/factory/matrix_multiply.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/factory/random_hadamard.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/transform_args.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/transform_config.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/transform_scheme.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/utils/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/utils/hadamard.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/utils/hadamards.safetensors +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/transform/utils/matrix.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/utils/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/utils/helpers.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/utils/internal.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/utils/match.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/utils/offload.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/utils/permutations_24.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/utils/permute.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/utils/safetensors_load.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/utils/semi_structured_conversions.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors/utils/type.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors.egg-info/SOURCES.txt +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors.egg-info/dependency_links.txt +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors.egg-info/requires.txt +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/src/compressed_tensors.egg-info/top_level.txt +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/conftest.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/model_compressors/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/model_compressors/test_model_compressor.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/quantized_compressors/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/quantized_compressors/test_fp8_quant.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/quantized_compressors/test_int_quant.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/quantized_compressors/test_nvfp4_quant.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/quantized_compressors/test_pack_quant.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/sparse_compressors/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/sparse_compressors/test_bitmask.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/sparse_compressors/test_sparse_24_bitmask.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/sparse_quantized_compressors/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_compressors/sparse_quantized_compressors/test_marlin_24.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_configs/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_configs/test_base.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_examples/test_bitmask_compression_ipynb.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_linear/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_linear/test_compressed_linear.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/lifecycle/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/lifecycle/conftest.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/lifecycle/test_apply.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/lifecycle/test_dynamic_lifecycle.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/lifecycle/test_enabled.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/lifecycle/test_helpers.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/lifecycle/test_initialize.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/lifecycle/test_lifecycle.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/test_configs/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/test_configs/test_bit_depths.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/test_configs/test_strategies.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/test_quant_args.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/test_quant_config.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/test_quant_scheme.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_quantization/test_utils/test_helpers.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_registry.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_transform/conftest.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_transform/factory/test_correctness.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_transform/factory/test_memory.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_transform/factory/test_serialization.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_transform/test_transform_args.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_transform/test_transform_config.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_transform/test_transform_scheme.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_transform/utils/test_hadamard.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_utils/__init__.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_utils/test_helpers.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_utils/test_match.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_utils/test_offload.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_utils/test_safetensors_load.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_utils/test_type.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/testing_utils.py +0 -0
- {compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/utils/copyright.py +0 -0
@@ -1,6 +1,6 @@
|
|
1
1
|
Metadata-Version: 2.4
|
2
2
|
Name: compressed-tensors
|
3
|
-
Version: 0.11.
|
3
|
+
Version: 0.11.1a20250903
|
4
4
|
Summary: Library for utilization of compressed safetensors of neural network models
|
5
5
|
Home-page: https://github.com/neuralmagic/compressed-tensors
|
6
6
|
Author: Neuralmagic, Inc.
|
@@ -1,6 +1,6 @@
|
|
1
1
|
Metadata-Version: 2.4
|
2
2
|
Name: compressed-tensors
|
3
|
-
Version: 0.11.
|
3
|
+
Version: 0.11.1a20250903
|
4
4
|
Summary: Library for utilization of compressed safetensors of neural network models
|
5
5
|
Home-page: https://github.com/neuralmagic/compressed-tensors
|
6
6
|
Author: Neuralmagic, Inc.
|
@@ -19,9 +19,8 @@ import pytest
|
|
19
19
|
import torch
|
20
20
|
from compressed_tensors.quantization.lifecycle.forward import (
|
21
21
|
_process_quantization,
|
22
|
-
|
22
|
+
fake_quantize,
|
23
23
|
forward_quantize,
|
24
|
-
quantize,
|
25
24
|
wrap_module_forward_quantized,
|
26
25
|
)
|
27
26
|
from compressed_tensors.quantization.lifecycle.initialize import (
|
@@ -96,7 +95,7 @@ def test_forward_quantize(
|
|
96
95
|
|
97
96
|
|
98
97
|
@pytest.mark.parametrize(
|
99
|
-
"num_bits,type,strategy,group_size,scale,zero_point,g_idx",
|
98
|
+
"num_bits,type,strategy,group_size,scale,zero_point,g_idx,global_scale",
|
100
99
|
[
|
101
100
|
(
|
102
101
|
4,
|
@@ -106,6 +105,7 @@ def test_forward_quantize(
|
|
106
105
|
torch.rand((1,)) * 0.01,
|
107
106
|
torch.zeros((1,)),
|
108
107
|
None,
|
108
|
+
None,
|
109
109
|
),
|
110
110
|
(
|
111
111
|
4,
|
@@ -115,6 +115,7 @@ def test_forward_quantize(
|
|
115
115
|
torch.rand((512, 8)) * 0.01,
|
116
116
|
torch.zeros((512, 8)),
|
117
117
|
None,
|
118
|
+
None,
|
118
119
|
),
|
119
120
|
(
|
120
121
|
4,
|
@@ -124,6 +125,7 @@ def test_forward_quantize(
|
|
124
125
|
torch.rand((512, 8)) * 0.01,
|
125
126
|
torch.zeros((512, 8)),
|
126
127
|
make_dummy_g_idx(1024, 128),
|
128
|
+
None,
|
127
129
|
),
|
128
130
|
(
|
129
131
|
8,
|
@@ -133,6 +135,7 @@ def test_forward_quantize(
|
|
133
135
|
torch.rand((1,)) * 0.01,
|
134
136
|
torch.zeros((1,)),
|
135
137
|
None,
|
138
|
+
None,
|
136
139
|
),
|
137
140
|
(
|
138
141
|
8,
|
@@ -142,6 +145,7 @@ def test_forward_quantize(
|
|
142
145
|
torch.rand((512, 8)) * 0.01,
|
143
146
|
torch.zeros((512, 8)),
|
144
147
|
None,
|
148
|
+
None,
|
145
149
|
),
|
146
150
|
(
|
147
151
|
8,
|
@@ -151,28 +155,8 @@ def test_forward_quantize(
|
|
151
155
|
torch.rand((512, 8)) * 0.01,
|
152
156
|
torch.zeros((512, 8)),
|
153
157
|
make_dummy_g_idx(1024, 128),
|
158
|
+
None,
|
154
159
|
),
|
155
|
-
],
|
156
|
-
)
|
157
|
-
def test_quantize(num_bits, type, strategy, group_size, scale, zero_point, g_idx):
|
158
|
-
args = QuantizationArgs(
|
159
|
-
num_bits=num_bits, type=type, strategy=strategy, group_size=group_size
|
160
|
-
)
|
161
|
-
|
162
|
-
x = torch.rand((512, 1024))
|
163
|
-
quantize(
|
164
|
-
x=x,
|
165
|
-
scale=scale,
|
166
|
-
zero_point=zero_point,
|
167
|
-
args=args,
|
168
|
-
dtype=args.pytorch_dtype(),
|
169
|
-
g_idx=g_idx,
|
170
|
-
)
|
171
|
-
|
172
|
-
|
173
|
-
@pytest.mark.parametrize(
|
174
|
-
"num_bits,type,strategy,group_size,scale,zero_point,g_idx",
|
175
|
-
[
|
176
160
|
(
|
177
161
|
8,
|
178
162
|
"int",
|
@@ -181,6 +165,7 @@ def test_quantize(num_bits, type, strategy, group_size, scale, zero_point, g_idx
|
|
181
165
|
torch.rand((512, 8)) * 0.01,
|
182
166
|
torch.zeros((512, 8)),
|
183
167
|
None,
|
168
|
+
None,
|
184
169
|
),
|
185
170
|
(
|
186
171
|
8,
|
@@ -190,23 +175,26 @@ def test_quantize(num_bits, type, strategy, group_size, scale, zero_point, g_idx
|
|
190
175
|
torch.rand((512, 8)) * 0.01,
|
191
176
|
torch.zeros((512, 8)),
|
192
177
|
make_dummy_g_idx(1024, 128),
|
178
|
+
None,
|
193
179
|
),
|
194
180
|
],
|
195
181
|
)
|
196
|
-
def
|
182
|
+
def test_fake_quantize_2d(
|
183
|
+
num_bits, type, strategy, group_size, scale, zero_point, g_idx, global_scale
|
184
|
+
):
|
197
185
|
args = QuantizationArgs(
|
198
186
|
num_bits=num_bits, type=type, strategy=strategy, group_size=group_size
|
199
187
|
)
|
200
188
|
|
201
|
-
|
202
|
-
|
203
|
-
|
189
|
+
x = torch.rand((512, 1024))
|
190
|
+
fake_quantize(
|
191
|
+
x=x,
|
204
192
|
scale=scale,
|
205
193
|
zero_point=zero_point,
|
206
194
|
args=args,
|
207
|
-
dtype=None,
|
208
195
|
g_idx=g_idx,
|
209
|
-
|
196
|
+
global_scale=global_scale,
|
197
|
+
) # note that reconstruction loss is bad for uncalibrated scales
|
210
198
|
|
211
199
|
|
212
200
|
def test_process_quantization_block_static():
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
{compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/.github/workflows/test.yml
RENAMED
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
{compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/test_registry.py
RENAMED
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
{compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/tests/testing_utils.py
RENAMED
File without changes
|
{compressed_tensors-0.11.1a20250828 → compressed_tensors-0.11.1a20250903}/utils/copyright.py
RENAMED
File without changes
|