compressed-tensors 0.10.3a20250709__tar.gz → 0.10.3a20250711__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.github/actions/test/action.yml +5 -5
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.github/workflows/build-test.yml +2 -1
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.github/workflows/test.yml +18 -4
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.github/workflows/trigger-all.yml +1 -1
- {compressed_tensors-0.10.3a20250709/src/compressed_tensors.egg-info → compressed_tensors-0.10.3a20250711}/PKG-INFO +1 -1
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/model_compressors/model_compressor.py +8 -4
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/sparse_compressors/sparse_24_bitmask.py +6 -2
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/version.py +1 -1
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711/src/compressed_tensors.egg-info}/PKG-INFO +1 -1
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/model_compressors/test_model_compressor.py +1 -4
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.github/.gitkeep +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.github/scripts/step-status +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.github/workflows/build.yml +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.github/workflows/report.yml +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.github/workflows/test-check.yaml +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.github/workflows/upload.yml +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.gitignore +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/LICENSE +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/Makefile +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/README.md +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/examples/bit_packing/ex_quantize_and_pack.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/examples/bit_packing/int4_config.json +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/examples/bitmask_compression.ipynb +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/examples/llama_1.1b/ex_config_quantization.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/examples/llama_1.1b/ex_llmcompressor_quantization.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/examples/llama_1.1b/example_quant_config.json +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/examples/llama_1.1b/example_quant_recipe.yaml +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/examples/quantize_and_pack_int4.ipynb +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/pyproject.toml +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/setup.cfg +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/setup.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/README.md +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/base.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/base.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/helpers.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/model_compressors/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/quantized_compressors/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/quantized_compressors/base.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/quantized_compressors/naive_quantized.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/quantized_compressors/nvfp4_quantized.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/quantized_compressors/pack_quantized.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/sparse_compressors/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/sparse_compressors/base.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/sparse_compressors/dense.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/sparse_compressors/sparse_bitmask.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/sparse_quantized_compressors/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/compressors/sparse_quantized_compressors/marlin_24.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/config/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/config/base.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/config/dense.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/config/sparse_24_bitmask.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/config/sparse_bitmask.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/linear/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/linear/compressed_linear.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/lifecycle/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/lifecycle/apply.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/lifecycle/compressed.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/lifecycle/forward.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/lifecycle/helpers.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/lifecycle/initialize.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/quant_args.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/quant_config.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/quant_scheme.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/utils/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/quantization/utils/helpers.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/registry/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/registry/registry.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/apply.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/factory/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/factory/base.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/factory/hadamard.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/factory/matrix_multiply.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/factory/random_hadamard.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/transform_args.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/transform_config.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/transform_scheme.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/utils/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/utils/hadamard.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/utils/hadamards.safetensors +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/transform/utils/utils.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/utils/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/utils/helpers.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/utils/internal.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/utils/offload.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/utils/permutations_24.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/utils/permute.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/utils/safetensors_load.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors/utils/semi_structured_conversions.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors.egg-info/SOURCES.txt +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors.egg-info/dependency_links.txt +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors.egg-info/requires.txt +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/src/compressed_tensors.egg-info/top_level.txt +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/conftest.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/model_compressors/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/quantized_compressors/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/quantized_compressors/test_fp8_quant.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/quantized_compressors/test_int_quant.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/quantized_compressors/test_nvfp4_quant.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/quantized_compressors/test_pack_quant.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/sparse_compressors/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/sparse_compressors/test_bitmask.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/sparse_compressors/test_sparse_24_bitmask.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/sparse_quantized_compressors/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_compressors/sparse_quantized_compressors/test_marlin_24.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_configs/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_configs/test_base.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_examples/test_bitmask_compression_ipynb.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_linear/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_linear/test_compressed_linear.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/lifecycle/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/lifecycle/conftest.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/lifecycle/test_apply.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/lifecycle/test_dynamic_lifecycle.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/lifecycle/test_enabled.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/lifecycle/test_forward.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/lifecycle/test_helpers.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/lifecycle/test_initialize.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/lifecycle/test_lifecycle.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/test_configs/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/test_configs/test_bit_depths.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/test_configs/test_strategies.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/test_quant_args.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/test_quant_config.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/test_quant_scheme.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_quantization/test_utils/test_helpers.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_registry.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_transform/conftest.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_transform/factory/test_correctness.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_transform/factory/test_memory.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_transform/test_transform_args.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_transform/test_transform_config.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_transform/test_transform_scheme.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_transform/utils/test_hadamard.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_utils/__init__.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_utils/test_helpers.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_utils/test_offload.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_utils/test_safetensors_load.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/testing_utils.py +0 -0
- {compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/utils/copyright.py +0 -0
@@ -69,11 +69,11 @@ runs:
|
|
69
69
|
echo "::endgroup::"
|
70
70
|
|
71
71
|
if [[ "${ENABLE_COVERAGE}" == "true" ]]; then
|
72
|
-
echo "::group::
|
73
|
-
|
74
|
-
|
75
|
-
|
76
|
-
|
72
|
+
echo "::group::check coverage reports"
|
73
|
+
if [ ! -d coverage-html ]; then
|
74
|
+
echo "ERROR: coverage-html folder not found"
|
75
|
+
exit 1
|
76
|
+
fi
|
77
77
|
echo "::endgroup::"
|
78
78
|
fi
|
79
79
|
|
@@ -25,7 +25,7 @@ on:
|
|
25
25
|
|
26
26
|
# test related parameters
|
27
27
|
test_configs:
|
28
|
-
description: "python, label, timeout"
|
28
|
+
description: "python, label, timeout, etc"
|
29
29
|
type: string
|
30
30
|
required: true
|
31
31
|
|
@@ -53,6 +53,7 @@ jobs:
|
|
53
53
|
python: ${{ matrix.test_config.python }}
|
54
54
|
timeout: ${{ matrix.test_config.timeout }}
|
55
55
|
whl: ${{ needs.BUILD.outputs.whl }}
|
56
|
+
code_coverage: ${{ matrix.test_config.code_coverage || false }}
|
56
57
|
secrets: inherit
|
57
58
|
|
58
59
|
UPLOAD:
|
{compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/.github/workflows/test.yml
RENAMED
@@ -70,6 +70,10 @@ jobs:
|
|
70
70
|
permissions:
|
71
71
|
contents: 'read'
|
72
72
|
id-token: 'write'
|
73
|
+
pages: 'write'
|
74
|
+
environment:
|
75
|
+
name: github-pages
|
76
|
+
url: ${{ steps.coverage.outputs.page_url }}
|
73
77
|
|
74
78
|
steps:
|
75
79
|
|
@@ -134,6 +138,11 @@ jobs:
|
|
134
138
|
suitename: test-${{ inputs.python }}-${{ inputs.test_label }}
|
135
139
|
code_coverage: ${{ inputs.code_coverage }}
|
136
140
|
|
141
|
+
- name: extra info for summary
|
142
|
+
if: ${{ inputs.code_coverage }}
|
143
|
+
run: |
|
144
|
+
echo "EXTRA='Code Coverage: https://neuralmagic.github.io/compressed-tensors/'" >> $GITHUB_ENV
|
145
|
+
|
137
146
|
- name: summary
|
138
147
|
uses: neuralmagic/nm-actions/actions/summary-test@v1.13.0
|
139
148
|
if: success() || failure()
|
@@ -143,6 +152,7 @@ jobs:
|
|
143
152
|
python: ${{ inputs.python }}
|
144
153
|
whl: ${{ inputs.whl }}
|
145
154
|
test_status: ${{ steps.test.outputs.status }}
|
155
|
+
extra: ${{ env.EXTRA }}
|
146
156
|
|
147
157
|
- name: copy results to GCP
|
148
158
|
run: |
|
@@ -157,9 +167,13 @@ jobs:
|
|
157
167
|
retention-days: 5
|
158
168
|
|
159
169
|
- name: upload coverage report
|
160
|
-
uses: actions/upload-artifact@
|
161
|
-
if:
|
170
|
+
uses: actions/upload-pages-artifact@v3
|
171
|
+
if: ${{ inputs.code_coverage }}
|
162
172
|
with:
|
163
|
-
|
164
|
-
path: coverage-results/*
|
173
|
+
path: coverage-html
|
165
174
|
retention-days: 5
|
175
|
+
|
176
|
+
- name: deploy to Github Pages
|
177
|
+
id: coverage
|
178
|
+
uses: actions/deploy-pages@v4
|
179
|
+
if: ${{ inputs.code_coverage }}
|
@@ -32,7 +32,7 @@ jobs:
|
|
32
32
|
wf_category: ${{ inputs.wf_category || 'NIGHTLY' }}
|
33
33
|
gitref: ${{ inputs.gitref || 'main' }}
|
34
34
|
push_to_pypi: ${{ (github.event.schedule == '30 0 * * *') || inputs.push_to_pypi || false }}
|
35
|
-
test_configs: '[{"python":"3.11.4","label":"ubuntu-24.04","timeout":"40"},
|
35
|
+
test_configs: '[{"python":"3.11.4","label":"ubuntu-24.04","timeout":"40","code_coverage":true},
|
36
36
|
{"python":"3.10.12","label":"ubuntu-22.04","timeout":"40"},
|
37
37
|
{"python":"3.9.17","label":"k8s-h100-solo","timeout":"40"},
|
38
38
|
{"python":"3.12.6","label":"k8s-a100-duo","timeout":"40"}]'
|
@@ -1,6 +1,6 @@
|
|
1
1
|
Metadata-Version: 2.4
|
2
2
|
Name: compressed-tensors
|
3
|
-
Version: 0.10.
|
3
|
+
Version: 0.10.3a20250711
|
4
4
|
Summary: Library for utilization of compressed safetensors of neural network models
|
5
5
|
Home-page: https://github.com/neuralmagic/compressed-tensors
|
6
6
|
Author: Neuralmagic, Inc.
|
@@ -392,8 +392,8 @@ class ModelCompressor:
|
|
392
392
|
for prefix, module in tqdm(model.named_modules(), desc="Compressing model"):
|
393
393
|
|
394
394
|
if prefix in module_to_scheme or prefix in sparse_compression_targets:
|
395
|
-
module_device = get_execution_device(module)
|
396
|
-
is_meta =
|
395
|
+
module_device = get_execution_device(module)
|
396
|
+
is_meta = module_device.type == "meta"
|
397
397
|
|
398
398
|
exec_device = "meta" if is_meta else "cpu"
|
399
399
|
onloading_device = "meta" if is_meta else module_device
|
@@ -747,12 +747,16 @@ class ModelCompressor:
|
|
747
747
|
|
748
748
|
def map_module_to_scheme(model: Module) -> Dict[str, QuantizationScheme]:
|
749
749
|
"""
|
750
|
-
Returns a dictionary which maps quantized module names to their quantization
|
750
|
+
Returns a dictionary which maps quantized module names to their quantization
|
751
|
+
schemes. Only includes modules with weight quantization
|
751
752
|
"""
|
752
753
|
return {
|
753
754
|
fix_fsdp_module_name(name): module.quantization_scheme
|
754
755
|
for name, module in model.named_modules()
|
755
|
-
if
|
756
|
+
if (
|
757
|
+
hasattr(module, "quantization_scheme") and
|
758
|
+
module.quantization_scheme.weights is not None
|
759
|
+
)
|
756
760
|
}
|
757
761
|
|
758
762
|
|
@@ -178,9 +178,13 @@ def sparse24_bitmask_compress(
|
|
178
178
|
|
179
179
|
if tensor.is_meta:
|
180
180
|
num_rows, num_cols = tensor.shape
|
181
|
-
compressed_values = torch.empty(
|
181
|
+
compressed_values = torch.empty(
|
182
|
+
(num_rows, num_cols // 2), dtype=tensor.dtype, device="meta"
|
183
|
+
)
|
182
184
|
packed_cols = (num_cols + 7) // 8
|
183
|
-
bitmasks_packed = torch.empty(
|
185
|
+
bitmasks_packed = torch.empty(
|
186
|
+
(num_rows, packed_cols), dtype=torch.uint8, device="meta"
|
187
|
+
)
|
184
188
|
return compressed_values, bitmasks_packed
|
185
189
|
|
186
190
|
bytemasks = get_24_bytemasks(tensor=tensor)
|
@@ -1,6 +1,6 @@
|
|
1
1
|
Metadata-Version: 2.4
|
2
2
|
Name: compressed-tensors
|
3
|
-
Version: 0.10.
|
3
|
+
Version: 0.10.3a20250711
|
4
4
|
Summary: Library for utilization of compressed safetensors of neural network models
|
5
5
|
Home-page: https://github.com/neuralmagic/compressed-tensors
|
6
6
|
Author: Neuralmagic, Inc.
|
@@ -446,10 +446,7 @@ def test_compress_model_meta(model_stub, q_format, s_config):
|
|
446
446
|
cpu_model, s_config, q_format
|
447
447
|
)
|
448
448
|
# Only stores dtype because meta model does not store values
|
449
|
-
expected = {
|
450
|
-
k: v.dtype
|
451
|
-
for k, v in reference_compressor.compress(cpu_model).items()
|
452
|
-
}
|
449
|
+
expected = {k: v.dtype for k, v in reference_compressor.compress(cpu_model).items()}
|
453
450
|
|
454
451
|
# Load model on meta device
|
455
452
|
meta_model = AutoModelForCausalLM.from_pretrained(
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
{compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/test_registry.py
RENAMED
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
{compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/tests/testing_utils.py
RENAMED
File without changes
|
{compressed_tensors-0.10.3a20250709 → compressed_tensors-0.10.3a20250711}/utils/copyright.py
RENAMED
File without changes
|