datachain 0.37.7__tar.gz → 0.37.8__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Potentially problematic release.
This version of datachain might be problematic. Click here for more details.
- {datachain-0.37.7 → datachain-0.37.8}/PKG-INFO +1 -1
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/datachain.py +19 -3
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/query/dataset.py +22 -5
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/toolkit/split.py +30 -8
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain.egg-info/PKG-INFO +1 -1
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_toolkit.py +34 -4
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_datachain.py +14 -0
- {datachain-0.37.7 → datachain-0.37.8}/.cruft.json +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.gitattributes +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.github/ISSUE_TEMPLATE/bug_report.yml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.github/ISSUE_TEMPLATE/empty_issue.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.github/ISSUE_TEMPLATE/feature_request.yml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.github/codecov.yaml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.github/dependabot.yml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.github/workflows/benchmarks.yml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.github/workflows/release.yml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.github/workflows/tests-studio.yml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.github/workflows/tests.yml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.github/workflows/update-template.yaml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.gitignore +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/.pre-commit-config.yaml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/CODE_OF_CONDUCT.rst +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/LICENSE +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/README.rst +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/api_hooks.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/assets/captioned_cartoons.png +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/assets/datachain-white.svg +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/assets/datachain.svg +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/assets/webhook_dialog.png +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/assets/webhook_list.png +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/commands/auth/login.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/commands/auth/logout.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/commands/auth/team.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/commands/auth/token.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/commands/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/commands/job/cancel.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/commands/job/clusters.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/commands/job/logs.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/commands/job/ls.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/commands/job/run.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/contributing.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/css/github-permalink-style.css +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/examples.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/guide/checkpoints.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/guide/db_migrations.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/guide/delta.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/guide/env.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/guide/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/guide/namespaces.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/guide/processing.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/guide/remotes.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/guide/retry.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/overrides/main.html +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/quick-start.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/data-types/arrowrow.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/data-types/bbox.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/data-types/file.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/data-types/imagefile.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/data-types/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/data-types/pose.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/data-types/segment.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/data-types/tarvfile.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/data-types/textfile.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/data-types/videofile.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/datachain.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/func.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/functions/aggregate.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/functions/array.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/functions/conditional.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/functions/numeric.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/functions/path.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/functions/random.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/functions/string.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/functions/window.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/toolkit.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/torch.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/references/udf.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/api/.gitkeep +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/ca-certificates.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/git-forges/bitbucket.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/git-forges/github.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/git-forges/gitlab.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/git-forges/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/ssl-tls.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/installation/aws-ami.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/installation/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/installation/k8s-helm.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/troubleshooting/502-errors.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/troubleshooting/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/troubleshooting/support-bundle.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/upgrading/airgap-procedure.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/upgrading/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/upgrading/regular-procedure.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/account-management.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/authentication/openid-connect.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/authentication/single-sign-on.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/configure-a-project.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/create-a-project.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/explore-ml-experiments.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/live-metrics-and-plots.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/run-experiments.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/share-a-project.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/visualize-and-compare.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/git-connections/custom-gitlab-server.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/git-connections/github-app.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/git-connections/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/jobs/create-and-run.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/jobs/index.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/jobs/monitor-jobs.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/model-registry/add-a-model.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/model-registry/assign-stage.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/model-registry/register-version.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/model-registry/remove-a-model-or-its-details.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/model-registry/use-models.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/model-registry/view-and-compare-models.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/team-collaboration.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/troubleshooting.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/studio/webhooks.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/templates/main.dot +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/templates/operation.dot +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/templates/responses.def +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/docs/tutorials.md +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/computer_vision/iptc_exif_xmp_lib.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/computer_vision/llava2_image_desc_lib.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/computer_vision/openimage-detect.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/computer_vision/ultralytics-bbox.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/computer_vision/ultralytics-pose.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/computer_vision/ultralytics-segment.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/get_started/common_sql_functions.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/get_started/json-csv-reader.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/get_started/nested_datamodel.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/get_started/torch-loader.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/get_started/udfs/parallel.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/get_started/udfs/simple.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/get_started/udfs/stateful.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/incremental_processing/delta.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/incremental_processing/retry.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/incremental_processing/utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/llm_and_nlp/claude-query.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/llm_and_nlp/hf-dataset-llm-eval.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/multimodal/audio-to-text.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/multimodal/clip_inference.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/multimodal/hf_pipeline.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/multimodal/openai_image_desc_lib.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/multimodal/wds.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/examples/multimodal/wds_filtered.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/mkdocs.yml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/noxfile.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/pyproject.toml +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/setup.cfg +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/__main__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/asyn.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cache.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/catalog/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/catalog/catalog.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/catalog/datasource.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/catalog/dependency.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/catalog/loader.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/checkpoint.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/commands/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/commands/datasets.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/commands/du.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/commands/index.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/commands/ls.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/commands/misc.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/commands/query.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/commands/show.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/parser/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/parser/job.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/parser/studio.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/parser/utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/cli/utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/client/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/client/azure.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/client/fileslice.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/client/fsspec.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/client/gcs.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/client/hf.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/client/http.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/client/local.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/client/s3.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/config.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/data_storage/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/data_storage/db_engine.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/data_storage/job.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/data_storage/metastore.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/data_storage/schema.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/data_storage/serializer.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/data_storage/sqlite.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/data_storage/warehouse.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/dataset.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/delta.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/diff/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/error.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/fs/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/fs/reference.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/fs/utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/func/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/func/aggregate.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/func/array.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/func/base.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/func/conditional.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/func/func.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/func/numeric.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/func/path.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/func/random.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/func/string.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/func/window.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/hash_utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/job.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/arrow.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/audio.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/clip.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/convert/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/convert/flatten.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/convert/python_to_sql.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/convert/sql_to_python.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/convert/unflatten.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/convert/values_to_tuples.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/data_model.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dataset_info.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/csv.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/database.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/datasets.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/hf.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/json.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/listings.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/pandas.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/parquet.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/records.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/storage.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/storage_pattern.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/dc/values.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/file.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/hf.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/image.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/listing.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/listing_info.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/meta_formats.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/model_store.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/namespaces.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/projects.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/pytorch.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/settings.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/signal_schema.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/tar.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/text.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/udf.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/udf_signature.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/video.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/webdataset.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/lib/webdataset_laion.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/listing.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/model/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/model/bbox.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/model/pose.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/model/segment.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/model/ultralytics/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/model/ultralytics/bbox.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/model/ultralytics/pose.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/model/ultralytics/segment.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/model/utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/namespace.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/node.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/nodes_fetcher.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/nodes_thread_pool.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/plugins.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/progress.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/project.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/py.typed +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/query/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/query/batch.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/query/dispatch.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/query/metrics.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/query/params.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/query/queue.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/query/schema.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/query/session.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/query/udf.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/remote/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/remote/studio.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/script_meta.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/semver.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/default/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/default/base.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/functions/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/functions/aggregate.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/functions/array.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/functions/conditional.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/functions/numeric.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/functions/path.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/functions/random.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/functions/string.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/postgresql_dialect.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/postgresql_types.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/selectable.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/sqlite/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/sqlite/base.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/sqlite/types.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/sqlite/vector.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/types.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/sql/utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/studio.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/telemetry.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/toolkit/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/torch/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain/utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain.egg-info/SOURCES.txt +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain.egg-info/dependency_links.txt +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain.egg-info/entry_points.txt +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain.egg-info/requires.txt +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/src/datachain.egg-info/top_level.txt +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/benchmarks/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/benchmarks/conftest.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/benchmarks/datasets/.dvc/.gitignore +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/benchmarks/datasets/.dvc/config +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/benchmarks/datasets/.gitignore +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/benchmarks/datasets/laion-tiny.npz.dvc +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/benchmarks/test_datachain.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/benchmarks/test_ls.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/benchmarks/test_version.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/conftest.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/data.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/examples/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/examples/test_examples.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/examples/test_wds_e2e.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/examples/wds_data.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/data/Big_Buck_Bunny_360_10s_1MB.mp4 +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/data/lena.jpg +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/fake-service-account-credentials.json +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/functions/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/functions/test_aggregate.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/functions/test_array.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/functions/test_conditional.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/functions/test_numeric.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/functions/test_path.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/functions/test_random.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/functions/test_string.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/model/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/model/data/running-mask0.png +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/model/data/running-mask1.png +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/model/data/running.jpg +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/model/data/ships.jpg +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/model/test_yolo.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_audio.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_catalog.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_checkpoints.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_client.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_cloud_transfer.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_data_storage.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_datachain.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_datachain_merge.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_dataset_query.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_datasets.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_delta.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_feature_pickling.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_file.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_hf.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_hidden_field.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_image.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_listing.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_ls.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_meta_formats.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_metastore.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_metrics.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_mutate.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_pull.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_pytorch.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_query.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_read_database.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_read_dataset_remote.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_read_dataset_version_specifiers.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_retry.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_session.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_storage_pattern.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_studio_datetime_parsing.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_temp_table_tracking.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_to_database.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_udf.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_union.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_video.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/func/test_warehouse.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/scripts/feature_class.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/scripts/feature_class_exception.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/scripts/feature_class_parallel.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/scripts/feature_class_parallel_data_model.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/scripts/name_len_slow.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/test_atomicity.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/test_cli_e2e.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/test_cli_studio.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/test_import_time.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/test_job_management_e2e.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/test_query_e2e.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/test_telemetry.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/conftest.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_arrow.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_audio.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_checkpoints.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_clip.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_datachain_bootstrap.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_datachain_merge.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_diff.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_feature.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_feature_utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_file.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_hf.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_image.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_listing_info.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_namespace.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_partition_by.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_project.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_python_to_sql.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_schema.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_settings.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_signal_schema.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_sql_to_python.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_storage_pattern.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_text.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_udf.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_udf_signature.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/lib/test_webdataset.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/model/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/model/test_bbox.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/model/test_pose.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/model/test_segment.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/model/test_utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/sql/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/sql/sqlite/__init__.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/sql/sqlite/test_types.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/sql/sqlite/test_utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/sql/test_array.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/sql/test_conditional.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/sql/test_path.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/sql/test_random.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/sql/test_selectable.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/sql/test_string.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_asyn.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_batching.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_cache.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_catalog.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_catalog_loader.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_cli_datasets.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_cli_parsing.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_client.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_client_gcs.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_client_http.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_client_s3.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_config.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_data_storage.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_database_engine.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_datachain_hash.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_dataset.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_dispatch.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_fileslice.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_func.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_hash_utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_job_management.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_listing.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_metastore.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_module_exports.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_pytorch.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_query.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_query_metrics.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_query_params.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_query_steps_hash.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_script_meta.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_semver.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_serializer.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_session.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_utils.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/unit/test_warehouse.py +0 -0
- {datachain-0.37.7 → datachain-0.37.8}/tests/utils.py +0 -0
|
@@ -52,7 +52,11 @@ from datachain.lib.udf_signature import UdfSignature
|
|
|
52
52
|
from datachain.lib.utils import DataChainColumnError, DataChainParamsError
|
|
53
53
|
from datachain.project import Project
|
|
54
54
|
from datachain.query import Session
|
|
55
|
-
from datachain.query.dataset import
|
|
55
|
+
from datachain.query.dataset import (
|
|
56
|
+
DatasetQuery,
|
|
57
|
+
PartitionByType,
|
|
58
|
+
RegenerateSystemColumns,
|
|
59
|
+
)
|
|
56
60
|
from datachain.query.schema import DEFAULT_DELIMITER, Column
|
|
57
61
|
from datachain.sql.functions import path as pathfunc
|
|
58
62
|
from datachain.utils import batched_it, env2bool, inside_notebook, row_to_nested_dict
|
|
@@ -2740,8 +2744,20 @@ class DataChain:
|
|
|
2740
2744
|
)
|
|
2741
2745
|
|
|
2742
2746
|
def shuffle(self) -> "Self":
|
|
2743
|
-
"""Shuffle
|
|
2744
|
-
|
|
2747
|
+
"""Shuffle rows with a best-effort deterministic ordering.
|
|
2748
|
+
|
|
2749
|
+
This produces repeatable shuffles. Merge and union operations can
|
|
2750
|
+
lead to non-deterministic results. Use order by or save a dataset
|
|
2751
|
+
afterward to guarantee the same result.
|
|
2752
|
+
"""
|
|
2753
|
+
query = self._query.clone(new_table=False)
|
|
2754
|
+
query.steps.append(RegenerateSystemColumns(self._query.catalog))
|
|
2755
|
+
|
|
2756
|
+
chain = self._evolve(
|
|
2757
|
+
query=query,
|
|
2758
|
+
signal_schema=SignalSchema({"sys": Sys}) | self.signals_schema,
|
|
2759
|
+
)
|
|
2760
|
+
return chain.order_by("sys.rand")
|
|
2745
2761
|
|
|
2746
2762
|
def sample(self, n: int) -> "Self":
|
|
2747
2763
|
"""Return a random sample from the chain.
|
|
@@ -786,10 +786,31 @@ class SQLClause(Step, ABC):
|
|
|
786
786
|
return tuple(c.get_column() if isinstance(c, Function) else c for c in cols)
|
|
787
787
|
|
|
788
788
|
@abstractmethod
|
|
789
|
-
def apply_sql_clause(self, query):
|
|
789
|
+
def apply_sql_clause(self, query: Any) -> Any:
|
|
790
790
|
pass
|
|
791
791
|
|
|
792
792
|
|
|
793
|
+
@frozen
|
|
794
|
+
class RegenerateSystemColumns(Step):
|
|
795
|
+
catalog: "Catalog"
|
|
796
|
+
|
|
797
|
+
def hash_inputs(self) -> str:
|
|
798
|
+
return hashlib.sha256(b"regenerate_system_columns").hexdigest()
|
|
799
|
+
|
|
800
|
+
def apply(
|
|
801
|
+
self, query_generator: QueryGenerator, temp_tables: list[str]
|
|
802
|
+
) -> StepResult:
|
|
803
|
+
query = query_generator.select()
|
|
804
|
+
new_query = self.catalog.warehouse._regenerate_system_columns(
|
|
805
|
+
query, keep_existing_columns=True
|
|
806
|
+
)
|
|
807
|
+
|
|
808
|
+
def q(*columns):
|
|
809
|
+
return new_query.with_only_columns(*columns)
|
|
810
|
+
|
|
811
|
+
return step_result(q, new_query.selected_columns)
|
|
812
|
+
|
|
813
|
+
|
|
793
814
|
@frozen
|
|
794
815
|
class SQLSelect(SQLClause):
|
|
795
816
|
args: tuple[Function | ColumnElement, ...]
|
|
@@ -1488,10 +1509,6 @@ class DatasetQuery:
|
|
|
1488
1509
|
finally:
|
|
1489
1510
|
self.cleanup()
|
|
1490
1511
|
|
|
1491
|
-
def shuffle(self) -> "Self":
|
|
1492
|
-
# ToDo: implement shaffle based on seed and/or generating random column
|
|
1493
|
-
return self.order_by(C.sys__rand)
|
|
1494
|
-
|
|
1495
1512
|
def sample(self, n) -> "Self":
|
|
1496
1513
|
"""
|
|
1497
1514
|
Return a random sample from the dataset.
|
|
@@ -1,6 +1,7 @@
|
|
|
1
1
|
import random
|
|
2
2
|
|
|
3
3
|
from datachain import C, DataChain
|
|
4
|
+
from datachain.lib.signal_schema import SignalResolvingError
|
|
4
5
|
|
|
5
6
|
RESOLUTION = 2**31 - 1 # Maximum positive value for a 32-bit signed integer.
|
|
6
7
|
|
|
@@ -59,7 +60,10 @@ def train_test_split(
|
|
|
59
60
|
```
|
|
60
61
|
|
|
61
62
|
Note:
|
|
62
|
-
|
|
63
|
+
Splits reuse the same best-effort shuffle used by `DataChain.shuffle`. Results
|
|
64
|
+
are typically repeatable, but earlier operations such as `merge`, `union`, or
|
|
65
|
+
custom SQL that reshuffle rows can change the outcome between runs. Add order by
|
|
66
|
+
stable keys first when you need strict reproducibility.
|
|
63
67
|
"""
|
|
64
68
|
if len(weights) < 2:
|
|
65
69
|
raise ValueError("Weights should have at least two elements")
|
|
@@ -68,16 +72,34 @@ def train_test_split(
|
|
|
68
72
|
|
|
69
73
|
weights_normalized = [weight / sum(weights) for weight in weights]
|
|
70
74
|
|
|
75
|
+
try:
|
|
76
|
+
dc.signals_schema.resolve("sys.rand")
|
|
77
|
+
except SignalResolvingError:
|
|
78
|
+
dc = dc.persist()
|
|
79
|
+
|
|
71
80
|
rand_col = C("sys.rand")
|
|
72
81
|
if seed is not None:
|
|
73
82
|
uniform_seed = random.Random(seed).randrange(1, RESOLUTION) # noqa: S311
|
|
74
83
|
rand_col = (rand_col % RESOLUTION) * uniform_seed # type: ignore[assignment]
|
|
75
84
|
rand_col = rand_col % RESOLUTION # type: ignore[assignment]
|
|
76
85
|
|
|
77
|
-
|
|
78
|
-
|
|
79
|
-
|
|
80
|
-
|
|
81
|
-
)
|
|
82
|
-
|
|
83
|
-
|
|
86
|
+
boundaries: list[int] = [0]
|
|
87
|
+
cumulative = 0.0
|
|
88
|
+
for weight in weights_normalized[:-1]:
|
|
89
|
+
cumulative += weight
|
|
90
|
+
boundary = round(cumulative * RESOLUTION)
|
|
91
|
+
boundaries.append(min(boundary, RESOLUTION))
|
|
92
|
+
boundaries.append(RESOLUTION)
|
|
93
|
+
|
|
94
|
+
splits: list[DataChain] = []
|
|
95
|
+
last_index = len(weights_normalized) - 1
|
|
96
|
+
for index in range(len(weights_normalized)):
|
|
97
|
+
lower = boundaries[index]
|
|
98
|
+
if index == last_index:
|
|
99
|
+
condition = rand_col >= lower
|
|
100
|
+
else:
|
|
101
|
+
upper = boundaries[index + 1]
|
|
102
|
+
condition = (rand_col >= lower) & (rand_col < upper)
|
|
103
|
+
splits.append(dc.filter(condition))
|
|
104
|
+
|
|
105
|
+
return splits
|
|
@@ -1,5 +1,6 @@
|
|
|
1
1
|
import pytest
|
|
2
2
|
|
|
3
|
+
import datachain as dc
|
|
3
4
|
from datachain.toolkit import train_test_split
|
|
4
5
|
|
|
5
6
|
|
|
@@ -18,8 +19,8 @@ def test_train_test_split_not_random(not_random_ds, seed, weights, expected):
|
|
|
18
19
|
res = train_test_split(not_random_ds, weights, seed=seed)
|
|
19
20
|
assert len(res) == len(expected)
|
|
20
21
|
|
|
21
|
-
for i,
|
|
22
|
-
assert
|
|
22
|
+
for i, chain in enumerate(res):
|
|
23
|
+
assert chain.to_values("sys.id") == expected[i]
|
|
23
24
|
|
|
24
25
|
|
|
25
26
|
@pytest.mark.parametrize(
|
|
@@ -40,8 +41,8 @@ def test_train_test_split_random(pseudo_random_ds, seed, weights, expected):
|
|
|
40
41
|
res = train_test_split(pseudo_random_ds, weights, seed=seed)
|
|
41
42
|
assert len(res) == len(expected)
|
|
42
43
|
|
|
43
|
-
for i,
|
|
44
|
-
assert
|
|
44
|
+
for i, chain in enumerate(res):
|
|
45
|
+
assert chain.to_values("sys.id") == expected[i]
|
|
45
46
|
|
|
46
47
|
|
|
47
48
|
def test_train_test_split_errors(not_random_ds):
|
|
@@ -49,3 +50,32 @@ def test_train_test_split_errors(not_random_ds):
|
|
|
49
50
|
train_test_split(not_random_ds, [0.5])
|
|
50
51
|
with pytest.raises(ValueError, match="Weights should be non-negative"):
|
|
51
52
|
train_test_split(not_random_ds, [-1, 1])
|
|
53
|
+
|
|
54
|
+
|
|
55
|
+
def test_split_after_merge(test_session):
|
|
56
|
+
left = dc.read_values(ids=[1, 2, 3, 4], session=test_session)
|
|
57
|
+
right = dc.read_values(
|
|
58
|
+
ids=[1, 2, 3, 4],
|
|
59
|
+
extra=["a", "b", "c", "d"],
|
|
60
|
+
session=test_session,
|
|
61
|
+
)
|
|
62
|
+
|
|
63
|
+
merged = left.merge(right, on="ids")
|
|
64
|
+
|
|
65
|
+
train, test = train_test_split(merged, [0.5, 0.5])
|
|
66
|
+
|
|
67
|
+
for split in (train, test):
|
|
68
|
+
sys_schema = split.signals_schema.resolve("sys.id", "sys.rand").values
|
|
69
|
+
assert sys_schema["sys.id"] is int
|
|
70
|
+
assert sys_schema["sys.rand"] is int
|
|
71
|
+
|
|
72
|
+
combined_rows = set(train.to_list("ids", "extra")) | set(
|
|
73
|
+
test.to_list("ids", "extra")
|
|
74
|
+
)
|
|
75
|
+
|
|
76
|
+
assert combined_rows == {
|
|
77
|
+
(1, "a"),
|
|
78
|
+
(2, "b"),
|
|
79
|
+
(3, "c"),
|
|
80
|
+
(4, "d"),
|
|
81
|
+
}
|
|
@@ -1235,6 +1235,20 @@ def test_persist_restores_sys_signals_after_merge(test_session):
|
|
|
1235
1235
|
assert sys_schema["sys.rand"] is int
|
|
1236
1236
|
|
|
1237
1237
|
|
|
1238
|
+
def test_shuffle_after_merge(test_session):
|
|
1239
|
+
left = dc.read_values(ids=[1, 2], session=test_session)
|
|
1240
|
+
right = dc.read_values(ids=[1, 2], extra=["x", "y"], session=test_session)
|
|
1241
|
+
|
|
1242
|
+
shuffled = left.merge(right, on="ids").shuffle()
|
|
1243
|
+
|
|
1244
|
+
sys_schema = shuffled.signals_schema.resolve("sys.id", "sys.rand").values
|
|
1245
|
+
assert sys_schema["sys.id"] is int
|
|
1246
|
+
assert sys_schema["sys.rand"] is int
|
|
1247
|
+
|
|
1248
|
+
rows = set(shuffled.to_list("ids", "extra"))
|
|
1249
|
+
assert rows == {(1, "x"), (2, "y")}
|
|
1250
|
+
|
|
1251
|
+
|
|
1238
1252
|
def test_unsupported_output_type(test_session):
|
|
1239
1253
|
vector = [3.14, 2.72, 1.62]
|
|
1240
1254
|
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/ca-certificates.md
RENAMED
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/git-forges/bitbucket.md
RENAMED
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/git-forges/github.md
RENAMED
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/git-forges/gitlab.md
RENAMED
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/configuration/git-forges/index.md
RENAMED
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/troubleshooting/502-errors.md
RENAMED
|
File without changes
|
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/troubleshooting/support-bundle.md
RENAMED
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/upgrading/airgap-procedure.md
RENAMED
|
File without changes
|
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/self-hosting/upgrading/regular-procedure.md
RENAMED
|
File without changes
|
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/authentication/openid-connect.md
RENAMED
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/authentication/single-sign-on.md
RENAMED
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/configure-a-project.md
RENAMED
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/create-a-project.md
RENAMED
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/explore-ml-experiments.md
RENAMED
|
File without changes
|
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/live-metrics-and-plots.md
RENAMED
|
File without changes
|
|
File without changes
|
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/experiments/visualize-and-compare.md
RENAMED
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/git-connections/custom-gitlab-server.md
RENAMED
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
{datachain-0.37.7 → datachain-0.37.8}/docs/studio/user-guide/model-registry/register-version.md
RENAMED
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|