PyPI - hydraflow - Versions diffs - 0.15.1__tar.gz → 0.16.1__tar.gz - Mend

hydraflow 0.15.1tar.gz → 0.16.1tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (101) hide show

{hydraflow-0.15.1 → hydraflow-0.16.1}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: hydraflow
-Version: 0.15.1
+Version: 0.16.1
 Summary: HydraFlow seamlessly integrates Hydra and MLflow to streamline ML experiment management, combining Hydra's configuration management with MLflow's tracking capabilities.
 Project-URL: Documentation, https://daizutabi.github.io/hydraflow/
 Project-URL: Source, https://github.com/daizutabi/hydraflow
@@ -51,7 +51,7 @@ Requires-Dist: ruff>=0.11
 Requires-Dist: typer>=0.15
 Description-Content-Type: text/markdown
-# Hydraflow
+# HydraFlow
 [![PyPI Version][pypi-v-image]][pypi-v-link]
 [![Build Status][GHAction-image]][GHAction-link]
@@ -60,6 +60,7 @@ Description-Content-Type: text/markdown
 [![Python Version][python-v-image]][python-v-link]
 <!-- Badges -->
 [pypi-v-image]: https://img.shields.io/pypi/v/hydraflow.svg
 [pypi-v-link]: https://pypi.org/project/hydraflow/
 [GHAction-image]: https://github.com/daizutabi/hydraflow/actions/workflows/ci.yaml/badge.svg?branch=main&event=push
@@ -73,117 +74,125 @@ Description-Content-Type: text/markdown
 ## Overview
-Hydraflow is a library designed to seamlessly integrate
-[Hydra](https://hydra.cc/) and [MLflow](https://mlflow.org/), making it easier to
-manage and track machine learning experiments. By combining the flexibility of
-Hydra's configuration management with the robust experiment tracking capabilities
-of MLflow, Hydraflow provides a comprehensive solution for managing complex
-machine learning workflows.
+HydraFlow seamlessly integrates [Hydra](https://hydra.cc/) and [MLflow](https://mlflow.org/) to streamline machine learning experiment workflows. By combining Hydra's powerful configuration management with MLflow's robust experiment tracking, HydraFlow provides a comprehensive solution for defining, executing, and analyzing machine learning experiments.
+## Design Principles
+HydraFlow is built on the following design principles:
+1. **Type Safety** - Utilizing Python dataclasses for configuration type checking and IDE support
+2. **Reproducibility** - Automatically tracking all experiment configurations for fully reproducible experiments
+3. **Analysis Capabilities** - Providing powerful APIs for easily analyzing experiment results
+4. **Workflow Integration** - Creating a cohesive workflow by integrating Hydra's configuration management with MLflow's experiment tracking
 ## Key Features
-- **Configuration Management**: Utilize Hydra's advanced configuration management
-  to handle complex parameter sweeps and experiment setups.
-- **Experiment Tracking**: Leverage MLflow's tracking capabilities to log parameters,
-  metrics, and artifacts for each run.
-- **Artifact Management**: Automatically log and manage artifacts, such as model
-  checkpoints and configuration files, with MLflow.
-- **Seamless Integration**: Easily integrate Hydra and MLflow in your machine learning
-  projects with minimal setup.
-- **Rich CLI Interface**: Command-line tools for managing experiments and viewing results.
-- **Cross-Platform Support**: Works consistently across different operating systems.
+- **Type-safe Configuration Management** - Define experiment parameters using Python dataclasses with full IDE support and validation
+- **Seamless Hydra-MLflow Integration** - Automatically register configurations with Hydra and track experiments with MLflow
+- **Advanced Parameter Sweeps** - Define complex parameter spaces using extended sweep syntax for numerical ranges, combinations, and SI prefixes
+- **Workflow Automation** - Create reusable experiment workflows with YAML-based job definitions
+- **Powerful Analysis Tools** - Filter, group, and analyze experiment results with type-aware APIs
+- **Custom Implementation Support** - Extend experiment analysis with domain-specific functionality
 ## Installation
-You can install Hydraflow via pip:
 ```bash
 pip install hydraflow
 ```
 **Requirements:** Python 3.13+
-## Quick Start
-Here is a simple example to get you started with Hydraflow:
+## Quick Example
 ```python
-from __future__ import annotations
 from dataclasses import dataclass
-from typing import TYPE_CHECKING
+from mlflow.entities import Run
 import hydraflow
-import mlflow
-if TYPE_CHECKING:
-    from mlflow.entities import Run
+@dataclass
+class Config:
+    width: int = 1024
+    height: int = 768
+@hydraflow.main(Config)
+def app(run: Run, cfg: Config) -> None:
+    # Your experiment code here
+    print(f"Running with width={cfg.width}, height={cfg.height}")
+    # Log metrics
+    hydraflow.log_metric("area", cfg.width * cfg.height)
+if __name__ == "__main__":
+    app()
+```
+Execute a parameter sweep with:
+```bash
+python app.py -m width=800,1200 height=600,900
+```
+## Core Components
+HydraFlow consists of the following key components:
+### Configuration Management
+Define type-safe configurations using Python dataclasses:
+```python
 @dataclass
 class Config:
-    """Configuration for the ML training experiment."""
-    # Training hyperparameters
     learning_rate: float = 0.001
     batch_size: int = 32
     epochs: int = 10
+```
-    # Model architecture parameters
-    hidden_size: int = 128
-    dropout: float = 0.1
-    # Dataset parameters
-    train_size: float = 0.8
-    random_seed: int = 42
+### Main Decorator
+The `@hydraflow.main` decorator integrates Hydra and MLflow:
+```python
 @hydraflow.main(Config)
-def app(run: Run, cfg: Config):
-    """Train a model with the given configuration.
-    This example demonstrates how to:
+def train(run: Run, cfg: Config) -> None:
+    # Your experiment code
+```
-    1. Define a configuration using dataclasses
-    2. Use Hydraflow to integrate with MLflow
-    3. Track metrics and parameters automatically
+### Workflow Automation
-    Args:
-        run: MLflow run for the experiment corresponding to the Hydra app.
-            This `Run` instance is automatically created by Hydraflow.
-        cfg: Configuration for the experiment's run.
-            This `Config` instance is originally defined by Hydra, and then
-            automatically passed to the app by Hydraflow.
-    """
-    # Training loop
-    for epoch in range(cfg.epochs):
-        # Simulate training and validation
-        train_loss = 1.0 / (epoch + 1)
-        val_loss = 1.1 / (epoch + 1)
+Define reusable experiment workflows in YAML:
-        # Log metrics to MLflow
-        mlflow.log_metrics({
-            "train_loss": train_loss,
-            "val_loss": val_loss
-        }, step=epoch)
+```yaml
+jobs:
+  train_models:
+    run: python train.py
+    sets:
+      - each: model=small,medium,large
+        all: learning_rate=0.001,0.01,0.1
+```
-        print(f"Epoch {epoch}: train_loss={train_loss:.4f}, val_loss={val_loss:.4f}")
+### Analysis Tools
+Analyze experiment results with powerful APIs:
-if __name__ == "__main__":
-    app()
-```
+```python
+from hydraflow import Run, iter_run_dirs
-This example demonstrates:
+# Load runs
+runs = Run.load(iter_run_dirs("mlruns"))
-- Configuration management with Hydra
-- Automatic experiment tracking with MLflow
-- Parameter logging and metric tracking
-- Type-safe configuration with dataclasses
+# Filter and analyze
+best_runs = runs.filter(model_type="transformer").to_frame("learning_rate", "accuracy")
+```
 ## Documentation
-For detailed documentation, including advanced usage examples and API reference,
-visit our [documentation site](https://daizutabi.github.io/hydraflow/).
+For detailed documentation, visit our [documentation site](https://daizutabi.github.io/hydraflow/):
+- [Getting Started](https://daizutabi.github.io/hydraflow/getting-started/) - Installation and core concepts
+- [Practical Tutorials](https://daizutabi.github.io/hydraflow/practical-tutorials/) - Learn through hands-on examples
+- [User Guide](https://daizutabi.github.io/hydraflow/part1-applications/) - Detailed documentation of HydraFlow's capabilities
+- [API Reference](https://daizutabi.github.io/hydraflow/api/hydraflow/) - Complete API documentation
 ## Contributing
@@ -191,4 +200,4 @@ We welcome contributions! Please see our [contributing guide](CONTRIBUTING.md) f
 ## License
-This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
+This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

hydraflow-0.16.1/README.md ADDED Viewed

@@ -0,0 +1,150 @@
+# HydraFlow
+[![PyPI Version][pypi-v-image]][pypi-v-link]
+[![Build Status][GHAction-image]][GHAction-link]
+[![Coverage Status][codecov-image]][codecov-link]
+[![Documentation Status][docs-image]][docs-link]
+[![Python Version][python-v-image]][python-v-link]
+<!-- Badges -->
+[pypi-v-image]: https://img.shields.io/pypi/v/hydraflow.svg
+[pypi-v-link]: https://pypi.org/project/hydraflow/
+[GHAction-image]: https://github.com/daizutabi/hydraflow/actions/workflows/ci.yaml/badge.svg?branch=main&event=push
+[GHAction-link]: https://github.com/daizutabi/hydraflow/actions?query=event%3Apush+branch%3Amain
+[codecov-image]: https://codecov.io/github/daizutabi/hydraflow/coverage.svg?branch=main
+[codecov-link]: https://codecov.io/github/daizutabi/hydraflow?branch=main
+[docs-image]: https://img.shields.io/badge/docs-latest-blue.svg
+[docs-link]: https://daizutabi.github.io/hydraflow/
+[python-v-image]: https://img.shields.io/pypi/pyversions/hydraflow.svg
+[python-v-link]: https://pypi.org/project/hydraflow
+## Overview
+HydraFlow seamlessly integrates [Hydra](https://hydra.cc/) and [MLflow](https://mlflow.org/) to streamline machine learning experiment workflows. By combining Hydra's powerful configuration management with MLflow's robust experiment tracking, HydraFlow provides a comprehensive solution for defining, executing, and analyzing machine learning experiments.
+## Design Principles
+HydraFlow is built on the following design principles:
+1. **Type Safety** - Utilizing Python dataclasses for configuration type checking and IDE support
+2. **Reproducibility** - Automatically tracking all experiment configurations for fully reproducible experiments
+3. **Analysis Capabilities** - Providing powerful APIs for easily analyzing experiment results
+4. **Workflow Integration** - Creating a cohesive workflow by integrating Hydra's configuration management with MLflow's experiment tracking
+## Key Features
+- **Type-safe Configuration Management** - Define experiment parameters using Python dataclasses with full IDE support and validation
+- **Seamless Hydra-MLflow Integration** - Automatically register configurations with Hydra and track experiments with MLflow
+- **Advanced Parameter Sweeps** - Define complex parameter spaces using extended sweep syntax for numerical ranges, combinations, and SI prefixes
+- **Workflow Automation** - Create reusable experiment workflows with YAML-based job definitions
+- **Powerful Analysis Tools** - Filter, group, and analyze experiment results with type-aware APIs
+- **Custom Implementation Support** - Extend experiment analysis with domain-specific functionality
+## Installation
+```bash
+pip install hydraflow
+```
+**Requirements:** Python 3.13+
+## Quick Example
+```python
+from dataclasses import dataclass
+from mlflow.entities import Run
+import hydraflow
+@dataclass
+class Config:
+    width: int = 1024
+    height: int = 768
+@hydraflow.main(Config)
+def app(run: Run, cfg: Config) -> None:
+    # Your experiment code here
+    print(f"Running with width={cfg.width}, height={cfg.height}")
+    # Log metrics
+    hydraflow.log_metric("area", cfg.width * cfg.height)
+if __name__ == "__main__":
+    app()
+```
+Execute a parameter sweep with:
+```bash
+python app.py -m width=800,1200 height=600,900
+```
+## Core Components
+HydraFlow consists of the following key components:
+### Configuration Management
+Define type-safe configurations using Python dataclasses:
+```python
+@dataclass
+class Config:
+    learning_rate: float = 0.001
+    batch_size: int = 32
+    epochs: int = 10
+```
+### Main Decorator
+The `@hydraflow.main` decorator integrates Hydra and MLflow:
+```python
+@hydraflow.main(Config)
+def train(run: Run, cfg: Config) -> None:
+    # Your experiment code
+```
+### Workflow Automation
+Define reusable experiment workflows in YAML:
+```yaml
+jobs:
+  train_models:
+    run: python train.py
+    sets:
+      - each: model=small,medium,large
+        all: learning_rate=0.001,0.01,0.1
+```
+### Analysis Tools
+Analyze experiment results with powerful APIs:
+```python
+from hydraflow import Run, iter_run_dirs
+# Load runs
+runs = Run.load(iter_run_dirs("mlruns"))
+# Filter and analyze
+best_runs = runs.filter(model_type="transformer").to_frame("learning_rate", "accuracy")
+```
+## Documentation
+For detailed documentation, visit our [documentation site](https://daizutabi.github.io/hydraflow/):
+- [Getting Started](https://daizutabi.github.io/hydraflow/getting-started/) - Installation and core concepts
+- [Practical Tutorials](https://daizutabi.github.io/hydraflow/practical-tutorials/) - Learn through hands-on examples
+- [User Guide](https://daizutabi.github.io/hydraflow/part1-applications/) - Detailed documentation of HydraFlow's capabilities
+- [API Reference](https://daizutabi.github.io/hydraflow/api/hydraflow/) - Complete API documentation
+## Contributing
+We welcome contributions! Please see our [contributing guide](CONTRIBUTING.md) for details.
+## License
+This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

{hydraflow-0.15.1 → hydraflow-0.16.1}/docs/part1-applications/main-decorator.md RENAMED Viewed

@@ -242,6 +242,36 @@ This option is particularly useful when:
 - You're iterating on experiments with command-line variations
 - Your configuration contains volatile or automatically generated values
+### Dynamic Configuration Updates (`update`)
+Modify or enhance the configuration after it has been loaded by Hydra
+but before the run starts:
+```python
+def update_config(cfg: Config) -> Config:
+    # Calculate derived values or add runtime information
+    if cfg.width > 0 and cfg.height > 0:
+        cfg.area = cfg.width * cfg.height
+    return cfg
+@hydraflow.main(Config, update=update_config)
+def train(run: Run, cfg: Config) -> None:
+    # Configuration has been updated with calculated area
+    print(f"Area: {cfg.area}")
+```
+This option is powerful when you need to:
+- Calculate derived parameters based on existing configuration values
+- Apply conditional logic to adjust parameters based on their relationships
+- Ensure consistency between related parameters
+- Adapt configurations to the current environment (e.g., hardware capabilities)
+The `update` function should accept a configuration object and
+return the same object (or None).
+Any changes made to the configuration will be saved to the run's configuration file,
+ensuring that the stored configuration accurately reflects all updates.
 ## Best Practices
 1. **Keep Configuration Classes Focused**: Break down complex configurations

{hydraflow-0.15.1 → hydraflow-0.16.1}/docs/part3-analysis/run-class.md RENAMED Viewed

@@ -56,9 +56,22 @@ learning_rate = run.get("learning_rate")
 # Nested access with dot notation
 model_type = run.get("model.type")
+# Alternatively, use double underscore notation for nested access
+model_type = run.get("model__type")  # Equivalent to "model.type"
 # Access implementation attributes or run info
 metric_value = run.get("accuracy")  # From impl or cfg
 run_id = run.get("run_id")  # From RunInfo
+# Provide a default value if the key doesn't exist
+batch_size = run.get("batch_size", 32)
+# Use a callable as default to dynamically generate values based on the run
+# This is useful for derived parameters or conditional defaults
+lr = run.get("learning_rate", default=lambda r: r.get("base_lr", 0.01) / 10)
+# Complex default logic based on other parameters
+steps = run.get("steps", default=lambda r: r.get("epochs", 10) * r.get("steps_per_epoch", 100))
 ```
 The `get` method searches for values in the following order:
@@ -69,6 +82,20 @@ The `get` method searches for values in the following order:
 This provides a unified access interface regardless of where the data is stored.
+The double underscore notation (`__`) is automatically converted to dot notation (`.`) internally,
+making it useful for nested parameter access, especially when using keyword arguments in methods
+that don't allow dots in parameter names.
+When providing a default value, you can use either a static value or a callable function.
+If you provide a callable, it will receive the Run instance as an argument, allowing you to
+create context-dependent default values that can access other run parameters or properties.
+This is particularly useful for:
+- Creating derived parameters that don't exist in the original configuration
+- Handling schema evolution across different experiment iterations
+- Providing fallbacks that depend on other configuration values
+- Implementing conditional logic for parameter defaults
 ## Type-Safe Configuration Access
 For better IDE integration and type checking, you can specify the configuration

hydraflow 0.15.1__tar.gz → 0.16.1__tar.gz

hydraflow 0.15.1tar.gz → 0.16.1tar.gz