PyPI - congrads - Versions diffs - 0.1.0__py3-none-any.whl → 0.2.0__py3-none-any.whl - Mend

congrads 0.1.0py3-none-any.whl → 0.2.0py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (17) hide show

congrads/__init__.py +7 -6
congrads/constraints.py +182 -300
congrads/core.py +158 -144
congrads/datasets.py +12 -559
congrads/descriptor.py +20 -35
congrads/metrics.py +37 -52
congrads/networks.py +5 -6
congrads/utils.py +310 -0
congrads-0.2.0.dist-info/LICENSE +26 -0
congrads-0.2.0.dist-info/METADATA +222 -0
congrads-0.2.0.dist-info/RECORD +13 -0
congrads/learners.py +0 -233
congrads-0.1.0.dist-info/LICENSE +0 -34
congrads-0.1.0.dist-info/METADATA +0 -196
congrads-0.1.0.dist-info/RECORD +0 -13
{congrads-0.1.0.dist-info → congrads-0.2.0.dist-info}/WHEEL +0 -0
{congrads-0.1.0.dist-info → congrads-0.2.0.dist-info}/top_level.txt +0 -0

congrads-0.2.0.dist-info/METADATA ADDED Viewed

@@ -0,0 +1,222 @@
+Metadata-Version: 2.1
+Name: congrads
+Version: 0.2.0
+Summary: A toolbox for using Constraint Guided Gradient Descent when training neural networks.
+Author-email: Wout Rombouts <wout.rombouts@kuleuven.be>, Quinten Van Baelen <quinten.vanbaelen@kuleuven.be>, Peter Karsmakers <peter.karsmakers@kuleuven.be>
+License: Copyright 2024 DTAI - KU Leuven
+        Redistribution and use in source and binary forms, with or without modification,
+        are permitted provided that the following conditions are met:
+        1. Redistributions of source code must retain the above copyright notice,
+        this list of conditions and the following disclaimer.
+        2. Redistributions in binary form must reproduce the above copyright notice,
+        this list of conditions and the following disclaimer in the documentation
+        and/or other materials provided with the distribution.
+        3. Neither the name of the copyright holder nor the names of its
+        contributors may be used to endorse or promote products derived from
+        this software without specific prior written permission.
+        THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS “AS IS”
+        AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
+        IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
+        ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE
+        LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
+        DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
+        SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
+        CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
+        OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
+        OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
+Requires-Python: >=3.9
+Description-Content-Type: text/markdown
+License-File: LICENSE
+Requires-Dist: torch>=1.12.0
+Requires-Dist: pandas>=2.2.2
+Requires-Dist: numpy>=1.26.4
+# Congrads
+**Congrads** is a Python toolbox that brings **constraint-guided gradient descent** capabilities to your machine learning projects. Built with seamless integration into PyTorch, Congrads empowers you to enhance the training and optimization process by incorporating constraints into your training pipeline.
+Whether you're working with simple inequality constraints, combinations of input-output relations, or custom constraint formulations, Congrads provides the tools and flexibility needed to build more robust and generalized models.
+> **Note:** The Congrads toolbox is **currently in alpha phase**. Expect significant changes, potential bugs, and incomplete features as we continue to develop and improve the functionality. Feedback is highly appreciated during this phase to help us refine the toolbox and ensure its reliability in later stages.
+## Key Features
+- **Constraint-Guided Training**: Add constraints to guide the optimization process, ensuring that your model generalizes better by trying to satisfy the constraints.
+- **Flexible Constraint Definition**: Define constraints on inputs, outputs, or combinations thereof, using an intuitive and extendable interface. Make use of pre-programmed constraint classes or write your own.
+- **Seamless PyTorch Integration**: Use Congrads within your existing PyTorch workflows with minimal setup.
+- **Flexible and extendible**: Write your own custom networks, constraints and dataset classes to easily extend the functionality of the toolbox.
+## Installation
+Currently, the **Congrads** toolbox can only be installed using pip. We will later expand to other package managers such as conda.
+```bash
+pip install congrads
+```
+## Getting Started
+### 1. **Prerequisites**
+Before you can use **Congrads**, make sure you have the following installed:
+- Python 3.6+ (preffered version 3.11)
+- **PyTorch** (install with CUDA support for GPU training, refer to [PyTorch's getting started guide](https://pytorch.org/get-started/locally/))
+- **NumPy** (install with ```pip install numpy```, or refer to [NumPy's install guide](https://numpy.org/install/).)
+- **Pandas** (install with ```pip install pandas```, or refer to [Panda's install guide](https://pandas.pydata.org/docs/getting_started/install.html).)
+### 2. **Installation**
+Please install **Congrads** via pip:
+```bash
+pip install congrads
+```
+### 3. **Basic Usage**
+#### 1. Import necessary classes and functions from the toolbox
+To start using the toolbox, import the required modules and functions. This includes classes for defining constraints, data processing, network setup, and training utilities.
+```python
+from congrads.constraints import BinaryConstraint, ScalarConstraint, Constraint
+from congrads.core import CongradsCore
+from congrads.datasets import BiasCorrection
+from congrads.descriptor import Descriptor
+from congrads.metrics import MetricManager
+from congrads.networks import MLPNetwork
+from congrads.utils import preprocess_BiasCorrection, splitDataLoaders
+```
+#### 2. Set up data and preprocessing
+The toolbox works with various datasets, and for this example, we are using the **BiasCorrection** dataset. After loading the dataset, it is preprocessed using a utility function and split into train, validation, and test sets using DataLoader instances.
+```python
+# Load and preprocess data
+data = BiasCorrection("./datasets", preprocess_BiasCorrection)
+loaders = splitDataLoaders(
+    data, loader_args={"batch_size": 100, "shuffle": True, "num_workers": 6}
+)
+```
+#### 3. Configure the network
+The model architecture used here is a Multi-Layer Perceptron (MLP) with 25 input features, 2 output features, and 3 hidden layers, each containing 35 neurons. The network outputs are later mapped to meaningful names using the descriptor.
+```python
+# Instantiate network and push to correct device
+network = MLPNetwork(25, 2, n_hidden_layers=3, hidden_dim=35)
+network = network.to(device)
+```
+#### 4. Instantiate loss and optimizer
+Define the loss function and optimizer, which are critical for training the model. In this example, we use the Mean Squared Error (MSE) loss function and the Adam optimizer with a learning rate of 0.001.
+```python
+# Instantiate loss and optimizer
+criterion = MSELoss()
+optimizer = Adam(network.parameters(), lr=0.001)
+```
+#### 5. Set up the descriptor
+The descriptor serves as a mapping between network layers and their semantic meanings. For this example, the network's two outputs are named ```Tmax``` (maximum temperature) and ```Tmin``` (minimum temperature), which correspond to specific columns in the dataset.
+```python
+# Descriptor setup
+descriptor = Descriptor()
+descriptor.add("output", 0, "Tmax", output=True)
+descriptor.add("output", 1, "Tmin", output=True)
+```
+#### 6. Define constraints on your network
+Constraints are rules applied to the network's behavior, ensuring its outputs meet specific criteria. Using the descriptor, constraints can be defined for named outputs. In this case, constraints enforce bounds (e.g., ```0 <= Tmin <= 1```) and relationships (```Tmax > Tmin```) on the outputs.
+```python
+# Constraints definition
+Constraint.descriptor = descriptor
+constraints = [
+    ScalarConstraint("Tmin", ge, 0),   # Tmin >= 0
+    ScalarConstraint("Tmin", le, 1),   # Tmin <= 1
+    ScalarConstraint("Tmax", ge, 0),   # Tmax >= 0
+    ScalarConstraint("Tmax", le, 1),   # Tmax <= 1
+    BinaryConstraint("Tmax", gt, "Tmin"),  # Tmax > Tmin
+]
+```
+#### 7. Set up trainer
+Metrics are used to evaluate and track the model's performance during training. A ```MetricManager``` is instantiated with a TensorBoard writer to log metrics and visualize training progress.
+```python
+# Initialize metrics
+writer = SummaryWriter()
+metric_manager = MetricManager(writer, device)
+```
+#### 8. Initialize and configure the core learner
+The core of the toolbox is the ```CongradsCore``` class, which integrates the descriptor, constraints, data loaders, network, loss function, optimizer, and metrics to manage the learning process.
+```python
+# Instantiate core
+core = CongradsCore(
+    descriptor,
+    constraints,
+    loaders,
+    network,
+    criterion,
+    optimizer,
+    metric_manager,
+    device,
+)
+```
+#### 9. Start training
+The ```fit``` method of the core class starts the training loop for the specified number of epochs. At the end of training, the TensorBoard writer is closed to finalize the logs.
+```python
+# Start training
+core.fit(max_epochs=150)
+# Close writer
+writer.close()
+```
+## Example Use Cases
+- **Optimization with Domain Knowledge**: Ensure outputs meet real-world restrictions or safety standards.
+- **Physics-Informed Neural Networks (PINNs)**: Enforce physical laws as constraints in your models.
+- **Improve Training Process**: Inject domain knowledge in the training stage, increasing learning efficiency.
+## Roadmap
+- [ ] Documentation and Notebook examples
+- [ ] Add support for constraint parser that can interpret equations
+- [x] Add better handling of metric logging and visualization
+- [x] Revise if Pytorch Lightning is preferable over plain Pytorch
+- [ ] Determine if it is feasible to add unit and or functional tests
+## Contributing
+We welcome contributions to Congrads! Whether you want to report issues, suggest features, or contribute code via issues and pull requests.
+## License
+Congrads is licensed under the [The 3-Clause BSD License](LICENSE). We encourage companies that are interested in a collaboration for a specific topic to contact the authors for more information or to set up joint research projects.
+---
+Elevate your neural networks with Congrads! 🚀

congrads-0.2.0.dist-info/RECORD ADDED Viewed

@@ -0,0 +1,13 @@
+congrads/__init__.py,sha256=XnRKk4VTheZJj6Z1f8x5Iq2YPtd2fycUgQazOqiqOEw,458
+congrads/constraints.py,sha256=JgO8SUhSKHoBH-WvFdwYnhkVl_jO-RqwgHCVOR1_F-8,13488
+congrads/core.py,sha256=egFph4MhKncwOi6pcTzDFGqh9bIxb7-z68899TxcEQM,7696
+congrads/datasets.py,sha256=uTDwnjwA52wwT6Hv4Kw0WqKi2dMDJE3nP-xKB6AjCNw,5470
+congrads/descriptor.py,sha256=FzmFZBHZ3nhd8NS951EJqcM97C-XsoRIA9qK6rmeBU4,1520
+congrads/metrics.py,sha256=ct4wj8q-GL3lYXxBeNCsCvCLn0TPBbs_8ybiMe-Fw5w,1471
+congrads/networks.py,sha256=QpuEgHmkXDCrTbonHoXbbRblZIdpYsopqMST--Ki9i4,3256
+congrads/utils.py,sha256=Z4ElFFreacRN7qPXh7Gv5lIzdAs5gtvVloJnHag2E9g,13890
+congrads-0.2.0.dist-info/LICENSE,sha256=hDkSuSj1L5IpO9uhrag5zd29HicibbYX8tUbY3RXF40,1480
+congrads-0.2.0.dist-info/METADATA,sha256=jaWNoJ4AeWB4aCL47Tw4OsXwOb-q5w4rwYKxSE76CKM,9804
+congrads-0.2.0.dist-info/WHEEL,sha256=PZUExdf71Ui_so67QXpySuHtCi3-J3wvF4ORK6k_S8U,91
+congrads-0.2.0.dist-info/top_level.txt,sha256=B8M9NmtHbmzp-3APHe4C0oo7aRIWRHWoba9FIy9XeYM,9
+congrads-0.2.0.dist-info/RECORD,,

congrads/learners.py DELETED Viewed

@@ -1,233 +0,0 @@
-import logging
-from typing import Union
-from torch import Tensor
-from torch.nn import Module
-from torch.nn.modules.loss import _Loss
-from torch.optim import Optimizer
-from .core import CGGDModule
-from .constraints import Constraint
-from .descriptor import Descriptor
-class Learner(CGGDModule):
-    def __init__(
-        self,
-        network: Module,
-        descriptor: Descriptor,
-        constraints: list[Constraint],
-        loss_function: Union[_Loss, dict[str, _Loss]],
-        optimizer: Optimizer,
-    ):
-        """
-        A class that integrates a neural network with a training and validation loop,
-        supporting single or multi-output loss functions. The class manages the forward pass,
-        training step, and validation step while also configuring the optimizer.
-        Args:
-            network (Module): The neural network model to be trained.
-            descriptor (Descriptor): An object that defines the structure of the network,
-                                    including the output layers.
-            constraints (list[Constraint]): A list of constraints that can be applied during training.
-            loss_function (Union[_Loss, dict[str, _Loss]]): A loss function or a dictionary of loss functions
-                                                        for each output layer.
-            optimizer (Optimizer): The optimizer used for training the model.
-        Raises:
-            ValueError: If the descriptor does not contain any output layers or if the number of loss functions
-                        does not match the number of output layers when using a dictionary of loss functions.
-        """
-        # Init parent class
-        super().__init__(descriptor, constraints)
-        # Init object variables
-        self.network = network
-        self.descriptor = descriptor
-        self.loss_function = loss_function
-        self.optimizer = optimizer
-        # Perform checks
-        if len(self.descriptor.output_layers) == 0:
-            raise ValueError(
-                'The descriptor class must contain one or more output layers. Mark a layer as output by setting descriptor.add("layer", ..., output=True).'
-            )
-        if isinstance(loss_function, _Loss):
-            if len(self.descriptor.output_layers) > 1:
-                logging.warning(
-                    f"Multiple layers were marked as output, but only one loss function is defined. Only the loss of layer {list(self.descriptor.output_layers)[0]} will be calculated and used. To use the same loss function for all output layers, please specify then explicitly."
-                )
-        if isinstance(loss_function, dict):
-            if len(self.descriptor.output_layers) != len(loss_function):
-                raise ValueError(
-                    f"The number of marked output layers does not match the number of provided loss functions."
-                )
-        # Assign proper step function based on if one or multiple loss functions are assigned
-        if isinstance(loss_function, _Loss):
-            self.training_step = self.training_step_single
-            self.validation_step = self.validation_step_single
-        if isinstance(loss_function, dict):
-            self.training_step = self.training_step_multi
-            self.validation_step = self.validation_step_multi
-    def forward(self, x):
-        """
-        Perform a forward pass through the network.
-        Args:
-            x (Tensor): The input tensor to pass through the network.
-        Returns:
-            Tensor: The model's output for the given input.
-        """
-        return self.network(x)
-    def training_step_single(self, batch, batch_idx):
-        """
-        Perform a single training step using a single loss function.
-        Args:
-            batch (tuple): A tuple containing the input and target output tensors.
-            batch_idx (int): The index of the batch in the current epoch.
-        Returns:
-            Tensor: The loss value for the batch.
-        """
-        self.train()
-        inputs, outputs = batch
-        prediction: dict[str, Tensor] = self(inputs)
-        layer = list(self.descriptor.output_layers)[0]
-        loss = self.loss_function(prediction[layer], outputs)
-        self.log(
-            "train_loss",
-            loss,
-            on_step=False,
-            on_epoch=True,
-        )
-        return super().training_step(prediction, loss)
-    def training_step_multi(self, batch, batch_idx):
-        """
-        Perform a training step using multiple loss functions, one for each output layer.
-        Args:
-            batch (tuple): A tuple containing the input and target output tensors.
-            batch_idx (int): The index of the batch in the current epoch.
-        Returns:
-            Tensor: The total loss value for the batch, combining the losses from all output layers.
-        """
-        self.train()
-        inputs, outputs = batch
-        prediction: dict[str, Tensor] = self(inputs)
-        # TODO add hyperparameter to scale loss per function
-        loss = 0
-        for layer in self.descriptor.output_layers:
-            layer_loss = self.loss_function[layer](prediction[layer], outputs)
-            loss += layer_loss
-            self.log(
-                f"train_loss_{layer}",
-                layer_loss,
-                on_step=False,
-                on_epoch=True,
-            )
-        self.log(
-            "train_loss",
-            loss,
-            on_step=False,
-            on_epoch=True,
-        )
-        return super().training_step(prediction, loss)
-    def validation_step_single(self, batch, batch_idx):
-        """
-        Perform a single validation step using a single loss function.
-        Args:
-            batch (tuple): A tuple containing the input and target output tensors.
-            batch_idx (int): The index of the batch in the current epoch.
-        Returns:
-            Tensor: The validation loss for the batch.
-        """
-        self.eval()
-        inputs, outputs = batch
-        prediction: dict[str, Tensor] = self(inputs)
-        layer = list(self.descriptor.output_layers)[0]
-        loss = self.loss_function(prediction[layer], outputs)
-        self.log(
-            "valid_loss",
-            loss,
-            on_step=False,
-            on_epoch=True,
-        )
-        return super().validation_step(prediction, loss)
-    def validation_step_multi(self, batch, batch_idx):
-        """
-        Perform a validation step using multiple loss functions, one for each output layer.
-        Args:
-            batch (tuple): A tuple containing the input and target output tensors.
-            batch_idx (int): The index of the batch in the current epoch.
-        Returns:
-            Tensor: The total validation loss for the batch, combining the losses from all output layers.
-        """
-        self.eval()
-        inputs, outputs = batch
-        prediction: dict[str, Tensor] = self(inputs)
-        loss = 0
-        for layer in self.descriptor.output_layers:
-            layer_loss = self.loss_function[layer](prediction[layer], outputs)
-            loss += layer_loss
-            self.log(
-                f"valid_loss_{layer}",
-                layer_loss,
-                on_step=False,
-                on_epoch=True,
-            )
-        self.log(
-            "valid_loss",
-            loss,
-            on_step=False,
-            on_epoch=True,
-        )
-        return super().validation_step(prediction, loss)
-    def configure_optimizers(self):
-        """
-        Configure the optimizer for training.
-        Returns:
-            Optimizer: The optimizer used to update the model's parameters during training.
-        """
-        return self.optimizer

congrads-0.1.0.dist-info/LICENSE DELETED Viewed

@@ -1,34 +0,0 @@
-MIT License
-Copyright (c) 2024 DTAI - KU Leuven
-Permission is hereby granted, free of charge, to any person obtaining a copy
-of this software and associated documentation files (the "Software"), to deal
-in the Software without restriction, including without limitation the rights
-to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
-copies of the Software, and to permit persons to whom the Software is
-furnished to do so, subject to the following conditions:
-The above copyright notice and this permission notice shall be included in all
-copies or substantial portions of the Software.
-THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
-IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
-FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
-AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
-LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
-OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
-SOFTWARE.
-"Commons Clause" License Condition v1.0
-The Software is provided to you by the Licensor under the License, as defined below, subject to the following condition.
-Without limiting other conditions in the License, the grant of rights under the License will not include, and the License does not grant to you, the right to Sell the Software.
-For purposes of the foregoing, "Sell" means practicing any or all of the rights granted to you under the License to provide to third parties, for a fee or other consideration (including without limitation fees for hosting or consulting/ support services related to the Software), a product or service whose value derives, entirely or substantially, from the functionality of the Software. Any license notice or attribution required by the License must also include this Commons Clause License Condition notice.
-Software: All CGGD-Toolbox associated files.
-License: MIT
-Licensor: DTAI - KU Leuven

congrads 0.1.0__py3-none-any.whl → 0.2.0__py3-none-any.whl

congrads 0.1.0py3-none-any.whl → 0.2.0py3-none-any.whl