PyPI - gbrl - Versions diffs - 1.0.0.dev4__tar.gz → 1.0.0.dev5__tar.gz - Mend

gbrl 1.0.0.dev4tar.gz → 1.0.0.dev5tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (59) hide show

gbrl-1.0.0.dev5/PKG-INFO ADDED Viewed

@@ -0,0 +1,120 @@
+Metadata-Version: 2.1
+Name: gbrl
+Version: 1.0.0.dev5
+Summary: Gradient Boosted Trees for RL
+Author-email: Benjamin Fuhrer <bfuhrer@nvidia.com>, Chen Tessler <ctessler@nvidia.com>, Gal Dalal <galal@nvidia.com>
+Classifier: Development Status :: 4 - Beta
+Classifier: Intended Audience :: Developers
+Classifier: Intended Audience :: Science/Research
+Classifier: License :: Other/Proprietary License
+Classifier: Programming Language :: Python :: 3.9
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Topic :: Software Development :: Libraries :: Python Modules
+Requires-Python: >=3.9
+Description-Content-Type: text/markdown
+License-File: LICENSE
+Requires-Dist: pybind11==2.11.1
+Requires-Dist: numpy
+Requires-Dist: torch
+Provides-Extra: sklearn
+Requires-Dist: scikit_learn==1.2.2; extra == "sklearn"
+# Gradient Boosting Reinforcement Learning (GBRL)
+GBRL is a Python-based GBT library designed and optimized for reinforcement learning (RL). GBRL is implemented in C++/CUDA aimed to seamlessly integrate within popular RL libraries.
+### Key Features:
+- GBTs Tailored for RL: GBRL adapts the power of Gradient Boosting Trees to the unique challenges of RL environments, including non-stationarity and delayed feedback.
+- Optimized Actor-Critic Architecture: GBRL features a shared tree-based structure for policy and value functions. This significantly reduces memory and computational overhead, enabling it to tackle complex, high-dimensional RL problems.
+- Hardware Acceleration: GBRL leverages CUDA for hardware-accelerated computation, ensuring efficiency and speed.
+- Seamless Integration: GBRL is designed for easy integration with popular RL libraries, making it readily accessible for practitioners.
+## Getting started
+### Dependencies
+llvm
+openmp
+#### MAC OS
+Make sure to run:
+```
+brew install libomp
+brew install llvm
+ ```
+xcode command line tools should be installed installed
+### Installation
+```
+pip install gbrl
+```
+For GPU support GBRL looks for `CUDA_PATH` or `CUDA_HOME` environment variables. Unless found, GBRL will automatically compile only for CPU.
+Verify that GPU is visible by running
+```
+import gbrl
+gbrl.cuda_available()
+```
+*OPTIONAL*
+For tree visualization make sure graphviz is installed before compilation.
+## Usage Example in an RL library
+```
+import torch as th
+from gbrl import ActorCriticGBRL
+from stable_baselines3.common.distributions import CategoricalDistribution
+### initialize model ###
+# define tree structure parameters
+tree_struct = {'max_depth': 4, 'min_data_in_leaf': 0, 'n_bins': 256, 'grow_policy': 'oblivious'}
+# define gbrl parameters
+gbrl_params = {'control_variates': False, 'split_score_func': 'cosine', 'generator_type': 'Quantile'}
+# actor and critic optimizers
+policy_optimizer = {'algo': 'SGD', 'lr': 0.05}
+value_optimizer = {'algo': 'SGD', lr: '0.25'}
+# Given action_space instance of gym.spaces.Space
+model = ActorCriticGBRL(tree_struct=tree_struct, output_dim=action_space.n, pg_optimizer=policy_optimizer, value_optimizer=value_optimizer, shared_tree_struct=True, gbrl_params=gbrl_params, device='cuda')
+model.init_model()
+action_dist = CategoricalDistribution(action_space.n)
+### Training loop ###
+logits, values = model(obs, requires_grad=True) # returns numpy arrays
+distribution = action_dist.proba_distribution(logits)
+log_prob = distribution.log_prob(actions)
+loss = ... calculate loss according to RL algorithm
+loss.backward()
+self.policy.step(obs)
+```
+## Current Supported Features
+### Tree Fitting
+- Greedy (Depth-wise) tree building - (CPU/GPU)
+- Oblivious (Symmetric) tree building - (CPU/GPU)
+- L2 split score - (CPU/GPU)
+- Cosine split score - (CPU/GPU)
+- Uniform based candidate generation - (CPU/GPU)
+- Quantile based candidate generation - (CPU/GPU)
+- Supervised learning fitting / Multi-iteration fitting - (CPU/GPU)
+    - MultiRMSE loss (only)
+- Categorical Inputs
+### GBT Inference
+- SGD optimizer - (CPU/GPU)
+- ADAM optimizer - (CPU only)
+- Control Variates (gradient variance reduction technique) - (CPU only)
+- Shared Tree for policy and value function - (CPU/GPU)
+- Linear and constant learning rate scheduler - (CPU/GPU only constant)
+- Support for up to two different optimizers (e.g, policy/value) - **(CPU/GPU if both are SGD)
+## Citing

gbrl-1.0.0.dev5/gbrl.egg-info/PKG-INFO ADDED Viewed

@@ -0,0 +1,120 @@
+Metadata-Version: 2.1
+Name: gbrl
+Version: 1.0.0.dev5
+Summary: Gradient Boosted Trees for RL
+Author-email: Benjamin Fuhrer <bfuhrer@nvidia.com>, Chen Tessler <ctessler@nvidia.com>, Gal Dalal <galal@nvidia.com>
+Classifier: Development Status :: 4 - Beta
+Classifier: Intended Audience :: Developers
+Classifier: Intended Audience :: Science/Research
+Classifier: License :: Other/Proprietary License
+Classifier: Programming Language :: Python :: 3.9
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Topic :: Software Development :: Libraries :: Python Modules
+Requires-Python: >=3.9
+Description-Content-Type: text/markdown
+License-File: LICENSE
+Requires-Dist: pybind11==2.11.1
+Requires-Dist: numpy
+Requires-Dist: torch
+Provides-Extra: sklearn
+Requires-Dist: scikit_learn==1.2.2; extra == "sklearn"
+# Gradient Boosting Reinforcement Learning (GBRL)
+GBRL is a Python-based GBT library designed and optimized for reinforcement learning (RL). GBRL is implemented in C++/CUDA aimed to seamlessly integrate within popular RL libraries.
+### Key Features:
+- GBTs Tailored for RL: GBRL adapts the power of Gradient Boosting Trees to the unique challenges of RL environments, including non-stationarity and delayed feedback.
+- Optimized Actor-Critic Architecture: GBRL features a shared tree-based structure for policy and value functions. This significantly reduces memory and computational overhead, enabling it to tackle complex, high-dimensional RL problems.
+- Hardware Acceleration: GBRL leverages CUDA for hardware-accelerated computation, ensuring efficiency and speed.
+- Seamless Integration: GBRL is designed for easy integration with popular RL libraries, making it readily accessible for practitioners.
+## Getting started
+### Dependencies
+llvm
+openmp
+#### MAC OS
+Make sure to run:
+```
+brew install libomp
+brew install llvm
+ ```
+xcode command line tools should be installed installed
+### Installation
+```
+pip install gbrl
+```
+For GPU support GBRL looks for `CUDA_PATH` or `CUDA_HOME` environment variables. Unless found, GBRL will automatically compile only for CPU.
+Verify that GPU is visible by running
+```
+import gbrl
+gbrl.cuda_available()
+```
+*OPTIONAL*
+For tree visualization make sure graphviz is installed before compilation.
+## Usage Example in an RL library
+```
+import torch as th
+from gbrl import ActorCriticGBRL
+from stable_baselines3.common.distributions import CategoricalDistribution
+### initialize model ###
+# define tree structure parameters
+tree_struct = {'max_depth': 4, 'min_data_in_leaf': 0, 'n_bins': 256, 'grow_policy': 'oblivious'}
+# define gbrl parameters
+gbrl_params = {'control_variates': False, 'split_score_func': 'cosine', 'generator_type': 'Quantile'}
+# actor and critic optimizers
+policy_optimizer = {'algo': 'SGD', 'lr': 0.05}
+value_optimizer = {'algo': 'SGD', lr: '0.25'}
+# Given action_space instance of gym.spaces.Space
+model = ActorCriticGBRL(tree_struct=tree_struct, output_dim=action_space.n, pg_optimizer=policy_optimizer, value_optimizer=value_optimizer, shared_tree_struct=True, gbrl_params=gbrl_params, device='cuda')
+model.init_model()
+action_dist = CategoricalDistribution(action_space.n)
+### Training loop ###
+logits, values = model(obs, requires_grad=True) # returns numpy arrays
+distribution = action_dist.proba_distribution(logits)
+log_prob = distribution.log_prob(actions)
+loss = ... calculate loss according to RL algorithm
+loss.backward()
+self.policy.step(obs)
+```
+## Current Supported Features
+### Tree Fitting
+- Greedy (Depth-wise) tree building - (CPU/GPU)
+- Oblivious (Symmetric) tree building - (CPU/GPU)
+- L2 split score - (CPU/GPU)
+- Cosine split score - (CPU/GPU)
+- Uniform based candidate generation - (CPU/GPU)
+- Quantile based candidate generation - (CPU/GPU)
+- Supervised learning fitting / Multi-iteration fitting - (CPU/GPU)
+    - MultiRMSE loss (only)
+- Categorical Inputs
+### GBT Inference
+- SGD optimizer - (CPU/GPU)
+- ADAM optimizer - (CPU only)
+- Control Variates (gradient variance reduction technique) - (CPU only)
+- Shared Tree for policy and value function - (CPU/GPU)
+- Linear and constant learning rate scheduler - (CPU/GPU only constant)
+- Support for up to two different optimizers (e.g, policy/value) - **(CPU/GPU if both are SGD)
+## Citing

{gbrl-1.0.0.dev4 → gbrl-1.0.0.dev5}/pyproject.toml RENAMED Viewed

@@ -8,8 +8,9 @@ build-backend = "setuptools.build_meta"
 [project]
 name = "gbrl"
-version = "1.0.0.dev4"
+version = "1.0.0.dev5"
 description = "Gradient Boosted Trees for RL"
+readme = { file = "README.md", content-type = "text/markdown" }
 authors = [
     {name = "Benjamin Fuhrer", email = "bfuhrer@nvidia.com"},
     {name = "Chen Tessler", email = "ctessler@nvidia.com"},
@@ -20,6 +21,18 @@ dependencies = [
     "numpy",
     "torch"
 ]
+classifiers = [
+    "Development Status :: 4 - Beta",
+    "Intended Audience :: Developers",
+    "Intended Audience :: Science/Research",
+    "License :: Other/Proprietary License",
+    "Programming Language :: Python :: 3.9",
+    "Programming Language :: Python :: 3.10",
+    "Programming Language :: Python :: 3.11",
+    "Topic :: Software Development :: Libraries :: Python Modules"
+]
+requires-python = ">=3.9"
 [project.optional-dependencies]
 sklearn = ["scikit_learn==1.2.2"]

gbrl-1.0.0.dev4/PKG-INFO DELETED Viewed

@@ -1,11 +0,0 @@
-Metadata-Version: 2.1
-Name: gbrl
-Version: 1.0.0.dev4
-Summary: Gradient Boosted Trees for RL
-Author-email: Benjamin Fuhrer <bfuhrer@nvidia.com>, Chen Tessler <ctessler@nvidia.com>, Gal Dalal <galal@nvidia.com>
-License-File: LICENSE
-Requires-Dist: pybind11==2.11.1
-Requires-Dist: numpy
-Requires-Dist: torch
-Provides-Extra: sklearn
-Requires-Dist: scikit_learn==1.2.2; extra == "sklearn"

gbrl-1.0.0.dev4/gbrl.egg-info/PKG-INFO DELETED Viewed

@@ -1,11 +0,0 @@
-Metadata-Version: 2.1
-Name: gbrl
-Version: 1.0.0.dev4
-Summary: Gradient Boosted Trees for RL
-Author-email: Benjamin Fuhrer <bfuhrer@nvidia.com>, Chen Tessler <ctessler@nvidia.com>, Gal Dalal <galal@nvidia.com>
-License-File: LICENSE
-Requires-Dist: pybind11==2.11.1
-Requires-Dist: numpy
-Requires-Dist: torch
-Provides-Extra: sklearn
-Requires-Dist: scikit_learn==1.2.2; extra == "sklearn"