PyPI - active-vision - Versions diffs - 0.0.3__tar.gz → 0.0.4__tar.gz - Mend

active-vision 0.0.3tar.gz → 0.0.4tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (13) hide show

{active_vision-0.0.3 → active_vision-0.0.4}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.2
 Name: active-vision
-Version: 0.0.3
+Version: 0.0.4
 Summary: Active learning for edge vision.
 Requires-Python: >=3.10
 Description-Content-Type: text/markdown
@@ -12,6 +12,7 @@ Requires-Dist: ipykernel>=6.29.5
 Requires-Dist: ipywidgets>=8.1.5
 Requires-Dist: loguru>=0.7.3
 Requires-Dist: seaborn>=0.13.2
+Requires-Dist: timm>=1.0.13
 ![Python Version](https://img.shields.io/badge/python-3.10%2B-blue?style=for-the-badge)
 ![License](https://img.shields.io/badge/License-Apache%202.0-green.svg?style=for-the-badge)
@@ -26,16 +27,13 @@ Active learning at the edge for computer vision.
 The goal of this project is to create a framework for the active learning loop for computer vision deployed on edge devices.
-## Installation
-I recommend using [uv](https://docs.astral.sh/uv/) to set up a virtual environment and install the package. You can also use other virtual env of your choice.
+Supported tasks:
+- [X] Image classification
+- [ ] Object detection
+- [ ] Segmentation
-If you're using uv:
-```bash
-uv venv
-uv sync
-```
-Once the virtual environment is created, you can install the package using pip.
+## Installation
 Get a release from PyPI
 ```bash
@@ -49,6 +47,16 @@ cd active-vision
 pip install -e .
 ```
+I recommend using [uv](https://docs.astral.sh/uv/) to set up a virtual environment and install the package. You can also use other virtual env of your choice.
+If you're using uv:
+```bash
+uv venv
+uv sync
+```
+Once the virtual environment is created, you can install the package using pip.
 > [!TIP]
 > If you're using uv add a uv before the pip install command to install into your virtual environment. Eg:
 > ```bash
@@ -59,9 +67,11 @@ pip install -e .
 See the [notebook](./nbs/04_relabel_loop.ipynb) for a complete example.
 Be sure to prepared 3 datasets:
-- train: A dataframe of an existing labeled training dataset.
-- unlabeled: A dataframe of unlabeled data which we will sample from using active learning.
-- eval: A dataframe of labeled data which we will use to evaluate the performance of the model. (Optional)
+- [initial_samples](./nbs/initial_samples.parquet): A dataframe of an existing labeled training dataset to seed the training set.
+- [unlabeled](./nbs/unlabeled_samples.parquet): A dataframe of unlabeled data which we will sample from using active learning.
+- [eval](./nbs/evaluation_samples.parquet): A dataframe of labeled data which we will use to evaluate the performance of the model.
+As a toy example I created the above 3 datasets from the imagenette dataset.
 ```python
 from active_vision import ActiveLearner
@@ -102,6 +112,13 @@ al.add_to_train_set(labeled_df, output_filename="active_labeled")
 Repeat the process until the model is good enough. Use the dataset to train a larger model and deploy.
+> [!TIP]
+> For the toy dataset, I got to about 93% accuracy on the evaluation set with 200+ labeled images. The best performing model on the [leaderboard](https://github.com/fastai/imagenette) got 95.11% accuracy training on all 9469 labeled images.
+>
+> This took me about 6 iterations of relabeling. Each iteration took about 5 minutes to complete including labeling and model training (resnet18). See the [notebook](./nbs/04_relabel_loop.ipynb) for more details.
+>
+> But using the dataset of 200+ images, I trained a more capable model (convnext_small_in22k) and got 99.3% accuracy on the evaluation set. See the [notebook](./nbs/05_retrain_larger.ipynb) for more details.
 ## Workflow
 There are two workflows for active learning at the edge that we can use depending on the availability of labeled data.
@@ -109,10 +126,10 @@ There are two workflows for active learning at the edge that we can use dependin
 If we have no labeled data, we can use active learning to iteratively improve the model and build a labeled dataset.
 1. Load a small proxy model.
-2. Label an initial dataset.
+2. Label an initial dataset. If there is none, you'll have to label some images.
 3. Train the proxy model on the labeled dataset.
 4. Run inference on the unlabeled dataset.
-5. Evaluate the performance of the proxy model on the unlabeled dataset.
+5. Evaluate the performance of the proxy model.
 6. Is model good enough?
     - Yes: Save the proxy model and the dataset.
     - No: Select the most informative images to label using active learning.
@@ -164,7 +181,7 @@ graph TD
 ```
-## Methodology
+<!-- ## Methodology
 To test out the workflows we will use the [imagenette dataset](https://huggingface.co/datasets/frgfm/imagenette). But this will be applicable to any dataset.
 Imagenette is a subset of the ImageNet dataset with 10 classes. We will use this dataset to test out the workflows. Additionally, Imagenette has an existing leaderboard which we can use to evaluate the performance of the models.
@@ -215,4 +232,4 @@ After the first iteration we got 94.57% accuracy on the validation set. See the
 > [!TIP]
 > | Train Epochs | Number of Images | Validation Accuracy |      Source      |
 > |--------------|-----------------|----------------------|------------------|
-> | 10           | 200             | 94.57%               | First relabeling [notebook](./nbs/03_retrain_model.ipynb) |
+> | 10           | 200             | 94.57%               | First relabeling [notebook](./nbs/03_retrain_model.ipynb) | -->

{active_vision-0.0.3 → active_vision-0.0.4}/README.md RENAMED Viewed

@@ -11,16 +11,13 @@ Active learning at the edge for computer vision.
 The goal of this project is to create a framework for the active learning loop for computer vision deployed on edge devices.
-## Installation
-I recommend using [uv](https://docs.astral.sh/uv/) to set up a virtual environment and install the package. You can also use other virtual env of your choice.
+Supported tasks:
+- [X] Image classification
+- [ ] Object detection
+- [ ] Segmentation
-If you're using uv:
-```bash
-uv venv
-uv sync
-```
-Once the virtual environment is created, you can install the package using pip.
+## Installation
 Get a release from PyPI
 ```bash
@@ -34,6 +31,16 @@ cd active-vision
 pip install -e .
 ```
+I recommend using [uv](https://docs.astral.sh/uv/) to set up a virtual environment and install the package. You can also use other virtual env of your choice.
+If you're using uv:
+```bash
+uv venv
+uv sync
+```
+Once the virtual environment is created, you can install the package using pip.
 > [!TIP]
 > If you're using uv add a uv before the pip install command to install into your virtual environment. Eg:
 > ```bash
@@ -44,9 +51,11 @@ pip install -e .
 See the [notebook](./nbs/04_relabel_loop.ipynb) for a complete example.
 Be sure to prepared 3 datasets:
-- train: A dataframe of an existing labeled training dataset.
-- unlabeled: A dataframe of unlabeled data which we will sample from using active learning.
-- eval: A dataframe of labeled data which we will use to evaluate the performance of the model. (Optional)
+- [initial_samples](./nbs/initial_samples.parquet): A dataframe of an existing labeled training dataset to seed the training set.
+- [unlabeled](./nbs/unlabeled_samples.parquet): A dataframe of unlabeled data which we will sample from using active learning.
+- [eval](./nbs/evaluation_samples.parquet): A dataframe of labeled data which we will use to evaluate the performance of the model.
+As a toy example I created the above 3 datasets from the imagenette dataset.
 ```python
 from active_vision import ActiveLearner
@@ -87,6 +96,13 @@ al.add_to_train_set(labeled_df, output_filename="active_labeled")
 Repeat the process until the model is good enough. Use the dataset to train a larger model and deploy.
+> [!TIP]
+> For the toy dataset, I got to about 93% accuracy on the evaluation set with 200+ labeled images. The best performing model on the [leaderboard](https://github.com/fastai/imagenette) got 95.11% accuracy training on all 9469 labeled images.
+>
+> This took me about 6 iterations of relabeling. Each iteration took about 5 minutes to complete including labeling and model training (resnet18). See the [notebook](./nbs/04_relabel_loop.ipynb) for more details.
+>
+> But using the dataset of 200+ images, I trained a more capable model (convnext_small_in22k) and got 99.3% accuracy on the evaluation set. See the [notebook](./nbs/05_retrain_larger.ipynb) for more details.
 ## Workflow
 There are two workflows for active learning at the edge that we can use depending on the availability of labeled data.
@@ -94,10 +110,10 @@ There are two workflows for active learning at the edge that we can use dependin
 If we have no labeled data, we can use active learning to iteratively improve the model and build a labeled dataset.
 1. Load a small proxy model.
-2. Label an initial dataset.
+2. Label an initial dataset. If there is none, you'll have to label some images.
 3. Train the proxy model on the labeled dataset.
 4. Run inference on the unlabeled dataset.
-5. Evaluate the performance of the proxy model on the unlabeled dataset.
+5. Evaluate the performance of the proxy model.
 6. Is model good enough?
     - Yes: Save the proxy model and the dataset.
     - No: Select the most informative images to label using active learning.
@@ -149,7 +165,7 @@ graph TD
 ```
-## Methodology
+<!-- ## Methodology
 To test out the workflows we will use the [imagenette dataset](https://huggingface.co/datasets/frgfm/imagenette). But this will be applicable to any dataset.
 Imagenette is a subset of the ImageNet dataset with 10 classes. We will use this dataset to test out the workflows. Additionally, Imagenette has an existing leaderboard which we can use to evaluate the performance of the models.
@@ -200,4 +216,4 @@ After the first iteration we got 94.57% accuracy on the validation set. See the
 > [!TIP]
 > | Train Epochs | Number of Images | Validation Accuracy |      Source      |
 > |--------------|-----------------|----------------------|------------------|
-> | 10           | 200             | 94.57%               | First relabeling [notebook](./nbs/03_retrain_model.ipynb) |
+> | 10           | 200             | 94.57%               | First relabeling [notebook](./nbs/03_retrain_model.ipynb) | -->

{active_vision-0.0.3 → active_vision-0.0.4}/pyproject.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [project]
 name = "active-vision"
-version = "0.0.3"
+version = "0.0.4"
 description = "Active learning for edge vision."
 readme = "README.md"
 requires-python = ">=3.10"
@@ -12,4 +12,5 @@ dependencies = [
     "ipywidgets>=8.1.5",
     "loguru>=0.7.3",
     "seaborn>=0.13.2",
-]
+    "timm>=1.0.13",
+]

active_vision-0.0.4/src/active_vision/__init__.py ADDED Viewed

@@ -0,0 +1,3 @@
+__version__ = "0.0.4"
+from .core import *

{active_vision-0.0.3 → active_vision-0.0.4}/src/active_vision.egg-info/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.2
 Name: active-vision
-Version: 0.0.3
+Version: 0.0.4
 Summary: Active learning for edge vision.
 Requires-Python: >=3.10
 Description-Content-Type: text/markdown
@@ -12,6 +12,7 @@ Requires-Dist: ipykernel>=6.29.5
 Requires-Dist: ipywidgets>=8.1.5
 Requires-Dist: loguru>=0.7.3
 Requires-Dist: seaborn>=0.13.2
+Requires-Dist: timm>=1.0.13
 ![Python Version](https://img.shields.io/badge/python-3.10%2B-blue?style=for-the-badge)
 ![License](https://img.shields.io/badge/License-Apache%202.0-green.svg?style=for-the-badge)
@@ -26,16 +27,13 @@ Active learning at the edge for computer vision.
 The goal of this project is to create a framework for the active learning loop for computer vision deployed on edge devices.
-## Installation
-I recommend using [uv](https://docs.astral.sh/uv/) to set up a virtual environment and install the package. You can also use other virtual env of your choice.
+Supported tasks:
+- [X] Image classification
+- [ ] Object detection
+- [ ] Segmentation
-If you're using uv:
-```bash
-uv venv
-uv sync
-```
-Once the virtual environment is created, you can install the package using pip.
+## Installation
 Get a release from PyPI
 ```bash
@@ -49,6 +47,16 @@ cd active-vision
 pip install -e .
 ```
+I recommend using [uv](https://docs.astral.sh/uv/) to set up a virtual environment and install the package. You can also use other virtual env of your choice.
+If you're using uv:
+```bash
+uv venv
+uv sync
+```
+Once the virtual environment is created, you can install the package using pip.
 > [!TIP]
 > If you're using uv add a uv before the pip install command to install into your virtual environment. Eg:
 > ```bash
@@ -59,9 +67,11 @@ pip install -e .
 See the [notebook](./nbs/04_relabel_loop.ipynb) for a complete example.
 Be sure to prepared 3 datasets:
-- train: A dataframe of an existing labeled training dataset.
-- unlabeled: A dataframe of unlabeled data which we will sample from using active learning.
-- eval: A dataframe of labeled data which we will use to evaluate the performance of the model. (Optional)
+- [initial_samples](./nbs/initial_samples.parquet): A dataframe of an existing labeled training dataset to seed the training set.
+- [unlabeled](./nbs/unlabeled_samples.parquet): A dataframe of unlabeled data which we will sample from using active learning.
+- [eval](./nbs/evaluation_samples.parquet): A dataframe of labeled data which we will use to evaluate the performance of the model.
+As a toy example I created the above 3 datasets from the imagenette dataset.
 ```python
 from active_vision import ActiveLearner
@@ -102,6 +112,13 @@ al.add_to_train_set(labeled_df, output_filename="active_labeled")
 Repeat the process until the model is good enough. Use the dataset to train a larger model and deploy.
+> [!TIP]
+> For the toy dataset, I got to about 93% accuracy on the evaluation set with 200+ labeled images. The best performing model on the [leaderboard](https://github.com/fastai/imagenette) got 95.11% accuracy training on all 9469 labeled images.
+>
+> This took me about 6 iterations of relabeling. Each iteration took about 5 minutes to complete including labeling and model training (resnet18). See the [notebook](./nbs/04_relabel_loop.ipynb) for more details.
+>
+> But using the dataset of 200+ images, I trained a more capable model (convnext_small_in22k) and got 99.3% accuracy on the evaluation set. See the [notebook](./nbs/05_retrain_larger.ipynb) for more details.
 ## Workflow
 There are two workflows for active learning at the edge that we can use depending on the availability of labeled data.
@@ -109,10 +126,10 @@ There are two workflows for active learning at the edge that we can use dependin
 If we have no labeled data, we can use active learning to iteratively improve the model and build a labeled dataset.
 1. Load a small proxy model.
-2. Label an initial dataset.
+2. Label an initial dataset. If there is none, you'll have to label some images.
 3. Train the proxy model on the labeled dataset.
 4. Run inference on the unlabeled dataset.
-5. Evaluate the performance of the proxy model on the unlabeled dataset.
+5. Evaluate the performance of the proxy model.
 6. Is model good enough?
     - Yes: Save the proxy model and the dataset.
     - No: Select the most informative images to label using active learning.
@@ -164,7 +181,7 @@ graph TD
 ```
-## Methodology
+<!-- ## Methodology
 To test out the workflows we will use the [imagenette dataset](https://huggingface.co/datasets/frgfm/imagenette). But this will be applicable to any dataset.
 Imagenette is a subset of the ImageNet dataset with 10 classes. We will use this dataset to test out the workflows. Additionally, Imagenette has an existing leaderboard which we can use to evaluate the performance of the models.
@@ -215,4 +232,4 @@ After the first iteration we got 94.57% accuracy on the validation set. See the
 > [!TIP]
 > | Train Epochs | Number of Images | Validation Accuracy |      Source      |
 > |--------------|-----------------|----------------------|------------------|
-> | 10           | 200             | 94.57%               | First relabeling [notebook](./nbs/03_retrain_model.ipynb) |
+> | 10           | 200             | 94.57%               | First relabeling [notebook](./nbs/03_retrain_model.ipynb) | -->