PyPI - blink-gpu - Versions diffs - 0.1.0__tar.gz - Mend

blink-gpu 0.1.0__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (20) hide show

blink_gpu-0.1.0/PKG-INFO +315 -0
blink_gpu-0.1.0/README.md +258 -0
blink_gpu-0.1.0/blink/__init__.py +30 -0
blink_gpu-0.1.0/blink/__main__.py +117 -0
blink_gpu-0.1.0/blink/_analyzer.py +98 -0
blink_gpu-0.1.0/blink/_predictor.py +164 -0
blink_gpu-0.1.0/blink/_version.py +1 -0
blink_gpu-0.1.0/blink/py.typed +1 -0
blink_gpu-0.1.0/blink_gpu.egg-info/PKG-INFO +315 -0
blink_gpu-0.1.0/blink_gpu.egg-info/SOURCES.txt +18 -0
blink_gpu-0.1.0/blink_gpu.egg-info/dependency_links.txt +1 -0
blink_gpu-0.1.0/blink_gpu.egg-info/entry_points.txt +3 -0
blink_gpu-0.1.0/blink_gpu.egg-info/requires.txt +39 -0
blink_gpu-0.1.0/blink_gpu.egg-info/top_level.txt +1 -0
blink_gpu-0.1.0/pyproject.toml +127 -0
blink_gpu-0.1.0/setup.cfg +4 -0
blink_gpu-0.1.0/tests/test_diverse_models.py +190 -0
blink_gpu-0.1.0/tests/test_gnn_scaling.py +41 -0
blink_gpu-0.1.0/tests/test_predictors.py +59 -0
blink_gpu-0.1.0/tests/test_profiler.py +42 -0

blink_gpu-0.1.0/PKG-INFO ADDED Viewed

@@ -0,0 +1,315 @@
+Metadata-Version: 2.4
+Name: blink-gpu
+Version: 0.1.0
+Summary: Predict GPU execution time & memory for PyTorch models — without running them.
+Author-email: Aniket Mishra <aniket@blink-gpu.dev>
+License: MIT
+Project-URL: Homepage, https://github.com/Aniketxmishra/Blink_Main
+Project-URL: Documentation, https://github.com/Aniketxmishra/Blink_Main#readme
+Project-URL: Repository, https://github.com/Aniketxmishra/Blink_Main.git
+Project-URL: Bug Tracker, https://github.com/Aniketxmishra/Blink_Main/issues
+Keywords: gpu,performance,prediction,pytorch,neural-network,profiling,machine-learning,explainability
+Classifier: Development Status :: 3 - Alpha
+Classifier: Intended Audience :: Science/Research
+Classifier: Intended Audience :: Developers
+Classifier: License :: OSI Approved :: MIT License
+Classifier: Programming Language :: Python :: 3
+Classifier: Programming Language :: Python :: 3.9
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
+Classifier: Topic :: System :: Hardware
+Requires-Python: >=3.9
+Description-Content-Type: text/markdown
+Requires-Dist: numpy>=1.24
+Requires-Dist: pandas>=2.0
+Requires-Dist: scikit-learn>=1.3
+Requires-Dist: xgboost>=2.0
+Requires-Dist: joblib>=1.3
+Requires-Dist: thop>=0.1.1
+Provides-Extra: full
+Requires-Dist: optuna; extra == "full"
+Requires-Dist: lightgbm; extra == "full"
+Requires-Dist: pynvml; extra == "full"
+Requires-Dist: shap>=0.44; extra == "full"
+Requires-Dist: streamlit>=1.30; extra == "full"
+Requires-Dist: plotly>=5.18; extra == "full"
+Requires-Dist: matplotlib>=3.8; extra == "full"
+Requires-Dist: seaborn>=0.13; extra == "full"
+Provides-Extra: api
+Requires-Dist: fastapi>=0.110; extra == "api"
+Requires-Dist: uvicorn[standard]>=0.27; extra == "api"
+Requires-Dist: python-multipart; extra == "api"
+Requires-Dist: httpx; extra == "api"
+Provides-Extra: gnn
+Requires-Dist: torch-geometric; extra == "gnn"
+Provides-Extra: explain
+Requires-Dist: shap>=0.44; extra == "explain"
+Provides-Extra: dev
+Requires-Dist: pytest>=7.4; extra == "dev"
+Requires-Dist: pytest-cov; extra == "dev"
+Requires-Dist: ruff; extra == "dev"
+Requires-Dist: black; extra == "dev"
+Requires-Dist: mypy; extra == "dev"
+Requires-Dist: pre-commit; extra == "dev"
+Provides-Extra: all
+Requires-Dist: blink-gpu[api,dev,explain,full,gnn]; extra == "all"
+# Blink 🔭
+> **GPU Performance Predictor for Deep Learning Models**
+Blink predicts **execution time** and **memory usage** of PyTorch neural networks on GPU without actually running them. It combines classical ML (XGBoost, Random Forest) with a Graph Neural Network (GNN) that encodes the computational graph of any model architecture.
+---
+## 📋 Table of Contents
+- [Overview](#overview)
+- [Architecture](#architecture)
+- [Project Structure](#project-structure)
+- [Installation](#installation)
+- [Usage](#usage)
+- [Data Pipeline](#data-pipeline)
+- [Model Performance](#model-performance)
+- [Dashboard](#dashboard)
+- [Paper Reproducibility](#paper-reproducibility)
+---
+## Overview
+Given a PyTorch model and a batch size, Blink answers:
+- *How long will a forward pass take on this GPU?*
+- *How much GPU memory will it consume?*
+This is useful for:
+- **Batch size optimization** before deployment
+- **Hardware cost estimation** for training runs
+- **NAS (Neural Architecture Search)** — filtering architectures by predicted cost
+---
+## Architecture
+```
+PyTorch Model
+      │
+      ▼
+┌─────────────────────┐
+│  Feature Extractor  │  ← layer counts, FLOPs, params, depth, width, skip connections
+│  + GNN Extractor    │  ← graph-based architecture encoding (ArchitectureGNN)
+└─────────┬───────────┘
+          │
+          ▼
+┌─────────────────────┐
+│  Prediction Models  │
+│  ─────────────────  │
+│  · XGBoost (tuned)  │  ← main predictor (best MAPE)
+│  · Random Forest    │  ← ensemble comparison
+│  · GNN Predictor    │  ← graph-native, generalizes across architectures
+│  · Linear / Ridge   │  ← baselines
+└─────────┬───────────┘
+          │
+          ▼
+   Predicted: execution_time_ms, memory_mb
+   + Uncertainty bounds (lower / upper)
+```
+---
+## Project Structure
+```
+Blink/
+├── dashboard.py             # 🖥️  Main Streamlit web app  (run this)
+├── prediction_api.py        # 🌐  Flask REST API
+│
+├── ── Core ML Modules ──
+│   ├── model_profiler.py    # GPU profiler (CUDA events)
+│   ├── feature_extractor.py # Static feature extraction from nn.Module
+│   ├── gnn_extractor.py     # GNN-based graph feature extraction
+│   ├── gnn_model.py         # ArchitectureGNN model definition (PyG)
+│   ├── prediction_model.py  # Train XGBoost / RF / Linear models
+│   ├── train_gnn.py         # Train the GNN predictor
+│   ├── train_memory_model.py# Train memory prediction model
+│   ├── gpu_predictor.py     # Inference class with caching & batch support
+│   ├── model_analyser.py    # Model complexity analysis utilities
+│   ├── advanced_features.py # Extended feature engineering
+│   ├── dynamic_predictor.py # Dynamic / online prediction
+│   ├── gpu_info.py          # GPU metadata (pynvml)
+│   ├── workload_scheduler.py# Batch workload scheduler
+│   └── performance_monitor.py
+│
+├── scripts/                 # 🔬  Experiment & data scripts
+│   ├── collect_data.py      # Profile CNN/Transformer/custom models → data/raw/
+│   ├── enhance_dataset.py   # Augment dataset (more batch sizes / models)
+│   ├── diverse_architectures.py  # Profile diverse arch families
+│   ├── ablation_study.py    # 5-condition ablation (Table II in paper)
+│   ├── generate_paper_figures.py # Reproduce all paper figures
+│   └── generate_paper_tables.py  # Reproduce paper tables
+│
+├── tests/                   # ✅  Test suite
+│   ├── test_diverse_models.py
+│   ├── test_predictors.py
+│   ├── test_profiler.py
+│   ├── test_gnn_scaling.py
+│   └── evaluate_gnn_vs_xgb.py
+│
+├── data/
+│   ├── raw/                 # Raw profiling CSVs (gitignored)
+│   ├── processed/           # Feature-engineered CSVs
+│   ├── enriched/            # Final training-ready dataset
+│   └── feedback_log.csv     # Online feedback loop log
+│
+├── models/                  # Serialized model artifacts (gitignored)
+│   ├── xgboost_(tuned)_model.joblib
+│   ├── random_forest_model.joblib
+│   ├── gnn_predictor.pth
+│   ├── memory_model.joblib
+│   └── ...
+│
+├── results/
+│   ├── figures/             # Paper figures (PNG)
+│   ├── ablation_study_table.csv
+│   ├── gnn_scaling_table.csv
+│   └── ...
+│
+├── templates/index.html     # HTML template for web interface
+├── legacy/                  # Archived / superseded scripts
+├── requirements.txt
+└── .gitignore
+```
+---
+## Installation
+```bash
+# 1. Clone the repo
+git clone <your-repo-url>
+cd Blink
+# 2. Create a virtual environment
+python -m venv venv
+venv\Scripts\activate        # Windows
+# source venv/bin/activate   # Linux/macOS
+# 3. Install dependencies
+pip install -r requirements.txt
+# 4. Install PyTorch Geometric (match your CUDA version)
+# See: https://pytorch-geometric.readthedocs.io/en/latest/install/installation.html
+pip install torch-geometric
+```
+**Requirements:** NVIDIA GPU with CUDA, Python ≥ 3.10
+---
+## Usage
+### 1. Launch the Dashboard
+```bash
+streamlit run dashboard.py
+```
+Features: live model prediction, batch size optimizer, model comparison, performance monitor.
+### 2. Collect Profiling Data
+```bash
+python scripts/collect_data.py --batch-sizes 1 4 16 32 64
+```
+### 3. Train Prediction Models
+```bash
+# Train XGBoost / RF / Linear baseline models
+python prediction_model.py
+# Train GNN predictor
+python train_gnn.py
+# Train memory model
+python train_memory_model.py
+```
+### 4. Run Ablation Study
+```bash
+python scripts/ablation_study.py
+```
+### 5. Predict via Python API
+```python
+from gpu_predictor import GPUPredictor
+import torchvision.models as models
+predictor = GPUPredictor()
+model = models.resnet50(pretrained=False)
+result = predictor.predict_for_custom_model(model, batch_size=16)
+print(result)
+# {'execution_time_ms': 12.4, 'memory_mb': 1820, 'confidence_lower': 11.1, ...}
+```
+---
+## Data Pipeline
+```
+collect_data.py
+    └─▶ data/raw/*.csv          (GPU profiling measurements)
+            │
+            ▼
+feature_extractor.py
+    └─▶ data/processed/*.csv    (static model features)
+            │
+            ▼
+enhance_dataset.py
+    └─▶ data/enriched/*.csv     (augmented, training-ready)
+            │
+            ▼
+prediction_model.py / train_gnn.py
+    └─▶ models/                 (trained predictors)
+```
+---
+## Model Performance
+Results on held-out test set (20% split):
+| Model | Exec Time MAPE | Memory MAPE | Notes |
+|---|---|---|---|
+| XGBoost (tuned) | ~8% | ~6% | Best overall |
+| Random Forest | ~11% | ~9% | Robust baseline |
+| GNN Predictor | ~10% | ~8% | Best on unseen architectures |
+| Linear Regression | ~22% | ~19% | Baseline |
+*(Full ablation study results: `results/ablation_study_table.csv`)*
+---
+## Dashboard
+The Streamlit dashboard (`dashboard.py`) provides:
+| Tab | Description |
+|---|---|
+| 🎯 Prediction | Predict execution time & memory for standard or custom models |
+| ⚡ Batch Optimizer | Find optimal batch size within a memory budget |
+| 📊 Model Comparison | Compare predictions across multiple architectures |
+| 📈 Performance Monitor | Live GPU utilization and prediction history |
+---
+## Paper Reproducibility
+To reproduce all paper figures and tables:
+```bash
+python scripts/generate_paper_figures.py
+python scripts/generate_paper_tables.py
+python scripts/ablation_study.py
+```
+Outputs saved to `results/figures/`.
+---
+## License
+MIT License — see [LICENSE](LICENSE) for details.

blink_gpu-0.1.0/README.md ADDED Viewed

@@ -0,0 +1,258 @@
+# Blink 🔭
+> **GPU Performance Predictor for Deep Learning Models**
+Blink predicts **execution time** and **memory usage** of PyTorch neural networks on GPU without actually running them. It combines classical ML (XGBoost, Random Forest) with a Graph Neural Network (GNN) that encodes the computational graph of any model architecture.
+---
+## 📋 Table of Contents
+- [Overview](#overview)
+- [Architecture](#architecture)
+- [Project Structure](#project-structure)
+- [Installation](#installation)
+- [Usage](#usage)
+- [Data Pipeline](#data-pipeline)
+- [Model Performance](#model-performance)
+- [Dashboard](#dashboard)
+- [Paper Reproducibility](#paper-reproducibility)
+---
+## Overview
+Given a PyTorch model and a batch size, Blink answers:
+- *How long will a forward pass take on this GPU?*
+- *How much GPU memory will it consume?*
+This is useful for:
+- **Batch size optimization** before deployment
+- **Hardware cost estimation** for training runs
+- **NAS (Neural Architecture Search)** — filtering architectures by predicted cost
+---
+## Architecture
+```
+PyTorch Model
+      │
+      ▼
+┌─────────────────────┐
+│  Feature Extractor  │  ← layer counts, FLOPs, params, depth, width, skip connections
+│  + GNN Extractor    │  ← graph-based architecture encoding (ArchitectureGNN)
+└─────────┬───────────┘
+          │
+          ▼
+┌─────────────────────┐
+│  Prediction Models  │
+│  ─────────────────  │
+│  · XGBoost (tuned)  │  ← main predictor (best MAPE)
+│  · Random Forest    │  ← ensemble comparison
+│  · GNN Predictor    │  ← graph-native, generalizes across architectures
+│  · Linear / Ridge   │  ← baselines
+└─────────┬───────────┘
+          │
+          ▼
+   Predicted: execution_time_ms, memory_mb
+   + Uncertainty bounds (lower / upper)
+```
+---
+## Project Structure
+```
+Blink/
+├── dashboard.py             # 🖥️  Main Streamlit web app  (run this)
+├── prediction_api.py        # 🌐  Flask REST API
+│
+├── ── Core ML Modules ──
+│   ├── model_profiler.py    # GPU profiler (CUDA events)
+│   ├── feature_extractor.py # Static feature extraction from nn.Module
+│   ├── gnn_extractor.py     # GNN-based graph feature extraction
+│   ├── gnn_model.py         # ArchitectureGNN model definition (PyG)
+│   ├── prediction_model.py  # Train XGBoost / RF / Linear models
+│   ├── train_gnn.py         # Train the GNN predictor
+│   ├── train_memory_model.py# Train memory prediction model
+│   ├── gpu_predictor.py     # Inference class with caching & batch support
+│   ├── model_analyser.py    # Model complexity analysis utilities
+│   ├── advanced_features.py # Extended feature engineering
+│   ├── dynamic_predictor.py # Dynamic / online prediction
+│   ├── gpu_info.py          # GPU metadata (pynvml)
+│   ├── workload_scheduler.py# Batch workload scheduler
+│   └── performance_monitor.py
+│
+├── scripts/                 # 🔬  Experiment & data scripts
+│   ├── collect_data.py      # Profile CNN/Transformer/custom models → data/raw/
+│   ├── enhance_dataset.py   # Augment dataset (more batch sizes / models)
+│   ├── diverse_architectures.py  # Profile diverse arch families
+│   ├── ablation_study.py    # 5-condition ablation (Table II in paper)
+│   ├── generate_paper_figures.py # Reproduce all paper figures
+│   └── generate_paper_tables.py  # Reproduce paper tables
+│
+├── tests/                   # ✅  Test suite
+│   ├── test_diverse_models.py
+│   ├── test_predictors.py
+│   ├── test_profiler.py
+│   ├── test_gnn_scaling.py
+│   └── evaluate_gnn_vs_xgb.py
+│
+├── data/
+│   ├── raw/                 # Raw profiling CSVs (gitignored)
+│   ├── processed/           # Feature-engineered CSVs
+│   ├── enriched/            # Final training-ready dataset
+│   └── feedback_log.csv     # Online feedback loop log
+│
+├── models/                  # Serialized model artifacts (gitignored)
+│   ├── xgboost_(tuned)_model.joblib
+│   ├── random_forest_model.joblib
+│   ├── gnn_predictor.pth
+│   ├── memory_model.joblib
+│   └── ...
+│
+├── results/
+│   ├── figures/             # Paper figures (PNG)
+│   ├── ablation_study_table.csv
+│   ├── gnn_scaling_table.csv
+│   └── ...
+│
+├── templates/index.html     # HTML template for web interface
+├── legacy/                  # Archived / superseded scripts
+├── requirements.txt
+└── .gitignore
+```
+---
+## Installation
+```bash
+# 1. Clone the repo
+git clone <your-repo-url>
+cd Blink
+# 2. Create a virtual environment
+python -m venv venv
+venv\Scripts\activate        # Windows
+# source venv/bin/activate   # Linux/macOS
+# 3. Install dependencies
+pip install -r requirements.txt
+# 4. Install PyTorch Geometric (match your CUDA version)
+# See: https://pytorch-geometric.readthedocs.io/en/latest/install/installation.html
+pip install torch-geometric
+```
+**Requirements:** NVIDIA GPU with CUDA, Python ≥ 3.10
+---
+## Usage
+### 1. Launch the Dashboard
+```bash
+streamlit run dashboard.py
+```
+Features: live model prediction, batch size optimizer, model comparison, performance monitor.
+### 2. Collect Profiling Data
+```bash
+python scripts/collect_data.py --batch-sizes 1 4 16 32 64
+```
+### 3. Train Prediction Models
+```bash
+# Train XGBoost / RF / Linear baseline models
+python prediction_model.py
+# Train GNN predictor
+python train_gnn.py
+# Train memory model
+python train_memory_model.py
+```
+### 4. Run Ablation Study
+```bash
+python scripts/ablation_study.py
+```
+### 5. Predict via Python API
+```python
+from gpu_predictor import GPUPredictor
+import torchvision.models as models
+predictor = GPUPredictor()
+model = models.resnet50(pretrained=False)
+result = predictor.predict_for_custom_model(model, batch_size=16)
+print(result)
+# {'execution_time_ms': 12.4, 'memory_mb': 1820, 'confidence_lower': 11.1, ...}
+```
+---
+## Data Pipeline
+```
+collect_data.py
+    └─▶ data/raw/*.csv          (GPU profiling measurements)
+            │
+            ▼
+feature_extractor.py
+    └─▶ data/processed/*.csv    (static model features)
+            │
+            ▼
+enhance_dataset.py
+    └─▶ data/enriched/*.csv     (augmented, training-ready)
+            │
+            ▼
+prediction_model.py / train_gnn.py
+    └─▶ models/                 (trained predictors)
+```
+---
+## Model Performance
+Results on held-out test set (20% split):
+| Model | Exec Time MAPE | Memory MAPE | Notes |
+|---|---|---|---|
+| XGBoost (tuned) | ~8% | ~6% | Best overall |
+| Random Forest | ~11% | ~9% | Robust baseline |
+| GNN Predictor | ~10% | ~8% | Best on unseen architectures |
+| Linear Regression | ~22% | ~19% | Baseline |
+*(Full ablation study results: `results/ablation_study_table.csv`)*
+---
+## Dashboard
+The Streamlit dashboard (`dashboard.py`) provides:
+| Tab | Description |
+|---|---|
+| 🎯 Prediction | Predict execution time & memory for standard or custom models |
+| ⚡ Batch Optimizer | Find optimal batch size within a memory budget |
+| 📊 Model Comparison | Compare predictions across multiple architectures |
+| 📈 Performance Monitor | Live GPU utilization and prediction history |
+---
+## Paper Reproducibility
+To reproduce all paper figures and tables:
+```bash
+python scripts/generate_paper_figures.py
+python scripts/generate_paper_tables.py
+python scripts/ablation_study.py
+```
+Outputs saved to `results/figures/`.
+---
+## License
+MIT License — see [LICENSE](LICENSE) for details.

blink_gpu-0.1.0/blink/__init__.py ADDED Viewed

@@ -0,0 +1,30 @@
+"""
+Blink — GPU Performance Predictor
+==================================
+Predict GPU execution time and memory usage for PyTorch models
+*before* running them on GPU hardware.
+Quick start
+-----------
+>>> from blink import BlinkPredictor
+>>> predictor = BlinkPredictor()
+>>> result = predictor.predict("resnet18", batch_size=32)
+>>> print(f"Exec time: {result['exec_time_ms']:.1f} ms")
+>>> print(f"Memory   : {result['memory_mb']:.1f} MB")
+Or with your own model:
+>>> import torch.nn as nn
+>>> model = nn.Sequential(nn.Linear(512, 256), nn.ReLU(), nn.Linear(256, 10))
+>>> result = BlinkPredictor().predict(model, batch_size=64)
+"""
+from __future__ import annotations
+from blink._predictor import BlinkPredictor
+from blink._analyzer  import BlinkAnalyzer
+from blink._version   import __version__
+__all__ = [
+    "BlinkPredictor",
+    "BlinkAnalyzer",
+    "__version__",
+]