PyPI - univi - Versions diffs - 0.2.1__tar.gz → 0.2.5__tar.gz - Mend

univi 0.2.1tar.gz → 0.2.5tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (51) hide show

{univi-0.2.1/univi.egg-info → univi-0.2.5}/PKG-INFO +336 -108
{univi-0.2.1 → univi-0.2.5}/README.md +335 -107
{univi-0.2.1 → univi-0.2.5}/pyproject.toml +1 -1
univi-0.2.5/univi/__init__.py +85 -0
univi-0.2.5/univi/config.py +110 -0
univi-0.2.5/univi/data.py +364 -0
univi-0.2.5/univi/evaluation.py +377 -0
{univi-0.2.1 → univi-0.2.5}/univi/models/__init__.py +4 -0
univi-0.2.5/univi/models/decoders.py +249 -0
{univi-0.2.1 → univi-0.2.5}/univi/models/encoders.py +3 -3
{univi-0.2.1 → univi-0.2.5}/univi/models/mlp.py +11 -5
univi-0.2.5/univi/models/univi.py +886 -0
univi-0.2.5/univi/plotting.py +126 -0
univi-0.2.5/univi/trainer.py +340 -0
univi-0.2.5/univi/utils/io.py +222 -0
{univi-0.2.1 → univi-0.2.5/univi.egg-info}/PKG-INFO +336 -108
univi-0.2.1/univi/__init__.py +0 -35
univi-0.2.1/univi/config.py +0 -71
univi-0.2.1/univi/data.py +0 -190
univi-0.2.1/univi/evaluation.py +0 -555
univi-0.2.1/univi/models/decoders.py +0 -443
univi-0.2.1/univi/models/univi.py +0 -440
univi-0.2.1/univi/plotting.py +0 -129
univi-0.2.1/univi/trainer.py +0 -294
univi-0.2.1/univi/utils/io.py +0 -230
{univi-0.2.1 → univi-0.2.5}/LICENSE +0 -0
{univi-0.2.1 → univi-0.2.5}/setup.cfg +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/__main__.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/cli.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/diagnostics.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/hyperparam_optimization/__init__.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/hyperparam_optimization/common.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/hyperparam_optimization/run_adt_hparam_search.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/hyperparam_optimization/run_atac_hparam_search.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/hyperparam_optimization/run_citeseq_hparam_search.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/hyperparam_optimization/run_multiome_hparam_search.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/hyperparam_optimization/run_rna_hparam_search.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/hyperparam_optimization/run_teaseq_hparam_search.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/matching.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/objectives.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/pipeline.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/utils/__init__.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/utils/logging.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/utils/seed.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/utils/stats.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi/utils/torch_utils.py +0 -0
{univi-0.2.1 → univi-0.2.5}/univi.egg-info/SOURCES.txt +0 -0
{univi-0.2.1 → univi-0.2.5}/univi.egg-info/dependency_links.txt +0 -0
{univi-0.2.1 → univi-0.2.5}/univi.egg-info/entry_points.txt +0 -0
{univi-0.2.1 → univi-0.2.5}/univi.egg-info/requires.txt +0 -0
{univi-0.2.1 → univi-0.2.5}/univi.egg-info/top_level.txt +0 -0

{univi-0.2.1/univi.egg-info → univi-0.2.5}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: univi
-Version: 0.2.1
+Version: 0.2.5
 Summary: UniVI: a scalable multi-modal variational autoencoder toolkit for seamless integration and analysis of multimodal single-cell data.
 Author-email: "Andrew J. Ashford" <ashforda@ohsu.edu>
 License: MIT License
@@ -57,33 +57,33 @@ Dynamic: license-file
 # UniVI
 [![PyPI version](https://img.shields.io/pypi/v/univi)](https://pypi.org/project/univi/)
-[![PyPI - Python Version](https://img.shields.io/pypi/pyversions/univi.svg?v=0.2.1)](https://pypi.org/project/univi/)
+[![PyPI - Python Version](https://img.shields.io/pypi/pyversions/univi.svg?v=0.2.5)](https://pypi.org/project/univi/)
 <picture>
   <!-- Dark mode (GitHub supports this; PyPI may ignore <source>) -->
   <source media="(prefers-color-scheme: dark)"
-          srcset="https://raw.githubusercontent.com/Ashford-A/UniVI/v0.2.1/assets/figures/univi_overview_dark.png">
+          srcset="https://raw.githubusercontent.com/Ashford-A/UniVI/v0.2.5/assets/figures/univi_overview_dark.png">
   <!-- Light mode / fallback (works on GitHub + PyPI) -->
-  <img src="https://raw.githubusercontent.com/Ashford-A/UniVI/v0.2.1/assets/figures/univi_overview_light.png"
+  <img src="https://raw.githubusercontent.com/Ashford-A/UniVI/v0.2.5/assets/figures/univi_overview_light.png"
        alt="UniVI overview and evaluation roadmap"
        width="100%">
 </picture>
-**UniVI overview and evaluation roadmap.**
-(a) Generic UniVI architecture schematic. (b) Core training objective (for UniVI v1 - see documentation for UniVI-lite training objective). (c) Example modality combinations beyond bi-modal data (e.g. TEA-seq (tri-modal RNA + ATAC + ADT)). (d) Evaluation roadmap spanning latent alignment (FOSCTTM ↓), modality mixing, label transfer, reconstruction/prediction NLL, and downstream biological consistency.
+**UniVI overview and evaluation roadmap.**
+(a) Generic UniVI architecture schematic. (b) Core training objective (for UniVI v1 - see documentation for UniVI-lite training objective). (c) Example modality combinations beyond bi-modal data (e.g. TEA-seq (tri-modal RNA + ATAC + ADT)). (d) Evaluation roadmap spanning latent alignment (FOSCTTM), modality mixing, label transfer, reconstruction/prediction NLL, and downstream biological consistency.
 ---
 UniVI is a **multi-modal variational autoencoder (VAE)** framework for aligning and integrating single-cell modalities such as RNA, ADT (CITE-seq), and ATAC. It’s built to support experiments like:
-- Joint embedding of RNA + ADT (CITE-seq)
-- RNA + ATAC (Multiome) integration
-- RNA + ADT + ATAC (TEA-seq) tri-modal data integration
-- Independent non-paired modalities from the same tissue type
-- Cross-modal reconstruction and imputation
-- Data denoising
-- Structured evaluation of alignment quality (FOSCTTM, modality mixing, label transfer, etc.)
-- Exploratory analysis of the relationships between heterogeneous molecular readouts that inform biological functional dimensions
+* Joint embedding of RNA + ADT (CITE-seq)
+* RNA + ATAC (Multiome) integration
+* RNA + ADT + ATAC (TEA-seq) tri-modal data integration
+* Independent non-paired modalities from the same tissue type
+* Cross-modal reconstruction and imputation
+* Data denoising
+* Structured evaluation of alignment quality (FOSCTTM, modality mixing, label transfer, etc.)
+* Exploratory analysis of the relationships between heterogeneous molecular readouts that inform biological functional dimensions
 This repository contains the core UniVI code, training scripts, parameter files, and example notebooks.
@@ -93,8 +93,8 @@ This repository contains the core UniVI code, training scripts, parameter files,
 If you use UniVI in your work, please cite:
-> Ashford AJ, Enright T, Nikolova O, Demir E.
-> **Unifying Multimodal Single-Cell Data Using a Mixture of Experts β-Variational Autoencoder-Based Framework.**
+> Ashford AJ, Enright T, Nikolova O, Demir E.
+> **Unifying Multimodal Single-Cell Data Using a Mixture of Experts β-Variational Autoencoder-Based Framework.**
 > *bioRxiv* (2025). doi: [10.1101/2025.02.28.640429](https://www.biorxiv.org/content/10.1101/2025.02.28.640429v1.full)
 ```bibtex
@@ -106,11 +106,12 @@ If you use UniVI in your work, please cite:
   doi     = {10.1101/2025.02.28.640429},
   url     = {https://www.biorxiv.org/content/10.1101/2025.02.28.640429v1}
 }
-````
+```
 ---
 ## License
 MIT License — see `LICENSE`.
 ---
@@ -145,7 +146,7 @@ UniVI/
 │   ├── evaluate_univi.py                  # Evaluate trained models (FOSCTTM, label transfer, etc.)
 │   ├── benchmark_univi_citeseq.py         # CITE-seq-specific benchmarking script
 │   ├── run_multiome_hparam_search.py
-│   ├── run_frequency_robustness.py.       # Composition/frequency mismatch robustness
+│   ├── run_frequency_robustness.py        # Composition/frequency mismatch robustness
 │   ├── run_do_not_integrate_detection.py  # “Do-not-integrate” unmatched population demo
 │   ├── run_benchmarks.py                  # Unified wrapper (includes optional Harmony baseline)
 │   └── revision_reproduce_all.sh          # One-click: reproduces figures + supplemental tables
@@ -189,6 +190,8 @@ UniVI/
 ```
+---
 ## Generated outputs
 Most entry-point scripts write results into a user-specified output directory (commonly `runs/`), which is **not** tracked in git.
@@ -339,49 +342,252 @@ See the notebooks under `notebooks/` for end-to-end preprocessing examples for C
 ---
-## Running a minimal training script (UniVI v1 vs UniVI-lite)
+## Training modes & example recipes (v1 vs v2/lite + supervised options)
 UniVI supports two training regimes:
-* **UniVI v1**: paired/pseudo-paired batches + cross-modal reconstruction (e.g., RNA→ADT and ADT→RNA) + posterior alignment.
-* **UniVI-lite**: missing-modality friendly (can train when only a subset of modalities are present in a batch), typically with a lighter latent alignment term.
+* **UniVI v1**: per-modality posteriors + reconstruction terms controlled by `v1_recon` (cross/self/avg/etc.) + posterior alignment across modality posteriors.
+* **UniVI-lite / v2**: fused latent posterior (precision-weighted MoE/PoE style) + per-modality reconstruction + β·KL(q_fused||p) + γ·pairwise alignment between modality posteriors. Scales cleanly to 3+ modalities and is the recommended default.
+### Which supervised option should I use?
+Use labels to “shape” the latent in one of three ways:
+1. **Classification head (decoder-only)** — `p(y|z)` (**recommended default**)
+   *Works for `loss_mode="lite"` and `loss_mode="v1"`.*
+   Best if you want the latent to be predictive/separable without changing how modalities reconstruct.
+2. **Label expert injected into fusion (encoder-side)** — `q(z|y)` (**lite/v2 only**)
+   *Works only for `loss_mode="lite"` / `v2`.*
+   Best for semi-supervised settings where labels should directly influence the **fused posterior**.
+3. **Labels as a full categorical “modality”** — `"celltype"` modality with likelihood `"categorical"`
+   *Best with `loss_mode="lite"`.*
+   Useful when you want cell types to behave like a first-class modality (encode/decode/reconstruct), but avoid `v1` cross-reconstruction unless you really know you want it.
+---
+## Supervised labels (three supported patterns)
+### A) Latent classification head (decoder-only): `p(y|z)` (works in **lite/v2** and **v1**)
+This is the simplest way to shape the latent. UniVI attaches a categorical head to the latent `z` and adds:
+```math
+\mathcal{L} \;+=\; \lambda \cdot \mathrm{CE}(\mathrm{logits}(z), y)
+```
+**How to enable:** initialize the model with:
+* `n_label_classes > 0`
+* `label_loss_weight` (default `1.0`)
+* `label_ignore_index` (default `-1`, used to mask unlabeled rows)
+```python
+import numpy as np
+import torch
+from univi import UniVIMultiModalVAE, UniVIConfig, ModalityConfig
+# Example labels (0..C-1) from AnnData
+y_codes = rna.obs["celltype"].astype("category").cat.codes.to_numpy()
+n_classes = int(y_codes.max() + 1)
+univi_cfg = UniVIConfig(
+    latent_dim=40,
+    beta=5.0,
+    gamma=40.0,
+    modalities=[
+        ModalityConfig("rna", rna.n_vars, [1024, 512], [512, 1024], likelihood="nb"),
+        ModalityConfig("adt", adt.n_vars, [256, 128],  [128, 256],  likelihood="nb"),
+    ],
+)
+model = UniVIMultiModalVAE(
+    univi_cfg,
+    loss_mode="lite",            # OR "v1"
+    n_label_classes=n_classes,
+    label_loss_weight=1.0,
+    label_ignore_index=-1,
+    classify_from_mu=True,
+).to("cuda")
+```
+During training your batch should provide `y`, and your loop should call:
+```python
+out = model(x_dict, y=y, epoch=epoch)
+loss = out["loss"]
+```
+Unlabeled cells are supported: set `y=-1` and CE is automatically masked.
+---
+### B) Label expert injected into fusion: `q(z|y)` (**lite/v2 only**)
+In **lite/v2**, UniVI can optionally add a **label encoder** as an additional expert into MoE fusion. Labeled cells get an extra “expert vote” in the fused posterior; unlabeled cells ignore it automatically.
+```python
+model = UniVIMultiModalVAE(
+    univi_cfg,
+    loss_mode="lite",
+    # Optional: keep the decoder-side classification head too
+    n_label_classes=n_classes,
+    label_loss_weight=1.0,
+    # Encoder-side label expert injected into fusion
+    use_label_encoder=True,
+    label_moe_weight=1.0,      # >1 => labels influence fusion more
+    unlabeled_logvar=20.0,     # very high => tiny precision => ignored in fusion
+    label_encoder_warmup=5,    # wait N epochs before injecting labels into fusion
+    label_ignore_index=-1,
+).to("cuda")
+```
+**Notes**
+* This pathway is **only used in `loss_mode="lite"` / `v2`**, because it is implemented as an extra expert inside fusion.
+* Unlabeled cells (`y=-1`) are automatically ignored in fusion via a huge log-variance.
+---
+### C) Treat labels as a categorical “modality” (best with **lite/v2**)
+Instead of providing `y` separately, you can represent labels as another modality (e.g. `"celltype"`) with likelihood `"categorical"`. This makes labels a first-class modality with its own encoder/decoder.
+**Recommended representation:** one-hot matrix `(B, C)` stored in `.X`.
+```python
+import numpy as np
+from anndata import AnnData
+# y codes (0..C-1)
+y_codes = rna.obs["celltype"].astype("category").cat.codes.to_numpy()
+C = int(y_codes.max() + 1)
+Y = np.eye(C, dtype=np.float32)[y_codes]  # (B, C) one-hot
+celltype = AnnData(X=Y)
+celltype.obs_names = rna.obs_names.copy()  # MUST match paired modalities
+celltype.var_names = [f"class_{i}" for i in range(C)]
+adata_dict = {"rna": rna, "adt": adt, "celltype": celltype}
+univi_cfg = UniVIConfig(
+    latent_dim=40,
+    beta=5.0,
+    gamma=40.0,
+    modalities=[
+        ModalityConfig("rna",      rna.n_vars, [1024, 512], [512, 1024], likelihood="nb"),
+        ModalityConfig("adt",      adt.n_vars, [256, 128],  [128, 256],  likelihood="nb"),
+        ModalityConfig("celltype", C,          [128],       [128],       likelihood="categorical"),
+    ],
+)
+model = UniVIMultiModalVAE(univi_cfg, loss_mode="lite").to("cuda")
+```
+**Important caveat for `loss_mode="v1"`**
+`v1` can perform cross-reconstruction across all modalities. If you include `"celltype"` as a modality, you typically **do not** want cross-recon terms like `celltype → RNA`. If you must run `v1` with label-as-modality, prefer:
+```python
+model = UniVIMultiModalVAE(univi_cfg, loss_mode="v1", v1_recon="self").to("cuda")
+```
+If you want full `v1` cross-reconstruction and label shaping, prefer **Pattern A (classification head)** instead.
+---
+## Running a minimal training script (UniVI v1 vs UniVI-lite)
 ### 0) Choose the training objective (`loss_mode`) in your config JSON
-In `parameter_files/*.json`, set a single switch that controls the objective:
+In `parameter_files/*.json`, set a single switch that controls the objective.
-**Paper objective (v1; cross-reconstruction + cross-posterior alignment):**
+**Paper objective (v1; `"avg"` trains with 50% weight on self-reconstruction and 50% weight on cross-reconstruction, with weights automatically adjusted so this stays true for any number of modalities):**
 ```json5
 {
   "model": {
     "loss_mode": "v1",
-    "v1_recon": "cross",             // or "cross" | "self" | "avg" | "moe" | "src:rna" etc.
-    "v1_recon_mix": 0.0,             // optional extra averaged-z recon weight
+    "v1_recon": "avg",
     "normalize_v1_terms": true
+  }
 }
-````
+```
 **UniVI-lite objective (v2; lightweight / fusion-based):**
 ```json5
 {
   "model": {
-    "loss_mode": "lite",             // "lite" is also a proxy for "v2"
-    "v1_recon": "cross",             // doesn't get used if loss_mode="lite"
-    "v1_recon_mix": 0.0,             // doesn't get used if loss_mode="lite"
-    "normalize_v1_terms": true       // doesn't get used if loss_mode="lite"
+    "loss_mode": "lite"
+  }
 }
 ```
 > **Note**
 > `loss_mode: "lite"` is an alias for `loss_mode: "v2"` (they run the same objective in the current code).
+### 0b) (Optional) Enable supervised labels from config JSON
+**Classification head (decoder-only):**
+```json5
+{
+  "model": {
+    "loss_mode": "lite",
+    "n_label_classes": 30,
+    "label_loss_weight": 1.0,
+    "label_ignore_index": -1,
+    "classify_from_mu": true
+  }
+}
+```
+**Lite + label expert injected into fusion (encoder-side):**
+```json5
+{
+  "model": {
+    "loss_mode": "lite",
+    "n_label_classes": 30,
+    "label_loss_weight": 1.0,
+    "use_label_encoder": true,
+    "label_moe_weight": 1.0,
+    "unlabeled_logvar": 20.0,
+    "label_encoder_warmup": 5,
+    "label_ignore_index": -1
+  }
+}
+```
+**Labels as a categorical modality:** add an additional `"celltype"` modality in `"data.modalities"` and provide a matching AnnData on disk (or build it in Python).
+```json5
+{
+  "model": { "loss_mode": "lite" },
+  "data": {
+    "modalities": [
+      { "name": "rna",      "likelihood": "nb",          "X_key": "X", "layer": "counts" },
+      { "name": "adt",      "likelihood": "nb",          "X_key": "X", "layer": "counts" },
+      { "name": "celltype", "likelihood": "categorical", "X_key": "X", "layer": null }
+    ]
+  }
+}
+```
 ### 1) Normalization / representation switch (counts vs continuous)
-UniVI can be trained on **counts** (NB/ZINB/Poisson likelihoods) or **continuous** representations (Gaussian/MSE likelihoods). In your configs, keep this explicit.
+**Important note on selectors:**
+* `layer` selects `.layers[layer]` (if `X_key == "X"`).
+* `X_key == "X"` selects `.X`/`.layers[layer]`; otherwise `X_key` selects `.obsm[X_key]`.
-Recommended pattern (example showing several preprocessing options for the different data types, YMMV):
+Correct pattern:
 ```json5
 {
@@ -389,35 +595,34 @@ Recommended pattern (example showing several preprocessing options for the diffe
     "modalities": [
       {
         "name": "rna",
-        "layer": "log1p",            // uses .X or .layers["log1p"]
+        "layer": "log1p",        // uses adata.layers["log1p"] (since X_key=="X")
         "X_key": "X",
-        "assume_log1p": true,        // already log-normalized
+        "assume_log1p": true,
         "likelihood": "gaussian"
       },
       {
         "name": "adt",
-        "layer": "counts",           // raw counts in .layers["counts"]
-        "X_key": "counts",
-        "assume_log1p": false,       // use raw for ZINB
+        "layer": "counts",       // uses adata.layers["counts"] (since X_key=="X")
+        "X_key": "X",
+        "assume_log1p": false,
         "likelihood": "zinb"
       },
       {
         "name": "atac",
-        "layer": "X_lsi",            // continuous LSI features
-        "X_key": "X_lsi",
+        "layer": null,           // ignored because X_key != "X"
+        "X_key": "X_lsi",        // uses adata.obsm["X_lsi"]
         "assume_log1p": false,
         "likelihood": "gaussian"
       }
     ]
   }
 }
 ```
 * Use `.layers["counts"]` when you want NB/ZINB/Poisson decoders.
-* Use continuous `.X` (log1p/CLR/LSI) when you want Gaussian/MSE decoders.
+* Use continuous `.X` or `.obsm["X_lsi"]` when you want Gaussian/MSE decoders.
-> Jupyter Notebooks in this repository (UniVI/notebooks/) show recommended preprocessing per dataset for different data types and analyses. Depending on your research goals, you can use several different methods of preprocessing. The model is quite robust when it comes to learning underlying biology regardless of input data processing method used; the main key is that the decoder likelihood should roughly match the input distribution per-modality.
+> Jupyter notebooks in this repository (UniVI/notebooks/) show recommended preprocessing per dataset for different data types and analyses. Depending on your research goals, you can use several different methods of preprocessing. The model is robust when it comes to learning underlying biology regardless of preprocessing; the key is that the decoder likelihood should roughly match the input distribution per-modality.
 ### 2) Train (CLI)
@@ -456,8 +661,8 @@ python scripts/train_univi.py \
 ```bash
 python scripts/train_univi.py \
-  --config parameter_files/ \
-  --outdir saved_models/defaults_multiome_lite.json \
+  --config parameter_files/defaults_multiome_lite.json \
+  --outdir saved_models/multiome_lite_run1 \
   --data-root /path/to/your/data
 ```
@@ -481,7 +686,9 @@ python scripts/train_univi.py \
   --data-root /path/to/your/data
 ```
-### 3) Quickstart: run UniVI from Python / Jupyter
+---
+## Quickstart: run UniVI from Python / Jupyter
 If you prefer to stay inside a notebook or a Python script instead of calling the CLI, you can build the configs, model, and trainer directly.
@@ -500,33 +707,29 @@ from univi import (
     UniVIConfig,
     TrainingConfig,
 )
-from univi.data import MultiModalDataset
+from univi.data import MultiModalDataset, align_paired_obs_names
 from univi.trainer import UniVITrainer
-````
+```
-#### 1) Load preprocessed AnnData (paired cells)
+### 1) Load preprocessed AnnData (paired cells)
 ```python
-# Example: CITE-seq with RNA + ADT
 rna = sc.read_h5ad("path/to/rna_citeseq.h5ad")
 adt = sc.read_h5ad("path/to/adt_citeseq.h5ad")
-# Assumes rna.obs_names == adt.obs_names (same cells, same order)
-adata_dict = {
-    "rna": rna,
-    "adt": adt,
-}
+adata_dict = {"rna": rna, "adt": adt}
+adata_dict = align_paired_obs_names(adata_dict)  # ensures same obs_names/order
 ```
-#### 2) Build `MultiModalDataset` and DataLoaders
+### 2) Build `MultiModalDataset` and DataLoaders (unsupervised)
 ```python
 device = "cuda" if torch.cuda.is_available() else "cpu"
 dataset = MultiModalDataset(
     adata_dict=adata_dict,
-    X_key="X",          # use .X from each AnnData for training
-    device=device,      # tensors moved to this device on-the-fly
+    X_key="X",
+    device=None,  # "cpu" or "cuda"
 )
 n_cells = rna.n_obs
@@ -542,29 +745,35 @@ val_ds   = Subset(dataset, val_idx)
 batch_size = 256
-train_loader = DataLoader(
-    train_ds,
-    batch_size=batch_size,
-    shuffle=True,
-    num_workers=0,
-)
+train_loader = DataLoader(train_ds, batch_size=batch_size, shuffle=True, num_workers=0)
+val_loader   = DataLoader(val_ds,   batch_size=batch_size, shuffle=False, num_workers=0)
+```
-val_loader = DataLoader(
-    val_ds,
-    batch_size=batch_size,
-    shuffle=False,
-    num_workers=0,
-)
+### 2b) (Optional) Supervised batches for Pattern A/B (`(x_dict, y)`)
+If you use the classification head and/or label expert injection, supply `y` as integer class indices and mask unlabeled with `-1`.
+```python
+y_codes = rna.obs["celltype"].astype("category").cat.codes.to_numpy()
+dataset_sup = MultiModalDataset(adata_dict=adata_dict, X_key="X", labels=y_codes)
+def collate_xy(batch):
+    xs, ys = zip(*batch)
+    x = {k: torch.stack([d[k] for d in xs], 0) for k in xs[0].keys()}
+    y = torch.as_tensor(ys, dtype=torch.long)
+    return x, y
+train_loader = DataLoader(dataset_sup, batch_size=batch_size, shuffle=True, collate_fn=collate_xy)
 ```
-#### 3) Define UniVI configs (v1 vs UniVI-lite)
+### 3) Define UniVI configs (v1 vs UniVI-lite)
 ```python
-# UniVI model config (architecture + regularization)
 univi_cfg = UniVIConfig(
     latent_dim=40,
-    beta=5.0,          # KL weight
-    gamma=40.0,        # alignment weight (used differently in v1 vs lite)
+    beta=5.0,
+    gamma=40.0,
     encoder_dropout=0.1,
     decoder_dropout=0.0,
     encoder_batchnorm=True,
@@ -574,24 +783,11 @@ univi_cfg = UniVIConfig(
     align_anneal_start=0,
     align_anneal_end=25,
     modalities=[
-        ModalityConfig(
-            name="rna",
-            input_dim=rna.n_vars,
-            encoder_hidden=[1024, 512],
-            decoder_hidden=[512, 1024],
-            likelihood="nb",   # counts-like RNA
-        ),
-        ModalityConfig(
-            name="adt",
-            input_dim=adt.n_vars,
-            encoder_hidden=[256, 128],
-            decoder_hidden=[128, 256],
-            likelihood="nb",   # counts-like ADT
-        ),
+        ModalityConfig("rna", rna.n_vars, [1024, 512], [512, 1024], likelihood="nb"),
+        ModalityConfig("adt", adt.n_vars, [256, 128],  [128, 256],  likelihood="nb"),
     ],
 )
-# Training config (epochs, LR, device, etc.)
 train_cfg = TrainingConfig(
     n_epochs=200,
     batch_size=batch_size,
@@ -608,27 +804,47 @@ train_cfg = TrainingConfig(
 )
 ```
-#### 4) Choose the objective: **v1** vs **UniVI-lite**
-* **v1** (paper objective): cross-reconstruction + cross-posterior alignment.
-  * Best when batches are paired/pseudo-paired and you want explicit cross-prediction.
-* **lite** (aka `"v2"`): missing-modality friendly; trains even if some modalities are absent in a batch.
+### 4) Choose the objective + supervised option
 ```python
-# Option A: UniVI v1 (paper)
+# Option A: UniVI v1 (unsupervised)
 model = UniVIMultiModalVAE(
     univi_cfg,
     loss_mode="v1",
-    v1_recon="cross",          # "cross" | "self" | "avg" | "moe" | "src:rna" etc.
-    v1_recon_mix=0.0,          # optional extra averaged-z recon weight
+    v1_recon="avg",
+    v1_recon_mix=0.0,
     normalize_v1_terms=True,
 ).to(device)
-# Option B: UniVI-lite (v2)
+# Option B: UniVI-lite / v2 (unsupervised)
 # model = UniVIMultiModalVAE(univi_cfg, loss_mode="lite").to(device)
+# Option C: Add classification head (Pattern A; works in lite/v2 AND v1)
+# n_classes = int(y_codes.max() + 1)
+# model = UniVIMultiModalVAE(
+#     univi_cfg,
+#     loss_mode="lite",
+#     n_label_classes=n_classes,
+#     label_loss_weight=1.0,
+#     label_ignore_index=-1,
+#     classify_from_mu=True,
+# ).to(device)
+# Option D: Add label expert injection into fusion (Pattern B; lite/v2 ONLY)
+# model = UniVIMultiModalVAE(
+#     univi_cfg,
+#     loss_mode="lite",
+#     n_label_classes=n_classes,
+#     label_loss_weight=1.0,
+#     use_label_encoder=True,
+#     label_moe_weight=1.0,
+#     unlabeled_logvar=20.0,
+#     label_encoder_warmup=5,
+#     label_ignore_index=-1,
+# ).to(device)
 ```
-#### 5) Train inside Python / Jupyter
+### 5) Train inside Python / Jupyter
 ```python
 trainer = UniVITrainer(
@@ -639,26 +855,36 @@ trainer = UniVITrainer(
     device=device,
 )
-history = trainer.fit()  # runs the training loop
+history = trainer.fit()
 ```
-`history` typically contains per-epoch loss and metric curves. After training, you can reuse `model` directly for:
-* Computing latent embeddings (`encode_modalities` / `mixture_of_experts`)
-* Cross-modal reconstruction (forward passes with different modality subsets)
-* Exporting `z` to AnnData or NumPy for downstream analysis (UMAP, clustering, DE, etc.)
-#### 6) Write latent `z` into AnnData `.obsm["X_univi"]`
+### 6) Write latent `z` into AnnData `.obsm["X_univi"]`
 ```python
 from univi import write_univi_latent
-Z = write_univi_latent(model, adata_dict, obsm_key="X_univi", device=device)
+Z = write_univi_latent(model, adata_dict, obsm_key="X_univi", device=device, use_mean=True)
 print("Embedding shape:", Z.shape)
 ```
 > **Tip**
-> If you want deterministic embeddings for plotting, add the argument `use_mean=True` to the `write_univi_latent` function so you store mu_z instead of a sampled z. Of note, a sampled z is a stochastic sampling from each latent distribution which allows for generative modeling, while mu_z uses the means of each latent distribution for a more informative view of overall latent structure.
+> Use `use_mean=True` for deterministic plotting/UMAP. Sampling (`use_mean=False`) is stochastic and useful for generative behavior.
+---
+## Evaluating / encoding: choosing the latent representation
+Some utilities (e.g., `encode_adata`) support selecting what embedding to return:
+* `"moe_mean"` / `"moe_sample"`: fused latent (MoE/PoE)
+* `"modality_mean"` / `"modality_sample"`: per-modality latent
+```python
+from univi.evaluation import encode_adata
+Z_rna = encode_adata(model, rna, modality="rna", device=device, layer="counts", latent="modality_mean")
+Z_moe = encode_adata(model, rna, modality="rna", device=device, layer="counts", latent="moe_mean")
+```
 ---
@@ -819,3 +1045,5 @@ Typical evaluation outputs include:
 For richer, exploratory workflows (TEA-seq tri-modal integration, Multiome RNA+ATAC, non-paired matching, etc.), see the notebooks in `notebooks/`.
+---

univi 0.2.1__tar.gz → 0.2.5__tar.gz

univi 0.2.1tar.gz → 0.2.5tar.gz