PyPI - pylocuszoom - Versions diffs - 0.5.0__tar.gz → 0.8.0__tar.gz - Mend

pylocuszoom 0.5.0tar.gz → 0.8.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (83) hide show

{pylocuszoom-0.5.0 → pylocuszoom-0.8.0}/.github/workflows/ci.yml RENAMED Viewed

@@ -44,14 +44,7 @@ jobs:
         run: uv sync --extra dev --extra all
       - name: Run tests
-        run: uv run pytest --cov=pylocuszoom --cov-report=xml
-      - name: Upload coverage
-        uses: codecov/codecov-action@v4
-        if: matrix.python-version == '3.11'
-        with:
-          files: ./coverage.xml
-          fail_ci_if_error: false
+        run: uv run pytest --cov=pylocuszoom --cov-report=term-missing
   build:
     runs-on: ubuntu-latest

pylocuszoom-0.8.0/.github/workflows/publish.yml ADDED Viewed

@@ -0,0 +1,110 @@
+name: Publish to PyPI
+on:
+  release:
+    types: [published]
+jobs:
+  publish:
+    runs-on: ubuntu-latest
+    environment: pypi
+    permissions:
+      id-token: write
+    outputs:
+      version: ${{ steps.version.outputs.version }}
+      sha256: ${{ steps.sha256.outputs.sha256 }}
+    steps:
+      - uses: actions/checkout@v4
+      - name: Install uv
+        uses: astral-sh/setup-uv@v5
+      - name: Get version
+        id: version
+        run: |
+          VERSION=$(grep '^version = ' pyproject.toml | sed 's/version = "\(.*\)"/\1/')
+          echo "version=$VERSION" >> "$GITHUB_OUTPUT"
+      - name: Build package
+        run: uv build
+      - name: Publish to PyPI
+        uses: pypa/gh-action-pypi-publish@release/v1
+      - name: Wait for PyPI availability
+        env:
+          PKG_VERSION: ${{ steps.version.outputs.version }}
+        run: |
+          echo "Waiting for pylocuszoom $PKG_VERSION to be available on PyPI..."
+          for i in {1..30}; do
+            if curl -s "https://pypi.org/pypi/pylocuszoom/$PKG_VERSION/json" | grep -q '"version"'; then
+              echo "Package available on PyPI"
+              break
+            fi
+            echo "Attempt $i: Package not yet available, waiting 10s..."
+            sleep 10
+          done
+      - name: Get SHA256
+        id: sha256
+        env:
+          PKG_VERSION: ${{ steps.version.outputs.version }}
+        run: |
+          URL="https://pypi.io/packages/source/p/pylocuszoom/pylocuszoom-$PKG_VERSION.tar.gz"
+          SHA256=$(curl -sL "$URL" | sha256sum | cut -d' ' -f1)
+          echo "sha256=$SHA256" >> "$GITHUB_OUTPUT"
+          echo "SHA256: $SHA256"
+  update-bioconda:
+    needs: publish
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+        with:
+          ref: main
+      - name: Update bioconda/meta.yaml
+        env:
+          PKG_VERSION: ${{ needs.publish.outputs.version }}
+          PKG_SHA256: ${{ needs.publish.outputs.sha256 }}
+        run: |
+          # Update version
+          sed -i "s/{% set version = \".*\" %}/{% set version = \"$PKG_VERSION\" %}/" bioconda/meta.yaml
+          # Update sha256
+          sed -i "s/sha256: .*/sha256: $PKG_SHA256/" bioconda/meta.yaml
+          # Update plotly version requirement
+          sed -i "s/plotly >=5.0.0/plotly >=5.15.0/" bioconda/meta.yaml
+          # Add new dependencies if missing
+          if ! grep -q "pydantic" bioconda/meta.yaml; then
+            sed -i '/adjusttext/a\    - pydantic >=2.0.0' bioconda/meta.yaml
+          fi
+          if ! grep -q "requests" bioconda/meta.yaml; then
+            sed -i '/pydantic/a\    - requests >=2.25.0' bioconda/meta.yaml
+          fi
+          if ! grep -q "tqdm" bioconda/meta.yaml; then
+            sed -i '/requests/a\    - tqdm >=4.60.0' bioconda/meta.yaml
+          fi
+          cat bioconda/meta.yaml
+      - name: Create Pull Request
+        uses: peter-evans/create-pull-request@v6
+        with:
+          token: ${{ secrets.GITHUB_TOKEN }}
+          commit-message: "chore: update bioconda recipe for new release"
+          branch: bioconda-update
+          title: "Update bioconda recipe"
+          body: |
+            Automated update of bioconda/meta.yaml after PyPI release.
+            **Next steps:**
+            1. Review this PR
+            2. Merge to main
+            3. Copy `bioconda/meta.yaml` to your fork of bioconda-recipes
+            4. Submit PR to bioconda-recipes
+          labels: |
+            bioconda
+            automated

{pylocuszoom-0.5.0 → pylocuszoom-0.8.0}/.gitignore RENAMED Viewed

@@ -26,3 +26,5 @@ htmlcov/
 # Project instructions (private)
 CLAUDE.md
+docs/plans/
+.planning/

pylocuszoom-0.8.0/.pre-commit-config.yaml ADDED Viewed

@@ -0,0 +1,17 @@
+repos:
+  - repo: https://github.com/astral-sh/ruff-pre-commit
+    rev: v0.9.1
+    hooks:
+      - id: ruff
+        args: [--fix]
+      - id: ruff-format
+  - repo: local
+    hooks:
+      - id: pytest-cov
+        name: pytest with coverage
+        entry: uv run python -m pytest -q
+        language: system
+        types: [python]
+        pass_filenames: false
+        always_run: true

{pylocuszoom-0.5.0 → pylocuszoom-0.8.0}/CHANGELOG.md RENAMED Viewed

@@ -7,6 +7,69 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 ## [Unreleased]
+## [0.8.0] - 2026-01-28
+### Added
+- `set_yticks()` backend method for consistent y-axis labels across all backends
+- Shared `convert_latex_to_unicode()` utility for interactive backends
+- Automatic gene annotation fetching from Ensembl REST API (`auto_genes=True`)
+- `get_genes_for_region()` function to fetch genes from Ensembl with disk caching
+- `fetch_genes_from_ensembl()` and `fetch_exons_from_ensembl()` low-level API functions
+- `clear_ensembl_cache()` utility to clear cached Ensembl data
+- Support for human, mouse, rat, and any Ensembl species
+- Retry logic with exponential backoff for Ensembl API resilience
+- 5Mb region size validation (Ensembl API limit)
+- `DataFrameValidator` builder class for consistent validation across modules
+- `filter_by_region()` shared utility for chromosome/position filtering
+- `HoverDataBuilder` for constructing hover tooltips across backends
+- Backend capability system with `supports_*` properties for feature detection
+- Backend registration system with `get_backend()` and automatic fallback
+- Pre-commit hook for pytest with coverage enforcement (70% minimum)
+### Changed
+- Forest plot example now uses odds ratios with `null_value=1.0` (more representative)
+- PheWAS and forest plot y-axis labels now work correctly in Plotly and Bokeh backends
+- Gene track styling: arrows now 75% height and 10% wider for better proportions
+- Gene track labels increased from 5.5pt to 7pt for improved readability
+- Migrated eQTL, finemapping, phewas, and forest validation to `DataFrameValidator`
+- Plotter now uses capability-based dispatch instead of backend name checks
+- Removed empty `__init__` methods from backend classes
+- Removed unused matplotlib imports from plotter (now backend-agnostic)
+### Fixed
+- `load_gwas()` now forwards `**kwargs` to format-specific loaders
+- Forest plot validator now checks that effect and CI columns are numeric
+- PheWAS validator now checks that p-values are numeric and within (0, 1] range
+### Security
+- Tar extraction now includes path traversal protection for recombination map downloads
+## [0.7.0] - 2026-01-27
+## [0.6.0] - 2026-01-27
+### Added
+- `plot_phewas()` method for phenome-wide association study plots
+- `plot_forest()` method for forest plots (meta-analysis visualization)
+- PheWAS category color palette with 12 distinct colors
+- Forest plot and PheWAS validation utilities
+- Backend methods: `axvline()`, `hbar()`, `errorbar_h()` for new plot types
+- Example plots for PheWAS and forest plots
+- Progress bars (tqdm) for recombination map and liftover chain downloads
+- `requests` and `tqdm` as core dependencies for reliable downloads with progress
+- `pytest-randomly` and `pytest-xdist` as dev dependencies for test randomization and parallel execution
+### Changed
+- Bumped minimum Plotly version to 5.15.0 (required for multiple legends feature)
+- eQTL loaders now output `effect_size` column instead of `effect` for plotter compatibility
+- Download functions now use `requests` with streaming and progress bars instead of `urllib`
+### Fixed
+- SAIGE loader now prefers SPA-adjusted p-values (`p.value.NA`) over raw p-values when both present
+- BED loader now handles BED12 format and files with more than 6 columns
+- eQTL panel in `plot_stacked()` now filters by chromosome in addition to position
+- Validation errors for non-numeric p-values or positions now show clear "must be numeric" message instead of runtime errors
 ## [0.5.0] - 2026-01-27
 ### Added
@@ -118,7 +181,10 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 - bokeh >= 3.8.2
 - kaleido >= 0.2.0
-[Unreleased]: https://github.com/michael-denyer/pyLocusZoom/compare/v0.5.0...HEAD
+[Unreleased]: https://github.com/michael-denyer/pyLocusZoom/compare/v0.8.0...HEAD
+[0.8.0]: https://github.com/michael-denyer/pyLocusZoom/compare/v0.7.0...v0.8.0
+[0.7.0]: https://github.com/michael-denyer/pyLocusZoom/compare/v0.6.0...v0.7.0
+[0.6.0]: https://github.com/michael-denyer/pyLocusZoom/compare/v0.5.0...v0.6.0
 [0.5.0]: https://github.com/michael-denyer/pyLocusZoom/compare/v0.4.0...v0.5.0
 [0.4.0]: https://github.com/michael-denyer/pyLocusZoom/compare/v0.3.0...v0.4.0
 [0.3.0]: https://github.com/michael-denyer/pyLocusZoom/compare/v0.2.0...v0.3.0

{pylocuszoom-0.5.0 → pylocuszoom-0.8.0}/PKG-INFO RENAMED Viewed

@@ -1,15 +1,15 @@
 Metadata-Version: 2.4
 Name: pylocuszoom
-Version: 0.5.0
+Version: 0.8.0
 Summary: Publication-ready regional association plots with LD coloring, gene tracks, and recombination overlays
 Project-URL: Homepage, https://github.com/michael-denyer/pylocuszoom
 Project-URL: Documentation, https://github.com/michael-denyer/pylocuszoom#readme
 Project-URL: Repository, https://github.com/michael-denyer/pylocuszoom
-Author: Michael Denyer
+Author-email: Michael Denyer <code.denyer@gmail.com>
 License-Expression: GPL-3.0-or-later
 License-File: LICENSE.md
 Keywords: genetics,gwas,locus-zoom,locuszoom,regional-plot,visualization
-Classifier: Development Status :: 3 - Alpha
+Classifier: Development Status :: 4 - Beta
 Classifier: Intended Audience :: Science/Research
 Classifier: License :: OSI Approved :: GNU General Public License v3 or later (GPLv3+)
 Classifier: Programming Language :: Python :: 3
@@ -26,13 +26,17 @@ Requires-Dist: loguru>=0.7.0
 Requires-Dist: matplotlib>=3.5.0
 Requires-Dist: numpy>=1.21.0
 Requires-Dist: pandas>=1.4.0
-Requires-Dist: plotly>=5.0.0
+Requires-Dist: plotly>=5.15.0
 Requires-Dist: pydantic>=2.0.0
 Requires-Dist: pyliftover>=0.4
+Requires-Dist: requests>=2.25.0
+Requires-Dist: tqdm>=4.60.0
 Provides-Extra: all
 Requires-Dist: pyspark>=3.0.0; extra == 'all'
 Provides-Extra: dev
 Requires-Dist: pytest-cov>=4.0.0; extra == 'dev'
+Requires-Dist: pytest-randomly>=3.0.0; extra == 'dev'
+Requires-Dist: pytest-xdist>=3.0.0; extra == 'dev'
 Requires-Dist: pytest>=7.0.0; extra == 'dev'
 Requires-Dist: ruff>=0.1.0; extra == 'dev'
 Provides-Extra: spark
@@ -40,20 +44,18 @@ Requires-Dist: pyspark>=3.0.0; extra == 'spark'
 Description-Content-Type: text/markdown
 [![CI](https://github.com/michael-denyer/pyLocusZoom/actions/workflows/ci.yml/badge.svg)](https://github.com/michael-denyer/pyLocusZoom/actions/workflows/ci.yml)
-[![codecov](https://codecov.io/gh/michael-denyer/pyLocusZoom/graph/badge.svg)](https://codecov.io/gh/michael-denyer/pyLocusZoom)
 [![PyPI](https://img.shields.io/pypi/v/pylocuszoom)](https://pypi.org/project/pylocuszoom/)
-[![Bioconda](https://img.shields.io/conda/vn/bioconda/pylocuszoom)](https://anaconda.org/bioconda/pylocuszoom)
 [![License: GPL v3](https://img.shields.io/badge/License-GPLv3-red.svg)](https://www.gnu.org/licenses/gpl-3.0)
 [![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)
 [![Ruff](https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/astral-sh/ruff/main/assets/badge/v2.json)](https://github.com/astral-sh/ruff)
 [![Matplotlib](https://img.shields.io/badge/Matplotlib-3.5+-11557c.svg)](https://matplotlib.org/)
-[![Plotly](https://img.shields.io/badge/Plotly-5.0+-3F4F75.svg)](https://plotly.com/python/)
+[![Plotly](https://img.shields.io/badge/Plotly-5.15+-3F4F75.svg)](https://plotly.com/python/)
 [![Bokeh](https://img.shields.io/badge/Bokeh-3.8+-E6526F.svg)](https://bokeh.org/)
 [![Pandas](https://img.shields.io/badge/Pandas-1.4+-150458.svg)](https://pandas.pydata.org/)
 <img src="logo.svg" alt="pyLocusZoom logo" width="120" align="right">
 # pyLocusZoom
-Publication-ready regional association plots with LD coloring, gene tracks, and recombination overlays.
+Designed for publication-ready GWAS visualization with regional association plots, gene tracks, eQTL, PheWAS, fine-mapping, and forest plots.
 Inspired by [LocusZoom](http://locuszoom.org/) and [locuszoomr](https://github.com/myles-lewis/locuszoomr).
@@ -64,18 +66,22 @@ Inspired by [LocusZoom](http://locuszoom.org/) and [locuszoomr](https://github.c
     - **Multi-species support**: Built-in reference data for *Canis lupus familiaris* (CanFam3.1/CanFam4) and *Felis catus* (FelCat9), or optionally provide your own for any species
     - **LD coloring**: SNPs colored by linkage disequilibrium (R²) with lead variant
     - **Gene tracks**: Annotated gene/exon positions below the association plot
-    - **Recombination rate**: Overlay showing recombination rate across region (*Canis lupus familiaris* only)
-    - **SNP labels (matplotlib)**: Automatic labeling of lead SNPs with RS ID
-    - **Tooltips (Bokeh and Plotly)**: Mouseover for detailed SNP data
+    - **Recombination rate**: Optional overlay across region (*Canis lupus familiaris* built-in, not shown in example image)
+    - **SNP labels (matplotlib)**: Automatic labeling of top SNPs by p-value (RS IDs)
+    - **Hover tooltips (Plotly and Bokeh)**: Detailed SNP data on hover
-![Example regional association plot](examples/regional_plot.png)
+![Example regional association plot with LD coloring and gene track](examples/regional_plot.png)
+*Regional association plot with LD coloring, gene/exon track, and top SNP labels (recombination overlay disabled in example).*
 2. **Stacked plots**: Compare multiple GWAS/phenotypes vertically
 3. **eQTL plot**: Expression QTL data aligned with association plots and gene tracks
 4. **Fine-mapping plots**: Visualize SuSiE credible sets with posterior inclusion probabilities
-5. **Multiple charting libraries**: matplotlib (static), plotly (interactive), bokeh (dashboards)
-6. **Pandas and PySpark support**: Works with both Pandas and PySpark DataFrames for large-scale genomics data
-7. **Convenience data file loaders**: Load and validate common GWAS, eQTL and fine-mapping file formats
+5. **PheWAS plots**: Phenome-wide association study visualization across multiple phenotypes
+6. **Forest plots**: Meta-analysis effect size visualization with confidence intervals
+7. **Multiple backends**: matplotlib (publication-ready), plotly (interactive), bokeh (dashboard integration)
+8. **Pandas and PySpark support**: Works with both Pandas and PySpark DataFrames for large-scale genomics data
+9. **Convenience data file loaders**: Load and validate common GWAS, eQTL and fine-mapping file formats
+10. **Automatic gene annotations**: Fetch gene/exon data from Ensembl REST API with caching (human, mouse, rat, canine, feline, and any Ensembl species)
 ## Installation
@@ -175,28 +181,46 @@ fig = plotter.plot(
 )
 ```
+## Automatic Gene Annotations
+pyLocusZoom can automatically fetch gene annotations from Ensembl for any species:
+```python
+# Enable automatic gene fetching
+plotter = LocusZoomPlotter(species="human", auto_genes=True)
+# No need to provide genes_df - fetched automatically
+fig = plotter.plot(gwas_df, chrom=13, start=32000000, end=33000000)
+```
+Supported species aliases: `human`, `mouse`, `rat`, `canine`/`dog`, `feline`/`cat`, or any Ensembl species name.
+Data is cached locally for fast subsequent plots. Maximum region size is 5Mb (Ensembl API limit).
 ## Backends
-pyLocusZoom supports multiple rendering backends:
+pyLocusZoom supports multiple rendering backends (set at initialization):
 ```python
 # Static publication-quality plot (default)
-fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000, backend="matplotlib")
+plotter = LocusZoomPlotter(species="canine", backend="matplotlib")
+fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000)
 fig.savefig("plot.png", dpi=150)
 # Interactive Plotly (hover tooltips, pan/zoom)
-fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000, backend="plotly")
+plotter = LocusZoomPlotter(species="canine", backend="plotly")
+fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000)
 fig.write_html("plot.html")
 # Interactive Bokeh (dashboard-ready)
-fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000, backend="bokeh")
+plotter = LocusZoomPlotter(species="canine", backend="bokeh")
+fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000)
 ```
 | Backend | Output | Best For | Features |
 |---------|--------|----------|----------|
-| `matplotlib` | Static PNG/PDF/SVG | Publications, presentations | Full feature set with SNP labels |
-| `plotly` | Interactive HTML | Web reports, data exploration | Hover tooltips, pan/zoom |
-| `bokeh` | Interactive HTML | Dashboards, web apps | Hover tooltips, pan/zoom |
+| `matplotlib` | Static PNG/PDF/SVG | Publication-ready figures | Full feature set with SNP labels |
+| `plotly` | Interactive HTML | Web reports, exploration | Hover tooltips, pan/zoom |
+| `bokeh` | Interactive HTML | Dashboard integration | Hover tooltips, pan/zoom |
 > **Note:** All backends support scatter plots, gene tracks, recombination overlay, and LD legend. SNP labels (auto-positioned with adjustText) are matplotlib-only; interactive backends use hover tooltips instead.
@@ -215,7 +239,8 @@ fig = plotter.plot_stacked(
 )
 ```
-![Example stacked plot](examples/stacked_plot.png)
+![Example stacked plot comparing two phenotypes](examples/stacked_plot.png)
+*Stacked plot comparing two phenotypes with LD coloring and shared gene track.*
 ## eQTL Overlay
@@ -238,6 +263,7 @@ fig = plotter.plot_stacked(
 ```
 ![Example eQTL overlay plot](examples/eqtl_overlay.png)
+*eQTL overlay with effect direction (up/down triangles) and magnitude binning.*
 ## Fine-mapping Visualization
@@ -260,19 +286,62 @@ fig = plotter.plot_stacked(
 ```
 ![Example fine-mapping plot](examples/finemapping_plot.png)
+*Fine-mapping visualization with PIP line and credible set coloring (CS1/CS2).*
+## PheWAS Plots
+Visualize associations of a single variant across multiple phenotypes:
+```python
+phewas_df = pd.DataFrame({
+    "phenotype": ["Height", "BMI", "T2D", "CAD", "HDL"],
+    "p_value": [1e-15, 0.05, 1e-8, 1e-3, 1e-10],
+    "category": ["Anthropometric", "Anthropometric", "Metabolic", "Cardiovascular", "Lipids"],
+})
+fig = plotter.plot_phewas(
+    phewas_df,
+    variant_id="rs12345",
+    category_col="category",
+)
+```
+![Example PheWAS plot](examples/phewas_plot.png)
+*PheWAS plot showing associations across phenotype categories with significance threshold.*
+## Forest Plots
+Create forest plots for meta-analysis visualization:
+```python
+forest_df = pd.DataFrame({
+    "study": ["Study A", "Study B", "Study C", "Meta-analysis"],
+    "effect": [0.45, 0.52, 0.38, 0.46],
+    "ci_lower": [0.30, 0.35, 0.20, 0.40],
+    "ci_upper": [0.60, 0.69, 0.56, 0.52],
+    "weight": [25, 35, 20, 100],
+})
+fig = plotter.plot_forest(
+    forest_df,
+    variant_id="rs12345",
+    weight_col="weight",
+)
+```
+![Example forest plot](examples/forest_plot.png)
+*Forest plot with effect sizes, confidence intervals, and weight-proportional markers.*
 ## PySpark Support
-For large-scale genomics data, pass PySpark DataFrames directly:
+For large-scale genomics data, convert PySpark DataFrames with `to_pandas()` before plotting:
 ```python
 from pylocuszoom import LocusZoomPlotter, to_pandas
-# PySpark DataFrame (automatically converted)
-fig = plotter.plot(spark_gwas_df, chrom=1, start=1000000, end=2000000)
-# Or convert manually with sampling for very large data
+# Convert PySpark DataFrame (optionally sampled for very large data)
 pandas_df = to_pandas(spark_gwas_df, sample_size=100000)
+fig = plotter.plot(pandas_df, chrom=1, start=1000000, end=2000000)
 ```
 Install PySpark support: `uv add pylocuszoom[spark]`

{pylocuszoom-0.5.0 → pylocuszoom-0.8.0}/README.md RENAMED Viewed

@@ -1,18 +1,16 @@
 [![CI](https://github.com/michael-denyer/pyLocusZoom/actions/workflows/ci.yml/badge.svg)](https://github.com/michael-denyer/pyLocusZoom/actions/workflows/ci.yml)
-[![codecov](https://codecov.io/gh/michael-denyer/pyLocusZoom/graph/badge.svg)](https://codecov.io/gh/michael-denyer/pyLocusZoom)
 [![PyPI](https://img.shields.io/pypi/v/pylocuszoom)](https://pypi.org/project/pylocuszoom/)
-[![Bioconda](https://img.shields.io/conda/vn/bioconda/pylocuszoom)](https://anaconda.org/bioconda/pylocuszoom)
 [![License: GPL v3](https://img.shields.io/badge/License-GPLv3-red.svg)](https://www.gnu.org/licenses/gpl-3.0)
 [![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)
 [![Ruff](https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/astral-sh/ruff/main/assets/badge/v2.json)](https://github.com/astral-sh/ruff)
 [![Matplotlib](https://img.shields.io/badge/Matplotlib-3.5+-11557c.svg)](https://matplotlib.org/)
-[![Plotly](https://img.shields.io/badge/Plotly-5.0+-3F4F75.svg)](https://plotly.com/python/)
+[![Plotly](https://img.shields.io/badge/Plotly-5.15+-3F4F75.svg)](https://plotly.com/python/)
 [![Bokeh](https://img.shields.io/badge/Bokeh-3.8+-E6526F.svg)](https://bokeh.org/)
 [![Pandas](https://img.shields.io/badge/Pandas-1.4+-150458.svg)](https://pandas.pydata.org/)
 <img src="logo.svg" alt="pyLocusZoom logo" width="120" align="right">
 # pyLocusZoom
-Publication-ready regional association plots with LD coloring, gene tracks, and recombination overlays.
+Designed for publication-ready GWAS visualization with regional association plots, gene tracks, eQTL, PheWAS, fine-mapping, and forest plots.
 Inspired by [LocusZoom](http://locuszoom.org/) and [locuszoomr](https://github.com/myles-lewis/locuszoomr).
@@ -23,18 +21,22 @@ Inspired by [LocusZoom](http://locuszoom.org/) and [locuszoomr](https://github.c
     - **Multi-species support**: Built-in reference data for *Canis lupus familiaris* (CanFam3.1/CanFam4) and *Felis catus* (FelCat9), or optionally provide your own for any species
     - **LD coloring**: SNPs colored by linkage disequilibrium (R²) with lead variant
     - **Gene tracks**: Annotated gene/exon positions below the association plot
-    - **Recombination rate**: Overlay showing recombination rate across region (*Canis lupus familiaris* only)
-    - **SNP labels (matplotlib)**: Automatic labeling of lead SNPs with RS ID
-    - **Tooltips (Bokeh and Plotly)**: Mouseover for detailed SNP data
+    - **Recombination rate**: Optional overlay across region (*Canis lupus familiaris* built-in, not shown in example image)
+    - **SNP labels (matplotlib)**: Automatic labeling of top SNPs by p-value (RS IDs)
+    - **Hover tooltips (Plotly and Bokeh)**: Detailed SNP data on hover
-![Example regional association plot](examples/regional_plot.png)
+![Example regional association plot with LD coloring and gene track](examples/regional_plot.png)
+*Regional association plot with LD coloring, gene/exon track, and top SNP labels (recombination overlay disabled in example).*
 2. **Stacked plots**: Compare multiple GWAS/phenotypes vertically
 3. **eQTL plot**: Expression QTL data aligned with association plots and gene tracks
 4. **Fine-mapping plots**: Visualize SuSiE credible sets with posterior inclusion probabilities
-5. **Multiple charting libraries**: matplotlib (static), plotly (interactive), bokeh (dashboards)
-6. **Pandas and PySpark support**: Works with both Pandas and PySpark DataFrames for large-scale genomics data
-7. **Convenience data file loaders**: Load and validate common GWAS, eQTL and fine-mapping file formats
+5. **PheWAS plots**: Phenome-wide association study visualization across multiple phenotypes
+6. **Forest plots**: Meta-analysis effect size visualization with confidence intervals
+7. **Multiple backends**: matplotlib (publication-ready), plotly (interactive), bokeh (dashboard integration)
+8. **Pandas and PySpark support**: Works with both Pandas and PySpark DataFrames for large-scale genomics data
+9. **Convenience data file loaders**: Load and validate common GWAS, eQTL and fine-mapping file formats
+10. **Automatic gene annotations**: Fetch gene/exon data from Ensembl REST API with caching (human, mouse, rat, canine, feline, and any Ensembl species)
 ## Installation
@@ -134,28 +136,46 @@ fig = plotter.plot(
 )
 ```
+## Automatic Gene Annotations
+pyLocusZoom can automatically fetch gene annotations from Ensembl for any species:
+```python
+# Enable automatic gene fetching
+plotter = LocusZoomPlotter(species="human", auto_genes=True)
+# No need to provide genes_df - fetched automatically
+fig = plotter.plot(gwas_df, chrom=13, start=32000000, end=33000000)
+```
+Supported species aliases: `human`, `mouse`, `rat`, `canine`/`dog`, `feline`/`cat`, or any Ensembl species name.
+Data is cached locally for fast subsequent plots. Maximum region size is 5Mb (Ensembl API limit).
 ## Backends
-pyLocusZoom supports multiple rendering backends:
+pyLocusZoom supports multiple rendering backends (set at initialization):
 ```python
 # Static publication-quality plot (default)
-fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000, backend="matplotlib")
+plotter = LocusZoomPlotter(species="canine", backend="matplotlib")
+fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000)
 fig.savefig("plot.png", dpi=150)
 # Interactive Plotly (hover tooltips, pan/zoom)
-fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000, backend="plotly")
+plotter = LocusZoomPlotter(species="canine", backend="plotly")
+fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000)
 fig.write_html("plot.html")
 # Interactive Bokeh (dashboard-ready)
-fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000, backend="bokeh")
+plotter = LocusZoomPlotter(species="canine", backend="bokeh")
+fig = plotter.plot(gwas_df, chrom=1, start=1000000, end=2000000)
 ```
 | Backend | Output | Best For | Features |
 |---------|--------|----------|----------|
-| `matplotlib` | Static PNG/PDF/SVG | Publications, presentations | Full feature set with SNP labels |
-| `plotly` | Interactive HTML | Web reports, data exploration | Hover tooltips, pan/zoom |
-| `bokeh` | Interactive HTML | Dashboards, web apps | Hover tooltips, pan/zoom |
+| `matplotlib` | Static PNG/PDF/SVG | Publication-ready figures | Full feature set with SNP labels |
+| `plotly` | Interactive HTML | Web reports, exploration | Hover tooltips, pan/zoom |
+| `bokeh` | Interactive HTML | Dashboard integration | Hover tooltips, pan/zoom |
 > **Note:** All backends support scatter plots, gene tracks, recombination overlay, and LD legend. SNP labels (auto-positioned with adjustText) are matplotlib-only; interactive backends use hover tooltips instead.
@@ -174,7 +194,8 @@ fig = plotter.plot_stacked(
 )
 ```
-![Example stacked plot](examples/stacked_plot.png)
+![Example stacked plot comparing two phenotypes](examples/stacked_plot.png)
+*Stacked plot comparing two phenotypes with LD coloring and shared gene track.*
 ## eQTL Overlay
@@ -197,6 +218,7 @@ fig = plotter.plot_stacked(
 ```
 ![Example eQTL overlay plot](examples/eqtl_overlay.png)
+*eQTL overlay with effect direction (up/down triangles) and magnitude binning.*
 ## Fine-mapping Visualization
@@ -219,19 +241,62 @@ fig = plotter.plot_stacked(
 ```
 ![Example fine-mapping plot](examples/finemapping_plot.png)
+*Fine-mapping visualization with PIP line and credible set coloring (CS1/CS2).*
+## PheWAS Plots
+Visualize associations of a single variant across multiple phenotypes:
+```python
+phewas_df = pd.DataFrame({
+    "phenotype": ["Height", "BMI", "T2D", "CAD", "HDL"],
+    "p_value": [1e-15, 0.05, 1e-8, 1e-3, 1e-10],
+    "category": ["Anthropometric", "Anthropometric", "Metabolic", "Cardiovascular", "Lipids"],
+})
+fig = plotter.plot_phewas(
+    phewas_df,
+    variant_id="rs12345",
+    category_col="category",
+)
+```
+![Example PheWAS plot](examples/phewas_plot.png)
+*PheWAS plot showing associations across phenotype categories with significance threshold.*
+## Forest Plots
+Create forest plots for meta-analysis visualization:
+```python
+forest_df = pd.DataFrame({
+    "study": ["Study A", "Study B", "Study C", "Meta-analysis"],
+    "effect": [0.45, 0.52, 0.38, 0.46],
+    "ci_lower": [0.30, 0.35, 0.20, 0.40],
+    "ci_upper": [0.60, 0.69, 0.56, 0.52],
+    "weight": [25, 35, 20, 100],
+})
+fig = plotter.plot_forest(
+    forest_df,
+    variant_id="rs12345",
+    weight_col="weight",
+)
+```
+![Example forest plot](examples/forest_plot.png)
+*Forest plot with effect sizes, confidence intervals, and weight-proportional markers.*
 ## PySpark Support
-For large-scale genomics data, pass PySpark DataFrames directly:
+For large-scale genomics data, convert PySpark DataFrames with `to_pandas()` before plotting:
 ```python
 from pylocuszoom import LocusZoomPlotter, to_pandas
-# PySpark DataFrame (automatically converted)
-fig = plotter.plot(spark_gwas_df, chrom=1, start=1000000, end=2000000)
-# Or convert manually with sampling for very large data
+# Convert PySpark DataFrame (optionally sampled for very large data)
 pandas_df = to_pandas(spark_gwas_df, sample_size=100000)
+fig = plotter.plot(pandas_df, chrom=1, start=1000000, end=2000000)
 ```
 Install PySpark support: `uv add pylocuszoom[spark]`

pylocuszoom 0.5.0__tar.gz → 0.8.0__tar.gz

pylocuszoom 0.5.0tar.gz → 0.8.0tar.gz