PyPI - marm-behavior - Versions diffs - 0.1.0__tar.gz - Mend

marm-behavior 0.1.0__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (44) hide show

marm_behavior-0.1.0/LICENSE +21 -0
marm_behavior-0.1.0/PKG-INFO +378 -0
marm_behavior-0.1.0/README.md +332 -0
marm_behavior-0.1.0/marm_behavior/__init__.py +25 -0
marm_behavior-0.1.0/marm_behavior/__main__.py +435 -0
marm_behavior-0.1.0/marm_behavior/_data_files.py +458 -0
marm_behavior-0.1.0/marm_behavior/_tf_quiet.py +71 -0
marm_behavior-0.1.0/marm_behavior/data/README.txt +60 -0
marm_behavior-0.1.0/marm_behavior/data/__init__.py +1 -0
marm_behavior-0.1.0/marm_behavior/data/nn_reference/README.txt +57 -0
marm_behavior-0.1.0/marm_behavior/depths/__init__.py +0 -0
marm_behavior-0.1.0/marm_behavior/depths/depths_1.py +291 -0
marm_behavior-0.1.0/marm_behavior/dlc_inference.py +369 -0
marm_behavior-0.1.0/marm_behavior/el_to_csv.py +131 -0
marm_behavior-0.1.0/marm_behavior/extract/__init__.py +0 -0
marm_behavior-0.1.0/marm_behavior/extract/extract_1.py +331 -0
marm_behavior-0.1.0/marm_behavior/extract/extract_2.py +666 -0
marm_behavior-0.1.0/marm_behavior/extract/extract_3.py +285 -0
marm_behavior-0.1.0/marm_behavior/features/__init__.py +0 -0
marm_behavior-0.1.0/marm_behavior/features/labels.py +1300 -0
marm_behavior-0.1.0/marm_behavior/io/__init__.py +0 -0
marm_behavior-0.1.0/marm_behavior/io/csv_io.py +43 -0
marm_behavior-0.1.0/marm_behavior/io/mat_io.py +249 -0
marm_behavior-0.1.0/marm_behavior/nn_postprocess.py +945 -0
marm_behavior-0.1.0/marm_behavior/numerics/__init__.py +0 -0
marm_behavior-0.1.0/marm_behavior/numerics/helpers.py +284 -0
marm_behavior-0.1.0/marm_behavior/numerics/hull.py +252 -0
marm_behavior-0.1.0/marm_behavior/pipeline/__init__.py +0 -0
marm_behavior-0.1.0/marm_behavior/pipeline/orchestrators.py +508 -0
marm_behavior-0.1.0/marm_behavior/process/__init__.py +0 -0
marm_behavior-0.1.0/marm_behavior/process/postures.py +153 -0
marm_behavior-0.1.0/marm_behavior/process/process_1.py +99 -0
marm_behavior-0.1.0/marm_behavior/process/process_2.py +292 -0
marm_behavior-0.1.0/marm_behavior/process/process_3.py +323 -0
marm_behavior-0.1.0/marm_behavior/process/process_4.py +502 -0
marm_behavior-0.1.0/marm_behavior/run.py +525 -0
marm_behavior-0.1.0/marm_behavior.egg-info/PKG-INFO +378 -0
marm_behavior-0.1.0/marm_behavior.egg-info/SOURCES.txt +42 -0
marm_behavior-0.1.0/marm_behavior.egg-info/dependency_links.txt +1 -0
marm_behavior-0.1.0/marm_behavior.egg-info/entry_points.txt +2 -0
marm_behavior-0.1.0/marm_behavior.egg-info/requires.txt +23 -0
marm_behavior-0.1.0/marm_behavior.egg-info/top_level.txt +1 -0
marm_behavior-0.1.0/pyproject.toml +117 -0
marm_behavior-0.1.0/setup.cfg +4 -0

marm_behavior-0.1.0/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2026 William Menegas
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

marm_behavior-0.1.0/PKG-INFO ADDED Viewed

@@ -0,0 +1,378 @@
+Metadata-Version: 2.1
+Name: marm_behavior
+Version: 0.1.0
+Summary: Multi-animal marmoset behavioral analysis pipeline (DeepLabCut pose tracking + LSTM encoder + openTSNE clustering)
+Author-email: William Menegas <william.s.menegas@gmail.com>
+Maintainer-email: William Menegas <william.s.menegas@gmail.com>
+License: MIT
+Project-URL: Homepage, https://github.com/williammenegas/marm_behavior
+Project-URL: Repository, https://github.com/williammenegas/marm_behavior
+Project-URL: Issues, https://github.com/williammenegas/marm_behavior/issues
+Project-URL: Reference Data, https://huggingface.co/datasets/williammenegas/data
+Keywords: neuroscience,behavior,marmoset,primate,deeplabcut,pose-estimation,tsne,behavioral-clustering,computational-ethology
+Classifier: Development Status :: 4 - Beta
+Classifier: Intended Audience :: Science/Research
+Classifier: License :: OSI Approved :: MIT License
+Classifier: Operating System :: OS Independent
+Classifier: Programming Language :: Python :: 3
+Classifier: Programming Language :: Python :: 3.8
+Classifier: Programming Language :: Python :: 3.9
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Topic :: Scientific/Engineering
+Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
+Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
+Requires-Python: >=3.8
+Description-Content-Type: text/markdown
+License-File: LICENSE
+Requires-Dist: numpy>=1.22
+Requires-Dist: scipy>=1.9
+Requires-Dist: h5py>=3.6
+Requires-Dist: huggingface_hub>=0.20
+Provides-Extra: video
+Requires-Dist: opencv-python-headless>=4.5; extra == "video"
+Provides-Extra: imageio
+Requires-Dist: imageio>=2.25; extra == "imageio"
+Requires-Dist: imageio-ffmpeg>=0.4.7; extra == "imageio"
+Provides-Extra: dlc
+Requires-Dist: deeplabcut>=2.2; extra == "dlc"
+Provides-Extra: nn
+Requires-Dist: tensorflow>=2.5; extra == "nn"
+Requires-Dist: openTSNE>=0.7; extra == "nn"
+Requires-Dist: annoy>=1.17; extra == "nn"
+Requires-Dist: scikit-learn>=1.0; extra == "nn"
+Provides-Extra: fast-download
+Requires-Dist: hf_xet; extra == "fast-download"
+# marm_behavior
+A six-stage multi-animal marmoset behavioral analysis pipeline. Takes a
+video of four differently-marked marmosets in a stereo RGB+depth arena,
+runs DeepLabCut pose estimation, extracts per-animal body-part tracks,
+computes per-frame behavioral features, and projects the features into a
+learned behavioral cluster space.
+Everything is packaged so a single command runs the full pipeline:
+```bash
+python -m marm_behavior path/to/video.avi
+```
+For a detailed reference covering every command-line flag, see
+[USER_GUIDE.md](USER_GUIDE.md).
+## Install
+The fastest way is to use the bundled conda environment file. It pins
+the exact ML stack (TensorFlow 2.9.1, openTSNE 0.6.2, scikit-learn
+1.1.2, DeepLabCut 2.2.0.6, NumPy 1.22.4) the canonical reference
+outputs were produced with — important because the nn stage's t-SNE
+output is sensitive to openTSNE's version. Full install takes ~10
+minutes on a clean machine.
+### 1. Create the conda environment
+If you don't have conda, install
+[Miniconda](https://docs.conda.io/en/latest/miniconda.html) first.
+Then from the repo root:
+```bash
+conda env create -f env/deep_learning.yml
+conda activate deep_learning
+```
+This creates an env named `deep_learning` (matching the lab's
+canonical name) with everything the pipeline needs.
+### 2. (GPU users only) Install CUDA + cuDNN
+For TensorFlow 2.9.1, CUDA 11.2 + cuDNN 8.1 is the supported
+combination:
+```bash
+conda install -c conda-forge cudatoolkit=11.2 cudnn=8.1 -y
+```
+Verify the GPU is visible to TensorFlow:
+```bash
+python -c "import tensorflow as tf; print(tf.config.list_physical_devices('GPU'))"
+```
+You should see at least one `PhysicalDevice(name='/physical_device:GPU:0', ...)`.
+If the list is empty, the `dlc` stage will still run on CPU but will be
+~50× slower.
+### 3. Install the marm_behavior package
+Either clone and run in-place:
+```bash
+git clone <repo-url> marm_behavior
+cd marm_behavior
+python -m marm_behavior path/to/video.avi
+```
+...or install it editable:
+```bash
+cd marm_behavior
+pip install -e .
+marm-behavior path/to/video.avi
+```
+On the first invocation, the bundled data (DLC model ~128 MB, NN
+encoder ~2 MB, NN reference set ~3 GB) downloads from the Hugging
+Face Hub and is cached under `~/.cache/huggingface/hub/`. Every
+subsequent run reuses the cache.
+### 4. (Optional) Override the NN reference data
+The default reference set is fetched from
+[`williammenegas/data`](https://huggingface.co/datasets/williammenegas/data)
+on first run. You only need to override this if you want to use your
+own reference set (e.g. for a different cohort) — in that case, pass
+`--nn-reference-dir /path/to/your/reference/folder`. See the
+[Reference files for the NN stage](#reference-files-for-the-nn-stage)
+section below.
+### Verify the install
+```bash
+python -m marm_behavior --help
+```
+Should print the full usage banner with all the per-colour and
+per-stage flags.
+## The six stages
+| Stage | Input | Output |
+|---|---|---|
+| **dlc** | video | `*DLC_..._el.picklesingle.csv`, `*DLC_..._el.picklemulti.csv` |
+| **extract** | DLC CSVs | `tracks_<video>.mat` (per-animal body-part tracks) |
+| **process** | `tracks_*.mat` | `edges_<video>.mat` (per-animal edge matrices) |
+| **depths** | `edges_*.mat` + video | `depths_<video>.mat` (pixel-depth lookups) |
+| **labels** | `edges_*.mat` + `depths_*.mat` | `{w,b,r,y}_description_<video>.csv` (30 behavioral features per frame) |
+| **nn** | description CSVs | `hcoord_{Red,White,Blue,Yellow}_<video>.csv` (2D t-SNE coords) and `hlabel_*_<video>.csv` (cluster labels) |
+## Common invocations
+**Run everything with defaults:**
+```bash
+python -m marm_behavior path/to/video.avi
+```
+**Skip DLC if the CSVs already exist:**
+```bash
+python -m marm_behavior video.avi --stages extract process depths labels nn
+```
+**Re-run just the labels stage** against existing `edges_*.mat` / `depths_*.mat`:
+```bash
+python -m marm_behavior video.avi --stages labels
+```
+**Point the NN stage at a custom reference folder:**
+```bash
+python -m marm_behavior video.avi --nn-reference-dir /path/to/references
+```
+**Declare one colour absent:**
+```bash
+python -m marm_behavior video.avi --no-yellow
+```
+**Use your own DLC project** instead of the bundled one:
+```bash
+python -m marm_behavior video.avi --dlc-config /path/to/your/config.yaml
+```
+See `python -m marm_behavior --help` for every flag.
+## Bundled data
+The runtime data marm_behavior needs — the trained DeepLabCut project
+(~128 MB), the LSTM encoder (~2 MB), the canonical NN reference set
+(~190 MB), and a `ground_normalized.npz` body-part cloud (~570 KB) — is
+**not shipped in the wheel**. It's hosted on the Hugging Face Hub at
+[`williammenegas/data`](https://huggingface.co/datasets/williammenegas/data)
+and downloaded lazily on first use, then cached under
+`~/.cache/huggingface/hub/`. The first pipeline run on a fresh machine
+incurs a one-time ~310 MB download; every run after that is instant.
+To pre-warm the cache (e.g. before going offline):
+```bash
+python -m marm_behavior --prefetch-data
+```
+For shared cluster installs where every user should hit the same
+on-disk copy, set `MARM_BEHAVIOR_DATA_DIR=/shared/path/to/data`. The
+helper checks that path before falling back to the per-user HF cache.
+For lab-internal mirrors of the data repo, set
+`MARM_BEHAVIOR_HF_REPO=your-org/your-mirror` to override the source
+repo.
+The lookup logic lives in `marm_behavior/_data_files.py` if you want
+to read it.
+## Reference files for the NN stage
+The NN stage projects video features into a stable behavioral cluster
+space. The canonical reference data is **fetched from the Hugging Face
+Hub** as part of the bundled-data download above, so the stage works
+out of the box. You can ignore this section unless you want to use a
+different reference set.
+The bundled folder contains:
+```
+out_inner_mean1.csv                    (256,)        normalization mean
+out_inner_std1.csv                     (256,)        normalization std
+tsne_temp1_1.csv                       (N, 2)        reference 2D coords
+dbscan_temp1_1.csv                     (N,)          reference cluster labels
+embedding_train_coords.npy             (n_train, 2)  training 2D coords
+embedding_train_annoy.bin              (~180 MB)     cached annoy k-NN index
+embedding_train_meta.json                            cache metadata
+embedding_train_optimizer_gains.npy    (n_train, 2)  optimizer state
+```
+The 3 GB `out_inner1.csv` (raw training latents) is **not** shipped —
+it isn't needed at runtime because the cache files above already encode
+everything `transform()` needs.
+**Using your own reference set.** Pass `--nn-reference-dir /path/to/dir`.
+The folder needs the four small CSVs at minimum. If the
+`embedding_train_*` cache files are missing, the stage will fit
+openTSNE from `out_inner1.csv` (which must be present in that case;
+takes ~6 min) and write the cache for future runs.
+**Bootstrap mode.** If you don't have any reference data at all and
+want to experiment, pass `--nn-bootstrap` to generate everything from
+the current video's own description CSVs. Bootstrap cluster IDs are
+**not comparable** across videos or to canonical references, so this
+mode is opt-in and intended for initial exploration only.
+**Buddy chains.** The NN stage pairs each animal's behavioral features
+with one other animal's features before encoding. The default pairings
+are Red↔Yellow, White↔Blue, Blue↔White, and Yellow↔Red (with a
+secondary fallback if the primary's description CSV isn't present).
+Override per-animal with the `--<color>-buddy` flags — each takes one
+or more short color keys (`r`, `w`, `b`, `y`) in preference order:
+```bash
+# Always pair Red with Blue instead of Yellow:
+python -m marm_behavior video.avi --red-buddy b
+# Pair Yellow with White first, falling back to Blue:
+python -m marm_behavior video.avi --yellow-buddy w b
+# Multiple overrides at once:
+python -m marm_behavior video.avi --red-buddy b --blue-buddy y r
+```
+From Python, pass `nn_buddies={'r': ['b'], 'y': ['w', 'b']}` to
+`marm_behavior.run()`.
+## One-animal mode
+When exactly one of the four animals is marked present (via three
+`--no-<color>` flags), the pipeline automatically switches into
+**one-animal mode**. Four stages change behaviour:
+1. **extract stage** — every multi-CSV tracklet is assigned to the
+   focal animal regardless of which colour DLC's head classifier
+   predicted. This is the correct behaviour because DLC's head
+   classifier was trained on four-animal data and routinely
+   mislabels a single animal across colours; the four-animal
+   proximity-based assignment would otherwise scatter the focal
+   animal's tracklets across whichever absent colours happened to
+   get mislabelled. For each frame, the highest-quality tracklet
+   (most surviving body parts after the confidence threshold) is
+   picked, with ties broken on the lower track id.
+2. **process stage** — non-focal animals get a constant body-length
+   `bh = 30` instead of the per-frame movmedian + clamp +
+   forward-fill used in four-animal mode. The focal animal still
+   gets the full computation. This avoids producing meaningless
+   body-length estimates from F matrices that are all-NaN because
+   the colour isn't actually in the video.
+3. **depths stage** — only the focal animal's per-frame depth lookup
+   runs. The other three colours' inner loops are skipped entirely,
+   roughly 4× faster than four-animal mode for a typical video.
+4. **nn stage** — skipped automatically. The NN encoder pairs each
+   self animal with a buddy animal's behavioural features, and in
+   one-animal mode there is no buddy. Pass `--force-nn` to override
+   if you want to run the NN stage anyway (e.g. for bootstrapping
+   a one-animal-specific reference space).
+The mode is detected automatically — there's no separate flag to
+enable it. The CLI prints a clear banner up front so you can see
+what's about to happen:
+```
+$ python -m marm_behavior video.avi --no-white --no-blue --no-yellow
+[marm_behavior] ONE-ANIMAL MODE: only Red present
+[marm_behavior]   extract: all multi-CSV tracklets are assigned to the focal animal (no proximity matching to colour-classified heads)
+[marm_behavior]   process: non-focal animals will use constant bh = 30
+[marm_behavior]   depths:  per-frame lookup runs only for the focal animal
+[marm_behavior]   nn:      stage will be skipped (no buddy animal available; pass force_nn=True / --force-nn to override)
+[marm_behavior] dlc:     ...
+```
+When zero, two, three, or four animals are present, behaviour is
+unchanged from the standard four-animal pipeline.
+Everything is also available as a Python function:
+```python
+from marm_behavior import run
+result = run("path/to/video.avi")
+print(result["stages_run"])    # ['dlc', 'extract', 'process', 'depths', 'labels', 'nn']
+print(result["descriptions"])  # {'w': Path(...), 'b': Path(...), ...}
+print(result["nn"])            # {'Red': (hcoord_path, hlabel_path), ...}
+```
+See `help(marm_behavior.run)` for every parameter.
+## Dependencies
+**Runtime** (always required):
+- numpy ≥ 1.22, scipy ≥ 1.9, h5py ≥ 3.6
+**Per-stage** (install the ones you need):
+| Stage | Needs |
+|---|---|
+| dlc | `deeplabcut[tf]` or `deeplabcut[pytorch]` |
+| depths | `opencv-python-headless` or `imageio` + `imageio-ffmpeg` |
+| nn | `tensorflow`, `openTSNE`, `scikit-learn` |
+The `extract`, `process`, and `labels` stages need only the core runtime
+deps.
+## Layout
+```
+marm_behavior/              <- the Python package
+├── run.py                  <- pipeline entry point
+├── __main__.py             <- CLI
+├── dlc_inference.py        <- DLC shell-out
+├── el_to_csv.py            <- tracklet-pickle to CSV converter
+├── nn_postprocess.py       <- NN stage
+├── io/                     <- .mat and .csv I/O
+├── numerics/               <- hull, rolling reductions, NaN helpers
+├── extract/                <- body-part track extraction
+├── process/                <- posture and edge computation
+├── depths/                 <- per-frame depth lookup
+├── features/               <- behavioral feature labelling
+├── pipeline/               <- batch orchestrators
+└── data/                   <- bundled models + canonical NN reference data
+pyproject.toml
+README.md
+```