npm - stemit-cli - Versions diffs - 1.0.1 → 1.0.2 - Mend

stemit-cli 1.0.1 → 1.0.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +40 -0
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -34,6 +34,7 @@
     - [Skip BPM/key analysis](#skip-bpmkey-analysis)
     - [All options](#all-options)
   - [Available Models](#available-models)
+  - [Separation Accuracy](#separation-accuracy)
   - [Output Structure](#output-structure)
   - [How It Works](#how-it-works)
   - [First-Run Setup](#first-run-setup)
@@ -259,6 +260,45 @@ Options:
 ---
+## Separation Accuracy
+stemit uses [Demucs](https://github.com/facebookresearch/demucs) — one of the highest-rated open-source source separation models, consistently ranking at the top of the [Music Demixing Challenge](https://mdx-workshop.github.io/) leaderboards.
+**What to expect:**
+| Genre / Instrument | Typical Quality |
+|---|---|
+| Vocals (pop/rock) | Excellent — clean isolation with minimal bleed |
+| Drums | Very good — kick, snare, and cymbals well preserved |
+| Bass | Good — works best when bass is prominent in the mix |
+| Guitar / Piano (`htdemucs_6s`) | Moderate — depends heavily on how prominent the instrument is |
+| Electronic / heavily layered music | Lower — harder to separate tightly mixed synths |
+| Vocals (rap/spoken word) | Good — works well when vocals are dry or lightly processed |
+**Factors that affect quality:**
+- **Production style** — heavily compressed or layered mixes are harder to separate
+- **Frequency overlap** — instruments sharing the same frequency range (e.g. bass guitar and kick drum) bleed into each other
+- **Reverb / effects** — wet, heavily reverbed sources are harder to isolate cleanly
+- **Model choice** — `htdemucs_ft` gives the best overall quality; `mdx_extra` is specifically tuned for vocals
+**Benchmark scores (SDR — Signal-to-Distortion Ratio, higher is better):**
+The `htdemucs_ft` model achieves approximately:
+| Stem | SDR |
+|---|---|
+| Vocals | ~8.4 dB |
+| Drums | ~8.6 dB |
+| Bass | ~8.8 dB |
+| Other | ~5.8 dB |
+> SDR is a standard metric for source separation. Scores above 6 dB are considered good; above 8 dB is excellent. For reference, an SDR of 0 means the output is no better than silence.
+These scores are competitive with commercial stem separation tools and are state-of-the-art for open-source models. Results on real-world music may vary.
+---
 ## Output Structure
 ```

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "stemit-cli",
-  "version": "1.0.1",
+  "version": "1.0.2",
   "description": "CLI tool to split audio into stems (vocals/drums/bass/other), analyze BPM & key, and mute/solo tracks",
   "type": "module",
   "bin": {