PyPI - XspecT - Versions diffs - 0.5.0__tar.gz → 0.5.1__tar.gz - Mend

XspecT 0.5.0tar.gz → 0.5.1tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of XspecT might be problematic. Click here for more details.

Files changed (119) hide show

{xspect-0.5.0 → xspect-0.5.1}/.github/workflows/test.yml RENAMED Viewed

@@ -28,6 +28,8 @@ jobs:
           run: |
             xspect models download
         - name: Test with pytest
+          env:
+            NCBI_API_KEY: ${{ secrets.NCBI_API_KEY }}
           run: |
             pytest --cov --retries 2 --retry-delay 5
         - name: Upload coverage reports to Codecov

{xspect-0.5.0 → xspect-0.5.1}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: XspecT
-Version: 0.5.0
+Version: 0.5.1
 Summary: Tool to monitor and characterize pathogens using Bloom filters.
 License: MIT License
@@ -55,7 +55,7 @@ Requires-Dist: pytest-retry; extra == "test"
 Requires-Dist: httpx; extra == "test"
 Dynamic: license-file
-# XspecT - Acinetobacter Species Assignment Tool
+# XspecT
 <!-- start intro -->
 ![Test](https://github.com/bionf/xspect2/actions/workflows/test.yml/badge.svg)
 [![linting: pylint](https://img.shields.io/badge/linting-pylint-yellowgreen)](https://github.com/pylint-dev/pylint)
@@ -63,7 +63,7 @@ Dynamic: license-file
 XspecT is a Python-based tool to taxonomically classify sequence-reads (or assembled genomes) on the species and/or MLST level using [kmer indices] and a [Support Vector Machine].
-XspecT utilizes the uniqueness of kmers and compares extracted kmers from the input-data to a reference database. Bloom Filter ensure a fast lookup in this process. For a final prediction, the results are classified using a Support Vector Machine.
+XspecT utilizes the uniqueness of kmers and compares extracted kmers from the input-data to a kmer index. Probablistic data structures ensure a fast lookup in this process. For a final prediction, the results are classified using a Support Vector Machine.
 The tool is available as a web-based application and as a command line interface.
@@ -91,16 +91,22 @@ xspect models train ncbi
 ```
 ### How to run the web app
-To run the web app, install and run [XspecT Web](https://github.com/aromberg/xspect-web). Additionally, run XspecT in API mode:
+To run the web app, simply execute:
 ```
 xspect web
 ```
+This will start a local web server. You can access the web app by navigating to `http://localhost:8000` in your web browser.
 ### How to use the XspecT command line interface
-Run XspecT with the configuration you want to run it with as arguments.
+To use the XspecT command line interface, execute `xspect` with the desired subcommand and parameters.
+**Example**:
 ```
 xspect classify species
 ```
+If you do not provide the required parameters, the command line interface will prompt you for them.
 For further instructions on how to use the command line interface, please refer to the [documentation] or execute:
 ```
 xspect --help

{xspect-0.5.0 → xspect-0.5.1}/README.md RENAMED Viewed

@@ -1,4 +1,4 @@
-# XspecT - Acinetobacter Species Assignment Tool
+# XspecT
 <!-- start intro -->
 ![Test](https://github.com/bionf/xspect2/actions/workflows/test.yml/badge.svg)
 [![linting: pylint](https://img.shields.io/badge/linting-pylint-yellowgreen)](https://github.com/pylint-dev/pylint)
@@ -6,7 +6,7 @@
 XspecT is a Python-based tool to taxonomically classify sequence-reads (or assembled genomes) on the species and/or MLST level using [kmer indices] and a [Support Vector Machine].
-XspecT utilizes the uniqueness of kmers and compares extracted kmers from the input-data to a reference database. Bloom Filter ensure a fast lookup in this process. For a final prediction, the results are classified using a Support Vector Machine.
+XspecT utilizes the uniqueness of kmers and compares extracted kmers from the input-data to a kmer index. Probablistic data structures ensure a fast lookup in this process. For a final prediction, the results are classified using a Support Vector Machine.
 The tool is available as a web-based application and as a command line interface.
@@ -34,16 +34,22 @@ xspect models train ncbi
 ```
 ### How to run the web app
-To run the web app, install and run [XspecT Web](https://github.com/aromberg/xspect-web). Additionally, run XspecT in API mode:
+To run the web app, simply execute:
 ```
 xspect web
 ```
+This will start a local web server. You can access the web app by navigating to `http://localhost:8000` in your web browser.
 ### How to use the XspecT command line interface
-Run XspecT with the configuration you want to run it with as arguments.
+To use the XspecT command line interface, execute `xspect` with the desired subcommand and parameters.
+**Example**:
 ```
 xspect classify species
 ```
+If you do not provide the required parameters, the command line interface will prompt you for them.
 For further instructions on how to use the command line interface, please refer to the [documentation] or execute:
 ```
 xspect --help

{xspect-0.5.0 → xspect-0.5.1}/pyproject.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [project]
 name = "XspecT"
-version = "0.5.0"
+version = "0.5.1"
 description = "Tool to monitor and characterize pathogens using Bloom filters."
 readme = {file = "README.md", content-type = "text/markdown"}
 license = {file = "LICENSE"}

{xspect-0.5.0 → xspect-0.5.1}/src/XspecT.egg-info/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: XspecT
-Version: 0.5.0
+Version: 0.5.1
 Summary: Tool to monitor and characterize pathogens using Bloom filters.
 License: MIT License
@@ -55,7 +55,7 @@ Requires-Dist: pytest-retry; extra == "test"
 Requires-Dist: httpx; extra == "test"
 Dynamic: license-file
-# XspecT - Acinetobacter Species Assignment Tool
+# XspecT
 <!-- start intro -->
 ![Test](https://github.com/bionf/xspect2/actions/workflows/test.yml/badge.svg)
 [![linting: pylint](https://img.shields.io/badge/linting-pylint-yellowgreen)](https://github.com/pylint-dev/pylint)
@@ -63,7 +63,7 @@ Dynamic: license-file
 XspecT is a Python-based tool to taxonomically classify sequence-reads (or assembled genomes) on the species and/or MLST level using [kmer indices] and a [Support Vector Machine].
-XspecT utilizes the uniqueness of kmers and compares extracted kmers from the input-data to a reference database. Bloom Filter ensure a fast lookup in this process. For a final prediction, the results are classified using a Support Vector Machine.
+XspecT utilizes the uniqueness of kmers and compares extracted kmers from the input-data to a kmer index. Probablistic data structures ensure a fast lookup in this process. For a final prediction, the results are classified using a Support Vector Machine.
 The tool is available as a web-based application and as a command line interface.
@@ -91,16 +91,22 @@ xspect models train ncbi
 ```
 ### How to run the web app
-To run the web app, install and run [XspecT Web](https://github.com/aromberg/xspect-web). Additionally, run XspecT in API mode:
+To run the web app, simply execute:
 ```
 xspect web
 ```
+This will start a local web server. You can access the web app by navigating to `http://localhost:8000` in your web browser.
 ### How to use the XspecT command line interface
-Run XspecT with the configuration you want to run it with as arguments.
+To use the XspecT command line interface, execute `xspect` with the desired subcommand and parameters.
+**Example**:
 ```
 xspect classify species
 ```
+If you do not provide the required parameters, the command line interface will prompt you for them.
 For further instructions on how to use the command line interface, please refer to the [documentation] or execute:
 ```
 xspect --help

xspect-0.5.1/src/xspect/classify.py ADDED Viewed

@@ -0,0 +1,67 @@
+from pathlib import Path
+from xspect.mlst_feature.mlst_helper import pick_scheme_from_models_dir
+import xspect.model_management as mm
+from xspect.models.probabilistic_filter_mlst_model import (
+    ProbabilisticFilterMlstSchemeModel,
+)
+from xspect.definitions import fasta_endings, fastq_endings
+def classify_genus(
+    model_genus: str, input_path: Path, output_path: Path, step: int = 1
+):
+    """Classify the input file using the genus model."""
+    model = mm.get_genus_model(model_genus)
+    input_paths = []
+    input_is_dir = input_path.is_dir()
+    ending_wildcards = [f"*.{ending}" for ending in fasta_endings + fastq_endings]
+    if input_is_dir:
+        input_paths = [p for e in ending_wildcards for p in input_path.glob(e)]
+    elif input_path.is_file():
+        input_paths = [input_path]
+    for idx, current_path in enumerate(input_paths):
+        result = model.predict(current_path, step=step)
+        result.input_source = current_path.name
+        output_name = (
+            f"{output_path.stem}_{idx+1}{output_path.suffix}"
+            if input_is_dir
+            else output_path.name
+        )
+        result.save(output_path.parent / output_name)
+        print(f"Saved result as {output_name}")
+def classify_species(model_genus, input_path, output_path, step=1):
+    """Classify the input file using the species model."""
+    model = mm.get_species_model(model_genus)
+    input_paths = []
+    input_is_dir = input_path.is_dir()
+    ending_wildcards = [f"*.{ending}" for ending in fasta_endings + fastq_endings]
+    if input_is_dir:
+        input_paths = [p for e in ending_wildcards for p in input_path.glob(e)]
+    elif input_path.is_file():
+        input_paths = [input_path]
+    for idx, current_path in enumerate(input_paths):
+        result = model.predict(current_path, step=step)
+        result.input_source = current_path.name
+        output_name = (
+            f"{output_path.stem}_{idx+1}{output_path.suffix}"
+            if input_is_dir
+            else output_path.name
+        )
+        result.save(output_path.parent / output_name)
+        print(f"Saved result as {output_name}")
+def classify_mlst(input_path, output_path):
+    """Classify the input file using the MLST model."""
+    scheme_path = pick_scheme_from_models_dir()
+    model = ProbabilisticFilterMlstSchemeModel.load(scheme_path)
+    result = model.predict(scheme_path, input_path)
+    result.save(output_path)

{xspect-0.5.0 → xspect-0.5.1}/src/xspect/definitions.py RENAMED Viewed

@@ -9,9 +9,17 @@ fastq_endings = ["fastq", "fq"]
 def get_xspect_root_path():
     """Return the root path for XspecT data."""
-    root_path = Path(getcwd()) / "xspect-data"
-    root_path.mkdir(exist_ok=True, parents=True)
-    return root_path
+    home_based_dir = Path.home() / "xspect-data"
+    if home_based_dir.exists():
+        return home_based_dir
+    cwd_based_dir = Path(getcwd()) / "xspect-data"
+    if cwd_based_dir.exists():
+        return cwd_based_dir
+    home_based_dir.mkdir(exist_ok=True, parents=True)
+    return home_based_dir
 def get_xspect_model_path():

xspect-0.5.1/src/xspect/filter_sequences.py ADDED Viewed

@@ -0,0 +1,138 @@
+from pathlib import Path
+from xspect.model_management import get_genus_model, get_species_model
+from xspect.file_io import filter_sequences
+from xspect.definitions import fasta_endings, fastq_endings
+def filter_species(
+    model_genus: str,
+    model_species: str,
+    input_path: Path,
+    output_path: Path,
+    threshold: float,
+    classification_output_path: Path | None = None,
+):
+    """Filter sequences by species.
+    This function filters sequences from the input file based on the species model.
+    It uses the genus model to identify the genus of the sequences and then applies
+    the species model to filter the sequences.
+    Args:
+        model_genus (str): The genus model slug.
+        model_species (str): The species model slug.
+        input_path (Path): The path to the input file containing sequences.
+        output_path (Path): The path to the output file where filtered sequences will be saved.
+            above this threshold will be included in the output file. A threshold of -1 will
+            include only sequences if the species score is the highest among the
+            available species scores.
+        classification_output_path (Path): Optional path to save the classification results.
+        threshold (float): The threshold for filtering sequences. Only sequences with a score
+            above this threshold will be included in the output file. A threshold of -1 will
+            include only sequences if the species score is the highest among the
+            available species scores.
+    """
+    species_model = get_species_model(model_genus)
+    input_paths = []
+    input_is_dir = input_path.is_dir()
+    ending_wildcards = [f"*.{ending}" for ending in fasta_endings + fastq_endings]
+    if input_is_dir:
+        input_paths = [p for e in ending_wildcards for p in input_path.glob(e)]
+    elif input_path.is_file():
+        input_paths = [input_path]
+    for idx, current_path in enumerate(input_paths):
+        result = species_model.predict(current_path)
+        result.input_source = current_path.name
+        if classification_output_path:
+            classification_output_name = (
+                f"{classification_output_path.stem}_{idx+1}{classification_output_path.suffix}"
+                if input_is_dir
+                else classification_output_path.name
+            )
+            result.save(classification_output_path.parent / classification_output_name)
+            print(
+                f"Saved classification results from {current_path.name} as {classification_output_name}"
+            )
+        included_ids = result.get_filtered_subsequence_labels(model_species, threshold)
+        if not included_ids:
+            print(f"No sequences found for the given species in {current_path.name}.")
+            continue
+        output_name = (
+            f"{output_path.stem}_{idx+1}{output_path.suffix}"
+            if input_is_dir
+            else output_path.name
+        )
+        filter_sequences(
+            current_path,
+            output_path.parent / output_name,
+            included_ids,
+        )
+        print(f"Saved filtered sequences from {current_path.name} as {output_name}")
+def filter_genus(
+    model_genus: str,
+    input_path: Path,
+    output_path: Path,
+    threshold: float,
+    classification_output_path: Path | None = None,
+):
+    """Filter sequences by genus.
+    This function filters sequences from the input file based on the genus model.
+    It uses the genus model to identify the genus of the sequences and then applies
+    the filtering based on the provided threshold.
+    Args:
+        model_genus (str): The genus model slug.
+        input_path (Path): The path to the input file containing sequences.
+        output_path (Path): The path to the output file where filtered sequences will be saved.
+        threshold (float): The threshold for filtering sequences. Only sequences with a score
+            above this threshold will be included in the output file.
+        classification_output_path (Path): Optional path to save the classification results.
+    """
+    genus_model = get_genus_model(model_genus)
+    input_paths = []
+    input_is_dir = input_path.is_dir()
+    ending_wildcards = [f"*.{ending}" for ending in fasta_endings + fastq_endings]
+    if input_is_dir:
+        input_paths = [p for e in ending_wildcards for p in input_path.glob(e)]
+    elif input_path.is_file():
+        input_paths = [input_path]
+    for idx, current_path in enumerate(input_paths):
+        result = genus_model.predict(current_path)
+        result.input_source = current_path.name
+        if classification_output_path:
+            classification_output_name = (
+                f"{classification_output_path.stem}_{idx+1}{classification_output_path.suffix}"
+                if input_is_dir
+                else classification_output_path.name
+            )
+            result.save(classification_output_path.parent / classification_output_name)
+            print(
+                f"Saved classification results from {current_path.name} as {classification_output_name}"
+            )
+        included_ids = result.get_filtered_subsequence_labels(model_genus, threshold)
+        if not included_ids:
+            print(f"No sequences found for the given genus in {current_path.name}.")
+            continue
+        output_name = (
+            f"{output_path.stem}_{idx+1}{output_path.suffix}"
+            if input_is_dir
+            else output_path.name
+        )
+        filter_sequences(
+            current_path,
+            output_path.parent / output_name,
+            included_ids,
+        )
+        print(f"Saved filtered sequences from {current_path.name} as {output_name}")

{xspect-0.5.0 → xspect-0.5.1}/src/xspect/main.py RENAMED Viewed

@@ -7,12 +7,12 @@ import uvicorn
 from xspect import classify
 from xspect.web import app
 from xspect.download_models import download_test_models
-from xspect.file_io import filter_sequences
+from xspect import filter_sequences
 from xspect.train import train_from_directory, train_from_ncbi
 from xspect.definitions import (
     get_xspect_model_path,
 )
-from xspect.mlst_feature.mlst_helper import pick_scheme, pick_scheme_from_models_dir
+from xspect.mlst_feature.mlst_helper import pick_scheme
 from xspect.mlst_feature.pub_mlst_handler import PubMLSTHandler
 from xspect.models.probabilistic_filter_mlst_model import (
     ProbabilisticFilterMlstSchemeModel,
@@ -211,19 +211,19 @@ def classify_seqs():
     help="Path to FASTA or FASTQ file for classification.",
     type=click.Path(exists=True, dir_okay=True, file_okay=True),
     prompt=True,
+    default=Path("."),
 )
 @click.option(
     "-o",
     "--output-path",
     help="Path to the output file.",
-    type=click.Path(dir_okay=True, file_okay=True),
+    type=click.Path(dir_okay=False, file_okay=True),
     default=Path(".") / f"result_{uuid4()}.json",
 )
 def classify_genus(model_genus, input_path, output_path):
     """Classify samples using a genus model."""
     click.echo("Classifying...")
     classify.classify_genus(model_genus, Path(input_path), Path(output_path))
-    click.echo(f"Result saved as {output_path}.")
 @classify_seqs.command(
@@ -244,12 +244,13 @@ def classify_genus(model_genus, input_path, output_path):
     help="Path to FASTA or FASTQ file for classification.",
     type=click.Path(exists=True, dir_okay=True, file_okay=True),
     prompt=True,
+    default=Path("."),
 )
 @click.option(
     "-o",
     "--output-path",
     help="Path to the output file.",
-    type=click.Path(dir_okay=True, file_okay=True),
+    type=click.Path(dir_okay=False, file_okay=True),
     default=Path(".") / f"result_{uuid4()}.json",
 )
 @click.option(
@@ -264,7 +265,6 @@ def classify_species(model_genus, input_path, output_path, sparse_sampling_step)
     classify.classify_species(
         model_genus, Path(input_path), Path(output_path), sparse_sampling_step
     )
-    click.echo(f"Result saved as {output_path}.")
 @classify_seqs.command(
@@ -275,15 +275,14 @@ def classify_species(model_genus, input_path, output_path, sparse_sampling_step)
     "-i",
     "--input-path",
     help="Path to FASTA-file for mlst identification.",
-    type=click.Path(exists=True, dir_okay=True, file_okay=True),
+    type=click.Path(exists=True, dir_okay=False, file_okay=True),
     prompt=True,
 )
 @click.option(
     "-o",
     "--output-path",
     help="Path to the output file.",
-    type=click.Path(dir_okay=True, file_okay=True),
-    default=Path(".") / f"result_{uuid4()}.json",
+    type=click.Path(dir_okay=False, file_okay=True),
 )
 def classify_mlst(input_path, output_path):
     """MLST classify a sample."""
@@ -321,37 +320,42 @@ def filter_seqs():
     help="Path to FASTA or FASTQ file for classification.",
     type=click.Path(exists=True, dir_okay=True, file_okay=True),
     prompt=True,
+    default=Path("."),
 )
 @click.option(
     "-o",
     "--output-path",
     help="Path to the output file.",
-    type=click.Path(dir_okay=True, file_okay=True),
+    type=click.Path(dir_okay=False, file_okay=True),
     prompt=True,
+    default=Path(".") / f"genus_filtered_{uuid4()}.fasta",
 )
 @click.option(
+    "--classification-output-path",
+    help="Optional path to the classification output file.",
+    type=click.Path(dir_okay=False, file_okay=True),
+)
+@click.option(
+    "-t",
     "--threshold",
-    type=float,
+    type=click.FloatRange(0, 1),
     help="Threshold for filtering (default: 0.7).",
     default=0.7,
     prompt=True,
 )
-def filter_genus(model_genus, input_path, output_path, threshold):
+def filter_genus(
+    model_genus, input_path, output_path, classification_output_path, threshold
+):
     """Filter samples using a genus model."""
     click.echo("Filtering...")
-    genus_model = get_genus_model(model_genus)
-    result = genus_model.predict(Path(input_path))
-    included_ids = result.get_filtered_subsequence_labels(model_genus, threshold)
-    if not included_ids:
-        click.echo("No sequences found for the given genus.")
-        return
-    filter_sequences(
+    filter_sequences.filter_genus(
+        model_genus,
         Path(input_path),
         Path(output_path),
-        included_ids=included_ids,
+        threshold,
+        Path(classification_output_path) if classification_output_path else None,
     )
-    click.echo(f"Filtered sequences saved at {output_path}.")
 @filter_seqs.command(
@@ -378,24 +382,44 @@ def filter_genus(model_genus, input_path, output_path, threshold):
     help="Path to FASTA or FASTQ file for classification.",
     type=click.Path(exists=True, dir_okay=True, file_okay=True),
     prompt=True,
+    default=Path("."),
 )
 @click.option(
     "-o",
     "--output-path",
     help="Path to the output file.",
-    type=click.Path(dir_okay=True, file_okay=True),
+    type=click.Path(dir_okay=False, file_okay=True),
     prompt=True,
+    default=Path(".") / f"species_filtered_{uuid4()}.fasta",
+)
+@click.option(
+    "--classification-output-path",
+    help="Optional path to the classification output file.",
+    type=click.Path(dir_okay=False, file_okay=True),
 )
 @click.option(
+    "-t",
     "--threshold",
     type=float,
     help="Threshold for filtering (default: 0.7). Use -1 to filter for the highest scoring species.",
     default=0.7,
     prompt=True,
 )
-def filter_species(model_genus, model_species, input_path, output_path, threshold):
+def filter_species(
+    model_genus,
+    model_species,
+    input_path,
+    output_path,
+    threshold,
+    classification_output_path,
+):
     """Filter a sample using the species model."""
+    if threshold != -1 and (threshold < 0 or threshold > 1):
+        raise click.BadParameter(
+            "Threshold must be between 0 and 1, or -1 for filtering by the highest scoring species."
+        )
     available_species = get_model_metadata(f"{model_genus}-species")["display_names"]
     available_species = {
         id: name.replace(f"{model_genus} ", "")
@@ -420,18 +444,14 @@ def filter_species(model_genus, model_species, input_path, output_path, threshol
     ][0]
     click.echo("Filtering...")
-    species_model = get_species_model(model_genus)
-    result = species_model.predict(Path(input_path))
-    included_ids = result.get_filtered_subsequence_labels(model_species, threshold)
-    if not included_ids:
-        click.echo("No sequences found for the given species.")
-        return
-    filter_sequences(
+    filter_sequences.filter_species(
+        model_genus,
+        model_species,
         Path(input_path),
         Path(output_path),
-        included_ids=included_ids,
+        threshold,
+        Path(classification_output_path) if classification_output_path else None,
     )
-    click.echo(f"Filtered sequences saved at {output_path}.")
 if __name__ == "__main__":

{xspect-0.5.0 → xspect-0.5.1}/src/xspect/models/probabilistic_filter_model.py RENAMED Viewed

@@ -20,13 +20,13 @@ class ProbabilisticFilterModel:
         self,
         k: int,
         model_display_name: str,
-        author: str,
-        author_email: str,
+        author: str | None,
+        author_email: str | None,
         model_type: str,
         base_path: Path,
         fpr: float = 0.01,
         num_hashes: int = 7,
-        training_accessions: dict[str, list[str]] = None,
+        training_accessions: dict[str, list[str]] | None = None,
     ) -> None:
         if k < 1:
             raise ValueError("Invalid k value, must be greater than 0")
@@ -49,7 +49,7 @@ class ProbabilisticFilterModel:
         self.index = None
         self.training_accessions = training_accessions
-    def get_cobs_index_path(self) -> Path:
+    def get_cobs_index_path(self) -> str:
         """Returns the path to the cobs index"""
         return str(self.base_path / self.slug() / "index.cobs_classic")
@@ -76,8 +76,8 @@ class ProbabilisticFilterModel:
     def fit(
         self,
         dir_path: Path,
-        display_names: dict = None,
-        training_accessions: dict[str, list[str]] = None,
+        display_names: dict | None = None,
+        training_accessions: dict[str, list[str]] | None = None,
     ) -> None:
         """Adds filters to the model"""
@@ -123,7 +123,7 @@ class ProbabilisticFilterModel:
         self.index = cobs.Search(self.get_cobs_index_path(), True)
     def calculate_hits(
-        self, sequence: Seq, filter_ids: list[str] = None, step: int = 1
+        self, sequence: Seq, filter_ids: list[str] | None = None, step: int = 1
     ) -> dict:
         """Calculates the hits for a sequence"""

{xspect-0.5.0 → xspect-0.5.1}/src/xspect/models/probabilistic_filter_svm_model.py RENAMED Viewed

@@ -21,16 +21,16 @@ class ProbabilisticFilterSVMModel(ProbabilisticFilterModel):
         self,
         k: int,
         model_display_name: str,
-        author: str,
-        author_email: str,
+        author: str | None,
+        author_email: str | None,
         model_type: str,
         base_path: Path,
         kernel: str,
         c: float,
         fpr: float = 0.01,
         num_hashes: int = 7,
-        training_accessions: dict[str, list[str]] = None,
-        svm_accessions: dict[str, list[str]] = None,
+        training_accessions: dict[str, list[str]] | None = None,
+        svm_accessions: dict[str, list[str]] | None = None,
     ) -> None:
         super().__init__(
             k=k,
@@ -64,10 +64,10 @@ class ProbabilisticFilterSVMModel(ProbabilisticFilterModel):
         self,
         dir_path: Path,
         svm_path: Path,
-        display_names: dict = None,
+        display_names: dict[str, str] | None = None,
         svm_step: int = 1,
-        training_accessions: list[str] = None,
-        svm_accessions: list[str] = None,
+        training_accessions: dict[str, list[str]] | None = None,
+        svm_accessions: dict[str, list[str]] | None = None,
     ) -> None:
         """Fit the SVM to the sequences and labels"""

XspecT 0.5.0__tar.gz → 0.5.1__tar.gz

Potentially problematic release.

XspecT 0.5.0tar.gz → 0.5.1tar.gz