PyamilySeq 0.5.0__tar.gz → 0.5.1__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {pyamilyseq-0.5.0/src/PyamilySeq.egg-info → pyamilyseq-0.5.1}/PKG-INFO +7 -7
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/README.md +6 -6
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/setup.cfg +1 -1
- pyamilyseq-0.5.1/src/PyamilySeq/Constants.py +2 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq/PyamilySeq.py +3 -3
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1/src/PyamilySeq.egg-info}/PKG-INFO +7 -7
- pyamilyseq-0.5.0/src/PyamilySeq/Constants.py +0 -2
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/LICENSE +0 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/pyproject.toml +0 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq/CD-Hit_StORF-Reporter_Cross-Genera_Builder.py +0 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq/PyamilySeq_Genus.py +0 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq/PyamilySeq_Species.py +0 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq/Seq_Combiner.py +0 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq/__init__.py +0 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq/utils.py +0 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq.egg-info/SOURCES.txt +0 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq.egg-info/dependency_links.txt +0 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq.egg-info/entry_points.txt +0 -0
- {pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq.egg-info/top_level.txt +0 -0
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.1
|
|
2
2
|
Name: PyamilySeq
|
|
3
|
-
Version: 0.5.
|
|
3
|
+
Version: 0.5.1
|
|
4
4
|
Summary: PyamilySeq - A a tool to look for sequence-based gene families identified by clustering methods such as CD-HIT, DIAMOND, BLAST or MMseqs2.
|
|
5
5
|
Home-page: https://github.com/NickJD/PyamilySeq
|
|
6
6
|
Author: Nicholas Dimonaco
|
|
@@ -47,7 +47,7 @@ usage: PyamilySeq.py [-h] -run_mode {Full,Partial} -group_mode {Species,Genus}
|
|
|
47
47
|
[-gpa GENE_PRESENCE_ABSENCE_OUT] [-verbose {True,False}]
|
|
48
48
|
[-v]
|
|
49
49
|
|
|
50
|
-
PyamilySeq v0.5.
|
|
50
|
+
PyamilySeq v0.5.1: PyamilySeq Run Parameters.
|
|
51
51
|
|
|
52
52
|
options:
|
|
53
53
|
-h, --help show this help message and exit
|
|
@@ -56,9 +56,9 @@ Required Arguments:
|
|
|
56
56
|
-run_mode {Full,Partial}
|
|
57
57
|
Run Mode: Should PyamilySeq be run in "Full" or
|
|
58
58
|
"Partial" mode?
|
|
59
|
-
-group_mode {Species
|
|
59
|
+
-group_mode {Species}
|
|
60
60
|
Group Mode: Should PyamilySeq be run in "Species" or
|
|
61
|
-
"Genus" mode?
|
|
61
|
+
"Genus" mode? - Genus mode not currently functioning
|
|
62
62
|
-clust_tool {CD-HIT} Clustering tool to use: CD-HIT, DIAMOND, BLAST or
|
|
63
63
|
MMseqs2.
|
|
64
64
|
-output_dir OUTPUT_DIR
|
|
@@ -119,14 +119,14 @@ Misc:
|
|
|
119
119
|
### Examples: Below are two examples of running PyamilySeq in its two main modes.
|
|
120
120
|
#### 'Full Mode': Will conduct clustering of sequences as part of PyamilySeq run
|
|
121
121
|
```bash
|
|
122
|
-
PyamilySeq -
|
|
122
|
+
PyamilySeq -run_mode Full -group_mode Species -output_dir ../../test_data/testing -input_type combined -input_dir .../test_data/genomes -name_split _combined.gff3 -pid 0.99 -len_diff 0.99 -clust_tool CD-HIT -gpa True -con True -w 99 -verbose True
|
|
123
123
|
```
|
|
124
124
|
#### 'Partial Mode': Will take the output of a sequence clustering
|
|
125
125
|
```bash
|
|
126
126
|
PyamilySeq -run_mode Partial -group_mode Species -output_dir .../test_data/testing -cluster_file .../test_data/CD-HIT/combined_Ensmbl_pep_CD_90_60.clstr -clust_tool CD-HIT -original_fasta .../test_data/combined_Ensmbl_cds.fasta -gpa True -con True -w 99 -verbose True
|
|
127
127
|
```
|
|
128
128
|
|
|
129
|
-
```
|
|
129
|
+
```bash
|
|
130
130
|
Calculating Groups
|
|
131
131
|
Gene Groups:
|
|
132
132
|
first_core_99: 3103
|
|
@@ -145,7 +145,7 @@ Seq-Combiner -input_dir .../test_data/genomes -name_split _combined.gff3 -output
|
|
|
145
145
|
```bash
|
|
146
146
|
usage: Seq_Combiner.py [-h] -input_dir INPUT_DIR -input_type {separate,combined} -name_split NAME_SPLIT -output_dir OUTPUT_DIR -output_name OUTPUT_FILE
|
|
147
147
|
|
|
148
|
-
Seq-Combiner v0.5.
|
|
148
|
+
Seq-Combiner v0.5.1: Seq-Combiner Run Parameters.
|
|
149
149
|
|
|
150
150
|
options:
|
|
151
151
|
-h, --help show this help message and exit
|
|
@@ -32,7 +32,7 @@ usage: PyamilySeq.py [-h] -run_mode {Full,Partial} -group_mode {Species,Genus}
|
|
|
32
32
|
[-gpa GENE_PRESENCE_ABSENCE_OUT] [-verbose {True,False}]
|
|
33
33
|
[-v]
|
|
34
34
|
|
|
35
|
-
PyamilySeq v0.5.
|
|
35
|
+
PyamilySeq v0.5.1: PyamilySeq Run Parameters.
|
|
36
36
|
|
|
37
37
|
options:
|
|
38
38
|
-h, --help show this help message and exit
|
|
@@ -41,9 +41,9 @@ Required Arguments:
|
|
|
41
41
|
-run_mode {Full,Partial}
|
|
42
42
|
Run Mode: Should PyamilySeq be run in "Full" or
|
|
43
43
|
"Partial" mode?
|
|
44
|
-
-group_mode {Species
|
|
44
|
+
-group_mode {Species}
|
|
45
45
|
Group Mode: Should PyamilySeq be run in "Species" or
|
|
46
|
-
"Genus" mode?
|
|
46
|
+
"Genus" mode? - Genus mode not currently functioning
|
|
47
47
|
-clust_tool {CD-HIT} Clustering tool to use: CD-HIT, DIAMOND, BLAST or
|
|
48
48
|
MMseqs2.
|
|
49
49
|
-output_dir OUTPUT_DIR
|
|
@@ -104,14 +104,14 @@ Misc:
|
|
|
104
104
|
### Examples: Below are two examples of running PyamilySeq in its two main modes.
|
|
105
105
|
#### 'Full Mode': Will conduct clustering of sequences as part of PyamilySeq run
|
|
106
106
|
```bash
|
|
107
|
-
PyamilySeq -
|
|
107
|
+
PyamilySeq -run_mode Full -group_mode Species -output_dir ../../test_data/testing -input_type combined -input_dir .../test_data/genomes -name_split _combined.gff3 -pid 0.99 -len_diff 0.99 -clust_tool CD-HIT -gpa True -con True -w 99 -verbose True
|
|
108
108
|
```
|
|
109
109
|
#### 'Partial Mode': Will take the output of a sequence clustering
|
|
110
110
|
```bash
|
|
111
111
|
PyamilySeq -run_mode Partial -group_mode Species -output_dir .../test_data/testing -cluster_file .../test_data/CD-HIT/combined_Ensmbl_pep_CD_90_60.clstr -clust_tool CD-HIT -original_fasta .../test_data/combined_Ensmbl_cds.fasta -gpa True -con True -w 99 -verbose True
|
|
112
112
|
```
|
|
113
113
|
|
|
114
|
-
```
|
|
114
|
+
```bash
|
|
115
115
|
Calculating Groups
|
|
116
116
|
Gene Groups:
|
|
117
117
|
first_core_99: 3103
|
|
@@ -130,7 +130,7 @@ Seq-Combiner -input_dir .../test_data/genomes -name_split _combined.gff3 -output
|
|
|
130
130
|
```bash
|
|
131
131
|
usage: Seq_Combiner.py [-h] -input_dir INPUT_DIR -input_type {separate,combined} -name_split NAME_SPLIT -output_dir OUTPUT_DIR -output_name OUTPUT_FILE
|
|
132
132
|
|
|
133
|
-
Seq-Combiner v0.5.
|
|
133
|
+
Seq-Combiner v0.5.1: Seq-Combiner Run Parameters.
|
|
134
134
|
|
|
135
135
|
options:
|
|
136
136
|
-h, --help show this help message and exit
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
[metadata]
|
|
2
2
|
name = PyamilySeq
|
|
3
|
-
version = v0.5.
|
|
3
|
+
version = v0.5.1
|
|
4
4
|
author = Nicholas Dimonaco
|
|
5
5
|
author_email = nicholas@dimonaco.co.uk
|
|
6
6
|
description = PyamilySeq - A a tool to look for sequence-based gene families identified by clustering methods such as CD-HIT, DIAMOND, BLAST or MMseqs2.
|
|
@@ -3,7 +3,7 @@ import collections
|
|
|
3
3
|
import os
|
|
4
4
|
import glob
|
|
5
5
|
import subprocess
|
|
6
|
-
|
|
6
|
+
|
|
7
7
|
|
|
8
8
|
|
|
9
9
|
try:
|
|
@@ -43,8 +43,8 @@ def main():
|
|
|
43
43
|
required.add_argument('-run_mode', action='store', dest='run_mode', choices=['Full','Partial'],
|
|
44
44
|
help='Run Mode: Should PyamilySeq be run in "Full" or "Partial" mode?',
|
|
45
45
|
required=True)
|
|
46
|
-
required.add_argument('-group_mode', action='store', dest='group_type', choices=['Species'
|
|
47
|
-
help='Group Mode: Should PyamilySeq be run in "Species" or "Genus" mode?',
|
|
46
|
+
required.add_argument('-group_mode', action='store', dest='group_type', choices=['Species'],
|
|
47
|
+
help='Group Mode: Should PyamilySeq be run in "Species" or "Genus" mode? - Genus mode not currently functioning',
|
|
48
48
|
required=True)
|
|
49
49
|
required.add_argument("-clust_tool", action="store", dest="clust_tool", choices=['CD-HIT'],
|
|
50
50
|
help="Clustering tool to use: CD-HIT, DIAMOND, BLAST or MMseqs2.",
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.1
|
|
2
2
|
Name: PyamilySeq
|
|
3
|
-
Version: 0.5.
|
|
3
|
+
Version: 0.5.1
|
|
4
4
|
Summary: PyamilySeq - A a tool to look for sequence-based gene families identified by clustering methods such as CD-HIT, DIAMOND, BLAST or MMseqs2.
|
|
5
5
|
Home-page: https://github.com/NickJD/PyamilySeq
|
|
6
6
|
Author: Nicholas Dimonaco
|
|
@@ -47,7 +47,7 @@ usage: PyamilySeq.py [-h] -run_mode {Full,Partial} -group_mode {Species,Genus}
|
|
|
47
47
|
[-gpa GENE_PRESENCE_ABSENCE_OUT] [-verbose {True,False}]
|
|
48
48
|
[-v]
|
|
49
49
|
|
|
50
|
-
PyamilySeq v0.5.
|
|
50
|
+
PyamilySeq v0.5.1: PyamilySeq Run Parameters.
|
|
51
51
|
|
|
52
52
|
options:
|
|
53
53
|
-h, --help show this help message and exit
|
|
@@ -56,9 +56,9 @@ Required Arguments:
|
|
|
56
56
|
-run_mode {Full,Partial}
|
|
57
57
|
Run Mode: Should PyamilySeq be run in "Full" or
|
|
58
58
|
"Partial" mode?
|
|
59
|
-
-group_mode {Species
|
|
59
|
+
-group_mode {Species}
|
|
60
60
|
Group Mode: Should PyamilySeq be run in "Species" or
|
|
61
|
-
"Genus" mode?
|
|
61
|
+
"Genus" mode? - Genus mode not currently functioning
|
|
62
62
|
-clust_tool {CD-HIT} Clustering tool to use: CD-HIT, DIAMOND, BLAST or
|
|
63
63
|
MMseqs2.
|
|
64
64
|
-output_dir OUTPUT_DIR
|
|
@@ -119,14 +119,14 @@ Misc:
|
|
|
119
119
|
### Examples: Below are two examples of running PyamilySeq in its two main modes.
|
|
120
120
|
#### 'Full Mode': Will conduct clustering of sequences as part of PyamilySeq run
|
|
121
121
|
```bash
|
|
122
|
-
PyamilySeq -
|
|
122
|
+
PyamilySeq -run_mode Full -group_mode Species -output_dir ../../test_data/testing -input_type combined -input_dir .../test_data/genomes -name_split _combined.gff3 -pid 0.99 -len_diff 0.99 -clust_tool CD-HIT -gpa True -con True -w 99 -verbose True
|
|
123
123
|
```
|
|
124
124
|
#### 'Partial Mode': Will take the output of a sequence clustering
|
|
125
125
|
```bash
|
|
126
126
|
PyamilySeq -run_mode Partial -group_mode Species -output_dir .../test_data/testing -cluster_file .../test_data/CD-HIT/combined_Ensmbl_pep_CD_90_60.clstr -clust_tool CD-HIT -original_fasta .../test_data/combined_Ensmbl_cds.fasta -gpa True -con True -w 99 -verbose True
|
|
127
127
|
```
|
|
128
128
|
|
|
129
|
-
```
|
|
129
|
+
```bash
|
|
130
130
|
Calculating Groups
|
|
131
131
|
Gene Groups:
|
|
132
132
|
first_core_99: 3103
|
|
@@ -145,7 +145,7 @@ Seq-Combiner -input_dir .../test_data/genomes -name_split _combined.gff3 -output
|
|
|
145
145
|
```bash
|
|
146
146
|
usage: Seq_Combiner.py [-h] -input_dir INPUT_DIR -input_type {separate,combined} -name_split NAME_SPLIT -output_dir OUTPUT_DIR -output_name OUTPUT_FILE
|
|
147
147
|
|
|
148
|
-
Seq-Combiner v0.5.
|
|
148
|
+
Seq-Combiner v0.5.1: Seq-Combiner Run Parameters.
|
|
149
149
|
|
|
150
150
|
options:
|
|
151
151
|
-h, --help show this help message and exit
|
|
File without changes
|
|
File without changes
|
{pyamilyseq-0.5.0 → pyamilyseq-0.5.1}/src/PyamilySeq/CD-Hit_StORF-Reporter_Cross-Genera_Builder.py
RENAMED
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|