npsearch 2.0.0 → 2.0.1
Sign up to get free protection for your applications and to get access to all the features.
- checksums.yaml +4 -4
- data/README.md +104 -53
- data/lib/npsearch/version.rb +1 -1
- data/npsearch.gemspec +2 -1
- metadata +5 -4
checksums.yaml
CHANGED
@@ -1,7 +1,7 @@
|
|
1
1
|
---
|
2
2
|
SHA1:
|
3
|
-
metadata.gz:
|
4
|
-
data.tar.gz:
|
3
|
+
metadata.gz: 7ede73d531ae96b5790db5dfe81704e44981470d
|
4
|
+
data.tar.gz: bbcbcff58473ba403044aded6e2882e150821a7b
|
5
5
|
SHA512:
|
6
|
-
metadata.gz:
|
7
|
-
data.tar.gz:
|
6
|
+
metadata.gz: 9c2a40a4ee188e2a3c159e66c92e4949b41775684ae4a4d87943e562ca361b2619534cc69d4c1ac6d263467edbb4b08aba982d9d48026d48882720b8e30b9a12
|
7
|
+
data.tar.gz: 3d258345776914e041dbb5dfd4447b88bd7feb6176a654f147598b41cb4fa011c4051d8934a8fe9df4e7ab324b229cf252d9c6c8cacfc7c9da1a02e34cff8c1d
|
data/README.md
CHANGED
@@ -1,68 +1,119 @@
|
|
1
|
-
#
|
2
|
-
[![
|
3
|
-
[![
|
4
|
-
[![Dependency Status](https://gemnasium.com/
|
5
|
-
[![Inline docs](http://inch-ci.org/github/IsmailM/NeuroPeptideSearch.png?branch=master)](http://inch-ci.org/github/IsmailM/NeuroPeptideSearch)
|
1
|
+
# NpSearch (NeuroPeptideSearch)
|
2
|
+
[![Build Status](https://travis-ci.org/wurmlab/NpSearch.svg?branch=master)](https://travis-ci.org/wurmlab/NpSearch)
|
3
|
+
[![Gem Version](https://badge.fury.io/rb/npsearch.svg)](http://badge.fury.io/rb/npsearch)
|
4
|
+
[![Dependency Status](https://gemnasium.com/wurmlab/NpSearch.svg)](https://gemnasium.com/wurmlab/NpSearch)
|
6
5
|
|
7
|
-
> A tool to identify noval Neuropeptides.
|
8
6
|
|
9
|
-
NpSearch (NeuroPeptideSearch) is a program that searches for potential neuropeptides precursors based on the motifs commonly found on a neuropeptide. Ideally, the input would be transcriptome or protein data since there are no introns to worry about and the signal peptide would be attached to the front of the precursor.
|
10
7
|
|
11
|
-
|
8
|
+
## Introduction
|
9
|
+
NpSearch is a tool that helps identify novel neuropeptides. As such it is not based on homology to existing neuropeptides - rather NpSearch is based on the common characteristics of neuropeptides and their precursors.
|
12
10
|
|
13
|
-
|
11
|
+
If you use this program, please cite us:
|
14
12
|
|
15
|
-
|
13
|
+
>Moghul I, Rowe M, Priyam A, ELphick M & Wurm Y <em>(in prep)</em> NpSearch: A Tool to Identify Novel Neuropeptides
|
16
14
|
|
17
|
-
|
15
|
+
NpSearch produces a fasta file and highly visual html file that are ordered by the likelihood of a sequence encoding a neuropeptide precursor.
|
18
16
|
|
19
|
-
|
17
|
+
NpSearch orders the results based on the following characteristics:
|
18
|
+
|
19
|
+
- **Signal peptide**: All neuropeptide precursors must have a signal peptide. This is due to the fact that the final bioactive neuropeptide has to be secreted from the cell of synthesis in order to be functionally active.
|
20
|
+
- **Cleavage sites**: Being derived from a precursor, the bioactive neuropeptide has to be cleaved out from the precursor. Prohormone convertase enzymes cleave these bioactive peptides at specific cleavage sites. Since certain cleavage motifs are more likely to be cleaved, NpSearch awards sequences with cleavage site motifs that are more likely to be cleaved with a higher score.
|
21
|
+
- **C-terminal Glycine**: A significant number of bioactive neuropeptides have a C-terminal glycine, that is amidated during post-translation modification. NpSearch awards sequences that have a potential neuropeptide with a C-terminal glycine a higher score.
|
22
|
+
- **Repeated peptides**: Some neuropeptide precursors contain numerous copies of the same neuropeptides (usually with slight sequence differences). NpSearch attempts to detect this by aligning all potential neuropeptides within a sequence. If a sequence is found to have multiple, similar predicted NPs, NpSearch awards it with a higher score.
|
23
|
+
- **Acidic spacer regions**: Neuropeptide precursors that contain multiple neuropeptide copies tend to have highly acidic spacer regions that separate the NP copies. If detected by NpSearch, the sequence is awarded with a higher score.
|
24
|
+
|
25
|
+
|
26
|
+
|
27
|
+
|
28
|
+
|
29
|
+
|
30
|
+
## Installation
|
20
31
|
|
21
|
-
|
32
|
+
### Installation Requirements
|
33
|
+
* Ruby (>= 2.0.0)
|
34
|
+
* SignalP 4.1 (Available from [here](http://www.cbs.dtu.dk/cgi-bin/nph-sw_request?signalp))
|
35
|
+
* USearch (Available from [here]())
|
36
|
+
* A web browser.
|
22
37
|
|
23
38
|
## Installation
|
39
|
+
Simply run the following command in the terminal.
|
24
40
|
|
25
|
-
|
41
|
+
```bash
|
42
|
+
gem install npsearch
|
26
43
|
```
|
27
|
-
|
44
|
+
|
45
|
+
If that doesn't work, try `sudo gem install npsearch` instead.
|
46
|
+
|
47
|
+
##### Running From Source (Not Recommended)
|
48
|
+
It is also possible to run from source. However, this is not recommended.
|
49
|
+
|
50
|
+
```bash
|
51
|
+
# Clone the repository.
|
52
|
+
git clone https://github.com/wurmlab/npsearch.git
|
53
|
+
|
54
|
+
# Move into NpSearch source directory.
|
55
|
+
cd NpSearch
|
56
|
+
|
57
|
+
# Install bundler
|
58
|
+
gem install bundler
|
59
|
+
|
60
|
+
# Use bundler to install dependencies
|
61
|
+
bundle install
|
62
|
+
|
63
|
+
# Optional: run tests, build documentation and build the gem from source
|
64
|
+
bundle exec rake
|
65
|
+
|
66
|
+
# Run NpSearch.
|
67
|
+
bundle exec npsearch -h
|
68
|
+
# note that `bundle exec` executes NpSearch in the context of the bundle
|
69
|
+
|
70
|
+
# Alternativaly, install NpSearch as a gem
|
71
|
+
bundle exec rake install
|
72
|
+
npsearch -h
|
28
73
|
```
|
74
|
+
|
75
|
+
|
76
|
+
|
77
|
+
|
29
78
|
## Usage
|
79
|
+
Verify NpSearch installed by running the following command in the terminal:
|
80
|
+
|
81
|
+
```bash
|
82
|
+
npsearch
|
83
|
+
```
|
84
|
+
|
85
|
+
You should see the following output.
|
86
|
+
|
87
|
+
```bash
|
88
|
+
* Usage: npsearch [Options] -i [Input File]
|
89
|
+
|
90
|
+
* Mandatory Options:
|
91
|
+
|
92
|
+
-i, --input [file] Path to the input fasta file
|
93
|
+
|
94
|
+
* Optional Options:
|
95
|
+
-s, --signalp_path The full path to the signalp script. This can be downloaded from
|
96
|
+
CBS. See https://www.github.com/wurmlab/NpSearch for more
|
97
|
+
information
|
98
|
+
-u, --usearch_path The full path to the usearch binary. This script can be downloaded
|
99
|
+
from .... See https://www.github.com/wurmlab/NpSearch for more
|
100
|
+
information
|
101
|
+
-n, --num_threads The number of threads to use when analysing the input file
|
102
|
+
-m, --orf_min_length N The minimum length of a potential neuropeptide precursor.
|
103
|
+
Default: 30
|
104
|
+
-h, --help Display this screen
|
105
|
+
-v, --version Shows version
|
106
|
+
|
107
|
+
```
|
108
|
+
|
109
|
+
|
110
|
+
### Example Usage Scenario
|
111
|
+
The following runs NpSearch on an input fasta dataset.
|
112
|
+
|
113
|
+
```bash
|
114
|
+
npsearch -i INPUT_FASTA_FILE -s /path/to/signalp -u /path/to/usearch -n NUM_THREADS
|
115
|
+
```
|
116
|
+
|
117
|
+
## Output
|
118
|
+
The output produced by NpSearch is presented in two manners. NpSearch produces a highly visual HTML file that can be open in any browsers (an example can seen [here]()) and a fasta file.
|
30
119
|
|
31
|
-
* Usage: npsearch [Options] -i [Input File] -o [Output Folder Name]
|
32
|
-
|
33
|
-
* Mandatory Options:
|
34
|
-
|
35
|
-
-i, --input [file] The input file (in fasta format). Can be a relative or a full
|
36
|
-
path.
|
37
|
-
-o, --output [folder name] The path to the output folder. This will be created if the
|
38
|
-
folder does not exist.
|
39
|
-
|
40
|
-
* Optional Options:
|
41
|
-
-m, --motif [Query Motif] By default NpSearch only searches for dibasic cleavage site
|
42
|
-
("KR", "RR" or "KK"). This option allows one to change the
|
43
|
-
set of cleavage sites to be searched.
|
44
|
-
The period "." can be used to denote any character. Multiple
|
45
|
-
motifs query can be used by using a pipeline character ("|")
|
46
|
-
between each query and putting the motif query in speech marks
|
47
|
-
e.g. "KR|RR|R..R"
|
48
|
-
Advanced Users: Regular expressions are supported.
|
49
|
-
-c, --cut_off N Changes the minimum Open Reading
|
50
|
-
Frame from the default 10 amino acid residues to N amino acid
|
51
|
-
residues.
|
52
|
-
-s, --signalp_file [file] Is used to supply the signal peptide results to the program.
|
53
|
-
These signal peptide results must be created using the SignalP
|
54
|
-
program (Version 4.x), downloadable from CBS. If this argument
|
55
|
-
isn't suplied, then NpSearch will try to run a local version
|
56
|
-
of the Signal P script.
|
57
|
-
-e, --extract_orf Only extracts the Open Reading Frames.
|
58
|
-
-v, --verbose Provides more information on each step taken in this program.
|
59
|
-
-h, --help Display this screen
|
60
|
-
--version Shows version
|
61
|
-
|
62
|
-
## Contributing
|
63
|
-
|
64
|
-
1. Fork it
|
65
|
-
2. Create your feature branch (`git checkout -b my-new-feature`)
|
66
|
-
3. Commit your changes (`git commit -am 'Add some feature'`)
|
67
|
-
4. Push to the branch (`git push origin my-new-feature`)
|
68
|
-
5. Create new Pull Request
|
data/lib/npsearch/version.rb
CHANGED
data/npsearch.gemspec
CHANGED
@@ -12,7 +12,8 @@ Gem::Specification.new do |s|
|
|
12
12
|
s.description = 'Search for Neuropeptides based solely on the common' \
|
13
13
|
' neuropeptide markers (e.g. signal peptide, dibasic' \
|
14
14
|
' cleavage sites etc.) i.e. not based on homology to' \
|
15
|
-
|
15
|
+
" known neuropeptides.\n\n" \
|
16
|
+
' For more information: https://github.com/wurmlab/npsearch'
|
16
17
|
s.summary = 'Search for neuropeptides based on the common' \
|
17
18
|
' neuropeptides markers'
|
18
19
|
s.homepage = 'https://github.com/IsmailM/NeuroPeptideSearch'
|
metadata
CHANGED
@@ -1,7 +1,7 @@
|
|
1
1
|
--- !ruby/object:Gem::Specification
|
2
2
|
name: npsearch
|
3
3
|
version: !ruby/object:Gem::Version
|
4
|
-
version: 2.0.
|
4
|
+
version: 2.0.1
|
5
5
|
platform: ruby
|
6
6
|
authors:
|
7
7
|
- Ismail Moghul
|
@@ -84,9 +84,10 @@ dependencies:
|
|
84
84
|
- - "~>"
|
85
85
|
- !ruby/object:Gem::Version
|
86
86
|
version: '3.0'
|
87
|
-
description:
|
88
|
-
(e.g. signal peptide, dibasic cleavage sites etc.) i.e. not based on homology to
|
89
|
-
|
87
|
+
description: |-
|
88
|
+
Search for Neuropeptides based solely on the common neuropeptide markers (e.g. signal peptide, dibasic cleavage sites etc.) i.e. not based on homology to known neuropeptides.
|
89
|
+
|
90
|
+
For more information: https://github.com/wurmlab/npsearch
|
90
91
|
email:
|
91
92
|
- y.wurm@qmul.ac.uk
|
92
93
|
executables:
|