lederhosen 1.3.0 → 1.3.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
data/lederhosen.gemspec CHANGED
@@ -5,11 +5,11 @@
5
5
 
6
6
  Gem::Specification.new do |s|
7
7
  s.name = "lederhosen"
8
- s.version = "1.3.0"
8
+ s.version = "1.3.1"
9
9
 
10
10
  s.required_rubygems_version = Gem::Requirement.new(">= 0") if s.respond_to? :required_rubygems_version=
11
11
  s.authors = ["Austin G. Davis-Richardson"]
12
- s.date = "2012-11-13"
12
+ s.date = "2012-11-26"
13
13
  s.description = "Various tools for OTU clustering"
14
14
  s.email = "harekrishna@gmail.com"
15
15
  s.executables = ["lederhosen"]
@@ -6,7 +6,7 @@ module Lederhosen
6
6
 
7
7
  method_option :input, :type => :string, :required => true
8
8
  method_option :database, :type => :string, :required => true
9
- method_option :threads, :type => :numeric, :default => 0
9
+ method_option :threads, :type => :numeric, :default => false
10
10
  method_option :identity, :type => :numeric, :required => true
11
11
  method_option :output, :type => :string, :required => true
12
12
  method_option :strand, :type => :string, :default => 'plus'
@@ -33,8 +33,8 @@ module Lederhosen
33
33
  "--strand #{strand}"
34
34
  ]
35
35
 
36
- # threads = 0 : use all threads (default)
37
- if threads != 0
36
+ # threads = False : use all threads (default)
37
+ if threads != false
38
38
  cmd << "--threads #{threads}"
39
39
  end
40
40
 
@@ -47,12 +47,15 @@ module Lederhosen
47
47
  pbar.inc line.unpack('*C').size
48
48
 
49
49
  dat = parse_usearch_line(line.strip)
50
- next if dat.nil?
51
50
 
52
- levels.each do |level|
53
- name = dat[level] rescue nil
54
- all_names[level] << name
55
- level_sample_cluster_count[level][input_file][name] += 1
51
+ if dat.nil? # unclassified
52
+ levels.each { |level| level_sample_cluster_count[level][input_file]['unclassified_reads'] += 1 }
53
+ else # classified
54
+ levels.each do |level|
55
+ name = dat[level] rescue nil
56
+ all_names[level] << name
57
+ level_sample_cluster_count[level][input_file][name] += 1
58
+ end
56
59
  end
57
60
 
58
61
  end
@@ -3,7 +3,7 @@ module Lederhosen
3
3
  MAJOR = 1
4
4
  MINOR = 3
5
5
  CODENAME = 'Dirndl' # changes for minor versions
6
- PATCH = 0
6
+ PATCH = 1
7
7
 
8
8
  STRING = [MAJOR, MINOR, PATCH].join('.')
9
9
  end
data/readme.md CHANGED
@@ -2,7 +2,11 @@
2
2
 
3
3
  # Lederhosen
4
4
 
5
- Cluster raw Illumina 16S rRNA amplicon data to generate OTUs.
5
+ OTU clustering for rRNA amplicons. Lederhosen is intended to be simple, robust and easy to use.
6
+
7
+ ### Why not QIIME?
8
+
9
+ QIIME is great but imagine for a moment, if you will, a world where there was only one web browser.
6
10
 
7
11
  ### About
8
12
 
@@ -13,24 +17,20 @@ using paired and non-paired end short reads such as those produced by Illumina (
13
17
  - Lederhosen is free and open source under the [MIT open source license](http://opensource.org/licenses/mit-license.php/).
14
18
  - Except for USEARCH which requires a license, Lederhosen is available for commercial use.
15
19
 
16
- ## How do I get Lederhosen?
17
-
18
- 0. Obtain & Install [USEARCH](http://www.drive5.com/) (32bit is fine for non-commercial use)
19
- 2. Get a copy of [TaxCollector](http://github.com/audy/taxcollector)
20
- 3. Install Lederhosen by typing:
21
-
22
- `sudo gem install lederhosen`
23
- 4. Check installation by typing `lederhosen`. You should see some help text.
24
-
25
- ## Features
20
+ ### Features
26
21
 
27
22
  - Sequence trimming (paired-end Illumina).
28
23
  - Parallel, referenced-based clustering to TaxCollector using USEARCH
29
24
  - Generation and filtering of OTU abundancy matrices.
30
25
 
31
- ## How do I use Lederhosen?
26
+ ### Installation
32
27
 
33
- Lederhosen is just a convenient wrapper for UCLUST and BLAT with some scripts for quality filtering, de-noising of data as well as creation of nice tables. It is similar to QIIME but meant for paired-end Illumina data rather than single-end 454. The basic lederhosen pipeline consists of: trimming, joining, sorting, filtering, clustering, more filtering, and output generation (OTU tables, representative reads, reads by cluster, and taxonomic descriptions for clusters). See the example pipeline in `pipeline.sh`.
28
+ 0. Obtain & Install [USEARCH](http://www.drive5.com/) (32bit is fine for non-commercial use)
29
+ 2. Get a copy of [TaxCollector](http://github.com/audy/taxcollector) or [GreenGenes](http://greengenes.lbl.gov) 16S database
30
+ 3. Install Lederhosen by typing:
31
+
32
+ `sudo gem install lederhosen`
33
+ 4. Check installation by typing `lederhosen`. You should see some help text.
34
34
 
35
35
  ## Tasks
36
36
 
@@ -50,8 +50,8 @@ Create UDB database required by usearch from TaxCollector
50
50
 
51
51
  ```bash
52
52
  lederhosen make_udb \
53
- --input=taxcollector.fa \
54
- --output=taxcollector.udb
53
+ --input=taxcollector.fa \
54
+ --output=taxcollector.udb
55
55
  ```
56
56
 
57
57
  ### Cluster Reads using USEARCH
@@ -60,10 +60,10 @@ Cluster reads using USEARCH. Output is a uc file.
60
60
 
61
61
  ```bash
62
62
  lederhosen cluster \
63
- --input=trimmed/*.fasta \
64
- --identity=0.95 \
65
- --output=clusters_95.uc \
66
- --database=taxcollector.udb
63
+ --input=trimmed/*.fasta \
64
+ --identity=0.95 \
65
+ --output=clusters_95.uc \
66
+ --database=taxcollector.udb
67
67
  ```
68
68
  ### Generate OTU table(s)
69
69
 
@@ -71,9 +71,9 @@ Create an OTU abundance table where rows are samples and columns are clusters. T
71
71
 
72
72
  ```bash
73
73
  lederhosen otu_table \
74
- --clusters=clusters_95.uc \
75
- --prefix=otu_table \
76
- --level=domain phylum class order family genus species
74
+ --clusters=clusters_95.uc \
75
+ --prefix=otu_table \
76
+ --level=domain phylum class order family genus species
77
77
  ```
78
78
 
79
79
  This will create the files:
metadata CHANGED
@@ -1,7 +1,7 @@
1
1
  --- !ruby/object:Gem::Specification
2
2
  name: lederhosen
3
3
  version: !ruby/object:Gem::Version
4
- version: 1.3.0
4
+ version: 1.3.1
5
5
  prerelease:
6
6
  platform: ruby
7
7
  authors:
@@ -9,7 +9,7 @@ authors:
9
9
  autorequire:
10
10
  bindir: bin
11
11
  cert_chain: []
12
- date: 2012-11-13 00:00:00.000000000 Z
12
+ date: 2012-11-26 00:00:00.000000000 Z
13
13
  dependencies:
14
14
  - !ruby/object:Gem::Dependency
15
15
  name: dna
@@ -160,7 +160,7 @@ required_ruby_version: !ruby/object:Gem::Requirement
160
160
  version: '0'
161
161
  segments:
162
162
  - 0
163
- hash: -3539547832258076669
163
+ hash: -2600038455412980670
164
164
  required_rubygems_version: !ruby/object:Gem::Requirement
165
165
  none: false
166
166
  requirements: