RubyGems - parse_fasta - Versions diffs - 1.8.1 → 1.8.2 - Mend

parse_fasta 1.8.1 → 1.8.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

checksums.yaml +8 -8
data/README.md +31 -39
data/lib/parse_fasta/fastq_file.rb +2 -2
data/lib/parse_fasta/version.rb +1 -1
metadata +2 -2

checksums.yaml CHANGED Viewed

@@ -1,15 +1,15 @@
 ---
 !binary "U0hBMQ==":
   metadata.gz: !binary |-
-    YzlkNTQ5NGQ5YTFlNzVkOTJjOTJkMTM2YmUwN2FlMjhmOTg2ZDZlMQ==
+    MDE1YThkNzUyNzI0MTMwZDMyYzBiNzFiODMzZGQzNzQ5ODU3ZTk1MA==
   data.tar.gz: !binary |-
-    OThhYTU5NTAzYzlkMTg2N2IxOWNjYTExMWEyODRiY2Q2OGFhMzQ4MQ==
+    NjBmZGUxZTdkM2UyZTQ4YWY1MDliMTI0OTJlYjA5ZDFmMzg4OWRlZQ==
 SHA512:
   metadata.gz: !binary |-
-    NzMyYTNmZmQ0YThlMThkZmE3ZjZhZjAzNDM2MGQ4ZTcwODhkODY3NzI2NzU1
-    NDQzYmU1ZDBiZjljYzVhZmNlMDIzZDMxMDc4Zjk3N2E1YTAxOTUzZTIyOGNj
-    NzdjOWJiODA2ZDA0NGNmMjFkOGI1ZjgxZWY3NTRmMTQ1MDc5MTU=
+    MjIxNGNlODdkNTk3ZWE1ZDk1Zjg4ZDY0ZWE3NzE0ZWI0ODQ4MDZjZTk1MDY1
+    OGNiOGViOWYwZDU5ODY0YjNmZWY1ODYwOGVlN2E5MTVmYzZlZmIwNzE4MjNi
+    YWFjMjgzNGQ4YmMzODdjYjZjNTBmYTM4MWFiYTcyYjlmZWFhYWM=
   data.tar.gz: !binary |-
-    ODU2NzMwZTk3ZmE0ZTIxYzMwOWVkMWUyY2U4MTE3YzAzMzI5MzU1ZDAzNWE3
-    OGQ3ODk2ZjQwYTNjNTJlZTVjYzg3MGU5YzliZjAyYjQ4ZDNmNjRlNzE2YmJk
-    MmY2OTRhYjI3NTM3ODFmYWYwNDk2ZjQ0YzI3YjIxMzI3MGU3MmE=
+    YTNiMTYzNmJhODkzMjEyMjBlOTgxOGIyMjFmMTFlOTE0NTEyOWZjNTgxMTRj
+    N2Y4YjE3NWUxMjYyNTRjNTYzZGE3MjBhNjJjZTNmNjRkYzY5ZGI2MGY0MjQz
+    N2RhMDUxN2E1MjY0NDZkOWQyMjEzYTU2ZDE4M2FlZDg3YzA0N2M=

data/README.md CHANGED Viewed

@@ -66,14 +66,11 @@ Read fasta file into a hash.
 ## Versions ##
-### 1.8 ###
+### 1.8.2 ###
-Add `Sequence#rev_comp`. It can handle IUPAC characters. Since
-`parse_fasta` doesn't check whether the seq is AA or NA, if called on
-an amino acid string, things will get weird as it will complement the
-IUPAC characters in the AA string and leave others.
+Speed up `FastqFile#each_record`.
-#### 1.8.1 ####
+### 1.8.1 ###
 An error will be raised if a fasta file has a `>` in the
 sequence. Sometimes files are not terminated with a newline
@@ -93,12 +90,14 @@ This will raise `ParseFasta::SequenceFormatError`.
 Also, headers with lots of `>` within are fine now.
+### 1.8 ###
-### 1.7 ###
-Add `SeqFile#to_hash`, `FastaFile#to_hash` and `FastqFile#to_hash`.
+Add `Sequence#rev_comp`. It can handle IUPAC characters. Since
+`parse_fasta` doesn't check whether the seq is AA or NA, if called on
+an amino acid string, things will get weird as it will complement the
+IUPAC characters in the AA string and leave others.
-#### 1.7.2 ####
+### 1.7.2 ###
 Strip spaces (not all whitespace) from `Sequence` and `Quality` strings.
@@ -108,24 +107,28 @@ there are spaces that don't match in the quality and sequence in a
 fastQ file, then things will get messed up in the FastQ file. FastQ
 shouldn't have spaces though.
-### 1.6 ###
+### 1.7 ###
-Added `SeqFile` class, which accepts either fastA or fastQ files. It
-uses FastaFile and FastqFile internally. You can use this class if you
-want your scripts to accept either fastA or fastQ files.
+Add `SeqFile#to_hash`, `FastaFile#to_hash` and `FastqFile#to_hash`.
-If you need the description and quality string, you should use
-FastqFile instead.
+### 1.6.2 ###
+`FastaFile::open` now raises a `ParseFasta::DataFormatError` when passed files
+that don't begin with a `>`.
-#### 1.6.1 ####
+### 1.6.1 ###
 Better internal handling of empty sequences -- instead of raising
 errors, pass empty sequences.
-#### 1.6.2 ####
+### 1.6 ###
-`FastaFile::open` now raises a `ParseFasta::DataFormatError` when passed files
-that don't begin with a `>`.
+Added `SeqFile` class, which accepts either fastA or fastQ files. It
+uses FastaFile and FastqFile internally. You can use this class if you
+want your scripts to accept either fastA or fastQ files.
+If you need the description and quality string, you should use
+FastqFile instead.
 ### 1.5 ###
@@ -204,17 +207,16 @@ Last version with File monkey patch.
 ## Benchmark ##
-Perhaps this isn't exactly fair since `BioRuby` is a big module with
-lots of features and error checking, whereas `parse_fasta` is meant to
-be lightweight and easy to use for my own research. Oh well ;)
+**NOTE**: These benchmarks are against an older version of
+  `parse_fasta`.
+Some quick and dirty benchmarks against `BioRuby`.
 ### FastaFile#each_record ###
-You're probably wondering...How does it compare to BioRuby in some
-super accurate benchmarking tests? Lucky for you, I calculated
-sequence length for each fasta record with both the `each_record`
-method from this gem and using the `FastaFormat` class from
-BioRuby. You can see the test script in `benchmark.rb`.
+Calculating sequence length length for each fasta record with both the
+`each_record` method from this gem and using the `FastaFormat` class
+from BioRuby. You can see the test script in `benchmark.rb`.
 The test file contained 2,009,897 illumina reads and the file size
 was 1.1 gigabytes. Here are the results from Ruby's `Benchmark` class:
@@ -255,20 +257,10 @@ test 2 was 4,000,000 and test 3 was 8,000,000 bases.
 Nice!
-Troll: "But Ryan, when will you find the GC of an 8,000,000 base
-sequence?"
+Troll: "When will you find the GC of an 8,000,000 base sequence?"
 Me: "Step off, troll!"
-## Test suite & docs ##
-For a good time, you could clone this repo and run the test suite with
-rspec! Or if you just don't trust that it works like it should. The
-specs probably need a little clean up...so fork it and clean it up ;)
-Same with the docs. Clone the repo and build them yourself with `yard`
-if you are in need of some excitement.
 ## Notes ##
 Only the `SeqFile` class actually checks to make sure that you passed

data/lib/parse_fasta/fastq_file.rb CHANGED Viewed

@@ -80,11 +80,11 @@ class FastqFile < File
       case count % 4
       when 0
-        header = line.sub(/^@/, '')
+        header = line[1..-1]
       when 1
         sequence = Sequence.new(line)
       when 2
-        description = line.sub(/^\+/, '')
+        description = line[1..-1]
       when 3
         quality = Quality.new(line)
         yield(header, sequence, description, quality)

data/lib/parse_fasta/version.rb CHANGED Viewed

@@ -17,5 +17,5 @@
 # along with parse_fasta.  If not, see <http://www.gnu.org/licenses/>.
 module ParseFasta
-  VERSION = "1.8.1"
+  VERSION = "1.8.2"
 end

metadata CHANGED Viewed

@@ -1,14 +1,14 @@
 --- !ruby/object:Gem::Specification
 name: parse_fasta
 version: !ruby/object:Gem::Version
-  version: 1.8.1
+  version: 1.8.2
 platform: ruby
 authors:
 - Ryan Moore
 autorequire:
 bindir: bin
 cert_chain: []
-date: 2016-03-11 00:00:00.000000000 Z
+date: 2016-04-16 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: bundler