RubyGems - bio-vcf - Versions diffs - 0.0.1 → 0.0.2 - Mend

bio-vcf 0.0.1 → 0.0.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (24) hide show

checksums.yaml +7 -0
data/.travis.yml +12 -0
data/Gemfile +6 -6
data/Gemfile.lock +49 -54
data/README.md +83 -10
data/Rakefile +5 -5
data/VERSION +1 -1
data/bin/bio-vcf +24 -8
data/bio-vcf.gemspec +73 -0
data/features/diff_count.feature +30 -0
data/features/multisample.feature +37 -0
data/features/somaticsniper.feature +84 -0
data/features/step_definitions/diff_count.rb +41 -0
data/features/step_definitions/multisample.rb +73 -0
data/features/step_definitions/somaticsniper.rb +122 -0
data/features/support/env.rb +4 -0
data/lib/bio-vcf/variant.rb +38 -0
data/lib/bio-vcf/vcfgenotypefield.rb +118 -10
data/lib/bio-vcf/vcfheader.rb +5 -0
data/lib/bio-vcf/vcfrdf.rb +30 -0
data/lib/bio-vcf/vcfrecord.rb +68 -5
data/lib/bio-vcf.rb +1 -0
data/test/data/input/multisample.vcf +150 -0
metadata +28 -76

checksums.yaml ADDED Viewed

@@ -0,0 +1,7 @@
+---
+SHA1:
+  metadata.gz: 014f3f4adef8a533501e115027fc8a920487f949
+  data.tar.gz: 89e442176c21a0893c267dc324db403fd57e7577
+SHA512:
+  metadata.gz: 624f6cd5251384da85b824cd1d98e49be2efcb765d34b902b225ba8e8dc7ce7b6fa68d9a47b8bea5f7eb60b33e21182c7d941d7b0da70ce58228354021c596a0
+  data.tar.gz: 7ffe437d72368db97987fe53c8aa93bebdcff68002975f86e02cf45ec69b48f76e060e76140e27242122b3c533021eae4073a19c0c0a09089edac62b02394682

data/.travis.yml ADDED Viewed

@@ -0,0 +1,12 @@
+language: ruby
+rvm:
+  - 1.9.3
+  - 2.1.0
+  - jruby-head
+#  - jruby-19mode # JRuby in 1.9 mode
+#  - 1.8.7
+#  - jruby-18mode # JRuby in 1.8 mode
+#  - rbx-18mode
+# uncomment this line if your project needs to run something other than `rake`:
+# script: bundle exec rspec spec

data/Gemfile CHANGED Viewed

@@ -6,11 +6,11 @@ source "http://rubygems.org"
 # Add dependencies to develop your gem here.
 # Include everything needed to run rake, tests, features, etc.
 group :development do
-  gem "minitest", "~> 5.0.7"
-  gem "rspec", "~> 2.8.0"
-  gem "cucumber", ">= 0"
-  gem "jeweler", "~> 1.8.4", :git => "https://github.com/technicalpickles/jeweler.git"
-  gem "bundler", ">= 1.0.21"
+  # gem "minitest"
+  gem "rspec"
+  gem "cucumber"
+  gem "jeweler" # , "~> 1.8.4", :git => "https://github.com/technicalpickles/jeweler.git"
+  # gem "bundler", ">= 1.0.21"
   # gem "bio", ">= 1.4.2"
-  gem "rdoc", "~> 3.12"
+  # gem "rdoc", "~> 3.12"
 end

data/Gemfile.lock CHANGED Viewed

@@ -1,78 +1,73 @@
-GIT
-  remote: https://github.com/technicalpickles/jeweler.git
-  revision: f7e0a55a207d83f56637dd8fbabf26a803410faf
-  specs:
-    jeweler (1.8.7)
-      builder
-      bundler (~> 1.0)
-      git (>= 1.2.5)
-      github_api (= 0.10.1)
-      highline (>= 1.6.15)
-      nokogiri (= 1.5.10)
-      rake
-      rdoc
 GEM
   remote: http://rubygems.org/
   specs:
     addressable (2.3.5)
     builder (3.2.2)
-    cucumber (1.3.2)
+    cucumber (1.3.11)
       builder (>= 2.1.2)
       diff-lcs (>= 1.1.3)
-      gherkin (~> 2.12.0)
-      multi_json (~> 1.3)
-    diff-lcs (1.1.3)
-    faraday (0.8.8)
-      multipart-post (~> 1.2.0)
-    gherkin (2.12.1)
+      gherkin (~> 2.12)
+      multi_json (>= 1.7.5, < 2.0)
+      multi_test (>= 0.0.2)
+    descendants_tracker (0.0.3)
+    diff-lcs (1.2.5)
+    faraday (0.9.0)
+      multipart-post (>= 1.2, < 3)
+    gherkin (2.12.2)
       multi_json (~> 1.3)
     git (1.2.6)
-    github_api (0.10.1)
-      addressable
-      faraday (~> 0.8.1)
+    github_api (0.11.3)
+      addressable (~> 2.3)
+      descendants_tracker (~> 0.0.1)
+      faraday (~> 0.8, < 0.10)
       hashie (>= 1.2)
-      multi_json (~> 1.4)
-      nokogiri (~> 1.5.2)
+      multi_json (>= 1.7.5, < 2.0)
+      nokogiri (~> 1.6.0)
       oauth2
     hashie (2.0.5)
-    highline (1.6.19)
-    httpauth (0.2.0)
-    json (1.8.0)
-    jwt (0.1.8)
+    highline (1.6.21)
+    jeweler (2.0.1)
+      builder
+      bundler (>= 1.0)
+      git (>= 1.2.5)
+      github_api
+      highline (>= 1.6.15)
+      nokogiri (>= 1.5.10)
+      rake
+      rdoc
+    json (1.8.1)
+    jwt (0.1.11)
       multi_json (>= 1.5)
-    minitest (5.0.7)
-    multi_json (1.8.0)
+    mini_portile (0.5.2)
+    multi_json (1.9.0)
+    multi_test (0.0.3)
     multi_xml (0.5.5)
-    multipart-post (1.2.0)
-    nokogiri (1.5.10)
-    oauth2 (0.9.2)
-      faraday (~> 0.8)
-      httpauth (~> 0.2)
-      jwt (~> 0.1.4)
-      multi_json (~> 1.0)
+    multipart-post (2.0.0)
+    nokogiri (1.6.1)
+      mini_portile (~> 0.5.0)
+    oauth2 (0.9.3)
+      faraday (>= 0.8, < 0.10)
+      jwt (~> 0.1.8)
+      multi_json (~> 1.3)
       multi_xml (~> 0.5)
       rack (~> 1.2)
     rack (1.5.2)
-    rake (10.1.0)
-    rdoc (3.12.2)
+    rake (10.1.1)
+    rdoc (4.1.1)
       json (~> 1.4)
-    rspec (2.8.0)
-      rspec-core (~> 2.8.0)
-      rspec-expectations (~> 2.8.0)
-      rspec-mocks (~> 2.8.0)
-    rspec-core (2.8.0)
-    rspec-expectations (2.8.0)
-      diff-lcs (~> 1.1.2)
-    rspec-mocks (2.8.0)
+    rspec (2.14.1)
+      rspec-core (~> 2.14.0)
+      rspec-expectations (~> 2.14.0)
+      rspec-mocks (~> 2.14.0)
+    rspec-core (2.14.8)
+    rspec-expectations (2.14.5)
+      diff-lcs (>= 1.1.3, < 2.0)
+    rspec-mocks (2.14.6)
 PLATFORMS
   ruby
 DEPENDENCIES
-  bundler (>= 1.0.21)
   cucumber
-  jeweler (~> 1.8.4)!
-  minitest (~> 5.0.7)
-  rdoc (~> 3.12)
-  rspec (~> 2.8.0)
+  jeweler
+  rspec

data/README.md CHANGED Viewed

@@ -2,18 +2,24 @@
 [![Build Status](https://secure.travis-ci.org/pjotrp/bioruby-vcf.png)](http://travis-ci.org/pjotrp/bioruby-vcf)
-Yet another VCF parser. This one may give better performance and
-useful command line filtering.
+Yet another VCF parser. This one may give better performance because
+of lazy parsing and useful combinations of (fancy) command line
+filtering. For example, to filter somatic data
+```ruby
+  bio-vcf --filter 'rec.alt.size==1 and rec.tumor.bq[rec.alt]>30 and rec.tumor.mq>20' < file.vcf
+```
 The VCF format is commonly used for variant calling between NGS
 samples. The fast parser needs to carry some state, recorded for each
 file in VcfHeader, which contains the VCF file header. Individual
 lines (variant calls) first go through a raw parser returning an array
-of fields. Further (lazy) parsing is handled through VcfRecord.
+of fields. Further (lazy) parsing is handled through VcfRecord.
-Health warning: Early days, your mileage may vary because I add
-features as I go along! If something is not working, check out the
-code. It is easy to add features.
+At this point the filter is pretty generic with multi-sample support.
+If something is not working, check out the feature descriptions and
+the source code. It is not hard to add features. Otherwise, send me a short
+example of a VCF statement you need to work on.
 ## Installation
@@ -35,10 +41,17 @@ Get the version of the VCF file
 Get the column headers
 ```ruby
-  bio-vcf -q -eval-once 'header.column_names.join(",")' < file.vcf
+  bio-vcf -q --eval-once 'header.column_names.join(",")' < file.vcf
   CHROM,POS,ID,REF,ALT,QUAL,FILTER,INFO,FORMAT,NORMAL,TUMOR
 ```
+Get the sample names
+```ruby
+  bio-vcf -q --eval-once 'header.samples.join(",")' < file.vcf
+  NORMAL,TUMOR
+```
 The 'fields' array contains unprocessed data (strings).  Print first
 five raw fields
@@ -59,6 +72,12 @@ object named 'rec'. Position is a value, so we can filter a range
   bio-vcf --filter 'rec.chrom=="12" and rec.pos>96_641_270 and rec.pos<96_641_276' < file.vcf
 ```
+Info fields are referenced by
+```ruby
+  bio-vcf --filter 'rec.info.dp>100 and rec.info.readposranksum<=0.815' < file.vcf
+```
 With subfields defined by rec.format
 ```ruby
@@ -68,19 +87,23 @@ With subfields defined by rec.format
 Output
 ```ruby
-  bio-vcf --filter 'rec.tumor.gq>30' --eval '[rec.ref,rec.alt,rec.tumor.bcount,rec.tumor.gq,rec.normal.gq].join("\t")' < file.vcf
+  bio-vcf --filter 'rec.tumor.gq>30'
+    --eval '[rec.ref,rec.alt,rec.tumor.bcount,rec.tumor.gq,rec.normal.gq].join("\t")'
+    < file.vcf
 ```
 Show the count of the bases that were scored as somatic
 ```ruby
-  bio-vcf --eval 'rec.alt+"\t"+rec.tumor.bcount.split(",")[["A","C","G","T"].index(rec.alt)]+"\t"+rec.tumor.gq.to_s' < file.vcf
+  bio-vcf --eval 'rec.alt+"\t"+rec.tumor.bcount.split(",")[["A","C","G","T"].index(rec.alt)]+
+    "\t"+rec.tumor.gq.to_s' < file.vcf
 ```
 Actually, we have a convenience implementation for bcount, so this is the same
 ```ruby
-  bio-vcf --eval 'rec.alt+"\t"+rec.tumor.bcount[rec.alt].to_s+"\t"+rec.tumor.gq.to_s' < file.vcf
+  bio-vcf --eval 'rec.alt+"\t"+rec.tumor.bcount[rec.alt].to_s+"\t"+rec.tumor.gq.to_s'
+    < file.vcf
 ```
 Filter on the somatic results that were scored at least 4 times
@@ -95,6 +118,56 @@ Similar for base quality scores
   bio-vcf --filter 'rec.alt.size==1 and rec.tumor.amq[rec.alt]>30' < test.vcf
 ```
+If your samples have other names you can fetch genotypes for that
+sample with
+```sh
+  bio-vcf --eval "rec.sample['BIOPSY17513D'].gt" < file.vcf
+```
+Or read depth for another
+```sh
+  bio-vcf --eval "rec.sample['subclone46'].dp" < file.vcf
+```
+Better even, you can access samples directly with
+```sh
+  bio-vcf --eval "rec.sample.biopsy17513d.gt" < file.vcf
+  bio-vcf --eval "rec.sample.subclone46.dp" < file.vcf
+```
+For more examples see the feature [section](https://github.com/pjotrp/bioruby-vcf/tree/master/features).
+## API
+BioVcf can also be used as an API. The following code is basically
+what the command line interface uses (see ./bin/bio-vcf)
+```ruby
+  FILE.each_line do | line |
+    if line =~ /^##fileformat=/
+      # ---- We have a new file header
+      header = VcfHeader.new
+      header.add(line)
+      STDIN.each_line do | headerline |
+        if headerline !~ /^#/
+          line = headerline
+          break # end of header
+        end
+        header.add(headerline)
+      end
+    end
+    # ---- Parse VCF record line
+    # fields = VcfLine.parse(line,header.columns)
+    fields = VcfLine.parse(line)
+    rec = VcfRecord.new(fields,header)
+    #
+    # Do something with rec
+    #
+  end
+```
 ## Project home page

data/Rakefile CHANGED Viewed

@@ -36,16 +36,16 @@ Jeweler::RubygemsDotOrgTasks.new
 #   spec.rcov = true
 # end
-require 'rake/testtask'
+# require 'rake/testtask'
-Rake::TestTask.new do |t|
-  t.pattern = "spec/*_spec.rb"
-end
+# Rake::TestTask.new do |t|
+#   t.pattern = "spec/*_spec.rb"
+# end
 require 'cucumber/rake/task'
 Cucumber::Rake::Task.new(:features)
-task :default => :spec
+task :default => :features
 require 'rdoc/task'
 Rake::RDocTask.new do |rdoc|

data/VERSION CHANGED Viewed

	@@ -1 +1 @@
1	- 0.0.1
1	+ 0.0.2

data/bin/bio-vcf CHANGED Viewed

@@ -25,21 +25,28 @@ require 'optparse'
 options = { show_help: false}
 opts = OptionParser.new do |o|
-  o.banner = "Usage: #{File.basename($0)} [options] filename\ne.g.  #{File.basename($0)} --rdf < test/data/input/somaticsniper.vcf"
+  o.banner = "Usage: #{File.basename($0)} [options] filename\ne.g.  #{File.basename($0)} < test/data/input/somaticsniper.vcf"
-  o.on_tail('--filter cmd',String, 'Evaluate filter on each record') do |cmd|
+  o.on('--filter cmd',String, 'Evaluate filter on each record') do |cmd|
     options[:filter] = cmd
   end
-  o.on_tail('-e cmd', '--eval cmd',String, 'Evaluate command on each record') do |cmd|
+  o.on('-e cmd', '--eval cmd',String, 'Evaluate command on each record') do |cmd|
     options[:eval] = cmd
   end
-  o.on_tail('--eval-once cmd',String, 'Evaluate command once') do |cmd|
+  o.on('--eval-once cmd',String, 'Evaluate command once (usually for header info)') do |cmd|
     options[:eval_once] = true
     options[:eval] = cmd
   end
-  o.on("--rdf", "Generate RDF") do |b|
+  o.on("--rdf", "Generate Turtle RDF") do |b|
+    require 'bio-vcf/vcfrdf'
     options[:rdf] = true
   end
+  o.on_tail("--id name", String, "Identifier") do |s|
+    options[:id] = s
+  end
+  o.on_tail("--tags list", String, "Add tags") do |s|
+    options[:tags] = eval(s)
+  end
   # Uncomment the following when using the bio-logger
   # o.separator ""
@@ -77,7 +84,6 @@ begin
   $stderr.print "vcf #{version} (biogem Ruby #{RUBY_VERSION}) by Pjotr Prins 2014\n" if !options[:quiet]
   if options[:show_help]
     print opts
     print USAGE
@@ -87,6 +93,7 @@ begin
   $stderr.print "Options: ",options,"\n" if !options[:quiet]
   header = VcfHeader.new
+  header_out = false
   STDIN.each_line do | line |
     if line =~ /^##fileformat=/
@@ -110,8 +117,17 @@ begin
         print eval(options[:eval])
         exit(1) if options[:eval_once]
       else
-        # Default behaviour
-        print fields.join("\t")
+        if options[:rdf]
+          # Output Turtle RDF
+          if not header_out
+            VcfRdf::header
+            header_out = true
+          end
+          VcfRdf::record(options[:id],rec,options[:tags])
+        else
+          # Default behaviour prints VCF line
+          print fields.join("\t")
+        end
       end
       print "\n"
     end

data/bio-vcf.gemspec ADDED Viewed

@@ -0,0 +1,73 @@
+# Generated by jeweler
+# DO NOT EDIT THIS FILE DIRECTLY
+# Instead, edit Jeweler::Tasks in Rakefile, and run 'rake gemspec'
+# -*- encoding: utf-8 -*-
+Gem::Specification.new do |s|
+  s.name = "bio-vcf"
+  s.version = "0.0.2"
+  s.required_rubygems_version = Gem::Requirement.new(">= 0") if s.respond_to? :required_rubygems_version=
+  s.authors = ["Pjotr Prins"]
+  s.date = "2014-03-05"
+  s.description = "Smart parser for VCF format"
+  s.email = "pjotr.public01@thebird.nl"
+  s.executables = ["bio-vcf"]
+  s.extra_rdoc_files = [
+    "LICENSE.txt",
+    "README.md"
+  ]
+  s.files = [
+    ".travis.yml",
+    "Gemfile",
+    "Gemfile.lock",
+    "LICENSE.txt",
+    "README.md",
+    "Rakefile",
+    "VERSION",
+    "bin/bio-vcf",
+    "bio-vcf.gemspec",
+    "features/diff_count.feature",
+    "features/multisample.feature",
+    "features/somaticsniper.feature",
+    "features/step_definitions/bio-vcf_steps.rb",
+    "features/step_definitions/diff_count.rb",
+    "features/step_definitions/multisample.rb",
+    "features/step_definitions/somaticsniper.rb",
+    "features/support/env.rb",
+    "lib/bio-vcf.rb",
+    "lib/bio-vcf/variant.rb",
+    "lib/bio-vcf/vcf.rb",
+    "lib/bio-vcf/vcfgenotypefield.rb",
+    "lib/bio-vcf/vcfheader.rb",
+    "lib/bio-vcf/vcfline.rb",
+    "lib/bio-vcf/vcfrdf.rb",
+    "lib/bio-vcf/vcfrecord.rb",
+    "test/data/input/multisample.vcf",
+    "test/data/input/somaticsniper.vcf"
+  ]
+  s.homepage = "http://github.com/pjotrp/bioruby-vcf"
+  s.licenses = ["MIT"]
+  s.require_paths = ["lib"]
+  s.rubygems_version = "2.0.3"
+  s.summary = "VCF parser"
+  if s.respond_to? :specification_version then
+    s.specification_version = 4
+    if Gem::Version.new(Gem::VERSION) >= Gem::Version.new('1.2.0') then
+      s.add_development_dependency(%q<rspec>, [">= 0"])
+      s.add_development_dependency(%q<cucumber>, [">= 0"])
+      s.add_development_dependency(%q<jeweler>, [">= 0"])
+    else
+      s.add_dependency(%q<rspec>, [">= 0"])
+      s.add_dependency(%q<cucumber>, [">= 0"])
+      s.add_dependency(%q<jeweler>, [">= 0"])
+    end
+  else
+    s.add_dependency(%q<rspec>, [">= 0"])
+    s.add_dependency(%q<cucumber>, [">= 0"])
+    s.add_dependency(%q<jeweler>, [">= 0"])
+  end
+end

data/features/diff_count.feature ADDED Viewed

@@ -0,0 +1,30 @@
+@diff
+Feature: Variant calling (filters) - diffing nucleotide counts
+  Basic filtering happens on the command line with the --filter switch. To
+  support somewhat more advanced features the following features are
+  included.
+  When diffing nucleotide counts we want to find out which nucleotide defines
+  the tumor. The difference has to be larger than 0 and the relative difference
+  is the max. When a threshold is set only those nucleotides are included which
+  pass the threshold (i.e., no more than x supporting nucleotides in the
+  reference).
+  The advantage is that filtering is possible without actually looking at
+  the rec.alt and rec.ref values, i.e., no assumptions are being made
+  about the underlying nucleotides.
+  Scenario: Diffing nucleotide counts
+    Given normal and tumor counts [0,25,0,1] and [0,40,0,12]
+    When I look for the difference
+    Then I expect the diff to be [0,15,0,11]
+    And the relative diff to be [0,0.23,0,0.85]
+    And I expect the defining tumor nucleotide to be "T"
+    And I expect the tumor count to be 12
+    When I set an inclusion threshold for the reference
+    Then I expect the diff for threshold 2 to be [0,0,0,11]
+    And the relative diff to be [0,0,0,0.85]

data/features/multisample.feature ADDED Viewed

@@ -0,0 +1,37 @@
+@multi
+Feature: Multi-sample VCF
+  Here we take a VCF line and parse the information for multiple named
+  samples
+  Scenario: When parsing a record
+    Given the multi sample header line
+    """
+#CHROM  POS     ID      REF     ALT     QUAL    FILTER  INFO    FORMAT  BIOPSY17513D    clone10 clone3  clone4  subclone105     subclone33      subclone46
+    """
+    When I parse the header
+    Given multisample vcf line
+    """
+1       10321   .       C       T       106.30  .       AC=5;AF=0.357;AN=14;BaseQRankSum=3.045;DP=1537;Dels=0.01;FS=5.835;HaplotypeScore=220.1531;MLEAC=5;MLEAF=0.357;MQ=26.69;MQ0=258;MQRankSum=-4.870;QD=0.10;ReadPosRankSum=0.815    GT:AD:DP:GQ:PL  0/1:189,25:218:30:30,0,810      0/0:219,22:246:24:0,24,593      0/1:218,27:248:34:34,0,1134     0/0:220,22:248:56:0,56,1207     0/1:168,23:193:19:19,0,493      0/1:139,22:164:46:46,0,689      0/1:167,26:196:20:20,0,522
+    """
+    When I parse the record
+    Then I expect rec.chrom to contain "1"
+    Then I expect rec.pos to contain 10321
+    Then I expect rec.ref to contain "C"
+    And I expect multisample rec.alt to contain ["T"]
+    And I expect rec.qual to be 106.30
+    And I expect rec.info.ac to be 5
+    And I expect rec.info.af to be 0.357
+    And I expect rec.info.dp to be 1537
+    And I expect rec.info.readposranksum to be 0.815
+    And I expect rec.sample['BIOPSY17513D'].gt to be "0/1"
+    And I expect rec.sample['BIOPSY17513D'].ad to be [189,25]
+    And I expect rec.sample['subclone46'].ad to be [167,26]
+    And I expect rec.sample['subclone46'].dp to be 196
+    And I expect rec.sample['subclone46'].gq to be 20
+    And I expect rec.sample['subclone46'].pl to be [20,0,522]
+    # And the nicer self resolving
+    And I expect rec.sample.biopsy17513d.gt to be [0,1]
+    And I expect rec.sample.subclone46.pl to be [20,0,522]

data/features/somaticsniper.feature ADDED Viewed

@@ -0,0 +1,84 @@
+@sniper
+Feature: VCF for Somatic Sniper
+  Here we take a VCF line and parse the information given by Somatic Sniper.
+  At this position the reference contains: AAAGAAAAGAAAAA  (12A,2G)
+  At this position the tumor contains:     AAAAACACAA      (8A,2C)
+  rec.alt contains variants C,G.  rec.tumor.bcount reflects the contents of the
+  tumor (8A,2C) so rec.tumor.bcount[rec.alt] reflects the actual number of
+  variants in the tumor.
+  The mapping quality in the BAM file is 37/37 and base quality is 55/60 in normal
+  and tumor respectively.
+  For the second scenario:
+  At this position the reference contains: (15A)
+  At this position the tumor contains:     AAAAAAAAATATTA (13A, 3T)
+  Scenario: When parsing a record
+    Given the somatic sniper vcf line
+    """
+1       27691244        .       A       C,G     .       .       .       GT:IGT:DP:DP4:BCOUNT:GQ:JGQ:VAQ:BQ:MQ:AMQ:SS:SSC        0/2:0/2:14:0,12,0,2:12,0,2,0:14:35:14:14,35:37:37,37:1:.        0/1:0/1:10:0,8,0,2:8,2,0,0:18:35:18:20,51:37:37,37:2:33
+    """
+    When I parse the record
+    Then I expect rec.chrom to contain "1"
+    Then I expect rec.pos to contain 27691244
+    Then I expect rec.ref to contain "A"
+    And I expect rec.alt to contain ["C","G"]
+    And I expect rec.tumor.dp to be 10
+    And I expect rec.tumor.dp4 to be [0,8,0,2]
+    And I expect rec.tumor.bcount.to_ary to be [8,2,0,0]
+    And I expect rec.tumor.bcount[rec.alt] to be [2,0]
+    And I expect rec.tumor.bcount["G"] to be 0
+    And I expect rec.tumor.bcount[1] to be 2
+    And I expect rec.tumor.bcount[3] to be 0
+    And I expect rec.tumor.bcount.sum to be 2
+    And I expect rec.tumor.bcount.max to be 2
+    And I expect rec.tumor.bq.to_ary to be [20,51]
+    And I expect rec.tumor.bq["G"] to be 51
+    And I expect rec.tumor.bq[1] to be 51
+    And I expect rec.tumor.bq.min to be 20
+    And I expect rec.tumor.bq.max to be 51
+    And I expect rec.tumor.amq.to_ary to be [37,37]
+    And I expect rec.tumor.mq to be 37
+    And I expect rec.tumor.ss to be 2
+    # The following are additional functions
+    And I expect rec.call_diff to be [-4,2,-2,0]
+    And I expect rec.call_nuc to be "C"
+    And I expect rec.call_tumor_count to be 2
+    And I expect rec.call_normal_count to be 0
+    And I expect rec.call_tumor_relative_count to be 1.0
+    Given the somatic sniper vcf line
+    """
+1 27686841  . A T . . . GT:IGT:DP:DP4:BCOUNT:GQ:JGQ:VAQ:BQ:MQ:AMQ:SS:SSC  0/0:0/0:15:3,12,0,0:15,0,0,0:66:37:0:25:37:37:0:. 0/1:0/1:16:2,11,0,3:13,0,0,3:30:37:30:34,55:37:37,37:2:37
+    """
+    When I parse the record
+    Then I expect rec.chrom to contain "1"
+    Then I expect rec.pos to contain 27686841
+    Then I expect rec.ref to contain "A"
+    And I expect rec.alt to contain one ["T"]
+    And I expect rec.tumor.dp to be 16
+    And I expect rec.tumor.dp4 to be [2,11,0,3]
+    And I expect rec.tumor.bcount.to_ary to be [13,0,0,3]
+    And I expect rec.tumor.bcount[rec.alt] to be one [3]
+    And I expect rec.tumor.bcount["G"] to be 0
+    And I expect rec.tumor.bcount["T"] to be 3
+    And I expect rec.tumor.bcount[1] to be 0
+    And I expect rec.tumor.bcount[3] to be 3
+    And I expect rec.tumor.bcount.sum to be 3
+    And I expect rec.tumor.bcount.max to be 3
+    And I expect rec.tumor.bq.to_ary to be [34,55]
+    And I expect rec.tumor.bq["T"] to be 34
+    And I expect rec.tumor.bq[1] to be 55
+    And I expect rec.tumor.bq.min to be 34
+    And I expect rec.tumor.bq.max to be 55
+    And I expect rec.tumor.amq.to_ary to be [37,37]
+    And I expect rec.tumor.mq to be 37
+    And I expect rec.tumor.ss to be 2