RubyGems - bio-grid - Versions diffs - 0.3.0 → 0.3.1 - Mend

bio-grid 0.3.0 → 0.3.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

data/README.md CHANGED

@@ -25,9 +25,9 @@ What is happening here is the following:
 * the ```-i``` options specifies the input files or, as in this case, the location where to find input files based on a typical wildcard expression. You can actually specify as many input files/locations as you need using a comma separated list.
 * the ```-n``` specify the job name
-* the ```-c``` is the command line to be executed on the cluster / grid system. What BioGrid does is to fill in the ```<input1>```,```<input2>``` and ```<output>``` placeholders with the corresponding parameters passed on the command line. This is done for each input file (or each group of input files) and BioGrid will check if the ```<output>``` placeholder has an extension (like .sam, .out etc.) and will generate a unique output file name for each job.
+* the ```-c``` is the command line to be executed on the cluster / grid system. What BioGrid does is to fill in the ```<input1>```,```<input2>``` and ```<output>``` placeholders with the corresponding parameters passed on the command line. This is done for each input file (or each group of input files), taking care of generating a unique output name for each job submitted.
 * the ```-o``` set the location where output files for each job will be saved. Only provide the folder where you want to save the output file(s), BioGrid will take care of generating a unique file name for the output, if needed. Check the [Output management](https://github.com/fstrozzi/bioruby-grid#output-management) for more details.
-* the ```-s``` is a key parameter to specify the granularity of the jobs, setting the number of input files (or group of files, when more than one input placeholder is present in the command line) to be used for each job. So, going back to the FastQ example, if -s 1 is specified, each job will be run with exactly one FastQ R1 file and one FastQ R2 file. This gives you a great power in deciding how to split the entire dataset analysis across multiple computing nodes.
+* the ```-s``` is a key parameter to specify the granularity of the jobs, setting the number of input files (or group of files, when more than one input placeholder is present in the command line) to be used for each job. So, going back to the FastQ example, if ```-s 1``` is specified, each job will be run with exactly one FastQ R1 file and one FastQ R2 file (corresponding to the ```<input1>``` and ```<input2>``` placeholders). This gives you a great power in deciding how to split the entire input dataset across multiple computing nodes to carry on the analysis.
 * the ```-p``` parameter indicates how many processes we want to use for each job. This number needs to match with the actual number of threads / processes that our command or tool will use for the analysis.
 All of this is just turned into a submission script that will look like this:

data/VERSION CHANGED

	@@ -1 +1 @@
1	- 0.3.0
1	+ 0.3.1

data/bio-grid.gemspec CHANGED

@@ -5,11 +5,11 @@
 Gem::Specification.new do |s|
   s.name = "bio-grid"
-  s.version = "0.3.0"
+  s.version = "0.3.1"
   s.required_rubygems_version = Gem::Requirement.new(">= 0") if s.respond_to? :required_rubygems_version=
   s.authors = ["Francesco Strozzi"]
-  s.date = "2012-09-24"
+  s.date = "2012-09-25"
   s.description = "A BioGem to submit jobs on a queue system"
   s.email = "francesco.strozzi@gmail.com"
   s.executables = ["bio-grid"]

data/lib/bio/grid.rb CHANGED

@@ -10,7 +10,7 @@ module Bio
 		end
 		def self.run(options)
-			options[:number] = 1 unless options[:number]
+			options[:number] = "all" unless options[:number]
 			grid = self.new options[:input], options[:number]
 			options[:uuid] = grid.uuid
 			groups = grid.prepare_input_groups
@@ -41,13 +41,15 @@ module Bio
 			end
 		end
-		def	prepare_input_groups
+		def	prepare_input_groups
 			groups = Hash.new {|h,k| h[k] = [] }
 			self.input.each_with_index do |location,index|
+				list = Dir.glob(location).sort
+				raise ArgumentError,"Input file or folder #{location} do not exist!" if list.empty?
 				if self.number == "all"
-					groups["input#{index+1}"] = [Dir.glob(location).sort]
+					groups["input#{index+1}"] = [list]
 				else
-					Dir.glob(location).sort.each_slice(self.number.to_i) {|subgroup| groups["input#{index+1}"] << subgroup}
+					list.each_slice(self.number.to_i) {|subgroup| groups["input#{index+1}"] << subgroup}
 				end
 			end
 			groups

metadata CHANGED

@@ -1,7 +1,7 @@
 --- !ruby/object:Gem::Specification
 name: bio-grid
 version: !ruby/object:Gem::Version
-  version: 0.3.0
+  version: 0.3.1
   prerelease:
 platform: ruby
 authors:
@@ -9,7 +9,7 @@ authors:
 autorequire:
 bindir: bin
 cert_chain: []
-date: 2012-09-24 00:00:00.000000000 Z
+date: 2012-09-25 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: uuid
@@ -146,7 +146,7 @@ required_ruby_version: !ruby/object:Gem::Requirement
       version: '0'
       segments:
       - 0
-      hash: 2907219030788310971
+      hash: -4026493835135905673
 required_rubygems_version: !ruby/object:Gem::Requirement
   none: false
   requirements: