RubyGems - better-benchmark - Versions diffs - 0.7.0 → 0.8.1 - Mend

better-benchmark 0.7.0 → 0.8.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

data/README.md +115 -0
data/bin/bbench +6 -0
data/example.rb +1 -0
data/lib/better-benchmark.rb +37 -48
data/lib/better-benchmark/bencher.rb +95 -0
data/lib/better-benchmark/comparison-partial.rb +33 -0
metadata +45 -15
data/README +0 -34

data/README.md ADDED Viewed

@@ -0,0 +1,115 @@
+# Better Benchmark
+Statistically correct benchmarking for Ruby.
+## Dependencies
+* [The R Project](http://www.r-project.org/)
+* [rsruby](http://github.com/alexgutteridge/rsruby)
+## Usage
+### Comparing code blocks
+    result = Benchmark.compare_realtime {
+      do_something_one_way
+    }.with {
+      do_it_another_way
+    }
+    Benchmark.report_on result
+See also example.rb for a more comprehensive example.
+### Comparing git revisions
+#### With a test script (recommended)
+To test two revisions of a library, create a simple runner script:
+    # runner.rb
+    require 'mylib'
+    class TestQuick
+      def initialize
+        # initialization...
+      end
+      def run
+        Benchmark.write_realtime( '/home/pistos/tmp' ) do
+          5000.times do
+            # do something with your lib
+          end
+        end
+      end
+    end
+    t = TestQuick.new
+    t.run
+Then run the bbench script, passing two git revisions:
+    bbench -r 6e84dd5 -r ed1e7c6 -d ~/tmp -- -Ilib runner.rb
+#### Without altering or writing new code
+You can also test two revisions by running some already-existing script,
+such as a file in your test suite:
+    bbench -r 6e84dd5 -r ed1e7c6 -- -Itest -Ilib test/test_something.rb
+Be aware, however, that this may produce unnecessarily variant timings due to
+wide variance in the startup time of the Ruby interpreter and script.
+### Comparing git working copy
+You can also compare the current branch tip to the current (dirty) working copy:
+    bbench -w -d ~/tmp -- -Ilib runner.rb
+This lets you experiment without committing anything, and then only commit
+when you are confident that your changes result in a performance improvement.
+## Interpretation
+Considering two "things under test", U1 and U2:
+### Example 1
+    Set 1 mean: 0.216 s
+    Set 1 std dev: 0.023
+    Set 2 mean: 0.187 s
+    Set 2 std dev: 0.020
+    p.value: 0.00287947346770876
+    W: 88.0
+    The difference (-13.5%) IS statistically significant.
+This means that the results permit us to conclude that U2 performed 13.5%
+faster than U1.
+### Example 2
+    Set 1 mean: 10.968 s
+    Set 1 std dev: 4.294
+    Set 2 mean: 9.036 s
+    Set 2 std dev: 3.581
+    p.value: 0.217562623135379
+    W: 67.0
+    The difference (-17.6%) IS NOT statistically significant.
+This means that the results do not permit us to conclude that the performance
+of U1 and U2 differed.
+## Not just Ruby
+Technically, the bbench script can work with any script or program that writes
+a run time (in seconds) to the file bbench-run-time in the data dir.  Use the
+-e option to specify a different executable than "ruby".  e.g. perl, python,
+java, etc.
+## Help, etc.
+irc.freenode.net#mathetes or http://webchat.freenode.net?channels=mathetes .
+## Repository
+git clone git://github.com/Pistos/better-benchmark.git

data/bin/bbench ADDED Viewed

@@ -0,0 +1,6 @@
+#!/usr/bin/env ruby
+require 'better-benchmark'
+b = ::Benchmark::Bencher.new( ARGV.dup )
+b.run

data/example.rb CHANGED Viewed

@@ -1,5 +1,6 @@
 #!/usr/bin/env ruby
+require 'rubygems'
 require 'better-benchmark'
 # Provide two blocks of code to compare.  For example, two blocks that

data/lib/better-benchmark.rb CHANGED Viewed

@@ -1,59 +1,49 @@
 require 'benchmark'
 require 'rsruby'
+require 'better-benchmark/comparison-partial'
+require 'better-benchmark/bencher'
 module Benchmark
-  BETTER_BENCHMARK_VERSION = '0.7.0'
+  BETTER_BENCHMARK_VERSION = '0.8.0'
+  DEFAULT_REQUIRED_SIGNIFICANCE = 0.01
-  class ComparisonPartial
-    def initialize( block, options )
-      @block1 = block
-      @options = options
+  def self.write_realtime( data_dir, &block )
+    t = Benchmark.realtime( &block )
+    File.open( "#{data_dir}/#{Bencher::DATA_FILE}", 'w' ) do |f|
+      f.print t
     end
+  end
-    def with( &block2 )
-      times1 = []
-      times2 = []
-      (1..@options[ :iterations ]).each do |iteration|
-        if @options[ :verbose ]
-          $stdout.print "."; $stdout.flush
-        end
-        times1 << Benchmark.realtime do
-          @options[ :inner_iterations ].times do |i|
-            @block1.call( iteration )
-          end
-        end
-        times2 << Benchmark.realtime do
-          @options[ :inner_iterations ].times do |i|
-            block2.call( iteration )
-          end
-        end
-      end
-      r = RSRuby.instance
-      wilcox_result = r.wilcox_test( times1, times2 )
+  # The number of elements in times1 and times2 should be the same.
+  # @param [Array] times1
+  #   An Array of elapsed times in float form, measured in seconds
+  # @param [Array] times2
+  #   An Array of elapsed times in float form, measured in seconds
+  # @param [Fixnum] required_significance
+  #   The maximum p value needed to declare statistical significance
+  def self.compare_times( times1, times2, required_significance = DEFAULT_REQUIRED_SIGNIFICANCE )
+    r = RSRuby.instance
+    wilcox_result = r.wilcox_test( times1, times2 )
-      {
-        :results1 => {
-          :times => times1,
-          :mean => r.mean( times1 ),
-          :stddev => r.sd( times1 ),
-        },
-        :results2 => {
-          :times => times2,
-          :mean => r.mean( times2 ),
-          :stddev => r.sd( times2 ),
-        },
-        :p => wilcox_result[ 'p.value' ],
-        :W => wilcox_result[ 'statistic' ][ 'W' ],
-        :significant => (
-          wilcox_result[ 'p.value' ] < @options[ :required_significance ]
-        ),
-      }
-    end
-    alias to with
+    {
+      :results1 => {
+        :times => times1,
+        :mean => r.mean( times1 ),
+        :stddev => r.sd( times1 ),
+      },
+      :results2 => {
+        :times => times2,
+        :mean => r.mean( times2 ),
+        :stddev => r.sd( times2 ),
+      },
+      :p => wilcox_result[ 'p.value' ],
+      :W => wilcox_result[ 'statistic' ][ 'W' ],
+      :significant => (
+        wilcox_result[ 'p.value' ] < ( required_significance || DEFAULT_REQUIRED_SIGNIFICANCE )
+      ),
+    }
   end
   # Options:
@@ -85,7 +75,6 @@ module Benchmark
   def self.compare_realtime( options = {}, &block1 )
     options[ :iterations ] ||= 20
     options[ :inner_iterations ] ||= 1
-    options[ :required_significance ] ||= 0.01
     if options[ :iterations ] > 30
       warn "The number of iterations is set to #{options[ :iterations ]}.  " +

data/lib/better-benchmark/bencher.rb ADDED Viewed

@@ -0,0 +1,95 @@
+module Benchmark
+  class Bencher
+    DATA_FILE = 'bbench-run-time'
+    def print_usage
+      puts "#{$0} [-i <iterations>] [-w] [-r <revision 1> -r <revision 2>] [-p <max p-value>] [-d <data tmp dir>] [-e <executable/interpreter>] -- <executable's args...>"
+    end
+    # @param [Array] argv
+    #   The command line arguments passed to the bencher script
+    def initialize( argv )
+      @iterations = 10
+      @executable = 'ruby'
+      while argv.any?
+        arg = argv.shift
+        case arg
+        when '-d'
+          @data_dir = argv.shift
+          begin
+            if ! File.stat( @data_dir ).directory?
+              $stderr.puts "#{@data_dir} is not a directory."
+              exit 3
+            end
+          rescue Errno::ENOENT
+            $stderr.puts "#{@data_dir} does not exist."
+            exit 4
+          end
+        when '-e'
+          @executable = argv.shift
+        when '-i'
+          @iterations = argv.shift.to_i
+        when '-p'
+          @max_p = argv.shift
+        when '-r'
+          if @r1.nil?
+            @r1 = argv.shift
+          else
+            @r2 = argv.shift
+          end
+        when '-w'
+          @test_working_copy = true
+        when '--'
+          @executable_args = argv.dup
+          argv.clear
+        end
+      end
+      if ( ! @test_working_copy && ( @r1.nil? || @r2.nil? ) ) || @executable_args.nil?
+        print_usage
+        exit 2
+      end
+    end
+    def one_run
+      system "#{@executable} #{ @executable_args.join(' ') }"  or exit $?
+    end
+    def time_one_run
+      if @data_dir
+        one_run
+        File.read( "#{@data_dir}/#{DATA_FILE}" ).to_f
+      else
+        t0 = Time.now
+        one_run
+        Time.now.to_f - t0.to_f
+      end
+    end
+    def run
+      times1 = []
+      times2 = []
+      @iterations.times do
+        if @test_working_copy
+          system "git stash -q"  or exit $?
+        else
+          system "git checkout #{@r1}"  or exit $?
+        end
+        times1 << time_one_run
+        if @test_working_copy
+          system "git stash pop -q"  or exit $?
+        else
+          system "git checkout #{@r2}"  or exit $?
+        end
+        times2 << time_one_run
+      end
+      ::Benchmark.report_on(
+        ::Benchmark.compare_times( times1, times2, @max_p )
+      )
+    end
+  end
+end

data/lib/better-benchmark/comparison-partial.rb ADDED Viewed

@@ -0,0 +1,33 @@
+module Benchmark
+  class ComparisonPartial
+    def initialize( block, options )
+      @block1 = block
+      @options = options
+    end
+    def with( &block2 )
+      times1 = []
+      times2 = []
+      (1..@options[ :iterations ]).each do |iteration|
+        if @options[ :verbose ]
+          $stdout.print "."; $stdout.flush
+        end
+        times1 << Benchmark.realtime do
+          @options[ :inner_iterations ].times do |i|
+            @block1.call( iteration )
+          end
+        end
+        times2 << Benchmark.realtime do
+          @options[ :inner_iterations ].times do |i|
+            block2.call( iteration )
+          end
+        end
+      end
+      ::Benchmark.compare_times( times1, times2, @options[ :required_significance ] )
+    end
+    alias to with
+  end
+end

metadata CHANGED Viewed

@@ -1,7 +1,13 @@
 --- !ruby/object:Gem::Specification
 name: better-benchmark
 version: !ruby/object:Gem::Version
-  version: 0.7.0
+  hash: 61
+  prerelease: false
+  segments:
+  - 0
+  - 8
+  - 1
+  version: 0.8.1
 platform: ruby
 authors:
 - Pistos
@@ -9,50 +15,74 @@ autorequire:
 bindir: bin
 cert_chain: []
-date: 2009-02-11 00:00:00 -05:00
+date: 2010-09-10 00:00:00 -04:00
 default_executable:
-dependencies: []
+dependencies:
+- !ruby/object:Gem::Dependency
+  name: rsruby
+  prerelease: false
+  requirement: &id001 !ruby/object:Gem::Requirement
+    none: false
+    requirements:
+    - - ">="
+      - !ruby/object:Gem::Version
+        hash: 3
+        segments:
+        - 0
+        version: "0"
+  type: :runtime
+  version_requirements: *id001
 description: Statistically correct benchmarking for Ruby.
-email: pistos at purepistos dot net
-executables: []
+email: betterbenchmark dot pistos at purepistos dot net
+executables:
+- bbench
 extensions: []
 extra_rdoc_files:
-- README
+- README.md
 - LICENCE
 files:
-- README
+- README.md
 - LICENCE
 - example.rb
 - run-example
 - lib/better-benchmark.rb
-has_rdoc: false
-homepage: http://github.com/Pistos/better-benchmark/tree
+- lib/better-benchmark/bencher.rb
+- lib/better-benchmark/comparison-partial.rb
+- bin/bbench
+has_rdoc: true
+homepage: http://github.com/Pistos/better-benchmark
+licenses: []
 post_install_message:
 rdoc_options: []
 require_paths:
 - lib
 required_ruby_version: !ruby/object:Gem::Requirement
+  none: false
   requirements:
   - - ">="
     - !ruby/object:Gem::Version
+      hash: 3
+      segments:
+      - 0
       version: "0"
-  version:
 required_rubygems_version: !ruby/object:Gem::Requirement
+  none: false
   requirements:
   - - ">="
     - !ruby/object:Gem::Version
+      hash: 3
+      segments:
+      - 0
       version: "0"
-  version:
 requirements:
 - "The R project: http://www.r-project.org/"
 rubyforge_project: better-benchmark
-rubygems_version: 1.3.1
+rubygems_version: 1.3.7
 signing_key:
-specification_version: 2
+specification_version: 3
 summary: Statistically correct benchmarking for Ruby.
 test_files: []

data/README DELETED Viewed

@@ -1,34 +0,0 @@
-## Dependencies
-The R Project: http://www.r-project.org/
-rsruby: http://web.kuicr.kyoto-u.ac.jp/~alexg/rsruby/
-## Usage
-result = Benchmark.compare_realtime {
-  do_something_one_way
-}.with {
-  do_it_another_way
-}
-Benchmark.report_on result
-See also example.rb for a more comprehensive example.
-## Example Output
-....................
-Set 1 mean: 0.484 s
-Set 1 std dev: 0.098
-Set 2 mean: 0.469 s
-Set 2 std dev: 0.088
-p.value: 0.601661885634415
-W: 220.0
-The difference (-3.2%) IS NOT statistically significant.
-## Help, etc.
-irc.freenode.net#mathetes or http://mibbit.com/?server=irc.freenode.net&channel=%23mathetes
-## Repository
-git clone git://github.com/Pistos/better-benchmark.git