RubyGems - lab_tech - Versions diffs - 0.1.0 - Mend

lab_tech 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (65) hide show

checksums.yaml +7 -0
data/MIT-LICENSE +20 -0
data/README.md +323 -0
data/Rakefile +30 -0
data/app/models/lab_tech/application_record.rb +5 -0
data/app/models/lab_tech/default_cleaner.rb +87 -0
data/app/models/lab_tech/experiment.rb +190 -0
data/app/models/lab_tech/observation.rb +40 -0
data/app/models/lab_tech/percentile.rb +41 -0
data/app/models/lab_tech/result.rb +130 -0
data/app/models/lab_tech/speedup.rb +65 -0
data/app/models/lab_tech/summary.rb +183 -0
data/config/routes.rb +2 -0
data/db/migrate/20190815192130_create_experiment_tables.rb +50 -0
data/lib/lab_tech.rb +176 -0
data/lib/lab_tech/engine.rb +6 -0
data/lib/lab_tech/version.rb +3 -0
data/lib/tasks/lab_tech_tasks.rake +4 -0
data/spec/dummy/Rakefile +6 -0
data/spec/dummy/app/assets/config/manifest.js +1 -0
data/spec/dummy/app/assets/javascripts/application.js +14 -0
data/spec/dummy/app/assets/stylesheets/application.css +15 -0
data/spec/dummy/app/controllers/application_controller.rb +2 -0
data/spec/dummy/app/jobs/application_job.rb +2 -0
data/spec/dummy/app/models/application_record.rb +3 -0
data/spec/dummy/bin/bundle +3 -0
data/spec/dummy/bin/rails +4 -0
data/spec/dummy/bin/rake +4 -0
data/spec/dummy/bin/setup +33 -0
data/spec/dummy/bin/update +28 -0
data/spec/dummy/config.ru +5 -0
data/spec/dummy/config/application.rb +35 -0
data/spec/dummy/config/boot.rb +5 -0
data/spec/dummy/config/database.yml +25 -0
data/spec/dummy/config/environment.rb +5 -0
data/spec/dummy/config/environments/development.rb +46 -0
data/spec/dummy/config/environments/production.rb +71 -0
data/spec/dummy/config/environments/test.rb +36 -0
data/spec/dummy/config/initializers/application_controller_renderer.rb +8 -0
data/spec/dummy/config/initializers/backtrace_silencers.rb +7 -0
data/spec/dummy/config/initializers/cors.rb +16 -0
data/spec/dummy/config/initializers/filter_parameter_logging.rb +4 -0
data/spec/dummy/config/initializers/inflections.rb +16 -0
data/spec/dummy/config/initializers/mime_types.rb +4 -0
data/spec/dummy/config/initializers/wrap_parameters.rb +14 -0
data/spec/dummy/config/locales/en.yml +33 -0
data/spec/dummy/config/puma.rb +34 -0
data/spec/dummy/config/routes.rb +3 -0
data/spec/dummy/config/spring.rb +6 -0
data/spec/dummy/db/schema.rb +52 -0
data/spec/dummy/db/test.sqlite3 +0 -0
data/spec/dummy/log/development.log +0 -0
data/spec/dummy/log/test.log +1519 -0
data/spec/examples.txt +79 -0
data/spec/models/lab_tech/default_cleaner_spec.rb +32 -0
data/spec/models/lab_tech/experiment_spec.rb +110 -0
data/spec/models/lab_tech/percentile_spec.rb +85 -0
data/spec/models/lab_tech/result_spec.rb +198 -0
data/spec/models/lab_tech/speedup_spec.rb +133 -0
data/spec/models/lab_tech/summary_spec.rb +325 -0
data/spec/models/lab_tech_spec.rb +23 -0
data/spec/rails_helper.rb +62 -0
data/spec/spec_helper.rb +98 -0
data/spec/support/misc_helpers.rb +7 -0
metadata +238 -0

checksums.yaml ADDED

@@ -0,0 +1,7 @@
+---
+SHA256:
+  metadata.gz: 2f5e6f3f09101904d4cf6fbc209768dba3f58bdb3ca9bbe157473e7d939fe124
+  data.tar.gz: 88dceb210ec735257a5f2cf329fe132085062b39dd2ad207dae2d0ff16392993
+SHA512:
+  metadata.gz: b52697112c48077ac53e85af052b0e20e5425f448fba77a8139344afad7bb39c4bf497dcf5987d794d5826e5a6dfd920975a0d509d44fb16779d92342d21ffbd
+  data.tar.gz: d6178294c5066e7117435237b7ba2a93a1bbf15bba7152ead2c6f4393a808c0af1d271478126e50a5948ee133c855fa28ca2589da140a305cc5698271ce0fd81

data/MIT-LICENSE ADDED

@@ -0,0 +1,20 @@
+Copyright 2019 Sam Livingston-Gray
+Permission is hereby granted, free of charge, to any person obtaining
+a copy of this software and associated documentation files (the
+"Software"), to deal in the Software without restriction, including
+without limitation the rights to use, copy, modify, merge, publish,
+distribute, sublicense, and/or sell copies of the Software, and to
+permit persons to whom the Software is furnished to do so, subject to
+the following conditions:
+The above copyright notice and this permission notice shall be
+included in all copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
+EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
+MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
+NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE
+LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION
+OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION
+WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

data/README.md ADDED

@@ -0,0 +1,323 @@
+# LabTech!
+Rails engine for using GitHub's 'Scientist' library with ActiveRecord, for those of us not operating apps at ROFLscale
+Please go read [Scientist's amazing
+README](https://github.com/github/scientist/blob/master/README.md).  This tool
+won't make any sense until you understand what Scientist is for and how it
+works.
+If conference videos are your thing, Jesse Toth's ["Easy Rewrites With Ruby and
+Science!"](http://www.confreaks.tv/videos/rubyconf2014-easy-rewrites-with-ruby-and-science)
+from RubyConf 2014 is well worth your time as well.
+**NOTE:  our examples assume that you have access to the Rails production
+console.**  If you work at a company that locks this down, you'll need to write
+an administrative back-end UI to enable and disable experiments and review them
+for accuracy and performance.  (Please feel free to send those back to us in a
+pull request; we simply haven't needed them for ourselves, so they don't
+exist yet.)
+## Usage
+Once you've installed the gem and run its migrations (as described in
+"Installation", below), you can start running experiments.
+For the purposes of this README, let's say we have a Customer Relationship
+Manager (CRM) that lets its users search for leads using some predefined set of
+criteria (name, location, favorite food, whatever).  (Any resemblance to an
+actual SaaS product we sell here at Real Geeks is... purely coincidental.)
+Let's say, too, that the code behind that search started out sort of okay, but
+it got worse and worse over time until someone decided it was time for a full
+rewrite.  The old search code lives in a method named `Lead.search`, but we've
+been working on a replacement that lives in `SpiffySearch.leads`.  The tests
+for SpiffySearch are in great shape, and we're confident that we won't be
+causing any 500 errors -- but we'd like to use a tool like Scientist to make
+sure that SpiffySearch returns the same results in the same order our users
+expect.
+Stand back -- we're going to try SCIENCE!
+The first thing we need is a name for our experiment -- let's just go with
+`"spiffy-search"`.
+### Deploying an Experiment
+```ruby
+LabTech.science "spiffy-search" do |exp|
+  exp.use { Lead.search(params[:search]) }        # control
+  exp.try { SpiffySearch.leads(params[:search]) } # candidate
+end
+```
+Within the block, `exp` is an object that includes the `Scientist::Experiment`
+module, so we can use any and all of the tools in the Scientist README.
+However, I want to call out a few specific ones as being extremely useful for
+this sort of thing.  When working with code that returns interesting objects,
+it's a Very Good Idea™ to make use of the `clean` method on the experiment.
+It's probably also good to override the default comparison and to provide some
+context for the experiment, so let's just redo that block with that in mind:
+```ruby
+LabTech.science "spiffy-search" do |exp|
+  exp.context params: params.to_h
+  exp.use { Lead.search(params[:search]) }        # control
+  exp.try { SpiffySearch.leads(params[:search]) } # candidate
+  exp.compare {|control, candidate| control.map(&:id) == candidate.map(&:id) }
+  exp.clean { |records| records.map(&:id) }
+end
+```
+Now that that's done, we can safely commit and deploy this code.  As soon as
+that code starts getting run, we'll see a new LabTech::Experiment record in the
+database.  However, **the candidate block will never run,** because
+LabTech::Experiment records are disabled by default.
+### Enabling an Experiment
+In order to enable the experiment, we'll need to go into the Rails production
+console and run:
+```ruby
+LabTech.enable "spiffy-search"
+```
+If we have particularly high search volume and we only want to run the
+experiment on a fraction of our requests, the `.enable` method takes an
+optional `:percent` keyword argument, which accepts an integer in the range
+`(0..100)`.  So to sample only, say, 3% of our searches (selected at random),
+we could run this instead:
+```ruby
+LabTech.enable "spiffy-search", percent: 3
+```
+### Summarizing Experimental Results
+At this point, if you have the [table_print gem](http://tableprintgem.com/)
+installed, you can get a quick overview of all of your experiments by running:
+```ruby
+tp LabTech::Experiment.all
+```
+Either way, if you want more details on how the experiment is running, you can run:
+```ruby
+tp LabTech.summarize_results "spiffy-search"
+```
+This will print a terminal-friendly summary of your experimental results.  I'll
+talk more about this summary later, but for now, let's say we were
+overconfident, and there's a bug in SpiffySearch that's raising an exception.
+The summary will have a line that looks like this:
+```
+22 of 22 (100.00%) raised errors
+```
+Ruh roh!
+### Summarizing Errors
+We run this to see what's up:
+```
+LabTech.summarize_errors "spiffy-search"
+```
+And we see something that looks like this, only longer:
+```
+====================================================================================================
+Comparing results for smoke-test:
+----------------------------------------------------------------------------------------------------
+Result #1
+  * RuntimeError:  nope
+----------------------------------------------------------------------------------------------------
+----------------------------------------------------------------------------------------------------
+Result #2
+  * RuntimeError:  nope
+----------------------------------------------------------------------------------------------------
+====================================================================================================
+```
+If you want to see individual backtraces, you can do so by finding and
+inspecting indvididual records in the Rails console.  For now, though, let's
+say we know where the error is...
+### Disabling and Restarting an Experiment
+There's no point continuing to collect those exceptions, so we might as well
+turn the experiment back off:
+```ruby
+LabTech.disable "spiffy-search"
+```
+We fix the exception, deploy the new code, and now we want to start the
+experiment over again.  We don't want the previous exceptions cluttering up our
+new results, so let's clear out all those observations:
+```ruby
+exp = LabTech::Experiment.named("spiffy-search")
+exp.purge_data
+```
+(Yes, this is a slightly more cumbersome interface than enabling or summarizing
+an experiment.  While deleting data is sometimes necessary, we don't want to
+make it easy to do accidentally.)
+### Summarizing Experimental Results, Take Two
+This time, the output from `LabTech.summarize_results "spiffy-search"` looks
+more like this:
+```
+--------------------------------------------------------------------------------
+Experiment: smoke-test
+--------------------------------------------------------------------------------
+Earliest results: 2019-08-21T11:00:41-10:00
+Latest result:    2019-08-21T11:23:31-10:00 (23 minutes)
+103 of 106 (97.16%) correct
+2 of 106 (1.88%) mismatched
+1 of 106 (0.94%) timed out
+Median time delta: +0.000s  (90% of observations between -0.000s and +0.000s)
+Speedups (by percentiles):
+      0%  [           █             ·                         ]    -3.1x
+      5%  [             █           ·                         ]    -2.8x
+     10%  [             █           ·                         ]    -2.6x
+     15%  [              █          ·                         ]    -2.5x
+     20%  [              █          ·                         ]    -2.4x
+     25%  [               █         ·                         ]    -2.3x
+     30%  [               █         ·                         ]    -2.2x
+     35%  [                █        ·                         ]    -2.1x
+     40%  [                █        ·                         ]    -2.0x
+     45%  [                         ·    █                    ]    +1.2x faster
+     50%  [ · · · · · · · · · · · · · · · · █ · · · · · · · · ]    +1.8x faster
+     55%  [                         ·        █                ]    +2.0x faster
+     60%  [                         ·        █                ]    +2.1x faster
+     65%  [                         ·         █               ]    +2.2x faster
+     70%  [                         ·         █               ]    +2.4x faster
+     75%  [                         ·          █              ]    +2.6x faster
+     80%  [                         ·          █              ]    +2.6x faster
+     85%  [                         ·           █             ]    +2.7x faster
+     90%  [                         ·           █             ]    +2.8x faster
+     95%  [                         ·            █            ]    +3.0x faster
+    100%  [                         ·                        █]    +6.7x faster
+--------------------------------------------------------------------------------
+```
+First off, we see a summary of the time range represented in this experiment.
+This is a very simple "first result to last result" view that does not take
+into account when the experiment was enabled.
+Next, we see some counts.  An individual run of an experiment may have one of
+four outcomes:
+- "correct" means that both control and candidate were considered equivalent
+- "mismatched" means that the candidate returned a different value than the
+  control
+- "timed out" means that the experiment's run raised a `Timeout::Error`
+- "raised error" means that the experiment's run raised anything other than
+  `Timeout::Error`
+After the counts, we see a bunch of performance data, starting with a line that
+says "Median time delta" and includes the 5th and 95th percentile time deltas
+as well.  "Time delta" just means the difference in execution time between the
+control and the candidate:  negative values are faster, and positive values are
+slower.  (The 5th and 95th percentiles are deliberately chosen to keep us from
+worrying too much about extreme values that might be outliers.)
+The rest of the output is taken up by a chart that attempts to provide a handy
+visual chart showing whether the candidate is faster or slower than the
+control.  Because it can be hard to remember what the signs signify, this also
+includes the word "faster" when the candidate was faster than the control.
+### Comparing Mismatches
+At this point, we might be curious about any mismatches, and want to
+investigate those.  Unfortunately, the chart I showed above was edited by hand
+to show what the output might look like if mismatches were present, but as of
+this writing I don't actually have any mismatches to show you.  (I promise
+that's not a humblebrag.)
+However, you can get a quick, if EXTREMELY VERBOSE, listing of the first few
+mismatches by running:
+```ruby
+LabTech.compare_mismatches "spiffy-search", limit: 3
+```
+You have the ability to customize the output of this by passing a block that
+takes a "control" parameter followed by a "candidate" parameter; the return
+value of that block will be printed to the console.  How you do this will
+largely depend on the kind of data you're collecting to validate your
+experiments.  There are several examples in the `lib/lab_tech.rb` file; I
+encourage you to check them out.
+## Installation
+**NOTE: As this gem is a Rails engine, we assume you have a Rails application to
+include it in.**
+Add this line to your application's Gemfile:
+```ruby
+gem 'lab_tech'
+```
+And then execute:
+```bash
+$ bundle
+```
+Or install it yourself as:
+```bash
+$ gem install lab_tech
+```
+Once the gem is installed, run this from your application's root (possibly with
+the `bundle exec` or `bin/` prefix, or whatever else may be dictated by your
+local custom and practice):
+```ruby
+rails lab_tech:install:migrations db:migrate
+```
+The output from that command should look like this:
+```
+Copied migration 20190822175815_create_experiment_tables.lab_tech.rb from lab_tech
+== 20190822175815 CreateExperimentTables: migrating ===========================
+-- create_table("lab_tech_experiments")
+-> 0.0147s
+-- create_table("lab_tech_results")
+-> 0.0152s
+-- create_table("lab_tech_observations")
+-> 0.0109s
+== 20190822175815 CreateExperimentTables: migrated (0.0410s) ==================
+```
+Once that's done, you should be good to go!  See the "Usage" section, above.
+## Contributing
+This gem was extracted just before its primary author left Real Geeks, so it's
+not quite clear who's going to take responsibility for the gem.  It's probably
+a good idea to open a GitHub issue to start a conversation before undertaking
+any great amount of work -- though, of course, you're perfectly welcome to fork
+the gem and use your modified version at any time.
+## License
+The gem is available as open source under the terms of the [MIT
+License](https://opensource.org/licenses/MIT).

data/Rakefile ADDED

@@ -0,0 +1,30 @@
+begin
+  require 'bundler/setup'
+rescue LoadError
+  puts 'You must `gem install bundler` and `bundle install` to run rake tasks'
+end
+require 'rdoc/task'
+RDoc::Task.new(:rdoc) do |rdoc|
+  rdoc.rdoc_dir = 'rdoc'
+  rdoc.title    = 'LabTech'
+  rdoc.options << '--line-numbers'
+  rdoc.rdoc_files.include('README.md')
+  rdoc.rdoc_files.include('lib/**/*.rb')
+end
+APP_RAKEFILE = File.expand_path("spec/dummy/Rakefile", __dir__)
+load 'rails/tasks/engine.rake'
+load 'rails/tasks/statistics.rake'
+require 'bundler/gem_tasks'
+require 'rspec/core'
+require 'rspec/core/rake_task'
+desc "Run all specs in spec directory (excluding plugin specs)"
+RSpec::Core::RakeTask.new(spec: "app:db:test:prepare")
+task default: :spec

data/app/models/lab_tech/application_record.rb ADDED

@@ -0,0 +1,5 @@
+module LabTech
+  class ApplicationRecord < ActiveRecord::Base
+    self.abstract_class = true
+  end
+end

data/app/models/lab_tech/default_cleaner.rb ADDED

@@ -0,0 +1,87 @@
+module LabTech
+  class DefaultCleaner
+    def self.call( value )
+      new.call( value )
+    end
+    def call( value )
+      clean( value, return_placeholders: false )
+    end
+    class RecordPlaceholder
+      attr_reader :class_name, :id
+      def initialize( record )
+        @class_name = record.class.to_s
+        @id         = record.id
+      end
+      def to_a
+        [ class_name, id ]
+      end
+      def inspect
+        "<#{class_name} ##{id}>"
+      end
+    end
+    private
+    # In the event of a Recursion Blunder, stop before we smash the stack, so we
+    # can actually get a useful stack trace that doesn't overwhelm our scrollback
+    # buffers
+    MAX_DEPTH = 10
+    def __push__
+      @depth ||= 0
+      @depth += 1
+      fail "wtf are you even doing?" if @depth > MAX_DEPTH
+    end
+    def __pop__
+      @depth -= 1
+    end
+    def clean( value, return_placeholders: )
+      __push__
+      case value
+      when RecordPlaceholder
+        clean_record_placeholder( value, return_placeholders: return_placeholders )
+      when ActiveRecord::Base
+        clean_record( value, return_placeholders: return_placeholders )
+      when Array
+        clean_array( value, return_placeholders: return_placeholders )
+      else
+        value
+      end
+    ensure
+      __pop__
+    end
+    def clean_array( value, return_placeholders: )
+      placeholders = value.map {|e| clean(e, return_placeholders: true) }
+      if placeholders.all? { |e| e.kind_of?(RecordPlaceholder) } && !return_placeholders
+        count_placeholders( placeholders )
+      else
+        placeholders.map {|e| clean(e, return_placeholders: return_placeholders) }
+      end
+    end
+    def clean_record( value, return_placeholders: )
+      placeholder = RecordPlaceholder.new( value )
+      clean_record_placeholder( placeholder, return_placeholders: return_placeholders )
+    end
+    def clean_record_placeholder( value, return_placeholders: )
+      return_placeholders \
+        ? value \
+        : value.to_a
+    end
+    def count_placeholders( placeholders )
+      counts = placeholders.group_by(&:class_name).map { |class_name, summs|
+        [ class_name, summs.map(&:id) ]
+      }
+      Hash[ counts ]
+    end
+  end
+end