RubyGems - pipes - Versions diffs - 0.1.0 - Mend

pipes 0.1.0

Files changed (24) hide show

data/.gitignore +24 -0
data/.rspec +1 -0
data/.rvmrc +1 -0
data/Gemfile +3 -0
data/LICENSE.txt +22 -0
data/README.md +331 -0
data/Rakefile +8 -0
data/lib/pipes.rb +46 -0
data/lib/pipes/resque_hooks.rb +18 -0
data/lib/pipes/runner.rb +112 -0
data/lib/pipes/stage_parser.rb +152 -0
data/lib/pipes/store.rb +122 -0
data/lib/pipes/utils.rb +7 -0
data/lib/pipes/version.rb +3 -0
data/pipes.gemspec +24 -0
data/spec/mock_jobs.rb +58 -0
data/spec/pipes/resque_hooks_spec.rb +22 -0
data/spec/pipes/runner_spec.rb +110 -0
data/spec/pipes/stage_parser_spec.rb +169 -0
data/spec/pipes/store_spec.rb +181 -0
data/spec/pipes/utils_spec.rb +14 -0
data/spec/pipes_spec.rb +46 -0
data/spec/spec_helper.rb +13 -0
metadata +140 -0

data/.gitignore ADDED

@@ -0,0 +1,24 @@
+*.gem
+*.rbc
+.bundle
+.config
+.yardoc
+Gemfile.lock
+InstalledFiles
+_yardoc
+coverage
+doc/
+lib/bundler/man
+pkg
+rdoc
+spec/reports
+test/tmp
+test/version_tmp
+tmp
+log/*.log
+pkg/
+spec/dummy/db/*.sqlite3
+spec/dummy/log/*.log
+spec/dummy/tmp/
+spec/dummy/.sass-cache

data/.rspec ADDED

	@@ -0,0 +1 @@
1	+ --colour --order rand

data/.rvmrc ADDED

	@@ -0,0 +1 @@
1	+ rvm use --create 1.9.3@pipes

data/Gemfile ADDED

@@ -0,0 +1,3 @@
+source 'https://rubygems.org'
+gemspec

data/LICENSE.txt ADDED

@@ -0,0 +1,22 @@
+Copyright (c) 2012 Mike Pack
+MIT License
+Permission is hereby granted, free of charge, to any person obtaining
+a copy of this software and associated documentation files (the
+"Software"), to deal in the Software without restriction, including
+without limitation the rights to use, copy, modify, merge, publish,
+distribute, sublicense, and/or sell copies of the Software, and to
+permit persons to whom the Software is furnished to do so, subject to
+the following conditions:
+The above copyright notice and this permission notice shall be
+included in all copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
+EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
+MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
+NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE
+LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION
+OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION
+WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

data/README.md ADDED

@@ -0,0 +1,331 @@
+# Pipes
+![Pipes](http://i.imgur.com/MND26.png)
+Pipes is a Redis-backed concurrency management system designed around Resque. It provides a DSL for defining "stages" of a process. Each (Resque) job in the stage can be run concurrently, but all must finish before subsequent stages are run.
+## Example
+At Factory Code Labs, we work on a system for which we must deploy static HTML files. We must render any number of HTML pages, assets, .htaccess files, etc so the static HTML-based site can run on Apache.
+Here's a simplified look at our stages:
+**Stage 1**
+- Publish HTML files.
+- Publish assets.
+- Publish .htaccess.
+**Stage 2**
+- rsync files to another server.
+- Upload assets to a CDN.
+**Stage 3**
+- Activate rynced files.
+- Email people about deploy.
+We want to ensure that all of **Stage 1** is finished before **Stage 2** begins, and likewise for **Stage 3**. However, the individual components of each stage can execute asynchronously, we just want to make sure they converge when all is finished.
+## Installation
+Add this line to your application's Gemfile:
+    gem 'pipes'
+And then execute:
+    $ bundle
+Or install it yourself as:
+    $ gem install pipes
+## Usage
+Pipes assumes your conforming to the Resque API in your jobs, so you might have the following:
+```ruby
+module Writers
+  class HTMLWriter
+    @queue = :content_writers
+    def self.perform(url = 'http://localhost:3000/')
+      # ... fetch URL and save HTML ...
+    end
+  end
+end
+```
+You'll generally need to do two things when working with Pipes:
+1. Define a set of stages.
+2. Run the jobs.
+Let's look at these two steps individually.
+### Defining Stages
+As part of the configuration process, you'll want to define your stages:
+```ruby
+Pipes.configure do |config|
+  config.stages do
+    # Stage 1
+    content_writers [
+      Writers::HTMLWriter,
+      Writers::AssetWriter,
+      Writers::HtaccessWriter
+    ]
+    # Stage 2
+    publishers [
+      Publishers::Rsyncer,
+      Publishers::CDNUploader
+    ]
+    # Stage 3
+    notifiers [
+      Notifiers::FileActivator
+      Notifiers::Emailer
+    ]
+  end
+end
+```
+There's more advanced ways of defining stages, more on that later.
+Stages are defined lexically. That is, the order in which you define your stages in the config determines the order they will be run.
+The name of the stage is arbitrary. Above, we have `content_writers`, `publishers` and `notifiers`, though there's no significant meaning. The name of the stage can be later extracted and presented to the user or referenced as a symbol.
+### Running The Jobs
+Once your configuration is set up, you can fire off the jobs:
+```ruby
+Pipes::Runner.run([Writers::HTMLWriter, Publishers::Rsyncer])
+```
+The above line essentially says "here's the jobs I'm looking to run", at which point Pipes takes over to determine how to partition them into their appropriate stages. Pipes will break these two jobs up as you would expect:
+```ruby
+# Stage 1 (content_writers)
+Writers::HTMLWriter
+# Stage 2 (publishers)
+Publishers::Rsyncer
+```
+You can also pass arguments to the jobs, just like Resque:
+```ruby
+Pipes::Runner.run([Writers::HTMLWriter], 'http://localhost:3000/page')
+```
+In the above case, all jobs' `.perform` methods would receive the `http://localhost:3000/page` argument. You can, of course, pass multiple arguments:
+```ruby
+module Writers
+  class HTMLWriter
+    @queue = :content_writers
+    def self.perform(host = 'localhost', port = 3000)
+      # ... fetch URL and save HTML ...
+    end
+  end
+end
+Pipes::Runner.run([Writers::HTMLWriter], 'google.com', 80)
+```
+## Defining Stage Dependencies
+Pipes makes it easy to define dependencies between jobs.
+Say you want the `Publishers::Rsyncer` to always run after `Writers::HTMLWriter`. You'll first want to modify your config:
+```ruby
+Pipes.configure do |config|
+  config.stages do
+    content_writers [
+      {Writers::HTMLWriter => Publishers::Rsyncer}
+    ]
+    publishers [
+      Publishers::Rsyncer,
+      Publishers::CDNUploader
+    ]
+  end
+end
+```
+By converting the individual job into a Hash, you can specify that you want `Publishers::Rsyncer` to always run after `Writers::HTMLWriter`. You can also specify multiple dependencies:
+```ruby
+Pipes.configure do |config|
+  config.stages do
+    content_writers [
+      {Writers::HTMLWriter => [Publishers::Rsyncer, Publishers::CDNUploader]}
+    ]
+    publishers [
+      Publishers::Rsyncer,
+      Publishers::CDNUploader
+    ]
+  end
+end
+```
+Defining arrays of dependencies is great, but if you're just reiterating all jobs in a particular stage, you can specify the stage instead:
+```ruby
+Pipes.configure do |config|
+  config.stages do
+    content_writers [
+      {Writers::HTMLWriter => :publishers}
+    ]
+    publishers [
+      Publishers::Rsyncer,
+      Publishers::CDNUploader
+    ]
+  end
+end
+```
+If you need to specify multiple dependent stages, you can provide an array of symbols:
+```ruby
+Pipes.configure do |config|
+  config.stages do
+    content_writers [
+      {Writers::HTMLWriter => [:publishers, :notifiers]}
+    ]
+    publishers [
+      Publishers::Rsyncer,
+      Publishers::CDNUploader
+    ]
+    notifiers [
+      Notifiers::FileActivator
+    ]
+  end
+end
+```
+Pipes will also resolve deep dependencies:
+```ruby
+Pipes.configure do |config|
+  config.stages do
+    content_writers [
+      {Writers::HTMLWriter => :publishers}
+    ]
+    publishers [
+      {Publishers::Rsyncer => Notifiers::FileActivator},
+      Publishers::CDNUploader
+    ]
+    notifiers [
+      Notifiers::FileActivator
+    ]
+  end
+end
+```
+In the above example, `Notifiers::FileActivator` will also be a dependency of `Writers::HTMLWriter` because it's a dependency of one of `Writers::HTMLWriters` dependencies (:publishers).
+Running jobs with dependencies is the same as before:
+```ruby
+Pipes::Runner.run([Writers::HTMLWriter], 'http://localhost:3000/page')
+```
+The above code will run `Writers::HTMLWriter` in **Stage 1**, `Publishers::Rsyncer` and `Publishers::CDNUploader` in **Stage 2**, and `Notifiers::FileActivator` in **Stage 3**, all receiving the `http://localhost:3000/page' argument.
+You can turn off dependency resolution by passing in some additional Pipes options as the third argument:
+```ruby
+Pipes::Runner.run([Writers::HTMLWriter], 'http://localhost:3000/page', {resolve: false})
+```
+In the above code, only `Writers::HTMLWriter` will be run.
+## Acceptable Formats for Jobs
+Pipes allows you to specify your jobs in a variety of ways:
+```ruby
+# A single job
+Pipes::Runner.run(Writers::HTMLWriter)
+# A single job as a string. Might be helpful if accepting params from a form
+Pipes::Runner.run('Writers::HTMLWriter')
+# An entire stage
+Pipes::Runner.run(:content_writers)
+# You can pass an array of any of the above, intermixing types
+Pipes::Runner.run([:content_writers, 'Publishers::CDNUploader', Notifiers::FileActivator])
+```
+## Configuring Pipes
+Pipes allows you to specify a variety of configuration options:
+```ruby
+Pipes.configure do |config|
+  # config.redis can be a string...
+  config.redis = 'localhost:6379'
+  # ...or a Redis connection (default $redis):
+  config.redis = REDIS
+  # config.namespace will specify a Redis namespace to use (default nil):
+  config.namespace = 'my_project'
+  # config.resolve tells Pipes to resolve dependencies when calling Pipes::Runner.run(...) (default true):
+  config.resolve = false
+  config.stages do
+    # ...
+  end
+end
+```
+If you're using Pipes in a Rails app, stick your configuration in `config/initializers/pipes.rb`.
+## Support
+Pipes is currently tested under Ruby 1.9.3.
+## Known Caveats
+If your job is expecting a hash as the last argument, you'll need to pass an additional hash so pipes won't think your final argument is the options:
+```ruby
+# Pipes will assume {follow_links: true} is options for Pipes, not your job:
+Pipes::Runner.run([Writers::HTMLWriter], {follow_links: true})
+# So you should pass a trailing hash to denote that there are no Pipes options:
+Pipes::Runner.run([Writers::HTMLWriter], {follow_links: true}, {})
+# Of course, if you do specify options for Pipes, everything will work fine:
+Pipes::Runner.run([Writers::HTMLWriter], {follow_links: true}, {resolve: true})
+```
+## Future Improvements
+- Better atomicity
+- Represent jobs and stages as objects, instead of simple data structures
+- Support for runaway workers/jobs
+## Credits
+![Factory Code Labs](http://i.imgur.com/yV4u1.png)
+Pipes is maintained by [Factory Code Labs](http://www.factorycodelabs.com).
+## License
+Pipes is Copyright © 2012 Factory Code Labs. It is free software, and may be redistributed under the terms specified in the MIT-LICENSE file.

data/Rakefile ADDED

@@ -0,0 +1,8 @@
+require 'rspec/core/rake_task'
+desc 'Run RSpec code examples'
+RSpec::Core::RakeTask.new do |t|
+  t.verbose = false
+end
+task default: :spec

data/lib/pipes.rb ADDED

@@ -0,0 +1,46 @@
+module Pipes
+  # Default options
+  @redis   = $redis
+  @resolve = true
+  class << self
+    attr_reader :redis
+    attr_accessor :namespace, :resolve
+  end
+  def self.configure(*args, &block)
+    yield self
+  end
+  # config.redis can be a string or a redis connection
+  #   eg: config.redis = 'localhost:6379'
+  #   or  config.redis = $MY_REDIS
+  def self.redis=(redis)
+    if redis.is_a? String
+      host, port = redis.split(':')
+      set_redis(Redis.new(host: host, port: port))
+    else
+      set_redis(redis)
+    end
+  end
+  def self.stages(*args, &block)
+    Abyss.configure(*args) do
+      stages &block
+    end
+  end
+  private
+  def self.set_redis(redis)
+    @redis        = redis
+    Resque.redis  = redis
+    Redis.current = redis
+  end
+end
+require 'pipes/utils'
+require 'pipes/stage_parser'
+require 'pipes/store'
+require 'pipes/runner'
+require 'pipes/resque_hooks'

data/lib/pipes/resque_hooks.rb ADDED

@@ -0,0 +1,18 @@
+require 'pipes'
+require 'resque'
+Resque.before_fork do |job|
+  job.payload_class.extend Pipes::ResqueHooks
+end
+module Pipes
+  module ResqueHooks
+    def after_perform_pipes(*args)
+      Pipes::Store.done
+    end
+    def on_failure_pipes(e, *args)
+      Pipes::Store.done
+    end
+  end
+end