ezpub-delayed_job 1.7.0

Sign up to get free protection for your applications and to get access to all the features.
@@ -0,0 +1,20 @@
1
+ Copyright (c) 2005 Tobias Luetke
2
+
3
+ Permission is hereby granted, free of charge, to any person obtaining
4
+ a copy of this software and associated documentation files (the
5
+ "Software"), to deal in the Software without restriction, including
6
+ without limitation the rights to use, copy, modify, merge, publish,
7
+ distribute, sublicense, and/or sell copies of the Software, and to
8
+ permit persons to whom the Software is furnished to do so, subject to
9
+ the following conditions:
10
+
11
+ The above copyright notice and this permission notice shall be
12
+ included in all copies or substantial portions of the Software.
13
+
14
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
15
+ EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
16
+ MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOa AND
17
+ NONINFRINGEMENT. IN NO EVENT SaALL THE AUTHORS OR COPYRIGHT HOLDERS BE
18
+ LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION
19
+ OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION
20
+ WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
@@ -0,0 +1,124 @@
1
+ h1. Delayed::Job
2
+
3
+ Delated_job (or DJ) encapsulates the common pattern of asynchronously executing longer tasks in the background.
4
+
5
+ It is a direct extraction from Shopify where the job table is responsible for a multitude of core tasks. Amongst those tasks are:
6
+
7
+ * sending massive newsletters
8
+ * image resizing
9
+ * http downloads
10
+ * updating smart collections
11
+ * updating solr, our search server, after product changes
12
+ * batch imports
13
+ * spam checks
14
+
15
+ h2. Setup
16
+
17
+ The library evolves around a delayed_jobs table which looks as follows:
18
+
19
+ <pre><code>
20
+ create_table :delayed_jobs, :force => true do |table|
21
+ table.integer :priority, :default => 0 # Allows some jobs to jump to the front of the queue
22
+ table.integer :attempts, :default => 0 # Provides for retries, but still fail eventually.
23
+ table.text :handler # YAML-encoded string of the object that will do work
24
+ table.string :job_type # Class name of the job object, for type-specific workers
25
+ table.string :last_error # reason for last failure (See Note below)
26
+ table.datetime :run_at # When to run. Could be Time.now for immediately, or sometime in the future.
27
+ table.datetime :locked_at # Set when a client is working on this object
28
+ table.datetime :failed_at # Set when all retries have failed (actually, by default, the record is deleted instead)
29
+ table.string :locked_by # Who is working on this object (if locked)
30
+ table.timestamps
31
+ end
32
+ </code></pre>
33
+
34
+ On failure, the job is scheduled again in 5 seconds + N ** 4, where N is the number of retries.
35
+
36
+ The default MAX_ATTEMPTS is 25. After this, the job either deleted (default), or left in the database with "failed_at" set.
37
+ With the default of 25 attempts, the last retry will be 20 days later, with the last interval being almost 100 hours.
38
+
39
+ The default MAX_RUN_TIME is 4.hours. If your job takes longer than that, another computer could pick it up. It's up to you to
40
+ make sure your job doesn't exceed this time. You should set this to the longest time you think the job could take.
41
+
42
+ By default, it will delete failed jobs (and it always deletes successful jobs). If you want to keep failed jobs, set
43
+ Delayed::Job.destroy_failed_jobs = false. The failed jobs will be marked with non-null failed_at.
44
+
45
+ Here is an example of changing job parameters in Rails:
46
+
47
+ <pre><code>
48
+ # config/initializers/delayed_job_config.rb
49
+ Delayed::Job.destroy_failed_jobs = false
50
+ silence_warnings do
51
+ Delayed::Job.const_set("MAX_ATTEMPTS", 3)
52
+ Delayed::Job.const_set("MAX_RUN_TIME", 5.minutes)
53
+ end
54
+ </code></pre>
55
+
56
+ Note: If your error messages are long, consider changing last_error field to a :text instead of a :string (255 character limit).
57
+
58
+
59
+ h2. Usage
60
+
61
+ Jobs are simple ruby objects with a method called perform. Any object which responds to perform can be stuffed into the jobs table.
62
+ Job objects are serialized to yaml so that they can later be resurrected by the job runner.
63
+
64
+ <pre><code>
65
+ class NewsletterJob < Struct.new(:text, :emails)
66
+ def perform
67
+ emails.each { |e| NewsletterMailer.deliver_text_to_email(text, e) }
68
+ end
69
+ end
70
+
71
+ Delayed::Job.enqueue NewsletterJob.new('lorem ipsum...', Customers.find(:all).collect(&:email))
72
+ </code></pre>
73
+
74
+ There is also a second way to get jobs in the queue: send_later.
75
+
76
+ <pre><code>
77
+ BatchImporter.new(Shop.find(1)).send_later(:import_massive_csv, massive_csv)
78
+ </code></pre>
79
+
80
+ This will simply create a Delayed::PerformableMethod job in the jobs table which serializes all the parameters you pass to it. There are some special smarts for active record objects
81
+ which are stored as their text representation and loaded from the database fresh when the job is actually run later.
82
+
83
+
84
+ h2. Running the jobs
85
+
86
+ You can invoke @rake jobs:work@ which will start working off jobs. You can cancel the rake task with @CTRL-C@.
87
+
88
+ You can also run by writing a simple @script/job_runner@, and invoking it externally:
89
+
90
+ <pre><code>
91
+ #!/usr/bin/env ruby
92
+ require File.dirname(__FILE__) + '/../config/environment'
93
+
94
+ Delayed::Worker.new.start
95
+ </code></pre>
96
+
97
+ Workers can be running on any computer, as long as they have access to the database and their clock is in sync. You can even
98
+ run multiple workers on per computer, but you must give each one a unique name. (TODO: put in an example)
99
+ Keep in mind that each worker will check the database at least every 5 seconds.
100
+
101
+ Note: The rake task will exit if the database has any network connectivity problems.
102
+
103
+ If you only want to run specific types of jobs in a given worker, include them when initializing the worker:
104
+
105
+ <pre><code>
106
+ Delayed::Worker.new(:job_types => "SimpleJob").start
107
+ Delayed::Worker.new(:job_types => ["SimpleJob", "NewsletterJob"]).start
108
+ </pre></code>
109
+
110
+ h3. Cleaning up
111
+
112
+ You can invoke @rake jobs:clear@ to delete all jobs in the queue.
113
+
114
+ h3. Changes
115
+
116
+ * 1.7.0: Added failed_at column which can optionally be set after a certain amount of failed job attempts. By default failed job attempts are destroyed after about a month.
117
+
118
+ * 1.6.0: Renamed locked_until to locked_at. We now store when we start a given job instead of how long it will be locked by the worker. This allows us to get a reading on how long a job took to execute.
119
+
120
+ * 1.5.0: Job runners can now be run in parallel. Two new database columns are needed: locked_until and locked_by. This allows us to use pessimistic locking instead of relying on row level locks. This enables us to run as many worker processes as we need to speed up queue processing.
121
+
122
+ * 1.2.0: Added #send_later to Object for simpler job creation
123
+
124
+ * 1.0.0: Initial release
@@ -0,0 +1,41 @@
1
+ #version = File.read('README.textile').scan(/^\*\s+([\d\.]+)/).flatten
2
+
3
+ Gem::Specification.new do |s|
4
+ s.name = "delayed_job"
5
+ s.version = "1.7.0"
6
+ s.date = "2008-11-28"
7
+ s.summary = "Database-backed asynchronous priority queue system -- Extracted from Shopify"
8
+ s.email = "tobi@leetsoft.com"
9
+ s.homepage = "http://github.com/tobi/delayed_job/tree/master"
10
+ s.description = "Delated_job (or DJ) encapsulates the common pattern of asynchronously executing longer tasks in the background. It is a direct extraction from Shopify where the job table is responsible for a multitude of core tasks."
11
+ s.authors = ["Tobias Lütke"]
12
+
13
+ # s.bindir = "bin"
14
+ # s.executables = ["delayed_job"]
15
+ # s.default_executable = "delayed_job"
16
+
17
+ s.has_rdoc = false
18
+ s.rdoc_options = ["--main", "README.textile"]
19
+ s.extra_rdoc_files = ["README.textile"]
20
+
21
+ # run git ls-files to get an updated list
22
+ s.files = %w[
23
+ MIT-LICENSE
24
+ README.textile
25
+ delayed_job.gemspec
26
+ init.rb
27
+ lib/delayed/job.rb
28
+ lib/delayed/message_sending.rb
29
+ lib/delayed/performable_method.rb
30
+ lib/delayed/worker.rb
31
+ lib/delayed_job.rb
32
+ tasks/jobs.rake
33
+ tasks/tasks.rb
34
+ ]
35
+ s.test_files = %w[
36
+ spec/database.rb
37
+ spec/delayed_method_spec.rb
38
+ spec/job_spec.rb
39
+ spec/story_spec.rb
40
+ ]
41
+ end
data/init.rb ADDED
@@ -0,0 +1 @@
1
+ require File.dirname(__FILE__) + '/lib/delayed_job'
@@ -0,0 +1,279 @@
1
+ module Delayed
2
+
3
+ class DeserializationError < StandardError
4
+ end
5
+
6
+ # A job object that is persisted to the database.
7
+ # Contains the work object as a YAML field.
8
+ class Job < ActiveRecord::Base
9
+ MAX_ATTEMPTS = 25
10
+ MAX_RUN_TIME = 4.hours
11
+ set_table_name :delayed_jobs
12
+
13
+ # By default failed jobs are destroyed after too many attempts.
14
+ # If you want to keep them around (perhaps to inspect the reason
15
+ # for the failure), set this to false.
16
+ cattr_accessor :destroy_failed_jobs
17
+ self.destroy_failed_jobs = true
18
+
19
+ # Every worker has a unique name which by default is the pid of the process.
20
+ # There are some advantages to overriding this with something which survives worker retarts:
21
+ # Workers can safely resume working on tasks which are locked by themselves. The worker will assume that it crashed before.
22
+ cattr_accessor :worker_name
23
+ self.worker_name = "host:#{Socket.gethostname} pid:#{Process.pid}" rescue "pid:#{Process.pid}"
24
+
25
+ NextTaskSQL = '(run_at <= ? AND (locked_at IS NULL OR locked_at < ?) OR (locked_by = ?)) AND failed_at IS NULL'
26
+ NextTaskOrder = 'priority DESC, run_at ASC'
27
+
28
+ ParseObjectFromYaml = /\!ruby\/\w+\:([^\s]+)/
29
+
30
+ cattr_accessor :min_priority, :max_priority, :job_types
31
+ self.min_priority = nil
32
+ self.max_priority = nil
33
+ self.job_types = nil
34
+
35
+ # When a worker is exiting, make sure we don't have any locked jobs.
36
+ def self.clear_locks!
37
+ update_all("locked_by = null, locked_at = null", ["locked_by = ?", worker_name])
38
+ end
39
+
40
+ def failed?
41
+ failed_at
42
+ end
43
+ alias_method :failed, :failed?
44
+
45
+ def payload_object
46
+ @payload_object ||= deserialize(self['handler'])
47
+ end
48
+
49
+ def name
50
+ @name ||= begin
51
+ payload = payload_object
52
+ if payload.respond_to?(:display_name)
53
+ payload.display_name
54
+ else
55
+ payload.class.name
56
+ end
57
+ end
58
+ end
59
+
60
+ def payload_object=(object)
61
+ self['job_type'] = object.class.to_s
62
+ self['handler'] = object.to_yaml
63
+ end
64
+
65
+ # Reschedule the job in the future (when a job fails).
66
+ # Uses an exponential scale depending on the number of failed attempts.
67
+ def reschedule(message, backtrace = [], time = nil)
68
+ if self.attempts < MAX_ATTEMPTS
69
+ time ||= Job.db_time_now + (attempts ** 4) + 5
70
+
71
+ self.attempts += 1
72
+ self.run_at = time
73
+ self.last_error = message + "\n" + backtrace.join("\n")
74
+ self.unlock
75
+ save!
76
+ else
77
+ logger.info "* [JOB] PERMANENTLY removing #{self.name} because of #{attempts} consequetive failures."
78
+ destroy_failed_jobs ? destroy : update_attribute(:failed_at, Time.now)
79
+ end
80
+ end
81
+
82
+
83
+ # Try to run one job. Returns true/false (work done/work failed) or nil if job can't be locked.
84
+ def run_with_lock(max_run_time, worker_name)
85
+ logger.info "* [JOB] aquiring lock on #{name}"
86
+ unless lock_exclusively!(max_run_time, worker_name)
87
+ # We did not get the lock, some other worker process must have
88
+ logger.warn "* [JOB] failed to aquire exclusive lock for #{name}"
89
+ return nil # no work done
90
+ end
91
+
92
+ begin
93
+ runtime = Benchmark.realtime do
94
+ invoke_job # TODO: raise error if takes longer than max_run_time
95
+ destroy
96
+ end
97
+ # TODO: warn if runtime > max_run_time ?
98
+ logger.info "* [JOB] #{name} completed after %.4f" % runtime
99
+ return true # did work
100
+ rescue Exception => e
101
+ reschedule e.message, e.backtrace
102
+ log_exception(e)
103
+ return false # work failed
104
+ end
105
+ end
106
+
107
+ # Add a job to the queue
108
+ def self.enqueue(*args, &block)
109
+ object = block_given? ? EvaledJob.new(&block) : args.shift
110
+
111
+ unless object.respond_to?(:perform) || block_given?
112
+ raise ArgumentError, 'Cannot enqueue items which do not respond to perform'
113
+ end
114
+
115
+ priority = args.first || 0
116
+ run_at = args[1]
117
+
118
+ Job.create(:payload_object => object, :priority => priority.to_i, :run_at => run_at)
119
+ end
120
+
121
+ # Find a few candidate jobs to run (in case some immediately get locked by others).
122
+ # Return in random order prevent everyone trying to do same head job at once.
123
+ def self.find_available(limit = 5, max_run_time = MAX_RUN_TIME)
124
+
125
+ time_now = db_time_now
126
+
127
+ sql = NextTaskSQL.dup
128
+
129
+ conditions = [time_now, time_now - max_run_time, worker_name]
130
+
131
+ if self.min_priority
132
+ sql << ' AND (priority >= ?)'
133
+ conditions << min_priority
134
+ end
135
+
136
+ if self.max_priority
137
+ sql << ' AND (priority <= ?)'
138
+ conditions << max_priority
139
+ end
140
+
141
+ if self.job_types
142
+ sql << ' AND (job_type IN (?))'
143
+ conditions << job_types
144
+ end
145
+
146
+ conditions.unshift(sql)
147
+
148
+ records = ActiveRecord::Base.silence do
149
+ find(:all, :conditions => conditions, :order => NextTaskOrder, :limit => limit)
150
+ end
151
+
152
+ records.sort_by { rand() }
153
+ end
154
+
155
+ # Run the next job we can get an exclusive lock on.
156
+ # If no jobs are left we return nil
157
+ def self.reserve_and_run_one_job(max_run_time = MAX_RUN_TIME)
158
+
159
+ # We get up to 5 jobs from the db. In case we cannot get exclusive access to a job we try the next.
160
+ # this leads to a more even distribution of jobs across the worker processes
161
+ find_available(5, max_run_time).each do |job|
162
+ t = job.run_with_lock(max_run_time, worker_name)
163
+ return t unless t == nil # return if we did work (good or bad)
164
+ end
165
+
166
+ nil # we didn't do any work, all 5 were not lockable
167
+ end
168
+
169
+ # Lock this job for this worker.
170
+ # Returns true if we have the lock, false otherwise.
171
+ def lock_exclusively!(max_run_time, worker = worker_name)
172
+ now = self.class.db_time_now
173
+ affected_rows = if locked_by != worker
174
+ # We don't own this job so we will update the locked_by name and the locked_at
175
+ self.class.update_all(["locked_at = ?, locked_by = ?", now, worker], ["id = ? and (locked_at is null or locked_at < ?)", id, (now - max_run_time.to_i)])
176
+ else
177
+ # We already own this job, this may happen if the job queue crashes.
178
+ # Simply resume and update the locked_at
179
+ self.class.update_all(["locked_at = ?", now], ["id = ? and locked_by = ?", id, worker])
180
+ end
181
+ if affected_rows == 1
182
+ self.locked_at = now
183
+ self.locked_by = worker
184
+ return true
185
+ else
186
+ return false
187
+ end
188
+ end
189
+
190
+ # Unlock this job (note: not saved to DB)
191
+ def unlock
192
+ self.locked_at = nil
193
+ self.locked_by = nil
194
+ end
195
+
196
+ # This is a good hook if you need to report job processing errors in additional or different ways
197
+ def log_exception(error)
198
+ logger.error "* [JOB] #{name} failed with #{error.class.name}: #{error.message} - #{attempts} failed attempts"
199
+ logger.error(error)
200
+ end
201
+
202
+ # Do num jobs and return stats on success/failure.
203
+ # Exit early if interrupted.
204
+ def self.work_off(num = 100)
205
+ success, failure = 0, 0
206
+
207
+ num.times do
208
+ case self.reserve_and_run_one_job
209
+ when true
210
+ success += 1
211
+ when false
212
+ failure += 1
213
+ else
214
+ break # leave if no work could be done
215
+ end
216
+ break if $exit # leave if we're exiting
217
+ end
218
+
219
+ return [success, failure]
220
+ end
221
+
222
+ # Moved into its own method so that new_relic can trace it.
223
+ def invoke_job
224
+ payload_object.perform
225
+ end
226
+
227
+ private
228
+
229
+ def deserialize(source)
230
+ handler = YAML.load(source) rescue nil
231
+
232
+ unless handler.respond_to?(:perform)
233
+ if handler.nil? && source =~ ParseObjectFromYaml
234
+ handler_class = $1
235
+ end
236
+ attempt_to_load(handler_class || handler.class)
237
+ handler = YAML.load(source)
238
+ end
239
+
240
+ return handler if handler.respond_to?(:perform)
241
+
242
+ raise DeserializationError,
243
+ 'Job failed to load: Unknown handler. Try to manually require the appropiate file.'
244
+ rescue TypeError, LoadError, NameError => e
245
+ raise DeserializationError,
246
+ "Job failed to load: #{e.message}. Try to manually require the required file."
247
+ end
248
+
249
+ # Constantize the object so that ActiveSupport can attempt
250
+ # its auto loading magic. Will raise LoadError if not successful.
251
+ def attempt_to_load(klass)
252
+ klass.constantize
253
+ end
254
+
255
+ # Get the current time (GMT or local depending on DB)
256
+ # Note: This does not ping the DB to get the time, so all your clients
257
+ # must have syncronized clocks.
258
+ def self.db_time_now
259
+ (ActiveRecord::Base.default_timezone == :utc) ? Time.now.utc : Time.now
260
+ end
261
+
262
+ protected
263
+
264
+ def before_save
265
+ self.run_at ||= self.class.db_time_now
266
+ end
267
+
268
+ end
269
+
270
+ class EvaledJob
271
+ def initialize
272
+ @job = yield
273
+ end
274
+
275
+ def perform
276
+ eval(@job)
277
+ end
278
+ end
279
+ end
@@ -0,0 +1,17 @@
1
+ module Delayed
2
+ module MessageSending
3
+ def send_later(method, *args)
4
+ Delayed::Job.enqueue Delayed::PerformableMethod.new(self, method.to_sym, args)
5
+ end
6
+
7
+ module ClassMethods
8
+ def handle_asynchronously(method)
9
+ without_name = "#{method}_without_send_later"
10
+ define_method("#{method}_with_send_later") do |*args|
11
+ send_later(without_name, *args)
12
+ end
13
+ alias_method_chain method, :send_later
14
+ end
15
+ end
16
+ end
17
+ end
@@ -0,0 +1,55 @@
1
+ module Delayed
2
+ class PerformableMethod < Struct.new(:object, :method, :args)
3
+ CLASS_STRING_FORMAT = /^CLASS\:([A-Z][\w\:]+)$/
4
+ AR_STRING_FORMAT = /^AR\:([A-Z][\w\:]+)\:(\d+)$/
5
+
6
+ def initialize(object, method, args)
7
+ raise NoMethodError, "undefined method `#{method}' for #{self.inspect}" unless object.respond_to?(method)
8
+
9
+ self.object = dump(object)
10
+ self.args = args.map { |a| dump(a) }
11
+ self.method = method.to_sym
12
+ end
13
+
14
+ def display_name
15
+ case self.object
16
+ when CLASS_STRING_FORMAT then "#{$1}.#{method}"
17
+ when AR_STRING_FORMAT then "#{$1}##{method}"
18
+ else "Unknown##{method}"
19
+ end
20
+ end
21
+
22
+ def perform
23
+ load(object).send(method, *args.map{|a| load(a)})
24
+ rescue ActiveRecord::RecordNotFound
25
+ # We cannot do anything about objects which were deleted in the meantime
26
+ true
27
+ end
28
+
29
+ private
30
+
31
+ def load(arg)
32
+ case arg
33
+ when CLASS_STRING_FORMAT then $1.constantize
34
+ when AR_STRING_FORMAT then $1.constantize.find($2)
35
+ else arg
36
+ end
37
+ end
38
+
39
+ def dump(arg)
40
+ case arg
41
+ when Class then class_to_string(arg)
42
+ when ActiveRecord::Base then ar_to_string(arg)
43
+ else arg
44
+ end
45
+ end
46
+
47
+ def ar_to_string(obj)
48
+ "AR:#{obj.class}:#{obj.id}"
49
+ end
50
+
51
+ def class_to_string(obj)
52
+ "CLASS:#{obj.name}"
53
+ end
54
+ end
55
+ end
@@ -0,0 +1,55 @@
1
+ module Delayed
2
+ class Worker
3
+ SLEEP = 5
4
+
5
+ cattr_accessor :logger
6
+ self.logger = if defined?(Merb::Logger)
7
+ Merb.logger
8
+ elsif defined?(RAILS_DEFAULT_LOGGER)
9
+ RAILS_DEFAULT_LOGGER
10
+ end
11
+
12
+ def initialize(options={})
13
+ @quiet = options[:quiet]
14
+ Delayed::Job.min_priority = options[:min_priority] if options.has_key?(:min_priority)
15
+ Delayed::Job.max_priority = options[:max_priority] if options.has_key?(:max_priority)
16
+ Delayed::Job.job_types = options[:job_types] if options.has_key?(:job_types)
17
+ end
18
+
19
+ def start
20
+ say "*** Starting job worker #{Delayed::Job.worker_name}"
21
+
22
+ trap('TERM') { say 'Exiting...'; $exit = true }
23
+ trap('INT') { say 'Exiting...'; $exit = true }
24
+
25
+ loop do
26
+ result = nil
27
+
28
+ realtime = Benchmark.realtime do
29
+ result = Delayed::Job.work_off
30
+ end
31
+
32
+ count = result.sum
33
+
34
+ break if $exit
35
+
36
+ if count.zero?
37
+ sleep(SLEEP)
38
+ else
39
+ say "#{count} jobs processed at %.4f j/s, %d failed ..." % [count / realtime, result.last]
40
+ end
41
+
42
+ break if $exit
43
+ end
44
+
45
+ ensure
46
+ Delayed::Job.clear_locks!
47
+ end
48
+
49
+ def say(text)
50
+ puts text unless @quiet
51
+ logger.info text if logger
52
+ end
53
+
54
+ end
55
+ end
@@ -0,0 +1,13 @@
1
+ autoload :ActiveRecord, 'activerecord'
2
+
3
+ require File.dirname(__FILE__) + '/delayed/message_sending'
4
+ require File.dirname(__FILE__) + '/delayed/performable_method'
5
+ require File.dirname(__FILE__) + '/delayed/job'
6
+ require File.dirname(__FILE__) + '/delayed/worker'
7
+
8
+ Object.send(:include, Delayed::MessageSending)
9
+ Module.send(:include, Delayed::MessageSending::ClassMethods)
10
+
11
+ if defined?(Merb::Plugins)
12
+ Merb::Plugins.add_rakefiles File.dirname(__FILE__) / '..' / 'tasks' / 'tasks'
13
+ end
@@ -0,0 +1,43 @@
1
+ $:.unshift(File.dirname(__FILE__) + '/../lib')
2
+ $:.unshift(File.dirname(__FILE__) + '/../../rspec/lib')
3
+
4
+ require 'rubygems'
5
+ require 'active_record'
6
+ gem 'sqlite3-ruby'
7
+
8
+ require File.dirname(__FILE__) + '/../init'
9
+ require 'spec'
10
+
11
+ ActiveRecord::Base.logger = Logger.new('/tmp/dj.log')
12
+ ActiveRecord::Base.establish_connection(:adapter => 'sqlite3', :database => '/tmp/jobs.sqlite')
13
+ ActiveRecord::Migration.verbose = false
14
+
15
+ ActiveRecord::Schema.define do
16
+
17
+ create_table :delayed_jobs, :force => true do |table|
18
+ table.integer :priority, :default => 0
19
+ table.integer :attempts, :default => 0
20
+ table.text :handler
21
+ table.string :job_type
22
+ table.string :last_error
23
+ table.datetime :run_at
24
+ table.datetime :locked_at
25
+ table.string :locked_by
26
+ table.datetime :failed_at
27
+ table.timestamps
28
+ end
29
+
30
+ create_table :stories, :force => true do |table|
31
+ table.string :text
32
+ end
33
+
34
+ end
35
+
36
+
37
+ # Purely useful for test cases...
38
+ class Story < ActiveRecord::Base
39
+ def tell; text; end
40
+ def whatever(n, _); tell*n; end
41
+
42
+ handle_asynchronously :whatever
43
+ end
@@ -0,0 +1,128 @@
1
+ require File.dirname(__FILE__) + '/database'
2
+
3
+ class SimpleJob
4
+ cattr_accessor :runs; self.runs = 0
5
+ def perform; @@runs += 1; end
6
+ end
7
+
8
+ class RandomRubyObject
9
+ def say_hello
10
+ 'hello'
11
+ end
12
+ end
13
+
14
+ class ErrorObject
15
+
16
+ def throw
17
+ raise ActiveRecord::RecordNotFound, '...'
18
+ false
19
+ end
20
+
21
+ end
22
+
23
+ class StoryReader
24
+
25
+ def read(story)
26
+ "Epilog: #{story.tell}"
27
+ end
28
+
29
+ end
30
+
31
+ class StoryReader
32
+
33
+ def read(story)
34
+ "Epilog: #{story.tell}"
35
+ end
36
+
37
+ end
38
+
39
+ describe 'random ruby objects' do
40
+ before { Delayed::Job.delete_all }
41
+
42
+ it "should respond_to :send_later method" do
43
+
44
+ RandomRubyObject.new.respond_to?(:send_later)
45
+
46
+ end
47
+
48
+ it "should raise a ArgumentError if send_later is called but the target method doesn't exist" do
49
+ lambda { RandomRubyObject.new.send_later(:method_that_deos_not_exist) }.should raise_error(NoMethodError)
50
+ end
51
+
52
+ it "should add a new entry to the job table when send_later is called on it" do
53
+ Delayed::Job.count.should == 0
54
+
55
+ RandomRubyObject.new.send_later(:to_s)
56
+
57
+ Delayed::Job.count.should == 1
58
+ end
59
+
60
+ it "should add a new entry to the job table when send_later is called on the class" do
61
+ Delayed::Job.count.should == 0
62
+
63
+ RandomRubyObject.send_later(:to_s)
64
+
65
+ Delayed::Job.count.should == 1
66
+ end
67
+
68
+ it "should run get the original method executed when the job is performed" do
69
+
70
+ RandomRubyObject.new.send_later(:say_hello)
71
+
72
+ Delayed::Job.count.should == 1
73
+ end
74
+
75
+ it "should ignore ActiveRecord::RecordNotFound errors because they are permanent" do
76
+
77
+ ErrorObject.new.send_later(:throw)
78
+
79
+ Delayed::Job.count.should == 1
80
+
81
+ Delayed::Job.reserve_and_run_one_job
82
+
83
+ Delayed::Job.count.should == 0
84
+
85
+ end
86
+
87
+ it "should store the object as string if its an active record" do
88
+ story = Story.create :text => 'Once upon...'
89
+ story.send_later(:tell)
90
+
91
+ job = Delayed::Job.find(:first)
92
+ job.payload_object.class.should == Delayed::PerformableMethod
93
+ job.payload_object.object.should == "AR:Story:#{story.id}"
94
+ job.payload_object.method.should == :tell
95
+ job.payload_object.args.should == []
96
+ job.payload_object.perform.should == 'Once upon...'
97
+ end
98
+
99
+ it "should store arguments as string if they an active record" do
100
+
101
+ story = Story.create :text => 'Once upon...'
102
+
103
+ reader = StoryReader.new
104
+ reader.send_later(:read, story)
105
+
106
+ job = Delayed::Job.find(:first)
107
+ job.payload_object.class.should == Delayed::PerformableMethod
108
+ job.payload_object.method.should == :read
109
+ job.payload_object.args.should == ["AR:Story:#{story.id}"]
110
+ job.payload_object.perform.should == 'Epilog: Once upon...'
111
+ end
112
+
113
+ it "should call send later on methods which are wrapped with handle_asynchronously" do
114
+ story = Story.create :text => 'Once upon...'
115
+
116
+ Delayed::Job.count.should == 0
117
+
118
+ story.whatever(1, 5)
119
+
120
+ Delayed::Job.count.should == 1
121
+ job = Delayed::Job.find(:first)
122
+ job.payload_object.class.should == Delayed::PerformableMethod
123
+ job.payload_object.method.should == :whatever_without_send_later
124
+ job.payload_object.args.should == [1, 5]
125
+ job.payload_object.perform.should == 'Once upon...'
126
+ end
127
+
128
+ end
@@ -0,0 +1,368 @@
1
+ require File.dirname(__FILE__) + '/database'
2
+
3
+ class SimpleJob
4
+ cattr_accessor :runs; self.runs = 0
5
+ def perform; @@runs += 1; end
6
+ end
7
+
8
+ class ErrorJob
9
+ cattr_accessor :runs; self.runs = 0
10
+ def perform; raise 'did not work'; end
11
+ end
12
+
13
+ module M
14
+ class ModuleJob
15
+ cattr_accessor :runs; self.runs = 0
16
+ def perform; @@runs += 1; end
17
+ end
18
+
19
+ end
20
+
21
+ describe Delayed::Job do
22
+ before do
23
+ Delayed::Job.max_priority = nil
24
+ Delayed::Job.min_priority = nil
25
+
26
+ Delayed::Job.delete_all
27
+ end
28
+
29
+ before(:each) do
30
+ SimpleJob.runs = 0
31
+ end
32
+
33
+ it "should set run_at automatically if not set" do
34
+ Delayed::Job.create(:payload_object => ErrorJob.new ).run_at.should_not == nil
35
+ end
36
+
37
+ it "should not set run_at automatically if already set" do
38
+ later = 5.minutes.from_now
39
+ Delayed::Job.create(:payload_object => ErrorJob.new, :run_at => later).run_at.should == later
40
+ end
41
+
42
+ it "should raise ArgumentError when handler doesn't respond_to :perform" do
43
+ lambda { Delayed::Job.enqueue(Object.new) }.should raise_error(ArgumentError)
44
+ end
45
+
46
+ it "should increase count after enqueuing items" do
47
+ Delayed::Job.enqueue SimpleJob.new
48
+ Delayed::Job.count.should == 1
49
+ end
50
+
51
+ it "should be able to set priority when enqueuing items" do
52
+ Delayed::Job.enqueue SimpleJob.new, 5
53
+ Delayed::Job.first.priority.should == 5
54
+ end
55
+
56
+ it "should be able to set run_at when enqueuing items" do
57
+ later = 5.minutes.from_now
58
+ Delayed::Job.enqueue SimpleJob.new, 5, later
59
+
60
+ # use be close rather than equal to because millisecond values cn be lost in DB round trip
61
+ Delayed::Job.first.run_at.should be_close(later, 1)
62
+ end
63
+
64
+ it "should call perform on jobs when running work_off" do
65
+ SimpleJob.runs.should == 0
66
+
67
+ Delayed::Job.enqueue SimpleJob.new
68
+ Delayed::Job.work_off
69
+
70
+ SimpleJob.runs.should == 1
71
+ end
72
+
73
+ it "should work on specified job types" do
74
+ SimpleJob.runs.should == 0
75
+
76
+ Delayed::Job.job_types = "SimpleJob"
77
+ Delayed::Job.enqueue SimpleJob.new
78
+ Delayed::Job.work_off
79
+
80
+ SimpleJob.runs.should == 1
81
+
82
+ Delayed::Job.job_types = nil
83
+ end
84
+
85
+ it "should not work on unspecified job types" do
86
+ SimpleJob.runs.should == 0
87
+
88
+ Delayed::Job.job_types = "AnotherJob"
89
+ Delayed::Job.enqueue SimpleJob.new
90
+ Delayed::Job.work_off
91
+
92
+ SimpleJob.runs.should == 0
93
+
94
+ Delayed::Job.job_types = nil
95
+ end
96
+
97
+ it "should work with eval jobs" do
98
+ $eval_job_ran = false
99
+
100
+ Delayed::Job.enqueue do <<-JOB
101
+ $eval_job_ran = true
102
+ JOB
103
+ end
104
+
105
+ Delayed::Job.work_off
106
+
107
+ $eval_job_ran.should == true
108
+ end
109
+
110
+ it "should work with jobs in modules" do
111
+ M::ModuleJob.runs.should == 0
112
+
113
+ Delayed::Job.enqueue M::ModuleJob.new
114
+ Delayed::Job.work_off
115
+
116
+ M::ModuleJob.runs.should == 1
117
+ end
118
+
119
+ it "should re-schedule by about 1 second at first and increment this more and more minutes when it fails to execute properly" do
120
+ Delayed::Job.enqueue ErrorJob.new
121
+ Delayed::Job.work_off(1)
122
+
123
+ job = Delayed::Job.find(:first)
124
+
125
+ job.last_error.should =~ /did not work/
126
+ job.last_error.should =~ /job_spec.rb:10:in `perform'/
127
+ job.attempts.should == 1
128
+
129
+ job.run_at.should > Delayed::Job.db_time_now - 10.minutes
130
+ job.run_at.should < Delayed::Job.db_time_now + 10.minutes
131
+ end
132
+
133
+ it "should raise an DeserializationError when the job class is totally unknown" do
134
+
135
+ job = Delayed::Job.new
136
+ job['handler'] = "--- !ruby/object:JobThatDoesNotExist {}"
137
+
138
+ lambda { job.payload_object.perform }.should raise_error(Delayed::DeserializationError)
139
+ end
140
+
141
+ it "should try to load the class when it is unknown at the time of the deserialization" do
142
+ job = Delayed::Job.new
143
+ job['handler'] = "--- !ruby/object:JobThatDoesNotExist {}"
144
+
145
+ job.should_receive(:attempt_to_load).with('JobThatDoesNotExist').and_return(true)
146
+
147
+ lambda { job.payload_object.perform }.should raise_error(Delayed::DeserializationError)
148
+ end
149
+
150
+ it "should try include the namespace when loading unknown objects" do
151
+ job = Delayed::Job.new
152
+ job['handler'] = "--- !ruby/object:Delayed::JobThatDoesNotExist {}"
153
+ job.should_receive(:attempt_to_load).with('Delayed::JobThatDoesNotExist').and_return(true)
154
+ lambda { job.payload_object.perform }.should raise_error(Delayed::DeserializationError)
155
+ end
156
+
157
+ it "should also try to load structs when they are unknown (raises TypeError)" do
158
+ job = Delayed::Job.new
159
+ job['handler'] = "--- !ruby/struct:JobThatDoesNotExist {}"
160
+
161
+ job.should_receive(:attempt_to_load).with('JobThatDoesNotExist').and_return(true)
162
+
163
+ lambda { job.payload_object.perform }.should raise_error(Delayed::DeserializationError)
164
+ end
165
+
166
+ it "should try include the namespace when loading unknown structs" do
167
+ job = Delayed::Job.new
168
+ job['handler'] = "--- !ruby/struct:Delayed::JobThatDoesNotExist {}"
169
+
170
+ job.should_receive(:attempt_to_load).with('Delayed::JobThatDoesNotExist').and_return(true)
171
+ lambda { job.payload_object.perform }.should raise_error(Delayed::DeserializationError)
172
+ end
173
+
174
+ it "should be failed if it failed more than MAX_ATTEMPTS times and we don't want to destroy jobs" do
175
+ default = Delayed::Job.destroy_failed_jobs
176
+ Delayed::Job.destroy_failed_jobs = false
177
+
178
+ @job = Delayed::Job.create :payload_object => SimpleJob.new, :attempts => 50
179
+ @job.reload.failed_at.should == nil
180
+ @job.reschedule 'FAIL'
181
+ @job.reload.failed_at.should_not == nil
182
+
183
+ Delayed::Job.destroy_failed_jobs = default
184
+ end
185
+
186
+ it "should be destroyed if it failed more than MAX_ATTEMPTS times and we want to destroy jobs" do
187
+ default = Delayed::Job.destroy_failed_jobs
188
+ Delayed::Job.destroy_failed_jobs = true
189
+
190
+ @job = Delayed::Job.create :payload_object => SimpleJob.new, :attempts => 50
191
+ @job.should_receive(:destroy)
192
+ @job.reschedule 'FAIL'
193
+
194
+ Delayed::Job.destroy_failed_jobs = default
195
+ end
196
+
197
+ it "should never find failed jobs" do
198
+ @job = Delayed::Job.create :payload_object => SimpleJob.new, :attempts => 50, :failed_at => Time.now
199
+ Delayed::Job.find_available(1).length.should == 0
200
+ end
201
+
202
+ context "when another worker is already performing an task, it" do
203
+
204
+ before :each do
205
+ Delayed::Job.worker_name = 'worker1'
206
+ @job = Delayed::Job.create :payload_object => SimpleJob.new, :locked_by => 'worker1', :locked_at => Delayed::Job.db_time_now - 5.minutes
207
+ end
208
+
209
+ it "should not allow a second worker to get exclusive access" do
210
+ @job.lock_exclusively!(4.hours, 'worker2').should == false
211
+ end
212
+
213
+ it "should allow a second worker to get exclusive access if the timeout has passed" do
214
+ @job.lock_exclusively!(1.minute, 'worker2').should == true
215
+ end
216
+
217
+ it "should be able to get access to the task if it was started more then max_age ago" do
218
+ @job.locked_at = 5.hours.ago
219
+ @job.save
220
+
221
+ @job.lock_exclusively! 4.hours, 'worker2'
222
+ @job.reload
223
+ @job.locked_by.should == 'worker2'
224
+ @job.locked_at.should > 1.minute.ago
225
+ end
226
+
227
+ it "should not be found by another worker" do
228
+ Delayed::Job.worker_name = 'worker2'
229
+
230
+ Delayed::Job.find_available(1, 6.minutes).length.should == 0
231
+ end
232
+
233
+ it "should be found by another worker if the time has expired" do
234
+ Delayed::Job.worker_name = 'worker2'
235
+
236
+ Delayed::Job.find_available(1, 4.minutes).length.should == 1
237
+ end
238
+
239
+ it "should be able to get exclusive access again when the worker name is the same" do
240
+ @job.lock_exclusively! 5.minutes, 'worker1'
241
+ @job.lock_exclusively! 5.minutes, 'worker1'
242
+ @job.lock_exclusively! 5.minutes, 'worker1'
243
+ end
244
+ end
245
+
246
+ context "#name" do
247
+ it "should be the class name of the job that was enqueued" do
248
+ Delayed::Job.create(:payload_object => ErrorJob.new ).name.should == 'ErrorJob'
249
+ end
250
+
251
+ it "should be the method that will be called if its a performable method object" do
252
+ Delayed::Job.send_later(:clear_locks!)
253
+ Delayed::Job.last.name.should == 'Delayed::Job.clear_locks!'
254
+
255
+ end
256
+ it "should be the instance method that will be called if its a performable method object" do
257
+ story = Story.create :text => "..."
258
+
259
+ story.send_later(:save)
260
+
261
+ Delayed::Job.last.name.should == 'Story#save'
262
+ end
263
+ end
264
+
265
+ context "worker prioritization" do
266
+
267
+ before(:each) do
268
+ Delayed::Job.max_priority = nil
269
+ Delayed::Job.min_priority = nil
270
+ end
271
+
272
+ it "should only work_off jobs that are >= min_priority" do
273
+ Delayed::Job.min_priority = -5
274
+ Delayed::Job.max_priority = 5
275
+ SimpleJob.runs.should == 0
276
+
277
+ Delayed::Job.enqueue SimpleJob.new, -10
278
+ Delayed::Job.enqueue SimpleJob.new, 0
279
+ Delayed::Job.work_off
280
+
281
+ SimpleJob.runs.should == 1
282
+ end
283
+
284
+ it "should only work_off jobs that are <= max_priority" do
285
+ Delayed::Job.min_priority = -5
286
+ Delayed::Job.max_priority = 5
287
+ SimpleJob.runs.should == 0
288
+
289
+ Delayed::Job.enqueue SimpleJob.new, 10
290
+ Delayed::Job.enqueue SimpleJob.new, 0
291
+
292
+ Delayed::Job.work_off
293
+
294
+ SimpleJob.runs.should == 1
295
+ end
296
+
297
+ end
298
+
299
+ context "when pulling jobs off the queue for processing, it" do
300
+ before(:each) do
301
+ @job = Delayed::Job.create(
302
+ :payload_object => SimpleJob.new,
303
+ :locked_by => 'worker1',
304
+ :locked_at => Delayed::Job.db_time_now - 5.minutes)
305
+ end
306
+
307
+ it "should leave the queue in a consistent state and not run the job if locking fails" do
308
+ SimpleJob.runs.should == 0
309
+ @job.stub!(:lock_exclusively!).with(any_args).once.and_return(false)
310
+ Delayed::Job.should_receive(:find_available).once.and_return([@job])
311
+ Delayed::Job.work_off(1)
312
+ SimpleJob.runs.should == 0
313
+ end
314
+
315
+ end
316
+
317
+ context "while running alongside other workers that locked jobs, it" do
318
+ before(:each) do
319
+ Delayed::Job.worker_name = 'worker1'
320
+ Delayed::Job.create(:payload_object => SimpleJob.new, :locked_by => 'worker1', :locked_at => (Delayed::Job.db_time_now - 1.minutes))
321
+ Delayed::Job.create(:payload_object => SimpleJob.new, :locked_by => 'worker2', :locked_at => (Delayed::Job.db_time_now - 1.minutes))
322
+ Delayed::Job.create(:payload_object => SimpleJob.new)
323
+ Delayed::Job.create(:payload_object => SimpleJob.new, :locked_by => 'worker1', :locked_at => (Delayed::Job.db_time_now - 1.minutes))
324
+ end
325
+
326
+ it "should ingore locked jobs from other workers" do
327
+ Delayed::Job.worker_name = 'worker3'
328
+ SimpleJob.runs.should == 0
329
+ Delayed::Job.work_off
330
+ SimpleJob.runs.should == 1 # runs the one open job
331
+ end
332
+
333
+ it "should find our own jobs regardless of locks" do
334
+ Delayed::Job.worker_name = 'worker1'
335
+ SimpleJob.runs.should == 0
336
+ Delayed::Job.work_off
337
+ SimpleJob.runs.should == 3 # runs open job plus worker1 jobs that were already locked
338
+ end
339
+ end
340
+
341
+ context "while running with locked and expired jobs, it" do
342
+ before(:each) do
343
+ Delayed::Job.worker_name = 'worker1'
344
+ exp_time = Delayed::Job.db_time_now - (1.minutes + Delayed::Job::MAX_RUN_TIME)
345
+ Delayed::Job.create(:payload_object => SimpleJob.new, :locked_by => 'worker1', :locked_at => exp_time)
346
+ Delayed::Job.create(:payload_object => SimpleJob.new, :locked_by => 'worker2', :locked_at => (Delayed::Job.db_time_now - 1.minutes))
347
+ Delayed::Job.create(:payload_object => SimpleJob.new)
348
+ Delayed::Job.create(:payload_object => SimpleJob.new, :locked_by => 'worker1', :locked_at => (Delayed::Job.db_time_now - 1.minutes))
349
+ end
350
+
351
+ it "should only find unlocked and expired jobs" do
352
+ Delayed::Job.worker_name = 'worker3'
353
+ SimpleJob.runs.should == 0
354
+ Delayed::Job.work_off
355
+ SimpleJob.runs.should == 2 # runs the one open job and one expired job
356
+ end
357
+
358
+ it "should ignore locks when finding our own jobs" do
359
+ Delayed::Job.worker_name = 'worker1'
360
+ SimpleJob.runs.should == 0
361
+ Delayed::Job.work_off
362
+ SimpleJob.runs.should == 3 # runs open job plus worker1 jobs
363
+ # This is useful in the case of a crash/restart on worker1, but make sure multiple workers on the same host have unique names!
364
+ end
365
+
366
+ end
367
+
368
+ end
@@ -0,0 +1,17 @@
1
+ require File.dirname(__FILE__) + '/database'
2
+
3
+ describe "A story" do
4
+
5
+ before(:all) do
6
+ @story = Story.create :text => "Once upon a time..."
7
+ end
8
+
9
+ it "should be shared" do
10
+ @story.tell.should == 'Once upon a time...'
11
+ end
12
+
13
+ it "should not return its result if it storytelling is delayed" do
14
+ @story.send_later(:tell).should_not == 'Once upon a time...'
15
+ end
16
+
17
+ end
@@ -0,0 +1 @@
1
+ require File.join(File.dirname(__FILE__), 'tasks')
@@ -0,0 +1,15 @@
1
+ # Re-definitions are appended to existing tasks
2
+ task :environment
3
+ task :merb_env
4
+
5
+ namespace :jobs do
6
+ desc "Clear the delayed_job queue."
7
+ task :clear => [:merb_env, :environment] do
8
+ Delayed::Job.delete_all
9
+ end
10
+
11
+ desc "Start a delayed_job worker."
12
+ task :work => [:merb_env, :environment] do
13
+ Delayed::Worker.new(:min_priority => ENV['MIN_PRIORITY'], :max_priority => ENV['MAX_PRIORITY']).start
14
+ end
15
+ end
metadata ADDED
@@ -0,0 +1,68 @@
1
+ --- !ruby/object:Gem::Specification
2
+ name: ezpub-delayed_job
3
+ version: !ruby/object:Gem::Version
4
+ version: 1.7.0
5
+ platform: ruby
6
+ authors:
7
+ - "Tobias L\xC3\xBCtke"
8
+ autorequire:
9
+ bindir: bin
10
+ cert_chain: []
11
+
12
+ date: 2008-11-28 00:00:00 -08:00
13
+ default_executable:
14
+ dependencies: []
15
+
16
+ description: Delated_job (or DJ) encapsulates the common pattern of asynchronously executing longer tasks in the background. It is a direct extraction from Shopify where the job table is responsible for a multitude of core tasks.
17
+ email: tobi@leetsoft.com
18
+ executables: []
19
+
20
+ extensions: []
21
+
22
+ extra_rdoc_files:
23
+ - README.textile
24
+ files:
25
+ - MIT-LICENSE
26
+ - README.textile
27
+ - delayed_job.gemspec
28
+ - init.rb
29
+ - lib/delayed/job.rb
30
+ - lib/delayed/message_sending.rb
31
+ - lib/delayed/performable_method.rb
32
+ - lib/delayed/worker.rb
33
+ - lib/delayed_job.rb
34
+ - tasks/jobs.rake
35
+ - tasks/tasks.rb
36
+ has_rdoc: false
37
+ homepage: http://github.com/tobi/delayed_job/tree/master
38
+ licenses:
39
+ post_install_message:
40
+ rdoc_options:
41
+ - --main
42
+ - README.textile
43
+ require_paths:
44
+ - lib
45
+ required_ruby_version: !ruby/object:Gem::Requirement
46
+ requirements:
47
+ - - ">="
48
+ - !ruby/object:Gem::Version
49
+ version: "0"
50
+ version:
51
+ required_rubygems_version: !ruby/object:Gem::Requirement
52
+ requirements:
53
+ - - ">="
54
+ - !ruby/object:Gem::Version
55
+ version: "0"
56
+ version:
57
+ requirements: []
58
+
59
+ rubyforge_project:
60
+ rubygems_version: 1.3.5
61
+ signing_key:
62
+ specification_version: 2
63
+ summary: Database-backed asynchronous priority queue system -- Extracted from Shopify
64
+ test_files:
65
+ - spec/database.rb
66
+ - spec/delayed_method_spec.rb
67
+ - spec/job_spec.rb
68
+ - spec/story_spec.rb