Skip to content

Commit

Permalink
Initial extraction
Browse files Browse the repository at this point in the history
  • Loading branch information
Tobias Luetke (home) committed Feb 17, 2008
0 parents commit 75b49dc
Show file tree
Hide file tree
Showing 11 changed files with 600 additions and 0 deletions.
20 changes: 20 additions & 0 deletions MIT-LICENSE
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
Copyright (c) 2005 Tobias Luetke

Permission is hereby granted, free of charge, to any person obtaining
a copy of this software and associated documentation files (the
"Software"), to deal in the Software without restriction, including
without limitation the rights to use, copy, modify, merge, publish,
distribute, sublicense, and/or sell copies of the Software, and to
permit persons to whom the Software is furnished to do so, subject to
the following conditions:

The above copyright notice and this permission notice shall be
included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOa AND
NONINFRINGEMENT. IN NO EVENT SaALL THE AUTHORS OR COPYRIGHT HOLDERS BE
LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION
OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION
WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
103 changes: 103 additions & 0 deletions README
Original file line number Diff line number Diff line change
@@ -0,0 +1,103 @@
Delayed::Job
============

Delated_job (or DJ) encapsulates the common pattern of asynchronously executing longer tasks in the background.

It is a direct extraction from Shopify where the job table is responsible for a multitude of core tasks. Amongst those tasks are:

* sending massive newsletters
* image resizing
* http downloads
* updating smart collections
* updating solr, our search server, after product changes
* batch imports
* spam checks

== Setup ==

The library evolves around a delayed_jobs table which looks as follows:

create_table :delayed_jobs, :force => true do |table|
table.integer :priority, :default => 0
table.integer :attempts, :default => 0
table.text :handler
table.string :last_error
table.datetime :run_at
table.timestamps
end

== Usage ==

Jobs are simple ruby objects with a method called perform. Any object which responds to perform can be stuffed into the jobs table.
Job objects are serialized to yaml so that they can later be resurrected by the job runner.

class NewsletterJob < Struct.new(:text, :emails)
def perform
emails.each { |e| NewsletterMailer.deliver_text_to_email(text, e) }
end
end

Delayed::Job.enqueue NewsletterJob.new('lorem ipsum...', Customers.find(:all).collect(&:email))

There is also a second way to get jobs in the queue: send_later.


BatchImporter.new(Shop.find(1)).send_later(:import_massive_csv, massive_csv)


This will simply create a Delayed::PerformableMethod job in the jobs table which serializes all the parameters you pass to it. There are some special smarts for active record objects
which are stored as their text representation and loaded from the database fresh when the job is actually run later.


== Running the tasks ==

You can invoke rake jobs:work which will start working off jobs. You can cancel the rake task by CTRL-C.

At Shopify we run the the tasks from a simple script/job_runner which is being invoked by runnit:

#!/usr/bin/env ruby
require File.dirname(__FILE__) + '/../config/environment'

SLEEP = 15
RESTART_AFTER = 1000

trap('TERM') { puts 'Exiting...'; $exit = true }
trap('INT') { puts 'Exiting...'; $exit = true }

# this script dies after several runs to prevent memory leaks.
# runnit will immediately start it again.
count, runs_left = 0, RESTART_AFTER

loop do

count = 0

# this requires the locking plugin, also from jadedPixel
ActiveRecord::base.aquire_lock("jobs table worker", 10) do
puts 'got lock'

realtime = Benchmark.realtime do
count = Delayed::Job.work_off
end
end

runs_left -= 1

break if $exit

if count.zero?
sleep(SLEEP)
else
status = "#{count} jobs completed at %.2f j/s ..." % [count / realtime]
RAILS_DEFAULT_LOGGER.info status
puts status
end

if $exit or runs_left <= 0
break
end
end

== Todo ==

Work out a locking mechanism which would allow several job runners to run at the same time, spreading the load between them.
5 changes: 5 additions & 0 deletions init.rb
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
require File.dirname(__FILE__) + '/lib/delayed/message_sending'
require File.dirname(__FILE__) + '/lib/delayed/performable_method'
require File.dirname(__FILE__) + '/lib/delayed/job'

Object.send(:include, Delayed::MessageSending)
145 changes: 145 additions & 0 deletions lib/delayed/job.rb
Original file line number Diff line number Diff line change
@@ -0,0 +1,145 @@
module Delayed

class DeserializationError < StandardError
end

class Job < ActiveRecord::Base
ParseObjectFromYaml = /\!ruby\/\w+\:([^\s]+)/

set_table_name :delayed_jobs

class Runner
attr_accessor :logger, :jobs
attr_accessor :runs, :success, :failure

def initialize(jobs, logger = nil)
@jobs = jobs
@logger = logger
self.runs = self.success = self.failure = 0
end

def run

ActiveRecord::Base.cache do
ActiveRecord::Base.transaction do
@jobs.each do |job|
self.runs += 1
begin
time = Benchmark.measure do
job.perform
ActiveRecord::Base.uncached { job.destroy }
self.success += 1
end
logger.debug "Executed job in #{time.real}"
rescue DeserializationError, StandardError, RuntimeError => e
if logger
logger.error "Job #{job.id}: #{e.class} #{e.message}"
logger.error e.backtrace.join("\n")
end
ActiveRecord::Base.uncached { job.reshedule e.message }
self.failure += 1
end
end
end
end

self
end
end

def self.enqueue(object, priority = 0)
raise ArgumentError, 'Cannot enqueue items which do not respond to perform' unless object.respond_to?(:perform)

Job.create(:handler => object, :priority => priority)
end

def handler=(object)
self['handler'] = object.to_yaml
end

def handler
@handler ||= deserialize(self['handler'])
end

def perform
handler.perform
end

def reshedule(message)
self.attempts += 1
self.run_at = self.class.time_now + 5.minutes
self.last_error = message
save!
end

def self.peek(limit = 1)
if limit == 1
find(:first, :order => "priority DESC, run_at ASC", :conditions => ['run_at <= ?', time_now])
else
find(:all, :order => "priority DESC, run_at ASC", :limit => limit, :conditions => ['run_at <= ?', time_now])
end
end

def self.work_off(limit = 100)
jobs = Job.find(:all, :conditions => ['run_at <= ?', time_now], :order => "priority DESC, run_at ASC", :limit => limit)

Job::Runner.new(jobs, logger).run
end

protected

def self.time_now
(ActiveRecord::Base.default_timezone == :utc) ? Time.now.utc : Time.now
end

def before_save
self.run_at ||= self.class.time_now
end

private

def deserialize(source)
attempt_to_load_file = true

begin
handler = YAML.load(source) rescue nil
return handler if handler.respond_to?(:perform)

if handler.nil?
if source =~ ParseObjectFromYaml

# Constantize the object so that ActiveSupport can attempt
# its auto loading magic. Will raise LoadError if not successful.
attempt_to_load($1)

# If successful, retry the yaml.load
handler = YAML.load(source)
return handler if handler.respond_to?(:perform)
end
end

if handler.is_a?(YAML::Object)

# Constantize the object so that ActiveSupport can attempt
# its auto loading magic. Will raise LoadError if not successful.
attempt_to_load(handler.class)

# If successful, retry the yaml.load
handler = YAML.load(source)
return handler if handler.respond_to?(:perform)
end

raise DeserializationError, 'Job failed to load: Unknown handler. Try to manually require the appropiate file.'

rescue TypeError, LoadError, NameError => e

raise DeserializationError, "Job failed to load: #{e.message}. Try to manually require the required file."
end
end

def attempt_to_load(klass)
klass.constantize
end

end
end
7 changes: 7 additions & 0 deletions lib/delayed/message_sending.rb
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
module Delayed
module MessageSending
def send_later(method, *args)
Delayed::Job.enqueue Delayed::PerformableMethod.new(self, method.to_sym, args)
end
end
end
37 changes: 37 additions & 0 deletions lib/delayed/performable_method.rb
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
module Delayed
class PerformableMethod < Struct.new(:object, :method, :args)
AR_STRING_FORMAT = /^AR\:([A-Z]\w+)\:(\d+)$/

def initialize(object, method, args)
raise NoMethodError, "undefined method `#{method}' for #{self.inspect}" unless object.respond_to?(method)

self.object = dump(object)
self.args = args.map { |a| dump(a) }
self.method = method.to_sym
end

def perform
load(object).send(method, *args.map{|a| load(a)})
end

private

def load(arg)
case arg
when AR_STRING_FORMAT then $1.constantize.find($2)
else arg
end
end

def dump(arg)
case arg
when ActiveRecord::Base then ar_to_string(arg)
else arg
end
end

def ar_to_string(obj)
"AR:#{obj.class}:#{obj.id}"
end
end
end
33 changes: 33 additions & 0 deletions spec/database.rb
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
$:.unshift(File.dirname(__FILE__) + '/../lib')

require 'rubygems'
require 'active_record'
require File.dirname(__FILE__) + '/../init'

ActiveRecord::Base.logger = Logger.new(nil)
ActiveRecord::Base.establish_connection(:adapter => 'sqlite3', :database => '/tmp/jobs.sqlite')
ActiveRecord::Migration.verbose = false

def reset_db
ActiveRecord::Schema.define do

create_table :delayed_jobs, :force => true do |table|
table.integer :priority, :default => 0
table.integer :attempts, :default => 0
table.text :handler
table.string :last_error
table.datetime :run_at
table.timestamps
end

create_table :stories, :force => true do |table|
table.string :text
end

end
end

# Purely useful for test cases...
class Story < ActiveRecord::Base
def tell; text; end
end
Loading

0 comments on commit 75b49dc

Please sign in to comment.