RubyGems - bobik - Versions diffs - 0.0.1 → 0.0.2 - Mend

bobik 0.0.1 → 0.0.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

data/README.md CHANGED

@@ -1,6 +1,44 @@
-bobik_ruby_gem
-==============
+## Web Scraping in Ruby using Bobik
-Bobik SDK for Ruby
+This is a community-supported Bobik SDK for web scraping in Ruby.
-TODO: provide documentation on how to install and use the SDK
+### Installing
++ Either install directly and system-wide:
+  1. Run `gem install bobik` from command line
+  2. Add `require 'bobik'` to your Ruby code
++ Or, add to bundler:
+  1. add `gem 'bobik'` to Gemfile
+  2. Unless you're using Rails (which includes all gems from Gemfile automatically), add `require 'bobik'` to your Ruby code
+### Using
+Here's a quick example to get you started.
+```ruby
+  client = Bobik::Client.new(:auth_token => YOUR_AUTH_TOKEN, :timeout_ms => 60000)
+  sample_data = {
+    urls:       ['amazon.com', 'zynga.com', 'http://finance.yahoo.com/'],
+    queries:    ["//th", "//img/@src", "return document.title", "return $('script').length"]
+  }
+  client.scrape(sample_data, true) do |results, errors|
+    pust "Errors: #{errors}"
+    results.each do |url, queries|
+      puts "Printing results for #{url}"
+      queries.each do |query, result|
+        puts " Result of query #{query}: #{result}"
+      end
+    end
+  end
+```
+Full API reference is available at http://usebobik.com/sdk/
+### Contributing
+Write to support@usebobik.com to become a collaborator.
+### Bugs?
+Submit them here on GitHub: https://github.com/emirkin/bobik_ruby_gem/issues

data/lib/bobik/client.rb CHANGED

@@ -2,17 +2,26 @@ require 'json'
 require 'httparty'
 module Bobik
+  # Author::    Eugene Mirkin
+  # This is the main class for interacting with Bobik platform.
   class Client
     include HTTParty
     base_uri 'https://usebobik.com/api/v1'
+    # Notable parameters:
+    # * :auth_token - [required] authentication token
+    # * :timeout_ms - [optional] when to stop waiting for the job to finish
+    # * :logger - [optional] any logger that conforms to the Log4r interface
     def initialize(opts)
       @auth_token = opts[:auth_token] || raise(Error.new("'auth_token' was not provided"))
-      @timeout_ms = opts[:timeout_ms] || 30000
+      @timeout_ms = opts[:timeout_ms] || 60000
       @log = opts[:logger] || (defined?(Rails.logger) && Rails.logger)
     end
+    # Submit a scraping request.
+    # The callback block will be invoked when results arrive.
+    # If asynchronous mode is used, the method returns right away.
+    # Otherwise, it blocks until results arrive.
     def scrape(request, block_until_done, &block)
       request = Marshal.load(Marshal.dump(request))
       request[:auth_token] = @auth_token
@@ -55,7 +64,7 @@ module Bobik
       block.call(results, errors)
     end
+    # A single call to get a given job's status with or without results
     def get_job_data(job_id, with_results)
       job_response = self.class.get('/jobs.json', :body => {
         auth_token: @auth_token,

metadata CHANGED

@@ -1,7 +1,7 @@
 --- !ruby/object:Gem::Specification
 name: bobik
 version: !ruby/object:Gem::Version
-  version: 0.0.1
+  version: 0.0.2
   prerelease:
 platform: ruby
 authors: