RubyGems - metacrunch - Versions diffs - 4.2.0 → 4.2.1 - Mend

metacrunch 4.2.0 → 4.2.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
-SHA1:
-  metadata.gz: 1fc2560b2bb768757c384d71a806509d594e7610
-  data.tar.gz: 8cbb0e384582550d29f94daa2d7521942cbe14c9
+SHA256:
+  metadata.gz: 6c1facce15096151df3186f7d48245b1c06bebb231b9d6dabeb70d569c0bb06c
+  data.tar.gz: 41487b86683753e2f8eba95d1e9dbea47efa0af5c36b13c1bb343f0b59f714ab
 SHA512:
-  metadata.gz: eaf6d9b6b72b7cadc92dae0d4d9e1204f6d43fe909f2f5f60bda85ad0407aa6fffe567a119361d3cef3fa15ed2d83ca5abc44d314eafc3f59b7b936c49a07ab5
-  data.tar.gz: 0204e2e3284a53c007ea086b883ff52fbb87e0731616fe83c1c4d636134985506c39d885ec8ff4bc4cc8708e3a51d02c27e28fe79b732798f7f3bfc24905ac6d
+  metadata.gz: '04015927726756e1f5839d4b4bceac287400b729a1828a3652d15fc456720245ade702d8b1fddec83aaf41418df2c404006f0cd89805cdfc0aba8bd04e737579'
+  data.tar.gz: f6e8d9719618e8c1f6c8b68c8a28c008f9f710929b22d878352076cd3bf7f99e4503d7604e78472e8c70d052f3aa07eb8ccc0d778ed31dc19d040de893f4d484

data/.circleci/config.yml ADDED Viewed

@@ -0,0 +1,35 @@
+version: 2.1
+orbs:
+  ruby: circleci/ruby@1.1.1
+jobs:
+  build:
+    docker:
+      - image: circleci/ruby:2.6-node-browsers
+    working_directory: ~/repo
+    steps:
+      - checkout
+      - run:
+          name: Install dependencies
+          command: bundle install --jobs=4 --retry=3 --path vendor/bundle
+      - run:
+          name: Install CodeClimate test coverage reporter
+          command: |
+            curl -L https://codeclimate.com/downloads/test-reporter/test-reporter-latest-linux-amd64 > ./cc-test-reporter
+            chmod +x ./cc-test-reporter
+            ./cc-test-reporter before-build
+      - run:
+          name: Run tests
+          command: |
+            mkdir /tmp/test-results
+            bundle exec rspec --format progress --format RspecJunitFormatter --out /tmp/test-results/rspec.xml
+      - run:
+          name: Upload test coverage report to CodeClimate
+          command: ./cc-test-reporter after-build --exit-code $?

data/Gemfile CHANGED Viewed

@@ -5,7 +5,6 @@ gemspec
 group :development do
   gem "bundler", ">= 1.15"
   gem "rake",    ">= 12.1"
-  gem "rspec",   ">= 3.5.0", "< 4.0.0"
   if !ENV["CI"]
     gem "pry-byebug", ">= 3.5.0"
@@ -13,5 +12,7 @@ group :development do
 end
 group :test do
-  gem "simplecov", ">= 0.15.0"
+  gem "rspec",                 ">= 3.5.0", "< 4.0.0"
+  gem "rspec_junit_formatter", ">= 0.3.0"
+  gem "simplecov",             "= 0.17.1"
 end

data/Readme.md CHANGED Viewed

@@ -3,7 +3,8 @@ metacrunch
 [![Gem Version](https://badge.fury.io/rb/metacrunch.svg)](http://badge.fury.io/rb/metacrunch)
 [![Code Climate](https://codeclimate.com/github/ubpb/metacrunch/badges/gpa.svg)](https://codeclimate.com/github/ubpb/metacrunch)
-[![Build Status](https://travis-ci.org/ubpb/metacrunch.svg)](https://travis-ci.org/ubpb/metacrunch)
+[![Test Coverage](https://codeclimate.com/github/ubpb/metacrunch/badges/coverage.svg)](https://codeclimate.com/github/ubpb/metacrunch/coverage)
+[![CircleCI](https://circleci.com/gh/ubpb/metacrunch.svg?style=svg)](https://circleci.com/gh/ubpb/metacrunch)
 metacrunch is a simple and lightweight data processing and ETL ([Extract-Transform-Load](http://en.wikipedia.org/wiki/Extract,_transform,_load))
 toolkit for Ruby.
@@ -28,7 +29,7 @@ metacrunch gives you a simple DSL ([Domain-specific language](https://en.wikiped
 Let's walk through the main steps of creating ETL jobs with metacrunch. For a collection of working examples check out our [metacrunch-demo](https://github.com/ubpb/metacrunch-demo) repository.
-#### It's Ruby
+### It's Ruby
 Every `.metacrunch` job is a regular Ruby file and you can use any valid Ruby code like declaring methods, classes, variables, requiring other Ruby
 files and so on.
@@ -50,12 +51,14 @@ require "SomeGem"
 require_relative "./some/other/ruby/file"
 ```
-#### Defining a source
+### Defining a source
-A source is an object that reads data (e.g. from a file or an external system) into the metacrunch processing pipeline. Implementing sources is easy – a source can be any Ruby object that responds to `#each`. For more information on how to implement sources [see notes below](#implementing-sources).
+A source is an object that emits data objects (e.g. from a file or an external system) into the metacrunch processing pipeline. Implementing sources is easy – a source is a Ruby `Enumerable` (any object that responds to the `#each` method). For more information on how to implement sources [see notes below](#implementing-sources).
 You must declare a source to allow a job to run.
+A source iterates over it's entries and emits every entry as a data object into the transformation pipeline, by passing it to the first transformation.
 ```ruby
 # File: my_etl_job.metacrunch
@@ -66,15 +69,15 @@ source Metacrunch::File::Source.new(ARGV)
 source MySource.new
 ```
-#### Defining transformations
+### Defining transformations
-To process, transform or manipulate data use the `#transformation` hook. A transformation is implemented with a `callable` object (any Ruby object that responds to `#call`. E.g. a lambda). To learn more about transformations check the section about [implementing transformations](#implementing-transformations) below.
+To process, transform or manipulate data use the `#transformation` hook. A transformation is implemented with a `callable` object (any Ruby object that responds to `#call`. E.g. a `Proc`). To learn more about transformations check the section about [implementing transformations](#implementing-transformations) below.
-The current data object (the last object yielded by the source) will be passed to the first transformation as a parameter. The return value of a transformation will then be passed to the next transformation and so on.
+The *current data object* (the current object emitted by the source) will be passed to the first transformation as a parameter. The return value of a transformation will then be passed to the next transformation and so on.
-There are two exceptions to that rule.
+There are two exceptions to that rule:
-* If you return `nil` the current data object will be dismissed and the next transformation won't be called.
+* If you return `nil` the current data object will be dismissed and the next transformation won't be called. The process continues with the next data object that will be emitted by the source and the first transformation.
 * If you return an `Enumerator` the object will be expanded and the following transformations will be called with each element of the `Enumerator`.
 ```ruby
@@ -85,27 +88,29 @@ source [1,2,3,4,5,6,7,8,9]
 # A transformation is implemented with a `callable` object (any
 # object that responds to #call).
-# Lambdas responds to #call
+# Proc responds to #call
 transformation ->(number) {
-  # Called for each data object that has been read by a source.
+  # Called for each data object that has been emitted by a source.
   # You must return the data to keep it in the pipeline. Dismiss the
   # data conditionally by returning nil.
   number if number.odd?
 }
+# Only called for odd numbers as even numbers gets dismissed in the previous
+# transformation.
 transformation ->(odd_number) {
   odd_number * 2
 }
-# MyTransformation implements #call
+# MyTransformation implements #call. Gets called with the prevous number times 2.
 transformation MyTransformation.new
 ```
-#### Using a transformation buffer
+### Using a transformation buffer
 Sometimes it is useful to buffer data between transformation steps to allow a transformation to work on larger bulks of data. metacrunch uses a simple transformation buffer to achieve this.
-To use a transformation buffer add the `:buffer` option to your transformation. You can pass a positive integer value as a buffer size, or as an advanced option you can pass a `Proc` object. The buffer flushes every time the buffer reaches the given size or if the `Proc` returns `true`.
+To use a transformation buffer add the `:buffer` option to your transformation. You can pass a positive integer value as a buffer size, or as an advanced option you can pass a `Proc` object. The buffer flushes every time the buffer reaches the given size or if the `Proc` returns `true`. The buffer also flushes after the last data object was emitted by the source.
 ```ruby
 # File: my_etl_job.metacrunch
@@ -128,11 +133,9 @@ transformation ->(bulk) {
 }
 ```
-#### Defining a destination
-A destination is an object that writes the transformed data to an external system. Implementing destinations is easy – [see notes below](#implementing-destinations). A destination receives the return value from the last transformation as a parameter if the return value from the last transformation was not `nil`.
+### Defining a destination
-Using destinations is optional. In most cases using the last transformation to write the data to an external system is fine. Destinations are useful if the required code is more complex.
+A destination is an object that writes the transformed data to an external system (e.g. a file, database etc.). Implementing destinations is easy – [see notes below](#implementing-destinations). A destination receives the return value from the last transformation as a parameter if the return value from the last transformation was not `nil`.
 ```ruby
 # File: my_etl_job.metacrunch
@@ -140,20 +143,20 @@ Using destinations is optional. In most cases using the last transformation to w
 destination MyDestination.new
 ```
-#### Pre/Post process
+### Pre/Post process
 To run arbitrary code before the first transformation is run on the first data object use the `#pre_process` hook. To run arbitrary code after the last transformation is run on the last data object use `#post_process`. Like transformations, `#post_process` and `#pre_process` must be implemented using a `callable` object.
 ```ruby
 pre_process -> {
-  # Lambdas responds to #call
+  # Proc responds to #call
 }
 # MyCallable class defines #call
 post_process MyCallable.new
 ```
-#### Defining job options
+### Defining job options
 metacrunch has build-in support to parameterize jobs. Using the `options` hook you can declare options that can be set/overridden by the CLI when [running your jobs](#running-etl-jobs).
@@ -191,9 +194,7 @@ Job options:
                                      REQUIRED
 ```
-To learn more about defining options take a look at the [reference below](#defining-job-options).
-#### Require non-option arguments
+### Require non-option arguments
 All non-option arguments that get passed to the job when running are available to the `ARGV` constant. If your job requires such arguments (e.g. if you work with a list of files) you can require it.
@@ -242,11 +243,11 @@ $ [bundle exec] metacrunch [options] JOB_FILE [job-options] [ARGS...]
 Implementing sources
 --------------------
-A metacrunch source is any Ruby object that responds to the `each` method that yields data objects one by one.
+A metacrunch source is any Ruby `Enumerable` object (an object that responds to the `#each` method) that yields data objects one by one.
 The data is usually a `Hash` instance, but could be other structures as long as the rest of your pipeline is expecting it.
-Any `enumerable` object (e.g. `Array`) responds to `each` and can be used as a source in metacrunch.
+Any `Enumerable` object (e.g. `Array`) responds to `#each` and can be used as a source in metacrunch.
 ```ruby
 # File: my_etl_job.metacrunch
@@ -288,9 +289,9 @@ source MyCsvSource.new("my_data.csv")
 Implementing transformations
 ----------------------------
-A metacrunch transformation is implemented as a `callable` object. A `callable` in Ruby is any object that responds to the `call` method.
+A metacrunch transformation is implemented as a `callable` object. A `callable` in Ruby is any object that responds to the `#call` method.
-Procs and Lambdas in Ruby respond to `call`. They can be used to implement transformations inline.
+`Proc`s in Ruby respond to `#call`. They can be used to implement transformations inline.
 ```ruby
 # File: my_etl_job.metacrunch
@@ -329,7 +330,7 @@ transformation MyTransformation.new
 Implementing destinations
 -------------------------
-A destination is any Ruby object that responds to `write(data)` and `close`.
+A destination is any Ruby object that responds to `#write(data)` and `#close`.
 Like sources you are encouraged to implement destinations as classes.

data/lib/metacrunch/cli.rb CHANGED Viewed

@@ -51,7 +51,7 @@ private
   def run!(job_file)
     if job_file.blank?
       error "You need to provide a job file."
-    elsif !File.exists?(job_file)
+    elsif !File.exist?(job_file)
       error "The file `#{job_file}` doesn't exist."
     else
       job_filename = File.expand_path(job_file)

data/lib/metacrunch/version.rb CHANGED Viewed

@@ -1,3 +1,3 @@
 module Metacrunch
-  VERSION = "4.2.0"
+  VERSION = "4.2.1"
 end

metadata CHANGED Viewed

@@ -1,16 +1,15 @@
 --- !ruby/object:Gem::Specification
 name: metacrunch
 version: !ruby/object:Gem::Version
-  version: 4.2.0
+  version: 4.2.1
 platform: ruby
 authors:
 - René Sprotte
 - Michael Sievers
 - Marcel Otto
-autorequire:
 bindir: exe
 cert_chain: []
-date: 2017-10-10 00:00:00.000000000 Z
+date: 1980-01-02 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: activesupport
@@ -40,16 +39,14 @@ dependencies:
     - - ">="
       - !ruby/object:Gem::Version
         version: 0.8.1
-description:
-email:
 executables:
 - metacrunch
 extensions: []
 extra_rdoc_files: []
 files:
+- ".circleci/config.yml"
 - ".gitignore"
 - ".rspec"
-- ".travis.yml"
 - Gemfile
 - License.txt
 - Rakefile
@@ -73,7 +70,6 @@ homepage: http://github.com/ubpb/metacrunch
 licenses:
 - MIT
 metadata: {}
-post_install_message:
 rdoc_options: []
 require_paths:
 - lib
@@ -88,9 +84,7 @@ required_rubygems_version: !ruby/object:Gem::Requirement
     - !ruby/object:Gem::Version
       version: '0'
 requirements: []
-rubyforge_project:
-rubygems_version: 2.6.11
-signing_key:
+rubygems_version: 3.6.9
 specification_version: 4
 summary: Data processing and ETL toolkit for Ruby
 test_files: []

data/.travis.yml DELETED Viewed

@@ -1,4 +0,0 @@
-language: ruby
-rvm:
-  - ruby-2.3.5
-  - ruby-2.4.2