RubyGems - active_data_frame - Versions diffs - 0.1.3 → 0.1.5 - Mend

active_data_frame 0.1.3 → 0.1.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (11) hide show

checksums.yaml +4 -4
data/README.md +218 -3
data/active_data_frame.gemspec +1 -1
data/lib/active_data_frame/data_frame_proxy.rb +5 -1
data/lib/active_data_frame/database.rb +77 -62
data/lib/active_data_frame/row.rb +7 -3
data/lib/active_data_frame/table.rb +3 -4
data/lib/active_data_frame/version.rb +1 -1
data/lib/generators/active_data_frame/USAGE +20 -0
data/lib/generators/active_data_frame/install_generator.rb +3 -5
metadata +7 -6

checksums.yaml CHANGED

@@ -1,7 +1,7 @@
 ---
 SHA1:
-  metadata.gz: 9e1350ed7595307e7875b6430c24bab9a2fd90a2
-  data.tar.gz: 1d2f0c6eae0ddfb9ed4fa52d07526e7f9e47ec20
+  metadata.gz: 6e6c248d13e0f7f10933eca32158e4fb33a080e3
+  data.tar.gz: 28d52390deef35b8e582942612989f99e2026ed3
 SHA512:
-  metadata.gz: db812db474e0980059520b193b9c4fb67d36dbafaf865c28019ea247ab75b269ca26ccf4c6f146e1aec34897003c9f9a70be88550e002f2c514b4a3437ebce84
-  data.tar.gz: 2a2585b6f966cf5691f7d4d5155f8ea977f1f2f0213476ebdc88d3648d6511c71c743bc036ba2c4ba9b55e6610a817469e978b5c398dd6d642739fedfa2c8912
+  metadata.gz: 9fd95c152778f43ea9d3d3e09160a22ed355989b5fdc5e2cbbfb1f10b2290aab4db07d04344acb595f30f28a98eaf1282b4a461611ba6c5999b5a060fc60ae77
+  data.tar.gz: c0b32d8827258e8e8cf38e051758d8de8a1784bb2a4b19cdaada75af5b9b1541b3c45d15e6d2f033ca07ad1ec9422fc290c11b44e41fd2df5ef08b68bf805c47

data/README.md CHANGED

@@ -1,9 +1,9 @@
 # ActiveDataFrame
-ActiveDataFrame allows efficient writing, reading, and analytical queries on large tables of numerical data. You can think of it as a persistent NumPy or NArray with good support for slicing
-and aggregates without needing to load the entire dataset into memory.
+ActiveDataFrame allows efficient writing, reading, and analytical queries on large tables of numerical data. You can think of it as a persistent NumPy or NArray with good support for slicing and aggregates without needing the entire dataset in memory.
 The library depends on ActiveRecord and currently supports the following relational databases:
 * PostgreSQL
 * MySQL
 * SQLite
@@ -24,16 +24,231 @@ Or install it yourself as:
     $ gem install active_data_frame
+## Examples
+### Using the generator
+    # Generate a new data frame named Statistic, with a datapoint type of double, and a block size of 100
+    $ rails generate active_data_frame:install Statistic double 100
+    # Then run migrations to create the underlying table
+    $ rake db:migrate
 ## Usage
+### Generator
+The easiest way to get started is to use the in-built generator to generate a new
+`ActiveDataFrame`. This will generate the required migrations for the data frame
+and generate a new module that you can include inside an `ActiveRecord` model to give it access to the frame.
+```
+    # Generate a new MeterReading data frame type, with a block type of
+    # double and a block size of 48 data points
+    $ rails generate active_data_frame:install MeterReading double 48
+    # Generate a new Dimension data frame type, with a block type of
+    # float and a block size of 10 data points.
+    # Inject the data-type for use into the Iris model
+    $ rails generate active_data_frame:install Dimension float 10 Iris
+    #
+    # Generate a new status data frame type with an integer block type
+    #
+    $ rails generate active_data_frame:install Status integer
+```
+### Writing to a data frame
+When you include a data frame in an ActiveRecord model, each instance of the model corresponds to a single row in the data frame. The columns are a series of points that stretch towards infinity in each direction.
+By default columns are indexed by integers, but you can set a static or dynamic column map so that you can easily have columns indexed by time, enum columns or use any other data type that serves as a useful index.
+You can write any number of data points to a row in the dataframe using #[]=
+        #E.g.
+        # Write to the row called readings from index 0. Here Sensor is the ActiveRecord model, readings is the name of the row
+        Sensor.first.readings[0] = 1,2,3
+        # Write to the row called readings from an offset at 1_000_000
+        Sensor.first.readings[1_000_000] = -10, -9, -8
+        #Writing to a row which has a column mapping applied, mapping times on integer indexes
+        MeterChannel.first.readings['2001-01-01'] = [1.3, 3.4]
+        #If you have enum columns you can use the #[enum_name]= setter instead.
+        Iris.first.dimensions.sepal_length = 5.3
+        Iris.first.dimensions.petal_width  = 4.3
+        # You can set data for multiple rows at once, by using the frame accessor on the model's class instead of an instance.
+        E.g.
+        # This sets the reading at index 1 to 5 for ALL sensors
+        Sensor.readings[1] = 5
+        # You can use AR queries to refine which set of rows you are updating at once.
+        # E.g.
+        MeterChannel.where("created_at < ?", "2001-01-01").readings['2001-01-01'] = [5,6,7]
+ActiveDataFrame supports very quick writing of 1000's of values for a single row at a time. Don't be afraid to write large arrays of data like this.
+### Reading from a data frame
+Reading from a data frame is similar to writing and uses the #[] method.
+You can read individual values, a range of values, and sparse selections of columns.
+        #E.g.
+        # Read a single value
+        Sensor.first.readings[0] # => Matrix(1x1)[...]
+        # Read a range of 3 values values
+        Sensor.first.readings[0...3] # => Matrix(1x3)[...]
+        # Read some non contiguous values and ranges
+        Sensor.first.readings[5, 10, 4..7, 9..10] = Matrix(1x8)[...]
+        #Reading from a row which has a column mapping that uses times
+        MeterChannel.first.readings['2001-01-01'...'2002-01-01'] = Matrix(1xM)[....]
+        #If you have enum columns you can use the #[enum_name] getter for single columns
+        Iris.first.dimensions.sepal_length
+        Iris.first.dimensions.petal_width
+        # And use symbols as column indices (this assumes a specific ordering of enum columns)
+        Iris.first.dimensions[:sepal_length...:petal_width]
+Similar to when writing data, you can also read data from multiple rows at once.
+Just use the active data frame accessor on the model class instead of a model instance. E.g.
+        Sensor.readings[0..5] # => Matrix(Nx5)
+### Deleting
+    You can use #clear(range_or_indices) to delete data.
+    Deleting data is equivalent to setting all data points to zero.
+    So the operation row[index] = [0, 0, 0, 0.....0] is equivalent
+    to the operation row.clear(index...end_index). ActiveDataFrame
+    will automatically trim empty blocks.
-TODO: Write usage instructions here
+### Batching
+If performing many small reads and writes from a data frame in a single atomic operation
+it makes sense to do this in a single transaction. Active Data Frame provides the `ActiveDataFrame::Database.batch do ... end` method. This method will not only ensure your operations occur in a single transaction, but also that they are sent to the underlying database adapter as a single command.
+### Analytical Queries
+Any read of a dataframe returns an RMatrix instance. An RMatrix supports a large number of
+statistical methods and list methods. (See the RMatrix readme for more details).
+E.g.
+        cpu_loads = CPU.first.loads['2001-01-01'..'2005-01-01']
+        puts cpu_loads.avg
+        puts cpu_loads.stddev
+        puts cpu_loads.max
+        # ... and many more
+However in some cases you are dealing with so much data it is not possible, or too slow to retreive all the data at once and manipulate in-memory. ActiveDataFrame supports performing a number of aggregate methods directly in the database. These are #avg, #min, #max and #sum. The syntax for this is almost identical to an ordinary read.
+        CPU.loads.avg['2001-01-01'...'2005-01-01'] # The average CPU load per period over all CPUS
+        CPU.where(manufacturer: :intel).loads.min['2001-01-01'...'2005-01-01'] # The minimum CPU load per period over all intel CPUS
+### Categorical data
+ActiveDataFrame provides a very basic abstraction for storing categorical data. This is done by storing categories as an integer data frame, and providing a map from integers to categories. The library will then allow you to use the category names in place of the raw underlying integers.
+E.g.
+    module HasStatus
+      include ActiveDataFrame::HasDataFrame('status', Blocks::StatusBlock, value_map: {
+        actual: 2,
+        estimated: 1,
+        unknown: 0
+      })
+    end
+    class CPU < ApplicationRecord
+      include HasStatus
+    end
+The CPU model above includes a dataframe with a status mapping. We can now do things like
+    CPU.first.status[0]    # => :unknown
+    CPU.first.status[0..5] # => [:unknown,:unknown,:unknown,:unknown,:unknown]
+    CPU.first.status[0] = :actual, :estimated
+    CPU.first.status[0..5] # => [:actual,:estimated,:unknown,:unknown,:unknown]
+### Time-series data
+We can use any datatype we like to index into a dataframe, so long as we can map it to an integer index. This makes active dataframes very well suited to storing large streams of interval data over time.
+For example we might define a mapping such that every half hour period in time corresponds to a colum in our dataframe. In the below example we might be counting the number of arrivals at an airport every half-hour.
+    module HasArrivals
+      include ActiveDataFrame::HasDataFrame('arrivals', Blocks::ArrivalBlock)
+      module ColumnMaps
+        def self.included(base)
+          base.arrivals_column_map Hash.new{|hash, time| ((time.to_time - Time.at(0)) / 1.hour).to_i rescue time.to_i }
+        end
+      end
+    end
+    class Airport < ApplicationRecord
+      include HasArrivals::ColumnMaps, HasArrivals
+    end
+Now we can use any value that implements #to_time to index into our dataframe. This supports both single indexes and ranges (...).
+E.g.
+    Airport.first.arrivals['2001-01-01'...'2002-01-01'] = Matrix(1xM)[....]
+### Column Mappings
+We can use any datatype we like to index into a dataframe, so long as we can map it to an integer index. See the section on Time-series data for one example of this. Columns can also be aliases to categories. An example of this is using ActiveDataFrame to model the classic Iris dataset.
+    class Iris < ApplicationRecord
+      include HasDimensions
+      dimension_column_names %i(sepal_length sepal_width petal_length petal_width)
+    end
+Here we have mapped the first four columns of our data frame to sepal_length, sepal_width, petal_length and petal_width.
+When using symbols as column names ActiveDataFrame provides some syntactic sugar for easily slicing and dicing frames.
+We can do things like:
+* Extract a slice of data:
+    `iris_results = Iris.where(species: :setosa).dimension[:sepal_width..:petal_length]`
+* Extract an entire column from a data-set using the column name:
+    `iris_results.sepal_width => V[[...]`]
+* Extract an entire column from a data-set using the column name:
+    `iris_results.sepal_width => V[[...]`]
+* Extract a single value from an instance:
+    `Iris.first.dimension.sepal_width.to_f`
+* Set one or more values for an instance or row at once:
+    `Iris.first.dimension.sepal_width = 13`
+    `Iris.all.dimension.petal_length = 5.2,6.3,5.4,1.1`
+### Configuration
+ActiveDataFrame supports project-wide configuration using
+    ActiveDataFrame.config do |config|
+      config.[config_option_name] = [config_value]
+    end
+Currently the following configuration options are supported:
+* `suppress_logs` The queries generated by ActiveDataFrame are quite verbose. If you would like to supress ActiveRecord logging for these queries, set this option to `true`
 ## Development
 After checking out the repo, run `bin/setup` to install dependencies. You can also run `bin/console` for an interactive prompt that will allow you to experiment.
 To install this gem onto your local machine, run `bundle exec rake install`. To release a new version, update the version number in `version.rb`, and then run `bundle exec rake release`, which will create a git tag for the version, push git commits and tags, and push the `.gem` file to [rubygems.org](https://rubygems.org).
+## Testing
 ## Contributing
 Bug reports and pull requests are welcome on GitHub at https://github.com/[USERNAME]/active_data_frame. This project is intended to be a safe, welcoming space for collaboration, and contributors are expected to adhere to the [Contributor Covenant](http://contributor-covenant.org) code of conduct.

data/active_data_frame.gemspec CHANGED

@@ -31,5 +31,5 @@ Gem::Specification.new do |spec|
   spec.add_development_dependency 'minitest-reporters', '~> 1.1', '>= 1.1.0'
   spec.add_development_dependency 'minitest-around', '0.4.1'
   spec.add_runtime_dependency     'activerecord', '~> 5.0'
-  spec.add_runtime_dependency     'rmatrix', '~> 0.1.15', '>=0.1.15'
+  spec.add_runtime_dependency     'rmatrix', '~> 0.1.17', '>=0.1.17'
 end

data/lib/active_data_frame/data_frame_proxy.rb CHANGED

@@ -57,8 +57,12 @@ module ActiveDataFrame
     end
     def method_missing(name, *args, &block)
+      if name.to_s.ends_with?(?=)
+        is_assignment = true
+        name = name.to_s.gsub(/=$/,'').to_sym
+      end
       if column_name_map && column_map[name]
-        self[name]
+        is_assignment ? self.[]=(name, *args) : self[name]
       else
         super
       end

data/lib/active_data_frame/database.rb CHANGED

@@ -15,15 +15,17 @@ module ActiveDataFrame
       else
         unless sql.empty?
           ActiveRecord::Base.transaction do
-            case ActiveRecord::Base.connection_config[:adapter]
-            when 'sqlite3'.freeze
-              ActiveRecord::Base.connection.raw_connection.execute_batch sql
-            when 'mysql2'
-              sql.split(';').reject{|x| x.strip.empty?}.each do |stmt|
-                ActiveRecord::Base.connection.execute(stmt)
+            ActiveDataFrame::DataFrameProxy.suppress_logs do
+              case ActiveRecord::Base.connection_config[:adapter]
+              when 'sqlite3'.freeze
+                ActiveRecord::Base.connection.raw_connection.execute_batch sql
+              when 'mysql2'
+                sql.split(';').reject{|x| x.strip.empty?}.each do |stmt|
+                  ActiveRecord::Base.connection.execute(stmt)
+                end
+              else
+                ActiveRecord::Base.connection.execute(sql)
               end
-            else
-              ActiveRecord::Base.connection.execute(sql)
             end
           end
         end
@@ -60,56 +62,16 @@ module ActiveDataFrame
     # Update block data for all blocks in a single call
     ##
     def bulk_update(existing)
-      ActiveDataFrame::DataFrameProxy.suppress_logs do
-        case ActiveRecord::Base.connection_config[:adapter]
-        when 'postgresql'.freeze
-          # Fast bulk update
-          updates = ''
-          existing.each do |period_index, (values, df_id)|
-            updates <<  "(#{df_id}, #{period_index}, #{values.map{|v| v.inspect.gsub('"',"'") }.join(',')}),"
-          end
-          perform_update(updates)
-        else
-          ids = existing.map {|_, (_, id)| id}
-          updates = block_type::COLUMNS.map.with_index do |column, column_idx|
-            [column, "CASE period_index\n#{existing.map{|period_index, (values, _)| "WHEN #{period_index} then #{values[column_idx]}"}.join("\n")} \nEND\n"]
-          end.to_h
-          update_statement = updates.map{|cl, up| "#{cl} = #{up}" }.join(', ')
-          Database.execute("UPDATE #{block_type.table_name} SET #{update_statement} WHERE
-            #{block_type.table_name}.data_frame_id IN (#{ids.join(',')})
-            AND #{block_type.table_name}.data_frame_type = '#{data_frame_type.name}'
-            AND #{block_type.table_name}.period_index IN (#{existing.keys.join(', ')});
-            "
-          )
-        end
-      end
-    end
-    def bulk_delete(id, indices)
-      ActiveDataFrame::DataFrameProxy.suppress_logs do
-        block_type.where(data_frame_id: id, period_index: indices).delete_all
-      end
-    end
-    ##
-    # Insert block data for all blocks in a single call
-    ##
-    def bulk_insert(new_blocks, instance)
-      ActiveDataFrame::DataFrameProxy.suppress_logs do
-        inserts = ''
-        new_blocks.each do |period_index, (values)|
-          inserts << \
-          case ActiveRecord::Base.connection_config[:adapter]
-          when 'postgresql', 'mysql2' then "(#{values.map{|v| v.inspect.gsub('"',"'") }.join(',')}, #{instance.id}, #{period_index}, '#{data_frame_type.name}'),"
-          else "(#{values.map{|v| v.inspect.gsub('"',"'") }.join(',')}, #{instance.id}, #{period_index}, '#{data_frame_type.name}'),"
-          end
+      case ActiveRecord::Base.connection_config[:adapter]
+      when 'postgresql'.freeze
+        #
+        # PostgreSQL Supports the fast setting of multiple update values that differ
+        # per row from a temporary table.
+        #
+        updates = ''
+        existing.each do |period_index, (values, df_id)|
+          updates <<  "(#{df_id}, #{period_index}, #{values.map{|v| v.inspect.gsub('"',"'") }.join(',')}),"
         end
-        perform_insert(inserts)
-      end
-    end
-    def perform_update(updates)
-      ActiveDataFrame::DataFrameProxy.suppress_logs do
         Database.execute(
           <<-SQL
           UPDATE #{block_type.table_name}
@@ -121,15 +83,68 @@ module ActiveDataFrame
             AND #{block_type.table_name}.data_frame_type = '#{data_frame_type.name}'
           SQL
         )
-        true
+      #
+      # For MySQL we use the ON DUPLICATE KEY UPDATE functionality.
+      # This relies on there being a unique index dataframe and period index
+      # on the blocks table.
+      # This tends to be faster than the general CASE based solution below
+      # but slower than the PostgreSQL solution above
+      #
+      when 'mysql2'.freeze
+        # Fast bulk update
+        updates, on_duplicate = "", ""
+        existing.each do |period_index, (values, df_id)|
+          updates << "(#{values.map{|v| v.inspect.gsub('"',"'") }.join(',')}, #{df_id}, #{period_index}, '#{data_frame_type.name}'),"
+        end
+        on_duplicate = block_type::COLUMNS.map do |cname|
+          "#{cname}=VALUES(#{cname})"
+        end.join(", ")
+        stmt = <<-SQL
+          INSERT INTO #{block_type.table_name} (#{block_type::COLUMNS.join(',')},data_frame_id,period_index,data_frame_type)
+          VALUES #{updates[0..-2]}
+          ON DUPLICATE KEY UPDATE #{on_duplicate}
+        SQL
+        Database.execute(stmt)
+      else
+        #
+        # General CASE based solution for multiple differing updates
+        # set per row.
+        # We use a CASE statement per column which determines the column
+        # to set based on the period index
+        #
+        ids = existing.map {|_, (_, id)| id}
+        updates = block_type::COLUMNS.map.with_index do |column, column_idx|
+          [column, "CASE period_index\n#{existing.map{|period_index, (values, _)| "WHEN #{period_index} then #{values[column_idx]}"}.join("\n")} \nEND\n"]
+        end.to_h
+        update_statement = updates.map{|cl, up| "#{cl} = #{up}" }.join(', ')
+        Database.execute(<<-SQL
+          UPDATE #{block_type.table_name} SET #{update_statement} WHERE
+          #{block_type.table_name}.data_frame_id IN (#{ids.join(',')})
+          AND #{block_type.table_name}.data_frame_type = '#{data_frame_type.name}'
+          AND #{block_type.table_name}.period_index IN (#{existing.keys.join(', ')});
+        SQL
+        )
       end
     end
-    def perform_insert(inserts)
-      ActiveDataFrame::DataFrameProxy.suppress_logs do
-        sql = "INSERT INTO #{block_type.table_name} (#{block_type::COLUMNS.join(',')}, data_frame_id, period_index, data_frame_type) VALUES #{inserts[0..-2]}"
-        Database.execute sql
+    def bulk_delete(id, indices)
+      block_type.where(data_frame_id: id, period_index: indices).delete_all
+    end
+    ##
+    # Insert block data for all blocks in a single call
+    ##
+    def bulk_insert(new_blocks, instance)
+      inserts = ''
+      new_blocks.each do |period_index, (values)|
+        inserts << \
+        case ActiveRecord::Base.connection_config[:adapter]
+        when 'postgresql', 'mysql2' then "(#{values.map{|v| v.inspect.gsub('"',"'") }.join(',')}, #{instance.id}, #{period_index}, '#{data_frame_type.name}'),"
+        else "(#{values.map{|v| v.inspect.gsub('"',"'") }.join(',')}, #{instance.id}, #{period_index}, '#{data_frame_type.name}'),"
+        end
       end
+      sql = "INSERT INTO #{block_type.table_name} (#{block_type::COLUMNS.join(',')}, data_frame_id, period_index, data_frame_type) VALUES #{inserts[0..-2]}"
+      Database.execute sql
     end
   end
 end

data/lib/active_data_frame/row.rb CHANGED

@@ -21,7 +21,6 @@ module ActiveDataFrame
       end
       deleted_indices = []
       existing = blocks_between([bounds]).pluck(:data_frame_id, :period_index, *block_type::COLUMNS).map do |id, period_index, *block_values|
         [period_index, [block_values, id]]
       end.to_h
@@ -31,7 +30,10 @@ module ActiveDataFrame
         if existing[index]
           block = existing[index]
           block.first[left..right] = chunk.to_a
-          deleted_indices << index if block.first.all?(&:zero?)
+          if block.first.all?(&:zero?)
+            deleted_indices << index
+            existing.delete(index)
+          end
         elsif chunk.any?(&:nonzero?)
           new_blocks[index].first[left..right] = chunk.to_a
         end
@@ -49,7 +51,9 @@ module ActiveDataFrame
         get_bounds(range.first, range.exclude_end? ? range.end - 1 : range.end, index)
       end
-      existing = blocks_between(all_bounds).pluck(:period_index, *block_type::COLUMNS).map{|pi, *values| [pi, values]}.to_h
+      existing = self.class.suppress_logs{
+        blocks_between(all_bounds).pluck(:period_index, *block_type::COLUMNS).map{|pi, *values| [pi, values]}.to_h
+      }
       result   = M.blank(typecode: block_type::TYPECODE, columns: all_bounds.map(&:length).sum)
       iterate_bounds(all_bounds) do |index, left, right, cursor, size|

data/lib/active_data_frame/table.rb CHANGED

@@ -42,7 +42,6 @@ module ActiveDataFrame
         col_cases = cases[col].sort_by(&:begin).reduce([]) do |agg, col_case|
           if agg.empty?
             agg << col_case
-            agg
           else
             if agg[-1].end.succ == col_case.begin
               agg[-1] = (agg[-1].begin..col_case.end)
@@ -96,9 +95,9 @@ module ActiveDataFrame
         ids = data_frame_type.pluck(:id)
         as_sql = blocks_between(
           all_bounds,
-          block_scope: data_frame_type.unscoped
-                                    .joins("LEFT JOIN #{block_type.table_name} ON #{data_frame_type.table_name}.id = #{block_type.table_name}.data_frame_id")
+          block_scope: data_frame_type.unscoped.where(
+            "#{data_frame_type.table_name}.id IN (SELECT id FROM (#{data_frame_type.select(:id).to_sql}) airport_ids)"
+          ).joins("LEFT JOIN #{block_type.table_name} ON #{data_frame_type.table_name}.id = #{block_type.table_name}.data_frame_id")
         ).where(
           block_type.table_name => {data_frame_type: data_frame_type.name }
         ).select(:period_index, :data_frame_id, *column_cases(case_map)).to_sql

data/lib/active_data_frame/version.rb CHANGED

@@ -1,3 +1,3 @@
 module ActiveDataFrame
-  VERSION = "0.1.3"
+  VERSION = "0.1.5"
 end

data/lib/generators/active_data_frame/USAGE ADDED

@@ -0,0 +1,20 @@
+Description:
+    Generate a new data frame type, and optionally inject it into models that have such a data frame
+Example:
+    # Generate a new MeterReading data frame type, with a block type of
+    # double and a block size of 48 data points
+    rails generate active_data_frame:install MeterReading double 48
+    # Generate a new Dimension data frame type, with a block type of
+    # float and a block size of 10 data points.
+    # Inject the data-type for use into the Iris model
+    rails generate active_data_frame:install Dimension float 10 Iris
+    #
+    # Generate a new status data frame type with an integer block type
+    #
+    rails generate active_data_frame:install Status integer

data/lib/generators/active_data_frame/install_generator.rb CHANGED

@@ -2,13 +2,11 @@ require 'rails/generators/active_record'
 module ActiveDataFrame
   class InstallGenerator < ActiveRecord::Generators::Base
-    desc "Generates a new data_frame type"
     STREAM_TYPES = %w(bit byte integer long float double)
     # Commandline options can be defined here using Thor-like options:
-    argument :type,    :type => :string, :default => 'float', :desc => "DataFrame type. One of(#{STREAM_TYPES*" ,"})"
-    argument :columns, :type => :numeric, :default => 512, :desc => "Number of columns"
-    argument :inject,     type: :array, default: []
+    argument :type,     type: :string,  default: 'float', desc: "DataFrame type. One of(#{STREAM_TYPES*" ,"})"
+    argument :columns,  type: :numeric, default: 512,     desc: "Number of columns"
+    argument :inject,   type: :array,   default: []
     def self.source_root
       @source_root ||= File.join(File.dirname(__FILE__), 'templates')

metadata CHANGED

@@ -1,14 +1,14 @@
 --- !ruby/object:Gem::Specification
 name: active_data_frame
 version: !ruby/object:Gem::Version
-  version: 0.1.3
+  version: 0.1.5
 platform: ruby
 authors:
 - Wouter Coppieters
 autorequire:
 bindir: exe
 cert_chain: []
-date: 2018-04-24 00:00:00.000000000 Z
+date: 2018-06-19 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: bundler
@@ -188,20 +188,20 @@ dependencies:
     requirements:
     - - "~>"
       - !ruby/object:Gem::Version
-        version: 0.1.15
+        version: 0.1.17
     - - ">="
       - !ruby/object:Gem::Version
-        version: 0.1.15
+        version: 0.1.17
   type: :runtime
   prerelease: false
   version_requirements: !ruby/object:Gem::Requirement
     requirements:
     - - "~>"
       - !ruby/object:Gem::Version
-        version: 0.1.15
+        version: 0.1.17
     - - ">="
       - !ruby/object:Gem::Version
-        version: 0.1.15
+        version: 0.1.17
 description: An active data frame helper
 email:
 - wc@pico.net.nz
@@ -230,6 +230,7 @@ files:
 - lib/active_data_frame/row.rb
 - lib/active_data_frame/table.rb
 - lib/active_data_frame/version.rb
+- lib/generators/active_data_frame/USAGE
 - lib/generators/active_data_frame/install_generator.rb
 - lib/generators/active_data_frame/templates/has_concern.rb
 - lib/generators/active_data_frame/templates/migration.rb