RubyGems - red_amber - Versions diffs - 0.2.0 → 0.2.2 - Mend

red_amber 0.2.0 → 0.2.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (43) hide show

checksums.yaml +4 -4
data/.rubocop.yml +5 -0
data/CHANGELOG.md +125 -0
data/README.md +86 -269
data/doc/DataFrame.md +427 -281
data/doc/Vector.md +35 -54
data/doc/image/basic_verbs.png +0 -0
data/doc/image/dataframe/assign.png +0 -0
data/doc/image/dataframe/assign_operation.png +0 -0
data/doc/image/dataframe/drop.png +0 -0
data/doc/image/dataframe/pick.png +0 -0
data/doc/image/dataframe/pick_operation.png +0 -0
data/doc/image/dataframe/remove.png +0 -0
data/doc/image/dataframe/rename.png +0 -0
data/doc/image/dataframe/rename_operation.png +0 -0
data/doc/image/dataframe/reshaping_DataFrames.png +0 -0
data/doc/image/dataframe/slice.png +0 -0
data/doc/image/dataframe/slice_operation.png +0 -0
data/doc/image/dataframe_model.png +0 -0
data/doc/image/group_operation.png +0 -0
data/doc/image/replace-if_then.png +0 -0
data/doc/image/reshaping_dataframe.png +0 -0
data/doc/image/screenshot.png +0 -0
data/doc/image/vector/binary_element_wise.png +0 -0
data/doc/image/vector/unary_aggregation.png +0 -0
data/doc/image/vector/unary_aggregation_w_option.png +0 -0
data/doc/image/vector/unary_element_wise.png +0 -0
data/lib/red_amber/data_frame.rb +33 -41
data/lib/red_amber/data_frame_displayable.rb +59 -6
data/lib/red_amber/data_frame_loadsave.rb +36 -0
data/lib/red_amber/data_frame_reshaping.rb +12 -10
data/lib/red_amber/data_frame_selectable.rb +53 -9
data/lib/red_amber/data_frame_variable_operation.rb +57 -20
data/lib/red_amber/group.rb +5 -3
data/lib/red_amber/helper.rb +20 -18
data/lib/red_amber/vector.rb +50 -31
data/lib/red_amber/vector_functions.rb +21 -24
data/lib/red_amber/vector_selectable.rb +18 -9
data/lib/red_amber/vector_updatable.rb +6 -3
data/lib/red_amber/version.rb +1 -1
data/lib/red_amber.rb +1 -0
metadata +13 -3
data/doc/examples_of_red_amber.ipynb +0 -6783

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: 73459d02c921fcb0fcb742760e8c882b5491fa5316a79b9016233a516ada013e
-  data.tar.gz: ac25e808c5e5d4c13bb1877659550bba532cb5778371e39dfa1f3b9e5a91a4f8
+  metadata.gz: a16699a945f41bf98790f698998126cc6b4a5e916eccb805e78448ec029f9310
+  data.tar.gz: 5e7fa732f64567fd85e5a74b046e80861824f13d15dc910278b6c62359db9a22
 SHA512:
-  metadata.gz: 1bfa4200d440c338f496fe282816634d6a833e30e17edc87a2cf5ec63866e2bbbaf8796916f1b052ea66482c54a038bbf1445258c2526691e42c2b47be2c39c5
-  data.tar.gz: e324e480e6086f7017de58201783c857825b79d0b2e2c8fa2636089cd1c5531e22905a3c0d860f26b833eb6add6ed6017497632bd1ea8fcb932c2d2233b11812
+  metadata.gz: 6ae7a6e3a8015b6b9736fb934526d9dc96b43830f0890ccbc16e175e539a8df1053432a63dde84a31dbd3a170aa6256b681127c510117723427bce815568c981
+  data.tar.gz: a0e7d86a7bdc6be7ec493ef5331ced5ecf4e6b89458f4252f208435905a7e4e80a088a718098073fb0c65c86d76297c70c978cd4dec28b1eb1a0d915bb7e3608

data/.rubocop.yml CHANGED Viewed

@@ -63,6 +63,7 @@ Metrics/AbcSize:
     - 'lib/red_amber/data_frame_displayable.rb' # Max: 55
     - 'lib/red_amber/data_frame_reshaping.rb' # Max 40.91
     - 'lib/red_amber/data_frame_selectable.rb' # Max: 51
+    - 'lib/red_amber/data_frame_variable_operation.rb' # Max: 30.15
     - 'lib/red_amber/vector_updatable.rb' # Max: 36
     - 'lib/red_amber/vector_selectable.rb' # Max: 33
@@ -86,6 +87,7 @@ Metrics/CyclomaticComplexity:
   Exclude:
     - 'lib/red_amber/data_frame_displayable.rb' # Max: 18
     - 'lib/red_amber/data_frame_selectable.rb' # Max: 14
+    - 'lib/red_amber/helper.rb' # Max: 15
     - 'lib/red_amber/vector_selectable.rb' # Max: 13
     - 'lib/red_amber/vector_updatable.rb' # Max: 14
@@ -94,6 +96,8 @@ Metrics/MethodLength:
   Max: 30
   Exclude:
     - 'lib/red_amber/data_frame_displayable.rb' # Max: 33
+    - 'lib/red_amber/data_frame_selectable.rb' # Max: 38
+    - 'lib/red_amber/data_frame_variable_operation.rb' # Max: 35
 # Max: 100
 Metrics/ModuleLength:
@@ -109,6 +113,7 @@ Metrics/PerceivedComplexity:
   Max: 13
   Exclude:
     - 'lib/red_amber/data_frame_selectable.rb' # Max: 14
+    - 'lib/red_amber/helper.rb' # Max: 15
     - 'lib/red_amber/vector_updatable.rb' # Max: 15
     - 'lib/red_amber/data_frame_displayable.rb' # Max: 19

data/CHANGELOG.md CHANGED Viewed

@@ -1,3 +1,121 @@
+## [0.2.2] - 2022-10-04
+- Bug fixes
+  - Return self when no replacement happen in Vector#replace. (#92)
+  - Limit n-digits in to_iruby. (#111)
+  - Fix displaying space in to_iruby. (#111)
+  - Raise error if key is duplicated. (#113)
+  - Fix DataFrame#pick/#drop with endless Range. (#113)
+  - Change type from dictionary to string in DataFrame reshaping methods. (#113)
+  - Fix arguments parser to accept Enumerator. (#114)
+- New features and improvements
+  - Support to make a data frame from a to_arrow-responsible object. (#106) [Patch by Kenta Murata]
+  - Introduce DataFrame#auto_cast (experimental feature) (#105)
+  - Change default name in DataFrame#transpose, #to_long, #to_wide. (#110)
+  - Add Vector#dictionary? method. (#113)
+  - Add display mode 'Plain' and 'Minimum'. (#113)
+  - Refactor code
+    - Refine test_vector_selectable. (#92)
+    - Refine test_vector_updatable. (#92)
+    - Refine Vector.new. (#113)
+    - Refine DataFrame#pick, #drop. (#113)
+  - Documents
+    - Update images. (#90, #105, #113)
+    - Update README to use simpler examples. (#112)
+      - Update README with a new screenshot example. (#113)
+  - GitHub site
+    - Update Jupyter notebooks in Binder (#88, #115)
+      - Move binder support to heronshoes/docker-stacks repository.
+      - Update README notebook on binder.
+      - Add examples_of_RedAmber notebook on binder.
+    - Start to use discussions.
+- Thanks
+  - Kenta Murata
+## [0.2.1] - 2022-09-07
+- Bug fixes
+  - Fix `Vector#each` with block (#66)
+    `Vector#each` will return value of each element with block.
+  - Fix table format at size == 9 (#67)
+  - Fix to support Vector in `DataFrame#assign` (#77)
+  - Add `assert_delta` functionality for `assert_with_NaN` (#78)
+  - Fix Vector#is_in when self is chunked (#79)
+  - Fix Array type error (uint/int) (#79)
+- New features and improvements
+  - Refine `DataFrame#indices` method (#67)
+  - Update DataFrame reshaping methods (#73)
+    - Change default option value of DataFrame reshaping
+    - Change the order of import_cars example
+  - Add `DataFrame#method_missing` to get column vector by method (#75)
+    - Add `DataFrame#method_missing` to get column (#75)
+  - Accept both args and block in `DataFrame#assign` (#75)
+  - Accept indices in `DataFrame#pick` and `DataFrame#drop` (#76)
+  - Add `DataFrame#slice_by` method (#77)
+  - Add new Vector functions (#78)
+    - Add inverse trigonometric function for Vector
+      - `acos`
+      - `asin`
+    - Add logarithmic function for Vector
+      - `ln`
+      - `log10`
+      - `log1p`
+      - `log2`
+    - Add binary function `Vector#logb`
+  - Docker image and Jupyter Notebook [Thanks to Kenta Murata]
+    - Add link to RubyData in README
+    - Add link to interactive README by Binder
+  - Update Jupyter Notebook `71 examples of RedAmber`
+- Thanks
+  - Kenta Murata
 ## [0.2.0] - 2022-08-15
 - Bump version up to 0.2.0
@@ -236,6 +354,13 @@
   - Documentation
     - Fix typo in DataFrame.md
+  - Github site
+    - Add gem and status badges in README. (#42) [Patch by kojix2]
+- Thanks
+  - kojix2
 ## [0.1.5] - 2022-06-12 (experimental)
 - Bug fixes

data/README.md CHANGED Viewed

@@ -2,12 +2,15 @@
 [![Gem Version](https://badge.fury.io/rb/red_amber.svg)](https://badge.fury.io/rb/red_amber)
 [![Ruby](https://github.com/heronshoes/red_amber/actions/workflows/test.yml/badge.svg)](https://github.com/heronshoes/red_amber/actions/workflows/test.yml)
+[![Discussions](https://img.shields.io/github/discussions/heronshoes/red_amber)](https://github.com/heronshoes/red_amber/discussions)
 A simple dataframe library for Ruby.
 - Powered by [Red Arrow](https://github.com/apache/arrow/tree/master/ruby/red-arrow) [![Gitter Chat](https://badges.gitter.im/red-data-tools/en.svg)](https://gitter.im/red-data-tools/en)
 - Inspired by the dataframe library [Rover-df](https://github.com/ankane/rover)
+![screenshot from jupyterlab](doc/image/screenshot.png)
 ## Requirements
 Supported Ruby version is >= 2.7.
@@ -53,328 +56,136 @@ Or install it yourself as:
 gem install red_amber
 ```
-## `RedAmber::DataFrame`
+## Docker image and Jupyter Notebook
-Represents a set of data in 2D-shape. The entity is a Red Arrow's Table object.
+[RubyData Docker Stacks](https://github.com/RubyData/docker-stacks) is available as a ready-to-run Docker image containing Jupyter and useful data tools as well as RedAmber (Thanks to @mrkn).
-```ruby
-require 'red_amber' # require 'red-amber' is also OK.
-require 'datasets-arrow'
+Also you can try the contents of this README interactively by [Binder](https://mybinder.org/v2/gh/heronshoes/docker-stacks/RedAmber-binder?filepath=README.ipynb).
+[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/heronshoes/docker-stacks/RedAmber-binder?filepath=red-amber.ipynb)
-arrow = Datasets::Penguins.new.to_arrow
-penguins = RedAmber::DataFrame.new(arrow)
-# =>
-#<RedAmber::DataFrame : 344 x 8 Vectors, 0x0000000000013790>
-    species  island    bill_length_mm bill_depth_mm flipper_length_mm ...     year
-    <string> <string>        <double>      <double>           <uint8> ... <uint16>
-  1 Adelie   Torgersen           39.1          18.7               181 ...     2007
-  2 Adelie   Torgersen           39.5          17.4               186 ...     2007
-  3 Adelie   Torgersen           40.3          18.0               195 ...     2007
-  4 Adelie   Torgersen          (nil)         (nil)             (nil) ...     2007
-  5 Adelie   Torgersen           36.7          19.3               193 ...     2007
-  : :        :                      :             :                 : ...        :
-342 Gentoo   Biscoe              50.4          15.7               222 ...     2009
-343 Gentoo   Biscoe              45.2          14.8               212 ...     2009
-344 Gentoo   Biscoe              49.9          16.1               213 ...     2009
-```
+## Data frame in `RedAmber`
-### DataFrame model
-![dataframe model of RedAmber](doc/image/dataframe_model.png)
-For example, `DataFrame#pick` accepts keys as an argument and returns a sub DataFrame.
+Class `RedAmber::DataFrame` represents a set of data in 2D-shape.
+The entity is a Red Arrow's Table object.
-![pick method image](doc/image/dataframe/pick.png)
-```ruby
-penguins.keys
-# =>
-[:species,
- :island,
- :bill_length_mm,
- :bill_depth_mm,
- :flipper_length_mm,
- :body_mass_g,
- :sex,
- :year]
-df = penguins.pick(:species, :island, :body_mass_g)
-df
-# =>
-#<RedAmber::DataFrame : 344 x 3 Vectors, 0x000000000003cc1c>
-    species  island    body_mass_g
-    <string> <string>     <uint16>
-  1 Adelie   Torgersen        3750
-  2 Adelie   Torgersen        3800
-  3 Adelie   Torgersen        3250
-  4 Adelie   Torgersen       (nil)
-  5 Adelie   Torgersen        3450
-  : :        :                   :
-342 Gentoo   Biscoe           5750
-343 Gentoo   Biscoe           5200
-344 Gentoo   Biscoe           5400
-```
-`DataFrame#drop` drops some columns to create a remainer DataFrame.
-![drop method image](doc/image/dataframe/drop.png)
+![dataframe model of RedAmber](doc/image/dataframe_model.png)
-You can specify by keys or a boolean array (same size as n_keys).
+Load the library.
 ```ruby
-# Same as df.drop(:species, :island)
-df = df.drop(true, true, false)
-# =>
-#<RedAmber::DataFrame : 344 x 1 Vector, 0x0000000000048760>
-    body_mass_g
-       <uint16>
-  1        3750
-  2        3800
-  3        3250
-  4       (nil)
-  5        3450
-  :           :
-342        5750
-343        5200
-344        5400
+require 'red_amber' # require 'red-amber' is also OK.
+include RedAmber
 ```
-Arrow data is immutable, so these methods always return an new object.
-`DataFrame#assign` creates new columns or update existing columns.
-![assign method image](doc/image/dataframe/assign.png)
+### Example: diamonds dataset
 ```ruby
-# New column is created because ':body_mass_kg' is a new key.
-df.assign(:body_mass_kg => df[:body_mass_g] / 1000.0)
+require 'datasets-arrow' # to load sample data
-# =>
-#<RedAmber::DataFrame : 344 x 2 Vectors, 0x00000000000212f0>
-    body_mass_g body_mass_kg
-       <uint16>     <double>
-  1        3750          3.8
-  2        3800          3.8
-  3        3250          3.3
-  4       (nil)        (nil)
-  5        3450          3.5
-  :           :            :
-342        5750          5.8
-343        5200          5.2
-344        5400          5.4
-```
-`DataFrame#slice` selects rows (observations) to create a sub DataFrame.
-![slice method image](doc/image/dataframe/slice.png)
-```ruby
-# returns 5 rows at the start and 5 rows from the end
-penguins.slice(0...5, -5..-1)
+dataset = Datasets::Diamonds.new
+diamonds = DataFrame.new(dataset) # from v0.2.2, should be `dataset.to_arrow` if older.
 # =>
-#<RedAmber::DataFrame : 10 x 8 Vectors, 0x0000000000042be4>
-   species  island    bill_length_mm bill_depth_mm flipper_length_mm ...     year
-   <string> <string>        <double>      <double>           <uint8> ... <uint16>
- 1 Adelie   Torgersen           39.1          18.7               181 ...     2007
- 2 Adelie   Torgersen           39.5          17.4               186 ...     2007
- 3 Adelie   Torgersen           40.3          18.0               195 ...     2007
- 4 Adelie   Torgersen          (nil)         (nil)             (nil) ...     2007
- 5 Adelie   Torgersen           36.7          19.3               193 ...     2007
- : :        :                      :             :                 : ...        :
- 8 Gentoo   Biscoe              50.4          15.7               222 ...     2009
- 9 Gentoo   Biscoe              45.2          14.8               212 ...     2009
-10 Gentoo   Biscoe              49.9          16.1               213 ...     2009
+#<RedAmber::DataFrame : 53940 x 10 Vectors, 0x000000000000f668>
+         carat cut       color    clarity     depth    table    price        x ...        z
+      <double> <string>  <string> <string> <double> <double> <uint16> <double> ... <double>
+    0     0.23 Ideal     E        SI2          61.5     55.0      326     3.95 ...     2.43
+    1     0.21 Premium   E        SI1          59.8     61.0      326     3.89 ...     2.31
+    2     0.23 Good      E        VS1          56.9     65.0      327     4.05 ...     2.31
+    3     0.29 Premium   I        VS2          62.4     58.0      334      4.2 ...     2.63
+    4     0.31 Good      J        SI2          63.3     58.0      335     4.34 ...     2.75
+    :        : :         :        :               :        :        :        : ...        :
+53937      0.7 Very Good D        SI1          62.8     60.0     2757     5.66 ...     3.56
+53938     0.86 Premium   H        SI2          61.0     58.0     2757     6.15 ...     3.74
+53939     0.75 Ideal     D        SI2          62.2     55.0     2757     5.83 ...     3.64
 ```
-`DataFrame#remove` rejects rows (observations) to create a remainer DataFrame.
-![remove method image](doc/image/dataframe/remove.png)
+For example, we can compute mean prices per 'cut' for the data larger than 1 carat.
 ```ruby
-# penguins[:bill_length_mm] < 40 returns a boolean Vector
-penguins.remove(penguins[:bill_length_mm] < 40)
+df = diamonds
+  .slice { carat > 1 }
+  .group(:cut)
+  .mean(:price) # `pick` prior to `group` is not required if `:price` is specified here.
+  .sort('-mean(price)')
 # =>
-#<RedAmber::DataFrame : 244 x 8 Vectors, 0x000000000007d6f4>
-    species  island    bill_length_mm bill_depth_mm flipper_length_mm ...     year
-    <string> <string>        <double>      <double>           <uint8> ... <uint16>
-  1 Adelie   Torgersen           40.3          18.0               195 ...     2007
-  2 Adelie   Torgersen          (nil)         (nil)             (nil) ...     2007
-  3 Adelie   Torgersen           42.0          20.2               190 ...     2007
-  4 Adelie   Torgersen           41.1          17.6               182 ...     2007
-  5 Adelie   Torgersen           42.5          20.7               197 ...     2007
-  : :        :                      :             :                 : ...        :
-242 Gentoo   Biscoe              50.4          15.7               222 ...     2009
-243 Gentoo   Biscoe              45.2          14.8               212 ...     2009
-244 Gentoo   Biscoe              49.9          16.1               213 ...     2009
+#<RedAmber::DataFrame : 5 x 2 Vectors, 0x000000000000f67c>
+  cut       mean(price)
+  <string>     <double>
+0 Ideal         8674.23
+1 Premium       8487.25
+2 Very Good     8340.55
+3 Good           7753.6
+4 Fair          7177.86
 ```
-DataFrame manipulating methods like `pick`, `drop`, `slice`, `remove`, `rename` and `assign` accept a block.
-This example is usage of block to update a column.
+Arrow data is immutable, so these methods always return new objects.
+Next example will rename a column and create a new column by simple calcuration.
 ```ruby
-df = RedAmber::DataFrame.new(
-  integer: [0, 1, 2, 3, nil],
-  float:   [0.0, 1.1,  2.2, Float::NAN, nil],
-  string:  ['A', 'B', 'C', 'D', nil],
-  boolean: [true, false, true, false, nil])
-df
+usdjpy = 110.0
-# =>
-#<RedAmber::DataFrame : 5 x 4 Vectors, 0x000000000003131c>
-  integer    float string   boolean
-  <uint8> <double> <string> <boolean>
-1       0      0.0 A        true
-2       1      1.1 B        false
-3       2      2.2 C        true
-4       3      NaN D        false
-5   (nil)    (nil) (nil)    (nil)
-df.assign do
-  vectors.select(&:float?).map { |v| [v.key, -v] }
-  # => returns [[:float], [-0.0, -1.1, -2.2, NAN, nil]]
-end
+df.rename('mean(price)': :mean_price_USD)
+  .assign(:mean_price_JPY) { mean_price_USD * usdjpy }
 # =>
-#<RedAmber::DataFrame : 5 x 3 Vectors, 0x00000000000e270c>
-    index    float string
-  <uint8> <double> <string>
-1       0     -0.0 A
-2       1     -1.1 B
-3       2     -2.2 C
-4       3      NaN D
-5   (nil)    (nil) (nil)
+#<RedAmber::DataFrame : 5 x 3 Vectors, 0x000000000000f71c>
+  cut       mean_price_USD mean_price_JPY
+  <string>        <double>       <double>
+0 Ideal            8674.23      954164.93
+1 Premium          8487.25      933597.34
+2 Very Good        8340.55      917460.37
+3 Good              7753.6      852896.11
+4 Fair             7177.86      789564.12
 ```
-Next example is to eliminate rows containing nil.
+### Example: starwars dataset
-```ruby
-# remove all observations containing nil
-nil_removed = penguins.remove { vectors.map(&:is_nil).reduce(&:|) }
-nil_removed.tdr
-# =>
-RedAmber::DataFrame : 342 x 8 Vectors
-Vectors : 5 numeric, 3 strings
-# key                type   level data_preview
-1 :species           string     3 {"Adelie"=>151, "Chinstrap"=>68, "Gentoo"=>123}
-2 :island            string     3 {"Torgersen"=>51, "Biscoe"=>167, "Dream"=>124}
-3 :bill_length_mm    double   164 [39.1, 39.5, 40.3, 36.7, 39.3, ... ]
-4 :bill_depth_mm     double    80 [18.7, 17.4, 18.0, 19.3, 20.6, ... ]
-5 :flipper_length_mm int64     55 [181, 186, 195, 193, 190, ... ]
-6 :body_mass_g       int64     94 [3750, 3800, 3250, 3450, 3650, ... ]
-7 :sex               string     3 {"male"=>168, "female"=>165, ""=>9}
-8 :year              int64      3 {2007=>109, 2008=>114, 2009=>119}
-```
-For this frequently needed task, we can do it much simpler.
+Next example is `starwars` dataset reading from the downloaded CSV file. Followed by minimum data cleansing.
 ```ruby
-penguins.remove_nil # => same result as above
-```
+uri = URI('https://vincentarelbundock.github.io/Rdatasets/csv/dplyr/starwars.csv')
-`DataFrame#summary` shows summary statistics in a DataFrame.
+starwars = DataFrame.load(uri)
-```ruby
-puts penguins.summary.to_s(width: 82)
-# =>
-  variables            count     mean      std      min      25%   median      75%      max
-  <dictionary>      <uint16> <double> <double> <double> <double> <double> <double> <double>
-1 bill_length_mm         342    43.92     5.46     32.1    39.23    44.38     48.5     59.6
-2 bill_depth_mm          342    17.15     1.97     13.1     15.6    17.32     18.7     21.5
-3 flipper_length_mm      342   200.92    14.06    172.0    190.0    197.0    213.0    231.0
-4 body_mass_g            342  4201.75   801.95   2700.0   3550.0   4031.5   4750.0   6300.0
-5 year                   344  2008.03     0.82   2007.0   2007.0   2008.0   2009.0   2009.0
-```
-`DataFrame#group` method can be used for the grouping tasks.
-```ruby
-starwars = RedAmber::DataFrame.load(URI("https://vincentarelbundock.github.io/Rdatasets/csv/dplyr/starwars.csv"))
 starwars
+  .drop(0) # delete unnecessary index column
+  .remove { species == "NA" } # delete unnecessary rows
+  .group(:species) { [count(:species), mean(:height, :mass)] }
+  .slice { count > 1 }
 # =>
-#<RedAmber::DataFrame : 87 x 12 Vectors, 0x000000000000607c>
-   unnamed1 name            height     mass hair_color skin_color  eye_color ... species
-    <int64> <string>       <int64> <double> <string>   <string>    <string>  ... <string>
- 1        1 Luke Skywalker     172     77.0 blond      fair        blue      ... Human
- 2        2 C-3PO              167     75.0 NA         gold        yellow    ... Droid
- 3        3 R2-D2               96     32.0 NA         white, blue red       ... Droid
- 4        4 Darth Vader        202    136.0 none       white       yellow    ... Human
- 5        5 Leia Organa        150     49.0 brown      light       brown     ... Human
- :        : :                    :        : :          :           :         ... :
-85       85 BB8              (nil)    (nil) none       none        black     ... Droid
-86       86 Captain Phasma   (nil)    (nil) unknown    unknown     unknown   ... NA
-87       87 Padmé Amidala      165     45.0 brown      light       brown     ... Human
-grouped = starwars.group(:species) { [count(:species), mean(:height, :mass)] }
-grouped.slice { v(:count) > 1 }
-# =>
-#<RedAmber::DataFrame : 9 x 4 Vectors, 0x000000000006e848>
+#<RedAmber::DataFrame : 8 x 4 Vectors, 0x000000000000f848>
   species    count mean(height) mean(mass)
   <string> <int64>     <double>   <double>
-1 Human         35        176.6       82.8
-2 Droid          6        131.2       69.8
-3 Wookiee        2        231.0      124.0
-4 Gungan         3        208.7       74.0
-5 NA             4        181.3       48.0
-: :              :            :          :
-7 Twi'lek        2        179.0       55.0
-8 Mirialan       2        168.0       53.1
-9 Kaminoan       2        221.0       88.0
+0 Human         35       176.65      82.78
+1 Droid          6        131.2      69.75
+2 Wookiee        2        231.0      124.0
+3 Gungan         3       208.67       74.0
+4 Zabrak         2        173.0       80.0
+5 Twi'lek        2        179.0       55.0
+6 Mirialan       2        168.0       53.1
+7 Kaminoan       2        221.0       88.0
 ```
 See [DataFrame.md](doc/DataFrame.md) for other examples and details.
-## `RedAmber::Vector`
+### `Vector` for 1D data object in column
 Class `RedAmber::Vector` represents a series of data in the DataFrame.
-Method `RedAmber::DataFrame#[key]` returns a Vector with the key `key`.
-```ruby
-penguins[:bill_length_mm]
-# =>
-#<RedAmber::Vector(:double, size=344):0x000000000000f8fc>
-[39.1, 39.5, 40.3, nil, 36.7, 39.3, 38.9, 39.2, 34.1, 42.0, 37.8, 37.8, 41.1, ... ]
-```
-Vectors accepts some [functional methods from Arrow](https://arrow.apache.org/docs/cpp/compute.html).
-This is an element-wise comparison and returns a boolean Vector of same size.
-![unary element-wise](doc/image/vector/unary_element_wise.png)
-```ruby
-penguins[:bill_length_mm] < 40
-# =>
-#<RedAmber::Vector(:boolean, size=344):0x000000000007e7ac>
-[true, true, false, nil, true, true, true, true, true, false, true, true, false, ... ]
-```
-Next example returns aggregated result.
-![unary aggregation](doc/image/vector/unary_aggregation.png)
-```ruby
-penguins[:bill_length_mm].mean
-43.92192982456141
-# =>
-```
 See [Vector.md](doc/Vector.md) for details.
 ## Jupyter notebook
-[61 Examples of Red Amber](doc/examples_of_red_amber.ipynb) shows more examples in jupyter notebook.
+[73 Examples of Red Amber](binder/examples_of_red_amber.ipynb) shows more examples in jupyter notebook.
+You can try this notebook on [Binder](https://mybinder.org/v2/gh/heronshoes/docker-stacks/RedAmber-binder?filepath=examples_of_red_amber.ipynb).
+[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/heronshoes/docker-stacks/RedAmber-binder?filepath=examples_of_red_amber.ipynb)
 ## Development
@@ -385,8 +196,14 @@ bundle install
 bundle exec rake test
 ```
+## Community
 I will appreciate if you could help to improve this project. Here are a few ways you can help:
+- Let's talk in the [discussions](https://github.com/heronshoes/red_amber/discussions). [![Discussions](https://img.shields.io/github/discussions/heronshoes/red_amber)](https://github.com/heronshoes/red_amber/discussions)
+  - Browse Q and A, how to use, tips, etc.
+  - Ask questions you’re wondering about.
+  - Share ideas. The idea may be promoted to issues or pull requests.
 - [Report bugs or suggest new features](https://github.com/heronshoes/red_amber/issues)
 - Fix bugs and [submit pull requests](https://github.com/heronshoes/red_amber/pulls)
 - Write, clarify, or fix documentation