statsample-ekatena 2.0.2

Sign up to get free protection for your applications and to get access to all the features.
Files changed (156) hide show
  1. checksums.yaml +7 -0
  2. data/.gitignore +15 -0
  3. data/.travis.yml +23 -0
  4. data/CONTRIBUTING.md +17 -0
  5. data/Gemfile +2 -0
  6. data/History.txt +457 -0
  7. data/LICENSE.txt +12 -0
  8. data/README.md +175 -0
  9. data/Rakefile +44 -0
  10. data/benchmarks/correlation_matrix_15_variables.rb +32 -0
  11. data/benchmarks/correlation_matrix_5_variables.rb +33 -0
  12. data/benchmarks/correlation_matrix_methods/correlation_matrix.ds +0 -0
  13. data/benchmarks/correlation_matrix_methods/correlation_matrix.html +93 -0
  14. data/benchmarks/correlation_matrix_methods/correlation_matrix.rb +71 -0
  15. data/benchmarks/correlation_matrix_methods/correlation_matrix.xls +0 -0
  16. data/benchmarks/correlation_matrix_methods/correlation_matrix_gsl_ruby.ods +0 -0
  17. data/benchmarks/correlation_matrix_methods/correlation_matrix_with_graphics.ods +0 -0
  18. data/benchmarks/correlation_matrix_methods/results.ds +0 -0
  19. data/benchmarks/factor_map.rb +37 -0
  20. data/benchmarks/helpers_benchmark.rb +5 -0
  21. data/data/locale/es/LC_MESSAGES/statsample.mo +0 -0
  22. data/doc_latex/manual/equations.tex +78 -0
  23. data/examples/boxplot.rb +28 -0
  24. data/examples/chisquare_test.rb +23 -0
  25. data/examples/correlation_matrix.rb +32 -0
  26. data/examples/dataset.rb +30 -0
  27. data/examples/dominance_analysis.rb +33 -0
  28. data/examples/dominance_analysis_bootstrap.rb +32 -0
  29. data/examples/histogram.rb +26 -0
  30. data/examples/icc.rb +24 -0
  31. data/examples/levene.rb +29 -0
  32. data/examples/multiple_regression.rb +20 -0
  33. data/examples/multivariate_correlation.rb +33 -0
  34. data/examples/parallel_analysis.rb +40 -0
  35. data/examples/polychoric.rb +40 -0
  36. data/examples/principal_axis.rb +26 -0
  37. data/examples/reliability.rb +31 -0
  38. data/examples/scatterplot.rb +25 -0
  39. data/examples/t_test.rb +27 -0
  40. data/examples/tetrachoric.rb +17 -0
  41. data/examples/u_test.rb +24 -0
  42. data/examples/vector.rb +20 -0
  43. data/examples/velicer_map_test.rb +46 -0
  44. data/grab_references.rb +29 -0
  45. data/lib/spss.rb +134 -0
  46. data/lib/statsample-ekatena/analysis.rb +100 -0
  47. data/lib/statsample-ekatena/analysis/suite.rb +89 -0
  48. data/lib/statsample-ekatena/analysis/suitereportbuilder.rb +44 -0
  49. data/lib/statsample-ekatena/anova.rb +24 -0
  50. data/lib/statsample-ekatena/anova/contrast.rb +79 -0
  51. data/lib/statsample-ekatena/anova/oneway.rb +187 -0
  52. data/lib/statsample-ekatena/anova/twoway.rb +207 -0
  53. data/lib/statsample-ekatena/bivariate.rb +406 -0
  54. data/lib/statsample-ekatena/bivariate/pearson.rb +54 -0
  55. data/lib/statsample-ekatena/codification.rb +182 -0
  56. data/lib/statsample-ekatena/converter/csv.rb +28 -0
  57. data/lib/statsample-ekatena/converter/spss.rb +48 -0
  58. data/lib/statsample-ekatena/converters.rb +211 -0
  59. data/lib/statsample-ekatena/crosstab.rb +188 -0
  60. data/lib/statsample-ekatena/daru.rb +115 -0
  61. data/lib/statsample-ekatena/dataset.rb +10 -0
  62. data/lib/statsample-ekatena/dominanceanalysis.rb +425 -0
  63. data/lib/statsample-ekatena/dominanceanalysis/bootstrap.rb +232 -0
  64. data/lib/statsample-ekatena/factor.rb +104 -0
  65. data/lib/statsample-ekatena/factor/map.rb +124 -0
  66. data/lib/statsample-ekatena/factor/parallelanalysis.rb +166 -0
  67. data/lib/statsample-ekatena/factor/pca.rb +242 -0
  68. data/lib/statsample-ekatena/factor/principalaxis.rb +243 -0
  69. data/lib/statsample-ekatena/factor/rotation.rb +198 -0
  70. data/lib/statsample-ekatena/formula/fit_model.rb +46 -0
  71. data/lib/statsample-ekatena/formula/formula.rb +306 -0
  72. data/lib/statsample-ekatena/graph.rb +11 -0
  73. data/lib/statsample-ekatena/graph/boxplot.rb +236 -0
  74. data/lib/statsample-ekatena/graph/histogram.rb +198 -0
  75. data/lib/statsample-ekatena/graph/scatterplot.rb +213 -0
  76. data/lib/statsample-ekatena/histogram.rb +180 -0
  77. data/lib/statsample-ekatena/matrix.rb +329 -0
  78. data/lib/statsample-ekatena/multiset.rb +310 -0
  79. data/lib/statsample-ekatena/regression.rb +65 -0
  80. data/lib/statsample-ekatena/regression/multiple.rb +89 -0
  81. data/lib/statsample-ekatena/regression/multiple/alglibengine.rb +128 -0
  82. data/lib/statsample-ekatena/regression/multiple/baseengine.rb +251 -0
  83. data/lib/statsample-ekatena/regression/multiple/gslengine.rb +129 -0
  84. data/lib/statsample-ekatena/regression/multiple/matrixengine.rb +205 -0
  85. data/lib/statsample-ekatena/regression/multiple/rubyengine.rb +86 -0
  86. data/lib/statsample-ekatena/regression/simple.rb +121 -0
  87. data/lib/statsample-ekatena/reliability.rb +150 -0
  88. data/lib/statsample-ekatena/reliability/icc.rb +415 -0
  89. data/lib/statsample-ekatena/reliability/multiscaleanalysis.rb +181 -0
  90. data/lib/statsample-ekatena/reliability/scaleanalysis.rb +233 -0
  91. data/lib/statsample-ekatena/reliability/skillscaleanalysis.rb +114 -0
  92. data/lib/statsample-ekatena/resample.rb +15 -0
  93. data/lib/statsample-ekatena/shorthand.rb +125 -0
  94. data/lib/statsample-ekatena/srs.rb +169 -0
  95. data/lib/statsample-ekatena/test.rb +82 -0
  96. data/lib/statsample-ekatena/test/bartlettsphericity.rb +45 -0
  97. data/lib/statsample-ekatena/test/chisquare.rb +73 -0
  98. data/lib/statsample-ekatena/test/f.rb +52 -0
  99. data/lib/statsample-ekatena/test/kolmogorovsmirnov.rb +63 -0
  100. data/lib/statsample-ekatena/test/levene.rb +88 -0
  101. data/lib/statsample-ekatena/test/t.rb +309 -0
  102. data/lib/statsample-ekatena/test/umannwhitney.rb +208 -0
  103. data/lib/statsample-ekatena/test/wilcoxonsignedrank.rb +90 -0
  104. data/lib/statsample-ekatena/vector.rb +19 -0
  105. data/lib/statsample-ekatena/version.rb +3 -0
  106. data/lib/statsample.rb +282 -0
  107. data/po/es/statsample.mo +0 -0
  108. data/po/es/statsample.po +959 -0
  109. data/po/statsample.pot +947 -0
  110. data/references.txt +24 -0
  111. data/statsample-ekatena.gemspec +49 -0
  112. data/test/fixtures/bank2.dat +200 -0
  113. data/test/fixtures/correlation_matrix.rb +17 -0
  114. data/test/fixtures/df.csv +15 -0
  115. data/test/fixtures/hartman_23.matrix +9 -0
  116. data/test/fixtures/stock_data.csv +500 -0
  117. data/test/fixtures/tetmat_matrix.txt +5 -0
  118. data/test/fixtures/tetmat_test.txt +1001 -0
  119. data/test/helpers_tests.rb +83 -0
  120. data/test/test_analysis.rb +176 -0
  121. data/test/test_anova_contrast.rb +36 -0
  122. data/test/test_anovaoneway.rb +26 -0
  123. data/test/test_anovatwoway.rb +37 -0
  124. data/test/test_anovatwowaywithdataset.rb +47 -0
  125. data/test/test_anovawithvectors.rb +102 -0
  126. data/test/test_awesome_print_bug.rb +16 -0
  127. data/test/test_bartlettsphericity.rb +25 -0
  128. data/test/test_bivariate.rb +164 -0
  129. data/test/test_codification.rb +78 -0
  130. data/test/test_crosstab.rb +67 -0
  131. data/test/test_dominance_analysis.rb +39 -0
  132. data/test/test_factor.rb +228 -0
  133. data/test/test_factor_map.rb +38 -0
  134. data/test/test_factor_pa.rb +56 -0
  135. data/test/test_fit_model.rb +88 -0
  136. data/test/test_ggobi.rb +35 -0
  137. data/test/test_gsl.rb +15 -0
  138. data/test/test_histogram.rb +109 -0
  139. data/test/test_matrix.rb +48 -0
  140. data/test/test_multiset.rb +176 -0
  141. data/test/test_regression.rb +231 -0
  142. data/test/test_reliability.rb +223 -0
  143. data/test/test_reliability_icc.rb +198 -0
  144. data/test/test_reliability_skillscale.rb +57 -0
  145. data/test/test_resample.rb +24 -0
  146. data/test/test_srs.rb +9 -0
  147. data/test/test_statistics.rb +69 -0
  148. data/test/test_stest.rb +69 -0
  149. data/test/test_stratified.rb +17 -0
  150. data/test/test_test_f.rb +33 -0
  151. data/test/test_test_kolmogorovsmirnov.rb +34 -0
  152. data/test/test_test_t.rb +62 -0
  153. data/test/test_umannwhitney.rb +27 -0
  154. data/test/test_vector.rb +12 -0
  155. data/test/test_wilcoxonsignedrank.rb +64 -0
  156. metadata +570 -0
@@ -0,0 +1,7 @@
1
+ ---
2
+ SHA256:
3
+ metadata.gz: a9078d39532535ce985e33d0166e2b64a610e470556f94f83bc0839b1731c9ed
4
+ data.tar.gz: 7f273247ae596db0b832aafe329434ee752b3958e88f87e914c66a61e7de1c10
5
+ SHA512:
6
+ metadata.gz: 2e0b67eef26c46edc3823432a37a479ef1d5813aab2a3a758174b27b53a226ccf161d82c22fd8869b930d003e70ad0035ae0648447feca8cab0920d0ec911c2f
7
+ data.tar.gz: 04bb9c2fbaba94d547cb22e16aea3d2147a959fb86cb1ac28afa7cb81318c26462ccec5be6b2c3bab5b550962cb0a4b98de9440119b55ec1b66f3b21faf4c0fb
@@ -0,0 +1,15 @@
1
+ Gemfile.lock
2
+ doc.yaml
3
+ *.swp
4
+ *.rbc
5
+ coverage
6
+ *~
7
+ agregar_adsense_a_doc.rb
8
+ pkg
9
+ doc
10
+ .yardoc
11
+ examples/images/*
12
+ examples/*.html
13
+ web/upload_task.rb
14
+ .idea
15
+ *.gem
@@ -0,0 +1,23 @@
1
+ language:
2
+ ruby
3
+
4
+ rvm:
5
+ - '2.0'
6
+ - '2.1'
7
+ - '2.2'
8
+ - '2.3.0'
9
+
10
+ matrix:
11
+ fast_finish:
12
+ true
13
+
14
+ script: "bundle exec rake test"
15
+
16
+ install:
17
+ - gem install bundler
18
+ - bundle install
19
+
20
+ before_install:
21
+ - sudo apt-get update -qq
22
+ - sudo apt-get install -y libgsl0-dev r-base r-base-dev
23
+ - sudo Rscript -e "install.packages(c('Rserve','irr'),,'http://cran.us.r-project.org')"
@@ -0,0 +1,17 @@
1
+ # Contributing guide
2
+
3
+ ## Installing statsample development dependencies
4
+
5
+ Keep in mind that either nmatrix OR rb-gsl are NOT NECESSARY for using statsample. They are just required for an optional speed up.
6
+
7
+ Statsample also works with [rb-gsl](https://github.com/sciruby/rb-gsl).
8
+
9
+ Install dependencies:
10
+
11
+ `bundle install`
12
+
13
+ And run the test suite (should be all green):
14
+
15
+ `bundle exec rake test`
16
+
17
+ If you have problems installing nmatrix, please consult the [nmatrix installation wiki](https://github.com/SciRuby/nmatrix/wiki/Installation) or the [mailing list](https://groups.google.com/forum/#!forum/sciruby-dev).
data/Gemfile ADDED
@@ -0,0 +1,2 @@
1
+ source "https://www.rubygems.org"
2
+ gemspec
@@ -0,0 +1,457 @@
1
+ === 2.0.2 / 2016-03-11
2
+ * Update dependencies (spreadsheet, GSL)
3
+
4
+ === 2.0.1 / 2015-08-19
5
+ * Cleaned legacy containers in favor of `Daru::DataFrame` and `Daru::Vector`.
6
+
7
+ === 2.0.0 / 2015-06-20
8
+ * Added dependency on daru and replaced Statsample::Vector and Dataset with
9
+ Daru::Vector and Daru::DataFrame.
10
+ * NMatrix and gsl-nmatrix are used as development dependencies.
11
+
12
+ === 1.5.0 / 2015-06-11
13
+ * Made sure all methods work properly with and without GSL.
14
+ * Statsample works with either rb-gsl or gsl-nmatrix.
15
+ * Changed the data types of Statsample::Vector from :ordinal, :scale and
16
+ :nominal to only :numeric and :object. :numeric replaces :ordinal/:scale
17
+ and :object replaces :nominal. Methods for creating the older data types still
18
+ exist, but throw a warning prodding the user to use the new methods.
19
+
20
+ === 1.4.3 / 2015-04-27
21
+ * Removed rb-gsl dependency.
22
+
23
+ === 1.4.2 / 2015-04-07
24
+ * Statsample::CSV.read accepts numbers in scientific notation.
25
+ * Test on Ruby 2.2 via Travis CI.
26
+
27
+ === 1.4.1 / 2015-03-26
28
+ * Removed Hoe gem in order to use `statsample.gemspec`.
29
+ * Improved readability of some files by using rubocop.
30
+ * Removed a bad check in `cronbach_alpha` (#10).
31
+
32
+ === 1.4.0 / 2014-10-11
33
+ * Replaced README.txt for README.md
34
+ * Replace File.exists? for File.exist?
35
+ + New Dataset.join to join two dataset based on some fields
36
+ * Deleted MLE based regression (Probit and logistic). Now all GML methods are on statsample-glm
37
+
38
+ === 1.3.1 / 2014-06-26
39
+
40
+ * Example referred to a SimpleRegression class which doesn't exist. Updated to working example.
41
+ * Merge pull request #15 from Blahah/patch-1
42
+ * Updated Gemfile
43
+ * Updated README.txt for v1.3.0
44
+ * Updated to ruby 2.1.0
45
+
46
+ === 1.3.0 / 2013-09-19
47
+
48
+ * Merge remote-tracking branch 'vpereira/master' into vpereira
49
+ * New Wilcoxon Signed Rank test
50
+ * Remove TimeSeries class. Now is available on gem "bio-statsample-timeseries" [GSOC 2013 project :) ]
51
+ * Update shoulda support
52
+ * added Bundle depds
53
+ * improved the csv read method (requires tests)
54
+ * open svg on mac osx
55
+
56
+ === 1.2.0 / 2011-12-15
57
+
58
+ * Added support for time series (TimeSeries object): MA, EMA, MACD, acf, lag and delta. [Rob Britton]
59
+ * Changed summary attribute to properly display 'b' value for simple linear regression [hstove]
60
+ * Merge pull request #6 from hstove/patch-1Changed summary attribute to properly display 'b' value for simple linear regression [Claudio Bustos]
61
+ * fix example code for CovariateMatrix [James Kebinger]
62
+
63
+ === 1.1.0 / 2011-06-02
64
+
65
+ * New Statsample::Anova::Contrast
66
+ * Jacknife and bootstrap for Vector. Thanks to John Firebaugh for the idea
67
+ * Improved Statsample::Analysis API
68
+ * Updated CSV.read. Third argument is a Hash with options to CSV class
69
+ * Added restriction on Statsample::Excel.read
70
+ * Updated spanish po
71
+ * Better summary for Vector
72
+ * Improving summary of t related test (confidence interval and estimate output)
73
+ * Replaced c for vector on Statsample::Analysis examples
74
+ * Added Vector#median_absolute_deviation
75
+ * First implementation of Kolmogorov Smirnov test. Returns correct D value, but without Kolmogorov distribution isn't very useful.
76
+
77
+ === 1.0.1 / 2011-01-28
78
+
79
+ * Updated spanish po.
80
+ * Update distribution gem dependence. On Ruby 1.8.7, distribution 0.2.0 raises an error.
81
+
82
+ === 1.0.0 / 2011-01-27
83
+
84
+ * Added Statsample::Analysis, a beautiful DSL to perform fast statistical analysis using statsample. See directory /examples
85
+ * Created benchmarks directory
86
+ * Removed Distribution module from statsample and moved to a gem. Changes on code to reflect new API
87
+ * Optimized simple regression. Better library detection
88
+ * New 'should_with_gsl' to test methods with gsl. Refactored Factor::MAP
89
+ * Almost complete GSL cleanup on Vector
90
+ * Updated some doc on Vector
91
+ * Used GSL::Matrix on Factor classes when available
92
+ * SkillScaleAnalysis doesn't crash with one or more vectors with 0 variance
93
+ * Modified examples using Statsample::Analysis
94
+ * Simplified eigen calculations
95
+ * Updated some examples. Added correlation matrix speed suite
96
+ * Correlation matrix optimized. Better specs
97
+ * Optimized correlation matrix. Use gsl matrix algebra or pairwise correlations depending on empiric calculated equations. See benchmarks/correlation_matrix.rb to see implementation of calculation
98
+ * Moved tests fixtures from data to test/fixtures
99
+ * Fixed some errors on tests
100
+ * Bug fix: constant_se on binomial regression have an error
101
+ * All test should work on ruby 1.9.3
102
+ * New Vector.[] and Vector.new_scale
103
+ * Detect linearly dependent predictors on OLS.
104
+
105
+ === 0.18.0 / 2011-01-07
106
+ * New Statsample.load_excel
107
+ * New Statsample.load_csv
108
+ * Statsample::Dataset#[] accepts an array of fields and uses clone
109
+ * New Dataset#correlation_matrix and Statsample::Dataset#covariance_matrix
110
+ * Statsample::Dataset.filter add labels to vectors
111
+ * Principal Components generation complete on PCA (covariance matrix prefered)
112
+ * Added note on Statsample::Factor::PCA about erratic signs on eigenvalues,
113
+ * Statsample::Factor::PCA.component_matrix calculated different for covariance matrix
114
+ * Improved summary for PCA using covariance matrix
115
+ * New attribute :label_angle for Statsample::Graph::Boxplot
116
+ * Fixed Scatterplots scaling problems
117
+ * New attributes for Scatterplots: groups, minimum_x, minimum_y, maximum_x,
118
+ * New Statsample::Multiset#union allows to create a new dataset based on a m
119
+ * New Statsample::Multiset#each to traverse through datasets
120
+ * Bug fix: Vector#standarized and Vector#percentile crash on nil data
121
+ * Bug fix: Vector#mean and Vector#sd crash on data without valid values
122
+ * Modified methods names on Statsample::Factor::PCA : feature_vector to feature_matrix, data_transformation to principal_components
123
+ * Added Statsample::Vector.vector_centered
124
+ * Factor::MAP.with_dataset() implemented
125
+ * Bug fix: Factor::MAP with correlation matrix with non-real eigenvalues crashes * Added documentation for Graph::Histogram
126
+ * Added MPA to Reliability::MultiScaleAnalysis
127
+ * Added custom names for returned vectors and datasets
128
+ * Updated spanish traslation
129
+ * Graph::Histogram updated. Custom x and y max and min, optional normal distribution drawing
130
+ * Updated Histogram class, with several new methods compatibles with GSL::Histogram
131
+
132
+ === 0.17.0 / 2010-12-09
133
+ * Added Statsample::Graph::Histogram and Statsample::Graph::Boxplot
134
+ * Added Statsample::Reliability::SkillScaleAnalysis for analysis of skill based scales.
135
+ * Delete combination and permutation clases. Backport for ruby 1.8.7 widely available
136
+ * Deleted unused variables (thanks, ruby-head)
137
+
138
+ === 0.16.0 / 2010-11-13
139
+ * Works on ruby 1.9.2 and HEAD. Updated Rakefile and manifest
140
+ * Removed all graph based on Svg::Graph.
141
+ * First operative version of Graph with Rubyvis
142
+ * Corrected bug on Distribution::Normal.cdf.
143
+ * Added reference on references.txt
144
+ * Ruby-based random gaussian distribution generator when gsl not available
145
+ * Added population average deviation [Al Chou]
146
+
147
+ === 0.15.1 / 2010-10-20
148
+ * Statsample::Excel and Statsample::PlainText add name to vectors equal to field name
149
+ * Statsample::Dataset.delete_vector accept multiple fields.
150
+ * Statsample::Dataset.dup_only_valid allows duplication of specific fields
151
+ * ScaleAnalysis doesn't crash on one-item scales
152
+ * Updated references
153
+
154
+ === 0.15.0 / 2010-09-07
155
+ * Added class Statsample::Reliability::ICC for calculation of Intra-class correlation (Shrout & Fleiss, 1979; McGraw & Wong, 1996). Tested with SPSS and R values.
156
+ * References: Updated and standarized references on many classes. Added grab_references.rb script, to create a list of references for library
157
+ * Added Spearman-Brown prophecy on Reliability module
158
+ * Distribution::F uses Gsl when available
159
+ * Added mean r.p.b. and item sd on Scale Analysis
160
+ * Corrected bug on Vector.ary_method and example of Anova Two Way using vector.
161
+
162
+
163
+ === 0.14.1 / 2010-08-18
164
+
165
+ * Added extra information on $DEBUG=true.
166
+ * Changed ParallelAnalysis: with_random_data parameters, bootstrap_method options are data and random, resolve bug related to number of factors to preserve, resolved bug related to original eigenvalues, can support failed bootstrap of data for Tetrachoric correlation.
167
+ * Optimized eigenpairs on Matrix when GSL is available.
168
+ * Added test for parallel analysis using data bootstraping
169
+ * Updated .pot and Manifest.txt
170
+ * Added test for kmo(global and univariate), bartlett and anti-image. Kmo and Bartlett have test based on Dziuban and Shirkey with correct results
171
+ * Complete set of test to test if a correlation matrix is appropriate for factor analysis: test of sphericity, KMO and anti-image (see Dziuban and Shirkey, 1974)
172
+ * Updated Parallel Analysis to work on Principal Axis Analysis based on O'Connors formulae
173
+ * Added reference for Statsample::Factor::MAP
174
+
175
+ === 0.14.0 / 2010-08-16
176
+ * Added Statsample::Factor::MAP, to execute Velicer's (1976) MAP to determine the number of factors to retain on EFA
177
+ * Bug fix on test suite on Ruby 1.8.7
178
+ * Horn's Parallel Analysis operational and tested for pure random data
179
+ * Fixed bug on Excel writer on Ruby1.9 (frozen string on header raises an error).
180
+ * Extra information on Factorial Analysis on summaries
181
+ * Fixed bug on Factor::Rotation when used ::Matrix without field method.
182
+ * Added Vector#vector_percentil method
183
+ * Summaries for PCA, Rotation, MultiScale and ScaleAnalysis created or improved.
184
+ * Factor::PCA could have rotation and parallel analysis on summary.
185
+ * Cronbach's alpha from covariance matrix raise an error on size<2
186
+ * MultiScaleAnalysis could have Parallel Analysis on summary.
187
+ * Added Chi Square test
188
+ * Added new information on README.txt
189
+
190
+ === 0.13.1 / 2010-07-03
191
+
192
+ * Rserve extensions for dataset and vector operational
193
+ * On x86_64, variance from gsl is not exactly equal to sum of variance-covariance on Statsample::Reliability::Scale, but in delta 1e-10
194
+ * Updated README.txt
195
+ * Reliability::ScaleAnalysis uses covariance matrix for 'if deleted' calculations to optimize memory and speed. Test for 'if deleted' statistics
196
+ * More string translated. Added dependency on tetrachoric on parallel analysis
197
+
198
+ === 0.13.0 / 2010-06-13
199
+
200
+ * Polychoric and Tetrachoric moved to gem statsample-bivariate-extension
201
+ * All classes left with summary method include Summarizable now. Every method which return localizable string is now parsed with _()
202
+ * Correct implementation of Reliability::MultiScaleAnalysis.
203
+ * Spanish translation for Mann-Whitney's U
204
+ * Added example for Mann-Whitney's U test
205
+ * Better summary for Mann-Whitney's U Test
206
+ * Added Statsample::Bivariate::Pearson class to retrieve complete analysis for r correlations
207
+ * Bug fix on DominanceAnalysis::Bootstrap
208
+
209
+ === 0.12.0 / 2010-06-09
210
+
211
+ * Modified Rakefile to remove dependencies based on C extensions. These are moved to statsample-optimization
212
+ * T test with unequal variance fixed on i686
213
+ * API Change: Renamed Reliability::ItemAnalysis and moved to independent file
214
+ * New Reliability::MultiScaleAnalysis for easy analysis of scales on a same survey, includind reliability, correlation matrix and Factor Analysis
215
+ * Updated README to reflect changes on Reliability module
216
+ * SvgGraph works with reportbuilder.
217
+ * Added methods on Polychoric based on Olsson(1979): the idea is estimate using second derivatives.
218
+ * Distribution test changed (reduced precision on 32 bits system
219
+
220
+ === 0.11.2 / 2010-05-05
221
+ * Updated dependency for 'extendedmatrix' to 0.2 (Matrix#build method)
222
+
223
+ === 0.11.1 / 2010-05-04
224
+ * Removed Matrix almost all Matrix extensions and replaced by dependency on 'extendmatrix' gem
225
+ * Added dependency to gsl >=1.12.109. Polychoric with joint method fails without this explicit dependency
226
+ === 0.11.0 / 2010-04-16
227
+ <b>New features:</b>
228
+ * Added Statsample::Anova::TwoWay and Statsample::Anova::TwoWayWithVectors
229
+ * Added Statsample.clone_only valid and Statsample::Dataset.clone_only_valid, for cheap copy on already clean vectors
230
+ <b>Optimizations and bug fix</b>
231
+ * Removed library statistics2 from package. Used gem statistics2 instead, because have a extension version
232
+ * Added example for Reliability class
233
+ * Bug fix on Statsample::DominanceAnalysis
234
+
235
+ === 0.10.0 / 2010-04-13
236
+
237
+ <b>API modifications</b>
238
+ * Refactoring of Statsample::Anova module.
239
+ * Statsample::Anova::OneWay :implementation of generic ANOVA One-Way, used by Multiple Regression, for example.
240
+ * Statsample::Anova::OneWayWithVectors: implementation of ANOVA One-Way to test differences of means.
241
+
242
+ <b>New features</b>
243
+ * New Statsample::Factor::Parallel Analysis, to performs Horn's 'parallel analysis' to a PCA, to adjust for sample bias on retention of components.
244
+ * New Statsample.only_valid_clone and Statsample::Dataset.clone, which allows to create shallow copys of valid vector and datasets. Used by correlation matrix methods to optimize calculations
245
+ * New module Statsample::Summarizable, which add GetText and ReportBuilder support to classes. Better summaries for Vector, Dataset, Crosstab, PrincipalAxis, PCA and Regression::Multiple classes
246
+
247
+ <b>Optimizations and bug fix</b>
248
+
249
+ * Refactoring of Statsample::Regression::Multiple classes. Still needs works
250
+ * Bug fix on Statsample::Factor::PCA and Statsample::Factor::PrincipalAxis
251
+ * Bug fix on Statsample::Bivariate::Polychoric.new_with_vectors. Should be defined class method, no instance method.
252
+ * Optimized correlation and covariance matrix. Only calculates the half of matrix and the other half is returned from cache
253
+ * More tests coverage. RCOV Total: 82.51% , Code: 77.83%
254
+
255
+ === 0.9.0 / 2010-04-04
256
+ * New Statsample::Test::F. Anova::OneWay subclasses it and Regression classes uses it.
257
+ === 0.8.2 / 2010-04-01
258
+ * Statsample::PromiseAfter replaced by external package DirtyMemoize [http://rubygems.org/gems/dirty-memoize]
259
+ === 0.8.1 / 2010-03-29
260
+ * Fixed Regression summaries
261
+ === 0.8.0 / 2010-03-29
262
+ * New Statsample::Test::T module, with classes and methods to do Student's t tests for one and two samples.
263
+ * Statsample::PromiseAfter module to set a number of variables without explicitly call the compute or iterate method
264
+ * All tests ported to MiniUnit
265
+ * Directory 'demo' renamed to 'examples'
266
+ * Bug fix on report_building on Statsample::Regression::Multiple classes
267
+
268
+ === 0.7.0 / 2010-03-25
269
+ * Ported to ReportBuilder 1.x series
270
+ * Implementation of ruby based covariance and correlation changed to a clearer code
271
+ * Statsample::Vector#svggraph_frequencies accepts IO
272
+ * Some test ported to Miniunit
273
+ * CSV on Ruby1.8 uses FasterCSV
274
+
275
+ === 0.6.7 / 2010-03-23
276
+ * Bug fix: dependency on ReportBuilder should be set to "~>0.2.0", not "0.2"
277
+ === 0.6.6 / 2010-03-22
278
+ * Set ReportBuilder dependency to '0.2.~' version, because future API break
279
+ * Removed Alglib dependency
280
+ * Factor::PrincipalAxis and Factor::PCA reworked
281
+ * Standarization of documentation on almost every file
282
+ * New Statsample::Test::Levene, to test equality of variances
283
+ * Constant HAS_GSL replaced by Statsample.has_gsl?
284
+ * PCA and Principal Axis test based on R and SPSS results
285
+ * Bug fix on test_dataset.rb / test_saveload
286
+ * Added Rakefile
287
+ * Demos for levene, Principal Axis
288
+
289
+ === 0.6.5 / 2010-02-24
290
+
291
+ * Bug fix on test: Use tempfile instead of tempdir
292
+ * Multiple Regression: Calculation of constant standard error , using covariance matrix.
293
+ * Calculation of R^2_yx and P^2_yx for Regresion on Multiple Dependents variables
294
+ * Dominance Analysis could use Correlation or Covariance Matrix as input.
295
+ * Dominance Analysis extension to multiple dependent variables (Azen & Budescu, 2006)
296
+ * Two-step estimate of Polychoric correlation uses minimization gem, so could be executed without rb-gsl
297
+
298
+
299
+ === 0.6.4 / 2010-02-19
300
+ * Dominance Analysis and Dominance Analysis Bootstrap allows multivariate dependent analysis.
301
+ * Test suite for Dominance Analysis, using Azen and Budescu papers as references
302
+ * X^2 for polychoric correlation
303
+
304
+ === 0.6.3 / 2010-02-15
305
+ * Statsample::Bivariate::Polychoric have joint estimation.
306
+ * Some extra documentation and bug fixs
307
+
308
+ === 0.6.2 / 2010-02-11
309
+ * New Statsample::Bivariate::Polychoric. For implement: X2 and G2
310
+ * New matrix.rb, for faster development of Contingence Tables and Correlation Matrix
311
+
312
+ === 0.6.1 / 2010-02-08
313
+ * Bug fix on DominanceAnalysis summary for Ruby1.9
314
+ * Some extra documentation
315
+ === 0.6.0 / 2010-02-05
316
+ * New Statsample::Factor module. Include classes for extracting factors (Statsample::Factor::PCA and Statsample::Factor::PrincipalAxis) and rotate component matrix ( Statsample::Factor::Rotation subclasses). For now, only orthogonal rotations
317
+ * New Statsample::Dataset.crosstab_with_asignation, Statsample::Dataset.one_to_many
318
+ * New class Statsample::Permutation to produce permutations of a given array
319
+ * New class Statsample::Histogram, with same interface as GSL one
320
+ * New class Statsample::Test::UMannWhitney, to perform Mann-Whitney's U test. Gives z based and exact calculation of probability
321
+ * Improved support for ReportBuilder
322
+ * Statsample::Codification module reworked
323
+ * Fixed bugs on Dominance Analysis classes
324
+ * Fixed bugs on Statsample::Vector.kurtosis and Statsample::Vector.skew
325
+
326
+ === 0.5.1 / 2009-10-06
327
+
328
+ * New class Statsample::Bivariate::Tetrachoric, for calculation of tetrachoric correlations. See http://www.john-uebersax.com/stat/tetra.htm for information.
329
+ * New Statsample::Dataset.merge
330
+ * New Statsample::Vector.dichotomize
331
+ * New ItemReliability.item_difficulty_analysis
332
+ * New module Statsample::SPSS, to export information to SPSS. For now, only tetrachoric correlation matrix are provided
333
+ * All SpreadSheet based importers now accept repeated variable names and renames they on the fly
334
+ * MultipleRegression::BaseEngine moved to new file
335
+ * Bug fix for MultipleRegression::GslEngine checks for Alglib, not GSL
336
+
337
+ === 0.5.0 / 2009-09-26
338
+ * Vector now uses a Hash as a third argument
339
+ * Tested on Ruby 1.8.6, 1.8.7 and 1.9.1 with multiruby
340
+
341
+ === 0.4.1 / 2009-09-12
342
+ * More methods and usage documentation
343
+ * Logit tests
344
+ * Bug fix: rescue for requires doesn't specify LoadError
345
+ * Binomial::BaseEngine new methods: coeffs_se, coeffs, constant and constant_se
346
+
347
+ === 0.4.0 / 2009-09-10
348
+ * New Distribution module, based on statistics2.rb by Shin-ichiro HARA. Replaces all instances of GSL distributions pdf and cdf calculations for native calculation.
349
+ * New Maximum Likehood Estimation for Logit, Probit and Normal Distribution using Von Tessin(2005) algorithm. See MLE class and subclasses for more information.
350
+ * New Binomial regression subclasses (Logit and Probit), usign MLE class
351
+ * Added tests for gsl, Distribution, MLE and Logit
352
+ * Bug fix on svggraph.rb. Added check_type for scale graphics
353
+ * Bug fix on gdchart. Replaced old Nominal, Ordinal and Scale for Vector
354
+
355
+ === 0.3.4 / 2009-08-21
356
+ * Works with statsample-optimization 2.0.0
357
+ * Vector doesn't uses delegation. All methods are part of Vector
358
+ * Added Combination. Generates all combination of n elements taken r at a time
359
+ * Bivariate#prop_pearson now can uses as a second parameter :both, :left, :right, :positive or :negative
360
+ * Added LICENSE.txt
361
+
362
+ === 0.3.3 / 2009-08-11
363
+ * Added i18n support. For now, only spanish translation available
364
+ * Bug fix: Test now load libraries on ../lib path
365
+ * Excel and CSV importers automatically modify type of vector to Scale when all data are numbers or nils values
366
+
367
+ === 0.3.2 / 2009-08-04
368
+
369
+ * Added Regression::Multiple::GslEngine
370
+ * Added setup.rb
371
+ * Crosstab#row_label and #column_name
372
+ * DominanceAnalysis and DominanceAnalysisBootstrap uses Dataset#labels for Vector names.
373
+
374
+ === 0.3.1 / 2009-08-03
375
+
376
+ * Name and logic of Regression classes changed. Now, you have Regression::Simple class and Regression::Multiple module with two engines: RubyEngine and AlglibEngne
377
+ * New Crosstab#summary
378
+
379
+ === 0.3.0 / 2009-08-02
380
+
381
+ * Statsample renamed to Statsample
382
+ * Optimization extension goes to another gem: ruby-statsample-optimization
383
+
384
+ === 0.2.0 / 2009-08-01
385
+
386
+ * One Way Anova on Statsample::Anova::OneWay
387
+ * Dominance Analysis!!!! The one and only reason to develop a Multiple Regression on pure ruby.
388
+ * Multiple Regression on Multiple Regression module. Pairwise (pure ruby) or MultipleRegressionPairwise and Listwise (optimized) on MultipleRegressionAlglib and
389
+ * New Dataset#to_gsl_matrix, #from_to,#[..],#bootstrap,#vector_missing_values, #vector_count_characters, #each_with_index, #collect_with_index
390
+ * New Vector#box_cox_transformation
391
+ * Module Correlation renamed to Bivariate
392
+ * Some fancy methods and classes to create Summaries
393
+ * Some documentation about Algorithm used on doc_latex
394
+ * Deleted 'distributions' extension. Ruby/GSL has all the pdf and cdf you ever need.
395
+ * Tests work without any dependency. Only nags about missing deps.
396
+ * Test for MultipleRegression, Anova, Excel, Bivariate.correlation_matrix and many others
397
+
398
+ === 0.1.9 / 2009-05-22
399
+
400
+ * Class Vector: new method vector_standarized_pop, []=, min,max
401
+ * Class Dataset: global variable $RUBY_SS_ROW stores the row number on each() and related methods. dup() with argument returns a copy of the dataset only for given fields. New methods: standarize, vector_mean, collect, verify,collect_matrix
402
+ * Module Correlation: new methods covariance, t_pearson, t_r, prop_pearson, covariance_matrix, correlation_matrix, correlation_probability_matrix
403
+ * Module SRS: New methods estimation_n0 and estimation_n
404
+ * Module Reliability: new ItemCharacteristicCurve class
405
+ * New HtmlReport class
406
+ * New experimental SPSS Class.
407
+ * Converters: Module CSV with new options. Added write() method for GGobi module
408
+ * New Mx exporter (http://www.vcu.edu/mx/)
409
+ * Class SimpleRegression: new methods standard error
410
+
411
+ * Added tests for regression and reliability, Vector#vector_mean, Dataset#dup (partial) and Dataset#verify
412
+
413
+
414
+ === 0.1.8 / 2008-12-10
415
+ * Added Regression and Reliability modules
416
+ * Class Vector: added methods vector_standarized, recode, inspect, ranked
417
+ * Class Dataset: added methods vector_by_calculation, vector_sum, filter_field
418
+ * Module Correlation: added methods like spearman, point biserial and tau-b
419
+ * Added tests for Vector#ranked, Vector#vector_standarized, Vector#sum_of_squared_deviation, Dataset#vector_by_calculation, Dataset#vector_sum, Dataset#filter_field and various test for Correlation module
420
+ * Added demos: item_analysis and sample_test
421
+
422
+ === 0.1.7 / 2008-10-1
423
+ * New module for codification
424
+ * ...
425
+ === 0.1.6 / 2008-09-26
426
+ * New modules for SRS and stratified sampling
427
+ * Statsample::Database for read and write onto databases.
428
+ You could use Database and CSV on-tandem for mass-editing and reimport
429
+ of databases
430
+
431
+ === 0.1.5 / 2008-08-29
432
+ * New extension statsampleopt for optimizing some functions on Statsample submodules
433
+ * New submodules Correlation and Test
434
+
435
+ === 0.1.4 / 2008-08-27
436
+
437
+ * New extension, with cdf functions for
438
+ chi-square, t, gamma and normal distributions.
439
+ Based on dcdflib (http://www.netlib.org/random/)
440
+ Also, has a function to calculate the tail for a noncentral T distribution
441
+
442
+ === 0.1.3 / 2008-08-22
443
+
444
+ * Operational versions of Vector, Dataset, Crosstab and Resample
445
+ * Read and write CSV files
446
+ * Calculate chi-square for 2 matrixes
447
+
448
+ === 0.1.1 - 0.1.2 / 2008-08-18
449
+
450
+ * Included several methods on Ruby::Type classes
451
+ * Organized dirs with sow
452
+
453
+
454
+ === 0.1.0 / 2008-08-12
455
+
456
+ * First version.
457
+