RubyGems - regexp-examples - Versions diffs - 0.5.0 → 0.5.1 - Mend

regexp-examples 0.5.0 → 0.5.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

checksums.yaml +4 -4
data/README.md +8 -7
data/lib/regexp-examples/groups.rb +18 -16
data/lib/regexp-examples/parser.rb +21 -11
data/lib/regexp-examples/version.rb +1 -1
data/spec/regexp-examples_spec.rb +21 -3
metadata +2 -2

checksums.yaml CHANGED

@@ -1,7 +1,7 @@
 ---
 SHA1:
-  metadata.gz: 460970b9b7691fe163a9334542ccd57c71241c2a
-  data.tar.gz: 0c4ce5df7f59e72af6de41b8538d72b52c40939d
+  metadata.gz: ff205ed61ad0ca0c2dc2383afd17930110763b44
+  data.tar.gz: a13f23a7d56b9b7d7e4323380d705276793e6e4f
 SHA512:
-  metadata.gz: 5932e6ea64e3008dad054d67f906863244366324e28cf1d280a222966b77dc1bc816760d6af1d9f023518b40ccfbdd8ad9c634060d2811ca5d687e333604e268
-  data.tar.gz: fd2c136a9457aac2e953fe77c972925a6af9f420f025ca9f7500a8252636ebf1ba2b1104232d17719de5b1c53cf8e1a4316c52e789bed113b6589b8cf71c7dda
+  metadata.gz: 068c7dbd13945c0487b91c61e9cae304323775adc82cd4e80092ab32505eb46f31aaf98736f5a353eb1ad06a471864f9432ce3d203097f2dcfc6e696dba07e12
+  data.tar.gz: c2014a4055874af1edbdc3fbee60e58b3f182ba37278f5f0738f4699256a0669b84e8b7d731494c5493e04fb249c3135cb31e5a716e687cd677fd3f9fa1a8da9

data/README.md CHANGED

@@ -39,15 +39,16 @@ For more detail on this, see [configuration options](#configuration-options).
   * Groups work fine, even if nested or optional e.g. `/(even(this(works?))) \1 \2 \3/`, `/what about (this)? \1/`
   * Non-capture groups, e.g. `/(?:foo)/`
 * Control characters, e.g. `/\ca/`, `/\cZ/`, `/\C-9/`
-* Escape sequences, e.g. `/\x42/`, `/\x3D/`, `/\x5word/`, `/#{"\x80".force_encoding("ASCII-8BIT")}/`
+* Escape sequences, e.g. `/\x42/`, `/\x5word/`, `/#{"\x80".force_encoding("ASCII-8BIT")}/`
 * Unicode characters, e.g. `/\u0123/`, `/\uabcd/`, `/\u{789}/`
 * **Arbitrarily complex combinations of all the above!**
-## Bugs and Not-Yet-Supported syntax
+* Regexp options can also be used:
+  * Case insensitive examples: `/cool/i.examples #=> ["cool", "cooL", "coOl", "coOL", ...]`
+  * Multiline examples: `/./m.examples(max_group_results: 999) #=> ["a", "b", "c", ..., "\n"]`
+  * Extended form examples: `/line1 #comment \n line2/x.examples #=> ["line1line2"]`
-* Other options (besides ingnorecase), will currently just be ignored, for example:
-  * `/white  space/x.examples` will not strip out the whitespace from the pattern, i.e. this incorrectly returns `["white  space"]` rather than `["whitespace"]`
-  * `/./m.examples(max_group_results: 999)` will not include `"\n"`
+## Bugs and Not-Yet-Supported syntax
 * Nested character classes, and the use of set intersection ([See here](http://www.ruby-doc.org/core-2.2.0/Regexp.html#class-Regexp-label-Character+Classes) for the official documentation on this.) For example:
   * `/[[abc]]/.examples`  (which _should_ return `["a", "b", "c"]`)
@@ -60,14 +61,14 @@ For more detail on this, see [configuration options](#configuration-options).
 * The patterns: `/\10/` ... `/\77/` should match the octal representation of their character code, if there is no nth grouped subexpression. For example, `/\10/.examples` should return `["\x08"]`. Funnily enough, I did not think of this when writing my regexp parser.
-There are loads more (increasingly obscure) unsupported bits of syntax, which I cannot be bothered to write out here. Full documentation on all the various other obscurities in the ruby (version 2.x) regexp parser can be found [here](https://raw.githubusercontent.com/k-takata/Onigmo/master/doc/RE).
 Using any of the following will raise a RegexpExamples::UnsupportedSyntax exception (until such time as they are implemented!):
 * POSIX bracket expressions, e.g. `/[[:alnum:]]/`, `/[[:space:]]/`
 * Named properties, e.g. `/\p{L}/` ("Letter"), `/\p{Arabic}/` ("Arabic character"), `/\p{^Ll}/` ("Not a lowercase letter")
 * Subexpression calls, e.g. `/(?<name> ... \g<name>* )/` (Note: These could get _really_ ugly to implement, and may even be impossible, so I highly doubt it's worth the effort!)
+There are loads more (increasingly obscure) unsupported bits of syntax, which I cannot be bothered to write out here. Full documentation on all the various other obscurities in the ruby (version 2.x) regexp parser can be found [here](https://raw.githubusercontent.com/k-takata/Onigmo/master/doc/RE).
 ## Impossible features ("illegal syntax")
 The following features in the regex language can never be properly implemented into this gem because, put simply, they are not technically "regular"!

data/lib/regexp-examples/groups.rb CHANGED

@@ -23,11 +23,11 @@ module RegexpExamples
     end
   end
-  module GroupWithOptions
-    attr_reader :options
+  module GroupWithIgnoreCase
+    attr_reader :ignorecase
     def result
       group_result = super
-      if options[:ignorecase]
+      if ignorecase
         group_result
           .concat( group_result.map(&:swapcase) )
           .uniq
@@ -38,10 +38,10 @@ module RegexpExamples
   end
   class SingleCharGroup
-    prepend GroupWithOptions
-    def initialize(char, options)
+    prepend GroupWithIgnoreCase
+    def initialize(char, ignorecase)
       @char = char
-      @options = options
+      @ignorecase = ignorecase
     end
     def result
       [GroupResult.new(@char)]
@@ -49,10 +49,10 @@ module RegexpExamples
   end
   class CharGroup
-    prepend GroupWithOptions
-    def initialize(chars, options)
+    prepend GroupWithIgnoreCase
+    def initialize(chars, ignorecase)
       @chars = chars
-      @options = options
+      @ignorecase = ignorecase
       if chars[0] == "^"
         @negative = true
         @chars = @chars[1..-1]
@@ -119,25 +119,27 @@ module RegexpExamples
   end
   class DotGroup
-    prepend GroupWithOptions
-    def initialize(options={})
-      @options = options
+    attr_reader :multiline
+    def initialize(multiline)
+      @multiline = multiline
     end
     def result
-      CharSets::Any.map do |result|
+      chars = CharSets::Any
+      chars |= ["\n"] if multiline
+      chars.map do |result|
         GroupResult.new(result)
       end
     end
   end
   class MultiGroup
-    prepend GroupWithOptions
+    prepend GroupWithIgnoreCase
     attr_reader :group_id
-    def initialize(groups, group_id, options)
+    def initialize(groups, group_id, ignorecase)
       @groups = groups
       @group_id = group_id
-      @options = options
+      @ignorecase = ignorecase
     end
     # Generates the result of each contained group

data/lib/regexp-examples/parser.rb CHANGED

@@ -3,10 +3,12 @@ module RegexpExamples
     attr_reader :regexp_string
     def initialize(regexp_string, regexp_options, config_options={})
       @regexp_string = regexp_string
-      @ignorecase = ( regexp_options & Regexp::IGNORECASE == 1 )
+      @ignorecase = !(regexp_options & Regexp::IGNORECASE).zero?
+      @multiline = !(regexp_options & Regexp::MULTILINE).zero?
+      @extended = !(regexp_options & Regexp::EXTENDED).zero?
       @num_groups = 0
       @current_position = 0
-      RegexpExamples::ResultCountLimiters.configure!(
+      ResultCountLimiters.configure!(
         config_options[:max_repeater_variance],
         config_options[:max_group_results]
       )
@@ -28,10 +30,6 @@ module RegexpExamples
     private
-    def regexp_options
-      {ignorecase: @ignorecase}
-    end
     def parse_group(repeaters)
       case next_char
       when '('
@@ -58,12 +56,24 @@ module RegexpExamples
         else
           raise IllegalSyntaxError, "Anchors cannot be supported, as they are not regular"
         end
+      when /[#\s]/
+        if @extended
+          parse_extended_whitespace
+          group = parse_single_char_group('') # Ignore the whitespace/comment
+        else
+          group = parse_single_char_group(next_char)
+        end
       else
         group = parse_single_char_group(next_char)
       end
       group
     end
+    def parse_extended_whitespace
+      whitespace_chars = rest_of_string.match(/#.*|\s+/)[0]
+      @current_position += whitespace_chars.length - 1
+    end
     def parse_after_backslash_group
       @current_position += 1
       case
@@ -78,7 +88,7 @@ module RegexpExamples
           # Note: The `.dup` is important, as it prevents modifying the constant, in
           # CharGroup#init_ranges (where the '-' is moved to the front)
           BackslashCharMap[next_char].dup,
-          regexp_options
+          @ignorecase
         )
       when rest_of_string =~ /\A(c|C-)(.)/ # Control character
         @current_position += $1.length
@@ -153,7 +163,7 @@ module RegexpExamples
         end
       end
       groups = parse
-      MultiGroup.new(groups, group_id, regexp_options)
+      MultiGroup.new(groups, group_id, @ignorecase)
     end
     def parse_multi_end_group
@@ -181,11 +191,11 @@ module RegexpExamples
         chars << next_char
         @current_position += 1
       end
-      CharGroup.new(chars, regexp_options)
+      CharGroup.new(chars, @ignorecase)
     end
     def parse_dot_group
-      DotGroup.new(regexp_options)
+      DotGroup.new(@multiline)
     end
     def parse_or_group(left_repeaters)
@@ -196,7 +206,7 @@ module RegexpExamples
     def parse_single_char_group(char)
-      SingleCharGroup.new(char, regexp_options)
+      SingleCharGroup.new(char, @ignorecase)
     end
     def parse_backreference_group(match)

data/lib/regexp-examples/version.rb CHANGED

@@ -1,3 +1,3 @@
 module RegexpExamples
-  VERSION = '0.5.0'
+  VERSION = '0.5.1'
 end

data/spec/regexp-examples_spec.rb CHANGED

@@ -233,7 +233,7 @@ RSpec.describe Regexp, "#examples" do
     context "exact examples match" do
       # More rigorous tests to assert that ALL examples are being listed
-      context "default options" do
+      context "default config options" do
         # Simple examples
         it { expect(/[ab]{2}/.examples).to eq ["aa", "ab", "ba", "bb"] }
         it { expect(/(a|b){2}/.examples).to eq ["aa", "ab", "ba", "bb"] }
@@ -243,7 +243,7 @@ RSpec.describe Regexp, "#examples" do
         it { expect(/a{1}?/.examples).to eq ["", "a"] }
       end
-      context "max_repeater_variance option" do
+      context "max_repeater_variance config option" do
         it do
           expect(/a+/.examples(max_repeater_variance: 5))
             .to eq %w(a aa aaa aaaa aaaaa aaaaaa)
@@ -254,7 +254,7 @@ RSpec.describe Regexp, "#examples" do
         end
       end
-      context "max_group_results option" do
+      context "max_group_results config option" do
         it do
           expect(/\d/.examples(max_group_results: 10))
             .to eq %w(0 1 2 3 4 5 6 7 8 9)
@@ -266,6 +266,24 @@ RSpec.describe Regexp, "#examples" do
         it { expect(/a+/i.examples).to eq %w(a A aa aA Aa AA aaa aaA aAa aAA Aaa AaA AAa AAA) }
         it { expect(/([ab])\1/i.examples).to eq %w(aa bb AA BB) }
       end
+      context "multiline" do
+        it { expect(/./.examples(max_group_results: 999)).not_to include "\n" }
+        it { expect(/./m.examples(max_group_results: 999)).to include "\n" }
+      end
+      context "exteded form" do
+        it { expect(/a b c/x.examples).to eq %w(abc) }
+        it { expect(/a#comment/x.examples).to eq %w(a) }
+        it do
+          expect(
+            /
+              line1 #comment
+              line2 #comment
+            /x.examples
+          ).to eq %w(line1line2)
+        end
+      end
     end
   end

metadata CHANGED

@@ -1,14 +1,14 @@
 --- !ruby/object:Gem::Specification
 name: regexp-examples
 version: !ruby/object:Gem::Version
-  version: 0.5.0
+  version: 0.5.1
 platform: ruby
 authors:
 - Tom Lord
 autorequire:
 bindir: bin
 cert_chain: []
-date: 2015-02-04 00:00:00.000000000 Z
+date: 2015-02-08 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: bundler