RubyGems - stanford-core-nlp - Versions diffs - 0.3.0 → 0.3.1 - Mend

stanford-core-nlp 0.3.0 → 0.3.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

data/README.md CHANGED

@@ -4,6 +4,8 @@
 This gem provides high-level Ruby bindings to the [Stanford Core NLP package](http://nlp.stanford.edu/software/corenlp.shtml), a set natural language processing tools for tokenization, part-of-speech tagging, lemmatization, and parsing of several languages, as well as named entity recognition and coreference resolution in English. This gem is compatible with Ruby 1.9.2 and above.
+If you are looking for an full-scale natural language processing framework in Ruby, have a look at [Treat](https://github.com/louismullie/treat).
 **Installing**
 First, install the gem: `gem install stanford-core-nlp`. Then, download the Stanford Core NLP JAR and model files. Three different packages are available:

data/lib/stanford-core-nlp.rb CHANGED

@@ -1,6 +1,6 @@
 module StanfordCoreNLP
-  VERSION = '0.3.0'
+  VERSION = '0.3.1'
   require 'bind-it'
   extend BindIt::Binding
@@ -9,9 +9,10 @@ module StanfordCoreNLP
   # BindIt Configuration Options #
   # ############################ #
-  # The path in which to look for the Stanford JAR files,
-  # with a trailing slash.
-  self.jar_path = File.dirname(__FILE__) + '/../bin/'
+  # The default path for the JAR files
+  # is the gem's bin folder.
+  self.jar_path = File.dirname(__FILE__).
+  gsub('/lib', '') + '/bin/'
   # Load the JVM with a minimum heap size of 512MB,
   # and a maximum heap size of 1024MB.
@@ -24,6 +25,7 @@ module StanfordCoreNLP
   self.default_jars = [
     'joda-time.jar',
     'xom.jar',
+    'stanford-parser.jar',
     'stanford-corenlp.jar',
     'bridge.jar'
   ]
@@ -42,7 +44,11 @@ module StanfordCoreNLP
   # Default namespace is the Stanford pipeline namespace.
   self.default_namespace = 'edu.stanford.nlp.pipeline'
+  # ########################### #
+  # Stanford Core NLP bindings  #
+  # ########################### #
   require 'stanford-core-nlp/config'
   require 'stanford-core-nlp/bridge'
@@ -51,6 +57,8 @@ module StanfordCoreNLP
     attr_accessor :model_files
     # The folder in which to look for models.
     attr_accessor :model_path
+    # Store the language currently being used.
+    attr_accessor :language
   end
   # The path to the main folder containing the folders
@@ -63,6 +71,7 @@ module StanfordCoreNLP
   # code (e.g. :english, :eng or :en will work).
   def self.use(language)
     lang = nil
+    self.language = language
     self.model_files = {}
     Config::LanguageCodes.each do |l,codes|
       lang = codes[2] if codes.include?(language)
@@ -99,8 +108,15 @@ module StanfordCoreNLP
   # properties.
   def self.load(*annotators)
+    # Take care of Windows users.
+    if self.running_on_windows?
+      self.jar_path.gsub!('/', '\\')
+      self.model_path.gsub!('/', '\\')
+    end
     # Make the bindings.
     self.bind
     # Prepend the JAR path to the model files.
     properties = {}
     self.model_files.each do |k,v|
@@ -119,8 +135,14 @@ module StanfordCoreNLP
       properties[k] = f
     end
+    # Bug fix for French parser
+    if self.language == :french
+      properties['parser.flags'] = ''
+    end
     properties['annotators'] =
     annotators.map { |x| x.to_s }.join(', ')
     CoreNLP.new(get_properties(properties))
   end
@@ -132,7 +154,7 @@ module StanfordCoreNLP
     end
     props
   end
   # Get a Java ArrayList binding to pass lists
   # of tokens to the Stanford Core NLP process.
   def self.get_list(tokens)
@@ -143,4 +165,9 @@ module StanfordCoreNLP
     list
   end
+  # Returns true if we're running on Windows.
+  def self.running_on_windows?
+    RUBY_PLATFORM.split("-")[1] == 'mswin32'
+  end
 end

metadata CHANGED

@@ -1,7 +1,7 @@
 --- !ruby/object:Gem::Specification
 name: stanford-core-nlp
 version: !ruby/object:Gem::Version
-  version: 0.3.0
+  version: 0.3.1
   prerelease:
 platform: ruby
 authors:
@@ -9,7 +9,7 @@ authors:
 autorequire:
 bindir: bin
 cert_chain: []
-date: 2012-04-05 00:00:00.000000000 Z
+date: 2012-05-15 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: bind-it