RubyGems - stanford-core-nlp - Versions diffs - 0.1.4 → 0.1.5 - Mend

stanford-core-nlp 0.1.4 → 0.1.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

data/README.markdown +22 -23
data/bin/INFO +1 -1
data/lib/stanford-core-nlp.rb +126 -52
data/lib/stanford-core-nlp/config.rb +453 -0
data/lib/stanford-core-nlp/java_wrapper.rb +27 -0
metadata +5 -5
data/lib/stanford-core-nlp/stanford_annotations.rb +0 -401

data/README.markdown CHANGED Viewed

@@ -1,12 +1,12 @@
 **About**
-This gem provides high-level Ruby bindings to the [Stanford Core NLP package](http://nlp.stanford.edu/software/corenlp.shtml), a set natural language processing tools for English, including tokenization, part-of-speech tagging, lemmatization, named entity recognition, parsing, and coreference resolution.
+This gem provides high-level Ruby bindings to the [Stanford Core NLP package](http://nlp.stanford.edu/software/corenlp.shtml), a set natural language processing tools that features tokenization, part-of-speech tagging, lemmatization, and parsing for five languages (English, French, German, Arabic and Chinese), as well as named entity recognition and coreference resolution for English.
 **Installing**
 1. Install the gem: `gem install stanford-core-nlp`.
-2. Download the Stanford Core NLP JAR and model files [here](http://louismullie.com/stanford-core-nlp-english.zip). Place the contents of the extracted archive inside the /bin/ folder of the stanford-core-nlp gem (typically this is /usr/local/lib/ruby/gems/1.9.1/gems/stanford-core-nlp-0.x/bin/). This package only includes model files for English; see below for information on adding model files for other languages.
+2. Download the Stanford Core NLP JAR and model files. Two package are available with the necessary files: a package for [English only](http://louismullie.com/stanford-core-nlp-english.zip), or a package with models for [all languages](http://louismullie.com/stanford-core-nlp-all.zip). Place the contents of the extracted archive inside the /bin/ folder of the stanford-core-nlp gem (typically this is /usr/local/lib/ruby/gems/1.9.1/gems/stanford-core-nlp-0.x/bin/).
 **Configuration**
@@ -23,18 +23,12 @@ After installing and requiring the gem (`require 'stanford-core-nlp'`), you may
     # Redirect VM output to log.txt
     StanfordCoreNLP.log_file = 'log.txt'
-You may also want to load your own classes from the Stanford NLP to do more specific tasks. The gem provides an API to do this:
-    # Default base class is edu.stanford.nlp.pipeline.
-    StanfordCoreNLP.load('PTBTokenizerAnnotator')
-    puts StanfordCoreNLP::PTBTokenizerAnnotator.inspect
-      # => #<Rjb::Edu_stanford_nlp_pipeline_PTBTokenizerAnnotator>
-    # Here, we specify another base class.
-    StanfordCoreNLP.load('MaxentTagger', 'edu.stanford.nlp.tagger')
-    puts StanfordCoreNLP::MaxentTagger.inspect
-      # => <Rjb::Edu_stanford_nlp_tagger_maxent_MaxentTagger:0x007f88491e2020>
+    # Use the model files for a different language than English.
+    StanfordCoreNLP.use(:french)
+	# Change a specific model file.
+ 	StanfordCoreNLP.set_model('pos.model', 'english-left3words-distsim.tagger')
 **Using the gem**
     text = 'Angela Merkel met Nicolas Sarkozy on January 25th in ' +
@@ -64,22 +58,27 @@ You may also want to load your own classes from the Stanford NLP to do more spec
         end
     end
-A good reference for names of annotations are the Stanford Javadocs for [CoreAnnotations](http://nlp.stanford.edu/nlp/javadoc/javanlp/edu/stanford/nlp/ling/CoreAnnotations.html), [CoreCorefAnnotations](http://nlp.stanford.edu/nlp/javadoc/javanlp/edu/stanford/nlp/dcoref/CorefCoreAnnotations.html), and [TreeCoreAnnotations](http://nlp.stanford.edu/nlp/javadoc/javanlp/edu/stanford/nlp/trees/TreeCoreAnnotations.html). For a full list of all possible annotations, see the 'stanford_annotations.rb' file inside the gem. The Ruby symbol (e.g. :named_entity_tag) corresponding ot a Java annotation class follows the simple un-camel-casing convention, with 'Annotation' at the end removed. For example, the annotation NamedEntityTagAnnotation translates to :named_entity_tag, PartOfSpeechAnnotation to :part_of_speech, etc.
+> Note: You need to load the StanfordCoreNLP pipeline before using the StanfordCoreNLP::Text class.
-**Adding models for other languages for the parser and tagger**
+A good reference for names of annotations are the Stanford Javadocs for [CoreAnnotations](http://nlp.stanford.edu/nlp/javadoc/javanlp/edu/stanford/nlp/ling/CoreAnnotations.html), [CoreCorefAnnotations](http://nlp.stanford.edu/nlp/javadoc/javanlp/edu/stanford/nlp/dcoref/CorefCoreAnnotations.html), and [TreeCoreAnnotations](http://nlp.stanford.edu/nlp/javadoc/javanlp/edu/stanford/nlp/trees/TreeCoreAnnotations.html). For a full list of all possible annotations, see the 'config.rb' file inside the gem. The Ruby symbol (e.g. :named_entity_tag) corresponding to a Java annotation class follows the simple un-camel-casing convention, with 'Annotation' at the end removed. For example, the annotation NamedEntityTagAnnotation translates to :named_entity_tag, PartOfSpeechAnnotation to :part_of_speech, etc.
-- For the Stanford Parser, download the [parser files](http://nlp.stanford.edu/software/lex-parser.shtml), and copy from the grammar/ directory the grammars you need into the gem's bin/grammar directory (e.g. /usr/local/lib/ruby/gems/1.9.1/gems/stanford-core-nlp-0.x/bin/grammar). Grammars are available for Arabic, Chinese, French, German and Xinhua.
-- For the Stanford Tagger, download the [tagger files](http://nlp.stanford.edu/software/tagger.shtml), and copy from the models/ directory the models you need into the gem's bin/models directory. Models are available for Arabic, Chinese, French and German.
+**Loading specific classes**
-Then, configure the gem to use your newly added files, e.g.:
-    StanfordCoreNLP.set_model('parser.model', '/path/to/gem/bin/grammar/chinesePCFG.ser.gz')
-    StanfordCoreNLP.set_model('tagger.model', '/path/to/gem/bin/grammar/chinese.tagger')
-    pipeline =  StanfordCoreNLP.load(:ssplit, :tokenize, :pos, :parse)
+You may also want to load your own classes from the Stanford NLP to do more specific tasks. The gem provides an API to do this:
+    # Default base class is edu.stanford.nlp.pipeline.
+    StanfordCoreNLP.load_class('PTBTokenizerAnnotator')
+    puts StanfordCoreNLP::PTBTokenizerAnnotator.inspect
+      # => #<Rjb::Edu_stanford_nlp_pipeline_PTBTokenizerAnnotator>
+    # Here, we specify another base class.
+    StanfordCoreNLP.load_class('MaxentTagger', 'edu.stanford.nlp.tagger')
+    puts StanfordCoreNLP::MaxentTagger.inspect
+      # => <Rjb::Edu_stanford_nlp_tagger_maxent_MaxentTagger:0x007f88491e2020>
 **Current known issues**
-The models included with the gem for the NER system are missing two files: "edu/stanford/nlp/models/dcoref/countries" and "edu/stanford/nlp/models/dcoref/statesandprovinces", which I couldn't find anywhere. I will be very grateful if somebody could add/e-mail me these files.
+The models included with the gem for the NER system are missing two files: "edu/stanford/nlp/models/dcoref/countries" and "edu/stanford/nlp/models/dcoref/statesandprovinces", which I couldn't find anywhere. I will be grateful if somebody could add/e-mail me these files.
 **Contributing**

data/bin/INFO CHANGED Viewed

	@@ -1 +1 @@
1	- This is where you should put the JAR files.
1	+ This is where you should put the JAR files and the folders with the model files.

data/lib/stanford-core-nlp.rb CHANGED Viewed

@@ -1,81 +1,135 @@
 module StanfordCoreNLP
-  VERSION = '0.1.4'
-  require 'stanford-core-nlp/jar_loader.rb'
+  VERSION = '0.1.5'
+  require 'stanford-core-nlp/jar_loader'
   require 'stanford-core-nlp/java_wrapper'
-  require 'stanford-core-nlp/stanford_annotations'
+  require 'stanford-core-nlp/config'
   class << self
-    # The path in which to look for the Stanford JAR files.
-    # This is passed to JarLoader.
+    # The path in which to look for the Stanford JAR files,
+    # with a trailing slash.
+    #
+    # The structure of the JAR folder must be as follows:
+    #
+    # Files:
+    #
+    #  /stanford-core-nlp.jar
+    #  /joda-time.jar
+    #  /xom.jar
+    #  /bridge.jar*
+    #
+    # Folders:
+    #
+    #  /classifiers         # Models for the NER system.
+    #  /dcoref              # Models for the coreference resolver.
+    #  /taggers             # Models for the POS tagger.
+    #  /grammar             # Models for the parser.
+    #
+    # *The file bridge.jar is a thin JAVA wrapper over the
+    # Stanford Core NLP get() function, which allows to
+    # retrieve annotations using static classes as names.
+    # This works around one of the lacunae of Rjb.
     attr_accessor :jar_path
-    # The flags for starting the JVM machine.
-    # Parser and named entity recognizer are very memory consuming.
+    # The flags for starting the JVM machine. The parser
+    # and named entity recognizer are very memory consuming.
     attr_accessor :jvm_args
     # A file to redirect JVM output to.
     attr_accessor :log_file
-    # The model files. Use #set_model to modify these.
+    # The model files for a given language.
     attr_accessor :model_files
   end
   # The default JAR path is the gem's bin folder.
   self.jar_path = File.dirname(__FILE__) + '/../bin/'
-  # Load the JVM with a minimum heap size of 512MB and a
+  # Load the JVM with a minimum heap size of 512MB and a
   # maximum heap size of 1024MB.
   self.jvm_args = ['-Xms512M', '-Xmx1024M']
   # Turn logging off by default.
   self.log_file = nil
-  # Default model files.
-  self.model_files = {
-    'pos.model' => 'taggers/english-left3words-distsim.tagger',
-    'ner.model.3class' => 'classifiers/all.3class.distsim.crf.ser.gz',
-    'ner.model.7class' => 'classifiers/muc.7class.distsim.crf.ser.gz',
-    'ner.model.MISCclass' => 'classifiers/conll.4class.distsim.crf.ser.gz',
-    'parser.model' => 'grammar/englishPCFG.ser.gz',
-    'dcoref.demonym' => 'dcoref/demonyms.txt',
-    'dcoref.animate' => 'dcoref/animate.unigrams.txt',
-    'dcoref.female' => 'dcoref/female.unigrams.txt',
-    'dcoref.inanimate' => 'dcoref/inanimate.unigrams.txt',
-    'dcoref.male' => 'dcoref/male.unigrams.txt',
-    'dcoref.neutral' => 'dcoref/neutral.unigrams.txt',
-    'dcoref.plural' => 'dcoref/plural.unigrams.txt',
-    'dcoref.singular' => 'dcoref/singular.unigrams.txt',
-    'dcoref.states' => 'dcoref/state-abbreviations.txt',
-    'dcoref.countries' => 'dcoref/unknown.txt',     # Fix - can somebody provide this file?
-    'dcoref.states.provinces' => 'dcoref/unknown.txt',   # Fix - can somebody provide this file?
-    'dcoref.extra.gender' => 'dcoref/namegender.combine.txt'
-  }
-  # Whether the classes are initialized or not.
-  @@initialized = false
-  # Whether the jars are loaded or not.
-  @@loaded = false
+  # Use models for a given language. Language can be
+  # supplied as full-length, or ISO-639 2 or 3 letter
+  # code (e.g. :english, :eng or :en will work).
+  def self.use(language)
+    lang = nil
+    self.model_files = {}
+    Config::LanguageCodes.each do |l,codes|
+      lang = codes[2] if codes.include?(language)
+    end
+    Config::Models.each do |n, languages|
+      models = languages[lang]
+      folder = Config::ModelFolders[n]
+      if models.is_a?(Hash)
+        n = n.to_s
+        n += '.model' if n == 'ner'
+        models.each do |m, file|
+          self.model_files["#{n}.#{m}"] =
+          folder + file
+        end
+      elsif models.is_a?(String)
+        self.model_files["#{n}.model"] =
+        folder + models
+      end
+    end
+  end
+  # Use english by default.
+  self.use(:english)
-  # Set a model file.
+  # Set a model file. Here are the default models for English:
+  #
+  #    'pos.model' => 'english-left3words-distsim.tagger',
+  #    'ner.model.3class' => 'all.3class.distsim.crf.ser.gz',
+  #    'ner.model.7class' => 'muc.7class.distsim.crf.ser.gz',
+  #    'ner.model.MISCclass' => 'conll.4class.distsim.crf.ser.gz',
+  #    'parser.model' => 'englishPCFG.ser.gz',
+  #    'dcoref.demonym' => 'demonyms.txt',
+  #    'dcoref.animate' => 'animate.unigrams.txt',
+  #    'dcoref.female' => 'female.unigrams.txt',
+  #    'dcoref.inanimate' => 'inanimate.unigrams.txt',
+  #    'dcoref.male' => 'male.unigrams.txt',
+  #    'dcoref.neutral' => 'neutral.unigrams.txt',
+  #    'dcoref.plural' => 'plural.unigrams.txt',
+  #    'dcoref.singular' => 'singular.unigrams.txt',
+  #    'dcoref.states' => 'state-abbreviations.txt',
+  #    'dcoref.extra.gender' => 'namegender.combine.txt'
+  #
   def self.set_model(name, file)
-    unless File.readable?(self.jar_path + file)
-      raise "JAR file #{self.jar_path + file} could not be found." +
-      "You may need to download this file manually and/or set paths properly."
-    end
-    self.model_files[name] = file
+    n = name.split('.')[0].intern
+    self.model_files[name] =
+    Config::ModelFolders[n] + file
   end
+  # Whether the classes are initialized or not.
+  @@initialized = false
+  # Whether the JAR files are loaded or not.
+  @@loaded = false
   # Load the JARs, create the classes.
   def self.init
     self.load_jars unless @@loaded
     self.create_classes
     @@initialized = true
   end
-  # Load a StanfordCoreNLP pipeline with the specified JVM flags and
-  # StanfordCoreNLP properties (hash of property => values).
+  # Load a StanfordCoreNLP pipeline with the
+  # specified JVM flags and StanfordCoreNLP
+  # properties.
   def self.load(*annotators)
     self.init unless @@initialized
     # Prepend the JAR path to the model files.
     properties = {}
-    self.model_files.each { |k,v| properties[k] = self.jar_path + v }
-    properties['annotators'] =
+    self.model_files.each do |k,v|
+      f = self.jar_path + v
+      unless File.readable?(f)
+        raise "Model file #{f} could not be found. " +
+        "You may need to download this file manually and/or set paths properly."
+      else
+        properties[k] = f
+      end
+    end
+    properties['annotators'] =
     annotators.map { |x| x.to_s }.join(', ')
     CoreNLP.new(get_properties(properties))
   end
@@ -101,17 +155,37 @@ module StanfordCoreNLP
     const_set(:Properties, Rjb::import('java.util.Properties'))
     const_set(:AnnotationBridge, Rjb::import('AnnotationBridge'))
   end
   # Load a class (e.g. PTBTokenizerAnnotator) in a specific
   # class path (default is 'edu.stanford.nlp.pipeline').
   # The class is then accessible under the StanfordCoreNLP
   # namespace, e.g. StanfordCoreNLP::PTBTokenizerAnnotator.
+  #
+  # List of annotators:
+  #
+  #  - PTBTokenizingAnnotator - tokenizes the text following Penn Treebank conventions.
+  #  - WordToSentenceAnnotator - splits a sequence of words into a sequence of sentences.
+  #  - POSTaggerAnnotator - annotates the text with part-of-speech tags.
+  #  - MorphaAnnotator - morphological normalizer (generates lemmas).
+  #  - NERAnnotator - annotates the text with named-entity labels.
+  #  - NERCombinerAnnotator - combines several NER models (use this instead of NERAnnotator!).
+  #  - TrueCaseAnnotator - detects the true case of words in free text (useful for all upper or lower case text).
+  #  - ParserAnnotator - generates constituent and dependency trees.
+  #  - NumberAnnotator - recognizes numerical entities such as numbers, money, times, and dates.
+  #  - TimeWordAnnotator - recognizes common temporal expressions, such as "teatime".
+  #  - QuantifiableEntityNormalizingAnnotator - normalizes the content of all numerical entities.
+  #  - SRLAnnotator - annotates predicates and their semantic roles.
+  #  - CorefAnnotator - implements pronominal anaphora resolution using a statistical model (deprecated!).
+  #  - DeterministicCorefAnnotator - implements anaphora resolution using a deterministic model (newer model, use this!).
+  #  - NFLAnnotator - implements entity and relation mention extraction for the NFL domain.
   def self.load_class(klass, base = 'edu.stanford.nlp.pipeline')
     self.load_jars unless @@loaded
     const_set(klass.intern, Rjb::import("#{base}.#{klass}"))
   end
-  # Create a java.util.Properties object from a hash.
+# Private helper functions.
+  private
+  # HCreate a java.util.Properties object from a hash.
   def self.get_properties(properties)
     props = Properties.new
     properties.each do |property, value|
@@ -119,10 +193,10 @@ module StanfordCoreNLP
     end
     props
   end
-  # Helper function: under_case -> CamelCase.
+  # Under_case -> CamelCase.
   def self.camel_case(text)
     text.to_s.gsub(/^[a-z]|_[a-z]/) { |a| a.upcase }.gsub('_', '')
   end
-end
+end

data/lib/stanford-core-nlp/config.rb ADDED Viewed

@@ -0,0 +1,453 @@
+module StanfordCoreNLP
+  class Config
+    # A hash of language codes in humanized,
+    # 2 and 3-letter ISO639 codes.
+    LanguageCodes = {
+      :english => [:en, :eng, :english],
+      :german => [:de, :ger, :german],
+      :french => [:fr, :fre, :french],
+      :arabic => [:ar, :ara, :arabic],
+      :chinese => [:ch, :chi, :chinese],
+      :xinhua => [:xi, :xin, :xinhua]
+    }
+    # Folders inside the JAR path for the models.
+    ModelFolders = {
+      :pos => 'taggers/',
+      :parser => 'grammar/',
+      :ner => 'classifiers/',
+      :dcoref => 'dcoref/'
+    }
+    # Default models for all languages.
+    Models = {
+      :pos => {
+        :english => 'english-left3words-distsim.tagger',
+        :german => 'german-fast.tagger',
+        :french  => 'french.tagger',
+        :arabic => 'arabic-fast.tagger',
+        :chinese  => 'chinese.tagger',
+        :xinhua   => nil
+      },
+      :parser => {
+        :english => 'englishPCFG.ser.gz',
+        :german => 'germanPCFG.ser.gz',
+        :french  => 'frenchFactored.ser.gz',
+        :arabic => 'arabicFactored.ser.gz',
+        :chinese  => 'chinesePCFG.ser.gz',
+        :xinhua   => 'xinhuaPCFG.ser.gz'
+      },
+      :ner => {
+        :english => {
+          '3class' => 'all.3class.distsim.crf.ser.gz',
+          '7class' => 'muc.7class.distsim.crf.ser.gz',
+          'MISCclass' => 'conll.4class.distsim.crf.ser.gz'
+        },
+        :german => {},
+        :french  => {},
+        :arabic => {},
+        :chinese  => {},
+        :xinhua   => {}
+      },
+      :dcoref => {
+        :english => {
+          'demonym' => 'demonyms.txt',
+          'animate' => 'animate.unigrams.txt',
+          'female' => 'female.unigrams.txt',
+          'inanimate' => 'inanimate.unigrams.txt',
+          'male' => 'male.unigrams.txt',
+          'neutral' => 'neutral.unigrams.txt',
+          'plural' => 'plural.unigrams.txt',
+          'singular' => 'singular.unigrams.txt',
+          'states' => 'state-abbreviations.txt',
+          'countries' => 'unknown.txt',          # Fix - can somebody provide this file?
+          'states.provinces' => 'unknown.txt',   # Fix - can somebody provide this file?
+          'extra.gender' => 'namegender.combine.txt'
+        },
+        :german => {},
+        :french  => {},
+        :arabic => {},
+        :chinese  => {},
+        :xinhua   => {}
+      }
+      # Models to add.
+      #"truecase.model" - path towards the true-casing model; default: StanfordCoreNLPModels/truecase/noUN.ser.gz
+      #"truecase.bias" - class bias of the true case model; default: INIT_UPPER:-0.7,UPPER:-0.7,O:0
+      #"truecase.mixedcasefile" - path towards the mixed case file; default: StanfordCoreNLPModels/truecase/MixDisambiguation.list
+      #"nfl.gazetteer" - path towards the gazetteer for the NFL domain
+      #"nfl.relation.model" - path towards the NFL relation extraction model
+    }
+    # List of annotations by JAVA class path.
+    Annotations = {
+      'nlp.trees.international.pennchinese.ChineseGrammaticalRelations' => [
+        'AdjectivalModifierGRAnnotation',
+        'AdverbialModifierGRAnnotation',
+        'ArgumentGRAnnotation',
+        'AspectMarkerGRAnnotation',
+        'AssociativeMarkerGRAnnotation',
+        'AssociativeModifierGRAnnotation',
+        'AttributiveGRAnnotation',
+        'AuxModifierGRAnnotation',
+        'AuxPassiveGRAnnotation',
+        'BaGRAnnotation',
+        'ClausalComplementGRAnnotation',
+        'ClausalSubjectGRAnnotation',
+        'ClauseModifierGRAnnotation',
+        'ComplementGRAnnotation',
+        'ComplementizerGRAnnotation',
+        'ControllingSubjectGRAnnotation',
+        'CoordinationGRAnnotation',
+        'DeterminerGRAnnotation',
+        'DirectObjectGRAnnotation',
+        'DvpMarkerGRAnnotation',
+        'DvpModifierGRAnnotation',
+        'EtcGRAnnotation',
+        'LocalizerComplementGRAnnotation',
+        'ModalGRAnnotation',
+        'ModifierGRAnnotation',
+        'NegationModifierGRAnnotation',
+        'NominalPassiveSubjectGRAnnotation',
+        'NominalSubjectGRAnnotation',
+        'NounCompoundModifierGRAnnotation',
+        'NumberModifierGRAnnotation',
+        'NumericModifierGRAnnotation',
+        'ObjectGRAnnotation',
+        'OrdNumberGRAnnotation',
+        'ParentheticalGRAnnotation',
+        'ParticipialModifierGRAnnotation',
+        'PreconjunctGRAnnotation',
+        'PrepositionalLocalizerModifierGRAnnotation',
+        'PrepositionalModifierGRAnnotation',
+        'PrepositionalObjectGRAnnotation',
+        'PunctuationGRAnnotation',
+        'RangeGRAnnotation',
+        'RelativeClauseModifierGRAnnotation',
+        'ResultativeComplementGRAnnotation',
+        'SemanticDependentGRAnnotation',
+        'SubjectGRAnnotation',
+        'TemporalClauseGRAnnotation',
+        'TemporalGRAnnotation',
+        'TimePostpositionGRAnnotation',
+        'TopicGRAnnotation',
+        'VerbCompoundGRAnnotation',
+        'VerbModifierGRAnnotation',
+        'XClausalComplementGRAnnotation'
+      ],
+      'nlp.dcoref.CoNLL2011DocumentReader' => [
+        'CorefMentionAnnotation',
+        'NamedEntityAnnotation'
+      ],
+      'nlp.ling.CoreAnnotations' => [
+        'AbbrAnnotation',
+        'AbgeneAnnotation',
+        'AbstrAnnotation',
+        'AfterAnnotation',
+        'AnswerAnnotation',
+        'AnswerObjectAnnotation',
+        'AntecedentAnnotation',
+        'ArgDescendentAnnotation',
+        'ArgumentAnnotation',
+        'BagOfWordsAnnotation',
+        'BeAnnotation',
+        'BeforeAnnotation',
+        'BeginIndexAnnotation',
+        'BestCliquesAnnotation',
+        'BestFullAnnotation',
+        'CalendarAnnotation',
+        'CategoryAnnotation',
+        'CategoryFunctionalTagAnnotation',
+        'CharacterOffsetBeginAnnotation',
+        'CharacterOffsetEndAnnotation',
+        'CharAnnotation',
+        'ChineseCharAnnotation',
+        'ChineseIsSegmentedAnnotation',
+        'ChineseOrigSegAnnotation',
+        'ChineseSegAnnotation',
+        'ChunkAnnotation',
+        'CoarseTagAnnotation',
+        'CommonWordsAnnotation',
+        'CoNLLDepAnnotation',
+        'CoNLLDepParentIndexAnnotation',
+        'CoNLLDepTypeAnnotation',
+        'CoNLLPredicateAnnotation',
+        'CoNLLSRLAnnotation',
+        'ContextsAnnotation',
+        'CopyAnnotation',
+        'CostMagnificationAnnotation',
+        'CovertIDAnnotation',
+        'D2_LBeginAnnotation',
+        'D2_LEndAnnotation',
+        'D2_LMiddleAnnotation',
+        'DayAnnotation',
+        'DependentsAnnotation',
+        'DictAnnotation',
+        'DistSimAnnotation',
+        'DoAnnotation',
+        'DocDateAnnotation',
+        'DocIDAnnotation',
+        'DomainAnnotation',
+        'EndIndexAnnotation',
+        'EntityClassAnnotation',
+        'EntityRuleAnnotation',
+        'EntityTypeAnnotation',
+        'FeaturesAnnotation',
+        'FemaleGazAnnotation',
+        'FirstChildAnnotation',
+        'ForcedSentenceEndAnnotation',
+        'FreqAnnotation',
+        'GazAnnotation',
+        'GazetteerAnnotation',
+        'GenericTokensAnnotation',
+        'GeniaAnnotation',
+        'GoldAnswerAnnotation',
+        'GovernorAnnotation',
+        'GrandparentAnnotation',
+        'HaveAnnotation',
+        'HeadWordStringAnnotation',
+        'HeightAnnotation',
+        'IDAnnotation',
+        'IDFAnnotation',
+        'INAnnotation',
+        'IndexAnnotation',
+        'InterpretationAnnotation',
+        'IsDateRangeAnnotation',
+        'IsURLAnnotation',
+        'LabelAnnotation',
+        'LastGazAnnotation',
+        'LastTaggedAnnotation',
+        'LBeginAnnotation',
+        'LeftChildrenNodeAnnotation',
+        'LeftTermAnnotation',
+        'LemmaAnnotation',
+        'LEndAnnotation',
+        'LengthAnnotation',
+        'LMiddleAnnotation',
+        'MaleGazAnnotation',
+        'MarkingAnnotation',
+        'MonthAnnotation',
+        'MorphoCaseAnnotation',
+        'MorphoGenAnnotation',
+        'MorphoNumAnnotation',
+        'MorphoPersAnnotation',
+        'NamedEntityTagAnnotation',
+        'NeighborsAnnotation',
+        'NERIDAnnotation',
+        'NormalizedNamedEntityTagAnnotation',
+        'NotAnnotation',
+        'NumericCompositeObjectAnnotation',
+        'NumericCompositeTypeAnnotation',
+        'NumericCompositeValueAnnotation',
+        'NumericObjectAnnotation',
+        'NumericTypeAnnotation',
+        'NumericValueAnnotation',
+        'NumerizedTokensAnnotation',
+        'NumTxtSentencesAnnotation',
+        'OriginalAnswerAnnotation',
+        'OriginalCharAnnotation',
+        'OriginalTextAnnotation',
+        'ParagraphAnnotation',
+        'ParagraphsAnnotation',
+        'ParaPositionAnnotation',
+        'ParentAnnotation',
+        'PartOfSpeechAnnotation',
+        'PercentAnnotation',
+        'PhraseWordsAnnotation',
+        'PhraseWordsTagAnnotation',
+        'PolarityAnnotation',
+        'PositionAnnotation',
+        'PossibleAnswersAnnotation',
+        'PredictedAnswerAnnotation',
+        'PrevChildAnnotation',
+        'PriorAnnotation',
+        'ProjectedCategoryAnnotation',
+        'ProtoAnnotation',
+        'RoleAnnotation',
+        'SectionAnnotation',
+        'SemanticHeadTagAnnotation',
+        'SemanticHeadWordAnnotation',
+        'SemanticTagAnnotation',
+        'SemanticWordAnnotation',
+        'SentenceIDAnnotation',
+        'SentenceIndexAnnotation',
+        'SentencePositionAnnotation',
+        'SentencesAnnotation',
+        'ShapeAnnotation',
+        'SpaceBeforeAnnotation',
+        'SpanAnnotation',
+        'SpeakerAnnotation',
+        'SRL_ID',
+        'SRLIDAnnotation',
+        'SRLInstancesAnnotation',
+        'StackedNamedEntityTagAnnotation',
+        'StateAnnotation',
+        'StemAnnotation',
+        'SubcategorizationAnnotation',
+        'TagLabelAnnotation',
+        'TextAnnotation',
+        'TokenBeginAnnotation',
+        'TokenEndAnnotation',
+        'TokensAnnotation',
+        'TopicAnnotation',
+        'TrueCaseAnnotation',
+        'TrueCaseTextAnnotation',
+        'TrueTagAnnotation',
+        'UBlockAnnotation',
+        'UnaryAnnotation',
+        'UnknownAnnotation',
+        'UtteranceAnnotation',
+        'UTypeAnnotation',
+        'ValueAnnotation',
+        'VerbSenseAnnotation',
+        'WebAnnotation',
+        'WordFormAnnotation',
+        'WordnetSynAnnotation',
+        'WordPositionAnnotation',
+        'WordSenseAnnotation',
+        'XmlContextAnnotation',
+        'XmlElementAnnotation',
+        'YearAnnotation'
+      ],
+      'nlp.dcoref.CorefCoreAnnotations' => [
+        'CorefAnnotation',
+        'CorefChainAnnotation',
+        'CorefClusterAnnotation',
+        'CorefClusterIdAnnotation',
+        'CorefDestAnnotation',
+        'CorefGraphAnnotation'
+      ],
+      'nlp.ling.CoreLabel' => [
+        'GenericAnnotation'
+      ],
+      'nlp.trees.EnglishGrammaticalRelations' => [
+        'AbbreviationModifierGRAnnotation',
+        'AdjectivalComplementGRAnnotation',
+        'AdjectivalModifierGRAnnotation',
+        'AdvClauseModifierGRAnnotation',
+        'AdverbialModifierGRAnnotation',
+        'AgentGRAnnotation',
+        'AppositionalModifierGRAnnotation',
+        'ArgumentGRAnnotation',
+        'AttributiveGRAnnotation',
+        'AuxModifierGRAnnotation',
+        'AuxPassiveGRAnnotation',
+        'ClausalComplementGRAnnotation',
+        'ClausalPassiveSubjectGRAnnotation',
+        'ClausalSubjectGRAnnotation',
+        'ComplementGRAnnotation',
+        'ComplementizerGRAnnotation',
+        'ConjunctGRAnnotation',
+        'ControllingSubjectGRAnnotation',
+        'CoordinationGRAnnotation',
+        'CopulaGRAnnotation',
+        'DeterminerGRAnnotation',
+        'DirectObjectGRAnnotation',
+        'ExpletiveGRAnnotation',
+        'IndirectObjectGRAnnotation',
+        'InfinitivalModifierGRAnnotation',
+        'MarkerGRAnnotation',
+        'ModifierGRAnnotation',
+        'MultiWordExpressionGRAnnotation',
+        'NegationModifierGRAnnotation',
+        'NominalPassiveSubjectGRAnnotation',
+        'NominalSubjectGRAnnotation',
+        'NounCompoundModifierGRAnnotation',
+        'NpAdverbialModifierGRAnnotation',
+        'NumberModifierGRAnnotation',
+        'NumericModifierGRAnnotation',
+        'ObjectGRAnnotation',
+        'ParataxisGRAnnotation',
+        'ParticipialModifierGRAnnotation',
+        'PhrasalVerbParticleGRAnnotation',
+        'PossessionModifierGRAnnotation',
+        'PossessiveModifierGRAnnotation',
+        'PreconjunctGRAnnotation',
+        'PredeterminerGRAnnotation',
+        'PredicateGRAnnotation',
+        'PrepositionalComplementGRAnnotation',
+        'PrepositionalModifierGRAnnotation',
+        'PrepositionalObjectGRAnnotation',
+        'PunctuationGRAnnotation',
+        'PurposeClauseModifierGRAnnotation',
+        'QuantifierModifierGRAnnotation',
+        'ReferentGRAnnotation',
+        'RelativeClauseModifierGRAnnotation',
+        'RelativeGRAnnotation',
+        'SemanticDependentGRAnnotation',
+        'SubjectGRAnnotation',
+        'TemporalModifierGRAnnotation',
+        'XClausalComplementGRAnnotation'
+      ],
+      'nlp.trees.GrammaticalRelation' => [
+        'DependentGRAnnotation',
+        'GovernorGRAnnotation',
+        'GrammaticalRelationAnnotation',
+        'KillGRAnnotation',
+        'Language',
+        'RootGRAnnotation'
+      ],
+      'nlp.ie.machinereading.structure.MachineReadingAnnotations' => [
+        'DependencyAnnotation',
+        'DocumentDirectoryAnnotation',
+        'DocumentIdAnnotation',
+        'EntityMentionsAnnotation',
+        'EventMentionsAnnotation',
+        'GenderAnnotation',
+        'RelationMentionsAnnotation',
+        'TriggerAnnotation'
+      ],
+      'nlp.parser.lexparser.ParserAnnotations' => [
+        'ConstraintAnnotation'
+      ],
+      'nlp.trees.semgraph.SemanticGraphCoreAnnotations' => [
+        'BasicDependenciesAnnotation',
+        'CollapsedCCProcessedDependenciesAnnotation',
+        'CollapsedDependenciesAnnotation'
+      ],
+      'nlp.time.TimeAnnotations' => [
+        'TimexAnnotation',
+        'TimexAnnotations'
+      ],
+      'nlp.time.TimeExpression' => [
+        'Annotation',
+        'ChildrenAnnotation'
+      ],
+      'nlp.trees.TreeCoreAnnotations' => [
+        'TreeHeadTagAnnotation',
+        'TreeHeadWordAnnotation',
+        'TreeAnnotation'
+      ]
+    }
+    # Create a list of annotation names => paths.
+    annotations_by_name = {}
+    Annotations.each do |base_class, annotation_classes|
+      annotation_classes.each do |annotation_class|
+        annotations_by_name[annotation_class] ||= []
+        annotations_by_name[annotation_class] << base_class
+      end
+    end
+    # Hash of name => path.
+    AnnotationsByName = annotations_by_name
+  end
+end

data/lib/stanford-core-nlp/java_wrapper.rb CHANGED Viewed

@@ -18,5 +18,32 @@ module StanfordCoreNLP
       end
     end
+    # Dynamically defined on all proxied annotation classes.
+    # Get an annotation using the annotation bridge.
+    def get(annotation, anno_base = nil)
+      if !java_methods.include?('get(Ljava.lang.Class;)')
+        raise'No annotation can be retrieved on this object.'
+      else
+        anno_class = "#{StanfordCoreNLP.camel_case(annotation)}Annotation"
+        if anno_base
+          raise "The path #{anno_base} doesn't exist." unless Annotations[anno_base]
+          anno_bases = [anno_base]
+        else
+          anno_bases = Config::AnnotationsByName[anno_class]
+          raise "The annotation #{anno_class} doesn't exist." unless anno_bases
+        end
+        if anno_bases.size > 1
+          msg = "There are many different annotations bearing the name #{anno_class}. "
+          msg << "Please specify one of the following base classes as second parameter to disambiguate: "
+          msg << anno_bases.join(',')
+          raise msg
+        else
+          base_class = anno_bases[0]
+        end
+        url = "edu.stanford.#{base_class}$#{anno_class}"
+        AnnotationBridge.getAnnotation(self, url)
+      end
+    end
   end
 end

metadata CHANGED Viewed

@@ -1,7 +1,7 @@
 --- !ruby/object:Gem::Specification
 name: stanford-core-nlp
 version: !ruby/object:Gem::Version
-  version: 0.1.4
+  version: 0.1.5
   prerelease:
 platform: ruby
 authors:
@@ -9,11 +9,11 @@ authors:
 autorequire:
 bindir: bin
 cert_chain: []
-date: 2012-01-31 00:00:00.000000000 Z
+date: 2012-02-04 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: rjb
-  requirement: &70226234873780 !ruby/object:Gem::Requirement
+  requirement: &70191057037760 !ruby/object:Gem::Requirement
     none: false
     requirements:
     - - ! '>='
@@ -21,7 +21,7 @@ dependencies:
         version: '0'
   type: :runtime
   prerelease: false
-  version_requirements: *70226234873780
+  version_requirements: *70191057037760
 description: ! " High-level Ruby bindings to the Stanford CoreNLP package, a set natural
   language processing \ntools for English, including tokenization, part-of-speech
   tagging, lemmatization, named entity recognition,\nparsing, and coreference resolution. "
@@ -31,9 +31,9 @@ executables: []
 extensions: []
 extra_rdoc_files: []
 files:
+- lib/stanford-core-nlp/config.rb
 - lib/stanford-core-nlp/jar_loader.rb
 - lib/stanford-core-nlp/java_wrapper.rb
-- lib/stanford-core-nlp/stanford_annotations.rb
 - lib/stanford-core-nlp.rb
 - bin/bridge.jar
 - bin/INFO

data/lib/stanford-core-nlp/stanford_annotations.rb DELETED Viewed

@@ -1,401 +0,0 @@
-module StanfordCoreNLP
-  # @private
-  Annotations = {
-   'nlp.trees.international.pennchinese.ChineseGrammaticalRelations' => [
-     'AdjectivalModifierGRAnnotation',
-     'AdverbialModifierGRAnnotation',
-     'ArgumentGRAnnotation',
-     'AspectMarkerGRAnnotation',
-     'AssociativeMarkerGRAnnotation',
-     'AssociativeModifierGRAnnotation',
-     'AttributiveGRAnnotation',
-     'AuxModifierGRAnnotation',
-     'AuxPassiveGRAnnotation',
-     'BaGRAnnotation',
-     'ClausalComplementGRAnnotation',
-     'ClausalSubjectGRAnnotation',
-     'ClauseModifierGRAnnotation',
-     'ComplementGRAnnotation',
-     'ComplementizerGRAnnotation',
-     'ControllingSubjectGRAnnotation',
-     'CoordinationGRAnnotation',
-     'DeterminerGRAnnotation',
-     'DirectObjectGRAnnotation',
-     'DvpMarkerGRAnnotation',
-     'DvpModifierGRAnnotation',
-     'EtcGRAnnotation',
-     'LocalizerComplementGRAnnotation',
-     'ModalGRAnnotation',
-     'ModifierGRAnnotation',
-     'NegationModifierGRAnnotation',
-     'NominalPassiveSubjectGRAnnotation',
-     'NominalSubjectGRAnnotation',
-     'NounCompoundModifierGRAnnotation',
-     'NumberModifierGRAnnotation',
-     'NumericModifierGRAnnotation',
-     'ObjectGRAnnotation',
-     'OrdNumberGRAnnotation',
-     'ParentheticalGRAnnotation',
-     'ParticipialModifierGRAnnotation',
-     'PreconjunctGRAnnotation',
-     'PrepositionalLocalizerModifierGRAnnotation',
-     'PrepositionalModifierGRAnnotation',
-     'PrepositionalObjectGRAnnotation',
-     'PunctuationGRAnnotation',
-     'RangeGRAnnotation',
-     'RelativeClauseModifierGRAnnotation',
-     'ResultativeComplementGRAnnotation',
-     'SemanticDependentGRAnnotation',
-     'SubjectGRAnnotation',
-     'TemporalClauseGRAnnotation',
-     'TemporalGRAnnotation',
-     'TimePostpositionGRAnnotation',
-     'TopicGRAnnotation',
-     'VerbCompoundGRAnnotation',
-     'VerbModifierGRAnnotation',
-     'XClausalComplementGRAnnotation'
-    ],
-   'nlp.dcoref.CoNLL2011DocumentReader' => [
-     'CorefMentionAnnotation',
-     'NamedEntityAnnotation'
-    ],
-   'nlp.ling.CoreAnnotations' => [
-     'AbbrAnnotation',
-     'AbgeneAnnotation',
-     'AbstrAnnotation',
-     'AfterAnnotation',
-     'AnswerAnnotation',
-     'AnswerObjectAnnotation',
-     'AntecedentAnnotation',
-     'ArgDescendentAnnotation',
-     'ArgumentAnnotation',
-     'BagOfWordsAnnotation',
-     'BeAnnotation',
-     'BeforeAnnotation',
-     'BeginIndexAnnotation',
-     'BestCliquesAnnotation',
-     'BestFullAnnotation',
-     'CalendarAnnotation',
-     'CategoryAnnotation',
-     'CategoryFunctionalTagAnnotation',
-     'CharacterOffsetBeginAnnotation',
-     'CharacterOffsetEndAnnotation',
-     'CharAnnotation',
-     'ChineseCharAnnotation',
-     'ChineseIsSegmentedAnnotation',
-     'ChineseOrigSegAnnotation',
-     'ChineseSegAnnotation',
-     'ChunkAnnotation',
-     'CoarseTagAnnotation',
-     'CommonWordsAnnotation',
-     'CoNLLDepAnnotation',
-     'CoNLLDepParentIndexAnnotation',
-     'CoNLLDepTypeAnnotation',
-     'CoNLLPredicateAnnotation',
-     'CoNLLSRLAnnotation',
-     'ContextsAnnotation',
-     'CopyAnnotation',
-     'CostMagnificationAnnotation',
-     'CovertIDAnnotation',
-     'D2_LBeginAnnotation',
-     'D2_LEndAnnotation',
-     'D2_LMiddleAnnotation',
-     'DayAnnotation',
-     'DependentsAnnotation',
-     'DictAnnotation',
-     'DistSimAnnotation',
-     'DoAnnotation',
-     'DocDateAnnotation',
-     'DocIDAnnotation',
-     'DomainAnnotation',
-     'EndIndexAnnotation',
-     'EntityClassAnnotation',
-     'EntityRuleAnnotation',
-     'EntityTypeAnnotation',
-     'FeaturesAnnotation',
-     'FemaleGazAnnotation',
-     'FirstChildAnnotation',
-     'ForcedSentenceEndAnnotation',
-     'FreqAnnotation',
-     'GazAnnotation',
-     'GazetteerAnnotation',
-     'GenericTokensAnnotation',
-     'GeniaAnnotation',
-     'GoldAnswerAnnotation',
-     'GovernorAnnotation',
-     'GrandparentAnnotation',
-     'HaveAnnotation',
-     'HeadWordStringAnnotation',
-     'HeightAnnotation',
-     'IDAnnotation',
-     'IDFAnnotation',
-     'INAnnotation',
-     'IndexAnnotation',
-     'InterpretationAnnotation',
-     'IsDateRangeAnnotation',
-     'IsURLAnnotation',
-     'LabelAnnotation',
-     'LastGazAnnotation',
-     'LastTaggedAnnotation',
-     'LBeginAnnotation',
-     'LeftChildrenNodeAnnotation',
-     'LeftTermAnnotation',
-     'LemmaAnnotation',
-     'LEndAnnotation',
-     'LengthAnnotation',
-     'LMiddleAnnotation',
-     'MaleGazAnnotation',
-     'MarkingAnnotation',
-     'MonthAnnotation',
-     'MorphoCaseAnnotation',
-     'MorphoGenAnnotation',
-     'MorphoNumAnnotation',
-     'MorphoPersAnnotation',
-     'NamedEntityTagAnnotation',
-     'NeighborsAnnotation',
-     'NERIDAnnotation',
-     'NormalizedNamedEntityTagAnnotation',
-     'NotAnnotation',
-     'NumericCompositeObjectAnnotation',
-     'NumericCompositeTypeAnnotation',
-     'NumericCompositeValueAnnotation',
-     'NumericObjectAnnotation',
-     'NumericTypeAnnotation',
-     'NumericValueAnnotation',
-     'NumerizedTokensAnnotation',
-     'NumTxtSentencesAnnotation',
-     'OriginalAnswerAnnotation',
-     'OriginalCharAnnotation',
-     'OriginalTextAnnotation',
-     'ParagraphAnnotation',
-     'ParagraphsAnnotation',
-     'ParaPositionAnnotation',
-     'ParentAnnotation',
-     'PartOfSpeechAnnotation',
-     'PercentAnnotation',
-     'PhraseWordsAnnotation',
-     'PhraseWordsTagAnnotation',
-     'PolarityAnnotation',
-     'PositionAnnotation',
-     'PossibleAnswersAnnotation',
-     'PredictedAnswerAnnotation',
-     'PrevChildAnnotation',
-     'PriorAnnotation',
-     'ProjectedCategoryAnnotation',
-     'ProtoAnnotation',
-     'RoleAnnotation',
-     'SectionAnnotation',
-     'SemanticHeadTagAnnotation',
-     'SemanticHeadWordAnnotation',
-     'SemanticTagAnnotation',
-     'SemanticWordAnnotation',
-     'SentenceIDAnnotation',
-     'SentenceIndexAnnotation',
-     'SentencePositionAnnotation',
-     'SentencesAnnotation',
-     'ShapeAnnotation',
-     'SpaceBeforeAnnotation',
-     'SpanAnnotation',
-     'SpeakerAnnotation',
-     'SRL_ID',
-     'SRLIDAnnotation',
-     'SRLInstancesAnnotation',
-     'StackedNamedEntityTagAnnotation',
-     'StateAnnotation',
-     'StemAnnotation',
-     'SubcategorizationAnnotation',
-     'TagLabelAnnotation',
-     'TextAnnotation',
-     'TokenBeginAnnotation',
-     'TokenEndAnnotation',
-     'TokensAnnotation',
-     'TopicAnnotation',
-     'TrueCaseAnnotation',
-     'TrueCaseTextAnnotation',
-     'TrueTagAnnotation',
-     'UBlockAnnotation',
-     'UnaryAnnotation',
-     'UnknownAnnotation',
-     'UtteranceAnnotation',
-     'UTypeAnnotation',
-     'ValueAnnotation',
-     'VerbSenseAnnotation',
-     'WebAnnotation',
-     'WordFormAnnotation',
-     'WordnetSynAnnotation',
-     'WordPositionAnnotation',
-     'WordSenseAnnotation',
-     'XmlContextAnnotation',
-     'XmlElementAnnotation',
-     'YearAnnotation'
-    ],
-   'nlp.dcoref.CorefCoreAnnotations' => [
-     'CorefAnnotation',
-     'CorefChainAnnotation',
-     'CorefClusterAnnotation',
-     'CorefClusterIdAnnotation',
-     'CorefDestAnnotation',
-     'CorefGraphAnnotation'
-    ],
-   'nlp.ling.CoreLabel' => [
-     'GenericAnnotation'
-    ],
-   'nlp.trees.EnglishGrammaticalRelations' => [
-     'AbbreviationModifierGRAnnotation',
-     'AdjectivalComplementGRAnnotation',
-     'AdjectivalModifierGRAnnotation',
-     'AdvClauseModifierGRAnnotation',
-     'AdverbialModifierGRAnnotation',
-     'AgentGRAnnotation',
-     'AppositionalModifierGRAnnotation',
-     'ArgumentGRAnnotation',
-     'AttributiveGRAnnotation',
-     'AuxModifierGRAnnotation',
-     'AuxPassiveGRAnnotation',
-     'ClausalComplementGRAnnotation',
-     'ClausalPassiveSubjectGRAnnotation',
-     'ClausalSubjectGRAnnotation',
-     'ComplementGRAnnotation',
-     'ComplementizerGRAnnotation',
-     'ConjunctGRAnnotation',
-     'ControllingSubjectGRAnnotation',
-     'CoordinationGRAnnotation',
-     'CopulaGRAnnotation',
-     'DeterminerGRAnnotation',
-     'DirectObjectGRAnnotation',
-     'ExpletiveGRAnnotation',
-     'IndirectObjectGRAnnotation',
-     'InfinitivalModifierGRAnnotation',
-     'MarkerGRAnnotation',
-     'ModifierGRAnnotation',
-     'MultiWordExpressionGRAnnotation',
-     'NegationModifierGRAnnotation',
-     'NominalPassiveSubjectGRAnnotation',
-     'NominalSubjectGRAnnotation',
-     'NounCompoundModifierGRAnnotation',
-     'NpAdverbialModifierGRAnnotation',
-     'NumberModifierGRAnnotation',
-     'NumericModifierGRAnnotation',
-     'ObjectGRAnnotation',
-     'ParataxisGRAnnotation',
-     'ParticipialModifierGRAnnotation',
-     'PhrasalVerbParticleGRAnnotation',
-     'PossessionModifierGRAnnotation',
-     'PossessiveModifierGRAnnotation',
-     'PreconjunctGRAnnotation',
-     'PredeterminerGRAnnotation',
-     'PredicateGRAnnotation',
-     'PrepositionalComplementGRAnnotation',
-     'PrepositionalModifierGRAnnotation',
-     'PrepositionalObjectGRAnnotation',
-     'PunctuationGRAnnotation',
-     'PurposeClauseModifierGRAnnotation',
-     'QuantifierModifierGRAnnotation',
-     'ReferentGRAnnotation',
-     'RelativeClauseModifierGRAnnotation',
-     'RelativeGRAnnotation',
-     'SemanticDependentGRAnnotation',
-     'SubjectGRAnnotation',
-     'TemporalModifierGRAnnotation',
-     'XClausalComplementGRAnnotation'
-    ],
-   'nlp.trees.GrammaticalRelation' => [
-     'DependentGRAnnotation',
-     'GovernorGRAnnotation',
-     'GrammaticalRelationAnnotation',
-     'KillGRAnnotation',
-     'Language',
-     'RootGRAnnotation'
-    ],
-   'nlp.ie.machinereading.structure.MachineReadingAnnotations' => [
-     'DependencyAnnotation',
-     'DocumentDirectoryAnnotation',
-     'DocumentIdAnnotation',
-     'EntityMentionsAnnotation',
-     'EventMentionsAnnotation',
-     'GenderAnnotation',
-     'RelationMentionsAnnotation',
-     'TriggerAnnotation'
-    ],
-   'nlp.parser.lexparser.ParserAnnotations' => [
-     'ConstraintAnnotation'
-    ],
-   'nlp.trees.semgraph.SemanticGraphCoreAnnotations' => [
-     'BasicDependenciesAnnotation',
-     'CollapsedCCProcessedDependenciesAnnotation',
-     'CollapsedDependenciesAnnotation'
-    ],
-   'nlp.time.TimeAnnotations' => [
-     'TimexAnnotation',
-     'TimexAnnotations'
-    ],
-   'nlp.time.TimeExpression' => [
-     'Annotation',
-     'ChildrenAnnotation'
-    ],
-   'nlp.trees.TreeCoreAnnotations' => [
-     'TreeHeadTagAnnotation',
-     'TreeHeadWordAnnotation',
-     'TreeAnnotation'
-    ]
-  }
-  annotations_by_name = {}
-  Annotations.each do |base_class, annotation_classes|
-    annotation_classes.each do |annotation_class|
-      annotations_by_name[annotation_class] ||= []
-      annotations_by_name[annotation_class] << base_class
-    end
-  end
-  AnnotationsByName = annotations_by_name
-  # Modify the Rjb JavaProxy class to add our own method to get annotations.
-  Rjb::Rjb_JavaProxy.class_eval do
-    # Dynamically defined on all proxied annotation classes.
-    # Get an annotation using the annotation bridge.
-    def get(annotation, anno_base = nil)
-      if !java_methods.include?('get(Ljava.lang.Class;)')
-        raise'No annotation can be retrieved on this object.'
-      else
-        anno_class = "#{StanfordCoreNLP.camel_case(annotation)}Annotation"
-        if anno_base
-          raise "The path #{anno_base} doesn't exist." unless Annotations[anno_base]
-           anno_bases = [anno_base]
-        else
-          anno_bases = AnnotationsByName[anno_class]
-          raise "The annotation #{anno_class} doesn't exist." unless anno_bases
-        end
-        if anno_bases.size > 1
-          msg = "There are many different annotations bearing the name #{anno_class}. "
-          msg << "Please specify one of the following base classes as second parameter to disambiguate: "
-          msg << anno_bases.join(',')
-          raise msg
-        else
-          base_class = anno_bases[0]
-        end
-        url = "edu.stanford.#{base_class}$#{anno_class}"
-        AnnotationBridge.getAnnotation(self, url)
-      end
-    end
-  end
-end