RubyGems - informers - Versions diffs - 1.0.3 → 1.1.0 - Mend

informers 1.0.3 → 1.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (17) hide show

checksums.yaml +4 -4
data/CHANGELOG.md +4 -0
data/README.md +123 -0
data/lib/informers/configs.rb +10 -8
data/lib/informers/model.rb +2 -9
data/lib/informers/models.rb +997 -12
data/lib/informers/pipelines.rb +768 -8
data/lib/informers/processors.rb +796 -0
data/lib/informers/tokenizers.rb +154 -4
data/lib/informers/utils/core.rb +4 -0
data/lib/informers/utils/generation.rb +294 -0
data/lib/informers/utils/image.rb +116 -0
data/lib/informers/utils/math.rb +73 -0
data/lib/informers/utils/tensor.rb +46 -0
data/lib/informers/version.rb +1 -1
data/lib/informers.rb +3 -0
metadata +8 -5

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: f5340da0bce9d55a0339fac6b8806f09119df3e89567ecb37a77e1a5921b8fa2
-  data.tar.gz: 66a9d275cb2999ad14ba1cfd900bdcbf9fdc3d26ce29387acdd74452bf2050ef
+  metadata.gz: ab4f19adb4d6ca0289784cee6c6cb5235b73a5184abffbeaf44391768be1f0ac
+  data.tar.gz: '0880ce4dced5ce47ceaaa5fee8d10e6324b3fc0a23e05c3da3728414dcc273d9'
 SHA512:
-  metadata.gz: a4a0c3da3d8a3555a6f2debca8f2939b6536ac76386cdd6c7264890b2d00842d537ecfca352021fa349ff9c4636ba49c189f652a66676746d9ec2a8d97eecc2a
-  data.tar.gz: a06aa115b5966fd1b8da7a80d8481d3e61778f31c3bb0da143f329e81ae3f73d4a1d1b2ee01672f4e90742a35d68a23dd5c871c3b68ffad0c16d8e5de480a60f
+  metadata.gz: eb3ee6d16e4e20eca6fae3fae8f97d78ba6bb655d48e2012640d64538785e2a9ff2afb10269cf01db928553438e8fbd08584774ba3f3d08bc25f36cbb971a99a
+  data.tar.gz: '0008441293f2605ec8599135d715093053e21f67f56ba59b730a3bc1f46f04f4a7fabb7fef039f156cd4183011c93b7fc9cab6ba731bf78627244bc4dedcf18d'

data/CHANGELOG.md CHANGED Viewed

@@ -1,3 +1,7 @@
+## 1.1.0 (2024-09-17)
+- Added more pipelines
 ## 1.0.3 (2024-08-29)
 - Added `model_output` option

data/README.md CHANGED Viewed

@@ -240,6 +240,12 @@ The model must include a `.onnx` file ([example](https://huggingface.co/Xenova/a
 ## Pipelines
+- [Text](#text)
+- [Vision](#vision)
+- [Multimodel](#multimodal)
+### Text
 Embedding
 ```ruby
@@ -275,6 +281,48 @@ qa = Informers.pipeline("question-answering")
 qa.("Who invented Ruby?", "Ruby is a programming language created by Matz")
 ```
+Zero-shot classification
+```ruby
+classifier = Informers.pipeline("zero-shot-classification")
+classifier.("text", ["label1", "label2", "label3"])
+```
+Text generation
+```ruby
+generator = Informers.pipeline("text-generation")
+generator.("I enjoy walking with my cute dog,")
+```
+Text-to-text generation
+```ruby
+text2text = Informers.pipeline("text2text-generation")
+text2text.("translate from English to French: I'm very happy")
+```
+Translation
+```ruby
+translator = Informers.pipeline("translation", "Xenova/nllb-200-distilled-600M")
+translator.("जीवन एक चॉकलेट बॉक्स की तरह है।", src_lang: "hin_Deva", tgt_lang: "fra_Latn")
+```
+Summarization
+```ruby
+summarizer = Informers.pipeline("summarization")
+summarizer.("Many paragraphs of text")
+```
+Fill mask
+```ruby
+unmasker = Informers.pipeline("fill-mask")
+unmasker.("Paris is the [MASK] of France.")
+```
 Feature extraction
 ```ruby
@@ -282,6 +330,80 @@ extractor = Informers.pipeline("feature-extraction")
 extractor.("We are very happy to show you the 🤗 Transformers library.")
 ```
+### Vision
+Image classification
+```ruby
+classifier = Informers.pipeline("image-classification")
+classifier.("image.jpg")
+```
+Zero-shot image classification
+```ruby
+classifier = Informers.pipeline("zero-shot-image-classification")
+classifier.("image.jpg", ["label1", "label2", "label3"])
+```
+Image segmentation
+```ruby
+segmenter = Informers.pipeline("image-segmentation")
+segmenter.("image.jpg")
+```
+Object detection
+```ruby
+detector = Informers.pipeline("object-detection")
+detector.("image.jpg")
+```
+Zero-shot object detection
+```ruby
+detector = Informers.pipeline("zero-shot-object-detection")
+detector.("image.jpg", ["label1", "label2", "label3"])
+```
+Depth estimation
+```ruby
+estimator = Informers.pipeline("depth-estimation")
+estimator.("image.jpg")
+```
+Image-to-image
+```ruby
+upscaler = Informers.pipeline("image-to-image")
+upscaler.("image.jpg")
+```
+Image feature extraction
+```ruby
+extractor = Informers.pipeline("image-feature-extraction")
+extractor.("image.jpg")
+```
+### Multimodal
+Image captioning
+```ruby
+captioner = Informers.pipeline("image-to-text")
+captioner.("image.jpg")
+```
+Document question answering
+```ruby
+qa = Informers.pipeline("document-question-answering")
+qa.("image.jpg", "What is the invoice number?")
+```
 ## Credits
 This library was ported from [Transformers.js](https://github.com/xenova/transformers.js) and is available under the same license.
@@ -321,5 +443,6 @@ To get started with development:
 git clone https://github.com/ankane/informers.git
 cd informers
 bundle install
+bundle exec rake download:files
 bundle exec rake test
 ```

data/lib/informers/configs.rb CHANGED Viewed

@@ -1,17 +1,19 @@
 module Informers
   class PretrainedConfig
-    attr_reader :model_type, :problem_type, :id2label
     def initialize(config_json)
-      @is_encoder_decoder = false
-      @model_type = config_json["model_type"]
-      @problem_type = config_json["problem_type"]
-      @id2label = config_json["id2label"]
+      @config_json = config_json.to_h
     end
     def [](key)
-      instance_variable_get("@#{key}")
+      @config_json[key.to_s]
+    end
+    def []=(key, value)
+      @config_json[key.to_s] = value
+    end
+    def to_h
+      @config_json.to_h
     end
     def self.from_pretrained(

data/lib/informers/model.rb CHANGED Viewed

@@ -1,19 +1,12 @@
 module Informers
   class Model
     def initialize(model_id, quantized: false)
-      @model_id = model_id
       @model = Informers.pipeline("embedding", model_id, quantized: quantized)
+      @options = model_id == "mixedbread-ai/mxbai-embed-large-v1" ? {pooling: "cls", normalize: false} : {}
     end
     def embed(texts)
-      case @model_id
-      when "sentence-transformers/all-MiniLM-L6-v2", "Xenova/all-MiniLM-L6-v2", "Xenova/multi-qa-MiniLM-L6-cos-v1", "Supabase/gte-small"
-        @model.(texts)
-      when "mixedbread-ai/mxbai-embed-large-v1"
-        @model.(texts, pooling: "cls", normalize: false)
-      else
-        raise Error, "Use the embedding pipeline for this model: #{@model_id}"
-      end
+      @model.(texts, **@options)
     end
   end
 end