RubyGems - scrapifier - Versions diffs - 0.0.5 → 0.0.6 - Mend

scrapifier 0.0.5 → 0.0.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

checksums.yaml CHANGED

@@ -1,7 +1,7 @@
 ---
 SHA1:
-  metadata.gz: e52f7f47695c7b16fed80a51f99a119d3f98a3b7
-  data.tar.gz: 24db438acd2fb2df421ab7b6c148e7cbed3331ee
+  metadata.gz: 66a555f6eaeccf042961100999adeeee7a4a084a
+  data.tar.gz: 0517ecc6004507c2c4a479c187d24d559d6cd2e0
 SHA512:
-  metadata.gz: 41e5f58a1760d61196ee3b4f429644025789d4432e7a1cef1a70d31473a26c12a8e38091527adacc78e7650759fc55cbdb80e365ce3f86954fcf493304f08d86
-  data.tar.gz: 248f23ec549f331a942d1847b9fc4d92935dea34993128fe5bd63fca9a4a4b5814ee531ef3c390e38502cd7f802c55a12dfeef6aa7eb434782d6f735a4d29e28
+  metadata.gz: 001a68b1afe7f2b88d7c8c6fd9155ef6754468b78bd66fc6be2c038e291e38ee7aac9a04e9d4f51ce87579a0590ec04b25dd07d6701b317ef37bb02b611b8a37
+  data.tar.gz: 7330af461a9585ecf840c054a7f96556036e1189af32198a86e2020cac90b9305f3596fdffc33a3d837a403129d31d085c424542502786da5c33f5e8fac13d70

data/README.md CHANGED

@@ -1,94 +1,119 @@
-# Scrapifier
-[![Build Status](https://travis-ci.org/tiagopog/scrapifier.svg?branch=master)](https://travis-ci.org/tiagopog/scrapifier)
-[![Code Climate](https://codeclimate.com/github/tiagopog/scrapifier.png)](https://codeclimate.com/github/tiagopog/scrapifier)
-[![Dependency Status](https://gemnasium.com/tiagopog/scrapifier.svg)](https://gemnasium.com/tiagopog/scrapifier)
-[![Gem Version](https://badge.fury.io/rb/scrapifier.svg)](http://badge.fury.io/rb/scrapifier)
-It's a Ruby gem that brings a very simple way to extract meta information from URIs using the screen scraping technique.
-Note: This gem is mainly focused on screen scraping URLs (presence of protocol, such as: "http", "https" and "ftp"), but it also works with URIs which have the "www" without any protocol defined, like: "www.google.com".
-## Installation
-Compatible with Ruby 1.9.3+
-Add this line to your application's Gemfile:
-    gem 'scrapifier'
-And then execute:
-    $ bundle
-Or install it yourself as:
-    $ gem install scrapifier
-An then require the gem:
-    $ require 'scrapifier'
-## Usage
-The String#scrapify method finds URIs in a string and then gets their metadata, e.g., the page's title, description, images and URI. All the data is returned in a well-formatted hash.
-#### Default usage.
-``` ruby
-'Wow! What an awesome site: http://adtangerine.com!'.scrapify
-#=> {
-#   title:       "AdTangerine | Advertising Platform for Social Media",
-#   description: "AdTangerine is an advertising platform that uses the tangerine as a virtual currency for advertisers and publishers in order to share content on social networks.",
-#   images:      ["http://adtangerine.com/assets/logo_adt_og.png", "http://adtangerine.com/assets/logo_adt_og.png", "http://s3-us-west-2.amazonaws.com/adtangerine-prod/users/avatars/000/000/834/thumb/275747_1118382211_1929809351_n.jpg", "http://adtangerine.com/assets/foobar.gif"],
-#   uri:         "http://adtangerine.com"
-# }
-```
-#### Allow only certain image types.
-``` ruby
-'Wow! What an awesome site: http://adtangerine.com!'.scrapify(images: :jpg)
-#=> {
-#   title:       "AdTangerine | Advertising Platform for Social Media",
-#   description: "AdTangerine is an advertising platform that uses the tangerine as a virtual currency for advertisers and publishers in order to share content on social networks.",
-#   images:      ["http://s3-us-west-2.amazonaws.com/adtangerine-prod/users/avatars/000/000/834/thumb/275747_1118382211_1929809351_n.jpg"],
-#   uri:         "http://adtangerine.com"
-# }
-'Wow! What an awesome site: http://adtangerine.com!'.scrapify(images: [:png, :gif])
-#=> {
-#   title:       "AdTangerine | Advertising Platform for Social Media",
-#   description: "AdTangerine is an advertising platform that uses the tangerine as a virtual currency for advertisers and publishers in order to share content on social networks.",
-#   images:      ["http://adtangerine.com/assets/logo_adt_og.png", "http://adtangerine.com/assets/logo_adt_og.png", "http://adtangerine.com/assets/foobar.gif"],
-#   uri:         "http://adtangerine.com"
-# }
-```
-#### Choose which URI you want it to be scraped.
-``` ruby
-'Check out: http://adtangerine.com and www.twitflink.com'.scrapify(which: 1)
-#=> {
-#   title:       "TwitFlink | Find a link!",
-#   description: "TwitFlink is a very simple searching tool that allows people to find out links tweeted by any user from Twitter.",
-#   images:      ["http://www.twitflink.com//assets/tf_logo.png", "http://twitflink.com/assets/tf_logo.png"],
-#   uri:         "http://www.twitflink.com"
-# }
-'Check out: http://adtangerine.com and www.twitflink.com'.scrapify(which: 0, images: :gif)
-#=> {
-#   title:       "AdTangerine | Advertising Platform for Social Media",
-#   description: "AdTangerine is an advertising platform that uses the tangerine as a virtual currency for advertisers and publishers in order to share content on social networks.",
-#   images:      ["http://adtangerine.com/assets/foobar.gif"],
-#   uri:         "http://adtangerine.com"
-# }
-```
-## Contributing
-1. Fork it
-2. Create your feature branch (`git checkout -b my-new-feature`)
-3. Commit your changes (`git commit -am 'Add some feature'`)
-4. Push to the branch (`git push origin my-new-feature`)
-5. Create new Pull Request
+# Scrapifier
+[![Build Status](https://travis-ci.org/tiagopog/scrapifier.svg?branch=master)](https://travis-ci.org/tiagopog/scrapifier)
+[![Code Climate](https://codeclimate.com/github/tiagopog/scrapifier.png)](https://codeclimate.com/github/tiagopog/scrapifier)
+[![Dependency Status](https://gemnasium.com/tiagopog/scrapifier.svg)](https://gemnasium.com/tiagopog/scrapifier)
+[![Gem Version](https://badge.fury.io/rb/scrapifier.svg)](http://badge.fury.io/rb/scrapifier)
+It's a Ruby gem that brings a very simple way to extract meta information from URIs using the screen scraping technique.
+Note: This gem is mainly focused on screen scraping URLs (presence of protocol, such as: "http", "https" and "ftp"), but it also works with URIs which have the "www" without any protocol defined, like: "www.google.com".
+## Installation
+Compatible with Ruby 1.9.3+
+Add this line to your application's Gemfile:
+    gem 'scrapifier'
+And then execute:
+    $ bundle
+Or install it yourself as:
+    $ gem install scrapifier
+An then require the gem:
+    $ require 'scrapifier'
+## Usage
+The String#scrapify method finds URIs in a string and then gets their metadata, e.g., the page's title, description, images, keywords, language, encode, "reply to" email, author and URI. All the data is returned in a well-formatted hash.
+#### Default usage.
+``` ruby
+'Wow! What an awesome site: http://adtangerine.com!'.scrapify
+#=> {
+#   title: "AdTangerine | Boosting great ideas",
+#   description: "Advertising social network that uses tangerines as a virtual currency..." ,
+#   keywords: "ad network, ad, advertising, advertiser, publisher, social media",
+#   lang: "en-us",
+#   encode: "utf-8",
+#   reply_to: "sayhello@adtangerine.com",
+#   author: "Tiago Guedes, Jonatas de Paula, Raphael da Costa",
+#   images: ["http://adtangerine.com/assets/logo_adt_og.png", "http://adtangerine.com/assets/logo_adt_og.png", "http://s3-us-west-2.amazonaws.com/adtangerine-prod/users/avatars/000/000/834/thumb/275747_1118382211_1929809351_n.jpg", "http://adtangerine.com/assets/foobar.gif"],
+#   uri: "http://adtangerine.com"
+# }
+```
+#### Allow only certain image types.
+``` ruby
+'Wow! What an awesome site: http://adtangerine.com!'.scrapify(images: :jpg)
+#=> {
+#   title: "AdTangerine | Boosting great ideas",
+#   description: "Advertising social network that uses tangerines as a virtual currency..." ,
+#   keywords: "ad network, ad, advertising, advertiser, publisher, social media",
+#   lang: "en-us",
+#   encode: "utf-8",
+#   reply_to: "sayhello@adtangerine.com",
+#   author: "Tiago Guedes, Jonatas de Paula, Raphael da Costa",
+#   images: ["http://s3-us-west-2.amazonaws.com/adtangerine-prod/users/avatars/000/000/834/thumb/275747_1118382211_1929809351_n.jpg"],
+#   uri: "http://adtangerine.com"
+# }
+'Wow! What an awesome site: http://adtangerine.com!'.scrapify(images: [:png, :gif])
+#=> {
+#   title: "AdTangerine | Boosting great ideas",
+#   description: "Advertising social network that uses tangerines as a virtual currency..." ,
+#   keywords: "ad network, ad, advertising, advertiser, publisher, social media",
+#   lang: "en-us",
+#   encode: "utf-8",
+#   reply_to: "sayhello@adtangerine.com",
+#   author: "Tiago Guedes, Jonatas de Paula, Raphael da Costa",
+#   images: ["http://adtangerine.com/assets/logo_adt_og.png", "http://adtangerine.com/assets/logo_adt_og.png", "http://adtangerine.com/assets/foobar.gif"],
+#   uri: "http://adtangerine.com"
+# }
+```
+#### Choose which URI you want it to be scraped.
+``` ruby
+'Check out: http://adtangerine.com and www.twitflink.com'.scrapify(which: 1)
+#=> {
+#   title: "TwitFlink | Find a link!",
+#   description: "TwitFlink is a very simple searching tool that allows people to find out links tweeted...",
+#   keywords: "search, searching tool, link, twitter, social media",
+#   lang: "en-us",
+#   encode: "utf-8",
+#   reply_to: "sayhello@adtangerine.com",
+#   author: "Tiago Guedes",
+#   images: ["http://www.twitflink.com//assets/tf_logo.png", "http://twitflink.com/assets/tf_logo.png"],
+#   uri: "http://www.twitflink.com"
+# }
+'Check out: http://adtangerine.com and www.twitflink.com'.scrapify(which: 0, images: :gif)
+#=> {
+#   title: "AdTangerine | Boosting great ideas",
+#   description: "Advertising social network that uses tangerines as a virtual currency..." ,
+#   keywords: "ad network, ad, advertising, advertiser, publisher, social media",
+#   lang: "en-us",
+#   encode: "utf-8",
+#   reply_to: "sayhello@adtangerine.com",
+#   author: "Tiago Guedes, Jonatas de Paula, Raphael da Costa",
+#   images: ["http://adtangerine.com/assets/foobar.gif"],
+#   uri: "http://adtangerine.com"
+# }
+```
+## Contributing
+1. Fork it
+2. Create your feature branch (`git checkout -b my-new-feature`)
+3. Commit your changes (`git commit -am 'Add some feature'`)
+4. Push to the branch (`git push origin my-new-feature`)
+5. Create new Pull Request

data/lib/scrapifier/version.rb CHANGED

@@ -1,3 +1,3 @@
 module Scrapifier
-  VERSION = '0.0.5'
+  VERSION = '0.0.6'
 end

data/lib/scrapifier/xpath.rb CHANGED

@@ -41,14 +41,14 @@ module Scrapifier
     REPLY_TO =
       <<-END.gsub(/^\s+\|/, '')
-        |//meta[@name="reply_to"]/@content
+        |//meta[@name="reply_to"]/@content|
+        |//meta[@name="Reply_to"]/@content
       END
     AUTHOR =
       <<-END.gsub(/^\s+\|/, '')
         |//meta[@name="author"]/@content|
-        |//meta[@name="Author"]/@content|
-        |//meta[@name="reply_to"]/@content
+        |//meta[@name="Author"]/@content
       END
     IMG =

metadata CHANGED

@@ -1,14 +1,14 @@
 --- !ruby/object:Gem::Specification
 name: scrapifier
 version: !ruby/object:Gem::Version
-  version: 0.0.5
+  version: 0.0.6
 platform: ruby
 authors:
 - Tiago Guedes
 autorequire:
 bindir: bin
 cert_chain: []
-date: 2014-06-28 00:00:00.000000000 Z
+date: 2014-06-29 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: nokogiri