RubyGems - jazzez - Versions diffs - 1.1.1 - Mend

jazzez 1.1.1

Files changed (4) hide show

data/README +152 -0
data/doc/README.txt +152 -0
data/jazzez.rb +162 -0
metadata +57 -0

data/README ADDED

@@ -0,0 +1,152 @@
+Documentation for Jazzez Version 1.1.1 gem:
+1. Get the links from URL
+   Ex.
+   require 'jazzez'
+   output= Jazzez.new
+   puts output.links("google.com\")
+   Output:
+   http://images.google.com/imghp?hl=en&tab=wi
+   http://maps.google.com/maps?hl=en&tab=wl
+   http://news.google.com/nwshp?hl=en&tab=wn
+   http://video.google.com/?hl=en&tab=wv
+   http://mail.google.com/mail/?hl=en&tab=wm
+   http://www.google.com/intl/en/options/
+   https://www.google.com/accounts/Login?continue=http://66.249.89.44/&hl=en
+   http://google.com/advanced_search?hl=en
+   http://google.com/preferences?hl=en
+   http://google.com/language_tools?hl=en
+   http://google.com/intl/en/ads/
+   http://google.com/services/
+   http://google.com/intl/en/about.html
+   http://www.google.com/ncr
+   http://google.com/intl/en/privacy.html
+Usage:
+1. Get the URL from User.
+2. Make sure to check whether it is valid or not.
+3. If it is valid, then get the source code for that page with the help of Mechanize gem.
+4. Get all the <a> tags & collect only HREF Values in that page with the help of Mechanize gem
+5. If the href values not having the domains then add a URL(homepage) + Href value.
+6. return the results to User as an array
+2. Get the Second level links
+   Ex.
+   require 'jazzez'
+   output= Jazzez.new
+   puts output.links_level2("google.com\")
+   Output:
+   It gives the Second level outputs.
+   If you want to see the output of this code then just go to http://jazzez.wordpress.com
+3. Get the Html tags
+   Ex.
+   require 'jazzez'
+   output= Jazzez.new
+   puts output.tagdetails("google.com\")
+   Output:
+   1<html tag(s)
+   1</html> tag(s)
+   1<head tag(s)
+   1</head> tag(s)
+   1<body tag(s)
+   1</body> tag(s)
+   2<table tag(s)
+   2</table> tag(s)
+   3<tr tag(s)
+   3</tr> tag(s)
+   9<td tag(s)
+   9</td> tag(s)
+   0<th tag(s)
+   0</th> tag(s)
+   0<l  tag(s)
+   0</l> tag(s)
+   0<link tag(s)
+   1<p tag(s)
+   1</p> tag(s)
+   4<div tag(s)
+   4</div> tag(s)
+   0<span tag(s)
+   0</span> tag(s)
+   4<script tag(s)
+   4</script> tag(s)
+   0<ul tag(s)
+   0</ul> tag(s)
+   0<ol tag(s)
+   0</ol> tag(s)
+   16<a tag(s)
+   15</a> tag(s)
+   0<h1 tag(s)
+   0</h1> tag(s)
+   0<h2 tag(s)
+   0</h2> tag(s)
+   0<h3 tag(s)
+   0</h3> tag(s)
+   0<h4 tag(s)
+   0</h4> tag(s)
+   0<h5 tag(s)
+   0</h5> tag(s)
+   0<h6 tag(s)
+   0</h6> tag(s)
+   4<font tag(s)
+   4</font> tag(s)
+   0<select tag(s)
+   0</select> tag(s)
+   0<option tag(s)
+   0</option> tag(s)
+Usage:
+Easy to answer the below questions
+     How many tables in your code ?
+     How many table rows/coloums in your code ?
+     How Many div tags opened and how many div tags closed ?
+     Are you sure your html tags were properly closed ?
+More functions available in next version.
+Any queries just send a mail to jazzezravi@gmail.com.
+Thanks,
+P.Raveendran
+http://raveendran.wordpress.com
+http://jazzez.wordpress.com

data/doc/README.txt ADDED

@@ -0,0 +1,152 @@
+Documentation for Jazzez Version 1.1.1 gem:
+1. Get the links from URL
+   Ex.
+   require 'jazzez'
+   output= Jazzez.new
+   puts output.links("google.com\")
+   Output:
+   http://images.google.com/imghp?hl=en&tab=wi
+   http://maps.google.com/maps?hl=en&tab=wl
+   http://news.google.com/nwshp?hl=en&tab=wn
+   http://video.google.com/?hl=en&tab=wv
+   http://mail.google.com/mail/?hl=en&tab=wm
+   http://www.google.com/intl/en/options/
+   https://www.google.com/accounts/Login?continue=http://66.249.89.44/&hl=en
+   http://google.com/advanced_search?hl=en
+   http://google.com/preferences?hl=en
+   http://google.com/language_tools?hl=en
+   http://google.com/intl/en/ads/
+   http://google.com/services/
+   http://google.com/intl/en/about.html
+   http://www.google.com/ncr
+   http://google.com/intl/en/privacy.html
+Usage:
+1. Get the URL from User.
+2. Make sure to check whether it is valid or not.
+3. If it is valid, then get the source code for that page with the help of Mechanize gem.
+4. Get all the <a> tags & collect only HREF Values in that page with the help of Mechanize gem
+5. If the href values not having the domains then add a URL(homepage) + Href value.
+6. return the results to User as an array
+2. Get the Second level links
+   Ex.
+   require 'jazzez'
+   output= Jazzez.new
+   puts output.links_level2("google.com\")
+   Output:
+   It gives the Second level outputs.
+   If you want to see the output of this code then just go to http://jazzez.wordpress.com
+3. Get the Html tags
+   Ex.
+   require 'jazzez'
+   output= Jazzez.new
+   puts output.tagdetails("google.com\")
+   Output:
+   1<html tag(s)
+   1</html> tag(s)
+   1<head tag(s)
+   1</head> tag(s)
+   1<body tag(s)
+   1</body> tag(s)
+   2<table tag(s)
+   2</table> tag(s)
+   3<tr tag(s)
+   3</tr> tag(s)
+   9<td tag(s)
+   9</td> tag(s)
+   0<th tag(s)
+   0</th> tag(s)
+   0<l  tag(s)
+   0</l> tag(s)
+   0<link tag(s)
+   1<p tag(s)
+   1</p> tag(s)
+   4<div tag(s)
+   4</div> tag(s)
+   0<span tag(s)
+   0</span> tag(s)
+   4<script tag(s)
+   4</script> tag(s)
+   0<ul tag(s)
+   0</ul> tag(s)
+   0<ol tag(s)
+   0</ol> tag(s)
+   16<a tag(s)
+   15</a> tag(s)
+   0<h1 tag(s)
+   0</h1> tag(s)
+   0<h2 tag(s)
+   0</h2> tag(s)
+   0<h3 tag(s)
+   0</h3> tag(s)
+   0<h4 tag(s)
+   0</h4> tag(s)
+   0<h5 tag(s)
+   0</h5> tag(s)
+   0<h6 tag(s)
+   0</h6> tag(s)
+   4<font tag(s)
+   4</font> tag(s)
+   0<select tag(s)
+   0</select> tag(s)
+   0<option tag(s)
+   0</option> tag(s)
+Usage:
+Easy to answer the below questions
+     How many tables in your code ?
+     How many table rows/coloums in your code ?
+     How Many div tags opened and how many div tags closed ?
+     Are you sure your html tags were properly closed ?
+More functions available in next version.
+Any queries just send a mail to jazzezravi@gmail.com.
+Thanks,
+P.Raveendran
+http://raveendran.wordpress.com
+http://jazzez.wordpress.com

data/jazzez.rb ADDED

@@ -0,0 +1,162 @@
+require 'rubygems'
+class Jazzez
+    def check_http(url)
+      #Convert into String
+      @url=url.to_s
+      #Variable need when --> url without starting http://
+      @http="http://"
+      # Add http:// when url without starting http://
+      @url = @http+@url if @url[0,4] != "http"
+      # Get a homepage or domain
+      @homepage=@http+@url.split('/')[2]
+    end
+    def create_agent
+      #Require the Mechanize gem
+      require 'mechanize'
+      #create a new object for Mechanize class
+      @agent = WWW::Mechanize.new
+    end
+    def check_URL_length(url)
+      #Raise error when given URL length is less than 4 characters.
+      raise "The given URL is not a valid one.Please provide a valid URL"if url.strip.length < 4
+    end
+    def links(url)
+      # call method --> check_URL_length
+      check_URL_length(url)
+      # call method --> check_http
+      check_http(url)
+      # call method --> create_agent
+      create_agent
+      # output array
+      @level0=[]
+      #Get the source code for particular url or page
+      page = @agent.get(@url)  rescue  page = 1    #in case any error the assign page =1
+      if page!=1
+        # If the page has links then
+        if page.links !=nil
+          #Set of links available then
+          page.links.each do |one|
+            #Get the uri and convert into String
+            href=one.uri.to_s rescue next
+            #Add http:// when url without starting http://
+            href=@homepage+href if href[0,4] != "http"
+            # Push the output into the array
+            @level0 << href.to_s
+          end
+          # The array is empty then raise error
+          @empty=@level0.empty?
+          raise "Oops. Something went wrong. Check the given URL have any links inside or not" if @empty == true
+        end
+        #return the output
+        return @level0
+      else
+        #Otherwise raise this error
+        raise "Oops. Something went wrong.
+        1. Check whether the given URL is valid or not.
+        2. Check your internet connection.
+        Try again now.."
+      end
+    end
+    def levels(url)
+      # Dummy method for LEVEL 2 related links
+      check_http(url)
+      create_agent
+      @level0=[]
+      page = @agent.get(@url)  rescue  page = 1
+      if page!=1
+        if page.links !=nil
+          page.links.each do |one|
+            href=one.uri.to_s rescue next
+            href=@homepage+href if href[0,4] != "http"
+            @level0 << href.to_s
+          end
+        end
+      end
+      #return the output
+      return @level0
+    end
+    def array_links(links)
+      @final_output=[]
+      @arraylinks=[]
+      @arraylinks=links
+      @arraylinks.each do |link|
+        levels(link) if  (@url.split('/')[2]== link.split('/')[2]) == true
+        @final_output<<@level0
+      end
+    end
+    def backup
+      @level1_output << @level0
+    end
+    def links_level2(url)
+      # call method --> links
+      links(url)
+      #level1_output
+      @level1_output=[]
+      # call method -->backup
+      backup
+      # call method --> array_links
+      array_links(@level0)
+      @final_output=@final_output.flatten
+      @final_output=@final_output.uniq
+      @level1_output << @final_output
+      @level1_output=@level1_output.flatten
+      @level1_output=@level1_output.uniq
+      return @level1_output.sort # final output
+    end
+    def   tagdetails(url)
+      # call method --> check_URL_length
+      check_URL_length(url)
+       # call method --> check_http
+      check_http(url)
+       # call method --> create_agent
+      create_agent
+      page = @agent.get(@url)  rescue page =1
+      raise "oops. Something went wrong.
+          1. Check the given URL is valid or not.
+          2. Check your internet connection" if page ==1
+      #Get the body content
+      source=page.body
+      #What are the Tags we are going to count
+      search=["<html","</html>","<head","</head>","<body","</body>","<table","</table>","<tr","</tr>","<td","</td>","<th","</th>","<l ","</l>","<link","<p","</p>","<div","</div>","<span","</span>","<script","</script>","<ul","</ul>","<ol","</ol>","<a","</a>","<h1","</h1>","<h2","</h2>","<h3","</h3>","<h4","</h4>","<h5","</h5>","<h6","</h6>","<font","</font>","<select","</select>","<option","</option>"]
+      tag=[]
+      taghelp=[]
+      result=[]
+      source.each do |line|
+        i=0
+        while i < search.length do
+          # Search the terms
+          taghelp = line.downcase.scan(search[i]).to_a
+          taghelp.each do |result_tag|
+            #push the results
+            tag << result_tag.to_s
+          end
+          i+=1
+        end
+      end
+      j=0
+      while j< search.length do
+        #counting the times
+        count= tag.grep(search[j])
+        #Main result
+        result << count.length.to_s + search[j].to_s + " tag(s)"
+        j+=1
+      end
+      return result  # returns the result
+    end
+end

metadata ADDED

@@ -0,0 +1,57 @@
+--- !ruby/object:Gem::Specification
+rubygems_version: 0.9.4
+specification_version: 1
+name: jazzez
+version: !ruby/object:Gem::Version
+  version: 1.1.1
+date: 2009-04-14 00:00:00 +05:30
+summary: Get Links,level 2 links and Tag details from URL
+require_paths:
+- .
+email: jazzezravi@gmail.com
+homepage: http://jazzez.wordpress.com/
+rubyforge_project: jazzez
+description:
+autorequire:
+default_executable:
+bindir: bin
+has_rdoc: true
+required_ruby_version: !ruby/object:Gem::Version::Requirement
+  requirements:
+  - - ">"
+    - !ruby/object:Gem::Version
+      version: 0.0.0
+  version:
+platform: ruby
+signing_key:
+cert_chain:
+post_install_message:
+authors:
+- Jazzezravi
+files:
+- jazzez.rb
+- doc/README.txt
+- README
+test_files: []
+rdoc_options: []
+extra_rdoc_files:
+- README
+- doc/README.txt
+executables: []
+extensions: []
+requirements: []
+dependencies:
+- !ruby/object:Gem::Dependency
+  name: mechanize
+  version_requirement:
+  version_requirements: !ruby/object:Gem::Version::Requirement
+    requirements:
+    - - ">="
+      - !ruby/object:Gem::Version
+        version: 0.7.5
+    version: