RubyGems - html2md - Versions diffs - 0.1 → 0.1.1 - Mend

html2md 0.1 → 0.1.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

data/README.md CHANGED

@@ -0,0 +1,60 @@
+Description
+===========
+A basic library that converts HTML to Markdown. It is basic in that it only supports basic HTML formatting (No CSS Support [yet])
+Examples
+========
+``` ruby
+require 'html2md'
+require 'open-uri'
+html2md = Html2Md.new(open("Http://www.google.com").read)
+puts html2md.parse
+```
+``` markdown
+GoogleSearch [Images](http://www.google.com/imghp?hl=en&tab=wi) [Videos](http://video.google.com/?hl=en&tab=wv) [Maps](http://maps.google.com/maps?hl=en&tab=wl) [News](http://news.google.com/nwshp?hl=en&tab=wn) [Shopping](http://www.google.com/shopping?hl=en&tab=wf) [Gmail](https://mail.google.com/mail/?tab=wm) [More »](http://www.google.com/intl/en/options/)[iGoogle](/url?sa=p&pref=ig&pval=3&q=http://www.google.com/ig%3Fhl%3Den%26source%3Diglk&usg=AFQjCNFA18XPfgb7dKnXfKz7x7g1GDH1tg) | [Web History](http://www.google.com/history/optout?hl=en) | [Settings](/preferences?hl=en) | [Sign in](https://accounts.google.com/ServiceLogin?hl=en&continue=http://www.google.com/)
+<table><tr><td> </td><td>
+</td><td>[Advanced search](/advanced_search?hl=en)[Language tools](/language_tools?hl=en)</td></tr></table>
+[Advertising Programs](/intl/en/ads/)[Business Solutions](/services/)[+Google](https://plus.google.com/116899029375914044550)[About Google](/intl/en/about.html)© 2012 - [Privacy](/intl/en/privacy.html)
+```
+Build
+=====
+This gem is built with Travis-ci.org. http://travis-ci.org/#!/pmorton/html2md
+Compatibility
+==============
+Currently not compatiable with jruby, mainly because I am too lazy to fix the build issues. Compatiablity for jruby will be added in the near future.
+Contributing
+============
+1. Fork this repository
+2. Create a branch for your proposed changes
+3. Add tests for your code
+4. Make sure that all tests pass
+5. Update Documentation!
+6. Issue a pull request
+License and Author
+==================
+Author:: Paul Morton (<geeksitk@gmail.com>)
+Licensed under the Apache License, Version 2.0 (the "License");
+you may not use this file except in compliance with the License.
+You may obtain a copy of the License at
+    http://www.apache.org/licenses/LICENSE-2.0
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.

data/Rakefile CHANGED

@@ -4,13 +4,16 @@ lib = File.expand_path('../lib/', __FILE__)
 $:.unshift lib unless $:.include?(lib)
 require 'html2md'
+require 'open-uri'
 Cucumber::Rake::Task.new do |t|
   t.cucumber_opts = %w{--format pretty}
 end
+ task :default => [:cucumber]
 desc "Test"
 task :t, [] => [] do |taks,args|
-  t = Html2Md.new(File.read('test.html'))
+  t = Html2Md.new(open("http://loremipsum.net/about.html").read)
   puts t.parse
 end

data/features/markdown.feature CHANGED

@@ -79,4 +79,19 @@ Feature: Markdown
   Scenario: Character data should not have new lines
     * HTML This is character data \n
     * I say parse
-    * The markdown should be (This is character data \n\n)
+    * The markdown should be (This is character data \n\n)
+  Scenario: First level headers
+    * HTML <h1>This is a H1 Element</h1>
+    * I say parse
+    * The markdown should be (\nThis is a H1 Element\n====================\n)
+  Scenario: Second level headers
+    * HTML <h2>This is a H2 Element</h2>
+    * I say parse
+    * The markdown should be (\nThis is a H2 Element\n--------------------\n)
+  Scenario: Third level headers
+    * HTML <h3>This is a H3 Element</h3>
+    * I say parse
+    * The markdown should be (\n### This is a H3 Element\n)

data/lib/html2md/VERSION.rb CHANGED

@@ -1,3 +1,3 @@
 class Html2Md
-  VERSION = "0.1"
+  VERSION = "0.1.1"
 end

data/lib/html2md/document.rb CHANGED

@@ -13,6 +13,7 @@ class Html2Md
       @allowed_tags = ['tr','td','th','table']
       @current_list = -1
       @list_tree = []
+      @last_cdata_length = 0
     end
@@ -91,6 +92,38 @@ class Html2Md
       @markdown << "\n\n"
     end
+    def start_h1(attributes)
+      @markdown << "\n"
+    end
+    def end_h1(attributes)
+      @markdown << "\n"
+      @last_cdata_length.times do
+        @markdown << "="
+      end
+      @markdown << "\n"
+    end
+    def start_h2(attributes)
+      @markdown << "\n"
+    end
+    def end_h2(attributes)
+      @markdown << "\n"
+      @last_cdata_length.times do
+        @markdown << "-"
+      end
+      @markdown << "\n"
+    end
+    def start_h3(attributes)
+      @markdown << "\n### "
+    end
+    def end_h3(attributes)
+      @markdown << "\n"
+    end
     def start_a(attributes)
       attributes.each do | attrib |
         if attrib[0].downcase.eql? 'href'
@@ -163,6 +196,7 @@ class Html2Md
     end
     def characters c
+      @last_cdata_length = c.chomp.length
       if @list_tree[-1]
         @markdown << c.chomp.lstrip.rstrip
       else

metadata CHANGED

@@ -1,7 +1,7 @@
 --- !ruby/object:Gem::Specification
 name: html2md
 version: !ruby/object:Gem::Version
-  version: '0.1'
+  version: 0.1.1
   prerelease:
 platform: ruby
 authors: