RubyGems - html2tex - Versions diffs - 0.1.2 → 0.1.3 - Mend

html2tex 0.1.2 → 0.1.3

Files changed (4) hide show

data/README.md CHANGED Viewed

@@ -59,3 +59,22 @@ comprehensive.
 StringScanner is used to process the HTML, but cannot read from a stream
 directly, so the entire input document must be read into memory as a string
 first.
+UTF-8 is assumed everywhere; other character encodings will produce odd
+results. If the HTML file to be processed is not in UTF-8 encoding with unix
+line endings (at least, on Linux/OS X/etc.), _fix that first_. The usual
+suspects will help here:
+    iconv -f windows-1252 -t utf-8 < somefile-win1252.html > somefile-utf8.html
+    dos2unix somefile-utf8.html
+Next steps
+----------
+If you have XeLaTex, you can easily turn the generated `.tex` file into a PDF:
+    xelatex my-book.tex
+For better results, tweak the font settings or use a custom class like [this][ebook.cls].
+[ebook.cls]: http://github.com/threedaymonk/gutenberg2pdf/blob/master/ebook.cls

data/lib/html2tex/preamble_processor.rb CHANGED Viewed

@@ -21,6 +21,7 @@ class HTML2TeX
   private
     def read_html_head
       scanner.scan %r{\s*}
+      scanner.scan %r{<\?xml[^>]*?\?>\s*}i
       scanner.scan %r{<!doctype[^>]*>\s*}i
       scanner.scan %r{<html[^>]*>\s*}i
       if head = scanner.scan(%r{<head[^>]*>.*?</head>}im)

data/lib/html2tex/version.rb CHANGED Viewed

@@ -1,3 +1,3 @@
 class HTML2TeX
-  VERSION = "0.1.2"
+  VERSION = "0.1.3"
 end

metadata CHANGED Viewed

@@ -1,7 +1,7 @@
 --- !ruby/object:Gem::Specification
 name: html2tex
 version: !ruby/object:Gem::Version
-  version: 0.1.2
+  version: 0.1.3
 platform: ruby
 authors:
 - Paul Battley
@@ -9,7 +9,7 @@ autorequire:
 bindir: bin
 cert_chain: []
-date: 2010-05-09 00:00:00 +01:00
+date: 2010-05-10 00:00:00 +01:00
 default_executable:
 dependencies:
 - !ruby/object:Gem::Dependency