nokogiri 1.4.6-java → 1.4.7-java

Sign up to get free protection for your applications and to get access to all the features.

Potentially problematic release.


This version of nokogiri might be problematic. Click here for more details.

@@ -1,3 +1,9 @@
1
+ === 1.4.7 / 2011年7月1日
2
+
3
+ * バグの修正
4
+
5
+ * エンコーディング宣言のないHTMLファイルで部分的に重複したドキュメントが生成される問題を修正した. #478
6
+
1
7
  === 1.4.6 / 2011年6月19日
2
8
 
3
9
  * ノート
@@ -1,3 +1,11 @@
1
+ === 1.4.7 / 2011-07-01
2
+
3
+ * Bugfixes
4
+
5
+ * Fix a bug in advanced encoding detection that leads to partially
6
+ duplicated document when parsing an HTML file with unknown
7
+ encoding. Thanks, Timothy Elliott (@ender672)! #478
8
+
1
9
  === 1.4.6 / 2011-06-19
2
10
 
3
11
  * Notes
@@ -92,7 +92,7 @@ module Nokogiri
92
92
  if string_or_io.respond_to?(:read)
93
93
  url ||= string_or_io.respond_to?(:path) ? string_or_io.path : nil
94
94
  if !encoding
95
- # Perform further encoding detection that libxml2 does
95
+ # Perform advanced encoding detection that libxml2 does
96
96
  # not do.
97
97
  string_or_io = EncodingReader.new(string_or_io)
98
98
  begin
@@ -181,16 +181,13 @@ module Nokogiri
181
181
  if !@firstchunk
182
182
  @firstchunk = @io.read(len) or return nil
183
183
 
184
- # This implementation expects and assumes that the first
185
- # call from htmlReadIO() is made with a length long enough
186
- # (~1KB) to achieve further encoding detection that
187
- # libxml2 does not do.
184
+ # This implementation expects that the first call from
185
+ # htmlReadIO() is made with a length long enough (~1KB) to
186
+ # achieve advanced encoding detection.
188
187
  if encoding = EncodingReader.detect_encoding(@firstchunk)
188
+ # The first chunk is stored for the next read in retry.
189
189
  raise EncodingFoundException, encoding
190
190
  end
191
-
192
- # This chunk is stored for the next read in retry.
193
- return @firstchunk
194
191
  end
195
192
 
196
193
  ret = @firstchunk.slice!(0, len)
@@ -1,6 +1,6 @@
1
1
  module Nokogiri
2
2
  # The version of Nokogiri you are using
3
- VERSION = '1.4.6'
3
+ VERSION = '1.4.7'
4
4
 
5
5
  # More complete version information about libxml
6
6
  VERSION_INFO = {}
@@ -22,6 +22,7 @@ module Nokogiri
22
22
  SHIFT_JIS_HTML = File.join(ASSETS_DIR, 'shift_jis.html')
23
23
  ENCODING_XHTML_FILE = File.join(ASSETS_DIR, 'encoding.xhtml')
24
24
  ENCODING_HTML_FILE = File.join(ASSETS_DIR, 'encoding.html')
25
+ NOENCODING_FILE = File.join(ASSETS_DIR, 'noencoding.html')
25
26
  PO_XML_FILE = File.join(ASSETS_DIR, 'po.xml')
26
27
  PO_SCHEMA_FILE = File.join(ASSETS_DIR, 'po.xsd')
27
28
  ADDRESS_SCHEMA_FILE = File.join(ASSETS_DIR, 'address_book.rlx')
@@ -89,6 +89,13 @@ module Nokogiri
89
89
  File.open(file, 'rb')
90
90
  end
91
91
 
92
+ def test_document_html_noencoding
93
+ from_stream = Nokogiri::HTML(binopen(NOENCODING_FILE))
94
+ from_string = Nokogiri::HTML(binread(NOENCODING_FILE))
95
+
96
+ assert_equal from_string.to_s.size, from_stream.to_s.size
97
+ end
98
+
92
99
  def test_document_xhtml_enc
93
100
  [ENCODING_XHTML_FILE, ENCODING_HTML_FILE].each { |file|
94
101
  doc_from_string_enc = Nokogiri::HTML(binread(file), nil, 'Shift_JIS')
metadata CHANGED
@@ -2,7 +2,7 @@
2
2
  name: nokogiri
3
3
  version: !ruby/object:Gem::Version
4
4
  prerelease:
5
- version: 1.4.6
5
+ version: 1.4.7
6
6
  platform: java
7
7
  authors:
8
8
  - Aaron Patterson
@@ -11,7 +11,7 @@ autorequire:
11
11
  bindir: bin
12
12
  cert_chain: []
13
13
 
14
- date: 2011-06-19 00:00:00 -04:00
14
+ date: 2011-07-01 00:00:00 -04:00
15
15
  default_executable: nokogiri
16
16
  dependencies:
17
17
  - !ruby/object:Gem::Dependency