text-hyphen 1.0.2 → 1.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- data/.gemtest +0 -0
- data/History.rdoc +54 -0
- data/License.rdoc +159 -0
- data/Manifest.txt +67 -5
- data/README.rdoc +69 -0
- data/Rakefile +8 -4
- data/bin/{hyphen → ruby-hyphen} +0 -0
- data/lib/text-hyphen.rb +1 -0
- data/lib/text/hyphen.rb +74 -111
- data/lib/text/hyphen/language.rb +90 -26
- data/lib/text/hyphen/language/1.8/ca.rb +171 -0
- data/lib/text/hyphen/language/1.8/cs.rb +360 -0
- data/lib/text/hyphen/language/1.8/da.rb +117 -0
- data/lib/text/hyphen/language/1.8/de1.rb +718 -0
- data/lib/text/hyphen/language/1.8/de2.rb +680 -0
- data/lib/text/hyphen/language/1.8/en_uk.rb +789 -0
- data/lib/text/hyphen/language/1.8/en_us.rb +490 -0
- data/lib/text/hyphen/language/1.8/es.rb +287 -0
- data/lib/text/hyphen/language/1.8/et.rb +335 -0
- data/lib/text/hyphen/language/1.8/eu.rb +112 -0
- data/lib/text/hyphen/language/1.8/fi.rb +112 -0
- data/lib/text/hyphen/language/1.8/fr.rb +389 -0
- data/lib/text/hyphen/language/1.8/ga.rb +606 -0
- data/lib/text/hyphen/language/1.8/hr.rb +122 -0
- data/lib/text/hyphen/language/1.8/hsb.rb +179 -0
- data/lib/text/hyphen/language/1.8/hu1.rb +380 -0
- data/lib/text/hyphen/language/1.8/hu2.rb +1278 -0
- data/lib/text/hyphen/language/1.8/ia.rb +71 -0
- data/lib/text/hyphen/language/1.8/id.rb +91 -0
- data/lib/text/hyphen/language/1.8/is.rb +387 -0
- data/lib/text/hyphen/language/1.8/it.rb +133 -0
- data/lib/text/hyphen/language/1.8/la.rb +132 -0
- data/lib/text/hyphen/language/1.8/mn.rb +101 -0
- data/lib/text/hyphen/language/1.8/nl.rb +1250 -0
- data/lib/text/hyphen/language/1.8/no1.rb +299 -0
- data/lib/text/hyphen/language/1.8/no2.rb +134 -0
- data/lib/text/hyphen/language/1.8/pl.rb +478 -0
- data/lib/text/hyphen/language/1.8/pt.rb +54 -0
- data/lib/text/hyphen/language/1.8/sv.rb +447 -0
- data/lib/text/hyphen/language/1.9/ca.rb +174 -0
- data/lib/text/hyphen/language/1.9/cs.rb +361 -0
- data/lib/text/hyphen/language/1.9/da.rb +117 -0
- data/lib/text/hyphen/language/1.9/de1.rb +719 -0
- data/lib/text/hyphen/language/1.9/de2.rb +682 -0
- data/lib/text/hyphen/language/1.9/en_uk.rb +791 -0
- data/lib/text/hyphen/language/1.9/en_us.rb +492 -0
- data/lib/text/hyphen/language/1.9/es.rb +289 -0
- data/lib/text/hyphen/language/1.9/et.rb +336 -0
- data/lib/text/hyphen/language/1.9/eu.rb +114 -0
- data/lib/text/hyphen/language/1.9/fi.rb +113 -0
- data/lib/text/hyphen/language/1.9/fr.rb +391 -0
- data/lib/text/hyphen/language/1.9/ga.rb +608 -0
- data/lib/text/hyphen/language/1.9/hr.rb +123 -0
- data/lib/text/hyphen/language/1.9/hsb.rb +180 -0
- data/lib/text/hyphen/language/1.9/hu1.rb +382 -0
- data/lib/text/hyphen/language/1.9/hu2.rb +1280 -0
- data/lib/text/hyphen/language/1.9/ia.rb +73 -0
- data/lib/text/hyphen/language/1.9/id.rb +93 -0
- data/lib/text/hyphen/language/1.9/is.rb +388 -0
- data/lib/text/hyphen/language/1.9/it.rb +134 -0
- data/lib/text/hyphen/language/1.9/la.rb +134 -0
- data/lib/text/hyphen/language/1.9/mn.rb +102 -0
- data/lib/text/hyphen/language/1.9/nl.rb +1252 -0
- data/lib/text/hyphen/language/1.9/no1.rb +301 -0
- data/lib/text/hyphen/language/1.9/no2.rb +136 -0
- data/lib/text/hyphen/language/1.9/pl.rb +479 -0
- data/lib/text/hyphen/language/1.9/pt.rb +55 -0
- data/lib/text/hyphen/language/1.9/sv.rb +449 -0
- data/lib/text/hyphen/language/ca.rb +3 -173
- data/lib/text/hyphen/language/cs.rb +3 -362
- data/lib/text/hyphen/language/da.rb +3 -117
- data/lib/text/hyphen/language/de.rb +1 -0
- data/lib/text/hyphen/language/de1.rb +3 -724
- data/lib/text/hyphen/language/de2.rb +3 -685
- data/lib/text/hyphen/language/en_uk.rb +3 -790
- data/lib/text/hyphen/language/en_us.rb +3 -492
- data/lib/text/hyphen/language/es.rb +3 -288
- data/lib/text/hyphen/language/et.rb +3 -336
- data/lib/text/hyphen/language/eu.rb +3 -114
- data/lib/text/hyphen/language/fi.rb +3 -112
- data/lib/text/hyphen/language/fr.rb +3 -391
- data/lib/text/hyphen/language/ga.rb +3 -607
- data/lib/text/hyphen/language/hr.rb +3 -123
- data/lib/text/hyphen/language/hsb.rb +2 -179
- data/lib/text/hyphen/language/hu.rb +1 -0
- data/lib/text/hyphen/language/hu1.rb +3 -384
- data/lib/text/hyphen/language/hu2.rb +3 -1282
- data/lib/text/hyphen/language/ia.rb +3 -72
- data/lib/text/hyphen/language/id.rb +3 -96
- data/lib/text/hyphen/language/is.rb +3 -389
- data/lib/text/hyphen/language/it.rb +3 -134
- data/lib/text/hyphen/language/la.rb +3 -133
- data/lib/text/hyphen/language/mn.rb +3 -102
- data/lib/text/hyphen/language/ms.rb +9 -0
- data/lib/text/hyphen/language/nl.rb +3 -1252
- data/lib/text/hyphen/language/no.rb +1 -0
- data/lib/text/hyphen/language/no1.rb +3 -302
- data/lib/text/hyphen/language/no2.rb +3 -137
- data/lib/text/hyphen/language/pl.rb +3 -479
- data/lib/text/hyphen/language/pt.rb +3 -55
- data/lib/text/hyphen/language/sv.rb +3 -448
- data/test/data/bug_9807_latin1.rb +10 -0
- data/test/data/bug_9807_utf-8.rb +10 -0
- data/test/test_bugs.rb +14 -4
- data/test/test_text_hyphen.rb +3 -3
- data/text-hyphen.gemspec +29 -29
- metadata +101 -40
- data/COPYING.txt +0 -339
- data/History.txt +0 -23
- data/LICENCE.txt +0 -47
- data/README.txt +0 -82
data/History.txt
DELETED
|
@@ -1,23 +0,0 @@
|
|
|
1
|
-
== 1.0.2 / unreleased
|
|
2
|
-
* Moved to 'hoe' and GitHub.
|
|
3
|
-
* Preparing for 2.0 which will be Ruby 1.9-only for UTF-8.
|
|
4
|
-
* Fixing German support (RubyForge 28498):
|
|
5
|
-
* Choosing 'de' as a language will load 'de1'. Choosing 'de1' or 'de2' will
|
|
6
|
-
load properly now, but they will be reported with an ISO language code of
|
|
7
|
-
'de' (new optional #isocode attribute on a language definition that will
|
|
8
|
-
override the #iso_language setting of a Text::Hyphen instance if set).
|
|
9
|
-
* Both 'de1' and 'de2' can be loaded simultaneously now, but the first one
|
|
10
|
-
loaded will claim the Text::Hyphen::Language::DE constant.
|
|
11
|
-
* Added test cases for bugs:
|
|
12
|
-
* RubyForge 9807 (cannot reproduce)
|
|
13
|
-
* RubyForge 28128 (cannot reproduce)
|
|
14
|
-
* RubyForge 28498
|
|
15
|
-
|
|
16
|
-
== 1.0.1
|
|
17
|
-
* Minor modification to the RubyGem release of Text::Hyphen to enable the
|
|
18
|
-
hyphen command-line program.
|
|
19
|
-
|
|
20
|
-
== 1.0.0
|
|
21
|
-
* Initial version based on TeX::Hyphen 0.4.0 (some changes have been
|
|
22
|
-
backported to TeX::Hyphen 0.5.0).
|
|
23
|
-
* Incorporated many hyphenation pattern files from CTAN.
|
data/LICENCE.txt
DELETED
|
@@ -1,47 +0,0 @@
|
|
|
1
|
-
Text::Hyphen is copyright (c) 2004 - 2005 Austin Ziegler
|
|
2
|
-
|
|
3
|
-
Licensing for Text::Hyphen is unfortunately complex because of the various
|
|
4
|
-
copyrights and licences of the source hyphenation files. Some of these files
|
|
5
|
-
are available only under the TeX licence and others are available only under
|
|
6
|
-
the GNU GPL while others are public domain. Each language file has these
|
|
7
|
-
licences embedded within the file. Please consult each file's licence to
|
|
8
|
-
ensure that it is compatible with your application.
|
|
9
|
-
|
|
10
|
-
The copyright on the Text::Hyphen application/library and the Ruby
|
|
11
|
-
translations of hyphenation files belongs to Austin Ziegler. All other
|
|
12
|
-
copyrights on original versions still stand; Text::Hyphen is a derivative work
|
|
13
|
-
of these and other projects.
|
|
14
|
-
|
|
15
|
-
Application and Compilation Licences
|
|
16
|
-
------------------------------------
|
|
17
|
-
Text::Hyphen, the application/library is licensed under the same terms as
|
|
18
|
-
Ruby. Note that this specifically refers to the contents of bin/hyphen,
|
|
19
|
-
lib/text/hyphen.rb, and lib/text/hyphen/language.rb.
|
|
20
|
-
|
|
21
|
-
Individual language hyphenation files are NOT licensed under these terms, but
|
|
22
|
-
under the following MIT-style licence and the original hyphenation pattern
|
|
23
|
-
licenses. The copyright for the original TeX hyphenation files is held by the
|
|
24
|
-
original authors; any mistakes in conversion of these files to Ruby is
|
|
25
|
-
attributable to the contributors to the Text::Hyphen package only.
|
|
26
|
-
|
|
27
|
-
The compilation package Text::Hyphen is licensed under the same terms as Ruby.
|
|
28
|
-
|
|
29
|
-
Blanket Language Hyphenation File Licence
|
|
30
|
-
-----------------------------------------
|
|
31
|
-
Permission is hereby granted, free of charge, to any person obtaining a copy
|
|
32
|
-
of this software and associated documentation files (the "Software"), to deal
|
|
33
|
-
in the Software without restriction, including without limitation the rights
|
|
34
|
-
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
|
35
|
-
copies of the Software, and to permit persons to whom the Software is
|
|
36
|
-
furnished to do so, subject to the following conditions:
|
|
37
|
-
|
|
38
|
-
The above copyright notice and this permission notice shall be included in all
|
|
39
|
-
copies or substantial portions of the Software.
|
|
40
|
-
|
|
41
|
-
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
|
42
|
-
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
|
43
|
-
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
|
44
|
-
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
|
45
|
-
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
|
46
|
-
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
|
47
|
-
SOFTWARE.
|
data/README.txt
DELETED
|
@@ -1,82 +0,0 @@
|
|
|
1
|
-
== text-hyphen
|
|
2
|
-
|
|
3
|
-
http://rubyforge.org/projects/text-format/
|
|
4
|
-
http://github.com/halostatue/text-hyphen/
|
|
5
|
-
|
|
6
|
-
== DESCRIPTION:
|
|
7
|
-
|
|
8
|
-
Text::Hyphen will hyphenate words using modified versions of TeX hyphenation
|
|
9
|
-
patterns.
|
|
10
|
-
|
|
11
|
-
Text::Hyphen will properly hyphenate various words according to the rules of
|
|
12
|
-
the language the word is written in. The algorithm is based on that of the TeX
|
|
13
|
-
typesetting system by Donald E. Knuth. This is based on the Perl implementation
|
|
14
|
-
of TeX::Hyphen[1] and the Ruby port[2]. The language hyphenation pattern files
|
|
15
|
-
are based on the sources available from CTAN[3] as of 2004.12.19 and have been
|
|
16
|
-
translated by Austin Ziegler.
|
|
17
|
-
|
|
18
|
-
This release is 1.0.2. It is a minor bugfix for the RubyGem release of
|
|
19
|
-
Text::Hyphen to enable the hyphen command-line program. Text::Hyphen represents
|
|
20
|
-
a significant improvement over its predecessor, TeX::Hyphen.
|
|
21
|
-
|
|
22
|
-
== SYNOPSIS:
|
|
23
|
-
|
|
24
|
-
require 'text/hyphen'
|
|
25
|
-
hh = Text::Hyphen.new(:language => 'en_us', :left => 2, :right => 2)
|
|
26
|
-
# Defaults to the above
|
|
27
|
-
hh = TeX::Hyphen.new
|
|
28
|
-
|
|
29
|
-
word = "representation"
|
|
30
|
-
points = hyp.hyphenate(word) #=> [3, 5, 8, 10]
|
|
31
|
-
puts hyp.visualize(word) #=> rep-re-sen-ta-tion
|
|
32
|
-
|
|
33
|
-
Text::Hyphen is truly multilingual[4]. As an example, consider the difference
|
|
34
|
-
between the following:
|
|
35
|
-
|
|
36
|
-
require 'text/hyphen'
|
|
37
|
-
# Using left and right minimum values of 0 ensures that you will
|
|
38
|
-
# see all possible hyphenation points, not just those that meet
|
|
39
|
-
# the minimum width requirements.
|
|
40
|
-
en = Text::Hyphen.new(:left => 0, :right => 0)
|
|
41
|
-
fr = Text::Hyphen.new(:language = "fr", :left => 0, :right => 0)
|
|
42
|
-
|
|
43
|
-
puts en.visualise("organiser") #=> or-gan-iser
|
|
44
|
-
puts fr.visualise("organiser") #=> or-ga-ni-ser
|
|
45
|
-
|
|
46
|
-
As you can see, the hyphenation is distinct between the two hyphenators.
|
|
47
|
-
Additional improvements over TeX::Hyphen include thread safety (except for
|
|
48
|
-
debug control) and (minimal) support for UTF-8.
|
|
49
|
-
|
|
50
|
-
== FUTURE ENHANCEMENTS:
|
|
51
|
-
|
|
52
|
-
* Ruby 1.9 compatibility.
|
|
53
|
-
|
|
54
|
-
== INSTALL:
|
|
55
|
-
|
|
56
|
-
* This release of text-hyphen is only installed with RubyGems.
|
|
57
|
-
|
|
58
|
-
== DEVELOPERS:
|
|
59
|
-
|
|
60
|
-
After checking out the source, run:
|
|
61
|
-
|
|
62
|
-
$ rake newb
|
|
63
|
-
|
|
64
|
-
This task will install any missing dependencies, run the tests/specs,
|
|
65
|
-
and generate the RDoc.
|
|
66
|
-
|
|
67
|
-
== LICENSE:
|
|
68
|
-
|
|
69
|
-
The licensing for Text::Hyphen is complex and somewhat dependent upon the
|
|
70
|
-
languages being used during hyphenation; some languages are held under a more
|
|
71
|
-
strict licence than that granted in the LICENCE file.
|
|
72
|
-
|
|
73
|
-
Copyright 2004 - 2005 Austin Ziegler <austin@rubyforge.org>
|
|
74
|
-
See the LICENCE.txt file for more information.
|
|
75
|
-
|
|
76
|
-
[1] <http://search.cpan.org/author/JANPAZ/TeX-Hyphen-0.140/lib/TeX/Hyphen.pm>
|
|
77
|
-
Maintained by Jan Pazdziora.
|
|
78
|
-
[2] Available at <http://rubyforge.org/projects/text-format>.
|
|
79
|
-
[3] <http://www.ctan.org>
|
|
80
|
-
[4] There are some bugs and design decisions in the original Perl
|
|
81
|
-
implementation of TeX::Hyphen that make it unsuitable for most multilingual
|
|
82
|
-
implementations that carried over to the Ruby port of TeX::Hyphen.
|