text-hyphen 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (40) hide show
  1. data/ChangeLog +4 -0
  2. data/Changelog +4 -0
  3. data/INSTALL +6 -0
  4. data/LICENCE +47 -0
  5. data/README +56 -0
  6. data/Rakefile +116 -0
  7. data/bin/hyphen +107 -0
  8. data/lib/text/hyphen.rb +289 -0
  9. data/lib/text/hyphen/language.rb +112 -0
  10. data/lib/text/hyphen/language/ca.rb +174 -0
  11. data/lib/text/hyphen/language/cs.rb +363 -0
  12. data/lib/text/hyphen/language/da.rb +118 -0
  13. data/lib/text/hyphen/language/de1.rb +723 -0
  14. data/lib/text/hyphen/language/de2.rb +685 -0
  15. data/lib/text/hyphen/language/en_uk.rb +791 -0
  16. data/lib/text/hyphen/language/en_us.rb +493 -0
  17. data/lib/text/hyphen/language/es.rb +289 -0
  18. data/lib/text/hyphen/language/et.rb +337 -0
  19. data/lib/text/hyphen/language/eu.rb +115 -0
  20. data/lib/text/hyphen/language/fi.rb +113 -0
  21. data/lib/text/hyphen/language/fr.rb +392 -0
  22. data/lib/text/hyphen/language/ga.rb +608 -0
  23. data/lib/text/hyphen/language/hr.rb +124 -0
  24. data/lib/text/hyphen/language/hsb.rb +180 -0
  25. data/lib/text/hyphen/language/hu1.rb +385 -0
  26. data/lib/text/hyphen/language/hu2.rb +1283 -0
  27. data/lib/text/hyphen/language/ia.rb +73 -0
  28. data/lib/text/hyphen/language/id.rb +97 -0
  29. data/lib/text/hyphen/language/is.rb +390 -0
  30. data/lib/text/hyphen/language/it.rb +135 -0
  31. data/lib/text/hyphen/language/la.rb +134 -0
  32. data/lib/text/hyphen/language/mn.rb +103 -0
  33. data/lib/text/hyphen/language/nl.rb +1253 -0
  34. data/lib/text/hyphen/language/no1.rb +303 -0
  35. data/lib/text/hyphen/language/no2.rb +138 -0
  36. data/lib/text/hyphen/language/pl.rb +480 -0
  37. data/lib/text/hyphen/language/pt.rb +56 -0
  38. data/lib/text/hyphen/language/sv.rb +449 -0
  39. data/tests/tc_text_hyphen.rb +62 -0
  40. metadata +90 -0
@@ -0,0 +1,115 @@
1
+ # Hyphenation patterns for Text::Hyphen in Ruby: Basque
2
+ # Converted from the TeX hyphenation/bahyph.tex file, by Juan M.
3
+ # Aguirregabiria (wtpagagj@lg.ehu.es) in 1997 based on work by Julio
4
+ # Sanchez (jsanchez@gmv.es) in 1991.
5
+ #
6
+ # The original copyright holds and is reproduced in the source to this file.
7
+ # The Ruby version of these patterns are copyright 2004 Austin Ziegler and
8
+ # are available under an MIT license. See LICENCE for more information.
9
+ #--
10
+ # Hyphenation patterns for Basque.
11
+ # This file has been written by Juan M. Aguirregabiria (wtpagagj@lg.ehu.es)
12
+ # on February 1997 and is based on the shyphen.sh script that generates the
13
+ # Spanish patterns as compiled by Julio Sanchez (jsanchez@gmv.es) on
14
+ # September 1991. The original Copyright follows and applies also to this
15
+ # file whose last version will be always available by anonymous ftp from
16
+ # tp.lc.ehu.es or by poynting your Web browser to
17
+ # http://tp.lc.ehu.es/basque.html
18
+ #
19
+ # Hyphenation patterns for Spanish.
20
+ # Compiled by Julio Sanchez (jsanchez@gmv.es) on September 1991. These
21
+ # patterns have been derived from "On Word Division in Spanish", Jos'e A.
22
+ # Ma~nas, Communications of the ACM, and implemented in his package ftc. You
23
+ # can get ftc and a draft of the abovementioned paper from goya.dit.upm.es
24
+ # in src/text.proc/ftc.Z. FTP access may be available. Otherwise, send help
25
+ # to info@goya.dit.upm.es for details on use of the mail server.
26
+ #
27
+ # Rules mentioned below are those described in that paper. After several
28
+ # unsatisfactory attempts to pretend I knew better, these patterns closely
29
+ # follow that paper. Pattern 'tl' is not considered. It is conflictive and
30
+ # ftc does not use it either.
31
+ #
32
+ # These patterns have been generated by shyphen.sh version 1.0, shyphen.sh
33
+ # is a sh script that allows a number of choices. Full benefit from some of
34
+ # these options can only be obtained if appropriate fonts are available.
35
+ #
36
+ # Follows a copyright notice. This is not in the public domain, but the
37
+ # copyright is essentially a hold-harmless clause. That is, use it at will,
38
+ # but don't sue me if you don't like it.
39
+ #
40
+ # COPYRIGHT NOTICE
41
+ #
42
+ # These patterns and the generating sh script are Copyright (c) GMV 1991
43
+ # These patterns were developed for internal GMV use and are made public in
44
+ # the hope that they will benefit others. Also, spreading these patterns
45
+ # throughout the Spanish-language TeX community is expected to provide
46
+ # back-benefits to GMV in that it can help keeping GMV in the mainstream of
47
+ # spanish users. However, this is given for free and WITHOUT ANY WARRANTY.
48
+ # Under no circumstances can Julio Sanchez, GMV, Jos'e A. Ma~nas or any
49
+ # agents or representatives thereof be held responsible for any errors in
50
+ # this software nor for any damages derived from its use, even in case any
51
+ # of the above has been notified of the possibility of such damages. If any
52
+ # such situation arises, you responsible for repair. Use of this software is
53
+ # an explicit acceptance of these conditions.
54
+ #
55
+ # You can use this software for any purpose. You cannot delete this
56
+ # copyright notice. If you change this software, you must include comments
57
+ # explaining who, when and why. You are kindly requested to send any changes
58
+ # to tex@gmv.es. If you change the generating script, you must include code
59
+ # in it such that any output is clearly labeled as generated by a modified
60
+ # script.
61
+ #
62
+ # Despite the lack of warranty, we would like to hear about any problem you
63
+ # find. Please report problems to tex@gmv.es.
64
+ #
65
+ # END OF COPYRIGHT NOTICE
66
+ #
67
+ # Options included in this set: basic
68
+ # Open vowels: a e o
69
+ # Closed vowels: i u
70
+ # Consonants: b c d f g j k l m n p q r s t v w x y z
71
+ #
72
+ # Some of the patterns below represent combinations that never happen in
73
+ # Basque. Would they happen, they would be hyphenated according to the
74
+ # rules.
75
+ #
76
+ # This keeps {cat|lc}code changes, if any, local. Nice to users of
77
+ # multilingual versions. These are the minimum changes needed to process the
78
+ # patterns. These and other changes will have to be re-enacted when Basque
79
+ # be established as the current language. See the babel docs if you don't
80
+ # understand this.
81
+ #++
82
+ require 'text/hyphen/language'
83
+
84
+ Text::Hyphen::Language::EU = Text::Hyphen::Language.new do |lang|
85
+ lang.patterns <<-PATTERNS
86
+ % Rule SR1
87
+ % Vowels are kept together by the defaults
88
+
89
+ % Rule SR2
90
+ % Attach vowel groups to left consonant
91
+ 1ba 1be 1bo 1bi 1bu 1ca 1ce 1co 1ci 1cu 1da 1de 1do 1di 1du 1fa 1fe 1fo 1fi
92
+ 1fu 1ga 1ge 1go 1gi 1gu 1ja 1je 1jo 1ji 1ju 1ka 1ke 1ko 1ki 1ku 1la 1le 1lo
93
+ 1li 1lu 1ma 1me 1mo 1mi 1mu 1na 1ne 1no 1ni 1nu 1pa 1pe 1po 1pi 1pu 1qa 1qe
94
+ 1qo 1qi 1qu 1ra 1re 1ro 1ri 1ru 1sa 1se 1so 1si 1su 1ta 1te 1to 1ti 1tu 1va
95
+ 1ve 1vo 1vi 1vu 1wa 1we 1wo 1wi 1wu 1xa 1xe 1xo 1xi 1xu 1ya 1ye 1yo 1yi 1yu
96
+ 1za 1ze 1zo 1zi 1zu
97
+
98
+ % Rule SR3
99
+ % Build legal consonant groups, leave other consonants bound to
100
+ % the previous group. This overrides part of the SR2 pattern
101
+ % group.
102
+ 1l2la 1l2le 1l2lo 1l2li 1l2lu 1r2ra 1r2re 1r2ro 1r2ri 1r2ru 1t2sa 1t2se
103
+ 1t2so 1t2si 1t2su 1t2xa 1t2xe 1t2xo 1t2xi 1t2xu 1t2za 1t2ze 1t2zo 1t2zi
104
+ 1t2zu 1b2la 1b2le 1b2lo 1b2li 1b2lu 1b2ra 1b2re 1b2ro 1b2ri 1b2ru 1d2ra
105
+ 1d2re 1d2ro 1d2ri 1d2ru 1f2la 1f2le 1f2lo 1f2li 1f2lu 1f2ra 1f2re 1f2ro
106
+ 1f2ri 1f2ru 1g2la 1g2le 1g2lo 1g2li 1g2lu 1g2ra 1g2re 1g2ro 1g2ri 1g2ru
107
+ 1k2la 1k2le 1k2lo 1k2li 1k2lu 1k2ra 1k2re 1k2ro 1k2ri 1k2ru 1p2la 1p2le
108
+ 1p2lo 1p2li 1p2lu 1p2ra 1p2re 1p2ro 1p2ri 1p2ru 1t2ra 1t2re 1t2ro 1t2ri
109
+ 1t2ru
110
+ % We now avoid some problematic breaks.
111
+ su2b2r su2b2l
112
+ PATTERNS
113
+ end
114
+ Text::Hyphen::Language::BAQ = Text::Hyphen::Language::EU
115
+ Text::Hyphen::Language::EUS = Text::Hyphen::Language::EU
@@ -0,0 +1,113 @@
1
+ # Hyphenation patterns for Text::Hyphen in Ruby: Finnish
2
+ # Converted from the TeX hyphenation/fihyph.tex file, Kauko Saarinen,
3
+ # Computing Centre, University of Jyvaskyla, Finland, 1986 - 1989.
4
+ #
5
+ # The original copyright holds and is reproduced in the source to this file.
6
+ # The Ruby version of these patterns are copyright 2004 Austin Ziegler and
7
+ # are available under an MIT license. See LICENCE for more information.
8
+ #--
9
+ # -----> Finnish hyphenation patterns for MLPCTeX <------
10
+ # First release January -86 by Kauko Saarinen,
11
+ # Computing Centre, University of Jyvaskyla, Finland
12
+ #
13
+ # Completely rewritten January -88. The new patterns make
14
+ # much less mistakes with foreign and compound words.
15
+ # The article "Automatic Hyphenation of Finnish"
16
+ # by Professor Fred Karlsson is also referred
17
+ # ---------------------------------------------------------
18
+ #
19
+ # 8th March -89 (vers. 2.2), some vowel triples by Fred Karlsson added.
20
+ #
21
+ # ********* Patterns may be freely distributed **********
22
+ #++
23
+ require 'text/hyphen/language'
24
+
25
+ Text::Hyphen::Language::FI = Text::Hyphen::Language.new do |lang|
26
+ lang.patterns <<-PATTERNS
27
+ 1ba 1be 1bi 1bo 1bu 1by 1da 1de 1di 1do 1du 1dy 1d� 1d� 1fa 1fe 1fi 1fo 1fu
28
+ 1fy 1ga 1ge 1gi 1go 1gu 1gy 1g� 1g� 1ha 1he 1hi 1ho 1hu 1hy 1h� 1h� 1ja 1je
29
+ 1ji 1jo 1ju 1jy 1j� 1j� 1ka 1ke 1ki 1ko 1ku 1ky 1k� 1k� 1la 1le 1li 1lo 1lu
30
+ 1ly 1l� 1l� 1ma 1me 1mi 1mo 1mu 1my 1m� 1m� 1na 1ne 1ni 1no 1nu 1ny 1n� 1n�
31
+ 1pa 1pe 1pi 1po 1pu 1py 1p� 1p� 1ra 1re 1ri 1ro 1ru 1ry 1r� 1r� 1sa 1se 1si
32
+ 1so 1su 1sy 1s� 1s� 1ta 1te 1ti 1to 1tu 1ty 1t� 1t� 1va 1ve 1vi 1vo 1vu 1vy
33
+ 1v� 1v�
34
+
35
+ % ------- Some common words borrowed from other languages -------
36
+ % ------- This part could be updated from time to time -------
37
+ 1st2r % -stressi, -strategia etc.
38
+
39
+ % ------ Some special cases occuring with compound words only ----
40
+ % ------ There still remains well known problem as 'kaivos\-aukko' etc.
41
+ %a1y (common in borrowed words)
42
+ �2y y1a2 y1o2 o1y �2y u1y2 y1u2 �3a2 �3o2 �3a2 �3o2 �1u2 �1u2
43
+ a1� % (a1�2 ei mahdollinen!)
44
+ a1� o1� o1� u1�2 u1�2 �2� �2� �2� �2�
45
+ % lyhyt/pitka -vokaalipareja, tavallisesti sanarajalla
46
+ aa1i2 % maa-ikkuna
47
+ aa1e2 aa1o2 % maa-ottelu
48
+ aa1u2 % uraa-uurtava
49
+ ee1a2 % tee-astia
50
+ ee1i2 % tee-istutus
51
+ ee1u2 % varietee-uusinta
52
+ ee1y2 ii1a2 ii1e2 ii1o2 uu1a2 uu1e2 % puu-esine
53
+ uu1o2 % puu-osa
54
+ uu1i2 % puu-istutus
55
+ e1aa i1aa o1aa u1aa u1ee a1uu % kala-uuni
56
+ i1uu % ravi-uutiset
57
+ e1uu % virhe-uutinen
58
+ o1uu % radio-uutiset
59
+ ��1i ��1e ��3y i1�� e1�� y1�� i1�� % yhti-��n etc.
60
+ %i1eu % keski-eurooppalainen
61
+
62
+ % vokaalikolmikkoja etc. yhdyssanojen rajoissa
63
+ % -------- vowel triples by Fred Karlsson
64
+ a1ei a1oi e1ai % e1oi % ambiguous for ex. video-ilme (8.3.89)
65
+ i1au % u1oi % ambiguous (8.3.89)
66
+ y1ei ai1a ai1e ai1o ai1u au1a au1e eu1a ie1a ie1o %ie1u % ambiguous
67
+ ie1y io1a2 io1e2 iu1a iu1e iu1o oi1a oi1e oi1o oi1u
68
+ o1ui % veto-uistin, himo-uimari, etc.
69
+ ou1e ou1o ue1a ui1e uo1a % uo1i % ambiguous
70
+ uo1u
71
+ % ---------------- End of vowel triples --------------------
72
+ e1�2 �1e2 .�2 % don't hyphenate �-lyllinen etc.
73
+
74
+ % The following patterns contain no general scientific rule. They are
75
+ % selected more or less intuitively to solve problems with common and
76
+ % frequently appearing compound words. However, every pattern resolves more
77
+ % than only one hyphenation problem.
78
+ u2s % estaa virheita yhdyssanojen yhteydessa
79
+ yli1o2p % yli-opisto etc.
80
+ ali1a2v % ali-avaruus etc.
81
+ 1sp2li % kuutio-splini etc.
82
+ alous1 keus1 % oikeus-oppinut etc.
83
+ rtaus1 2s1ohje % -sohjelma etc.
84
+ 2s1a2sia % nais-asianainen etc.
85
+ 1a2sian % neuvottelu-asian etc.
86
+ 1a2siat % koti-asian (ei kotia-sian)
87
+ 1a2sioi % talous-asioita etc.
88
+ r2as l2as % tikku-rasia etc
89
+ 2s1o2pisk % xs-opiskelija etc
90
+ 2n1o2pet 2s1a2loi 2n1o2pist % kansan-opisto etc.
91
+ 2s1o2pist 2s1o2sa % xxs-osakas etc.
92
+ 2n1o2sa % asian-osainen etc.
93
+ alkei2s1 perus1 2s1i2dea. 2s1i2dean 2s1e2sity % xs-esitys etc
94
+ 2n1e2dus % kansan-edustaja etc.
95
+ 2s1ajatu % -ajatus etc.
96
+ 2s1ase 2s1apu 2s1y2rit % yhteis-yritys etc.
97
+ .ydi2n1 .suu2r1a2 % suur-ajot etc.
98
+ 2s1y2hti 2n1otto 2n1oton 2n1anto 2n1anno 2n1a2jan 2n1aika 2n1o2mai 2n1y2lit
99
+ 2s1a2len 2n1a2len 1a2siaka2s1 ulo2s1 % ulos-ajo
100
+ 2n1a2jo % kiven-ajo
101
+ 2s1a2jo
102
+
103
+ % *** The following rules may be used on user's responsibility ***
104
+ % *** for example, may be needed with narrow columns ***
105
+ % >>>>>>>>>>> a1e a1o e1o o1a u1a <<<<<<<<<<<
106
+
107
+ % ----- Some districting rules by Professor Fred Karlsson's ideas ------
108
+ b2l 1b2lo bib3li b2r 1b2ri 1b2ro 1b2ru d2r 1d2ra f2l 1f2la f2r 1f2ra 1f2re
109
+ g2l 1g2lo g2r 1g2ra k2l 1k2ra 1k2re 1k2ri 1k2v 1k2va p2l p2r 1p2ro c2l q2v
110
+ 1q2vi sc2h ts2h ch2r
111
+ PATTERNS
112
+ end
113
+ Text::Hyphen::Language::FIN = Text::Hyphen::Language::FI
@@ -0,0 +1,392 @@
1
+ # Hyphenation patterns for Text::Hyphen in Ruby: French
2
+ # Converted from the TeX hyphenation/frhyph.tex file, by Jacques
3
+ # Desarmenien (1984 - 1986), Norman Buckle, Michael Ferguson (1988),
4
+ # Justin Bur, and others.
5
+ #
6
+ # The original copyright holds and is reproduced in the source to this file.
7
+ # The Ruby version of these patterns are copyright 2004 Austin Ziegler and
8
+ # are available under an MIT license. See LICENCE for more information.
9
+ #--
10
+ # frhyph.tex % French hyphenation patterns
11
+ #%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
12
+ # This file is available for free and can used and redistributed
13
+ # asis for free. Modified versions should have another name.
14
+ #%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
15
+ #message{frhyph.tex - French hyphenation patterns (V2.11) <2002/01/16>}
16
+ #
17
+ #%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
18
+ # *BEFORE* using this file *PLEASE* run checksum on it:
19
+ # checksum -v frhyph.tex
20
+ # to make sure that it hasn't been damaged.
21
+ # Then if you notice anything wrong in french hyphenation please report to
22
+ # D. Flipo and B. Gaulle at the email address: cesure-l@gutenberg.eu.org
23
+ #%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
24
+ #% checksum = "37094 1457 3075 34744"
25
+ #%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
26
+ #%%%%%%% The most famous good guys who worked hard to obtain something usable.
27
+ # Jacques Desarmenien, Universite de Strasbourg :
28
+ # - << how to run TeX in a French environment: hyphenation, fonts,
29
+ # typography. >> in Tugboat, 5 (1984) 91-102. and TeX85 conference
30
+ # - << La division par ordinateur des mots francais :
31
+ # application a TeX >> in TSI vol. 5 No 4, 1986 (C) AFCET-
32
+ # Gauthier-Villars
33
+ # Norman Buckle, UQAH (nb; many additions)
34
+ # Michael Ferguson, INRS-Telecommunications (mjf) June 1988
35
+ # Justin Bur, Universite de Montreal (jbb; checked against original list)
36
+ # all patterns including apostrophe missing from nb list
37
+ # after that, GUTenberg and specially Daniel Flipo and Bernard Gaulle
38
+ # did their best effort to improve the list of patterns.
39
+ #
40
+ # -----------------------------------------------------------------
41
+ #
42
+ # Adaption of these patterns for
43
+ # - TeX Version 3.x and MLTeX 3.x (2.x)
44
+ # and
45
+ # - all fonts in T1/`Cork' and/or CM/OT1 encoding
46
+ # by Bernd Raichle 1996/08/28 (using ideas from `ghyph31.tex'
47
+ # as of 1994-02-13 maintained by Bernd Raichle).
48
+ # (An adaption for the old MLTeX 2.x exists but can not be
49
+ # tested in lack of an executable.)
50
+ #++
51
+ require 'text/hyphen/language'
52
+
53
+ Text::Hyphen::Language::FR = Text::Hyphen::Language.new do |lang|
54
+ lang.patterns <<-PATTERNS
55
+ % *, phonetic
56
+ 2'2
57
+
58
+ % a, phonetic
59
+ .a4 'a4 .�4 '�4
60
+ ab2h % df-bg 1998/02/07 for abhorrer
61
+ ad2h % df-bg 1998/02/07 for adh�sion & co
62
+ % a, etymological
63
+ .ab3r�a 'ab3r�a a1�2dre .ae3s4ch 'ae3s4ch 1alcool a2l1algi .amino1a2c
64
+ 'amino1a2c .ana3s4tr 'ana3s4tr 1a2nesth�si .anti1a2 'anti1a2 .anti1e2
65
+ 'anti1e2 .anti1�2 .anti2enne 'anti2enne 'anti1�2 .anti1s2 'anti1s2 .apo2s3ta
66
+ 'apo2s3ta apo2s3tr archi1�2pis .as2ta 'as2ta a2s3tro
67
+
68
+ % b, phonetic
69
+ 1ba 1b� 1be 1b� 1b� 1b� 4be. 4bes.
70
+ 2bent. % mute syllable: tombent (df) 22/02/94
71
+ 1bi 1b� 1b2l 4ble. 4bles.
72
+ 2blent. % mute syllable: troublent (df) 28/02/94
73
+ 1bo 1b� 1b2r 4bre. 4bres.
74
+ 2brent. % mute syllable: palabrent (df) 28/02/94
75
+ 1bu 1b� 1by
76
+ % b, etymological
77
+ .bai2se3main % hyphen disappeared from french 5/2/94
78
+ .bi1a2c .bi1a2t % like .tri1a2t for tri-athlon bg 12/27/93
79
+ .bi1au .bio1a2 .bi2s1a2 .bi1u2
80
+
81
+ % c, phonetic
82
+ 1� 1ca 1c� 1ce 1c� 1c� 1c� 4ce. 4ces.
83
+ % words ending with -cent (df) 22/02/94
84
+ 2cent. ja3cent. ac3cent. �3cent. munifi3cent. r�ti3cent. privatdo3cent.
85
+ inno3cent. es3cent. acquies4cent. is3cent. immis4cent. .ch4 1c2h 4ch. 2chb
86
+ 4che. 4ches. 2chent. % mute syllable: touchent (df) 22/02/94
87
+ ca3ou3t2 % for caoutchou... added 3/1/94 df-bg
88
+ 2chg ch2l 4chle. 4chles. 2chm 2chn 2chp ch2r 4chre. 4chres. 2chs 2cht 2chw
89
+ 1ci 1c� 1c2k 4ck. 2ckb 4cke. 4ckes.
90
+ 2ckent. % mute syllable: stockent (df) 22/02/94
91
+ 2ckf 2ckg 2ck3h 2ckp 2cks 2ckt 1c2l 4cle. 4cles.
92
+ 2clent. % mute syllable: encerclent (df) 28/02/94
93
+ 1co 1c� 1c�0 % final zero essential to terminate cs
94
+ 1c2r 4cre. 4cres. 2crent. % mute syllable: massacrent (df) 28/02/94
95
+ 1cu 1c� 1cy .cul4 % -- as .con4 .cons4 (march 92)
96
+ % c, etymological
97
+ .ch�2vre3feuille % hyphen disappeared from french 5/2/94
98
+ chlo2r3a2c chlo2r3�2t .ci2s1alp co1acc co1acq co1a2d co1ap co1ar co1assoc
99
+ co1assur co1au co1ax co1�2 co1ef co1en co1ex .con4 % missing from nb list
100
+ .cons4 % missing from nb list
101
+ .contre1s2c .contre3ma�tre % hyphen disappeared from french 5/2/94
102
+ co2nurb .co1o2 .co2o3lie
103
+
104
+ % d, phonetic
105
+ 1d' 1da 1d� d1d2h 1de 1d� 1d� 1d� 4de. 4des. 1di 1d�
106
+ 2dlent. % mute syllable: jodlent (df) 28/02/94
107
+ 1do 1d� 1d2r 4dre. 4dres.
108
+ 2drent. % mute syllable: engendrent (df) 28/02/94
109
+ d1s2 1du 1d� 1dy
110
+ % d, etymological
111
+ .dacryo1a2 .d�1a2 .d�1io .d�1o2
112
+ .d�2s % originaly in JD file
113
+ %.d�2s1a2 removed 09/17/92 because wrong for the original JD 500 words test
114
+ .d�3s2a3cr .d�s2a3m % .d�s2a2mi introduced 09/17/92 bec. i can't see why
115
+ % d�samidonner ran in JD. Moved to .d�s2a3m df 12/27/93.
116
+ .d�3s2a3tell .d�3s2astr .d�3s2c % 1 moved 3 due to .d�2s 09/17/92
117
+ %.d�2s1e2 removed 09/17/92 because wrong for the original JD 500 words test
118
+ .d�2s1�2 .d�3s2�3gr .d�3s2ensib .d�3s2ert .d�3s2exu
119
+ %.d�2s3h removed 09/17/92 because wrong for the original JD 500 words test
120
+ .d�2s1i2 .d�3s2i3d .d�3s2i3gn .d�3s2i3li .d�3s2i3nen .d�3s2invo .d�3s2i3r
121
+ .d�3s2ist
122
+ %.d�2s1o2 removed 09/17/92 because wrong for the original JD 500 words test
123
+ .d�3s2o3d� .d�2s1�0 % final zero essential to terminate cs
124
+ .d�3s2o3l .d�3s2o3pil .d�3s2orm .d�3s2orp .d�3s2oufr
125
+ .d�3s2p % 1 moved 3 due to .d�2s 09/17/92
126
+ .d�3s2t % 1 moved 3 due to .d�2s 09/17/92
127
+ .d�2s1u2n 3d2hal 3d2houd di2s3cop .di1a2c� .di1a2cid .di1ald .di1a2mi
128
+ .di1a2tom .di1e2n .di2s3h .dy2s3 .dy2s1a2 .dy2s1i2
129
+ .dy2s1o2 % missing from nb list
130
+ .dy2s1u2
131
+ % words ending with -dent (df) 22/02/94
132
+ 2dent. d�ca3dent. �3dent. cci3dent. inci3dent. confi3dent. tri3dent.
133
+ dissi3dent. chien3dent. .ar3dent. impu3dent. pru3dent.
134
+
135
+ % e, phonetic
136
+ .e4 'e4 .�4 '�4 .�4 '�4 .�4 '�4
137
+ % e, etymological
138
+ �d2hi % df-bg 1998/02/07 for r�dhibitoire
139
+ 1�2drie 1�2drique 1�2lectr 1�2l�ment .en1a2 'en1a2 1�2nerg e2n1i2vr .en1o2
140
+ 'en1o2 �pi2s3cop �pi3s4cope e2s3cop .eu2r1a2 'eu2r1a2 eu1s2tat extra1
141
+ extra2c extra2i
142
+
143
+ % f, phonetic
144
+ 1fa 1f� 1fe 1f� 1f� 1f� 4fe. 4fes.
145
+ 2fent. % mute syllable: agrafent chauffent (df) 22/02/94
146
+ 1fi 1f� 1f2l 4fle. 4fles.
147
+ 2flent. % mute syllable: gonflent (df) 28/02/94
148
+ 1fo 1f� 1f2r 4fre. 4fres.
149
+ 2frent. % mute syllable: balafrent (df) 28/02/94
150
+ f1s2 1fu 1f� 1fy
151
+ % g, phonetic
152
+ 1ga 1g� 1ge 1g� 1g� 1g� 4ge. 4ges. 1g2ha 1g2he 1g2hi 1g2ho 1g2hy 1gi 1g�
153
+ 1g2l 4gle. 4gles. 2glent. % mute syllable: meuglent (df) 28/02/94
154
+ 1g2n 'a2g3nat % (df) 16/01/02
155
+ .a2g3nat % (df) 16/01/02
156
+ a2g3nos % (df) 16/01/02 (pattern dia2g3n deleted)
157
+ co2g3niti % (df) 16/01/02
158
+ 'i2g3n� % (df) 16/01/02
159
+ .i2g3n� % (df) 16/01/02
160
+ 'i2g3ni % (df) 16/01/02
161
+ .i2g3ni % (df) 16/01/02
162
+ .ma2g3nicide % (df) 16/01/02
163
+ .ma2g3nificat % (df) 16/01/02
164
+ .ma2g3num % (df) 16/01/02
165
+ o2g3nomoni % (df) 16/01/02
166
+ o2g3nosi % (df) 16/01/02
167
+ .pro2g3nath % (df) 16/01/02
168
+ pu2g3nable % (df) 16/01/02
169
+ pu2g3nac % (df) 16/01/02
170
+ .sta2g3n .syn2g3nath % (df) 16/01/02
171
+ wa2g3n 4gne. 4gnes. 2gnent. % mute syllable: accompagnent (df) 28/02/94
172
+ 1go 1g� 1g2r 4gre. 4gres. 2grent. % mute syllable: immigrent (df) 28/02/94
173
+ 1gu 1g� g1s2 4gue. 4gues. 1gy
174
+ % words ending with -guent (df) 22/02/94
175
+ 2guent. .on3guent. 'on3guent.
176
+ % g, etymological
177
+ % words ending with -gent (df) 22/02/94
178
+ 2gent. r�3gent. entre3gent. indi3gent. dili3gent. intelli3gent. indul3gent.
179
+ tan3gent. rin3gent. contin3gent. .ar3gent. 'ar3gent. ser3gent. ter3gent.
180
+ r�sur3gent.
181
+
182
+ % h, phonetic
183
+ 1ha 1h� 1he 1h� 1h� 1h� 4he. 4hes. 1hi 1h� 1ho 1h� 1hu 1h� 1hy
184
+ % h, etymological
185
+ h�mi1� h�mo1p2t hypera2 hypere2 hyper�2 hyperi2 hypero2 hypers2 hype4r1
186
+ hyperu2 hypo1a2 hypo1e2 % missing from nb list
187
+ hypo1�2 hypo1i2 hypo1o2 hypo1s2 hypo1u2
188
+
189
+ % i, phonetic
190
+ .i4 'i4 .�4 '�4
191
+ % ill patterns missing from nb list
192
+ il2l cil3l rcil4l ucil4l vacil4l gil3l hil3l lil3l l3lion mil3l mil4let
193
+ �mil4l semil4l rmil4l armil5l capil3l papil3la papil3le papil3li papil3lom
194
+ pupil3l piril3l thril3l cyril3l ibril3l pusil3l .stil3l distil3l instil3l
195
+ fritil3l boutil3l vanil3lin vanil3lis vil3l avil4l chevil4l uevil4l uvil4l
196
+ xil3l
197
+ % end of ill patterns
198
+
199
+ % i, etymological
200
+ i1algi i1arthr i1�2dre 1informat % missing from nb list
201
+ .in1a2 'in1a2 .in2a3nit 'in2a3nit .in2augur 'in2augur .in1e2 'in1e2 .in1�2
202
+ 'in1�2 .in2effab % missing from nb list
203
+ 'in2effab .in2�3lucta 'in2�3lucta .in2�3narra 'in2�3narra .in2ept 'in2ept
204
+ .in2er 'in2er .in2exora % missing from nb list
205
+ 'in2exora .in1i2 'in1i2 .in2i3miti 'in2i3miti .in2i3q 'in2i3q .in2i3t
206
+ 'in2i3t .in1o2 'in1o2 .in2o3cul 'in2o3cul .in2ond 'in2ond .in1s2tab
207
+ 'in1s2tab 'inte4r3 .intera2 'intera2 .intere2 'intere2 .inter�2 'inter�2
208
+ .interi2 'interi2 .intero2 'intero2 .inte4r3 .interu2 'interu2 .inters2
209
+ 'inters2 .in1u2 'in1u2 .in2uit 'in2uit .in2u3l 'in2u3l io1a2ct i1oxy i1s2tat
210
+
211
+ % j, phonetic
212
+ 1j 2jk 4je. 4jes. 2jent. % mute syllable: gal�jent (df) 22/02/94
213
+
214
+ % k, phonetic
215
+ 1ka 1k� 1ke 1k� 1k� 1k� 4ke. 4kes.
216
+ 2kent. % mute syllable: jerkent (df) 22/02/94
217
+ 1k2h 4kh. .kh4 1ki 1k� 1ko 1k� 1k2r 1ku 1k� 1ky
218
+
219
+ % l, phonetic
220
+ 1la 1l� 1l� 1le 1l� 1l� 1l� 4le. 4les. 1li 1l� 1lo 1l� l1s2t 1lu 1l� 1ly
221
+
222
+ % l, etymological
223
+ la2w3re
224
+ % words ending with -lent (df) 22/02/94
225
+ 2lent. .ta3lent. iva3lent. �quiva4lent. monova3lent. polyva3lent. re3lent.
226
+ .do3lent. indo3lent. inso3lent. turbu3lent. succu3lent. f�cu3lent.
227
+ trucu3lent. opu3lent. corpu3lent. ru3lent. sporu4lent.
228
+
229
+ % m, phonetic
230
+ 1ma 1m� 1me 1m� 1m� 1m� 4me. 4mes. 1mi 1m� 1mo 1m�
231
+ 1m�0 % final zero essential to terminate cs
232
+ m1s2 1mu 1m� 1my
233
+ % m, etymological
234
+ .ma2c3k .macro1s2c .ma2l1a2dres .ma2l1a2dro .ma2l1ais� .ma2l1ap .ma2l1a2v
235
+ .ma2l1en .ma2l1int .ma2l1oc .ma2l1o2d .ma2r1x % nb (jbb: ?)
236
+ .m�2g1oh .m�2sa % missing from nb list
237
+ .m�3san % missing from nb list
238
+ .m�2s1es .m�2s1i .m�2s1u2s .m�ta1s2ta
239
+ % words ending with -ment (df) 22/02/94
240
+ �2ment. da2ment. fa2ment. amalga2ment. cla2ment. ra2ment. temp�ra3ment.
241
+ ta2ment. testa3ment. qua2ment. �2ment. car�2ment. diaphrag2ment. ryth2ment.
242
+ ai2ment. rai3ment. ab�2ment. �ci2ment. vidi2ment. subli2ment. �li2ment.
243
+ reli2ment. mi2ment. ani2ment. veni2ment. ri2ment. d�tri3ment. nutri3ment.
244
+ inti2ment. esti2ment. l2ment. flam2ment. gram2ment. .gem2ment. om2ment.
245
+ .com3ment. �2ment. slalo2ment. chro2ment. to2ment. ar2ment. .sar3ment.
246
+ er2ment. antifer3ment. .ser3ment. fir2ment. or2ment. as2ment. au2ment.
247
+ �cu2ment. fu2ment. hu2ment. fichu3ment. llu2ment. plu2ment. bou2ment.
248
+ bru2ment. su2ment. tu2ment. .milli1am 1m2n�mo 1m2n�s 1m2n�si .mono1a2
249
+ .mono1e2 .mono1�2 .mono1i2 .mono1�2d� .mono1o2 .mono1u2 .mono1s2
250
+ mon2t3r�al % missing from nb list
251
+ moye2n1�2g
252
+
253
+ % n, phonetic
254
+ 1na 1n� 1ne 1n� 1n� 1n� 4ne. 4nes. 1ni 1n� 1no 1n�
255
+ 1n�0 % final zero essential to terminate cs
256
+ 1nu 1n� n1x 1ny
257
+ % n, etymological
258
+ % words ending with -nent (df) 22/02/94
259
+ 2nent. % fric-tionnent - syllable muette - bg 27/12/93
260
+ r�ma3nent. imma3nent. perma3nent. .�mi3nent. pr��mi3nent. pro�mi3nent.
261
+ sur�mi3nent. immi3nent. conti3nent. perti3nent. absti3nent. .no2n1obs
262
+ n3s2at. n3s2ats.
263
+
264
+ % o, phonetic
265
+ .o4 'o4 '�4 .�4 %'�2 % mjf % deleted 3/1/94 df-bg
266
+ % o, etymological
267
+ o2b3long 1octet % missing from nb list
268
+ o1d2l o1�2dre o1ioni ombud2s3 omni1s2 o1s2tas o1s2tat o1s2t�ro o1s2tim
269
+ o1s2tom o1s2trad o1s2tratu o1s2triction .oua1ou 'oua1ou .ovi1s2c 'ovi1s2c
270
+ oxy1a2
271
+
272
+ % p, phonetic
273
+ 1pa 1p� 1pe 1p� 1p� 1p� 4pe. 4pes. 1p2h .ph4 4ph. 4phe. 4phes.
274
+ 2phent. % mute syllable: triomphent (df) 22/02/94
275
+ ph2l 4phle. 4phles. 2phn ph2r 4phre. 4phres. 2phs 2pht 1pi 1p� 1p2l 4ple.
276
+ 4ples. 2plent. % mute syllable: accouplent (df) 28/02/94
277
+ 1po 1p� 1p2r 4pre. 4pres. 2prent. % mute syllable: empourprent (df) 28/02/94
278
+ 1pu 1p� 1py
279
+ % p, etymological
280
+ pal�o1�2 .pa2n1a2f .pa2n1a2m� .pa2n1a2ra .pa2n1is .pa2n1o2ph .pa2n1opt
281
+ .pa2r1a2che .pa2r1a2ch� .para1s2 .pa2r3h�
282
+ % words ending with -pent (df) 22/02/94
283
+ 2pent. re3pent. .ar3pent. 'ar3pent. ser3pent.
284
+ .pen2ta % pent- or penta- but never pen-ta bg 12/27/93
285
+ per3h p�2nul % p�2n1ul moved back 09/17/92 to JD def.
286
+ .pe4r .per1a2 .per1e2 .per1�2 .per1i2 .per1o2 .per1u2
287
+ p�1r2�2q % 2r1 moved 09/17/92 to 1r2(it was a typo)
288
+ .p�ri1os .p�ri1s2 .p�ri2s3s .p�ri2s3ta .p�ri1u2 .phalan3s2t photo1s2
289
+ 3ph2tal� 3ph2tis
290
+ % Here is an example of a pb involving phonetic and etymologic patterns 5/94
291
+ % .phyto3ph2 % originaly, but wrong for phy-toph-thora 9/92
292
+ % .phy2topha % for -pharmacie but wrong for phyto-biol.. 5/94
293
+ .pluri1a 1p2n� 1p2neu po1astre poly1a2 poly1e2 poly1�2 poly1�2 poly1i2
294
+ poly1o2 poly1s2 poly1u2 .pon2tet % JD hypenated it asis 09/17/92, exception
295
+ .pos2t3h .pos2t1in .pos2t1o2 .pos2t3r .post1s2 .pr�1a2
296
+ .pr�2a3la % missing from nb list
297
+ .pr�2au .pr�1�2 .pr�1e2 .pr�1i2 .pr�1o2 .pr�1u2 .pr�1s2 .pro1�2 .pro1s2c�
298
+ pro2s3tat .prou3d2h 1p2sych .psycho1a2n 1p2t�r 1p2t�r .pud1d2l
299
+
300
+ % q
301
+ 1q 4que. 4ques.
302
+ % words ending with -quent (df) 22/02/94
303
+ 2quent. �3quent. �lo3quent. grandilo3quent.
304
+
305
+ % r, phonetic
306
+ 1ra 1r� 1re 1r� 1r� 1r� 1r2h 4rhe. 4rhes. 1ri 1r� 1ro 1r� 1ru 1r� 1ry
307
+ % r, etymological
308
+ radio1a2 % missing from nb list
309
+ .r�1a2 .r�2a3le .r�2a3lis .r�2a3lit .r�2aux .r�1�2 .r�1e2 .r�2el .r�2er
310
+ .r�2�r .r�1i2 .r�2i3fi .r�1o2 .re1s2 .re2s3cap
311
+ .re2s3cisi % for res-cision 09/17/92 (missing from nb)
312
+ .re2s3ciso % for res-cisoire 09/17/92(missing from nb)
313
+ .re2s3cou .re2s3cri .re2s3pect .re2s3pir .re2s3plend .re2s3pons .re2s3quil
314
+ .re2s3s .re2s3t .re3s4tab .re3s4tag .re3s4tand .re3s4tat .re3s4t�n .re3s4t�r
315
+ .re3s4tim .re3s4tip .re3s4toc .re3s4top .re3s4tr .re4s5trein .re4s5trict
316
+ .re4s5trin .re3s4tu .re3s4ty
317
+ .r�u2 %.r�1u2 % pattern rejected 12/2/92
318
+ % (don't hyphenate as r�-union nor r�u-nion)
319
+ .r�2uss .r�tro1a2 4re. 4res.
320
+ % words ending with -rent (df) 22/02/94
321
+ 2rent. % es-p�rent - syllable muette - bg 27/12/93
322
+ .pa3rent. appa3rent. transpa3rent. �3rent. tor3rent. cur3rent. 2r3heur
323
+ 2r3hydr
324
+
325
+ % s, phonetic
326
+ 1sa 1s� .sch4 1s2ch e2s3ch i2s3ch� i2s3chia i2s3chio 4sch. 4sche. 4sches.
327
+ 2schs 1se 1s� 1s� 1s� 4se. 4ses. .seu2le % jbb
328
+ .sh4 1s2h 4sh. 4she. 4shes. 2shent. % mute syllable: smashent (df) 22/02/94
329
+ 2shm 2shr 2shs 1si 1s� 1so 1s� 1s�0 % final zero essential to terminate cs
330
+ 1su 1s� 1sy
331
+ % s, etymological
332
+ 1s2caph 1s2cl�r 1s2cop sesqui1a2
333
+ % words ending with -sent (df) 22/02/94
334
+ 2sent. % massent - syllable muette - bg 27/12/93
335
+ ab3sent. pr�3sent. .res3sent. 2s3hom 1s2lav 1s2lov 1s2patia 1s2perm 1s2por
336
+ 1s2ph�r 1s2ph�r 1s2piel 1s2piros 1s2tandard 1s2tein st�r�o1s2 1s2tigm
337
+ 1s2tock 1s2tomos 1s2troph 1s2tructu 1s2tyle .su2b1a2 .su3b2alt .su2b1�2
338
+ .su3b2�3r .su2b1in .su2b3limin .su2b3lin .su2b3lu sub1s2 .su2b1ur supero2
339
+ supe4r1 supers2 .su2r1a2 su3r2ah .su3r2a3t .su2r1e2 .su3r2eau .su3r2ell
340
+ .su3r2et .su2r1�2 .su2r3h .su2r1i2m .su2r1inf .su2r1int .su2r1of .su2r1ox
341
+
342
+ % t, phonetic
343
+ 1ta 1t� 1t� 1te 1t� 1t� 1t� 4te. 4tes. .th4 1t2h 4th. 4the. 4thes.
344
+ 2thl % th2l was wrong for ...ath-lon (jd said 2thl) df 12/27/93
345
+ 2thm 2thn th2r 4thre. 4thres. 2ths 1ti 1t� 1to 1t� 1t2r 4tre. 4tres.
346
+ 2trent. % mute syllable: infiltrent (df) 28/02/94
347
+ 1tu 1t� tung2s3 1ty
348
+ % t, etymological
349
+ tachy1a2 tchin3t2 t�l�1e2 t�l�1i2 t�l�1o2b t�l�1o2p t�l�1s2
350
+ % words ending with -tent (df) 22/02/94
351
+ 2tent. % mentent - syllable muette - bg 27/12/93
352
+ .la3tent. .pa3tent. comp�3tent. �ni3tent. m�con3tent. omnipo3tent.
353
+ ventripo3tent. �quipo3tent. impo3tent. mit3tent.
354
+ thermo1s2 2t3heur tran2s1a2 tran3s2act tran3s2ats tran2s3h tran2s1o2
355
+ tran2s3p tran2s1u2 .tri1a2c .tri1a2n .tri1a2t .tri1o2n t1t2l
356
+
357
+ % u, phonetic
358
+ .u4 'u4 .�4 '�4
359
+ % u, etymological
360
+ uni1o2v uni1a2x u2s3tr
361
+
362
+ % v, phonetic
363
+ 1va 1v� 1ve 1v� 1v� 1v� 4ve. 4ves. 1vi 1v� 1vo 1v� 1v2r 4vre. 4vres.
364
+ 2vrent. % mute syllable: recouvrent (df) 28/02/94
365
+ 1vu 1v� 1vy
366
+
367
+ % v, etymological
368
+ v�lo1s2ki
369
+ % words ending with -vent (df) 22/02/94
370
+ 2vent. conni3vent. .sou3vent. vol2t1amp
371
+
372
+ % w, phonetic
373
+ 1wa 1we 4we. 4wes. 2went. % mute syllable: interviewent (df) 22/02/94
374
+ 1wi 1wo 1wu 1w2r
375
+
376
+ % x
377
+ 2xent. % mute syllable: malaxent (df) 22/02/94
378
+
379
+ % y, phonetic
380
+ .y4 'y4
381
+ % y, etymological
382
+ y1asth y1s2tom y1algi
383
+
384
+ % z, phonetic
385
+ 1za 1ze 1z� 1z� 4ze. 4zes. 1zi 1zo 1zu 1zy
386
+ % z, etymological
387
+ % words ending with -zent (df) 22/02/94
388
+ 2zent. privatdo3zent.
389
+ PATTERNS
390
+ end
391
+ Text::Hyphen::Language::FRA = Text::Hyphen::Language::FR
392
+ Text::Hyphen::Language::FRE = Text::Hyphen::Language::FR