text-hyphen 1.0.0

Sign up to get free protection for your applications and to get access to all the features.
Files changed (40) hide show
  1. data/ChangeLog +4 -0
  2. data/Changelog +4 -0
  3. data/INSTALL +6 -0
  4. data/LICENCE +47 -0
  5. data/README +56 -0
  6. data/Rakefile +116 -0
  7. data/bin/hyphen +107 -0
  8. data/lib/text/hyphen.rb +289 -0
  9. data/lib/text/hyphen/language.rb +112 -0
  10. data/lib/text/hyphen/language/ca.rb +174 -0
  11. data/lib/text/hyphen/language/cs.rb +363 -0
  12. data/lib/text/hyphen/language/da.rb +118 -0
  13. data/lib/text/hyphen/language/de1.rb +723 -0
  14. data/lib/text/hyphen/language/de2.rb +685 -0
  15. data/lib/text/hyphen/language/en_uk.rb +791 -0
  16. data/lib/text/hyphen/language/en_us.rb +493 -0
  17. data/lib/text/hyphen/language/es.rb +289 -0
  18. data/lib/text/hyphen/language/et.rb +337 -0
  19. data/lib/text/hyphen/language/eu.rb +115 -0
  20. data/lib/text/hyphen/language/fi.rb +113 -0
  21. data/lib/text/hyphen/language/fr.rb +392 -0
  22. data/lib/text/hyphen/language/ga.rb +608 -0
  23. data/lib/text/hyphen/language/hr.rb +124 -0
  24. data/lib/text/hyphen/language/hsb.rb +180 -0
  25. data/lib/text/hyphen/language/hu1.rb +385 -0
  26. data/lib/text/hyphen/language/hu2.rb +1283 -0
  27. data/lib/text/hyphen/language/ia.rb +73 -0
  28. data/lib/text/hyphen/language/id.rb +97 -0
  29. data/lib/text/hyphen/language/is.rb +390 -0
  30. data/lib/text/hyphen/language/it.rb +135 -0
  31. data/lib/text/hyphen/language/la.rb +134 -0
  32. data/lib/text/hyphen/language/mn.rb +103 -0
  33. data/lib/text/hyphen/language/nl.rb +1253 -0
  34. data/lib/text/hyphen/language/no1.rb +303 -0
  35. data/lib/text/hyphen/language/no2.rb +138 -0
  36. data/lib/text/hyphen/language/pl.rb +480 -0
  37. data/lib/text/hyphen/language/pt.rb +56 -0
  38. data/lib/text/hyphen/language/sv.rb +449 -0
  39. data/tests/tc_text_hyphen.rb +62 -0
  40. metadata +90 -0
@@ -0,0 +1,115 @@
1
+ # Hyphenation patterns for Text::Hyphen in Ruby: Basque
2
+ # Converted from the TeX hyphenation/bahyph.tex file, by Juan M.
3
+ # Aguirregabiria (wtpagagj@lg.ehu.es) in 1997 based on work by Julio
4
+ # Sanchez (jsanchez@gmv.es) in 1991.
5
+ #
6
+ # The original copyright holds and is reproduced in the source to this file.
7
+ # The Ruby version of these patterns are copyright 2004 Austin Ziegler and
8
+ # are available under an MIT license. See LICENCE for more information.
9
+ #--
10
+ # Hyphenation patterns for Basque.
11
+ # This file has been written by Juan M. Aguirregabiria (wtpagagj@lg.ehu.es)
12
+ # on February 1997 and is based on the shyphen.sh script that generates the
13
+ # Spanish patterns as compiled by Julio Sanchez (jsanchez@gmv.es) on
14
+ # September 1991. The original Copyright follows and applies also to this
15
+ # file whose last version will be always available by anonymous ftp from
16
+ # tp.lc.ehu.es or by poynting your Web browser to
17
+ # http://tp.lc.ehu.es/basque.html
18
+ #
19
+ # Hyphenation patterns for Spanish.
20
+ # Compiled by Julio Sanchez (jsanchez@gmv.es) on September 1991. These
21
+ # patterns have been derived from "On Word Division in Spanish", Jos'e A.
22
+ # Ma~nas, Communications of the ACM, and implemented in his package ftc. You
23
+ # can get ftc and a draft of the abovementioned paper from goya.dit.upm.es
24
+ # in src/text.proc/ftc.Z. FTP access may be available. Otherwise, send help
25
+ # to info@goya.dit.upm.es for details on use of the mail server.
26
+ #
27
+ # Rules mentioned below are those described in that paper. After several
28
+ # unsatisfactory attempts to pretend I knew better, these patterns closely
29
+ # follow that paper. Pattern 'tl' is not considered. It is conflictive and
30
+ # ftc does not use it either.
31
+ #
32
+ # These patterns have been generated by shyphen.sh version 1.0, shyphen.sh
33
+ # is a sh script that allows a number of choices. Full benefit from some of
34
+ # these options can only be obtained if appropriate fonts are available.
35
+ #
36
+ # Follows a copyright notice. This is not in the public domain, but the
37
+ # copyright is essentially a hold-harmless clause. That is, use it at will,
38
+ # but don't sue me if you don't like it.
39
+ #
40
+ # COPYRIGHT NOTICE
41
+ #
42
+ # These patterns and the generating sh script are Copyright (c) GMV 1991
43
+ # These patterns were developed for internal GMV use and are made public in
44
+ # the hope that they will benefit others. Also, spreading these patterns
45
+ # throughout the Spanish-language TeX community is expected to provide
46
+ # back-benefits to GMV in that it can help keeping GMV in the mainstream of
47
+ # spanish users. However, this is given for free and WITHOUT ANY WARRANTY.
48
+ # Under no circumstances can Julio Sanchez, GMV, Jos'e A. Ma~nas or any
49
+ # agents or representatives thereof be held responsible for any errors in
50
+ # this software nor for any damages derived from its use, even in case any
51
+ # of the above has been notified of the possibility of such damages. If any
52
+ # such situation arises, you responsible for repair. Use of this software is
53
+ # an explicit acceptance of these conditions.
54
+ #
55
+ # You can use this software for any purpose. You cannot delete this
56
+ # copyright notice. If you change this software, you must include comments
57
+ # explaining who, when and why. You are kindly requested to send any changes
58
+ # to tex@gmv.es. If you change the generating script, you must include code
59
+ # in it such that any output is clearly labeled as generated by a modified
60
+ # script.
61
+ #
62
+ # Despite the lack of warranty, we would like to hear about any problem you
63
+ # find. Please report problems to tex@gmv.es.
64
+ #
65
+ # END OF COPYRIGHT NOTICE
66
+ #
67
+ # Options included in this set: basic
68
+ # Open vowels: a e o
69
+ # Closed vowels: i u
70
+ # Consonants: b c d f g j k l m n p q r s t v w x y z
71
+ #
72
+ # Some of the patterns below represent combinations that never happen in
73
+ # Basque. Would they happen, they would be hyphenated according to the
74
+ # rules.
75
+ #
76
+ # This keeps {cat|lc}code changes, if any, local. Nice to users of
77
+ # multilingual versions. These are the minimum changes needed to process the
78
+ # patterns. These and other changes will have to be re-enacted when Basque
79
+ # be established as the current language. See the babel docs if you don't
80
+ # understand this.
81
+ #++
82
+ require 'text/hyphen/language'
83
+
84
+ Text::Hyphen::Language::EU = Text::Hyphen::Language.new do |lang|
85
+ lang.patterns <<-PATTERNS
86
+ % Rule SR1
87
+ % Vowels are kept together by the defaults
88
+
89
+ % Rule SR2
90
+ % Attach vowel groups to left consonant
91
+ 1ba 1be 1bo 1bi 1bu 1ca 1ce 1co 1ci 1cu 1da 1de 1do 1di 1du 1fa 1fe 1fo 1fi
92
+ 1fu 1ga 1ge 1go 1gi 1gu 1ja 1je 1jo 1ji 1ju 1ka 1ke 1ko 1ki 1ku 1la 1le 1lo
93
+ 1li 1lu 1ma 1me 1mo 1mi 1mu 1na 1ne 1no 1ni 1nu 1pa 1pe 1po 1pi 1pu 1qa 1qe
94
+ 1qo 1qi 1qu 1ra 1re 1ro 1ri 1ru 1sa 1se 1so 1si 1su 1ta 1te 1to 1ti 1tu 1va
95
+ 1ve 1vo 1vi 1vu 1wa 1we 1wo 1wi 1wu 1xa 1xe 1xo 1xi 1xu 1ya 1ye 1yo 1yi 1yu
96
+ 1za 1ze 1zo 1zi 1zu
97
+
98
+ % Rule SR3
99
+ % Build legal consonant groups, leave other consonants bound to
100
+ % the previous group. This overrides part of the SR2 pattern
101
+ % group.
102
+ 1l2la 1l2le 1l2lo 1l2li 1l2lu 1r2ra 1r2re 1r2ro 1r2ri 1r2ru 1t2sa 1t2se
103
+ 1t2so 1t2si 1t2su 1t2xa 1t2xe 1t2xo 1t2xi 1t2xu 1t2za 1t2ze 1t2zo 1t2zi
104
+ 1t2zu 1b2la 1b2le 1b2lo 1b2li 1b2lu 1b2ra 1b2re 1b2ro 1b2ri 1b2ru 1d2ra
105
+ 1d2re 1d2ro 1d2ri 1d2ru 1f2la 1f2le 1f2lo 1f2li 1f2lu 1f2ra 1f2re 1f2ro
106
+ 1f2ri 1f2ru 1g2la 1g2le 1g2lo 1g2li 1g2lu 1g2ra 1g2re 1g2ro 1g2ri 1g2ru
107
+ 1k2la 1k2le 1k2lo 1k2li 1k2lu 1k2ra 1k2re 1k2ro 1k2ri 1k2ru 1p2la 1p2le
108
+ 1p2lo 1p2li 1p2lu 1p2ra 1p2re 1p2ro 1p2ri 1p2ru 1t2ra 1t2re 1t2ro 1t2ri
109
+ 1t2ru
110
+ % We now avoid some problematic breaks.
111
+ su2b2r su2b2l
112
+ PATTERNS
113
+ end
114
+ Text::Hyphen::Language::BAQ = Text::Hyphen::Language::EU
115
+ Text::Hyphen::Language::EUS = Text::Hyphen::Language::EU
@@ -0,0 +1,113 @@
1
+ # Hyphenation patterns for Text::Hyphen in Ruby: Finnish
2
+ # Converted from the TeX hyphenation/fihyph.tex file, Kauko Saarinen,
3
+ # Computing Centre, University of Jyvaskyla, Finland, 1986 - 1989.
4
+ #
5
+ # The original copyright holds and is reproduced in the source to this file.
6
+ # The Ruby version of these patterns are copyright 2004 Austin Ziegler and
7
+ # are available under an MIT license. See LICENCE for more information.
8
+ #--
9
+ # -----> Finnish hyphenation patterns for MLPCTeX <------
10
+ # First release January -86 by Kauko Saarinen,
11
+ # Computing Centre, University of Jyvaskyla, Finland
12
+ #
13
+ # Completely rewritten January -88. The new patterns make
14
+ # much less mistakes with foreign and compound words.
15
+ # The article "Automatic Hyphenation of Finnish"
16
+ # by Professor Fred Karlsson is also referred
17
+ # ---------------------------------------------------------
18
+ #
19
+ # 8th March -89 (vers. 2.2), some vowel triples by Fred Karlsson added.
20
+ #
21
+ # ********* Patterns may be freely distributed **********
22
+ #++
23
+ require 'text/hyphen/language'
24
+
25
+ Text::Hyphen::Language::FI = Text::Hyphen::Language.new do |lang|
26
+ lang.patterns <<-PATTERNS
27
+ 1ba 1be 1bi 1bo 1bu 1by 1da 1de 1di 1do 1du 1dy 1d� 1d� 1fa 1fe 1fi 1fo 1fu
28
+ 1fy 1ga 1ge 1gi 1go 1gu 1gy 1g� 1g� 1ha 1he 1hi 1ho 1hu 1hy 1h� 1h� 1ja 1je
29
+ 1ji 1jo 1ju 1jy 1j� 1j� 1ka 1ke 1ki 1ko 1ku 1ky 1k� 1k� 1la 1le 1li 1lo 1lu
30
+ 1ly 1l� 1l� 1ma 1me 1mi 1mo 1mu 1my 1m� 1m� 1na 1ne 1ni 1no 1nu 1ny 1n� 1n�
31
+ 1pa 1pe 1pi 1po 1pu 1py 1p� 1p� 1ra 1re 1ri 1ro 1ru 1ry 1r� 1r� 1sa 1se 1si
32
+ 1so 1su 1sy 1s� 1s� 1ta 1te 1ti 1to 1tu 1ty 1t� 1t� 1va 1ve 1vi 1vo 1vu 1vy
33
+ 1v� 1v�
34
+
35
+ % ------- Some common words borrowed from other languages -------
36
+ % ------- This part could be updated from time to time -------
37
+ 1st2r % -stressi, -strategia etc.
38
+
39
+ % ------ Some special cases occuring with compound words only ----
40
+ % ------ There still remains well known problem as 'kaivos\-aukko' etc.
41
+ %a1y (common in borrowed words)
42
+ �2y y1a2 y1o2 o1y �2y u1y2 y1u2 �3a2 �3o2 �3a2 �3o2 �1u2 �1u2
43
+ a1� % (a1�2 ei mahdollinen!)
44
+ a1� o1� o1� u1�2 u1�2 �2� �2� �2� �2�
45
+ % lyhyt/pitka -vokaalipareja, tavallisesti sanarajalla
46
+ aa1i2 % maa-ikkuna
47
+ aa1e2 aa1o2 % maa-ottelu
48
+ aa1u2 % uraa-uurtava
49
+ ee1a2 % tee-astia
50
+ ee1i2 % tee-istutus
51
+ ee1u2 % varietee-uusinta
52
+ ee1y2 ii1a2 ii1e2 ii1o2 uu1a2 uu1e2 % puu-esine
53
+ uu1o2 % puu-osa
54
+ uu1i2 % puu-istutus
55
+ e1aa i1aa o1aa u1aa u1ee a1uu % kala-uuni
56
+ i1uu % ravi-uutiset
57
+ e1uu % virhe-uutinen
58
+ o1uu % radio-uutiset
59
+ ��1i ��1e ��3y i1�� e1�� y1�� i1�� % yhti-��n etc.
60
+ %i1eu % keski-eurooppalainen
61
+
62
+ % vokaalikolmikkoja etc. yhdyssanojen rajoissa
63
+ % -------- vowel triples by Fred Karlsson
64
+ a1ei a1oi e1ai % e1oi % ambiguous for ex. video-ilme (8.3.89)
65
+ i1au % u1oi % ambiguous (8.3.89)
66
+ y1ei ai1a ai1e ai1o ai1u au1a au1e eu1a ie1a ie1o %ie1u % ambiguous
67
+ ie1y io1a2 io1e2 iu1a iu1e iu1o oi1a oi1e oi1o oi1u
68
+ o1ui % veto-uistin, himo-uimari, etc.
69
+ ou1e ou1o ue1a ui1e uo1a % uo1i % ambiguous
70
+ uo1u
71
+ % ---------------- End of vowel triples --------------------
72
+ e1�2 �1e2 .�2 % don't hyphenate �-lyllinen etc.
73
+
74
+ % The following patterns contain no general scientific rule. They are
75
+ % selected more or less intuitively to solve problems with common and
76
+ % frequently appearing compound words. However, every pattern resolves more
77
+ % than only one hyphenation problem.
78
+ u2s % estaa virheita yhdyssanojen yhteydessa
79
+ yli1o2p % yli-opisto etc.
80
+ ali1a2v % ali-avaruus etc.
81
+ 1sp2li % kuutio-splini etc.
82
+ alous1 keus1 % oikeus-oppinut etc.
83
+ rtaus1 2s1ohje % -sohjelma etc.
84
+ 2s1a2sia % nais-asianainen etc.
85
+ 1a2sian % neuvottelu-asian etc.
86
+ 1a2siat % koti-asian (ei kotia-sian)
87
+ 1a2sioi % talous-asioita etc.
88
+ r2as l2as % tikku-rasia etc
89
+ 2s1o2pisk % xs-opiskelija etc
90
+ 2n1o2pet 2s1a2loi 2n1o2pist % kansan-opisto etc.
91
+ 2s1o2pist 2s1o2sa % xxs-osakas etc.
92
+ 2n1o2sa % asian-osainen etc.
93
+ alkei2s1 perus1 2s1i2dea. 2s1i2dean 2s1e2sity % xs-esitys etc
94
+ 2n1e2dus % kansan-edustaja etc.
95
+ 2s1ajatu % -ajatus etc.
96
+ 2s1ase 2s1apu 2s1y2rit % yhteis-yritys etc.
97
+ .ydi2n1 .suu2r1a2 % suur-ajot etc.
98
+ 2s1y2hti 2n1otto 2n1oton 2n1anto 2n1anno 2n1a2jan 2n1aika 2n1o2mai 2n1y2lit
99
+ 2s1a2len 2n1a2len 1a2siaka2s1 ulo2s1 % ulos-ajo
100
+ 2n1a2jo % kiven-ajo
101
+ 2s1a2jo
102
+
103
+ % *** The following rules may be used on user's responsibility ***
104
+ % *** for example, may be needed with narrow columns ***
105
+ % >>>>>>>>>>> a1e a1o e1o o1a u1a <<<<<<<<<<<
106
+
107
+ % ----- Some districting rules by Professor Fred Karlsson's ideas ------
108
+ b2l 1b2lo bib3li b2r 1b2ri 1b2ro 1b2ru d2r 1d2ra f2l 1f2la f2r 1f2ra 1f2re
109
+ g2l 1g2lo g2r 1g2ra k2l 1k2ra 1k2re 1k2ri 1k2v 1k2va p2l p2r 1p2ro c2l q2v
110
+ 1q2vi sc2h ts2h ch2r
111
+ PATTERNS
112
+ end
113
+ Text::Hyphen::Language::FIN = Text::Hyphen::Language::FI
@@ -0,0 +1,392 @@
1
+ # Hyphenation patterns for Text::Hyphen in Ruby: French
2
+ # Converted from the TeX hyphenation/frhyph.tex file, by Jacques
3
+ # Desarmenien (1984 - 1986), Norman Buckle, Michael Ferguson (1988),
4
+ # Justin Bur, and others.
5
+ #
6
+ # The original copyright holds and is reproduced in the source to this file.
7
+ # The Ruby version of these patterns are copyright 2004 Austin Ziegler and
8
+ # are available under an MIT license. See LICENCE for more information.
9
+ #--
10
+ # frhyph.tex % French hyphenation patterns
11
+ #%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
12
+ # This file is available for free and can used and redistributed
13
+ # asis for free. Modified versions should have another name.
14
+ #%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
15
+ #message{frhyph.tex - French hyphenation patterns (V2.11) <2002/01/16>}
16
+ #
17
+ #%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
18
+ # *BEFORE* using this file *PLEASE* run checksum on it:
19
+ # checksum -v frhyph.tex
20
+ # to make sure that it hasn't been damaged.
21
+ # Then if you notice anything wrong in french hyphenation please report to
22
+ # D. Flipo and B. Gaulle at the email address: cesure-l@gutenberg.eu.org
23
+ #%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
24
+ #% checksum = "37094 1457 3075 34744"
25
+ #%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
26
+ #%%%%%%% The most famous good guys who worked hard to obtain something usable.
27
+ # Jacques Desarmenien, Universite de Strasbourg :
28
+ # - << how to run TeX in a French environment: hyphenation, fonts,
29
+ # typography. >> in Tugboat, 5 (1984) 91-102. and TeX85 conference
30
+ # - << La division par ordinateur des mots francais :
31
+ # application a TeX >> in TSI vol. 5 No 4, 1986 (C) AFCET-
32
+ # Gauthier-Villars
33
+ # Norman Buckle, UQAH (nb; many additions)
34
+ # Michael Ferguson, INRS-Telecommunications (mjf) June 1988
35
+ # Justin Bur, Universite de Montreal (jbb; checked against original list)
36
+ # all patterns including apostrophe missing from nb list
37
+ # after that, GUTenberg and specially Daniel Flipo and Bernard Gaulle
38
+ # did their best effort to improve the list of patterns.
39
+ #
40
+ # -----------------------------------------------------------------
41
+ #
42
+ # Adaption of these patterns for
43
+ # - TeX Version 3.x and MLTeX 3.x (2.x)
44
+ # and
45
+ # - all fonts in T1/`Cork' and/or CM/OT1 encoding
46
+ # by Bernd Raichle 1996/08/28 (using ideas from `ghyph31.tex'
47
+ # as of 1994-02-13 maintained by Bernd Raichle).
48
+ # (An adaption for the old MLTeX 2.x exists but can not be
49
+ # tested in lack of an executable.)
50
+ #++
51
+ require 'text/hyphen/language'
52
+
53
+ Text::Hyphen::Language::FR = Text::Hyphen::Language.new do |lang|
54
+ lang.patterns <<-PATTERNS
55
+ % *, phonetic
56
+ 2'2
57
+
58
+ % a, phonetic
59
+ .a4 'a4 .�4 '�4
60
+ ab2h % df-bg 1998/02/07 for abhorrer
61
+ ad2h % df-bg 1998/02/07 for adh�sion & co
62
+ % a, etymological
63
+ .ab3r�a 'ab3r�a a1�2dre .ae3s4ch 'ae3s4ch 1alcool a2l1algi .amino1a2c
64
+ 'amino1a2c .ana3s4tr 'ana3s4tr 1a2nesth�si .anti1a2 'anti1a2 .anti1e2
65
+ 'anti1e2 .anti1�2 .anti2enne 'anti2enne 'anti1�2 .anti1s2 'anti1s2 .apo2s3ta
66
+ 'apo2s3ta apo2s3tr archi1�2pis .as2ta 'as2ta a2s3tro
67
+
68
+ % b, phonetic
69
+ 1ba 1b� 1be 1b� 1b� 1b� 4be. 4bes.
70
+ 2bent. % mute syllable: tombent (df) 22/02/94
71
+ 1bi 1b� 1b2l 4ble. 4bles.
72
+ 2blent. % mute syllable: troublent (df) 28/02/94
73
+ 1bo 1b� 1b2r 4bre. 4bres.
74
+ 2brent. % mute syllable: palabrent (df) 28/02/94
75
+ 1bu 1b� 1by
76
+ % b, etymological
77
+ .bai2se3main % hyphen disappeared from french 5/2/94
78
+ .bi1a2c .bi1a2t % like .tri1a2t for tri-athlon bg 12/27/93
79
+ .bi1au .bio1a2 .bi2s1a2 .bi1u2
80
+
81
+ % c, phonetic
82
+ 1� 1ca 1c� 1ce 1c� 1c� 1c� 4ce. 4ces.
83
+ % words ending with -cent (df) 22/02/94
84
+ 2cent. ja3cent. ac3cent. �3cent. munifi3cent. r�ti3cent. privatdo3cent.
85
+ inno3cent. es3cent. acquies4cent. is3cent. immis4cent. .ch4 1c2h 4ch. 2chb
86
+ 4che. 4ches. 2chent. % mute syllable: touchent (df) 22/02/94
87
+ ca3ou3t2 % for caoutchou... added 3/1/94 df-bg
88
+ 2chg ch2l 4chle. 4chles. 2chm 2chn 2chp ch2r 4chre. 4chres. 2chs 2cht 2chw
89
+ 1ci 1c� 1c2k 4ck. 2ckb 4cke. 4ckes.
90
+ 2ckent. % mute syllable: stockent (df) 22/02/94
91
+ 2ckf 2ckg 2ck3h 2ckp 2cks 2ckt 1c2l 4cle. 4cles.
92
+ 2clent. % mute syllable: encerclent (df) 28/02/94
93
+ 1co 1c� 1c�0 % final zero essential to terminate cs
94
+ 1c2r 4cre. 4cres. 2crent. % mute syllable: massacrent (df) 28/02/94
95
+ 1cu 1c� 1cy .cul4 % -- as .con4 .cons4 (march 92)
96
+ % c, etymological
97
+ .ch�2vre3feuille % hyphen disappeared from french 5/2/94
98
+ chlo2r3a2c chlo2r3�2t .ci2s1alp co1acc co1acq co1a2d co1ap co1ar co1assoc
99
+ co1assur co1au co1ax co1�2 co1ef co1en co1ex .con4 % missing from nb list
100
+ .cons4 % missing from nb list
101
+ .contre1s2c .contre3ma�tre % hyphen disappeared from french 5/2/94
102
+ co2nurb .co1o2 .co2o3lie
103
+
104
+ % d, phonetic
105
+ 1d' 1da 1d� d1d2h 1de 1d� 1d� 1d� 4de. 4des. 1di 1d�
106
+ 2dlent. % mute syllable: jodlent (df) 28/02/94
107
+ 1do 1d� 1d2r 4dre. 4dres.
108
+ 2drent. % mute syllable: engendrent (df) 28/02/94
109
+ d1s2 1du 1d� 1dy
110
+ % d, etymological
111
+ .dacryo1a2 .d�1a2 .d�1io .d�1o2
112
+ .d�2s % originaly in JD file
113
+ %.d�2s1a2 removed 09/17/92 because wrong for the original JD 500 words test
114
+ .d�3s2a3cr .d�s2a3m % .d�s2a2mi introduced 09/17/92 bec. i can't see why
115
+ % d�samidonner ran in JD. Moved to .d�s2a3m df 12/27/93.
116
+ .d�3s2a3tell .d�3s2astr .d�3s2c % 1 moved 3 due to .d�2s 09/17/92
117
+ %.d�2s1e2 removed 09/17/92 because wrong for the original JD 500 words test
118
+ .d�2s1�2 .d�3s2�3gr .d�3s2ensib .d�3s2ert .d�3s2exu
119
+ %.d�2s3h removed 09/17/92 because wrong for the original JD 500 words test
120
+ .d�2s1i2 .d�3s2i3d .d�3s2i3gn .d�3s2i3li .d�3s2i3nen .d�3s2invo .d�3s2i3r
121
+ .d�3s2ist
122
+ %.d�2s1o2 removed 09/17/92 because wrong for the original JD 500 words test
123
+ .d�3s2o3d� .d�2s1�0 % final zero essential to terminate cs
124
+ .d�3s2o3l .d�3s2o3pil .d�3s2orm .d�3s2orp .d�3s2oufr
125
+ .d�3s2p % 1 moved 3 due to .d�2s 09/17/92
126
+ .d�3s2t % 1 moved 3 due to .d�2s 09/17/92
127
+ .d�2s1u2n 3d2hal 3d2houd di2s3cop .di1a2c� .di1a2cid .di1ald .di1a2mi
128
+ .di1a2tom .di1e2n .di2s3h .dy2s3 .dy2s1a2 .dy2s1i2
129
+ .dy2s1o2 % missing from nb list
130
+ .dy2s1u2
131
+ % words ending with -dent (df) 22/02/94
132
+ 2dent. d�ca3dent. �3dent. cci3dent. inci3dent. confi3dent. tri3dent.
133
+ dissi3dent. chien3dent. .ar3dent. impu3dent. pru3dent.
134
+
135
+ % e, phonetic
136
+ .e4 'e4 .�4 '�4 .�4 '�4 .�4 '�4
137
+ % e, etymological
138
+ �d2hi % df-bg 1998/02/07 for r�dhibitoire
139
+ 1�2drie 1�2drique 1�2lectr 1�2l�ment .en1a2 'en1a2 1�2nerg e2n1i2vr .en1o2
140
+ 'en1o2 �pi2s3cop �pi3s4cope e2s3cop .eu2r1a2 'eu2r1a2 eu1s2tat extra1
141
+ extra2c extra2i
142
+
143
+ % f, phonetic
144
+ 1fa 1f� 1fe 1f� 1f� 1f� 4fe. 4fes.
145
+ 2fent. % mute syllable: agrafent chauffent (df) 22/02/94
146
+ 1fi 1f� 1f2l 4fle. 4fles.
147
+ 2flent. % mute syllable: gonflent (df) 28/02/94
148
+ 1fo 1f� 1f2r 4fre. 4fres.
149
+ 2frent. % mute syllable: balafrent (df) 28/02/94
150
+ f1s2 1fu 1f� 1fy
151
+ % g, phonetic
152
+ 1ga 1g� 1ge 1g� 1g� 1g� 4ge. 4ges. 1g2ha 1g2he 1g2hi 1g2ho 1g2hy 1gi 1g�
153
+ 1g2l 4gle. 4gles. 2glent. % mute syllable: meuglent (df) 28/02/94
154
+ 1g2n 'a2g3nat % (df) 16/01/02
155
+ .a2g3nat % (df) 16/01/02
156
+ a2g3nos % (df) 16/01/02 (pattern dia2g3n deleted)
157
+ co2g3niti % (df) 16/01/02
158
+ 'i2g3n� % (df) 16/01/02
159
+ .i2g3n� % (df) 16/01/02
160
+ 'i2g3ni % (df) 16/01/02
161
+ .i2g3ni % (df) 16/01/02
162
+ .ma2g3nicide % (df) 16/01/02
163
+ .ma2g3nificat % (df) 16/01/02
164
+ .ma2g3num % (df) 16/01/02
165
+ o2g3nomoni % (df) 16/01/02
166
+ o2g3nosi % (df) 16/01/02
167
+ .pro2g3nath % (df) 16/01/02
168
+ pu2g3nable % (df) 16/01/02
169
+ pu2g3nac % (df) 16/01/02
170
+ .sta2g3n .syn2g3nath % (df) 16/01/02
171
+ wa2g3n 4gne. 4gnes. 2gnent. % mute syllable: accompagnent (df) 28/02/94
172
+ 1go 1g� 1g2r 4gre. 4gres. 2grent. % mute syllable: immigrent (df) 28/02/94
173
+ 1gu 1g� g1s2 4gue. 4gues. 1gy
174
+ % words ending with -guent (df) 22/02/94
175
+ 2guent. .on3guent. 'on3guent.
176
+ % g, etymological
177
+ % words ending with -gent (df) 22/02/94
178
+ 2gent. r�3gent. entre3gent. indi3gent. dili3gent. intelli3gent. indul3gent.
179
+ tan3gent. rin3gent. contin3gent. .ar3gent. 'ar3gent. ser3gent. ter3gent.
180
+ r�sur3gent.
181
+
182
+ % h, phonetic
183
+ 1ha 1h� 1he 1h� 1h� 1h� 4he. 4hes. 1hi 1h� 1ho 1h� 1hu 1h� 1hy
184
+ % h, etymological
185
+ h�mi1� h�mo1p2t hypera2 hypere2 hyper�2 hyperi2 hypero2 hypers2 hype4r1
186
+ hyperu2 hypo1a2 hypo1e2 % missing from nb list
187
+ hypo1�2 hypo1i2 hypo1o2 hypo1s2 hypo1u2
188
+
189
+ % i, phonetic
190
+ .i4 'i4 .�4 '�4
191
+ % ill patterns missing from nb list
192
+ il2l cil3l rcil4l ucil4l vacil4l gil3l hil3l lil3l l3lion mil3l mil4let
193
+ �mil4l semil4l rmil4l armil5l capil3l papil3la papil3le papil3li papil3lom
194
+ pupil3l piril3l thril3l cyril3l ibril3l pusil3l .stil3l distil3l instil3l
195
+ fritil3l boutil3l vanil3lin vanil3lis vil3l avil4l chevil4l uevil4l uvil4l
196
+ xil3l
197
+ % end of ill patterns
198
+
199
+ % i, etymological
200
+ i1algi i1arthr i1�2dre 1informat % missing from nb list
201
+ .in1a2 'in1a2 .in2a3nit 'in2a3nit .in2augur 'in2augur .in1e2 'in1e2 .in1�2
202
+ 'in1�2 .in2effab % missing from nb list
203
+ 'in2effab .in2�3lucta 'in2�3lucta .in2�3narra 'in2�3narra .in2ept 'in2ept
204
+ .in2er 'in2er .in2exora % missing from nb list
205
+ 'in2exora .in1i2 'in1i2 .in2i3miti 'in2i3miti .in2i3q 'in2i3q .in2i3t
206
+ 'in2i3t .in1o2 'in1o2 .in2o3cul 'in2o3cul .in2ond 'in2ond .in1s2tab
207
+ 'in1s2tab 'inte4r3 .intera2 'intera2 .intere2 'intere2 .inter�2 'inter�2
208
+ .interi2 'interi2 .intero2 'intero2 .inte4r3 .interu2 'interu2 .inters2
209
+ 'inters2 .in1u2 'in1u2 .in2uit 'in2uit .in2u3l 'in2u3l io1a2ct i1oxy i1s2tat
210
+
211
+ % j, phonetic
212
+ 1j 2jk 4je. 4jes. 2jent. % mute syllable: gal�jent (df) 22/02/94
213
+
214
+ % k, phonetic
215
+ 1ka 1k� 1ke 1k� 1k� 1k� 4ke. 4kes.
216
+ 2kent. % mute syllable: jerkent (df) 22/02/94
217
+ 1k2h 4kh. .kh4 1ki 1k� 1ko 1k� 1k2r 1ku 1k� 1ky
218
+
219
+ % l, phonetic
220
+ 1la 1l� 1l� 1le 1l� 1l� 1l� 4le. 4les. 1li 1l� 1lo 1l� l1s2t 1lu 1l� 1ly
221
+
222
+ % l, etymological
223
+ la2w3re
224
+ % words ending with -lent (df) 22/02/94
225
+ 2lent. .ta3lent. iva3lent. �quiva4lent. monova3lent. polyva3lent. re3lent.
226
+ .do3lent. indo3lent. inso3lent. turbu3lent. succu3lent. f�cu3lent.
227
+ trucu3lent. opu3lent. corpu3lent. ru3lent. sporu4lent.
228
+
229
+ % m, phonetic
230
+ 1ma 1m� 1me 1m� 1m� 1m� 4me. 4mes. 1mi 1m� 1mo 1m�
231
+ 1m�0 % final zero essential to terminate cs
232
+ m1s2 1mu 1m� 1my
233
+ % m, etymological
234
+ .ma2c3k .macro1s2c .ma2l1a2dres .ma2l1a2dro .ma2l1ais� .ma2l1ap .ma2l1a2v
235
+ .ma2l1en .ma2l1int .ma2l1oc .ma2l1o2d .ma2r1x % nb (jbb: ?)
236
+ .m�2g1oh .m�2sa % missing from nb list
237
+ .m�3san % missing from nb list
238
+ .m�2s1es .m�2s1i .m�2s1u2s .m�ta1s2ta
239
+ % words ending with -ment (df) 22/02/94
240
+ �2ment. da2ment. fa2ment. amalga2ment. cla2ment. ra2ment. temp�ra3ment.
241
+ ta2ment. testa3ment. qua2ment. �2ment. car�2ment. diaphrag2ment. ryth2ment.
242
+ ai2ment. rai3ment. ab�2ment. �ci2ment. vidi2ment. subli2ment. �li2ment.
243
+ reli2ment. mi2ment. ani2ment. veni2ment. ri2ment. d�tri3ment. nutri3ment.
244
+ inti2ment. esti2ment. l2ment. flam2ment. gram2ment. .gem2ment. om2ment.
245
+ .com3ment. �2ment. slalo2ment. chro2ment. to2ment. ar2ment. .sar3ment.
246
+ er2ment. antifer3ment. .ser3ment. fir2ment. or2ment. as2ment. au2ment.
247
+ �cu2ment. fu2ment. hu2ment. fichu3ment. llu2ment. plu2ment. bou2ment.
248
+ bru2ment. su2ment. tu2ment. .milli1am 1m2n�mo 1m2n�s 1m2n�si .mono1a2
249
+ .mono1e2 .mono1�2 .mono1i2 .mono1�2d� .mono1o2 .mono1u2 .mono1s2
250
+ mon2t3r�al % missing from nb list
251
+ moye2n1�2g
252
+
253
+ % n, phonetic
254
+ 1na 1n� 1ne 1n� 1n� 1n� 4ne. 4nes. 1ni 1n� 1no 1n�
255
+ 1n�0 % final zero essential to terminate cs
256
+ 1nu 1n� n1x 1ny
257
+ % n, etymological
258
+ % words ending with -nent (df) 22/02/94
259
+ 2nent. % fric-tionnent - syllable muette - bg 27/12/93
260
+ r�ma3nent. imma3nent. perma3nent. .�mi3nent. pr��mi3nent. pro�mi3nent.
261
+ sur�mi3nent. immi3nent. conti3nent. perti3nent. absti3nent. .no2n1obs
262
+ n3s2at. n3s2ats.
263
+
264
+ % o, phonetic
265
+ .o4 'o4 '�4 .�4 %'�2 % mjf % deleted 3/1/94 df-bg
266
+ % o, etymological
267
+ o2b3long 1octet % missing from nb list
268
+ o1d2l o1�2dre o1ioni ombud2s3 omni1s2 o1s2tas o1s2tat o1s2t�ro o1s2tim
269
+ o1s2tom o1s2trad o1s2tratu o1s2triction .oua1ou 'oua1ou .ovi1s2c 'ovi1s2c
270
+ oxy1a2
271
+
272
+ % p, phonetic
273
+ 1pa 1p� 1pe 1p� 1p� 1p� 4pe. 4pes. 1p2h .ph4 4ph. 4phe. 4phes.
274
+ 2phent. % mute syllable: triomphent (df) 22/02/94
275
+ ph2l 4phle. 4phles. 2phn ph2r 4phre. 4phres. 2phs 2pht 1pi 1p� 1p2l 4ple.
276
+ 4ples. 2plent. % mute syllable: accouplent (df) 28/02/94
277
+ 1po 1p� 1p2r 4pre. 4pres. 2prent. % mute syllable: empourprent (df) 28/02/94
278
+ 1pu 1p� 1py
279
+ % p, etymological
280
+ pal�o1�2 .pa2n1a2f .pa2n1a2m� .pa2n1a2ra .pa2n1is .pa2n1o2ph .pa2n1opt
281
+ .pa2r1a2che .pa2r1a2ch� .para1s2 .pa2r3h�
282
+ % words ending with -pent (df) 22/02/94
283
+ 2pent. re3pent. .ar3pent. 'ar3pent. ser3pent.
284
+ .pen2ta % pent- or penta- but never pen-ta bg 12/27/93
285
+ per3h p�2nul % p�2n1ul moved back 09/17/92 to JD def.
286
+ .pe4r .per1a2 .per1e2 .per1�2 .per1i2 .per1o2 .per1u2
287
+ p�1r2�2q % 2r1 moved 09/17/92 to 1r2(it was a typo)
288
+ .p�ri1os .p�ri1s2 .p�ri2s3s .p�ri2s3ta .p�ri1u2 .phalan3s2t photo1s2
289
+ 3ph2tal� 3ph2tis
290
+ % Here is an example of a pb involving phonetic and etymologic patterns 5/94
291
+ % .phyto3ph2 % originaly, but wrong for phy-toph-thora 9/92
292
+ % .phy2topha % for -pharmacie but wrong for phyto-biol.. 5/94
293
+ .pluri1a 1p2n� 1p2neu po1astre poly1a2 poly1e2 poly1�2 poly1�2 poly1i2
294
+ poly1o2 poly1s2 poly1u2 .pon2tet % JD hypenated it asis 09/17/92, exception
295
+ .pos2t3h .pos2t1in .pos2t1o2 .pos2t3r .post1s2 .pr�1a2
296
+ .pr�2a3la % missing from nb list
297
+ .pr�2au .pr�1�2 .pr�1e2 .pr�1i2 .pr�1o2 .pr�1u2 .pr�1s2 .pro1�2 .pro1s2c�
298
+ pro2s3tat .prou3d2h 1p2sych .psycho1a2n 1p2t�r 1p2t�r .pud1d2l
299
+
300
+ % q
301
+ 1q 4que. 4ques.
302
+ % words ending with -quent (df) 22/02/94
303
+ 2quent. �3quent. �lo3quent. grandilo3quent.
304
+
305
+ % r, phonetic
306
+ 1ra 1r� 1re 1r� 1r� 1r� 1r2h 4rhe. 4rhes. 1ri 1r� 1ro 1r� 1ru 1r� 1ry
307
+ % r, etymological
308
+ radio1a2 % missing from nb list
309
+ .r�1a2 .r�2a3le .r�2a3lis .r�2a3lit .r�2aux .r�1�2 .r�1e2 .r�2el .r�2er
310
+ .r�2�r .r�1i2 .r�2i3fi .r�1o2 .re1s2 .re2s3cap
311
+ .re2s3cisi % for res-cision 09/17/92 (missing from nb)
312
+ .re2s3ciso % for res-cisoire 09/17/92(missing from nb)
313
+ .re2s3cou .re2s3cri .re2s3pect .re2s3pir .re2s3plend .re2s3pons .re2s3quil
314
+ .re2s3s .re2s3t .re3s4tab .re3s4tag .re3s4tand .re3s4tat .re3s4t�n .re3s4t�r
315
+ .re3s4tim .re3s4tip .re3s4toc .re3s4top .re3s4tr .re4s5trein .re4s5trict
316
+ .re4s5trin .re3s4tu .re3s4ty
317
+ .r�u2 %.r�1u2 % pattern rejected 12/2/92
318
+ % (don't hyphenate as r�-union nor r�u-nion)
319
+ .r�2uss .r�tro1a2 4re. 4res.
320
+ % words ending with -rent (df) 22/02/94
321
+ 2rent. % es-p�rent - syllable muette - bg 27/12/93
322
+ .pa3rent. appa3rent. transpa3rent. �3rent. tor3rent. cur3rent. 2r3heur
323
+ 2r3hydr
324
+
325
+ % s, phonetic
326
+ 1sa 1s� .sch4 1s2ch e2s3ch i2s3ch� i2s3chia i2s3chio 4sch. 4sche. 4sches.
327
+ 2schs 1se 1s� 1s� 1s� 4se. 4ses. .seu2le % jbb
328
+ .sh4 1s2h 4sh. 4she. 4shes. 2shent. % mute syllable: smashent (df) 22/02/94
329
+ 2shm 2shr 2shs 1si 1s� 1so 1s� 1s�0 % final zero essential to terminate cs
330
+ 1su 1s� 1sy
331
+ % s, etymological
332
+ 1s2caph 1s2cl�r 1s2cop sesqui1a2
333
+ % words ending with -sent (df) 22/02/94
334
+ 2sent. % massent - syllable muette - bg 27/12/93
335
+ ab3sent. pr�3sent. .res3sent. 2s3hom 1s2lav 1s2lov 1s2patia 1s2perm 1s2por
336
+ 1s2ph�r 1s2ph�r 1s2piel 1s2piros 1s2tandard 1s2tein st�r�o1s2 1s2tigm
337
+ 1s2tock 1s2tomos 1s2troph 1s2tructu 1s2tyle .su2b1a2 .su3b2alt .su2b1�2
338
+ .su3b2�3r .su2b1in .su2b3limin .su2b3lin .su2b3lu sub1s2 .su2b1ur supero2
339
+ supe4r1 supers2 .su2r1a2 su3r2ah .su3r2a3t .su2r1e2 .su3r2eau .su3r2ell
340
+ .su3r2et .su2r1�2 .su2r3h .su2r1i2m .su2r1inf .su2r1int .su2r1of .su2r1ox
341
+
342
+ % t, phonetic
343
+ 1ta 1t� 1t� 1te 1t� 1t� 1t� 4te. 4tes. .th4 1t2h 4th. 4the. 4thes.
344
+ 2thl % th2l was wrong for ...ath-lon (jd said 2thl) df 12/27/93
345
+ 2thm 2thn th2r 4thre. 4thres. 2ths 1ti 1t� 1to 1t� 1t2r 4tre. 4tres.
346
+ 2trent. % mute syllable: infiltrent (df) 28/02/94
347
+ 1tu 1t� tung2s3 1ty
348
+ % t, etymological
349
+ tachy1a2 tchin3t2 t�l�1e2 t�l�1i2 t�l�1o2b t�l�1o2p t�l�1s2
350
+ % words ending with -tent (df) 22/02/94
351
+ 2tent. % mentent - syllable muette - bg 27/12/93
352
+ .la3tent. .pa3tent. comp�3tent. �ni3tent. m�con3tent. omnipo3tent.
353
+ ventripo3tent. �quipo3tent. impo3tent. mit3tent.
354
+ thermo1s2 2t3heur tran2s1a2 tran3s2act tran3s2ats tran2s3h tran2s1o2
355
+ tran2s3p tran2s1u2 .tri1a2c .tri1a2n .tri1a2t .tri1o2n t1t2l
356
+
357
+ % u, phonetic
358
+ .u4 'u4 .�4 '�4
359
+ % u, etymological
360
+ uni1o2v uni1a2x u2s3tr
361
+
362
+ % v, phonetic
363
+ 1va 1v� 1ve 1v� 1v� 1v� 4ve. 4ves. 1vi 1v� 1vo 1v� 1v2r 4vre. 4vres.
364
+ 2vrent. % mute syllable: recouvrent (df) 28/02/94
365
+ 1vu 1v� 1vy
366
+
367
+ % v, etymological
368
+ v�lo1s2ki
369
+ % words ending with -vent (df) 22/02/94
370
+ 2vent. conni3vent. .sou3vent. vol2t1amp
371
+
372
+ % w, phonetic
373
+ 1wa 1we 4we. 4wes. 2went. % mute syllable: interviewent (df) 22/02/94
374
+ 1wi 1wo 1wu 1w2r
375
+
376
+ % x
377
+ 2xent. % mute syllable: malaxent (df) 22/02/94
378
+
379
+ % y, phonetic
380
+ .y4 'y4
381
+ % y, etymological
382
+ y1asth y1s2tom y1algi
383
+
384
+ % z, phonetic
385
+ 1za 1ze 1z� 1z� 4ze. 4zes. 1zi 1zo 1zu 1zy
386
+ % z, etymological
387
+ % words ending with -zent (df) 22/02/94
388
+ 2zent. privatdo3zent.
389
+ PATTERNS
390
+ end
391
+ Text::Hyphen::Language::FRA = Text::Hyphen::Language::FR
392
+ Text::Hyphen::Language::FRE = Text::Hyphen::Language::FR