@chr33s/pdf-codepoints 5.0.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/LICENSE.md +21 -0
- package/README.md +81 -0
- package/data/ArabicShaping.txt +894 -0
- package/data/Blocks.txt +336 -0
- package/data/CaseFolding.txt +1581 -0
- package/data/CompositionExclusions.txt +208 -0
- package/data/DerivedNormalizationProps.txt +9803 -0
- package/data/EastAsianWidth.txt +2473 -0
- package/data/IndicPositionalCategory.txt +755 -0
- package/data/IndicSyllabicCategory.txt +1286 -0
- package/data/PropertyValueAliases.txt +1541 -0
- package/data/Scripts.txt +2837 -0
- package/data/SpecialCasing.txt +281 -0
- package/data/UnicodeData.txt +32840 -0
- package/data/extracted/DerivedNumericValues.txt +2537 -0
- package/dist/index.d.ts +5 -0
- package/dist/index.js +6 -0
- package/dist/index.js.map +1 -0
- package/dist/parser.d.ts +35 -0
- package/dist/parser.js +308 -0
- package/dist/parser.js.map +1 -0
- package/package.json +40 -0
- package/scripts/update-data.ts +64 -0
- package/src/index.ts +7 -0
- package/src/parser.ts +428 -0
- package/test/parser.test.ts +77 -0
- package/tsconfig.json +10 -0
- package/tsconfig.typecheck.json +14 -0
- package/vitest.config.ts +8 -0
|
@@ -0,0 +1,1286 @@
|
|
|
1
|
+
# IndicSyllabicCategory-12.0.0.txt
|
|
2
|
+
# Date: 2019-01-31, 02:26:00 GMT [KW, RP]
|
|
3
|
+
# © 2019 Unicode®, Inc.
|
|
4
|
+
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
|
|
5
|
+
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
|
6
|
+
#
|
|
7
|
+
# For documentation, see UAX #44: Unicode Character Database,
|
|
8
|
+
# at http://www.unicode.org/reports/tr44/
|
|
9
|
+
#
|
|
10
|
+
# This file defines the following property:
|
|
11
|
+
#
|
|
12
|
+
# Indic_Syllabic_Category enumerated property
|
|
13
|
+
#
|
|
14
|
+
# Scope: This property is aimed at two general problem
|
|
15
|
+
# areas involving the analysis and processing of Indic scripts:
|
|
16
|
+
#
|
|
17
|
+
# 1. Specification of syllabic structure.
|
|
18
|
+
# 2. Specification of segmentation rules.
|
|
19
|
+
#
|
|
20
|
+
# Both of these problem areas may benefit from having defined subtypes
|
|
21
|
+
# of Indic script characters which are relevant to how Indic
|
|
22
|
+
# syllables (or aksaras) are constructed. Note that rules for
|
|
23
|
+
# syllabic structure in Indic scripts may differ significantly
|
|
24
|
+
# from how phonological syllables are defined.
|
|
25
|
+
#
|
|
26
|
+
# Format:
|
|
27
|
+
# Field 0 Unicode code point value or range of code point values
|
|
28
|
+
# Field 1 Indic_Syllabic_Category property value
|
|
29
|
+
#
|
|
30
|
+
# Field 1 is followed by a comment field, starting with the number sign '#',
|
|
31
|
+
# which shows the General_Category property value, the Unicode character name
|
|
32
|
+
# or names, and, in lines with ranges of code points, the code point count in
|
|
33
|
+
# square brackets.
|
|
34
|
+
#
|
|
35
|
+
# The scripts assessed as Indic in the structural sense used for the
|
|
36
|
+
# Indic_Syllabic_Category are the following:
|
|
37
|
+
#
|
|
38
|
+
# Ahom, Balinese, Batak, Bengali, Bhaiksuki, Brahmi, Buginese, Buhid,
|
|
39
|
+
# Chakma, Cham, Devanagari, Dogra, Grantha, Gujarati, Gunjala Gondi,
|
|
40
|
+
# Gurmukhi, Hanunoo, Javanese, Kaithi, Kannada, Kayah Li, Kharoshthi,
|
|
41
|
+
# Khmer, Khojki, Khudawadi, Lao, Lepcha, Limbu, Mahajani, Makasar,
|
|
42
|
+
# Malayalam, Marchen, Masaram Gondi, Meetei Mayek, Modi, Multani,
|
|
43
|
+
# Myanmar, Nandinagari, Newa, New Tai Lue, Oriya, Phags-pa, Rejang,
|
|
44
|
+
# Saurashtra, Sharada, Siddham, Sinhala, Soyombo, Sundanese, Syloti
|
|
45
|
+
# Nagri, Tagalog, Tagbanwa, Tai Le, Tai Tham, Tai Viet, Takri, Tamil,
|
|
46
|
+
# Telugu, Thai, Tibetan, Tirhuta, and Zanabazar Square.
|
|
47
|
+
#
|
|
48
|
+
# All characters for all other scripts not in that list
|
|
49
|
+
# take the default value for this property, unless they
|
|
50
|
+
# are individually listed in this data file.
|
|
51
|
+
#
|
|
52
|
+
|
|
53
|
+
# ================================================
|
|
54
|
+
|
|
55
|
+
# Property: Indic_Syllabic_Category
|
|
56
|
+
#
|
|
57
|
+
# All code points not explicitly listed for Indic_Syllabic_Category
|
|
58
|
+
# have the value Other.
|
|
59
|
+
#
|
|
60
|
+
# @missing: 0000..10FFFF; Other
|
|
61
|
+
|
|
62
|
+
# ================================================
|
|
63
|
+
|
|
64
|
+
# Indic_Syllabic_Category=Bindu
|
|
65
|
+
|
|
66
|
+
# Bindu/Anusvara (nasalization or -n)
|
|
67
|
+
|
|
68
|
+
# [Not derivable]
|
|
69
|
+
|
|
70
|
+
0900..0902 ; Bindu # Mn [3] DEVANAGARI SIGN INVERTED CANDRABINDU..DEVANAGARI SIGN ANUSVARA
|
|
71
|
+
0981 ; Bindu # Mn BENGALI SIGN CANDRABINDU
|
|
72
|
+
0982 ; Bindu # Mc BENGALI SIGN ANUSVARA
|
|
73
|
+
09FC ; Bindu # Lo BENGALI LETTER VEDIC ANUSVARA
|
|
74
|
+
0A01..0A02 ; Bindu # Mn [2] GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN BINDI
|
|
75
|
+
0A70 ; Bindu # Mn GURMUKHI TIPPI
|
|
76
|
+
0A81..0A82 ; Bindu # Mn [2] GUJARATI SIGN CANDRABINDU..GUJARATI SIGN ANUSVARA
|
|
77
|
+
0B01 ; Bindu # Mn ORIYA SIGN CANDRABINDU
|
|
78
|
+
0B02 ; Bindu # Mc ORIYA SIGN ANUSVARA
|
|
79
|
+
0B82 ; Bindu # Mn TAMIL SIGN ANUSVARA
|
|
80
|
+
0C00 ; Bindu # Mn TELUGU SIGN COMBINING CANDRABINDU ABOVE
|
|
81
|
+
0C01..0C02 ; Bindu # Mc [2] TELUGU SIGN CANDRABINDU..TELUGU SIGN ANUSVARA
|
|
82
|
+
0C04 ; Bindu # Mn TELUGU SIGN COMBINING ANUSVARA ABOVE
|
|
83
|
+
0C80 ; Bindu # Lo KANNADA SIGN SPACING CANDRABINDU
|
|
84
|
+
0C81 ; Bindu # Mn KANNADA SIGN CANDRABINDU
|
|
85
|
+
0C82 ; Bindu # Mc KANNADA SIGN ANUSVARA
|
|
86
|
+
0D00..0D01 ; Bindu # Mn [2] MALAYALAM SIGN COMBINING ANUSVARA ABOVE..MALAYALAM SIGN CANDRABINDU
|
|
87
|
+
0D02 ; Bindu # Mc MALAYALAM SIGN ANUSVARA
|
|
88
|
+
0D82 ; Bindu # Mc SINHALA SIGN ANUSVARAYA
|
|
89
|
+
0E4D ; Bindu # Mn THAI CHARACTER NIKHAHIT
|
|
90
|
+
0ECD ; Bindu # Mn LAO NIGGAHITA
|
|
91
|
+
0F7E ; Bindu # Mn TIBETAN SIGN RJES SU NGA RO
|
|
92
|
+
0F82..0F83 ; Bindu # Mn [2] TIBETAN SIGN NYI ZLA NAA DA..TIBETAN SIGN SNA LDAN
|
|
93
|
+
1036 ; Bindu # Mn MYANMAR SIGN ANUSVARA
|
|
94
|
+
17C6 ; Bindu # Mn KHMER SIGN NIKAHIT
|
|
95
|
+
1932 ; Bindu # Mn LIMBU SMALL LETTER ANUSVARA
|
|
96
|
+
1A74 ; Bindu # Mn TAI THAM SIGN MAI KANG
|
|
97
|
+
1B00..1B02 ; Bindu # Mn [3] BALINESE SIGN ULU RICEM..BALINESE SIGN CECEK
|
|
98
|
+
1B80 ; Bindu # Mn SUNDANESE SIGN PANYECEK
|
|
99
|
+
1C34..1C35 ; Bindu # Mc [2] LEPCHA CONSONANT SIGN NYIN-DO..LEPCHA CONSONANT SIGN KANG
|
|
100
|
+
A80B ; Bindu # Mn SYLOTI NAGRI SIGN ANUSVARA
|
|
101
|
+
A873 ; Bindu # Lo PHAGS-PA LETTER CANDRABINDU
|
|
102
|
+
A880 ; Bindu # Mc SAURASHTRA SIGN ANUSVARA
|
|
103
|
+
A8C5 ; Bindu # Mn SAURASHTRA SIGN CANDRABINDU
|
|
104
|
+
A8F2..A8F3 ; Bindu # Lo [2] DEVANAGARI SIGN SPACING CANDRABINDU..DEVANAGARI SIGN CANDRABINDU VIRAMA
|
|
105
|
+
A980..A981 ; Bindu # Mn [2] JAVANESE SIGN PANYANGGA..JAVANESE SIGN CECAK
|
|
106
|
+
10A0E ; Bindu # Mn KHAROSHTHI SIGN ANUSVARA
|
|
107
|
+
11000 ; Bindu # Mc BRAHMI SIGN CANDRABINDU
|
|
108
|
+
11001 ; Bindu # Mn BRAHMI SIGN ANUSVARA
|
|
109
|
+
11080..11081 ; Bindu # Mn [2] KAITHI SIGN CANDRABINDU..KAITHI SIGN ANUSVARA
|
|
110
|
+
11100..11101 ; Bindu # Mn [2] CHAKMA SIGN CANDRABINDU..CHAKMA SIGN ANUSVARA
|
|
111
|
+
11180..11181 ; Bindu # Mn [2] SHARADA SIGN CANDRABINDU..SHARADA SIGN ANUSVARA
|
|
112
|
+
11234 ; Bindu # Mn KHOJKI SIGN ANUSVARA
|
|
113
|
+
112DF ; Bindu # Mn KHUDAWADI SIGN ANUSVARA
|
|
114
|
+
11300..11301 ; Bindu # Mn [2] GRANTHA SIGN COMBINING ANUSVARA ABOVE..GRANTHA SIGN CANDRABINDU
|
|
115
|
+
11302 ; Bindu # Mc GRANTHA SIGN ANUSVARA
|
|
116
|
+
1135E..1135F ; Bindu # Lo [2] GRANTHA LETTER VEDIC ANUSVARA..GRANTHA LETTER VEDIC DOUBLE ANUSVARA
|
|
117
|
+
11443..11444 ; Bindu # Mn [2] NEWA SIGN CANDRABINDU..NEWA SIGN ANUSVARA
|
|
118
|
+
1145F ; Bindu # Lo NEWA LETTER VEDIC ANUSVARA
|
|
119
|
+
114BF..114C0 ; Bindu # Mn [2] TIRHUTA SIGN CANDRABINDU..TIRHUTA SIGN ANUSVARA
|
|
120
|
+
115BC..115BD ; Bindu # Mn [2] SIDDHAM SIGN CANDRABINDU..SIDDHAM SIGN ANUSVARA
|
|
121
|
+
1163D ; Bindu # Mn MODI SIGN ANUSVARA
|
|
122
|
+
116AB ; Bindu # Mn TAKRI SIGN ANUSVARA
|
|
123
|
+
11837 ; Bindu # Mn DOGRA SIGN ANUSVARA
|
|
124
|
+
119DE ; Bindu # Mc NANDINAGARI SIGN ANUSVARA
|
|
125
|
+
11A35..11A38 ; Bindu # Mn [4] ZANABAZAR SQUARE SIGN CANDRABINDU..ZANABAZAR SQUARE SIGN ANUSVARA
|
|
126
|
+
11A96 ; Bindu # Mn SOYOMBO SIGN ANUSVARA
|
|
127
|
+
11C3C..11C3D ; Bindu # Mn [2] BHAIKSUKI SIGN CANDRABINDU..BHAIKSUKI SIGN ANUSVARA
|
|
128
|
+
11CB5..11CB6 ; Bindu # Mn [2] MARCHEN SIGN ANUSVARA..MARCHEN SIGN CANDRABINDU
|
|
129
|
+
11D40 ; Bindu # Mn MASARAM GONDI SIGN ANUSVARA
|
|
130
|
+
11D95 ; Bindu # Mn GUNJALA GONDI SIGN ANUSVARA
|
|
131
|
+
|
|
132
|
+
# ================================================
|
|
133
|
+
|
|
134
|
+
# Indic_Syllabic_Category=Visarga
|
|
135
|
+
|
|
136
|
+
# Visarga (-h)
|
|
137
|
+
# Excludes letters for jihvamuliya and upadhmaniya, which are
|
|
138
|
+
# related, but structured somewhat differently.
|
|
139
|
+
|
|
140
|
+
# [Not derivable]
|
|
141
|
+
|
|
142
|
+
0903 ; Visarga # Mc DEVANAGARI SIGN VISARGA
|
|
143
|
+
0983 ; Visarga # Mc BENGALI SIGN VISARGA
|
|
144
|
+
0A03 ; Visarga # Mc GURMUKHI SIGN VISARGA
|
|
145
|
+
0A83 ; Visarga # Mc GUJARATI SIGN VISARGA
|
|
146
|
+
0B03 ; Visarga # Mc ORIYA SIGN VISARGA
|
|
147
|
+
0C03 ; Visarga # Mc TELUGU SIGN VISARGA
|
|
148
|
+
0C83 ; Visarga # Mc KANNADA SIGN VISARGA
|
|
149
|
+
0D03 ; Visarga # Mc MALAYALAM SIGN VISARGA
|
|
150
|
+
0D83 ; Visarga # Mc SINHALA SIGN VISARGAYA
|
|
151
|
+
0F7F ; Visarga # Mc TIBETAN SIGN RNAM BCAD
|
|
152
|
+
1038 ; Visarga # Mc MYANMAR SIGN VISARGA
|
|
153
|
+
17C7 ; Visarga # Mc KHMER SIGN REAHMUK
|
|
154
|
+
1B04 ; Visarga # Mc BALINESE SIGN BISAH
|
|
155
|
+
1B82 ; Visarga # Mc SUNDANESE SIGN PANGWISAD
|
|
156
|
+
A881 ; Visarga # Mc SAURASHTRA SIGN VISARGA
|
|
157
|
+
A983 ; Visarga # Mc JAVANESE SIGN WIGNYAN
|
|
158
|
+
AAF5 ; Visarga # Mc MEETEI MAYEK VOWEL SIGN VISARGA
|
|
159
|
+
10A0F ; Visarga # Mn KHAROSHTHI SIGN VISARGA
|
|
160
|
+
11002 ; Visarga # Mc BRAHMI SIGN VISARGA
|
|
161
|
+
11082 ; Visarga # Mc KAITHI SIGN VISARGA
|
|
162
|
+
11102 ; Visarga # Mn CHAKMA SIGN VISARGA
|
|
163
|
+
11182 ; Visarga # Mc SHARADA SIGN VISARGA
|
|
164
|
+
11303 ; Visarga # Mc GRANTHA SIGN VISARGA
|
|
165
|
+
11445 ; Visarga # Mc NEWA SIGN VISARGA
|
|
166
|
+
114C1 ; Visarga # Mc TIRHUTA SIGN VISARGA
|
|
167
|
+
115BE ; Visarga # Mc SIDDHAM SIGN VISARGA
|
|
168
|
+
1163E ; Visarga # Mc MODI SIGN VISARGA
|
|
169
|
+
116AC ; Visarga # Mc TAKRI SIGN VISARGA
|
|
170
|
+
11838 ; Visarga # Mc DOGRA SIGN VISARGA
|
|
171
|
+
119DF ; Visarga # Mc NANDINAGARI SIGN VISARGA
|
|
172
|
+
11A39 ; Visarga # Mc ZANABAZAR SQUARE SIGN VISARGA
|
|
173
|
+
11A97 ; Visarga # Mc SOYOMBO SIGN VISARGA
|
|
174
|
+
11C3E ; Visarga # Mc BHAIKSUKI SIGN VISARGA
|
|
175
|
+
11D41 ; Visarga # Mn MASARAM GONDI SIGN VISARGA
|
|
176
|
+
11D96 ; Visarga # Mc GUNJALA GONDI SIGN VISARGA
|
|
177
|
+
|
|
178
|
+
# ================================================
|
|
179
|
+
|
|
180
|
+
# Indic_Syllabic_Category=Avagraha
|
|
181
|
+
|
|
182
|
+
# Avagraha (elision of initial a- in sandhi)
|
|
183
|
+
|
|
184
|
+
# [Not derivable]
|
|
185
|
+
|
|
186
|
+
093D ; Avagraha # Lo DEVANAGARI SIGN AVAGRAHA
|
|
187
|
+
09BD ; Avagraha # Lo BENGALI SIGN AVAGRAHA
|
|
188
|
+
0ABD ; Avagraha # Lo GUJARATI SIGN AVAGRAHA
|
|
189
|
+
0B3D ; Avagraha # Lo ORIYA SIGN AVAGRAHA
|
|
190
|
+
0C3D ; Avagraha # Lo TELUGU SIGN AVAGRAHA
|
|
191
|
+
0CBD ; Avagraha # Lo KANNADA SIGN AVAGRAHA
|
|
192
|
+
0D3D ; Avagraha # Lo MALAYALAM SIGN AVAGRAHA
|
|
193
|
+
0F85 ; Avagraha # Po TIBETAN MARK PALUTA
|
|
194
|
+
17DC ; Avagraha # Lo KHMER SIGN AVAKRAHASANYA
|
|
195
|
+
1BBA ; Avagraha # Lo SUNDANESE AVAGRAHA
|
|
196
|
+
111C1 ; Avagraha # Lo SHARADA SIGN AVAGRAHA
|
|
197
|
+
1133D ; Avagraha # Lo GRANTHA SIGN AVAGRAHA
|
|
198
|
+
11447 ; Avagraha # Lo NEWA SIGN AVAGRAHA
|
|
199
|
+
114C4 ; Avagraha # Lo TIRHUTA SIGN AVAGRAHA
|
|
200
|
+
119E1 ; Avagraha # Lo NANDINAGARI SIGN AVAGRAHA
|
|
201
|
+
11A9D ; Avagraha # Lo SOYOMBO MARK PLUTA
|
|
202
|
+
11C40 ; Avagraha # Lo BHAIKSUKI SIGN AVAGRAHA
|
|
203
|
+
|
|
204
|
+
# ================================================
|
|
205
|
+
|
|
206
|
+
# Indic_Syllabic_Category=Nukta
|
|
207
|
+
|
|
208
|
+
# Nukta (diacritic for borrowed consonants or other consonant
|
|
209
|
+
# modifications). Note that while the resulting sound is typically a
|
|
210
|
+
# consonant, the base letter a nukta follows may be an independent
|
|
211
|
+
# vowel. For example, <U+0A85 GUJARATI LETTER A, U+0AFD GUJARATI
|
|
212
|
+
# SIGN THREE-DOT NUKTA ABOVE> is used to transcribe ARABIC LETTER
|
|
213
|
+
# AIN.
|
|
214
|
+
|
|
215
|
+
# [Not derivable]
|
|
216
|
+
|
|
217
|
+
093C ; Nukta # Mn DEVANAGARI SIGN NUKTA
|
|
218
|
+
09BC ; Nukta # Mn BENGALI SIGN NUKTA
|
|
219
|
+
0A3C ; Nukta # Mn GURMUKHI SIGN NUKTA
|
|
220
|
+
0ABC ; Nukta # Mn GUJARATI SIGN NUKTA
|
|
221
|
+
0AFD..0AFF ; Nukta # Mn [3] GUJARATI SIGN THREE-DOT NUKTA ABOVE..GUJARATI SIGN TWO-CIRCLE NUKTA ABOVE
|
|
222
|
+
0B3C ; Nukta # Mn ORIYA SIGN NUKTA
|
|
223
|
+
0CBC ; Nukta # Mn KANNADA SIGN NUKTA
|
|
224
|
+
0F39 ; Nukta # Mn TIBETAN MARK TSA -PHRU
|
|
225
|
+
1B34 ; Nukta # Mn BALINESE SIGN REREKAN
|
|
226
|
+
1BE6 ; Nukta # Mn BATAK SIGN TOMPI
|
|
227
|
+
1C37 ; Nukta # Mn LEPCHA SIGN NUKTA
|
|
228
|
+
A9B3 ; Nukta # Mn JAVANESE SIGN CECAK TELU
|
|
229
|
+
10A38..10A3A ; Nukta # Mn [3] KHAROSHTHI SIGN BAR ABOVE..KHAROSHTHI SIGN DOT BELOW
|
|
230
|
+
110BA ; Nukta # Mn KAITHI SIGN NUKTA
|
|
231
|
+
11173 ; Nukta # Mn MAHAJANI SIGN NUKTA
|
|
232
|
+
111CA ; Nukta # Mn SHARADA SIGN NUKTA
|
|
233
|
+
11236 ; Nukta # Mn KHOJKI SIGN NUKTA
|
|
234
|
+
112E9 ; Nukta # Mn KHUDAWADI SIGN NUKTA
|
|
235
|
+
1133B..1133C ; Nukta # Mn [2] COMBINING BINDU BELOW..GRANTHA SIGN NUKTA
|
|
236
|
+
11446 ; Nukta # Mn NEWA SIGN NUKTA
|
|
237
|
+
114C3 ; Nukta # Mn TIRHUTA SIGN NUKTA
|
|
238
|
+
115C0 ; Nukta # Mn SIDDHAM SIGN NUKTA
|
|
239
|
+
116B7 ; Nukta # Mn TAKRI SIGN NUKTA
|
|
240
|
+
1183A ; Nukta # Mn DOGRA SIGN NUKTA
|
|
241
|
+
11D42 ; Nukta # Mn MASARAM GONDI SIGN NUKTA
|
|
242
|
+
|
|
243
|
+
# ================================================
|
|
244
|
+
|
|
245
|
+
# Indic_Syllabic_Category=Virama
|
|
246
|
+
|
|
247
|
+
# Virama (killing of inherent vowel in consonant sequence
|
|
248
|
+
# or consonant stacker)
|
|
249
|
+
# Only includes characters that can act both as visible killer viramas
|
|
250
|
+
# and consonant stackers. Separate property values exist for characters
|
|
251
|
+
# that can only act as pure killers or only as consonant stackers.
|
|
252
|
+
|
|
253
|
+
# [Derivation: (ccc=9) - (InSC=Pure_Killer) - (InSC=Invisible_Stacker)
|
|
254
|
+
# - (InSC=Number_Joiner) - 2D7F]
|
|
255
|
+
|
|
256
|
+
094D ; Virama # Mn DEVANAGARI SIGN VIRAMA
|
|
257
|
+
09CD ; Virama # Mn BENGALI SIGN VIRAMA
|
|
258
|
+
0A4D ; Virama # Mn GURMUKHI SIGN VIRAMA
|
|
259
|
+
0ACD ; Virama # Mn GUJARATI SIGN VIRAMA
|
|
260
|
+
0B4D ; Virama # Mn ORIYA SIGN VIRAMA
|
|
261
|
+
0BCD ; Virama # Mn TAMIL SIGN VIRAMA
|
|
262
|
+
0C4D ; Virama # Mn TELUGU SIGN VIRAMA
|
|
263
|
+
0CCD ; Virama # Mn KANNADA SIGN VIRAMA
|
|
264
|
+
0D4D ; Virama # Mn MALAYALAM SIGN VIRAMA
|
|
265
|
+
0DCA ; Virama # Mn SINHALA SIGN AL-LAKUNA
|
|
266
|
+
1B44 ; Virama # Mc BALINESE ADEG ADEG
|
|
267
|
+
A806 ; Virama # Mn SYLOTI NAGRI SIGN HASANTA
|
|
268
|
+
A8C4 ; Virama # Mn SAURASHTRA SIGN VIRAMA
|
|
269
|
+
A9C0 ; Virama # Mc JAVANESE PANGKON
|
|
270
|
+
11046 ; Virama # Mn BRAHMI VIRAMA
|
|
271
|
+
110B9 ; Virama # Mn KAITHI SIGN VIRAMA
|
|
272
|
+
111C0 ; Virama # Mc SHARADA SIGN VIRAMA
|
|
273
|
+
11235 ; Virama # Mc KHOJKI SIGN VIRAMA
|
|
274
|
+
1134D ; Virama # Mc GRANTHA SIGN VIRAMA
|
|
275
|
+
11442 ; Virama # Mn NEWA SIGN VIRAMA
|
|
276
|
+
114C2 ; Virama # Mn TIRHUTA SIGN VIRAMA
|
|
277
|
+
115BF ; Virama # Mn SIDDHAM SIGN VIRAMA
|
|
278
|
+
1163F ; Virama # Mn MODI SIGN VIRAMA
|
|
279
|
+
116B6 ; Virama # Mc TAKRI SIGN VIRAMA
|
|
280
|
+
11839 ; Virama # Mn DOGRA SIGN VIRAMA
|
|
281
|
+
119E0 ; Virama # Mn NANDINAGARI SIGN VIRAMA
|
|
282
|
+
11C3F ; Virama # Mn BHAIKSUKI SIGN VIRAMA
|
|
283
|
+
|
|
284
|
+
# ================================================
|
|
285
|
+
|
|
286
|
+
# Indic_Syllabic_Category=Pure_Killer
|
|
287
|
+
|
|
288
|
+
# Pure killer (killing of inherent vowel in consonant sequence,
|
|
289
|
+
# with no consonant stacking behavior)
|
|
290
|
+
|
|
291
|
+
# [Not derivable]
|
|
292
|
+
|
|
293
|
+
0D3B..0D3C ; Pure_Killer # Mn [2] MALAYALAM SIGN VERTICAL BAR VIRAMA..MALAYALAM SIGN CIRCULAR VIRAMA
|
|
294
|
+
0E3A ; Pure_Killer # Mn THAI CHARACTER PHINTHU
|
|
295
|
+
0E4E ; Pure_Killer # Mn THAI CHARACTER YAMAKKAN
|
|
296
|
+
0EBA ; Pure_Killer # Mn LAO SIGN PALI VIRAMA
|
|
297
|
+
0F84 ; Pure_Killer # Mn TIBETAN MARK HALANTA
|
|
298
|
+
103A ; Pure_Killer # Mn MYANMAR SIGN ASAT
|
|
299
|
+
1714 ; Pure_Killer # Mn TAGALOG SIGN VIRAMA
|
|
300
|
+
1734 ; Pure_Killer # Mn HANUNOO SIGN PAMUDPOD
|
|
301
|
+
17D1 ; Pure_Killer # Mn KHMER SIGN VIRIAM
|
|
302
|
+
1A7A ; Pure_Killer # Mn TAI THAM SIGN RA HAAM
|
|
303
|
+
1BAA ; Pure_Killer # Mc SUNDANESE SIGN PAMAAEH
|
|
304
|
+
1BF2..1BF3 ; Pure_Killer # Mc [2] BATAK PANGOLAT..BATAK PANONGONAN
|
|
305
|
+
A953 ; Pure_Killer # Mc REJANG VIRAMA
|
|
306
|
+
ABED ; Pure_Killer # Mn MEETEI MAYEK APUN IYEK
|
|
307
|
+
11134 ; Pure_Killer # Mn CHAKMA MAAYYAA
|
|
308
|
+
112EA ; Pure_Killer # Mn KHUDAWADI SIGN VIRAMA
|
|
309
|
+
1172B ; Pure_Killer # Mn AHOM SIGN KILLER
|
|
310
|
+
11A34 ; Pure_Killer # Mn ZANABAZAR SQUARE SIGN VIRAMA
|
|
311
|
+
11D44 ; Pure_Killer # Mn MASARAM GONDI SIGN HALANTA
|
|
312
|
+
|
|
313
|
+
# ================================================
|
|
314
|
+
|
|
315
|
+
# Indic_Syllabic_Category=Invisible_Stacker
|
|
316
|
+
|
|
317
|
+
# Invisible stacker (invisible consonant stacker virama).
|
|
318
|
+
#
|
|
319
|
+
# Note that in some scripts, such as Kharoshthi and Masaram Gondi, an invisible
|
|
320
|
+
# stacker may have a second function, changing the shape and/or location of the
|
|
321
|
+
# consonant preceding it, even when there is no consonant following the
|
|
322
|
+
# invisible stacker.
|
|
323
|
+
|
|
324
|
+
# [Not derivable]
|
|
325
|
+
|
|
326
|
+
1039 ; Invisible_Stacker # Mn MYANMAR SIGN VIRAMA
|
|
327
|
+
17D2 ; Invisible_Stacker # Mn KHMER SIGN COENG
|
|
328
|
+
1A60 ; Invisible_Stacker # Mn TAI THAM SIGN SAKOT
|
|
329
|
+
1BAB ; Invisible_Stacker # Mn SUNDANESE SIGN VIRAMA
|
|
330
|
+
AAF6 ; Invisible_Stacker # Mn MEETEI MAYEK VIRAMA
|
|
331
|
+
10A3F ; Invisible_Stacker # Mn KHAROSHTHI VIRAMA
|
|
332
|
+
11133 ; Invisible_Stacker # Mn CHAKMA VIRAMA
|
|
333
|
+
11A47 ; Invisible_Stacker # Mn ZANABAZAR SQUARE SUBJOINER
|
|
334
|
+
11A99 ; Invisible_Stacker # Mn SOYOMBO SUBJOINER
|
|
335
|
+
11D45 ; Invisible_Stacker # Mn MASARAM GONDI VIRAMA
|
|
336
|
+
11D97 ; Invisible_Stacker # Mn GUNJALA GONDI VIRAMA
|
|
337
|
+
|
|
338
|
+
# ================================================
|
|
339
|
+
|
|
340
|
+
# Indic_Syllabic_Category=Vowel_Independent
|
|
341
|
+
|
|
342
|
+
# Independent Vowels (contrasted with matras)
|
|
343
|
+
|
|
344
|
+
# [Not derivable]
|
|
345
|
+
|
|
346
|
+
0904..0914 ; Vowel_Independent # Lo [17] DEVANAGARI LETTER SHORT A..DEVANAGARI LETTER AU
|
|
347
|
+
0960..0961 ; Vowel_Independent # Lo [2] DEVANAGARI LETTER VOCALIC RR..DEVANAGARI LETTER VOCALIC LL
|
|
348
|
+
0972..0977 ; Vowel_Independent # Lo [6] DEVANAGARI LETTER CANDRA A..DEVANAGARI LETTER UUE
|
|
349
|
+
0985..098C ; Vowel_Independent # Lo [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L
|
|
350
|
+
098F..0990 ; Vowel_Independent # Lo [2] BENGALI LETTER E..BENGALI LETTER AI
|
|
351
|
+
0993..0994 ; Vowel_Independent # Lo [2] BENGALI LETTER O..BENGALI LETTER AU
|
|
352
|
+
09E0..09E1 ; Vowel_Independent # Lo [2] BENGALI LETTER VOCALIC RR..BENGALI LETTER VOCALIC LL
|
|
353
|
+
0A05..0A0A ; Vowel_Independent # Lo [6] GURMUKHI LETTER A..GURMUKHI LETTER UU
|
|
354
|
+
0A0F..0A10 ; Vowel_Independent # Lo [2] GURMUKHI LETTER EE..GURMUKHI LETTER AI
|
|
355
|
+
0A13..0A14 ; Vowel_Independent # Lo [2] GURMUKHI LETTER OO..GURMUKHI LETTER AU
|
|
356
|
+
0A85..0A8D ; Vowel_Independent # Lo [9] GUJARATI LETTER A..GUJARATI VOWEL CANDRA E
|
|
357
|
+
0A8F..0A91 ; Vowel_Independent # Lo [3] GUJARATI LETTER E..GUJARATI VOWEL CANDRA O
|
|
358
|
+
0A93..0A94 ; Vowel_Independent # Lo [2] GUJARATI LETTER O..GUJARATI LETTER AU
|
|
359
|
+
0AE0..0AE1 ; Vowel_Independent # Lo [2] GUJARATI LETTER VOCALIC RR..GUJARATI LETTER VOCALIC LL
|
|
360
|
+
0B05..0B0C ; Vowel_Independent # Lo [8] ORIYA LETTER A..ORIYA LETTER VOCALIC L
|
|
361
|
+
0B0F..0B10 ; Vowel_Independent # Lo [2] ORIYA LETTER E..ORIYA LETTER AI
|
|
362
|
+
0B13..0B14 ; Vowel_Independent # Lo [2] ORIYA LETTER O..ORIYA LETTER AU
|
|
363
|
+
0B60..0B61 ; Vowel_Independent # Lo [2] ORIYA LETTER VOCALIC RR..ORIYA LETTER VOCALIC LL
|
|
364
|
+
0B85..0B8A ; Vowel_Independent # Lo [6] TAMIL LETTER A..TAMIL LETTER UU
|
|
365
|
+
0B8E..0B90 ; Vowel_Independent # Lo [3] TAMIL LETTER E..TAMIL LETTER AI
|
|
366
|
+
0B92..0B94 ; Vowel_Independent # Lo [3] TAMIL LETTER O..TAMIL LETTER AU
|
|
367
|
+
0C05..0C0C ; Vowel_Independent # Lo [8] TELUGU LETTER A..TELUGU LETTER VOCALIC L
|
|
368
|
+
0C0E..0C10 ; Vowel_Independent # Lo [3] TELUGU LETTER E..TELUGU LETTER AI
|
|
369
|
+
0C12..0C14 ; Vowel_Independent # Lo [3] TELUGU LETTER O..TELUGU LETTER AU
|
|
370
|
+
0C60..0C61 ; Vowel_Independent # Lo [2] TELUGU LETTER VOCALIC RR..TELUGU LETTER VOCALIC LL
|
|
371
|
+
0C85..0C8C ; Vowel_Independent # Lo [8] KANNADA LETTER A..KANNADA LETTER VOCALIC L
|
|
372
|
+
0C8E..0C90 ; Vowel_Independent # Lo [3] KANNADA LETTER E..KANNADA LETTER AI
|
|
373
|
+
0C92..0C94 ; Vowel_Independent # Lo [3] KANNADA LETTER O..KANNADA LETTER AU
|
|
374
|
+
0CE0..0CE1 ; Vowel_Independent # Lo [2] KANNADA LETTER VOCALIC RR..KANNADA LETTER VOCALIC LL
|
|
375
|
+
0D05..0D0C ; Vowel_Independent # Lo [8] MALAYALAM LETTER A..MALAYALAM LETTER VOCALIC L
|
|
376
|
+
0D0E..0D10 ; Vowel_Independent # Lo [3] MALAYALAM LETTER E..MALAYALAM LETTER AI
|
|
377
|
+
0D12..0D14 ; Vowel_Independent # Lo [3] MALAYALAM LETTER O..MALAYALAM LETTER AU
|
|
378
|
+
0D5F..0D61 ; Vowel_Independent # Lo [3] MALAYALAM LETTER ARCHAIC II..MALAYALAM LETTER VOCALIC LL
|
|
379
|
+
0D85..0D96 ; Vowel_Independent # Lo [18] SINHALA LETTER AYANNA..SINHALA LETTER AUYANNA
|
|
380
|
+
1021..102A ; Vowel_Independent # Lo [10] MYANMAR LETTER A..MYANMAR LETTER AU
|
|
381
|
+
1052..1055 ; Vowel_Independent # Lo [4] MYANMAR LETTER VOCALIC R..MYANMAR LETTER VOCALIC LL
|
|
382
|
+
1700..1702 ; Vowel_Independent # Lo [3] TAGALOG LETTER A..TAGALOG LETTER U
|
|
383
|
+
1720..1722 ; Vowel_Independent # Lo [3] HANUNOO LETTER A..HANUNOO LETTER U
|
|
384
|
+
1740..1742 ; Vowel_Independent # Lo [3] BUHID LETTER A..BUHID LETTER U
|
|
385
|
+
1760..1762 ; Vowel_Independent # Lo [3] TAGBANWA LETTER A..TAGBANWA LETTER U
|
|
386
|
+
17A3..17B3 ; Vowel_Independent # Lo [17] KHMER INDEPENDENT VOWEL QAQ..KHMER INDEPENDENT VOWEL QAU
|
|
387
|
+
1A4D..1A52 ; Vowel_Independent # Lo [6] TAI THAM LETTER I..TAI THAM LETTER OO
|
|
388
|
+
1B05..1B12 ; Vowel_Independent # Lo [14] BALINESE LETTER AKARA..BALINESE LETTER OKARA TEDUNG
|
|
389
|
+
1B83..1B89 ; Vowel_Independent # Lo [7] SUNDANESE LETTER A..SUNDANESE LETTER EU
|
|
390
|
+
1BE4..1BE5 ; Vowel_Independent # Lo [2] BATAK LETTER I..BATAK LETTER U
|
|
391
|
+
A800..A801 ; Vowel_Independent # Lo [2] SYLOTI NAGRI LETTER A..SYLOTI NAGRI LETTER I
|
|
392
|
+
A803..A805 ; Vowel_Independent # Lo [3] SYLOTI NAGRI LETTER U..SYLOTI NAGRI LETTER O
|
|
393
|
+
A882..A891 ; Vowel_Independent # Lo [16] SAURASHTRA LETTER A..SAURASHTRA LETTER AU
|
|
394
|
+
A8FE ; Vowel_Independent # Lo DEVANAGARI LETTER AY
|
|
395
|
+
A984..A988 ; Vowel_Independent # Lo [5] JAVANESE LETTER A..JAVANESE LETTER U
|
|
396
|
+
A98C..A98E ; Vowel_Independent # Lo [3] JAVANESE LETTER E..JAVANESE LETTER O
|
|
397
|
+
AA00..AA05 ; Vowel_Independent # Lo [6] CHAM LETTER A..CHAM LETTER O
|
|
398
|
+
AAE0..AAE1 ; Vowel_Independent # Lo [2] MEETEI MAYEK LETTER E..MEETEI MAYEK LETTER O
|
|
399
|
+
ABCE..ABCF ; Vowel_Independent # Lo [2] MEETEI MAYEK LETTER UN..MEETEI MAYEK LETTER I
|
|
400
|
+
ABD1 ; Vowel_Independent # Lo MEETEI MAYEK LETTER ATIYA
|
|
401
|
+
11005..11012 ; Vowel_Independent # Lo [14] BRAHMI LETTER A..BRAHMI LETTER AU
|
|
402
|
+
11083..1108C ; Vowel_Independent # Lo [10] KAITHI LETTER A..KAITHI LETTER AU
|
|
403
|
+
11103..11106 ; Vowel_Independent # Lo [4] CHAKMA LETTER AA..CHAKMA LETTER E
|
|
404
|
+
11183..11190 ; Vowel_Independent # Lo [14] SHARADA LETTER A..SHARADA LETTER AU
|
|
405
|
+
11200..11207 ; Vowel_Independent # Lo [8] KHOJKI LETTER A..KHOJKI LETTER AU
|
|
406
|
+
11280..11283 ; Vowel_Independent # Lo [4] MULTANI LETTER A..MULTANI LETTER E
|
|
407
|
+
112B0..112B9 ; Vowel_Independent # Lo [10] KHUDAWADI LETTER A..KHUDAWADI LETTER AU
|
|
408
|
+
11305..1130C ; Vowel_Independent # Lo [8] GRANTHA LETTER A..GRANTHA LETTER VOCALIC L
|
|
409
|
+
1130F..11310 ; Vowel_Independent # Lo [2] GRANTHA LETTER EE..GRANTHA LETTER AI
|
|
410
|
+
11313..11314 ; Vowel_Independent # Lo [2] GRANTHA LETTER OO..GRANTHA LETTER AU
|
|
411
|
+
11360..11361 ; Vowel_Independent # Lo [2] GRANTHA LETTER VOCALIC RR..GRANTHA LETTER VOCALIC LL
|
|
412
|
+
11400..1140D ; Vowel_Independent # Lo [14] NEWA LETTER A..NEWA LETTER AU
|
|
413
|
+
11481..1148E ; Vowel_Independent # Lo [14] TIRHUTA LETTER A..TIRHUTA LETTER AU
|
|
414
|
+
11580..1158D ; Vowel_Independent # Lo [14] SIDDHAM LETTER A..SIDDHAM LETTER AU
|
|
415
|
+
115D8..115DB ; Vowel_Independent # Lo [4] SIDDHAM LETTER THREE-CIRCLE ALTERNATE I..SIDDHAM LETTER ALTERNATE U
|
|
416
|
+
11600..1160D ; Vowel_Independent # Lo [14] MODI LETTER A..MODI LETTER AU
|
|
417
|
+
11680..11689 ; Vowel_Independent # Lo [10] TAKRI LETTER A..TAKRI LETTER AU
|
|
418
|
+
11800..11809 ; Vowel_Independent # Lo [10] DOGRA LETTER A..DOGRA LETTER AU
|
|
419
|
+
119A0..119A7 ; Vowel_Independent # Lo [8] NANDINAGARI LETTER A..NANDINAGARI LETTER VOCALIC RR
|
|
420
|
+
119AA..119AD ; Vowel_Independent # Lo [4] NANDINAGARI LETTER E..NANDINAGARI LETTER AU
|
|
421
|
+
11A00 ; Vowel_Independent # Lo ZANABAZAR SQUARE LETTER A
|
|
422
|
+
11A50 ; Vowel_Independent # Lo SOYOMBO LETTER A
|
|
423
|
+
11C00..11C08 ; Vowel_Independent # Lo [9] BHAIKSUKI LETTER A..BHAIKSUKI LETTER VOCALIC L
|
|
424
|
+
11C0A..11C0D ; Vowel_Independent # Lo [4] BHAIKSUKI LETTER E..BHAIKSUKI LETTER AU
|
|
425
|
+
11D00..11D06 ; Vowel_Independent # Lo [7] MASARAM GONDI LETTER A..MASARAM GONDI LETTER E
|
|
426
|
+
11D08..11D09 ; Vowel_Independent # Lo [2] MASARAM GONDI LETTER AI..MASARAM GONDI LETTER O
|
|
427
|
+
11D0B ; Vowel_Independent # Lo MASARAM GONDI LETTER AU
|
|
428
|
+
11D60..11D65 ; Vowel_Independent # Lo [6] GUNJALA GONDI LETTER A..GUNJALA GONDI LETTER UU
|
|
429
|
+
11D67..11D68 ; Vowel_Independent # Lo [2] GUNJALA GONDI LETTER EE..GUNJALA GONDI LETTER AI
|
|
430
|
+
11D6A..11D6B ; Vowel_Independent # Lo [2] GUNJALA GONDI LETTER OO..GUNJALA GONDI LETTER AU
|
|
431
|
+
|
|
432
|
+
# ================================================
|
|
433
|
+
|
|
434
|
+
# Indic_Syllabic_Category=Vowel_Dependent
|
|
435
|
+
|
|
436
|
+
# Dependent Vowels (contrasted with independent vowels and/or with
|
|
437
|
+
# complex placement). Known as matras in Indic scripts. Also
|
|
438
|
+
# includes vowel modifiers that follow dependent (and sometimes
|
|
439
|
+
# independent) vowels.
|
|
440
|
+
|
|
441
|
+
# [Not derivable]
|
|
442
|
+
|
|
443
|
+
093A ; Vowel_Dependent # Mn DEVANAGARI VOWEL SIGN OE
|
|
444
|
+
093B ; Vowel_Dependent # Mc DEVANAGARI VOWEL SIGN OOE
|
|
445
|
+
093E..0940 ; Vowel_Dependent # Mc [3] DEVANAGARI VOWEL SIGN AA..DEVANAGARI VOWEL SIGN II
|
|
446
|
+
0941..0948 ; Vowel_Dependent # Mn [8] DEVANAGARI VOWEL SIGN U..DEVANAGARI VOWEL SIGN AI
|
|
447
|
+
0949..094C ; Vowel_Dependent # Mc [4] DEVANAGARI VOWEL SIGN CANDRA O..DEVANAGARI VOWEL SIGN AU
|
|
448
|
+
094E..094F ; Vowel_Dependent # Mc [2] DEVANAGARI VOWEL SIGN PRISHTHAMATRA E..DEVANAGARI VOWEL SIGN AW
|
|
449
|
+
0955..0957 ; Vowel_Dependent # Mn [3] DEVANAGARI VOWEL SIGN CANDRA LONG E..DEVANAGARI VOWEL SIGN UUE
|
|
450
|
+
0962..0963 ; Vowel_Dependent # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
|
|
451
|
+
09BE..09C0 ; Vowel_Dependent # Mc [3] BENGALI VOWEL SIGN AA..BENGALI VOWEL SIGN II
|
|
452
|
+
09C1..09C4 ; Vowel_Dependent # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
|
|
453
|
+
09C7..09C8 ; Vowel_Dependent # Mc [2] BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI
|
|
454
|
+
09CB..09CC ; Vowel_Dependent # Mc [2] BENGALI VOWEL SIGN O..BENGALI VOWEL SIGN AU
|
|
455
|
+
09D7 ; Vowel_Dependent # Mc BENGALI AU LENGTH MARK
|
|
456
|
+
09E2..09E3 ; Vowel_Dependent # Mn [2] BENGALI VOWEL SIGN VOCALIC L..BENGALI VOWEL SIGN VOCALIC LL
|
|
457
|
+
0A3E..0A40 ; Vowel_Dependent # Mc [3] GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN II
|
|
458
|
+
0A41..0A42 ; Vowel_Dependent # Mn [2] GURMUKHI VOWEL SIGN U..GURMUKHI VOWEL SIGN UU
|
|
459
|
+
0A47..0A48 ; Vowel_Dependent # Mn [2] GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN AI
|
|
460
|
+
0A4B..0A4C ; Vowel_Dependent # Mn [2] GURMUKHI VOWEL SIGN OO..GURMUKHI VOWEL SIGN AU
|
|
461
|
+
0ABE..0AC0 ; Vowel_Dependent # Mc [3] GUJARATI VOWEL SIGN AA..GUJARATI VOWEL SIGN II
|
|
462
|
+
0AC1..0AC5 ; Vowel_Dependent # Mn [5] GUJARATI VOWEL SIGN U..GUJARATI VOWEL SIGN CANDRA E
|
|
463
|
+
0AC7..0AC8 ; Vowel_Dependent # Mn [2] GUJARATI VOWEL SIGN E..GUJARATI VOWEL SIGN AI
|
|
464
|
+
0AC9 ; Vowel_Dependent # Mc GUJARATI VOWEL SIGN CANDRA O
|
|
465
|
+
0ACB..0ACC ; Vowel_Dependent # Mc [2] GUJARATI VOWEL SIGN O..GUJARATI VOWEL SIGN AU
|
|
466
|
+
0AE2..0AE3 ; Vowel_Dependent # Mn [2] GUJARATI VOWEL SIGN VOCALIC L..GUJARATI VOWEL SIGN VOCALIC LL
|
|
467
|
+
0B3E ; Vowel_Dependent # Mc ORIYA VOWEL SIGN AA
|
|
468
|
+
0B3F ; Vowel_Dependent # Mn ORIYA VOWEL SIGN I
|
|
469
|
+
0B40 ; Vowel_Dependent # Mc ORIYA VOWEL SIGN II
|
|
470
|
+
0B41..0B44 ; Vowel_Dependent # Mn [4] ORIYA VOWEL SIGN U..ORIYA VOWEL SIGN VOCALIC RR
|
|
471
|
+
0B47..0B48 ; Vowel_Dependent # Mc [2] ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI
|
|
472
|
+
0B4B..0B4C ; Vowel_Dependent # Mc [2] ORIYA VOWEL SIGN O..ORIYA VOWEL SIGN AU
|
|
473
|
+
0B56 ; Vowel_Dependent # Mn ORIYA AI LENGTH MARK
|
|
474
|
+
0B57 ; Vowel_Dependent # Mc ORIYA AU LENGTH MARK
|
|
475
|
+
0B62..0B63 ; Vowel_Dependent # Mn [2] ORIYA VOWEL SIGN VOCALIC L..ORIYA VOWEL SIGN VOCALIC LL
|
|
476
|
+
0BBE..0BBF ; Vowel_Dependent # Mc [2] TAMIL VOWEL SIGN AA..TAMIL VOWEL SIGN I
|
|
477
|
+
0BC0 ; Vowel_Dependent # Mn TAMIL VOWEL SIGN II
|
|
478
|
+
0BC1..0BC2 ; Vowel_Dependent # Mc [2] TAMIL VOWEL SIGN U..TAMIL VOWEL SIGN UU
|
|
479
|
+
0BC6..0BC8 ; Vowel_Dependent # Mc [3] TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI
|
|
480
|
+
0BCA..0BCC ; Vowel_Dependent # Mc [3] TAMIL VOWEL SIGN O..TAMIL VOWEL SIGN AU
|
|
481
|
+
0BD7 ; Vowel_Dependent # Mc TAMIL AU LENGTH MARK
|
|
482
|
+
0C3E..0C40 ; Vowel_Dependent # Mn [3] TELUGU VOWEL SIGN AA..TELUGU VOWEL SIGN II
|
|
483
|
+
0C41..0C44 ; Vowel_Dependent # Mc [4] TELUGU VOWEL SIGN U..TELUGU VOWEL SIGN VOCALIC RR
|
|
484
|
+
0C46..0C48 ; Vowel_Dependent # Mn [3] TELUGU VOWEL SIGN E..TELUGU VOWEL SIGN AI
|
|
485
|
+
0C4A..0C4C ; Vowel_Dependent # Mn [3] TELUGU VOWEL SIGN O..TELUGU VOWEL SIGN AU
|
|
486
|
+
0C55..0C56 ; Vowel_Dependent # Mn [2] TELUGU LENGTH MARK..TELUGU AI LENGTH MARK
|
|
487
|
+
0C62..0C63 ; Vowel_Dependent # Mn [2] TELUGU VOWEL SIGN VOCALIC L..TELUGU VOWEL SIGN VOCALIC LL
|
|
488
|
+
0CBE ; Vowel_Dependent # Mc KANNADA VOWEL SIGN AA
|
|
489
|
+
0CBF ; Vowel_Dependent # Mn KANNADA VOWEL SIGN I
|
|
490
|
+
0CC0..0CC4 ; Vowel_Dependent # Mc [5] KANNADA VOWEL SIGN II..KANNADA VOWEL SIGN VOCALIC RR
|
|
491
|
+
0CC6 ; Vowel_Dependent # Mn KANNADA VOWEL SIGN E
|
|
492
|
+
0CC7..0CC8 ; Vowel_Dependent # Mc [2] KANNADA VOWEL SIGN EE..KANNADA VOWEL SIGN AI
|
|
493
|
+
0CCA..0CCB ; Vowel_Dependent # Mc [2] KANNADA VOWEL SIGN O..KANNADA VOWEL SIGN OO
|
|
494
|
+
0CCC ; Vowel_Dependent # Mn KANNADA VOWEL SIGN AU
|
|
495
|
+
0CD5..0CD6 ; Vowel_Dependent # Mc [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
|
|
496
|
+
0CE2..0CE3 ; Vowel_Dependent # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
|
|
497
|
+
0D3E..0D40 ; Vowel_Dependent # Mc [3] MALAYALAM VOWEL SIGN AA..MALAYALAM VOWEL SIGN II
|
|
498
|
+
0D41..0D44 ; Vowel_Dependent # Mn [4] MALAYALAM VOWEL SIGN U..MALAYALAM VOWEL SIGN VOCALIC RR
|
|
499
|
+
0D46..0D48 ; Vowel_Dependent # Mc [3] MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN AI
|
|
500
|
+
0D4A..0D4C ; Vowel_Dependent # Mc [3] MALAYALAM VOWEL SIGN O..MALAYALAM VOWEL SIGN AU
|
|
501
|
+
0D57 ; Vowel_Dependent # Mc MALAYALAM AU LENGTH MARK
|
|
502
|
+
0D62..0D63 ; Vowel_Dependent # Mn [2] MALAYALAM VOWEL SIGN VOCALIC L..MALAYALAM VOWEL SIGN VOCALIC LL
|
|
503
|
+
0DCF..0DD1 ; Vowel_Dependent # Mc [3] SINHALA VOWEL SIGN AELA-PILLA..SINHALA VOWEL SIGN DIGA AEDA-PILLA
|
|
504
|
+
0DD2..0DD4 ; Vowel_Dependent # Mn [3] SINHALA VOWEL SIGN KETTI IS-PILLA..SINHALA VOWEL SIGN KETTI PAA-PILLA
|
|
505
|
+
0DD6 ; Vowel_Dependent # Mn SINHALA VOWEL SIGN DIGA PAA-PILLA
|
|
506
|
+
0DD8..0DDF ; Vowel_Dependent # Mc [8] SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOWEL SIGN GAYANUKITTA
|
|
507
|
+
0DF2..0DF3 ; Vowel_Dependent # Mc [2] SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHALA VOWEL SIGN DIGA GAYANUKITTA
|
|
508
|
+
0E30 ; Vowel_Dependent # Lo THAI CHARACTER SARA A
|
|
509
|
+
0E31 ; Vowel_Dependent # Mn THAI CHARACTER MAI HAN-AKAT
|
|
510
|
+
0E32..0E33 ; Vowel_Dependent # Lo [2] THAI CHARACTER SARA AA..THAI CHARACTER SARA AM
|
|
511
|
+
0E34..0E39 ; Vowel_Dependent # Mn [6] THAI CHARACTER SARA I..THAI CHARACTER SARA UU
|
|
512
|
+
0E40..0E45 ; Vowel_Dependent # Lo [6] THAI CHARACTER SARA E..THAI CHARACTER LAKKHANGYAO
|
|
513
|
+
0E47 ; Vowel_Dependent # Mn THAI CHARACTER MAITAIKHU
|
|
514
|
+
0EB0 ; Vowel_Dependent # Lo LAO VOWEL SIGN A
|
|
515
|
+
0EB1 ; Vowel_Dependent # Mn LAO VOWEL SIGN MAI KAN
|
|
516
|
+
0EB2..0EB3 ; Vowel_Dependent # Lo [2] LAO VOWEL SIGN AA..LAO VOWEL SIGN AM
|
|
517
|
+
0EB4..0EB9 ; Vowel_Dependent # Mn [6] LAO VOWEL SIGN I..LAO VOWEL SIGN UU
|
|
518
|
+
0EBB ; Vowel_Dependent # Mn LAO VOWEL SIGN MAI KON
|
|
519
|
+
0EC0..0EC4 ; Vowel_Dependent # Lo [5] LAO VOWEL SIGN E..LAO VOWEL SIGN AI
|
|
520
|
+
0F71..0F7D ; Vowel_Dependent # Mn [13] TIBETAN VOWEL SIGN AA..TIBETAN VOWEL SIGN OO
|
|
521
|
+
0F80..0F81 ; Vowel_Dependent # Mn [2] TIBETAN VOWEL SIGN REVERSED I..TIBETAN VOWEL SIGN REVERSED II
|
|
522
|
+
102B..102C ; Vowel_Dependent # Mc [2] MYANMAR VOWEL SIGN TALL AA..MYANMAR VOWEL SIGN AA
|
|
523
|
+
102D..1030 ; Vowel_Dependent # Mn [4] MYANMAR VOWEL SIGN I..MYANMAR VOWEL SIGN UU
|
|
524
|
+
1031 ; Vowel_Dependent # Mc MYANMAR VOWEL SIGN E
|
|
525
|
+
1032..1035 ; Vowel_Dependent # Mn [4] MYANMAR VOWEL SIGN AI..MYANMAR VOWEL SIGN E ABOVE
|
|
526
|
+
1056..1057 ; Vowel_Dependent # Mc [2] MYANMAR VOWEL SIGN VOCALIC R..MYANMAR VOWEL SIGN VOCALIC RR
|
|
527
|
+
1058..1059 ; Vowel_Dependent # Mn [2] MYANMAR VOWEL SIGN VOCALIC L..MYANMAR VOWEL SIGN VOCALIC LL
|
|
528
|
+
1062 ; Vowel_Dependent # Mc MYANMAR VOWEL SIGN SGAW KAREN EU
|
|
529
|
+
1067..1068 ; Vowel_Dependent # Mc [2] MYANMAR VOWEL SIGN WESTERN PWO KAREN EU..MYANMAR VOWEL SIGN WESTERN PWO KAREN UE
|
|
530
|
+
1071..1074 ; Vowel_Dependent # Mn [4] MYANMAR VOWEL SIGN GEBA KAREN I..MYANMAR VOWEL SIGN KAYAH EE
|
|
531
|
+
1083..1084 ; Vowel_Dependent # Mc [2] MYANMAR VOWEL SIGN SHAN AA..MYANMAR VOWEL SIGN SHAN E
|
|
532
|
+
1085..1086 ; Vowel_Dependent # Mn [2] MYANMAR VOWEL SIGN SHAN E ABOVE..MYANMAR VOWEL SIGN SHAN FINAL Y
|
|
533
|
+
109C ; Vowel_Dependent # Mc MYANMAR VOWEL SIGN AITON A
|
|
534
|
+
109D ; Vowel_Dependent # Mn MYANMAR VOWEL SIGN AITON AI
|
|
535
|
+
1712..1713 ; Vowel_Dependent # Mn [2] TAGALOG VOWEL SIGN I..TAGALOG VOWEL SIGN U
|
|
536
|
+
1732..1733 ; Vowel_Dependent # Mn [2] HANUNOO VOWEL SIGN I..HANUNOO VOWEL SIGN U
|
|
537
|
+
1752..1753 ; Vowel_Dependent # Mn [2] BUHID VOWEL SIGN I..BUHID VOWEL SIGN U
|
|
538
|
+
1772..1773 ; Vowel_Dependent # Mn [2] TAGBANWA VOWEL SIGN I..TAGBANWA VOWEL SIGN U
|
|
539
|
+
17B6 ; Vowel_Dependent # Mc KHMER VOWEL SIGN AA
|
|
540
|
+
17B7..17BD ; Vowel_Dependent # Mn [7] KHMER VOWEL SIGN I..KHMER VOWEL SIGN UA
|
|
541
|
+
17BE..17C5 ; Vowel_Dependent # Mc [8] KHMER VOWEL SIGN OE..KHMER VOWEL SIGN AU
|
|
542
|
+
17C8 ; Vowel_Dependent # Mc KHMER SIGN YUUKALEAPINTU
|
|
543
|
+
1920..1922 ; Vowel_Dependent # Mn [3] LIMBU VOWEL SIGN A..LIMBU VOWEL SIGN U
|
|
544
|
+
1923..1926 ; Vowel_Dependent # Mc [4] LIMBU VOWEL SIGN EE..LIMBU VOWEL SIGN AU
|
|
545
|
+
1927..1928 ; Vowel_Dependent # Mn [2] LIMBU VOWEL SIGN E..LIMBU VOWEL SIGN O
|
|
546
|
+
193A ; Vowel_Dependent # Mn LIMBU SIGN KEMPHRENG
|
|
547
|
+
19B0..19C0 ; Vowel_Dependent # Lo [17] NEW TAI LUE VOWEL SIGN VOWEL SHORTENER..NEW TAI LUE VOWEL SIGN IY
|
|
548
|
+
1A17..1A18 ; Vowel_Dependent # Mn [2] BUGINESE VOWEL SIGN I..BUGINESE VOWEL SIGN U
|
|
549
|
+
1A19..1A1A ; Vowel_Dependent # Mc [2] BUGINESE VOWEL SIGN E..BUGINESE VOWEL SIGN O
|
|
550
|
+
1A1B ; Vowel_Dependent # Mn BUGINESE VOWEL SIGN AE
|
|
551
|
+
1A61 ; Vowel_Dependent # Mc TAI THAM VOWEL SIGN A
|
|
552
|
+
1A62 ; Vowel_Dependent # Mn TAI THAM VOWEL SIGN MAI SAT
|
|
553
|
+
1A63..1A64 ; Vowel_Dependent # Mc [2] TAI THAM VOWEL SIGN AA..TAI THAM VOWEL SIGN TALL AA
|
|
554
|
+
1A65..1A6C ; Vowel_Dependent # Mn [8] TAI THAM VOWEL SIGN I..TAI THAM VOWEL SIGN OA BELOW
|
|
555
|
+
1A6D..1A72 ; Vowel_Dependent # Mc [6] TAI THAM VOWEL SIGN OY..TAI THAM VOWEL SIGN THAM AI
|
|
556
|
+
1A73 ; Vowel_Dependent # Mn TAI THAM VOWEL SIGN OA ABOVE
|
|
557
|
+
1B35 ; Vowel_Dependent # Mc BALINESE VOWEL SIGN TEDUNG
|
|
558
|
+
1B36..1B3A ; Vowel_Dependent # Mn [5] BALINESE VOWEL SIGN ULU..BALINESE VOWEL SIGN RA REPA
|
|
559
|
+
1B3B ; Vowel_Dependent # Mc BALINESE VOWEL SIGN RA REPA TEDUNG
|
|
560
|
+
1B3C ; Vowel_Dependent # Mn BALINESE VOWEL SIGN LA LENGA
|
|
561
|
+
1B3D..1B41 ; Vowel_Dependent # Mc [5] BALINESE VOWEL SIGN LA LENGA TEDUNG..BALINESE VOWEL SIGN TALING REPA TEDUNG
|
|
562
|
+
1B42 ; Vowel_Dependent # Mn BALINESE VOWEL SIGN PEPET
|
|
563
|
+
1B43 ; Vowel_Dependent # Mc BALINESE VOWEL SIGN PEPET TEDUNG
|
|
564
|
+
1BA4..1BA5 ; Vowel_Dependent # Mn [2] SUNDANESE VOWEL SIGN PANGHULU..SUNDANESE VOWEL SIGN PANYUKU
|
|
565
|
+
1BA6..1BA7 ; Vowel_Dependent # Mc [2] SUNDANESE VOWEL SIGN PANAELAENG..SUNDANESE VOWEL SIGN PANOLONG
|
|
566
|
+
1BA8..1BA9 ; Vowel_Dependent # Mn [2] SUNDANESE VOWEL SIGN PAMEPET..SUNDANESE VOWEL SIGN PANEULEUNG
|
|
567
|
+
1BE7 ; Vowel_Dependent # Mc BATAK VOWEL SIGN E
|
|
568
|
+
1BE8..1BE9 ; Vowel_Dependent # Mn [2] BATAK VOWEL SIGN PAKPAK E..BATAK VOWEL SIGN EE
|
|
569
|
+
1BEA..1BEC ; Vowel_Dependent # Mc [3] BATAK VOWEL SIGN I..BATAK VOWEL SIGN O
|
|
570
|
+
1BED ; Vowel_Dependent # Mn BATAK VOWEL SIGN KARO O
|
|
571
|
+
1BEE ; Vowel_Dependent # Mc BATAK VOWEL SIGN U
|
|
572
|
+
1BEF ; Vowel_Dependent # Mn BATAK VOWEL SIGN U FOR SIMALUNGUN SA
|
|
573
|
+
1C26..1C2B ; Vowel_Dependent # Mc [6] LEPCHA VOWEL SIGN AA..LEPCHA VOWEL SIGN UU
|
|
574
|
+
1C2C ; Vowel_Dependent # Mn LEPCHA VOWEL SIGN E
|
|
575
|
+
A802 ; Vowel_Dependent # Mn SYLOTI NAGRI SIGN DVISVARA
|
|
576
|
+
A823..A824 ; Vowel_Dependent # Mc [2] SYLOTI NAGRI VOWEL SIGN A..SYLOTI NAGRI VOWEL SIGN I
|
|
577
|
+
A825..A826 ; Vowel_Dependent # Mn [2] SYLOTI NAGRI VOWEL SIGN U..SYLOTI NAGRI VOWEL SIGN E
|
|
578
|
+
A827 ; Vowel_Dependent # Mc SYLOTI NAGRI VOWEL SIGN OO
|
|
579
|
+
A8B5..A8C3 ; Vowel_Dependent # Mc [15] SAURASHTRA VOWEL SIGN AA..SAURASHTRA VOWEL SIGN AU
|
|
580
|
+
A8FF ; Vowel_Dependent # Mn DEVANAGARI VOWEL SIGN AY
|
|
581
|
+
A947..A94E ; Vowel_Dependent # Mn [8] REJANG VOWEL SIGN I..REJANG VOWEL SIGN EA
|
|
582
|
+
A9B4..A9B5 ; Vowel_Dependent # Mc [2] JAVANESE VOWEL SIGN TARUNG..JAVANESE VOWEL SIGN TOLONG
|
|
583
|
+
A9B6..A9B9 ; Vowel_Dependent # Mn [4] JAVANESE VOWEL SIGN WULU..JAVANESE VOWEL SIGN SUKU MENDUT
|
|
584
|
+
A9BA..A9BB ; Vowel_Dependent # Mc [2] JAVANESE VOWEL SIGN TALING..JAVANESE VOWEL SIGN DIRGA MURE
|
|
585
|
+
A9BC ; Vowel_Dependent # Mn JAVANESE VOWEL SIGN PEPET
|
|
586
|
+
A9E5 ; Vowel_Dependent # Mn MYANMAR SIGN SHAN SAW
|
|
587
|
+
AA29..AA2E ; Vowel_Dependent # Mn [6] CHAM VOWEL SIGN AA..CHAM VOWEL SIGN OE
|
|
588
|
+
AA2F..AA30 ; Vowel_Dependent # Mc [2] CHAM VOWEL SIGN O..CHAM VOWEL SIGN AI
|
|
589
|
+
AA31..AA32 ; Vowel_Dependent # Mn [2] CHAM VOWEL SIGN AU..CHAM VOWEL SIGN UE
|
|
590
|
+
AAB0 ; Vowel_Dependent # Mn TAI VIET MAI KANG
|
|
591
|
+
AAB1 ; Vowel_Dependent # Lo TAI VIET VOWEL AA
|
|
592
|
+
AAB2..AAB4 ; Vowel_Dependent # Mn [3] TAI VIET VOWEL I..TAI VIET VOWEL U
|
|
593
|
+
AAB5..AAB6 ; Vowel_Dependent # Lo [2] TAI VIET VOWEL E..TAI VIET VOWEL O
|
|
594
|
+
AAB7..AAB8 ; Vowel_Dependent # Mn [2] TAI VIET MAI KHIT..TAI VIET VOWEL IA
|
|
595
|
+
AAB9..AABD ; Vowel_Dependent # Lo [5] TAI VIET VOWEL UEA..TAI VIET VOWEL AN
|
|
596
|
+
AABE ; Vowel_Dependent # Mn TAI VIET VOWEL AM
|
|
597
|
+
AAEB ; Vowel_Dependent # Mc MEETEI MAYEK VOWEL SIGN II
|
|
598
|
+
AAEC..AAED ; Vowel_Dependent # Mn [2] MEETEI MAYEK VOWEL SIGN UU..MEETEI MAYEK VOWEL SIGN AAI
|
|
599
|
+
AAEE..AAEF ; Vowel_Dependent # Mc [2] MEETEI MAYEK VOWEL SIGN AU..MEETEI MAYEK VOWEL SIGN AAU
|
|
600
|
+
ABE3..ABE4 ; Vowel_Dependent # Mc [2] MEETEI MAYEK VOWEL SIGN ONAP..MEETEI MAYEK VOWEL SIGN INAP
|
|
601
|
+
ABE5 ; Vowel_Dependent # Mn MEETEI MAYEK VOWEL SIGN ANAP
|
|
602
|
+
ABE6..ABE7 ; Vowel_Dependent # Mc [2] MEETEI MAYEK VOWEL SIGN YENAP..MEETEI MAYEK VOWEL SIGN SOUNAP
|
|
603
|
+
ABE8 ; Vowel_Dependent # Mn MEETEI MAYEK VOWEL SIGN UNAP
|
|
604
|
+
ABE9..ABEA ; Vowel_Dependent # Mc [2] MEETEI MAYEK VOWEL SIGN CHEINAP..MEETEI MAYEK VOWEL SIGN NUNG
|
|
605
|
+
10A01..10A03 ; Vowel_Dependent # Mn [3] KHAROSHTHI VOWEL SIGN I..KHAROSHTHI VOWEL SIGN VOCALIC R
|
|
606
|
+
10A05..10A06 ; Vowel_Dependent # Mn [2] KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SIGN O
|
|
607
|
+
10A0C..10A0D ; Vowel_Dependent # Mn [2] KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI SIGN DOUBLE RING BELOW
|
|
608
|
+
11038..11045 ; Vowel_Dependent # Mn [14] BRAHMI VOWEL SIGN AA..BRAHMI VOWEL SIGN AU
|
|
609
|
+
110B0..110B2 ; Vowel_Dependent # Mc [3] KAITHI VOWEL SIGN AA..KAITHI VOWEL SIGN II
|
|
610
|
+
110B3..110B6 ; Vowel_Dependent # Mn [4] KAITHI VOWEL SIGN U..KAITHI VOWEL SIGN AI
|
|
611
|
+
110B7..110B8 ; Vowel_Dependent # Mc [2] KAITHI VOWEL SIGN O..KAITHI VOWEL SIGN AU
|
|
612
|
+
11127..1112B ; Vowel_Dependent # Mn [5] CHAKMA VOWEL SIGN A..CHAKMA VOWEL SIGN UU
|
|
613
|
+
1112C ; Vowel_Dependent # Mc CHAKMA VOWEL SIGN E
|
|
614
|
+
1112D..11132 ; Vowel_Dependent # Mn [6] CHAKMA VOWEL SIGN AI..CHAKMA AU MARK
|
|
615
|
+
11145..11146 ; Vowel_Dependent # Mc [2] CHAKMA VOWEL SIGN AA..CHAKMA VOWEL SIGN EI
|
|
616
|
+
111B3..111B5 ; Vowel_Dependent # Mc [3] SHARADA VOWEL SIGN AA..SHARADA VOWEL SIGN II
|
|
617
|
+
111B6..111BE ; Vowel_Dependent # Mn [9] SHARADA VOWEL SIGN U..SHARADA VOWEL SIGN O
|
|
618
|
+
111BF ; Vowel_Dependent # Mc SHARADA VOWEL SIGN AU
|
|
619
|
+
111CB..111CC ; Vowel_Dependent # Mn [2] SHARADA VOWEL MODIFIER MARK..SHARADA EXTRA SHORT VOWEL MARK
|
|
620
|
+
1122C..1122E ; Vowel_Dependent # Mc [3] KHOJKI VOWEL SIGN AA..KHOJKI VOWEL SIGN II
|
|
621
|
+
1122F..11231 ; Vowel_Dependent # Mn [3] KHOJKI VOWEL SIGN U..KHOJKI VOWEL SIGN AI
|
|
622
|
+
11232..11233 ; Vowel_Dependent # Mc [2] KHOJKI VOWEL SIGN O..KHOJKI VOWEL SIGN AU
|
|
623
|
+
112E0..112E2 ; Vowel_Dependent # Mc [3] KHUDAWADI VOWEL SIGN AA..KHUDAWADI VOWEL SIGN II
|
|
624
|
+
112E3..112E8 ; Vowel_Dependent # Mn [6] KHUDAWADI VOWEL SIGN U..KHUDAWADI VOWEL SIGN AU
|
|
625
|
+
1133E..1133F ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN AA..GRANTHA VOWEL SIGN I
|
|
626
|
+
11340 ; Vowel_Dependent # Mn GRANTHA VOWEL SIGN II
|
|
627
|
+
11341..11344 ; Vowel_Dependent # Mc [4] GRANTHA VOWEL SIGN U..GRANTHA VOWEL SIGN VOCALIC RR
|
|
628
|
+
11347..11348 ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN EE..GRANTHA VOWEL SIGN AI
|
|
629
|
+
1134B..1134C ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN OO..GRANTHA VOWEL SIGN AU
|
|
630
|
+
11357 ; Vowel_Dependent # Mc GRANTHA AU LENGTH MARK
|
|
631
|
+
11362..11363 ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN VOCALIC L..GRANTHA VOWEL SIGN VOCALIC LL
|
|
632
|
+
11435..11437 ; Vowel_Dependent # Mc [3] NEWA VOWEL SIGN AA..NEWA VOWEL SIGN II
|
|
633
|
+
11438..1143F ; Vowel_Dependent # Mn [8] NEWA VOWEL SIGN U..NEWA VOWEL SIGN AI
|
|
634
|
+
11440..11441 ; Vowel_Dependent # Mc [2] NEWA VOWEL SIGN O..NEWA VOWEL SIGN AU
|
|
635
|
+
114B0..114B2 ; Vowel_Dependent # Mc [3] TIRHUTA VOWEL SIGN AA..TIRHUTA VOWEL SIGN II
|
|
636
|
+
114B3..114B8 ; Vowel_Dependent # Mn [6] TIRHUTA VOWEL SIGN U..TIRHUTA VOWEL SIGN VOCALIC LL
|
|
637
|
+
114B9 ; Vowel_Dependent # Mc TIRHUTA VOWEL SIGN E
|
|
638
|
+
114BA ; Vowel_Dependent # Mn TIRHUTA VOWEL SIGN SHORT E
|
|
639
|
+
114BB..114BE ; Vowel_Dependent # Mc [4] TIRHUTA VOWEL SIGN AI..TIRHUTA VOWEL SIGN AU
|
|
640
|
+
115AF..115B1 ; Vowel_Dependent # Mc [3] SIDDHAM VOWEL SIGN AA..SIDDHAM VOWEL SIGN II
|
|
641
|
+
115B2..115B5 ; Vowel_Dependent # Mn [4] SIDDHAM VOWEL SIGN U..SIDDHAM VOWEL SIGN VOCALIC RR
|
|
642
|
+
115B8..115BB ; Vowel_Dependent # Mc [4] SIDDHAM VOWEL SIGN E..SIDDHAM VOWEL SIGN AU
|
|
643
|
+
115DC..115DD ; Vowel_Dependent # Mn [2] SIDDHAM VOWEL SIGN ALTERNATE U..SIDDHAM VOWEL SIGN ALTERNATE UU
|
|
644
|
+
11630..11632 ; Vowel_Dependent # Mc [3] MODI VOWEL SIGN AA..MODI VOWEL SIGN II
|
|
645
|
+
11633..1163A ; Vowel_Dependent # Mn [8] MODI VOWEL SIGN U..MODI VOWEL SIGN AI
|
|
646
|
+
1163B..1163C ; Vowel_Dependent # Mc [2] MODI VOWEL SIGN O..MODI VOWEL SIGN AU
|
|
647
|
+
11640 ; Vowel_Dependent # Mn MODI SIGN ARDHACANDRA
|
|
648
|
+
116AD ; Vowel_Dependent # Mn TAKRI VOWEL SIGN AA
|
|
649
|
+
116AE..116AF ; Vowel_Dependent # Mc [2] TAKRI VOWEL SIGN I..TAKRI VOWEL SIGN II
|
|
650
|
+
116B0..116B5 ; Vowel_Dependent # Mn [6] TAKRI VOWEL SIGN U..TAKRI VOWEL SIGN AU
|
|
651
|
+
11720..11721 ; Vowel_Dependent # Mc [2] AHOM VOWEL SIGN A..AHOM VOWEL SIGN AA
|
|
652
|
+
11722..11725 ; Vowel_Dependent # Mn [4] AHOM VOWEL SIGN I..AHOM VOWEL SIGN UU
|
|
653
|
+
11726 ; Vowel_Dependent # Mc AHOM VOWEL SIGN E
|
|
654
|
+
11727..1172A ; Vowel_Dependent # Mn [4] AHOM VOWEL SIGN AW..AHOM VOWEL SIGN AM
|
|
655
|
+
1182C..1182E ; Vowel_Dependent # Mc [3] DOGRA VOWEL SIGN AA..DOGRA VOWEL SIGN II
|
|
656
|
+
1182F..11836 ; Vowel_Dependent # Mn [8] DOGRA VOWEL SIGN U..DOGRA VOWEL SIGN AU
|
|
657
|
+
119D1..119D3 ; Vowel_Dependent # Mc [3] NANDINAGARI VOWEL SIGN AA..NANDINAGARI VOWEL SIGN II
|
|
658
|
+
119D4..119D7 ; Vowel_Dependent # Mn [4] NANDINAGARI VOWEL SIGN U..NANDINAGARI VOWEL SIGN VOCALIC RR
|
|
659
|
+
119DA..119DB ; Vowel_Dependent # Mn [2] NANDINAGARI VOWEL SIGN E..NANDINAGARI VOWEL SIGN AI
|
|
660
|
+
119DC..119DD ; Vowel_Dependent # Mc [2] NANDINAGARI VOWEL SIGN O..NANDINAGARI VOWEL SIGN AU
|
|
661
|
+
119E4 ; Vowel_Dependent # Mc NANDINAGARI VOWEL SIGN PRISHTHAMATRA E
|
|
662
|
+
11A01..11A0A ; Vowel_Dependent # Mn [10] ZANABAZAR SQUARE VOWEL SIGN I..ZANABAZAR SQUARE VOWEL LENGTH MARK
|
|
663
|
+
11A51..11A56 ; Vowel_Dependent # Mn [6] SOYOMBO VOWEL SIGN I..SOYOMBO VOWEL SIGN OE
|
|
664
|
+
11A57..11A58 ; Vowel_Dependent # Mc [2] SOYOMBO VOWEL SIGN AI..SOYOMBO VOWEL SIGN AU
|
|
665
|
+
11A59..11A5B ; Vowel_Dependent # Mn [3] SOYOMBO VOWEL SIGN VOCALIC R..SOYOMBO VOWEL LENGTH MARK
|
|
666
|
+
11C2F ; Vowel_Dependent # Mc BHAIKSUKI VOWEL SIGN AA
|
|
667
|
+
11C30..11C36 ; Vowel_Dependent # Mn [7] BHAIKSUKI VOWEL SIGN I..BHAIKSUKI VOWEL SIGN VOCALIC L
|
|
668
|
+
11C38..11C3B ; Vowel_Dependent # Mn [4] BHAIKSUKI VOWEL SIGN E..BHAIKSUKI VOWEL SIGN AU
|
|
669
|
+
11CB0 ; Vowel_Dependent # Mn MARCHEN VOWEL SIGN AA
|
|
670
|
+
11CB1 ; Vowel_Dependent # Mc MARCHEN VOWEL SIGN I
|
|
671
|
+
11CB2..11CB3 ; Vowel_Dependent # Mn [2] MARCHEN VOWEL SIGN U..MARCHEN VOWEL SIGN E
|
|
672
|
+
11CB4 ; Vowel_Dependent # Mc MARCHEN VOWEL SIGN O
|
|
673
|
+
11D31..11D36 ; Vowel_Dependent # Mn [6] MASARAM GONDI VOWEL SIGN AA..MASARAM GONDI VOWEL SIGN VOCALIC R
|
|
674
|
+
11D3A ; Vowel_Dependent # Mn MASARAM GONDI VOWEL SIGN E
|
|
675
|
+
11D3C..11D3D ; Vowel_Dependent # Mn [2] MASARAM GONDI VOWEL SIGN AI..MASARAM GONDI VOWEL SIGN O
|
|
676
|
+
11D3F ; Vowel_Dependent # Mn MASARAM GONDI VOWEL SIGN AU
|
|
677
|
+
11D43 ; Vowel_Dependent # Mn MASARAM GONDI SIGN CANDRA
|
|
678
|
+
11D8A..11D8E ; Vowel_Dependent # Mc [5] GUNJALA GONDI VOWEL SIGN AA..GUNJALA GONDI VOWEL SIGN UU
|
|
679
|
+
11D90..11D91 ; Vowel_Dependent # Mn [2] GUNJALA GONDI VOWEL SIGN EE..GUNJALA GONDI VOWEL SIGN AI
|
|
680
|
+
11D93..11D94 ; Vowel_Dependent # Mc [2] GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI VOWEL SIGN AU
|
|
681
|
+
11EF3..11EF4 ; Vowel_Dependent # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
|
|
682
|
+
11EF5..11EF6 ; Vowel_Dependent # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
|
|
683
|
+
|
|
684
|
+
# ================================================
|
|
685
|
+
|
|
686
|
+
# Indic_Syllabic_Category=Vowel
|
|
687
|
+
|
|
688
|
+
# (Other) Vowels (reanalyzed as ordinary alphabetic letters or marks)
|
|
689
|
+
|
|
690
|
+
# [Not derivable]
|
|
691
|
+
|
|
692
|
+
1963..196D ; Vowel # Lo [11] TAI LE LETTER A..TAI LE LETTER AI
|
|
693
|
+
A85E..A861 ; Vowel # Lo [4] PHAGS-PA LETTER I..PHAGS-PA LETTER O
|
|
694
|
+
A866 ; Vowel # Lo PHAGS-PA LETTER EE
|
|
695
|
+
A922..A925 ; Vowel # Lo [4] KAYAH LI LETTER A..KAYAH LI LETTER OO
|
|
696
|
+
A926..A92A ; Vowel # Mn [5] KAYAH LI VOWEL UE..KAYAH LI VOWEL O
|
|
697
|
+
11150..11154 ; Vowel # Lo [5] MAHAJANI LETTER A..MAHAJANI LETTER O
|
|
698
|
+
|
|
699
|
+
# ================================================
|
|
700
|
+
|
|
701
|
+
# Indic_Syllabic_Category=Consonant_Placeholder
|
|
702
|
+
|
|
703
|
+
# Consonant Placeholder
|
|
704
|
+
# This includes generic placeholders used for
|
|
705
|
+
# Indic script layout (NBSP and dotted circle), as well as a few script-
|
|
706
|
+
# specific vowel-holder characters which are not technically
|
|
707
|
+
# consonants, but serve instead as bases for placement of vowel marks.
|
|
708
|
+
|
|
709
|
+
# [Not derivable]
|
|
710
|
+
|
|
711
|
+
002D ; Consonant_Placeholder # Pd HYPHEN-MINUS
|
|
712
|
+
00A0 ; Consonant_Placeholder # Zs NO-BREAK SPACE
|
|
713
|
+
00D7 ; Consonant_Placeholder # Sm MULTIPLICATION SIGN
|
|
714
|
+
0980 ; Consonant_Placeholder # Lo BENGALI ANJI
|
|
715
|
+
0A72..0A73 ; Consonant_Placeholder # Lo [2] GURMUKHI IRI..GURMUKHI URA
|
|
716
|
+
104B ; Consonant_Placeholder # Po MYANMAR SIGN SECTION
|
|
717
|
+
104E ; Consonant_Placeholder # Po MYANMAR SYMBOL AFOREMENTIONED
|
|
718
|
+
1900 ; Consonant_Placeholder # Lo LIMBU VOWEL-CARRIER LETTER
|
|
719
|
+
1CFA ; Consonant_Placeholder # Lo VEDIC SIGN DOUBLE ANUSVARA ANTARGOMUKHA
|
|
720
|
+
2010..2014 ; Consonant_Placeholder # Pd [5] HYPHEN..EM DASH
|
|
721
|
+
25CC ; Consonant_Placeholder # So DOTTED CIRCLE
|
|
722
|
+
AA74..AA76 ; Consonant_Placeholder # Lo [3] MYANMAR LOGOGRAM KHAMTI OAY..MYANMAR LOGOGRAM KHAMTI HM
|
|
723
|
+
11A3F ; Consonant_Placeholder # Po ZANABAZAR SQUARE INITIAL HEAD MARK
|
|
724
|
+
11A45 ; Consonant_Placeholder # Po ZANABAZAR SQUARE INITIAL DOUBLE-LINED HEAD MARK
|
|
725
|
+
11EF2 ; Consonant_Placeholder # Lo MAKASAR ANGKA
|
|
726
|
+
|
|
727
|
+
# ================================================
|
|
728
|
+
|
|
729
|
+
# Indic_Syllabic_Category=Consonant
|
|
730
|
+
|
|
731
|
+
# Consonant (ordinary abugida consonants, with inherent vowels)
|
|
732
|
+
|
|
733
|
+
# [Not derivable]
|
|
734
|
+
|
|
735
|
+
0915..0939 ; Consonant # Lo [37] DEVANAGARI LETTER KA..DEVANAGARI LETTER HA
|
|
736
|
+
0958..095F ; Consonant # Lo [8] DEVANAGARI LETTER QA..DEVANAGARI LETTER YYA
|
|
737
|
+
0978..097F ; Consonant # Lo [8] DEVANAGARI LETTER MARWARI DDA..DEVANAGARI LETTER BBA
|
|
738
|
+
0995..09A8 ; Consonant # Lo [20] BENGALI LETTER KA..BENGALI LETTER NA
|
|
739
|
+
09AA..09B0 ; Consonant # Lo [7] BENGALI LETTER PA..BENGALI LETTER RA
|
|
740
|
+
09B2 ; Consonant # Lo BENGALI LETTER LA
|
|
741
|
+
09B6..09B9 ; Consonant # Lo [4] BENGALI LETTER SHA..BENGALI LETTER HA
|
|
742
|
+
09DC..09DD ; Consonant # Lo [2] BENGALI LETTER RRA..BENGALI LETTER RHA
|
|
743
|
+
09DF ; Consonant # Lo BENGALI LETTER YYA
|
|
744
|
+
09F0..09F1 ; Consonant # Lo [2] BENGALI LETTER RA WITH MIDDLE DIAGONAL..BENGALI LETTER RA WITH LOWER DIAGONAL
|
|
745
|
+
0A15..0A28 ; Consonant # Lo [20] GURMUKHI LETTER KA..GURMUKHI LETTER NA
|
|
746
|
+
0A2A..0A30 ; Consonant # Lo [7] GURMUKHI LETTER PA..GURMUKHI LETTER RA
|
|
747
|
+
0A32..0A33 ; Consonant # Lo [2] GURMUKHI LETTER LA..GURMUKHI LETTER LLA
|
|
748
|
+
0A35..0A36 ; Consonant # Lo [2] GURMUKHI LETTER VA..GURMUKHI LETTER SHA
|
|
749
|
+
0A38..0A39 ; Consonant # Lo [2] GURMUKHI LETTER SA..GURMUKHI LETTER HA
|
|
750
|
+
0A59..0A5C ; Consonant # Lo [4] GURMUKHI LETTER KHHA..GURMUKHI LETTER RRA
|
|
751
|
+
0A5E ; Consonant # Lo GURMUKHI LETTER FA
|
|
752
|
+
0A95..0AA8 ; Consonant # Lo [20] GUJARATI LETTER KA..GUJARATI LETTER NA
|
|
753
|
+
0AAA..0AB0 ; Consonant # Lo [7] GUJARATI LETTER PA..GUJARATI LETTER RA
|
|
754
|
+
0AB2..0AB3 ; Consonant # Lo [2] GUJARATI LETTER LA..GUJARATI LETTER LLA
|
|
755
|
+
0AB5..0AB9 ; Consonant # Lo [5] GUJARATI LETTER VA..GUJARATI LETTER HA
|
|
756
|
+
0AF9 ; Consonant # Lo GUJARATI LETTER ZHA
|
|
757
|
+
0B15..0B28 ; Consonant # Lo [20] ORIYA LETTER KA..ORIYA LETTER NA
|
|
758
|
+
0B2A..0B30 ; Consonant # Lo [7] ORIYA LETTER PA..ORIYA LETTER RA
|
|
759
|
+
0B32..0B33 ; Consonant # Lo [2] ORIYA LETTER LA..ORIYA LETTER LLA
|
|
760
|
+
0B35..0B39 ; Consonant # Lo [5] ORIYA LETTER VA..ORIYA LETTER HA
|
|
761
|
+
0B5C..0B5D ; Consonant # Lo [2] ORIYA LETTER RRA..ORIYA LETTER RHA
|
|
762
|
+
0B5F ; Consonant # Lo ORIYA LETTER YYA
|
|
763
|
+
0B71 ; Consonant # Lo ORIYA LETTER WA
|
|
764
|
+
0B95 ; Consonant # Lo TAMIL LETTER KA
|
|
765
|
+
0B99..0B9A ; Consonant # Lo [2] TAMIL LETTER NGA..TAMIL LETTER CA
|
|
766
|
+
0B9C ; Consonant # Lo TAMIL LETTER JA
|
|
767
|
+
0B9E..0B9F ; Consonant # Lo [2] TAMIL LETTER NYA..TAMIL LETTER TTA
|
|
768
|
+
0BA3..0BA4 ; Consonant # Lo [2] TAMIL LETTER NNA..TAMIL LETTER TA
|
|
769
|
+
0BA8..0BAA ; Consonant # Lo [3] TAMIL LETTER NA..TAMIL LETTER PA
|
|
770
|
+
0BAE..0BB9 ; Consonant # Lo [12] TAMIL LETTER MA..TAMIL LETTER HA
|
|
771
|
+
0C15..0C28 ; Consonant # Lo [20] TELUGU LETTER KA..TELUGU LETTER NA
|
|
772
|
+
0C2A..0C39 ; Consonant # Lo [16] TELUGU LETTER PA..TELUGU LETTER HA
|
|
773
|
+
0C58..0C5A ; Consonant # Lo [3] TELUGU LETTER TSA..TELUGU LETTER RRRA
|
|
774
|
+
0C95..0CA8 ; Consonant # Lo [20] KANNADA LETTER KA..KANNADA LETTER NA
|
|
775
|
+
0CAA..0CB3 ; Consonant # Lo [10] KANNADA LETTER PA..KANNADA LETTER LLA
|
|
776
|
+
0CB5..0CB9 ; Consonant # Lo [5] KANNADA LETTER VA..KANNADA LETTER HA
|
|
777
|
+
0CDE ; Consonant # Lo KANNADA LETTER FA
|
|
778
|
+
0D15..0D3A ; Consonant # Lo [38] MALAYALAM LETTER KA..MALAYALAM LETTER TTTA
|
|
779
|
+
0D9A..0DB1 ; Consonant # Lo [24] SINHALA LETTER ALPAPRAANA KAYANNA..SINHALA LETTER DANTAJA NAYANNA
|
|
780
|
+
0DB3..0DBB ; Consonant # Lo [9] SINHALA LETTER SANYAKA DAYANNA..SINHALA LETTER RAYANNA
|
|
781
|
+
0DBD ; Consonant # Lo SINHALA LETTER DANTAJA LAYANNA
|
|
782
|
+
0DC0..0DC6 ; Consonant # Lo [7] SINHALA LETTER VAYANNA..SINHALA LETTER FAYANNA
|
|
783
|
+
0E01..0E2E ; Consonant # Lo [46] THAI CHARACTER KO KAI..THAI CHARACTER HO NOKHUK
|
|
784
|
+
0E81..0E82 ; Consonant # Lo [2] LAO LETTER KO..LAO LETTER KHO SUNG
|
|
785
|
+
0E84 ; Consonant # Lo LAO LETTER KHO TAM
|
|
786
|
+
0E86..0E8A ; Consonant # Lo [5] LAO LETTER PALI GHA..LAO LETTER SO TAM
|
|
787
|
+
0E8C..0EA3 ; Consonant # Lo [24] LAO LETTER PALI JHA..LAO LETTER LO LING
|
|
788
|
+
0EA5 ; Consonant # Lo LAO LETTER LO LOOT
|
|
789
|
+
0EA7..0EAE ; Consonant # Lo [8] LAO LETTER WO..LAO LETTER HO TAM
|
|
790
|
+
0EDC..0EDF ; Consonant # Lo [4] LAO HO NO..LAO LETTER KHMU NYO
|
|
791
|
+
0F40..0F47 ; Consonant # Lo [8] TIBETAN LETTER KA..TIBETAN LETTER JA
|
|
792
|
+
0F49..0F6C ; Consonant # Lo [36] TIBETAN LETTER NYA..TIBETAN LETTER RRA
|
|
793
|
+
1000..1020 ; Consonant # Lo [33] MYANMAR LETTER KA..MYANMAR LETTER LLA
|
|
794
|
+
103F ; Consonant # Lo MYANMAR LETTER GREAT SA
|
|
795
|
+
1050..1051 ; Consonant # Lo [2] MYANMAR LETTER SHA..MYANMAR LETTER SSA
|
|
796
|
+
105A..105D ; Consonant # Lo [4] MYANMAR LETTER MON NGA..MYANMAR LETTER MON BBE
|
|
797
|
+
1061 ; Consonant # Lo MYANMAR LETTER SGAW KAREN SHA
|
|
798
|
+
1065..1066 ; Consonant # Lo [2] MYANMAR LETTER WESTERN PWO KAREN THA..MYANMAR LETTER WESTERN PWO KAREN PWA
|
|
799
|
+
106E..1070 ; Consonant # Lo [3] MYANMAR LETTER EASTERN PWO KAREN NNA..MYANMAR LETTER EASTERN PWO KAREN GHWA
|
|
800
|
+
1075..1081 ; Consonant # Lo [13] MYANMAR LETTER SHAN KA..MYANMAR LETTER SHAN HA
|
|
801
|
+
108E ; Consonant # Lo MYANMAR LETTER RUMAI PALAUNG FA
|
|
802
|
+
1703..170C ; Consonant # Lo [10] TAGALOG LETTER KA..TAGALOG LETTER YA
|
|
803
|
+
170E..1711 ; Consonant # Lo [4] TAGALOG LETTER LA..TAGALOG LETTER HA
|
|
804
|
+
1723..1731 ; Consonant # Lo [15] HANUNOO LETTER KA..HANUNOO LETTER HA
|
|
805
|
+
1743..1751 ; Consonant # Lo [15] BUHID LETTER KA..BUHID LETTER HA
|
|
806
|
+
1763..176C ; Consonant # Lo [10] TAGBANWA LETTER KA..TAGBANWA LETTER YA
|
|
807
|
+
176E..1770 ; Consonant # Lo [3] TAGBANWA LETTER LA..TAGBANWA LETTER SA
|
|
808
|
+
1780..17A2 ; Consonant # Lo [35] KHMER LETTER KA..KHMER LETTER QA
|
|
809
|
+
1901..191E ; Consonant # Lo [30] LIMBU LETTER KA..LIMBU LETTER TRA
|
|
810
|
+
1950..1962 ; Consonant # Lo [19] TAI LE LETTER KA..TAI LE LETTER NA
|
|
811
|
+
1980..19AB ; Consonant # Lo [44] NEW TAI LUE LETTER HIGH QA..NEW TAI LUE LETTER LOW SUA
|
|
812
|
+
1A00..1A16 ; Consonant # Lo [23] BUGINESE LETTER KA..BUGINESE LETTER HA
|
|
813
|
+
1A20..1A4C ; Consonant # Lo [45] TAI THAM LETTER HIGH KA..TAI THAM LETTER LOW HA
|
|
814
|
+
1A53..1A54 ; Consonant # Lo [2] TAI THAM LETTER LAE..TAI THAM LETTER GREAT SA
|
|
815
|
+
1B13..1B33 ; Consonant # Lo [33] BALINESE LETTER KA..BALINESE LETTER HA
|
|
816
|
+
1B45..1B4B ; Consonant # Lo [7] BALINESE LETTER KAF SASAK..BALINESE LETTER ASYURA SASAK
|
|
817
|
+
1B8A..1BA0 ; Consonant # Lo [23] SUNDANESE LETTER KA..SUNDANESE LETTER HA
|
|
818
|
+
1BAE..1BAF ; Consonant # Lo [2] SUNDANESE LETTER KHA..SUNDANESE LETTER SYA
|
|
819
|
+
1BBB..1BBD ; Consonant # Lo [3] SUNDANESE LETTER REU..SUNDANESE LETTER BHA
|
|
820
|
+
1BC0..1BE3 ; Consonant # Lo [36] BATAK LETTER A..BATAK LETTER MBA
|
|
821
|
+
1C00..1C23 ; Consonant # Lo [36] LEPCHA LETTER KA..LEPCHA LETTER A
|
|
822
|
+
1C4D..1C4F ; Consonant # Lo [3] LEPCHA LETTER TTA..LEPCHA LETTER DDA
|
|
823
|
+
A807..A80A ; Consonant # Lo [4] SYLOTI NAGRI LETTER KO..SYLOTI NAGRI LETTER GHO
|
|
824
|
+
A80C..A822 ; Consonant # Lo [23] SYLOTI NAGRI LETTER CO..SYLOTI NAGRI LETTER HO
|
|
825
|
+
A840..A85D ; Consonant # Lo [30] PHAGS-PA LETTER KA..PHAGS-PA LETTER A
|
|
826
|
+
A862..A865 ; Consonant # Lo [4] PHAGS-PA LETTER QA..PHAGS-PA LETTER GGA
|
|
827
|
+
A869..A870 ; Consonant # Lo [8] PHAGS-PA LETTER TTA..PHAGS-PA LETTER ASPIRATED FA
|
|
828
|
+
A872 ; Consonant # Lo PHAGS-PA SUPERFIXED LETTER RA
|
|
829
|
+
A892..A8B3 ; Consonant # Lo [34] SAURASHTRA LETTER KA..SAURASHTRA LETTER LLA
|
|
830
|
+
A90A..A921 ; Consonant # Lo [24] KAYAH LI LETTER KA..KAYAH LI LETTER CA
|
|
831
|
+
A930..A946 ; Consonant # Lo [23] REJANG LETTER KA..REJANG LETTER A
|
|
832
|
+
A989..A98B ; Consonant # Lo [3] JAVANESE LETTER PA CEREK..JAVANESE LETTER NGA LELET RASWADI
|
|
833
|
+
A98F..A9B2 ; Consonant # Lo [36] JAVANESE LETTER KA..JAVANESE LETTER HA
|
|
834
|
+
A9E0..A9E4 ; Consonant # Lo [5] MYANMAR LETTER SHAN GHA..MYANMAR LETTER SHAN BHA
|
|
835
|
+
A9E7..A9EF ; Consonant # Lo [9] MYANMAR LETTER TAI LAING NYA..MYANMAR LETTER TAI LAING NNA
|
|
836
|
+
A9FA..A9FE ; Consonant # Lo [5] MYANMAR LETTER TAI LAING LLA..MYANMAR LETTER TAI LAING BHA
|
|
837
|
+
AA06..AA28 ; Consonant # Lo [35] CHAM LETTER KA..CHAM LETTER HA
|
|
838
|
+
AA60..AA6F ; Consonant # Lo [16] MYANMAR LETTER KHAMTI GA..MYANMAR LETTER KHAMTI FA
|
|
839
|
+
AA71..AA73 ; Consonant # Lo [3] MYANMAR LETTER KHAMTI XA..MYANMAR LETTER KHAMTI RA
|
|
840
|
+
AA7A ; Consonant # Lo MYANMAR LETTER AITON RA
|
|
841
|
+
AA7E..AA7F ; Consonant # Lo [2] MYANMAR LETTER SHWE PALAUNG CHA..MYANMAR LETTER SHWE PALAUNG SHA
|
|
842
|
+
AA80..AAAF ; Consonant # Lo [48] TAI VIET LETTER LOW KO..TAI VIET LETTER HIGH O
|
|
843
|
+
AAE2..AAEA ; Consonant # Lo [9] MEETEI MAYEK LETTER CHA..MEETEI MAYEK LETTER SSA
|
|
844
|
+
ABC0..ABCD ; Consonant # Lo [14] MEETEI MAYEK LETTER KOK..MEETEI MAYEK LETTER HUK
|
|
845
|
+
ABD0 ; Consonant # Lo MEETEI MAYEK LETTER PHAM
|
|
846
|
+
ABD2..ABDA ; Consonant # Lo [9] MEETEI MAYEK LETTER GOK..MEETEI MAYEK LETTER BHAM
|
|
847
|
+
10A00 ; Consonant # Lo KHAROSHTHI LETTER A
|
|
848
|
+
10A10..10A13 ; Consonant # Lo [4] KHAROSHTHI LETTER KA..KHAROSHTHI LETTER GHA
|
|
849
|
+
10A15..10A17 ; Consonant # Lo [3] KHAROSHTHI LETTER CA..KHAROSHTHI LETTER JA
|
|
850
|
+
10A19..10A35 ; Consonant # Lo [29] KHAROSHTHI LETTER NYA..KHAROSHTHI LETTER VHA
|
|
851
|
+
11013..11037 ; Consonant # Lo [37] BRAHMI LETTER KA..BRAHMI LETTER OLD TAMIL NNNA
|
|
852
|
+
1108D..110AF ; Consonant # Lo [35] KAITHI LETTER KA..KAITHI LETTER HA
|
|
853
|
+
11107..11126 ; Consonant # Lo [32] CHAKMA LETTER KAA..CHAKMA LETTER HAA
|
|
854
|
+
11144 ; Consonant # Lo CHAKMA LETTER LHAA
|
|
855
|
+
11155..11172 ; Consonant # Lo [30] MAHAJANI LETTER KA..MAHAJANI LETTER RRA
|
|
856
|
+
11191..111B2 ; Consonant # Lo [34] SHARADA LETTER KA..SHARADA LETTER HA
|
|
857
|
+
11208..11211 ; Consonant # Lo [10] KHOJKI LETTER KA..KHOJKI LETTER JJA
|
|
858
|
+
11213..1122B ; Consonant # Lo [25] KHOJKI LETTER NYA..KHOJKI LETTER LLA
|
|
859
|
+
11284..11286 ; Consonant # Lo [3] MULTANI LETTER KA..MULTANI LETTER GA
|
|
860
|
+
11288 ; Consonant # Lo MULTANI LETTER GHA
|
|
861
|
+
1128A..1128D ; Consonant # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA
|
|
862
|
+
1128F..1129D ; Consonant # Lo [15] MULTANI LETTER NYA..MULTANI LETTER BA
|
|
863
|
+
1129F..112A8 ; Consonant # Lo [10] MULTANI LETTER BHA..MULTANI LETTER RHA
|
|
864
|
+
112BA..112DE ; Consonant # Lo [37] KHUDAWADI LETTER KA..KHUDAWADI LETTER HA
|
|
865
|
+
11315..11328 ; Consonant # Lo [20] GRANTHA LETTER KA..GRANTHA LETTER NA
|
|
866
|
+
1132A..11330 ; Consonant # Lo [7] GRANTHA LETTER PA..GRANTHA LETTER RA
|
|
867
|
+
11332..11333 ; Consonant # Lo [2] GRANTHA LETTER LA..GRANTHA LETTER LLA
|
|
868
|
+
11335..11339 ; Consonant # Lo [5] GRANTHA LETTER VA..GRANTHA LETTER HA
|
|
869
|
+
1140E..11434 ; Consonant # Lo [39] NEWA LETTER KA..NEWA LETTER HA
|
|
870
|
+
1148F..114AF ; Consonant # Lo [33] TIRHUTA LETTER KA..TIRHUTA LETTER HA
|
|
871
|
+
1158E..115AE ; Consonant # Lo [33] SIDDHAM LETTER KA..SIDDHAM LETTER HA
|
|
872
|
+
1160E..1162F ; Consonant # Lo [34] MODI LETTER KA..MODI LETTER LLA
|
|
873
|
+
1168A..116AA ; Consonant # Lo [33] TAKRI LETTER KA..TAKRI LETTER RRA
|
|
874
|
+
116B8 ; Consonant # Lo TAKRI LETTER ARCHAIC KHA
|
|
875
|
+
11700..1171A ; Consonant # Lo [27] AHOM LETTER KA..AHOM LETTER ALTERNATE BA
|
|
876
|
+
1180A..1182B ; Consonant # Lo [34] DOGRA LETTER KA..DOGRA LETTER RRA
|
|
877
|
+
119AE..119D0 ; Consonant # Lo [35] NANDINAGARI LETTER KA..NANDINAGARI LETTER RRA
|
|
878
|
+
11A0B..11A32 ; Consonant # Lo [40] ZANABAZAR SQUARE LETTER KA..ZANABAZAR SQUARE LETTER KSSA
|
|
879
|
+
11A5C..11A83 ; Consonant # Lo [40] SOYOMBO LETTER KA..SOYOMBO LETTER KSSA
|
|
880
|
+
11C0E..11C2E ; Consonant # Lo [33] BHAIKSUKI LETTER KA..BHAIKSUKI LETTER HA
|
|
881
|
+
11C72..11C8F ; Consonant # Lo [30] MARCHEN LETTER KA..MARCHEN LETTER A
|
|
882
|
+
11D0C..11D30 ; Consonant # Lo [37] MASARAM GONDI LETTER KA..MASARAM GONDI LETTER TRA
|
|
883
|
+
11D6C..11D89 ; Consonant # Lo [30] GUNJALA GONDI LETTER YA..GUNJALA GONDI LETTER SA
|
|
884
|
+
11EE0..11EF1 ; Consonant # Lo [18] MAKASAR LETTER KA..MAKASAR LETTER A
|
|
885
|
+
|
|
886
|
+
# ================================================
|
|
887
|
+
|
|
888
|
+
# Indic_Syllabic_Category=Consonant_Dead
|
|
889
|
+
|
|
890
|
+
# Dead Consonant (special consonant with killed vowel)
|
|
891
|
+
|
|
892
|
+
# [Not derivable]
|
|
893
|
+
|
|
894
|
+
09CE ; Consonant_Dead # Lo BENGALI LETTER KHANDA TA
|
|
895
|
+
0D54..0D56 ; Consonant_Dead # Lo [3] MALAYALAM LETTER CHILLU M..MALAYALAM LETTER CHILLU LLL
|
|
896
|
+
0D7A..0D7F ; Consonant_Dead # Lo [6] MALAYALAM LETTER CHILLU NN..MALAYALAM LETTER CHILLU K
|
|
897
|
+
1CF2..1CF3 ; Consonant_Dead # Lo [2] VEDIC SIGN ARDHAVISARGA..VEDIC SIGN ROTATED ARDHAVISARGA
|
|
898
|
+
|
|
899
|
+
# ================================================
|
|
900
|
+
|
|
901
|
+
# Indic_Syllabic_Category=Consonant_With_Stacker
|
|
902
|
+
|
|
903
|
+
# Consonants that may make stacked ligatures with the next consonant
|
|
904
|
+
# without the use of a virama
|
|
905
|
+
|
|
906
|
+
# [Not derivable]
|
|
907
|
+
|
|
908
|
+
0CF1..0CF2 ; Consonant_With_Stacker # Lo [2] KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADHMANIYA
|
|
909
|
+
1CF5..1CF6 ; Consonant_With_Stacker # Lo [2] VEDIC SIGN JIHVAMULIYA..VEDIC SIGN UPADHMANIYA
|
|
910
|
+
11003..11004 ; Consonant_With_Stacker # Lo [2] BRAHMI SIGN JIHVAMULIYA..BRAHMI SIGN UPADHMANIYA
|
|
911
|
+
|
|
912
|
+
# ================================================
|
|
913
|
+
|
|
914
|
+
# Indic_Syllabic_Category=Consonant_Prefixed
|
|
915
|
+
|
|
916
|
+
# Cluster-initial consonants
|
|
917
|
+
|
|
918
|
+
# [Not derivable]
|
|
919
|
+
|
|
920
|
+
111C2..111C3 ; Consonant_Prefixed # Lo [2] SHARADA SIGN JIHVAMULIYA..SHARADA SIGN UPADHMANIYA
|
|
921
|
+
11A3A ; Consonant_Prefixed # Lo ZANABAZAR SQUARE CLUSTER-INITIAL LETTER RA
|
|
922
|
+
11A84..11A89 ; Consonant_Prefixed # Lo [6] SOYOMBO SIGN JIHVAMULIYA..SOYOMBO CLUSTER-INITIAL LETTER SA
|
|
923
|
+
|
|
924
|
+
# ================================================
|
|
925
|
+
|
|
926
|
+
# Indic_Syllabic_Category=Consonant_Preceding_Repha
|
|
927
|
+
|
|
928
|
+
# Repha Form of RA (reanalyzed in some scripts), when preceding the main
|
|
929
|
+
# consonant.
|
|
930
|
+
|
|
931
|
+
# [Not derivable]
|
|
932
|
+
|
|
933
|
+
0D4E ; Consonant_Preceding_Repha # Lo MALAYALAM LETTER DOT REPH
|
|
934
|
+
11D46 ; Consonant_Preceding_Repha # Lo MASARAM GONDI REPHA
|
|
935
|
+
|
|
936
|
+
# ================================================
|
|
937
|
+
|
|
938
|
+
# Indic_Syllabic_Category=Consonant_Initial_Postfixed
|
|
939
|
+
|
|
940
|
+
# Consonants that succeed the main consonant in character sequences, but are
|
|
941
|
+
# pronounced before it.
|
|
942
|
+
|
|
943
|
+
# [Not derivable]
|
|
944
|
+
|
|
945
|
+
1A5A ; Consonant_Initial_Postfixed # Mn TAI THAM CONSONANT SIGN LOW PA
|
|
946
|
+
|
|
947
|
+
# ================================================
|
|
948
|
+
|
|
949
|
+
# Indic_Syllabic_Category=Consonant_Succeeding_Repha
|
|
950
|
+
|
|
951
|
+
# Repha Form of RA (reanalyzed in some scripts), when succeeding the main
|
|
952
|
+
# consonant.
|
|
953
|
+
|
|
954
|
+
# [Not derivable]
|
|
955
|
+
|
|
956
|
+
17CC ; Consonant_Succeeding_Repha # Mn KHMER SIGN ROBAT
|
|
957
|
+
1B03 ; Consonant_Succeeding_Repha # Mn BALINESE SIGN SURANG
|
|
958
|
+
1B81 ; Consonant_Succeeding_Repha # Mn SUNDANESE SIGN PANGLAYAR
|
|
959
|
+
A982 ; Consonant_Succeeding_Repha # Mn JAVANESE SIGN LAYAR
|
|
960
|
+
|
|
961
|
+
# ================================================
|
|
962
|
+
|
|
963
|
+
# Indic_Syllabic_Category=Consonant_Subjoined
|
|
964
|
+
|
|
965
|
+
# Subjoined Consonant (C2 form subtending a base consonant in Tibetan, etc.)
|
|
966
|
+
|
|
967
|
+
# [Not derivable]
|
|
968
|
+
|
|
969
|
+
0F8D..0F97 ; Consonant_Subjoined # Mn [11] TIBETAN SUBJOINED SIGN LCE TSA CAN..TIBETAN SUBJOINED LETTER JA
|
|
970
|
+
0F99..0FBC ; Consonant_Subjoined # Mn [36] TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOINED LETTER FIXED-FORM RA
|
|
971
|
+
1929..192B ; Consonant_Subjoined # Mc [3] LIMBU SUBJOINED LETTER YA..LIMBU SUBJOINED LETTER WA
|
|
972
|
+
1A57 ; Consonant_Subjoined # Mc TAI THAM CONSONANT SIGN LA TANG LAI
|
|
973
|
+
1A5B..1A5E ; Consonant_Subjoined # Mn [4] TAI THAM CONSONANT SIGN HIGH RATHA OR LOW PA..TAI THAM CONSONANT SIGN SA
|
|
974
|
+
1BA1 ; Consonant_Subjoined # Mc SUNDANESE CONSONANT SIGN PAMINGKAL
|
|
975
|
+
1BA2..1BA3 ; Consonant_Subjoined # Mn [2] SUNDANESE CONSONANT SIGN PANYAKRA..SUNDANESE CONSONANT SIGN PANYIKU
|
|
976
|
+
1BAC..1BAD ; Consonant_Subjoined # Mn [2] SUNDANESE CONSONANT SIGN PASANGAN MA..SUNDANESE CONSONANT SIGN PASANGAN WA
|
|
977
|
+
1C24..1C25 ; Consonant_Subjoined # Mc [2] LEPCHA SUBJOINED LETTER YA..LEPCHA SUBJOINED LETTER RA
|
|
978
|
+
A867..A868 ; Consonant_Subjoined # Lo [2] PHAGS-PA SUBJOINED LETTER WA..PHAGS-PA SUBJOINED LETTER YA
|
|
979
|
+
A871 ; Consonant_Subjoined # Lo PHAGS-PA SUBJOINED LETTER RA
|
|
980
|
+
11C92..11CA7 ; Consonant_Subjoined # Mn [22] MARCHEN SUBJOINED LETTER KA..MARCHEN SUBJOINED LETTER ZA
|
|
981
|
+
11CA9 ; Consonant_Subjoined # Mc MARCHEN SUBJOINED LETTER YA
|
|
982
|
+
11CAA..11CAF ; Consonant_Subjoined # Mn [6] MARCHEN SUBJOINED LETTER RA..MARCHEN SUBJOINED LETTER A
|
|
983
|
+
|
|
984
|
+
# ================================================
|
|
985
|
+
|
|
986
|
+
# Indic_Syllabic_Category=Consonant_Medial
|
|
987
|
+
|
|
988
|
+
# Medial Consonant (medial liquid, occurring in clusters)
|
|
989
|
+
|
|
990
|
+
# [Not derivable]
|
|
991
|
+
|
|
992
|
+
0A75 ; Consonant_Medial # Mn GURMUKHI SIGN YAKASH
|
|
993
|
+
0EBC ; Consonant_Medial # Mn LAO SEMIVOWEL SIGN LO
|
|
994
|
+
0EBD ; Consonant_Medial # Lo LAO SEMIVOWEL SIGN NYO
|
|
995
|
+
103B..103C ; Consonant_Medial # Mc [2] MYANMAR CONSONANT SIGN MEDIAL YA..MYANMAR CONSONANT SIGN MEDIAL RA
|
|
996
|
+
103D..103E ; Consonant_Medial # Mn [2] MYANMAR CONSONANT SIGN MEDIAL WA..MYANMAR CONSONANT SIGN MEDIAL HA
|
|
997
|
+
105E..1060 ; Consonant_Medial # Mn [3] MYANMAR CONSONANT SIGN MON MEDIAL NA..MYANMAR CONSONANT SIGN MON MEDIAL LA
|
|
998
|
+
1082 ; Consonant_Medial # Mn MYANMAR CONSONANT SIGN SHAN MEDIAL WA
|
|
999
|
+
1A55 ; Consonant_Medial # Mc TAI THAM CONSONANT SIGN MEDIAL RA
|
|
1000
|
+
1A56 ; Consonant_Medial # Mn TAI THAM CONSONANT SIGN MEDIAL LA
|
|
1001
|
+
A8B4 ; Consonant_Medial # Mc SAURASHTRA CONSONANT SIGN HAARU
|
|
1002
|
+
A9BD ; Consonant_Medial # Mn JAVANESE CONSONANT SIGN KERET
|
|
1003
|
+
A9BE..A9BF ; Consonant_Medial # Mc [2] JAVANESE CONSONANT SIGN PENGKAL..JAVANESE CONSONANT SIGN CAKRA
|
|
1004
|
+
AA33..AA34 ; Consonant_Medial # Mc [2] CHAM CONSONANT SIGN YA..CHAM CONSONANT SIGN RA
|
|
1005
|
+
AA35..AA36 ; Consonant_Medial # Mn [2] CHAM CONSONANT SIGN LA..CHAM CONSONANT SIGN WA
|
|
1006
|
+
1171D..1171F ; Consonant_Medial # Mn [3] AHOM CONSONANT SIGN MEDIAL LA..AHOM CONSONANT SIGN MEDIAL LIGATING RA
|
|
1007
|
+
11A3B..11A3E ; Consonant_Medial # Mn [4] ZANABAZAR SQUARE CLUSTER-FINAL LETTER YA..ZANABAZAR SQUARE CLUSTER-FINAL LETTER VA
|
|
1008
|
+
11D47 ; Consonant_Medial # Mn MASARAM GONDI RA-KARA
|
|
1009
|
+
|
|
1010
|
+
# ================================================
|
|
1011
|
+
|
|
1012
|
+
# Indic_Syllabic_Category=Consonant_Final
|
|
1013
|
+
|
|
1014
|
+
# Final Consonant (special final forms which do not take vowels)
|
|
1015
|
+
|
|
1016
|
+
# [Not derivable]
|
|
1017
|
+
|
|
1018
|
+
1930..1931 ; Consonant_Final # Mc [2] LIMBU SMALL LETTER KA..LIMBU SMALL LETTER NGA
|
|
1019
|
+
1933..1938 ; Consonant_Final # Mc [6] LIMBU SMALL LETTER TA..LIMBU SMALL LETTER LA
|
|
1020
|
+
1939 ; Consonant_Final # Mn LIMBU SIGN MUKPHRENG
|
|
1021
|
+
19C1..19C7 ; Consonant_Final # Lo [7] NEW TAI LUE LETTER FINAL V..NEW TAI LUE LETTER FINAL B
|
|
1022
|
+
1A58..1A59 ; Consonant_Final # Mn [2] TAI THAM SIGN MAI KANG LAI..TAI THAM CONSONANT SIGN FINAL NGA
|
|
1023
|
+
1BBE..1BBF ; Consonant_Final # Lo [2] SUNDANESE LETTER FINAL K..SUNDANESE LETTER FINAL M
|
|
1024
|
+
1BF0..1BF1 ; Consonant_Final # Mn [2] BATAK CONSONANT SIGN NG..BATAK CONSONANT SIGN H
|
|
1025
|
+
1C2D..1C33 ; Consonant_Final # Mn [7] LEPCHA CONSONANT SIGN K..LEPCHA CONSONANT SIGN T
|
|
1026
|
+
A94F..A951 ; Consonant_Final # Mn [3] REJANG CONSONANT SIGN NG..REJANG CONSONANT SIGN R
|
|
1027
|
+
A952 ; Consonant_Final # Mc REJANG CONSONANT SIGN H
|
|
1028
|
+
AA40..AA42 ; Consonant_Final # Lo [3] CHAM LETTER FINAL K..CHAM LETTER FINAL NG
|
|
1029
|
+
AA43 ; Consonant_Final # Mn CHAM CONSONANT SIGN FINAL NG
|
|
1030
|
+
AA44..AA4B ; Consonant_Final # Lo [8] CHAM LETTER FINAL CH..CHAM LETTER FINAL SS
|
|
1031
|
+
AA4C ; Consonant_Final # Mn CHAM CONSONANT SIGN FINAL M
|
|
1032
|
+
AA4D ; Consonant_Final # Mc CHAM CONSONANT SIGN FINAL H
|
|
1033
|
+
ABDB..ABE2 ; Consonant_Final # Lo [8] MEETEI MAYEK LETTER KOK LONSUM..MEETEI MAYEK LETTER I LONSUM
|
|
1034
|
+
11A8A..11A95 ; Consonant_Final # Mn [12] SOYOMBO FINAL CONSONANT SIGN G..SOYOMBO FINAL CONSONANT SIGN -A
|
|
1035
|
+
|
|
1036
|
+
# ================================================
|
|
1037
|
+
|
|
1038
|
+
# Indic_Syllabic_Category=Consonant_Head_Letter
|
|
1039
|
+
|
|
1040
|
+
# Head Letter (Tibetan)
|
|
1041
|
+
|
|
1042
|
+
# [Not derivable]
|
|
1043
|
+
|
|
1044
|
+
0F88..0F8C ; Consonant_Head_Letter # Lo [5] TIBETAN SIGN LCE TSA CAN..TIBETAN SIGN INVERTED MCHU CAN
|
|
1045
|
+
|
|
1046
|
+
# ================================================
|
|
1047
|
+
|
|
1048
|
+
# Indic_Syllabic_Category=Modifying_Letter
|
|
1049
|
+
|
|
1050
|
+
# Reanalyzed letters not participating in the abugida structure, but
|
|
1051
|
+
# serving to modify the sound of an adjacent vowel or consonant.
|
|
1052
|
+
# Note that this is not the same as General_Category=Modifier_Letter.
|
|
1053
|
+
|
|
1054
|
+
# [Not derivable]
|
|
1055
|
+
|
|
1056
|
+
0B83 ; Modifying_Letter # Lo TAMIL SIGN VISARGA
|
|
1057
|
+
|
|
1058
|
+
# ================================================
|
|
1059
|
+
|
|
1060
|
+
# Indic_Syllabic_Category=Tone_Letter
|
|
1061
|
+
|
|
1062
|
+
# Tone Letter (spacing lexical tone mark with status as a letter)
|
|
1063
|
+
|
|
1064
|
+
# [Not derivable]
|
|
1065
|
+
|
|
1066
|
+
1970..1974 ; Tone_Letter # Lo [5] TAI LE LETTER TONE-2..TAI LE LETTER TONE-6
|
|
1067
|
+
AAC0 ; Tone_Letter # Lo TAI VIET TONE MAI NUENG
|
|
1068
|
+
AAC2 ; Tone_Letter # Lo TAI VIET TONE MAI SONG
|
|
1069
|
+
|
|
1070
|
+
# ================================================
|
|
1071
|
+
|
|
1072
|
+
# Indic_Syllabic_Category=Tone_Mark
|
|
1073
|
+
|
|
1074
|
+
# Tone Mark (nonspacing or spacing lexical tone mark)
|
|
1075
|
+
|
|
1076
|
+
# [Not derivable]
|
|
1077
|
+
|
|
1078
|
+
0E48..0E4B ; Tone_Mark # Mn [4] THAI CHARACTER MAI EK..THAI CHARACTER MAI CHATTAWA
|
|
1079
|
+
0EC8..0ECB ; Tone_Mark # Mn [4] LAO TONE MAI EK..LAO TONE MAI CATAWA
|
|
1080
|
+
1037 ; Tone_Mark # Mn MYANMAR SIGN DOT BELOW
|
|
1081
|
+
1063..1064 ; Tone_Mark # Mc [2] MYANMAR TONE MARK SGAW KAREN HATHI..MYANMAR TONE MARK SGAW KAREN KE PHO
|
|
1082
|
+
1069..106D ; Tone_Mark # Mc [5] MYANMAR SIGN WESTERN PWO KAREN TONE-1..MYANMAR SIGN WESTERN PWO KAREN TONE-5
|
|
1083
|
+
1087..108C ; Tone_Mark # Mc [6] MYANMAR SIGN SHAN TONE-2..MYANMAR SIGN SHAN COUNCIL TONE-3
|
|
1084
|
+
108D ; Tone_Mark # Mn MYANMAR SIGN SHAN COUNCIL EMPHATIC TONE
|
|
1085
|
+
108F ; Tone_Mark # Mc MYANMAR SIGN RUMAI PALAUNG TONE-5
|
|
1086
|
+
109A..109B ; Tone_Mark # Mc [2] MYANMAR SIGN KHAMTI TONE-1..MYANMAR SIGN KHAMTI TONE-3
|
|
1087
|
+
19C8..19C9 ; Tone_Mark # Lo [2] NEW TAI LUE TONE MARK-1..NEW TAI LUE TONE MARK-2
|
|
1088
|
+
1A75..1A79 ; Tone_Mark # Mn [5] TAI THAM SIGN TONE-1..TAI THAM SIGN KHUEN TONE-5
|
|
1089
|
+
A92B..A92D ; Tone_Mark # Mn [3] KAYAH LI TONE PLOPHU..KAYAH LI TONE CALYA PLOPHU
|
|
1090
|
+
AA7B ; Tone_Mark # Mc MYANMAR SIGN PAO KAREN TONE
|
|
1091
|
+
AA7C ; Tone_Mark # Mn MYANMAR SIGN TAI LAING TONE-2
|
|
1092
|
+
AA7D ; Tone_Mark # Mc MYANMAR SIGN TAI LAING TONE-5
|
|
1093
|
+
AABF ; Tone_Mark # Mn TAI VIET TONE MAI EK
|
|
1094
|
+
AAC1 ; Tone_Mark # Mn TAI VIET TONE MAI THO
|
|
1095
|
+
ABEC ; Tone_Mark # Mc MEETEI MAYEK LUM IYEK
|
|
1096
|
+
|
|
1097
|
+
# ================================================
|
|
1098
|
+
|
|
1099
|
+
# Indic_Syllabic_Category=Gemination_Mark
|
|
1100
|
+
|
|
1101
|
+
# Gemination Mark (doubling of the preceding or following consonant)
|
|
1102
|
+
|
|
1103
|
+
# [Not derivable]
|
|
1104
|
+
|
|
1105
|
+
0A71 ; Gemination_Mark # Mn GURMUKHI ADDAK
|
|
1106
|
+
11237 ; Gemination_Mark # Mn KHOJKI SIGN SHADDA
|
|
1107
|
+
11A98 ; Gemination_Mark # Mn SOYOMBO GEMINATION MARK
|
|
1108
|
+
|
|
1109
|
+
# ================================================
|
|
1110
|
+
|
|
1111
|
+
# Indic_Syllabic_Category=Cantillation_Mark
|
|
1112
|
+
|
|
1113
|
+
# Cantillation Mark (recitation marks, such as svara markers for the Samaveda)
|
|
1114
|
+
|
|
1115
|
+
# [Not derivable]
|
|
1116
|
+
|
|
1117
|
+
0951..0952 ; Cantillation_Mark # Mn [2] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI STRESS SIGN ANUDATTA
|
|
1118
|
+
0A51 ; Cantillation_Mark # Mn GURMUKHI SIGN UDAAT
|
|
1119
|
+
0AFA..0AFC ; Cantillation_Mark # Mn [3] GUJARATI SIGN SUKUN..GUJARATI SIGN MADDAH
|
|
1120
|
+
1CD0..1CD2 ; Cantillation_Mark # Mn [3] VEDIC TONE KARSHANA..VEDIC TONE PRENKHA
|
|
1121
|
+
1CD4..1CE0 ; Cantillation_Mark # Mn [13] VEDIC SIGN YAJURVEDIC MIDLINE SVARITA..VEDIC TONE RIGVEDIC KASHMIRI INDEPENDENT SVARITA
|
|
1122
|
+
1CE1 ; Cantillation_Mark # Mc VEDIC TONE ATHARVAVEDIC INDEPENDENT SVARITA
|
|
1123
|
+
1CF4 ; Cantillation_Mark # Mn VEDIC TONE CANDRA ABOVE
|
|
1124
|
+
1CF7 ; Cantillation_Mark # Mc VEDIC SIGN ATIKRAMA
|
|
1125
|
+
1CF8..1CF9 ; Cantillation_Mark # Mn [2] VEDIC TONE RING ABOVE..VEDIC TONE DOUBLE RING ABOVE
|
|
1126
|
+
20F0 ; Cantillation_Mark # Mn COMBINING ASTERISK ABOVE
|
|
1127
|
+
A8E0..A8F1 ; Cantillation_Mark # Mn [18] COMBINING DEVANAGARI DIGIT ZERO..COMBINING DEVANAGARI SIGN AVAGRAHA
|
|
1128
|
+
1123E ; Cantillation_Mark # Mn KHOJKI SIGN SUKUN
|
|
1129
|
+
11366..1136C ; Cantillation_Mark # Mn [7] COMBINING GRANTHA DIGIT ZERO..COMBINING GRANTHA DIGIT SIX
|
|
1130
|
+
11370..11374 ; Cantillation_Mark # Mn [5] COMBINING GRANTHA LETTER A..COMBINING GRANTHA LETTER PA
|
|
1131
|
+
|
|
1132
|
+
# ================================================
|
|
1133
|
+
|
|
1134
|
+
# Indic_Syllabic_Category=Register_Shifter
|
|
1135
|
+
|
|
1136
|
+
# Register Shifter (shifts register for consonants, akin to a tone mark)
|
|
1137
|
+
|
|
1138
|
+
# [Not derivable]
|
|
1139
|
+
|
|
1140
|
+
17C9..17CA ; Register_Shifter # Mn [2] KHMER SIGN MUUSIKATOAN..KHMER SIGN TRIISAP
|
|
1141
|
+
|
|
1142
|
+
# ================================================
|
|
1143
|
+
|
|
1144
|
+
# Indic_Syllabic_Category=Syllable_Modifier
|
|
1145
|
+
|
|
1146
|
+
# Syllable Modifier (miscellaneous combining characters that modify
|
|
1147
|
+
# something in the orthographic syllable they succeed)
|
|
1148
|
+
|
|
1149
|
+
# [Not derivable]
|
|
1150
|
+
|
|
1151
|
+
00B2..00B3 ; Syllable_Modifier # No [2] SUPERSCRIPT TWO..SUPERSCRIPT THREE
|
|
1152
|
+
09FE ; Syllable_Modifier # Mn BENGALI SANDHI MARK
|
|
1153
|
+
0F35 ; Syllable_Modifier # Mn TIBETAN MARK NGAS BZUNG NYI ZLA
|
|
1154
|
+
0F37 ; Syllable_Modifier # Mn TIBETAN MARK NGAS BZUNG SGOR RTAGS
|
|
1155
|
+
0FC6 ; Syllable_Modifier # Mn TIBETAN SYMBOL PADMA GDAN
|
|
1156
|
+
17CB ; Syllable_Modifier # Mn KHMER SIGN BANTOC
|
|
1157
|
+
17CE..17D0 ; Syllable_Modifier # Mn [3] KHMER SIGN KAKABAT..KHMER SIGN SAMYOK SANNYA
|
|
1158
|
+
17D3 ; Syllable_Modifier # Mn KHMER SIGN BATHAMASAT
|
|
1159
|
+
17DD ; Syllable_Modifier # Mn KHMER SIGN ATTHACAN
|
|
1160
|
+
193B ; Syllable_Modifier # Mn LIMBU SIGN SA-I
|
|
1161
|
+
1A7B..1A7C ; Syllable_Modifier # Mn [2] TAI THAM SIGN MAI SAM..TAI THAM SIGN KHUEN-LUE KARAN
|
|
1162
|
+
1A7F ; Syllable_Modifier # Mn TAI THAM COMBINING CRYPTOGRAMMIC DOT
|
|
1163
|
+
1C36 ; Syllable_Modifier # Mn LEPCHA SIGN RAN
|
|
1164
|
+
1DFB ; Syllable_Modifier # Mn COMBINING DELETION MARK
|
|
1165
|
+
2074 ; Syllable_Modifier # No SUPERSCRIPT FOUR
|
|
1166
|
+
2082..2084 ; Syllable_Modifier # No [3] SUBSCRIPT TWO..SUBSCRIPT FOUR
|
|
1167
|
+
111C9 ; Syllable_Modifier # Mn SHARADA SANDHI MARK
|
|
1168
|
+
1145E ; Syllable_Modifier # Mn NEWA SANDHI MARK
|
|
1169
|
+
11A33 ; Syllable_Modifier # Mn ZANABAZAR SQUARE FINAL CONSONANT MARK
|
|
1170
|
+
|
|
1171
|
+
# ================================================
|
|
1172
|
+
|
|
1173
|
+
# Indic_Syllabic_Category=Consonant_Killer
|
|
1174
|
+
|
|
1175
|
+
# Consonant Killer (signifies that the previous consonant or consonants are
|
|
1176
|
+
# not pronounced)
|
|
1177
|
+
|
|
1178
|
+
# [Not derivable]
|
|
1179
|
+
|
|
1180
|
+
0E4C ; Consonant_Killer # Mn THAI CHARACTER THANTHAKHAT
|
|
1181
|
+
17CD ; Consonant_Killer # Mn KHMER SIGN TOANDAKHIAT
|
|
1182
|
+
|
|
1183
|
+
# ================================================
|
|
1184
|
+
|
|
1185
|
+
# Indic_Syllabic_Category=Non_Joiner
|
|
1186
|
+
|
|
1187
|
+
# Non_Joiner (Zero Width Non-Joiner)
|
|
1188
|
+
|
|
1189
|
+
# [Not derivable]
|
|
1190
|
+
|
|
1191
|
+
200C ; Non_Joiner # Cf ZERO WIDTH NON-JOINER
|
|
1192
|
+
|
|
1193
|
+
# ================================================
|
|
1194
|
+
|
|
1195
|
+
# Indic_Syllabic_Category=Joiner
|
|
1196
|
+
|
|
1197
|
+
# Joiner (Zero Width Joiner)
|
|
1198
|
+
|
|
1199
|
+
# [Not derivable]
|
|
1200
|
+
|
|
1201
|
+
200D ; Joiner # Cf ZERO WIDTH JOINER
|
|
1202
|
+
|
|
1203
|
+
# ================================================
|
|
1204
|
+
|
|
1205
|
+
# Indic_Syllabic_Category=Number_Joiner
|
|
1206
|
+
|
|
1207
|
+
# Number_Joiner (forms ligatures between numbers for multiplication)
|
|
1208
|
+
|
|
1209
|
+
# [Not derivable]
|
|
1210
|
+
|
|
1211
|
+
1107F ; Number_Joiner # Mn BRAHMI NUMBER JOINER
|
|
1212
|
+
|
|
1213
|
+
# ================================================
|
|
1214
|
+
|
|
1215
|
+
# Indic_Syllabic_Category=Number
|
|
1216
|
+
|
|
1217
|
+
# Number (can be used as vowel-holders like consonant placeholders)
|
|
1218
|
+
# Note: A number may even hold subjoined consonants which may in turn
|
|
1219
|
+
# have been formed using a virama or a stacker, e.g. the sequence
|
|
1220
|
+
# <U+1A93, U+1A60, U+1A34> where THAI THAM LETTER LOW TA is subjoined to
|
|
1221
|
+
# TAI THAM THAM DIGIT THREE using an invisible stacker.
|
|
1222
|
+
|
|
1223
|
+
# [Not derivable]
|
|
1224
|
+
|
|
1225
|
+
0030..0039 ; Number # Nd [10] DIGIT ZERO..DIGIT NINE
|
|
1226
|
+
0966..096F ; Number # Nd [10] DEVANAGARI DIGIT ZERO..DEVANAGARI DIGIT NINE
|
|
1227
|
+
09E6..09EF ; Number # Nd [10] BENGALI DIGIT ZERO..BENGALI DIGIT NINE
|
|
1228
|
+
0A66..0A6F ; Number # Nd [10] GURMUKHI DIGIT ZERO..GURMUKHI DIGIT NINE
|
|
1229
|
+
0AE6..0AEF ; Number # Nd [10] GUJARATI DIGIT ZERO..GUJARATI DIGIT NINE
|
|
1230
|
+
0B66..0B6F ; Number # Nd [10] ORIYA DIGIT ZERO..ORIYA DIGIT NINE
|
|
1231
|
+
0BE6..0BEF ; Number # Nd [10] TAMIL DIGIT ZERO..TAMIL DIGIT NINE
|
|
1232
|
+
0C66..0C6F ; Number # Nd [10] TELUGU DIGIT ZERO..TELUGU DIGIT NINE
|
|
1233
|
+
0CE6..0CEF ; Number # Nd [10] KANNADA DIGIT ZERO..KANNADA DIGIT NINE
|
|
1234
|
+
0D66..0D6F ; Number # Nd [10] MALAYALAM DIGIT ZERO..MALAYALAM DIGIT NINE
|
|
1235
|
+
0DE6..0DEF ; Number # Nd [10] SINHALA LITH DIGIT ZERO..SINHALA LITH DIGIT NINE
|
|
1236
|
+
0E50..0E59 ; Number # Nd [10] THAI DIGIT ZERO..THAI DIGIT NINE
|
|
1237
|
+
0ED0..0ED9 ; Number # Nd [10] LAO DIGIT ZERO..LAO DIGIT NINE
|
|
1238
|
+
0F20..0F29 ; Number # Nd [10] TIBETAN DIGIT ZERO..TIBETAN DIGIT NINE
|
|
1239
|
+
0F2A..0F33 ; Number # No [10] TIBETAN DIGIT HALF ONE..TIBETAN DIGIT HALF ZERO
|
|
1240
|
+
1040..1049 ; Number # Nd [10] MYANMAR DIGIT ZERO..MYANMAR DIGIT NINE
|
|
1241
|
+
1090..1099 ; Number # Nd [10] MYANMAR SHAN DIGIT ZERO..MYANMAR SHAN DIGIT NINE
|
|
1242
|
+
17E0..17E9 ; Number # Nd [10] KHMER DIGIT ZERO..KHMER DIGIT NINE
|
|
1243
|
+
1946..194F ; Number # Nd [10] LIMBU DIGIT ZERO..LIMBU DIGIT NINE
|
|
1244
|
+
19D0..19D9 ; Number # Nd [10] NEW TAI LUE DIGIT ZERO..NEW TAI LUE DIGIT NINE
|
|
1245
|
+
19DA ; Number # No NEW TAI LUE THAM DIGIT ONE
|
|
1246
|
+
1A80..1A89 ; Number # Nd [10] TAI THAM HORA DIGIT ZERO..TAI THAM HORA DIGIT NINE
|
|
1247
|
+
1A90..1A99 ; Number # Nd [10] TAI THAM THAM DIGIT ZERO..TAI THAM THAM DIGIT NINE
|
|
1248
|
+
1B50..1B59 ; Number # Nd [10] BALINESE DIGIT ZERO..BALINESE DIGIT NINE
|
|
1249
|
+
1BB0..1BB9 ; Number # Nd [10] SUNDANESE DIGIT ZERO..SUNDANESE DIGIT NINE
|
|
1250
|
+
1C40..1C49 ; Number # Nd [10] LEPCHA DIGIT ZERO..LEPCHA DIGIT NINE
|
|
1251
|
+
A8D0..A8D9 ; Number # Nd [10] SAURASHTRA DIGIT ZERO..SAURASHTRA DIGIT NINE
|
|
1252
|
+
A900..A909 ; Number # Nd [10] KAYAH LI DIGIT ZERO..KAYAH LI DIGIT NINE
|
|
1253
|
+
A9D0..A9D9 ; Number # Nd [10] JAVANESE DIGIT ZERO..JAVANESE DIGIT NINE
|
|
1254
|
+
A9F0..A9F9 ; Number # Nd [10] MYANMAR TAI LAING DIGIT ZERO..MYANMAR TAI LAING DIGIT NINE
|
|
1255
|
+
AA50..AA59 ; Number # Nd [10] CHAM DIGIT ZERO..CHAM DIGIT NINE
|
|
1256
|
+
ABF0..ABF9 ; Number # Nd [10] MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT NINE
|
|
1257
|
+
10A40..10A48 ; Number # No [9] KHAROSHTHI DIGIT ONE..KHAROSHTHI FRACTION ONE HALF
|
|
1258
|
+
11066..1106F ; Number # Nd [10] BRAHMI DIGIT ZERO..BRAHMI DIGIT NINE
|
|
1259
|
+
11136..1113F ; Number # Nd [10] CHAKMA DIGIT ZERO..CHAKMA DIGIT NINE
|
|
1260
|
+
111D0..111D9 ; Number # Nd [10] SHARADA DIGIT ZERO..SHARADA DIGIT NINE
|
|
1261
|
+
111E1..111F4 ; Number # No [20] SINHALA ARCHAIC DIGIT ONE..SINHALA ARCHAIC NUMBER ONE THOUSAND
|
|
1262
|
+
112F0..112F9 ; Number # Nd [10] KHUDAWADI DIGIT ZERO..KHUDAWADI DIGIT NINE
|
|
1263
|
+
11450..11459 ; Number # Nd [10] NEWA DIGIT ZERO..NEWA DIGIT NINE
|
|
1264
|
+
114D0..114D9 ; Number # Nd [10] TIRHUTA DIGIT ZERO..TIRHUTA DIGIT NINE
|
|
1265
|
+
11650..11659 ; Number # Nd [10] MODI DIGIT ZERO..MODI DIGIT NINE
|
|
1266
|
+
116C0..116C9 ; Number # Nd [10] TAKRI DIGIT ZERO..TAKRI DIGIT NINE
|
|
1267
|
+
11730..11739 ; Number # Nd [10] AHOM DIGIT ZERO..AHOM DIGIT NINE
|
|
1268
|
+
1173A..1173B ; Number # No [2] AHOM NUMBER TEN..AHOM NUMBER TWENTY
|
|
1269
|
+
11C50..11C59 ; Number # Nd [10] BHAIKSUKI DIGIT ZERO..BHAIKSUKI DIGIT NINE
|
|
1270
|
+
11C5A..11C6C ; Number # No [19] BHAIKSUKI NUMBER ONE..BHAIKSUKI HUNDREDS UNIT MARK
|
|
1271
|
+
11D50..11D59 ; Number # Nd [10] MASARAM GONDI DIGIT ZERO..MASARAM GONDI DIGIT NINE
|
|
1272
|
+
11DA0..11DA9 ; Number # Nd [10] GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT NINE
|
|
1273
|
+
|
|
1274
|
+
# ================================================
|
|
1275
|
+
|
|
1276
|
+
# Indic_Syllabic_Category=Brahmi_Joining_Number
|
|
1277
|
+
|
|
1278
|
+
# Brahmi Joining Number (similar to Number in that in can be used as
|
|
1279
|
+
# vowel-holders like Consonant_Placeholder, but may also be joined by
|
|
1280
|
+
# a Number_Joiner of the same script, e.g. in Brahmi)
|
|
1281
|
+
|
|
1282
|
+
# [Not derivable]
|
|
1283
|
+
|
|
1284
|
+
11052..11065 ; Brahmi_Joining_Number # No [20] BRAHMI NUMBER ONE..BRAHMI NUMBER ONE THOUSAND
|
|
1285
|
+
|
|
1286
|
+
# EOF
|