source: for-distributions/trunk/bin/windows/perl/lib/unicore/CompositionExclusions.txt@ 14489

Last change on this file since 14489 was 14489, checked in by oranfry, 17 years ago

upgrading to perl 5.8

File size: 7.3 KB
Line 
1# CompositionExclusions-4.1.0.txt
2# Date: 2005-03-17, 15:21:00 PST [KW]
3#
4# This file lists the characters from the UAX #15 Composition Exclusion Table.
5#
6# This file is a normative contributory data file in the
7# Unicode Character Database.
8#
9# Copyright (c) 1991-2005 Unicode, Inc.
10# For terms of use, see http://www.unicode.org/terms_of_use.html
11#
12# For more information, see
13# http://www.unicode.org/unicode/reports/tr15/#Primary Exclusion List Table
14# ================================================
15
16# (1) Script Specifics
17# This list of characters cannot be derived from the UnicodeData file.
18# ================================================
19
200958 # DEVANAGARI LETTER QA
210959 # DEVANAGARI LETTER KHHA
22095A # DEVANAGARI LETTER GHHA
23095B # DEVANAGARI LETTER ZA
24095C # DEVANAGARI LETTER DDDHA
25095D # DEVANAGARI LETTER RHA
26095E # DEVANAGARI LETTER FA
27095F # DEVANAGARI LETTER YYA
2809DC # BENGALI LETTER RRA
2909DD # BENGALI LETTER RHA
3009DF # BENGALI LETTER YYA
310A33 # GURMUKHI LETTER LLA
320A36 # GURMUKHI LETTER SHA
330A59 # GURMUKHI LETTER KHHA
340A5A # GURMUKHI LETTER GHHA
350A5B # GURMUKHI LETTER ZA
360A5E # GURMUKHI LETTER FA
370B5C # ORIYA LETTER RRA
380B5D # ORIYA LETTER RHA
390F43 # TIBETAN LETTER GHA
400F4D # TIBETAN LETTER DDHA
410F52 # TIBETAN LETTER DHA
420F57 # TIBETAN LETTER BHA
430F5C # TIBETAN LETTER DZHA
440F69 # TIBETAN LETTER KSSA
450F76 # TIBETAN VOWEL SIGN VOCALIC R
460F78 # TIBETAN VOWEL SIGN VOCALIC L
470F93 # TIBETAN SUBJOINED LETTER GHA
480F9D # TIBETAN SUBJOINED LETTER DDHA
490FA2 # TIBETAN SUBJOINED LETTER DHA
500FA7 # TIBETAN SUBJOINED LETTER BHA
510FAC # TIBETAN SUBJOINED LETTER DZHA
520FB9 # TIBETAN SUBJOINED LETTER KSSA
53FB1D # HEBREW LETTER YOD WITH HIRIQ
54FB1F # HEBREW LIGATURE YIDDISH YOD YOD PATAH
55FB2A # HEBREW LETTER SHIN WITH SHIN DOT
56FB2B # HEBREW LETTER SHIN WITH SIN DOT
57FB2C # HEBREW LETTER SHIN WITH DAGESH AND SHIN DOT
58FB2D # HEBREW LETTER SHIN WITH DAGESH AND SIN DOT
59FB2E # HEBREW LETTER ALEF WITH PATAH
60FB2F # HEBREW LETTER ALEF WITH QAMATS
61FB30 # HEBREW LETTER ALEF WITH MAPIQ
62FB31 # HEBREW LETTER BET WITH DAGESH
63FB32 # HEBREW LETTER GIMEL WITH DAGESH
64FB33 # HEBREW LETTER DALET WITH DAGESH
65FB34 # HEBREW LETTER HE WITH MAPIQ
66FB35 # HEBREW LETTER VAV WITH DAGESH
67FB36 # HEBREW LETTER ZAYIN WITH DAGESH
68FB38 # HEBREW LETTER TET WITH DAGESH
69FB39 # HEBREW LETTER YOD WITH DAGESH
70FB3A # HEBREW LETTER FINAL KAF WITH DAGESH
71FB3B # HEBREW LETTER KAF WITH DAGESH
72FB3C # HEBREW LETTER LAMED WITH DAGESH
73FB3E # HEBREW LETTER MEM WITH DAGESH
74FB40 # HEBREW LETTER NUN WITH DAGESH
75FB41 # HEBREW LETTER SAMEKH WITH DAGESH
76FB43 # HEBREW LETTER FINAL PE WITH DAGESH
77FB44 # HEBREW LETTER PE WITH DAGESH
78FB46 # HEBREW LETTER TSADI WITH DAGESH
79FB47 # HEBREW LETTER QOF WITH DAGESH
80FB48 # HEBREW LETTER RESH WITH DAGESH
81FB49 # HEBREW LETTER SHIN WITH DAGESH
82FB4A # HEBREW LETTER TAV WITH DAGESH
83FB4B # HEBREW LETTER VAV WITH HOLAM
84FB4C # HEBREW LETTER BET WITH RAFE
85FB4D # HEBREW LETTER KAF WITH RAFE
86FB4E # HEBREW LETTER PE WITH RAFE
87
88# Total code points: 67
89
90# ================================================
91# (2) Post Composition Version precomposed characters
92# These characters cannot be derived solely from the UnicodeData.txt file
93# in this version of Unicode.
94# ================================================
95
962ADC # FORKING
971D15E # MUSICAL SYMBOL HALF NOTE
981D15F # MUSICAL SYMBOL QUARTER NOTE
991D160 # MUSICAL SYMBOL EIGHTH NOTE
1001D161 # MUSICAL SYMBOL SIXTEENTH NOTE
1011D162 # MUSICAL SYMBOL THIRTY-SECOND NOTE
1021D163 # MUSICAL SYMBOL SIXTY-FOURTH NOTE
1031D164 # MUSICAL SYMBOL ONE HUNDRED TWENTY-EIGHTH NOTE
1041D1BB # MUSICAL SYMBOL MINIMA
1051D1BC # MUSICAL SYMBOL MINIMA BLACK
1061D1BD # MUSICAL SYMBOL SEMIMINIMA WHITE
1071D1BE # MUSICAL SYMBOL SEMIMINIMA BLACK
1081D1BF # MUSICAL SYMBOL FUSA WHITE
1091D1C0 # MUSICAL SYMBOL FUSA BLACK
110
111# Total code points: 14
112
113# ================================================
114# (3) Singleton Decompositions
115# These characters can be derived from the UnicodeData file
116# by including all characters whose canonical decomposition
117# consists of a single character.
118# These characters are simply quoted here for reference.
119# ================================================
120
121# 0340..0341 [2] COMBINING GRAVE TONE MARK..COMBINING ACUTE TONE MARK
122# 0343 COMBINING GREEK KORONIS
123# 0374 GREEK NUMERAL SIGN
124# 037E GREEK QUESTION MARK
125# 0387 GREEK ANO TELEIA
126# 1F71 GREEK SMALL LETTER ALPHA WITH OXIA
127# 1F73 GREEK SMALL LETTER EPSILON WITH OXIA
128# 1F75 GREEK SMALL LETTER ETA WITH OXIA
129# 1F77 GREEK SMALL LETTER IOTA WITH OXIA
130# 1F79 GREEK SMALL LETTER OMICRON WITH OXIA
131# 1F7B GREEK SMALL LETTER UPSILON WITH OXIA
132# 1F7D GREEK SMALL LETTER OMEGA WITH OXIA
133# 1FBB GREEK CAPITAL LETTER ALPHA WITH OXIA
134# 1FBE GREEK PROSGEGRAMMENI
135# 1FC9 GREEK CAPITAL LETTER EPSILON WITH OXIA
136# 1FCB GREEK CAPITAL LETTER ETA WITH OXIA
137# 1FD3 GREEK SMALL LETTER IOTA WITH DIALYTIKA AND OXIA
138# 1FDB GREEK CAPITAL LETTER IOTA WITH OXIA
139# 1FE3 GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND OXIA
140# 1FEB GREEK CAPITAL LETTER UPSILON WITH OXIA
141# 1FEE..1FEF [2] GREEK DIALYTIKA AND OXIA..GREEK VARIA
142# 1FF9 GREEK CAPITAL LETTER OMICRON WITH OXIA
143# 1FFB GREEK CAPITAL LETTER OMEGA WITH OXIA
144# 1FFD GREEK OXIA
145# 2000..2001 [2] EN QUAD..EM QUAD
146# 2126 OHM SIGN
147# 212A..212B [2] KELVIN SIGN..ANGSTROM SIGN
148# 2329 LEFT-POINTING ANGLE BRACKET
149# 232A RIGHT-POINTING ANGLE BRACKET
150# F900..FA0D [270] CJK COMPATIBILITY IDEOGRAPH-F900..CJK COMPATIBILITY IDEOGRAPH-FA0D
151# FA10 CJK COMPATIBILITY IDEOGRAPH-FA10
152# FA12 CJK COMPATIBILITY IDEOGRAPH-FA12
153# FA15..FA1E [10] CJK COMPATIBILITY IDEOGRAPH-FA15..CJK COMPATIBILITY IDEOGRAPH-FA1E
154# FA20 CJK COMPATIBILITY IDEOGRAPH-FA20
155# FA22 CJK COMPATIBILITY IDEOGRAPH-FA22
156# FA25..FA26 [2] CJK COMPATIBILITY IDEOGRAPH-FA25..CJK COMPATIBILITY IDEOGRAPH-FA26
157# FA2A..FA2D [4] CJK COMPATIBILITY IDEOGRAPH-FA2A..CJK COMPATIBILITY IDEOGRAPH-FA2D
158# FA30..FA6A [59] CJK COMPATIBILITY IDEOGRAPH-FA30..CJK COMPATIBILITY IDEOGRAPH-FA6A
159# FA70..FAD9 [106] CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COMPATIBILITY IDEOGRAPH-FAD9
160# 2F800..2FA1D [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
161
162# Total code points: 924
163
164# ================================================
165# (4) Non-Starter Decompositions
166# These characters can be derived from the UnicodeData file
167# by including all characters whose canonical decomposition consists
168# of a sequence of characters, the first of which has a non-zero
169# combining class.
170# These characters are simply quoted here for reference.
171# ================================================
172
173# 0344 COMBINING GREEK DIALYTIKA TONOS
174# 0F73 TIBETAN VOWEL SIGN II
175# 0F75 TIBETAN VOWEL SIGN UU
176# 0F81 TIBETAN VOWEL SIGN REVERSED II
177
178# Total code points: 4
179
Note: See TracBrowser for help on using the repository browser.