source: for-distributions/trunk/bin/windows/perl/lib/unicore/ArabicShaping.txt@ 14489

Last change on this file since 14489 was 14489, checked in by oranfry, 17 years ago

upgrading to perl 5.8

File size: 9.8 KB
Line 
1# ArabicShaping-4.1.0.txt
2# Date: 2005-03-17, 15:21:00 PST [KW]
3#
4# This file is a normative contributory data file in the
5# Unicode Character Database.
6#
7# Copyright (c) 1991-2005 Unicode, Inc.
8# For terms of use, see http://www.unicode.org/terms_of_use.html
9#
10# This file defines the shaping classes for Arabic and Syriac
11# positional shaping, repeating in machine readable form the
12# information printed in Tables 8-3, 8-7, 8-8, 8-11, 8-12, and
13# 8-13 of The Unicode Standard, Version 4.0.
14#
15# See sections 8.2 and 8.3 of The Unicode Standard, Version 4.0
16# for more information.
17#
18# Each line contains four fields, separated by a semicolon.
19#
20# Field 0: the code point, in 4-digit hexadecimal
21# form, of an Arabic or Syriac character.
22# Field 1: gives a short schematic name for that character,
23# abbreviated from the normative Unicode character name.
24# Field 2: defines the joining type (property name: Joining_Type)
25# R Right_Joining
26# L Left_Joining
27# D Dual_Joining
28# C Join_Causing
29# U Non_Joining
30# T Transparent
31# See the Arabic block description for more information on these types.
32# Field 3: defines the joining group (property name: Joining_Group)
33#
34# The values of the joining group are based schematically on character
35# names. Where a schematic character name consists of two or more parts separated
36# by spaces, the formal Joining_Group property value, as specified in
37# PropertyValueAliases.txt, consists of the same name parts joined by
38# underscores. Hence, the entry:
39#
40# 0629; TEH MARBUTA; R; TEH MARBUTA
41#
42# corresponds to [Joining_Group = Teh_Marbuta].
43#
44# Note: For historical reasons, the property value [Joining_Group = Hamza_On_Heh_Goal]
45# is anachronistically named. It used to apply to both of the following characters
46# in earlier versions of the standard:
47#
48# U+06C2 ARABIC LETTER HEH GOAL WITH HAMZA ABOVE
49# U+06C3 ARABIC LETTER TEH MARBUTA GOAL
50#
51# However, it currently applies only to U+06C3, and *not* to U+06C2.
52# To avoid destabilizing existing Joining_Group property aliases, the
53# value Hamza_On_Heh_Goal has not been changed, despite the fact that it
54# no longer applies to Hamza On Heh Goal, but only to Teh Marbuta Goal.
55#
56# Note: Code points that are not explicitly listed in this file are
57# either of joining type T or U:
58#
59# - Those that not explicitly listed that are of General Category Mn, Me, or Cf
60# have joining type T.
61# - All others not explicitly listed have type U.
62#
63# For an explicit listing of characters of joining type T, see
64# the derived property file DerivedJoiningType.txt.
65#
66# There are currently no characters of type L defined in Unicode.
67#
68# #############################################################
69
70# Unicode; Schematic Name; Joining Type; Joining Group
71
72# Arabic characters
73
740600; ARABIC NUMBER SIGN; U; No_Joining_Group
750601; ARABIC SIGN SANAH; U; No_Joining_Group
760602; ARABIC FOOTNOTE MARKER; U; No_Joining_Group
770603; ARABIC SIGN SAFHA; U; No_Joining_Group
78060B; AFGHANI SIGN; U; No_Joining_Group
790621; HAMZA; U; No_Joining_Group
800622; MADDA ON ALEF; R; ALEF
810623; HAMZA ON ALEF; R; ALEF
820624; HAMZA ON WAW; R; WAW
830625; HAMZA UNDER ALEF; R; ALEF
840626; HAMZA ON YEH; D; YEH
850627; ALEF; R; ALEF
860628; BEH; D; BEH
870629; TEH MARBUTA; R; TEH MARBUTA
88062A; TEH; D; BEH
89062B; THEH; D; BEH
90062C; JEEM; D; HAH
91062D; HAH; D; HAH
92062E; KHAH; D; HAH
93062F; DAL; R; DAL
940630; THAL; R; DAL
950631; REH; R; REH
960632; ZAIN; R; REH
970633; SEEN; D; SEEN
980634; SHEEN; D; SEEN
990635; SAD; D; SAD
1000636; DAD; D; SAD
1010637; TAH; D; TAH
1020638; ZAH; D; TAH
1030639; AIN; D; AIN
104063A; GHAIN; D; AIN
1050640; TATWEEL; C; No_Joining_Group
1060641; FEH; D; FEH
1070642; QAF; D; QAF
1080643; KAF; D; KAF
1090644; LAM; D; LAM
1100645; MEEM; D; MEEM
1110646; NOON; D; NOON
1120647; HEH; D; HEH
1130648; WAW; R; WAW
1140649; ALEF MAKSURA; D; YEH
115064A; YEH; D; YEH
116066E; DOTLESS BEH; D; BEH
117066F; DOTLESS QAF; D; QAF
1180671; HAMZAT WASL ON ALEF; R; ALEF
1190672; WAVY HAMZA ON ALEF; R; ALEF
1200673; WAVY HAMZA UNDER ALEF; R; ALEF
1210674; HIGH HAMZA; U; No_Joining_Group
1220675; HIGH HAMZA ALEF; R; ALEF
1230676; HIGH HAMZA WAW; R; WAW
1240677; HIGH HAMZA WAW WITH DAMMA; R; WAW
1250678; HIGH HAMZA YEH; D; YEH
1260679; TEH WITH SMALL TAH; D; BEH
127067A; TEH WITH 2 DOTS VERTICAL ABOVE; D; BEH
128067B; BEH WITH 2 DOTS VERTICAL BELOW; D; BEH
129067C; TEH WITH RING; D; BEH
130067D; TEH WITH 3 DOTS ABOVE DOWNWARD; D; BEH
131067E; TEH WITH 3 DOTS BELOW; D; BEH
132067F; TEH WITH 4 DOTS ABOVE; D; BEH
1330680; BEH WITH 4 DOTS BELOW; D; BEH
1340681; HAMZA ON HAH; D; HAH
1350682; HAH WITH 2 DOTS VERTICAL ABOVE; D; HAH
1360683; HAH WITH MIDDLE 2 DOTS; D; HAH
1370684; HAH WITH MIDDLE 2 DOTS VERTICAL; D; HAH
1380685; HAH WITH 3 DOTS ABOVE; D; HAH
1390686; HAH WITH MIDDLE 3 DOTS DOWNWARD; D; HAH
1400687; HAH WITH MIDDLE 4 DOTS; D; HAH
1410688; DAL WITH SMALL TAH; R; DAL
1420689; DAL WITH RING; R; DAL
143068A; DAL WITH DOT BELOW; R; DAL
144068B; DAL WITH DOT BELOW AND SMALL TAH; R; DAL
145068C; DAL WITH 2 DOTS ABOVE; R; DAL
146068D; DAL WITH 2 DOTS BELOW; R; DAL
147068E; DAL WITH 3 DOTS ABOVE; R; DAL
148068F; DAL WITH 3 DOTS ABOVE DOWNWARD; R; DAL
1490690; DAL WITH 4 DOTS ABOVE; R; DAL
1500691; REH WITH SMALL TAH; R; REH
1510692; REH WITH SMALL V; R; REH
1520693; REH WITH RING; R; REH
1530694; REH WITH DOT BELOW; R; REH
1540695; REH WITH SMALL V BELOW; R; REH
1550696; REH WITH DOT BELOW AND DOT ABOVE; R; REH
1560697; REH WITH 2 DOTS ABOVE; R; REH
1570698; REH WITH 3 DOTS ABOVE; R; REH
1580699; REH WITH 4 DOTS ABOVE; R; REH
159069A; SEEN WITH DOT BELOW AND DOT ABOVE; D; SEEN
160069B; SEEN WITH 3 DOTS BELOW; D; SEEN
161069C; SEEN WITH 3 DOTS BELOW AND 3 DOTS ABOVE; D; SEEN
162069D; SAD WITH 2 DOTS BELOW; D; SAD
163069E; SAD WITH 3 DOTS ABOVE; D; SAD
164069F; TAH WITH 3 DOTS ABOVE; D; TAH
16506A0; AIN WITH 3 DOTS ABOVE; D; AIN
16606A1; DOTLESS FEH; D; FEH
16706A2; FEH WITH DOT MOVED BELOW; D; FEH
16806A3; FEH WITH DOT BELOW; D; FEH
16906A4; FEH WITH 3 DOTS ABOVE; D; FEH
17006A5; FEH WITH 3 DOTS BELOW; D; FEH
17106A6; FEH WITH 4 DOTS ABOVE; D; FEH
17206A7; QAF WITH DOT ABOVE; D; QAF
17306A8; QAF WITH 3 DOTS ABOVE; D; QAF
17406A9; KEHEH; D; GAF
17506AA; SWASH KAF; D; SWASH KAF
17606AB; KAF WITH RING; D; GAF
17706AC; KAF WITH DOT ABOVE; D; KAF
17806AD; KAF WITH 3 DOTS ABOVE; D; KAF
17906AE; KAF WITH 3 DOTS BELOW; D; KAF
18006AF; GAF; D; GAF
18106B0; GAF WITH RING; D; GAF
18206B1; GAF WITH 2 DOTS ABOVE; D; GAF
18306B2; GAF WITH 2 DOTS BELOW; D; GAF
18406B3; GAF WITH 2 DOTS VERTICAL BELOW; D; GAF
18506B4; GAF WITH 3 DOTS ABOVE; D; GAF
18606B5; LAM WITH SMALL V; D; LAM
18706B6; LAM WITH DOT ABOVE; D; LAM
18806B7; LAM WITH 3 DOTS ABOVE; D; LAM
18906B8; LAM WITH 3 DOTS BELOW; D; LAM
19006B9; NOON WITH DOT BELOW; D; NOON
19106BA; DOTLESS NOON; D; NOON
19206BB; DOTLESS NOON WITH SMALL TAH; D; NOON
19306BC; NOON WITH RING; D; NOON
19406BD; NOON WITH 3 DOTS ABOVE; D; NOON
19506BE; KNOTTED HEH; D; KNOTTED HEH
19606BF; HAH WITH MIDDLE 3 DOTS DOWNWARD AND DOT ABOVE; D; HAH
19706C0; HAMZA ON HEH; R; TEH MARBUTA
19806C1; HEH GOAL; D; HEH GOAL
19906C2; HAMZA ON HEH GOAL; D; HEH GOAL
20006C3; TEH MARBUTA GOAL; R; HAMZA ON HEH GOAL
20106C4; WAW WITH RING; R; WAW
20206C5; WAW WITH BAR; R; WAW
20306C6; WAW WITH SMALL V; R; WAW
20406C7; WAW WITH DAMMA; R; WAW
20506C8; WAW WITH ALEF ABOVE; R; WAW
20606C9; WAW WITH INVERTED SMALL V; R; WAW
20706CA; WAW WITH 2 DOTS ABOVE; R; WAW
20806CB; WAW WITH 3 DOTS ABOVE; R; WAW
20906CC; DOTLESS YEH; D; YEH
21006CD; YEH WITH TAIL; R; YEH WITH TAIL
21106CE; YEH WITH SMALL V; D; YEH
21206CF; WAW WITH DOT ABOVE; R; WAW
21306D0; YEH WITH 2 DOTS VERTICAL BELOW; D; YEH
21406D1; YEH WITH 3 DOTS BELOW; D; YEH
21506D2; YEH BARREE; R; YEH BARREE
21606D3; HAMZA ON YEH BARREE; R; YEH BARREE
21706D5; AE; R; TEH MARBUTA
21806DD; ARABIC END OF AYAH; U; No_Joining_Group
21906EE; DAL WITH INVERTED V; R; DAL
22006EF; REH WITH INVERTED V; R; REH
22106FA; SEEN WITH DOT BELOW AND 3 DOTS ABOVE; D; SEEN
22206FB; DAD WITH DOT BELOW; D; SAD
22306FC; GHAIN WITH DOT BELOW; D; AIN
22406FF; HEH WITH INVERTED V; D; KNOTTED HEH
225
226# Syriac characters
227
2280710; ALAPH; R; ALAPH
2290712; BETH; D; BETH
2300713; GAMAL; D; GAMAL
2310714; GAMAL GARSHUNI; D; GAMAL
2320715; DALATH; R; DALATH RISH
2330716; DOTLESS DALATH RISH; R; DALATH RISH
2340717; HE; R; HE
2350718; WAW; R; SYRIAC WAW
2360719; ZAIN; R; ZAIN
237071A; HETH; D; HETH
238071B; TETH; D; TETH
239071C; TETH GARSHUNI; D; TETH
240071D; YUDH; D; YUDH
241071E; YUDH HE; R; YUDH HE
242071F; KAPH; D; KAPH
2430720; LAMADH; D; LAMADH
2440721; MIM; D; MIM
2450722; NUN; D; NUN
2460723; SEMKATH; D; SEMKATH
2470724; FINAL SEMKATH; D; FINAL SEMKATH
2480725; E; D; E
2490726; PE; D; PE
2500727; REVERSED PE; D; REVERSED PE
2510728; SADHE; R; SADHE
2520729; QAPH; D; QAPH
253072A; RISH; R; DALATH RISH
254072B; SHIN; D; SHIN
255072C; TAW; R; TAW
256072D; PERSIAN BHETH; D; BETH
257072E; PERSIAN GHAMAL; D; GAMAL
258072F; PERSIAN DHALATH; R; DALATH RISH
259074D; SOGDIAN ZHAIN; R; ZHAIN
260074E; SOGDIAN KHAPH; D; KHAPH
261074F; SOGDIAN FE; D; FE
262
263# Arabic supplement characters
264
2650750; BEH WITH 3 DOTS HORIZONTALLY BELOW; D; BEH
2660751; BEH WITH DOT BELOW AND 3 DOTS ABOVE; D; BEH
2670752; BEH WITH 3 DOTS POINTING UPWARDS BELOW; D; BEH
2680753; BEH WITH 3 DOTS POINTING UPWARDS BELOW AND 2 DOTS ABOVE; D; BEH
2690754; BEH WITH 2 DOTS BELOW AND DOT ABOVE; D; BEH
2700755; BEH WITH INVERTED SMALL V BELOW; D; BEH
2710756; BEH WITH SMALL V; D; BEH
2720757; HAH WITH 2 DOTS ABOVE; D; HAH
2730758; HAH WITH 3 DOTS POINTING UPWARDS BELOW; D; HAH
2740759; DAL WITH 2 DOTS VERTICALLY BELOW AND SMALL TAH; R; DAL
275075A; DAL WITH INVERTED SMALL V BELOW; R; DAL
276075B; REH WITH STROKE; R; REH
277075C; SEEN WITH 4 DOTS ABOVE; D; SEEN
278075D; AIN WITH 2 DOTS ABOVE; D; AIN
279075E; AIN WITH 3 DOTS POINTING DOWNWARDS ABOVE; D; AIN
280075F; AIN WITH 2 DOTS VERTICALLY ABOVE; D; AIN
2810760; FEH WITH 2 DOTS BELOW; D; FEH
2820761; FEH WITH 3 DOTS POINTING UPWARDS BELOW; D; FEH
2830762; KEHEH WITH DOT ABOVE; D; GAF
2840763; KEHEH WITH 3 DOTS ABOVE; D; GAF
2850764; KEHEH WITH 3 DOTS POINTING UPWARDS BELOW; D; GAF
2860765; MEEM WITH DOT ABOVE; D; MEEM
2870766; MEEM WITH DOT BELOW; D; MEEM
2880767; NOON WITH 2 DOTS BELOW; D; NOON
2890768; NOON WITH SMALL TAH; D; NOON
2900769; NOON WITH SMALL V; D; NOON
291076A; LAM WITH BAR; D; LAM
292076B; REH WITH 2 DOTS VERTICALLY ABOVE; R; REH
293076C; REH WITH HAMZA ABOVE; R; REH
294076D; SEEN WITH 2 DOTS VERTICALLY ABOVE; D; SEEN
295
296# Other
297
298200D; ZERO WIDTH JOINER; C; No_Joining_Group
299200C; ZERO WIDTH NON-JOINER; U; No_Joining_Group
Note: See TracBrowser for help on using the repository browser.