source: for-distributions/trunk/bin/windows/perl/lib/Pod/perlcn.pod@ 14489

Last change on this file since 14489 was 14489, checked in by oranfry, 17 years ago

upgrading to perl 5.8

File size: 4.0 KB
Line 
1If you read this file _as_is_, just ignore the funny characters you
2see. It is written in the POD format (see perlpod manpage) which is
3specially designed to be readable as is.
4
5The following documentation is written in EUC-CN encoding.
6
7Èç¹ûÄãÓÃÒ»°ãµÄÎÄ×Ö±àŒ­Æ÷ÔÄÀÀÕâ·ÝÎÄŒþ, ÇëºöÂÔÎÄÖÐÆæÌصÄ×¢ŒÇ×Ö·û.
8Õâ·ÝÎÄŒþÊÇÒÔ POD (ŒòÃ÷ÎÄŒþžñÊœ) ÐŽ³É; ÕâÖÖžñÊœÊÇΪÁËÄÜÈÃÈËÖ±œÓÔĶÁ,
9¶øÌرðÉèŒÆµÄ. ¹ØÓÚŽËžñÊœµÄœøÒ»²œÐÅÏ¢, Çë²Î¿Œ perlpod ÏßÉÏÎÄŒþ.
10
11=head1 NAME
12
13perlcn - ŒòÌåÖÐÎÄ Perl ÖžÄÏ
14
15=head1 DESCRIPTION
16
17»¶Ó­ÀŽµœ Perl µÄÌìµØ!
18
19ŽÓ 5.8.0 °æ¿ªÊŒ, Perl Ÿß±žÁËÍêÉÆµÄ Unicode (ͳһÂë) Ö§Ô®,
20Ò²Á¬ŽøÖ§Ô®ÁËÐí¶àÀ­¶¡ÓïϵÒÔÍâµÄ±àÂ뷜ʜ; CJK (ÖÐÈÕº«) ±ãÊÇÆäÖеÄÒ»²¿·Ý.
21Unicode ÊǹúŒÊÐԵıê׌, ÊÔÍŒº­žÇÊÀœçÉÏËùÓеÄ×Ö·û: Î÷·œÊÀœç, ¶«·œÊÀœç,
22ÒÔŒ°ÁœÕߌäµÄÒ»ÇÐ (Ï£À°ÎÄ, ÐðÀûÑÇÎÄ, ÑÇÀ­²®ÎÄ, Ï£²®ÀŽÎÄ, Ó¡¶ÈÎÄ,
23Ó¡µØ°²ÎÄ, µÈµÈ). ËüÒ²ÈÝÄÉÁ˶àÖÖ×÷ҵϵͳÓëƜ̚ (Èç PC Œ°ÂóœðËþ).
24
25Perl ±ŸÉíÒÔ Unicode œøÐвÙ×÷. Õâ±íÊŸ Perl ÄÚ²¿µÄ×Ö·ûŽ®ÊýŸÝ¿ÉÓà Unicode
26±íÊŸ; Perl µÄº¯ÊœÓëËã·û (ÀýÈçÕý¹æ±íÊŸÊœ±È¶Ô) Ò²ÄÜ¶Ô Unicode œøÐвÙ×÷.
27ÔÚÊäÈ댰Êä³öʱ, ΪÁËŽŠÀíÒÔ Unicode ֮ǰµÄ±àÂ뷜ʜŽæ·ÅµÄÊýŸÝ, Perl
28ÌṩÁË Encode ÕâžöÄ£¿é, ¿ÉÒÔÈÃÄãÇáÒ׵ضÁÈ¡Œ°ÐŽÈëŸÉÓеıàÂëÊýŸÝ.
29
30Encode ÑÓÉìÄ£¿éÖ§Ô®ÏÂÁÐŒòÌåÖÐÎĵıàÂ뷜ʜ ('gb2312' ±íÊŸ 'euc-cn'):
31
32 euc-cn Unix ÑÓÉì×Ö·ûŒ¯, Ò²ŸÍÊÇË׳ƵĹú±êÂë
33 gb2312-raw ÎŽŸ­ŽŠÀíµÄ (µÍ±ÈÌØ) GB2312 ×Ö·û±í
34 gb12345 ÎŽŸ­ŽŠÀíµÄÖйúÓ÷±ÌåÖÐÎıàÂë
35 iso-ir-165 GB2312 + GB6345 + GB8565 + ÐÂÔö×Ö·û
36 cp936 ×ÖÂëÒ³ 936, Ò²¿ÉÒÔÓà 'GBK' (À©³ä¹ú±êÂë) ÖžÃ÷
37 hz 7 ±ÈÌØÒݳöÊœ GB2312 ±àÂë
38
39ŸÙÀýÀŽËµ, œ« EUC-CN ±àÂëµÄµµ°ž×ª³É Unicode, ìóÐèŒüÈëÏÂÁÐÖžÁî:
40
41 perl -Mencoding=euc-cn,STDOUT,utf8 -pe1 < file.euc-cn > file.utf8
42
43Perl Ò²ÄÚžœÁË "piconv", Ò»Ö§ÍêÈ«ÒÔ Perl ÐŽ³ÉµÄ×Ö·ûת»»¹€Ÿß³ÌÐò, Ó÷šÈçÏÂ:
44
45 piconv -f euc-cn -t utf8 < file.euc-cn > file.utf8
46 piconv -f utf8 -t euc-cn < file.utf8 > file.euc-cn
47
48ÁíÍâ, ÀûÓà encoding Ä£¿é, Äã¿ÉÒÔÇáÒ×ÐŽ³öÒÔ×Ö·ûΪµ¥Î»µÄ³ÌÐòÂë, ÈçÏÂËùÊŸ:
49
50 #!/usr/bin/env perl
51 # Æô¶¯ euc-cn ×ÖŽ®œâÎö; ±ê׌Êä³öÈ댰±ê׌ŽíÎó¶ŒÉèΪ euc-cn ±àÂë
52 use encoding 'euc-cn', STDIN => 'euc-cn', STDOUT => 'euc-cn';
53 print length("ÂæÍÕ"); # 2 (Ë«ÒýºÅ±íÊŸ×Ö·û)
54 print length('ÂæÍÕ'); # 4 (µ¥ÒýºÅ±íÊŸ×ÖœÚ)
55 print index("×»×»œÌ»å", "»×»œ"); # -1 (²»°üº¬ŽË×Ó×Ö·ûŽ®)
56 print index('×»×»œÌ»å', '»×»œ'); # 1 (ŽÓµÚ¶þžö×֜ڿªÊŒ)
57
58ÔÚ×îºóÒ»ÁÐÀý×ÓÀï, "×»" µÄµÚ¶þžö×ÖœÚÓë "×»" µÄµÚÒ»žö×֜ڜáºÏ³É EUC-CN
59ÂëµÄ "»×"; "×»" µÄµÚ¶þžö×ÖœÚÔòÓë "œÌ" µÄµÚÒ»žö×֜ڜáºÏ³É "»œ".
60ÕâœâŸöÁËÒÔÇ° EUC-CN Âë±È¶ÔŽŠÀíÉϳ£ŒûµÄÎÊÌâ.
61
62=head2 ¶îÍâµÄÖÐÎıàÂë
63
64Èç¹ûÐèÒªžü¶àµÄÖÐÎıàÂë, ¿ÉÒÔŽÓ CPAN (L<http://www.cpan.org/>) ÏÂÔØ
65Encode::HanExtra Ä£¿é. ËüÄ¿Ç°ÌṩÏÂÁбàÂ뷜ʜ:
66
67 gb18030 À©³ä¹ýµÄ¹ú±êÂë, °üº¬·±ÌåÖÐÎÄ
68
69ÁíÍâ, Encode::HanConvert Ä£¿éÔòÌṩÁËŒò·±×ª»»ÓõÄÁœÖÖ±àÂë:
70
71 big5-simp Big5 ·±ÌåÖÐÎÄÓë Unicode ŒòÌåÖÐÎÄ»¥×ª
72 gbk-trad GBK ŒòÌåÖÐÎÄÓë Unicode ·±ÌåÖÐÎÄ»¥×ª
73
74ÈôÏëÔÚ GBK Óë Big5 Ö®Œä»¥×ª, Çë²Î¿ŒžÃÄ£¿éÄÚžœµÄ b2g.pl Óë g2b.pl ÁœÖ§³ÌÐò,
75»òÔÚ³ÌÐòÄÚʹÓÃÏÂÁÐÐŽ·š:
76
77 use Encode::HanConvert;
78 $euc_cn = big5_to_gb($big5); # ŽÓ Big5 תΪ GBK
79 $big5 = gb_to_big5($euc_cn); # ŽÓ GBK תΪ Big5
80
81=head2 œøÒ»²œµÄÐÅÏ¢
82
83Çë²Î¿Œ Perl ÄÚžœµÄŽóÁ¿ËµÃ÷ÎÄŒþ (²»ÐÒÈ«ÊÇÓÃÓ¢ÎÄÐŽµÄ), ÀŽÑ§Ï°žü¶à¹ØÓÚ
84Perl µÄ֪ʶ, ÒÔŒ° Unicode µÄʹÓ÷œÊœ. ²»¹ý, ÍⲿµÄ×ÊÔŽÏ൱·áž»:
85
86=head2 Ìṩ Perl ×ÊÔŽµÄÍøÖ·
87
88=over 4
89
90=item L<http://www.perl.com/>
91
92Perl µÄÊ×Ò³ (ÓÉÅ·À³Àñ¹«ËŸÎ¬»€)
93
94=item L<http://www.cpan.org/>
95
96Perl ×ۺϵä²ØÍø (Comprehensive Perl Archive Network)
97
98=item L<http://lists.perl.org/>
99
100Perl ÓʵÝÂÛ̳һÀÀ
101
102=back
103
104=head2 ѧϰ Perl µÄÍøÖ·
105
106=over 4
107
108=item L<http://www.oreilly.com.cn/html/perl.html>
109
110ŒòÌåÖÐÎÄ°æµÄÅ·À³Àñ Perl Êéœå
111
112=back
113
114=head2 Perl ʹÓÃÕߌ¯»á
115
116=over 4
117
118=item L<http://www.pm.org/groups/asia.shtml#China>
119
120Öйú Perl Íƹã×éÒ»ÀÀ
121
122=back
123
124=head2 Unicode Ïà¹ØÍøÖ·
125
126=over 4
127
128=item L<http://www.unicode.org/>
129
130Unicode ѧÊõѧ»á (Unicode ±ê׌µÄÖƶšÕß)
131
132=item L<http://www.cl.cam.ac.uk/%7Emgk25/unicode.html>
133
134Unix/Linux É쵀 UTF-8 Œ° Unicode Žð¿ÍÎÊ
135
136=back
137
138=head1 SEE ALSO
139
140L<Encode>, L<Encode::CN>, L<encoding>, L<perluniintro>, L<perlunicode>
141
142=head1 AUTHORS
143
144Jarkko Hietaniemi E<lt>[email protected]<gt>
145
146Autrijus Tang (ÌÆ×Úºº) E<lt>[email protected]<gt>
147
148=cut
Note: See TracBrowser for help on using the repository browser.