source: for-distributions/trunk/bin/windows/perl/lib/Encode/CN.pm@ 14489

Last change on this file since 14489 was 14489, checked in by oranfry, 17 years ago

upgrading to perl 5.8

File size: 2.0 KB
Line 
1package Encode::CN;
2BEGIN {
3 if (ord("A") == 193) {
4 die "Encode::CN not supported on EBCDIC\n";
5 }
6}
7our $VERSION = do { my @r = (q$Revision: 2.0 $ =~ /\d+/g); sprintf "%d."."%02d" x $#r, @r };
8
9use Encode;
10use XSLoader;
11XSLoader::load(__PACKAGE__,$VERSION);
12
13# Relocated from Encode.pm
14
15use Encode::CN::HZ;
16# use Encode::CN::2022_CN;
17
181;
19__END__
20
21=head1 NAME
22
23Encode::CN - China-based Chinese Encodings
24
25=head1 SYNOPSIS
26
27 use Encode qw/encode decode/;
28 $euc_cn = encode("euc-cn", $utf8); # loads Encode::CN implicitly
29 $utf8 = decode("euc-cn", $euc_cn); # ditto
30
31=head1 DESCRIPTION
32
33This module implements China-based Chinese charset encodings.
34Encodings supported are as follows.
35
36 Canonical Alias Description
37 --------------------------------------------------------------------
38 euc-cn /\beuc.*cn$/i EUC (Extended Unix Character)
39 /\bcn.*euc$/i
40 /\bGB[-_ ]?2312(?:\D.*$|$)/i (see below)
41 gb2312-raw The raw (low-bit) GB2312 character map
42 gb12345-raw Traditional chinese counterpart to
43 GB2312 (raw)
44 iso-ir-165 GB2312 + GB6345 + GB8565 + additions
45 MacChineseSimp GB2312 + Apple Additions
46 cp936 Code Page 936, also known as GBK
47 (Extended GuoBiao)
48 hz 7-bit escaped GB2312 encoding
49 --------------------------------------------------------------------
50
51To find how to use this module in detail, see L<Encode>.
52
53=head1 NOTES
54
55Due to size concerns, C<GB 18030> (an extension to C<GBK>) is distributed
56separately on CPAN, under the name L<Encode::HanExtra>. That module
57also contains extra Taiwan-based encodings.
58
59=head1 BUGS
60
61When you see C<charset=gb2312> on mails and web pages, they really
62mean C<euc-cn> encodings. To fix that, C<gb2312> is aliased to C<euc-cn>.
63Use C<gb2312-raw> when you really mean it.
64
65The ASCII region (0x00-0x7f) is preserved for all encodings, even though
66this conflicts with mappings by the Unicode Consortium. See
67
68L<http://www.debian.or.jp/~kubota/unicode-symbols.html.en>
69
70to find out why it is implemented that way.
71
72=head1 SEE ALSO
73
74L<Encode>
75
76=cut
Note: See TracBrowser for help on using the repository browser.