1Encode::CN(3pm)        Perl Programmers Reference Guide        Encode::CN(3pm)
2
3
4

NAME

6       Encode::CN - China-based Chinese Encodings
7

SYNOPSIS

9           use Encode qw/encode decode/;
10           $euc_cn = encode("euc-cn", $utf8);   # loads Encode::CN implicitly
11           $utf8   = decode("euc-cn", $euc_cn); # ditto
12

DESCRIPTION

14       This module implements China-based Chinese charset encodings.
15       Encodings supported are as follows.
16
17         Canonical   Alias             Description
18         --------------------------------------------------------------------
19         euc-cn      /\beuc.*cn$/i     EUC (Extended Unix Character)
20                 /\bcn.*euc$/i
21                     /\bGB[-_ ]?2312(?:\D.*$|$)/i (see below)
22         gb2312-raw                    The raw (low-bit) GB2312 character map
23         gb12345-raw                   Traditional chinese counterpart to
24                       GB2312 (raw)
25         iso-ir-165                    GB2312 + GB6345 + GB8565 + additions
26         MacChineseSimp                GB2312 + Apple Additions
27         cp936                         Code Page 936, also known as GBK
28                       (Extended GuoBiao)
29         hz                            7-bit escaped GB2312 encoding
30         --------------------------------------------------------------------
31
32       To find how to use this module in detail, see Encode.
33

NOTES

35       Due to size concerns, "GB 18030" (an extension to "GBK") is distributed
36       separately on CPAN, under the name Encode::HanExtra. That module also
37       contains extra Taiwan-based encodings.
38

BUGS

40       When you see "charset=gb2312" on mails and web pages, they really mean
41       "euc-cn" encodings.  To fix that, "gb2312" is aliased to "euc-cn".  Use
42       "gb2312-raw" when you really mean it.
43
44       The ASCII region (0x00-0x7f) is preserved for all encodings, even
45       though this conflicts with mappings by the Unicode Consortium.  See
46
47       <http://www.debian.or.jp/~kubota/unicode-symbols.html.en>
48
49       to find out why it is implemented that way.
50

SEE ALSO

52       Encode
53
54
55
56perl v5.10.1                      2009-02-12                   Encode::CN(3pm)
Impressum