Changeset 1870 for trunk/gsdl/mappings/README
- Timestamp:
- 2001-01-29T14:54:58+13:00 (23 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
trunk/gsdl/mappings/README
r1868 r1870 1 This directory contains mapping files for converting various character 2 encodings to and from unicode. 1 This directory and its subdirectories contain .ump mapping files for 2 converting various character encodings to and from unicode. 3 4 To generate .ump files use a command like "makemapfile.pl -encoding 5 encodingname -mapfile textmapfile" where encodingname becomes the filename 6 of the two new .ump files and textmapfile is a plain text file containing a 7 tab separated list of the form: 8 0x8167 0x201C 9 where the first column is the hexadecimal value of the encoded character 10 and the second is the hexadecimal value of it's unicode equivalent. 11 12 13 14 The following .ump files were generated from their corresponding Microsoft 15 codepages. These codepages do, in some cases, differ very slightly from the 16 standards they were based on but we've used them anyway as they're so 17 extensively used on the web. 18 19 * gbk.ump: Simplified Chinese - generated from Microsoft's codepage 936 20 * shiftjis.ump: Japanese - generated from Microsoft's codepage 932 21 * uhc.ump: UHC Korean - generated from Microsoft's codepage 949 22 * big5.ump: Traditional Chinese - generated from Microsoft's codepage 950
Note:
See TracChangeset
for help on using the changeset viewer.