1234567891011121314151617181920212223242526272829303132333435363738394041424344454647 |
- Following is a list of character sets along with their widths:
- --------------------------------------------------------------
-
- 1 Octet 8bit:
- -------------
- Windows 125* (CP125*)
- CP*
- ANSI
- ISO-8859-* (IEC-8859-*)
- Macintosh (Mac OS Roman)
- KOI8-U (potentially KOI*8-*)
- KOI8-R
- MIK
- Cork (T1)
- ISCII
- VISCII
-
-
- 1 Octet 7bit:
- -------------
- US-ASCII
- K0I7
-
- 2 octets 16 bit:
- ----------------
- UCS-2
- UTF-16* (UTF-16BE etc)
-
- 4-octets 32 bit:
- ----------------
- UCS-4
- UTF-32
-
- Variable-width:
- ----------------------------
- Big5 - http://en.wikipedia.org/wiki/Big5 (1-2 bytes: 00-7f=1, 81-fe=2)
- HKSCS - http://en.wikipedia.org/wiki/HKSCS (a big5 variant, but some variants use 10646)
- ISO-10646 (IEC-10646) - http://en.wikipedia.org/wiki/ISO_10646 (unicode)
- UTF-8 (1-5 bytes)
- ISO-2022 (IEC-2022) - http://en.wikipedia.org/wiki/ISO_2022
- Shift-JIS - http://en.wikipedia.org/wiki/Shift-JIS
-
- A good resource:
- ----------------
- http://en.wikipedia.org/wiki/Character_encoding#Simple_character_sets
-
|