CHARSETS 968B

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647
  1. Following is a list of character sets along with their widths:
  2. --------------------------------------------------------------
  3. 1 Octet 8bit:
  4. -------------
  5. Windows 125* (CP125*)
  6. CP*
  7. ANSI
  8. ISO-8859-* (IEC-8859-*)
  9. Macintosh (Mac OS Roman)
  10. KOI8-U (potentially KOI*8-*)
  11. KOI8-R
  12. MIK
  13. Cork (T1)
  14. ISCII
  15. VISCII
  16. 1 Octet 7bit:
  17. -------------
  18. US-ASCII
  19. K0I7
  20. 2 octets 16 bit:
  21. ----------------
  22. UCS-2
  23. UTF-16* (UTF-16BE etc)
  24. 4-octets 32 bit:
  25. ----------------
  26. UCS-4
  27. UTF-32
  28. Variable-width:
  29. ----------------------------
  30. Big5 - http://en.wikipedia.org/wiki/Big5 (1-2 bytes: 00-7f=1, 81-fe=2)
  31. HKSCS - http://en.wikipedia.org/wiki/HKSCS (a big5 variant, but some variants use 10646)
  32. ISO-10646 (IEC-10646) - http://en.wikipedia.org/wiki/ISO_10646 (unicode)
  33. UTF-8 (1-5 bytes)
  34. ISO-2022 (IEC-2022) - http://en.wikipedia.org/wiki/ISO_2022
  35. Shift-JIS - http://en.wikipedia.org/wiki/Shift-JIS
  36. A good resource:
  37. ----------------
  38. http://en.wikipedia.org/wiki/Character_encoding#Simple_character_sets