http://a4esl.org/c/charset.html
Character Sets
charset=
Some Basic Ones
-
charset=big5
- Chinese Traditional (Big5) charset=euc-kr
- Korean (EUC) charset=iso-8859-1
- Western Alphabet charset=iso-8859-2
- Central European Alphabet (ISO) charset=iso-8859-3
- Latin 3 Alphabet (ISO) charset=iso-8859-4
- Baltic Alphabet (ISO) charset=iso-8859-5
- Cyrillic Alphabet (ISO) charset=iso-8859-6
- Arabic Alphabet (ISO) charset=iso-8859-7
- Greek Alphabet (ISO) charset=iso-8859-8
- Hebrew Alphabet (ISO) charset=koi8-r
- Cyrillic Alphabet (KOI8-R) charset=shift-jis
- Japanese (Shift-JIS) charset=x-euc
- Japanese (EUC) charset=utf-8
- Universal Alphabet (UTF-8) charset=windows-1250
- Central European Alphabet (Windows) charset=windows-1251
- Cyrillic Alphabet (Windows) charset=windows-1252
- Western Alphabet (Windows) charset=windows-1253
- Greek Alphabet (Windows) charset=windows-1254
- Turkish Alphabet charset=windows-1255
- Hebrew Alphabet (Windows) charset=windows-1256
- Arabic Alphabet (Windows) charset=windows-1257
- Baltic Alphabet (Windows) charset=windows-1258
- Vietnamese Alphabet (Windows) charset=windows-874
- Thai (Windows)
A Longer List
-
Arabic (ASMO 708)
- ##charset=ASMO-708 Arabic (DOS)
- ##charset=DOS-720 Arabic (ISO)
- ##charset=iso-8859-6 Arabic (Mac)
- ##charset=x-mac-arabic Arabic (Windows)
- ##charset=windows-1256 Baltic (DOS)
- ##charset=ibm775 Baltic (ISO)
- ##charset=iso-8859-4 Baltic (Windows)
- ##charset=windows-1257 Central European (DOS)
- ##charset=ibm852 Central European (ISO)
- ##charset=iso-8859-2 Central European (Mac)
- ##charset=x-mac-ce Central European (Windows)
- ##charset=windows-1250 Chinese Simplified (EUC)
- ##charset=EUC-CN Chinese Simplified (GB2312)
- ##charset=gb2312 Chinese Simplified (HZ)
- ##charset=hz-gb-2312 Chinese Simplified (Mac)
- ##charset=x-mac-chinesesimp Chinese Traditional (Big5)
- ##charset=big5 Chinese Traditional (CNS)
- ##charset=x-Chinese-CNS Chinese Traditional (Eten)
- ##charset=x-Chinese-Eten Chinese Traditional (Mac)
- ##charset=x-mac-chinesetrad
- ##charset=950 Cyrillic (DOS)
- ##charset=cp866 Cyrillic (ISO)
- ##charset=iso-8859-5 Cyrillic (KOI8-R)
- ##charset=koi8-r Cyrillic (KOI8-U)
- ##charset=koi8-u Cyrillic (Mac)
- ##charset=x-mac-cyrillic Cyrillic (Windows)
- ##charset=windows-1251 Europa
- ##charset=x-Europa German (IA5)
- ##charset=x-IA5-German Greek (DOS)
- ##charset=ibm737 Greek (ISO)
- ##charset=iso-8859-7 Greek (Mac)
- ##charset=x-mac-greek Greek (Windows)
- ##charset=windows-1253
- ##charset= Greek, Modern (DOS)
- ##charset=ibm869 Hebrew (DOS)
- ##charset=DOS-862 Hebrew (ISO-Logical)
- ##charset=iso-8859-8-i Hebrew (ISO-Visual)
- ##charset=iso-8859-8 Hebrew (Mac)
- ##charset=x-mac-hebrew Hebrew (Windows)
- ##charset=windows-1255 IBM EBCDIC (Arabic)
- ##charset=x-EBCDIC-Arabic IBM EBCDIC (Cyrillic Russian)
- ##charset=x-EBCDIC-CyrillicRussian IBM EBCDIC (Cyrillic Serbian-Bulgarian)
- ##charset=x-EBCDIC-CyrillicSerbianBulgarian IBM EBCDIC (Denmark-Norway)
- ##charset=x-EBCDIC-DenmarkNorway IBM EBCDIC (Denmark-Norway-Euro)
- ##charset=x-ebcdic-denmarknorway-euro IBM EBCDIC (Finland-Sweden)
- ##charset=x-EBCDIC-FinlandSweden IBM EBCDIC (Finland-Sweden-Euro)
- ##charset=x-ebcdic-finlandsweden-euro IBM EBCDIC (Finland-Sweden-Euro)
- ##charset=x-ebcdic-finlandsweden-euro IBM EBCDIC (France-Euro)
- ##charset=x-ebcdic-france-euro IBM EBCDIC (Germany)
- ##charset=x-EBCDIC-Germany IBM EBCDIC (Germany-Euro)
- ##charset=x-ebcdic-germany-euro IBM EBCDIC (Greek Modern)
- ##charset=x-EBCDIC-GreekModern IBM EBCDIC (Greek)
- ##charset=x-EBCDIC-Greek IBM EBCDIC (Hebrew)
- ##charset=x-EBCDIC-Hebrew IBM EBCDIC (Icelandic)
- ##charset=x-EBCDIC-Icelandic IBM EBCDIC (Icelandic-Euro)
- ##charset=x-ebcdic-icelandic-euro IBM EBCDIC (International-Euro)
- ##charset=x-ebcdic-international-euro IBM EBCDIC (Italy)
- ##charset=x-EBCDIC-Italy IBM EBCDIC (Italy-Euro)
- ##charset=x-ebcdic-italy-euro IBM EBCDIC (Japanese and Japanese Katakana)
- ##charset=x-EBCDIC-JapaneseAndKana IBM EBCDIC (Japanese and Japanese-Latin)
- ##charset=x-EBCDIC-JapaneseAndJapaneseLatin IBM EBCDIC (Japanese and US-Canada)
- ##charset=x-EBCDIC-JapaneseAndUSCanada IBM EBCDIC (Japanese katakana)
- ##charset=x-EBCDIC-JapaneseKatakana IBM EBCDIC (Korean and Korean Extended)
- ##charset=x-EBCDIC-KoreanAndKoreanExtended IBM EBCDIC (Korean Extended)
- ##charset=x-EBCDIC-KoreanExtended IBM EBCDIC (Multilingual Latin-2)
- ##charset=CP870 IBM EBCDIC (Simplified Chinese)
- ##charset=x-EBCDIC-SimplifiedChinese IBM EBCDIC (Spain)
- ##charset=X-EBCDIC-Spain IBM EBCDIC (Spain-Euro)
- ##charset=x-ebcdic-spain-euro IBM EBCDIC (Thai)
- ##charset=x-EBCDIC-Thai IBM EBCDIC (Traditional Chinese)
- ##charset=x-EBCDIC-TraditionalChinese IBM EBCDIC (Turkish Latin-5)
- ##charset=CP1026 IBM EBCDIC (Turkish)
- ##charset=x-EBCDIC-Turkish IBM EBCDIC (UK)
- ##charset=x-EBCDIC-UK IBM EBCDIC (UK-Euro)
- ##charset=x-ebcdic-uk-euro IBM EBCDIC (US-Canada)
- ##charset=ebcdic-cp-us IBM EBCDIC (US-Canada-Euro)
- ##charset=x-ebcdic-cp-us-euro Icelandic (DOS)
- ##charset=ibm861 Icelandic (Mac)
- ##charset=x-mac-icelandic ISCII Assamese
- ##charset=x-iscii-as ISCII Bengali
- ##charset=x-iscii-be ISCII Devanagari
- ##charset=x-iscii-de ISCII Gujarathi
- ##charset=x-iscii-gu ISCII Kannada
- ##charset=x-iscii-ka ISCII Malayalam
- ##charset=x-iscii-ma ISCII Oriya
- ##charset=x-iscii-or ISCII Panjabi
- ##charset=x-iscii-pa ISCII Tamil
- ##charset=x-iscii-ta ISCII Telugu
- ##charset=x-iscii-te Japanese (EUC)
- ##charset=euc-jp
- ##charset=x-euc-jp Japanese (JIS)
- ##charset=iso-2022-jp Japanese (JIS-Allow 1 byte Kana - SO/SI)
- ##charset=iso-2022-jp Japanese (JIS-Allow 1 byte Kana)
- ##charset=csISO2022JP Japanese (Mac)
- ##charset=x-mac-japanese Japanese (Shift-JIS)
- ##charset=shift_jis Korean
- ##charset=ks_c_5601-1987 Korean (EUC)
- ##charset=euc-kr Korean (ISO)
- ##charset=iso-2022-kr Korean (Johab)
- ##charset=Johab Korean (Mac)
- ##charset=x-mac-korean Latin 3 (ISO)
- ##charset=iso-8859-3 Latin 9 (ISO)
- ##charset=iso-8859-15 Norwegian (IA5)
- ##charset=x-IA5-Norwegian OEM United States
- ##charset=IBM437 Swedish (IA5)
- ##charset=x-IA5-Swedish Thai (Windows)
- ##charset=windows-874 Turkish (DOS)
- ##charset=ibm857 Turkish (ISO)
- ##charset=iso-8859-9 Turkish (Mac)
- ##charset=x-mac-turkish Turkish (Windows)
- ##charset=windows-1254 Unicode
- ##charset=unicode Unicode (Big-Endian)
- ##charset=unicodeFFFE Unicode (UTF-7)
- ##charset=utf-7 Unicode (UTF-8)
- ##charset=utf-8 US-ASCII
- ##charset=us-ascii Vietnamese (Windows)
- ##charset=windows-1258 Western European (DOS)
- ##charset=ibm850 Western European (IA5)
- ##charset=x-IA5 Western European (ISO)
- ##charset=iso-8859-1 Western European (Mac)
- ##charset=macintosh Western European (Windows)
- ##charset=Windows-1252
本文详细列举了各种字符集及其对应的编码方式,包括但不限于中文简体、繁体,日语、韩语等亚洲语言,以及西方语言的各种字符集。通过本文可以快速了解不同场景下适用的字符集。
4475

被折叠的 条评论
为什么被折叠?



