原文地址:http://www.chedong.com/tech/hello_unicode.html
The Unicode 2.0 Character Set
Characters |
Description |
---|---|
/u0000 - /u1FFF |
Alphabets |
/u0020 - /u007F |
Basic Latin |
/u0080 - /u00FF |
Latin-1 supplement |
/u0100 - /u017F |
Latin extended-A |
/u0180 - /u024F |
Latin extended-B |
/u0250 - /u02AF |
IPA extensions |
/u02B0 - /u02FF |
Spacing modifier letters |
/u0300 - /u036F |
Combining diacritical marks |
/u0370 - /u03FF |
Greek |
/u0400 - /u04FF |
Cyrillic |
/u0530 - /u058F |
Armenian |
/u0590 - /u05FF |
Hebrew |
/u0600 - /u06FF |
Arabic |
/u0900 - /u097F |
Devanagari |
/u0980 - /u09FF |
Bengali |
/u0A00 - /u0A7F |
Gurmukhi |
/u0A80 - /u0AFF |
Gujarati |
/u0B00 - /u0B7F |
Oriya |
/u0B80 - /u0BFF |
Tamil |
/u0C00 - /u0C7F |
Telugu |
/u0C80 - /u0CFF |
Kannada |
/u0D00 - /u0D7F |
Malayalam |
/u0E00 - /u0E7F |
Thai |
/u0E80 - /u0EFF |
Lao |
/u0F00 - /u0FBF |
Tibetan |
/u10A0 - /u10FF |
Georgian |
/u1100 - /u11FF |
Hangul Jamo |
/u1E00 - /u1EFF |
Latin extended additional |
/u1F00 - /u1FFF |
Greek extended |
/u2000 - /u2FFF |
Symbols and punctuation |
/u2000 - /u206F |
General punctuation |
/u2070 - /u209F |
Superscripts and subscripts |
/u20A0 - /u20CF |
Currency symbols |
/u20D0 - /u20FF |
Combining diacritical marks for symbols |
/u2100 - /u214F |
Letterlike symbols |
/u2150 - /u218F |
Number forms |
/u2190 - /u21FF |
Arrows |
/u2200 - /u22FF |
Mathematical operators |
/u2300 - /u23FF |
Miscellaneous technical |
/u2400 - /u243F |
Control pictures |
/u2440 - /u245F |
Optical character recognition |
/u2460 - /u24FF |
Enclosed alphanumerics |
/u2500 - /u257F |
Box drawing |
/u2580 - /u259F |
Block elements |
/u25A0 - /u25FF |
Geometric shapes |
/u2600 - /u26FF |
Miscellaneous symbols |
/u2700 - /u27BF |
Dingbats |
/u3000 - /u33FF |
CJK auxiliary |
/u3000 - /u303F |
CJK symbols and punctuation |
/u3040 - /u309F |
Hiragana |
/u30A0 - /u30FF |
Katakana |
/u3100 - /u312F |
Bopomofo |
/u3130 - /u318F |
Hangul compatibility Jamo |
/u3190 - /u319F |
Kanbun |
/u3200 - /u32FF |
Enclosed CJK letters and months |
/u3300 - /u33FF |
CJK compatibility |
/u4E00 - /u9FFF |
CJK unified ideographs: Han characters used in China, Japan, Korea, Taiwan, and Vietnam |
/uAC00 - /uD7A3 |
Hangul syllables |
/uD800 - /uDFFF |
Surrogates |
/uD800 - /uDB7F |
High surrogates |
/uDB80 - /uDBFF |
High private use surrogates |
/uDC00 - /uDFFF |
Low surrogates |
/uE000 - /uF8FF |
Private use |
/uF900 - /uFFFF |
Miscellaneous |
/uF900 - /uFAFF |
CJK compatibility ideographs |
/uFB00 - /uFB4F |
Alphabetic presentation forms |
/uFB50 - /uFDFF |
Arabic presentation forms-A |
/uFE20 - /uFE2F |
Combing half marks |
/uFE30 - /uFE4F |
CJK compatibility forms |
/uFE50 - /uFE6F |
Small form variants |
/uFE70 - /uFEFE |
Arabic presentation forms-B |
/uFEFF |
Specials |
/uFF00 - /uFFEF |
Halfwidth and fullwidth forms |
/uFFF0 - /uFFFF |
Special |