01. Java判断中文符号
private static final boolean isChinese(char c) {
Character.UnicodeBlock ub = Character.UnicodeBlock.of(c);
//获取此字符的UniCodeBlock
if (ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS
|| ub == Character.UnicodeBlock.CJK_COMPATIBILITY_IDEOGRAPHS
|| ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS_EXTENSION_A
|| ub == Character.UnicodeBlock.GENERAL_PUNCTUATION
|| ub == Character.UnicodeBlock.CJK_SYMBOLS_AND_PUNCTUATION
|| ub == Character.UnicodeBlock.HALFWIDTH_AND_FULLWIDTH_FORMS) {
return true;
}
return false;
}
02. Character.UnicodeBlock中的cjk说明
CJK的意思是“Chinese,Japanese,Korea”的简写 ,实际上就是指中日韩三国的象形文字的Unicode编码 。
jdk API中Character.UnicodeBlock中的cjk说明:

- Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS : 4E00-9FBF:CJK 统一表意符号
- Character.UnicodeBlock.CJK_COMPATIBILITY_IDEOGRAPHS :F900-FAFF:CJK 兼容象形文字
- Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS_EXTENSION_A :3400-4DBF:CJK 统一表意符号扩展A
- Character.UnicodeBlock.GENERAL_PUNCTUATION :2000-206F:常用标点
- Character.UnicodeBlock.CJK_SYMBOLS_AND_PUNCTUATION :3000-303F:CJK 符号和标点
- Character.UnicodeBlock.HALFWIDTH_AND_FULLWIDTH_FORMS :FF00-FFEF:半角及全角形式
小注:
Character.isLetter( c ):判断字符是否是字母
Character.isDigit( c ):判断字符是否是数字
本文介绍了如何使用Java中的Character.UnicodeBlock判断中文符号,包括CJK统一表意符号、兼容象形文字等,并提到了相关UnicodeBlock的范围。同时,文章提及了Character.isLetter和Character.isDigit方法用于检测字符是否为字母或数字。
2543

被折叠的 条评论
为什么被折叠?



