Alex_ShengShen的博客[http://blog.youkuaiyun.com/shsalex/article/details/52104898] 有三篇文章。
1. 编码与乱码
Words and sentences in text are created from characters.
youtube视频Characters, Symbols and the Unicode Miracle
Unicode is not an encoding.
There are several ways to encode Unicode code points into bits.
-
Unicode is one large standard effort which has catalogued and specified a number ⟷ character relationship for virtually all characters and symbols of every major language in use, which is hundreds of thousands of characters
-
UTF-8, 16 and 32 are different sub-standards for how to encode this ginormous catalog of numbers to bytes, each with different size tradeoffs
Excusez-moi? = Excuse me?
所以Unicode只是character ⟷ number只是
Using Unicode, you can write a document containing virtually any language using any character you can type into a computer.
本文介绍了Unicode的基本概念及其在不同编码方式中的应用。解释了Unicode并非一种具体的编码方式,而是为几乎每种主要语言中使用的字符和符号建立了数字与字符之间的对应关系。此外,还讨论了UTF-8、UTF-16及UTF-32等标准如何将庞大的Unicode字符集转换为字节形式。
1749

被折叠的 条评论
为什么被折叠?



