字符编码问题

本文介绍了Unicode的基本概念及其在不同编码方式中的应用。解释了Unicode并非一种具体的编码方式,而是为几乎每种主要语言中使用的字符和符号建立了数字与字符之间的对应关系。此外,还讨论了UTF-8、UTF-16及UTF-32等标准如何将庞大的Unicode字符集转换为字节形式。

Alex_ShengShen的博客[http://blog.youkuaiyun.com/shsalex/article/details/52104898] 有三篇文章。
1. 编码与乱码

Words and sentences in text are created from characters.


youtube视频Characters, Symbols and the Unicode Miracle


Unicode is not an encoding.
There are several ways to encode Unicode code points into bits.

-

Unicode is one large standard effort which has catalogued and specified a number ⟷ character relationship for virtually all characters and symbols of every major language in use, which is hundreds of thousands of characters

-

UTF-8, 16 and 32 are different sub-standards for how to encode this ginormous catalog of numbers to bytes, each with different size tradeoffs

Excusez-moi? = Excuse me?

所以Unicode只是character ⟷ number只是

Using Unicode, you can write a document containing virtually any language using any character you can type into a computer.

Code point

评论
成就一亿技术人!
拼手气红包6.0元
还能输入1000个字符
 
红包 添加红包
表情包 插入表情
 条评论被折叠 查看
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值