字符编码问题

本文介绍了Unicode的基本概念及其在不同编码方式中的应用。解释了Unicode并非一种具体的编码方式,而是为几乎每种主要语言中使用的字符和符号建立了数字与字符之间的对应关系。此外,还讨论了UTF-8、UTF-16及UTF-32等标准如何将庞大的Unicode字符集转换为字节形式。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

Alex_ShengShen的博客[http://blog.youkuaiyun.com/shsalex/article/details/52104898] 有三篇文章。
1. 编码与乱码

Words and sentences in text are created from characters.


youtube视频Characters, Symbols and the Unicode Miracle


Unicode is not an encoding.
There are several ways to encode Unicode code points into bits.

-

Unicode is one large standard effort which has catalogued and specified a number ⟷ character relationship for virtually all characters and symbols of every major language in use, which is hundreds of thousands of characters

-

UTF-8, 16 and 32 are different sub-standards for how to encode this ginormous catalog of numbers to bytes, each with different size tradeoffs

Excusez-moi? = Excuse me?

所以Unicode只是character ⟷ number只是

Using Unicode, you can write a document containing virtually any language using any character you can type into a computer.

Code point

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值