字符编码在python中的处理_字符编码在python中用'u2019'替换'

本文探讨了在Python中使用内置函数处理字符串时,如何确保'字符在'BACKRUSHIN'这样的字符串中正确显示,重点介绍了unidecode库的应用。作者分享了尝试过的多种方法,并最终通过unidecode.unidecode()函数解决了问题。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

I have tried numerous ways to encode this to the end result "BACK RUSHIN'" with the most important character being the right apostrophe '.

I would like a way of getting to this end result using some of the built in functions Python has where there is no discrimination between a normal string and a unicode string.

This was the code I was using to retrieve the string: str(unicode(etree.tostring(root.xpath('path')[0],method='text', encoding='utf-8'),errors='ignore')).strip()

With the result being: 'BACK RUSHIN' the thing being the apostrophe ' is missing.

Another way was: root.xpath('path/text()')

And that result was: u'BACK RUSHIN\u2019' in python.

Lastly if I try: u'BACK RUSHIN\u2019'.encode('ascii', 'replace')

The result is: 'BACK RUSHIN?'

Please no replace functions, I would like to make use of pythons codec libraries.

Also no printing the string because it is being held in a variable.

Thanks

解决方案>>> import unidecode

>>> unidecode.unidecode(u'BACK RUSHIN\u2019')

"BACK RUSHIN'"

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值