1.打开中文 txt 文件,报错:‘gbk’................:
以二进制打开
open("threekingdoms.txt", "rb").read() 读出来了
open("threekingdoms.txt", "r", encoding='utf-8',errors='ignore').read() 报错
2.打开中文出现乱码:
encodeing = 'gbk'
3.decode()
http://www.runoob.com/python/att-string-decode.html
decode('ascii','igore') 解码遇到 ascii 是,忽略
4.SyntaxError: Non-UTF-8 code starting with '\xb6' in file
开头添加:
# -*- coding: gb2312 -*-
5.UnicodeDecodeError: 'gbk' codec can't decode byte 0xaa in position 6: illegal multibyte sequence
pandas 读取数据是报的错,使用 gbk 的超级 gb18030 即可读取
6.UnicodeEncodeError: 'gbk' codec can't encode character '\u3635' in position 19: illegal multibyte sequence
pandas to_csv() 保存时报的错误,可以使用编码 utf_8_sig,这样的话不管用 wps,还是 offica 都不是乱码