ValueError: Falling back to the ‘python‘ engine because the separator encoded in utf-8 is ＞ 1 char l

条件漫步

于 2021-04-07 11:09:12 发布

阅读量701

点赞数 1

CC 4.0 BY-SA版权

分类专栏： python

本文链接：https://blog.youkuaiyun.com/chenhepg/article/details/115480926

python 专栏收录该内容

49 篇文章

订阅专栏

本文讲述了在使用pandas读取csv文件时遇到的ValueError，原因在于文件中英文分号被误识别为中文分号。解决方案包括直接修改分号字符和指定engine='python'。关键词涉及pandas、csv读取、UTF-8编码和engine设置。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

@创建于：20210407
@修改于：20210407

1、问题描述

python pandas 读csv文件遇到如下报错：

ValueError: Falling back to the ‘python’ engine because the separator encoded in utf-8 is > 1 char long, and the ‘c’ engine does not support such separators, but this causes ‘low_memory’ to be ignored as it is not supported by the ‘python’ engine.

2、解决办法

我的问题在于csv是英文分号，而在读的时候用的是中文分号。

if os.path.exists("data/test.csv"):
    df_day = pd.read_csv(filepath_or_buffer='data/test.csv', sep='；', header=0, low_memory=False)

中文分号（；）改成英文分号（;）

if os.path.exists("data/test.csv"):
    df_day = pd.read_csv(filepath_or_buffer='data/test.csv', sep=';', header=0, low_memory=False)

3、其他解决办法

python-碰到的问题

增加函数的引擎参数engine=‘python’，如下：

header = ['user_id', 'item_id', 'rating', 'timestamp']
df = pd.read_csv("D:/ratings.dat", sep='::', names=header,engine='python')