对ChatGLM-6进行训练时提示:DatasetGenerationError: An error occurred while generating the dataset

通过重新安装datasets依赖包,没有效果,经过和之前可以训练的脚本文件进行对比,发现新的脚本中缺少关键字(我训练的脚本中包含两个关键字类似key/value),补上关键字后训练可以正常运行。

以上是本人碰到的一种出现DatasetGenerationError: An error occurred while generating the dataset这个问题的情况。

Traceback (most recent call last): File "C:\Anaconda\envs\pytorch\lib\site-packages\datasets\builder.py", line 1855, in _prepare_split_single for _, table in generator: File "C:\Anaconda\envs\pytorch\lib\site-packages\datasets\packaged_modules\parquet\parquet.py", line 90, in _generate_tables if parquet_fragment.row_groups: File "pyarrow\\_dataset_parquet.pyx", line 386, in pyarrow._dataset_parquet.ParquetFileFragment.row_groups.__get__ File "pyarrow\\_dataset_parquet.pyx", line 393, in pyarrow._dataset_parquet.ParquetFileFragment.metadata.__get__ File "pyarrow\\_dataset_parquet.pyx", line 382, in pyarrow._dataset_parquet.ParquetFileFragment.ensure_complete_metadata File "pyarrow\\error.pxi", line 92, in pyarrow.lib.check_status pyarrow.lib.ArrowInvalid: Could not open Parquet input source '<Buffer>': Parquet file size is 0 bytes The above exception was the direct cause of the following exception: Traceback (most recent call last): File "C:\Users\31035\PycharmProjects\pythonProject1\main.py", line 12, in <module> dataset = load_dataset("imdb") File "C:\Anaconda\envs\pytorch\lib\site-packages\datasets\load.py", line 2084, in load_dataset builder_instance.download_and_prepare( File "C:\Anaconda\envs\pytorch\lib\site-packages\datasets\builder.py", line 925, in download_and_prepare self._download_and_prepare( File "C:\Anaconda\envs\pytorch\lib\site-packages\datasets\builder.py", line 1001, in _download_and_prepare self._prepare_split(split_generator, **prepare_split_kwargs) File "C:\Anaconda\envs\pytorch\lib\site-packages\datasets\builder.py", line 1742, in _prepare_split for job_id, done, content in self._prepare_split_single( File "C:\Anaconda\envs\pytorch\lib\site-packages\datasets\builder.py", line 1898, in _prepare_split_single raise DatasetGenerationError("An error occurred while generating the dataset") from e datasets.exceptions.DatasetGenerationError: An error occurred while generating the dataset Proce
03-29
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值