nn.utils.rnn.pack_padded_sequence: RuntimeError: ‘lengths‘ argument should be a 1D CPU int64 tensor

BierOne

于 2021-04-25 17:00:32 发布

阅读量909

点赞数 1

CC 4.0 BY-SA版权

分类专栏： pytorch

本文链接：https://blog.youkuaiyun.com/BierOne/article/details/116132904

pytorch 专栏收录该内容

16 篇文章

订阅专栏

本文介绍了解决PyTorch中RNN模块使用时遇到的关于张量设备（CPU/GPU）不匹配的问题。当使用pack_padded_sequence函数时，如果长度参数为Tensor类型，则必须确保该Tensor位于CPU上。文章提供了具体的解决方案，并解释了背后的原因。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

解决方法：对length调用.cpu()即可

packed = rnn.pack_padded_sequence(x, x_len.cpu(), batch_first=True)

原因：

It’s because of this PR #41984, that preserves the device of as_tensor argument if it’s torch tensor. pack_padded_sequence calls as_tensor on the lengths tensor: https://github.com/pytorch/pytorch/blob/master/torch/nn/utils/rnn.py#L234. It caused implicit copy before, but does not now.
Given that implementation does not do anything smart with the lengths on the GPU, and only copies and synchronizes behind users back, @myleott do you think we should restore previous behavior, or can you call .cpu() on the lengths in your script before calling pack_padded_sequence?

输入到pack_padded_sequence如果是tensor形式，必须要保证其在CPU()上

参考自：
https://github.com/pytorch/pytorch/issues/43227