[解决]one of the variables needed for gradient computation has been modified by an inplace operation:

最新推荐文章于 2024-06-13 01:55:26 发布

越卡卡卡卡得要死

最新推荐文章于 2024-06-13 01:55:26 发布

阅读量291

点赞数

文章标签：机器学习 python pytorch

本文链接：https://blog.youkuaiyun.com/qq_34257368/article/details/129864578

版权

代码位置

for epoch in range(10):
    for vector,xyLoc in tqdm(train_loader):
        xyLoc = xyLoc.cuda()
        optimizer.zero_grad()
        outputAll,(outputh,outputc) = model(xyLoc)#这次只用loc进行预测
        try :
            registerY
        except NameError:
            registerY = outputAll[0]
            registerY = registerY.unsqueeze_(0)
        else:   
            registerY = torch.cat((registerY,outputAll[0].unsqueeze_(dim = 0)),dim = 0)

        loss = criterion(outputAll, xyLoc) 
        loss.backward()
        optimizer.step()
    if epoch%1==0:
        print("epoch "+str(epoch)+"  : \t "+str(loss))

在使用LSTM进行学习时出现

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation:
 [torch.cuda.FloatTensor [100, 12, 2]], which is output 0 of CudnnRnnBackward0, is at version 1; expected version 0 instead. 
Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).

排查下来是.unsqueeze_导致的，因为想将model输出的每次结果都记录下来所以进行了升维，结果他会直接保存升维后的形状，导致后面出错。

unsqueeze_()和unsqueeze()实现一样的功能, 区别在于unsqueeze_是in_place操作。

解决办法：将unsqueeze_()改为unsqueeze()