
cuda
zisuina_2
这个作者很懒,什么都没留下…
展开
-
Reducer buckets have been rebuilt in this iteration.
在跑torch多GPU报错“Reducer buckets have been rebuilt in this iteration.”原因是torch版本问题, torch1.7以上的distributed.py发生更改导致报错:这玩意是distributed.py里的前向函数报错def forward(self, inputs, *kwargs): if self.ddp_join_enabled: ones = torch.ones(原创 2021-11-05 11:13:33 · 13648 阅读 · 9 评论 -
command gcc failed with exit status 1报错
command ‘gcc’ failed with exit status 1`报错报错!头疼!econn/models/operators/nms/src/nms_cuda.cpp:4:23: error: ‘AT_CHECK’ was not declared in this scope #define CHECK_CUDA(x) AT_CHECK(x.type().is_cuda(), #x, " must be a CUDAtensor ") ^原创 2020-05-25 14:34:53 · 724 阅读 · 0 评论 -
ubuntu重启后nvidia消失-NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver.
之前能正常运行,重启后,机器开始矫情,运行程序报错说cuda出问题;那就nvidia-smi,发现链接失败nvidia-smi报错:(NVIDIA-SMI失败因为它没法链接NVIDIA设备。请确保最新版本的NVIDIA已经安装和运行)- 翻译过来也是屁话解决方案1.检查NVIDIA是否存在nvcc -V输出说明NVIDIA还在,(不在的亲(づ ̄3 ̄)づ╭❤~,请重新安装)2...原创 2019-12-04 10:00:16 · 965 阅读 · 2 评论