问题描述:
服务器关机后开机跑程序发现GPU突然不能用了,
>>> nvidia-smi
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
去网上查了一下发现是无法与 nvidia driver 通信,nvcc -V
测试后看了一下驱动还在,于是按照nvidia-smi 报错:无法与 nvidia driver 通信的方法解决了。
解决方案:
>>> sudo apt install dkms
>>> sudo dkms inst