failed to run cuBLAS routine cublasSgemm_v2 CUDA报错

    在配置服务的过程中,安装好了CUDA9.0和cuDNN之后,想着把代码上传到服务器上进行训练一波测试下RTX 2080TI的速度,谁想到,竟然报了个错误,错误如下:

     failed to run cuBLAS routine cublasSgemm_v2

    环境cuda9.0+tensorflow1.12.0 Ubuntu16.04 LTS
    显卡 RTX 2080Ti

    找了下原因:

    NVIDIA cuda 9.0自身的bug
    解决方法安装cuda9.0的四个patch,文件分别是

    cuda9.0.176.1_linux.run

    cuda9.0.176.2_linux.run

    cuda9.0.176.3_linux.run

    cuda9.0.176.4_linux.run

    下载地址在Patch下载链接

    选好之后就可以看到Patch的下载选项,下载后使用下面的命令安装就可以了。

sudo sh cuda9.0.176.1_linux.run
sudo sh cuda9.0.176.2_linux.run
sudo sh cuda9.0.176.3_linux.run
sudo sh cuda9.0.176.4_linux.run

    安装的时候一直按enter就可以,最后输入accept结束安装,So easy。 

 

2025-03-11 10:44:54.222546: E T:\src\github\tensorflow\tensorflow\stream_executor\cuda\cuda_blas.cc:654] failed to run cuBLAS routine cublasSgemm_v2: CUBLAS_STATUS_EXECUTION_FAILED Traceback (most recent call last): File "C:\Users\19124\anaconda3\envs\rl(tf1.x)\lib\site-packages\tensorflow\python\client\session.py", line 1322, in _do_call return fn(*args) File "C:\Users\19124\anaconda3\envs\rl(tf1.x)\lib\site-packages\tensorflow\python\client\session.py", line 1307, in _run_fn options, feed_dict, fetch_list, target_list, run_metadata) File "C:\Users\19124\anaconda3\envs\rl(tf1.x)\lib\site-packages\tensorflow\python\client\session.py", line 1409, in _call_tf_sessionrun run_metadata) tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed : a.shape=(128, 7), b.shape=(7, 128), m=128, n=128, k=7 [[Node: Critic/dense/MatMul = MatMul[T=DT_FLOAT, transpose_a=false, transpose_b=false, _device="/job:localhost/replica:0/task:0/device:GPU:0"](_arg_state_0_0/_5, Critic/dense/kernel/read)]] [[Node: Critic/dense_1/BiasAdd/_7 = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_21_Critic/dense_1/BiasAdd", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]] During handling of the above exception, another exception occurred: Traceback (most recent call last): File "D:\threeMotorsProject\threeMotorsProject\RL\PPO\ppo.py", line 157, in <module> train() File "D:\threeMotorsProject\threeMotorsProject\RL\PPO\ppo.py", line 122, in train ppo.update(np.vstack(buffer_s), np.vstack(buffer_a), np.array(discounted_r)[:, np.newaxis]) File "D:\threeMotorsProject\threeMotorsProject\RL\PPO\ppo.py", line 80, in update adv = self.sess.run(self.v, {self.S: s}) - r File "C:\Users\19124\anaconda3\envs\rl(tf1.x)\lib\site-packages\tensorflow\python\client\session.py",
03-12
评论 2
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值