可能的原因:显卡计算能力不匹配,未使用CDNN
修改:
makefile:
GPU=1
CUDNN=1
OPENCV=0
OPENMP=0
DEBUG=0
ARCH= -gencode arch=compute_30,code=sm_30 \
-gencode arch=compute_35,code=sm_35 \
-gencode arch=compute_50,code=[sm_50,compute_50] \
-gencode arch=compute_61,code=[sm_61,compute_61]
#-gencode arch=compute_20,code=[sm_20,sm_21] #\ This one is deprecated?
ARCH中的数字对应显卡计算能力,获取显卡计算能力可参考链接:https://developer.nvidia.com/cuda-gpus