今天在跑自己设计的深度学习模型时出现了以下问题:
InternalError: 2 root error(s) found.
(0) Internal: Blas GEMM launch failed : a.shape=(12, 3), b.shape=(3, 32), m=12, n=32, k=3
[[node model/dense_3/MatMul (defined at <ipython-input-16-5a73e5716594>:10) ]]
[[gradient_tape/model/token_and_position_embedding/embedding_1/embedding_lookup/Reshape/_98]]
(1) Internal: Blas GEMM launch failed : a.shape=(12, 3), b.shape=(3, 32), m=12, n=32, k=3
[[node model/dense_3/MatMul (defined at <ipython-input-16-5a73e5716594>:10) ]]
0 successful operations.
0 derived errors ignored. [Op:__inference_train_function_3032]
Function call stack:
train_function -> train_function
出现该问题的原因可能是各种库之间的版本依赖问题,或者是GPU被占用了。
而我的情况是GPU被占用了,因此我把其他占用GPU的程度都关了,再次跑模型,OK了!