float16 matmul is way slower than float32 matmul on CPU #24738
Open
dchatterjee172 opened this issue on 7 Jan 2019 · 1 comment
Open
float16 matmul is way slower than float32 matmul on CPU#24738
dchatterjee172 opened this issue on 7 Jan 2019 · 1 comment
Comments
dchatterjee172 commented on 7 Jan 2019
| System information
You can collect some of this information using our environment capture script Describe the current behavior Code to reproduce the issue output |
jvishnuvardhan self-assigned this on 9 Jan 2019
jvishnuvardhan added type:support type:others comp:ops labels on 9 Jan 2019
jvishnuvardhan assigned rmlarsen and unassigned jvishnuvardhan on 9 Jan 2019
naisy commented on 9 Jan 2019
| It is simple, because Intel Architecture does not support FP16. |
👍 1
本文通过实验比较了在CPU上使用float16和float32进行矩阵乘法的性能差异,发现float16的运算速度远低于float32,这主要是由于Intel架构不支持FP16,导致float16的性能无法充分发挥。
1451

被折叠的 条评论
为什么被折叠?



