Armv8 - 64 Advanced SIMD Programming: Vector and Matrix Operations
1. Convolution Function Benchmark
The benchmark timing measurements for convolution functions are presented in Table 1. The size - optimized convolution functions Convolve1Ks5 and Convolve1Ks5_ are significantly faster than their non - size - optimized counterparts.
| Convolution Function | Mean Execution Time (microseconds) |
|---|---|
ConvolveKsN |
21714 |
ConvolveKsN_ |
8156 |
ConvolveKs5 |
4652 |
ConvolveKs5_ |
Armv8-64 SIMD编程:向量与矩阵运算优化
超级会员免费看
订阅专栏 解锁全文
1665

被折叠的 条评论
为什么被折叠?



