Sparse-PE: Revolutionizing Sparse CNN Acceleration
1. Introduction
In the realm of Convolutional Neural Network (CNN) inferences, acceleration has been a focal point of research. Various architectures have been proposed to enhance the efficiency of CNN computations, with a particular emphasis on exploiting sparsity to reduce compute and memory access volume. This blog explores the limitations of existing dense and sparse architectures and introduces a novel multithreaded Processing Element (PE) called Sparse - PE, which addresses these limitations and offers improved performance.
2. Related Work
2.1 Dense Architectures
- Compute Optimization : Some accelerators focus on optimizing compute operations. For
超级会员免费看
订阅专栏 解锁全文
1750

被折叠的 条评论
为什么被折叠?



