论文笔记：Efficient and Accurate Approximations of Nonlinear Convolutional Networks

最新推荐文章于 2021-09-01 17:58:25 发布

原创最新推荐文章于 2021-09-01 17:58:25 发布 · 1.1k 阅读

0 ·

CC 4.0 BY-SA版权

DL 专栏收录该内容

15 篇文章

订阅专栏

本文研究了如何通过分解低秩滤波器为多个较小滤波器来加速深度CNN的测试时间计算。文章提出了适用于非线性滤波器的分解方法。假设卷积滤波器在某些维度上具有低秩特性，输入x由于局部相关性也具有低秩性质。通过线性和非线性滤波器（ReLU）的分解，能够减少前向传播的计算复杂度，从而实现加速。文章还讨论了如何确定每层的分解维度以达到预期的加速比，并观察到模型准确度与所有层PCA能量乘积的负相关性。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

1. Motivation: Decomposing low-rank filters into multiple smaller filters helps to speed up the test-time computation of deep CNNs, and previous works only propose algorithms for linear filters.

2. Approaches

Assumption: conv filters are low rank along certain dimensions; filter input x is also low rank due to local correlations. y (response) therefore has low-rank behavior.

i. for linear filters:

a filter W can be reshaped to a vector of (k x k x c) length: y = Wx (y: d-dim vector)

rewrite y as: y = M (y - ybar) + ybar, M (d by d of rank d' < d)

(such M must exist, for example, M = identity matrix with 1 substituted by 0 at rows where y is 0)

decompose M = PQ` (P and Q are d by d') and substitute into (2), we have

y = PW'x + b

in this case the forward complexity is reduced from O(dk^2c) to O(d'k^2c + dd') ==> nearly d'/d times