Log-Sum-Exp Pooling

最新推荐文章于 2024-11-03 21:15:56 发布

原创

最新推荐文章于 2024-11-03 21:15:56 发布 · 3.6k 阅读

14 ·

CC 4.0 BY-SA版权

Log-Sum-Exp Pooling

Papers

From Image-level to Pixel-level Labeling with Convolutional Networks
ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classiﬁcation and Localization of Common Thorax Diseases

LSE Pooling

在阅读这两篇文章之前，我印象中常用的 Pooling 有 Max Pooling 和 Average Pooling，而这两篇文章中用到了 Log-Sum-Exp Pooling，其定义为：

$x_p=\frac{1}{r}\cdot log[\frac{1}{S}\cdot \sum_{(i,j)\in\mathbf{S}}exp(r\cdot x_{ij})]$

其中， $x_{ij}$ 表示在 $(i, j)$ 的激活值， $(i, j)$ 是池化区域 $\mathbf{S}$ 的一点并且 $S=s\times s$ 是池化区域 $\mathbf{S}$ 总点数， $r$ 是超参数。

在第一篇文章中，作者提到 LSE Pooling 的作用为：

The hyper-parameter r controls how smooth one wants the approximation to be: high r values implies having an effect similar to the max, very low values will have an effect similar to the score averaging. The advantage of this aggregation is that pixels having similar scores will have a similar weight in the training procedure, r controlling this notion of “similarity”.

在第二篇文章中，作者提到 LSE Pooling 的作用为：