Fully Convolutional Networks for Semantic Segmentation_proc. int. conf. learn. represent-优快云博客

本文介绍了提高卷积神经网络效率的两种方法：整图训练和Shift-and-stitch滤波膨胀技巧。整图训练通过层间并行计算提升了前馈及反向传播效率；Shift-and-stitch则能从粗粒度输出生成密集预测。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

1. Training the whole image is more efficient and equally effective.

When these receptive fields overlap significantly, both
feedforward computation and backpropagation are much
more efficient when computed layer-by-layer over an entire
image instead of independently patch-by-patch.

2.Shift-and-stitch Filter Dilation

Dense predictions can be obtained from coarse outputs by
stitching together outputs from shifted versions of the
input. If the output is downsampled by a factor of f, shift
the input x pixels to the right and y pixels down, once for
every x; y such that 0 < x, y < f. Process each of these $f^2$
inputs, and interlace the outputs so that the predictions correspond
to the pixels at the centers of their receptive fields.
(如果下采样系数为 $f$ , 将输入分别向右和下平移x和y个单位，0<=x,
y< $f$ , 将生成的 $f^2$ )个输出合成在一起作为输出，这样输出的每个像素点都指向他们感受野的中心）
A trike to do so:
Consider a layer (convolution or pooling) with input stride s, and a subsequent convolution layer with filter
weights fij (eliding the irrelevant feature dimensions). Setting
the earlier layer’s input stride to one upsamples its output
by a factor of s. However, convolving the original filter
with the upsampled output does not produce the same
result as shift-and-stitch, because the original filter only sees
a reduced portion of its (now upsampled) input. To produce
the same result, dilate (or “rarefy”) the filter by forming

f' i j = {f i / s, j / s 0 i f s d i v i d e s b o t h i a n d j o t h e r w i s e

$f'_{ij}=\Big\{\begin{array}\ f_{i/s, j/s} \quad & if \ s\ divides\ both\ i\ and\ j\\ 0\quad & otherwise \end{array}$
(with i and j zero-based). Reproducing the full net output of
shift-and-stitch involves repeating this filter enlargement
layer-by-layer until all subsampling is removed. (In practice,
this can be done efficiently by processing subsampled
versions of the upsampled input.)
使用空洞卷积使生成的特征谱