DeepLearning: Conv/Deconv/Relu/Loss/BatchNorm 的关系（非理论探究）

卷积与反卷积层的正确使用顺序

最新推荐文章于 2025-08-09 23:58:20 发布

原创最新推荐文章于 2025-08-09 23:58:20 发布 · 917 阅读

1 ·

CC 4.0 BY-SA版权

hao's DeepLearning blog 同时被 3 个专栏收录

6 篇文章

订阅专栏

DeepLearning

3 篇文章

订阅专栏

Caffe

2 篇文章

订阅专栏

本文探讨了卷积层(ConvLayers)、激活层(ActivationLayers)及反卷积层(DeconvLayers)在神经网络中的正确使用顺序。强调了在上采样操作中DeconvLayer的重要性，并提供了Caffe框架下具体的参数设置，同时指出了连接损失层(loss layer)时的注意事项。

Contact Me:
王雪豪 xuehaowang@buaa.edu.cn

It is generally known that the order is important when you use the Conv Layers and Activation Layers.

We usually use them as follow:

conv -> relu
conv -> batchnorm -> scale -> relu

This order shown above can work well. However, how to add into a Deconv Layer?

We use Deconv Layers to upsample the input data and the paras can be set as this(caffe framwork):

layer {
  name: "deconv"
  type: "Deconvolution"
  bottom: "conv"
  top: "deconv"
  param {lr_mult: 0 decay_mult: 0}
  param {lr_mult: 0 decay_mult: 0}
  convolution_param {
    num_output: 1
    pad: 4
    kernel_size: 16
    stride: 8 #扩大的倍数
    weight_filler {
      type:"bilinear" #差值填充
    }
    bias_filler {
      type: "constant"
      value: 0
    }
  }
}

This is a part of caffe network designed to upsample the input 'conv'. The lr is set as 0 to avoid changing the paras during training phase.

So if we want to combine the Conv layer and Deconv layer, we could follow this order:

Conv -> Deconv -> Relu
Conv -> Deconv -> batchnorm -> scale -> Relu

Essentially the Deconv layer can be treat as a post-operate of the Conv layer when you just want to upsample input data and Do Not Need a Activation Layer to active the Conv layer additionally.

Meanwhile if your network has a Loss layer to deal with the data from Conv and Deconv, you must directly connect the loss layer with Deconv layer instead of Conv layer, unless your data from Deconv and Conv layer are both single channel.

The follow order can work well and if you conect the loss layer with conv layer which is output by the deconv layer, you will get an unstable loss value.

Conv(维度下降） -> Deconv（上采样） -> loss
INSTEAD OF
Deconv（上采样） -> Conv（维度下降） -> loss