keras Conv2D参数详解

最新推荐文章于 2025-04-28 16:07:48 发布

koala_cola

最新推荐文章于 2025-04-28 16:07:48 发布

阅读量3.2w

点赞数 19

本文链接：https://blog.youkuaiyun.com/koala_cola/article/details/106883961

版权

Conv2D layer 二维卷积层

本文是对keras的英文API DOC的一个尽可能保留原意的翻译和一些个人的见解，会补充一些对个人对卷积层的理解。这篇博客写作时本人正大二，可能理解不充分。

`Conv2D` class

tf.keras.layers.Conv2D(
    filters,
    kernel_size,
    strides=(1, 1),
    padding="valid",
    data_format=None,
    dilation_rate=(1, 1),
    groups=1,
    activation=None,
    use_bias=True,
    kernel_initializer="glorot_uniform",
    bias_initializer="zeros",
    kernel_regularizer=None,
    bias_regularizer=None,
    activity_regularizer=None,
    kernel_constraint=None,
    bias_constraint=None,
    **kwargs
)

二维卷积层(例如空间卷积图像)。

这一层创建了一个卷积核，它与这一层的输入卷积以产生一个输出张量。
如果 use_bias为真，则创建一个偏差向量并添加到输出中。
最后，如果activation不是None，它也应用于输出。

当使用此层作为模型的第一层时，提供关键字参数input_shape(整数元组，不包括样本轴（不需要写batch_size）)，例如。
input_shape=(128, 128, 3)表示 128x128的 RGB 图像data_format="channels_last"

这个data_format参数是这样影响input_shape工作的如果不填写，默认是channels_last，否则可以填写channels_first。前者的会把input_shape这个三元组给识别成(batch_size, height, width, channels)，后者则会识别成(batch_size, channels, height, width)不过样本轴不需要自己填写（不然反而会报错）

Examples

>>> # The inputs are 28x28 RGB images with `channels_last` and the batch   
>>> # size is 4.  
>>> input_shape = (4, 28, 28, 3) 
>>> x = tf.random.normal(input_shape) 
>>> y = tf.keras.layers.Conv2D( ...
                               2,3,activation='relu',input_shape=input_shape[1:])(x) 
>>> print(y.shape) (4, 26, 26, 2)

>>> # With `dilation_rate` as 2.   
>>> input_shape = (4, 28, 28, 3) 
>>> x = tf.random.normal(input_shape) 
>>> y = tf.keras.layers.Conv2D( ... 
                   2,3,activation='relu',dilation_rate=2,input_shape=input_shape[1:])(x) 
>>> print(y.shape) (4, 24, 24, 2)

>>> # With `padding` as "same".   
>>> input_shape = (4, 28, 28, 3) 
>>> x = tf.random.normal(input_shape) 
>>> y = tf.keras.layers.Conv2D( ... 
2, 3, activation='relu', padding="same", input_shape=input_shape[1:])(x) 
>>> print(y.shape) (4, 28, 28, 2)

>>> # With extended batch shape [4, 7]:   
>>> input_shape = (4, 7, 28, 28, 3) 
>>> x = tf.random.normal(input_shape) 
>>> y = tf.keras.layers.Conv2D( ... 
2, 3, activation='relu', input_shape=input_shape[2:])(x) 
>>> print(y.shape) (4, 7, 26, 26, 2)

参数[以下方框内注释内容为个人理解，仅供参考]

filters: 整数，输出空间的维数(即在卷积中输出滤波器的数量)。[注：此处认为是与input_shape的通道数一样，比如是RGB就是3，是灰度图就是1]
kernel_size:一个整数或2个整数的元组/列表，指定二维卷积窗口的高度和宽度。
可以是单个整数，为所有空间维度指定相同的值。
strides: 一个整数或两个整数的元组/列表，指定沿高度和宽度的卷积的步长。可以是单个整数，为所有空间维度指定相同的值。
指定任何strides值!= 1与指定任何dilation_rate值!= 1是不兼容的。
padding: one of "valid" or "same" (不区分大小写).[注：卷积会导致输出图像越来越小，图像边界信息丢失，若想保持卷积后的图像大小不变，需要设置padding参数为same]
data_format: 一个字符串参数，要么是 channels_last (默认) ，要么就是 channels_first. 是输入的维度顺序排列理解。 channels_last 对应着 (batch_size, height, width, channels) ，而 channels_first 对应的输入为 (batch_size, channels, height, width). 他默认为在~/.keras/keras.json.中的image_data_format 的值如果你不设置这个地方，它就会是channels_last.
dilation_rate: 一个整数或两个整数的元组/列表，指定用于扩展卷积的扩展率。可以是单个整数，为所有空间维度指定相同的值。该参数定义了卷积核处理数据时各值的间距。在相同的计算条件下，该参数提供了更大的感受野。该参数经常用在实时图像分割中。当网络层需要较大的感受野，但计算资源有限而无法提高卷积核数量或大小时，可以考虑使用。

下图为卷积核为3，扩展率为2的和没有padding的二维卷积
groups: A positive integer specifying the number of groups in which the input is split along the channel axis. Each group is convolved separately with filters / groups filters. The output is the concatenation of all the groups results along the channel axis. Input channels and filters must both be divisible by groups.
activation: 使用激活函数。如果不特别指定，将不会使用任何的激活函数 ( 具体的可选项可以参考keras.activations).
use_bias: Boolean类型, 这一层是否有bias单元.
kernel_initializer: 默认是GlorotUniform ，通过输入和输出单元个数来推演权重矩阵尺寸 ( 可选项在 keras.initializers).
bias_initializer: kernel bias单元的初始化器，默认是0 ( 可选项在 keras.initializers).
kernel_regularizer: Regularizer function applied to the kernel weights matrix (see keras.regularizers).
bias_regularizer: Regularizer function applied to the bias vector ( see keras.regularizers).
activity_regularizer: Regularizer function applied to the output of the layer (its “activation”) ( see keras.regularizers).
kernel_constraint: Constraint function applied to the kernel matrix ( see keras.constraints).
bias_constraint: Constraint function applied to the bias vector ( see keras.constraints).