我们常用的卷积操作——Conv2d里的参数

长毛三花

已于 2023-11-01 21:52:58 修改

阅读量250

点赞数

分类专栏：神经网络基础文章标签：深度学习

于 2023-10-21 19:33:51 首次发布

本文链接：https://blog.youkuaiyun.com/weixin_42238112/article/details/133965027

版权

神经网络基础专栏收录该内容

3 篇文章

订阅专栏

不仔细看代码是真不知道以前的代码都是白跑了，里面的东西根本就没搞懂。

self.conv1 = nn.Conv2d(
                in_planes, planes, kernel_size=3, stride=stride, padding=1, bias=False)
# 我们常用的这句代码，里面的参数到底都代表什么意思呢？

Args:
    in_channels (int): Number of channels in the input image
    out_channels (int): Number of channels produced by the convolution  # 卷积产生的通道数
    kernel_size (int or tuple): Size of the convolving kernel
    stride (int or tuple, optional): Stride of the convolution. Default: 1
    #           整数型 or 元组，可选   ： 
    padding (int, tuple or str, optional): Padding added to all four sides of
            the input. Default: 0
    padding_mode (string, optional): ``'zeros'``, ``'reflect'``,
            ``'replicate'`` or ``'circular'``. Default: ``'zeros'``
    # 填充模式：默认以0填充
    dilation (int or tuple, optional): Spacing between kernel elements. Default: 1
    # 控制膨胀卷积                     内核元素之间的间距 
    groups (int, optional): Number of blocked connections from input
            channels to output channels. Default: 1
    # 控制分组卷积      默认不分组，groups=1
    bias (bool, optional): If ``True``, adds a learnable bias to the
            output. Default: ``True``

参数 kernel_size，stride，padding，dilation 都可以是一个整数或者是一个元组，一个值的情况将会同时作用于高和宽 两个维度，两个值的元组情况代表分别作用于高或宽维度。

dilation：控制卷积核选的元素之间的间距【可选】默认为1，好像叫膨胀卷积？

1.dilation=1的话（默认情况），效果如图：