【pytorch】nn.conv1d的使用

最新推荐文章于 2025-02-25 11:26:09 发布

安安爸Chris

最新推荐文章于 2025-02-25 11:26:09 发布

阅读量3.3k

点赞数

分类专栏： pytorch 文章标签： pytorch 深度学习 python

本文链接：https://blog.youkuaiyun.com/mimiduck/article/details/119993470

版权

pytorch 专栏收录该内容

24 篇文章

订阅专栏

官方文档在这里。

conv1d具体不做介绍了，本篇只做pytorch的API使用介绍.

torch.nn.Conv1d(in_channels, out_channels, kernel_size, stride=1, padding=0, dilation=1, groups=1, bias=True, padding_mode=‘zeros’, device=None, dtype=None)

计算公式

输入张量的Shape一般为 $N,C_{in}, L)$ ,其中N为batch_size,一般也可用B代替；
$C_{in}$ 输入张量倒数第二维，表示Channel的数量；
L是输入信号序列的长度；

输出张量的shape一般为 $N,C_{out},L_{out})$ , 下一节会介绍怎么计算来的。

公式截图如下，

星号表示卷积运算，torch的运算这里使用的是cross-correlation,即互相关运算。

运算过程

假设输入的序列为 [1,5,7,3,2,1,6,9], 卷积核为[2,4,6,1,3]

那么对应nn.conv1d初始化就是 in_channel=1， out_channel=1, stride=1, padding=0

计算过程中，每计算一次（kernel_size），移动一步(stride),
这个过程就是上图公式中的 $weight(C_{out},k) * input(N_i,k)$ ,其中k是out_channel的，
如果out_channel个数大于1，则这样的过程会有多次。
运算过程

入参

in_channels
Number of channels in the input image
输入的Channel数，对应的是输入数据的倒数第二维
out_channels
Number of channels produced by the convolution
输出的Channel数，对应输出数据的倒数第二维度
kernel_size
Size of the convolving kernel
即卷积核长度
它可以是一个数字也可以是一个tuple（但是conv1d下，tuple是否有意义？）
stride
Stride of the convolution. Default: 1
卷积核步长
padding
Padding added to both sides of the input. Default: 0
padding_mode
‘zeros’, ‘reflect’, ‘replicate’ or ‘circular’. Default: ‘zeros’
dilation
Spacing between kernel elements. Default: 1
groups
Number of blocked connections from input channels to output channels. Default: 1
bias
If True, adds a learnable bias to the output. Default: True
是否需要bias

输出结果的shape计算

计算output的shape

代码举例

1

net = nn.Conv1d(in_channels=1, out_channels=8, kernel_size=4, stride=1, bias=False)
x = torch.linspace(1,10,10).view(1,1,10)

y = net(x)
print(y.shape)

计算结果

torch.Size([1, 8, 7])

2. 修改stride

net = nn.Conv1d(in_channels=1, out_channels=8, kernel_size=4, stride=2, bias=False)
x = torch.linspace(1,10,10).view(1,1,10)

y = net(x)
print(y.shape)

计算结果

torch.Size([1, 8, 4])

3 添加padding

net = nn.Conv1d(in_channels=1, out_channels=8, kernel_size=4, stride=1, padding=1, padding_mode='zeros',bias=False)
x = torch.linspace(1,10,10).view(1,1,10)

y = net(x)
print(y.shape)

计算结果

torch.Size([1, 8, 5])

4 修改dilation

net = nn.Conv1d(in_channels=1, out_channels=8, kernel_size=4, stride=2, dilation=2,bias=False)
x = torch.linspace(1,10,10).view(1,1,10)

y = net(x)
print(y.shape)

计算结果

torch.Size([1, 8, 2])