Vision Transformer（vit）的MLP模块

O_o381

已于 2024-11-29 15:48:50 修改

阅读量455

点赞数 10

文章标签： transformer 深度学习人工智能

于 2024-11-29 15:44:08 首次发布

本文链接：https://blog.youkuaiyun.com/qq_61706514/article/details/144138525

版权

图解：

代码：

class Mlp(nn.Module):
    """
    MLP as used in Vision Transformer, MLP-Mixer and related networks
    """
    def __init__(self, 
                 in_features,               #输入特征的维度
                 hidden_features=None,      #隐藏层特征的维度，默认为none
                 out_features=None,         #输出特征的维度，默认为none
                 act_layer=nn.GELU,         #激活函数层，默认使用nn.GELU
                 drop=0.):                  #丢弃率,默认值为 0，表示不进行丢弃操作
        super().__init__()
        out_features = out_features or in_features
#如果输出特征的维度没有指定则默认与输入特征维度相同。
        hidden_features = hidden_features or in_features
#如果隐藏层特征的维度没有指定则默认与输入特征维度相同。
        self.fc1 = nn.Linear(in_features, hidden_features)
        self.act = act_layer()
#默认使用nn.GELU
        self.fc2 = nn.Linear(hidden_features, out_features)
        self.drop = nn.Dropout(drop)

    def forward(self, x):
        x = self.fc1(x)
        x = self.act(x)
        x = self.drop(x)
        x = self.fc2(x)
        x = self.drop(x)
        return x

GELU函数的优点：