nn.init.xavier_uniform_()的作用：根据均匀分布生成Tensor

AI阅读和图谱

于 2023-04-21 16:57:12 发布

阅读量1.3k

点赞数

分类专栏：学习日常记录文章标签：深度学习人工智能

本文链接：https://blog.youkuaiyun.com/qq_34740277/article/details/130292644

版权

学习日常记录专栏收录该内容

3 篇文章

订阅专栏

该文介绍了Glorot初始化方法，即Xavier均匀分布，用于填充神经网络权重，以解决深度学习模型训练中的梯度消失或爆炸问题。该方法基于数学公式`a=sqrt(6/(fan_in+fan_out))`来确定分布范围，确保输入和输出层的方差平衡。示例展示了如何使用PyTorch的`nn.init.xavier_uniform_`函数进行权重初始化，并关联到Glorot和Bengio在2010年的研究工作。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

官网解释如下：
Signature: nn.init.xavier_uniform_(tensor: torch.Tensor, gain: float = 1.0) -> torch.Tensor
Docstring:
Fills the input Tensor with values according to the method
described in Understanding the difficulty of training deep feedforward neural networks - Glorot, X. & Bengio, Y. (2010), using a uniform
distribution. The resulting tensor will have values sampled from
:math:\mathcal{U}(-a, a) where

… math::
a = \text{gain} \times \sqrt{\frac{6}{\text{fan_in} + \text{fan_out}}}

Also known as Glorot initialization.

Args:
tensor: an n-dimensional torch.Tensor
gain: an optional scaling factor

Examples:
>>> w = torch.empty(3, 5)
>>> nn.init.xavier_uniform_(w, gain=nn.init.calculate_gain(‘relu’))
File: c:\users\administrator\appdata\roaming\python\python37\site-packages\torch\nn\init.py
Type: function

其中
$\mathcal{U}=(-\mathrm{a}, \mathrm{a})$

在这里插入图片描述

参见：
Glorot, X. & Bengio, Y. (2010). Understanding the difficulty of training deep feedforward neural networks.