pytorch一些用到的函数记录

最新推荐文章于 2025-06-17 16:20:32 发布

yywxl

最新推荐文章于 2025-06-17 16:20:32 发布

阅读量336

点赞数

CC 4.0 BY-SA版权

分类专栏： pytorch

本文链接：https://blog.youkuaiyun.com/yywxl/article/details/102952177

pytorch 专栏收录该内容

10 篇文章

订阅专栏

本文深入探讨了PyTorch中的关键操作，包括torch.range、torch.arange、torch.meshgrid、torch.cat、torch.stack等张量操作，以及torch.tensor与NumPy、PIL之间的转换。同时，介绍了torch.nn.functional.interpolate插值方法、torchvision.transforms.functional模块的使用，ModuleList和Sequential的对比，以及梯度裁剪torch.nn.utils.clip_grad_norm()的实现。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

文章目录

torch.range troch.arange

import torch
x = torch.range(0, 4)
y = torch.arange(0, 4)
print(x, x.dtype)
print(y, y.dtype)

tensor([0., 1., 2., 3., 4.]) torch.float32
tensor([0, 1, 2, 3]) torch.int64
__main__:2: UserWarning: torch.range is deprecated in favor of torch.arange and will be removed in 0.5. Note that arange generates values in [start; end), not [start; end].

torch.range将被弃用，建议使用torch.arange
torch.arange类似于range,范围为[start, end), dtype为torch.int64,也可以指定类型dtype=torch.float32

torch.meshgrid

先看代码

import torch
x = torch.arange(0,2) #[H]
y = torch.arange(0, 4) #[W]
z = torch.meshgrid(x, y) #([H,W], [H, W])
print(x)
print(y)
print(z)

'''out
tensor([0, 1])
tensor([0, 1, 2, 3])
(tensor([[0, 0, 0, 0],
        [1, 1, 1, 1]]), tensor([[0, 1, 2, 3],
        [0, 1, 2, 3]]))
'''

以上代码生成的grid为两个相同size的矩阵,grid的坐标矩阵即为两个矩阵对应的点组成
rows为行坐标的列向量，cols为列坐标的列向量，他们能够构成的网格大小即为 $H\times W$
生成网格坐标即为两个矩阵想对应点构成的坐标

torch.cat和torch.stack

torch.cat(tensor, dim),拼接，不会增加维度，只会增大tensor的大小,拼接的维度大小需要一致
torch.stack(tensor, dim),堆叠，在维度上进行堆叠,会增加矩阵维度
看代码

import torch
x = torch.arange(0, 4)
y = torch.arange(0, 4)
z = torch.meshgrid(x, y)
a = torch.cat((x, y))
b = torch.stack((x, y))
print(a.shape)
print(b.shape)

'''out
torch.Size([8])
torch.Size([2, 4])
'''

torch.tensor&numpy&PIL

torch.from_numpy(array): numpy转为tensor
tensor.numpy(): tensor转为numpy
np.array(img): pil图像变为numpy
Image.fromarray(img_array): numpy转为PIL图像

torch插值

torch.nn.functional.interpolate(input, size=None, scale_factor=None, mode='nearest', align_corners=None)

input 必须是一个 $(n, c, h, w)$ ，四维矩阵

Parameters
- input (Tensor) – the input tensor

- size (int or Tuple[int] or Tuple[int, int] or Tuple[int, int, int]) – output spatial size.

- scale_factor (float or Tuple[float]) – multiplier for spatial size. Has to match input size if it is a tuple.

- mode (str) – algorithm used for upsampling: 'nearest' | 'linear' | 'bilinear' | 'bicubic' | 'trilinear' | 'area'. Default: 'nearest'

- align_corners (bool, optional) – Geometrically, we consider the pixels of the input and output as squares rather than points. If set to True, the input and output tensors are aligned by the center points of their corner pixels, preserving the values at the corner pixels. If set to False, the input and output tensors are aligned by the corner points of their corner pixels, and the interpolation uses edge value padding for out-of-boundary values, making this operation independent of input size when scale_factor is kept the same. This only has an effect when mode is 'linear', 'bilinear', 'bicubic' or 'trilinear'. Default: False

`torchvison.transforms.functional` PIL和tensor转换

先看torch官网描述

import torchvision.transforms.functional as tf
pil_image = tf.to_pil_image(tensor)
tensor= tf.to_tensor(pil_img)

在数据预处理和训练结果保存时使用很方便

ModuleList 和 Sequential

PyTorch 中的 ModuleList 和 Sequential 转自知乎

torch.nn.utils.clip_grad_norm()

source code
防止梯度爆炸

def clip_grad_norm_(parameters, max_norm, norm_type=2):
    r"""Clips gradient norm of an iterable of parameters.

    The norm is computed over all gradients together, as if they were
    concatenated into a single vector. Gradients are modified in-place.

    Arguments:
        parameters (Iterable[Tensor] or Tensor): an iterable of Tensors or a
            single Tensor that will have gradients normalized
        max_norm (float or int): max norm of the gradients
        norm_type (float or int): type of the used p-norm. Can be ``'inf'`` for
            infinity norm.

    Returns:
        Total norm of the parameters (viewed as a single vector).
    """
    if isinstance(parameters, torch.Tensor):
        parameters = [parameters]
    parameters = list(filter(lambda p: p.grad is not None, parameters))
    max_norm = float(max_norm)
    norm_type = float(norm_type)
    if norm_type == inf:
        total_norm = max(p.grad.data.abs().max() for p in parameters)
    else:
        total_norm = 0
        for p in parameters:
            param_norm = p.grad.data.norm(norm_type)
            total_norm += param_norm.item() ** norm_type
        total_norm = total_norm ** (1. / norm_type)
    clip_coef = max_norm / (total_norm + 1e-6)
    if clip_coef < 1:
        for p in parameters:
            p.grad.data.mul_(clip_coef)
    return total_norm