einops基础用法

最新推荐文章于 2024-08-09 08:18:04 发布

YawenLuo

最新推荐文章于 2024-08-09 08:18:04 发布

阅读量252

点赞数

分类专栏： python pytorch 安装配置 debug 计算机视觉文章标签： python 深度学习人工智能

本文链接：https://blog.youkuaiyun.com/qq_52106152/article/details/133796340

版权

python 同时被 3 个专栏收录

19 篇文章

订阅专栏

pytorch 安装配置 debug

3 篇文章

订阅专栏

计算机视觉

1 篇文章

订阅专栏

本文介绍了PyTorch中的torch.einsum函数，详细讲解了如何使用equation进行张量乘法、求和、对角线操作、维度变换以及重复和缩减操作，如rearrange、repeat和reduce。实例展示了如何利用这些功能处理多维数据和进行高效的计算.

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

基本概念

自由索引：出现在箭头右边的索引，可以遍历的索引。
求和索引：只出现在箭头左边的索引，表示中间计算结果需要这个维度上求和之后才能得到输出。

基础规则

规则一，equation 箭头左边，在不同输入之间重复出现的索引表示，把输入张量沿着该维度做乘法操作
规则二，只出现在 equation 箭头左边的索引，表示中间计算结果需要在这个维度上求和，也就是上面提到的求和索引
规则三，equation 箭头右边的索引顺序可以是任意的，比如上面的 “ik,kj->ij” 如果写成 “ik,kj->ji”，那么就是返回输出结果的转置

特殊规则

equation 可以不写包括箭头在内的右边部分，那么在这种情况下，输出张量的维度会根据默认规则推导。就是把输入中只出现一次的索引取出来，然后按字母表顺序排列，比如上面的矩阵乘法 “ik,kj->ij” 也可以简化为 “ik,kj”，根据默认规则，输出就是 “ij” 与原来一样。
equation 中支持 “…” 省略号，用于表示用户并不关心的索引，比如只对一个高维张量的最后两维做转置可以这么写

a = torch.zeros([2, 3, 4, 5, 6])
a = torch.einsum('...ij -> ...ji', a)
print(f"the dealed a shape is {a.shape}")
# the dealed a shape is torch.Size([2, 3, 4, 6, 5])

实例

# 取出对角线元素
b = torch.arange(16).reshape(4, 4)
print(b)
b = torch.einsum('ii->i', b)
print(b)
# tensor([[ 0,  1,  2,  3],
#         [ 4,  5,  6,  7],
#         [ 8,  9, 10, 11],
#         [12, 13, 14, 15]])
# tensor([ 0,  5, 10, 15])

# get sum
a = torch.arange(6).reshape(2, 3)
print(a)
a = torch.einsum('ij ->', a)
print(a)
# tensor([[0, 1, 2],
#         [3, 4, 5]])
# tensor(15)

# get sum by row、clo
a = torch.arange(6).reshape(2, 3)
print(f"sum example : \na = {a}")
print(torch.einsum('ij -> i', a))
print(torch.einsum('ij -> j', a))
# sum example : 
# a = tensor([[0, 1, 2],
#         [3, 4, 5]])
# tensor([ 3, 12])
# tensor([3, 5, 7])

用法表格

在这里插入图片描述

rearange

import torch
from einops import rearrange
 
images = torch.randn((32,30,40,3))
# (32, 30, 40, 3)
print(rearrange(images, 'b h w c -> b h w c').shape)
 
# (960, 40, 3)
print(rearrange(images, 'b h w c -> (b h) w c').shape)
 
# (30, 1280, 3)
print(rearrange(images, 'b h w c -> h (b w) c').shape)
 
# (32, 3, 30, 40)
print(rearrange(images, 'b h w c -> b c h w').shape)
 
# (32, 3600)
print(rearrange(images, 'b h w c -> b (c h w)').shape)
 
# ---------------------------------------------
# 这里(h h1) (w w1)就相当于h与w变为原来的1/h1,1/w1倍
 
# (128, 15, 20, 3)
print(rearrange(images, 'b (h h1) (w w1) c -> (b h1 w1) h w c', h1=2, w1=2).shape)
 
# (32, 15, 20, 12)
print(rearrange(images, 'b (h h1) (w w1) c -> b h w (c h1 w1)', h1=2, w1=2).shape)

repeat

import torch
from einops import repeat
 
image = torch.randn((30,40))
 
# 整体复制 (30, 40, 3)
print(repeat(image, 'h w -> h w c', c=3).shape)
 
# 按行复制 (60, 40)
print(repeat(image, 'h w -> (repeat h) w', repeat=2).shape)
 
# 按列复制 (30, 120) 注意：(repeat w)与(w repeat)结果是不同的
print(repeat(image, 'h w -> h (repeat w)', repeat=3).shape)
 
# (60, 80)
print(repeat(image, 'h w -> (h h2) (w w2)', h2=2, w2=2).shape)

reduce

import torch
from einops import reduce
 
x = torch.randn(3, 5, 5)
# (5, 5)
print(reduce(x, 'c h w -> h w', 'max').shape)
 
x = torch.randn(1, 3, 6, 6)
# (1, 3, 3, 3) 注意：如果不是整除会报错
y1 = reduce(x, 'b c (h h1) (w w1) -> b c h w', 'max', h1=2, w1=2)
print(y1.shape)
 
# Adaptive max-pooling:(1, 3, 3, 2)
print(reduce(x, 'b c (h h1) (w w1) -> b c h1 w1', 'max', h1=3, w1=2).shape)
 
# Global average pooling:(1, 3)
print(reduce(x, 'b c h w -> b c', 'mean').shape)