pytorch中Linear类中weight的形状问题源码探讨

最新推荐文章于 2025-03-02 22:46:46 发布

原创

最新推荐文章于 2025-03-02 22:46:46 发布 · 8.1k 阅读

CC 4.0 BY-SA版权

文章标签：

import torch
from torch import nn

m = nn.Linear(20, 30)
input = torch.randn(128, 20)
output = m(input)

print

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

关注关注

专栏目录

若北辰

04-19

1210

AGI

03-23

4556

4 条评论您还未登录，请先登录后发表或查看评论

4 条评论

Hello.Monica 2020.03.24
弱弱地问下博主，为什么要先定义为[out_features, in_features]，然后linear()中再matmul(weight.t())呢？直接定义为[in_features, out_features]，然后linear()中matmul(weight)不行吗？
- UESTC_KingTheon回复Hello.Monica 2020.08.13
  [reply]weixin_41642890[/reply]"It's historical weight layout, changing it is backward-incompatible. Unless there is some BIG benefit in terms of speed or convenience, we wont break userland." 应该是初始就这么设定的，现在改了就不兼容了，除非有很大的提升，所以就一直不改了。