pytroch、tensorflow对比学习—使用GPU训练模型

原创

已于 2022-07-24 11:49:29 修改 · 1.4k 阅读

22 ·

CC 4.0 BY-SA版权

文章标签：

#tensorflow #学习 #深度学习 #pytorch #人工智能

于 2022-07-24 00:19:52 首次发布

本文详细对比了PyTorch和TensorFlow在使用GPU进行模型训练的API接口及语法，助你理解模型优化流程。涵盖单GPU和多GPU部署，以及TPU训练的实战教程。

使用GPU训练模型

前言

本文是《pytorch-tensorflow-Comparative study》，pytorch和tensorflow对比学习专栏，第五章—— 使用GPU训练模型。

虽然说这两个框架在语法和接口的命名上有很多地方是不同的，但是深度学习的建模过程确实基本上都是一个套路的。

所以该笔记的笔记方式是：在使用相同的处理功能模块上，对比记录pytorch和tensorflow两者的API接口，和语法。
在这里插入图片描述
1，有利于深入理解深度学习建模过程流程。

2，有利于理解pytorch，和tensorflow设计上的不同，更加灵活的使用在自己的项目中。

3，有利于深入理解各个功能模块的使用。

本章节主要对比学习pytorch 和tensorflow有关使用GPU训练模型部分的API接口，和语法。

在这里插入图片描述

使用GPU训练模型

深度学习的训练过程常常非常耗时，一个模型训练几个小时是家常便饭，训练几天也是常有的事情，有时候甚至要训练几十天。

训练过程的耗时主要来自于两个部分，一部分来自数据准备，另一部分来自参数迭代。

当数据准备过程还是模型训练时间的主要瓶颈时，我们可以使用更多进程来准备数据。

当参数迭代过程成为训练时间的主要瓶颈时，我们通常的方法是应用GPU或者Google的TPU来进行加速。
在这里插入图片描述

使用单个GPU

在这里插入图片描述

pytorch

**Pytorch中：**使用GPU加速模型非常简单，只要将模型和数据移动到GPU上。核心代码只有几行。

# 定义模型
... 
device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
model.to(device) # 移动模型到cuda
# 训练模型
...
features = features.to(device) # 移动数据到cuda
labels = labels.to(device) # 或者  labels = labels.cuda() if torch.cuda.is_available() else labels
...

以下是一些和GPU有关的基本操作汇总

import torch 
from torch import nn 
# 1，查看gpu信息
if_cuda = torch.cuda.is_available()
print("if_cuda=",if_cuda)

gpu_count = torch.cuda.device_count()
print("gpu_count=",gpu_count)
# if_cuda= True
# gpu_count= 1

# 2，将张量在gpu和cpu间移动
tensor = torch.rand((100,100))
tensor_gpu = tensor.to("cuda:0") # 或者 tensor_gpu = tensor.cuda()
print(tensor_gpu.device)
print(tensor_gpu.is_cuda)

tensor_cpu = tensor_gpu.to("cpu") # 或者 tensor_cpu = tensor_gpu.cpu() 
print(tensor_cpu.device)
# cuda:0
# True
# cpu

# 5，清空cuda缓存

# 该方法在cuda超内存时十分有用
torch.cuda.empty_cache()

下面分别使用CPU和GPU作一个矩阵乘法，并比较其计算效率。

import time
import torch 
from torch import nn
# 使用cpu
a = torch.rand((10000,200))
b = torch.rand((200,10000))
tic = time.time()
c = torch.matmul(a,b)
toc = time.time()

print(toc-tic)
print(a.device)
print(b.device)
# 0.6454010009765625
# cpu
# cpu

# 使用gpu
device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu"

最低0.47元/天解锁文章