CS 20SI|Lecture 1 Introduction to TensorFlow

最新推荐文章于 2023-03-13 14:38:02 发布

浅梦s

最新推荐文章于 2023-03-13 14:38:02 发布

阅读量1.5k

点赞数 1

CC 4.0 BY-SA版权

分类专栏： # TensorFlow 深度学习文章标签：深度学习 TensorFlow

本文链接：https://blog.youkuaiyun.com/u012151283/article/details/54838410

深度学习同时被 2 个专栏收录

24 篇文章

订阅专栏

TensorFlow

8 篇文章

订阅专栏

本文总结了CS20SI课程中的TensorFlow深度学习研究内容，包括理解计算图方法、探索内建函数及构建结构化模型等。介绍了高阶API如TFLearn、TFSlim、Keras等工具的使用，并详细解释了计算图、Session的工作原理，以及如何通过图来实现分布式计算。

本文是基于课程CS 20SI: TensorFlow for Deep Learning Research进行总结。

课程目标

理解TF计算图方法
探索TF内建函数
学习如何构建适合DL项目的结构化模型

参考书籍

TensorFlow for Machine Intelligence (TFFMI)
Hands-On Machine Learning with Scikit-Learn and TensorFlow. Chapter 9: Up and running with TensorFlow
Fundamentals of Deep Learning. Chapter 3: Implementing Neural Networks in TensorFlow (FODL)
TensorFlow正在快速迭代，上述书籍可能会过时，直接参考官方网站

开始

High level API

TF Learn
TF Slim
Keras(官方宣布支持作为上层API)
Tensor Layer

Graphs and Sessions

Data Flow Graphs

TensorFlow将计算图的定义从执行过程中分离。TF的工作分为两个阶段：
阶段1：定义计算图
阶段2：使用一个session执行图中的operations
在一张计算图中，结点表示一种数学运算。边表示两个结点之间通信的多维张量。

Tensor

Tensor是一个n维矩阵，相当于numpy中的ndarray

import tensorflow as tf
a = tf.add(3, 5)

上述代码构建了上面的数据流图，在数据流图中结点Node可以是operators, variables, and constants,边Edge是tensors
Tensor Flow即为Tensor(Data)在图中流动。

import tensorflow as tf
a = tf.add(3, 5)
print(a)
>> Tensor("Add:0", shape=(), dtype=int32)

注意到print(a)并没有输出结果5.下面介绍如何得到a的数值

Session

Session对象封装了Operation对象执行和Tensor对象evaluated的环境

import tensorflow as tf
a = tf.add(3, 5)
sess = tf.Session()
print(sess.run(a))
sess.close()
>> 8

Session会查看计算图中的结点，寻找如何能够得到a的结果，并计算沿途的nodes.
使用上下文管理器可以不用每次显式关闭Session.

import tensorflow as tf
a = tf.add(3, 5)
with tf.Session() as sess:
    print(sess.run(a))

More (sub) graph

tf.Session.run(fetches, feed_dict=None, options=None, run_metadata=None)
将想要计算的变量作为一个list传递给fetches

import tensorflow as tf
x = 2
y = 3
op1 = tf.add(x, y)
op2 = tf.mul(x, y)
useless = tf.mul(x,op1)
op3 = tf.pow(op2, op1)
with tf.Session() as sess:
    op3, not_useless = sess.run([op3,useless])

Distributed Computation

将计算图拆分适合于多机多核并行计算

Another Graph

Multiple graphs require multiple sessions, each will try to use all available resources by default
Can’t pass data between them without passing them through python/numpy, which doesn’t work in distributed
It’s better to have disconnected subgraphs within one grap

g = tf.Graph()
#to add operators to a graph, set it as default:
with g.as_default():
    x = tf.add(3, 5)
sess = tf.Session(graph=g)
with tf.Session() as sess:
    sess.run(x)

小心不要混淆默认graph和用户定义graph,向任何计算图添加操作时，先将其定义为默认图

g1 = tf.get_default_graph()
g2 = tf.Graph()
#add ops to the default graph
with g1.as_default():
    a = tf.Constant(3)
#add ops to the user created graph
with g2.as_default():
    b = tf.Constant(5)

Why graphs

Save computation (only run subgraphs that lead to the values you want to fetch)
Break computation into small, differential pieces to facilitates auto-differentiation
Facilitate distributed computation, spread the work across multiple CPUs, GPUs, or devices
Many common machine learning models are commonly taught and visualized as directed graphs already