TensorFlow学习笔记（1）----基础概念和程序的形式

最新推荐文章于 2022-05-09 16:50:50 发布

转载最新推荐文章于 2022-05-09 16:50:50 发布 · 174 阅读

Tensorflow 专栏收录该内容

23 篇文章

订阅专栏

本文介绍TensorFlow的基本概念，包括图、会话、张量等，并通过平面拟合、矩阵乘法等实例展示如何构建和运行计算图，同时涉及feed和fetch机制。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

1.概念

graph：图，表示具体的计算任务

session：会话，图需要在会话中执行，一个会话可以包含很多图

tensor：张量，在此表示数据，类型是numpy::ndarray

variable：就是本意变量，图的重要组成部分

operation：简称op，是图中计算的节点，输入tensor计算后产生tensor

feed、fetch：意思是给图添加数据和获取图中的数据，因为训练过程中有些数据需要动态获得、临时给予数据

运行：

考虑到python运算的性能，肯定需要使用外部运算库，但是内外环境切换也是个很大的开销，TF如同其他主流机器学习工具，把程序通常组织成一个构建阶段和一个执行阶段。构建就是说明需要一个怎样的网络模型，执行就是按照指定的优化训练模型，也包含检验输出等操作。可以看做先用python程序搭建模型，然后全部在python之外运行。

2. 例子

2.1 平面拟合

需要拟合的平面：y = W1 * x1_data + W2*x2_data + b，其中，已知x1_data、x2_data和y，但是都包含一点噪声。

程序：

[python] view plain copy

import tensorflow as tf
import numpy as np
# Create 100 phony x, y data points in NumPy, y = x * 0.1 + 0.3
x1_data = np.random.rand(100).astype(np.float32)
x2_data = np.random.rand(100).astype(np.float32)
y_data = x1_data * 10 + x2_data * 5 + 3 + tf.random_uniform([100], -0.1, 0.1)
# Try to find values for W and b that compute y_data = W * x_data + b
# (We know that W should be 0.1 and b 0.3, but TensorFlow will
# figure that out for us.)
# note: W b and y just statement/container before initialization
W1 = tf.Variable(tf.random_uniform([1], -1.0, 1.0))
W2 = tf.Variable(tf.random_uniform([1], -1.0, 1.0))
b = tf.Variable(tf.zeros([1]))
y = W1 * x1_data + W2*x2_data + b
# Minimize the mean squared errors.
loss = tf.reduce_mean(tf.square(y - y_data))
optimizer = tf.train.AdagradOptimizer(0.6)
train = optimizer.minimize(loss)
# Before starting, initialize the variables. We will 'run' this first.
init = tf.initialize_all_variables()
# Launch the graph.
sess = tf.Session()
sess.run(init)
# Fit the line.
for step in range(20001):
sess.run(train)
#if step % 20 == 0:
#print(step, sess.run(W), sess.run(b),sess.run(loss))
print(step, sess.run(W1), sess.run(W2), sess.run(b),sess.run(loss))
# Learns best fit is W: [0.1], b: [0.3]

程序首先使用随机数产生需要拟合的数据，然后规定误差项和优化的方式，然后是训练并输出结果。优化方法有很多种不仅仅是AdagradOptimizer（）。

2.2 矩阵相乘

两个比较大的矩阵相乘，分别使用GPU和CPU，比较运行时间

[python] view plain copy

import tensorflow as tf
import numpy as np
#when put here the "cpu" is same as "gpu" , because it has been deploied on gpu or cpu
#select the fastest device automatically
#matrix1 = np.random.rand(20000,1500).astype(np.float32)
#matrix2 = np.random.rand(1500,20000).astype(np.float32)
#product = tf.matmul(matrix1, matrix2)
with tf.Session() as sess3:
with tf.device("/gpu:0"):#gpu 11.6s and cpu 20.2s
matrix1 = np.random.rand(20000,1500).astype(np.float32)
matrix2 = np.random.rand(1500,20000).astype(np.float32)
product = tf.matmul(matrix1, matrix2)
result = sess3.run(product)