Easy-TensorFlow项目中的TensorBoard嵌入可视化教程-优快云博客

本文链接：https://blog.youkuaiyun.com/gitblog_00701/article/details/148758054

Easy-TensorFlow项目中的TensorBoard嵌入可视化教程

easy-tensorflow Simple and comprehensive tutorials in TensorFlow 项目地址: https://gitcode.com/gh_mirrors/ea/easy-tensorflow

高维数据可视化的重要性

在深度学习领域，我们经常需要处理高维数据。以MNIST手写数字数据集为例，每张28×28像素的图像实际上是一个784维空间中的点。人类作为三维空间的生物，很难直观理解如此高维度的数据分布。

TensorBoard的嵌入可视化工具为我们提供了两种强大的降维方法：

主成分分析(PCA)：线性降维方法，通过保留最大方差方向来降低维度
t-SNE：非线性降维方法，特别擅长保留局部结构，适合可视化高维数据

环境准备与数据加载

首先导入必要的库：

import os
import tensorflow as tf
import numpy as np
import matplotlib.pyplot as plt

加载MNIST数据集，这是深度学习领域的经典基准数据集：

from tensorflow.examples.tutorials.mnist import input_data
mnist = input_data.read_data_sets("MNIST_data/", one_hot=True)

数据集包含：

训练集：55,000个样本
测试集：10,000个样本
验证集：5,000个样本

模型构建与训练

超参数设置

logs_path = "./logs/embedding/"  # TensorBoard日志路径
learning_rate = 0.001  # 学习率
epochs = 10  # 训练轮数
batch_size = 100  # 批次大小
display_freq = 100  # 显示频率

# 网络参数
img_h = img_w = 28  # 图像尺寸
img_size_flat = img_h * img_w  # 展平后的尺寸
n_classes = 10  # 类别数(0-9)
h1 = 200  # 第一隐藏层单元数

辅助函数定义

def weight_variable(name, shape):
    """创建权重变量"""
    initer = tf.truncated_normal_initializer(stddev=0.01)
    return tf.get_variable('W_' + name, shape=shape, initializer=initer)

def bias_variable(name, shape):
    """创建偏置变量"""
    initial = tf.constant(0., shape=shape, dtype=tf.float32)
    return tf.get_variable('b_' + name, initializer=initial)

def fc_layer(x, num_units, name, use_relu=True):
    """全连接层"""
    with tf.variable_scope(name):
        in_dim = x.get_shape()[1]
        W = weight_variable(name, shape=[in_dim, num_units])
        tf.summary.histogram('W', W)  # 记录权重直方图
        b = bias_variable(name, [num_units])
        tf.summary.histogram('b', b)  # 记录偏置直方图
        layer = tf.matmul(x, W) + b
        return tf.nn.relu(layer) if use_relu else layer

构建计算图

with tf.variable_scope('Input'):
    x = tf.placeholder(tf.float32, shape=[None, img_size_flat], name='X')
    tf.summary.image('input_image', tf.reshape(x, (-1, img_w, img_h, 1)), max_outputs=5)
    y = tf.placeholder(tf.float32, shape=[None, n_classes], name='Y')

# 网络结构
fc1 = fc_layer(x, h1, 'Hidden_layer', use_relu=True)
output_logits = fc_layer(fc1, n_classes, 'Output_layer', use_relu=False)

# 损失函数和优化器
with tf.variable_scope('Train'):
    loss = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(labels=y, logits=output_logits))
    tf.summary.scalar('loss', loss)
    optimizer = tf.train.AdamOptimizer(learning_rate=learning_rate).minimize(loss)
    accuracy = tf.reduce_mean(tf.cast(tf.equal(tf.argmax(output_logits, 1), tf.argmax(y, 1)), tf.float32))
    tf.summary.scalar('accuracy', accuracy)
    cls_prediction = tf.argmax(output_logits, axis=1, name='predictions')

模型训练与验证

训练过程中，我们记录了损失和准确率的变化：

sess = tf.InteractiveSession()
sess.run(tf.global_variables_initializer())
train_writer = tf.summary.FileWriter(logs_path, sess.graph)

for epoch in range(epochs):
    # 训练过程
    for iteration in range(num_tr_iter):
        batch_x, batch_y = mnist.train.next_batch(batch_size)
        _, summary_tr = sess.run([optimizer, merged], feed_dict={x: batch_x, y: batch_y})
        train_writer.add_summary(summary_tr, global_step)
    
    # 验证过程
    loss_valid, acc_valid = sess.run([loss, accuracy], 
                                   feed_dict={x: mnist.validation.images, y: mnist.validation.labels})

经过10轮训练后，模型在验证集上的准确率达到了98.1%，表现出色。

嵌入可视化实现

准备嵌入数据

我们关注隐藏层(fc1)的激活情况，该层有200个神经元，测试集有10,000个样本：

x_test = mnist.test.images
y_test = mnist.test.labels

# 创建嵌入变量
tensor_shape = (x_test.shape[0], fc1.get_shape()[1].value)  # [10000, 200]
embedding_var = tf.Variable(tf.zeros(tensor_shape), name='fc1_embedding')

# 将隐藏层激活赋值给嵌入变量
embedding_assign = embedding_var.assign(fc1)

配置嵌入可视化

为了在TensorBoard中正确显示，我们需要：

创建嵌入变量并分配隐藏层激活
配置元数据（如图像和标签）
将配置写入日志目录

# 创建嵌入配置
config = tf.contrib.tensorboard.plugins.projector.ProjectorConfig()
embedding = config.embeddings.add()
embedding.tensor_name = embedding_var.name

# 添加元数据
embedding.metadata_path = os.path.join(logs_path, 'metadata.tsv')  # 标签文件
embedding.sprite.image_path = os.path.join(logs_path, 'sprite.png')  # 图像精灵
embedding.sprite.single_image_dim.extend([img_w, img_h])  # 单图尺寸

# 保存配置
tf.contrib.tensorboard.plugins.projector.visualize_embeddings(train_writer, config)

生成元数据

创建标签文件(metadata.tsv)
创建图像精灵(sprite.png) - 将所有测试图像拼接成一个大图

# 生成标签文件
with open(os.path.join(logs_path, 'metadata.tsv'), 'w') as f:
    f.write("Index\tLabel\n")
    for i in range(x_test.shape[0]):
        f.write("%d\t%d\n" % (i, np.argmax(y_test[i])))
        
# 生成图像精灵
rows = int(math.ceil(math.sqrt(x_test.shape[0])))
sprite_dim = rows * img_h
sprite_image = np.zeros((sprite_dim, sprite_dim))

for i in range(x_test.shape[0]):
    row = i // rows
    col = i % rows
    sprite_image[row*img_h:(row+1)*img_h, col*img_w:(col+1)*img_w] = \
        x_test[i].reshape(img_h, img_w)
        
plt.imsave(os.path.join(logs_path, 'sprite.png'), sprite_image, cmap='gray')