TensorFlow-Examples项目解析：使用全连接神经网络实现MNIST分类-优快云博客

本文链接：https://blog.youkuaiyun.com/gitblog_01181/article/details/148325606

TensorFlow-Examples项目解析：使用全连接神经网络实现MNIST分类

TensorFlow-Examples TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2) 项目地址: https://gitcode.com/gh_mirrors/te/TensorFlow-Examples

本文将深入解析一个基于TensorFlow实现的全连接神经网络（Multilayer Perceptron）示例，该示例来自TensorFlow-Examples项目，用于MNIST手写数字分类任务。

神经网络基础概念

全连接神经网络（Fully Connected Neural Network），也称为多层感知机（MLP），是最基础的神经网络结构之一。它由多个全连接层组成，每个神经元都与前一层的所有神经元相连。在本示例中，我们构建的是一个具有两个隐藏层的神经网络。

环境准备与数据加载

首先需要导入必要的TensorFlow模块和MNIST数据集：

from tensorflow.examples.tutorials.mnist import input_data
mnist = input_data.read_data_sets("/tmp/data/", one_hot=False)

MNIST数据集包含60,000个训练样本和10,000个测试样本，每个样本是28x28像素的手写数字图像，标签为0-9的数字。

网络参数设置

learning_rate = 0.1
num_steps = 1000
batch_size = 128
display_step = 100

n_hidden_1 = 256  # 第一隐藏层神经元数量
n_hidden_2 = 256  # 第二隐藏层神经元数量
num_input = 784   # MNIST数据输入(28*28=784)
num_classes = 10  # MNIST类别数(0-9)

这些参数控制了网络的结构和训练过程，适当调整这些参数可以影响模型的性能和训练速度。

神经网络架构定义

示例中定义了一个三层的神经网络结构：

def neural_net(x_dict):
    x = x_dict['images']
    layer_1 = tf.layers.dense(x, n_hidden_1)
    layer_2 = tf.layers.dense(layer_1, n_hidden_2)
    out_layer = tf.layers.dense(layer_2, num_classes)
    return out_layer

这里使用了TensorFlow的高级API tf.layers.dense 来创建全连接层，它自动处理了权重矩阵和偏置向量的创建，以及激活函数的应用（默认使用线性激活）。

模型函数定义

使用TensorFlow Estimator API需要定义一个模型函数：

def model_fn(features, labels, mode):
    logits = neural_net(features)
    
    pred_classes = tf.argmax(logits, axis=1)
    pred_probas = tf.nn.softmax(logits)
    
    if mode == tf.estimator.ModeKeys.PREDICT:
        return tf.estimator.EstimatorSpec(mode, predictions=pred_classes)
    
    loss_op = tf.reduce_mean(tf.nn.sparse_softmax_cross_entropy_with_logits(
        logits=logits, labels=tf.cast(labels, dtype=tf.int32)))
    optimizer = tf.train.GradientDescentOptimizer(learning_rate=learning_rate)
    train_op = optimizer.minimize(loss_op, global_step=tf.train.get_global_step())
    
    acc_op = tf.metrics.accuracy(labels=labels, predictions=pred_classes)
    
    estim_specs = tf.estimator.EstimatorSpec(
        mode=mode,
        predictions=pred_classes,
        loss=loss_op,
        train_op=train_op,
        eval_metric_ops={'accuracy': acc_op})
    
    return estim_specs

这个函数定义了模型在不同模式（训练、评估、预测）下的行为，包括：

网络前向传播
预测结果计算
损失函数定义（交叉熵损失）
优化器配置（梯度下降）
评估指标（准确率）

模型训练与评估

创建Estimator并训练模型：

model = tf.estimator.Estimator(model_fn)

input_fn = tf.estimator.inputs.numpy_input_fn(
    x={'images': mnist.train.images}, y=mnist.train.labels,
    batch_size=batch_size, num_epochs=None, shuffle=True)
model.train(input_fn, steps=num_steps)

评估模型性能：

input_fn = tf.estimator.inputs.numpy_input_fn(
    x={'images': mnist.test.images}, y=mnist.test.labels,
    batch_size=batch_size, shuffle=False)
e = model.evaluate(input_fn)
print("Testing Accuracy:", e['accuracy'])