keras深度学习框架通过简单神经网络实现手写数字识别

原创

已于 2023-08-27 16:36:57 修改 · 748 阅读

3 ·

CC 4.0 BY-SA版权

文章标签：

#深度学习 #keras #神经网络 #Sequential #Dense

于 2023-08-27 16:25:34 首次发布

背景

keras深度学习框架，并不是一个独立的深度学习框架，它后台依赖tensorflow或者theano。大部分开发者应该使用的是tensorflow。keras可以很方便的像搭积木一样根据模型搭出我们需要的神经网络，然后进行编译，训练，测试，预测。

今天介绍的手写数字识别实验，主要是熟悉keras搭建神经网络的流程，以及大体的思路。现如今，手写数字识别实验的代码各种各样，对于初学者而言，我们需要的是类似helloworld那样简单的示例。通过示例，我们可以了解神经网络的搭建过程。

这里使用的手写数字识别，通过搭建网络，构建模型，最后保存模型，然后我们加载模型，通过真实的图片来预测，也检验一下神经网络的能力。

这里手写数字识别数据来源于官方自带mnist数据集，这个数据集包含60000个训练集和10000个测试集。每个数据是由28 * 28 = 784个矩阵元素组成。所以我们自己用来测试的图片最后应该也要按照这个28*28的尺寸来制作，并且最后进行预测predict的时候，也要像训练集或者测试集一样，把图片转为一个784元素的数组。

准备代码

import keras
import numpy as np
import tensorflow as tf
from keras.models import Sequential
from keras.layers import Dense, Activation
from tensorflow.keras import datasets, utils
import matplotlib.pyplot as plt


(x_train, y_train), (x_test, y_test) = datasets.mnist.load_data()
x_train = x_train.reshape((-1, 28*28))
x_train = x_train.astype('float32')/255
x_test = x_test.reshape((-1, 28*28))
x_test = x_test.astype('float32')/255

y_train = utils.to_categorical(y_train, num_classes=10)
y_test = utils.to_categorical(y_test, num_classes=10)

print('x_train.shape', x_train.shape)
print('x_test.shape', x_test.shape)
print('y_train.shape', y_train.shape)
print('y_test.shape', y_test.shape)
"""
layer = [Dense(32, input_shape=(784,)),
         Activation('relu'),
         Dense(10),
         Activation('softmax')]

model = Sequential(layer)
"""
model = Sequential()
# model.add(Dense(units=784, activation="relu", input_dim=784))
model.add(Dense(512, activation="relu", input_shape=(28*28, )))
model.add(Dense(10, activation="softmax"))

model.compile(loss="categorical_crossentropy

最低0.47元/天解锁文章