18、基于神经网络实现面部识别系统

最新推荐文章于 2026-01-08 20:27:01 发布

原创最新推荐文章于 2026-01-08 20:27:01 发布 · 23 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#面部识别 #孪生神经网络 #深度学习

神经网络实战六项目专栏收录该内容

20 篇文章 ¥499.90

订阅专栏¥69.90

会员秒杀 ¥9.9 重磅福利

超级会员免费看

基于神经网络实现面部识别系统

1. 创建孪生神经网络

1.1 构建共享卷积网络

首先，我们需要创建一个共享卷积网络。以下是使用 Keras 的 Sequential 类构建共享卷积网络的代码：

from keras.models import Sequential, Input
from keras.layers import Conv2D, MaxPooling2D, Flatten, Dense

def create_shared_network(input_shape):
    model = Sequential()
    model.add(Conv2D(filters=128, kernel_size=(3,3), activation='relu',
                     input_shape=input_shape))
    model.add(MaxPooling2D())
    model.add(Conv2D(filters=64, kernel_size=(3,3), activation='relu'))
    model.add(Flatten())
    model.add(Dense(units=128, activation='sigmoid'))
    return model

1.2 构建孪生神经网络

接下来，我们使用上述定义的共享卷积网络来构建孪生神经网络。具体步骤如下：
1. 创建共享网络的单个实例：

input_shape = X_train.shape[1:]
shared_network = create_shared_network(input_shape)

指定顶部和底部层的输入：

input_top = Input(shape=input_shape)
input_bottom = Input(shape=input_shape)

使用 Keras 的函数式方法将共享网络堆叠在输入层的右侧：

output_top = shared_network(input_top)
output_bottom = shared_network(input_bottom)

定义计算欧几里得距离的函数：

from keras import backend as K

def euclidean_distance(vectors):
    vector1, vector2 = vectors
    sum_square = K.sum(K.square(vector1 - vector2), axis=1, keepdims=True)
    return K.sqrt(K.maximum(sum_square, K.epsilon()))

将欧几里得距离函数包装在 Lambda 层中：

from keras.layers import Lambda
distance = Lambda(euclidean_distance, output_shape=(1,))([output_top, output_bottom])

组合距离层和输入以完成模型：

from keras.models import Model
model = Model(inputs=[input_top, input_bottom], outputs=distance)

验证模型结构：

print(model.summary())

1.3 模型结构总结

层类型	输入形状	输出形状
输入层（顶部）	112 x 92 x 1	112 x 92 x 1
输入层（底部）	112 x 92 x 1	112 x 92 x 1
共享卷积网络	112 x 92 x 1	128
Lambda 层（欧几里得距离）	128, 128	1

1.4 模型构建流程

graph LR
    A[定义共享卷积网络] --> B[创建共享网络实例]
    B --> C[指定输入层]
    C --> D[堆叠共享网络]
    D --> E[定义欧几里得距离函数]
    E --> F[包装欧几里得距离函数]
    F --> G[组合距离层和输入]
    G --> H[验证模型结构]

2. 模型训练

2.1 准备训练数据

训练孪生神经网络需要使用图像对及其对应的类别标签。我们可以使用以下函数创建这些图像对：

import random
import numpy as np

def create_pairs(X,Y, num_classes):
    pairs, labels = [], []
    # index of images in X and Y for each class
    class_idx = [np.where(Y==i)[0] for i in range(num_classes)]
    # The minimum number of images across all classes
    min_images = min(len(class_idx[i]) for i in range(num_classes)) - 1
    for c in range(num_classes):
        for n in range(min_images):
            # create positive pair
            img1 = X[class_idx[c][n]]
            img2 = X[class_idx[c][n+1]]
            pairs.append((img1, img2))
            labels.append(1)
            # create negative pair
            # list of classes that are different from the current class
            neg_list = list(range(num_classes))
            neg_list.remove(c)
            # select a random class from the negative list.
            # This class will be used to form the negative pair.
            neg_c = random.sample(neg_list,1)[0]
            img1 = X[class_idx[c][n]]
            img2 = X[class_idx[neg_c][n]]
            pairs.append((img1,img2))
            labels.append(0)
    return np.array(pairs), np.array(labels)

num_classes = len(np.unique(Y_train))
training_pairs, training_labels = create_pairs(X_train, Y_train, len(np.unique(Y_train)))
test_pairs, test_labels = create_pairs(X_test, Y_test, len(np.unique(Y_test)))

2.2 定义对比损失函数

由于 Keras 中没有默认的对比损失函数，我们需要自己定义：

def contrastive_loss(Y_true, D):
    margin = 1
    return K.mean(Y_true*K.square(D)+(1 - Y_true)*K.maximum((margin-D),0))

2.3 编译和训练模型

model.compile(loss=contrastive_loss, optimizer='adam')
model.fit([training_pairs[:, 0], training_pairs[:, 1]], training_labels,
          batch_size=64, epochs=10)

3. 分析结果

3.1 测试相同主体的图像对

import matplotlib.pyplot as plt

idx1, idx2 = 21, 29
img1 = np.expand_dims(X_test[idx1], axis=0)
img2 = np.expand_dims(X_test[idx2], axis=0)

fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(10,7))
ax1.imshow(np.squeeze(img1), cmap='gray')
ax2.imshow(np.squeeze(img2), cmap='gray')
for ax in [ax1, ax2]:
    ax.grid(False)
    ax.set_xticks([])
    ax.set_yticks([])
dissimilarity = model.predict([img1, img2])[0][0]
fig.suptitle("Dissimilarity Score = {:.3f}".format(dissimilarity), size=30)
plt.tight_layout()
plt.show()

3.2 测试不同主体的图像对

idx1, idx2 = 1, 39
img1 = np.expand_dims(X_test[idx1], axis=0)
img2 = np.expand_dims(X_test[idx2], axis=0)

fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(10,7))
ax1.imshow(np.squeeze(img1), cmap='gray')
ax2.imshow(np.squeeze(img2), cmap='gray')
for ax in [ax1, ax2]:
    ax.grid(False)
    ax.set_xticks([])
    ax.set_yticks([])
dissimilarity = model.predict([img1, img2])[0][0]
fig.suptitle("Dissimilarity Score = {:.3f}".format(dissimilarity), size=30)
plt.tight_layout()
plt.show()

3.3 确定阈值

通过对更多正样本和负样本对进行测试，我们发现阈值分数大约为 0.5。即分数低于 0.5 应分类为正样本对（面部匹配），分数高于 0.5 应分类为负样本对。

3.4 测试流程

graph LR
    A[选择图像对] --> B[预测欧几里得距离]
    B --> C[判断是否为相同主体]
    C --> D[输出结果]

4. 代码整合

4.1 创建 `utils.py` 文件

import numpy as np
import random
import os
import cv2
from keras.models import Sequential
from keras.layers import Flatten, Dense, Conv2D, MaxPooling2D
from keras import backend as K
from keras.preprocessing.image import load_img, img_to_array

def euclidean_distance(vectors):
    vector1, vector2 = vectors
    sum_square = K.sum(K.square(vector1 - vector2), axis=1, keepdims=True)
    return K.sqrt(K.maximum(sum_square, K.epsilon()))

def contrastive_loss(Y_true, D):
    margin = 1
    return K.mean(Y_true*K.square(D)+(1 - Y_true)*K.maximum((margin-D),0))

def accuracy(y_true, y_pred):
    return K.mean(K.equal(y_true, K.cast(y_pred < 0.5, y_true.dtype)))

def create_pairs(X,Y, num_classes):
    pairs, labels = [], []
    # index of images in X and Y for each class
    class_idx = [np.where(Y==i)[0] for i in range(num_classes)]
    # The minimum number of images across all classes
    min_images = min(len(class_idx[i]) for i in range(num_classes)) - 1
    for c in range(num_classes):
        for n in range(min_images):
            # create positive pair
            img1 = X[class_idx[c][n]]
            img2 = X[class_idx[c][n+1]]
            pairs.append((img1, img2))
            labels.append(1)
            # create negative pair
            neg_list = list(range(num_classes))
            neg_list.remove(c)
            # select a random class from the negative list.
            # this class will be used to form the negative pair
            neg_c = random.sample(neg_list,1)[0]
            img1 = X[class_idx[c][n]]
            img2 = X[class_idx[neg_c][n]]
            pairs.append((img1,img2))
            labels.append(0)
    return np.array(pairs), np.array(labels)

def create_shared_network(input_shape):
    model = Sequential(name='Shared_Conv_Network')
    model.add(Conv2D(filters=64, kernel_size=(3,3), activation='relu',
                     input_shape=input_shape))
    model.add(MaxPooling2D())
    model.add(Conv2D(filters=64, kernel_size=(3,3), activation='relu'))
    model.add(Flatten())
    model.add(Dense(units=128, activation='sigmoid'))
    return model

def get_data(dir):
    X_train, Y_train = [], []
    X_test, Y_test = [], []
    subfolders = sorted([file.path for file in os.scandir(dir) if
                         file.is_dir()])
    for idx, folder in enumerate(subfolders):
        for file in sorted(os.listdir(folder)):
            img = load_img(folder+"/"+file, color_mode='grayscale')
            img = img_to_array(img).astype('float32')/255
            img = img.reshape(img.shape[0], img.shape[1],1)
            if idx < 35:
                X_train.append(img)
                Y_train.append(idx)
            else:
                X_test.append(img)
                Y_test.append(idx-35)
    X_train = np.array(X_train)
    X_test = np.array(X_test)
    Y_train = np.array(Y_train)
    Y_test = np.array(Y_test)
    return (X_train, Y_train), (X_test, Y_test)

4.2 创建 `siamese_nn.py` 文件

'''
Main code for training a Siamese neural network for face recognition
'''
import utils
import numpy as np
from keras.layers import Input, Lambda
from keras.models import Model

faces_dir = 'att_faces/'
# Import Training and Testing Data
(X_train, Y_train), (X_test, Y_test) = utils.get_data(faces_dir)
num_classes = len(np.unique(Y_train))
# Create Siamese Neural Network
input_shape = X_train.shape[1:]
shared_network = utils.create_shared_network(input_shape)

input_top = Input(shape=input_shape)
input_bottom = Input(shape=input_shape)
output_top = shared_network(input_top)
output_bottom = shared_network(input_bottom)
distance = Lambda(utils.euclidean_distance, output_shape=(1,))([output_top, output_bottom])
model = Model(inputs=[input_top, input_bottom], outputs=distance)
# Train the model
training_pairs, training_labels = utils.create_pairs(X_train, Y_train,
                                num_classes=num_classes)
model.compile(loss=utils.contrastive_loss, optimizer='adam',
              metrics=[utils.accuracy])
model.fit([training_pairs[:, 0], training_pairs[:, 1]], training_labels,
          batch_size=128,
          epochs=10)
# Save the model
model.save('siamese_nn.h5')

5. 创建实时面部识别程序

我们可以将之前编写的代码整合起来，创建一个实时面部识别程序。该程序将使用计算机的网络摄像头进行面部识别，并验证坐在摄像头前的人是否是你。具体实现步骤如下：
1. 加载训练好的模型。
2. 打开网络摄像头。
3. 捕获视频帧。
4. 对视频帧中的面部进行识别。
5. 根据识别结果进行身份验证。

以下是一个简单的示例代码：

import cv2
import numpy as np
from keras.models import load_model

# 加载训练好的模型
model = load_model('siamese_nn.h5')

# 打开网络摄像头
cap = cv2.VideoCapture(0)

while True:
    ret, frame = cap.read()
    if not ret:
        break

    # 在这里添加面部检测和预处理代码
    # ...

    # 进行面部识别
    # ...

    # 显示结果
    cv2.imshow('Face Recognition', frame)

    if cv2.waitKey(1) & 0xFF == ord('q'):
        break

# 释放摄像头并关闭窗口
cap.release()
cv2.destroyAllWindows()

通过以上步骤，我们可以实现一个基于孪生神经网络的面部识别系统，并将其应用于实时面部识别场景。

6. 实时面部识别程序的详细实现

6.1 面部检测和预处理

在实时面部识别程序中，我们需要对捕获的视频帧进行面部检测和预处理。这里我们使用 OpenCV 的 Haar 级联分类器进行面部检测，并将检测到的面部图像调整为模型所需的输入尺寸。

import cv2
import numpy as np
from keras.models import load_model

# 加载训练好的模型
model = load_model('siamese_nn.h5')

# 加载 Haar 级联分类器
face_cascade = cv2.CascadeClassifier(cv2.data.haarcascades + 'haarcascade_frontalface_default.xml')

# 打开网络摄像头
cap = cv2.VideoCapture(0)

while True:
    ret, frame = cap.read()
    if not ret:
        break

    # 将帧转换为灰度图像
    gray = cv2.cvtColor(frame, cv2.COLOR_BGR2GRAY)

    # 检测面部
    faces = face_cascade.detectMultiScale(gray, 1.3, 5)

    for (x, y, w, h) in faces:
        # 提取面部区域
        face = gray[y:y + h, x:x + w]

        # 调整面部图像尺寸
        face = cv2.resize(face, (112, 92))
        face = face.astype('float32') / 255
        face = np.expand_dims(face, axis=0)
        face = np.expand_dims(face, axis=-1)

        # 这里可以添加与已知面部图像进行对比的代码
        # ...

        # 在帧上绘制矩形框
        cv2.rectangle(frame, (x, y), (x + w, y + h), (255, 0, 0), 2)

    # 显示结果
    cv2.imshow('Face Recognition', frame)

    if cv2.waitKey(1) & 0xFF == ord('q'):
        break

# 释放摄像头并关闭窗口
cap.release()
cv2.destroyAllWindows()

6.2 与已知面部图像进行对比

在检测到面部后，我们需要将其与已知的面部图像进行对比，以确定是否匹配。这里我们可以选择一个已知的面部图像作为参考，将检测到的面部图像与参考图像进行对比。

# 加载参考面部图像
reference_face = cv2.imread('reference_face.jpg', cv2.IMREAD_GRAYSCALE)
reference_face = cv2.resize(reference_face, (112, 92))
reference_face = reference_face.astype('float32') / 255
reference_face = np.expand_dims(reference_face, axis=0)
reference_face = np.expand_dims(reference_face, axis=-1)

for (x, y, w, h) in faces:
    # 提取面部区域
    face = gray[y:y + h, x:x + w]

    # 调整面部图像尺寸
    face = cv2.resize(face, (112, 92))
    face = face.astype('float32') / 255
    face = np.expand_dims(face, axis=0)
    face = np.expand_dims(face, axis=-1)

    # 进行面部对比
    dissimilarity = model.predict([face, reference_face])[0][0]

    if dissimilarity < 0.5:
        cv2.putText(frame, 'Match', (x, y - 10), cv2.FONT_HERSHEY_SIMPLEX, 0.9, (0, 255, 0), 2)
    else:
        cv2.putText(frame, 'No Match', (x, y - 10), cv2.FONT_HERSHEY_SIMPLEX, 0.9, (0, 0, 255), 2)

    # 在帧上绘制矩形框
    cv2.rectangle(frame, (x, y), (x + w, y + h), (255, 0, 0), 2)

6.3 实时面部识别流程

graph LR
    A[打开网络摄像头] --> B[捕获视频帧]
    B --> C[面部检测]
    C --> D[面部预处理]
    D --> E[与参考面部对比]
    E --> F[判断是否匹配]
    F --> G[显示结果]
    G --> B

7. 优化建议

7.1 模型优化

增加训练数据 ：可以收集更多的面部图像数据进行训练，以提高模型的泛化能力。
调整模型结构 ：尝试不同的卷积层、池化层和全连接层的组合，以找到更适合的模型结构。
使用预训练模型 ：可以使用预训练的卷积神经网络（如 VGG、ResNet 等）作为基础模型，进行微调训练。

7.2 面部检测优化

使用更先进的面部检测算法 ：如 MTCNN、RetinaFace 等，以提高面部检测的准确性和速度。
多尺度检测 ：在不同的尺度上进行面部检测，以检测到不同大小的面部。

7.3 实时性能优化

降低分辨率 ：可以降低视频帧的分辨率，以减少计算量，提高实时性能。
并行计算 ：使用 GPU 进行并行计算，以加速模型的推理过程。

8. 总结

通过本文的介绍，我们学习了如何基于神经网络实现一个面部识别系统。具体步骤如下：
1. 创建孪生神经网络，包括构建共享卷积网络和组合网络。
2. 准备训练数据，定义对比损失函数，并编译和训练模型。
3. 分析模型的测试结果，确定阈值。
4. 整合代码，创建实时面部识别程序。

同时，我们还介绍了实时面部识别程序的详细实现和优化建议。通过不断优化模型和算法，我们可以提高面部识别系统的准确性和实时性能，将其应用于更多的实际场景中。

8.1 步骤总结表格

步骤	描述
创建孪生神经网络	构建共享卷积网络，使用函数式方法组合网络
模型训练	准备训练数据，定义对比损失函数，编译和训练模型
分析结果	测试相同和不同主体的图像对，确定阈值
代码整合	创建 `utils.py` 和 `siamese_nn.py` 文件
实时面部识别	加载模型，打开摄像头，进行面部检测和识别