Speech Emotion Analyzer 使用教程-优快云博客

本文链接：https://blog.youkuaiyun.com/gitblog_00407/article/details/142197920

Speech Emotion Analyzer 使用教程

Speech-Emotion-Analyzer The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python) 项目地址: https://gitcode.com/gh_mirrors/spe/Speech-Emotion-Analyzer

1. 项目介绍

Speech Emotion Analyzer 是一个基于深度学习的开源项目，旨在从语音中检测和分类不同的情绪。该项目由 Mitesh Puthran 开发，使用 Python 和 Keras 构建，能够识别五种不同的男性和女性情绪：中性、平静、快乐、悲伤、愤怒、恐惧、厌恶和惊讶。

主要功能

情绪识别：能够从音频文件中识别出不同的情绪。
性别识别：能够区分男性和女性的声音。
高精度：模型在区分男性和女性声音时达到100%准确性，并对情绪有超过70%的识别率。

应用场景

市场营销：根据消费者的情绪状态推荐相应的产品，提高购买转化率。
汽车行业：在自动驾驶车辆中，通过识别驾驶员情绪来调整车速，确保行驶安全。
心理健康监测：用于监测和分析用户的情绪状态，提供个性化的服务和建议。

2. 项目快速启动

环境准备

Python 3.6 或更高版本
Keras 2.2.4 或更高版本
LibROSA 0.7.2 或更高版本

安装依赖

pip install keras librosa

克隆项目

git clone https://github.com/MiteshPuthran/Speech-Emotion-Analyzer.git
cd Speech-Emotion-Analyzer

运行示例

import librosa
import numpy as np
from keras.models import model_from_json

# 加载模型结构
with open('model.json', 'r') as json_file:
    loaded_model_json = json_file.read()
    model = model_from_json(loaded_model_json)

# 加载模型权重
model.load_weights("Emotion_Voice_Detection_Model.h5")

# 加载音频文件
audio_path = 'path_to_your_audio_file.wav'
audio, sample_rate = librosa.load(audio_path, res_type='kaiser_fast')
mfccs = librosa.feature.mfcc(y=audio, sr=sample_rate, n_mfcc=13)
mfccs_processed = np.mean(mfccs.T, axis=0)

# 预测情绪
emotion_prediction = model.predict(np.expand_dims(mfccs_processed, axis=0))
emotion_label = np.argmax(emotion_prediction)

# 情绪标签映射
emotion_labels = ['female_angry', 'female_calm', 'female_fearful', 'female_happy', 'female_sad', 
                  'male_angry', 'male_calm', 'male_fearful', 'male_happy', 'male_sad']

print(f"Detected Emotion: {emotion_labels[emotion_label]}")