【语音识别】基于MFCC和MEL倒频系数实现声纹识别附matlab代码

本文提出了一种结合Mel频率倒谱系数(MFCC)和MEL倒频系数的特征提取技术,并使用KNN作为分类器的语音识别方法。实验结果显示,该方法对于容易混淆的英文单词具有较高的识别准确率,尤其适用于语音识别系统。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

1 内容介绍

提出了以Mel频率倒谱系数(Mel Frequency Cepstrum Coefficients,MFCC)和MEL倒频系数作为特征提取技术,以KNN作为分类器的语音识别方法,实验结果表明,对于容易混淆的英文单词,该方法语音识别准确率较高.​

2 部分代码

function f=enframe(x,win,inc) 

%ENFRAME split signal up into (overlapping) frames: one per row. F=(X,WIN,INC) 

% F = ENFRAME(X,LEN) splits the vector X up into 

% frames. Each frame is of length LEN and occupies 

% one row of the output matrix. The last few frames of X 

% will be ignored if its length is not divisible by LEN. 

% It is an error if X is shorter than LEN. 

% F = ENFRAME(X,LEN,INC) has frames beginning at increments of INC 

% The centre of frame I is X((I-1)*INC+(LEN+1)/2) for I=1,2,... 

% The number of frames is fix((length(X)-LEN+INC)/INC) 

% F = ENFRAME(X,WINDOW) or ENFRAME(X,WINDOW,INC) multiplies 

% each frame by WINDOW(:) 

%    Copyright (C) Mike Brookes 1997 

%      Version: $Id: enframe.m,v 1.3 2005/02/21 15:22:12 dmb Exp $ 

%   VOICEBOX is a MATLAB toolbox for speech processing. 

%   Home page: http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

%   This program is free software; you can redistribute it and/or modify 

%   it under the terms of the GNU General Public License as published by 

%   the Free Software Foundation; either version 2 of the License, or 

%   (at your option) any later version. 

%   This program is distributed in the hope that it will be useful, 

%   but WITHOUT ANY WARRANTY; without even the implied warranty of 

%   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the 

%   GNU General Public License for more details. 

%   You can obtain a copy of the GNU General Public License from 

%   ftp://prep.ai.mit.edu/pub/gnu/COPYING-2.0 or by writing to 

%   Free Software Foundation, Inc.,675 Mass Ave, Cambridge, MA 02139, USA. 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

nx=length(x); 

nwin=length(win); 

if (nwin == 1) 

   len = win; 

else 

   len = nwin; 

end 

if (nargin < 3) 

   inc = len; 

end 

nf = fix((nx-len+inc)/inc); 

f=zeros(nf,len); 

indf= inc*(0:(nf-1)).'; 

inds = (1:len); 

f(:) = x(indf(:,ones(1,len))+inds(ones(nf,1),:)); 

if (nwin > 1) 

    w = win(:)'; 

    f = f .* w(ones(nf,1),:); 

end 

3 运行结果

4 参考文献

[1]李玲俐. 一种基于MFCC和SVM的语音识别方法[J]. 软件导刊, 2012, 11(3):2.

博主简介:擅长智能优化算法神经网络预测信号处理元胞自动机图像处理路径规划无人机雷达通信无线传感器等多种领域的Matlab仿真,相关matlab代码问题可私信交流。

部分理论引用网络文献,若有侵权联系博主删除。

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

matlab科研助手

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值