ABSTRACT
An open-source Mandarin speech corpus called AISHELL-1 is released. It is by far the largest corpus which is suitable for conducting the speech recognition research and building speech recognition systems for Mandarin. The recording procedure, including audio capturing devices and environments are presented in details. The preparation of the related resources, including transcriptions and lexicon are described. The corpus is released with a Kaldi recipe. Experimental results implies that the quality of audio recordings and transcriptions are promising.
Index Terms— Speech Recognition, Mandarin Corpus, Open-Source Data
INTRODUCTION
Automatic Speech Recognition(ASR) has been an active research topic for several decades. Most state-of-the-art ASR systems benefit from powerful statistical models, such as Gaussian Mixture Models(GMM), Hidden Markov Models(HMM) and Deep Neural Networks(DNN) . Th

本文介绍了由希尔贝壳科技有限公司发布的开源 Mandarin 语音识别数据集 AISHELL-1,该数据集包含178小时的普通话录音,涉及400位不同地区的说话人,旨在促进中文语音识别研究。数据集质量高,转写准确率超过95%,可用于构建和评估语音识别系统。AISHELL-1 是迄今为止最大的开源中文语音识别语料库,对学术界和工业界开放,有助于弥合研究与产业之间的差距。
最低0.47元/天 解锁文章
1115

被折叠的 条评论
为什么被折叠?



