AISHELL-ASR0009-OS1 开源中文语音数据库

本文介绍了由希尔贝壳科技有限公司发布的开源 Mandarin 语音识别数据集 AISHELL-1,该数据集包含178小时的普通话录音,涉及400位不同地区的说话人,旨在促进中文语音识别研究。数据集质量高,转写准确率超过95%,可用于构建和评估语音识别系统。AISHELL-1 是迄今为止最大的开源中文语音识别语料库,对学术界和工业界开放,有助于弥合研究与产业之间的差距。

ABSTRACT

An open-source Mandarin speech corpus called AISHELL-1 is released. It is by far the largest corpus which is suitable for conducting the speech recognition research and building speech recognition systems for Mandarin. The recording procedure, including audio capturing devices and environments are presented in details. The preparation of the related resources, including transcriptions and lexicon are described. The corpus is released with a Kaldi recipe. Experimental results implies that the quality of audio recordings and transcriptions are promising.

Index Terms— Speech Recognition, Mandarin Corpus, Open-Source Data

INTRODUCTION

Automatic Speech Recognition(ASR) has been an active research topic for several decades. Most state-of-the-art ASR systems benefit from powerful statistical models, such as Gaussian Mixture Models(GMM), Hidden Markov Models(HMM) and Deep Neural Networks(DNN) . Th

评论
成就一亿技术人!
拼手气红包6.0元
还能输入1000个字符
 
红包 添加红包
表情包 插入表情
 条评论被折叠 查看
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值