The AISHELL-DMASH dataset is recorded in real smart home scenarios with two different rooms. The dataset contains 30000 hours speech data. The recording devices include one close-talking microphone and seven groups of devices at seven different positions of the room. A group of recording devices include one iPhone, one Android phone, one iPad, one microphone, and one circular microphone array with a radius of 5cm. The dataset includes 511 speakers and each speaker visits three times with a gap of 7-15 days. AISHELL-DMASH dataset was transcribed by the professional speech annotators with high QA process, and the accuracy rate of word is 98%, which could be used in research of voiceprint recognition, speech recognition, wake-up words recognition and so on.

The setup of th

本文介绍了AISHELL-DMASH数据集,它包含30000小时的语音记录,适用于语音识别研究。FFSVC20挑战关注远场分布式麦克风阵列在真实环境中的语音验证,提供规模庞大的远场场景数据。希尔贝壳科技贡献了该领域的创新解决方案和技术服务。
最低0.47元/天 解锁文章
1026

被折叠的 条评论
为什么被折叠?



