audio to text

最新推荐文章于 2025-09-15 09:12:41 发布

转载最新推荐文章于 2025-09-15 09:12:41 发布 · 1.2k 阅读

SpeechRecognition 专栏收录该内容

3 篇文章

订阅专栏

VoxForge是一个致力于收集转录语音的项目，旨在为自由及开放源代码的语音识别引擎提供支持。该项目将收集到的音频文件发布在GPL许可下，并将其整合成可用于CMUSphinx、ISIP等开源语音识别引擎的声学模型。

VoxForge

Languages

Български
Catalan
Deutsch
Ελληνικά
Español
Français
עברית
Hrvatski
Italiano
Netherlands
فارسی
Português
Русский
Shqip
Türkçe
Українська

VoxForge was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines (on Linux, Windows and Mac).

We willmake available all submitted audio files under the GPL license, and then 'compile' them into acoustic models for usewith Open Source speech recognition engines such as CMU Sphinx, ISIP, Julius and HTK (note: HTK has distribution restrictions).

Why Do We Need Free GPL Speech Audio?

Most acoustic models used by 'Open Source' speech recognition(or Speech-to-Text) engines are closed source. They do not give you access to the speechaudio and transcriptions (i.e. the speechcorpus) used to create the acoustic model.

The reason for this is that Free and Open Source ('FOSS') projects arerequired to purchase large speechcorpora with restrictive licensing. Although there are afew instances of small FOSS speech corpora that could be used tocreate acoustic models, the vast majority of corpora (especiallylarge corpora best suited to building good acoustic models) must bepurchased under restrictive licenses.

How Can You Help?

Record yourself reading some text and upload your recordings to VoxForge.

Other Options.

News

Open Speech Data Corpus for German

By kmaclean-4/28/2015VoxForge is now mirroring the LT and the Teleccoperation group Open Speech Data Corpus for German with 35 hours of speech from about 180 speakers.

Java Code Signing Certificate from Thawte

By kmaclean-6/17/2014We would like to thank Thawte for renewing the code signing certificate for the VoxForge speech submission applet for another 2 years.

New VoxForge Language: Albanian

By kmaclean-2/27/2014Many thanks to ajashari for the VoxForge Albanian translations