public marks

PUBLIC MARKS with tags HTK & corpus

April 2007

Donate your speech to VoxForge using your telephone

by kmaclean
VoxForge ( http://www.voxforge.org ) is a open source project that collects speech recordings for use in the creation of Acoustic Models. Speech recognition engines need an acoustic model to recognize speech. To create an acoustic model, you take a very large number of speech audio recordings and 'compile' them into statistical representations of the sounds that make up each word. Most open source speech recognition engines use 'closed source' acoustic models. VoxForge hopes to address this problem by creating a free gpl speech corpus, and generating acoustic models from this corpus. You can now use your telephone to your donate your speech. Click this link: http://www.voxforge.org/home/s… to get the number, and the Interactive Voice Response system will guide you through the process.

February 2007

Improving Open Source Speech Recognition

by kmaclean
Speech Recognition Engines require two types of files to recognize speech: an Acoustic Model, created by 'compiling' a lots of transcribed speech into statistical models, and a Language Model (for Dictation) or Grammar file (for Command and Control). Most Acoustic Models used by 'Open Source' Speech Recognition engines are 'Closed Source'. They do not give you access to the speech audio (the 'Source') used to create the Acoustic Model. The reason for this is that there is no free Speech Corpus in a form that can readily be used to create Acoustic Models for Speech Recognition Engines. Open Source projects are thus required to purchase a Speech Corpus which has restrictive licensing in order to create their Acoustic Models. VoxForge (http://www.voxforge.org) was set up to address this problem. The site collects GPL transcribed speech audio from users which is then used to create Acoustic Models. These can then be used with Free and Open Source Speech Recognition Engines such as Sphinx, ISIP, Julius and/or HTK.

PUBLIC TAGS related to tag HTK

acoustic +   acoustic model +   copora +   corpora +   corpus +   isip +   Julius +   model +   recognition +   speech +   speech recognition +   sphinx +   transcribed +   transcribed speech +  

Active users

kmaclean
last mark : 26/04/2007 17:13