DSpace
 

Researchspace >
General science, engineering & technology >
General science, engineering & technology >
General science, engineering & technology >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10204/5576

Title: ASR corpus design for resource-scarce languages
Authors: Barnard, E
Davel, M
Van Heerden, C
Keywords: Accurate speech recognition (ASR)
Lwazi
Resource scarce languages
Corpus design
Issue Date: Sep-2009
Publisher: ISCA
Citation: Barnard, E, Davel, M and Van Heerden, C. ASR corpus design for resource-scarce languages. 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UK, 6-10 September 2009, pp 2847-2850
Abstract: The authors investigate the number of speakers and the amount of data that is required for the development of useable speaker-independent speech-recognition systems in resource-scarce languages. Their experiments employ the Lwazi corpus, which contains speech in the eleven official languages of South Africa. They find that a surprisingly small number of speakers (fewer than 50) and around 10 to 20 hours of speech per language are sufficient for the purposes of acceptable phone-based recognition.
Description: 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UK, 6-10 September 2009
URI: http://hdl.handle.net/10204/5576
Appears in Collections:Human language technologies
General science, engineering & technology

Files in This Item:

File Description SizeFormat
Barnard3_2009.pdf247.35 kBAdobe PDFView/Open
View Statistics

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback