DSpace
 

Researchspace >
General science, engineering & technology >
General science, engineering & technology >
General science, engineering & technology >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10204/3649

Title: Basic speech recognition for spoken dialogues
Authors: Van Heerden, C
Barnard, E
Davel, M
Keywords: Speech recognition
Spoken dialogue systems
SDS
Accurate speech recognisers
ASR
Resource scarce languages
Human language technologies
Interspeech 2009
Speech communication
Small-vocabulary speaker-independent ASR systems
Cross-language transfer
Issue Date: Sep-2009
Publisher: International Speech Communication Association
Citation: Van Heerden, C, Barnard, E and Davel, M. 2009. Basic speech recognition for spoken dialogues. 10th Annual Conference of the International Speech Communication Association (Interspeech 2009). Brighton, UK, 6-10 September, 2009. pp 3003-3006
Abstract: Spoken dialogue systems (SDSs) have great potential for information access in the developing world. However, the realisation of that potential requires the solution of several challenging problems, including the development of sufficiently accurate speech recognisers for a diverse multitude of languages. The paper investigates the feasibility of developing small-vocabulary speaker-independent ASR systems designed for use in a telephone-based information system, using ten resource-scarce languages spoken in South Africa as a case study. The researchers contrast a cross-language transfer approach (using a well-trained system from a different language) with the development of new language-specific corpora and systems, and evaluate the effectiveness of both approaches. It was found that limited speech corpora (3 to 8 hours of data from around 200 speakers) are sufficient for the development of reasonably accurate recognisers. Error rates are in the range 2% to 12% for a tenword task, where vocabulary words are excluded from training to simulate vocabulary-independent performance. This approach is substantially more accurate than cross-language transfer, and sufficient for the development of basic spoken dialogue systems.
Description: 10th Annual Conference of the International Speech Communication Association (Interspeech 2009). Brighton, UK, 6-10 September 2009
URI: http://hdl.handle.net/10204/3649
ISSN: 1990-9772
Appears in Collections:Human language technologies
General science, engineering & technology

Files in This Item:

File Description SizeFormat
Van Heerden_d2_2009.pdf231.66 kBAdobe PDFView/Open
View Statistics

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback