|
Researchspace >
General science, engineering & technology >
General science, engineering & technology >
General science, engineering & technology >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10204/5576
|
| Title: | ASR corpus design for resource-scarce languages |
| Authors: | Barnard, E Davel, M Van Heerden, C |
| Keywords: | Accurate speech recognition (ASR) Lwazi Resource scarce languages Corpus design |
| Issue Date: | Sep-2009 |
| Publisher: | ISCA |
| Citation: | Barnard, E, Davel, M and Van Heerden, C. ASR corpus design for resource-scarce languages. 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UK, 6-10 September 2009, pp 2847-2850 |
| Abstract: | The authors investigate the number of speakers and the amount of data that is required for the development of useable speaker-independent speech-recognition systems in resource-scarce languages. Their experiments employ the Lwazi corpus, which contains speech in the eleven official languages of South Africa. They find that a surprisingly small number of speakers (fewer than 50) and around 10 to 20 hours of speech per language are sufficient for the purposes of acceptable phone-based recognition. |
| Description: | 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UK, 6-10 September 2009 |
| URI: | http://hdl.handle.net/10204/5576 |
| Appears in Collections: | Human language technologies General science, engineering & technology
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|