GENERAL ENQUIRIES: Tel: + 27 12 841 2911 | Email: callcentre@csir.co.za

Show simple item record

dc.contributor.author Molapo, B
dc.contributor.author Barnard, E
dc.contributor.author De Wet, Febe
dc.date.accessioned 2014-08-25T10:09:51Z
dc.date.available 2014-08-25T10:09:51Z
dc.date.issued 2014-05
dc.identifier.citation Molapo, R and Barnard, E and De Wet, F. 2014. Speech data collection in an under-resourced language within a multilingual context. In: 4th International Workshop on Spoken Language Technologies for Under-resourced Languages, St Petersburg, Russia, 14-16 May 2014 en_US
dc.identifier.uri http://hdl.handle.net/10204/7621
dc.description 4th International Workshop on Spoken Language Technologies for Under-resourced Languages, St Petersburg, Russia, 14-16 May 2014 en_US
dc.description.abstract In this paper, we present an end-to-end solution to the development of an automatic speech recognition (ASR) system in typical under-resourced languages, where the target language is likely to be influenced by one more embedded foreign languages. We first describe the collection and processing of the text corpus crawled from the World Wide Web using the Rapid Language Adaptation Toolkit. In particular, we highlight the challenges faced when foreign languages are embedded within the matrix language. Thereafter, we discuss our speech data collection efforts in under-resourced environments. We finally report on a strategy called transliteration that aids to improve recognition results of our grapheme-based automatic speech recognition system in the presence of embedded language words. en_US
dc.language.iso en en_US
dc.publisher International Research Insitute en_US
dc.relation.ispartofseries Workflow;13146
dc.subject Under-resourced languages en_US
dc.subject Transliteration en_US
dc.subject Matrix language en_US
dc.subject Transliteration en_US
dc.subject Grapheme-based ASR en_US
dc.title Speech data collection in an under-resourced language within a multilingual context en_US
dc.type Presentation en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search ResearchSpace


Advanced Search

Browse

My Account