dc.contributor.author |
De Vries, NJ
|
|
dc.contributor.author |
Badenhorst, J
|
|
dc.contributor.author |
Davel, MH
|
|
dc.contributor.author |
Barnard, E
|
|
dc.contributor.author |
De Waal, A
|
|
dc.date.accessioned |
2011-09-21T12:46:49Z |
|
dc.date.available |
2011-09-21T12:46:49Z |
|
dc.date.issued |
2011-08 |
|
dc.identifier.citation |
De Vries, NJ, Badenhorst, J, Davel, MH et al. 2011. Woefzela - An open-source platform for ASR data collection in the developing world. INTERSPEECH 2011, Florence, Italy, 27-31 August 2011 |
en_US |
dc.identifier.uri |
http://hdl.handle.net/10204/5149
|
|
dc.description |
INTERSPEECH 2011, Florence, Italy, 27-31 August 2011 |
en_US |
dc.description.abstract |
Building transcribed speech corpora for under-resourced languages plays a pivotal role in developing speech technologies for such languages. The authors have developed an open-source tool for devices running the Android operating system to facilitate the efficient collection of speech data for Automatic Speech Recognition system development. The tool was designed for use in typical developing-world conditions; they present the relevant design choices and analyse the effectiveness of this tool by means of a case study. In particular, they introduce a novel semi-real-time quality monitoring system, which increases the efficiency of the data collection process. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
Conference paper |
en_US |
dc.relation.ispartofseries |
Workflow request;7186 |
|
dc.subject |
Speech resource collection |
en_US |
dc.subject |
Under resourced languages |
en_US |
dc.subject |
Automatic speech recognition |
en_US |
dc.subject |
Developing world |
en_US |
dc.subject |
Resource scarce environment |
en_US |
dc.subject |
Android |
en_US |
dc.subject |
Open source |
en_US |
dc.subject |
ASR data |
en_US |
dc.subject |
Interspeech 2011 |
en_US |
dc.title |
Woefzela - An open-source platform for ASR data collection in the developing world |
en_US |
dc.type |
Conference Presentation |
en_US |
dc.identifier.apacitation |
De Vries, N., Badenhorst, J., Davel, M., Barnard, E., & De Waal, A. (2011). Woefzela - An open-source platform for ASR data collection in the developing world. Conference paper. http://hdl.handle.net/10204/5149 |
en_ZA |
dc.identifier.chicagocitation |
De Vries, NJ, J Badenhorst, MH Davel, E Barnard, and A De Waal. "Woefzela - An open-source platform for ASR data collection in the developing world." (2011): http://hdl.handle.net/10204/5149 |
en_ZA |
dc.identifier.vancouvercitation |
De Vries N, Badenhorst J, Davel M, Barnard E, De Waal A, Woefzela - An open-source platform for ASR data collection in the developing world; Conference paper; 2011. http://hdl.handle.net/10204/5149 . |
en_ZA |
dc.identifier.ris |
TY - Conference Presentation
AU - De Vries, NJ
AU - Badenhorst, J
AU - Davel, MH
AU - Barnard, E
AU - De Waal, A
AB - Building transcribed speech corpora for under-resourced languages plays a pivotal role in developing speech technologies for such languages. The authors have developed an open-source tool for devices running the Android operating system to facilitate the efficient collection of speech data for Automatic Speech Recognition system development. The tool was designed for use in typical developing-world conditions; they present the relevant design choices and analyse the effectiveness of this tool by means of a case study. In particular, they introduce a novel semi-real-time quality monitoring system, which increases the efficiency of the data collection process.
DA - 2011-08
DB - ResearchSpace
DP - CSIR
KW - Speech resource collection
KW - Under resourced languages
KW - Automatic speech recognition
KW - Developing world
KW - Resource scarce environment
KW - Android
KW - Open source
KW - ASR data
KW - Interspeech 2011
LK - https://researchspace.csir.co.za
PY - 2011
T1 - Woefzela - An open-source platform for ASR data collection in the developing world
TI - Woefzela - An open-source platform for ASR data collection in the developing world
UR - http://hdl.handle.net/10204/5149
ER -
|
en_ZA |