Phonotactic spoken language identification with limited training data

Peche, M; Davel, MH; Barnard, E

dc.contributor.author	Peche, M
dc.contributor.author	Davel, MH
dc.contributor.author	Barnard, E
dc.date.accessioned	2008-12-04T13:41:23Z
dc.date.available	2008-12-04T13:41:23Z
dc.date.issued	2007-08
dc.identifier.citation	Peche, M, Davel, M and Barnard, E. 2007. Phonotactic spoken language identification with limited training data. Interspeech, Antwerp, Belgium, 27-31 August 2007, pp 4.	en
dc.identifier.uri	http://hdl.handle.net/10204/2657
dc.description.abstract	The authors investigate the addition of a new language, for which limited resources are available, to a phonotactic language identification system. Two classes of approaches are studied: in the first class, only existing phonetic recognizers are employed, whereas an additional phonetic recognizer in the new language is created for the second class. It is found that the number of acoustic recognizers employed plays a crucial role in determining the recognition accuracy for the new language. The authors study different approaches to incorporating a language for which audio-only data is available (no pronunciation dictionaries or transcriptions) and find that if more than about 2 000 training utterances are available, a bootstrapped acoustic model for the new language can improve accuracy substantially	en
dc.language.iso	en	en
dc.subject	Spoken language identification	en
dc.subject	Acoustic recognizers	en
dc.subject	Resource scarce languages	en
dc.title	Phonotactic spoken language identification with limited training data	en
dc.type	Article	en
dc.identifier.apacitation	Peche, M., Davel, M., & Barnard, E. (2007). Phonotactic spoken language identification with limited training data. http://hdl.handle.net/10204/2657	en_ZA
dc.identifier.chicagocitation	Peche, M, MH Davel, and E Barnard "Phonotactic spoken language identification with limited training data." (2007) http://hdl.handle.net/10204/2657	en_ZA
dc.identifier.vancouvercitation	Peche M, Davel M, Barnard E. Phonotactic spoken language identification with limited training data. 2007; http://hdl.handle.net/10204/2657.	en_ZA
dc.identifier.ris	TY - Article AU - Peche, M AU - Davel, MH AU - Barnard, E AB - The authors investigate the addition of a new language, for which limited resources are available, to a phonotactic language identification system. Two classes of approaches are studied: in the first class, only existing phonetic recognizers are employed, whereas an additional phonetic recognizer in the new language is created for the second class. It is found that the number of acoustic recognizers employed plays a crucial role in determining the recognition accuracy for the new language. The authors study different approaches to incorporating a language for which audio-only data is available (no pronunciation dictionaries or transcriptions) and find that if more than about 2 000 training utterances are available, a bootstrapped acoustic model for the new language can improve accuracy substantially DA - 2007-08 DB - ResearchSpace DP - CSIR KW - Spoken language identification KW - Acoustic recognizers KW - Resource scarce languages LK - https://researchspace.csir.co.za PY - 2007 T1 - Phonotactic spoken language identification with limited training data TI - Phonotactic spoken language identification with limited training data UR - http://hdl.handle.net/10204/2657 ER -	en_ZA

Files in this item

Name: Peche1_2007.pdf

Size: 102.6Kb

Format: PDF

View/Open

This item appears in the following Collection(s)

Journal Articles

Show simple item record

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.