Phonotactic spoken language identification with limited training data

Peche, M; Davel, MH; Barnard, E

Phonotactic spoken language identification with limited training data

http://hdl.handle.net/10204/2657

Abstract:

The authors investigate the addition of a new language, for which limited resources are available, to a phonotactic language identification system. Two classes of approaches are studied: in the first class, only existing phonetic recognizers are employed, whereas an additional phonetic recognizer in the new language is created for the second class. It is found that the number of acoustic recognizers employed plays a crucial role in determining the recognition accuracy for the new language. The authors study different approaches to incorporating a language for which audio-only data is available (no pronunciation dictionaries or transcriptions) and find that if more than about 2 000 training utterances are available, a bootstrapped acoustic model for the new language can improve accuracy substantially

Reference:

Peche, M, Davel, M and Barnard, E. 2007. Phonotactic spoken language identification with limited training data. Interspeech, Antwerp, Belgium, 27-31 August 2007, pp 4.

Peche, M., Davel, M., & Barnard, E. (2007). Phonotactic spoken language identification with limited training data. http://hdl.handle.net/10204/2657

Peche, M, MH Davel, and E Barnard "Phonotactic spoken language identification with limited training data." (2007) http://hdl.handle.net/10204/2657

Peche M, Davel M, Barnard E. Phonotactic spoken language identification with limited training data. 2007; http://hdl.handle.net/10204/2657.

Download RIS

Peche, M
Davel, MH
Barnard, E

Aug 2007

Spoken language identification
Acoustic recognizers
Resource scarce languages

Show full item record

Files in this item

Peche1_2007.pdf

This item appears in the following Collection(s)

Journal Articles

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Phonotactic spoken language identification with limited training data

Phonotactic spoken language identification with limited training data

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect