Multilingual speaker age recognition: regression analyses on the Lwazi corpus

Feld, M; Barnard, E; Van Heerden, C; Muller, C

Multilingual speaker age recognition: regression analyses on the Lwazi corpus

http://hdl.handle.net/10204/5506

Abstract:

Multilinguality represents an area of significant opportunities for automatic speech-processing systems: whereas multilingual societies are commonplace, the majority of speechprocessing systems are developed with a single language in mind. As a step towards improved understanding of multilingual speech processing, the current contribution investigates how an important para-linguistic aspect of speech, namely speaker age, depends on the language spoken. In particular, the authors study how certain speech features affect the performance of an age recognition system for different South African languages in the Lwazi corpus. By optimizing our feature set and performing language-specific tuning, we are working towards true multilingual classifiers. As they are closely related, ASR and dialog systems are likely to benefit from an improved classification of the speaker. In a comprehensive corpus analysis on long-term features, we have identified features that exhibit characteristic behaviors for particular languages. In a follow-up regression experiment, we confirm the suitability of our feature selection for age recognition and present cross-language error rates. The mean absolute error ranges between 7.7 and 12.8 years for same-language predictors and rises to 14.5 years for cross-language predictors.

Reference:

Feld, M, Barnard, E, Van Heerden, C and Muller, C. 2009. Multilingual speaker age recognition: regression analyses on the Lwazi corpus. 2009 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU-09), Merano, Italy, 13-17 December 2009

Feld, M., Barnard, E., Van Heerden, C., & Muller, C. (2009). Multilingual speaker age recognition: regression analyses on the Lwazi corpus. http://hdl.handle.net/10204/5506

Feld, M, E Barnard, C Van Heerden, and C Muller "Multilingual speaker age recognition: regression analyses on the Lwazi corpus." (2009) http://hdl.handle.net/10204/5506

Feld M, Barnard E, Van Heerden C, Muller C. Multilingual speaker age recognition: regression analyses on the Lwazi corpus. 2009; http://hdl.handle.net/10204/5506.

Download RIS

2009 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU-09), Merano, Italy, 13-17 December 2009

This item appears in the following Collection(s)

Journal Articles

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Multilingual speaker age recognition: regression analyses on the Lwazi corpus

Multilingual speaker age recognition: regression analyses on the Lwazi corpus

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect