Synthetic triphones from trajectory-based feature distributions

Badenhorst, J; Davel, MH

Synthetic triphones from trajectory-based feature distributions

http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7359509&tag=1
http://hdl.handle.net/10204/8737

Abstract:

We experiment with a new method to create synthetic models of rare and unseen triphones in order to supplement limited automatic speech recognition (ASR) training data. A trajectory model is used to characterise seen transitions at the spectral level, and these models are then used to create features for unseen or rare triphones. We find that a fairly restricted model (piece-wise linear with three line segments per channel of a diphone transition) is able to represent training data quite accurately. We report on initial results when creating additional triphones for a single-speaker data set, finding small but significant gains, especially when adding additional samples of rare (rather than unseen) triphones.

Reference:

Badenhorst, J and Davel, MH. 2015. Synthetic triphones from trajectory-based feature distributions. In: Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobTech), Port Elizabeth, South Africa, 25-26 November 2015

Badenhorst, J., & Davel, M. (2015). Synthetic triphones from trajectory-based feature distributions. IEEE. http://hdl.handle.net/10204/8737

Badenhorst, J, and MH Davel. "Synthetic triphones from trajectory-based feature distributions." (2015): http://hdl.handle.net/10204/8737

Badenhorst J, Davel M, Synthetic triphones from trajectory-based feature distributions; IEEE; 2015. http://hdl.handle.net/10204/8737 .

Download RIS

Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobTech), Port Elizabeth, South Africa, 25-26 November 2015

Badenhorst, J
Davel, MH

Nov 2015

Synthetic triphones
Trajectory modelling
Trajectory-based features
Feature distributions
Feature construction

Show full item record

Files in this item

Badenhorst_2015.pdf

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Synthetic triphones from trajectory-based feature distributions

Synthetic triphones from trajectory-based feature distributions

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect