Synthetic triphones from trajectory-based feature distributions

dc.contributor.authorBadenhorst, J
dc.contributor.authorDavel, MH
dc.date.accessioned2016-08-22T11:36:42Z
dc.date.available2016-08-22T11:36:42Z
dc.date.issued2015-11
dc.descriptionPattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobTech), Port Elizabeth, South Africa, 25-26 November 2015en_US
dc.description.abstractWe experiment with a new method to create synthetic models of rare and unseen triphones in order to supplement limited automatic speech recognition (ASR) training data. A trajectory model is used to characterise seen transitions at the spectral level, and these models are then used to create features for unseen or rare triphones. We find that a fairly restricted model (piece-wise linear with three line segments per channel of a diphone transition) is able to represent training data quite accurately. We report on initial results when creating additional triphones for a single-speaker data set, finding small but significant gains, especially when adding additional samples of rare (rather than unseen) triphones.en_US
dc.identifier.apacitationBadenhorst, J., & Davel, M. (2015). Synthetic triphones from trajectory-based feature distributions. IEEE. http://hdl.handle.net/10204/8737en_ZA
dc.identifier.chicagocitationBadenhorst, J, and MH Davel. "Synthetic triphones from trajectory-based feature distributions." (2015): http://hdl.handle.net/10204/8737en_ZA
dc.identifier.citationBadenhorst, J and Davel, MH. 2015. Synthetic triphones from trajectory-based feature distributions. In: Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobTech), Port Elizabeth, South Africa, 25-26 November 2015en_US
dc.identifier.ris TY - Conference Presentation AU - Badenhorst, J AU - Davel, MH AB - We experiment with a new method to create synthetic models of rare and unseen triphones in order to supplement limited automatic speech recognition (ASR) training data. A trajectory model is used to characterise seen transitions at the spectral level, and these models are then used to create features for unseen or rare triphones. We find that a fairly restricted model (piece-wise linear with three line segments per channel of a diphone transition) is able to represent training data quite accurately. We report on initial results when creating additional triphones for a single-speaker data set, finding small but significant gains, especially when adding additional samples of rare (rather than unseen) triphones. DA - 2015-11 DB - ResearchSpace DP - CSIR KW - Synthetic triphones KW - Trajectory modelling KW - Trajectory-based features KW - Feature distributions KW - Feature construction LK - https://researchspace.csir.co.za PY - 2015 T1 - Synthetic triphones from trajectory-based feature distributions TI - Synthetic triphones from trajectory-based feature distributions UR - http://hdl.handle.net/10204/8737 ER - en_ZA
dc.identifier.urihttp://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7359509&tag=1
dc.identifier.urihttp://hdl.handle.net/10204/8737
dc.identifier.vancouvercitationBadenhorst J, Davel M, Synthetic triphones from trajectory-based feature distributions; IEEE; 2015. http://hdl.handle.net/10204/8737 .en_ZA
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.ispartofseriesWorkflow;16011
dc.subjectSynthetic triphonesen_US
dc.subjectTrajectory modellingen_US
dc.subjectTrajectory-based featuresen_US
dc.subjectFeature distributionsen_US
dc.subjectFeature constructionen_US
dc.titleSynthetic triphones from trajectory-based feature distributionsen_US
dc.typeConference Presentationen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Badenhorst_2015.pdf
Size:
136.01 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: