Barnard, EDavel, M2007-07-272007-07-272006-11Barnard, E and Davel, M. 2006. Automatic error detection in alignments for speech synthesis. 17th Annual Symposium of the Pattern Recognition Association of South Africa, Parys, South Africa, 29 Nov - 1 Dec 2006, pp 4978-0-620-37384-5http://hdl.handle.net/10204/1044The phonetic segmentation of recorded speech is a crucial factor in the quality of concatenative systems for speech synthesis. The authors describe a likelihood-based error detection process that can be used to flag possible errors in such a segmentation, with a view towards manual correction. It is shown that this process can be used to assist in the creation of high-accuracy segmentations. In particular, for an isiZulu corpus used in the creation of a unit-selection synthesizer, almost half of the errors that existed in a manual segmentation were detected by this process, while flagging less than a quarter of all segments. Different phoneme classes are handled with differing amounts of success, with vowels being the most troublesomeenSpeech synthesizerPhonetic segmentationError detectionUnit-selection synthesizerAutomatic error detection in alignments for speech synthesisConference PresentationBarnard, E., & Davel, M. (2006). Automatic error detection in alignments for speech synthesis. http://hdl.handle.net/10204/1044Barnard, E, and M Davel. "Automatic error detection in alignments for speech synthesis." (2006): http://hdl.handle.net/10204/1044Barnard E, Davel M, Automatic error detection in alignments for speech synthesis; 2006. http://hdl.handle.net/10204/1044 .TY - Conference Presentation AU - Barnard, E AU - Davel, M AB - The phonetic segmentation of recorded speech is a crucial factor in the quality of concatenative systems for speech synthesis. The authors describe a likelihood-based error detection process that can be used to flag possible errors in such a segmentation, with a view towards manual correction. It is shown that this process can be used to assist in the creation of high-accuracy segmentations. In particular, for an isiZulu corpus used in the creation of a unit-selection synthesizer, almost half of the errors that existed in a manual segmentation were detected by this process, while flagging less than a quarter of all segments. Different phoneme classes are handled with differing amounts of success, with vowels being the most troublesome DA - 2006-11 DB - ResearchSpace DP - CSIR KW - Speech synthesizer KW - Phonetic segmentation KW - Error detection KW - Unit-selection synthesizer LK - https://researchspace.csir.co.za PY - 2006 SM - 978-0-620-37384-5 T1 - Automatic error detection in alignments for speech synthesis TI - Automatic error detection in alignments for speech synthesis UR - http://hdl.handle.net/10204/1044 ER -