QIAN Zhao-peng, XIAO Ke-jing, LIU Chan, et al. Voice Conversion for Enhancing Mandarin Electro-Laryngeal Speech Based on Semantic Information[J]. Acta Electronica Sinica, 2020, 48(5): 840-845.
DOI:
QIAN Zhao-peng, XIAO Ke-jing, LIU Chan, et al. Voice Conversion for Enhancing Mandarin Electro-Laryngeal Speech Based on Semantic Information[J]. Acta Electronica Sinica, 2020, 48(5): 840-845. DOI: 10.3969/j.issn.0372-2112.2020.05.002.
Voice Conversion for Enhancing Mandarin Electro-Laryngeal Speech Based on Semantic Information
The Electro-Laryngeal (EL) speech has some drawbacks such as single fundamental frequency
mechanical sound and large radiation noise. The drawbacks affect the intelligibility and naturalness of the EL speech. Especially
the tonal language such as Mandarin EL speech would be worse understanding. In this paper
the spelling corrector for pinyin and the tone labelling tool are designed to solve the problems that Mandarin EL speech recognition has some errors in consonants and the recognition result has no tone. The result is synthesized into the healthy speech by TTS based on Tacotron-2. The objective evaluation results show that the accuracy of pinyin spelling corrector has been improved; the accuracy of tone labelling under contextual environment is very high. The subjective results shows the proposed method can improve the intelligibility and naturalness of the EL speech a lot. The results illustrate that the proposed method can convert the EL speech without tone into the healthy speech. And the proposed method performs better than the traditional method based on speech signal processing.