1. 南京邮电大学信号处理与传输研究院,江苏,南京,210003
2. 南京邮电大学通信与信息工程学院,江苏,南京,210003
纸质出版:2010
移动端阅览
FONT face, Verdana, 徐 宁, 等. 基于状态空间模型的子频带语音转换算法[J]. 电子学报, 2010,38(3):646-653.
XU Ning, YANG Zhen, ZHANG Ling-hua. Sub-Band Voice Morphing Algorithm Based on State-Space Model[J]. Acta Electronica Sinica, 2010, 38(3): 646-653.
语音转换是一项改变说话人声音特征的技术,该领域主流方法——基于高斯混合模型的全频带参数映射,会导致转换后的语音频谱产生帧间不连续性。本文针对以上问题提出了改进方案:首先引入状态空间模型来模拟语音动态变化特性,其次利用离散小波变换对语音低频和高频部分的参数分为子频带处理。文章最后用主观和客观实验对提出的算法进行的实验仿真和验证。
<FONT face=Verdana><FONT face=Verdana>
Voice morphing is a technique to modify a source speaker’s speech to sound as if it was spoken by some designated target speaker. The Gaussian mixture model (GMM) based transformations combined with full-band extracted feature parameters have been commonly studied. However
these methods often introduce problems such as artifacts and discontinuities. In order to resolve the problem mentioned above
state-space model (SSM) is first used to describe the relationship between the source speech and the target speech in the spectral domain. Then Discrete Wavelet Transform (DWT) is applied to decompose speech signals into sub-bands in order to improve the quality of the converted speech. Finally
experiments using both objective and subjective measurements are conducted to validate the effectiveness of the proposed method..
0
浏览量
1344
下载量
5
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621