基于状态空间模型的子频带语音转换算法

FONT face; Verdana; 徐 宁; 杨 震; 张玲华; FONT

您当前的位置：

首页 >

文章列表页 >

基于状态空间模型的子频带语音转换算法

学术论文 | 更新时间：2025-07-16

- 基于状态空间模型的子频带语音转换算法
- Sub-Band Voice Morphing Algorithm Based on State-Space Model
- 电子学报 2010年38卷第3期页码：646-653
- 作者机构：
  
  1. 南京邮电大学信号处理与传输研究院,江苏,南京,210003
  2. 南京邮电大学通信与信息工程学院,江苏,南京,210003
- 作者简介：
- 基金信息：
- DOI：
  中图分类号： TN925
- 纸质出版：2010
- 稿件说明：
移动端阅览
FONT face, Verdana, 徐宁, 等. 基于状态空间模型的子频带语音转换算法[J]. 电子学报, 2010,38(3):646-653.

XU Ning, YANG Zhen, ZHANG Ling-hua. Sub-Band Voice Morphing Algorithm Based on State-Space Model[J]. Acta Electronica Sinica, 2010, 38(3): 646-653.
FONT face, Verdana, 徐宁, 等. 基于状态空间模型的子频带语音转换算法[J]. 电子学报, 2010,38(3):646-653. DOI：

XU Ning, YANG Zhen, ZHANG Ling-hua. Sub-Band Voice Morphing Algorithm Based on State-Space Model[J]. Acta Electronica Sinica, 2010, 38(3): 646-653. DOI：

摘要

语音转换是一项改变说话人声音特征的技术，该领域主流方法——基于高斯混合模型的全频带参数映射，会导致转换后的语音频谱产生帧间不连续性。本文针对以上问题提出了改进方案：首先引入状态空间模型来模拟语音动态变化特性，其次利用离散小波变换对语音低频和高频部分的参数分为子频带处理。文章最后用主观和客观实验对提出的算法进行的实验仿真和验证。

Abstract

Voice morphing is a technique to modify a source speaker’s speech to sound as if it was spoken by some designated target speaker. The Gaussian mixture model (GMM) based transformations combined with full-band extracted feature parameters have been commonly studied. However

these methods often introduce problems such as artifacts and discontinuities. In order to resolve the problem mentioned above

state-space model (SSM) is first used to describe the relationship between the source speech and the target speech in the spectral domain. Then Discrete Wavelet Transform (DWT) is applied to decompose speech signals into sub-bands in order to improve the quality of the converted speech. Finally

experiments using both objective and subjective measurements are conducted to validate the effectiveness of the proposed method..

关键词

Keywords

references

浏览量

1344

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

结构α-熵的加权高斯混合模型的子空间聚类

谐波恢复的互高阶累计量Hankel矩阵法

声音转换技术的研究与进展

高斯噪声中的参数盲估计