Sub-Band Voice Morphing Algorithm Based on State-Space Model

XU Ning; YANG Zhen; ZHANG Ling-hua

您当前的位置：

首页 >

文章列表页 >

Sub-Band Voice Morphing Algorithm Based on State-Space Model

更新时间：2025-07-16

- Sub-Band Voice Morphing Algorithm Based on State-Space Model
- Acta Electronica Sinica Vol. 38, Issue 3, Pages: 646-653(2010)
- 作者机构：
  
  1. 南京邮电大学信号处理与传输研究院,江苏,南京,210003
  2. 南京邮电大学通信与信息工程学院,江苏,南京,210003
- 作者简介：
- 基金信息：
- DOI：
  CLC： TN925
- Published：2010
- 稿件说明：
移动端阅览
XU Ning, YANG Zhen, ZHANG Ling-hua. Sub-Band Voice Morphing Algorithm Based on State-Space Model[J]. Acta Electronica Sinica, 2010, 38(3): 646-653.
DOI：

XU Ning, YANG Zhen, ZHANG Ling-hua. Sub-Band Voice Morphing Algorithm Based on State-Space Model[J]. Acta Electronica Sinica, 2010, 38(3): 646-653. DOI：

摘要

语音转换是一项改变说话人声音特征的技术，该领域主流方法——基于高斯混合模型的全频带参数映射，会导致转换后的语音频谱产生帧间不连续性。本文针对以上问题提出了改进方案：首先引入状态空间模型来模拟语音动态变化特性，其次利用离散小波变换对语音低频和高频部分的参数分为子频带处理。文章最后用主观和客观实验对提出的算法进行的实验仿真和验证。

Abstract

Voice morphing is a technique to modify a source speaker’s speech to sound as if it was spoken by some designated target speaker. The Gaussian mixture model (GMM) based transformations combined with full-band extracted feature parameters have been commonly studied. However

these methods often introduce problems such as artifacts and discontinuities. In order to resolve the problem mentioned above

state-space model (SSM) is first used to describe the relationship between the source speech and the target speech in the spectral domain. Then Discrete Wavelet Transform (DWT) is applied to decompose speech signals into sub-bands in order to improve the quality of the converted speech. Finally

experiments using both objective and subjective measurements are conducted to validate the effectiveness of the proposed method..

关键词

Keywords

references

Views

1344

下载量

CSCD

Alert me when the article has been cited

提交

Tools

Publicity Resources

Structural α-Entropy Weighting Gaussian Mixture Model for Subspace Clustering

The Hankel Matrix Decomposition Method of Cross-High-Order Cumulant Based on The Cross-High-Order Cumulant of State Space Model of Harmonic Retrieval

Voice Conversion Technology and Its Development

Blind Estimation of Parameters in Gaussian Noise

Related Author

LI Kai

ZHANG Ke-xin

ZHANG Li-yan

LIU Xin

BAO Chang-chun

ZHANG Xing-tao

ZHANG Li-li

SHI Yao-wu

Related Institution

Hebei Machine Vision Engineering Research Center

School of Cyber Security and Computer, Hebei University

School of Electronic Information and Control Engineering, Beijing University of Technology

College of Commun.Eng.,Jilin University

School of Electronics Information and Control EngineeringBeijing University of TechnologyBeijing 100022China

⁰