基于三对角和共享分块对角转换矩阵的快速说话人自适应方法

丁国宏; 徐 波

您当前的位置：

首页 >

文章列表页 >

基于三对角和共享分块对角转换矩阵的快速说话人自适应方法

论文 | 更新时间：2025-07-16

- 基于三对角和共享分块对角转换矩阵的快速说话人自适应方法
- Fast Speaker Adaptation Based on Triple Diagonal Transform Matrices and Shared Block Matrices
- 电子学报 2004年32卷第10期页码：1709-1712
- 作者机构：
  
  1. 中国科学院自动化研究所高技术创新中心,北京,100080
  2. 中国科学院自动化研究所模式识别国家重点实验室,北京,100080
  3. 中国科学院自动化研究所高技术创新中心北京,100080
  4. 中国科学院自动化研究所模式识别国家重点实验室北京,100080
- 作者简介：
- 基金信息：
- DOI：
  中图分类号： TP301.6
- 纸质出版：2004
- 稿件说明：
移动端阅览
丁国宏, 徐波. 基于三对角和共享分块对角转换矩阵的快速说话人自适应方法[J]. 电子学报, 2004,32(10):1709-1712.

DING Guo-hong, XU Bo. Fast Speaker Adaptation Based on Triple Diagonal Transform Matrices and Shared Block Matrices[J]. Acta Electronica Sinica, 2004, 32(10): 1709-1712.
丁国宏, 徐波. 基于三对角和共享分块对角转换矩阵的快速说话人自适应方法[J]. 电子学报, 2004,32(10):1709-1712. DOI：

DING Guo-hong, XU Bo. Fast Speaker Adaptation Based on Triple Diagonal Transform Matrices and Shared Block Matrices[J]. Acta Electronica Sinica, 2004, 32(10): 1709-1712. DOI：

摘要

本文提出了两种在最大似然线性回归(MLLR)框架下实现快速说话人自适应的方法.这两种方法在本文中分别称为Log-谱域下基于三对角转换矩阵的说话人自适应(SATD)和倒谱域下基于共享分块对角转换矩孟加拉国说话人自适应(SASBD).这两种方法在一定先验知识的基础上采用较少的参数来描述说话人间的差异

因而只需要少量的自适应数据就可以得到参数的鲁棒估计.在以整词建模的孤立词识别系统和以三音子建模的孤立词识别系统上分别进行的测试表明所提出的方法相对传统的MLLR自适应方法有较快的自适应性能.

Abstract

In the Maximum Likelihood Linear Regression (MLLR) framework

this paper proposes two fast speaker adaptation approaches

which are called Speaker Adaptation using Triple Diagonal matrices in the log-spectral domain (SATD) and Speaker Adaptation using Shared Block Diagonal matrices (SASBD) in the cepstral domain

respectively.Based on some prior knowledge

the proposed approaches utilize fewer parameters to describe the variation between speakers

and thus fewer adaptation data are needed to give robust estimation.Experimental results in both the whole-word-modeled isolated word recognition system and the isolated word recognition system using triphones as modeling units show that the proposed approaches can provide faster performance than the traditional MLLR approaches.

关键词

Keywords

references

浏览量

916

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

暂无数据