汉语连续语音识别中上下文相关的识别单元(三音子)的研究

赵庆卫; 王作英; 陆大

您当前的位置：

首页 >

文章列表页 >

汉语连续语音识别中上下文相关的识别单元(三音子)的研究

更新时间：2025-12-08

- 汉语连续语音识别中上下文相关的识别单元(三音子)的研究
- Research on Context-Dependent Acoustical Unit （Triphone） for Mandarin Continuous Speech Recognition
- 电子学报 1999年第6期页码：79-82
- 作者机构：
  
  1. 清华大学电子工程系!北京
  2. 100084
- 作者简介：
- 基金信息：
- DOI：
  中图分类号： TN912.3
- 纸质出版：1999
- 稿件说明：
移动端阅览
[1]赵庆卫,王作英,陆大　.汉语连续语音识别中上下文相关的识别单元(三音子)的研究[J].电子学报,1999(06):79-82+117.

赵庆卫, 王作英, 陆大. Research on Context-Dependent Acoustical Unit （Triphone） for Mandarin Continuous Speech Recognition[J]. Acta Electronica Sinica, 1999, (6): 79-82.
[1]赵庆卫,王作英,陆大　.汉语连续语音识别中上下文相关的识别单元(三音子)的研究[J].电子学报,1999(06):79-82+117. DOI：

赵庆卫, 王作英, 陆大. Research on Context-Dependent Acoustical Unit （Triphone） for Mandarin Continuous Speech Recognition[J]. Acta Electronica Sinica, 1999, (6): 79-82. DOI：

摘要

本文详细研究了汉语语音识别中如何有效地建立上下文相关的识别单元，以解决连续语音之间的协同发音问题．本文首先利用信息论原理，研究了传统的聚类算法的距离测度，分别是模型分布的散度和模型合并或分裂前后熵的变化值．然后本文提出了基于决策树的聚类方法，它的主要优点是充分利用了语音学知识，聚类后得到的模型可推广性好，尤其适用于集外语料中出现大量的未在训练语料中出现的三音子单元的情况．接着介绍了模型聚类和训练的实验步骤最后，非特定人大词汇量连续语音识别的实验表明，基于决策树的聚类方法所得到的识别单元，当识别集外语料时使系统的误识率降低了7．95％，而基于合并的聚类方法所得到的识别单元只降低了2．63％.

Abstract

The problem on building context dependent model in continuous mandarin speech recognition in order to avoid coarticulatory effects is descussed in detail in this paper. On the basis of information theory

the distance metric of the traditional clustering algorithm is first studied

which is the divergence of the model distribution and the difference in entropy result from model merging or splitting. Then the clustering algorithm based on decision tree is presented

which makes full use of the phonological rules. The model obtained from it is easy to be generalized

and this method demonstrates especially better when many triphones emerge that are not covered in the training material. In addition

the clustering and training procedure is discussed. At last

the speaker independent large vocabulary continuous speech recognition experiment shows that

if the recognition material is different from the training material

the recognition model obtained from the decision-tree-based clustering algoyithm reduces the error rate by 7. 95 %. However

the recognition model obtained from the traditional merge algorithm reduces the error rate only by 2. 63 %.

关键词

Keywords

references

浏览量

251

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

区域和邻域级信息相结合的加强型PFCM含噪图像分割算法

一种同型空时分组码的识别算法

基于适应度指导交配限制策略的重组算子与多目标优化研究

基于高斯相似度分析的插值自适应算法

基于聚类算法的最优子阵划分方法研究