1. 哈尔滨工业大学自动化测试与控制系!哈尔滨
2. 150001
纸质出版:1998
移动端阅览
[1]赵以宝,孙圣和.一种基于单字统计二元文法的自组词音字转换算法[J].电子学报,1998(10):55-59.
赵以宝, 孙圣和. A Word-Self-Made Chinese Phonetic-Character Conversion Algorithm Based on Chinese Character Bigram[J]. Acta Electronica Sinica, 1998, (10): 55-59.
音字转换在语音识别和汉字语句键盘输入方面都占有很重要的地位.现在比较流行的方法是基于大语料统计的Markov模型的音字转换方法其中基于单字N元文法的音字转换算法具有数据量少、算法简单的优点.但转换准确率却较低;而基于词N元文法的音字转换算法则正好相反本文在基于单字统计Bigram算法的基础上提出了一种自组词的音字转换方法,不仅具有单字Brgram方法的占空间少的优点.而且又可充分利用基于词Bigram算法的优点,实验表明该方法容易实现而且具有较高的转换准确率.
Chinese Phonetic-Character Conversion(CPCC) is an important issue in speech recognition and Chinese sentence keyboard input system.The appoaches based on large amount of corpus statisties Markov models become more and more popular today
The CPCC based on Chinese character N-gram (C-CPCC) has the advantage Of having a smaller statistics data library and simple algorithm. but has the drawback of lower accuracy of conversion
while that based on Chinese word N-gram(W-CPCC) is on the contrary.This paper presents a word-self-made CPCC algorithm based on the Chinese Character Bigram. which not only has foe C-CPCC’s advantage of having a smaller statistics data library
but also can take advantage of the W-CPCC. The experiment shows it can be easily realized with a higher accuracy of conversion.
0
浏览量
147
下载量
2
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621