哈尔滨工业大学计算机科学与技术学院,黑龙江,哈尔滨,150001
纸质出版:2009
移动端阅览
赵世奇, 刘 挺, 李 生. 基于自动构建语料库的词汇级复述研究[J]. 电子学报, 2009,37(5):975-980.
ZHAO Shi-qi, LIU Ting, LI Sheng. Lexical Paraphrasing Based on Automatically Constructed Corpora[J]. Acta Electronica Sinica, 2009, 37(5): 975-980.
本文针对词汇级复述问题提出了一种新的方法.该方法首先利用翻译引擎将双语平行语料库自动转换为单语平行语料库
以此构建复述语料库并用于候选复述的抽取.在此基础上
本文提出了一种新的统计模型.该模型根据特定的上下文为待复述词选择最为合适的复述.实验结果表明自动构建的复述语料库对于词汇级复述的抽取是有效的.同时
本文提出的模型明显优于两种传统模型
在准确率和召回率上分别提高10%左右.
This paper presents a new method for lexical paraphrasing.The method first constructs a paraphrase corpus by automatically translating a bilingual parallel corpus into a monolingual parallel corpus
from which candidate paraphrases for words are extracted.After that
a new statistical model is proposed for lexical paraphrasing
which selects the best paraphrase for a word in a given context sentence.Experimental results show that the automatically constructed paraphrase corpus is effective for lexical paraphrasing.In addition
the presented paraphrasing model significantly outperforms two conventional models
enhancing precision and recall by about 10%
respectively.
0
浏览量
1153
下载量
1
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621