1. 中南大学信息科学与工程学院,湖南,长沙,410083
2. 湖南大学软件学院,湖南,长沙,410082
3. 中南大学信息科学与工程学院,湖南,长沙,410083
4. 湖南大学软件学院,湖南,长沙,410082
纸质出版:2013
移动端阅览
欧阳柳波, 邹北骥, 刘丽杰. 一种基于混合判定模型的复合概念抽取方法[J]. 电子学报, 2013,41(3):488-495.
OUYANG Liu-bo, ZOU Bei-ji, LIU Li-jie. A Method of Compound Concept Extraction Based on Hybrid Judgment Model[J]. Acta Electronica Sinica, 2013, 41(3): 488-495.
欧阳柳波, 邹北骥, 刘丽杰. 一种基于混合判定模型的复合概念抽取方法[J]. 电子学报, 2013,41(3):488-495. DOI: 10.3969/j.issn.0372-2112.2013.03.012.
OUYANG Liu-bo, ZOU Bei-ji, LIU Li-jie. A Method of Compound Concept Extraction Based on Hybrid Judgment Model[J]. Acta Electronica Sinica, 2013, 41(3): 488-495. DOI: 10.3969/j.issn.0372-2112.2013.03.012.
从大规模领域语料库中抽取领域概念
现有方法不能有效识别复合概念.本文提出一种基于混合判定模型的复合概念抽取方法
首先对文本进行分词处理
为每个词条添加词条标签
并对词条集进行噪音词消除和同义词合并处理
然后统计词条的加权词频
根据词条标签值计算位置亲和度和位置匹配度
判定和筛选可组合成复合概念的原子词条
最后通过设置不同复合深度值
实现多重复合概念抽取.采用不同规模语料库进行抽取实验
实验结果表明本文方法具有更高的召回率和准确率.
The existing methods could not identify compound concept effectively from large-scale domain corpus.This paper proposes a method of compound concept extraction based on a hybrid model.Firstly
we make segmentation processing for corpus texts and add entry label for each term.We secondly remove noise words and merge synonyms for the entry set.Then we count the weighted term frequency
the location affinity degree
the location matching degree
and make a stepwise estimation to identify composite concept with atomic terms.Ultimately we realize the extraction of multiple-compound concept via giving different compound depth.On the foundation of the extraction method
we carried out the experiments with different corpora for compound concept extraction.The results indicated the method has higher recall and precision.
0
浏览量
2
下载量
1
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621