

浏览全部资源
扫码关注微信
上海电机学院电子信息学院,上海,201306
Published:2018
移动端阅览
Mass of Short Texts Topical Hierarchy Mining Integrated Anchor Extraction[J]. Acta Electronica Sinica, 2018, 46(5): 1084-1088.
Mass of Short Texts Topical Hierarchy Mining Integrated Anchor Extraction[J]. Acta Electronica Sinica, 2018, 46(5): 1084-1088. DOI: 10.3969/j.issn.0372-2112.2018.05.009.
从短文本集中挖掘不同粒度的主题、构建主题的层次结构在舆情分析、视觉检测、语义挖掘和图谱构建等方面具有重要应用.围绕如何从短文本集中分层次地挖掘主题,在修改传统短语定义的基础上,提出了融合锚词抽取的海量短文本主题层次挖掘框架.提出的主题层次挖掘框架首先基于词共现图实现主题推断和锚词抽取;然后,应用关联规则挖掘频繁锚词短语;最后,采用排序方法量化锚词短语以寻找最具代表性的主题短语.与已有的基于词共现图构建主题层次的方法相比,融合了锚词抽取的词共现图分析方法更有利于构建层次更高的主题.在2个实际的中文短文本数据集上执行实验,结果表明提出的方法挖掘的短语能较好地解释主题和用于分类预测.
A topical hierarchy at different levels of granularity from short texts has many valuable applications in the areas of opinion analysis
vision detection
semantics mining and graph construction.Aiming at how to mine the hierarchy of topics from short texts
a topical hierarchy mining framework integrated anchor extraction is proposed based on the modification of the tradition phrase definition.Firstly
the topic inference and the anchor extraction are conducted in the proposed framework.Secondly
frequent anchor phrases are found by applying associate rule mining.Finally
a kind of rank method is used to quantity the criterion of anchor phrases in order to find the most representative topical phrase by ranking.Compared to the topic analysis method of the word co-occurrence graph
the word co-occurrence integrated into anchor is more beneficial to build the higher level of topics.Experiments with datasets from the two Chinese short texts are performed
and the results show that the proposed method can generate interpretably phrases and be used for classification prediction.
0
Views
5
下载量
2
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621