1. 吉林大学计算机科学与技术学院,吉林,长春,130012
2. 符号计算与知识工程教育部重点实验室(吉林大学),吉林,长春,130012
3. 吉林大学计算机科学与技术学院,吉林,长春,130012
4. 符号计算与知识工程教育部重点实验室(吉林大学),吉林,长春,130012
网络出版:2018-02-25,
纸质出版:2018
移动端阅览
彭涛, 杨妮亚, 徐原博, 等. 双类型异质网中基于排序和聚类的离群点检测方法[J]. 电子学报, 2018,46(2):281-288.
PENG Tao, YANG Ni-ya, XU Yuan-bo, et al. An Outlier Detection Method Based on Ranking and Clustering in Bi-typed Heterogeneous Network[J]. Acta Electronica Sinica, 2018, 46(2): 281-288.
彭涛, 杨妮亚, 徐原博, 等. 双类型异质网中基于排序和聚类的离群点检测方法[J]. 电子学报, 2018,46(2):281-288. DOI: 10.3969/j.issn.0372-2112.2018.02.004.
PENG Tao, YANG Ni-ya, XU Yuan-bo, et al. An Outlier Detection Method Based on Ranking and Clustering in Bi-typed Heterogeneous Network[J]. Acta Electronica Sinica, 2018, 46(2): 281-288. DOI: 10.3969/j.issn.0372-2112.2018.02.004.
挖掘隐藏在网络中不同于正常数据对象的离群点是数据挖掘的重要任务之一.目前,针对双类型异质信息网络离群点检测的研究工作相对较少,原本适用于同质网络的离群点检测方法将很难适用于双类型异质网络.为此,提出了异质信息网络中基于排序和聚类的离群点检测方法(RKBOutlier).从异质信息网络中抽取两种类型的对象以及链接两种对象的语义信息,将待检测的数据作为属性对象,将另一类型数据作为目标对象,对目标对象进行聚类来检测属性对象在各个聚类中的分布情况,数据分布异常的对象即为离群点.将排序和聚类相结合来显著提高聚类的准确度.实验结果表明,RKBOutlier可以在双类型异质信息网络中有效地检测出离群点.
Mining the outliers that are different from normal data objects in the network is one of the important tasks in data mining. At present
the research aiming at outlier detection in bi-typed heterogeneous information network is relatively small. The methods which are applicable to homogeneous network can not be applied to bi-typed heterogeneous networks. Therefore
we propose a Rank-Kmeans Based Outlier detection method
called RKBOutlier
in heterogeneous information network. The two kinds of the objects and the connected semantic information are extracted from the heterogeneous information network. One type of the objects is regarded as the attribute objects
another type of the objects is regarded as the target objects. We perform cluster partitioning on target objects to detect the distribution of the attribute objects in each cluster. The objects which are abnormal at data distribution are considered to be the outliers. Ranking and clustering are combined to significantly improve the accuracy of clustering. The experimental results show that RKBOutlier can effectively detect outliers in bi-typed heterogeneous information network.
0
浏览量
298
下载量
2
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621