电子学报 ›› 2015, Vol. 43 ›› Issue (8): 1575-1582.DOI: 10.3969/j.issn.0372-2112.2015.08.016

• 学术论文 • 上一篇    下一篇

大规模复杂网络下重叠社区的识别

王诗懿, 董一鸿, 李志超, 陈华辉, 钱江波   

  1. 宁波大学信息科学与工程学院, 浙江宁波 315211
  • 收稿日期:2014-09-25 修回日期:2015-02-25 出版日期:2015-08-25
    • 通讯作者:
    • 董一鸿
    • 作者简介:
    • 王诗懿 女,1988年生于河南周口.现为宁波大学信息科学与工程学院硕士研究生.主要研究方向为大数据、数据挖掘. E-mail:zwb_wsy@126.com
    • 基金资助:
    • 国家自然科学基金 (No.61472194); 浙江省自然科学基金 (No.LY13F020040); 宁波市自然科学基金 (No.2013A610063,No.2014A610023)

The Identification of Overlapping Communities in Large-Scale Complex Networks

WANG Shi-yi, DONG Yi-hong, LI Zhi-chao, CHEN Hua-hui, QIAN Jiang-bo   

  1. Department of Information Science and Engineering, Ningbo University, Ningbo, Zhejiang 315211, China
  • Received:2014-09-25 Revised:2015-02-25 Online:2015-08-25 Published:2015-02-09
    • Supported by:
    • National Natural Science Foundation of China (No.61472194); National Natural Science Foundation of Zhejiang Province,  China (No.LY13F020040); Ningbo Natural Science Fund (No.2013A610063, No.2014A610023)

摘要:

随着网络规模的不断扩大,经典的复杂网络重叠社识别算法已不能高效处理现有的大规模网络图数据.本文在GraphLab并行计算模型上提出了基于重要节点扩展的重叠社区识别算法DOCVN (Detecting the Overlapping Community algorithm based on Vital Node Expanding in GraphLab).算法选取网络中PageRank值大的节点作为重要节点,计算其他节点归属于重要节点的节点归属度,并以重要节点为中心形成核心社区及扩展社区,最后根据重要节点间的连接紧密度合并核心社区及扩展社区,并计算出每个节点在所属社区里的节点重要度,实现了大规模网络的重叠社区识别.实验表明该算法与PD (Propinquity Dynamics)等现有并行算法相比更能有效地识别大规模网络的重叠社区结构.

关键词: 大规模复杂网络, GraphLab, 重叠社区识别, 社会网络, 核心社区

Abstract:

With the unceasing expanding of network scale, many classic detection algorithms of overlapping communities cannot work efficiently in large-scale complex network.Detecting the overlapping community algorithm based on vital node expanding in parallel framework GraphLab (DOCVN) is introduced to identify the overlapping communities.In this algorithm, nodes with high PageRank value are regarded as vital nodes, and then the affiliation degree of other nodes to these vital nodes are computed.After that, kernel communities and expanding communities are identified respectively.Finally, the kernel communities and expanding communities are combined into some overlapping communities by judging whether they connect tightly.And the importance weight of each node in its community is also computed.Experimental results show that the algorithm is more effective than the existing parallel algorithms like PD (Propinquity Dynamics) to identify large-scale overlapping communities.

Key words: large-scale complex network, GraphLab, overlapping community identification, social network, kernel community

中图分类号: