1. 浙江工商大学计算机与信息工程学院,浙江,杭州,310018
2. 武汉大学软件工程国家重点实验室,湖北,武汉,430072
3. 武汉大学计算机学院,湖北,武汉,430072
4. 浙江工商大学计算机与信息工程学院,浙江,杭州,310018
5. 武汉大学软件工程国家重点实验室,湖北,武汉,430072
6. 武汉大学计算机学院,湖北,武汉,430072
纸质出版:2014
移动端阅览
潘伟丰, 李兵, 马于涛, 等. 基于加权PageRank算法的关键包识别方法[J]. 电子学报, 2014,42(11):2174-2183.
PAN Wei-feng, LI Bing, MA Yu-tao, et al. Identifying the Key Packages Using Weighted PageRank Algorithm[J]. Acta Electronica Sinica, 2014, 42(11): 2174-2183.
潘伟丰, 李兵, 马于涛, 等. 基于加权PageRank算法的关键包识别方法[J]. 电子学报, 2014,42(11):2174-2183. DOI: 10.3969/j.issn.0372-2112.2014.11.008.
PAN Wei-feng, LI Bing, MA Yu-tao, et al. Identifying the Key Packages Using Weighted PageRank Algorithm[J]. Acta Electronica Sinica, 2014, 42(11): 2174-2183. DOI: 10.3969/j.issn.0372-2112.2014.11.008.
识别软件中的关键实体对于人们理解软件
控制和降低维护费用具有重要意义.然而现有的工作基本都是针对关键类识别的
针对关键包、方法/属性等的研究甚少;同时现有的工作也未能揭示关键类与软件外部质量属性间的关系.为丰富现有的工作
本文提出了一种基于加权PageRank算法的关键包识别方法.该方法用加权有向软件网络模型抽象包粒度软件系统
提出新度量PR(PackageRank)从结构角度量度节点重要性
并引入加权的PageRank算法计算该度量值.数据实验部分以六个开源Java软件为例
分析了包的PR值与常用复杂网络中心性指标(介数中心性、接近中心性、度数中心性等)间的相关性;使用加权的SIR(Susceptible-Infectious-Recovered)模型分析了PR所识别关键包的传播影响
并与其它相关方法进行比较
验证了本文方法的有效性;最后
以其中两个软件为例
分析了包的PR值与包可理解性间的关系
进一步验证了本文方法的有效性.
Identifying key entities has many implications for software understanding and controlling and reducing maintenance costs.However the existing methods only focus on identifying key classes.Little work has been done on the identification of key entities at the other levels.Further the existing work also failed to reveal the relationships between key classes and external quality attributes.In this paper
we introduce a novel method IDEEP (IDEntifying kEy Packages using weighted PageRank algorithm) to identify the key packages.IDEEP uses a weighted and directed software network to describe packages and their dependencies
proposes a new metric PR (PackageRank) to quantify the package importance
and introduces a weighted PageRank algorithm to compute PR values.Our experiments are carried out on six Java software systems.First we analyze the correlation between PR values and other centrality metrics such as betweenness
closeness and degree.Second we use a weighted version of the susceptible-infectious-recovered model to examine the spreading influence of each node.The results show that our method is better than other six methods.Further
we reveal the relationships between key packages and their understandability and show that the key packages identified by our method are more meaningful from a software engineering perspective.
0
浏览量
3
下载量
5
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621