Construct Knowledge Graph for Exploratory Bug Issue Searching
SUN Xiao-bing1,2, WANG Lu1, WANG Jing-wei1, LI Bin1, LI Yu1
1. School of Information Engineering, Yangzhou University, Yangzhou, Jiangsu 225127, China;
2. State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, Jiangsu 210023, China
Abstract:Software bug issues are inevitable in software development and maintenance.However,there are no corresponding relationships between bugs and commits in software repository.Moreover,with the increasing amount of bug reports and commit information,bug search in the software repository becomes more difficult and costly.In this paper,we propose an exploratory search approach to search bug issues based on knowledge graph.By building the bug knowledge graph of bug reports,commits and related developers (such as bug reporters,committers and so on) and combining with the idea of exploratory search,our approach can not only help software developers search bug issues accurately,but also provide the relevant information to explore bug issues,thus software developers can understand and resolve bug issues more effectively.
[1] 李勇,黄志球,王勇,等.数据驱动的软件缺陷预测研究综述[J].电子学报,2017,45(4):982-988. LI Yong,HUANG Zhi-qiu,WANG Yong,et al.Survey on data driven software defects prediction[J].Acta Electronica Sinica,2017,45(4):982-988.(in Chinese)
[2] Le T D B,Linares-Vásquez M,Lo D,et al.RCLinker:automated linking of issue reports and commits leveraging rich contextual information[A].Proceedings of IEEE 23rd International Conference on Program Comprehension (ICPC)[C].USA:IEEE,2015.36-47.
[3] 孙小兵,刘湘月,李斌,等.基于相关主题模型的程序网络自动构建与分析[J].电子学报,2017,45(5):1052-1056. SUN Xiao-bing,LIU Xiang-yue,LI Bin,et al.On automatic construction and analysis of program network via relational topic model[J].Acta Electronica Sinica,2017,45(5):1052-1056.(in Chinese)
[4] Gu Z,Barr E T,Schleck D,et al.Reusing debugging knowledge via trace-based bug search[J].ACM SIGPLAN Notices,2012,47(10):927-942.
[5] Sun X,Li Y,Li B,Chen Y.What information in software historical repositories do we need to support software maintenance tasks? an approach based on topic model[A].Computer and Information Science,Studies in Computational Intelligence (Volume 566)[M].Berlin:Springer,2015.27-37.
[6] Wu R,Zhang H,Kim S,et al.Relink:recovering links between bugs and changes[A].Proceedings of the 19th ACM SIGSOFT Symposium and the 13th European Conference on Foundations of Software Engineering[C].USA:ACM,2011.15-25.
[7] Bachmann A,Bird C,Rahman F,et al.The missing links:bugs and bug-fix commits[A].Proceedings of the Eighteenth ACM SIGSOFT International Symposium on Foundations of Software Engineering[C].USA:ACM,2010.97-106.
[8] Wang L,Sun X,Wang J,et al.Construct bug knowledge graph for bug resolution:poster[A].Proceedings of the 39th International Conference on Software Engineering Companion[C].USA:IEEE,2017.189-191.
[9] Sun X,Liu X,Hu J,Zhu J.Empirical studies on the NLP techniques for source code data preprocessing[A].Proceedings of the 2014 3rd International Workshop on Evidential Assessment of Software Technologies (EAST 2014)[C].USA:ACM,2014.32-39.
[10] 胡芳槐.基于多种数据源的中文知识图谱构建方法研究[D].华东理工大学,2015. HU Fang-huai.Chinese Knowledge Graph Construction Method Based on Multiple Data Sources[D].East China University of Science and Technology,2015.(in Chinese)
[11] Sarrafzadeh B,Vechtomova O,Jokic V.Exploring knowledge graphs for exploratory search[A].Proceedings of the 5th Information Interaction in Context Symposium[C].USA:ACM,2014.135-144.
[12] Sarrafzadeh B,Vechtomova O.Combining document retrieval with knowledge graphs for exploratory search[A].Proceedings of the 5th Information Interaction in Context Symposium[C].USA:ACM,2014.345-347.
[13] Lv Q,Xu L,Yu J,et al.Research on domain knowledge graph based on the large scale online knowledge fragment[A].Proceedings of IEEE Workshop on Advanced Research and Technology in Industry Applications (WARTIA)[C].USA:IEEE,2014.312-315.
[14] Li P,Wsng B,Shi Z,et al.Tag-textrank:a webpage keyword extraction method based on tags[J].Journal of Computer Research and Development,2012,49(11):2344-2351.
[15] Tanon T P,Vrande ié D,Schaffert S,et al.From freebase to wikidata:the great migration[A].Proceedings of the 25th International Conference on World Wide Web[C].USA:ACM,2016.1419-1428.
[16] Marchionini G.Exploratory search:from finding to understanding[J].Communications of the ACM,2006,49(4):41-46.
[17] 钟秀琴,刘忠,丁盘苹.基于混合推理的知识库的构建及其应用研究[J].计算机学报,2012,35(4):761-766. ZHONG Xiu-qin,LIU Zhong,DING Pan-ping.Construction of knowledge base on hybrid reasoning and its application[J].Journal of Computer Research and Development,2012,35(4):761-766.(in Chinese)
[18] 陈海鹏,申铉京,龙建武,等.自动确定聚类个数的模糊聚类算法[J].电子学报,2017,45(3):687-694. CHEN Hai-peng,SHEN Xuan-jing,LONG Jian-wu,et al.Fuzzy clustering algorithm for automatic identification of clusters[J].Acta Electronica Sinica,2017,45(3):687-694.(in Chinese)
[19] Lin Z,Zhao J,Xie B.A graph database based crowdsourcing infrastructure for modelling and searching code structure[A].Proceedings of the 6th Asia-Pacific Symposium on Internetware[C].USA:ACM,2014.15-24.
[20] Su Y,Yang S,Sun H,et al.Exploiting relevance feedback in knowledge graph search[A].Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining[C].USA:ACM,2015.1135-1144.
[21] Zhu Y,Cao X,Bian Y,et al.CKGHV:A comprehensive knowledge graph for history visualization[A].Proceedings of the 14th ACM/IEEE-CS Joint Conference on Digital Libraries[C].USA:IEEE,2014.437-438.
[22] Golovchinsky G,Adcock J,Pickens J,et al.Cerchiamo:A collaborative exploratory search tool[A].Proceedings of Computer Supported Cooperative Work (CSCW)[C].USA:ACM,2008.8-12.
[23] O'Connor B,Krieger M,Ahn D.TweetMotif:Exploratory search and topic summarization for twitter[A].Proceedings of the Fourth International Conference on Weblogs and Social Media (ICWSM)[C].USA:AAAI,2010.1-2.
[24] Lee J,Fuxman A,Zhao B,et al.Leveraging knowledge bases for contextual entity exploration[A].Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining[C].USA:ACM,2015.1949-1958.
[25] 林泽琦,赵俊峰,谢冰.一种基于图数据库的代码结构解析与搜索方法[J].计算机研究与发展,53(3):531-540. LIN Ze-qi,ZHAO Jun-feng,XIE Bing.A graph database based method for parsing and searching code structure[J].Journal of Computer Research and Development,53(3):531-540.(in Chinese)