1.中南大学湘雅三医院,湖南长沙 410013
2.湖南中医药大学信息科学与工程学院,湖南长沙 410208
3.医学信息研究湖南省普通高等学校重点实验室(中南大学),湖南长沙 410006
4.湖南信息职业技术学院软件学院,湖南长沙 410200
[ "李 鹏 男,1983年11月出生,湖南泸溪人.博士、讲师,中南大学公共卫生与预防医学博士后流动站在站博士后.主要研究方向为生物信息学、机器学习、中医药大数据. E-mail:lpchs617@csu.edu.cn" ]
[ "闵 慧 女,1986年12月出生,湖南湘潭人.硕士、讲师,主要研究方向为生物信息学、网络优化. E-mail:mh1220@126.com" ]
[ "罗爱静(通信作者) 女,1962年出生,湖南安乡人.博士、教授、博士生导师,主要研究方向为医药信息管理、卫生信息管理、医药信息检索. E-mail:805372510@qq.com" ]
收稿:2020-04-13,
修回:2020-08-27,
纸质出版:2021-08-25
移动端阅览
李鹏,闵慧,罗爱静.基于动态图的PPI网络构建和复合物挖掘算法研究[J].电子学报,2021,49(08):1489-1497.
LI Peng,MIN Hui,LUO Ai-jing.Research on PPI Network Construction and Complex Mining Algorithm Based on Dynamic Graph[J].ACTA ELECTRONICA SINICA,2021,49(08):1489-1497.
李鹏,闵慧,罗爱静.基于动态图的PPI网络构建和复合物挖掘算法研究[J].电子学报,2021,49(08):1489-1497. DOI: 10.12263/DZXB.20200357.
LI Peng,MIN Hui,LUO Ai-jing.Research on PPI Network Construction and Complex Mining Algorithm Based on Dynamic Graph[J].ACTA ELECTRONICA SINICA,2021,49(08):1489-1497. DOI: 10.12263/DZXB.20200357.
动态蛋白质网络的构建和复合物挖掘问题是目前研究的热点.针对现有的算法在解决前述问题上的不足,文中考虑了蛋白质的活性周期和连接强度,首先提出了一种基于动态图的蛋白质网络构建算法.然后基于密度聚类设计了一种在动态蛋白质网络上挖掘复合物的算法(PCMA).整个挖掘过程包含三个步骤:基于DBSCAN(Density-Based Spatial Clustering of Applications with Noise)算法的蛋白质复合物生成;基于合并增益的蛋白质复合物合并和基于归属度的复合物调整.在多个公开的生物数据集上进行了实验,实验结果表明,所提算法在查全率、查准率和F-measure方面的性能都要优于现有的算法,且对输入参数不敏感.在保证蛋白质复合物挖掘准确性的前提下,算法的时间复杂度处于一个合理的范围之内.
Dynamic protein network construction and complex mining problem is a hot topic. In view of the shortcomings of existing algorithms in solving the above problems
a protein network construction algorithm based on dynamic graph is firstly proposed by considering the active period and the connection strength of proteins in this paper. Then
a protein complex mining algorithm(PCMA) on dynamic protein network is designed based on the density clustering. The whole mining process consists of three steps: the generation of protein complex based on DBSCAN(density-based spatial clustering of applications with noise) algorithm; the combination of protein complex based on the combination gain and the adjustment of protein complex based on the degree of membership. Experiments are carried out on several open biological datasets. The experimental results show that the performance of the proposed algorithm is better than that of the existing algorithms in terms of recall
precision and F-measure
and it is not sensitive to the input parameters. On the premise of ensuring the accuracy of protein complex mining
the time complexity of the proposed algorithm is in a reasonable range.
Larance M , Lamond A I . Multidimensional proteomics for cell biology [J]. Nature Reviews Molecular Cell Biology , 2015 , 16 ( 5 ): 269 - 280 .
Yang X , Coulombe-Huntington J , Kang S , et al . Widespread expansion of protein interaction capabilities by alternative splicing [J]. Cell , 2016 , 164 ( 4 ): 805 - 817 .
Lei X , Wang F , Wu F X , et al . Protein complex identification through Markov clustering with firefly algorithm on dynamic protein-protein interaction networks [J]. Information Sciences , 2016 , 329 : 303 - 316 .
Nepusz T , Yu H , Paccanaro A . Detecting overlapping protein complexes in protein-protein interaction networks [J]. Nature Methods , 2012 , 9 ( 5 ): 471 - 472 .
李敏 , 王晓桐 , 罗慧敏 , 等 . 随机游走技术在网络生物学中的研究进展 [J]. 电子学报 , 2018 , 46 ( 8 ): 2035 - 2048 .
Li M , Wang X T , Luo H M , et al . Progress on random walk and its application in network biology [J]. Acta Electronica Sinica , 2018 , 46 ( 8 ): 2035 - 2048 . (in Chinese)
张媛 , 贾克斌 , 张爱东 . 基于多视图融合的蛋白质功能模块检测方法 [J]. 电子学报 , 2014 , 42 ( 12 ): 2337 - 2344 .
Zhang Y , Jia K B , Zhang A D . Consistent protein functional module detection from multi-view of biological data [J]. Acta Electronica Sinica , 2014 , 42 ( 12 ): 2337 - 2344 . (in Chinese)
Hegele A , Kamburov A , Grossmann A , et al . Dynamic protein-protein interaction wiring of the human spliceosome [J]. Molecular Cell , 2012 , 45 ( 4 ): 567 - 580 .
Tang X W , Wang J X , Liu B B , et al . A comparison of the functional modules identified from time course and static PPI network data [J]. BMC Bioinformatics , 2011 , 12 ( 1 ): 339 .1-339. 15 .
雷秀娟 , 高银 , 郭玲 . 基于拓扑势加权的动态PPI网络复合物挖掘方法 [J]. 电子学报 , 2018 , 46 ( 1 ): 145 - 151 .
Lei X J , Gao Y , Guo L . Mining protein complexes based on topology potential weight in dynamic protein-protein interaction networks [J]. Acta Electronica Sinica , 2018 , 46 ( 1 ): 145 - 151 . (in Chinese)
Shen X , Yi L , Jiang X , et al . Neighbor affinity based algorithm for discovering temporal protein complex from dynamic PPI network [J]. Methods , 2016 , 110 : 90 - 96 .
Lei X J , Fang M , Guo L , et al . Protein complex detection based on flower pollination mechanism in multi-relation reconstructed dynamic protein networks [J]. BMC Bioinformatics , 2019 , 20 ( 3 ): 63 - 74 .
Elfwing S , Uchibe E , Dova K . Sigmoid-weighted linear units for neural network function approximation in reinforcement learning [J]. Neural Networks , 2018 , 107 : 3 - 11 .
Shen J , Hao X , Liang Z , et al . Real-time superpixel segmentation by DBSCAN clustering algorithm [J]. IEEE Transactions on Image Processing , 2016 , 25 ( 12 ): 5933 - 5942 .
乔少杰 , 郭俊 , 韩楠 , 等 . 大规模复杂网络社区并行发现算法 [J]. 计算机学报 , 2017 , 40 ( 3 ): 687 - 700 .
Qiao S J , Guo J , Han N , et al . Parallel algorithm for discovering communities in large-scale complex networks [J]. Chinese Journal of Computers , 2017 , 40 ( 3 ): 687 - 700 . (in Chinese)
Lei H , Wen Y , You Z , et al . Protein-protein interactions prediction via multimodal deep polynomial network and regularized extreme learning machine [J]. IEEE Journal of Biomedical and Health Informatics , 2019 , 23 ( 3 ): 1290 - 1303 .
Ruan P Y , Hayashida M , Akutsu T , et al . Improving prediction of heterodimeric protein complexes using combination with pairwise kernel [J]. BMC Bioinformatics , 2018 , 19 ( 1 ): 73 - 84 .
Pellegrini M , Baglioni M , Geraci F . Protein complex prediction for large protein interaction networks with the Core&Peel method [J]. BMC Bioinformatics , 2016 , 17 ( 12 ): 372 .1-372. 30 .
Lei H , Wen Y , You Z , et al . Protein-protein interactions prediction via multimodal deep polynomial network and regularized extreme learning machine [J]. IEEE Journal of Biomedical and Health Informatics , 2018 , 23 ( 3 ): 1290 - 1303 .
0
浏览量
10
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621