1.哈尔滨工程大学计算机科学与技术学院,黑龙江哈尔滨 150001
2.黑龙江大学计算机科学与技术学院,黑龙江哈尔滨 150080
3.哈尔滨工程大学水声工程学院,黑龙江哈尔滨 150001
[ "初妍 女,1979年出生,黑龙江哈尔滨人.哈尔滨工程大学计算机科学与技术学院副教授.主要研究方向为机器学习、推荐系统.E-mail: chuyan@hrbeu.edu.cn" ]
[ "张薇(通讯作者) 女,1992年出生,黑龙江牡丹江人.黑龙江大学计算机科学技术学院副教授.主要研究方向为网络表征学习、基因预测." ]
收稿:2021-09-01,
修回:2022-08-24,
纸质出版:2023-10-25
移动端阅览
初妍,戚书豪,张薇等.基于网络嵌入的癌症性状hub基因发现[J].电子学报,2023,51(10):2866-2873.
CHU Yan,QI Shu-hao,ZHANG Wei,et al.Hub Gene Discovery of Cancer Traits Based on Network Embedding[J].ACTA ELECTRONICA SINICA,2023,51(10):2866-2873.
初妍,戚书豪,张薇等.基于网络嵌入的癌症性状hub基因发现[J].电子学报,2023,51(10):2866-2873. DOI: 10.12263/DZXB.20211197.
CHU Yan,QI Shu-hao,ZHANG Wei,et al.Hub Gene Discovery of Cancer Traits Based on Network Embedding[J].ACTA ELECTRONICA SINICA,2023,51(10):2866-2873. DOI: 10.12263/DZXB.20211197.
研究影响癌症性状的hub基因时存在如下问题:仅关注强相关性基因进行基因信息处理,缺少对弱相关性基因和不同基因模块间共表达性的研究;仅采用度中心性判断hub基因进行分析基因网络,对蕴含数据挖掘不够全面.本文提出基因模块标签信息游走的图嵌入算法Gene2vec.选取合适软阈值,保留更多弱相关性的基因信息.联合不同种类但与性状高度正相关性的基因模块,构成基因模块共表达网络.针对传统加权基因共表达网络分析方法与图嵌入方法挖掘基因模块网络信息存在的问题,利用标签参数与其他参数调节基因模块网络中的随机游走过程,分析游走生成的节点序列以挖掘基因网络的信息.实验表明,Gene2vec在hub基因的检出率上优于其他算法,得到的hub基因在癌症性状中的基因表达量高于常用生物学方法得到的hub基因.
The research on hub genes affecting cancer traits has such problems: only focusing on strong correlation genes for gene information processing
lack of the co-expression of weak correlation genes and different gene modules; only using degree centrality to judge hub genes to analyze gene network
not comprehensive enough for implicit data mining. This paper proposes the graph embedding algorithm Gene2vec based on information walk with gene module label. The appropriate soft threshold to retain more weakly correlated gene information is selected. The gene module co-expression network is formed by combining different kinds of gene modules with high positive correlation traits. Aiming to solve the problems of mining gene module network information by traditional weighted gene co-expression network analysis method and graph embedding method
the paper adjust the random walk process in the gene module network by label parameters and other parameters and analyze the node sequence generated random walk to mine the gene network information. Experiments show that Gene2vec is better than other algorithms in the hub gene's detection rate
and the hub gene expression in cancer traits is higher than that of the hub gene obtained by common biological methods.
高攀 . 关于乳腺癌的基因共表达网络分析及药物预测 [D]. 大连 : 大连海事大学 , 2018 .
GAO P . Gene Co-Expression Network Analysis and Drug Prediction for Breast Cancer [D]. Dalian : Dalian Maritime University , 2018 . (in Chinese)
YIN X , WANG P , YANG T S , et al . Identification of key modules and genes associated with breast cancer prognosis using WGCNA and ceRNA network analysis [J]. Aging , 2020 , 13 ( 2 ): 2519 - 2538 .
祁志卫 , 王笳辉 , 岳昆 , 等 . 图嵌入方法与应用: 研究综述 [J]. 电子学报 , 2020 , 48 ( 4 ): 808 - 818 .
QI Z W , WANG J H , YUE K , et al . Methods and applications of graph embedding: A survey [J]. Acta Electronica Sinica , 2020 , 48 ( 4 ): 808 - 818 . (in Chinese)
GOYAL P , FERRARA E . Graph embedding techniques, applications, and performance: A survey [J]. Knowledge-Based Systems , 2018 , 151 : 78 - 94 .
MENG L Q , MASUDA N . Analysis of node2vec random walks on networks [J]. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences , 2020 , 476 ( 2243 ): 1 - 25 .
谢勇军 . 水稻非生物逆境差异表达基因分析及共表达网络构建 [D]. 武汉 : 华中农业大学 , 2018 .
XIE Y J . Analysis of Differentially Expressed Genes and Construction Of Gene Co-Expression Network in Rice Under Abiotic Stresses [D]. Wuhan : Huazhong Agricultural University , 2018 . (in Chinese)
YUAN Q H , ZHOU Q , REN J E , et al . WGCNA identification of TLR7 as a novel diagnostic biomarker, progression and prognostic indicator, and immunotherapeutic target for stomach adenocarcinoma [J]. Cancer Medicine , 2021 , 10 ( 12 ): 4004 - 4016 .
0
浏览量
14
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621