1. 合肥工业大学计算机与信息学院,安徽,合肥,230009
2. 佛蒙特大学计算机科学系, 05,伯灵顿,美国,VT054
3. 合肥工业大学计算机与信息学院,安徽,合肥,230009
4. 佛蒙特大学计算机科学系 05,伯灵顿,美国,VT054
网络出版:2016-09-25,
纸质出版:2016
移动端阅览
吴信东, 赵银凤, 李磊. 基于种子节点选择的网络环境下多标签分类算法研究[J]. 电子学报, 2016,44(9):2074-2080.
WU Xin-dong, ZHAO Yin-feng, LI Lei. Multi-label Classification in Network Environments via Seed Node Selection[J]. Acta Electronica Sinica, 2016, 44(9): 2074-2080.
吴信东, 赵银凤, 李磊. 基于种子节点选择的网络环境下多标签分类算法研究[J]. 电子学报, 2016,44(9):2074-2080. DOI: 10.3969/j.issn.0372-2112.2016.09.008.
WU Xin-dong, ZHAO Yin-feng, LI Lei. Multi-label Classification in Network Environments via Seed Node Selection[J]. Acta Electronica Sinica, 2016, 44(9): 2074-2080. DOI: 10.3969/j.issn.0372-2112.2016.09.008.
多标签分类在基因分类,药物发现和文本分类等实际问题中有着广泛的应用.已存在的多标签分类算法,通常都是从网络中随机的选取节点作为训练集.然而,在分类算法执行的过程中,网络中不同节点所起的作用不同.在给定训练集数目的情况下,选择的训练集不同,分类精度也会不同.所以我们引入了种子节点的概念,标签分类从种子节点开始,经过不断推理,得到网络中其他所有节点的标签.本文提出了SHDA(Nodes Selection of High Degree from Each Affiliation)算法,即从网络的每个社团中,按比例的选取度数较大的节点,然后将其合并,处理后得到种子节点.真实数据集上的实验表明,将种子节点用作训练集进行多标签分类,能够提升网络环境下多标签分类的准确率.
Multi-label classification is widely used in genetic classification
drug discovery and text classification.The existing multi-label classification algorithms usually select nodes randomly from the network as their training set.However
during multi-label classification
different nodes have different effects.Given the number of nodes in the training set
a different training sub-set can lead to different classification accuracy.Hence
we introduce the concept of seed nodes
the classification procedure starts from the seed nodes
and after continuous reasoning
the labels of other nodes are inferred in the network.We propose an SHDA algorithm (Nodes Selection of High Degree from Each Affiliation) in which the nodes of high degrees from each affiliation belonging to the network are selected and merged
and after processing
the seed nodes are obtained.Experiments on several real-world datasets demonstrate that taking seed nodes as the training set to classify multi-labeled data can improve the classification performance.
0
浏览量
1063
下载量
3
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621