电子学报 ›› 2017, Vol. 45 ›› Issue (11): 2779-2786.DOI: 10.3969/j.issn.0372-2112.2017.11.027

• 学术论文 • 上一篇    下一篇

带拒绝域的ECOC多类分类

雷蕾1, 王晓丹1, 罗玺2, 王玮3   

  1. 1. 空军工程大学防空反导学院, 陕西西安 710051;
    2. 空军工程大学信息与导航学院陕西西安 710077;
    3. 空军大连通信士官学校基础部, 辽宁大连 116600
  • 收稿日期:2016-05-24 修回日期:2016-10-10 出版日期:2017-11-25
    • 作者简介:
    • 雷蕾,女,1988年生于四川南充,博士生.研究方向为智能信息处理和目标识别.E-mail:wendyandpaopao@163.com;王晓丹,女,1966年生于陕西汉中,教授,博士.研究方向为模式识别,机器学习等.E-mail:afeu_wang@163.com;罗玺,男,1988年生,硕士,讲师.研究方向为智能信息处理.E-mail:luoxi19887302@126.com;王玮,男,1985年,讲师.研究方向为计算机应用.
    • 基金资助:
    • 国家自然科学基金 (No.61273275,No.61503407)

Design of Reject Option for Multi-classification Based on ECOC

LEI Lei1, WANG Xiao-dan1, LUO Xi2, WANG Wei3   

  1. 1. The Air and Missile Defense Institute, Air Force Engineering University, Xi'an, Shaanxi 710051, China;
    2. The Information and Navigation Institute, Air Force Engineering University, Xi'an, Shaanxi 710077, China;
    3. The Fundamental Department, Dalian Air Force Communication NCO Academy, Dalian, Liaoning 116600, China
  • Received:2016-05-24 Revised:2016-10-10 Online:2017-11-25 Published:2017-11-25
    • Supported by:
    • National Natural Science Foundation of China (No.61273275, No.61503407)

摘要: 针对纠错输出编码分解框架的自身特点、从降低误判风险出发,研究了带拒绝域的ECOC多类分类方法.首先在二类划分过程中引入拒绝域,对不属于正负子类的待识别样本进行拒识;其次,在基分类器内部引入拒绝域,以最小化风险贝叶斯决策为目标,利用后验概率输出和代价矩阵寻找拒绝域阈值,对样本输出值落入拒绝域中的样本进行拒识;最后,研究了不同拒绝域输出的解码方法,并讨论了拒识码字个数和矩阵最小Hamming距离之间的关系.实验结果表明基于二类划分构造的拒绝域能够提高分类正确率,而基于基分类器构造的拒绝域能够减小分类代价.

关键词: 多类分类, 纠错输出编码, 拒绝域, 支持向量数据描述, 贝叶斯决策

Abstract: Aiming at reducing misclassification costs,this paper studies the design of reject options for ECOC multi-classification based on its properties.The first level of reject option is constructed in the process of bipartitions to recognize an instance whose real labels does not belong to the meta-subclasses.Meanwhile,the second reject rule is presented in the dichotomizers based on posterior probabilities and cost matrix to make the minimum-risk Bayesian decision.Finally,different decoding strategies are analysed according to different reject output.The relationship between the number of rejected positions and the minimum Hamming distance of matrix is discussed.The two-stage reject rule makes the ECOC multi-classification with rejection come true and reduce the misclassification error and costs.

Key words: multi-classification, error-correcting output codes, rejection option, support vector domain description, Bayesian decision

中图分类号: