1.武汉大学电子信息学院,湖北武汉 430072
2.广东工业大学计算机学院,广东广州 510006
[ "肖进胜 男,1975年7月出生,湖北武汉人.博士,武汉大学电子信息学院副教授,主要研究方向为视频图像处理、计算机视觉.E-mail: xiaojs@whu.edu.cn" ]
[ "张舒豪 男,1995年9月出生,河南项城人.武汉大学电子信息学院硕士生,主要研究方向为遥感影像目标检测.E-mail: zhangsh@whu.edu.cn" ]
[ "陈云华(通讯作者) 女,1977年3月出生,湖北仙桃人.博士,广东工业大学计算机学院副教授,主要研究方向为神经形态类脑计算、计算机视觉.E-mail: yhchen@gdut.edu.cn" ]
收稿:2021-03-14,
修回:2022-01-06,
纸质出版:2022-02-25
移动端阅览
肖进胜,张舒豪,陈云华等.双向特征融合与特征选择的遥感影像目标检测[J].电子学报,2022,50(02):267-272.
XIAO Jin-sheng,ZHANG Shu-hao,CHEN Yun-hua,et al.Remote Sensing Image Object Detection Based on Bidirectional Feature Fusion and Feature Selection[J].ACTA ELECTRONICA SINICA,2022,50(02):267-272.
肖进胜,张舒豪,陈云华等.双向特征融合与特征选择的遥感影像目标检测[J].电子学报,2022,50(02):267-272. DOI: 10.12263/DZXB.20210354.
XIAO Jin-sheng,ZHANG Shu-hao,CHEN Yun-hua,et al.Remote Sensing Image Object Detection Based on Bidirectional Feature Fusion and Feature Selection[J].ACTA ELECTRONICA SINICA,2022,50(02):267-272. DOI: 10.12263/DZXB.20210354.
遥感影像中复杂的背景占据图像的大部分区域,严重影响了目标检测效果.本文提出一种可以对特征图进行多特征选择的目标检测网络.设计了双向多尺度特征融合网络,融合深浅层信息,提高复杂背景下小目标的检测效果,在保留常规特征金字塔自上而下路径的同时,增加一条自下而上的路径,减少浅层特征传递到顶层经历的网络层数,从而控制浅层特征损失.为了降低多尺度特征图中无用信息对后续检测网络的干扰,设计了基于注意力机制的多特征选择模块,网络自适应地专注于有用特征,忽略无用特征.针对传统五参数回归法在预测角度时存在严重的边界不连续问题,不能精确预测长宽比值比较大的目标,将角度预测当作分类任务处理.在DOTA数据集和自制数据集DOTA-GF上进行实验,6类典型目标的mAP分别达到0.651和0.641,与主流目标检测算法的对比实验结果表明提出的方法的有效性.
In remote sensing image object detection
the complex background always occupies a large area of the entire image
which seriously affects the object detection effect. This paper proposes an object detection network that can perform multiple feature fusion and selection on feature maps. A feature fusion network is used to fuse deep and shallow features to improve the detection effect of small objects in complex background. While retaining the up-bottom path of the feature fusion network
it adds a bottom-up path to diminish the number of network layers that the shallow features need to pass on to the top layer
thereby reducing the loss of shallow features. In order to reduce the interference of useless information in the fusion feature maps with detection network
a multiple feature selection module is designed. The attention mechanism in the multiple feature selection module enables the network to adaptively focus on more important features
ignore useless features. Since the conventional five-parameter regression method has serious boundary problems
the angle prediction is often inaccurate for objects with a large aspect ratio
to solve this problem
the proposed method treats angle prediction as a classification task. The mAP of our method on DOTA and self-made dataset DOTA-GF reaches 0.651 and 0.641
and the comparative experiments with mainstream object detection methods demonstrate the effectiveness of the proposed method.
裴伟 , 许晏铭 , 朱永英 , 等 . 改进的SSD航拍目标检测方法 [J]. 软件学报 , 2019 , 30 ( 3 ): 738 - 758 .
PEI Wei , XU Yan-ming , ZHU Yong-ying , et al . The target detection method of aerial photography images with improved SSD [J]. Journal of Software , 2019 , 30 ( 3 ): 738 - 758 . (in Chinese)
刘小波 , 刘鹏 , 等 . 基于深度学习的光学遥感图像目标检测研究进展 [J]. 自动化学报 , 2021 , 47 ( 9 ): 2078 - 2089 .
LIU Xiao-bo , LIU Peng , et al . Research progress of optical remote sensing image object detection based on deep learning [J]. Acta Automatica Sinica , 2021 , 47 ( 9 ): 2078 - 2089 . (in Chinese)
肖进胜 , 周景龙 , 雷俊锋 , 等 . 基于霾层学习的单幅图像去雾算法 [J]. 电子学报 , 2019 , 47 ( 10 ): 2142 - 2148 .
XIAO Jin-sheng , ZHOU Jin-long , LEI Jun-feng , et al . Single image dehazing algorithm based on the learning of hazy layers [J]. Acta Electronica Sinica , 2019 , 47 ( 10 ): 2142 - 2148 . (in Chinese)
XIAO Jin-sheng , ZHANG Shu-hao , DAI Yuan , et al . A multiclass object detection in UAV images based on rotation region network [J]. IEEE Journal on Miniaturization for Air and Space Systems , 2020 , 1 ( 3 ): 188 - 196 .
MA Jian-qi , SHAO Wei-yuan , YE Hao , et al . Arbitrary-oriented scene text detection via rotation proposals [J]. IEEE Transactions on Multimedia , 2018 , 20 ( 11 ): 3111 - 3122 .
LIN T Y , DOLLAR P , GIRSHICK R , et al . Feature pyramid networks for object detection [C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR) . Piscataway : IEEE , 2017 : 2117 - 2125 .
戴媛 , 易本顺 , 肖进胜 , 等 . 基于改进旋转区域生成网络的遥感图像目标检测 [J]. 光学学报 , 2020 , 40 ( 1 ): 270 - 280 .
DAI Yuan , YI Ben-shun , XIAO Jin-sheng , et al . Object detection of remote sensing image based on improved rotation region proposal network [J]. Acta Optica Sinica , 2020 , 40 ( 1 ): 270 - 280 . (in Chinese)
YANG Xue , YAN Chun-li , Feng Zi-ming , et al . R3Det: refined single-stage detector with feature refinement for rotating object [C]// 35th AAAI Conference on Artificial Intelligence . Menlo Park : AAAI , 2021 : 3163 - 3171 .
YANG Xue , YANG Ji-rui , YAN Jun-chi , et al . SCRDet: Towards more robust detection for small, cluttered and rotated objects [C]// 2019 IEEE/CVF International Conference on Computer Vision(ICCV) . Piscataway : IEEE , 2019 .
XIA Gui-song , BAI Xiang , DING Jian , et al . DOTA: A large-scale dataset for object detection in aerial images [C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2018 : 3974 - 3983 .
HE Kai-ming , ZHANG Xiang-yu , REN Shao-qing , et al . Deep residual learning for image recognition [C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2016 : 770 - 778 .
REN Shao-qing , HE Kai-ming , et al . Faster r-cnn: towards real-time object detection with region proposal networks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2017 , 39 ( 6 ): 1137 - 1149 .
LIN T Y , GOYAL P , GIRSHICK R , et al . Focal loss for dense object detection [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2020 , 42 ( 2 ): 318 - 327 .
LIU S , QI L , QIN H F , et al . Path aggregation network for instance segmentation [C]// Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition . Washington : IEEE Computer Society , 2018 : 8759 - 8768 .
YANG Xue , YAN Junchi . Arbitrary-oriented object detection with circular smooth label [C]// European Conference on Computer Vision(ECCV) . Berlin : Springer , 2020 : 677 - 694 .
0
浏览量
10
下载量
11
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621