

浏览全部资源
扫码关注微信
湖北工业大学计算机学院,湖北武汉 430068
Received:21 August 2020,
Revised:2021-11-05,
Published:25 July 2022
移动端阅览
李超,黄新宇,王凯.基于特征融合和自学习锚框的高分辨率图像小目标检测算法[J].电子学报,2022,50(07):1684-1695.
LI Chao,HUANG Xin-yu,WANG Kai.Small Object Detection of High-Resolution Images Based on Feature Fusion and Learnable Anchor[J].ACTA ELECTRONICA SINICA,2022,50(07):1684-1695.
李超,黄新宇,王凯.基于特征融合和自学习锚框的高分辨率图像小目标检测算法[J].电子学报,2022,50(07):1684-1695. DOI: 10.12263/DZXB.20200917.
LI Chao,HUANG Xin-yu,WANG Kai.Small Object Detection of High-Resolution Images Based on Feature Fusion and Learnable Anchor[J].ACTA ELECTRONICA SINICA,2022,50(07):1684-1695. DOI: 10.12263/DZXB.20200917.
为了提高高分辨率图像中小目标的检测精度,解决高分辨率图像在下采样和局部裁切时由于细节和背景信息丢失造成的漏检和误检问题,本文提出了一种基于特征融合和自学习锚框的小目标检测算法.算法采用多路分支网络对高分辨率图像的全局语义和细节特征平滑后逐层融合,以同时增强特征图上小目标的细节和背景特征.针对训练样本尺寸差异造成不同分支网络上特征表达不一致的问题,本文引入自学习锚框使融合后的特征图能够适应锚框的位置和形状.使用本文算法与目前先进的目标检测算法对下采样图像和切块检测,大量实验结果验证了本文算法对高分辨率图像小目标检测的准确性和有效性.
Small object detection of high-resolution images presents significant challenges. To solve the problem that downsampling and cropping of high-resolution images result in missed detections and false detections due to the loss of fine details and contextual information
an algorithm based on feature fusion and learnable anchor is proposed for small object detection of high-resolution images. Contextual and detailed features are extracted from downsampled images and cropped patches respectively
which are then fused layer-wise. The fused features are further combined with smoothed features to strengthen both fine details and contextual information. To mitigate the feature inconsistency
learnable anchor is applied to make the fused features accommodative to the location and shape of anchors. The proposed method is tested from the perspective of global inference and local inference compared to state-of-the-art detectors. The experimental results show the accuracy and effectiveness of the proposed method.
KISANTAL M , WOJNA Z , MURAWSKI J , et al . Augmentation for small object detection [C]// 9th International Conference on Advances in Computing and Information Technology(ACITY 2019) . Sydney : Aircc Publishing Corporation , 2019 : 119 - 133 .
刘颖 , 刘红燕 , 范九伦 , 等 . 基于深度学习的小目标检测研究与应用综述 [J]. 电子学报 , 2020 , 48 ( 3 ): 590 - 601 .
LIU Y , LIU H Y , FAN J L , et al . A survey of research and application of small object detection based on deep learning [J]. Acta Electronica Sinica , 2020 , 48 ( 3 ): 590 - 601 . (in Chinese)
LIN T Y , MAIRE M , BELONGIE S , et al . Microsoft COCO: Common Objects in Context [C]// European Conference on Computer Vision . Cham : Springer , 2014 : 740 - 755 .
BODLA N , SINGH B , CHELLAPPA R , et al . Soft-NMS- Improving object detection with one line of code [C]// 2017 IEEE International Conference on Computer Vision . Venice : IEEE , 2017 : 5562 - 5570 .
李宝奇 , 贺昱曜 , 强伟 , 等 . 基于并行附加特征提取网络的SSD地面小目标检测模型 [J]. 电子学报 , 2020 , 48 ( 1 ): 84 - 91 .
LI B Q , HE Y Y , QIANG W , et al . SSD with parallel additional feature extraction network for ground small target detection [J]. Acta Electronica Sinica , 2020 , 48 ( 1 ): 84 - 91 . (in Chinese)
HE K M , ZHANG X Y , REN S Q , et al . Deep residual learning for image recognition [C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition . Las Vegas : IEEE , 2016 : 770 - 778 .
LIN T Y , DOLLÁR P , GIRSHICK R , et al . Feature pyramid networks for object detection [C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition . Hawaii : IEEE , 2017 : 936 - 944 .
PANG J M , CHEN K , SHI J P , et al . Libra R-CNN: Towards balanced learning for object detection [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 821 - 830 .
裴伟 , 许晏铭 , 朱永英 , 等 . 改进的SSD航拍目标检测方法 [J]. 软件学报 , 2019 , 30 ( 3 ): 738 - 758 .
PEI W , XU Y M , ZHU Y Y , et al . The target detection method of aerial photography images with improved SSD [J]. Journal of Software , 2019 , 30 ( 3 ): 738 - 758 . (in Chinese)
黄继鹏 , 史颖欢 , 高阳 . 面向小目标的多尺度Faster-RCNN检测算法 [J]. 计算机研究与发展 , 2019 , 56 ( 2 ): 319 - 327 .
HUANG J P , SHI Y H , GAO Y . Multi-scale faster-RCNN algorithm for small object detection [J]. Journal of Computer Research and Development , 2019 , 56 ( 2 ): 319 - 327 . (in Chinese)
LIU S , QI L , QIN H F , et al . Path aggregation network for instance segmentation [C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Salt Lake City : IEEE , 2018 : 8759 - 8768 .
CHEN W Y , JIANG Z Y , WANG Z Y , et al . Collaborative global-local networks for memory-efficient segmentation of ultra-high resolution images [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 8916 - 8925 .
SUN K , XIAO B , LIU D , et al . Deep high-resolution representation learning for human pose estimation [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 5686 - 5696 .
CHEN K , PANG J M , WANG J Q , et al . Hybrid task cascade for instance segmentation [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 4969 - 4978 .
REN S Q , HE K M , GIRSHICK R , et al . Faster R-CNN: Towards real-time object detection with region proposal networks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2017 , 39 ( 6 ): 1137 - 1149 .
OKSUZ K , CAM B C , KALKAN S , et al . Imbalance problems in object detection: A review [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2021 , 43 ( 10 ): 3388 - 3415 .
CAI Z W , VASCONCELOS N . Cascade R-CNN: Delving into high quality object detection [C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Salt Lake City : IEEE , 2018 : 6154 - 6162 .
ZHANG H K , CHANG H , MA B P , et al . Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training [M]// Computer Vision-ECCV 2020 . Cham : Springer International Publishing , 2020 : 260 - 275 .
WANG J Q , ZHANG W W , CAO Y H , et al . Side-aware boundary localization for more precise object detection [C]// European Conference on Computer Vision . Cham : Springer , 2020 : 403 - 419 .
WANG J Q , CHEN K , YANG S , et al . Region proposal by guided anchoring [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 2960 - 2969 .
YANG F , FAN H , CHU P , et al . Clustered object detection in aerial images [C]// 2019 IEEE/CVF International Conference on Computer Vision(ICCV) . Seoul : IEEE , 2019 : 8310 - 8319 .
LIN M , CHEN Q , YAN S . Network In Network [EB/OL]. ( 2014 )[2020]. https://arxiv.org/abs/1312.4400 https://arxiv.org/abs/1312.4400 .
OUYANG W L , WANG X G , ZHANG C , et al . Factors in finetuning deep model for object detection with long-tail distribution [C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition . Las Vegas : IEEE , 2016 : 864 - 873 .
DARIUS L , KUZMA R , MCGEE K , et al . xView: Objects in Context in Overhead Imagery [EB/OL]. ( 2018 )[2020]. https://arxiv.org/abs/1802.07856 https://arxiv.org/abs/1802.07856 .
CHEN K , WANG J , PANG J , et al . MMDetection: Open MMLab Detection Toolbox and Benchmark [EB/OL]. ( 2019 )[2020]. https://arxiv.org/abs/1906.07155v1 https://arxiv.org/abs/1906.07155v1 .
MOLCHANOV P , TYREE S , KARRAS T , et al . Pruning Convolutional Neural Networks for Resource Efficient Inference [C]// Proceedings of the 5th International Conference on Learning Representations(ICLR2017) . Toulon : ICLR , 2017 : 1 - 17 .
OKSUZ K , CAM B C , AKBAS E , et al . Generating positive bounding boxes for balanced training of object detectors [C]// 2020 IEEE Winter Conference on Applications of Computer Vision . Snowmass Village : IEEE , 2020 : 883 - 892 .
FU Z H , CHEN Y W , YONG H W , et al . Foreground gating and background refining network for surveillance object detection [J]. IEEE Transactions on Image Processing , 2019 , 28 ( 12 ): 6077 - 6090 .
HE Y H , ZHU C C , WANG J R , et al . Bounding box regression with uncertainty for accurate object detection [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 2883 - 2892 .
CHEN K A , LI J G , LIN W Y , et al . Towards accurate one-stage object detection with AP-loss [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 5114 - 5122 .
HUANG X , GE Z , JIE Z Q , et al . NMS by representative region: Towards crowded pedestrian detection by proposal pairing [C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Seattle : IEEE , 2020 : 10747 - 10756 .
REZATOFIGHI H , TSOI N , GWAK J , et al . Generalized intersection over union: A metric and a loss for bounding box regression [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 658 - 666 .
0
Views
10
下载量
4
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621