Small Object Detection of High-Resolution Images Based on Feature Fusion and Learnable Anchor

LI Chao; HUANG Xin-yu; WANG Kai

doi:10.12263/DZXB.20200917

您当前的位置：

首页 >

文章列表页 >

Small Object Detection of High-Resolution Images Based on Feature Fusion and Learnable Anchor

PAPERS | 更新时间：2025-12-08

- Small Object Detection of High-Resolution Images Based on Feature Fusion and Learnable Anchor
- ACTA ELECTRONICA SINICA Vol. 50, Issue 7, Pages: 1684-1695(2022)
- 作者机构：
  
  湖北工业大学计算机学院，湖北武汉 430068
- 作者简介：
- 基金信息：
- DOI：10.12263/DZXB.20200917
  CLC： TP391;
- Received：21 August 2020，
  
  Revised：2021-11-05，
  
  Published：25 July 2022
- 稿件说明：
移动端阅览
李超,黄新宇,王凯.基于特征融合和自学习锚框的高分辨率图像小目标检测算法[J].电子学报,2022,50(07):1684-1695.

LI Chao,HUANG Xin-yu,WANG Kai.Small Object Detection of High-Resolution Images Based on Feature Fusion and Learnable Anchor[J].ACTA ELECTRONICA SINICA,2022,50(07):1684-1695.
李超,黄新宇,王凯.基于特征融合和自学习锚框的高分辨率图像小目标检测算法[J].电子学报,2022,50(07):1684-1695. DOI： 10.12263/DZXB.20200917.

LI Chao,HUANG Xin-yu,WANG Kai.Small Object Detection of High-Resolution Images Based on Feature Fusion and Learnable Anchor[J].ACTA ELECTRONICA SINICA,2022,50(07):1684-1695. DOI： 10.12263/DZXB.20200917.

摘要

为了提高高分辨率图像中小目标的检测精度，解决高分辨率图像在下采样和局部裁切时由于细节和背景信息丢失造成的漏检和误检问题，本文提出了一种基于特征融合和自学习锚框的小目标检测算法.算法采用多路分支网络对高分辨率图像的全局语义和细节特征平滑后逐层融合，以同时增强特征图上小目标的细节和背景特征.针对训练样本尺寸差异造成不同分支网络上特征表达不一致的问题，本文引入自学习锚框使融合后的特征图能够适应锚框的位置和形状.使用本文算法与目前先进的目标检测算法对下采样图像和切块检测，大量实验结果验证了本文算法对高分辨率图像小目标检测的准确性和有效性.

Abstract

Small object detection of high-resolution images presents significant challenges. To solve the problem that downsampling and cropping of high-resolution images result in missed detections and false detections due to the loss of fine details and contextual information

an algorithm based on feature fusion and learnable anchor is proposed for small object detection of high-resolution images. Contextual and detailed features are extracted from downsampled images and cropped patches respectively

which are then fused layer-wise. The fused features are further combined with smoothed features to strengthen both fine details and contextual information. To mitigate the feature inconsistency

learnable anchor is applied to make the fused features accommodative to the location and shape of anchors. The proposed method is tested from the perspective of global inference and local inference compared to state-of-the-art detectors. The experimental results show the accuracy and effectiveness of the proposed method.

关键词

Keywords

references

KISANTAL M , WOJNA Z , MURAWSKI J , et al . Augmentation for small object detection [C]// 9th International Conference on Advances in Computing and Information Technology(ACITY 2019) . Sydney : Aircc Publishing Corporation , 2019 : 119 - 133 .

刘颖 , 刘红燕 , 范九伦 , 等 . 基于深度学习的小目标检测研究与应用综述 [J]. 电子学报 , 2020 , 48 ( 3 ): 590 - 601 .

LIU Y , LIU H Y , FAN J L , et al . A survey of research and application of small object detection based on deep learning [J]. Acta Electronica Sinica , 2020 , 48 ( 3 ): 590 - 601 . (in Chinese)

LIN T Y , MAIRE M , BELONGIE S , et al . Microsoft COCO: Common Objects in Context [C]// European Conference on Computer Vision . Cham : Springer , 2014 : 740 - 755 .

BODLA N , SINGH B , CHELLAPPA R , et al . Soft-NMS- Improving object detection with one line of code [C]// 2017 IEEE International Conference on Computer Vision . Venice : IEEE , 2017 : 5562 - 5570 .

李宝奇 , 贺昱曜 , 强伟 , 等 . 基于并行附加特征提取网络的SSD地面小目标检测模型 [J]. 电子学报 , 2020 , 48 ( 1 ): 84 - 91 .

LI B Q , HE Y Y , QIANG W , et al . SSD with parallel additional feature extraction network for ground small target detection [J]. Acta Electronica Sinica , 2020 , 48 ( 1 ): 84 - 91 . (in Chinese)

HE K M , ZHANG X Y , REN S Q , et al . Deep residual learning for image recognition [C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition . Las Vegas : IEEE , 2016 : 770 - 778 .

LIN T Y , DOLLÁR P , GIRSHICK R , et al . Feature pyramid networks for object detection [C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition . Hawaii : IEEE , 2017 : 936 - 944 .

PANG J M , CHEN K , SHI J P , et al . Libra R-CNN: Towards balanced learning for object detection [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 821 - 830 .

裴伟 , 许晏铭 , 朱永英 , 等 . 改进的SSD航拍目标检测方法 [J]. 软件学报 , 2019 , 30 ( 3 ): 738 - 758 .

PEI W , XU Y M , ZHU Y Y , et al . The target detection method of aerial photography images with improved SSD [J]. Journal of Software , 2019 , 30 ( 3 ): 738 - 758 . (in Chinese)

黄继鹏 , 史颖欢 , 高阳 . 面向小目标的多尺度Faster-RCNN检测算法 [J]. 计算机研究与发展 , 2019 , 56 ( 2 ): 319 - 327 .

HUANG J P , SHI Y H , GAO Y . Multi-scale faster-RCNN algorithm for small object detection [J]. Journal of Computer Research and Development , 2019 , 56 ( 2 ): 319 - 327 . (in Chinese)

LIU S , QI L , QIN H F , et al . Path aggregation network for instance segmentation [C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Salt Lake City : IEEE , 2018 : 8759 - 8768 .

CHEN W Y , JIANG Z Y , WANG Z Y , et al . Collaborative global-local networks for memory-efficient segmentation of ultra-high resolution images [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 8916 - 8925 .

SUN K , XIAO B , LIU D , et al . Deep high-resolution representation learning for human pose estimation [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 5686 - 5696 .

CHEN K , PANG J M , WANG J Q , et al . Hybrid task cascade for instance segmentation [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 4969 - 4978 .

REN S Q , HE K M , GIRSHICK R , et al . Faster R-CNN: Towards real-time object detection with region proposal networks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2017 , 39 ( 6 ): 1137 - 1149 .

OKSUZ K , CAM B C , KALKAN S , et al . Imbalance problems in object detection: A review [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2021 , 43 ( 10 ): 3388 - 3415 .

CAI Z W , VASCONCELOS N . Cascade R-CNN: Delving into high quality object detection [C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Salt Lake City : IEEE , 2018 : 6154 - 6162 .

ZHANG H K , CHANG H , MA B P , et al . Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training [M]// Computer Vision-ECCV 2020 . Cham : Springer International Publishing , 2020 : 260 - 275 .

WANG J Q , ZHANG W W , CAO Y H , et al . Side-aware boundary localization for more precise object detection [C]// European Conference on Computer Vision . Cham : Springer , 2020 : 403 - 419 .

WANG J Q , CHEN K , YANG S , et al . Region proposal by guided anchoring [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 2960 - 2969 .

YANG F , FAN H , CHU P , et al . Clustered object detection in aerial images [C]// 2019 IEEE/CVF International Conference on Computer Vision(ICCV) . Seoul : IEEE , 2019 : 8310 - 8319 .

LIN M , CHEN Q , YAN S . Network In Network [EB/OL]. ( 2014 )[2020]. https://arxiv.org/abs/1312.4400 https://arxiv.org/abs/1312.4400 .

OUYANG W L , WANG X G , ZHANG C , et al . Factors in finetuning deep model for object detection with long-tail distribution [C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition . Las Vegas : IEEE , 2016 : 864 - 873 .

DARIUS L , KUZMA R , MCGEE K , et al . xView: Objects in Context in Overhead Imagery [EB/OL]. ( 2018 )[2020]. https://arxiv.org/abs/1802.07856 https://arxiv.org/abs/1802.07856 .

CHEN K , WANG J , PANG J , et al . MMDetection: Open MMLab Detection Toolbox and Benchmark [EB/OL]. ( 2019 )[2020]. https://arxiv.org/abs/1906.07155v1 https://arxiv.org/abs/1906.07155v1 .

MOLCHANOV P , TYREE S , KARRAS T , et al . Pruning Convolutional Neural Networks for Resource Efficient Inference [C]// Proceedings of the 5th International Conference on Learning Representations(ICLR2017) . Toulon : ICLR , 2017 : 1 - 17 .

OKSUZ K , CAM B C , AKBAS E , et al . Generating positive bounding boxes for balanced training of object detectors [C]// 2020 IEEE Winter Conference on Applications of Computer Vision . Snowmass Village : IEEE , 2020 : 883 - 892 .

FU Z H , CHEN Y W , YONG H W , et al . Foreground gating and background refining network for surveillance object detection [J]. IEEE Transactions on Image Processing , 2019 , 28 ( 12 ): 6077 - 6090 .

HE Y H , ZHU C C , WANG J R , et al . Bounding box regression with uncertainty for accurate object detection [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 2883 - 2892 .

CHEN K A , LI J G , LIN W Y , et al . Towards accurate one-stage object detection with AP-loss [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 5114 - 5122 .

HUANG X , GE Z , JIE Z Q , et al . NMS by representative region: Towards crowded pedestrian detection by proposal pairing [C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Seattle : IEEE , 2020 : 10747 - 10756 .

REZATOFIGHI H , TSOI N , GWAK J , et al . Generalized intersection over union: A metric and a loss for bounding box regression [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Long beach : IEEE , 2019 : 658 - 666 .

Views

下载量

CSCD

Alert me when the article has been cited

提交

Tools

Publicity Resources

Fast Small Object Detection Method with Scale-Sensitivity Loss and Feature Fusion

Variable Horizon Multi-Directional Scanning Method for Time Series Anomaly Detection

Low-Light Small Target Detection Method Combining Feature Fusion Enhancement and Detail Features

Related Author

Chao LI

Xin-yu HUANG

Kai WANG

JU Chang-rui

QIN Xiao-yan

YUAN Guang-lin

LI Hao

ZHU Hong

Related Institution

School of Computer， Hubei University of Technology

Computer Teaching and Research Section， PLA Army Academy of Artillery and Air Defense

School of Computer Science and Engineering, School of Cyber Science and Engineering, Nanjing University of Science and Technology

Guangxi Key Laboratory of Image and Graphics Processing and Intelligent Processing, Guilin University of Electronic Technology

School of Information and Technology, Shanxi University

⁰