中国人民解放军陆军炮兵防空兵学院计算机教研室, 安徽合肥 230031
[ "琚长瑞 男,1994年9月出生,安徽桐城人,陆军炮兵防空兵学院硕士研究生,主要研究方向为计算机视觉. E-mail: juchangrui1994@163.com" ]
[ "秦晓燕 女,1980年2月出生,安徽淮北人,副教授.主要研究方向为目标检测、机器学习及应用.E-mail:70853559@qq.com" ]
[ "袁广林() 男,1973年生,博士,副教授,主要从事计算机视觉、机器学习及其应用方面的研究. E-mail: 1183212999@qq.com" ]
收稿:2021-04-25,
修回:2021-08-27,
纸质出版:2022-09-25
移动端阅览
琚长瑞,秦晓燕,袁广林等.尺度敏感损失与特征融合的快速小目标检测方法[J].电子学报,2022,50(09):2119-2126.
JU Chang-rui,QIN Xiao-yan,YUAN Guang-lin,et al.Fast Small Object Detection Method with Scale-Sensitivity Loss and Feature Fusion[J].ACTA ELECTRONICA SINICA,2022,50(09):2119-2126.
琚长瑞,秦晓燕,袁广林等.尺度敏感损失与特征融合的快速小目标检测方法[J].电子学报,2022,50(09):2119-2126. DOI: 10.12263/DZXB.20210530.
JU Chang-rui,QIN Xiao-yan,YUAN Guang-lin,et al.Fast Small Object Detection Method with Scale-Sensitivity Loss and Feature Fusion[J].ACTA ELECTRONICA SINICA,2022,50(09):2119-2126. DOI: 10.12263/DZXB.20210530.
现有通用深度学习目标检测方法对中、大目标有着较好的检测精度,而对小目标检测精度较低,主要原因在于小目标训练数据少以及下采样后特征图分辨率过低.针对上述问题,一方面,提出一种尺度敏感损失函数用于分类热图的训练,使小目标能够对模型更新产生更大的影响;另一方面,利用反卷积与可变形卷积提出一种自上而下的特征融合方法,获得高分辨率、强语义的特征图来检测目标.在上述两个方面的基础上,设计一种尺度敏感与特征融合的小目标检测方法.在PASCAL VOC数据集上,对提出方法进行了实验验证,实验结果表明:相比于现有目标检测方法,本文方法在保持较快检测速度的同时,提升了小目标检测的精度.
The existing general deep learning target detection methods have good detection accuracy for medium and large targets
but the detection accuracy for small targets is low
mainly due to few data for small target training and low resolution of feature map after down sampling. To solve the above problems
on the one hand
a scale sensitive loss function is proposed for the training of classified heatmaps
so that small targets can have a greater impact on model updating; on the other hand
a top-down feature fusion method is proposed by using deconvolution and deformable convolution to obtain high-resolution and strong semantic feature map for target detection. On the basis of the above two aspects
a small target detection method based on scale sensitivity and feature fusion is designed. Experimental results on PASCAL VOC dataset show that compared with the existing target detection methods
the proposed method can maintain a faster detection speed and improve the accuracy of small object detection.
WANG X J , ZHANG S L , YU Z R , et al . Scale-equalizing pyramid convolution for object detection [C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Piscataway : IEEE , 2020 : 13356 - 13365 .
GUO C X , FAN B , ZHANG Q , et al . AugFPN: improving multi-scale feature learning for object detection [C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) . Piscataway : IEEE , 2020 : 12592 - 12601 .
TAN M X , PANG R M , LE Q V . EfficientDet: scalable and efficient object detection [C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) . Piscataway : IEEE , 2020 : 10778 - 10787 .
刘颖 , 刘红燕 , 范九伦 , 等 . 基于深度学习的小目标检测研究与应用综述 [J]. 电子学报 , 2020 , 48 ( 3 ): 590 - 601 .
LIU Y , LIU H Y , FAN J L , et al . A survey of research and application of small object detection based on deep learning [J]. Acta Electronica Sinica , 2020 , 48 ( 3 ): 590 - 601 . (in Chinese)
TONG K , WU Y Q , ZHOU F . Recent advances in small object detection based on deep learning: A review [J]. Image and Vision Computing , 2020 , 97 : 103910 .
YANG F , CHOI W , LIN Y Q . Exploit all the layers: Fast and accurate CNN object detector with scale dependent pooling and cascaded rejection classifiers [C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2016 : 2129 - 2137 .
LIU W , ANGUELOV D , ERHAN D , et al . SSD: Single Shot MultiBox Detector [C]// European Conference on Computer Vision . Cham : Springer , 2016 : 21 - 37 .
CAI Z W , FAN Q F , FERIS R S , et al . A unified multi-scale deep convolutional neural network for fast object detection [C]// European Conference on Computer Vision . Cham : Springer , 2016 : 354 - 370 .
LIN T Y , DOLLÁR P , GIRSHICK R , et al . Feature pyramid networks for object detection [C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2017 : 936 - 944 .
LIANG Z W , SHAO J , ZHANG D Y , et al . Small object detection using deep feature pyramid networks [C]// Pacific Rim Conference on Multimedia . Cham : Springer , 2018 : 554 - 564 .
GHIASI G , LIN T Y , LE Q V . NAS-FPN: Learning scalable feature pyramid architecture for object detection [C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) . Piscataway : IEEE , 2019 : 7029 - 7038 .
WU J X , LIU S T , HUANG D , et al . Multi-Scale Positive Sample Refinement for Few-Shot Object Detection [C]// Computer Vision – ECCV 2020 . Cham : Springer International Publishing , 2020 : 456 - 472 .
李雅倩 , 盖成远 , 肖存军 , 等 . 基于细化多尺度深度特征的目标检测网络 [J]. 电子学报 , 2020 , 48 ( 12 ): 2360 - 2366 .
LI Y Q , GAI C Y , XIAO C J , et al . Object detection networks based on refined multi-scale depth feature [J]. Acta Electronica Sinica , 2020 , 48 ( 12 ): 2360 - 2366 . (in Chinese)
ZHANG S F , ZHU X Y , LEI Z , et al . S3FD : single shot scale-invariant face detector [C]// 2017 IEEE International Conference on Computer Vision . Piscataway : IEEE , 2017: 192 - 201 .
KISANTAL M , WOJNA Z , MURAWSKI J , et al . Augmentation for small object detection [EB/OL]. ( 2019-02-19 ). https://arxiv.org/abs/1902.07296 https://arxiv.org/abs/1902.07296 .
CHEN Y K , ZHANG P Z , LI Z M , et al . Dynamic scale training for object detection [EB/OL]. ( 2020-04-26 ). https://arxiv.org/abs/2004.12432 https://arxiv.org/abs/2004.12432 .
李宝奇 , 贺昱曜 , 强伟 , 等 . 基于并行附加特征提取网络的SSD地面小目标检测模型 [J]. 电子学报 , 2020 , 48 ( 1 ): 84 - 91 .
LI B Q , HE Y Y , QIANG W , et al . SSD with parallel additional feature extraction network for ground small target detection [J]. Acta Electronica Sinica , 2020 , 48 ( 1 ): 84 - 91 . (in Chinese)
ZHOU X Y , WANG D Q , Krähenbühl Philipp , et al . Objects as Points [EB/OL]. ( 2019-04-16 ). https://arxiv.org/abs/1904.07850 https://arxiv.org/abs/1904.07850 .
LIN T Y , GOYAL P , GIRSHICK R , et al . Focal loss for dense object detection [C]// 2017 IEEE International Conference on Computer Vision(ICCV) . Piscataway : IEEE , 2017 : 2999 - 3007 .
REN S Q , HE K M , GIRSHICK R , et al . Faster R-CNN: Towards real-time object detection with region proposal networks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2017 , 39 ( 6 ): 1137 - 1149 .
CAI Z W , VASCONCELOS N . Cascade R-CNN: Delving into high quality object detection [C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2018 : 6154 - 6162 .
DAI J , LI Y , HE K , et al . R-FCN: object detection via region-based fully convolutional networks [C]// Proceedings of the 30th International Conference on Neural Information Processing Systems . Red Hook : Curran Associates Inc , 2016 : 379 - 387 .
BOCHKOVSKIY A , WANG C Y , LIAO H , et al . YOLOv4: Optimal speed and accuracy of object detection [EB/OL]. ( 2020-04-23 ). https://arxiv.org/abs/2004.10934 https://arxiv.org/abs/2004.10934 .
FU C Y , LIU W , RANGA A , et al . Dssd: Deconvolutional single shot detector [EB/OL]. ( 2017-01-23 ). https://arxiv.org/abs/1701.06659 https://arxiv.org/abs/1701.06659 .
LAW H , DENG J . CornerNet: Detecting objects as paired keypoints [C]// Computer Vision – ECCV 2018 . Cham : Springer International Publishing , 2018 : 765 - 781 .
ZHOU X Y , KOLTUN V , KRÄHENBÜHL P , et al . Probabilistic two-stage detection [EB/OL]. ( 2021-03-12 ). https://arxiv.org/abs/2103.07461 https://arxiv.org/abs/2103.07461 .
0
浏览量
9
下载量
2
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621