

浏览全部资源
扫码关注微信
1.中国科学院声学研究所, 北京 100190
2.中国科学院先进水下信息技术重点实验室, 北京 100190
3.中国科学院大学, 北京 100049
Received:05 August 2022,
Revised:2023-02-27,
Published:25 March 2024
移动端阅览
李宝奇,黄海宁,刘纪元,等.基于改进SSD的合成孔径声纳图像感兴趣小目标检测方法[J].电子学报,2024,52(03):762-771.
LI Bao-qi, HUANG Hai-ning, LIU Ji-yuan, et al.Interested Small Target Detection Method Based on Improved SSD for Synthetic Aperture Sonar Image[J].Acta Electronica Sinica, 2024, 52(03): 762-771.
李宝奇,黄海宁,刘纪元,等.基于改进SSD的合成孔径声纳图像感兴趣小目标检测方法[J].电子学报,2024,52(03):762-771. DOI:10.12263/DZXB.20220925
LI Bao-qi, HUANG Hai-ning, LIU Ji-yuan, et al.Interested Small Target Detection Method Based on Improved SSD for Synthetic Aperture Sonar Image[J].Acta Electronica Sinica, 2024, 52(03): 762-771. DOI:10.12263/DZXB.20220925
轻量化目标检测模型SSD-MV3(Single Shot Detection-MobileNet V3)因输入图像尺寸限制无法直接检测高分辨率大尺寸合成孔径声纳(Synthetic Aperture Sonar, SAS)图像中感兴趣小目标.为此,本文提出了一种新的目标检测方法HRSSD(High Resolution Single Shot Detection),该方法通过冗余切割确保SSD-MV3输入图像尺寸的规范以及感兴趣小目标的完整,并利用二次非极大值抑制保证检测结果的唯一.此外,提出了一种尺度、空间和通道注意力机制联合的特征提取模块,并利用该模块重新设计了SSD-MV3的基础网络和附加特征提取网络,记作SSD-MV3P(Single Shot Detection-MobileNet V3 Pro),使得SSD-MV3P能更有效的感知感兴趣小目标特征信息.实验结果表明,在感兴趣小目标检测数据集SST(Sonar Small Targets)上,SSD-MV3P的平均检测精度(mean Average Precision, mAP)比SSD-MV3提升4.39%.HRSSD实现了高分辨率大尺寸SAS图像感兴趣小目标的检测,并且保证了同一位置上检测结果的完整性和唯一性.
The efficient object detection model SSD-MV3 (Single Shot Detection-MobileNet V3) cannot directly detect the interested small targets in high-resolution SAS (Synthetic Aperture Sonar) images due to the input image size limit. To this end
this paper proposes a novel object detection method
HRSSD (High Resolution Single Shot Detection)
which ensures the specification of SSD-MV3 input image size and the integrity of the interested small targets through redundant cutting algorithm
and guarantees the unique detection result by using secondary non-maximum suppression. Furthermore
an improved feature block with a combination of scale
space and channel attention mechanism is proposed
and the basic network and additional feature network of SSD-MV3 are redesigned as SSD-MV3P (Single Shot Detection-MobileNet V3 Pro). Thus
SSD-MV3P can more effectively perceive the feature information of interested small targets. The experimental results show that the mAP (mean Average Precision) of SSD-MV3P is 4.39% higher than that of SSD-MV3 on the interested small target detection dataset SST (Sonar Small Target). HRSSD realizes the detection of the interested small targets in high-resolution SAS images
and ensures the integrity and uniqueness of the detection result at the same location.
HAYES M P , GOUGH P T . Synthetic aperture sonar: A review of current status [J ] . IEEE Journal of Oceanic Engineering , 2009 , 34 ( 3 ): 207 - 224 .
SYNNES S A V , HANSEN R E , SAEBO T O . Spatial coherence of speckle for repeat-pass synthetic aperture sonar micronavigation [J ] . IEEE Journal of Oceanic Engineering , 2021 , 46 ( 4 ): 1330 - 1345 .
THOMAS B , HUNTER A , DUGELAY S . Phase wrap error correction by random sample consensus with application to synthetic aperture sonar micronavigation [J ] . IEEE Journal of Oceanic Engineering , 2021 , 46 ( 1 ): 221 - 235 .
BROWN D C , GERG I D , BLANFORD T E . Interpolation kernels for synthetic aperture sonar along-track motion estimation [J ] . IEEE Journal of Oceanic Engineering , 2020 , 45 ( 4 ): 1497 - 1505 .
FEI T , KRAUS D , ZOUBIR A M . Contributions to automatic target recognition systems for underwater mine classification [J ] . IEEE Transactions on Geoscience and Remote Sensing , 2015 , 53 ( 1 ): 505 - 518 .
WILLIAMS D P . Fast target detection in synthetic aperture sonar imagery: A new algorithm and large-scale performance analysis [J ] . IEEE Journal of Oceanic Engineering , 2015 , 40 ( 1 ): 71 - 92 .
WANG L , LI M H , YE X F , et al . Specific target recognition and segmentation algorithm for real-time side scan sonar images [C ] // 2015 IEEE International Conference on Mechatronics and Automation (ICMA) . Piscataway : IEEE , 2015 : 2649 - 2653 .
FANDOS R , ZOUBIR A M . Optimal feature set for automatic detection and classification of underwater objects in SAS images [J ] . IEEE Journal of Selected Topics in Signal Processing , 2011 , 5 ( 3 ): 454 - 468 .
LECUN Y , BENGIO Y , HINTON G . Deep learning [J ] . Nature , 2015 , 521 ( 7553 ): 436 - 444 .
KWOK R . Deep learning Powers a motion-tracking revolution [J ] . Nature , 2019 , 574 ( 7776 ): 137 - 138 .
WILLIAMS D P . Underwater target classification in synthetic aperture sonar imagery using deep convolutional neural networks [C ] // 2016 23rd International Conference on Pattern Recognition (ICPR) . Piscataway : IEEE , 2016 : 2497 - 2502 .
MCKAY J , GERG I , MONGA V , et al . What's mine is yours: Pretrained CNNs for limited training sonar ATR [EB/OL ] . ( 2022-06-29 ) [ 2022-06-29 ] . https://arxiv.org/abs/1706.09858.pdf https://arxiv.org/abs/1706.09858.pdf .
WILLIAMS D P . On the use of tiny convolutional neural networks for human-expert-level classification performance in sonar imagery [J ] . IEEE Journal of Oceanic Engineering , 2021 , 46 ( 1 ): 236 - 260 .
GIRSHICK R , DONAHUE J , DARRELL T , et al . Rich feature hierarchies for accurate object detection and semantic segmentation [C ] // 2014 IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2014 : 580 - 587 .
GIRSHICK R . Fast R-CNN [EB/OL ] . ( 2015-04-30 )[ 2022-05-27 ] . http://arxiv.org/abs/1504.08083 http://arxiv.org/abs/1504.08083 .
REN S Q , HE K M , GIRSHICK R , et al . Faster R-CNN: Towards real-time object detection with region proposal networks [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2017 , 39 ( 6 ): 1137 - 1149 .
JREDMON J , DIVVALA S , GIRSHICK R , et al . You only look once: Unified, real-time object detection [EB/OL ] . ( 2015-06-08 ) [ 2022-05-09 ] . https://arxiv.org/abs/1506.02640 https://arxiv.org/abs/1506.02640 .
LIU W , ANGUELOV D , ERHAN D , et al . SSD: Single shot MultiBox detector [EB/OL ] . ( 2015-12-08 )[ 2022-05-08 ] . https://arxiv.org/abs/1512.02325 https://arxiv.org/abs/1512.02325 .
Adam V E . You only look twice: rapid multi-scale object detection in satellite imagery [EB/OL ] . ( 2018-05-24 )[ 2022-05-24 ] . http://arxiv.org/abs/1805.09512 http://arxiv.org/abs/1805.09512 .
HOWARD A G , ZHU M L , CHEN B , et al . Mobilenets: Efficient convolutional neural networks for mobile vision applications [EB/OL ] . ( 2017-04-17 )[ 2022-04-17 ] . https://arxiv.org/abs/1704.04861 https://arxiv.org/abs/1704.04861 .
SANDLER M , HOWARD A , ZHU M L , et al . MobileNetV2: inverted residuals and linear bottlenecks [C ] // 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2018 : 4510 - 4520 .
李宝奇 , 黄海宁 , 刘纪元 , 等 . 基于改进SSD的合成孔径声呐图像水下多尺度目标轻量化检测模型 [J ] . 电子与信息学报 , 2021 , 43 ( 10 ): 2854 - 2862
LI B Q , HUANG H N , LIU J Y , et al . Synthetic aperture sonar underwater multi-scale target efficient detection model based on improved single shot detector [J ] . Journal of Electronics & Information Technology , 2021 , 43 ( 10 ): 2854 - 2862 (in Chinese)
HU J , SHEN L , ALBANIE S , et al . Squeeze-and-excitation networks [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2020 , 42 ( 8 ): 2011 - 2023 .
HOWARD A , SANDLER M , CHEN B , et al . Searching for MobileNetV3 [C ] // 2019 IEEE/CVF International Conference on Computer Vision (ICCV) . Piscataway : IEEE , 2019 : 1314 - 1324 .
WOO S , PARK J , LEE J Y , et al . CBAM: Convolutional block attention module [EB/OL ] . ( 2018-07-11 )[ 2022-05-18 ] . https://arxiv.org/abs/1807.06521.pdf https://arxiv.org/abs/1807.06521.pdf .
WANG Q L , WU B G , ZHU P F , et al . ECA-Net: Efficient channel attention for deep convolutional neural networks [OL ] . ( 2019-10-08 )[ 2022-04-07 ] . http://arxiv.org/abs/1910.03151 http://arxiv.org/abs/1910.03151 .
LI X , WANG W H , HU X L , et al . Selective kernel networks [C ] // 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) . Piscataway : IEEE , 2019 : 510 - 519 .
BODLA N , SINGH B , CHELLAPPA R , et al . Soft-NMS—Improving object detection with one line of code [EB/OL ] . ( 2017-04-14 )[ 2022-05-08 ] . http://arxiv.org/abs/1704.04503 http://arxiv.org/abs/1704.04503 .
CHEN L C , PAPANDREOU G , KOKKINOS I , et al . DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2018 , 40 ( 4 ): 834 - 848 .
QIN Y J , LUO H Y , ZHAO F , et al . NDGCN: Network in network, dilate convolution and graph convolutional networks based transportation mode recognition [J ] . IEEE Transactions on Vehicular Technology , 2021 , 70 ( 3 ): 2138 - 2152 .
0
Views
16
下载量
0
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621