LIU Chang-yuan,ZHANG Yu-liang,BI Xiao-jun.Urban Traffic Object Detection Based on Multi-Stage Proposal Sparse R-CNN[J].ACTA ELECTRONICA SINICA,2023,51(01):26-31.
Aiming at the slow speed and low accuracy of multi-object detection algorithms in urban traffic scenes
this paper proposes a multi-stage proposal sparse region-based convolutional neural network algorithm (MPS R-CNN). The algorithm mainly has the following characteristics: a multi-stage proposal box filtering update mechanism is proposed to improve the detection accuracy of the algorithm; a bidirectional parallel feature pyramid network (BPFPN) is proposed to enhance the model feature fusion capability; for the problem of object detection in urban traffic scenes
the Copy-Paste data augmentation method and CIoU loss function are introduced. The experimental results show that the MPS R-CNN algorithm achieves 77% mAP on the urban object dataset
and the algorithm detection speed remains at 37 fps
which is better than other current urban traffic object detection algorithms.
关键词
Keywords
references
LOWE D G . Distinctive image features from scale-invariant keypoints [J]. International Journal of Computer Vision , 2004 , 60 ( 2 ): 91 - 110 .
GIRSHICK R , DONAHUE J , DARRELL T , et al . Rich feature hierarchies for accurate object detection and semantic segmentation [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2014 : 580 - 587 .
REDMON J , FARHADI A . YOLO9000: Better, faster, stronger [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2017 : 7263 - 7271 .
LI Baoqi , HE Yuyao , QIANG Wei , HE Lingjiao . SSD small target detection model based on parallel additional feature extraction network [J]. Acta Electronica Sinica , 2020 , 48 ( 1 ): 84 - 91 . (in Chinese)
HE K , GKIOXARI G , DOLLÁR P , et al . Mask r-cnn [C]// Proceedings of the IEEE International Conference on Computer Vision . Piscataway : IEEE , 2017 : 2961 - 2969 .
LIN T Y , GOYAL P , GIRSHICK R , et al . Focal loss for dense object detection [C]// Proceedings of the IEEE International Conference on Computer Vision . Piscataway : IEEE , 2017 : 2980 - 2988 .
SUN P , ZHANG R , JIANG Y , et al . Sparse r-cnn: End-to-end object detection with learnable proposals [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2021 : 14454 - 14463 .
HE K , ZHANG X , REN S , et al . Deep residual learning for image recognition [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2016 : 770 - 778 .
LIN T Y , DOLLÁR P , GIRSHICK R , et al . Feature pyramid networks for object detection [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2017 : 2117 - 2125 .
HOU Qingshan , XING Jinsheng . SSD target detection algorithm based on Grad-CAM and KL loss [J]. Acta Electronica Sinica , 2020 , 48 ( 12 ): 2409 - 2416 . (in Chinese)
GHIASI G , CUI Y , SRINIVAS A , et al . Simple copy-paste is a strong data augmentation method for instance segmentation [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2021 : 2918 - 2928 .
ZHENG Z , WANG P , REN D , et al . Enhancing geometric factors in model learning and inference for object detection and instance segmentation [J]. IEEE Transactions on Cybernetics , 2021 , 62 ( 01 ): 1 - 13 .
LIU S , QI L , QIN H , et al . Path aggregation network for instance segmentation [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2018 : 8759 - 8768 .
DOMINGUEZ-SANCHEZ A , CAZORLA M , ORTS-ESCOLANO S . A new dataset and performance evaluation of a region-based cnn for urban object detection [J]. Electronics , 2018 , 7 ( 11 ): 301 .
TAN M , PANG R , LE Q V . Efficientdet: Scalable and efficient object detection [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2020 : 10781 - 10790 .
PSALTIS A , DIMOU A , ALVAREZ F , et al . Flow R-CNN: Flow-enhanced object detection [C]// ICPR International Workshops and Challenges . Berlin : Springer , 2021 : 685 - 700 .