Robust Visual Tracking Based on Second Order Pooling Network

PU Lei, FENG Xin-xi, HOU Zhi-qiang, YU Wang-sheng

ACTA ELECTRONICA SINICA ›› 2020, Vol. 48 ›› Issue (8) : 1472-1478.

PDF(5859 KB)
CIE Homepage  |  Join CIE  |  Login CIE  |  中文 
PDF(5859 KB)
ACTA ELECTRONICA SINICA ›› 2020, Vol. 48 ›› Issue (8) : 1472-1478. DOI: 10.3969/j.issn.0372-2112.2020.08.003

Robust Visual Tracking Based on Second Order Pooling Network

  • PU Lei1, FENG Xin-xi2, HOU Zhi-qiang3, YU Wang-sheng2
Author information +

Abstract

Aiming at the problem that the target is easy to lose in the complex scene such as low resolution, occlusion, the interference of similar objects, this paper proposes a visual tracking algorithm based on second-order pooling network. Most of the existing methods use the first-order pooling network, which makes the difference between similar targets insufficient. In this paper, based on the VGG16 network structure, the last first-order pooling layer is replaced by the second-order covariance pooling layer, and then the network is retrained on ImageNet and CUB200-2011 image data sets. In order to reduce the computational burden, only the fourth convolution feature of the pre-training network is extracted as the appearance representation of the target. Finally, the extracted features are combined with the existing correlation filtering algorithm. The experimental results show that the algorithm achieves excellent performance in tracking accuracy and success rate.

Key words

visual tracking / second-order pooling network / deep features / correlation filter

Cite this article

Download Citations
PU Lei, FENG Xin-xi, HOU Zhi-qiang, YU Wang-sheng. Robust Visual Tracking Based on Second Order Pooling Network[J]. Acta Electronica Sinica, 2020, 48(8): 1472-1478. https://doi.org/10.3969/j.issn.0372-2112.2020.08.003

References

[1] Smeulders A W M,CHu D M,Cucchiara R,et al.Visual tracking:An experimental survey[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2013,36(7):1442-1468.
[2] Wang Naiyan,SHi Jianping,Yeung Dityan,et al.Understanding and diagnosing visual tracking systems[A].IEEE International Conference on Computer Vision[C].Santiago,Chile:2015.3101-3109.
[3] Rawat W,Wang Z.Deep convolutional neural networks for image classification:A comprehensive review[J].Neural computation,2017,29(9):2352-2449.
[4] Girshick R,Donahue J,Darrell T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[A].Proceedings of the IEEE conference on computer vision and pattern recognition.Columbus[C].America:2014.580-587.
[5] Long J,Shelhamer E,Darrell T.Fully convolutional networks for semantic segmentation[A].Proceedings of the IEEE conference on computer vision and pattern recognition[C].Boston,America:2015.3431-3440.
[6] He K,Zhang X,Ren S,et al.Deep residual learning for image recognition[A].IEEE Conference on Computer Vision and Pattern Recognition[C].Las Vegas,America:2016.770-778.
[7] Ma Chao,Huang Jiabin,YANG Xiaokang,et al.Hierarchical convolutional features for visual tracking[A].IEEE International Conference on Computer Vision[C].Santiago,Chile:2015.3074-3082.
[8] Qi Yuankai,Zhang Shenging,Qin Lei,et al.Hedged deep tracking[A].Proceedings of the IEEE conference on computer vision and pattern recognition[C].Las Vegas,America:2016.4303-4311.
[9] Danelljan M,Hager G,Shahbaz Khan F,et al.Convolutional features for correlation filter based visual tracking[A].Proceedings of the IEEE International Conference on Computer Vision Workshops[C].Santiago,Chile:2015.58-66.
[10] Nam H,Han B.Learning multi-domain convolutional neural networks for visual tracking[A].Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition[C].2016.4293-4302.
[11] Bertinetto L,Valmadre J,Henriques J F,et al.Fully-convolutional siamese networks for object tracking[A].European conference on computer vision[C].Springer,Cham:2016.850-865.
[12] Gao Z,Xie J,Wang Q,et al.Global second-order pooling convolutional networks[A].Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition[C].Los Angeles,USA:2019.3024-3033.
[13] Carreira J,Caseiro R,Batista J,et al.Semantic segmentation with second-order pooling[A].European Conference on Computer Vision[C].Berlin,Heidelberg:Springer,2012.430-443.
[14] Chen B,Deng W,Hu J.Mixed high-order attention network for person re-identification[A].Proceedings of the IEEE International Conference on Computer Vision[C].Los Angeles,USA:2019.371-381.
[15] Li P,Xie J,Wang Q,et al.Is second-order information helpful for large-scale visual recognition? [A].In Proceedings of the IEEE International Conference on Computer Vision[C].Venice:2017.2070-2078.
[16] Krizhevsky A,Sutskever I,Hinton G.Imagenet classification with deep convolutional neural networks[A].International Conference on Neural Information Processing Systems[C].Lake Tahoe,Spain:2012,25(2):809-817.
[17] Li Feng,Tian Cheng,Zuo Wangmeng,et al.Learning spatial-temporal regularized correlation filters for visual tracking[A].Computer Vision and Pattern Recognition[C].Salt Lake City,USA:2018.4904-4913.
[18] Vedaldi A,Lenc K.Matconvnet:Convolutional neural networks for matlab[A].Proceedings of the 23rd ACM international conference on Multimedia.Brisbane[C].Australia:2015.689-692.
[19] Simonyan K,Zisserman A.Very deep convolutional networks for large-scale image recognition[A].International Conference on Learning Representations[C].San Diego,USA:2015.563-567.
[20] Wu Y,Lim J,Yang M H.Object tracking benchmark[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(9):1834-1848.
[21] T Zhang,C Xu,M Yang.Learning multi-task correlation particle filters for visual tracking[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2019,41(2):365-378.
[22] LI P,CHEN B,OUYANG W,et al.GradNet:Gradient-guided network for visual object tracking [A].Proceedings of the IEEE International Conference on Computer Vision[C].Venice:2019.6162-6171.
[23] Li B,Yan J,Wu W,et al.High performance visual tracking with siamese region proposal network[A].Computer Vision and Pattern Recognition[C].Salt Lake City,USA:2018.8971-8980.
[24] Liang Pengpeng,Erik B,Ling Haibin.Encoding color information for visual tracking:Algorithms and benchmark[J].IEEE Transactions on Image Processing,2015,24(12):5630-5644.
[25] Danelljan M,Robinson A,Khan F S,et al.Beyond correlation filters:Learning continuous convolution operators for visual tracking[A].European Conference on Computer Vision.Springer[C].Cham:2016.472-488.

Funding

National Natural Science Foundation of China (No.61571458, No.61703423)
PDF(5859 KB)

Accesses

Citation

Detail

Sections
Recommended

/