1.长春理工大学电子信息工程学院,吉林长春 130022
2.吉林省光电检测与智能信息处理工程技术研究中心,吉林长春 130022
[ "王彩霞 女,1978年2月生,辽宁鞍山人.长春理工大学副教授、硕士生导师.主要从事智能信息处理技术、计算机视觉与目标跟踪、传感与信号处理等方面的研究. E-mail: wcxhao@sina.com" ]
[ "安 琪 女,1996年1月生,吉林松原人.长春理工大学电子信息工程学院硕士研究生.主要研究方向为计算机视觉和目标跟踪. E-mail: 240404484@qq.com" ]
[ "周鸿策 男,2000年6月生,吉林松原人.长春理工大学电子信息工程学院硕士研究生.主要研究方向为计算机视觉和目标跟踪. E-mail: zhccust@163.com" ]
[ "李义鹏 男,2001年8月生,河南新乡人.长春理工大学电子信息工程学院研究生.主要研究方向为计算机视觉和图像超分辨率重构. E-mail: 2337297789@qq.com" ]
收稿:2025-01-11,
录用:2025-06-10,
纸质出版:2025-08-25
移动端阅览
王彩霞, 安琪, 周鸿策, 等. 基于特征自适应选取的视觉目标跟踪算法[J]. 电子学报, 2025, 53(08): 2879-2898.
WANG Cai-xia, AN Qi, ZHOU Hong-ce, et al. Visual Object Tracking Algorithm Based on Adaptive Feature Selection[J]. Acta Electronica Sinica, 2025, 53(08): 2879-2898.
王彩霞, 安琪, 周鸿策, 等. 基于特征自适应选取的视觉目标跟踪算法[J]. 电子学报, 2025, 53(08): 2879-2898. DOI:10.12263/DZXB.20250046
WANG Cai-xia, AN Qi, ZHOU Hong-ce, et al. Visual Object Tracking Algorithm Based on Adaptive Feature Selection[J]. Acta Electronica Sinica, 2025, 53(08): 2879-2898. DOI:10.12263/DZXB.20250046
针对现有视觉目标跟踪算法始终选择所有的历史模板与全部搜索区域进行交互,导致有强背景干扰或者目标发生形变时产生的跟踪失败问题,提出一种基于特征自适应选取的视觉目标跟踪算法.首先,通过模板特征过滤器将传统图像级模板更新优化为特征级动态更新,筛选当前帧的强相关模板特征并压缩弱相关特征,减少冗余信息干扰;其次,采用搜索特征鉴别器自适应划分搜索区域中潜在的目标特征与噪声特征,抑制无关区域的交互;最后,引入时空信息传播令牌,跨帧传递目标外观与位置信息,逐帧修正跟踪响应;设计基于分离注意力机制的特征交互编码器,将自注意力与交叉注意力分离,适配上述模块并增强判别能力.在多种大规模公开数据集上的实验取得了鲁棒结果,在OTB100、LaSOT和UAV123数据集上的精度分别达到93.0%、79.6%和91.2%,且算法能够实现跟踪成功率与跟踪速度的良好平衡,提升了跟踪器在复杂场景下的准确性和鲁棒性.
To address the persistent tracking failures caused by strong background interference or target deformation in existing visual object tracking algorithms that indiscriminately utilize all historical templates and interact with entire search regions
this paper proposes a feature-adaptive selection based visual object tracking algorithm. First
a template feature filter is introduced to optimize traditional image-level template updating into feature-level dynamic updating
which selectively preserves strongly correlated template features while compressing weakly relevant features to reduce redundant information interference. Second
a search feature discriminator is employed to autonomously distinguish potential target features from noise features in search regions
thereby suppressing interactions with irrelevant areas. Furthermore
spatio-temporal information propagation tokens are incorporated to transmit target appearance and positional information across frames for progressive response refinement. A feature interaction encoder based on decoupled attention mechanisms is designed
which separates self-attention and cross-attention operations to better adapt to the proposed modules while enhancing discriminative capabilities. Comprehensive experiments on multiple large-scale public datasets demonstrate robust performance
achieving precision scores of 93.0%
79.6%
and 91.2% on OTB100
LaSOT
and UAV123 datasets respectively. The algorithm maintains an optimal balance between tracking success rate and operational efficiency
significantly improving tracking accuracy and robustness in complex scenarios.
孙家伟 . 基于域不变投影的全天候目标跟踪方法研究 [D ] . 南京 : 南京邮电大学 , 2022 .
SUN J W . Research on All-day Target Tracking Algorithm Based on Domain Invariant Projection [D ] . Nanjing : Nanjing University of Posts and Telecommunications , 2022 . (in Chinese)
李淑慧 , 邓志红 , 冯肖雪 , 等 . 强杂波背景下基于变分贝叶斯推理的机载雷达目标跟踪算法 [J ] . 电子学报 , 2022 , 50 ( 5 ): 1089 - 1097 .
LI S H , DENG Z H , FENG X X , et al . Variational Bayesian inference-based airborne radar target tracking algorithm in strong clutter [J ] . Acta Electronica Sinica , 2022 , 50 ( 5 ): 1089 - 1097 . (in Chinese)
钟钰彬 , 杨鹏 , 窦磊 . 基于纵横比自适应的相关滤波跟踪算法 [J ] . 电子学报 , 2024 , 52 ( 6 ): 2112 - 2122 .
ZHONG Y B , YANG P , DOU L . Correlation filtering tracking algorithm based on adaptive aspect-ratio [J ] . Acta Electronica Sinica , 2024 , 52 ( 6 ): 2112 - 2122 . (in Chinese)
姜珊 , 底晓强 , 韩成 . 融合时空特性的孪生网络视觉跟踪 [J ] . 兵工学报 , 2021 , 42 ( 9 ): 1940 - 1950 .
JIANG S , DI X Q , HAN C . Siamese network for visual tracking with temporal-spatial property [J ] . Acta Armamentarii , 2021 , 42 ( 9 ): 1940 - 1950 . (in Chinese)
才华 , 王学伟 , 付强 , 等 . 基于动态模板更新的孪生网络目标跟踪算法 [J ] . 吉林大学学报(工学版) , 2022 , 52 ( 5 ): 1106 - 1116 .
CAI H , WANG X W , FU Q , et al . Siamese network target tracking algorithm based on dynamic template updating [J ] . Journal of Jilin University (Engineering and Technology Edition) , 2022 , 52 ( 5 ): 1106 - 1116 . (in Chinese)
谢青松 , 刘晓庆 , 安志勇 , 等 . 基于前景优化的视觉目标跟踪算法 [J ] . 电子学报 , 2022 , 50 ( 7 ): 1558 - 1566 .
XIE Q S , LIU X Q , AN Z Y , et al . Visual object tracking algorithm based on foreground optimization [J ] . Acta Electronica Sinica , 2022 , 50 ( 7 ): 1558 - 1566 . (in Chinese)
BERTINETTO L , VALMADRE J , HENRIQUES J F , et al . Fully-convolutional Siamese networks for object tracking [C ] // European Conference on Computer Vision . Cham : Springer , 2016 : 850 - 865 .
LI B , WU W , WANG Q , et al . SiamRPN++: Evolution of Siamese visual tracking with very deep networks [C ] // 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2019 : 4277 - 4286 .
GUO D Y , WANG J , CUI Y , et al . SiamCAR: Siamese fully convolutional classification and regression for visual tracking [C ] // 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2020 : 6268 - 6276 .
CHEN Z D , ZHONG B N , LI G R , et al . Siamese box adaptive network for visual tracking [C ] // 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2020 : 6667 - 6676 .
DANELLJAN M , BHAT G , KHAN F S , et al . ATOM: Accurate tracking by overlap maximization [C ] // 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2019 : 4655 - 4664 .
ZHANG Z P , LIU Y H , WANG X , et al . Learn to match: Automatic matching network design for visual tracking [C ] // 2021 IEEE/CVF International Conference on Computer Vision . Piscataway : IEEE , 2021 : 13319 - 13328 .
CHEN X , YAN B , ZHU J W , et al . Transformer tracking [C ] // 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2021 : 8122 - 8131 .
ZHENG Y Z , ZHONG B N , LIANG Q H , et al . ODTrack: Online dense temporal token learning for visual tracking [J ] . Proceedings of the AAAI Conference on Artificial Intelligence , 38 ( 7 ): 7588 - 7596 .
HUANG L H , ZHAO X , HUANG K Q . GlobalTrack: A simple and strong baseline for long-term tracking [J ] . Proceedings of the AAAI Conference on Artificial Intelligence , 2020 , 34 ( 7 ): 11037 - 11044 .
YAN B , PENG H W , FU J L , et al . Learning spatio-temporal transformer for visual tracking [C ] // 2021 IEEE/CVF International Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2021 : 10428 - 10437 .
CUI Y T , JIANG C , WANG L M , et al . MixFormer: End-to-end tracking with iterative mixed attention [C ] // 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2022 : 13598 - 13608 .
GAO S Y , ZHOU C L , MA C , et al . AIATrack: Attention in attention for transformer visual tracking [C ] // 17th European Conference on Computer Vision . Cham : Springer , 2022 : 146 - 164 .
GUO D Y , SHAO Y Y , CUI Y , et al . Graph attention tracking [C ] // 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2021 : 9538 - 9547 .
YE B T , CHANG H , MA B P , et al . Joint feature learning and relation modeling for tracking: A one-stream framework [C ] // 17th European Conference on Computer Vision . Cham : Springer , 2022 : 341 - 357 .
FU Z H , FU Z H , LIU Q J , et al . SparseTT: Visual tracking with sparse transformers [EB/OL ] . ( 2022-05-08 )[ 2025-01-01 ] . https://arxiv.org/abs/2205.03776 https://arxiv.org/abs/2205.03776 .
GAO S Y , ZHOU C L , ZHANG J . Generalized relation modeling for Transformer tracking [C ] // 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2023 : 18686 - 18695 .
YANG K , ZHANG H J , SHI J Y , et al . BANDT: A border-aware network with deformable transformers for visual tracking [J ] . IEEE Transactions on Consumer Electronics , 2023 , 69 ( 3 ): 377 - 390 .
SONG Z K , YU J Q , CHEN Y P , et al . Transformer tracking with cyclic shifting window attention [C ] // 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2022 : 8781 - 8790 .
WU Y , LIM J , YANG M H . Object tracking benchmark [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2015 , 37 ( 9 ): 1834 - 1848 .
FAN H , LIN L T , YANG F , et al . LaSOT: A high-quality benchmark for large-scale single object tracking [C ] // 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2019 : 5369 - 5378 .
MUELLER M , SMITH N , GHANEM B . A Benchmark and simulator for UAV tracking [M ] // Computer Vision - ECCV 2016 . Cham : Springer International Publishing , 2016 : 445 - 461 .
FU Z H , LIU Q J , FU Z H , et al . STMTrack: Template-free visual tracking with space-time memory networks [C ] // 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2021 : 13769 - 13778 .
刘广文 , 谢欣月 , 付强 , 等 . 基于时空模板焦点注意的Transformer目标跟踪算法 [J ] . 吉林大学学报(工学版) , 2025 , 55 ( 3 ): 1037 - 1049 .
LIU G W , XIE X Y , FU Q , et al . Spatiotemporal Transformer with template attention for target tracking [J ] . Journal of Jilin University (Engineering and Technology Edition) , 2025 , 55 ( 3 ): 1037 - 1049 . (in Chinese)
DANELLJAN M , HÄGER G , KHAN F S , et al . Adaptive decontamination of the training set: A unified formulation for discriminative visual tracking [C ] // 2016 IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2016 : 1430 - 1438 .
ZHOU X , GUO P , HONG L , et al . Reading relevant feature from global representation memory for visual object tracking [J ] . Advances in Neural Information Processing Systems , 2023 , 36 : 10814 - 10827 .
HE K M , CHEN X L , XIE S N , et al . Masked autoencoders are scalable vision learners [C ] // 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2022 : 15979 - 15988 .
DOSOVITSKIY A , BEYER L , KOLESNIKOV A , et al . An image is worth 16 x 16 words: Transformers for image recognition at scale[EB/OL ] . ( 2020-12-22 )[ 2024-12-20 ] . https://arxiv.org/pdf/2010.11929/1000 https://arxiv.org/pdf/2010.11929/1000 .
LIN T Y , MAIRE M , BELONGIE S , et al . Microsoft COCO: Common objects in context [M ] // Computer Vision - ECCV 2014 . Cham : Springer International Publishing , 2014 : 740 - 755 .
HUANG L H , ZHAO X , HUANG K Q . GOT-10k: A large high-diversity benchmark for generic object tracking in the wild [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2021 , 43 ( 5 ): 1562 - 1577 .
MÜLLER M , BIBI A , GIANCOLA S , et al . TrackingNet: A Large-scale dataset and benchmark for object tracking in the wild [M ] // Computer Vision-ECCV 2018 . Cham : Springer International Publishing , 2018 : 310 - 327 .
DANELLJAN M , HÄGER G , KHAN F S , et al . Learning spatially regularized correlation filters for visual tracking [C ] // 2015 IEEE International Conference on Computer Vision . Piscataway : IEEE , 2015 : 4310 - 4318 .
HENRIQUES J F , CASEIRO R , MARTINS P , et al . Exploiting the circulant structure of tracking-by-detection with kernels [M ] // Computer Vision-ECCV 2012 . Berlin : Springer , 2012 : 702 - 715 .
HENRIQUES J F , CASEIRO R , MARTINS P , et al . High-speed tracking with kernelized correlation filters [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2015 , 37 ( 3 ): 583 - 596 .
DANELLJAN M , HÄGER G , SHAHBAZ KHAN F , et al . Accurate scale estimation for robust visual tracking [J ] . Advances in Neural Information Processing Systems , 2023 , 36 : 10814 - 10827 .
BERTINETTO L , VALMADRE J , GOLODETZ S , et al . Staple: Complementary learners for real-time tracking [C ] // 2016 IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2016 : 1401 - 1409 .
DANELLJAN M , BHAT G , KHAN F S , et al . ECO: Efficient convolution operators for tracking [C ] // 2017 IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2017 : 6931 - 6939 .
ZHANG Z P , PENG H W , FU J L , et al . Ocean: Object-aware anchor-free tracking [C ] // 16th European Conference on Computer Vision . Glasgow : Springer , 2020 : 771 - 787 .
BHAT G , DANELLJAN M , VAN GOOL L , et al . Learning discriminative model prediction for tracking [C ] // 2019 IEEE/CVF International Conference on Computer Vision . Piscataway : IEEE , 2019 : 6181 - 6190 .
CHEN X , PENG H W , WANG D , et al . SeqTrack: Sequence to sequence learning for visual object tracking [C ] // 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2023 : 14572 - 14581 .
CAO Z A , HUANG Z Y , PAN L , et al . Towards real-world visual tracking with temporal contexts [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2023 , 45 ( 12 ): 15834 - 15849 .
CAO Z A , FU C H , YE J J , et al . HiFT: Hierarchical feature transformer for aerial tracking [C ] // 2021 IEEE/CVF International Conference on Computer Vision . Piscataway : IEEE , 2021 : 15437 - 15446 .
WEI X , BAI Y F , ZHENG Y C , et al . Autoregressive visual tracking [C ] // 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2023 : 9697 - 9706 .
XIE J X , ZHONG B N , MO Z Y , et al . Autoregressive queries for adaptive tracking with spatio-temporal transformers [C ] // 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2024 : 19300 - 19309 .
KRISTAN W M , LEONARDIS A , MATAS J , et al . The eighth visual object tracking VOT2020 challenge results [C ] // 16th European Conference on Computer Vision . Glasgow : Springer , 2020 : 547 - 601 .
HU W M , WANG Q , ZHANG L , et al . SiamMask: A framework for fast online object tracking and segmentation [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2023 , 45 ( 3 ): 3072 - 3089 .
MA Z A , WANG L Y , ZHANG H T , et al . RPT: Learning point set representation for Siamese visual tracking [C ] // 16th European Conference . on Computer Vision . Glasgow : Springer , 2020 : 653 - 665 .
0
浏览量
9
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621