Fast Video Object Segmentation Based on Siamese Networks

doi:10.3969/j.issn.0372-2112.2020.04.001

您当前的位置：

首页 >

文章列表页 >

Fast Video Object Segmentation Based on Siamese Networks

更新时间：2025-07-08

- Fast Video Object Segmentation Based on Siamese Networks
- Acta Electronica Sinica Vol. 48, Issue 4, Pages: 625-630(2020)
- 作者机构：
  
  北京工业大学信息学部,北京,100124
- 作者简介：
- 基金信息：
- DOI：10.3969/j.issn.0372-2112.2020.04.001
  CLC： TP391.41
- Published Online：25 April 2020，
  
  Published：2020
- 稿件说明：
移动端阅览
Fast Video Object Segmentation Based on Siamese Networks[J]. Acta Electronica Sinica, 2020, 48(4): 625-630.
DOI：

Fast Video Object Segmentation Based on Siamese Networks[J]. Acta Electronica Sinica, 2020, 48(4): 625-630. DOI： 10.3969/j.issn.0372-2112.2020.04.001.

摘要

视频目标分割是计算机视觉领域中的一个研究热点，传统基于深度学习的视频目标分割方法在线微调深度网络，导致分割耗时长，难以满足实时的需求.本文提出一种快速的视频目标分割方法.首先，参数共享的孪生编码器子网将参考流和目标流映射到相同的特征空间，使得相同的目标具有相似的特征.然后，全局特征提取子网在特征空间中匹配给定目标相似的特征，定位目标对象.最后，解码器子网将目标特征还原，并通过连接目标流的低阶特征，提供边缘信息，最终输出目标的分割掩码.在公开基准数据集上的实验表明，本文方法的分割速度有大幅度提升，同时具有较好的分割效果.

Abstract

Video object segmentation (VOS) is a research hotspot in the field of computer vision. Traditional VOS based on deep learning fine-tunes the deep network online

which leads to long time-consuming segmentation and is difficult to meet real-time requirements. Therefore

we propose a fast VOS method. First

the weight-shared siamese encoder subnet maps the reference stream and the target stream to the same feature space; so that the same objects have similar features. Then

the global feature extraction subnet matches the features similar to the given object to locate the object. Finally

the decoder subnet restores the object features and gets edge information by connecting the low-level features of target stream to output the mask. Experiments on public benchmark datasets show that our method improves the speed significantly and achieves good performance.

关键词

Keywords

references

Views

319

下载量

CSCD

Alert me when the article has been cited

提交

Tools

Publicity Resources

Semi-Supervised Video Object Segmentation Based on Foreground Perception Visual Attention

Continual Learning Methods and Applications in Computer Vision

Neural Network Based Image Style Transfer: A Survey

DRHA-UIE: An Underwater Image Enhancement Method Based on Dual Residual Hybrid Attention Block

A Survey on Deep Predictive Learning Based on Unlabeled Videos

Related Author

FU Li-hua

ZHAO Yu

JIANG Han-xu

ZHAO Ru

WU Hui-xian

YAN Shao-xing

FANG Yan

Wei Yun-chao

Related Institution

School of Computer Science and Engineering, Beihang University

School of Computer Science and Technology, Beijing Jiaotong University

School of Control Science and Engineering, Shandong University

School of Computer Science and Technology, Harbin Institute of Technology

School of Computer Science and Technology, Beijing Jiaotong University

⁰