Fusion Target Attention Mask Generation Network For Video Segmentation

Yunyi Li, Fangping Chen, Fan Yang, Yuan Li, Huizhu Jia, Xiaodong Xie

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 05:06

27 Oct 2020

Video segmentation aims to segment target objects in a videosequence, which remains a challenge due to the motion anddeformation of objects. In this paper, we propose a novel attention-driven hybrid encoder-decoder network that generates object segmentation by fully leveraging spatial and temporal information. Firstly, a multi-branch network is designed to learn feature representation from object appearance, location and motion. Secondly, a target attention module is proposed to further exploit context information from learned representation. In addition, a novel edge loss is designed which constraints the model to generate salient edge features and accurate segmentation. The proposed model has been evaluated over two widely used public benchmarks, and experiments demonstrate its superior robustness and effectiveness as compared with the state of the arts.

Tags:

sps conference

icip 2020

Fusion Target Attention Mask Generation Network For Video Segmentation

Yunyi Li, Fangping Chen, Fan Yang, Yuan Li, Huizhu Jia, Xiaodong Xie

Value-Added Bundle(s) Including this Product

ICIP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society