DEEP LEARNING AND INTERACTIVITY FOR VIDEO ROTOSCOPING

Shivam Saboo, Frederic Lefebvre, Vincent Demoulin

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 12:43

26 Oct 2020

In this work we extend the idea of object co-segmentation to perform interactive video segmentation. Our framework predicts the coordinates of vertices along the boundary of an object for two frames of a video simultaneously. The predicted vertices are interactive in nature and a user interaction on one frame assists the network to correct the predictions for both frames. We employ attention mechanism at the encoder stage and a simple combination network at the decoder stage which allows the network to perform this simultaneous correction efficiently. The framework is also robust to the distance between the two input frames as it can handle a distance of up to 50 frames in between the two inputs. We train our model on professional dataset, which consists pixel accurate annotations given by professional Roto artists. We test our model on DAVIS and achieve state of the art results in both automatic and interactive mode surpassing Curve-GCN and PolyRNN++.

Tags:

sps conference

icip 2020

DEEP LEARNING AND INTERACTIVITY FOR VIDEO ROTOSCOPING

Shivam Saboo, Frederic Lefebvre, Vincent Demoulin

Value-Added Bundle(s) Including this Product

ICIP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society