A FEATURE PAIR FUSION AND HIERARCHICAL LEARNING FRAMEWORK FOR VIDEO RE-LOCALIZATION

Ruolin Wang, Yuan Zhou

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 08:16

27 Oct 2020

Video re-localization has become an emerging research topic nowadays but existing methods still have many deficiencies. The existing deficiencies mainly lie in the interference caused by the irrelevant information in the input reference video and the ignorance of the correlation between query and reference video features. Therefore, we present a novel framework named Semantic Relevance Learning Network to address these shortcomings. First, we extract effective proposals from reference video as new inputs to reduce interference from irrelevant video frames. Second, two key components of our proposed model, the Attention-based Fusion Tensor and Semantic Relevance Measurement, jointly explore the intrinsic correlation between video feature pairs and finally get a score as measurement. To better evaluate our proposed model, we reorganize Thumos14 to obtain another new dataset for the video re-localization task. For both ActivityNet and Thumos14, our model achieves the best performance reported so far.

Tags:

sps conference

icip 2020

A FEATURE PAIR FUSION AND HIERARCHICAL LEARNING FRAMEWORK FOR VIDEO RE-LOCALIZATION

Ruolin Wang, Yuan Zhou

Value-Added Bundle(s) Including this Product

ICIP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society