LOGOVIT: LOCAL-GLOBAL VISION TRANSFORMER FOR OBJECT RE-IDENTIFICATION

Phan Nguyen (VinBrain); Ta Duc Huy (Vinbrain); Soan T. M. Duong (Le Quy Don Technical University); Nguyen Hoang Tran (VinBrain); Sam Bao Tran (Vinbrain); Dao Huu Hung (VinBrain); Chanh D Tr Nguyen (VinBrain); Trung Bui (Individual); QUOC HUNG TRUONG (VINBRAIN)

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

04 Jun 2023

Object re-identification (ReID) is prone to errors under variations in scale, illumination, complex background, and object occlusion scenarios. To overcome these challenges, attention mechanisms are employed to focus on the object's characteristics, thereby extracting better discriminative features. This paper introduces a local-global vision transformer (LoGoViT) for object re-identification by learning a hierarchical-level representation from fine-grained (local) to general (global) context features. It comprises two components: (i) shift and shuffle operations to generate robust local features and (ii) local-global module to aggregate the multi-level hierarchy features of an object. Extensive experiments show that our method achieves state-of-the-art on the ReID benchmarks. We further investigate effective augmentation operations and discuss how patch modifications improve the proposed model's generalization under occlusion scenarios. The source code is available at: \url{https://github.com/nguyenphan99/LoGoViT}.

Tags:

Image and video content analysis

LOGOVIT: LOCAL-GLOBAL VISION TRANSFORMER FOR OBJECT RE-IDENTIFICATION

Phan Nguyen (VinBrain); Ta Duc Huy (Vinbrain); Soan T. M. Duong (Le Quy Don Technical University); Nguyen Hoang Tran (VinBrain); Sam Bao Tran (Vinbrain); Dao Huu Hung (VinBrain); Chanh D Tr Nguyen (VinBrain); Trung Bui (Individual); QUOC HUNG TRUONG (VINBRAIN)

Value-Added Bundle(s) Including this Product

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

OPT: One-shot Pose-Controllable Talking Head Generation

ENHANCED GM-PHD FILTER FOR REAL TIME SATELLITE MULTI-TARGET TRACKING

Semi-Federated Learning for Edge Intelligence with Imperfect SIC

Join the IEEE Signal Processing Society