ISDA: POSITION-AWARE INSTANCE SEGMENTATION WITH DEFORMABLE ATTENTION

Kaining Ying, Zhenhua Wang, Cong Bai, Pengfei Zhou

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:07:28

12 May 2022

Most instance segmentation models are not end-to-end trainable due to either the incorporation of proposal estimation as a pre-processing or non-maximum suppression (NMS) as a post-processing. Here we propose a novel end-to-end instance segmentation method termed ISDA. It reshapes the task into predicting a set of object masks, which are generated via traditional convolution operation with learned position-aware kernels and features of objects. Such kernels and features are learned by leveraging a deformable attention network with multi-scale representation. Thanks to the introduced set-prediction mechanism, the proposed method is NMS-free. Empirically, ISDA outperforms Mask R-CNN (the strong baseline) by 2.6 points on MS-COCO, and achieves leading performance compared with recent models. Code will be available soon.

Tags:

deformable attention

position-aware kernel

instance segmentation

end-to-end

ISDA: POSITION-AWARE INSTANCE SEGMENTATION WITH DEFORMABLE ATTENTION

Kaining Ying, Zhenhua Wang, Cong Bai, Pengfei Zhou

Value-Added Bundle(s) Including this Product

ICASSP 2022, May 2022 Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

End-to-End Automatic Speech Recognition

RECTANGULAR-OUTPUT IMAGE STITCHING

LEARNABLE SNAKE R-CNN FOR INSTANCE-LEVEL BIOMEDICAL IMAGE SEGMENTATION

Join the IEEE Signal Processing Society