Balanced Mixup Loss for Long-tailed Visual Recognition

Haibo Ye (Nanjing University of Aeronautics and Astronautics ); Fangyu Zhou (Nanjing University of Aeronautics and Astronautics); Xinjie Li (Nanjing University of Aeronautics and Astronautics); Qingheng Zhang (Nanjing University of Aeronautics and Astronautics)

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

06 Jun 2023

In the real world, the data collected naturally are often long-tailed, which inevitably leads to class-imbalanced prediction and performance degradation. As a simple and effective data augmentation method, Mixup has been proven to be beneficial for the tail class in recent long-tail learning studies. However, samples selected by Mixup are still imbalanced, which could exacerbate the imbalance problem. Existing work always considers adjusting the input distribution to alleviate this problem, while this may lead to over-fitting in the tail class. In this paper, we detail the theoretical analysis of the data imbalance caused by Mixup, and propose a novel Balanced Mixup (BaMix) loss function from the output perspective. By adding a balance term, we theoretically prove the BaMix loss can overcome the imbalance caused by Mixup. In the experiments, our solution achieves the state-of-the-art performance on CIFAR-LT, ImageNet-LT, and iNaturalist 2018.

Tags:

Applications of machine learning