Learning Representations for New Sound Classes With Continual Self-Supervised Learning
Zhepei Wang (University of Illinois at Urbana Champaign); Cem Subakan (Mila); Xilin Jiang (University of Illinois at Urbana Champaign); Junkai Wu (University of Illinois at Urbana Champaign); Efthymios Tzinis (University of Illinois at Urbana-Champaign); Mirco Ravanelli (Université de Montréal); Paris Smaragdis (University of Illinois at Urbana-Champaign)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
In this article, we work on a sound recognition system that continually incorporates new sound classes. Our main goal is to develop a framework where the model can be updated without relying on labeled data. For this purpose, we propose adopting representation learning, where an encoder is trained using unlabeled data. This learning framework enables the study and implementation of a practically relevant use case where only a small amount of the labels is available in a continual learning context. We also make the empirical observation that a similarity-based representation learning method within this framework is robust to forgetting even if no explicit mechanism against forgetting is employed. We show that this approach obtains similar performance compared to several distillation-based continual learning methods when employed on self-supervised representation learning methods.