Audio Barlow Twins: Self-Supervised Audio Representation Learning

Jonah Anton (Imperial College London); Harry Coppock (Imperial College London); Pancham Shukla (Imperial College London); Bjoern W. Schuller (Imperial College London)

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

06 Jun 2023

The Barlow Twins self-supervised learning objective requires neither negative samples or asymmetric learning updates, achieving results on a par with the current state-of-the-art within Computer Vision. As such, we present Audio Barlow Twins, a novel self-supervised audio representation learning approach, adapting Barlow Twins to the audio domain. We pre-train on the large-scale audio dataset AudioSet, and evaluate the quality of the learnt representations on 18 tasks from the HEAR 2021 Challenge, achieving results which outperform, or otherwise are on a par with, the current state-of-the-art for instance discrimination self-supervised learning approaches to audio representation learning. Code at https://github.com/jonahanton/SSL_audio.

Tags:

transfer learning

Audio Barlow Twins: Self-Supervised Audio Representation Learning

Jonah Anton (Imperial College London); Harry Coppock (Imperial College London); Pancham Shukla (Imperial College London); Bjoern W. Schuller (Imperial College London)

Value-Added Bundle(s) Including this Product

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

Short Course Bundle: ICIP 2023 COURSE 2: Short Course: Unboxing Advancements in Biomedical Image Processing (Parts 1-4)

PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition

(Slides) Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition

Join the IEEE Signal Processing Society