RANDMASKING AUGMENT: A SIMPLE AND RANDOMIZED DATA AUGMENTATION FOR ACOUSTIC SCENE CLASSIFICATION
JuBum Han (Samsung Research); Mateusz Matuszewski (Samsung R&D Institute Poland); Olaf Sikorski (Samsung R&D Poland); Hosang Sung (Samsung Research); Hoonyoung Cho (Samsung Research)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
In this work, we describe RandMasking Augment as an effective data augmentation method for acoustic scene classification research. We concentrate on both time and frequency domains masking augmentation introduced in SpecAugment, and apply various transformations that can maintain time and frequency information of the original spectrogram to the masking region. Because acoustic feature is transformed into various forms without distortion of frequency and time information, the proposed augmentation can capture unique characteristics of the input audio in detail. Moreover, RandMasking Augment can be extended by mixing other audio samples and applying different weights on frequency bands in the randomized masking region. We evaluate the suggested augmentation on the DCASE 2018 Task1A dataset and the DCASE 2019 Task1A dataset, and it is compared with other augmentation methods. The proposed augmentation shows outstanding performances with various popular convolutional neural networks.