Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks

Yanxiong Li, Mingle Liu, Konstantinos Drossos, Tuomas Virtanen

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 13:49

04 May 2020

Convolutional recurrent neural networks (CRNNs) have achieved state-of-the-art performance for sound event detection (SED). In this paper, we propose to use a dilated CRNN, namely a CRNN with a dilated convolutional kernel, as the classifier for the task of SED. We investigate the effectiveness of dilation operations which provide a CRNN with expanded receptive ?elds to capture long temporal context without increasing the amount of CRNNâs parameters. Compared to the classifier of the baseline CRNN, the classifier of the dilated CRNN obtains a maximum increase of 1.9%, 6.3% and 2.5% at F1 score and a maximum decrease of 1.7%, 4.1% and 3.9% at error rate (ER), on the publicly available audio corpora of the TUT-SED Synthetic 2016, the TUT Sound Event 2016 and the TUT Sound Event 2017, respectively.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks

Yanxiong Li, Mingle Liu, Konstantinos Drossos, Tuomas Virtanen

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society