Unsupervised Neural Mask Estimator For Generalized Eigen-Value Beamforming Based Asr

Rohit Kumar, Anirudh Sreeram, Anurenjan Purushothaman, Sriram Ganapathy

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 13:07

04 May 2020

The state-of-art methods for acoustic beamforming in multi-channel ASR is based on a neural mask estimator that attempts to learn the prediction of speech and noise using a paired corpus of clean and noisy recordings (teacher model). In this paper, we attempt to move away from the requirements of having a supervised clean recordings. The models based on signal enhancement and conventional beamforming methods serves as the required mask estimate. In this way, the model training can also be carried out on real recordings of noisy speech rather than simulated ones alone done in a teacher model. Several experiments performed on noisy and reverberant environments in the CHiME-3 corpus as well as the REVERB challenge corpus highlight the effectiveness of the proposed approach. The ASR results for the proposed approach provide performances that are significantly better than a teacher model trained on an out-of-domain dataset and on par with the oracle mask estimators in the in-domain dataset.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Unsupervised Neural Mask Estimator For Generalized Eigen-Value Beamforming Based Asr

Rohit Kumar, Anirudh Sreeram, Anurenjan Purushothaman, Sriram Ganapathy

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society