Feature Enhancement With Deep Feature Losses For Speaker Verification

Saurabh Kataria, Phani Sankar Nidadavolu, JesÃºs Villalba, Nanxin Chen, Paola Garcia, Najim Dehak

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 10:55

04 May 2020

Speaker Verification still suffers from the challenge of generalization to novel adverse environments. We leverage on the recent advancements made by deep learning based speech enhancement and propose a feature-domain supervised denoising based solution. We propose to use Deep Feature Loss which optimizes the enhancement network in the hidden activation space of a pre-trained auxiliary speaker embedding network. We experimentally verify the approach on simulated and real data. A simulated testing setup is created using various noise types at different SNR levels. For evaluation on real data, we choose BabyTrain corpus which consists of children recordings in uncontrolled environments. We observe consistent gains in every condition over the state-of-the-art augmented Factorized-TDNN x-vector system. On BabyTrain corpus, we observe relative gains of 10.38% and 12.40% in minDCF and EER respectively.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Feature Enhancement With Deep Feature Losses For Speaker Verification

Saurabh Kataria, Phani Sankar Nidadavolu, JesÃºs Villalba, Nanxin Chen, Paola Garcia, Najim Dehak

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society