ON THE IMPORTANCE OF DIFFERENT FREQUENCY BINS FOR SPEAKER VERIFICATION

Aiwen Deng, Wenxiong Kang, Feiqi Deng, Shuai Wang

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:06:44

11 May 2022

The majority of modern speaker verification systems take spectral analysis-based features as input, which contains multiple frequency bins. Naturally, there would be a question of whether all different frequency bins contribute equally to the speaker verification system performance? In this paper, we propose the frequency reweighting layer (FRL) to automatically learn and balance the importance of different frequency bins. This new layer can be freely inserted into the original speaker embedding learner once or multiple times at different layers, with an ignorable number of new parameters. Based on the proposed novel architecture, a set of experiments are designed and carried out on the VoxCeleb1 dataset, which not only achieves superior performance but also exhibits an interesting weight distribution -- the lower frequencies matter more.

Tags:

frequency bin

attention

speaker verification

ON THE IMPORTANCE OF DIFFERENT FREQUENCY BINS FOR SPEAKER VERIFICATION

Aiwen Deng, Wenxiong Kang, Feiqi Deng, Shuai Wang

Value-Added Bundle(s) Including this Product

ICASSP 2022, May 2022 Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

ATTEN-ADAPTER: A UNIFIED ATTENTION-BASED ADAPTER FOR EFFICIENT TUNING

Cross-Inferential Networks for Source-free Unsupervised Domain Adaptation

IMPROVEMENT OF IMAGE SEGMENTATION MODEL FOR HANDWRITTEN NOTEBOOK ANALYTICS

Join the IEEE Signal Processing Society