Skip to main content

Optimal Transport with a Diversified Memory Bank for Cross-Domain Speaker Verification

Ruiteng Zhang (Tianjin University); Jianguo Wei (School of Computer Software, Tianjin University, Tianjin, China); Xugang Lu (NICT); Wenhuan Lu (Tianjin University); Di Jin (Tianjin University); Lin Zhang (National Institute of Informatics); Junnhai Xu (Tianjin Key Laboratory of Cognitive Computing and Application, College of Intelligence and Computing, Tianjin University)

  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00
06 Jun 2023

Optimal transport (OT) can be applied to cross-domain adaptation in speaker verification (SV) by converting speakers’ probability distributions from source to target domains. However, in scenarios involving over-massive categories (speakers) or difficult samples in discrimination, OT often has difficulty computing effective transports. To address this challenge, we propose an OT-based unsupervised domain adaptation (UDA) framework for SV, OT with a diversified memory bank, called DMB-OT, which ensures the accuracy of transfers by two strategies: (1) It regularizes the solution space of OT, which attempts to plan transformations between audio samples from the same speaker with high confidence; (2) it integrates a dynamic curriculum learning algorithm, preventing OT from calculating transport couplings based on hard-discriminative samples in the early stage of UDA. Experiments under different target domains showed that our unsupervised DMB-OT could significantly improve the performance of OT-based UDA and could even match the performance of the supervised PLDA-based adaptation.

More Like This

  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00