Adaptive Multi-Corpora Language Model Training for Speech Recognition

Yingyi Ma (Meta); Zhe Liu (Meta); Xuedong Zhang (Meta)

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

06 Jun 2023

Neural network language model (NNLM) plays an essential role in automatic speech recognition (ASR) systems, especially in adaptation tasks when text-only data is available. In practice, an NNLM is typically trained on a combination of data sampled from multiple corpora. Thus, the data sampling strategy is important to the adaptation performance. Most existing works focus on designing static sampling strategies. However, each corpus may show varying impacts at different NNLM training stages. In this paper, we introduce a novel adaptive multi-corpora training algorithm that dynamically learns and adjusts the sampling probability of each corpus along the training process. The algorithm is robust to corpora sizes and domain relevance. Compared with static sampling strategy baselines, the proposed approach yields remarkable improvement by achieving up to relative 7% and 9% word error rate (WER) reductions on in-domain and out-of-domain adaptation tasks, respectively.

Tags:

Machine learning methods for language

Adaptive Multi-Corpora Language Model Training for Speech Recognition

Yingyi Ma (Meta); Zhe Liu (Meta); Xuedong Zhang (Meta)

Value-Added Bundle(s) Including this Product

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

A Sentiment and Syntactic-Aware Graph Convolutional Network for Aspect-level Sentiment Classification

SELF SUPERVISED BERT FOR LEGAL TEXT CLASSIFICATION

Estimating Shapley Values of Training Utterances for Automatic Speech Recognition Models

Join the IEEE Signal Processing Society