CURRICULUM OPTIMIZATION FOR LOW-RESOURCE SPEECH RECOGNITION

Anastasia Kuznetsova, Francis Tyers, Anurag Kumar, Jennifer Drexler Fox

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:11:57

12 May 2022

Modern end-to-end speech recognition models show astonishing results in transcribing audio signals into written text. However, conventional data feeding pipelines may be sub-optimal for low-resource speech recognition, which still remains a challenging task. We propose an automated curriculum learning approach to optimize the sequence of training examples based on both the progress of the model while training and prior knowledge about the difficulty of the training examples. We introduce a new difficulty measure called compression ratio that can be used as a scoring function for raw audio in various noise conditions. The proposed method improves speech recognition Word Error Rate performance by up to 33% relative over the baseline system.

Tags:

low-resource languages

speech recognition

curriculum learning

CURRICULUM OPTIMIZATION FOR LOW-RESOURCE SPEECH RECOGNITION

Anastasia Kuznetsova, Francis Tyers, Anurag Kumar, Jennifer Drexler Fox

Value-Added Bundle(s) Including this Product

ICASSP 2022, May 2022 Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

Tutorial: Foundational Problems in Neural Speech Recognition

Conversational Speech Processing and Recognition: Speech Separation, End-to-End Modeling, and Speaker Diarization

Curriculum Knowledge Switching for Pancreas Segmentation

Join the IEEE Signal Processing Society