Oov Recovery With Efficient 2Nd Pass Decoding And Open-Vocabulary Word-Level Rnnlm Rescoring For Hybrid Asr

Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 14:44

04 May 2020

In this paper, we investigate out-of-vocabulary (OOV) word recovery in word-based hybrid automatic speech recognition (ASR) systems, with emphasis on dynamic vocabulary expansion for both Weight Finite State Transducer (WFST)-based decoding and word-level RNNLM rescoring. We first describe our OOV candidate generation method based on a hybrid lexical model (HLM) with phoneme-sequence constraints. Next, we introduce a framework for efficient second pass OOV recovery with a dynamically expanded vocabulary, showing that, by calibrating OOV candidates' language model (LM) scores, it significantly improves OOV recovery and overall decoding performance compared to HLM-based first pass decoding. Finally we propose an open-vocabulary word-level recurrent neural network language model (RNNLM) re-scoring framework, making it possible to re-score ASR hypotheses containing recovered OOVs, using a single word-level RNNLM ignorant of OOVs when it was trained. By evaluating OOV recovery and overall decoding performance on Spanish/English ASR tasks, we show the proposed OOV recovery pipeline has the potential of an efficient open-vocab word-based ASR decoding framework, with minimal extra computation versus a standard WFST based decoding and RNNLM rescoring pipeline.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Oov Recovery With Efficient 2Nd Pass Decoding And Open-Vocabulary Word-Level Rnnlm Rescoring For Hybrid Asr

Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society