Fast Lattice-Free Keyword Filtering For Accelerated Spoken Term Detection

Jonathan Wintrode, Jenny Wilkes

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 11:50

04 May 2020

We present a novel set of keyword detection techniques to accelerate spoken term detection for known queries with minimal loss in accuracy. Using only ASR frame-level acoustic posteriors we can train multiple models to effectively detect non-target segments for which we need not perform full lattice decoding. We estimate phone n-gram soft counts for each segment in a single pass over the frame-level output. From this we can efficiently detect a fixed set of keywords with both linear and DNN-based classifiers. Furthermore we can train the linear classifiers on a small number of labeled examples. Experiments on the PSC and VAST English subset of NIST's 2019 OpenSAT evaluation demonstrate we can filter out half of the test audio segments while only increasing the keyword miss rate by under 3%.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Fast Lattice-Free Keyword Filtering For Accelerated Spoken Term Detection

Jonathan Wintrode, Jenny Wilkes

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society