Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting

ZHENYU WANG (UTD); Li Wan (Meta); Biqiao Zhang (Meta); Yiteng Huang (Meta Platforms); Shang-Wen Li (Meta); Ming Sun (Meta); Xin Lei (Meta); Zhaojun Yang (Meta)

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

06 Jun 2023

A keyword spotting (KWS) engine continuously running on the device is exposed to various speech signals that are usually unseen beforehand. It is a challenging problem to build a small-footprint and high-performing KWS model with robustness under different acoustic environments. In this paper, we explore how to effectively apply adversarial examples to improve KWS robustness. We propose datasource-aware disentangled learning with adversarial examples to reduce the mismatch between the original and adversarial data as well as the mismatch across original training datasources. The KWS model architecture is based on depth-wise separable convolution and a simple attention module. Experimental results demonstrate that the proposed learning strategy improves false reject rate by 40.31% at 1% false accept rate on the internal dataset, compared to the strongest baseline without adversarial examples. Our best-performing system achieves 98.06% accuracy on the Google Speech Commands V1 dataset.

Tags:

Word spotting, VAD, and other topics in speech recognition

Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting

ZHENYU WANG (UTD); Li Wan (Meta); Biqiao Zhang (Meta); Yiteng Huang (Meta Platforms); Shang-Wen Li (Meta); Ming Sun (Meta); Xin Lei (Meta); Zhaojun Yang (Meta)

Value-Added Bundle(s) Including this Product

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis

FEDERATED LEARNING FOR ASR BASED ON WAV2VEC 2.0

Neural Diarization with Non-autoregressive Intermediate Attractors

Join the IEEE Signal Processing Society