T-Gsa: Transformer With Gaussian-Weighted Self-Attention For Speech Enhancement

Jaeyoung Kim, Mostafa El-khamy, Jungwon Lee

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 12:03

04 May 2020

Transformer neural networks (TNN) demonstrated state-of-art performance on many natural language processing (NLP) tasks, replacing recurrent neural networks (RNNs), such as LSTMs or GRUs. However, TNNs did not perform well in speech enhancement, whose contextual nature is different than NLP tasks, like machine translation. Self-attention is a core building block of the Transformer, which not only enables parallelization of sequence computation, but also provides the constant path length between symbols that is essential to learning long-range dependencies. In this paper, we propose a Transformer with Gaussian-weighted self-attention (T-GSA), whose attention weights are attenuated according to the distance between target and context symbols. The experimental results show that the proposed T-GSA has significantly improved speech-enhancement performance, compared to the Transformer and RNNs.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

T-Gsa: Transformer With Gaussian-Weighted Self-Attention For Speech Enhancement

Jaeyoung Kim, Mostafa El-khamy, Jungwon Lee

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society