Performance Study Of A Convolutional Time-Domain Audio Separation Network For Real-Time Speech Denoising

Christian SchÃ¼ldt, Samuel Sonning, Hakan Erdogan, Scott Wisdom

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 13:56

04 May 2020

Time-domain audio separation networks based on dilated temporal convolutions have recently been shown to perform very well compared to methods that are based on a time-frequency representation in speech separation tasks, even outperforming an oracle binary time-frequency mask of the speakers. This paper investigates the performance of such a time-domain network (Conv-TasNet) for speech denoising in a real-time setting, comparing various parameter settings. Most importantly, different amounts of lookahead are evaluated and compared to the baseline of a fully causal model. We show that a large part of the increase in performance between a causal and non-causal model is achieved with a lookahead of only $20~$ milliseconds, demonstrating the usefulness of even small lookaheads for many real-time applications.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Performance Study Of A Convolutional Time-Domain Audio Separation Network For Real-Time Speech Denoising

Christian SchÃ¼ldt, Samuel Sonning, Hakan Erdogan, Scott Wisdom

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society