COMPLEX-VALUED SPATIAL AUTOENCODERS FOR MULTICHANNEL SPEECH ENHANCEMENT

Mhd Modar Halimeh, Walter Kellermann

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:11:02

09 May 2022

In this contribution, we present a novel online approach to multichannel speech enhancement. The proposed method estimates the enhanced signal through a filter-and-sum framework. More specifically, complex-valued masks are estimated by a deep complex-valued neural network, termed the complex-valued spatial autoencoder. The proposed network is capable of manipulating both the phase and the amplitude of the microphone signals and hence, the network is able to exploit both spatial and spectral characteristics of the desired source signal resulting in a physically plausible spatial selectivity and superior speech quality.

Tags:

complex-valued networks

deep learning

speech enhancement

multichannel signal processing

Value-Added Bundle(s) Including this Product

22 May 2022

ICASSP 2022, May 2022 Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

30 Jan 2025

Short Course Bundle: ICASSP 2022 COURSE 5: Speech Technology for Health: From Technical Foundations to Applications (Parts 1-3)

SPS

Members: $65.00
IEEE Members: $85.00
Non-members: $100.00

11 Jul 2024

Invertible Neural Networks and their Applications

1.00 pdh

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

11 Jul 2024

Slides: Invertible Neural Networks and their Applications

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00