Skip to main content
  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00
    Length: 00:05:32
08 Jun 2021

This paper describes a three-stage acoustic echo cancellation (AEC) and suppression framework for the ICASSP 2021 AEC-Challenge. In the first stage, a partitioned block frequency domain adaptive filtering is implemented to cancel the linear echo components without introducing the near-end speech distortion, where we estimate and compensate the time delay between the far-end reference signal and the microphone signal beforehand. In the second stage, a deep complex U-Net integrated with gated recurrent unit is proposed to further suppress the residual echo components. Finally, an extremely tiny deep complex U-Net is trained to further suppress environmental noise in the last stage, which can also further increase the echo return loss enhancement (ERLR) without increasing the computational complexity dramatically. Experimental results show that the proposed three-stage framework can get the ERLE over 50 dB in both single-talk and double-talk scenarios, and perceptual evaluation of speech quality can be improved about 0.7 in double-talk scenarios. Subjective results show that the proposed framework outperforms the AEC-Challenge baseline ResRNN by 0.12 points in terms of the MOS.

Chairs:
Hannes Gamper

Value-Added Bundle(s) Including this Product

More Like This

  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00