Skip to main content
  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00
06 Jun 2023

Monaural speech enhancement has been widely studied using real networks. However, the input and the target are naturally complex-valued in the TF domain, a fully complex network is highly desirable for effectively modelling the sequence in the complex domain. Moreover, phase has been proved learnable together with magnitude using complex masking or complex spectral mapping. Many recent studies focus only one of them, ignoring their performance boundaries. To address above issues, we propose a fully complex dual-path dual-decoder conformer network (D2Former). In D2Former, we form a dual-path complex TF self-attention architecture for effectively modelling the complex-valued TF sequence and boost the encoder and the decoders using a dual-path learning structure. In addition, we improve the performance boundaries of individual target by a joint-learning framework.

More Like This

  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00
  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00