Voice Conversion With Transformer Network

Ruolan Liu, Xiao Chen, Xue Wen

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 12:03

04 May 2020

This paper describes an end-to-end voice conversion system, which involves three main ideas: transformer, context preservation mechanisms, and model adaptation. Self-attention in the transformer architecture directly connects all positions, making it easier to learn long range dependencies and improve training efficiency. Context preservation mechanisms accelerate and stabilize training. Adaptation techniques are conductive to the training of the conversion mapping with limited training data. The results show that the proposed method obtains a higher MOS and the training speed is 2.72 times faster than LSTM based baseline system.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Value-Added Bundle(s) Including this Product

04 May 2020

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

26 Apr 2024

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

SPS

Members: $150.00
IEEE Members: $250.00
Non-members: $350.00

19 Apr 2024

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

SPS

Members: $150.00
IEEE Members: $250.00
Non-members: $350.00

16 Oct 2022

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

SPS

Members: $150.00
IEEE Members: $250.00
Non-members: $350.00