TransPlayer: Timbre Style Transfer with Flexible Timbre Control
Yuxuan Wu (Carnegie Mellon University); Yifan He (Carnegie Mellon University); Xinlu Liu (Carnegie Mellon University); Yi Wang (Carnegie Mellon University); Roger B. Dannenberg (School of Computer Science, Carnegie Mellon University)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
Music timbre style transfer aims at replacing the instrument timbre in a solo recording with another instrument, while preserving the musical content. Existing GAN-based methods can only achieve timbre style transfer between two given timbres. Inspired by the practice in voice conversion, we propose TransPlayer, which uses an autoencoder model with one-hot representations of instruments as the condition, and a Diffwave model trained especially for music synthesis. We evaluate our model in both the one-to-one transfer task and the many-to-many transfer task. The results prove that our method is able to provide one-to-one style transfer outputs comparable with the existing GAN-based method, and can transfer among multiple timbres with only one single model.