Tensorflow Audio Models In Essentia

Pablo Alonso-JimÃ©nez, Dmitry Bogdanov, Jordi Pons, Xavier Serra

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 10:37

04 May 2020

Essentia is a reference open-source C++/Python library for audio and music analysis. In this work, we present a set of algorithms that employ TensorFlow in Essentia, allow predictions with pre-trained deep learning models, and are designed to offer flexibility of use, easy extensibility, and real-time inference. To show the potential of this new interface with TensorFlow, we provide a number of pre-trained state-of-the-art music tagging and classification CNN models. We run an extensive evaluation of the developed models. In particular, we assess the generalization capabilities in a cross-collection evaluation utilizing both external tag datasets as well as manual annotations tailored to the taxonomies of our models.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Tensorflow Audio Models In Essentia

Pablo Alonso-JimÃ©nez, Dmitry Bogdanov, Jordi Pons, Xavier Serra

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society