Is Quality Enough? Integrating Energy Consumption in a Large-Scale Evaluation of Neural Audio Synthesis Models

Constance Douwes (IRCAM); Giovanni Bindi (IRCAM); Antoine CAILLON (IRCAM); Philippe Esling (IRCAM); Jean-Pierre Briot (CNRS)

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

07 Jun 2023

Deep learning models are now core components of modern audio synthesis, and their use has increased significantly in recent years, leading to highly accurate and successful solutions. However, the quest for quality comes at a tremendous computational cost, which incurs vast energy consumption and greenhouse gas emissions. At the heart of this problem are the measures we use as a scientific community to evaluate our work. In this paper, we suggest relying on a multi-objective metric based on Pareto optimality, which considers both the model's quality and energy consumption. By applying our measure to the current state-of-the-art in generative audio models, we show that it can drastically change the significance of the results. We hope to raise awareness of the need to simultaneously investigate energy-efficient models of high perceived quality, thus putting computational cost in the spotlight of deep learning research.

Tags:

Bounds on performance

Is Quality Enough? Integrating Energy Consumption in a Large-Scale Evaluation of Neural Audio Synthesis Models

Constance Douwes (IRCAM); Giovanni Bindi (IRCAM); Antoine CAILLON (IRCAM); Philippe Esling (IRCAM); Jean-Pierre Briot (CNRS)

Value-Added Bundle(s) Including this Product

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

Asymptotic Distribution of Stochastic Mirror Descent Iterates in Average Ensemble Models

On weighted cross-entropy for label-imbalanced separable data: An algorithmic-stability study

Communication-Constrained Exchange of Zeroth-Order Information with Application to Collaborative Target Tracking

Join the IEEE Signal Processing Society