A general framework for ensemble distribution distillation

Jakob Lindqvist,Amanda E. C. Olmin,Fredrik Lindsten,Lennart Svensson

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 13:39

21 Sep 2020

Ensembles of neural networks have shown to give better predictive performance and more reliable uncertainty estimates than individual networks. Additionally, ensembles allow the uncertainty to be decomposed into aleatoric (data) and epistemic (model) components, giving a more complete picture of the predictive uncertainty. Ensemble distillation is the process of compressing an ensemble into a single model, often resulting in a leaner model that still outperforms the individual ensemble members. Unfortunately, standard distillation erases the natural uncertainty decomposition of the ensemble. We present a general framework for distilling both regression and classification ensembles in a way that preserves the decomposition. We demonstrate the desired behaviour of our framework and show that its predictive performance is on par with standard distillation.

Tags:

sps conference

mlsp 2020

virtual workshop

mlsp 2020 workshop

September 2020