14

Evaluating Prediction Uncertainty Estimates from BatchEnsemble

Morten Blørstad
Herman Jangsett Mostein
Nello Blaser
Pekka Parviainen
Main:8 Pages
19 Figures
Bibliography:3 Pages
4 Tables
Appendix:6 Pages
Abstract

Deep learning models struggle with uncertainty estimation. Many approaches are either computationally infeasible or underestimate uncertainty. We investigate \textit{BatchEnsemble} as a general and scalable method for uncertainty estimation across both tabular and time series tasks. To extend BatchEnsemble to sequential modeling, we introduce GRUBE, a novel BatchEnsemble GRU cell. We compare the BatchEnsemble to Monte Carlo dropout and deep ensemble models. Our results show that BatchEnsemble matches the uncertainty estimation performance of deep ensembles, and clearly outperforms Monte Carlo dropout. GRUBE achieves similar or better performance in both prediction and uncertainty estimation. These findings show that BatchEnsemble and GRUBE achieve similar performance with fewer parameters and reduced training and inference time compared to traditional ensembles.

View on arXiv
Comments on this paper