View article

[HTML] from springer.com

Statistical comparison of classifiers through Bayesian hierarchical modelling

Authors

Giorgio Corani, Alessio Benavoli, Janez Demšar, Francesca Mangili, Marco Zaffalon

Publication date

2017/11

Journal

Machine Learning

Volume

106

Pages

1817-1837

Publisher

Springer US

Description

Usually one compares the accuracy of two competing classifiers using null hypothesis significance tests. Yet such tests suffer from important shortcomings, which can be overcome by switching to Bayesian hypothesis testing. We propose a Bayesian hierarchical model that jointly analyzes the cross-validation results obtained by two classifiers on multiple data sets. The model estimates more accurately the difference between classifiers on the individual data sets than the traditional approach of averaging, independently on each data set, the cross-validation results. It does so by jointly analyzing the results obtained on all data sets, and applying shrinkage to the estimates. The model eventually returns the posterior probability of the accuracies of the two classifiers being practically equivalent or significantly different.

Total citations

Cited by 59

2016201720182019202020212022202320241 3 7 8 4 11 11 10 4

Scholar articles

Statistical comparison of classifiers through Bayesian hierarchical modelling

G Corani, A Benavoli, J Demšar, F Mangili, M Zaffalon - Machine Learning, 2017