View article

OpenML benchmarking suites and the OpenML100

Authors

Bernd Bischl, Giuseppe Casalicchio, Matthias Feurer, Frank Hutter, Michel Lang, Rafael G Mantovani, Jan N Van Rijn, Joaquin Vanschoren

Publication date

2017/8

Journal

stat

Volume

1050

Pages

Description

We advocate the use of curated, comprehensive benchmark suites of machine learning datasets, backed by standardized OpenML-based interfaces and complementary software toolkits written in Python, Java and R. Major distinguishing features of OpenML benchmark suites are (a) ease of use through standardized data formats, APIs, and existing client libraries;(b) machine-readable meta-information regarding the contents of the suite; and (c) online sharing of results, enabling large scale comparisons. As a first such suite, we propose the OpenML100, a machine learning benchmark suite of 100 classification datasets carefully curated from the thousands of datasets available on OpenML. org.

Total citations

Cited by 76

201720182019202020212022202320241 8 12 14 16 7 12 4

Scholar articles

OpenML benchmarking suites and the OpenML100

B Bischl, G Casalicchio, M Feurer, F Hutter, M Lang… - stat, 2017

Cited by 76 Related articles