View article

[PDF] from mlr.press

Learning with marginalized corrupted features

Authors

Laurens Maaten, Minmin Chen, Stephen Tyree, Kilian Weinberger

Publication date

2013/2/13

Conference

International Conference on Machine Learning

Pages

410-418

Publisher

PMLR

Description

The goal of machine learning is to develop predictors that generalize well to test data. Ideally, this is achieved by training on very large (infinite) training data sets that capture all variations in the data distribution. In the case of finite training data, an effective solution is to extend the training set with artificially created examples–which, however, is also computationally costly. We propose to corrupt training examples with noise from known distributions within the exponential family and present a novel learning algorithm, called marginalized corrupted features (MCF), that trains robust predictors by minimizing the expected value of the loss function under the corrupting distribution–essentially learning with infinitely many (corrupted) training examples. We show empirically on a variety of data sets that MCF classifiers can be trained efficiently, may generalize substantially better to test data, and are more robust to feature deletion at test time.

Total citations

Cited by 201

20132014201520162017201820192020202120222023202410 22 18 22 25 25 18 15 20 11 10 3

Scholar articles

Learning with marginalized corrupted features

L Maaten, M Chen, S Tyree, K Weinberger - International Conference on Machine Learning, 2013

Learning by Marginalizing Corrupted Features*

L van der Maaten, M Chen, S Tyree, K Weinberger