View article

[PDF] from siam.org

Two models of double descent for weak features

Authors

Mikhail Belkin, Daniel Hsu, Ji Xu

Publication date

2020

Journal

SIAM Journal on Mathematics of Data Science

Volume

Issue

Pages

1167-1180

Publisher

Society for Industrial and Applied Mathematics

Description

The “double descent” risk curve was proposed to qualitatively describe the out-of-sample prediction accuracy of variably parameterized machine learning models. This article provides a precise mathematical analysis for the shape of this curve in two simple data models with the least squares/least norm predictor. Specifically, it is shown that the risk peaks when the number of features is close to the sample size but also that the risk sometimes decreases toward its minimum as increases beyond . This behavior parallels some key patterns observed in large models, including modern neural networks, and is contrasted with that of “prescient” models that select features in an a priori optimal order.

Total citations

Cited by 432

20192020202120222023202419 66 84 105 96 59

Scholar articles

Two models of double descent for weak features

M Belkin, D Hsu, J Xu - SIAM Journal on Mathematics of Data Science, 2020