Authors
Mikhail Belkin, Daniel Hsu, Ji Xu
Publication date
2020
Journal
SIAM Journal on Mathematics of Data Science
Volume
2
Issue
4
Pages
1167-1180
Publisher
Society for Industrial and Applied Mathematics
Description
The “double descent” risk curve was proposed to qualitatively describe the out-of-sample prediction accuracy of variably parameterized machine learning models. This article provides a precise mathematical analysis for the shape of this curve in two simple data models with the least squares/least norm predictor. Specifically, it is shown that the risk peaks when the number of features is close to the sample size but also that the risk sometimes decreases toward its minimum as increases beyond . This behavior parallels some key patterns observed in large models, including modern neural networks, and is contrasted with that of “prescient” models that select features in an a priori optimal order.
Total citations
2019202020212022202320241966841059659
Scholar articles
M Belkin, D Hsu, J Xu - SIAM Journal on Mathematics of Data Science, 2020