Authors
Karttikeya Mangalam, Vinay Uday Prabhu
Publication date
2019/5/17
Conference
Understanding Deep Phenomena, International Conference on Machine Learning
Description
In this paper, we empirically investigate the training journey of deep neural networks relative to fully trained shallow machine learning models. We observe that the deep neural networks (DNNs) train by learning to correctly classify shallow-learnable examples in the early epochs before learning the harder examples. We build on this observation this to suggest a way for partitioning the dataset into hard and easy subsets that can be used for improving the overall training process. Incidentally, we also found evidence of a subset of intriguing examples across all the datasets we considered, that were shallow learnable but not deep-learnable. In order to aid reproducibility, we also duly release our code for this work at https://github.com/karttikeya/Shallow_to_Deep/
Total citations
2019202020212022202320242788124
Scholar articles
K Mangalam, VU Prabhu - ICML 2019 Workshop on Identifying and …, 2019