Authors
Sven Bambach, David J Crandall, Linda B Smith, Chen Yu
Publication date
2017/9/18
Conference
2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)
Pages
290-295
Publisher
IEEE
Description
Toddlers quickly learn to recognize thousands of everyday objects despite the seemingly suboptimal training conditions of a visually cluttered world. One reason for this success may be that toddlers do not just passively perceive visual information, but actively explore and manipulate objects around them. The work in this paper is based on the idea that active viewing and exploration creates "clean" egocentric scenes that serve as high-quality training data for the visual system. We tested this idea by collecting first-person video data of free toy play between toddler-parent pairs. We use the raw frames from this data, weakly annotated with toy object labels, to train state-of-the-art machine learning models for object recognition (Convolutional Neural Networks, or CNNs). We run several training simulations, varying quantity and quality of the training data. Our results show that scenes captured by parents and toddlers …
Total citations
2017201820192020202120222023202421112332
Scholar articles
S Bambach, DJ Crandall, LB Smith, C Yu - 2017 Joint IEEE International Conference on …, 2017