View article

[PDF] from arxiv.org

The ImageNet Shuffle: Reorganized Pre-training for Video Event Detection

Authors

Pascal Mettes, Dennis C Koelma, Cees G M Snoek

Publication date

2016/2/23

Journal

ICMR

Description

This paper strives for video event detection using a representation learned from deep convolutional neural networks. Different from the leading approaches, who all learn from the 1,000 classes defined in the ImageNet Large Scale Visual Recognition Challenge, we investigate how to leverage the complete ImageNet hierarchy for pre-training deep networks. To deal with the problems of over-specific classes and classes with few images, we introduce a bottom-up and top-down approach for reorganization of the ImageNet hierarchy based on all its 21,814 classes and more than 14 million images. Experiments on the TRECVID Multimedia Event Detection 2013 and 2015 datasets show that video representations derived from the layers of a deep neural network pre-trained with our reorganized hierarchy i) improves over standard pre-training, ii) is complementary among different reorganizations, iii) maintains the …

Total citations

Cited by 140

20152016201720182019202020212022202320241 15 30 22 15 18 15 13 3 3

Scholar articles

The imagenet shuffle: Reorganized pre-training for video event detection

P Mettes, DC Koelma, CGM Snoek - Proceedings of the 2016 ACM on International …, 2016

DC Koelma, and CGM Snoek*

P Mettes - The ImageNet Shuffle: Reorganized Pre-training for …, 2016

Cited by 2 Related articles