View article

Evaluating multimedia features and fusion for example-based event detection

Authors

Gregory K Myers, Cees GM Snoek, Ramakant Nevatia, Ramesh Nallapati, Julien van Hout, Stephanie Pancoast, Chen Sun, Amirhossein Habibian, Dennis C Koelma, Koen EA van de Sande, Arnold WM Smeulders

Publication date

2014

Journal

Fusion in Computer Vision: Understanding Complex Visual Content

Pages

109-133

Publisher

Springer International Publishing

Description

Multimedia event detection (MED) is a challenging problem because of the heterogeneous content and variable quality found in large collections of Internet videos. To study the value of multimedia features and fusion for representing and learning events from a set of example video clips, we created SESAME, a system for video SEarch with Speed and Accuracy for Multimedia Events. SESAME includes multiple bag-of-words event classifiers based on single data types: low-level visual, motion, and audio features; high-level semantic visual concepts; and automatic speech recognition (ASR). Event detection performance was evaluated for each event classifier. The performance of low-level visual and motion features was improved by the use of difference coding. The accuracy of the visual concepts was nearly as strong as that of the low-level visual features. Experiments with a number of fusion methods for …

Total citations

Cited by 45

201320142015201620172018201920202021202220232 11 9 8 1 7 1 3 1 1

Scholar articles

Evaluating multimedia features and fusion for example-based event detection

GK Myers, CGM Snoek, R Nevatia, R Nallapati… - Fusion in Computer Vision: Understanding Complex …, 2014