Authors
Yong Jae Lee, Joydeep Ghosh, Kristen Grauman
Publication date
2012/6/16
Conference
2012 IEEE conference on computer vision and pattern recognition
Pages
1346-1353
Publisher
IEEE
Description
We present a video summarization approach for egocentric or “wearable” camera data. Given hours of video, the proposed method produces a compact storyboard summary of the camera wearer's day. In contrast to traditional keyframe selection techniques, the resulting summary focuses on the most important objects and people with which the camera wearer interacts. To accomplish this, we develop region cues indicative of high-level saliency in egocentric video — such as the nearness to hands, gaze, and frequency of occurrence — and learn a regressor to predict the relative importance of any new region based on these cues. Using these predictions and a simple form of temporal event detection, our method selects frames for the storyboard that reflect the key object-driven happenings. Critically, the approach is neither camera-wearer-specific nor object-specific; that means the learned importance metric need …
Total citations
2012201320142015201620172018201920202021202220232024615479488107941057378645532
Scholar articles
YJ Lee, J Ghosh, K Grauman - 2012 IEEE conference on computer vision and pattern …, 2012