View article

[PDF] from cmu.edu

Spatio-temporal matching for human pose estimation in video

Authors

Feng Zhou, Fernando De la Torre

Publication date

2016/2/4

Journal

IEEE Transactions on Pattern Analysis and Machine Intelligence

Volume

Issue

Pages

1492-1504

Publisher

IEEE

Description

Detection and tracking humans in videos have been long-standing problems in computer vision. Most successful approaches (e.g., deformable parts models) heavily rely on discriminative models to build appearance detectors for body joints and generative models to constrain possible body configurations (e.g., trees). While these 2D models have been successfully applied to images (and with less success to videos), a major challenge is to generalize these models to cope with camera views. In order to achieve view-invariance, these 2D models typically require a large amount of training data across views that is difficult to gather and time-consuming to label. Unlike existing 2D models, this paper formulates the problem of human detection in videos as spatio-temporal matching (STM) between a 3D motion capture model and trajectories in videos. Our algorithm estimates the camera view and selects a subset of …

Total citations

Cited by 30

20162017201820192020202120224 4 6 2 6 3 5

Scholar articles

Spatio-temporal matching for human pose estimation in video

F Zhou, F De la Torre - IEEE Transactions on Pattern Analysis and Machine …, 2016