View article

[PDF] from microsoft.com

Efficient regression of general-activity human poses from depth images

Authors

Ross Girshick, Jamie Shotton, Pushmeet Kohli, Antonio Criminisi, Andrew Fitzgibbon

Publication date

2011/11/6

Conference

2011 International Conference on Computer Vision

Pages

415-422

Publisher

IEEE

Description

We present a new approach to general-activity human pose estimation from depth images, building on Hough forests. We extend existing techniques in several ways: real time prediction of multiple 3D joints, explicit learning of voting weights, vote compression to allow larger training sets, and a comparison of several decision-tree training objectives. Key aspects of our work include: regression directly from the raw depth image, without the use of an arbitrary intermediate representation; applicability to general motions (not constrained to particular activities) and the ability to localize occluded as well as visible body joints. Experimental results demonstrate that our method produces state of the art results on several data sets including the challenging MSRC-5000 pose estimation test set, at a speed of about 200 frames per second. Results on silhouettes suggest broader applicability to other imaging modalities.

Total citations

Cited by 523

201120122013201420152016201720182019202020212022202320242 51 72 75 61 71 43 35 39 25 15 8 7 3

Scholar articles

Efficient regression of general-activity human poses from depth images

R Girshick, J Shotton, P Kohli, A Criminisi, A Fitzgibbon - 2011 International Conference on Computer Vision, 2011