Authors
Ross Girshick, Jamie Shotton, Pushmeet Kohli, Antonio Criminisi, Andrew Fitzgibbon
Publication date
2011/11/6
Conference
2011 International Conference on Computer Vision
Pages
415-422
Publisher
IEEE
Description
We present a new approach to general-activity human pose estimation from depth images, building on Hough forests. We extend existing techniques in several ways: real time prediction of multiple 3D joints, explicit learning of voting weights, vote compression to allow larger training sets, and a comparison of several decision-tree training objectives. Key aspects of our work include: regression directly from the raw depth image, without the use of an arbitrary intermediate representation; applicability to general motions (not constrained to particular activities) and the ability to localize occluded as well as visible body joints. Experimental results demonstrate that our method produces state of the art results on several data sets including the challenging MSRC-5000 pose estimation test set, at a speed of about 200 frames per second. Results on silhouettes suggest broader applicability to other imaging modalities.
Total citations
20112012201320142015201620172018201920202021202220232024251727561714335392515873
Scholar articles
R Girshick, J Shotton, P Kohli, A Criminisi, A Fitzgibbon - 2011 International Conference on Computer Vision, 2011