View article

[PDF] from archive.org

Embedding motion and structure features for action recognition

Authors

Xiantong Zhen, Ling Shao, Dacheng Tao, Xuelong Li

Publication date

2013/1/16

Journal

IEEE Transactions on Circuits and Systems for Video Technology

Volume

Issue

Pages

1182-1190

Publisher

IEEE

Description

We propose a novel method to model human actions by explicitly coding motion and structure features that are separately extracted from video sequences. Firstly, the motion template (one feature map) is applied to encode the motion information and image planes (five feature maps) are extracted from the volume of differences of frames to capture the structure information. The Gaussian pyramid and center-surround operations are performed on each of the six obtained feature maps, decomposing each feature map into a set of subband maps. Biologically inspired features are then extracted by successively applying Gabor filtering and max pooling on each subband map. To make a compact representation, discriminative locality alignment is employed to embed the high-dimensional features into a low-dimensional manifold space. In contrast to sparse representations based on detected interest points, which suffer …

Total citations

Cited by 74

20132014201520162017201820192020202120225 15 12 10 7 8 6 4 2 4

Scholar articles

Embedding motion and structure features for action recognition

X Zhen, L Shao, D Tao, X Li - IEEE Transactions on Circuits and Systems for Video …, 2013