Authors
Fabio Carrara, Petr Elias, Jan Sedmidubsky, Pavel Zezula
Publication date
2019/10/15
Journal
Multimedia Tools and Applications
Volume
78
Pages
27309-27331
Publisher
Springer US
Description
Motion capture data digitally represent human movements by sequences of 3D skeleton configurations. Such spatio-temporal data, often recorded in the stream-based nature, need to be efficiently processed to detect high-interest actions, for example, in human-computer interaction to understand hand gestures in real time. Alternatively, automatically annotated parts of a continuous stream can be persistently stored to become searchable, and thus reusable for future retrieval or pattern mining. In this paper, we focus on multi-label detection of user-specified actions in unsegmented sequences as well as continuous streams. In particular, we utilize the current advances in recurrent neural networks and adopt a unidirectional LSTM model to effectively encode the skeleton frames within the hidden network states. The model learns what subsequences of encoded frames belong to the specified action classes …
Total citations
201920202021202220232024381715167
Scholar articles
F Carrara, P Elias, J Sedmidubsky, P Zezula - Multimedia Tools and Applications, 2019