3D Action Recognition Using Multiscale Energy-Based Global Ternary Image
Keywords
Action recognition; depth sequence; human-computer interaction
Abstract
This paper presents an effective multiscale energy-based global ternary image (E-GTI) representation for action recognition from depth sequences. The unique property of our representation is that it takes the spatiotemporal discrimination and action speed variations into account, intending to solve the problems of distinguishing similar actions and identifying the actions with different speeds in one goal. The entire method is carried out in two stages. In the first stage, consecutive depth frames are used to generate global ternary image (GTI) features, which implicitly capture both inter-frame motion regions and motion directions. Specifically, each pixel in the GTI represents one of three possible states, namely, positive, negative, and neutral, which indicate the increased, decreased, and same depth values, respectively. To cope with speed variations in actions, energy-based sampling method is utilized, leading to multiscale E-GTI features, where the multiscale scheme can efficiently capture the temporal relationships among frames. In the second stage, all the E-GTI features are transformed by Radon transform (RT) as robust descriptors, which are aggregated by the bag-of-visual-words model as a compact representation. Extensive experiments on benchmark data sets show that our representation outperforms state-of-the-art approaches, since it captures discriminating spatiotemporal information of actions. Due to the merits of energy-based sampling and RT methods, our representation shows robustness to speed variations, depth noise, and partial occlusions.
Publication Date
8-1-2018
Publication Title
IEEE Transactions on Circuits and Systems for Video Technology
Volume
28
Issue
8
Number of Pages
1824-1838
Document Type
Article
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/TCSVT.2017.2655521
Copyright Status
Unknown
Socpus ID
85045312364 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/85045312364
STARS Citation
Liu, Mengyuan; Liu, Hong; and Chen, Chen, "3D Action Recognition Using Multiscale Energy-Based Global Ternary Image" (2018). Scopus Export 2015-2019. 9048.
https://stars.library.ucf.edu/scopus2015/9048