Scopus Export 2015-2019

3D Action Recognition Using Multiscale Energy-Based Global Ternary Image

Mengyuan Liu, Peking University
Hong Liu, Peking University
Chen Chen, University of Central Florida

Keywords

Action recognition; depth sequence; human-computer interaction

Abstract

This paper presents an effective multiscale energy-based global ternary image (E-GTI) representation for action recognition from depth sequences. The unique property of our representation is that it takes the spatiotemporal discrimination and action speed variations into account, intending to solve the problems of distinguishing similar actions and identifying the actions with different speeds in one goal. The entire method is carried out in two stages. In the first stage, consecutive depth frames are used to generate global ternary image (GTI) features, which implicitly capture both inter-frame motion regions and motion directions. Specifically, each pixel in the GTI represents one of three possible states, namely, positive, negative, and neutral, which indicate the increased, decreased, and same depth values, respectively. To cope with speed variations in actions, energy-based sampling method is utilized, leading to multiscale E-GTI features, where the multiscale scheme can efficiently capture the temporal relationships among frames. In the second stage, all the E-GTI features are transformed by Radon transform (RT) as robust descriptors, which are aggregated by the bag-of-visual-words model as a compact representation. Extensive experiments on benchmark data sets show that our representation outperforms state-of-the-art approaches, since it captures discriminating spatiotemporal information of actions. Due to the merits of energy-based sampling and RT methods, our representation shows robustness to speed variations, depth noise, and partial occlusions.

Publication Date

8-1-2018

Publication Title

IEEE Transactions on Circuits and Systems for Video Technology

Volume

Issue

Number of Pages

1824-1838

Document Type

Article

Personal Identifier

scopus

DOI Link

https://doi.org/10.1109/TCSVT.2017.2655521

Copyright Status

Unknown

Socpus ID

85045312364 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/85045312364

STARS Citation

Liu, Mengyuan; Liu, Hong; and Chen, Chen, "3D Action Recognition Using Multiscale Energy-Based Global Ternary Image" (2018). Scopus Export 2015-2019. 9048.
https://stars.library.ucf.edu/scopus2015/9048

This document is currently not available here.

COinS

Scopus Export 2015-2019

3D Action Recognition Using Multiscale Energy-Based Global Ternary Image

Keywords

Abstract

Publication Date

Publication Title

Volume

Issue

Number of Pages

Document Type

Personal Identifier

DOI Link

Copyright Status

Socpus ID

Source API URL

STARS Citation

Explore

Connect

Scopus Export 2015-2019

3D Action Recognition Using Multiscale Energy-Based Global Ternary Image

Creator

Keywords

Abstract

Publication Date

Publication Title

Volume

Issue

Number of Pages

Document Type

Personal Identifier

DOI Link

Copyright Status

Socpus ID

Source API URL

STARS Citation

Share

Explore

Connect