Time-Ordered Spatial-Temporal Interest Points For Human Action Classification

Keywords

Human action classification; Spatial-temporal interest point

Abstract

Human action classification, which is vital for content-based video retrieval and human-machine interaction, finds problem in distinguishing similar actions. Previous works typically detect spatial-temporal interest points (STIPs) from action sequences and then adopt bag-of-visual words (BoVW) model to describe actions as numerical statistics of STIPs. Despite the robustness of BoVW, this model ignores the spatial-temporal layout of STIPs, leading to misclassification among different types of actions with similar numerical statistics of STIPs. Motivated by this, a time-ordered feature is designed to describe the temporal distribution of STIPs, which contains complementary structural information to traditional BoVW model. Moreover, a temporal refinement method is used to eliminate intra-variations among time-ordered features caused by performers' habits. Then a time-ordered BoVW model is built to represent actions, which encodes both numerical statistics and temporal distribution of STIPs. Extensive experiments on three challenging datasets, i.e., KTH, Rochster and UT-Interaction, validate the effectiveness of our method in distinguishing similar actions.

Publication Date

8-28-2017

Publication Title

Proceedings - IEEE International Conference on Multimedia and Expo

Number of Pages

655-660

Document Type

Article; Proceedings Paper

Personal Identifier

scopus

DOI Link

https://doi.org/10.1109/ICME.2017.8019477

Socpus ID

85030251297 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/85030251297

This document is currently not available here.

Share

COinS