Title
Content Based Video Matching Using Spatiotemporal Volumes
Keywords
Motion segmentation; Spatiotemporal volumes; Video matching; Video retrieval
Abstract
This paper presents a novel framework for matching video sequences using the spatiotemporal segmentation of videos. Instead of using appearance features for region correspondence across frames, we use interest point trajectories to generate video volumes. Point trajectories, which are generated using the SIFT operator, are clustered to form motion segments by analyzing their motion and spatial properties. The temporal correspondence between the estimated motion segments is then established based on most common SIFT correspondences. A two pass correspondence algorithm is used to handle splitting and merging regions. Spatiotemporal volumes are extracted using the consistently tracked motion segments. Next, a set of features including color, texture, motion, and SIFT descriptors are extracted to represent a volume. We employ an Earth Mover's Distance (EMD) based approach for the comparison of volume features. Given two videos, a bipartite graph is constructed by modeling the volumes as vertices and their similarities as edge weights. Maximum matching of this graph produces volume correspondences between the videos, and these volume matching scores are used to compute the final video matching score. Experiments for video retrieval were performed on a variety of videos obtained from different sources including BBC Motion Gallery and promising results were achieved. We present qualitative and quantitative analysis of retrieval along with a comparison with two baseline methods. © 2007 Elsevier Inc. All rights reserved.
Publication Date
6-1-2008
Publication Title
Computer Vision and Image Understanding
Volume
110
Issue
3
Number of Pages
360-377
Document Type
Article
Personal Identifier
scopus
DOI Link
https://doi.org/10.1016/j.cviu.2007.09.016
Copyright Status
Unknown
Socpus ID
43049162147 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/43049162147
STARS Citation
Basharat, Arslan; Zhai, Yun; and Shah, Mubarak, "Content Based Video Matching Using Spatiotemporal Volumes" (2008). Scopus Export 2000s. 9878.
https://stars.library.ucf.edu/scopus2000/9878