Title

Detection And Representation Of Scenes In Videos

Keywords

Graph partitioning; Key-frames; Normalized cuts; Scene; Shot; Video segmentation

Abstract

This paper presents a method to perform a high-level segmentation of videos into scenes. A scene can be defined as a subdivision of a play in which either the setting is fixed, or when it presents continuous action in one place. We exploit this fact and propose a novel approach for clustering shots into scenes by transforming this task into a graph partitioning problem. This is achieved by constructing a weighted undirected graph called a shot similarity graph (SSG), where each node represents a shot and the edges between the shots are weighted by their similarity based on color and motion information. The SSG is then split into subgraphs by applying the normalized cuts for graph partitioning. The partitions so obtained represent individual scenes in the video. When clustering the shots, we consider the global similarities of shots rather than the individual shot pairs. We also propose a method to describe the content of each scene by selecting one representative image from the video as a scene key-frame. Recently, DVDs have become available with a chapter selection option where each chapter is represented by one image. Our algorithm automates this objective which is useful for applications such as video-on-demand, digital libraries, and the Internet. Experiments are presented with promising results on several Hollywood movies and one sitcom. © 2005 IEEE.

Publication Date

12-1-2005

Publication Title

IEEE Transactions on Multimedia

Volume

7

Issue

6

Number of Pages

1097-1105

Document Type

Article

Personal Identifier

scopus

DOI Link

https://doi.org/10.1109/TMM.2005.858392

Socpus ID

29044434462 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/29044434462

This document is currently not available here.

Share

COinS