Title
Learning, Detection And Representation Of Multi-Agent Events In Videos
Keywords
Edge weighted directed hypergraph; Event detection; Event learning; Event representation; Normalized cut; P-CASE; Temporal logic
Abstract
In this paper, we model multi-agent events in terms of a temporally varying sequence of sub-events, and propose a novel approach for learning, detecting and representing events in videos. The proposed approach has three main steps. First, in order to learn the event structure from training videos, we automatically encode the sub-event dependency graph, which is the learnt event model that depicts the conditional dependency between sub-events. Second, we pose the problem of event detection in novel videos as clustering the maximally correlated sub-events using normalized cuts. The principal assumption made in this work is that the events are composed of a highly correlated chain of sub-events that have high weights (association) within the cluster and relatively low weights (disassociation) between the clusters. The event detection does not require prior knowledge of the number of agents involved in an event and does not make any assumptions about the length of an event. Third, we recognize the fact that any abstract event model should extend to representations related to human understanding of events. Therefore, we propose an extension of CASE representation of natural languages that allows a plausible means of interface between users and the computer. We show results of learning, detection, and representation of events for videos in the meeting, surveillance, and railroad monitoring domains. © 2007 Elsevier B.V. All rights reserved.
Publication Date
6-1-2007
Publication Title
Artificial Intelligence
Volume
171
Issue
8-9
Number of Pages
586-605
Document Type
Article
Personal Identifier
scopus
DOI Link
https://doi.org/10.1016/j.artint.2007.04.002
Copyright Status
Unknown
Socpus ID
34249677392 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/34249677392
STARS Citation
Hakeem, Asaad and Shah, Mubarak, "Learning, Detection And Representation Of Multi-Agent Events In Videos" (2007). Scopus Export 2000s. 6561.
https://stars.library.ucf.edu/scopus2000/6561