Learning A Multi-Concept Video Retrieval Model With Multiple Latent Variables
Keywords
Multi-concept retrieval; Structural learning; Video retrieval
Abstract
Effective and efficient video retrieval has become a pressing need in the "big video" era and how to deal with multi-concept queries is a central component. The objective of this work is to provide a principled model for calculating the ranking scores of video in response to multiple concepts. However, it has been long overlooked and simply implemented by weighted averaging the corresponding concept detectors' scores. Our approach, which can be considered as a latent ranking SVM, integrates the advantages of various recent works on text and image retrieval, such as choosing ranking over structured prediction and modeling inter-dependencies between querying concepts and the others. Videos consist of shots and we use latent variables to account for the mutually complementary cues within and across shots. We introduce a simple and effective way to make our model robust to outliers and scarce data. Our approach gives rise to superior performance when it is tested on not only the queries seen at training, but also novel queries, some of which consist of more concepts than the seen queries used for training.
Publication Date
1-18-2017
Publication Title
Proceedings - 2016 IEEE International Symposium on Multimedia, ISM 2016
Number of Pages
615-620
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/ISM.2016.10
Copyright Status
Unknown
Socpus ID
85015230611 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/85015230611
STARS Citation
Mazaheri, Amir; Gong, Boqing; and Shah, Mubarak, "Learning A Multi-Concept Video Retrieval Model With Multiple Latent Variables" (2017). Scopus Export 2015-2019. 7472.
https://stars.library.ucf.edu/scopus2015/7472