Cross-Modal Retrieval Using Deep De-Correlated Subspace Ranking Hashing
Keywords
Cross-modal hashing; Image and text retrieval; Multimodal retrieval
Abstract
Cross-modal hashing has become a popular research topic in recent years due to the efficiency of storing and retrieving high-dimensional multimodal data represented by compact binary codes. While most cross-modal hash functions use binary space partitioning functions (e.g. the sign function), our method uses ranking-based hashing, which is based on numerically stable and scale-invariant rank correlation measures. In this paper, we propose a novel deep learning architecture called Deep De-correlated Subspace Ranking Hashing (DDSRH) that uses feature-ranking methods to determine the hash codes for the image and text modalities in a common hamming space. Specifically, DDSRH learns a set of de-correlated nonlinear subspaces on which to project the original features, so that the hash code can be determined by the relative ordering of projected feature values in a given optimized subspace. The network relies upon a pre-trained deep feature learning network for each modality, and a hashing network responsible for optimizing the hash codes based on the known similarity of the training image-text pairs. Our proposed method includes both architectural and mathematical techniques designed specifically for ranking-based hashing in order to achieve de-correlation between the bits, bit balancing, and quantization. Finally, through extensive experimental studies on two widely-used multimodal datasets, we show that the combination of these techniques can achieve state-of the-art performance on several benchmarks.
Publication Date
6-5-2018
Publication Title
ICMR 2018 - Proceedings of the 2018 ACM International Conference on Multimedia Retrieval
Number of Pages
55-63
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1145/3206025.3206066
Copyright Status
Unknown
Socpus ID
85053904084 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/85053904084
STARS Citation
Joslyn, Kevin; Li, Kai; and Hua, Kien A., "Cross-Modal Retrieval Using Deep De-Correlated Subspace Ranking Hashing" (2018). Scopus Export 2015-2019. 10131.
https://stars.library.ucf.edu/scopus2015/10131