Simplified Lstm Unit And Search Space Probability Exploration For Image Description
Abstract
We present a novel method for addressing the semantic description of images. Our method offers two main contributions. First we introduce a recurrent unit that we call a simplified long short-term memory (LSTM) unit which, in contrast to traditional LSTM units, couples the functions of the input and forget gates into a single gate; we observed that this simpler unit improves accuracy. We also propose a novel algorithm for exploring the search space of sentences inferred through a joined Convolutional Neural Network (CNN) and our simplified LSTM unit. We explore the search space by reducing it to a search over sequential trees for the combination of sequences that best represent the image to be described. Our results show improvement over the state of the art methods on the COCO [1] and Flickr8K [2] datasets.
Publication Date
4-26-2016
Publication Title
2015 10th International Conference on Information, Communications and Signal Processing, ICICS 2015
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/ICICS.2015.7459976
Copyright Status
Unknown
Socpus ID
84973636447 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/84973636447
STARS Citation
Nina, Oliver and Rodriguez, Andres, "Simplified Lstm Unit And Search Space Probability Exploration For Image Description" (2016). Scopus Export 2015-2019. 4071.
https://stars.library.ucf.edu/scopus2015/4071