Scopus Export 2015-2019

Vqs: Linking Segmentations To Questions And Answers For Supervised Attention In Vqa And Question-Focused Semantic Segmentation

Chuang Gan, Tsinghua University
Yandong Li, University of Central Florida
Haoxiang Li, Adobe Inc.
Chen Sun, Google LLC
Boqing Gong, University of Central Florida

Abstract

Rich and dense human labeled datasets are among the main enabling factors for the recent advance on visionlanguage understanding. Many seemingly distant annotations (e.g., semantic segmentation and visual question answering (VQA)) are inherently connected in that they reveal different levels and perspectives of human understandings about the same visual scenes - and even the same set of images (e.g., of COCO). The popularity of COCO correlates those annotations and tasks. Explicitly linking them up may significantly benefit both individual tasks and the unified vision and language modeling.,,We present the preliminary work of linking the instance segmentations provided by COCO to the questions and answers (QAs) in the VQA dataset, and name the collected links visual questions and segmentation answers (VQS). They transfer human supervision between the previously separate tasks, offer more effective leverage to existing problems, and also open the door for new research problems and models. We study two applications of the VQS data in this paper: supervised attention for VQA and a novel question-focused semantic segmentation task. For the former, we obtain state-of-the-art results on the VQA real multiple-choice task by simply augmenting the multilayer perceptrons with some attention features that are learned using the segmentation-QA links as explicit supervision. To put the latter in perspective, we study two plausible methods and compare them to an oracle method assuming that the instance segmentations are given at the test stage.

Publication Date

12-22-2017

Publication Title

Proceedings of the IEEE International Conference on Computer Vision

Volume

2017-October

Number of Pages

1829-1838

Document Type

Article; Proceedings Paper

Personal Identifier

scopus

DOI Link

https://doi.org/10.1109/ICCV.2017.201

Copyright Status

Unknown

Socpus ID

85041915280 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/85041915280

STARS Citation

Gan, Chuang; Li, Yandong; Li, Haoxiang; Sun, Chen; and Gong, Boqing, "Vqs: Linking Segmentations To Questions And Answers For Supervised Attention In Vqa And Question-Focused Semantic Segmentation" (2017). Scopus Export 2015-2019. 7037.
https://stars.library.ucf.edu/scopus2015/7037

This document is currently not available here.

COinS

Scopus Export 2015-2019

Vqs: Linking Segmentations To Questions And Answers For Supervised Attention In Vqa And Question-Focused Semantic Segmentation

Abstract

Publication Date

Publication Title

Volume

Number of Pages

Document Type

Personal Identifier

DOI Link

Copyright Status

Socpus ID

Source API URL

STARS Citation

Explore

Connect

Scopus Export 2015-2019

Vqs: Linking Segmentations To Questions And Answers For Supervised Attention In Vqa And Question-Focused Semantic Segmentation

Creator

Abstract

Publication Date

Publication Title

Volume

Number of Pages

Document Type

Personal Identifier

DOI Link

Copyright Status

Socpus ID

Source API URL

STARS Citation

Share

Explore

Connect