Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.00837
Cited By
v1
v2
v3 (latest)
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
2 December 2016
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering"
50 / 2,037 papers shown
Title
The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers
Dongxiang Zhang
Lei Wang
Nuo Xu
B. Dai
Heng Tao Shen
ReLM
AIMat
103
127
0
22 Aug 2018
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference
Rowan Zellers
Yonatan Bisk
Roy Schwartz
Yejin Choi
162
720
0
16 Aug 2018
How Much Reading Does Reading Comprehension Require? A Critical Investigation of Popular Benchmarks
Divyansh Kaushik
Zachary Chase Lipton
ELM
114
233
0
14 Aug 2018
Community Regularization of Visually-Grounded Dialog
Akshat Agarwal
Swaminathan Gurumurthy
Vasu Sharma
M. Lewis
Katia Sycara
73
10
0
10 Aug 2018
A Joint Sequence Fusion Model for Video Question Answering and Retrieval
Youngjae Yu
Jongseok Kim
Gunhee Kim
113
347
0
07 Aug 2018
Learning Visual Question Answering by Bootstrapping Hard Attention
Mateusz Malinowski
Carl Doersch
Adam Santoro
Peter W. Battaglia
OOD
94
96
0
01 Aug 2018
Interpretable Visual Question Answering by Visual Grounding from Attention Supervision Mining
Yundong Zhang
Juan Carlos Niebles
Á. Soto
87
68
0
01 Aug 2018
Pythia v0.1: the Winning Entry to the VQA Challenge 2018
Yu Jiang
Vivek Natarajan
Xinlei Chen
Marcus Rohrbach
Dhruv Batra
Devi Parikh
VLM
114
203
0
26 Jul 2018
Explainable Neural Computation via Stack Neural Module Networks
Ronghang Hu
Jacob Andreas
Trevor Darrell
Kate Saenko
LRM
OCL
112
199
0
23 Jul 2018
Question Relevance in Visual Question Answering
Prakruthi Prabhakar
Nitish Kulkarni
Linghao Zhang
38
6
0
23 Jul 2018
Dynamic Multimodal Instance Segmentation guided by natural language queries
Edgar Margffoy-Tuay
Juan C. Pérez
Emilio Botero
Pablo Arbelaez
101
176
0
06 Jul 2018
Collaborative Annotation of Semantic Objects in Images with Multi-granularity Supervisions
Lishi Zhang
Chenghan Fu
Jia Li
55
8
0
27 Jun 2018
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features
Chiori Hori
Huda AlAmri
Jue Wang
Gordon Wichern
Takaaki Hori
...
Raphael Gontijo-Lopes
Abhishek Das
Irfan Essa
Dhruv Batra
Devi Parikh
VGen
81
125
0
21 Jun 2018
Learning Conditioned Graph Structures for Interpretable Visual Question Answering
Will Norcliffe-Brown
Efstathios Vafeias
Sarah Parisot
GNN
114
239
0
19 Jun 2018
Learning Visual Knowledge Memory Networks for Visual Question Answering
Zhou Su
Chen Zhu
Yinpeng Dong
Dongqi Cai
Yurong Chen
Jianguo Li
96
62
0
13 Jun 2018
Cross-Dataset Adaptation for Visual Question Answering
Wei-Lun Chao
Hexiang Hu
Fei Sha
OOD
88
49
0
10 Jun 2018
Learning Answer Embeddings for Visual Question Answering
Hexiang Hu
Wei-Lun Chao
Fei Sha
65
33
0
10 Jun 2018
CS-VQA: Visual Question Answering with Compressively Sensed Images
Li-Chi Huang
K. Kulkarni
Anik Jha
Suhas Lohit
Suren Jayasuriya
Pavan Turaga
CoGe
39
8
0
08 Jun 2018
Visual Reasoning by Progressive Module Networks
Seung Wook Kim
Makarand Tapaswi
Sanja Fidler
ReLM
LRM
70
13
0
06 Jun 2018
Focal Visual-Text Attention for Visual Question Answering
Junwei Liang
Lu Jiang
Liangliang Cao
Li Li
Alexander G. Hauptmann
77
112
0
05 Jun 2018
On the Flip Side: Identifying Counterexamples in Visual Question Answering
Gabriel Grand
Aron Szanto
Yoon Kim
Alexander Rush
25
0
0
03 Jun 2018
Visual Referring Expression Recognition: What Do Systems Actually Learn?
Volkan Cirik
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
86
63
0
30 May 2018
Joint Image Captioning and Question Answering
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
57
13
0
22 May 2018
Reproducibility Report for "Learning To Count Objects In Natural Images For Visual Question Answering"
Shagun Sodhani
Vardaan Pahuja
15
0
0
21 May 2018
A new dataset and model for learning to understand navigational instructions
Ozan Arkan Can
Deniz Yuret
68
1
0
21 May 2018
Bilinear Attention Networks
Jin-Hwa Kim
Jaehyun Jun
Byoung-Tak Zhang
AIMat
159
880
0
21 May 2018
Did the Model Understand the Question?
Pramod Kaushik Mudrakarta
Ankur Taly
Mukund Sundararajan
Kedar Dhamdhere
ELM
OOD
FAtt
90
200
0
14 May 2018
Reciprocal Attention Fusion for Visual Question Answering
M. Farazi
Salman H Khan
72
14
0
11 May 2018
Question Type Guided Attention in Visual Question Answering
Yang Shi
Tommaso Furlanello
Sheng Zha
Anima Anandkumar
72
46
0
06 Apr 2018
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen
Takayuki Okatani
94
282
0
03 Apr 2018
Differential Attention for Visual Question Answering
Badri N. Patro
Vinay P. Namboodiri
AIMat
94
75
0
01 Apr 2018
Visual Question Reasoning on General Dependency Tree
Qingxing Cao
Xiaodan Liang
Bailin Li
Guanbin Li
Liang Lin
CoGe
94
37
0
31 Mar 2018
Generalized Hadamard-Product Fusion Operators for Visual Question Answering
Brendan Duke
Graham W. Taylor
40
8
0
26 Mar 2018
Attention on Attention: Architectures for Visual Question Answering (VQA)
Jasdeep Singh
Vincent Ying
Alex Nutkiewicz
63
26
0
21 Mar 2018
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Qing Li
Qingyi Tao
Shafiq Joty
Jianfei Cai
Jiebo Luo
109
109
0
20 Mar 2018
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
71
29
0
16 Mar 2018
Annotation Artifacts in Natural Language Inference Data
Suchin Gururangan
Swabha Swayamdipta
Omer Levy
Roy Schwartz
Samuel R. Bowman
Noah A. Smith
236
1,181
0
06 Mar 2018
VizWiz Grand Challenge: Answering Visual Questions from Blind People
Danna Gurari
Qing Li
Abigale Stangl
Anhong Guo
Chi Lin
Kristen Grauman
Jiebo Luo
Jeffrey P. Bigham
CoGe
290
866
0
22 Feb 2018
Learning to Count Objects in Natural Images for Visual Question Answering
Yan Zhang
Jonathon S. Hare
Adam Prugel-Bennett
OOD
90
208
0
15 Feb 2018
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
87
425
0
15 Feb 2018
Dual Recurrent Attention Units for Visual Question Answering
Ahmed Osman
Wojciech Samek
64
32
0
01 Feb 2018
Object-based reasoning in VQA
Mikyas T. Desta
Larry Chen
Tomasz Kornuta
67
33
0
29 Jan 2018
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
Qing Li
Jianlong Fu
D. Yu
Tao Mei
Jiebo Luo
FAtt
XAI
CoGe
102
60
0
27 Jan 2018
DVQA: Understanding Data Visualizations via Question Answering
Kushal Kafle
Brian L. Price
Scott D. Cohen
Christopher Kanan
AIMat
147
397
0
24 Jan 2018
Structured Triplet Learning with POS-tag Guided Attention for Visual Question Answering
Zhe Wang
Xiaoyi Liu
Liangjian Chen
Limin Wang
Yu Qiao
Xiaohui Xie
Charless C. Fowlkes
63
14
0
24 Jan 2018
What do we need to build explainable AI systems for the medical domain?
Andreas Holzinger
Chris Biemann
C. Pattichis
D. Kell
97
695
0
28 Dec 2017
Interpretable Counting for Visual Question Answering
Alexander R. Trott
Caiming Xiong
R. Socher
111
71
0
23 Dec 2017
CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication
Jin-Hwa Kim
Nikita Kitaev
Xinlei Chen
Marcus Rohrbach
Byoung-Tak Zhang
Yuandong Tian
Dhruv Batra
Devi Parikh
DiffM
VGen
100
25
0
15 Dec 2017
IQA: Visual Question Answering in Interactive Environments
Daniel Gordon
Aniruddha Kembhavi
Mohammad Rastegari
Joseph Redmon
Dieter Fox
Ali Farhadi
LM&Ro
146
391
0
09 Dec 2017
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
Aniruddha Kembhavi
OOD
186
587
0
01 Dec 2017
Previous
1
2
3
...
39
40
41
Next