Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1505.00468
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
VQA: Visual Question Answering
3 May 2015
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VQA: Visual Question Answering"
50 / 2,957 papers shown
Title
Reciprocal Attention Fusion for Visual Question Answering
M. Farazi
Salman H Khan
72
14
0
11 May 2018
LearningWord Embeddings for Low-resource Languages by PU Learning
Chao Jiang
Hsiang-Fu Yu
Cho-Jui Hsieh
Kai-Wei Chang
46
21
0
09 May 2018
Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog
Jiaping Zhang
Tiancheng Zhao
Zhou Yu
57
40
0
08 May 2018
Hypothesis Only Baselines in Natural Language Inference
Adam Poliak
Jason Naradowsky
Aparajita Haldar
Rachel Rudinger
Benjamin Van Durme
254
580
0
02 May 2018
Dialog-based Interactive Image Retrieval
Xiaoxiao Guo
Hui Wu
Yu Cheng
Steven J. Rennie
Gerald Tesauro
Rogerio Feris
144
209
0
01 May 2018
Learning Cross-Modal Deep Embeddings for Multi-Object Image Retrieval using Text and Sketch
S. Dey
Anjan Dutta
Suman K. Ghosh
Ernest Valveny
Josep Lladós
Umapada Pal
116
24
0
28 Apr 2018
Reward Learning from Narrated Demonstrations
H. Tung
Adam W. Harley
Liang-Kang Huang
Katerina Fragkiadaki
LM&Ro
SSL
88
29
0
27 Apr 2018
Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
69
7
0
27 Apr 2018
Interactive Language Acquisition with One-shot Visual Concept Learning through a Conversational Game
Haichao Zhang
Haonan Yu
Wenyuan Xu
LLMAG
79
8
0
26 Apr 2018
Large Scale Scene Text Verification with Guided Attention
Dafang He
Yeqing Li
Alexander N. Gorban
Derrall Heath
Julian Ibarz
Qian Yu
Daniel Kifer
C. Lee Giles
3DV
36
0
0
23 Apr 2018
Multi-task Learning for Universal Sentence Embeddings: A Thorough Evaluation using Transfer and Auxiliary Tasks
Wasi Uddin Ahmad
Xueying Bai
Zhechao Huang
Chao Jiang
Nanyun Peng
Kai-Wei Chang
SSL
65
6
0
21 Apr 2018
Pathologies of Neural Models Make Interpretations Difficult
Shi Feng
Eric Wallace
Alvin Grissom II
Mohit Iyyer
Pedro Rodriguez
Jordan L. Boyd-Graber
AAML
FAtt
106
322
0
20 Apr 2018
Object Ordering with Bidirectional Matchings for Visual Reasoning
Hao Tan
Joey Tianyi Zhou
BDL
CoGe
54
16
0
18 Apr 2018
NTUA-SLP at SemEval-2018 Task 1: Predicting Affective Content in Tweets with Deep Attentive RNNs and Transfer Learning
Christos Baziotis
Athanasiou Nikolaos
Alexandra Chronopoulou
Athanasia Kolovou
Georgios Paraskevopoulos
Nikolaos Ellinas
Shrikanth Narayanan
Alexandros Potamianos
85
127
0
18 Apr 2018
Deep Multimodal Subspace Clustering Networks
Mahdi Abavisani
Vishal M. Patel
84
166
0
17 Apr 2018
Comparatives, Quantifiers, Proportions: A Multi-Task Model for the Learning of Quantities from Vision
Sandro Pezzelle
Ionut-Teodor Sorodoc
Raffaella Bernardi
40
9
0
13 Apr 2018
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech
David Harwath
Galen Chuang
James R. Glass
87
58
0
09 Apr 2018
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
Dima Damen
Hazel Doughty
G. Farinella
Sanja Fidler
Antonino Furnari
...
Davide Moltisanti
Jonathan Munro
Toby Perrett
Will Price
Michael Wray
EgoV
145
1,040
0
08 Apr 2018
Compositional Obverter Communication Learning From Raw Visual Input
Edward Choi
Angeliki Lazaridou
Nando de Freitas
148
76
0
06 Apr 2018
Question Type Guided Attention in Visual Question Answering
Yang Shi
Tommaso Furlanello
Sheng Zha
Anima Anandkumar
72
46
0
06 Apr 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
David Harwath
Adrià Recasens
Dídac Surís
Galen Chuang
Antonio Torralba
James R. Glass
123
201
0
04 Apr 2018
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen
Takayuki Okatani
84
282
0
03 Apr 2018
Seeing Voices and Hearing Faces: Cross-modal biometric matching
Arsha Nagrani
Samuel Albanie
Andrew Zisserman
CVBM
107
221
0
01 Apr 2018
Differential Attention for Visual Question Answering
Badri N. Patro
Vinay P. Namboodiri
AIMat
84
75
0
01 Apr 2018
Visual Question Reasoning on General Dependency Tree
Qingxing Cao
Xiaodan Liang
Bailin Li
Guanbin Li
Liang Lin
CoGe
83
37
0
31 Mar 2018
Guide Me: Interacting with Deep Networks
Christian Rupprecht
Iro Laina
Nassir Navab
Gregory Hager
Federico Tombari
HAI
73
38
0
30 Mar 2018
Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering
Unnat Jain
Svetlana Lazebnik
Alex Schwing
MLLM
76
82
0
29 Mar 2018
Motion-Appearance Co-Memory Networks for Video Question Answering
J. Gao
Runzhou Ge
Kan Chen
Ram Nevatia
151
241
0
29 Mar 2018
Neural Baby Talk
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
VLM
242
436
0
27 Mar 2018
Generalized Hadamard-Product Fusion Operators for Visual Question Answering
Brendan Duke
Graham W. Taylor
40
8
0
26 Mar 2018
Scene Graph Parsing as Dependency Parsing
Yu-Siang Wang
Chenxi Liu
Fangyin Wei
Alan Yuille
GNN
3DV
52
53
0
25 Mar 2018
Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering
Somak Aditya
Yezhou Yang
Chitta Baral
LRM
NAI
ReLM
68
53
0
23 Mar 2018
Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation
Xin Eric Wang
Wenhan Xiong
Hongmin Wang
William Yang Wang
85
202
0
21 Mar 2018
Attention on Attention: Architectures for Visual Question Answering (VQA)
Jasdeep Singh
Vincent Ying
Alex Nutkiewicz
60
26
0
21 Mar 2018
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Qing Li
Qingyi Tao
Shafiq Joty
Jianfei Cai
Jiebo Luo
107
109
0
20 Mar 2018
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
71
29
0
16 Mar 2018
A Dataset and Architecture for Visual Reasoning with a Working Memory
G. R. Yang
Igor Ganichev
Xiao-Jing Wang
Jonathon Shlens
David Sussillo
71
55
0
16 Mar 2018
Compositional Attention Networks for Machine Reasoning
Drew A. Hudson
Christopher D. Manning
BDL
OOD
LRM
205
578
0
08 Mar 2018
Annotation Artifacts in Natural Language Inference Data
Suchin Gururangan
Swabha Swayamdipta
Omer Levy
Roy Schwartz
Samuel R. Bowman
Noah A. Smith
225
1,181
0
06 Mar 2018
Totally Looks Like - How Humans Compare, Compared to Machines
Amir Rosenfeld
M. Solbach
John K. Tsotsos
3DH
94
29
0
05 Mar 2018
Deep-neural-network based sinogram synthesis for sparse-view CT image reconstruction
Hoyeon Lee
Jongha Lee
Hyeongseok Kim
B. Cho
Seungryong Cho
74
228
0
02 Mar 2018
VizWiz Grand Challenge: Answering Visual Questions from Blind People
Danna Gurari
Qing Li
Abigale Stangl
Anhong Guo
Chi Lin
Kristen Grauman
Jiebo Luo
Jeffrey P. Bigham
CoGe
171
864
0
22 Feb 2018
Multimodal Named Entity Recognition for Short Social Media Posts
Seungwhan Moon
Leonardo Neves
Vitor R. Carvalho
84
155
0
22 Feb 2018
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
Pelin Dogan
Boyang Albert Li
Leonid Sigal
Markus Gross
AI4TS
73
20
0
19 Feb 2018
Learning to Count Objects in Natural Images for Visual Question Answering
Yan Zhang
Jonathon S. Hare
Adam Prugel-Bennett
OOD
90
208
0
15 Feb 2018
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
87
425
0
15 Feb 2018
Challenging Images For Minds and Machines
Amir Rosenfeld
John K. Tsotsos
VLM
31
1
0
13 Feb 2018
Answerer in Questioner's Mind: Information Theoretic Approach to Goal-Oriented Visual Dialog
Sang-Woo Lee
Y. Heo
Byoung-Tak Zhang
75
31
0
12 Feb 2018
FlipDial: A Generative Model for Two-Way Visual Dialogue
Daniela Massiceti
N. Siddharth
P. Dokania
Philip Torr
MLLM
92
41
0
11 Feb 2018
Recent Advances in Neural Program Synthesis
Neel Kant
NAI
106
37
0
07 Feb 2018
Previous
1
2
3
...
52
53
54
...
58
59
60
Next