Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1505.00468
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
VQA: Visual Question Answering
3 May 2015
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VQA: Visual Question Answering"
50 / 2,957 papers shown
Title
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
106
101
0
09 May 2016
Leveraging Visual Question Answering for Image-Caption Ranking
Xiaoyu Lin
Devi Parikh
CoGe
99
84
0
04 May 2016
Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering
Arun Mallya
Svetlana Lazebnik
101
119
0
16 Apr 2016
Visual Storytelling
Ting-Hao 'Kenneth' Huang
Huang
Francis Ferraro
N. Mostafazadeh
Ishan Misra
...
C. L. Zitnick
Devi Parikh
Lucy Vanderwende
Michel Galley
Margaret Mitchell
VGen
90
480
0
13 Apr 2016
Counting Everyday Objects in Everyday Scenes
Prithvijit Chattopadhyay
Ramakrishna Vedantam
Ramprasaath R. Selvaraju
Dhruv Batra
Devi Parikh
111
157
0
12 Apr 2016
Attributes as Semantic Units between Natural Language and Visual Recognition
Marcus Rohrbach
VLM
40
3
0
12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
83
274
0
10 Apr 2016
Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes
Gordon A. Christie
A. Laddha
Aishwarya Agrawal
Stanislaw Antol
Yash Goyal
K. Kochersberger
Dhruv Batra
81
30
0
07 Apr 2016
A Focused Dynamic Attention Model for Visual Question Answering
Ilija Ilievski
Shuicheng Yan
Jiashi Feng
77
122
0
06 Apr 2016
Deep Image Retrieval: Learning global representations for image search
Albert Gordo
Jon Almazán
Jérôme Revaud
Diane Larlus
84
807
0
05 Apr 2016
Automatic Annotation of Structured Facts in Images
Mohamed Elhoseiny
Scott D. Cohen
W. Chang
Brian L. Price
Ahmed Elgammal
61
9
0
02 Apr 2016
Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings
Spandana Gella
Mirella Lapata
Frank Keller
CoGe
82
53
0
30 Mar 2016
A Diagram Is Worth A Dozen Images
Aniruddha Kembhavi
M. Salvato
Eric Kolve
Minjoon Seo
Hannaneh Hajishirzi
Ali Farhadi
3DV
103
505
0
24 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing
Arnau Ramisa
F. Yan
Francesc Moreno-Noguer
K. Mikolajczyk
72
106
0
23 Mar 2016
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge
Qi Wu
Chunhua Shen
Anton Van Den Hengel
Peng Wang
A. Dick
89
362
0
09 Mar 2016
Dynamic Memory Networks for Visual and Textual Question Answering
Caiming Xiong
Stephen Merity
R. Socher
88
756
0
04 Mar 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
273
5,779
0
23 Feb 2016
A Taxonomy of Deep Convolutional Neural Nets for Computer Vision
Suraj Srinivas
Ravi Kiran Sarvadevabhatla
Konda Reddy Mopuri
N. Prabhu
S. Kruthiventi
R. Venkatesh Babu
OOD
69
216
0
25 Jan 2016
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures
Raffaella Bernardi
Ruken Cakici
Desmond Elliott
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
Frank Keller
A. Muscat
Barbara Plank
EGVM
VLM
92
365
0
15 Jan 2016
Learning to Compose Neural Networks for Question Answering
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
NAI
KELM
BDL
CoGe
130
568
0
07 Jan 2016
We Are Humor Beings: Understanding and Predicting Visual Humor
Arjun Chandrasekaran
Ashwin K. Vijayakumar
Stanislaw Antol
Joey Tianyi Zhou
Dhruv Batra
C. L. Zitnick
Devi Parikh
113
57
0
14 Dec 2015
Neural Self Talk: Image Understanding via Continuous Questioning and Answering
Yezhou Yang
Yi Li
Cornelia Fermuller
Yiannis Aloimonos
38
24
0
10 Dec 2015
MovieQA: Understanding Stories in Movies through Question-Answering
Makarand Tapaswi
Yukun Zhu
Rainer Stiefelhagen
Antonio Torralba
R. Urtasun
Sanja Fidler
122
752
0
09 Dec 2015
Simple Baseline for Visual Question Answering
Bolei Zhou
Yuandong Tian
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
FAtt
108
324
0
07 Dec 2015
A Restricted Visual Turing Test for Deep Scene and Event Understanding
Qi
Tianfu Wu
M. Lee
Song-Chun Zhu
62
12
0
06 Dec 2015
Natural Language Understanding with Distributed Representation
Kyunghyun Cho
GNN
BDL
86
55
0
24 Nov 2015
Where To Look: Focus Regions for Visual Question Answering
Kevin J. Shih
Saurabh Singh
Derek Hoiem
98
462
0
23 Nov 2015
Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes
Satwik Kottur
Ramakrishna Vedantam
José M. F. Moura
Devi Parikh
VLM
114
85
0
22 Nov 2015
Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources
Qi Wu
Peng Wang
Chunhua Shen
A. Dick
Anton Van Den Hengel
82
372
0
22 Nov 2015
Learning Deep Structure-Preserving Image-Text Embeddings
Liwei Wang
Yin Li
Svetlana Lazebnik
115
784
0
19 Nov 2015
Reducing Overfitting in Deep Networks by Decorrelating Representations
Michael Cogswell
Faruk Ahmed
Ross B. Girshick
C. L. Zitnick
Dhruv Batra
120
416
0
19 Nov 2015
ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering
Kan Chen
Jiang Wang
Liang-Chieh Chen
Haoyuan Gao
Wenyuan Xu
Ram Nevatia
86
288
0
18 Nov 2015
Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction
Hyeonwoo Noh
Paul Hongsuck Seo
Bohyung Han
OOD
78
327
0
18 Nov 2015
Compositional Memory for Visual Question Answering
Aiwen Jiang
Fang Wang
Fatih Porikli
Yi Li
CoGe
59
42
0
18 Nov 2015
Learning Articulated Motion Models from Visual and Lingual Signals
Zhengyang Wu
Joey Tianyi Zhou
Matthew R. Walter
31
0
0
17 Nov 2015
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
Huijuan Xu
Kate Saenko
122
763
0
17 Nov 2015
Yin and Yang: Balancing and Answering Binary Visual Questions
Peng Zhang
Yash Goyal
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
107
352
0
16 Nov 2015
Sherlock: Scalable Fact Learning in Images
Mohamed Elhoseiny
Scott D. Cohen
W. Chang
Brian L. Price
Ahmed Elgammal
59
26
0
16 Nov 2015
Uncovering Temporal Context for Video Question and Answering
Linchao Zhu
Zhongwen Xu
Yi Yang
Alexander G. Hauptmann
BDL
86
45
0
15 Nov 2015
Visual7W: Grounded Question Answering in Images
Yuke Zhu
Oliver Groth
Michael S. Bernstein
Li Fei-Fei
157
890
0
11 Nov 2015
Neural Module Networks
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
CoGe
177
1,079
0
09 Nov 2015
Explicit Knowledge-based Reasoning for Visual Question Answering
Peng Wang
Qi Wu
Chunhua Shen
Anton Van Den Hengel
A. Dick
91
261
0
09 Nov 2015
Generation and Comprehension of Unambiguous Object Descriptions
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana-Maria Camburu
Alan Yuille
Kevin Patrick Murphy
ObjD
144
1,362
0
07 Nov 2015
Stacked Attention Networks for Image Question Answering
Zichao Yang
Xiaodong He
Jianfeng Gao
Li Deng
Alex Smola
BDL
166
1,889
0
07 Nov 2015
Learning Visual Features from Large Weakly Supervised Data
Armand Joulin
Laurens van der Maaten
Allan Jabri
Nicolas Vasilache
SSL
131
410
0
06 Nov 2015
VISALOGY: Answering Visual Analogy Questions
Fereshteh Sadeghi
C. L. Zitnick
Ali Farhadi
83
46
0
30 Oct 2015
Using Thought-Provoking Children's Questions to Drive Artificial Intelligence Research
E. Mueller
H. Minsky
LRM
18
1
0
27 Aug 2015
A Survey of Current Datasets for Vision and Language Research
Francis Ferraro
N. Mostafazadeh
Ting-Hao 'Kenneth' Huang
Huang
Lucy Vanderwende
Jacob Devlin
Michel Galley
Margaret Mitchell
VLM
92
75
0
23 Jun 2015
Describing Common Human Visual Actions in Images
M. R. Ronchi
Pietro Perona
88
64
0
07 Jun 2015
What value do explicit high level concepts have in vision to language problems?
Qi Wu
Chunhua Shen
Lingqiao Liu
A. Dick
Anton Van Den Hengel
89
444
0
03 Jun 2015
Previous
1
2
3
...
58
59
60
Next