Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1505.00468
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
VQA: Visual Question Answering
3 May 2015
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VQA: Visual Question Answering"
50 / 2,957 papers shown
Title
Leveraging Video Descriptions to Learn Video Question Answering
Kuo-Hao Zeng
Tseng-Hung Chen
Ching-Yao Chuang
Yuan-Hong Liao
Juan Carlos Niebles
Min Sun
108
180
0
12 Nov 2016
Crowdsourcing in Computer Vision
Adriana Kovashka
Olga Russakovsky
Li Fei-Fei
Kristen Grauman
HAI
VLM
3DV
61
151
0
07 Nov 2016
Dynamic Coattention Networks For Question Answering
Caiming Xiong
Victor Zhong
R. Socher
AIMat
94
684
0
05 Nov 2016
Bidirectional Attention Flow for Machine Comprehension
Minjoon Seo
Aniruddha Kembhavi
Ali Farhadi
Hannaneh Hajishirzi
214
2,091
0
05 Nov 2016
Dual Attention Networks for Multimodal Reasoning and Matching
Hyeonseob Nam
Jung-Woo Ha
Jeonghee Kim
134
670
0
02 Nov 2016
End-to-end Learning of Deep Visual Representations for Image Retrieval
Albert Gordo
Jon Almazán
Jérôme Revaud
Diane Larlus
VLM
86
542
0
25 Oct 2016
Proposing Plausible Answers for Open-ended Visual Question Answering
Omid Bakhshandeh
Trung Bui
Zhe Lin
W. Chang
31
1
0
20 Oct 2016
Deep Identity-aware Transfer of Facial Attributes
Mu Li
W. Zuo
David C. Zhang
CVBM
103
149
0
18 Oct 2016
Video Fill in the Blank with Merging LSTMs
Amir Mazaheri
Dong Zhang
M. Shah
72
18
0
13 Oct 2016
Open-Ended Visual Question-Answering
Issey Masuda
Santiago Pascual de la Puente
Xavier Giró-i-Nieto
30
9
0
09 Oct 2016
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models
Ashwin K. Vijayakumar
Michael Cogswell
Ramprasaath R. Selvaraju
Q. Sun
Stefan Lee
David J. Crandall
Dhruv Batra
95
555
0
07 Oct 2016
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
FAtt
618
20,227
0
07 Oct 2016
Visual Question Answering: Datasets, Algorithms, and Future Challenges
Kushal Kafle
Christopher Kanan
OOD
101
244
0
05 Oct 2016
A Survey of Multi-View Representation Learning
Yingming Li
Ming Yang
Zhongfei Zhang
AI4TS
3DV
346
517
0
03 Oct 2016
Contextual RNN-GANs for Abstract Reasoning Diagram Generation
Arna Ghosh
Viveka Kulharia
A. Mukerjee
Vinay P. Namboodiri
Joey Tianyi Zhou
GAN
72
37
0
29 Sep 2016
Learning Language-Visual Embedding for Movie Understanding with Natural-Language
Atousa Torabi
Niket Tandon
Leonid Sigal
79
98
0
26 Sep 2016
Visual Fashion-Product Search at SK Planet
Taewan Kim
Seyeong Kim
Sangil Na
Hayoon Kim
Moonki Kim
Beyeongki Jeon
73
6
0
26 Sep 2016
Image-embodied Knowledge Representation Learning
Ruobing Xie
Zhiyuan Liu
Huanbo Luan
Maosong Sun
195
220
0
22 Sep 2016
The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA)
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
83
15
0
21 Sep 2016
Graph-Structured Representations for Visual Question Answering
Damien Teney
Lingqiao Liu
Anton Van Den Hengel
GNN
NAI
123
422
0
19 Sep 2016
The ACRV Picking Benchmark (APB): A Robotic Shelf Picking Benchmark to Foster Reproducible Research
Jurgen Leitner
Adam W. Tow
Jake E. Dean
Niko Sünderhauf
Joseph W. Durham
...
James Sergeant
Liao Wu
Fangyi Zhang
B. Upcroft
Peter Corke
82
79
0
17 Sep 2016
Towards Transparent AI Systems: Interpreting Visual Question Answering Models
Yash Goyal
Akrit Mohapatra
Devi Parikh
Dhruv Batra
69
74
0
31 Aug 2016
Measuring Machine Intelligence Through Visual Question Answering
C. L. Zitnick
Aishwarya Agrawal
Stanislaw Antol
Margaret Mitchell
Dhruv Batra
Devi Parikh
83
37
0
31 Aug 2016
Visual Question: Predicting If a Crowd Will Agree on the Answer
Danna Gurari
Kristen Grauman
HAI
71
2
0
29 Aug 2016
Machine Comprehension Using Match-LSTM and Answer Pointer
Shuohang Wang
Jing Jiang
100
594
0
29 Aug 2016
Convolutional Network for Attribute-driven and Identity-preserving Human Face Generation
Mu Li
W. Zuo
David C. Zhang
CVBM
70
51
0
23 Aug 2016
Solving Visual Madlibs with Multiple Cues
Tatiana Tommasi
Arun Mallya
Bryan A. Plummer
Svetlana Lazebnik
Alexander C. Berg
Tamara L. Berg
85
18
0
11 Aug 2016
Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs Task
Ashkan Mokarian
Mateusz Malinowski
Mario Fritz
70
5
0
09 Aug 2016
Cognitive Science in the era of Artificial Intelligence: A roadmap for reverse-engineering the infant language-learner
Emmanuel Dupoux
109
158
0
29 Jul 2016
Visual Question Answering: A Survey of Methods and Datasets
Qi Wu
Damien Teney
Peng Wang
Chunhua Shen
A. Dick
Anton Van Den Hengel
111
418
0
20 Jul 2016
Annotation Methodologies for Vision and Language Dataset Creation
Gitit Kehat
James Pustejovsky
30
2
0
10 Jul 2016
Intra-layer Nonuniform Quantization for Deep Convolutional Neural Network
Fangxuan Sun
Jun Lin
Zhongfeng Wang
MQ
34
3
0
10 Jul 2016
Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes
Çağlar Gülçehre
A. Chandar
Kyunghyun Cho
Yoshua Bengio
149
64
0
30 Jun 2016
Revisiting Visual Question Answering Baselines
Allan Jabri
Armand Joulin
Laurens van der Maaten
OOD
67
83
0
27 Jun 2016
Sort Story: Sorting Jumbled Images and Captions into Stories
Harsh Agrawal
Arjun Chandrasekaran
Dhruv Batra
Devi Parikh
Joey Tianyi Zhou
110
60
0
23 Jun 2016
CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks
Jindrich Libovický
Jindřich Helcl
Marek Tlustý
Pavel Pecina
Ondrej Bojar
78
68
0
23 Jun 2016
VideoMCC: a New Benchmark for Video Comprehension
Du Tran
Maksim Bolonkin
Manohar Paluri
Lorenzo Torresani
39
1
0
23 Jun 2016
Analyzing the Behavior of Visual Question Answering Models
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
99
315
0
23 Jun 2016
Semantic Parsing to Probabilistic Programs for Situated Question Answering
Jayant Krishnamurthy
Oyvind Tafjord
Aniruddha Kembhavi
77
25
0
22 Jun 2016
Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
Arijit Ray
Gordon A. Christie
Joey Tianyi Zhou
Dhruv Batra
Devi Parikh
87
56
0
21 Jun 2016
DualNet: Domain-Invariant Network for Visual Question Answering
Kuniaki Saito
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
81
59
0
20 Jun 2016
FVQA: Fact-based Visual Question Answering
Peng Wang
Qi Wu
Chunhua Shen
Anton van den Hengel
A. Dick
CoGe
115
464
0
17 Jun 2016
Training Recurrent Answering Units with Joint Loss Minimization for VQA
Hyeonwoo Noh
Bohyung Han
101
71
0
12 Jun 2016
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
Abhishek Das
Harsh Agrawal
C. L. Zitnick
Devi Parikh
Dhruv Batra
132
467
0
11 Jun 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
344
1,471
0
06 Jun 2016
Multimodal Residual Learning for Visual QA
Jin-Hwa Kim
Sang-Woo Lee
Donghyun Kwak
Min-Oh Heo
Jeonghee Kim
Jung-Woo Ha
Byoung-Tak Zhang
73
302
0
05 Jun 2016
Adversarial Feature Learning
Jiasen Lu
Philipp Krahenbuhl
Trevor Darrell
GAN
148
1,614
0
31 May 2016
End-to-End Instance Segmentation with Recurrent Attention
Mengye Ren
R. Zemel
SSeg
118
62
0
30 May 2016
HARRISON: A Benchmark on HAshtag Recommendation for Real-world Images in Social Networks
Minseok Park
Hanxiang Li
Junmo Kim
3DV
VLM
20
26
0
17 May 2016
Review of state-of-the-arts in artificial intelligence with application to AI safety problem
V. Shakirov
66
10
0
11 May 2016
Previous
1
2
3
...
57
58
59
60
Next