Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1505.00468
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
VQA: Visual Question Answering
3 May 2015
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VQA: Visual Question Answering"
50 / 2,957 papers shown
Title
Small Sample Learning in Big Data Era
Jun Shu
Zongben Xu
Deyu Meng
108
72
0
14 Aug 2018
Large Graph Exploration via Subgraph Discovery and Decomposition
J. Abello
Fred Hohman
Varun Bezzam
Duen Horng Chau
18
2
0
13 Aug 2018
Multimodal Differential Network for Visual Question Generation
Badri N. Patro
Sandeep Kumar
V. Kurmi
Vinay P. Namboodiri
68
40
0
12 Aug 2018
Community Regularization of Visually-Grounded Dialog
Akshat Agarwal
Swaminathan Gurumurthy
Vasu Sharma
M. Lewis
Katia Sycara
68
10
0
10 Aug 2018
Question-Guided Hybrid Convolution for Visual Question Answering
Peng Gao
Pan Lu
Hongsheng Li
Shuang Li
Yikang Li
Guosheng Lin
Xiaogang Wang
152
69
0
08 Aug 2018
A Joint Sequence Fusion Model for Video Question Answering and Retrieval
Youngjae Yu
Jongseok Kim
Gunhee Kim
108
347
0
07 Aug 2018
Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association
Dapeng Chen
Hongsheng Li
Xihui Liu
Yantao Shen
Zejian Yuan
Xiaogang Wang
84
134
0
05 Aug 2018
Visual Reasoning with Multi-hop Feature Modulation
Florian Strub
Mathieu Seurin
Ethan Perez
H. D. Vries
Jérémie Mary
Philippe Preux
Aaron Courville
Olivier Pietquin
95
26
0
03 Aug 2018
Neural Arithmetic Logic Units
Andrew Trask
Felix Hill
Scott E. Reed
Jack W. Rae
Chris Dyer
Phil Blunsom
NAI
96
206
0
01 Aug 2018
Learning Visual Question Answering by Bootstrapping Hard Attention
Mateusz Malinowski
Carl Doersch
Adam Santoro
Peter W. Battaglia
OOD
92
96
0
01 Aug 2018
Graph R-CNN for Scene Graph Generation
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
GNN
145
845
0
01 Aug 2018
Pythia v0.1: the Winning Entry to the VQA Challenge 2018
Yu Jiang
Vivek Natarajan
Xinlei Chen
Marcus Rohrbach
Dhruv Batra
Devi Parikh
VLM
104
203
0
26 Jul 2018
Coreset-Based Neural Network Compression
Abhimanyu Dubey
Moitreya Chatterjee
Narendra Ahuja
61
81
0
25 Jul 2018
Explainable Neural Computation via Stack Neural Module Networks
Ronghang Hu
Jacob Andreas
Trevor Darrell
Kate Saenko
LRM
OCL
108
199
0
23 Jul 2018
Question Relevance in Visual Question Answering
Prakruthi Prabhakar
Nitish Kulkarni
Linghao Zhang
38
6
0
23 Jul 2018
Revisiting Cross Modal Retrieval
Shah Nawaz
Muhammad Kamran Janjua
Alessandro Calefati
I. Gallo
33
6
0
19 Jul 2018
Convolutional Neural Networks for Aerial Multi-Label Pedestrian Detection
Amir Soleimani
Nasser M. Nasrabadi
ObjD
42
17
0
16 Jul 2018
Object Relation Detection Based on One-shot Learning
Li Zhou
Jian-jun Zhao
Jianshu Li
Li-xin Yuan
Jiashi Feng
ObjD
56
23
0
16 Jul 2018
Image Classification for Arabic: Assessing the Accuracy of Direct English to Arabic Translations
Abdulkareem Alsudais
VLM
50
4
0
13 Jul 2018
Towards Understanding End-of-trip Instructions in a Taxi Ride Scenario
Deepthi Karkada
R. Manuvinakurike
Kallirroi Georgila
40
0
0
11 Jul 2018
Talk the Walk: Navigating New York City through Grounded Dialogue
H. D. Vries
Kurt Shuster
Dhruv Batra
Devi Parikh
Jason Weston
Douwe Kiela
100
124
0
09 Jul 2018
Dynamic Multimodal Instance Segmentation guided by natural language queries
Edgar Margffoy-Tuay
Juan C. Pérez
Emilio Botero
Pablo Arbelaez
98
176
0
06 Jul 2018
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features
Chiori Hori
Huda AlAmri
Jue Wang
Gordon Wichern
Takaaki Hori
...
Raphael Gontijo-Lopes
Abhishek Das
Irfan Essa
Dhruv Batra
Devi Parikh
VGen
81
125
0
21 Jun 2018
Modularity Matters: Learning Invariant Relational Reasoning Tasks
Jason Jo
Vikas Verma
Yoshua Bengio
OOD
49
8
0
18 Jun 2018
The Neural Painter: Multi-Turn Image Generation
Ryan Y. Benmalek
Claire Cardie
Serge J. Belongie
Xiaodong He
Jianfeng Gao
MLLM
55
7
0
16 Jun 2018
Grounded Textual Entailment
H. Vu
Claudio Greco
A. Erofeeva
Somayeh Jafaritazehjan
Guido M. Linders
Marc Tanti
A. Testoni
Raffaella Bernardi
Albert Gatt
78
29
0
14 Jun 2018
Learning Visual Knowledge Memory Networks for Visual Question Answering
Zhou Su
Chen Zhu
Yinpeng Dong
Dongqi Cai
Yurong Chen
Jianguo Li
88
62
0
13 Jun 2018
Cross-Dataset Adaptation for Visual Question Answering
Wei-Lun Chao
Hexiang Hu
Fei Sha
OOD
83
49
0
10 Jun 2018
Learning Answer Embeddings for Visual Question Answering
Hexiang Hu
Wei-Lun Chao
Fei Sha
65
33
0
10 Jun 2018
CS-VQA: Visual Question Answering with Compressively Sensed Images
Li-Chi Huang
K. Kulkarni
Anik Jha
Suhas Lohit
Suren Jayasuriya
Pavan Turaga
CoGe
34
8
0
08 Jun 2018
Focal Visual-Text Attention for Visual Question Answering
Junwei Liang
Lu Jiang
Liangliang Cao
Li Li
Alexander G. Hauptmann
68
112
0
05 Jun 2018
On the Flip Side: Identifying Counterexamples in Visual Question Answering
Gabriel Grand
Aron Szanto
Yoon Kim
Alexander Rush
25
0
0
03 Jun 2018
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7
Huda AlAmri
Vincent Cartillier
Raphael Gontijo-Lopes
Abhishek Das
Jue Wang
...
Dhruv Batra
Devi Parikh
A. Cherian
Tim K. Marks
Chiori Hori
60
33
0
01 Jun 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Mian
Wen Liu
Syed Zulqarnain Gilani
Mubarak Shah
124
93
0
01 Jun 2018
Explaining Explanations: An Overview of Interpretability of Machine Learning
Leilani H. Gilpin
David Bau
Ben Z. Yuan
Ayesha Bajwa
Michael A. Specter
Lalana Kagal
XAI
129
1,873
0
31 May 2018
Visual Referring Expression Recognition: What Do Systems Actually Learn?
Volkan Cirik
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
81
63
0
30 May 2018
Multi-turn Dialogue Response Generation in an Adversarial Learning Framework
O. Olabiyi
A. Salimov
Anish Khazane
Erik T. Mueller
GAN
80
32
0
30 May 2018
GLAC Net: GLocal Attention Cascading Networks for Multi-image Cued Story Generation
Taehyeong Kim
Min-Oh Heo
Seonil Son
Kyoung-Wha Park
Byoung-Tak Zhang
64
78
0
28 May 2018
Interactive Text2Pickup Network for Natural Language based Human-Robot Collaboration
Hyemin Ahn
Sungjoon Choi
Nuri Kim
Geonho Cha
Songhwai Oh
56
7
0
28 May 2018
Think Visually: Question Answering through Virtual Imagery
Ankit Goyal
Jian Wang
Jia Deng
49
2
0
25 May 2018
Hyperbolic Attention Networks
Çağlar Gülçehre
Misha Denil
Mateusz Malinowski
Ali Razavi
Razvan Pascanu
...
Peter W. Battaglia
V. Bapst
David Raposo
Adam Santoro
Nando de Freitas
187
224
0
24 May 2018
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering
Pan Lu
Lei Ji
Wei Zhang
Nan Duan
M. Zhou
Jianyong Wang
CoGe
61
79
0
24 May 2018
Deep Reinforcement Learning For Sequence to Sequence Models
Yaser Keneshloo
Tian Shi
Naren Ramakrishnan
Chandan K. Reddy
AIMat
3DV
OffRL
92
211
0
24 May 2018
Joint Image Captioning and Question Answering
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
57
13
0
22 May 2018
Guided Feature Transformation (GFT): A Neural Language Grounding Module for Embodied Agents
Haonan Yu
Xiaochen Lian
Haichao Zhang
Wenyuan Xu
LM&Ro
58
21
0
22 May 2018
Bilinear Attention Networks
Jin-Hwa Kim
Jaehyun Jun
Byoung-Tak Zhang
AIMat
139
880
0
21 May 2018
Defoiling Foiled Image Captions
Pranava Madhyastha
Josiah Wang
Lucia Specia
65
9
0
16 May 2018
Stories for Images-in-Sequence by using Visual and Narrative Components
Marko Smilevski
Ilija Lalkovski
Gjorgji Madjarov
45
19
0
15 May 2018
Did the Model Understand the Question?
Pramod Kaushik Mudrakarta
Ankur Taly
Mukund Sundararajan
Kedar Dhamdhere
ELM
OOD
FAtt
85
200
0
14 May 2018
Domain Adapted Word Embeddings for Improved Sentiment Classification
P. Sarma
Yingyu Liang
W. Sethares
66
67
0
11 May 2018
Previous
1
2
3
...
51
52
53
...
58
59
60
Next