Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.05234
Cited By
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
17 November 2015
Huijuan Xu
Kate Saenko
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering"
50 / 141 papers shown
Title
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
Zihao Wang
Xihui Liu
Hongsheng Li
Lu Sheng
Junjie Yan
Xiaogang Wang
Jing Shao
VLM
25
299
0
12 Sep 2019
PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression
Sicheng Zhao
Zizhou Jia
Hui Chen
Leida Li
Guiguang Ding
Kurt Keutzer
36
62
0
11 Sep 2019
Probabilistic framework for solving Visual Dialog
Badri N. Patro
Anupriy
Vinay P. Namboodiri
BDL
30
13
0
11 Sep 2019
A Better Way to Attend: Attention with Trees for Video Question Answering
Hongyang Xue
Wenqing Chu
Zhou Zhao
Deng Cai
25
33
0
05 Sep 2019
U-CAM: Visual Explanation using Uncertainty based Class Activation Maps
Badri N. Patro
Mayank Lunayach
Shivansh Patel
Vinay P. Namboodiri
FAtt
UQCV
27
76
0
17 Aug 2019
HA-CCN: Hierarchical Attention-based Crowd Counting Network
Vishwanath A. Sindagi
Vishal M. Patel
20
187
0
24 Jul 2019
Compact Global Descriptor for Neural Networks
Xiangyu He
Ke Cheng
Qiang Chen
Qinghao Hu
Peisong Wang
Jian Cheng
31
8
0
23 Jul 2019
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments
K. Niu
Y. Huang
Wanli Ouyang
Liang Wang
27
138
0
23 Jun 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
21
37
0
29 May 2019
Multimodal Transformer with Multi-View Visual Representation for Image Captioning
Jun-chen Yu
Jing Li
Zhou Yu
Qingming Huang
ViT
27
377
0
20 May 2019
HAR-Net: Joint Learning of Hybrid Attention for Single-stage Object Detection
Yali Li
Shengjin Wang
22
32
0
25 Apr 2019
MAANet: Multi-view Aware Attention Networks for Image Super-Resolution
Jingcai Guo
Shiheng Ma
Song Guo
SupR
11
5
0
12 Apr 2019
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
A. Schwing
24
110
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
27
69
0
11 Apr 2019
MirrorGAN: Learning Text-to-image Generation by Redescription
Tingting Qiao
Jing Zhang
Duanqing Xu
Dacheng Tao
VLM
GAN
33
538
0
14 Mar 2019
Pyramid Feature Attention Network for Saliency detection
Ting Zhao
Xiangqian Wu
17
607
0
01 Mar 2019
PiCANet: Pixel-wise Contextual Attention Learning for Accurate Saliency Detection
Nian Liu
Junwei Han
Ming-Hsuan Yang
SSeg
36
99
0
15 Dec 2018
Complete the Look: Scene-based Complementary Product Recommendation
Wang-Cheng Kang
Eric Kim
J. Leskovec
Charles R. Rosenberg
Julian McAuley
27
76
0
04 Dec 2018
Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization
Siyuan Qiao
Zhe-nan Lin
Jianming Zhang
Alan Yuille
8
23
0
02 Dec 2018
Traversing the Continuous Spectrum of Image Retrieval with Deep Dynamic Models
Ziad Al-Halah
Andreas M. Lehrmann
Leonid Sigal
21
0
0
01 Dec 2018
Holistic Multi-modal Memory Network for Movie Question Answering
Anran Wang
Anh Tuan Luu
Chuan-Sheng Foo
Erik Cambria
Yi Tay
V. Chandrasekhar
36
20
0
12 Nov 2018
Interpretable Spatio-temporal Attention for Video Action Recognition
Lili Meng
Bo Zhao
B. Chang
Gao Huang
Wei Sun
Fred Tung
Leonid Sigal
33
83
0
01 Oct 2018
Channel-wise and Spatial Feature Modulation Network for Single Image Super-Resolution
Yanting Hu
Jie Li
Yuanfei Huang
Xinbo Gao
SupR
25
256
0
28 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
33
55
0
06 Sep 2018
Diverse and Coherent Paragraph Generation from Images
Moitreya Chatterjee
A. Schwing
19
66
0
03 Sep 2018
Recurrent Fusion Network for Image Captioning
Wenhao Jiang
Lin Ma
Yu-Gang Jiang
Wei Liu
Tong Zhang
ObjD
33
233
0
26 Jul 2018
DeepDiff: Deep-learning for predicting Differential gene expression from histone modifications
Arshdeep Sekhon
Ritambhara Singh
Yanjun Qi
19
52
0
10 Jul 2018
Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
Chao Yang
Taehwan Kim
Ruizhe Wang
Hao Peng
C.-C. Jay Kuo
28
51
0
16 Jun 2018
Learning Visual Knowledge Memory Networks for Visual Question Answering
Zhou Su
Chen Zhu
Yinpeng Dong
Dongqi Cai
Yurong Chen
Jianguo Li
34
62
0
13 Jun 2018
RetainVis: Visual Analytics with Interpretable and Interactive Recurrent Neural Networks on Electronic Medical Records
Bum Chul Kwon
Min-Je Choi
J. Kim
Edward Choi
Young Bin Kim
Soonwook Kwon
Jimeng Sun
Jaegul Choo
36
251
0
28 May 2018
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering
Pan Lu
Lei Ji
Wei Zhang
Nan Duan
M. Zhou
Jianyong Wang
CoGe
25
79
0
24 May 2018
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning
YunLong Yu
Zhong Ji
Yanwei Fu
Jichang Guo
Yanwei Pang
Zhongfei Zhang
VLM
24
27
0
21 May 2018
Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation
Jichao Zhang
Yezhi Shu
Songhua Xu
Gongze Cao
Fan Zhong
Meng Liu
Xueying Qin
CVBM
35
35
0
19 May 2018
Attention-Aware Compositional Network for Person Re-identification
Jing Xu
Rui Zhao
Feng Zhu
Huaming Wang
Wanli Ouyang
CVBM
42
443
0
09 May 2018
Deep Ordinal Hashing with Spatial Attention
Lu Jin
Xiangbo Shu
Kai Li
Zechao Li
Guo-Jun Qi
Jinhui Tang
43
78
0
07 May 2018
Learn To Pay Attention
Saumya Jetley
Nicholas A. Lord
Namhoon Lee
Philip Torr
67
437
0
06 Apr 2018
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen
Takayuki Okatani
24
279
0
03 Apr 2018
Unsupervised Textual Grounding: Linking Words to Image Concepts
Raymond A. Yeh
Minh Do
A. Schwing
22
40
0
29 Mar 2018
Motion-Appearance Co-Memory Networks for Video Question Answering
J. Gao
Runzhou Ge
Kan Chen
Ram Nevatia
41
240
0
29 Mar 2018
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Qing Li
Qingyi Tao
Chenyu You
Jianfei Cai
Jiebo Luo
34
106
0
20 Mar 2018
Attention-GAN for Object Transfiguration in Wild Images
Xinyuan Chen
Chang Xu
Xiaokang Yang
Dacheng Tao
32
176
0
19 Mar 2018
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
30
29
0
16 Mar 2018
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning
David Mascharka
Philip Tran
Ryan Soklaski
Arjun Majumdar
39
207
0
14 Mar 2018
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
Pelin Dogan
Boyang Albert Li
Leonid Sigal
Markus Gross
AI4TS
30
19
0
19 Feb 2018
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
37
419
0
15 Feb 2018
Object-based reasoning in VQA
Mikyas T. Desta
Larry Chen
Tomasz Kornuta
32
33
0
29 Jan 2018
Game of Sketches: Deep Recurrent Models of Pictionary-style Word Guessing
Ravi Kiran Sarvadevabhatla
Shiv Surya
Trisha Mittal
Venkatesh Babu Radhakrishnan
21
14
0
29 Jan 2018
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
Qing Li
Jianlong Fu
D. Yu
Tao Mei
Jiebo Luo
FAtt
XAI
CoGe
51
60
0
27 Jan 2018
Incorporating External Knowledge to Answer Open-Domain Visual Questions with Dynamic Memory Networks
Guohao Li
Hang Su
Wenwu Zhu
38
46
0
03 Dec 2017
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
Aniruddha Kembhavi
OOD
61
582
0
01 Dec 2017
Previous
1
2
3
Next