ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.05234
  4. Cited By
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for
  Visual Question Answering

Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering

17 November 2015
Huijuan Xu
Kate Saenko
ArXivPDFHTML

Papers citing "Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering"

50 / 141 papers shown
Title
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
Zihao Wang
Xihui Liu
Hongsheng Li
Lu Sheng
Junjie Yan
Xiaogang Wang
Jing Shao
VLM
25
299
0
12 Sep 2019
PDANet: Polarity-consistent Deep Attention Network for Fine-grained
  Visual Emotion Regression
PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression
Sicheng Zhao
Zizhou Jia
Hui Chen
Leida Li
Guiguang Ding
Kurt Keutzer
36
62
0
11 Sep 2019
Probabilistic framework for solving Visual Dialog
Probabilistic framework for solving Visual Dialog
Badri N. Patro
Anupriy
Vinay P. Namboodiri
BDL
30
13
0
11 Sep 2019
A Better Way to Attend: Attention with Trees for Video Question
  Answering
A Better Way to Attend: Attention with Trees for Video Question Answering
Hongyang Xue
Wenqing Chu
Zhou Zhao
Deng Cai
25
33
0
05 Sep 2019
U-CAM: Visual Explanation using Uncertainty based Class Activation Maps
U-CAM: Visual Explanation using Uncertainty based Class Activation Maps
Badri N. Patro
Mayank Lunayach
Shivansh Patel
Vinay P. Namboodiri
FAtt
UQCV
27
76
0
17 Aug 2019
HA-CCN: Hierarchical Attention-based Crowd Counting Network
HA-CCN: Hierarchical Attention-based Crowd Counting Network
Vishwanath A. Sindagi
Vishal M. Patel
20
187
0
24 Jul 2019
Compact Global Descriptor for Neural Networks
Xiangyu He
Ke Cheng
Qiang Chen
Qinghao Hu
Peisong Wang
Jian Cheng
31
8
0
23 Jul 2019
Improving Description-based Person Re-identification by
  Multi-granularity Image-text Alignments
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments
K. Niu
Y. Huang
Wanli Ouyang
Liang Wang
27
138
0
23 Jun 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
21
37
0
29 May 2019
Multimodal Transformer with Multi-View Visual Representation for Image
  Captioning
Multimodal Transformer with Multi-View Visual Representation for Image Captioning
Jun-chen Yu
Jing Li
Zhou Yu
Qingming Huang
ViT
27
377
0
20 May 2019
HAR-Net: Joint Learning of Hybrid Attention for Single-stage Object
  Detection
HAR-Net: Joint Learning of Hybrid Attention for Single-stage Object Detection
Yali Li
Shengjin Wang
22
32
0
25 Apr 2019
MAANet: Multi-view Aware Attention Networks for Image Super-Resolution
MAANet: Multi-view Aware Attention Networks for Image Super-Resolution
Jingcai Guo
Shiheng Ma
Song Guo
SupR
11
5
0
12 Apr 2019
Factor Graph Attention
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
A. Schwing
24
110
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
27
69
0
11 Apr 2019
MirrorGAN: Learning Text-to-image Generation by Redescription
MirrorGAN: Learning Text-to-image Generation by Redescription
Tingting Qiao
Jing Zhang
Duanqing Xu
Dacheng Tao
VLM
GAN
33
538
0
14 Mar 2019
Pyramid Feature Attention Network for Saliency detection
Pyramid Feature Attention Network for Saliency detection
Ting Zhao
Xiangqian Wu
17
607
0
01 Mar 2019
PiCANet: Pixel-wise Contextual Attention Learning for Accurate Saliency
  Detection
PiCANet: Pixel-wise Contextual Attention Learning for Accurate Saliency Detection
Nian Liu
Junwei Han
Ming-Hsuan Yang
SSeg
36
99
0
15 Dec 2018
Complete the Look: Scene-based Complementary Product Recommendation
Complete the Look: Scene-based Complementary Product Recommendation
Wang-Cheng Kang
Eric Kim
J. Leskovec
Charles R. Rosenberg
Julian McAuley
27
76
0
04 Dec 2018
Neural Rejuvenation: Improving Deep Network Training by Enhancing
  Computational Resource Utilization
Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization
Siyuan Qiao
Zhe-nan Lin
Jianming Zhang
Alan Yuille
8
23
0
02 Dec 2018
Traversing the Continuous Spectrum of Image Retrieval with Deep Dynamic
  Models
Traversing the Continuous Spectrum of Image Retrieval with Deep Dynamic Models
Ziad Al-Halah
Andreas M. Lehrmann
Leonid Sigal
21
0
0
01 Dec 2018
Holistic Multi-modal Memory Network for Movie Question Answering
Holistic Multi-modal Memory Network for Movie Question Answering
Anran Wang
Anh Tuan Luu
Chuan-Sheng Foo
Erik Cambria
Yi Tay
V. Chandrasekhar
36
20
0
12 Nov 2018
Interpretable Spatio-temporal Attention for Video Action Recognition
Interpretable Spatio-temporal Attention for Video Action Recognition
Lili Meng
Bo Zhao
B. Chang
Gao Huang
Wei Sun
Fred Tung
Leonid Sigal
33
83
0
01 Oct 2018
Channel-wise and Spatial Feature Modulation Network for Single Image
  Super-Resolution
Channel-wise and Spatial Feature Modulation Network for Single Image Super-Resolution
Yanting Hu
Jie Li
Yuanfei Huang
Xinbo Gao
SupR
25
256
0
28 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
33
55
0
06 Sep 2018
Diverse and Coherent Paragraph Generation from Images
Diverse and Coherent Paragraph Generation from Images
Moitreya Chatterjee
A. Schwing
19
66
0
03 Sep 2018
Recurrent Fusion Network for Image Captioning
Recurrent Fusion Network for Image Captioning
Wenhao Jiang
Lin Ma
Yu-Gang Jiang
Wei Liu
Tong Zhang
ObjD
33
233
0
26 Jul 2018
DeepDiff: Deep-learning for predicting Differential gene expression from
  histone modifications
DeepDiff: Deep-learning for predicting Differential gene expression from histone modifications
Arshdeep Sekhon
Ritambhara Singh
Yanjun Qi
19
52
0
10 Jul 2018
Show, Attend and Translate: Unsupervised Image Translation with
  Self-Regularization and Attention
Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
Chao Yang
Taehwan Kim
Ruizhe Wang
Hao Peng
C.-C. Jay Kuo
28
51
0
16 Jun 2018
Learning Visual Knowledge Memory Networks for Visual Question Answering
Learning Visual Knowledge Memory Networks for Visual Question Answering
Zhou Su
Chen Zhu
Yinpeng Dong
Dongqi Cai
Yurong Chen
Jianguo Li
34
62
0
13 Jun 2018
RetainVis: Visual Analytics with Interpretable and Interactive Recurrent
  Neural Networks on Electronic Medical Records
RetainVis: Visual Analytics with Interpretable and Interactive Recurrent Neural Networks on Electronic Medical Records
Bum Chul Kwon
Min-Je Choi
J. Kim
Edward Choi
Young Bin Kim
Soonwook Kwon
Jimeng Sun
Jaegul Choo
36
251
0
28 May 2018
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual
  Question Answering
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering
Pan Lu
Lei Ji
Wei Zhang
Nan Duan
M. Zhou
Jianyong Wang
CoGe
25
79
0
24 May 2018
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot
  Learning
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning
YunLong Yu
Zhong Ji
Yanwei Fu
Jichang Guo
Yanwei Pang
Zhongfei Zhang
VLM
24
27
0
21 May 2018
Sparsely Grouped Multi-task Generative Adversarial Networks for Facial
  Attribute Manipulation
Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation
Jichao Zhang
Yezhi Shu
Songhua Xu
Gongze Cao
Fan Zhong
Meng Liu
Xueying Qin
CVBM
35
35
0
19 May 2018
Attention-Aware Compositional Network for Person Re-identification
Attention-Aware Compositional Network for Person Re-identification
Jing Xu
Rui Zhao
Feng Zhu
Huaming Wang
Wanli Ouyang
CVBM
42
443
0
09 May 2018
Deep Ordinal Hashing with Spatial Attention
Deep Ordinal Hashing with Spatial Attention
Lu Jin
Xiangbo Shu
Kai Li
Zechao Li
Guo-Jun Qi
Jinhui Tang
43
78
0
07 May 2018
Learn To Pay Attention
Learn To Pay Attention
Saumya Jetley
Nicholas A. Lord
Namhoon Lee
Philip Torr
67
437
0
06 Apr 2018
Improved Fusion of Visual and Language Representations by Dense
  Symmetric Co-Attention for Visual Question Answering
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen
Takayuki Okatani
24
279
0
03 Apr 2018
Unsupervised Textual Grounding: Linking Words to Image Concepts
Unsupervised Textual Grounding: Linking Words to Image Concepts
Raymond A. Yeh
Minh Do
A. Schwing
22
40
0
29 Mar 2018
Motion-Appearance Co-Memory Networks for Video Question Answering
Motion-Appearance Co-Memory Networks for Video Question Answering
J. Gao
Runzhou Ge
Kan Chen
Ram Nevatia
41
240
0
29 Mar 2018
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual
  Questions
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Qing Li
Qingyi Tao
Chenyu You
Jianfei Cai
Jiebo Luo
34
106
0
20 Mar 2018
Attention-GAN for Object Transfiguration in Wild Images
Attention-GAN for Object Transfiguration in Wild Images
Xinyuan Chen
Chang Xu
Xiaokang Yang
Dacheng Tao
32
176
0
19 Mar 2018
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis
  Tool
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
30
29
0
16 Mar 2018
Transparency by Design: Closing the Gap Between Performance and
  Interpretability in Visual Reasoning
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning
David Mascharka
Philip Tran
Ryan Soklaski
Arjun Majumdar
39
207
0
14 Mar 2018
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
Pelin Dogan
Boyang Albert Li
Leonid Sigal
Markus Gross
AI4TS
30
19
0
19 Feb 2018
Multimodal Explanations: Justifying Decisions and Pointing to the
  Evidence
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
37
419
0
15 Feb 2018
Object-based reasoning in VQA
Object-based reasoning in VQA
Mikyas T. Desta
Larry Chen
Tomasz Kornuta
32
33
0
29 Jan 2018
Game of Sketches: Deep Recurrent Models of Pictionary-style Word
  Guessing
Game of Sketches: Deep Recurrent Models of Pictionary-style Word Guessing
Ravi Kiran Sarvadevabhatla
Shiv Surya
Trisha Mittal
Venkatesh Babu Radhakrishnan
21
14
0
29 Jan 2018
Tell-and-Answer: Towards Explainable Visual Question Answering using
  Attributes and Captions
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
Qing Li
Jianlong Fu
D. Yu
Tao Mei
Jiebo Luo
FAtt
XAI
CoGe
51
60
0
27 Jan 2018
Incorporating External Knowledge to Answer Open-Domain Visual Questions
  with Dynamic Memory Networks
Incorporating External Knowledge to Answer Open-Domain Visual Questions with Dynamic Memory Networks
Guohao Li
Hang Su
Wenwu Zhu
38
46
0
03 Dec 2017
Don't Just Assume; Look and Answer: Overcoming Priors for Visual
  Question Answering
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
Aniruddha Kembhavi
OOD
61
582
0
01 Dec 2017
Previous
123
Next