ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.02274
  4. Cited By
Stacked Attention Networks for Image Question Answering

Stacked Attention Networks for Image Question Answering

7 November 2015
Zichao Yang
Xiaodong He
Jianfeng Gao
Li Deng
Alex Smola
    BDL
ArXivPDFHTML

Papers citing "Stacked Attention Networks for Image Question Answering"

50 / 277 papers shown
Title
Deep Ordinal Hashing with Spatial Attention
Deep Ordinal Hashing with Spatial Attention
Lu Jin
Xiangbo Shu
Kai Li
Zechao Li
Guo-Jun Qi
Jinhui Tang
43
78
0
07 May 2018
Customized Image Narrative Generation via Interactive Visual Question
  Generation and Answering
Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
42
7
0
27 Apr 2018
Learn To Pay Attention
Learn To Pay Attention
Saumya Jetley
Nicholas A. Lord
Namhoon Lee
Philip Torr
67
437
0
06 Apr 2018
Improved Fusion of Visual and Language Representations by Dense
  Symmetric Co-Attention for Visual Question Answering
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen
Takayuki Okatani
24
279
0
03 Apr 2018
Unsupervised Textual Grounding: Linking Words to Image Concepts
Unsupervised Textual Grounding: Linking Words to Image Concepts
Raymond A. Yeh
Minh Do
A. Schwing
22
40
0
29 Mar 2018
Motion-Appearance Co-Memory Networks for Video Question Answering
Motion-Appearance Co-Memory Networks for Video Question Answering
J. Gao
Runzhou Ge
Kan Chen
Ram Nevatia
41
240
0
29 Mar 2018
Referring Relationships
Referring Relationships
Ranjay Krishna
Ines Chami
Michael S. Bernstein
Li Fei-Fei
30
94
0
28 Mar 2018
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual
  Questions
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Qing Li
Qingyi Tao
Chenyu You
Jianfei Cai
Jiebo Luo
34
106
0
20 Mar 2018
Transparency by Design: Closing the Gap Between Performance and
  Interpretability in Visual Reasoning
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning
David Mascharka
Philip Tran
Ryan Soklaski
Arjun Majumdar
36
207
0
14 Mar 2018
Video Object Segmentation with Joint Re-identification and
  Attention-Aware Mask Propagation
Video Object Segmentation with Joint Re-identification and Attention-Aware Mask Propagation
Xiaoxiao Li
Chen Change Loy
VOS
22
196
0
12 Mar 2018
Compositional Attention Networks for Machine Reasoning
Compositional Attention Networks for Machine Reasoning
Drew A. Hudson
Christopher D. Manning
BDL
OOD
LRM
32
572
0
08 Mar 2018
Multimodal Explanations: Justifying Decisions and Pointing to the
  Evidence
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
37
419
0
15 Feb 2018
Interactive Grounded Language Acquisition and Generalization in a 2D
  World
Interactive Grounded Language Acquisition and Generalization in a 2D World
Haonan Yu
Haichao Zhang
Wenyuan Xu
LLMAG
LM&Ro
30
77
0
31 Jan 2018
Object-based reasoning in VQA
Object-based reasoning in VQA
Mikyas T. Desta
Larry Chen
Tomasz Kornuta
32
33
0
29 Jan 2018
Tell-and-Answer: Towards Explainable Visual Question Answering using
  Attributes and Captions
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
Qing Li
Jianlong Fu
D. Yu
Tao Mei
Jiebo Luo
FAtt
XAI
CoGe
51
60
0
27 Jan 2018
DVQA: Understanding Data Visualizations via Question Answering
DVQA: Understanding Data Visualizations via Question Answering
Kushal Kafle
Brian L. Price
Scott D. Cohen
Christopher Kanan
AIMat
33
363
0
24 Jan 2018
Incorporating External Knowledge to Answer Open-Domain Visual Questions
  with Dynamic Memory Networks
Incorporating External Knowledge to Answer Open-Domain Visual Questions with Dynamic Memory Networks
Guohao Li
Hang Su
Wenwu Zhu
38
46
0
03 Dec 2017
Don't Just Assume; Look and Answer: Overcoming Priors for Visual
  Question Answering
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
Aniruddha Kembhavi
OOD
58
582
0
01 Dec 2017
HashGAN:Attention-aware Deep Adversarial Hashing for Cross Modal
  Retrieval
HashGAN:Attention-aware Deep Adversarial Hashing for Cross Modal Retrieval
Xi Zhang
Siyu Zhou
Jiashi Feng
Hanjiang Lai
Bo Li
Yan Pan
Jian Yin
Bo An
GAN
32
55
0
26 Nov 2017
Regularizing Deep Neural Networks by Noise: Its Interpretation and
  Optimization
Regularizing Deep Neural Networks by Noise: Its Interpretation and Optimization
Hyeonwoo Noh
Tackgeun You
Jonghwan Mun
Bohyung Han
NoLa
29
197
0
14 Oct 2017
Survey of Recent Advances in Visual Question Answering
Survey of Recent Advances in Visual Question Answering
Supriya Pandhre
Shagun Sodhani
10
14
0
24 Sep 2017
FiLM: Visual Reasoning with a General Conditioning Layer
FiLM: Visual Reasoning with a General Conditioning Layer
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
110
2,154
0
22 Sep 2017
Exploring Human-like Attention Supervision in Visual Question Answering
Exploring Human-like Attention Supervision in Visual Question Answering
Tingting Qiao
Jianfeng Dong
Duanqing Xu
19
104
0
19 Sep 2017
Multi-scale Deep Learning Architectures for Person Re-identification
Multi-scale Deep Learning Architectures for Person Re-identification
Xuelin Qian
Yanwei Fu
Yu-Gang Jiang
Tao Xiang
Xiangyang Xue
36
276
0
15 Sep 2017
Variational Reasoning for Question Answering with Knowledge Graph
Variational Reasoning for Question Answering with Knowledge Graph
Yuyu Zhang
H. Dai
Zornitsa Kozareva
Alex Smola
Le Song
31
467
0
12 Sep 2017
VQS: Linking Segmentations to Questions and Answers for Supervised
  Attention in VQA and Question-Focused Semantic Segmentation
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
Chuang Gan
Yandong Li
Haoxiang Li
Chen Sun
Boqing Gong
27
126
0
15 Aug 2017
Tips and Tricks for Visual Question Answering: Learnings from the 2017
  Challenge
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge
Damien Teney
Peter Anderson
Xiaodong He
Anton Van Den Hengel
50
380
0
09 Aug 2017
GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled
  Images
GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled Images
Avi Singh
Larry Yang
Sergey Levine
22
23
0
07 Aug 2017
Structured Attentions for Visual Question Answering
Structured Attentions for Visual Question Answering
Chen Zhu
Yanpeng Zhao
Shuaiyi Huang
Kewei Tu
Yi Ma
FAtt
32
106
0
07 Aug 2017
Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for
  Visual Question Answering
Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering
Zhou Yu
Jun-chen Yu
Jianping Fan
Dacheng Tao
41
663
0
04 Aug 2017
Dual-Glance Model for Deciphering Social Relationships
Dual-Glance Model for Deciphering Social Relationships
Junnan Li
Yongkang Wong
Qi Zhao
Mohan Kankanhalli
16
78
0
02 Aug 2017
Order-Free RNN with Visual Attention for Multi-Label Classification
Order-Free RNN with Visual Attention for Multi-Label Classification
Shang-Fu Chen
Yi-Chen Chen
Chih-Kuan Yeh
Y. Wang
28
142
0
18 Jul 2017
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis
  Network
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network
Zizhao Zhang
Yuanpu Xie
Fuyong Xing
M. McGough
L. Yang
MedIm
21
301
0
08 Jul 2017
Best of Both Worlds: Transferring Knowledge from Discriminative Learning
  to a Generative Visual Dialog Model
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model
Jiasen Lu
A. Kannan
Jianwei Yang
Devi Parikh
Dhruv Batra
BDL
38
136
0
05 Jun 2017
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Jingkuan Song
Zhao Guo
Lianli Gao
Wu Liu
Dongxiang Zhang
Heng Tao Shen
45
166
0
05 Jun 2017
Multimodal Machine Learning: A Survey and Taxonomy
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
15
2,865
0
26 May 2017
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
H. Ben-younes
Rémi Cadène
Matthieu Cord
Nicolas Thome
67
578
0
18 May 2017
Survey of Visual Question Answering: Datasets and Techniques
Survey of Visual Question Answering: Datasets and Techniques
A. Gupta
18
38
0
10 May 2017
The Forgettable-Watcher Model for Video Question Answering
The Forgettable-Watcher Model for Video Question Answering
Hongyang Xue
Zhou Zhao
Deng Cai
21
9
0
03 May 2017
Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question
  Answering
Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering
V. Kazemi
Ali Elqursh
OOD
28
183
0
11 Apr 2017
AMC: Attention guided Multi-modal Correlation Learning for Image Search
AMC: Attention guided Multi-modal Correlation Learning for Image Search
Kan Chen
Trung Bui
Chen Fang
Zhaowen Wang
Ram Nevatia
35
38
0
03 Apr 2017
It Takes Two to Tango: Towards Theory of AI's Mind
It Takes Two to Tango: Towards Theory of AI's Mind
Arjun Chandrasekaran
Deshraj Yadav
Prithvijit Chattopadhyay
Viraj Prabhu
Devi Parikh
38
54
0
03 Apr 2017
An Analysis of Visual Question Answering Algorithms
An Analysis of Visual Question Answering Algorithms
Kushal Kafle
Christopher Kanan
30
230
0
28 Mar 2017
Recurrent Multimodal Interaction for Referring Image Segmentation
Recurrent Multimodal Interaction for Referring Image Segmentation
Chenxi Liu
Zhe-nan Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Alan Yuille
EgoV
36
234
0
23 Mar 2017
VQABQ: Visual Question Answering by Basic Questions
VQABQ: Visual Question Answering by Basic Questions
Jia-Hong Huang
Modar Alfadly
Guohao Li
27
24
0
19 Mar 2017
Multi-Context Attention for Human Pose Estimation
Multi-Context Attention for Human Pose Estimation
Xiao Chu
Wei Yang
Wanli Ouyang
Cheng Ma
Alan Yuille
Xiaogang Wang
3DH
36
640
0
24 Feb 2017
Task-driven Visual Saliency and Attention-based Visual Question
  Answering
Task-driven Visual Saliency and Attention-based Visual Question Answering
Yuetan Lin
Zhangyang Pang
Donghui Wang
Yueting Zhuang
35
26
0
22 Feb 2017
Learning Spatial Regularization with Image-level Supervisions for
  Multi-label Image Classification
Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification
Feng Zhu
Hongsheng Li
Wanli Ouyang
Nenghai Yu
Xiaogang Wang
30
337
0
20 Feb 2017
Person Search with Natural Language Description
Person Search with Natural Language Description
Shuang Li
Tong Xiao
Hongsheng Li
Bolei Zhou
Dayu Yue
Xiaogang Wang
24
386
0
19 Feb 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,503
0
25 Jan 2017
Previous
123456
Next