ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Image-based table recognition: data, model, and evaluation
Image-based table recognition: data, model, and evaluation
Xu Zhong
Elaheh Shafieibavani
Antonio Jimeno Yepes
LMTD
155
223
0
25 Nov 2019
Two Causal Principles for Improving Visual Dialog
Two Causal Principles for Improving Visual Dialog
Jiaxin Qi
Yulei Niu
Jianqiang Huang
Hanwang Zhang
CML
119
149
0
24 Nov 2019
Unsupervised Keyword Extraction for Full-sentence VQA
Unsupervised Keyword Extraction for Full-sentence VQA
Kohei Uehara
Tatsuya Harada
32
1
0
23 Nov 2019
CRUR: Coupled-Recurrent Unit for Unification, Conceptualization and
  Context Capture for Language Representation -- A Generalization of Bi
  Directional LSTM
CRUR: Coupled-Recurrent Unit for Unification, Conceptualization and Context Capture for Language Representation -- A Generalization of Bi Directional LSTM
C. Sur
BDL
62
6
0
22 Nov 2019
Injecting Prior Knowledge into Image Caption Generation
Injecting Prior Knowledge into Image Caption Generation
A. Goel
Basura Fernando
Thanh-Son Nguyen
Hakan Bilen
44
0
0
22 Nov 2019
Orderless Recurrent Models for Multi-label Classification
Orderless Recurrent Models for Multi-label Classification
V. O. Yazici
Abel Gonzalez-Garcia
Arnau Ramisa
Bartlomiej Twardowski
Joost van de Weijer
SSL
117
93
0
22 Nov 2019
Separate and Attend in Personal Email Search
Separate and Attend in Personal Email Search
Yu Meng
Maryam Karimzadehgan
Honglei Zhuang
Donald Metzler
FedML
431
2
0
21 Nov 2019
Learning to Localize Sound Sources in Visual Scenes: Analysis and
  Applications
Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications
Arda Senocak
Tae-Hyun Oh
Junsik Kim
Ming-Hsuan Yang
In So Kweon
SSL
86
55
0
20 Nov 2019
Controlling Neural Machine Translation Formality with Synthetic
  Supervision
Controlling Neural Machine Translation Formality with Synthetic Supervision
Xing Niu
Marine Carpuat
83
35
0
20 Nov 2019
Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA
Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA
Badri N. Patro
Anupriy
Vinay P. Namboodiri
AAMLFAtt
85
26
0
19 Nov 2019
Influence-aware Memory Architectures for Deep Reinforcement Learning
Influence-aware Memory Architectures for Deep Reinforcement Learning
Miguel Suau
Jinke He
E. Congeduti
Rolf A. N. Starre
A. Czechowski
F. Oliehoek
57
5
0
18 Nov 2019
Co-Attentive Equivariant Neural Networks: Focusing Equivariance On
  Transformations Co-Occurring In Data
Co-Attentive Equivariant Neural Networks: Focusing Equivariance On Transformations Co-Occurring In Data
David W. Romero
Mark Hoogendoorn
72
24
0
18 Nov 2019
FFA-Net: Feature Fusion Attention Network for Single Image Dehazing
FFA-Net: Feature Fusion Attention Network for Single Image Dehazing
Xu Qin
Zhiling Wang
Yuanchao Bai
Xiaodong Xie
Huizhu Jia
151
1,369
0
18 Nov 2019
Towards Making Deep Transfer Learning Never Hurt
Towards Making Deep Transfer Learning Never Hurt
Ruosi Wan
Haoyi Xiong
Xingjian Li
Zhanxing Zhu
Jun Huan
69
21
0
18 Nov 2019
Deep Verifier Networks: Verification of Deep Discriminative Models with
  Deep Generative Models
Deep Verifier Networks: Verification of Deep Discriminative Models with Deep Generative Models
Tong Che
Xiaofeng Liu
Site Li
Yubin Ge
Ruixiang Zhang
Caiming Xiong
Yoshua Bengio
122
52
0
18 Nov 2019
Putting visual object recognition in context
Putting visual object recognition in context
Mengmi Zhang
Claire Tseng
Gabriel Kreiman
103
53
0
17 Nov 2019
A3GAN: An Attribute-aware Attentive Generative Adversarial Network for
  Face Aging
A3GAN: An Attribute-aware Attentive Generative Adversarial Network for Face Aging
Yunfan Liu
Qi Li
Zhenan Sun
Tieniu Tan
GANCVBM
33
32
0
15 Nov 2019
Sequence-to-Set Semantic Tagging: End-to-End Multi-label Prediction
  using Neural Attention for Complex Query Reformulation and Automated Text
  Categorization
Sequence-to-Set Semantic Tagging: End-to-End Multi-label Prediction using Neural Attention for Complex Query Reformulation and Automated Text Categorization
Manirupa Das
Juanxi Li
Eric Fosler-Lussier
Simon M. Lin
Soheil Moosavinasab
S. Rust
Yungui Huang
R. Ramnath
20
1
0
11 Nov 2019
Conditionally Learn to Pay Attention for Sequential Visual Task
Conditionally Learn to Pay Attention for Sequential Visual Task
Jun He
Quan-Jie Cao
Lei Zhang
49
0
0
11 Nov 2019
Keep it Consistent: Topic-Aware Storytelling from an Image Stream via
  Iterative Multi-agent Communication
Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication
Ruize Wang
Zhongyu Wei
Ying Cheng
Piji Li
Haijun Shan
Ji Zhang
Qi Zhang
Xuanjing Huang
VGenDiffM
86
13
0
11 Nov 2019
Constructing Gradient Controllable Recurrent Neural Networks Using Hamiltonian Dynamics
Konstantin Rusch
J. Pearson
K. Zygalakis
45
0
0
11 Nov 2019
Transfer Value Iteration Networks
Transfer Value Iteration Networks
Junyi Shen
H. Zhuo
Jin Xu
Bin Zhong
Sinno Jialin Pan
36
7
0
11 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion,
  and Applications
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAIAI4TS
135
338
0
10 Nov 2019
Can Neural Image Captioning be Controlled via Forced Attention?
Can Neural Image Captioning be Controlled via Forced Attention?
P. Sadler
Tatjana Scheffler
David Schlangen
45
3
0
10 Nov 2019
Two-Headed Monster And Crossed Co-Attention Networks
Two-Headed Monster And Crossed Co-Attention Networks
Yaoyiran Li
Jing Jiang
72
0
0
10 Nov 2019
Distilling Knowledge Learned in BERT for Text Generation
Distilling Knowledge Learned in BERT for Text Generation
Yen-Chun Chen
Zhe Gan
Yu Cheng
Jingzhou Liu
Jingjing Liu
82
28
0
10 Nov 2019
Drill-down: Interactive Retrieval of Complex Scenes using Natural
  Language Queries
Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries
Fuwen Tan
Paola Cascante-Bonilla
Xiaoxiao Guo
Hui Wu
Song Feng
Vicente Ordonez
68
30
0
10 Nov 2019
On Architectures for Including Visual Information in Neural Language
  Models for Image Description
On Architectures for Including Visual Information in Neural Language Models for Image Description
Marc Tanti
Albert Gatt
K. Camilleri
VLM
55
2
0
09 Nov 2019
Making the Best Use of Review Summary for Sentiment Analysis
Making the Best Use of Review Summary for Sentiment Analysis
Sen Yang
Leyang Cui
Jun Xie
Yue Zhang
54
0
0
07 Nov 2019
Interpretable Self-Attention Temporal Reasoning for Driving Behavior
  Understanding
Interpretable Self-Attention Temporal Reasoning for Driving Behavior Understanding
Yi-Chieh Liu
Yung-An Hsieh
Min-Hung Chen
Chao-Han Huck Yang
Jesper N. Tegnér
Y. Tsai
89
19
0
06 Nov 2019
Attributed Sequence Embedding
Attributed Sequence Embedding
Zhongfang Zhuang
Xiangnan Kong
Elke A. Rundensteiner
Jihane Zouaoui
Aditya Arora
182
12
0
03 Nov 2019
Sequence Modeling with Unconstrained Generation Order
Sequence Modeling with Unconstrained Generation Order
Dmitrii Emelianenko
Elena Voita
P. Serdyukov
106
18
0
01 Nov 2019
Can adversarial training learn image captioning ?
Can adversarial training learn image captioning ?
Jean-Benoit Delbrouck
Bastien Vanderplaetse
Stéphane Dupont
GANVLM
62
1
0
31 Oct 2019
Image-Conditioned Graph Generation for Road Network Extraction
Image-Conditioned Graph Generation for Road Network Extraction
Davide Belli
Thomas Kipf
GNN
63
40
0
31 Oct 2019
A Self Validation Network for Object-Level Human Attention Estimation
A Self Validation Network for Object-Level Human Attention Estimation
Zehua Zhang
Chen Yu
David J. Crandall
EgoV
102
10
0
31 Oct 2019
Hidden State Guidance: Improving Image Captioning using An Image
  Conditioned Autoencoder
Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder
Jialin Wu
Raymond J. Mooney
59
0
0
31 Oct 2019
Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
Risto Vuorio
Shao-Hua Sun
Hexiang Hu
Joseph J. Lim
108
218
0
30 Oct 2019
Style Mixer: Semantic-aware Multi-Style Transfer Network
Style Mixer: Semantic-aware Multi-Style Transfer Network
Zixuan Huang
Jinghuai Zhang
Jing Liao
127
19
0
29 Oct 2019
On Generalization Bounds of a Family of Recurrent Neural Networks
On Generalization Bounds of a Family of Recurrent Neural Networks
Minshuo Chen
Xingguo Li
T. Zhao
98
71
0
28 Oct 2019
Few-shot Video-to-Video Synthesis
Few-shot Video-to-Video Synthesis
Ting-Chun Wang
Ming-Yuan Liu
Andrew Tao
Guilin Liu
Jan Kautz
Bryan Catanzaro
DiffMVGen
169
370
0
28 Oct 2019
Input-Cell Attention Reduces Vanishing Saliency of Recurrent Neural
  Networks
Input-Cell Attention Reduces Vanishing Saliency of Recurrent Neural Networks
Aya Abdelsalam Ismail
Mohamed K. Gunady
L. Pessoa
H. C. Bravo
Soheil Feizi
AI4TS
113
50
0
27 Oct 2019
CXPlain: Causal Explanations for Model Interpretation under Uncertainty
CXPlain: Causal Explanations for Model Interpretation under Uncertainty
Patrick Schwab
W. Karlen
FAttCML
157
211
0
27 Oct 2019
Attention for Inference Compilation
Attention for Inference Compilation
William Harvey
Andreas Munk
A. G. Baydin
Alexander Bergholm
Frank Wood
61
9
0
25 Oct 2019
Automatic Reminiscence Therapy for Dementia
Automatic Reminiscence Therapy for Dementia
Mariona Carós
M. Garolera
Petia Radeva
Xavier Giró-i-Nieto
76
41
0
25 Oct 2019
Attention Optimization for Abstractive Document Summarization
Attention Optimization for Abstractive Document Summarization
Min Gui
Junfeng Tian
Rui Wang
Zhenglu Yang
71
18
0
25 Oct 2019
Heterogeneous Graph Learning for Visual Commonsense Reasoning
Heterogeneous Graph Learning for Visual Commonsense Reasoning
Weijiang Yu
Jingwen Zhou
Weihao Yu
Xiaodan Liang
Nong Xiao
LRM
79
47
0
25 Oct 2019
Controllable Attention for Structured Layered Video Decomposition
Controllable Attention for Structured Layered Video Decomposition
Jean-Baptiste Alayrac
João Carreira
Relja Arandjelović
Andrew Zisserman
60
10
0
24 Oct 2019
Assisting human experts in the interpretation of their visual process: A
  case study on assessing copper surface adhesive potency
Assisting human experts in the interpretation of their visual process: A case study on assessing copper surface adhesive potency
T. Hascoet
Xuejiao Deng
Daniela Mihai
Mari Sugiyama
Yuji Adachi
Sachiko Nakamura
Jonathon S. Hare
Tomoko Hayashi
T. Takiguchi
20
1
0
24 Oct 2019
KnowIT VQA: Answering Knowledge-Based Questions about Videos
KnowIT VQA: Answering Knowledge-Based Questions about Videos
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
152
80
0
23 Oct 2019
Towards an Intelligent Microscope: adaptively learned illumination for
  optimal sample classification
Towards an Intelligent Microscope: adaptively learned illumination for optimal sample classification
A. Chaware
Colin L. Cooke
Kanghyun Kim
R. Horstmeyer
60
11
0
22 Oct 2019
Previous
123...363738...697071
Next