ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
ECG Segmentation by Neural Networks: Errors and Correction
ECG Segmentation by Neural Networks: Errors and Correction
Iana Sereda
Sergey Alekseev
A. Koneva
R. Kataev
Grigory V. Osipov
UQCV
61
27
0
26 Dec 2018
Hierarchical LSTMs with Adaptive Attention for Visual Captioning
Hierarchical LSTMs with Adaptive Attention for Visual Captioning
Jingkuan Song
Xiangpeng Li
Lianli Gao
Heng Tao Shen
111
223
0
26 Dec 2018
3D PersonVLAD: Learning Deep Global Representations for Video-based
  Person Re-identification
3D PersonVLAD: Learning Deep Global Representations for Video-based Person Re-identification
Lin Wu
Yang Wang
Ling Shao
Ming Wang
3DPC
101
94
0
26 Dec 2018
Attention Branch Network: Learning of Attention Mechanism for Visual
  Explanation
Attention Branch Network: Learning of Attention Mechanism for Visual Explanation
Hiroshi Fukui
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
XAIFAtt
95
410
0
25 Dec 2018
TextNet: Irregular Text Reading from Images with an End-to-End Trainable
  Network
TextNet: Irregular Text Reading from Images with an End-to-End Trainable Network
Yipeng Sun
Chengquan Zhang
Zuming Huang
Jiaming Liu
Junyu Han
Errui Ding
67
62
0
24 Dec 2018
AVRA: Automatic Visual Ratings of Atrophy from MRI images using
  Recurrent Convolutional Neural Networks
AVRA: Automatic Visual Ratings of Atrophy from MRI images using Recurrent Convolutional Neural Networks
G. Mårtensson
D. Ferreira
L. Cavallin
J.-Sebastian Muehlboeck
L. Wahlund
Chunliang Wang
E. Westman
82
21
0
23 Dec 2018
End-to-End Classification of Reverberant Rooms using DNNs
End-to-End Classification of Reverberant Rooms using DNNs
C. Papayiannis
C. Evers
Patrick A. Naylor
97
12
0
21 Dec 2018
LEAFAGE: Example-based and Feature importance-based Explanationsfor
  Black-box ML models
LEAFAGE: Example-based and Feature importance-based Explanationsfor Black-box ML models
Ajaya Adhikari
David Tax
R. Satta
M. Faeth
FAtt
122
11
0
21 Dec 2018
A Multi-task Neural Approach for Emotion Attribution, Classification and
  Summarization
A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization
Guoyun Tu
Yanwei Fu
Boyang Albert Li
Jiarui Gao
Yu-Gang Jiang
Xiangyang Xue
36
29
0
21 Dec 2018
nocaps: novel object captioning at scale
nocaps: novel object captioning at scale
Harsh Agrawal
Karan Desai
Yufei Wang
Xinlei Chen
Rishabh Jain
Mark Johnson
Dhruv Batra
Devi Parikh
Stefan Lee
Peter Anderson
VLM
179
488
0
20 Dec 2018
A Comparison of LSTMs and Attention Mechanisms for Forecasting Financial
  Time Series
A Comparison of LSTMs and Attention Mechanisms for Forecasting Financial Time Series
Thomas Hollis
Antoine Viscardi
S. Yi
AI4TS
50
19
0
18 Dec 2018
Toward Multimodal Model-Agnostic Meta-Learning
Toward Multimodal Model-Agnostic Meta-Learning
Risto Vuorio
Shao-Hua Sun
Hexiang Hu
Joseph J. Lim
97
32
0
18 Dec 2018
Attention-based Recurrent Neural Network for Urban Vehicle Trajectory
  Prediction
Attention-based Recurrent Neural Network for Urban Vehicle Trajectory Prediction
Seongjin Choi
Jiwon Kim
H. Yeo
HAI
50
60
0
18 Dec 2018
A Tutorial on Deep Latent Variable Models of Natural Language
A Tutorial on Deep Latent Variable Models of Natural Language
Yoon Kim
Sam Wiseman
Alexander M. Rush
BDLVLM
124
42
0
17 Dec 2018
Attending Category Disentangled Global Context for Image Classification
Keke Tang
Guodong Wei
Runnan Chen
Jie Zhu
Zhaoquan Gu
Wenping Wang
42
0
0
17 Dec 2018
Grounded Video Description
Grounded Video Description
Luowei Zhou
Yannis Kalantidis
Xinlei Chen
Jason J. Corso
Marcus Rohrbach
102
193
0
17 Dec 2018
PiCANet: Pixel-wise Contextual Attention Learning for Accurate Saliency
  Detection
PiCANet: Pixel-wise Contextual Attention Learning for Accurate Saliency Detection
Nian Liu
Junwei Han
Ming-Hsuan Yang
SSeg
99
103
0
15 Dec 2018
Inverse Cooking: Recipe Generation from Food Images
Inverse Cooking: Recipe Generation from Food Images
Amaia Salvador
M. Drozdzal
Xavier Giró-i-Nieto
Adriana Romero
101
148
0
14 Dec 2018
On Attention Modules for Audio-Visual Synchronization
On Attention Modules for Audio-Visual Synchronization
Naji Khosravan
Shervin Ardeshir
R. Puri
73
21
0
14 Dec 2018
Dynamic Graph Modules for Modeling Object-Object Interactions in
  Activity Recognition
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition
Hao Huang
Luowei Zhou
Wei Zhang
Jason J. Corso
Chenliang Xu
59
3
0
13 Dec 2018
Visual Social Relationship Recognition
Visual Social Relationship Recognition
Junnan Li
Yongkang Wong
Qi Zhao
Mohan Kankanhalli
59
27
0
13 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual
  Question Answering
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
114
368
0
13 Dec 2018
Coarse-to-fine: A RNN-based hierarchical attention model for vehicle
  re-identification
Coarse-to-fine: A RNN-based hierarchical attention model for vehicle re-identification
Xiu-Shen Wei
Chen-Da Liu-Zhang
Lingqiao Liu
Chunhua Shen
Jianxin Wu
122
43
0
11 Dec 2018
On the Dimensionality of Word Embedding
On the Dimensionality of Word Embedding
Zi Yin
Yuanyuan Shen
83
193
0
11 Dec 2018
Spatial Knowledge Distillation to aid Visual Reasoning
Spatial Knowledge Distillation to aid Visual Reasoning
Somak Aditya
Rudra Saha
Yezhou Yang
Chitta Baral
79
15
0
10 Dec 2018
A Structured Model For Action Detection
A Structured Model For Action Detection
Yubo Zhang
P. Tokmakov
M. Hebert
Cordelia Schmid
122
101
0
09 Dec 2018
Real-Time Referring Expression Comprehension by Single-Stage Grounding
  Network
Real-Time Referring Expression Comprehension by Single-Stage Grounding Network
Xinpeng Chen
Lin Ma
Jingyuan Chen
Zequn Jie
Wen Liu
Jiebo Luo
ObjD
90
113
0
09 Dec 2018
Attend More Times for Image Captioning
Attend More Times for Image Captioning
Jiajun Du
Yu Qin
Hongtao Lu
Yonghua Zhang
VLM
75
5
0
08 Dec 2018
An Attempt towards Interpretable Audio-Visual Video Captioning
An Attempt towards Interpretable Audio-Visual Video Captioning
Yapeng Tian
Chenxiao Guan
Justin Goodman
Marc Moore
Chenliang Xu
91
20
0
07 Dec 2018
Verification of deep probabilistic models
Verification of deep probabilistic models
Krishnamurthy Dvijotham
M. Garnelo
Alhussein Fawzi
Pushmeet Kohli
76
23
0
06 Dec 2018
Video Action Transformer Network
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
215
710
0
06 Dec 2018
Recursive Visual Attention in Visual Dialog
Recursive Visual Attention in Visual Dialog
Yulei Niu
Hanwang Zhang
Manli Zhang
Jianhong Zhang
Zhiwu Lu
Ji-Rong Wen
112
119
0
06 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
193
705
0
06 Dec 2018
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Long Chen
Hanwang Zhang
Jun Xiao
Xiangnan He
Shiliang Pu
Shih-Fu Chang
102
159
0
06 Dec 2018
Summarizing Videos with Attention
Summarizing Videos with Attention
Jiri Fajtl
Hajar Sadeghi Sokeh
Vasileios Argyriou
D. Monekosso
Paolo Remagnino
102
191
0
05 Dec 2018
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Tian Shi
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
140
234
0
05 Dec 2018
Complete the Look: Scene-based Complementary Product Recommendation
Complete the Look: Scene-based Complementary Product Recommendation
Wang-Cheng Kang
Eric Kim
J. Leskovec
Charles R. Rosenberg
Julian McAuley
105
77
0
04 Dec 2018
e-SNLI: Natural Language Inference with Natural Language Explanations
e-SNLI: Natural Language Inference with Natural Language Explanations
Oana-Maria Camburu
Tim Rocktaschel
Thomas Lukasiewicz
Phil Blunsom
LRM
447
643
0
04 Dec 2018
Pre-Defined Sparse Neural Networks with Hardware Acceleration
Pre-Defined Sparse Neural Networks with Hardware Acceleration
Sourya Dey
Kuan-Wen Huang
Peter A. Beerel
K. Chugg
114
25
0
04 Dec 2018
Improving Clinical Predictions through Unsupervised Time Series
  Representation Learning
Improving Clinical Predictions through Unsupervised Time Series Representation Learning
Xinrui Lyu
Matthias Huser
Stephanie L. Hyland
George Zerveas
Gunnar Rätsch
SSLOODAI4TS
76
43
0
02 Dec 2018
Neural Rejuvenation: Improving Deep Network Training by Enhancing
  Computational Resource Utilization
Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization
Siyuan Qiao
Zhe Lin
Jianming Zhang
Alan Yuille
65
23
0
02 Dec 2018
Learning to Caption Images through a Lifetime by Asking Questions
Learning to Caption Images through a Lifetime by Asking Questions
Tingke Shen
Amlan Kar
Sanja Fidler
105
31
0
01 Dec 2018
FineFool: Fine Object Contour Attack via Attention
FineFool: Fine Object Contour Attack via Attention
Jinyin Chen
Haibin Zheng
Hui Xiong
Mengmeng Su
AAML
60
3
0
01 Dec 2018
From Known to the Unknown: Transferring Knowledge to Answer Questions
  about Novel Visual and Semantic Concepts
From Known to the Unknown: Transferring Knowledge to Answer Questions about Novel Visual and Semantic Concepts
M. Farazi
Salman H Khan
Nick Barnes
58
13
0
30 Nov 2018
Deep Multimodal Learning: An Effective Method for Video Classification
Deep Multimodal Learning: An Effective Method for Video Classification
Tianqi Zhao
26
4
0
30 Nov 2018
An Introduction to Deep Reinforcement Learning
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRLAI4CE
179
1,279
0
30 Nov 2018
Generating Easy-to-Understand Referring Expressions for Target
  Identifications
Generating Easy-to-Understand Referring Expressions for Target Identifications
Mikihiro Tanaka
Takayuki Itamochi
Kenichi Narioka
Ikuro Sato
Yoshitaka Ushiku
Tatsuya Harada
74
1
0
29 Nov 2018
MAMNet: Multi-path Adaptive Modulation Network for Image
  Super-Resolution
MAMNet: Multi-path Adaptive Modulation Network for Image Super-Resolution
Jun-Hyuk Kim
Jun-Ho Choi
Manri Cheon
Jong-Seok Lee
SupR
66
49
0
29 Nov 2018
Multi-level Multimodal Common Semantic Space for Image-Phrase Grounding
Multi-level Multimodal Common Semantic Space for Image-Phrase Grounding
Hassan Akbari
Svebor Karaman
Surabhi Bhargava
Brian Chen
Carl Vondrick
Shih-Fu Chang
68
83
0
28 Nov 2018
Neural Sign Language Translation based on Human Keypoint Estimation
Neural Sign Language Translation based on Human Keypoint Estimation
Sang-Ki Ko
Chang Jo Kim
Hyedong Jung
Choongsang Cho
SLR
118
213
0
28 Nov 2018
Previous
123...464748...697071
Next