ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Information Maximizing Visual Question Generation
Information Maximizing Visual Question Generation
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
131
95
0
27 Mar 2019
Improve Diverse Text Generation by Self Labeling Conditional Variational
  Auto Encoder
Improve Diverse Text Generation by Self Labeling Conditional Variational Auto Encoder
Yuchi Zhang
Yongliang Wang
Liping Zhang
Qing Cui
Kun Gai
48
19
0
26 Mar 2019
Attention Based Glaucoma Detection: A Large-scale Database and CNN Model
Attention Based Glaucoma Detection: A Large-scale Database and CNN Model
Liu Li
Mai Xu
Xiaofei Wang
Lai Jiang
Hanruo Liu
101
205
0
26 Mar 2019
SRM : A Style-based Recalibration Module for Convolutional Neural
  Networks
SRM : A Style-based Recalibration Module for Convolutional Neural Networks
HyunJae Lee
Hyo-Eun Kim
Hyeonseob Nam
77
228
0
26 Mar 2019
Learning Where to See: A Novel Attention Model for Automated
  Immunohistochemical Scoring
Learning Where to See: A Novel Attention Model for Automated Immunohistochemical Scoring
Talha Qaiser
Nasir M. Rajpoot
70
71
0
26 Mar 2019
AlphaX: eXploring Neural Architectures with Deep Neural Networks and
  Monte Carlo Tree Search
AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search
Linnan Wang
Yiyang Zhao
Yuu Jinnai
Yuandong Tian
Rodrigo Fonseca
BDL
103
96
0
26 Mar 2019
Unpaired Image Captioning via Scene Graph Alignments
Unpaired Image Captioning via Scene Graph Alignments
Jiuxiang Gu
Shafiq Joty
Jianfei Cai
Handong Zhao
Xu Yang
G. Wang
GNN
107
176
0
26 Mar 2019
Knowledge-driven Encode, Retrieve, Paraphrase for Medical Image Report
  Generation
Knowledge-driven Encode, Retrieve, Paraphrase for Medical Image Report Generation
Yuan Li
Xiaodan Liang
Zhiting Hu
Eric Xing
MedIm
81
277
0
25 Mar 2019
End-to-End Learning Using Cycle Consistency for Image-to-Caption
  Transformations
End-to-End Learning Using Cycle Consistency for Image-to-Caption Transformations
Keisuke Hagiwara
Yusuke Mukuta
Tatsuya Harada
35
0
0
25 Mar 2019
Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention
Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention
Ye Xia
Jinkyu Kim
John F. Canny
K. Zipser
D. Whitney
71
51
0
24 Mar 2019
Attention-based Convolutional Neural Network for Weakly Labeled Human
  Activities Recognition with Wearable Sensors
Attention-based Convolutional Neural Network for Weakly Labeled Human Activities Recognition with Wearable Sensors
Kun Wang
Jun He
Lefei Zhang
HAI
69
148
0
24 Mar 2019
Learning with Sets in Multiple Instance Regression Applied to Remote
  Sensing
Learning with Sets in Multiple Instance Regression Applied to Remote Sensing
Thomas Uriot
23
5
0
18 Mar 2019
Neural Sequential Phrase Grounding (SeqGROUND)
Neural Sequential Phrase Grounding (SeqGROUND)
Pelin Dogan
Leonid Sigal
Markus Gross
ObjD
85
52
0
18 Mar 2019
Boosted Attention: Leveraging Human Attention for Image Captioning
Boosted Attention: Leveraging Human Attention for Image Captioning
Shi Chen
Qi Zhao
81
47
0
18 Mar 2019
Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition
Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition
Johannes Michael
R. Labahn
Tobias Grüning
Jochen Zöllner
173
115
0
18 Mar 2019
A Weighted Multi-Criteria Decision Making Approach for Image Captioning
A Weighted Multi-Criteria Decision Making Approach for Image Captioning
Hassan Maleki Galandouz
M. Moghaddam
M. Shamsfard
26
0
0
17 Mar 2019
Dense Relational Captioning: Triple-Stream Networks for
  Relationship-Based Captioning
Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning
Dong-Jin Kim
Jinsoo Choi
Tae-Hyun Oh
In So Kweon
109
84
0
14 Mar 2019
MirrorGAN: Learning Text-to-image Generation by Redescription
MirrorGAN: Learning Text-to-image Generation by Redescription
Tingting Qiao
Jing Zhang
Duanqing Xu
Dacheng Tao
VLMGAN
67
544
0
14 Mar 2019
Learning Parallax Attention for Stereo Image Super-Resolution
Learning Parallax Attention for Stereo Image Super-Resolution
Longguang Wang
Yingqian Wang
Zhengfa Liang
Zaiping Lin
Jungang Yang
W. An
Yulan Guo
SupR
83
251
0
14 Mar 2019
Pragmatic inference and visual abstraction enable contextual flexibility
  during visual communication
Pragmatic inference and visual abstraction enable contextual flexibility during visual communication
Judith W. Fan
Robert D. Hawkins
Mike Wu
Noah D. Goodman
48
42
0
11 Mar 2019
Spatial-Aware Non-Local Attention for Fashion Landmark Detection
Spatial-Aware Non-Local Attention for Fashion Landmark Detection
Yixin Li
Shengqin Tang
Yun Ye
Jinwen Ma
70
23
0
11 Mar 2019
SR-LSTM: State Refinement for LSTM towards Pedestrian Trajectory
  Prediction
SR-LSTM: State Refinement for LSTM towards Pedestrian Trajectory Prediction
Pu Zhang
Wanli Ouyang
Pengfei Zhang
Jianru Xue
Nanning Zheng
86
464
0
07 Mar 2019
A Character-Level Approach to the Text Normalization Problem Based on a
  New Causal Encoder
A Character-Level Approach to the Text Normalization Problem Based on a New Causal Encoder
Adrián Javaloy Bornás
G. García-Mateos
CML
16
3
0
06 Mar 2019
Image captioning with weakly-supervised attention penalty
Image captioning with weakly-supervised attention penalty
Jiayun Li
M. K. Ebrahimpour
Azadeh Moghtaderi
Yen-Yun Yu
30
5
0
06 Mar 2019
Human Attention in Image Captioning: Dataset and Analysis
Human Attention in Image Captioning: Dataset and Analysis
Sen He
Hamed R. Tavakoli
Ali Borji
N. Pugeault
27
5
0
06 Mar 2019
Persona-Aware Tips Generation
Persona-Aware Tips Generation
Piji Li
Zihao Wang
Lidong Bing
Wai Lam
79
41
0
06 Mar 2019
Crowd Counting Using Scale-Aware Attention Networks
Crowd Counting Using Scale-Aware Attention Networks
M. Hossain
M. Hosseinzadeh
Omit Chanda
Yang Wang
73
132
0
05 Mar 2019
Selective Sensor Fusion for Neural Visual-Inertial Odometry
Selective Sensor Fusion for Neural Visual-Inertial Odometry
Changhao Chen
Stefano Rosa
Yishu Miao
Chris Xiaoxuan Lu
Wei Wu
Andrew Markham
A. Trigoni
66
134
0
04 Mar 2019
Deep Learning for Cognitive Neuroscience
Deep Learning for Cognitive Neuroscience
Katherine R. Storrs
N. Kriegeskorte
NAIAI4CE
80
46
0
04 Mar 2019
COMIC: Towards A Compact Image Captioning Model with Attention
COMIC: Towards A Compact Image Captioning Model with Attention
J. Tan
Chee Seng Chan
Joon Huang Chuah
VLM
101
40
0
04 Mar 2019
Spatiotemporal Pyramid Network for Video Action Recognition
Spatiotemporal Pyramid Network for Video Action Recognition
Yunbo Wang
Mingsheng Long
Jianmin Wang
Philip S. Yu
104
229
0
04 Mar 2019
CAD-Net: A Context-Aware Detection Network for Objects in Remote Sensing
  Imagery
CAD-Net: A Context-Aware Detection Network for Objects in Remote Sensing Imagery
Gongjie Zhang
Shijian Lu
Wei Zhang
108
358
0
03 Mar 2019
Improving Referring Expression Grounding with Cross-modal
  Attention-guided Erasing
Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing
Xihui Liu
Zihao Wang
Jing Shao
Xiaogang Wang
Hongsheng Li
ObjD
114
186
0
03 Mar 2019
Weakly Labelled AudioSet Tagging with Attention Neural Networks
Weakly Labelled AudioSet Tagging with Attention Neural Networks
Qiuqiang Kong
Changsong Yu
Turab Iqbal
Yong-mei Xu
Wenwu Wang
Mark D. Plumbley
NoLa
100
78
0
02 Mar 2019
Learning To Follow Directions in Street View
Learning To Follow Directions in Street View
Karl Moritz Hermann
Mateusz Malinowski
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
R. Hadsell
SSL
81
69
0
01 Mar 2019
Pyramid Feature Attention Network for Saliency detection
Pyramid Feature Attention Network for Saliency detection
Ting Zhao
Xiangqian Wu
81
613
0
01 Mar 2019
AFS: An Attention-based mechanism for Supervised Feature Selection
AFS: An Attention-based mechanism for Supervised Feature Selection
Ning Gui
Danni Ge
Ziyin Hu
60
68
0
28 Feb 2019
Financial series prediction using Attention LSTM
Financial series prediction using Attention LSTM
Sangyeon Kim
Myung-joo Kang
AI4TSHAI
75
52
0
28 Feb 2019
Learning Everywhere: Pervasive Machine Learning for Effective
  High-Performance Computation
Learning Everywhere: Pervasive Machine Learning for Effective High-Performance Computation
Geoffrey C. Fox
J. Glazier
J. Kadupitiya
V. Jadhao
Minje Kim
...
Madhav Marathe
Abhijin Adiga
Jiangzhuo Chen
O. Beckstein
S. Jha
59
53
0
27 Feb 2019
Object-driven Text-to-Image Synthesis via Adversarial Training
Object-driven Text-to-Image Synthesis via Adversarial Training
Wenbo Li
Pengchuan Zhang
Lei Zhang
Qiuyuan Huang
Xiaodong He
Siwei Lyu
Jianfeng Gao
GAN
115
302
0
27 Feb 2019
Attention is not Explanation
Attention is not Explanation
Sarthak Jain
Byron C. Wallace
FAtt
187
1,333
0
26 Feb 2019
Unmasking Clever Hans Predictors and Assessing What Machines Really
  Learn
Unmasking Clever Hans Predictors and Assessing What Machines Really Learn
Sebastian Lapuschkin
S. Wäldchen
Alexander Binder
G. Montavon
Wojciech Samek
K. Müller
127
1,023
0
26 Feb 2019
Generative Visual Dialogue System via Adaptive Reasoning and Weighted
  Likelihood Estimation
Generative Visual Dialogue System via Adaptive Reasoning and Weighted Likelihood Estimation
Heming Zhang
Shalini Ghosh
Larry Heck
Stephen Walsh
Junting Zhang
Jie Zhang
C.-C. Jay Kuo
133
7
0
26 Feb 2019
Learning Implicitly Recurrent CNNs Through Parameter Sharing
Learning Implicitly Recurrent CNNs Through Parameter Sharing
Pedro H. P. Savarese
Michael Maire
96
70
0
26 Feb 2019
MUREL: Multimodal Relational Reasoning for Visual Question Answering
MUREL: Multimodal Relational Reasoning for Visual Question Answering
Rémi Cadène
H. Ben-younes
Matthieu Cord
Nicolas Thome
LRM
95
277
0
25 Feb 2019
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Gi-Cheon Kang
Jaeseo Lim
Byoung-Tak Zhang
56
73
0
25 Feb 2019
Audio Caption: Listen and Tell
Audio Caption: Listen and Tell
Mengyue Wu
Heinrich Dinkel
Kai Yu
108
61
0
25 Feb 2019
Field-aware Neural Factorization Machine for Click-Through Rate
  Prediction
Field-aware Neural Factorization Machine for Click-Through Rate Prediction
Li Zhang
Weichen Shen
Shijian Li
Gang Pan
60
34
0
25 Feb 2019
Leveraging Knowledge Bases in LSTMs for Improving Machine Reading
Leveraging Knowledge Bases in LSTMs for Improving Machine Reading
Bishan Yang
Tom Michael Mitchell
90
248
0
25 Feb 2019
Unsupervised Grounding of Plannable First-Order Logic Representation
  from Images
Unsupervised Grounding of Plannable First-Order Logic Representation from Images
Masataro Asai
NAI
85
58
0
21 Feb 2019
Previous
123...444546...697071
Next