ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXivPDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,510 papers shown
Title
Visual Storytelling
Visual Storytelling
Ting-Hao 'Kenneth' Huang
Huang
Francis Ferraro
N. Mostafazadeh
Ishan Misra
...
C. L. Zitnick
Devi Parikh
Lucy Vanderwende
Michel Galley
Margaret Mitchell
VGen
22
464
0
13 Apr 2016
Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with
  MDLSTM Attention
Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention
Théodore Bluche
J. Louradour
Ronaldo O. Messina
VLM
24
170
0
12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
25
270
0
10 Apr 2016
Optimizing Performance of Recurrent Neural Networks on GPUs
Optimizing Performance of Recurrent Neural Networks on GPUs
J. Appleyard
Tomás Kociský
Phil Blunsom
25
91
0
07 Apr 2016
Advances in Very Deep Convolutional Neural Networks for LVCSR
Advances in Very Deep Convolutional Neural Networks for LVCSR
Tom Sercu
Vaibhava Goel
14
44
0
06 Apr 2016
Correlated and Individual Multi-Modal Deep Learning for RGB-D Object
  Recognition
Correlated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition
Ziyan Wang
Jiwen Lu
Ruogu Lin
Jianjiang Feng
Jie zhou
23
29
0
06 Apr 2016
Character-Level Neural Translation for Multilingual Media Monitoring in
  the SUMMA Project
Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project
Guntis Barzdins
Steve Renals
D. Gosko
18
5
0
05 Apr 2016
Image Captioning with Deep Bidirectional LSTMs
Image Captioning with Deep Bidirectional LSTMs
Cheng Wang
Haojin Yang
Christian Bartz
Christoph Meinel
VLM
12
278
0
04 Apr 2016
Character-Level Question Answering with Attention
Character-Level Question Answering with Attention
David Golub
Xiaodong He
27
184
0
04 Apr 2016
Reasoning About Pragmatics with Neural Listeners and Speakers
Reasoning About Pragmatics with Neural Listeners and Speakers
Jacob Andreas
Dan Klein
ReLM
LRM
18
174
0
02 Apr 2016
Automatic Annotation of Structured Facts in Images
Automatic Annotation of Structured Facts in Images
Mohamed Elhoseiny
Scott D. Cohen
W. Chang
Brian L. Price
Ahmed Elgammal
21
9
0
02 Apr 2016
AttSum: Joint Learning of Focusing and Summarization with Neural
  Attention
AttSum: Joint Learning of Focusing and Summarization with Neural Attention
Ziqiang Cao
Wenjie Li
Sujian Li
Furu Wei
Yanran Li
16
115
0
01 Apr 2016
Neural Attention Models for Sequence Classification: Analysis and
  Application to Key Term Extraction and Dialogue Act Detection
Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection
Sheng-syun Shen
Hung-yi Lee
14
65
0
31 Mar 2016
Minimal Gated Unit for Recurrent Neural Networks
Minimal Gated Unit for Recurrent Neural Networks
Guoxiang Zhou
Jianxin Wu
Chen-Da Liu-Zhang
Zhi-Hua Zhou
31
326
0
31 Mar 2016
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for
  Locally Robust Captioning
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for Locally Robust Captioning
Andrew Shin
Masataka Yamaguchi
Katsunori Ohnishi
Tatsuya Harada
53
8
0
30 Mar 2016
Recurrent Batch Normalization
Recurrent Batch Normalization
Tim Cooijmans
Nicolas Ballas
César Laurent
Çağlar Gülçehre
Aaron Courville
ODL
19
410
0
30 Mar 2016
Rich Image Captioning in the Wild
Rich Image Captioning in the Wild
Kenneth Tran
Xiaodong He
Lei Zhang
Jian Sun
Cornelia Carapcea
Chris Thrasher
Chris Buehler
Chris Sienkiewicz
VLM
19
123
0
30 Mar 2016
Generating Visual Explanations
Generating Visual Explanations
Lisa Anne Hendricks
Zeynep Akata
Marcus Rohrbach
Jeff Donahue
Bernt Schiele
Trevor Darrell
VLM
FAtt
47
618
0
28 Mar 2016
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for
  Automated Image Annotation
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation
Hoo-Chang Shin
Kirk Roberts
Le Lu
Dina Demner-Fushman
Jianhua Yao
Ronald M. Summers
21
347
0
28 Mar 2016
Audio Visual Emotion Recognition with Temporal Alignment and Perception
  Attention
Audio Visual Emotion Recognition with Temporal Alignment and Perception Attention
Linlin Chao
J. Tao
Minghao Yang
Ya Li
Zhengqi Wen
16
30
0
28 Mar 2016
Recurrent Mixture Density Network for Spatiotemporal Visual Attention
Recurrent Mixture Density Network for Spatiotemporal Visual Attention
Loris Bazzani
Hugo Larochelle
Lorenzo Torresani
26
133
0
27 Mar 2016
Neural Text Generation from Structured Data with Application to the
  Biography Domain
Neural Text Generation from Structured Data with Application to the Biography Domain
R. Lebret
David Grangier
Michael Auli
21
45
0
24 Mar 2016
Attentive Contexts for Object Detection
Attentive Contexts for Object Detection
Jianan Li
Yunchao Wei
Xiaodan Liang
Jian Dong
Tingfa Xu
Jiashi Feng
Shuicheng Yan
ObjD
17
221
0
24 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing
BreakingNews: Article Annotation by Image and Text Processing
Arnau Ramisa
F. Yan
Francesc Moreno-Noguer
K. Mikolajczyk
29
105
0
23 Mar 2016
Semantic Object Parsing with Graph LSTM
Semantic Object Parsing with Graph LSTM
Xiaodan Liang
Xiaohui Shen
Jiashi Feng
Liang Lin
Shuicheng Yan
35
354
0
23 Mar 2016
Deep Learning in Bioinformatics
Deep Learning in Bioinformatics
Seonwoo Min
Byunghan Lee
Sungroh Yoon
AI4CE
3DV
36
1,351
0
21 Mar 2016
Segmentation from Natural Language Expressions
Segmentation from Natural Language Expressions
Ronghang Hu
Marcus Rohrbach
Trevor Darrell
VLM
EgoV
19
426
0
20 Mar 2016
One-Shot Generalization in Deep Generative Models
One-Shot Generalization in Deep Generative Models
Danilo Jimenez Rezende
S. Mohamed
Ivo Danihelka
Karol Gregor
Daan Wierstra
BDL
VLM
DRL
LRM
30
254
0
16 Mar 2016
Image Captioning with Semantic Attention
Image Captioning with Semantic Attention
Quanzeng You
Hailin Jin
Zhaowen Wang
Chen Fang
Jiebo Luo
VLM
67
1,652
0
12 Mar 2016
Neural Discourse Relation Recognition with Semantic Memory
Neural Discourse Relation Recognition with Semantic Memory
Biao Zhang
Deyi Xiong
Jinsong Su
33
16
0
12 Mar 2016
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild
Chen-Yu Lee
Simon Osindero
VLM
40
458
0
09 Mar 2016
Image Captioning and Visual Question Answering Based on Attributes and
  External Knowledge
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge
Qi Wu
Chunhua Shen
Anton Van Den Hengel
Peng Wang
A. Dick
27
360
0
09 Mar 2016
Dynamic Memory Networks for Visual and Textual Question Answering
Dynamic Memory Networks for Visual and Textual Question Answering
Caiming Xiong
Stephen Merity
R. Socher
34
753
0
04 Mar 2016
Noisy Activation Functions
Noisy Activation Functions
Çağlar Gülçehre
Marcin Moczulski
Misha Denil
Yoshua Bengio
9
283
0
01 Mar 2016
Recurrent Neural Network Grammars
Recurrent Neural Network Grammars
Chris Dyer
A. Kuncoro
Miguel Ballesteros
Noah A. Smith
GNN
33
524
0
25 Feb 2016
Learning to Generate with Memory
Learning to Generate with Memory
Chongxuan Li
Jun Zhu
Bo Zhang
BDL
24
42
0
24 Feb 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense
  Image Annotations
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li-Jia Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
99
5,663
0
23 Feb 2016
Contextual LSTM (CLSTM) models for Large scale NLP tasks
Contextual LSTM (CLSTM) models for Large scale NLP tasks
Shalini Ghosh
Oriol Vinyals
B. Strope
Scott Roy
Tom Dean
Larry Heck
25
213
0
19 Feb 2016
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
Marco Tulio Ribeiro
Sameer Singh
Carlos Guestrin
FAtt
FaML
43
16,662
0
16 Feb 2016
Look, Listen and Learn - A Multimodal LSTM for Speaker Identification
Look, Listen and Learn - A Multimodal LSTM for Speaker Identification
Jimmy S. J. Ren
Yongtao Hu
Yu-Wing Tai
Chuan Wang
Li Xu
Wenxiu Sun
Qiong Yan
35
108
0
13 Feb 2016
Global Deconvolutional Networks for Semantic Segmentation
Global Deconvolutional Networks for Semantic Segmentation
Vladimir Nekrasov
Janghoon Ju
Jaesik Choi
SSeg
31
12
0
12 Feb 2016
Attentive Pooling Networks
Attentive Pooling Networks
Cicero Nogueira dos Santos
Ming Tan
Bing Xiang
Bowen Zhou
32
346
0
11 Feb 2016
A Convolutional Attention Network for Extreme Summarization of Source
  Code
A Convolutional Attention Network for Extreme Summarization of Source Code
Miltiadis Allamanis
Hao Peng
Charles Sutton
AI4TS
38
581
0
09 Feb 2016
Value Iteration Networks
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
27
649
0
09 Feb 2016
Predicting Clinical Events by Combining Static and Dynamic Information
  Using Recurrent Neural Networks
Predicting Clinical Events by Combining Static and Dynamic Information Using Recurrent Neural Networks
Cristóbal Esteban
O. Staeck
Yinchong Yang
Volker Tresp
16
154
0
08 Feb 2016
From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label
  Classification
From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification
André F. T. Martins
Ramón Fernández Astudillo
13
703
0
05 Feb 2016
Long-term Planning by Short-term Prediction
Long-term Planning by Short-term Prediction
Shai Shalev-Shwartz
Nir Ben-Zrihem
Aviad Cohen
Amnon Shashua
12
61
0
04 Feb 2016
Survey on the attention based RNN model and its applications in computer
  vision
Survey on the attention based RNN model and its applications in computer vision
Feng Wang
David Tax
AI4TS
AIMat
27
113
0
25 Jan 2016
Modeling Coverage for Neural Machine Translation
Modeling Coverage for Neural Machine Translation
Zhaopeng Tu
Zhengdong Lu
Yang Liu
Xiaohua Liu
Hang Li
15
746
0
19 Jan 2016
Multimodal Pivots for Image Caption Translation
Multimodal Pivots for Image Caption Translation
Julian Hitschler
Shigehiko Schamoni
Stefan Riezler
33
97
0
15 Jan 2016
Previous
123...6768697071
Next