ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXivPDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,510 papers shown
Title
Deep Reinforcement Learning for Visual Object Tracking in Videos
Deep Reinforcement Learning for Visual Object Tracking in Videos
Da Zhang
H. Maei
Xin Eric Wang
Yuan-fang Wang
23
115
0
31 Jan 2017
Memory Augmented Neural Networks with Wormhole Connections
Memory Augmented Neural Networks with Wormhole Connections
Çağlar Gülçehre
A. Chandar
Yoshua Bengio
31
63
0
30 Jan 2017
Supervised Deep Sparse Coding Networks
Supervised Deep Sparse Coding Networks
Xiaoxia Sun
Nasser M. Nasrabadi
T. Tran
BDL
27
15
0
29 Jan 2017
Image-Grounded Conversations: Multimodal Context for Natural Question
  and Response Generation
Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation
N. Mostafazadeh
Chris Brockett
W. Dolan
Michel Galley
Jianfeng Gao
Georgios P. Spithourakis
Lucy Vanderwende
26
181
0
28 Jan 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,505
0
25 Jan 2017
Incorporating Global Visual Features into Attention-Based Neural Machine
  Translation
Incorporating Global Visual Features into Attention-Based Neural Machine Translation
Iacer Calixto
Qun Liu
Nick Campbell
32
154
0
23 Jan 2017
Understanding the Effective Receptive Field in Deep Convolutional Neural
  Networks
Understanding the Effective Receptive Field in Deep Convolutional Neural Networks
Wenjie Luo
Yujia Li
R. Urtasun
R. Zemel
HAI
44
1,780
0
15 Jan 2017
Simplified Gating in Long Short-term Memory (LSTM) Recurrent Neural
  Networks
Simplified Gating in Long Short-term Memory (LSTM) Recurrent Neural Networks
Yuzhen Lu
F. Salem
24
39
0
12 Jan 2017
Comprehension-guided referring expressions
Comprehension-guided referring expressions
Ruotian Luo
Gregory Shakhnarovich
ObjD
29
171
0
12 Jan 2017
Attention-Based Multimodal Fusion for Video Description
Attention-Based Multimodal Fusion for Video Description
Chiori Hori
Takaaki Hori
Teng-Yok Lee
Kazuhiro Sumi
J. Hershey
Tim K. Marks
41
359
0
11 Jan 2017
Context-aware Captions from Context-agnostic Supervision
Context-aware Captions from Context-agnostic Supervision
Ramakrishna Vedantam
Samy Bengio
Kevin Patrick Murphy
Devi Parikh
Gal Chechik
22
152
0
11 Jan 2017
Towards Decoding as Continuous Optimization in Neural Machine
  Translation
Towards Decoding as Continuous Optimization in Neural Machine Translation
Cong Duy Vu Hoang
Gholamreza Haffari
Trevor Cohn
AI4CE
30
42
0
11 Jan 2017
OpenNMT: Open-Source Toolkit for Neural Machine Translation
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
273
1,896
0
10 Jan 2017
Textual Entailment with Structured Attentions and Composition
Textual Entailment with Structured Attentions and Composition
Kai Zhao
Liang Huang
Mingbo Ma
29
28
0
04 Jan 2017
Dynamic Deep Neural Networks: Optimizing Accuracy-Efficiency Trade-offs
  by Selective Execution
Dynamic Deep Neural Networks: Optimizing Accuracy-Efficiency Trade-offs by Selective Execution
Lanlan Liu
Jia Deng
35
200
0
02 Jan 2017
Aspect-augmented Adversarial Networks for Domain Adaptation
Aspect-augmented Adversarial Networks for Domain Adaptation
Yuan Zhang
Regina Barzilay
Tommi Jaakkola
44
96
0
01 Jan 2017
Feedback Networks
Feedback Networks
Amir Zamir
Te-Lin Wu
Lin Sun
Bokui (William) Shen
Jitendra Malik
Silvio Savarese
18
209
0
30 Dec 2016
FastMask: Segment Multi-scale Object Candidates in One Shot
FastMask: Segment Multi-scale Object Candidates in One Shot
Hexiang Hu
Shiyi Lan
Yuning Jiang
Zhimin Cao
Fei Sha
SSeg
3DPC
16
28
0
28 Dec 2016
Robust LSTM-Autoencoders for Face De-Occlusion in the Wild
Robust LSTM-Autoencoders for Face De-Occlusion in the Wild
F. Zhao
Jiashi Feng
Jian-jun Zhao
Wenhan Yang
Shuicheng Yan
CVBM
19
138
0
27 Dec 2016
Validation, comparison, and combination of algorithms for automatic
  detection of pulmonary nodules in computed tomography images: the LUNA16
  challenge
Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: the LUNA16 challenge
A. Setio
A. Traverso
Thomas de Bel
Moira S. N. Berens
C. V. D. Bogaard
...
Jef Vandemeulebroucke
N. Walasek
G. Zuidhof
Bram van Ginneken
Colin Jacobs
51
1,061
0
23 Dec 2016
Understanding Image and Text Simultaneously: a Dual Vision-Language
  Machine Comprehension Task
Understanding Image and Text Simultaneously: a Dual Vision-Language Machine Comprehension Task
Nan Ding
Sebastian Goodman
Fei Sha
Radu Soricut
VLM
27
9
0
22 Dec 2016
Re-evaluating Automatic Metrics for Image Captioning
Re-evaluating Automatic Metrics for Image Captioning
Mert Kilickaya
Aykut Erdem
Nazli Ikizler-Cinbis
Erkut Erdem
17
180
0
22 Dec 2016
A Context-aware Attention Network for Interactive Question Answering
A Context-aware Attention Network for Interactive Question Answering
Huayu Li
Martin Renqiang Min
Yong Ge
Asim Kadav
21
67
0
22 Dec 2016
Top-down Visual Saliency Guided by Captions
Top-down Visual Saliency Guided by Captions
Vasili Ramanishka
Abir Das
Jianming Zhang
Kate Saenko
21
142
0
21 Dec 2016
Multi-Agent Cooperation and the Emergence of (Natural) Language
Multi-Agent Cooperation and the Emergence of (Natural) Language
Angeliki Lazaridou
A. Peysakhovich
Marco Baroni
LLMAG
60
429
0
21 Dec 2016
An Empirical Study of Language CNN for Image Captioning
An Empirical Study of Language CNN for Image Captioning
Jiuxiang Gu
G. Wang
Jianfei Cai
Tsuhan Chen
31
132
0
21 Dec 2016
Action-Driven Object Detection with Top-Down Visual Attentions
Action-Driven Object Detection with Top-Down Visual Attentions
Donggeun Yoo
Sunggyun Park
K. Paeng
Joon-Young Lee
In So Kweon
ObjD
21
6
0
20 Dec 2016
Automatic Generation of Grounded Visual Questions
Automatic Generation of Grounded Visual Questions
Shijie Zhang
Lizhen Qu
Shaodi You
Zhenglu Yang
Jiawan Zhang
OOD
27
79
0
20 Dec 2016
Large-Scale Image Retrieval with Attentive Deep Local Features
Large-Scale Image Retrieval with Attentive Deep Local Features
Hyeonwoo Noh
A. Araújo
Jack Sim
Tobias Weyand
Bohyung Han
3DV
35
766
0
19 Dec 2016
Few-Shot Object Recognition from Machine-Labeled Web Images
Few-Shot Object Recognition from Machine-Labeled Web Images
Zhongwen Xu
Linchao Zhu
Yi Yang
VLM
18
66
0
19 Dec 2016
Learning to predict where to look in interactive environments using deep
  recurrent q-learning
Learning to predict where to look in interactive environments using deep recurrent q-learning
Seyed Sajad Mousavi
Michael Schukat
Enda Howley
Ali Borji
N. Mozayani
16
30
0
17 Dec 2016
Delta Networks for Optimized Recurrent Network Computation
Delta Networks for Optimized Recurrent Network Computation
Daniel Neil
Junhaeng Lee
T. Delbruck
Shih-Chii Liu
36
66
0
16 Dec 2016
CSVideoNet: A Real-time End-to-end Learning Framework for
  High-frame-rate Video Compressive Sensing
CSVideoNet: A Real-time End-to-end Learning Framework for High-frame-rate Video Compressive Sensing
Kai Xu
Fengbo Ren
27
8
0
15 Dec 2016
Recurrent Image Captioner: Describing Images with Spatial-Invariant
  Transformation and Attention Filtering
Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering
Hao Liu
Yang Yang
Fumin Shen
Lixin Duan
Heng Tao Shen
38
9
0
15 Dec 2016
Single Image Action Recognition using Semantic Body Part Actions
Single Image Action Recognition using Semantic Body Part Actions
Zhichen Zhao
Huimin Ma
Shaodi You
30
74
0
14 Dec 2016
End-to-End Deep Reinforcement Learning for Lane Keeping Assist
End-to-End Deep Reinforcement Learning for Lane Keeping Assist
Ahmad El-Sallab
Mohammed Abdou
E. Perot
S. Yogamani
19
175
0
13 Dec 2016
Paying More Attention to Attention: Improving the Performance of
  Convolutional Neural Networks via Attention Transfer
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer
Sergey Zagoruyko
N. Komodakis
67
2,553
0
12 Dec 2016
Empirical Evaluation of A New Approach to Simplifying Long Short-term
  Memory (LSTM)
Empirical Evaluation of A New Approach to Simplifying Long Short-term Memory (LSTM)
Yuzhen Lu
16
2
0
12 Dec 2016
VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question
  Answering
VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering
Marc Bolaños
Álvaro Peris
F. Casacuberta
Petia Radeva
32
6
0
12 Dec 2016
Text-guided Attention Model for Image Captioning
Text-guided Attention Model for Image Captioning
Jonghwan Mun
Minsu Cho
Bohyung Han
VLM
15
92
0
12 Dec 2016
Improving the Performance of Neural Machine Translation Involving
  Morphologically Rich Languages
Improving the Performance of Neural Machine Translation Involving Morphologically Rich Languages
Hans Krupakar
R. S. Milton
35
15
0
07 Dec 2016
Spatially Adaptive Computation Time for Residual Networks
Spatially Adaptive Computation Time for Residual Networks
Michael Figurnov
Maxwell D. Collins
Yukun Zhu
Li Zhang
Jonathan Huang
Dmitry Vetrov
Ruslan Salakhutdinov
23
346
0
07 Dec 2016
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image
  Captioning
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Jiasen Lu
Caiming Xiong
Devi Parikh
R. Socher
85
1,442
0
06 Dec 2016
Condensed Memory Networks for Clinical Diagnostic Inferencing
Condensed Memory Networks for Clinical Diagnostic Inferencing
Aaditya (Adi) Prakash
Siyuan Zhao
Sadid A. Hasan
Vivek Datla
Kathy Lee
Ashequl Qadir
Joey Liu
Oladimeji Farri
22
102
0
06 Dec 2016
Learning to Detect Multiple Photographic Defects
Learning to Detect Multiple Photographic Defects
Ning Yu
Xiaohui Shen
Zhe Lin
R. Měch
Connelly Barnes
19
14
0
06 Dec 2016
ImageNet pre-trained models with batch normalization
ImageNet pre-trained models with batch normalization
Marcel Simon
E. Rodner
Joachim Denzler
VLM
SSeg
44
165
0
05 Dec 2016
Areas of Attention for Image Captioning
Areas of Attention for Image Captioning
M. Pedersoli
Thomas Lucas
Cordelia Schmid
Jakob Verbeek
33
205
0
03 Dec 2016
Parameter Compression of Recurrent Neural Networks and Degradation of
  Short-term Memory
Parameter Compression of Recurrent Neural Networks and Degradation of Short-term Memory
Jonathan A. Cox
8
5
0
02 Dec 2016
Guided Open Vocabulary Image Captioning with Constrained Beam Search
Guided Open Vocabulary Image Captioning with Constrained Beam Search
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
35
232
0
02 Dec 2016
Self-critical Sequence Training for Image Captioning
Self-critical Sequence Training for Image Captioning
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
37
1,880
0
02 Dec 2016
Previous
123...636465...697071
Next