ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Lessons learned in multilingual grounded language learning
Lessons learned in multilingual grounded language learning
Ákos Kádár
Desmond Elliott
Marc-Alexandre Côté
Grzegorz Chrupała
Afra Alishahi
VLM
119
24
0
20 Sep 2018
C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis
C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis
K. J. Joseph
Arghya Pal
Sailaja Rajanala
V. Balasubramanian
DiffM
87
26
0
20 Sep 2018
Exploring Visual Relationship for Image Captioning
Exploring Visual Relationship for Image Captioning
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
147
837
0
19 Sep 2018
Quantum Statistics-Inspired Neural Attention
Quantum Statistics-Inspired Neural Attention
Aristotelis Charalampous
S. Chatzis
35
0
0
17 Sep 2018
Intermediate Deep Feature Compression: the Next Battlefield of
  Intelligent Sensing
Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing
Zhuo Chen
Weisi Lin
Shiqi Wang
Ling-yu Duan
Alex C. Kot
88
17
0
17 Sep 2018
Integrative Analysis of Patient Health Records and Neuroimages via
  Memory-based Graph Convolutional Network
Integrative Analysis of Patient Health Records and Neuroimages via Memory-based Graph Convolutional Network
Xi Sheryl Zhang
Jingyuan Chou
Fei Wang
69
15
0
17 Sep 2018
CADP: A Novel Dataset for CCTV Traffic Camera based Accident Analysis
CADP: A Novel Dataset for CCTV Traffic Camera based Accident Analysis
Ankit Parag Shah
Jean-Baptiste Lamare
Tuan Nguyen-Anh
Alexander G. Hauptmann
79
106
0
16 Sep 2018
Attention as a Perspective for Learning Tempo-invariant Audio Queries
Attention as a Perspective for Learning Tempo-invariant Audio Queries
Matthias Dorfer
Jan Hajic
Gerhard Widmer
25
2
0
15 Sep 2018
A Deep Learning and Gamification Approach to Energy Conservation at
  Nanyang Technological University
A Deep Learning and Gamification Approach to Energy Conservation at Nanyang Technological University
Ioannis C. Konstantakopoulos
Andrew R. Barkan
Shiying He
Tanya Veeravalli
Huihan Liu
C. Spanos
AI4CE
46
7
0
13 Sep 2018
Improving Reinforcement Learning Based Image Captioning with Natural
  Language Prior
Improving Reinforcement Learning Based Image Captioning with Natural Language Prior
Tszhang Guo
Shiyu Chang
Mo Yu
Kun Bai
70
15
0
13 Sep 2018
IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic
  Oracles
IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles
Tianze Shi
Kedar Tatwawadi
K. Chakrabarti
Yi Mao
Oleksandr Polozov
Weizhu Chen
78
64
0
13 Sep 2018
LiveBot: Generating Live Video Comments Based on Visual and Textual
  Contexts
LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts
Shuming Ma
Lei Cui
Damai Dai
Furu Wei
Xu Sun
VGen
81
63
0
13 Sep 2018
Image Captioning based on Deep Reinforcement Learning
Image Captioning based on Deep Reinforcement Learning
Haichao Shi
Peng Li
Bo Wang
Zhenyu Wang
31
25
0
13 Sep 2018
Higher-order Graph Convolutional Networks
Higher-order Graph Convolutional Networks
J. B. Lee
Ryan A. Rossi
Xiangnan Kong
Sungchul Kim
Eunyee Koh
Anup B. Rao
GNN
74
36
0
12 Sep 2018
End-to-end Image Captioning Exploits Multimodal Distributional
  Similarity
End-to-end Image Captioning Exploits Multimodal Distributional Similarity
Pranava Madhyastha
Josiah Wang
Lucia Specia
CoGe
63
7
0
11 Sep 2018
SPASS: Scientific Prominence Active Search System with Deep Image
  Captioning Network
SPASS: Scientific Prominence Active Search System with Deep Image Captioning Network
D. Qiu
26
2
0
10 Sep 2018
Tracking by Animation: Unsupervised Learning of Multi-Object Attentive
  Trackers
Tracking by Animation: Unsupervised Learning of Multi-Object Attentive Trackers
Zhen He
Jian Li
Daxue Liu
Hangen He
David Barber
VOT
77
54
0
10 Sep 2018
Dual Attention Network for Scene Segmentation
Dual Attention Network for Scene Segmentation
J. Fu
Qingbin Liu
Haijie Tian
Yong Li
Yongjun Bao
Zhiwei Fang
Hanqing Lu
SSeg
333
5,134
0
09 Sep 2018
Exploration on Grounded Word Embedding: Matching Words and Images with
  Image-Enhanced Skip-Gram Model
Exploration on Grounded Word Embedding: Matching Words and Images with Image-Enhanced Skip-Gram Model
Ruixuan Luo
36
0
0
08 Sep 2018
Object Hallucination in Image Captioning
Object Hallucination in Image Captioning
Anna Rohrbach
Lisa Anne Hendricks
Kaylee Burns
Trevor Darrell
Kate Saenko
236
445
0
06 Sep 2018
Bimodal network architectures for automatic generation of image
  annotation from text
Bimodal network architectures for automatic generation of image annotation from text
Mehdi Moradi
Ali Madani
Yaniv Gur
Yufan Guo
Tanveer Syeda-Mahmood
50
20
0
05 Sep 2018
Dynamically Context-Sensitive Time-Decay Attention for Dialogue Modeling
Dynamically Context-Sensitive Time-Decay Attention for Dialogue Modeling
Shang-Yu Su
Pei-Chieh Yuan
Yun-Nung Chen
51
7
0
05 Sep 2018
Text2Scene: Generating Compositional Scenes from Textual Descriptions
Text2Scene: Generating Compositional Scenes from Textual Descriptions
Fuwen Tan
Song Feng
Vicente Ordonez
135
18
0
04 Sep 2018
Diverse and Coherent Paragraph Generation from Images
Diverse and Coherent Paragraph Generation from Images
Moitreya Chatterjee
Alex Schwing
78
67
0
03 Sep 2018
VoxSegNet: Volumetric CNNs for Semantic Part Segmentation of 3D Shapes
VoxSegNet: Volumetric CNNs for Semantic Part Segmentation of 3D Shapes
Zongji Wang
Feng Lu
3DPC
68
113
0
01 Sep 2018
Improving Visual Relationship Detection using Semantic Modeling of Scene
  Descriptions
Improving Visual Relationship Detection using Semantic Modeling of Scene Descriptions
S. Baier
Yunpu Ma
Volker Tresp
102
59
0
01 Sep 2018
LIUM-CVC Submissions for WMT18 Multimodal Translation Task
LIUM-CVC Submissions for WMT18 Multimodal Translation Task
Ozan Caglayan
Adrien Bardet
Fethi Bougares
Loïc Barrault
M. García-Martínez
Marc Masana
Luis Herranz
Joost van de Weijer
68
41
0
01 Sep 2018
When to Finish? Optimal Beam Search for Neural Text Generation (modulo
  beam size)
When to Finish? Optimal Beam Search for Neural Text Generation (modulo beam size)
Liang Huang
Kai Zhao
Mingbo Ma
96
54
0
31 Aug 2018
A Deep Neural Network Sentence Level Classification Method with Context
  Information
A Deep Neural Network Sentence Level Classification Method with Context Information
Xingyi Song
Johann Petrak
A. Roberts
49
21
0
31 Aug 2018
An Adaptive Locally Connected Neuron Model: Focusing Neuron
An Adaptive Locally Connected Neuron Model: Focusing Neuron
F. Boray Tek
38
6
0
31 Aug 2018
Ensemble Sequence Level Training for Multimodal MT: OSU-Baidu WMT18
  Multimodal Machine Translation System Report
Ensemble Sequence Level Training for Multimodal MT: OSU-Baidu WMT18 Multimodal Machine Translation System Report
Renjie Zheng
Yilin Yang
Mingbo Ma
Liang Huang
86
8
0
31 Aug 2018
Learning to Describe Differences Between Pairs of Similar Images
Learning to Describe Differences Between Pairs of Similar Images
Harsh Jhamtani
Taylor Berg-Kirkpatrick
92
155
0
31 Aug 2018
LUCSS: Language-based User-customized Colourization of Scene Sketches
LUCSS: Language-based User-customized Colourization of Scene Sketches
C. Zou
Haoran Mo
Ruofei Du
Xing Wu
Chengying Gao
Hongbo Fu
47
8
0
30 Aug 2018
End-to-end Speech Recognition with Adaptive Computation Steps
End-to-end Speech Recognition with Adaptive Computation Steps
Mohan Li
Min Liu
Masanori Hattori
44
34
0
30 Aug 2018
Hard Non-Monotonic Attention for Character-Level Transduction
Hard Non-Monotonic Attention for Character-Level Transduction
Shijie Wu
Pamela Shapiro
Ryan Cotterell
90
42
0
29 Aug 2018
Attention-based Neural Text Segmentation
Attention-based Neural Text Segmentation
Pinkesh Badjatiya
Litton J. Kurisinkel
Manish Gupta
Vasudeva Varma
53
68
0
29 Aug 2018
Top-down Attention Recurrent VLAD Encoding for Action Recognition in
  Videos
Top-down Attention Recurrent VLAD Encoding for Action Recognition in Videos
Swathikiran Sudhakaran
Oswald Lanz
48
6
0
29 Aug 2018
Interact as You Intend: Intention-Driven Human-Object Interaction
  Detection
Interact as You Intend: Intention-Driven Human-Object Interaction Detection
Bingjie Xu
Junnan Li
Yongkang Wong
Mohan Kankanhalli
Qi Zhao
103
100
0
29 Aug 2018
Notes on Deep Learning for NLP
Notes on Deep Learning for NLP
A. Tixier
VLM
66
15
0
29 Aug 2018
Multi-Reference Training with Pseudo-References for Neural Translation
  and Text Generation
Multi-Reference Training with Pseudo-References for Neural Translation and Text Generation
Renjie Zheng
Mingbo Ma
Liang Huang
80
35
0
28 Aug 2018
Natural Language Generation with Neural Variational Models
Natural Language Generation with Neural Variational Models
Hareesh Bahuleyan
DRL
49
6
0
27 Aug 2018
A neural attention model for speech command recognition
A neural attention model for speech command recognition
Douglas Coimbra de Andrade
Sabato Leo
M. Viana
Christoph Bernkopf
57
145
0
27 Aug 2018
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and
  Comprehensive Image Captions
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Houfeng Wang
Xu Sun
132
66
0
27 Aug 2018
Learning End-to-End Goal-Oriented Dialog with Multiple Answers
Learning End-to-End Goal-Oriented Dialog with Multiple Answers
Janarthanan Rajendran
Jatin Ganhotra
Satinder Singh
L. Polymenakos
61
37
0
24 Aug 2018
Approximate Distribution Matching for Sequence-to-Sequence Learning
Approximate Distribution Matching for Sequence-to-Sequence Learning
Wenhu Chen
Guanlin Li
Shujie Liu
Zhirui Zhang
Mu Li
M. Zhou
OODBDL
38
0
0
24 Aug 2018
Sarcasm Analysis using Conversation Context
Sarcasm Analysis using Conversation Context
Debanjan Ghosh
Alexander R. Fabbri
Smaranda Muresan
72
85
0
22 Aug 2018
Attention Gated Networks: Learning to Leverage Salient Regions in
  Medical Images
Attention Gated Networks: Learning to Leverage Salient Regions in Medical Images
Jo Schlemper
Ozan Oktay
M. Schaap
M. Heinrich
Bernhard Kainz
Ben Glocker
Daniel Rueckert
MedIm
142
1,488
0
22 Aug 2018
Hierarchical Neural Network for Extracting Knowledgeable Snippets and
  Documents
Hierarchical Neural Network for Extracting Knowledgeable Snippets and Documents
Ganbin Zhou
Rongyu Cao
Xiang Ao
Ping Luo
Fen Lin
Leyu Lin
Qing He
48
0
0
22 Aug 2018
Exploring a Unified Attention-Based Pooling Framework for Speaker
  Verification
Exploring a Unified Attention-Based Pooling Framework for Speaker Verification
Yi Y. Liu
Liang He
Weiwei Liu
Jia-Wei Liu
43
8
0
21 Aug 2018
VERAM: View-Enhanced Recurrent Attention Model for 3D Shape
  Classification
VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification
Song-Le Chen
Lintao Zheng
Yan Zhang
Zhixin Sun
Kai Xu
3DPC3DV
79
75
0
20 Aug 2018
Previous
123...495051...697071
Next