Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
Lessons learned in multilingual grounded language learning
Ákos Kádár
Desmond Elliott
Marc-Alexandre Côté
Grzegorz Chrupała
Afra Alishahi
VLM
119
24
0
20 Sep 2018
C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis
K. J. Joseph
Arghya Pal
Sailaja Rajanala
V. Balasubramanian
DiffM
87
26
0
20 Sep 2018
Exploring Visual Relationship for Image Captioning
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
147
837
0
19 Sep 2018
Quantum Statistics-Inspired Neural Attention
Aristotelis Charalampous
S. Chatzis
35
0
0
17 Sep 2018
Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing
Zhuo Chen
Weisi Lin
Shiqi Wang
Ling-yu Duan
Alex C. Kot
88
17
0
17 Sep 2018
Integrative Analysis of Patient Health Records and Neuroimages via Memory-based Graph Convolutional Network
Xi Sheryl Zhang
Jingyuan Chou
Fei Wang
69
15
0
17 Sep 2018
CADP: A Novel Dataset for CCTV Traffic Camera based Accident Analysis
Ankit Parag Shah
Jean-Baptiste Lamare
Tuan Nguyen-Anh
Alexander G. Hauptmann
79
106
0
16 Sep 2018
Attention as a Perspective for Learning Tempo-invariant Audio Queries
Matthias Dorfer
Jan Hajic
Gerhard Widmer
25
2
0
15 Sep 2018
A Deep Learning and Gamification Approach to Energy Conservation at Nanyang Technological University
Ioannis C. Konstantakopoulos
Andrew R. Barkan
Shiying He
Tanya Veeravalli
Huihan Liu
C. Spanos
AI4CE
46
7
0
13 Sep 2018
Improving Reinforcement Learning Based Image Captioning with Natural Language Prior
Tszhang Guo
Shiyu Chang
Mo Yu
Kun Bai
70
15
0
13 Sep 2018
IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles
Tianze Shi
Kedar Tatwawadi
K. Chakrabarti
Yi Mao
Oleksandr Polozov
Weizhu Chen
78
64
0
13 Sep 2018
LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts
Shuming Ma
Lei Cui
Damai Dai
Furu Wei
Xu Sun
VGen
81
63
0
13 Sep 2018
Image Captioning based on Deep Reinforcement Learning
Haichao Shi
Peng Li
Bo Wang
Zhenyu Wang
31
25
0
13 Sep 2018
Higher-order Graph Convolutional Networks
J. B. Lee
Ryan A. Rossi
Xiangnan Kong
Sungchul Kim
Eunyee Koh
Anup B. Rao
GNN
74
36
0
12 Sep 2018
End-to-end Image Captioning Exploits Multimodal Distributional Similarity
Pranava Madhyastha
Josiah Wang
Lucia Specia
CoGe
63
7
0
11 Sep 2018
SPASS: Scientific Prominence Active Search System with Deep Image Captioning Network
D. Qiu
26
2
0
10 Sep 2018
Tracking by Animation: Unsupervised Learning of Multi-Object Attentive Trackers
Zhen He
Jian Li
Daxue Liu
Hangen He
David Barber
VOT
77
54
0
10 Sep 2018
Dual Attention Network for Scene Segmentation
J. Fu
Qingbin Liu
Haijie Tian
Yong Li
Yongjun Bao
Zhiwei Fang
Hanqing Lu
SSeg
333
5,134
0
09 Sep 2018
Exploration on Grounded Word Embedding: Matching Words and Images with Image-Enhanced Skip-Gram Model
Ruixuan Luo
36
0
0
08 Sep 2018
Object Hallucination in Image Captioning
Anna Rohrbach
Lisa Anne Hendricks
Kaylee Burns
Trevor Darrell
Kate Saenko
236
445
0
06 Sep 2018
Bimodal network architectures for automatic generation of image annotation from text
Mehdi Moradi
Ali Madani
Yaniv Gur
Yufan Guo
Tanveer Syeda-Mahmood
50
20
0
05 Sep 2018
Dynamically Context-Sensitive Time-Decay Attention for Dialogue Modeling
Shang-Yu Su
Pei-Chieh Yuan
Yun-Nung Chen
51
7
0
05 Sep 2018
Text2Scene: Generating Compositional Scenes from Textual Descriptions
Fuwen Tan
Song Feng
Vicente Ordonez
135
18
0
04 Sep 2018
Diverse and Coherent Paragraph Generation from Images
Moitreya Chatterjee
Alex Schwing
78
67
0
03 Sep 2018
VoxSegNet: Volumetric CNNs for Semantic Part Segmentation of 3D Shapes
Zongji Wang
Feng Lu
3DPC
68
113
0
01 Sep 2018
Improving Visual Relationship Detection using Semantic Modeling of Scene Descriptions
S. Baier
Yunpu Ma
Volker Tresp
102
59
0
01 Sep 2018
LIUM-CVC Submissions for WMT18 Multimodal Translation Task
Ozan Caglayan
Adrien Bardet
Fethi Bougares
Loïc Barrault
M. García-Martínez
Marc Masana
Luis Herranz
Joost van de Weijer
68
41
0
01 Sep 2018
When to Finish? Optimal Beam Search for Neural Text Generation (modulo beam size)
Liang Huang
Kai Zhao
Mingbo Ma
96
54
0
31 Aug 2018
A Deep Neural Network Sentence Level Classification Method with Context Information
Xingyi Song
Johann Petrak
A. Roberts
49
21
0
31 Aug 2018
An Adaptive Locally Connected Neuron Model: Focusing Neuron
F. Boray Tek
38
6
0
31 Aug 2018
Ensemble Sequence Level Training for Multimodal MT: OSU-Baidu WMT18 Multimodal Machine Translation System Report
Renjie Zheng
Yilin Yang
Mingbo Ma
Liang Huang
86
8
0
31 Aug 2018
Learning to Describe Differences Between Pairs of Similar Images
Harsh Jhamtani
Taylor Berg-Kirkpatrick
92
155
0
31 Aug 2018
LUCSS: Language-based User-customized Colourization of Scene Sketches
C. Zou
Haoran Mo
Ruofei Du
Xing Wu
Chengying Gao
Hongbo Fu
47
8
0
30 Aug 2018
End-to-end Speech Recognition with Adaptive Computation Steps
Mohan Li
Min Liu
Masanori Hattori
44
34
0
30 Aug 2018
Hard Non-Monotonic Attention for Character-Level Transduction
Shijie Wu
Pamela Shapiro
Ryan Cotterell
90
42
0
29 Aug 2018
Attention-based Neural Text Segmentation
Pinkesh Badjatiya
Litton J. Kurisinkel
Manish Gupta
Vasudeva Varma
53
68
0
29 Aug 2018
Top-down Attention Recurrent VLAD Encoding for Action Recognition in Videos
Swathikiran Sudhakaran
Oswald Lanz
48
6
0
29 Aug 2018
Interact as You Intend: Intention-Driven Human-Object Interaction Detection
Bingjie Xu
Junnan Li
Yongkang Wong
Mohan Kankanhalli
Qi Zhao
103
100
0
29 Aug 2018
Notes on Deep Learning for NLP
A. Tixier
VLM
66
15
0
29 Aug 2018
Multi-Reference Training with Pseudo-References for Neural Translation and Text Generation
Renjie Zheng
Mingbo Ma
Liang Huang
80
35
0
28 Aug 2018
Natural Language Generation with Neural Variational Models
Hareesh Bahuleyan
DRL
49
6
0
27 Aug 2018
A neural attention model for speech command recognition
Douglas Coimbra de Andrade
Sabato Leo
M. Viana
Christoph Bernkopf
57
145
0
27 Aug 2018
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Houfeng Wang
Xu Sun
132
66
0
27 Aug 2018
Learning End-to-End Goal-Oriented Dialog with Multiple Answers
Janarthanan Rajendran
Jatin Ganhotra
Satinder Singh
L. Polymenakos
61
37
0
24 Aug 2018
Approximate Distribution Matching for Sequence-to-Sequence Learning
Wenhu Chen
Guanlin Li
Shujie Liu
Zhirui Zhang
Mu Li
M. Zhou
OOD
BDL
38
0
0
24 Aug 2018
Sarcasm Analysis using Conversation Context
Debanjan Ghosh
Alexander R. Fabbri
Smaranda Muresan
72
85
0
22 Aug 2018
Attention Gated Networks: Learning to Leverage Salient Regions in Medical Images
Jo Schlemper
Ozan Oktay
M. Schaap
M. Heinrich
Bernhard Kainz
Ben Glocker
Daniel Rueckert
MedIm
142
1,488
0
22 Aug 2018
Hierarchical Neural Network for Extracting Knowledgeable Snippets and Documents
Ganbin Zhou
Rongyu Cao
Xiang Ao
Ping Luo
Fen Lin
Leyu Lin
Qing He
48
0
0
22 Aug 2018
Exploring a Unified Attention-Based Pooling Framework for Speaker Verification
Yi Y. Liu
Liang He
Weiwei Liu
Jia-Wei Liu
43
8
0
21 Aug 2018
VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification
Song-Le Chen
Lintao Zheng
Yan Zhang
Zhixin Sun
Kai Xu
3DPC
3DV
79
75
0
20 Aug 2018
Previous
1
2
3
...
49
50
51
...
69
70
71
Next