Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
When Point Process Meets RNNs: Predicting Fine-Grained User Interests with Mutual Behavioral Infectivity
Tongbing Chen
Lin Wu
Yang Wang
Jun Zhang
Hongxu Chen
Xue Li
AI4TS
43
0
0
14 Oct 2017
Convolutional Attention-based Seq2Seq Neural Network for End-to-End ASR
D. Lim
37
2
0
12 Oct 2017
Lung Cancer Screening Using Adaptive Memory-Augmented Recurrent Networks
Aryan Mobiny
S. Moulik
H. Nguyen
53
13
0
11 Oct 2017
Deep learning in remote sensing: a review
Xiaoxiang Zhu
D. Tuia
Lichao Mou
Gui-Song Xia
Liangpei Zhang
Feng Xu
Friedrich Fraundorfer
102
1,627
0
11 Oct 2017
iVQA: Inverse Visual Question Answering
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
66
47
0
10 Oct 2017
Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on Rough Terrain Challenge
Doo Re Song
Chuanyu Yang
C. McGreavy
Zhibin Li
167
30
0
08 Oct 2017
OSU Multimodal Machine Translation System Report
Mingbo Ma
Dapeng Li
Kai Zhao
Liang Huang
83
15
0
07 Oct 2017
Contrastive Learning for Image Captioning
Bo Dai
Dahua Lin
SSL
VLM
105
194
0
06 Oct 2017
Semantic speech retrieval with a visually grounded model of untranscribed speech
Herman Kamper
Gregory Shakhnarovich
Karen Livescu
73
53
0
05 Oct 2017
Person Re-Identification with Vision and Language
F. Yan
K. Mikolajczyk
J. Kittler
VLM
34
11
0
03 Oct 2017
Dynamic Time-Aware Attention to Speaker Roles and Contexts for Spoken Language Understanding
Po-Chun Chen
Ta-Chung Chi
Shang-Yu Su
Yun-Nung Chen
63
28
0
30 Sep 2017
HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis
Xihui Liu
Haiyu Zhao
Maoqing Tian
Lu Sheng
Jing Shao
Shuai Yi
Junjie Yan
Xiaogang Wang
119
517
0
28 Sep 2017
Sentiment Classification with Word Attention based on Weakly Supervised Learning with a Convolutional Neural Network
Gichang Lee
Jaeyun Jeong
Seungwan Seo
Czangyeob Kim
Pilsung Kang
33
49
0
28 Sep 2017
Fooling Vision and Language Models Despite Localization and Attention Mechanism
Xiaojun Xu
Xinyun Chen
Chang-rui Liu
Anna Rohrbach
Trevor Darrell
Basel Alomair
AAML
106
41
0
25 Sep 2017
Muon Trigger for Mobile Phones
M. Borisyak
Mikhail (Misha) Usvyatsov
M. Mulhearn
C. Shimmin
Andrey Ustyuzhanin
21
10
0
25 Sep 2017
The Consciousness Prior
Yoshua Bengio
DRL
AI4CE
91
231
0
25 Sep 2017
Visual Reference Resolution using Attention Memory for Visual Dialog
Paul Hongsuck Seo
Andreas M. Lehrmann
Bohyung Han
Leonid Sigal
107
123
0
23 Sep 2017
Learning to Detect Violent Videos using Convolutional Long Short-Term Memory
Swathikiran Sudhakaran
Oswald Lanz
67
217
0
19 Sep 2017
Learning to update Auto-associative Memory in Recurrent Neural Networks for Improving Sequence Memorization
Wei Zhang
Bowen Zhou
65
13
0
19 Sep 2017
Where to Focus: Deep Attention-based Spatially Recurrent Bilinear Networks for Fine-Grained Visual Recognition
Lin Wu
Yang Wang
53
9
0
18 Sep 2017
Deep Graph Attention Model
J. B. Lee
Ryan Rossi
Xiangnan Kong
GNN
44
13
0
15 Sep 2017
Learning Functional Causal Models with Generative Neural Networks
Hugo Jair Escalante
Sergio Escalera
Xavier Baro
Isabelle M Guyon
Umut Güçlü
Marcel van Gerven
CML
BDL
109
108
0
15 Sep 2017
Self-Guiding Multimodal LSTM - when we do not have a perfect training dataset for image captioning
Yang Xian
Yingli Tian
VLM
59
23
0
15 Sep 2017
Robustness Analysis of Visual QA Models by Basic Questions
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
C. Huck Yang
Guohao Li
OOD
65
24
0
14 Sep 2017
Natural Language Inference over Interaction Space
Yichen Gong
Heng Luo
Jian Zhang
115
265
0
13 Sep 2017
RRA: Recurrent Residual Attention for Sequence Learning
Cheng Wang
35
13
0
12 Sep 2017
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Jiuxiang Gu
Jianfei Cai
G. Wang
Tsuhan Chen
126
181
0
11 Sep 2017
Steering Output Style and Topic in Neural Response Generation
Di Wang
Nebojsa Jojic
Chris Brockett
Eric Nyberg
LLMSV
86
68
0
09 Sep 2017
Cross-Domain Image Retrieval with Attention Modeling
Xin Ji
Wei Wang
Mei-juan Zhang
Yang Yang
109
82
0
06 Sep 2017
Interacting Attention-gated Recurrent Networks for Recommendation
Wenjie Pei
Jie Yang
Zhu Sun
Jie Zhang
A. Bozzon
David Tax
GNN
HAI
56
68
0
05 Sep 2017
Squeeze-and-Excitation Networks
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
506
26,777
0
05 Sep 2017
Unsupervised feature learning with discriminative encoder
Gaurav Pandey
Ambedkar Dukkipati
SSL
60
6
0
03 Sep 2017
Learning to Look Around: Intelligently Exploring Unseen Environments for Unknown Tasks
Dinesh Jayaraman
Kristen Grauman
SSL
87
106
0
01 Sep 2017
Query-by-example Spoken Term Detection using Attention-based Multi-hop Networks
Chia-Wei Ao
Hung-yi Lee
60
22
0
01 Sep 2017
Predicting Cardiovascular Risk Factors from Retinal Fundus Photographs using Deep Learning
Ryan Poplin
A. Varadarajan
Katy Blumer
Yun-Hui Liu
M. McConnell
G. Corrado
L. Peng
D. Webster
MedIm
87
1,344
0
31 Aug 2017
Video Captioning with Guidance of Multimodal Latent Topics
Shizhe Chen
Jia Chen
Qin Jin
Alexander G. Hauptmann
125
67
0
31 Aug 2017
Generating Video Descriptions with Topic Guidance
Shizhe Chen
Jia Chen
Qin Jin
88
21
0
31 Aug 2017
Video Summarization with Attention-Based Encoder-Decoder Networks
Zhong Ji
Kailin Xiong
Yanwei Pang
Xuelong Li
94
308
0
31 Aug 2017
Action Classification and Highlighting in Videos
Atousa Torabi
Leonid Sigal
79
5
0
31 Aug 2017
End-to-end Learning for Short Text Expansion
Jian Tang
Yue Wang
Kai Zheng
Qiaozhu Mei
LLMAG
81
19
0
30 Aug 2017
Hierarchical Multi-scale Attention Networks for Action Recognition
Shiyang Yan
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
91
37
0
25 Aug 2017
Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks
Victor Campos
Brendan Jou
Xavier Giró-i-Nieto
Jordi Torres
Shih-Fu Chang
95
220
0
22 Aug 2017
Twin Networks: Matching the Future for Sequence Generation
Dmitriy Serdyuk
Nan Rosemary Ke
Alessandro Sordoni
Adam Trischler
C. Pal
Yoshua Bengio
69
12
0
22 Aug 2017
PiCANet: Learning Pixel-wise Contextual Attention for Saliency Detection
Nian Liu
Junwei Han
Ming-Hsuan Yang
SSeg
76
6
0
21 Aug 2017
Attentive Semantic Video Generation using Captions
Tanya Marwah
Gaurav Mittal
V. Balasubramanian
94
72
0
20 Aug 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
185
2,830
0
19 Aug 2017
Image2song: Song Retrieval via Bridging Image Content and Lyric Words
Xuelong Li
Di Hu
Xiaoqiang Lu
56
10
0
19 Aug 2017
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
VLM
82
148
0
17 Aug 2017
Modality-specific Cross-modal Similarity Measurement with Recurrent Attention Network
Yuxin Peng
Jinwei Qi
Yuxin Yuan
48
124
0
16 Aug 2017
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
Chuang Gan
Yandong Li
Haoxiang Li
Chen Sun
Boqing Gong
109
127
0
15 Aug 2017
Previous
1
2
3
...
58
59
60
...
69
70
71
Next