Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
Beyond Part Models: Person Retrieval with Refined Part Pooling (and a Strong Convolutional Baseline)
Yifan Sun
Liang Zheng
Yi Yang
Q. Tian
Shengjin Wang
184
2,186
0
26 Nov 2017
HashGAN:Attention-aware Deep Adversarial Hashing for Cross Modal Retrieval
Xi Zhang
Siyu Zhou
Jiashi Feng
Hanjiang Lai
Bo Li
Yan Pan
Jian Yin
Bo An
GAN
54
55
0
26 Nov 2017
Convolutional Image Captioning
J. Aneja
Aditya Deshpande
Alex Schwing
VLM
142
362
0
24 Nov 2017
Attended End-to-end Architecture for Age Estimation from Facial Expression Videos
Wenjie Pei
H. Dibeklioğlu
T. Baltrušaitis
David Tax
CVBM
49
42
0
23 Nov 2017
Self-view Grounding Given a Narrated 360° Video
Shih-Han Chou
Yi-Chun Chen
Kuo-Hao Zeng
Hou-Ning Hu
Jianlong Fu
Min Sun
31
4
0
23 Nov 2017
Conditional Image-Text Embedding Networks
Bryan A. Plummer
Paige Kordas
M. Kiapour
Shuai Zheng
Robinson Piramuthu
Svetlana Lazebnik
119
118
0
22 Nov 2017
Multi-Level Recurrent Residual Networks for Action Recognition
Zhenxing Zheng
Gaoyun An
Q. Ruan
41
12
0
22 Nov 2017
On the Automatic Generation of Medical Imaging Reports
Baoyu Jing
P. Xie
Eric Xing
MedIm
101
516
0
22 Nov 2017
Identifying Most Walkable Direction for Navigation in an Outdoor Environment
Sachin Mehta
Hannaneh Hajishirzi
Linda G. Shapiro
55
9
0
21 Nov 2017
Using stochastic computation graphs formalism for optimization of sequence-to-sequence model
Eugene Golikov
Vlad Zhukov
M. Kretov
46
0
0
21 Nov 2017
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
Liwei Wang
Alex Schwing
Svetlana Lazebnik
CoGe
116
175
0
19 Nov 2017
Excitation Backprop for RNNs
Sarah Adel Bargal
Andrea Zunino
Donghyun Kim
Jianming Zhang
Vittorio Murino
Stan Sclaroff
175
48
0
18 Nov 2017
ATRank: An Attention-Based User Behavior Modeling Framework for Recommendation
Chang Zhou
Jinze Bai
Junshuai Song
Xiaofei Liu
Zhengchao Zhao
Xiusi Chen
Jun Gao
HAI
101
309
0
17 Nov 2017
Action-Attending Graphic Neural Network
Chaolong Li
Zhen Cui
Wenming Zheng
Chunyan Xu
Rongrong Ji
Jian Yang
76
50
0
17 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
Anton Van Den Hengel
ObjD
87
135
0
17 Nov 2017
Language-Based Image Editing with Recurrent Attentive Models
Jianbo Chen
Yelong Shen
Jianfeng Gao
Jingjing Liu
Xiaodong Liu
99
122
0
16 Nov 2017
A Novel Framework for Robustness Analysis of Visual QA Models
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
Guohao Li
AAML
OOD
82
34
0
16 Nov 2017
Natural Language Guided Visual Relationship Detection
Wentong Liao
Shuai Lin
Bodo Rosenhahn
M. Yang
94
63
0
16 Nov 2017
Towards Interpretable R-CNN by Unfolding Latent Structures
Tianfu Wu
Wei Sun
Xilai Li
Xi Song
Yangqiu Song
ObjD
62
20
0
14 Nov 2017
Saliency-based Sequential Image Attention with Multiset Prediction
Sean Welleck
Jialin Mao
Kyunghyun Cho
Zheng Zhang
52
23
0
14 Nov 2017
Decoupled Weight Decay Regularization
I. Loshchilov
Frank Hutter
OffRL
167
2,166
0
14 Nov 2017
High-Order Attention Models for Visual Question Answering
Idan Schwartz
Alex Schwing
Tamir Hazan
80
103
0
12 Nov 2017
AON: Towards Arbitrarily-Oriented Text Recognition
Zhanzhan Cheng
Xuyang Liu
Fan Bai
Yi Niu
Shiliang Pu
Shuigeng Zhou
71
14
0
12 Nov 2017
Building machines that adapt and compute like brains
Brenden M. Lake
J. Tenenbaum
AI4CE
FedML
NAI
AILaw
331
887
0
11 Nov 2017
End-to-end Video-level Representation Learning for Action Recognition
Jiagang Zhu
Wei Zou
Zheng Zhu
97
89
0
11 Nov 2017
Phrase-based Image Captioning with Hierarchical LSTM Model
Y. Tan
Chee Seng Chan
VLM
31
4
0
11 Nov 2017
Towards Automated ICD Coding Using Deep Learning
Haoran Shi
P. Xie
Zhiting Hu
Ming Zhang
Eric Xing
62
142
0
11 Nov 2017
Attend and Diagnose: Clinical Time Series Analysis using Attention Models
Huan-Zhi Song
Deepta Rajan
Jayaraman J. Thiagarajan
A. Spanias
MLAU
112
456
0
10 Nov 2017
Learning Multi-Modal Word Representation Grounded in Visual Context
Éloi Zablocki
Benjamin Piwowarski
Laure Soulier
Patrick Gallinari
SSL
74
30
0
09 Nov 2017
Learning Markov Chain in Unordered Dataset
Yao-Hung Hubert Tsai
Haiying Zhao
Ruslan Salakhutdinov
Nebojsa Jojic
CML
83
1
0
08 Nov 2017
Multi-label Image Recognition by Recurrently Discovering Attentional Regions
Zhouxia Wang
Tianshui Chen
Guanbin Li
Ruijia Xu
Liang Lin
126
291
0
08 Nov 2017
Image Captioning and Classification of Dangerous Situations
Octavio Arriaga
Paul G. Plöger
Matias Valdenegro-Toro
37
8
0
07 Nov 2017
Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent Networks
Nan Rosemary Ke
Anirudh Goyal
O. Bilaniuk
Jonathan Binas
Laurent Charlin
C. Pal
Yoshua Bengio
78
15
0
07 Nov 2017
Semantic Image Retrieval via Active Grounding of Visual Situations
Max H. Quinn
E. Conser
Jordan M. Witte
Melanie Mitchell
69
9
0
31 Oct 2017
Fraternal Dropout
Konrad Zolna
Devansh Arpit
Dendi Suhubdy
Yoshua Bengio
92
53
0
31 Oct 2017
Whodunnit? Crime Drama as a Case for Natural Language Understanding
Lea Frermann
Shay B. Cohen
Mirella Lapata
67
26
0
31 Oct 2017
Melody Generation for Pop Music via Word Representation of Musical Properties
Andrew Shin
Léopold Crestel
Hiroharu Kato
Kuniaki Saito
Katsunori Ohnishi
Masataka Yamaguchi
Masahiro Nakawaki
Yoshitaka Ushiku
Tatsuya Harada
MGen
72
12
0
31 Oct 2017
How deep learning works --The geometry of deep learning
Xiao Dong
Jiasong Wu
Ling Zhou
GNN
66
8
0
30 Oct 2017
Understanding Hidden Memories of Recurrent Neural Networks
Yao Ming
Shaozu Cao
Ruixiang Zhang
Zerui Li
Yuanzhe Chen
Yangqiu Song
Huamin Qu
HAI
48
201
0
30 Oct 2017
Sequence-to-Sequence ASR Optimization via Reinforcement Learning
Andros Tjandra
S. Sakti
Satoshi Nakamura
AI4TS
96
26
0
30 Oct 2017
Contextual Regression: An Accurate and Conveniently Interpretable Nonlinear Model for Mining Discovery from Scientific Data
Chengyu Liu
Wei Wang
32
7
0
30 Oct 2017
Phase Conductor on Multi-layered Attentions for Machine Comprehension
R. Liu
Wei Wei
Weiguang Mao
M. Chikina
99
22
0
28 Oct 2017
Learning to diagnose from scratch by exploiting dependencies among labels
L. Yao
Eric Poblenz
Dmitry Dagunts
Ben Covington
D. Bernard
Kevin Lyman
83
336
0
28 Oct 2017
Attention-Based Models for Text-Dependent Speaker Verification
F. R. Chowdhury
Quan Wang
Ignacio López Moreno
Li Wan
89
172
0
28 Oct 2017
Few-shot Autoregressive Density Estimation: Towards Learning to Learn Distributions
Scott E. Reed
Yutian Chen
T. Paine
Aaron van den Oord
S. M. Ali Eslami
Danilo Jimenez Rezende
Oriol Vinyals
Nando de Freitas
130
88
0
27 Oct 2017
Human-in-the-loop Artificial Intelligence
Fabio Massimo Zanzotto
88
270
0
23 Oct 2017
FigureQA: An Annotated Figure Dataset for Visual Reasoning
Samira Ebrahimi Kahou
Vincent Michalski
Adam Atkinson
Ákos Kádár
Adam Trischler
Yoshua Bengio
ReLM
AIMat
91
332
0
19 Oct 2017
Learning Social Image Embedding with Deep Multimodal Attention Networks
Feiran Huang
Xiaoming Zhang
Zhoujun Li
Tao Mei
Yueying He
Zhonghua Zhao
59
20
0
18 Oct 2017
Beat by Beat: Classifying Cardiac Arrhythmias with Recurrent Neural Networks
Patrick Schwab
G. Scebba
Jia Zhang
Marco Delai
W. Karlen
87
76
0
17 Oct 2017
Describing Natural Images Containing Novel Objects with Knowledge Guided Assitance
Aditya Mogadala
Umanga Bista
Lexing Xie
Achim Rettinger
60
7
0
17 Oct 2017
Previous
1
2
3
...
57
58
59
...
69
70
71
Next