Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
Mimic and Fool: A Task Agnostic Adversarial Attack
Akshay Chaturvedi
Utpal Garain
AAML
57
27
0
11 Jun 2019
Relationship-Embedded Representation Learning for Grounding Referring Expressions
Sibei Yang
Guanbin Li
Yizhou Yu
ObjD
97
55
0
11 Jun 2019
Bag of Color Features For Color Constancy
Firas Laakom
Nikolaos Passalis
Jenni Raitoharju
Jarno Nikkanen
Anastasios Tefas
Alexandros Iosifidis
Moncef Gabbouj
46
33
0
11 Jun 2019
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Yale Song
M. Soleymani
84
247
0
11 Jun 2019
Improving Neural Language Modeling via Adversarial Training
Dilin Wang
Chengyue Gong
Qiang Liu
AAML
124
119
0
10 Jun 2019
An Attention-based Recurrent Convolutional Network for Vehicle Taillight Recognition
Kuan-Hui Lee
Takaaki Tagawa
Jia Pan
Adrien Gaidon
B. Douillard
ViT
37
15
0
09 Jun 2019
Attention-based Conditioning Methods for External Knowledge Integration
Katerina Margatina
Christos Baziotis
Alexandros Potamianos
51
30
0
09 Jun 2019
Attending to Discriminative Certainty for Domain Adaptation
V. Kurmi
Shanu Kumar
Vinay P. Namboodiri
OOD
98
108
0
08 Jun 2019
Figure Captioning with Reasoning and Sequence-Level Training
Charles C. Chen
Ruiyi Zhang
Eunyee Koh
Sungchul Kim
Scott D. Cohen
Tong Yu
Ryan Rossi
Razvan Bunescu
AIMat
69
39
0
07 Jun 2019
Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video
Zhenfang Chen
Lin Ma
Wenhan Luo
Kwan-Yee K. Wong
105
103
0
06 Jun 2019
Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
Alex Mott
Daniel Zoran
Mike Chrzanowski
Daan Wierstra
Danilo Jimenez Rezende
74
192
0
06 Jun 2019
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering
Zhou Yu
D. Xu
Jun-chen Yu
Ting Yu
Zhou Zhao
Yueting Zhuang
Dacheng Tao
157
478
0
06 Jun 2019
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Zhengjun Zha
Daqing Liu
Hanwang Zhang
Yongdong Zhang
Feng Wu
66
122
0
06 Jun 2019
Neural Legal Judgment Prediction in English
Ilias Chalkidis
Ion Androutsopoulos
Nikolaos Aletras
AILaw
ELM
190
342
0
05 Jun 2019
Large-Scale Multi-Label Text Classification on EU Legislation
Ilias Chalkidis
Manos Fergadiotis
Prodromos Malakasiotis
Ion Androutsopoulos
AILaw
66
217
0
05 Jun 2019
Machine Learning and System Identification for Estimation in Physical Systems
Fredrik Bagge Carlson
OOD
56
5
0
05 Jun 2019
KERMIT: Generative Insertion-Based Modeling for Sequences
William Chan
Nikita Kitaev
Kelvin Guu
Mitchell Stern
Jakob Uszkoreit
VLM
96
65
0
04 Jun 2019
Natural Vocabulary Emerges from Free-Form Annotations
Jordi Pont-Tuset
Michael Gygli
V. Ferrari
VLM
90
3
0
04 Jun 2019
Masked Non-Autoregressive Image Captioning
Junlong Gao
Xi Meng
Shiqi Wang
Xia Li
Shanshe Wang
Siwei Ma
Wen Gao
80
39
0
03 Jun 2019
Robust Sequence-to-Sequence Acoustic Modeling with Stepwise Monotonic Attention for Neural TTS
Mutian He
Yan Deng
Lei He
97
81
0
03 Jun 2019
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain
Johanes Effendi
Andros Tjandra
S. Sakti
Satoshi Nakamura
68
3
0
03 Jun 2019
A Survey of Natural Language Generation Techniques with a Focus on Dialogue Systems - Past, Present and Future Directions
Sashank Santhanam
Samira Shaikh
3DV
84
52
0
02 Jun 2019
Unsupervised Bilingual Lexicon Induction from Mono-lingual Multimodal Data
Shizhe Chen
Qin Jin
Alexander G. Hauptmann
SSL
46
9
0
02 Jun 2019
Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular Video
Jian Liu
Naveed Akhtar
Ajmal Mian
3DH
66
10
0
01 Jun 2019
Do Human Rationales Improve Machine Explanations?
Julia Strout
Ye Zhang
Raymond J. Mooney
89
58
0
31 May 2019
Audio Caption in a Car Setting with a Sentence-Level Loss
Xuenan Xu
Heinrich Dinkel
Mengyue Wu
Kai Yu
31
2
0
31 May 2019
Interactive-predictive neural multimodal systems
Álvaro Peris
F. Casacuberta
KELM
HAI
47
2
0
30 May 2019
Meta Dropout: Learning to Perturb Features for Generalization
Haebeom Lee
Taewook Nam
Eunho Yang
Sung Ju Hwang
OOD
68
3
0
30 May 2019
Adversarial Sub-sequence for Text Generation
Xingyuan Chen
Yanzhe Li
Peng Jin
Jiuhua Zhang
Xinyu Dai
Jiajun Chen
Gang Song
GAN
55
5
0
30 May 2019
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback
Hui Wu
Yupeng Gao
Xiaoxiao Guo
Ziad Al-Halah
Steven J. Rennie
Kristen Grauman
Rogerio Feris
EgoV
167
68
0
30 May 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
79
37
0
29 May 2019
Recurrent Existence Determination Through Policy Optimization
Baoxiang Wang
47
1
0
29 May 2019
Semantic Fisher Scores for Task Transfer: Using Objects to Classify Scenes
Mandar Dixit
Yunsheng Li
Nuno Vasconcelos
86
14
0
27 May 2019
Audio2Face: Generating Speech/Face Animation from Single Audio with Attention-Based Bidirectional LSTM Networks
Guanzhong Tian
Yi Yuan
Yang Liu
CVBM
86
45
0
27 May 2019
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient Models
Linfeng Zhang
Zhanhong Tan
Jiebo Song
Jingwei Chen
Chenglong Bao
Kaisheng Ma
55
71
0
27 May 2019
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
Jeff Clune
148
122
0
27 May 2019
Transcribing Content from Structural Images with Spotlight Mechanism
Yu Yin
Zhenya Huang
Enhong Chen
Qi Liu
Fuzheng Zhang
Xing Xie
Guoping Hu
48
22
0
27 May 2019
Extreme Multi-Label Legal Text Classification: A case study in EU Legislation
Ilias Chalkidis
Manos Fergadiotis
Prodromos Malakasiotis
Nikolaos Aletras
Ion Androutsopoulos
AILaw
86
75
0
26 May 2019
Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives
Yi Tay
Shuohang Wang
Anh Tuan Luu
Jie Fu
Minh C. Phan
Xingdi Yuan
J. Rao
S. Hui
Aston Zhang
118
110
0
26 May 2019
A Survey on Biomedical Image Captioning
Vasiliki Kougia
John Pavlopoulos
Ion Androutsopoulos
MedIm
94
83
0
26 May 2019
Path Ranking with Attention to Type Hierarchies
Weiyu Liu
A. Daruna
Z. Kira
Sonia Chernova
AIMat
70
13
0
26 May 2019
DIANet: Dense-and-Implicit Attention Network
Zhongzhan Huang
Senwei Liang
Mingfu Liang
Haizhao Yang
CVBM
82
57
0
25 May 2019
Bivariate Beta-LSTM
Kyungwoo Song
Joonho Jang
Seung-Jae Shin
Il-Chul Moon
51
6
0
25 May 2019
Pose-adaptive Hierarchical Attention Network for Facial Expression Recognition
Yuanyuan Liu
Jiyao Peng
Jiabei Zeng
Shiguang Shan
CVBM
69
16
0
24 May 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Jivitesh Sharma
Per-Arne Andersen
Ole-Christoffer Granmo
M. G. Olsen
AI4CE
78
70
0
23 May 2019
AttentionRNN: A Structured Spatial Attention Mechanism
Siddhesh Khandelwal
Leonid Sigal
71
3
0
22 May 2019
What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention
Antonino Furnari
G. Farinella
EgoV
141
175
0
22 May 2019
A Neural, Interactive-predictive System for Multimodal Sequence to Sequence Tasks
Álvaro Peris
F. Casacuberta
48
4
0
20 May 2019
Image Captioning based on Deep Learning Methods: A Survey
Yiyu Wang
Jungang Xu
Yingfei Sun
Xianpei Han
VLM
44
7
0
20 May 2019
Less Memory, Faster Speed: Refining Self-Attention Module for Image Reconstruction
Zheng Wang
Jianwu Li
Ge Song
Tieling Li
28
2
0
20 May 2019
Previous
1
2
3
...
41
42
43
...
69
70
71
Next