Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
FoleyGAN: Visually Guided Generative Adversarial Network-Based Synchronous Sound Generation in Silent Videos
Sanchita Ghose
John J. Prevost
GAN
67
26
0
20 Jul 2021
Class dependency based learning using Bi-LSTM coupled with the transfer learning of VGG16 for the diagnosis of Tuberculosis from chest x-rays
G. Jignesh Chowdary
G. Suganya
M. Premalatha
K. Karunamurthy
61
6
0
19 Jul 2021
Mediated Uncoupled Learning: Learning Functions without Direct Input-output Correspondences
Ikko Yamane
Junya Honda
Florian Yger
Masashi Sugiyama
SSL
FedML
OOD
59
1
0
16 Jul 2021
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning
Paul Pu Liang
Yiwei Lyu
Xiang Fan
Zetian Wu
Yun Cheng
...
Peter Wu
Michelle A. Lee
Yuke Zhu
Ruslan Salakhutdinov
Louis-Philippe Morency
VLM
111
172
0
15 Jul 2021
Variational Topic Inference for Chest X-Ray Report Generation
Ivona Najdenkoska
Xiantong Zhen
M. Worring
Ling Shao
MedIm
88
29
0
15 Jul 2021
An Overview and Experimental Study of Learning-based Optimization Algorithms for Vehicle Routing Problem
Bingjie Li
Guohua Wu
Yongming He
Mingfeng Fan
Witold Pedrycz
114
70
0
15 Jul 2021
Passive Attention in Artificial Neural Networks Predicts Human Visual Selectivity
Thomas A. Langlois
H. C. Zhao
Erin Grant
Ishita Dasgupta
Thomas Griffiths
Nori Jacoby
89
16
0
14 Jul 2021
Surgical Instruction Generation with Transformers
Jinglu Zhang
Y. Nie
Jian Chang
Jiangning Zhang
MedIm
94
13
0
14 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
162
270
0
14 Jul 2021
Multi-Scale Label Relation Learning for Multi-Label Classification Using 1-Dimensional Convolutional Neural Networks
Junhyung Lyle Kim
Byungyoon Park
Charmgil Hong
31
0
0
13 Jul 2021
Human Attention during Goal-directed Reading Comprehension Relies on Task Optimization
Jiajie Zou
Yuran Zhang
Jialu Li
Xing Tian
Nai Ding
AIMat
92
2
0
13 Jul 2021
Split, embed and merge: An accurate table structure recognizer
Zhenrong Zhang
Jianshu Zhang
Jun Du
LMTD
187
62
0
12 Jul 2021
Legal Judgment Prediction with Multi-Stage CaseRepresentation Learning in the Real Court Setting
Luyao Ma
Yating Zhang
Tianyi Wang
Xiaozhong Liu
Wei Ye
Changlong Sun
Shikun Zhang
ELM
AILaw
99
59
0
12 Jul 2021
Levels of explainable artificial intelligence for human-aligned conversational explanations
Richard Dazeley
Peter Vamplew
Cameron Foale
Charlotte Young
Sunil Aryal
F. Cruz
65
93
0
07 Jul 2021
Controlled Caption Generation for Images Through Adversarial Attacks
Nayyer Aafaq
Naveed Akhtar
Wei Liu
M. Shah
Ajmal Mian
AAML
59
10
0
07 Jul 2021
Self-Adversarial Training incorporating Forgery Attention for Image Forgery Localization
Longhao Zhuo
Shunquan Tan
Bin Li
Jiwu Huang
AAML
57
74
0
06 Jul 2021
Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling
Qingyong Hu
Bo Yang
Linhai Xie
Stefano Rosa
Yulan Guo
Zhihua Wang
Niki Trigoni
Andrew Markham
3DPC
88
184
0
06 Jul 2021
RATCHET: Medical Transformer for Chest X-ray Diagnosis and Reporting
Benjamin Hou
Georgios Kaissis
Ronald M. Summers
Bernhard Kainz
ViT
LM&MA
MedIm
93
53
0
05 Jul 2021
Gradient Importance Learning for Incomplete Observations
Qitong Gao
Dong Wang
Joshua D. Amason
Siyang Yuan
Chenyang Tao
Ricardo Henao
M. Hadziahmetovic
Lawrence Carin
Miroslav Pajic
50
10
0
05 Jul 2021
Audio-Oriented Multimodal Machine Comprehension: Task, Dataset and Model
Zhiqi Huang
Fenglin Liu
Xian Wu
Shen Ge
Helin Wang
Wei Fan
Yuexian Zou
AuLLM
57
2
0
04 Jul 2021
Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions
Motonari Kambara
K. Sugiura
ViT
62
6
0
02 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
169
101
0
01 Jul 2021
VideoLightFormer: Lightweight Action Recognition using Transformers
Raivo Koot
Haiping Lu
ViT
135
6
0
01 Jul 2021
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring
Jianing Qiu
Frank P.-W. Lo
Xiao Gu
M. Jobarteh
Wenyan Jia
...
M. McCrory
Edward Sazonov
Mingui Sun
Gary Frost
Benny Lo
EgoV
66
19
0
01 Jul 2021
MissFormer: (In-)attention-based handling of missing observations for trajectory filtering and prediction
S. Becker
Ronny Hug
Wolfgang Hubner
Michael Arens
B. Morris
69
4
0
30 Jun 2021
Attention Aware Wavelet-based Detection of Morphed Face Images
Poorya Aghdaie
Baaria Chaudhary
Sobhan Soleymani
J. Dawson
Nasser M. Nasrabadi
CVBM
76
30
0
29 Jun 2021
Contrastive Semantic Similarity Learning for Image Captioning Evaluation with Intrinsic Auto-encoder
Chao Zeng
Tiesong Zhao
Sam Kwong
94
2
0
29 Jun 2021
SALYPATH: A Deep-Based Architecture for visual attention prediction
M. A. Kerkouri
Marouane Tliba
A. Chetouani
R. Harba
FAtt
MDE
59
9
0
29 Jun 2021
Saying the Unseen: Video Descriptions via Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
73
6
0
26 Jun 2021
Neural Fashion Image Captioning : Accounting for Data Diversity
Gilles Hacheme
Nouréini Sayouti
69
13
0
23 Jun 2021
Probabilistic Attention for Interactive Segmentation
Prasad Gabbur
Manjot Bilkhu
J. Movellan
103
13
0
23 Jun 2021
Interventional Video Grounding with Dual Contrastive Learning
Guoshun Nan
Rui Qiao
Yao Xiao
Jun Liu
Sicong Leng
H. Zhang
Wei Lu
98
145
0
21 Jun 2021
Trust It or Not: Confidence-Guided Automatic Radiology Report Generation
Yixin Wang
Zihao Lin
Zhe Xu
Haoyu Dong
Jiang Tian
Jie Luo
Zhongchao Shi
Yang Zhang
Jianping Fan
Zhiqiang He
UQCV
MedIm
122
12
0
21 Jun 2021
Exploring Semantic Relationships for Unpaired Image Captioning
Fenglin Liu
Meng Gao
Tianhao Zhang
Yuexian Zou
142
7
0
20 Jun 2021
Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?
Prasanna Parthasarathi
J. Pineau
Sarath Chandar
64
2
0
20 Jun 2021
Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering
Ahjeong Seo
Gi-Cheon Kang
J. Park
Byoung-Tak Zhang
82
54
0
19 Jun 2021
Learning to Predict Visual Attributes in the Wild
Khoi Pham
Kushal Kafle
Zhe Lin
Zhi Ding
Scott D. Cohen
Q. Tran
Abhinav Shrivastava
54
114
0
17 Jun 2021
Semi-Autoregressive Transformer for Image Captioning
Yuanen Zhou
Yong Zhang
Zhenzhen Hu
Meng Wang
VLM
78
25
0
17 Jun 2021
Invertible Attention
Jiajun Zha
Yiran Zhong
Jing Zhang
Leonid Sigal
Liang Zheng
82
7
0
16 Jun 2021
Soft Attention: Does it Actually Help to Learn Social Interactions in Pedestrian Trajectory Prediction?
L. Boucaud
Daniel Aloise
Nicolas Saunier
HAI
41
0
0
16 Jun 2021
Kernel Identification Through Transformers
F. Simpson
Ian Davies
V. Lalchand
A. Vullo
N. Durrande
C. Rasmussen
63
11
0
15 Jun 2021
Contrastive Attention for Automatic Chest X-ray Report Generation
Fenglin Liu
Changchang Yin
Xian Wu
Shen Ge
Yuexian Zou
Ping Zhang
Yuexian Zou
Xu Sun
MedIm
137
153
0
13 Jun 2021
Exploring and Distilling Posterior and Prior Knowledge for Radiology Report Generation
Fenglin Liu
Xian Wu
Shen Ge
Wei Fan
Yuexian Zou
MedIm
120
262
0
13 Jun 2021
Bayesian Attention Belief Networks
Shujian Zhang
Xinjie Fan
Bo Chen
Mingyuan Zhou
BDL
114
32
0
09 Jun 2021
Salient Object Ranking with Position-Preserved Attention
Haoyang Fang
Daoxin Zhang
Yi Zhang
Minghao Chen
Jiawei Li
Yao Hu
Deng Cai
Xiaofei He
71
21
0
09 Jun 2021
Object Based Attention Through Internal Gating
Jordan Lei
Ari S. Benjamin
Konrad Paul Kording
OCL
40
4
0
08 Jun 2021
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models
Chenfeng Xu
Shijia Yang
Tomer Galanti
Bichen Wu
Xiangyu Yue
Bohan Zhai
Wei Zhan
Peter Vajda
Kurt Keutzer
Masayoshi Tomizuka
3DPC
62
55
0
08 Jun 2021
Lessons learned developing and using a machine learning model to automatically transcribe 2.3 million handwritten occupation codes
Bjorn-Richard Pedersen
Einar J. Holsbø
Trygve Andersen
N. Shvetsov
Johan Ravn
H. Sommerseth
L. A. Bongo
AI4TS
39
6
0
07 Jun 2021
Relative Importance in Sentence Processing
Nora Hollenstein
Lisa Beinborn
FAtt
82
32
0
07 Jun 2021
Adversarially Regularized Graph Attention Networks for Inductive Learning on Partially Labeled Graphs
Jiaren Xiao
Quanyu Dai
Xiaochen Xie
J. Lam
Ka-Wai Kwok
GNN
78
7
0
07 Jun 2021
Previous
1
2
3
...
20
21
22
...
69
70
71
Next