Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,509 papers shown
Title
Fine-grained Anomaly Detection in Sequential Data via Counterfactual Explanations
He Cheng
Depeng Xu
Shuhan Yuan
Xintao Wu
AI4TS
35
3
0
09 Oct 2022
Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling
Hsin-Ying Lee
Hung-Ting Su
Bing-Chen Tsai
Tsung-Han Wu
Jia-Fong Yeh
Winston H. Hsu
27
2
0
08 Oct 2022
Contextual Modeling for 3D Dense Captioning on Point Clouds
Yufeng Zhong
Longdao Xu
Jiebo Luo
Lin Ma
44
15
0
08 Oct 2022
LOCL: Learning Object-Attribute Composition using Localization
Satish Kumar
A S M Iftekhar
Ekta Prashnani
B.S.Manjunath
19
3
0
07 Oct 2022
Quantitative Metrics for Evaluating Explanations of Video DeepFake Detectors
Federico Baldassarre
Quentin Debard
Gonzalo Fiz Pontiveros
Tri Kurniawan Wijaya
44
4
0
07 Oct 2022
CLEAR: Causal Explanations from Attention in Neural Recommenders
Shami Nisimov
R. Y. Rohekar
Yaniv Gurwicz
G. Koren
Gal Novik
CML
23
6
0
07 Oct 2022
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation
Khoa T. Vo
Sang Truong
Kashu Yamazaki
Bhiksha Raj
Minh-Triet Tran
Ngan Le
86
26
0
05 Oct 2022
Improved Anomaly Detection by Using the Attention-Based Isolation Forest
Lev V. Utkin
A. Ageev
A. Konstantinov
41
6
0
05 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
Ye Zhu
Yuehua Wu
N. Sebe
Yan Yan
35
16
0
05 Oct 2022
Affection: Learning Affective Explanations for Real-World Visual Data
Panos Achlioptas
M. Ovsjanikov
Leonidas J. Guibas
Sergey Tulyakov
83
11
0
04 Oct 2022
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
Xu Yang
Hanwang Zhang
Chongyang Gao
Jianfei Cai
MLLM
40
10
0
04 Oct 2022
Music-to-Text Synaesthesia: Generating Descriptive Text from Music Recordings
Zhihuan Kuang
Shi Zong
Jianbing Zhang
Jiajun Chen
Hongfu Liu
30
4
0
02 Oct 2022
MaskTune: Mitigating Spurious Correlations by Forcing to Explore
Saeid Asgari Taghanaki
Aliasghar Khani
Fereshte Khani
A. Gholami
Linh-Tam Tran
Ali Mahdavi-Amiri
Ghassan Hamarneh
AAML
43
45
0
30 Sep 2022
Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning
Thi-Thu-Hong Phan
Duc Le
Brijesh Patel
Donald Adjeroh
Jingxian Wu
M. Jensen
Ngan Le
30
11
0
30 Sep 2022
SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
R. Ramos
Bruno Martins
Desmond Elliott
Yova Kementchedjhieva
VLM
30
86
0
30 Sep 2022
Medical Image Captioning via Generative Pretrained Transformers
Alexander Selivanov
Oleg Y. Rogov
Daniil Chesakov
Artem Shelmanov
Irina Fedulova
Dmitry Dylov
MedIm
57
55
0
28 Sep 2022
InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference
Mu Yuan
Lan Zhang
Fengxiang He
Xueting Tong
Miao-Hui Song
Zhengyuan Xu
Xiang-Yang Li
32
2
0
28 Sep 2022
RepsNet: Combining Vision with Language for Automated Medical Reports
A. Tanwani
Joelle Barral
Daniel Freedman
MedIm
44
20
0
27 Sep 2022
STING: Self-attention based Time-series Imputation Networks using GAN
Eunkyu Oh
Taehun Kim
Yunhu Ji
Sushil Khyalia
AI4TS
29
25
0
22 Sep 2022
DRAMA: Joint Risk Localization and Captioning in Driving
Srikanth Malla
Chiho Choi
Isht Dwivedi
Joonhyang Choi
Jiachen Li
107
87
0
22 Sep 2022
Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
K. Nguyen
Ali Furkan Biten
Andrés Mafla
Lluís Gómez
Dimosthenis Karatzas
36
10
0
21 Sep 2022
Active Particle Filter Networks: Efficient Active Localization in Continuous Action Spaces and Large Maps
Daniel Honerkamp
Suresh Guttikonda
Abhinav Valada
27
2
0
20 Sep 2022
Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud
Geraldo F. Oliveira
Juan Gómez Luna
Saugata Ghose
Amirali Boroumand
O. Mutlu
29
24
0
19 Sep 2022
Learning Distinct and Representative Styles for Image Captioning
Qi Chen
Chaorui Deng
Qi Wu
VLM
42
23
0
17 Sep 2022
Belief Revision based Caption Re-ranker with Visual Semantic Information
Ahmed Sabir
Francesc Moreno-Noguer
Pranava Madhyastha
Lluís Padró
BDL
32
2
0
16 Sep 2022
M^4I: Multi-modal Models Membership Inference
Pingyi Hu
Zihan Wang
Ruoxi Sun
Hu Wang
Minhui Xue
39
26
0
15 Sep 2022
Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Kartik Audhkhasi
Yinghui Huang
Bhuvana Ramabhadran
Pedro J. Moreno
24
3
0
13 Sep 2022
Vision Transformers for Action Recognition: A Survey
Anwaar Ulhaq
Naveed Akhtar
Ganna Pogrebna
Ajmal Mian
ViT
28
44
0
13 Sep 2022
Evaluation of Question Answering Systems: Complexity of judging a natural language
Amer Farea
Zhen Yang
Kien Duong
Nadeesha Perera
F. Emmert-Streib
ELM
31
3
0
10 Sep 2022
Foundations and Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions
Paul Pu Liang
Amir Zadeh
Louis-Philippe Morency
18
62
0
07 Sep 2022
RF Fingerprinting Needs Attention: Multi-task Approach for Real-World WiFi and Bluetooth
Anu Jagannath
Zackary Kane
Jithin Jagannath
33
11
0
07 Sep 2022
Parallel and Streaming Wavelet Neural Networks for Classification and Regression under Apache Spark
E Venkatesh
Yelleti Vivek
V. Ravi
Shiva Shankar Orsu
16
6
0
07 Sep 2022
A Weakly Supervised Learning Framework for Salient Object Detection via Hybrid Labels
Runmin Cong
Qi Qin
Chen Zhang
Qiuping Jiang
Shi Wang
Yao-Min Zhao
Sam Kwong
54
52
0
07 Sep 2022
Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation
Peining Zhang
Junliang Guo
Linli Xu
Mu You
Junming Yin
22
0
0
05 Sep 2022
MMKGR: Multi-hop Multi-modal Knowledge Graph Reasoning
Shangfei Zheng
Weiqing Wang
Jianfeng Qu
Hongzhi Yin
Wei Chen
Lei Zhao
LRM
21
22
0
03 Sep 2022
vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTM
THANH VAN NGUYEN
Long H. Nguyen
Nhat Truong Pham
Liu Tai Nguyen
Van Huong Do
Hai Nguyen
Ngoc Duy Nguyen
VLM
ViT
20
1
0
03 Sep 2022
EGFR Mutation Prediction of Lung Biopsy Images using Deep Learning
R. Gupta
Shivani Nandgaonkar
Nikhil Cherian Kurian
S. Rane
A. Sethi
MedIm
39
7
0
26 Aug 2022
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted Window
Mocho Go
Hideyuki Tachibana
ViT
37
9
0
24 Aug 2022
Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and Representation Mapping
Bo Zhou
Jiahui Liu
Songyi Cui
Yaping Zhao
26
5
0
23 Aug 2022
A Medical Semantic-Assisted Transformer for Radiographic Report Generation
Zhanyu Wang
Mingkang Tang
Lei Wang
Xiu Li
Luping Zhou
ViT
MedIm
24
57
0
22 Aug 2022
Mix-Pooling Strategy for Attention Mechanism
Shan Zhong
Wushao Wen
Jinghui Qin
33
3
0
22 Aug 2022
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Haoran Wang
Dongliang He
Wenhao Wu
Boyang Xia
Min Yang
Fu Li
YunLong Yu
Zhong Ji
Errui Ding
Jingdong Wang
30
23
0
21 Aug 2022
Offline Handwritten Mathematical Recognition using Adversarial Learning and Transformers
U. Thakur
Anuj Sharma
OffRL
24
4
0
20 Aug 2022
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview Pedestrian Detection with Attention
Jinwoo Hwang
Philipp Benz
Tae-Hoon Kim
ViT
31
3
0
19 Aug 2022
Sequence Prediction Under Missing Data : An RNN Approach Without Imputation
Soumen Pachal
Avinash Achar
AI4TS
14
4
0
18 Aug 2022
Look in Different Views: Multi-Scheme Regression Guided Cell Instance Segmentation
Menghao Li
W. Feng
Shuchang Lyu
Lijiang Chen
Qi Zhao
27
0
0
17 Aug 2022
Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning
J. Hu
Roberto Cavicchioli
Alessandro Capotondi
29
21
0
13 Aug 2022
A Means-End Account of Explainable Artificial Intelligence
O. Buchholz
XAI
37
12
0
09 Aug 2022
Distinctive Image Captioning via CLIP Guided Group Optimization
Youyuan Zhang
Jiuniu Wang
Hao Wu
Wenjia Xu
VLM
40
8
0
08 Aug 2022
Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences
Qianying Lin
Wen-Ji Zhou
Yanshi Wang
Qing Da
Qingguo Chen
Bing Wang
VLM
23
9
0
08 Aug 2022
Previous
1
2
3
...
11
12
13
...
69
70
71
Next