ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXivPDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,509 papers shown
Title
Fine-grained Anomaly Detection in Sequential Data via Counterfactual
  Explanations
Fine-grained Anomaly Detection in Sequential Data via Counterfactual Explanations
He Cheng
Depeng Xu
Shuhan Yuan
Xintao Wu
AI4TS
35
3
0
09 Oct 2022
Learning Fine-Grained Visual Understanding for Video Question Answering
  via Decoupling Spatial-Temporal Modeling
Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling
Hsin-Ying Lee
Hung-Ting Su
Bing-Chen Tsai
Tsung-Han Wu
Jia-Fong Yeh
Winston H. Hsu
27
2
0
08 Oct 2022
Contextual Modeling for 3D Dense Captioning on Point Clouds
Contextual Modeling for 3D Dense Captioning on Point Clouds
Yufeng Zhong
Longdao Xu
Jiebo Luo
Lin Ma
44
15
0
08 Oct 2022
LOCL: Learning Object-Attribute Composition using Localization
LOCL: Learning Object-Attribute Composition using Localization
Satish Kumar
A S M Iftekhar
Ekta Prashnani
B.S.Manjunath
19
3
0
07 Oct 2022
Quantitative Metrics for Evaluating Explanations of Video DeepFake
  Detectors
Quantitative Metrics for Evaluating Explanations of Video DeepFake Detectors
Federico Baldassarre
Quentin Debard
Gonzalo Fiz Pontiveros
Tri Kurniawan Wijaya
44
4
0
07 Oct 2022
CLEAR: Causal Explanations from Attention in Neural Recommenders
CLEAR: Causal Explanations from Attention in Neural Recommenders
Shami Nisimov
R. Y. Rohekar
Yaniv Gurwicz
G. Koren
Gal Novik
CML
23
6
0
07 Oct 2022
AOE-Net: Entities Interactions Modeling with Adaptive Attention
  Mechanism for Temporal Action Proposals Generation
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation
Khoa T. Vo
Sang Truong
Kashu Yamazaki
Bhiksha Raj
Minh-Triet Tran
Ngan Le
86
26
0
05 Oct 2022
Improved Anomaly Detection by Using the Attention-Based Isolation Forest
Improved Anomaly Detection by Using the Attention-Based Isolation Forest
Lev V. Utkin
A. Ageev
A. Konstantinov
41
6
0
05 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
Vision+X: A Survey on Multimodal Learning in the Light of Data
Ye Zhu
Yuehua Wu
N. Sebe
Yan Yan
35
16
0
05 Oct 2022
Affection: Learning Affective Explanations for Real-World Visual Data
Affection: Learning Affective Explanations for Real-World Visual Data
Panos Achlioptas
M. Ovsjanikov
Leonidas J. Guibas
Sergey Tulyakov
83
11
0
04 Oct 2022
Learning to Collocate Visual-Linguistic Neural Modules for Image
  Captioning
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
Xu Yang
Hanwang Zhang
Chongyang Gao
Jianfei Cai
MLLM
40
10
0
04 Oct 2022
Music-to-Text Synaesthesia: Generating Descriptive Text from Music
  Recordings
Music-to-Text Synaesthesia: Generating Descriptive Text from Music Recordings
Zhihuan Kuang
Shi Zong
Jianbing Zhang
Jiajun Chen
Hongfu Liu
30
4
0
02 Oct 2022
MaskTune: Mitigating Spurious Correlations by Forcing to Explore
MaskTune: Mitigating Spurious Correlations by Forcing to Explore
Saeid Asgari Taghanaki
Aliasghar Khani
Fereshte Khani
A. Gholami
Linh-Tam Tran
Ali Mahdavi-Amiri
Ghassan Hamarneh
AAML
43
45
0
30 Sep 2022
Multimodality Multi-Lead ECG Arrhythmia Classification using
  Self-Supervised Learning
Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning
Thi-Thu-Hong Phan
Duc Le
Brijesh Patel
Donald Adjeroh
Jingxian Wu
M. Jensen
Ngan Le
30
11
0
30 Sep 2022
SmallCap: Lightweight Image Captioning Prompted with Retrieval
  Augmentation
SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
R. Ramos
Bruno Martins
Desmond Elliott
Yova Kementchedjhieva
VLM
30
86
0
30 Sep 2022
Medical Image Captioning via Generative Pretrained Transformers
Medical Image Captioning via Generative Pretrained Transformers
Alexander Selivanov
Oleg Y. Rogov
Daniil Chesakov
Artem Shelmanov
Irina Fedulova
Dmitry Dylov
MedIm
57
55
0
28 Sep 2022
InFi: End-to-End Learning to Filter Input for Resource-Efficiency in
  Mobile-Centric Inference
InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference
Mu Yuan
Lan Zhang
Fengxiang He
Xueting Tong
Miao-Hui Song
Zhengyuan Xu
Xiang-Yang Li
32
2
0
28 Sep 2022
RepsNet: Combining Vision with Language for Automated Medical Reports
RepsNet: Combining Vision with Language for Automated Medical Reports
A. Tanwani
Joelle Barral
Daniel Freedman
MedIm
44
20
0
27 Sep 2022
STING: Self-attention based Time-series Imputation Networks using GAN
STING: Self-attention based Time-series Imputation Networks using GAN
Eunkyu Oh
Taehun Kim
Yunhu Ji
Sushil Khyalia
AI4TS
29
25
0
22 Sep 2022
DRAMA: Joint Risk Localization and Captioning in Driving
DRAMA: Joint Risk Localization and Captioning in Driving
Srikanth Malla
Chiho Choi
Isht Dwivedi
Joonhyang Choi
Jiachen Li
107
87
0
22 Sep 2022
Show, Interpret and Tell: Entity-aware Contextualised Image Captioning
  in Wikipedia
Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
K. Nguyen
Ali Furkan Biten
Andrés Mafla
Lluís Gómez
Dimosthenis Karatzas
36
10
0
21 Sep 2022
Active Particle Filter Networks: Efficient Active Localization in
  Continuous Action Spaces and Large Maps
Active Particle Filter Networks: Efficient Active Localization in Continuous Action Spaces and Large Maps
Daniel Honerkamp
Suresh Guttikonda
Abhinav Valada
27
2
0
20 Sep 2022
Accelerating Neural Network Inference with Processing-in-DRAM: From the
  Edge to the Cloud
Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud
Geraldo F. Oliveira
Juan Gómez Luna
Saugata Ghose
Amirali Boroumand
O. Mutlu
29
24
0
19 Sep 2022
Learning Distinct and Representative Styles for Image Captioning
Learning Distinct and Representative Styles for Image Captioning
Qi Chen
Chaorui Deng
Qi Wu
VLM
42
23
0
17 Sep 2022
Belief Revision based Caption Re-ranker with Visual Semantic Information
Belief Revision based Caption Re-ranker with Visual Semantic Information
Ahmed Sabir
Francesc Moreno-Noguer
Pranava Madhyastha
Lluís Padró
BDL
32
2
0
16 Sep 2022
M^4I: Multi-modal Models Membership Inference
M^4I: Multi-modal Models Membership Inference
Pingyi Hu
Zihan Wang
Ruoxi Sun
Hu Wang
Minhui Xue
39
26
0
15 Sep 2022
Analysis of Self-Attention Head Diversity for Conformer-based Automatic
  Speech Recognition
Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Kartik Audhkhasi
Yinghui Huang
Bhuvana Ramabhadran
Pedro J. Moreno
24
3
0
13 Sep 2022
Vision Transformers for Action Recognition: A Survey
Vision Transformers for Action Recognition: A Survey
Anwaar Ulhaq
Naveed Akhtar
Ganna Pogrebna
Ajmal Mian
ViT
28
44
0
13 Sep 2022
Evaluation of Question Answering Systems: Complexity of judging a
  natural language
Evaluation of Question Answering Systems: Complexity of judging a natural language
Amer Farea
Zhen Yang
Kien Duong
Nadeesha Perera
F. Emmert-Streib
ELM
31
3
0
10 Sep 2022
Foundations and Trends in Multimodal Machine Learning: Principles,
  Challenges, and Open Questions
Foundations and Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions
Paul Pu Liang
Amir Zadeh
Louis-Philippe Morency
18
62
0
07 Sep 2022
RF Fingerprinting Needs Attention: Multi-task Approach for Real-World
  WiFi and Bluetooth
RF Fingerprinting Needs Attention: Multi-task Approach for Real-World WiFi and Bluetooth
Anu Jagannath
Zackary Kane
Jithin Jagannath
33
11
0
07 Sep 2022
Parallel and Streaming Wavelet Neural Networks for Classification and
  Regression under Apache Spark
Parallel and Streaming Wavelet Neural Networks for Classification and Regression under Apache Spark
E Venkatesh
Yelleti Vivek
V. Ravi
Shiva Shankar Orsu
16
6
0
07 Sep 2022
A Weakly Supervised Learning Framework for Salient Object Detection via
  Hybrid Labels
A Weakly Supervised Learning Framework for Salient Object Detection via Hybrid Labels
Runmin Cong
Qi Qin
Chen Zhang
Qiuping Jiang
Shi Wang
Yao-Min Zhao
Sam Kwong
54
52
0
07 Sep 2022
Bridging Music and Text with Crowdsourced Music Comments: A
  Sequence-to-Sequence Framework for Thematic Music Comments Generation
Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation
Peining Zhang
Junliang Guo
Linli Xu
Mu You
Junming Yin
22
0
0
05 Sep 2022
MMKGR: Multi-hop Multi-modal Knowledge Graph Reasoning
MMKGR: Multi-hop Multi-modal Knowledge Graph Reasoning
Shangfei Zheng
Weiqing Wang
Jianfeng Qu
Hongzhi Yin
Wei Chen
Lei Zhao
LRM
21
22
0
03 Sep 2022
vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain
  using Swin Transformer and Attention-based LSTM
vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTM
THANH VAN NGUYEN
Long H. Nguyen
Nhat Truong Pham
Liu Tai Nguyen
Van Huong Do
Hai Nguyen
Ngoc Duy Nguyen
VLM
ViT
20
1
0
03 Sep 2022
EGFR Mutation Prediction of Lung Biopsy Images using Deep Learning
EGFR Mutation Prediction of Lung Biopsy Images using Deep Learning
R. Gupta
Shivani Nandgaonkar
Nikhil Cherian Kurian
S. Rane
A. Sethi
MedIm
39
7
0
26 Aug 2022
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted
  Window
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted Window
Mocho Go
Hideyuki Tachibana
ViT
37
9
0
24 Aug 2022
Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and
  Representation Mapping
Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and Representation Mapping
Bo Zhou
Jiahui Liu
Songyi Cui
Yaping Zhao
26
5
0
23 Aug 2022
A Medical Semantic-Assisted Transformer for Radiographic Report
  Generation
A Medical Semantic-Assisted Transformer for Radiographic Report Generation
Zhanyu Wang
Mingkang Tang
Lei Wang
Xiu Li
Luping Zhou
ViT
MedIm
24
57
0
22 Aug 2022
Mix-Pooling Strategy for Attention Mechanism
Mix-Pooling Strategy for Attention Mechanism
Shan Zhong
Wushao Wen
Jinghui Qin
33
3
0
22 Aug 2022
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for
  Image-Text Retrieval
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Haoran Wang
Dongliang He
Wenhao Wu
Boyang Xia
Min Yang
Fu Li
YunLong Yu
Zhong Ji
Errui Ding
Jingdong Wang
30
23
0
21 Aug 2022
Offline Handwritten Mathematical Recognition using Adversarial Learning
  and Transformers
Offline Handwritten Mathematical Recognition using Adversarial Learning and Transformers
U. Thakur
Anuj Sharma
OffRL
24
4
0
20 Aug 2022
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview
  Pedestrian Detection with Attention
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview Pedestrian Detection with Attention
Jinwoo Hwang
Philipp Benz
Tae-Hoon Kim
ViT
31
3
0
19 Aug 2022
Sequence Prediction Under Missing Data : An RNN Approach Without
  Imputation
Sequence Prediction Under Missing Data : An RNN Approach Without Imputation
Soumen Pachal
Avinash Achar
AI4TS
14
4
0
18 Aug 2022
Look in Different Views: Multi-Scheme Regression Guided Cell Instance
  Segmentation
Look in Different Views: Multi-Scheme Regression Guided Cell Instance Segmentation
Menghao Li
W. Feng
Shuchang Lyu
Lijiang Chen
Qi Zhao
27
0
0
17 Aug 2022
Exploiting Multiple Sequence Lengths in Fast End to End Training for
  Image Captioning
Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning
J. Hu
Roberto Cavicchioli
Alessandro Capotondi
29
21
0
13 Aug 2022
A Means-End Account of Explainable Artificial Intelligence
A Means-End Account of Explainable Artificial Intelligence
O. Buchholz
XAI
37
12
0
09 Aug 2022
Distinctive Image Captioning via CLIP Guided Group Optimization
Distinctive Image Captioning via CLIP Guided Group Optimization
Youyuan Zhang
Jiuniu Wang
Hao Wu
Wenjia Xu
VLM
40
8
0
08 Aug 2022
Sparse Attentive Memory Network for Click-through Rate Prediction with
  Long Sequences
Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences
Qianying Lin
Wen-Ji Zhou
Yanshi Wang
Qing Da
Qingguo Chen
Bing Wang
VLM
23
9
0
08 Aug 2022
Previous
123...111213...697071
Next