ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
FoleyGAN: Visually Guided Generative Adversarial Network-Based
  Synchronous Sound Generation in Silent Videos
FoleyGAN: Visually Guided Generative Adversarial Network-Based Synchronous Sound Generation in Silent Videos
Sanchita Ghose
John J. Prevost
GAN
67
26
0
20 Jul 2021
Class dependency based learning using Bi-LSTM coupled with the transfer
  learning of VGG16 for the diagnosis of Tuberculosis from chest x-rays
Class dependency based learning using Bi-LSTM coupled with the transfer learning of VGG16 for the diagnosis of Tuberculosis from chest x-rays
G. Jignesh Chowdary
G. Suganya
M. Premalatha
K. Karunamurthy
61
6
0
19 Jul 2021
Mediated Uncoupled Learning: Learning Functions without Direct
  Input-output Correspondences
Mediated Uncoupled Learning: Learning Functions without Direct Input-output Correspondences
Ikko Yamane
Junya Honda
Florian Yger
Masashi Sugiyama
SSLFedMLOOD
59
1
0
16 Jul 2021
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning
Paul Pu Liang
Yiwei Lyu
Xiang Fan
Zetian Wu
Yun Cheng
...
Peter Wu
Michelle A. Lee
Yuke Zhu
Ruslan Salakhutdinov
Louis-Philippe Morency
VLM
111
172
0
15 Jul 2021
Variational Topic Inference for Chest X-Ray Report Generation
Variational Topic Inference for Chest X-Ray Report Generation
Ivona Najdenkoska
Xiantong Zhen
M. Worring
Ling Shao
MedIm
88
29
0
15 Jul 2021
An Overview and Experimental Study of Learning-based Optimization
  Algorithms for Vehicle Routing Problem
An Overview and Experimental Study of Learning-based Optimization Algorithms for Vehicle Routing Problem
Bingjie Li
Guohua Wu
Yongming He
Mingfeng Fan
Witold Pedrycz
114
70
0
15 Jul 2021
Passive Attention in Artificial Neural Networks Predicts Human Visual
  Selectivity
Passive Attention in Artificial Neural Networks Predicts Human Visual Selectivity
Thomas A. Langlois
H. C. Zhao
Erin Grant
Ishita Dasgupta
Thomas Griffiths
Nori Jacoby
89
16
0
14 Jul 2021
Surgical Instruction Generation with Transformers
Surgical Instruction Generation with Transformers
Jinglu Zhang
Y. Nie
Jian Chang
Jiangning Zhang
MedIm
94
13
0
14 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DVVLMMLLM
162
270
0
14 Jul 2021
Multi-Scale Label Relation Learning for Multi-Label Classification Using
  1-Dimensional Convolutional Neural Networks
Multi-Scale Label Relation Learning for Multi-Label Classification Using 1-Dimensional Convolutional Neural Networks
Junhyung Lyle Kim
Byungyoon Park
Charmgil Hong
31
0
0
13 Jul 2021
Human Attention during Goal-directed Reading Comprehension Relies on
  Task Optimization
Human Attention during Goal-directed Reading Comprehension Relies on Task Optimization
Jiajie Zou
Yuran Zhang
Jialu Li
Xing Tian
Nai Ding
AIMat
92
2
0
13 Jul 2021
Split, embed and merge: An accurate table structure recognizer
Split, embed and merge: An accurate table structure recognizer
Zhenrong Zhang
Jianshu Zhang
Jun Du
LMTD
187
62
0
12 Jul 2021
Legal Judgment Prediction with Multi-Stage CaseRepresentation Learning
  in the Real Court Setting
Legal Judgment Prediction with Multi-Stage CaseRepresentation Learning in the Real Court Setting
Luyao Ma
Yating Zhang
Tianyi Wang
Xiaozhong Liu
Wei Ye
Changlong Sun
Shikun Zhang
ELMAILaw
99
59
0
12 Jul 2021
Levels of explainable artificial intelligence for human-aligned
  conversational explanations
Levels of explainable artificial intelligence for human-aligned conversational explanations
Richard Dazeley
Peter Vamplew
Cameron Foale
Charlotte Young
Sunil Aryal
F. Cruz
65
93
0
07 Jul 2021
Controlled Caption Generation for Images Through Adversarial Attacks
Controlled Caption Generation for Images Through Adversarial Attacks
Nayyer Aafaq
Naveed Akhtar
Wei Liu
M. Shah
Ajmal Mian
AAML
59
10
0
07 Jul 2021
Self-Adversarial Training incorporating Forgery Attention for Image
  Forgery Localization
Self-Adversarial Training incorporating Forgery Attention for Image Forgery Localization
Longhao Zhuo
Shunquan Tan
Bin Li
Jiwu Huang
AAML
57
74
0
06 Jul 2021
Learning Semantic Segmentation of Large-Scale Point Clouds with Random
  Sampling
Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling
Qingyong Hu
Bo Yang
Linhai Xie
Stefano Rosa
Yulan Guo
Zhihua Wang
Niki Trigoni
Andrew Markham
3DPC
88
184
0
06 Jul 2021
RATCHET: Medical Transformer for Chest X-ray Diagnosis and Reporting
RATCHET: Medical Transformer for Chest X-ray Diagnosis and Reporting
Benjamin Hou
Georgios Kaissis
Ronald M. Summers
Bernhard Kainz
ViTLM&MAMedIm
93
53
0
05 Jul 2021
Gradient Importance Learning for Incomplete Observations
Gradient Importance Learning for Incomplete Observations
Qitong Gao
Dong Wang
Joshua D. Amason
Siyang Yuan
Chenyang Tao
Ricardo Henao
M. Hadziahmetovic
Lawrence Carin
Miroslav Pajic
50
10
0
05 Jul 2021
Audio-Oriented Multimodal Machine Comprehension: Task, Dataset and Model
Audio-Oriented Multimodal Machine Comprehension: Task, Dataset and Model
Zhiqi Huang
Fenglin Liu
Xian Wu
Shen Ge
Helin Wang
Wei Fan
Yuexian Zou
AuLLM
57
2
0
04 Jul 2021
Case Relation Transformer: A Crossmodal Language Generation Model for
  Fetching Instructions
Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions
Motonari Kambara
K. Sugiura
ViT
62
6
0
02 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
169
101
0
01 Jul 2021
VideoLightFormer: Lightweight Action Recognition using Transformers
Raivo Koot
Haiping Lu
ViT
135
6
0
01 Jul 2021
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake
  Monitoring
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring
Jianing Qiu
Frank P.-W. Lo
Xiao Gu
M. Jobarteh
Wenyan Jia
...
M. McCrory
Edward Sazonov
Mingui Sun
Gary Frost
Benny Lo
EgoV
66
19
0
01 Jul 2021
MissFormer: (In-)attention-based handling of missing observations for
  trajectory filtering and prediction
MissFormer: (In-)attention-based handling of missing observations for trajectory filtering and prediction
S. Becker
Ronny Hug
Wolfgang Hubner
Michael Arens
B. Morris
69
4
0
30 Jun 2021
Attention Aware Wavelet-based Detection of Morphed Face Images
Attention Aware Wavelet-based Detection of Morphed Face Images
Poorya Aghdaie
Baaria Chaudhary
Sobhan Soleymani
J. Dawson
Nasser M. Nasrabadi
CVBM
76
30
0
29 Jun 2021
Contrastive Semantic Similarity Learning for Image Captioning Evaluation
  with Intrinsic Auto-encoder
Contrastive Semantic Similarity Learning for Image Captioning Evaluation with Intrinsic Auto-encoder
Chao Zeng
Tiesong Zhao
Sam Kwong
94
2
0
29 Jun 2021
SALYPATH: A Deep-Based Architecture for visual attention prediction
SALYPATH: A Deep-Based Architecture for visual attention prediction
M. A. Kerkouri
Marouane Tliba
A. Chetouani
R. Harba
FAttMDE
59
9
0
29 Jun 2021
Saying the Unseen: Video Descriptions via Dialog Agents
Saying the Unseen: Video Descriptions via Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
73
6
0
26 Jun 2021
Neural Fashion Image Captioning : Accounting for Data Diversity
Neural Fashion Image Captioning : Accounting for Data Diversity
Gilles Hacheme
Nouréini Sayouti
69
13
0
23 Jun 2021
Probabilistic Attention for Interactive Segmentation
Probabilistic Attention for Interactive Segmentation
Prasad Gabbur
Manjot Bilkhu
J. Movellan
103
13
0
23 Jun 2021
Interventional Video Grounding with Dual Contrastive Learning
Interventional Video Grounding with Dual Contrastive Learning
Guoshun Nan
Rui Qiao
Yao Xiao
Jun Liu
Sicong Leng
H. Zhang
Wei Lu
98
145
0
21 Jun 2021
Trust It or Not: Confidence-Guided Automatic Radiology Report Generation
Trust It or Not: Confidence-Guided Automatic Radiology Report Generation
Yixin Wang
Zihao Lin
Zhe Xu
Haoyu Dong
Jiang Tian
Jie Luo
Zhongchao Shi
Yang Zhang
Jianping Fan
Zhiqiang He
UQCVMedIm
122
12
0
21 Jun 2021
Exploring Semantic Relationships for Unpaired Image Captioning
Exploring Semantic Relationships for Unpaired Image Captioning
Fenglin Liu
Meng Gao
Tianhao Zhang
Yuexian Zou
142
7
0
20 Jun 2021
Do Encoder Representations of Generative Dialogue Models Encode
  Sufficient Information about the Task ?
Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?
Prasanna Parthasarathi
J. Pineau
Sarath Chandar
64
2
0
20 Jun 2021
Attend What You Need: Motion-Appearance Synergistic Networks for Video
  Question Answering
Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering
Ahjeong Seo
Gi-Cheon Kang
J. Park
Byoung-Tak Zhang
82
54
0
19 Jun 2021
Learning to Predict Visual Attributes in the Wild
Learning to Predict Visual Attributes in the Wild
Khoi Pham
Kushal Kafle
Zhe Lin
Zhi Ding
Scott D. Cohen
Q. Tran
Abhinav Shrivastava
54
114
0
17 Jun 2021
Semi-Autoregressive Transformer for Image Captioning
Semi-Autoregressive Transformer for Image Captioning
Yuanen Zhou
Yong Zhang
Zhenzhen Hu
Meng Wang
VLM
78
25
0
17 Jun 2021
Invertible Attention
Invertible Attention
Jiajun Zha
Yiran Zhong
Jing Zhang
Leonid Sigal
Liang Zheng
82
7
0
16 Jun 2021
Soft Attention: Does it Actually Help to Learn Social Interactions in
  Pedestrian Trajectory Prediction?
Soft Attention: Does it Actually Help to Learn Social Interactions in Pedestrian Trajectory Prediction?
L. Boucaud
Daniel Aloise
Nicolas Saunier
HAI
41
0
0
16 Jun 2021
Kernel Identification Through Transformers
Kernel Identification Through Transformers
F. Simpson
Ian Davies
V. Lalchand
A. Vullo
N. Durrande
C. Rasmussen
63
11
0
15 Jun 2021
Contrastive Attention for Automatic Chest X-ray Report Generation
Contrastive Attention for Automatic Chest X-ray Report Generation
Fenglin Liu
Changchang Yin
Xian Wu
Shen Ge
Yuexian Zou
Ping Zhang
Yuexian Zou
Xu Sun
MedIm
137
153
0
13 Jun 2021
Exploring and Distilling Posterior and Prior Knowledge for Radiology
  Report Generation
Exploring and Distilling Posterior and Prior Knowledge for Radiology Report Generation
Fenglin Liu
Xian Wu
Shen Ge
Wei Fan
Yuexian Zou
MedIm
120
262
0
13 Jun 2021
Bayesian Attention Belief Networks
Bayesian Attention Belief Networks
Shujian Zhang
Xinjie Fan
Bo Chen
Mingyuan Zhou
BDL
114
32
0
09 Jun 2021
Salient Object Ranking with Position-Preserved Attention
Salient Object Ranking with Position-Preserved Attention
Haoyang Fang
Daoxin Zhang
Yi Zhang
Minghao Chen
Jiawei Li
Yao Hu
Deng Cai
Xiaofei He
71
21
0
09 Jun 2021
Object Based Attention Through Internal Gating
Object Based Attention Through Internal Gating
Jordan Lei
Ari S. Benjamin
Konrad Paul Kording
OCL
40
4
0
08 Jun 2021
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained
  Models
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models
Chenfeng Xu
Shijia Yang
Tomer Galanti
Bichen Wu
Xiangyu Yue
Bohan Zhai
Wei Zhan
Peter Vajda
Kurt Keutzer
Masayoshi Tomizuka
3DPC
62
55
0
08 Jun 2021
Lessons learned developing and using a machine learning model to
  automatically transcribe 2.3 million handwritten occupation codes
Lessons learned developing and using a machine learning model to automatically transcribe 2.3 million handwritten occupation codes
Bjorn-Richard Pedersen
Einar J. Holsbø
Trygve Andersen
N. Shvetsov
Johan Ravn
H. Sommerseth
L. A. Bongo
AI4TS
39
6
0
07 Jun 2021
Relative Importance in Sentence Processing
Relative Importance in Sentence Processing
Nora Hollenstein
Lisa Beinborn
FAtt
82
32
0
07 Jun 2021
Adversarially Regularized Graph Attention Networks for Inductive
  Learning on Partially Labeled Graphs
Adversarially Regularized Graph Attention Networks for Inductive Learning on Partially Labeled Graphs
Jiaren Xiao
Quanyu Dai
Xiaochen Xie
J. Lam
Ka-Wai Kwok
GNN
78
7
0
07 Jun 2021
Previous
123...202122...697071
Next