ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
A Systematic Approach to Featurization for Cancer Drug Sensitivity
  Predictions with Deep Learning
A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep Learning
Austin R. Clyde
Thomas Brettin
A. Partin
Maulik Shaulik
H. Yoo
Yvonne A. Evrard
Yitan Zhu
Fangfang Xia
Rick L. Stevens
124
7
0
30 Apr 2020
Towards Embodied Scene Description
Towards Embodied Scene Description
Sinan Tan
Huaping Liu
Di Guo
Xinyu Zhang
F. Sun
LM&Ro
54
9
0
30 Apr 2020
memeBot: Towards Automatic Image Meme Generation
memeBot: Towards Automatic Image Meme Generation
Aadhavan Sadasivam
K. Gunasekar
H. Davulcu
Yezhou Yang
42
10
0
30 Apr 2020
WT5?! Training Text-to-Text Models to Explain their Predictions
WT5?! Training Text-to-Text Models to Explain their Predictions
Sharan Narang
Colin Raffel
Katherine Lee
Adam Roberts
Noah Fiedel
Karishma Malkan
102
201
0
30 Apr 2020
Explainable Deep Learning: A Field Guide for the Uninitiated
Explainable Deep Learning: A Field Guide for the Uninitiated
Gabrielle Ras
Ning Xie
Marcel van Gerven
Derek Doran
AAMLXAI
122
382
0
30 Apr 2020
Image Captioning through Image Transformer
Image Captioning through Image Transformer
Sen He
Wentong Liao
Hamed R. Tavakoli
M. Yang
Bodo Rosenhahn
N. Pugeault
ViT
97
94
0
29 Apr 2020
Valid Explanations for Learning to Rank Models
Valid Explanations for Learning to Rank Models
Jaspreet Singh
Zhenye Wang
Megha Khosla
Avishek Anand
LRMFAtt
43
8
0
29 Apr 2020
The Explanation Game: Towards Prediction Explainability through Sparse
  Communication
The Explanation Game: Towards Prediction Explainability through Sparse Communication
Marcos Vinícius Treviso
André F. T. Martins
FAtt
72
3
0
28 Apr 2020
Exploring Self-attention for Image Recognition
Exploring Self-attention for Image Recognition
Hengshuang Zhao
Jiaya Jia
V. Koltun
SSL
105
792
0
28 Apr 2020
Local Lipschitz Bounds of Deep Neural Networks
Local Lipschitz Bounds of Deep Neural Networks
Calypso Herrera
Florian Krach
Josef Teichmann
40
3
0
27 Apr 2020
Self-Supervised Attention Learning for Depth and Ego-motion Estimation
Self-Supervised Attention Learning for Depth and Ego-motion Estimation
Assem Sadek
Boris Chidlovskii
MDE
77
6
0
27 Apr 2020
Sequential Interpretability: Methods, Applications, and Future Direction
  for Understanding Deep Learning Models in the Context of Sequential Data
Sequential Interpretability: Methods, Applications, and Future Direction for Understanding Deep Learning Models in the Context of Sequential Data
B. Shickel
Parisa Rashidi
AI4TS
72
18
0
27 Apr 2020
Attention Based Real Image Restoration
Attention Based Real Image Restoration
Saeed Anwar
Nick Barnes
L. Petersson
63
0
0
26 Apr 2020
Show, Describe and Conclude: On Exploiting the Structure Information of
  Chest X-Ray Reports
Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray Reports
Baoyu Jing
Zeya Wang
Eric Xing
107
142
0
26 Apr 2020
Quantifying the Contextualization of Word Representations with Semantic
  Class Probing
Quantifying the Contextualization of Word Representations with Semantic Class Probing
Mengjie Zhao
Philipp Dufter
Yadollah Yaghoobzadeh
Hinrich Schütze
83
27
0
25 Apr 2020
Detective: An Attentive Recurrent Model for Sparse Object Detection
Detective: An Attentive Recurrent Model for Sparse Object Detection
A. Kechaou
Manuel Martínez
Monica Haurilet
Rainer Stiefelhagen
ObjD
39
3
0
25 Apr 2020
Deep Multimodal Neural Architecture Search
Deep Multimodal Neural Architecture Search
Zhou Yu
Yuhao Cui
Jun-chen Yu
Meng Wang
Dacheng Tao
Qi Tian
77
100
0
25 Apr 2020
The Variational Bandwidth Bottleneck: Stochastic Evaluation on an
  Information Budget
The Variational Bandwidth Bottleneck: Stochastic Evaluation on an Information Budget
Anirudh Goyal
Yoshua Bengio
M. Botvinick
Sergey Levine
70
24
0
24 Apr 2020
Survey on Visual Sentiment Analysis
Survey on Visual Sentiment Analysis
A. Ortis
G. Farinella
Sebastiano Battiato
45
77
0
24 Apr 2020
Why an Android App is Classified as Malware? Towards Malware
  Classification Interpretation
Why an Android App is Classified as Malware? Towards Malware Classification Interpretation
Bozhi Wu
Sen Chen
Cuiyun Gao
Lingling Fan
Yang Liu
W. Wen
Michael R. Lyu
107
59
0
24 Apr 2020
Efficient Neural Architecture for Text-to-Image Synthesis
Efficient Neural Architecture for Text-to-Image Synthesis
Douglas M. Souza
Jonatas Wehrmann
D. Ruiz
53
24
0
23 Apr 2020
Visual Question Answering Using Semantic Information from Image
  Descriptions
Visual Question Answering Using Semantic Information from Image Descriptions
Tasmia Tasrin
Md Sultan al Nahian
Brent Harrison
32
0
0
23 Apr 2020
Textual Visual Semantic Dataset for Text Spotting
Textual Visual Semantic Dataset for Text Spotting
Ahmed Sabir
Francesc Moreno-Noguer
Lluís Padró
43
3
0
21 Apr 2020
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual
  CNNs
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs
Shiyang Yan
Yang Hua
N. Robertson
81
7
0
21 Apr 2020
Transform and Tell: Entity-Aware News Image Captioning
Transform and Tell: Entity-Aware News Image Captioning
Alasdair Tran
A. Mathews
Lexing Xie
VLM
60
97
0
17 Apr 2020
Knowledge-Based Visual Question Answering in Videos
Knowledge-Based Visual Question Answering in Videos
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
23
0
0
17 Apr 2020
Multiple Visual-Semantic Embedding for Video Retrieval from Query
  Sentence
Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence
Huy Manh Nguyen
Tomo Miyazaki
Yoshihiro Sugaya
S. Omachi
149
1
0
16 Apr 2020
Top-Down Networks: A coarse-to-fine reimagination of CNNs
Top-Down Networks: A coarse-to-fine reimagination of CNNs
Ioannis Lelekas
Nergis Tomen
S. Pintea
Jan van Gemert
29
6
0
16 Apr 2020
Destination Prediction Based on Partial Trajectory Data
Destination Prediction Based on Partial Trajectory Data
Patrick Ebel
Ibrahim Emre Göl
Christoph Lingenfelder
Andreas Vogelsang
41
33
0
16 Apr 2020
Hybrid Attention Networks for Flow and Pressure Forecasting in Water
  Distribution Systems
Hybrid Attention Networks for Flow and Pressure Forecasting in Water Distribution Systems
Ziqing Ma
Shuming Liu
Guancheng Guo
Xipeng Yu
AI4TS
15
4
0
13 Apr 2020
Sequential Weakly Labeled Multi-Activity Localization and Recognition on
  Wearable Sensors using Recurrent Attention Networks
Sequential Weakly Labeled Multi-Activity Localization and Recognition on Wearable Sensors using Recurrent Attention Networks
Kun Wang
Jun He
Lefei Zhang
HAI
50
39
0
13 Apr 2020
Attend and Decode: 4D fMRI Task State Decoding Using Attention Models
Attend and Decode: 4D fMRI Task State Decoding Using Attention Models
Sam Nguyen
Brenda Ng
Alan Kaplan
Priyadip Ray
63
25
0
10 Apr 2020
S2A: Wasserstein GAN with Spatio-Spectral Laplacian Attention for
  Multi-Spectral Band Synthesis
S2A: Wasserstein GAN with Spatio-Spectral Laplacian Attention for Multi-Spectral Band Synthesis
Litu Rout
Indranil Misra
Manthira Moorthi Subbiah
D. Dhar
53
7
0
08 Apr 2020
Survey for Trust-aware Recommender Systems: A Deep Learning Perspective
Survey for Trust-aware Recommender Systems: A Deep Learning Perspective
Manqing Dong
Feng Yuan
Lina Yao
Xianzhi Wang
Xiwei Xu
Liming Zhu
73
8
0
08 Apr 2020
e-SNLI-VE: Corrected Visual-Textual Entailment with Natural Language
  Explanations
e-SNLI-VE: Corrected Visual-Textual Entailment with Natural Language Explanations
Virginie Do
Oana-Maria Camburu
Zeynep Akata
Thomas Lukasiewicz
LRM
99
30
0
07 Apr 2020
Context-Aware Group Captioning via Self-Attention and Contrastive
  Features
Context-Aware Group Captioning via Self-Attention and Contrastive Features
Zhuowan Li
Quan Hung Tran
Long Mai
Zhe Lin
Alan Yuille
VLM
81
44
0
07 Apr 2020
Hierarchical Opacity Propagation for Image Matting
Hierarchical Opacity Propagation for Image Matting
Yaoyi Li
Qin Xu
Hongtao Lu
71
13
0
07 Apr 2020
Deep Attentive Generative Adversarial Network for Photo-Realistic Image
  De-Quantization
Deep Attentive Generative Adversarial Network for Photo-Realistic Image De-Quantization
Yang Zhang
Changhui Hu
Xiaobo Lu
GAN
52
1
0
07 Apr 2020
Scenario-Transferable Semantic Graph Reasoning for Interaction-Aware
  Probabilistic Prediction
Scenario-Transferable Semantic Graph Reasoning for Interaction-Aware Probabilistic Prediction
Yeping Hu
Wei Zhan
Masayoshi Tomizuka
141
38
0
07 Apr 2020
Character-level Japanese Text Generation with Attention Mechanism for
  Chest Radiography Diagnosis
Character-level Japanese Text Generation with Attention Mechanism for Chest Radiography Diagnosis
Kenya Sakka
Kotaro Nakayama
Nisei Kimura
Taiki Inoue
Yusuke Iwasawa
Ryohei Yamaguchi
Yosimasa Kawazoe
K. Ohe
Y. Matsuo
16
2
0
06 Apr 2020
Guiding Monocular Depth Estimation Using Depth-Attention Volume
Guiding Monocular Depth Estimation Using Depth-Attention Volume
Lam Huynh
Phong Nguyen-Ha
Jirí Matas
Esa Rahtu
J. Heikkilä
MDE
97
156
0
06 Apr 2020
Sub-Instruction Aware Vision-and-Language Navigation
Sub-Instruction Aware Vision-and-Language Navigation
Yicong Hong
Cristian Rodriguez-Opazo
Qi Wu
Stephen Gould
136
72
0
06 Apr 2020
B-SCST: Bayesian Self-Critical Sequence Training for Image Captioning
B-SCST: Bayesian Self-Critical Sequence Training for Image Captioning
Shashank Bujimalla
Mahesh Subedar
Omesh Tickoo
BDLUQCV
37
10
0
06 Apr 2020
Iterative Context-Aware Graph Inference for Visual Dialog
Iterative Context-Aware Graph Inference for Visual Dialog
Dan Guo
Haibo Wang
Hanwang Zhang
Zhengjun Zha
Meng Wang
89
49
0
05 Apr 2020
Towards Relevance and Sequence Modeling in Language Recognition
Towards Relevance and Sequence Modeling in Language Recognition
Bharat Padi
Anand Mohan
Sriram Ganapathy
27
15
0
02 Apr 2020
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal
  Transformers
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers
Zhicheng Huang
Zhaoyang Zeng
Bei Liu
Dongmei Fu
Jianlong Fu
ViT
214
440
0
02 Apr 2020
More Grounded Image Captioning by Distilling Image-Text Matching Model
More Grounded Image Captioning by Distilling Image-Text Matching Model
Yuanen Zhou
Meng Wang
Daqing Liu
Zhenzhen Hu
Hanwang Zhang
101
126
0
01 Apr 2020
X-Linear Attention Networks for Image Captioning
X-Linear Attention Networks for Image Captioning
Yingwei Pan
Ting Yao
Yehao Li
Tao Mei
146
519
0
31 Mar 2020
Modulating Bottom-Up and Top-Down Visual Processing via
  Language-Conditional Filters
Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters
.Ilker Kesen
Ozan Arkan Can
Erkut Erdem
Aykut Erdem
Deniz Yuret
VLM
64
1
0
28 Mar 2020
Actor-Transformers for Group Activity Recognition
Actor-Transformers for Group Activity Recognition
Kirill Gavrilyuk
Ryan Sanford
Mehrsan Javan
Cees G. M. Snoek
ViT
73
182
0
28 Mar 2020
Previous
123...323334...697071
Next