ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXivPDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,515 papers shown
Title
EXplainable Neural-Symbolic Learning (X-NeSyL) methodology to fuse deep
  learning representations with expert knowledge graphs: the MonuMAI cultural
  heritage use case
EXplainable Neural-Symbolic Learning (X-NeSyL) methodology to fuse deep learning representations with expert knowledge graphs: the MonuMAI cultural heritage use case
Natalia Díaz Rodríguez
Alberto Lamas
Jules Sanchez
Gianni Franchi
Ivan Donadello
Siham Tabik
David Filliat
P. Cruz
Rosana Montes
Francisco Herrera
72
77
0
24 Apr 2021
AttWalk: Attentive Cross-Walks for Deep Mesh Analysis
AttWalk: Attentive Cross-Walks for Deep Mesh Analysis
Ran Ben Izhak
Alon Lahav
A. Tal
3DV
57
10
0
23 Apr 2021
Towards Accurate Text-based Image Captioning with Content Diversity
  Exploration
Towards Accurate Text-based Image Captioning with Content Diversity Exploration
Guanghui Xu
Shuaicheng Niu
Mingkui Tan
Yucheng Luo
Qing Du
Qi Wu
DiffM
44
56
0
23 Apr 2021
Multi-task Learning with Attention for End-to-end Autonomous Driving
Multi-task Learning with Attention for End-to-end Autonomous Driving
Keishi Ishihara
Anssi Kanervisto
J. Miura
Ville Hautamaki
53
60
0
21 Apr 2021
Discrete-continuous Action Space Policy Gradient-based Attention for
  Image-Text Matching
Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text Matching
Shiyang Yan
Li Yu
Yuan Xie
57
34
0
21 Apr 2021
Improving Weakly-supervised Object Localization via Causal Intervention
Improving Weakly-supervised Object Localization via Causal Intervention
Feifei Shao
Yawei Luo
Li Zhang
Lu Ye
Siliang Tang
Yi Yang
Jun Xiao
WSOL
30
25
0
21 Apr 2021
Review of end-to-end speech synthesis technology based on deep learning
Review of end-to-end speech synthesis technology based on deep learning
Zhaoxi Mu
Xinyu Yang
Yizhuo Dong
AuLLM
ALM
31
24
0
20 Apr 2021
Visual Navigation with Spatial Attention
Visual Navigation with Spatial Attention
Bar Mayo
Tamir Hazan
A. Tal
EgoV
34
74
0
20 Apr 2021
Attention in Attention Network for Image Super-Resolution
Attention in Attention Network for Image Super-Resolution
Haoyu Chen
Jinjin Gu
Zhi-Li Zhang
SupR
41
68
0
19 Apr 2021
Surrogate Gradient Field for Latent Space Manipulation
Surrogate Gradient Field for Latent Space Manipulation
Minjun Li
Yanghua Jin
Huachun Zhu
GAN
19
18
0
19 Apr 2021
Concadia: Towards Image-Based Text Generation with a Purpose
Concadia: Towards Image-Based Text Generation with a Purpose
Elisa Kreiss
Fei Fang
Noah D. Goodman
Christopher Potts
29
23
0
16 Apr 2021
Robust Open-Vocabulary Translation from Visual Text Representations
Robust Open-Vocabulary Translation from Visual Text Representations
Elizabeth Salesky
David Etter
Matt Post
VLM
27
40
0
16 Apr 2021
Pose Recognition with Cascade Transformers
Pose Recognition with Cascade Transformers
Ke Li
Shijie Wang
Xiang Zhang
Yifan Xu
Weijian Xu
Zhuowen Tu
ViT
43
210
0
14 Apr 2021
Autonomous Vehicles Drive into Shared Spaces: eHMI Design Concept
  Focusing on Vulnerable Road Users
Autonomous Vehicles Drive into Shared Spaces: eHMI Design Concept Focusing on Vulnerable Road Users
Yang Li
Hao Cheng
Zhe Zeng
Hailong Liu
Monika Sester
33
28
0
14 Apr 2021
Revisiting the Onsets and Frames Model with Additive Attention
Revisiting the Onsets and Frames Model with Additive Attention
K. Cheuk
Yin-Jyun Luo
Emmanouil Benetos
Dorien Herremans
12
20
0
14 Apr 2021
Co-Scale Conv-Attentional Image Transformers
Co-Scale Conv-Attentional Image Transformers
Weijian Xu
Yifan Xu
Tyler A. Chang
Zhuowen Tu
ViT
33
374
0
13 Apr 2021
A State-of-the-art Survey of Artificial Neural Networks for Whole-slide
  Image Analysis:from Popular Convolutional Neural Networks to Potential Visual
  Transformers
A State-of-the-art Survey of Artificial Neural Networks for Whole-slide Image Analysis:from Popular Convolutional Neural Networks to Potential Visual Transformers
Xintong Li
Xirong Li
Chen Li
M. Rahaman
Jian Wu
Xiaoqi Li
Yudong Yao
M. Grzegorzek
ViT
MedIm
43
44
0
13 Apr 2021
Automatic Generation of Descriptive Titles for Video Clips Using Deep
  Learning
Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning
Soheyla Amirian
Khaled Rasheed
T. Taha
H. Arabnia
VLM
VGen
21
23
0
07 Apr 2021
Differentiable Patch Selection for Image Recognition
Differentiable Patch Selection for Image Recognition
Jean-Baptiste Cordonnier
Aravindh Mahendran
Alexey Dosovitskiy
Dirk Weissenborn
Jakob Uszkoreit
Thomas Unterthiner
38
94
0
07 Apr 2021
Multimodal Continuous Visual Attention Mechanisms
Multimodal Continuous Visual Attention Mechanisms
António Farinhas
André F. T. Martins
P. Aguiar
22
7
0
07 Apr 2021
Compressing Visual-linguistic Model via Knowledge Distillation
Compressing Visual-linguistic Model via Knowledge Distillation
Zhiyuan Fang
Jianfeng Wang
Xiaowei Hu
Lijuan Wang
Yezhou Yang
Zicheng Liu
VLM
46
99
0
05 Apr 2021
FixMyPose: Pose Correctional Captioning and Retrieval
FixMyPose: Pose Correctional Captioning and Retrieval
Hyounghun Kim
Abhaysinh Zala
Graham Burri
Joey Tianyi Zhou
36
16
0
04 Apr 2021
Influencing Reinforcement Learning through Natural Language Guidance
Influencing Reinforcement Learning through Natural Language Guidance
Tasmia Tasrin
Md Sultan al Nahian
Habarakadage Perera
Brent Harrison
16
6
0
04 Apr 2021
M3L: Language-based Video Editing via Multi-Modal Multi-Level
  Transformers
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers
Tsu-Jui Fu
Xinze Wang
Scott T. Grafton
Miguel P. Eckstein
Wenjie Wang
52
9
0
02 Apr 2021
The Spatially-Correlative Loss for Various Image Translation Tasks
The Spatially-Correlative Loss for Various Image Translation Tasks
Chuanxia Zheng
Tat-Jen Cham
Jianfei Cai
32
117
0
02 Apr 2021
Towards General Purpose Vision Systems
Towards General Purpose Vision Systems
Tanmay Gupta
Amita Kamath
Aniruddha Kembhavi
Derek Hoiem
20
50
0
01 Apr 2021
DF^2AM: Dual-level Feature Fusion and Affinity Modeling for RGB-Infrared
  Cross-modality Person Re-identification
DF^2AM: Dual-level Feature Fusion and Affinity Modeling for RGB-Infrared Cross-modality Person Re-identification
Junhui Yin
Zhanyu Ma
Jiyang Xie
Shibo Nie
Kongming Liang
Jun Guo
38
2
0
01 Apr 2021
Qualitative Planning in Imperfect Information Games with Active Sensing
  and Reactive Sensor Attacks: Cost of Unawareness
Qualitative Planning in Imperfect Information Games with Active Sensing and Reactive Sensor Attacks: Cost of Unawareness
A. Kulkarni
Shuo Han
Nandi O. Leslie
Charles A. Kamhoua
Jie Fu
19
3
0
01 Apr 2021
NetAdaptV2: Efficient Neural Architecture Search with Fast Super-Network
  Training and Architecture Optimization
NetAdaptV2: Efficient Neural Architecture Search with Fast Super-Network Training and Architecture Optimization
Tien-Ju Yang
Yi-Lun Liao
Vivienne Sze
35
55
0
31 Mar 2021
FANet: A Feedback Attention Network for Improved Biomedical Image
  Segmentation
FANet: A Feedback Attention Network for Improved Biomedical Image Segmentation
Nikhil Kumar Tomar
Debesh Jha
Michael A. Riegler
Haavard D. Johansen
Dag Johansen
J. Rittscher
Pål Halvorsen
Sharib Ali
MedIm
25
149
0
31 Mar 2021
Data in context: How digital transformation can support human reasoning
  in cyber-physical production systems
Data in context: How digital transformation can support human reasoning in cyber-physical production systems
Romy Müller
F. Kessler
David W. Humphrey
Julian Rahm
13
7
0
31 Mar 2021
Channel-Based Attention for LCC Using Sentinel-2 Time Series
Channel-Based Attention for LCC Using Sentinel-2 Time Series
Hermann Courteille
A. Benoît
N. Méger
A. Atto
Dino Ienco
AI4TS
19
1
0
31 Mar 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep Learning
Alana de Santana Correia
Esther Luna Colombini
HAI
40
176
0
31 Mar 2021
Dual Contrastive Loss and Attention for GANs
Dual Contrastive Loss and Attention for GANs
Ning Yu
Guilin Liu
Aysegül Dündar
Andrew Tao
Bryan Catanzaro
Larry S. Davis
Mario Fritz
GAN
45
60
0
31 Mar 2021
A study of latent monotonic attention variants
A study of latent monotonic attention variants
Albert Zeyer
Ralf Schluter
Hermann Ney
53
5
0
30 Mar 2021
Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
Mingchen Zhuge
D. Gao
Deng-Ping Fan
Linbo Jin
Ben Chen
Hao Zhou
Minghui Qiu
Ling Shao
VLM
35
120
0
30 Mar 2021
Self-supervised Image-text Pre-training With Mixed Data In Chest X-rays
Self-supervised Image-text Pre-training With Mixed Data In Chest X-rays
Xiaosong Wang
Ziyue Xu
Leo K. Tam
Dong Yang
Daguang Xu
ViT
MedIm
25
23
0
30 Mar 2021
Embedding API Dependency Graph for Neural Code Generation
Embedding API Dependency Graph for Neural Code Generation
Chen Lyu
Ruyun Wang
Hongyu Zhang
Hanwen Zhang
Songlin Hu
GNN
31
20
0
29 Mar 2021
SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D
  Sequences
SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences
Shun-cheng Wu
Johanna Wald
Keisuke Tateno
Nassir Navab
Federico Tombari
3DPC
25
156
0
27 Mar 2021
Dodrio: Exploring Transformer Models with Interactive Visualization
Dodrio: Exploring Transformer Models with Interactive Visualization
Zijie J. Wang
Robert Turko
Duen Horng Chau
45
35
0
26 Mar 2021
Understanding Robustness of Transformers for Image Classification
Understanding Robustness of Transformers for Image Classification
Srinadh Bhojanapalli
Ayan Chakrabarti
Daniel Glasner
Daliang Li
Thomas Unterthiner
Andreas Veit
ViT
25
380
0
26 Mar 2021
Deep EHR Spotlight: a Framework and Mechanism to Highlight Events in
  Electronic Health Records for Explainable Predictions
Deep EHR Spotlight: a Framework and Mechanism to Highlight Events in Electronic Health Records for Explainable Predictions
Thanh Nguyen-Duc
N. Mulligan
G. Mannu
Joao H. Bettencourt-Silva
BDL
14
6
0
25 Mar 2021
Describing and Localizing Multiple Changes with Transformers
Describing and Localizing Multiple Changes with Transformers
Yue Qiu
Shintaro Yamamoto
Kodai Nakashima
Ryota Suzuki
K. Iwata
Hirokatsu Kataoka
Y. Satoh
35
56
0
25 Mar 2021
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent
  Forecasting
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting
Ye Yuan
Xinshuo Weng
Yanglan Ou
Kris Kitani
AI4TS
50
443
0
25 Mar 2021
More Photos are All You Need: Semi-Supervised Learning for Fine-Grained
  Sketch Based Image Retrieval
More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval
A. Bhunia
Pinaki Nath Chowdhury
Aneeshan Sain
Yongxin Yang
Tao Xiang
Yi-Zhe Song
GAN
SSL
31
61
0
25 Mar 2021
VLGrammar: Grounded Grammar Induction of Vision and Language
VLGrammar: Grounded Grammar Induction of Vision and Language
Yining Hong
Qing Li
Song-Chun Zhu
Siyuan Huang
VLM
41
25
0
24 Mar 2021
Scaling Local Self-Attention for Parameter Efficient Visual Backbones
Scaling Local Self-Attention for Parameter Efficient Visual Backbones
Ashish Vaswani
Prajit Ramachandran
A. Srinivas
Niki Parmar
Blake A. Hechtman
Jonathon Shlens
32
396
0
23 Mar 2021
SelfExplain: A Self-Explaining Architecture for Neural Text Classifiers
SelfExplain: A Self-Explaining Architecture for Neural Text Classifiers
Dheeraj Rajagopal
Vidhisha Balachandran
Eduard H. Hovy
Yulia Tsvetkov
MILM
SSL
FAtt
AI4TS
34
66
0
23 Mar 2021
Human-like Controllable Image Captioning with Verb-specific Semantic
  Roles
Human-like Controllable Image Captioning with Verb-specific Semantic Roles
Long Chen
Zhihong Jiang
Jun Xiao
Wei Liu
44
75
0
22 Mar 2021
Handling Missing Observations with an RNN-based Prediction-Update Cycle
Handling Missing Observations with an RNN-based Prediction-Update Cycle
S. Becker
Ronny Hug
Wolfgang Hubner
Michael Arens
B. Morris
28
1
0
22 Mar 2021
Previous
123...222324...697071
Next