ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Multimodal Transformer with Multi-View Visual Representation for Image
  Captioning
Multimodal Transformer with Multi-View Visual Representation for Image Captioning
Jun-chen Yu
Jing Li
Zhou Yu
Qingming Huang
ViT
70
387
0
20 May 2019
Conversion Prediction Using Multi-task Conditional Attention Networks to
  Support the Creation of Effective Ad Creative
Conversion Prediction Using Multi-task Conditional Attention Networks to Support the Creation of Effective Ad Creative
Shunsuke Kitada
Hitoshi Iyatomi
Yoshifumi Seki
26
8
0
17 May 2019
Deep Unified Multimodal Embeddings for Understanding both Content and
  Users in Social Media Networks
Deep Unified Multimodal Embeddings for Understanding both Content and Users in Social Media Networks
Karan Sikka
Lucas Van Bramer
Ajay Divakaran
94
2
0
17 May 2019
Inductive Guided Filter: Real-time Deep Image Matting with Weakly
  Annotated Masks on Mobile Devices
Inductive Guided Filter: Real-time Deep Image Matting with Weakly Annotated Masks on Mobile Devices
Yaoyi Li
Jianfu Zhang
Weijie Zhao
Hongtao Lu
46
5
0
16 May 2019
Incorporating Sememes into Chinese Definition Modeling
Incorporating Sememes into Chinese Definition Modeling
Liner Yang
Cunliang Kong
Yun Chen
Yang Liu
Qinan Fan
Erhong Yang
61
31
0
16 May 2019
Exact Hard Monotonic Attention for Character-Level Transduction
Exact Hard Monotonic Attention for Character-Level Transduction
Shijie Wu
Ryan Cotterell
71
60
0
15 May 2019
Embeddings and Representation Learning for Structured Data
Embeddings and Representation Learning for Structured Data
Benjamin Paassen
Claudio Gallicchio
Alessio Micheli
A. Sperduti
53
7
0
15 May 2019
Sparse Sequence-to-Sequence Models
Sparse Sequence-to-Sequence Models
Ben Peters
Vlad Niculae
André F. T. Martins
TPM
219
215
0
14 May 2019
A human-inspired recognition system for premodern Japanese historical
  documents
A human-inspired recognition system for premodern Japanese historical documents
A. D. Le
Tarin Clanuwat
A. Kitamoto
AI4TS
106
14
0
14 May 2019
Hierarchically Structured Meta-learning
Hierarchically Structured Meta-learning
Huaxiu Yao
Ying Wei
Junzhou Huang
Z. Li
80
205
0
13 May 2019
Federated Multi-task Hierarchical Attention Model for Sensor Analytics
Federated Multi-task Hierarchical Attention Model for Sensor Analytics
Yujing Chen
Yue Ning
Zheng Chai
Huzefa Rangwala
54
6
0
13 May 2019
What Clinicians Want: Contextualizing Explainable Machine Learning for
  Clinical End Use
What Clinicians Want: Contextualizing Explainable Machine Learning for Clinical End Use
S. Tonekaboni
Shalmali Joshi
M. Mccradden
Anna Goldenberg
106
403
0
13 May 2019
Object Detection in 20 Years: A Survey
Object Detection in 20 Years: A Survey
Zhengxia Zou
Keyan Chen
Zhenwei Shi
Yuhong Guo
Jieping Ye
VLMObjDAI4TS
169
2,418
0
13 May 2019
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards
Yuhang Song
Jianyi Wang
Thomas Lukasiewicz
Zhenghua Xu
Shangtong Zhang
Andrzej Wojcicki
Mai Xu
LRM
87
15
0
12 May 2019
Follow the Attention: Combining Partial Pose and Object Motion for
  Fine-Grained Action Detection
Follow the Attention: Combining Partial Pose and Object Motion for Fine-Grained Action Detection
M. M. K. Moghaddam
Ehsan Abbasnejad
Javen Qinfeng Shi
51
2
0
11 May 2019
Few-Shot Learning with Embedded Class Models and Shot-Free Meta Training
Few-Shot Learning with Embedded Class Models and Shot-Free Meta Training
Avinash Ravichandran
Rahul Bhotika
Stefano Soatto
81
170
0
10 May 2019
Exact Adversarial Attack to Image Captioning via Structured Output
  Learning with Latent Variables
Exact Adversarial Attack to Image Captioning via Structured Output Learning with Latent Variables
Yan Xu
Baoyuan Wu
Fumin Shen
Yanbo Fan
Yong Zhang
Heng Tao Shen
Wei Liu
AAML
80
56
0
10 May 2019
Memory-Attended Recurrent Network for Video Captioning
Memory-Attended Recurrent Network for Video Captioning
Wenjie Pei
Jiyuan Zhang
Xiangrong Wang
Lei Ke
Xiaoyong Shen
Yu-Wing Tai
111
204
0
10 May 2019
Embedding Human Knowledge into Deep Neural Network via Attention Map
Embedding Human Knowledge into Deep Neural Network via Attention Map
Masahiro Mitsuhara
Hiroshi Fukui
Yusuke Sakashita
Takanori Ogata
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
102
73
0
09 May 2019
Multimodal Semantic Attention Network for Video Captioning
Multimodal Semantic Attention Network for Video Captioning
Liang Sun
Bing Li
Chunfen Yuan
Zhengjun Zha
Weiming Hu
62
11
0
08 May 2019
ShapeGlot: Learning Language for Shape Differentiation
ShapeGlot: Learning Language for Shape Differentiation
Panos Achlioptas
Judy Fan
Robert D. Hawkins
Noah D. Goodman
Leonidas Guibas
132
83
0
08 May 2019
Frame-Recurrent Video Inpainting by Robust Optical Flow Inference
Frame-Recurrent Video Inpainting by Robust Optical Flow Inference
Yifan Ding
Chuan Wang
Haibin Huang
Jiaming Liu
Jue Wang
Liqiang Wang
58
12
0
08 May 2019
Object Exchangeability in Reinforcement Learning: Extended Abstract
Object Exchangeability in Reinforcement Learning: Extended Abstract
John Mern
Dorsa Sadigh
Mykel Kochenderfer
OCL
51
1
0
07 May 2019
Conditional Generative Neural System for Probabilistic Trajectory
  Prediction
Conditional Generative Neural System for Probabilistic Trajectory Prediction
Jiachen Li
Hengbo Ma
Masayoshi Tomizuka
102
176
0
05 May 2019
Face Hallucination by Attentive Sequence Optimization with Reinforcement
  Learning
Face Hallucination by Attentive Sequence Optimization with Reinforcement Learning
Yukai Shi
Guanbin Li
Qingxing Cao
Keze Wang
Liang Lin
CVBMSupR
68
32
0
04 May 2019
DeepSignals: Predicting Intent of Drivers Through Visual Signals
DeepSignals: Predicting Intent of Drivers Through Visual Signals
Davi Frossard
Eric Kee
R. Urtasun
ViT
38
17
0
03 May 2019
Processing Megapixel Images with Deep Attention-Sampling Models
Processing Megapixel Images with Deep Attention-Sampling Models
Angelos Katharopoulos
Franccois Fleuret
87
65
0
03 May 2019
Weight Map Layer for Noise and Adversarial Attack Robustness
Weight Map Layer for Noise and Adversarial Attack Robustness
Mohammed Amer
Tomás Maul
99
4
0
02 May 2019
Signed Distance-based Deep Memory Recommender
Signed Distance-based Deep Memory Recommender
Thanh-Binh Tran
Xinyue Liu
Kyumin Lee
Xiangnan Kong
FedMLHAI
62
20
0
01 May 2019
PR Product: A Substitute for Inner Product in Neural Networks
PR Product: A Substitute for Inner Product in Neural Networks
Zhennan Wang
Wenbin Zou
Chen Xu
50
6
0
30 Apr 2019
A scalable saliency-based Feature selection method with instance level
  information
A scalable saliency-based Feature selection method with instance level information
Brais Cancela
V. Bolón-Canedo
Amparo Alonso-Betanzos
João Gama
FAtt
62
13
0
30 Apr 2019
A self-attention based deep learning method for lesion attribute
  detection from CT reports
A self-attention based deep learning method for lesion attribute detection from CT reports
Yifan Peng
Ke Yan
V. Sandfort
Ronald M. Summers
Zhiyong Lu
MedIm
42
18
0
30 Apr 2019
Relational Collaborative Filtering:Modeling Multiple Item Relations for
  Recommendation
Relational Collaborative Filtering:Modeling Multiple Item Relations for Recommendation
Xin Xin
Xiangnan He
Yongfeng Zhang
Yongdong Zhang
J. Jose
89
167
0
29 Apr 2019
Human-Centered Emotion Recognition in Animated GIFs
Human-Centered Emotion Recognition in Animated GIFs
Zhengyuan Yang
Yixuan Zhang
Jiebo Luo
54
22
0
27 Apr 2019
Using Context Information to Enhance Simple Question Answering
Using Context Information to Enhance Simple Question Answering
Lin Li
Mengjing Zhang
Zhaohui Chao
Jianwen Xiang
33
11
0
27 Apr 2019
Knowing When to Stop: Evaluation and Verification of Conformity to
  Output-size Specifications
Knowing When to Stop: Evaluation and Verification of Conformity to Output-size Specifications
Chenglong Wang
Rudy Bunel
Krishnamurthy Dvijotham
Po-Sen Huang
Edward Grefenstette
Pushmeet Kohli
60
5
0
26 Apr 2019
Evaluating Recurrent Neural Network Explanations
Evaluating Recurrent Neural Network Explanations
L. Arras
Ahmed Osman
K. Müller
Wojciech Samek
XAIFAtt
117
88
0
26 Apr 2019
Box-driven Class-wise Region Masking and Filling Rate Guided Loss for
  Weakly Supervised Semantic Segmentation
Box-driven Class-wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation
Chunfeng Song
Yan Huang
Wanli Ouyang
Liang Wang
118
218
0
26 Apr 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering
TVQA+: Spatio-Temporal Grounding for Video Question Answering
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
83
230
0
25 Apr 2019
Pointing Novel Objects in Image Captioning
Pointing Novel Objects in Image Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
93
70
0
25 Apr 2019
Attention-based Transfer Learning for Brain-computer Interface
Attention-based Transfer Learning for Brain-computer Interface
Chuanqi Tan
F. Sun
Tao Kong
Bin Fang
Wenchang Zhang
OOD
45
9
0
25 Apr 2019
HAR-Net: Joint Learning of Hybrid Attention for Single-stage Object
  Detection
HAR-Net: Joint Learning of Hybrid Attention for Single-stage Object Detection
Yali Li
Shengjin Wang
74
35
0
25 Apr 2019
A Self-Attentive Emotion Recognition Network
A Self-Attentive Emotion Recognition Network
Harris Partaourides
Kostantinos Papadamou
N. Kourtellis
Ilias Leontiadis
S. Chatzis
31
7
0
24 Apr 2019
Generating Token-Level Explanations for Natural Language Inference
Generating Token-Level Explanations for Natural Language Inference
James Thorne
Andreas Vlachos
Christos Christodoulopoulos
Arpit Mittal
LRM
95
57
0
24 Apr 2019
Latent Variable Algorithms for Multimodal Learning and Sensor Fusion
Latent Variable Algorithms for Multimodal Learning and Sensor Fusion
Lijiang Guo
DRL
31
1
0
23 Apr 2019
Interpretable and Generalizable Person Re-Identification with
  Query-Adaptive Convolution and Temporal Lifting
Interpretable and Generalizable Person Re-Identification with Query-Adaptive Convolution and Temporal Lifting
Tianran Ouyang
Ling Shao
OOD
51
8
0
23 Apr 2019
End-to-End Spoken Language Translation
End-to-End Spoken Language Translation
Michelle Guo
Albert Haque
Prateek Verma
58
8
0
23 Apr 2019
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
Rami Al-Rfou
Dustin Zelle
Bryan Perozzi
57
57
0
21 Apr 2019
3G structure for image caption generation
3G structure for image caption generation
Aihong Yuan
Xuelong Li
Xiaoqiang Lu
38
34
0
21 Apr 2019
Compression and Localization in Reinforcement Learning for ATARI Games
Compression and Localization in Reinforcement Learning for ATARI Games
Joel Ruben Antony Moniz
Barun Patra
Sarthak Garg
AI4CE
51
2
0
20 Apr 2019
Previous
123...424344...697071
Next