ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Beyond Part Models: Person Retrieval with Refined Part Pooling (and a
  Strong Convolutional Baseline)
Beyond Part Models: Person Retrieval with Refined Part Pooling (and a Strong Convolutional Baseline)
Yifan Sun
Liang Zheng
Yi Yang
Q. Tian
Shengjin Wang
184
2,186
0
26 Nov 2017
HashGAN:Attention-aware Deep Adversarial Hashing for Cross Modal
  Retrieval
HashGAN:Attention-aware Deep Adversarial Hashing for Cross Modal Retrieval
Xi Zhang
Siyu Zhou
Jiashi Feng
Hanjiang Lai
Bo Li
Yan Pan
Jian Yin
Bo An
GAN
54
55
0
26 Nov 2017
Convolutional Image Captioning
Convolutional Image Captioning
J. Aneja
Aditya Deshpande
Alex Schwing
VLM
142
362
0
24 Nov 2017
Attended End-to-end Architecture for Age Estimation from Facial
  Expression Videos
Attended End-to-end Architecture for Age Estimation from Facial Expression Videos
Wenjie Pei
H. Dibeklioğlu
T. Baltrušaitis
David Tax
CVBM
49
42
0
23 Nov 2017
Self-view Grounding Given a Narrated 360° Video
Self-view Grounding Given a Narrated 360° Video
Shih-Han Chou
Yi-Chun Chen
Kuo-Hao Zeng
Hou-Ning Hu
Jianlong Fu
Min Sun
31
4
0
23 Nov 2017
Conditional Image-Text Embedding Networks
Conditional Image-Text Embedding Networks
Bryan A. Plummer
Paige Kordas
M. Kiapour
Shuai Zheng
Robinson Piramuthu
Svetlana Lazebnik
119
118
0
22 Nov 2017
Multi-Level Recurrent Residual Networks for Action Recognition
Multi-Level Recurrent Residual Networks for Action Recognition
Zhenxing Zheng
Gaoyun An
Q. Ruan
41
12
0
22 Nov 2017
On the Automatic Generation of Medical Imaging Reports
On the Automatic Generation of Medical Imaging Reports
Baoyu Jing
P. Xie
Eric Xing
MedIm
101
516
0
22 Nov 2017
Identifying Most Walkable Direction for Navigation in an Outdoor
  Environment
Identifying Most Walkable Direction for Navigation in an Outdoor Environment
Sachin Mehta
Hannaneh Hajishirzi
Linda G. Shapiro
55
9
0
21 Nov 2017
Using stochastic computation graphs formalism for optimization of
  sequence-to-sequence model
Using stochastic computation graphs formalism for optimization of sequence-to-sequence model
Eugene Golikov
Vlad Zhukov
M. Kretov
46
0
0
21 Nov 2017
Diverse and Accurate Image Description Using a Variational Auto-Encoder
  with an Additive Gaussian Encoding Space
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
Liwei Wang
Alex Schwing
Svetlana Lazebnik
CoGe
116
175
0
19 Nov 2017
Excitation Backprop for RNNs
Excitation Backprop for RNNs
Sarah Adel Bargal
Andrea Zunino
Donghyun Kim
Jianming Zhang
Vittorio Murino
Stan Sclaroff
175
48
0
18 Nov 2017
ATRank: An Attention-Based User Behavior Modeling Framework for
  Recommendation
ATRank: An Attention-Based User Behavior Modeling Framework for Recommendation
Chang Zhou
Jinze Bai
Junshuai Song
Xiaofei Liu
Zhengchao Zhao
Xiusi Chen
Jun Gao
HAI
101
309
0
17 Nov 2017
Action-Attending Graphic Neural Network
Action-Attending Graphic Neural Network
Chaolong Li
Zhen Cui
Wenming Zheng
Chunyan Xu
Rongrong Ji
Jian Yang
76
50
0
17 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery
  through Dialogs and Queries
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
Anton Van Den Hengel
ObjD
87
135
0
17 Nov 2017
Language-Based Image Editing with Recurrent Attentive Models
Language-Based Image Editing with Recurrent Attentive Models
Jianbo Chen
Yelong Shen
Jianfeng Gao
Jingjing Liu
Xiaodong Liu
99
122
0
16 Nov 2017
A Novel Framework for Robustness Analysis of Visual QA Models
A Novel Framework for Robustness Analysis of Visual QA Models
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
Guohao Li
AAMLOOD
82
34
0
16 Nov 2017
Natural Language Guided Visual Relationship Detection
Natural Language Guided Visual Relationship Detection
Wentong Liao
Shuai Lin
Bodo Rosenhahn
M. Yang
94
63
0
16 Nov 2017
Towards Interpretable R-CNN by Unfolding Latent Structures
Towards Interpretable R-CNN by Unfolding Latent Structures
Tianfu Wu
Wei Sun
Xilai Li
Xi Song
Yangqiu Song
ObjD
62
20
0
14 Nov 2017
Saliency-based Sequential Image Attention with Multiset Prediction
Saliency-based Sequential Image Attention with Multiset Prediction
Sean Welleck
Jialin Mao
Kyunghyun Cho
Zheng Zhang
52
23
0
14 Nov 2017
Decoupled Weight Decay Regularization
Decoupled Weight Decay Regularization
I. Loshchilov
Frank Hutter
OffRL
167
2,166
0
14 Nov 2017
High-Order Attention Models for Visual Question Answering
High-Order Attention Models for Visual Question Answering
Idan Schwartz
Alex Schwing
Tamir Hazan
80
103
0
12 Nov 2017
AON: Towards Arbitrarily-Oriented Text Recognition
AON: Towards Arbitrarily-Oriented Text Recognition
Zhanzhan Cheng
Xuyang Liu
Fan Bai
Yi Niu
Shiliang Pu
Shuigeng Zhou
71
14
0
12 Nov 2017
Building machines that adapt and compute like brains
Building machines that adapt and compute like brains
Brenden M. Lake
J. Tenenbaum
AI4CEFedMLNAIAILaw
331
887
0
11 Nov 2017
End-to-end Video-level Representation Learning for Action Recognition
End-to-end Video-level Representation Learning for Action Recognition
Jiagang Zhu
Wei Zou
Zheng Zhu
97
89
0
11 Nov 2017
Phrase-based Image Captioning with Hierarchical LSTM Model
Phrase-based Image Captioning with Hierarchical LSTM Model
Y. Tan
Chee Seng Chan
VLM
31
4
0
11 Nov 2017
Towards Automated ICD Coding Using Deep Learning
Towards Automated ICD Coding Using Deep Learning
Haoran Shi
P. Xie
Zhiting Hu
Ming Zhang
Eric Xing
62
142
0
11 Nov 2017
Attend and Diagnose: Clinical Time Series Analysis using Attention
  Models
Attend and Diagnose: Clinical Time Series Analysis using Attention Models
Huan-Zhi Song
Deepta Rajan
Jayaraman J. Thiagarajan
A. Spanias
MLAU
112
456
0
10 Nov 2017
Learning Multi-Modal Word Representation Grounded in Visual Context
Learning Multi-Modal Word Representation Grounded in Visual Context
Éloi Zablocki
Benjamin Piwowarski
Laure Soulier
Patrick Gallinari
SSL
74
30
0
09 Nov 2017
Learning Markov Chain in Unordered Dataset
Learning Markov Chain in Unordered Dataset
Yao-Hung Hubert Tsai
Haiying Zhao
Ruslan Salakhutdinov
Nebojsa Jojic
CML
83
1
0
08 Nov 2017
Multi-label Image Recognition by Recurrently Discovering Attentional
  Regions
Multi-label Image Recognition by Recurrently Discovering Attentional Regions
Zhouxia Wang
Tianshui Chen
Guanbin Li
Ruijia Xu
Liang Lin
126
291
0
08 Nov 2017
Image Captioning and Classification of Dangerous Situations
Image Captioning and Classification of Dangerous Situations
Octavio Arriaga
Paul G. Plöger
Matias Valdenegro-Toro
37
8
0
07 Nov 2017
Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent
  Networks
Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent Networks
Nan Rosemary Ke
Anirudh Goyal
O. Bilaniuk
Jonathan Binas
Laurent Charlin
C. Pal
Yoshua Bengio
78
15
0
07 Nov 2017
Semantic Image Retrieval via Active Grounding of Visual Situations
Semantic Image Retrieval via Active Grounding of Visual Situations
Max H. Quinn
E. Conser
Jordan M. Witte
Melanie Mitchell
69
9
0
31 Oct 2017
Fraternal Dropout
Fraternal Dropout
Konrad Zolna
Devansh Arpit
Dendi Suhubdy
Yoshua Bengio
92
53
0
31 Oct 2017
Whodunnit? Crime Drama as a Case for Natural Language Understanding
Whodunnit? Crime Drama as a Case for Natural Language Understanding
Lea Frermann
Shay B. Cohen
Mirella Lapata
67
26
0
31 Oct 2017
Melody Generation for Pop Music via Word Representation of Musical
  Properties
Melody Generation for Pop Music via Word Representation of Musical Properties
Andrew Shin
Léopold Crestel
Hiroharu Kato
Kuniaki Saito
Katsunori Ohnishi
Masataka Yamaguchi
Masahiro Nakawaki
Yoshitaka Ushiku
Tatsuya Harada
MGen
72
12
0
31 Oct 2017
How deep learning works --The geometry of deep learning
How deep learning works --The geometry of deep learning
Xiao Dong
Jiasong Wu
Ling Zhou
GNN
66
8
0
30 Oct 2017
Understanding Hidden Memories of Recurrent Neural Networks
Understanding Hidden Memories of Recurrent Neural Networks
Yao Ming
Shaozu Cao
Ruixiang Zhang
Zerui Li
Yuanzhe Chen
Yangqiu Song
Huamin Qu
HAI
48
201
0
30 Oct 2017
Sequence-to-Sequence ASR Optimization via Reinforcement Learning
Sequence-to-Sequence ASR Optimization via Reinforcement Learning
Andros Tjandra
S. Sakti
Satoshi Nakamura
AI4TS
96
26
0
30 Oct 2017
Contextual Regression: An Accurate and Conveniently Interpretable
  Nonlinear Model for Mining Discovery from Scientific Data
Contextual Regression: An Accurate and Conveniently Interpretable Nonlinear Model for Mining Discovery from Scientific Data
Chengyu Liu
Wei Wang
32
7
0
30 Oct 2017
Phase Conductor on Multi-layered Attentions for Machine Comprehension
Phase Conductor on Multi-layered Attentions for Machine Comprehension
R. Liu
Wei Wei
Weiguang Mao
M. Chikina
99
22
0
28 Oct 2017
Learning to diagnose from scratch by exploiting dependencies among
  labels
Learning to diagnose from scratch by exploiting dependencies among labels
L. Yao
Eric Poblenz
Dmitry Dagunts
Ben Covington
D. Bernard
Kevin Lyman
83
336
0
28 Oct 2017
Attention-Based Models for Text-Dependent Speaker Verification
Attention-Based Models for Text-Dependent Speaker Verification
F. R. Chowdhury
Quan Wang
Ignacio López Moreno
Li Wan
89
172
0
28 Oct 2017
Few-shot Autoregressive Density Estimation: Towards Learning to Learn
  Distributions
Few-shot Autoregressive Density Estimation: Towards Learning to Learn Distributions
Scott E. Reed
Yutian Chen
T. Paine
Aaron van den Oord
S. M. Ali Eslami
Danilo Jimenez Rezende
Oriol Vinyals
Nando de Freitas
130
88
0
27 Oct 2017
Human-in-the-loop Artificial Intelligence
Human-in-the-loop Artificial Intelligence
Fabio Massimo Zanzotto
88
270
0
23 Oct 2017
FigureQA: An Annotated Figure Dataset for Visual Reasoning
FigureQA: An Annotated Figure Dataset for Visual Reasoning
Samira Ebrahimi Kahou
Vincent Michalski
Adam Atkinson
Ákos Kádár
Adam Trischler
Yoshua Bengio
ReLMAIMat
91
332
0
19 Oct 2017
Learning Social Image Embedding with Deep Multimodal Attention Networks
Learning Social Image Embedding with Deep Multimodal Attention Networks
Feiran Huang
Xiaoming Zhang
Zhoujun Li
Tao Mei
Yueying He
Zhonghua Zhao
59
20
0
18 Oct 2017
Beat by Beat: Classifying Cardiac Arrhythmias with Recurrent Neural
  Networks
Beat by Beat: Classifying Cardiac Arrhythmias with Recurrent Neural Networks
Patrick Schwab
G. Scebba
Jia Zhang
Marco Delai
W. Karlen
87
76
0
17 Oct 2017
Describing Natural Images Containing Novel Objects with Knowledge Guided
  Assitance
Describing Natural Images Containing Novel Objects with Knowledge Guided Assitance
Aditya Mogadala
Umanga Bista
Lexing Xie
Achim Rettinger
60
7
0
17 Oct 2017
Previous
123...575859...697071
Next