ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
A Simple Neural Attentive Meta-Learner
A Simple Neural Attentive Meta-Learner
Nikhil Mishra
Mostafa Rohaninejad
Xi Chen
Pieter Abbeel
OOD
109
200
0
11 Jul 2017
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis
  Network
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network
Zizhao Zhang
Yuanpu Xie
Fuyong Xing
M. McGough
Ling Yang
MedIm
74
303
0
08 Jul 2017
A Nested Attention Neural Hybrid Model for Grammatical Error Correction
A Nested Attention Neural Hybrid Model for Grammatical Error Correction
Jianshu Ji
Qinlong Wang
Kristina Toutanova
Yongen Gong
Steven QH Truong
Jianfeng Gao
97
110
0
07 Jul 2017
Multiple Range-Restricted Bidirectional Gated Recurrent Units with
  Attention for Relation Classification
Multiple Range-Restricted Bidirectional Gated Recurrent Units with Attention for Relation Classification
Jonggu Kim
Jong-Hyeok Lee
48
4
0
05 Jul 2017
Visually Grounded Word Embeddings and Richer Visual Features for
  Improving Multimodal Neural Machine Translation
Visually Grounded Word Embeddings and Richer Visual Features for Improving Multimodal Neural Machine Translation
Jean-Benoit Delbrouck
Stéphane Dupont
Omar Seddati
95
8
0
04 Jul 2017
An empirical study on the effectiveness of images in Multimodal Neural
  Machine Translation
An empirical study on the effectiveness of images in Multimodal Neural Machine Translation
Jean-Benoit Delbrouck
Stéphane Dupont
86
39
0
04 Jul 2017
Where to Play: Retrieval of Video Segments using Natural-Language
  Queries
Where to Play: Retrieval of Video Segments using Natural-Language Queries
Sangkuk Lee
Daesik Kim
Myunggi Lee
Jihye Hwang
Nojun Kwak
68
3
0
02 Jul 2017
Modulating early visual processing by language
Modulating early visual processing by language
H. D. Vries
Florian Strub
Jérémie Mary
Hugo Larochelle
Olivier Pietquin
Aaron Courville
200
490
0
02 Jul 2017
Efficient Attention using a Fixed-Size Memory Representation
Efficient Attention using a Fixed-Size Memory Representation
D. Britz
M. Guan
Minh-Thang Luong
3DV
67
32
0
01 Jul 2017
Neural Sequence Model Training via $α$-divergence Minimization
Neural Sequence Model Training via ααα-divergence Minimization
Sotetsu Koyamada
Yuta Kikuchi
Atsunori Kanemura
S. Maeda
S. Ishii
104
0
0
30 Jun 2017
Graph Convolution: A High-Order and Adaptive Approach
Graph Convolution: A High-Order and Adaptive Approach
Zhenpeng Zhou
Xiaocheng Li
GNN
75
23
0
29 Jun 2017
Actor-Critic Sequence Training for Image Captioning
Actor-Critic Sequence Training for Image Captioning
Li Zhang
Flood Sung
Feng Liu
Tao Xiang
S. Gong
Yongxin Yang
Timothy M. Hospedales
86
111
0
29 Jun 2017
The YouTube-8M Kaggle Competition: Challenges and Methods
The YouTube-8M Kaggle Competition: Challenges and Methods
Haosheng Zou
Kun Xu
Jialian Li
Jun Zhu
39
13
0
28 Jun 2017
Generative Bridging Network in Neural Sequence Prediction
Generative Bridging Network in Neural Sequence Prediction
Wenhu Chen
Guanlin Li
Shuo Ren
Shujie Liu
Zhirui Zhang
Mu Li
M. Zhou
93
10
0
28 Jun 2017
Named Entity Disambiguation for Noisy Text
Named Entity Disambiguation for Noisy Text
Yotam Eshel
N. Cohen
Kira Radinsky
Shaul Markovitch
Ikuya Yamada
Omer Levy
49
92
0
28 Jun 2017
Generative Encoder-Decoder Models for Task-Oriented Spoken Dialog
  Systems with Chatting Capability
Generative Encoder-Decoder Models for Task-Oriented Spoken Dialog Systems with Chatting Capability
Tiancheng Zhao
Allen Lu
Kyusong Lee
M. Eskénazi
AuLLM
91
87
0
26 Jun 2017
Paying More Attention to Saliency: Image Captioning with Saliency and
  Context Attention
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention
Marcella Cornia
Lorenzo Baraldi
Giuseppe Serra
Rita Cucchiara
75
80
0
26 Jun 2017
Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network
  with Trust Gates
Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates
Jun Liu
Amir Shahroudy
Dong Xu
Alex C. Kot
G. Wang
89
457
0
26 Jun 2017
Deep Semantics-Aware Photo Adjustment
Deep Semantics-Aware Photo Adjustment
Seonghyeon Nam
Seon Joo Kim
GAN
10
3
0
26 Jun 2017
Methods for Interpreting and Understanding Deep Neural Networks
Methods for Interpreting and Understanding Deep Neural Networks
G. Montavon
Wojciech Samek
K. Müller
FaML
299
2,281
0
24 Jun 2017
Generating Long-term Trajectories Using Deep Hierarchical Networks
Generating Long-term Trajectories Using Deep Hierarchical Networks
Stephan Zheng
Yisong Yue
Jennifer Hobbs
86
104
0
21 Jun 2017
Neural-based Natural Language Generation in Dialogue using RNN
  Encoder-Decoder with Semantic Aggregation
Neural-based Natural Language Generation in Dialogue using RNN Encoder-Decoder with Semantic Aggregation
Van-Khanh Tran
Le-Minh Nguyen
62
33
0
21 Jun 2017
Grounded Language Learning in a Simulated 3D World
Grounded Language Learning in a Simulated 3D World
Karl Moritz Hermann
Felix Hill
Simon Green
Fumin Wang
Ryan Faulkner
...
Denis Teplyashin
Marcus Wainwright
C. Apps
Demis Hassabis
Phil Blunsom
LM&Ro
109
306
0
20 Jun 2017
Dipole: Diagnosis Prediction in Healthcare via Attention-based
  Bidirectional Recurrent Neural Networks
Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks
Fenglong Ma
Radha Chitta
Jing Zhou
Quanzeng You
Tong Sun
Jing Gao
74
559
0
19 Jun 2017
Dex: Incremental Learning for Complex Environments in Deep Reinforcement
  Learning
Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning
Nick Erickson
Qi Zhao
CLLOffRL
422
2
0
19 Jun 2017
An online sequence-to-sequence model for noisy speech recognition
An online sequence-to-sequence model for noisy speech recognition
Chung-Cheng Chiu
Dieterich Lawson
Yuping Luo
George Tucker
Kevin Swersky
Ilya Sutskever
Navdeep Jaitly
57
7
0
16 Jun 2017
FeaStNet: Feature-Steered Graph Convolutions for 3D Shape Analysis
FeaStNet: Feature-Steered Graph Convolutions for 3D Shape Analysis
Nitika Verma
Edmond Boyer
Jakob Verbeek
3DPCGNN
75
26
0
16 Jun 2017
Enriched Deep Recurrent Visual Attention Model for Multiple Object
  Recognition
Enriched Deep Recurrent Visual Attention Model for Multiple Object Recognition
Artsiom Ablavatski
Shijian Lu
Jianfei Cai
51
37
0
12 Jun 2017
Image Captioning with Object Detection and Localization
Image Captioning with Object Detection and Localization
Zhongliang Yang
Yujin Zhang
S. Rehman
Yongfeng Huang
ObjDVLM
50
47
0
08 Jun 2017
Best of Both Worlds: Transferring Knowledge from Discriminative Learning
  to a Generative Visual Dialog Model
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model
Jiasen Lu
A. Kannan
Jianwei Yang
Devi Parikh
Dhruv Batra
BDL
102
137
0
05 Jun 2017
Visual attention models for scene text recognition
Visual attention models for scene text recognition
Suman K. Ghosh
Ernest Valveny
Andrew D. Bagdanov
50
45
0
05 Jun 2017
A simple neural network module for relational reasoning
A simple neural network module for relational reasoning
Adam Santoro
David Raposo
David Barrett
Mateusz Malinowski
Razvan Pascanu
Peter W. Battaglia
Timothy Lillicrap
GNNNAI
191
1,617
0
05 Jun 2017
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Jingkuan Song
Zhao Guo
Lianli Gao
Wu Liu
Dongxiang Zhang
Heng Tao Shen
107
167
0
05 Jun 2017
Order embeddings and character-level convolutions for multimodal
  alignment
Order embeddings and character-level convolutions for multimodal alignment
Jonatas Wehrmann
Anderson Mattjie
Rodrigo C. Barros
37
27
0
03 Jun 2017
Attentive Convolutional Neural Network based Speech Emotion Recognition:
  A Study on the Impact of Input Features, Signal Length, and Acted Speech
Attentive Convolutional Neural Network based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech
Michael Neumann
Ngoc Thang Vu
92
217
0
02 Jun 2017
NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation
  Systems
NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems
Ozan Caglayan
Mercedes García-Martínez
Adrien Bardet
Walid Aransa
Fethi Bougares
Loïc Barrault
98
65
0
01 Jun 2017
Natural Language Generation for Spoken Dialogue System using RNN
  Encoder-Decoder Networks
Natural Language Generation for Spoken Dialogue System using RNN Encoder-Decoder Networks
Van-Khanh Tran
Le-Minh Nguyen
131
41
0
01 Jun 2017
Teaching Machines to Describe Images via Natural Language Feedback
Teaching Machines to Describe Images via Natural Language Feedback
Huan Ling
Sanja Fidler
95
45
0
01 Jun 2017
Reinforcement Learning for Learning Rate Control
Reinforcement Learning for Learning Rate Control
Chang Xu
Tao Qin
G. Wang
Tie-Yan Liu
81
34
0
31 May 2017
Emergent Communication in a Multi-Modal, Multi-Step Referential Game
Emergent Communication in a Multi-Modal, Multi-Step Referential Game
Katrina Evtimova
Andrew Drozdov
Douwe Kiela
Kyunghyun Cho
160
31
0
29 May 2017
Contextual Explanation Networks
Contextual Explanation Networks
Maruan Al-Shedivat
Kumar Avinava Dubey
Eric Xing
CML
114
83
0
29 May 2017
Latent Intention Dialogue Models
Latent Intention Dialogue Models
Tsung-Hsien Wen
Yishu Miao
Phil Blunsom
S. Young
90
146
0
29 May 2017
Deep Learning for User Comment Moderation
Deep Learning for User Comment Moderation
John Pavlopoulos
Prodromos Malakasiotis
Ion Androutsopoulos
70
126
0
28 May 2017
Multimodal Machine Learning: A Survey and Taxonomy
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
190
2,967
0
26 May 2017
Learning Structured Text Representations
Learning Structured Text Representations
Yang Liu
Mirella Lapata
130
152
0
25 May 2017
Attention-based Natural Language Person Retrieval
Attention-based Natural Language Person Retrieval
Tao Zhou
Muhao Chen
Jie Yu
Demetri Terzopoulos
39
14
0
24 May 2017
MMD GAN: Towards Deeper Understanding of Moment Matching Network
MMD GAN: Towards Deeper Understanding of Moment Matching Network
Chun-Liang Li
Wei-Cheng Chang
Yu Cheng
Yiming Yang
Barnabás Póczós
GAN
72
726
0
24 May 2017
Clinical Intervention Prediction and Understanding using Deep Networks
Clinical Intervention Prediction and Understanding using Deep Networks
Harini Suresh
Nathan Hunt
Alistair E. W. Johnson
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OOD
76
135
0
23 May 2017
Efficiently applying attention to sequential data with the Recurrent
  Discounted Attention unit
Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit
B. Maginnis
Pierre Harvey Richemond
AI4TS
32
1
0
23 May 2017
Question-Answering with Grammatically-Interpretable Representations
Question-Answering with Grammatically-Interpretable Representations
Hamid Palangi
P. Smolensky
Xiaodong He
Li Deng
82
54
0
23 May 2017
Previous
123...606162...697071
Next