ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Position Focused Attention Network for Image-Text Matching
Position Focused Attention Network for Image-Text Matching
Yaxiong Wang
Hao-Hsiang Yang
Xueming Qian
Lin Ma
Jing Lu
Biao Li
Xin Fan
58
172
0
23 Jul 2019
MacNet: Transferring Knowledge from Machine Comprehension to
  Sequence-to-Sequence Models
MacNet: Transferring Knowledge from Machine Comprehension to Sequence-to-Sequence Models
Boyuan Pan
Yazheng Yang
Hao Li
Zhou Zhao
Yueting Zhuang
Deng Cai
Xiaofei He
64
18
0
23 Jul 2019
Compact Global Descriptor for Neural Networks
Xiangyu He
Ke Cheng
Qiang Chen
Qinghao Hu
Peisong Wang
Jian Cheng
96
8
0
23 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
143
136
0
22 Jul 2019
Deep Learning for Time Series Forecasting: The Electric Load Case
Deep Learning for Time Series Forecasting: The Electric Load Case
Alberto Gasparin
S. Lukovic
Cesare Alippi
AI4TS
80
231
0
22 Jul 2019
Automatic Radiology Report Generation based on Multi-view Image Fusion
  and Medical Concept Enrichment
Automatic Radiology Report Generation based on Multi-view Image Fusion and Medical Concept Enrichment
Jianbo Yuan
Haofu Liao
R. Luo
Jiebo Luo
MedIm
88
196
0
22 Jul 2019
A study on the Interpretability of Neural Retrieval Models using
  DeepSHAP
A study on the Interpretability of Neural Retrieval Models using DeepSHAP
Zeon Trevor Fernando
Jaspreet Singh
Avishek Anand
FAttAAML
65
68
0
15 Jul 2019
Extracting Interpretable Physical Parameters from Spatiotemporal Systems
  using Unsupervised Learning
Extracting Interpretable Physical Parameters from Spatiotemporal Systems using Unsupervised Learning
Peter Y. Lu
Samuel Kim
Marin Soljacic
AI4CE
67
60
0
13 Jul 2019
A Survey of Deep Learning-based Object Detection
A Survey of Deep Learning-based Object Detection
L. Jiao
Fan Zhang
Fang Liu
Shuyuan Yang
Lingling Li
Zhixi Feng
Rong Qu
ObjD
131
974
0
11 Jul 2019
Variational Context: Exploiting Visual and Textual Context for Grounding
  Referring Expressions
Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions
Yulei Niu
Hanwang Zhang
Zhiwu Lu
Shih-Fu Chang
ObjDBDL
101
26
0
08 Jul 2019
Informative Visual Storytelling with Cross-modal Rules
Informative Visual Storytelling with Cross-modal Rules
Jiacheng Li
Haizhou Shi
Siliang Tang
Leilei Gan
Yueting Zhuang
52
24
0
07 Jul 2019
EPNAS: Efficient Progressive Neural Architecture Search
EPNAS: Efficient Progressive Neural Architecture Search
Yanqi Zhou
Peng Wang
Sercan O. Arik
Haonan Yu
Syed Zawad
Feng Yan
G. Diamos
47
5
0
07 Jul 2019
Graph Representation Learning via Hard and Channel-Wise Attention
  Networks
Graph Representation Learning via Hard and Channel-Wise Attention Networks
Hongyang Gao
Shuiwang Ji
GNN
70
57
0
05 Jul 2019
Embodied Vision-and-Language Navigation with Dynamic Convolutional
  Filters
Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters
Federico Landi
Lorenzo Baraldi
M. Corsini
Rita Cucchiara
LM&Ro
98
27
0
05 Jul 2019
Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and
  Graph Attention Networks
Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks
V. Kosaraju
Amir Sadeghian
Roberto Martín-Martín
Ian Reid
S. Hamid Rezatofighi
Silvio Savarese
95
613
0
04 Jul 2019
ACNe: Attentive Context Normalization for Robust Permutation-Equivariant
  Learning
ACNe: Attentive Context Normalization for Robust Permutation-Equivariant Learning
Weiwei Sun
Wei Jiang
Eduard Trulls
Andrea Tagliasacchi
K. M. Yi
3DPC
90
20
0
04 Jul 2019
Learning Blended, Precise Semantic Program Embeddings
Learning Blended, Precise Semantic Program Embeddings
Ke Wang
Z. Su
NAI
62
27
0
03 Jul 2019
Neural Image Captioning
Neural Image Captioning
E. Tan
Lakshay Sharma
VLM
55
3
0
02 Jul 2019
Generative Models for Automatic Chemical Design
Generative Models for Automatic Chemical Design
Daniel Schwalbe-Koda
Rafael Gómez-Bombarelli
MedImAI4CE
89
81
0
02 Jul 2019
Augmenting Self-attention with Persistent Memory
Augmenting Self-attention with Persistent Memory
Sainbayar Sukhbaatar
Edouard Grave
Guillaume Lample
Hervé Jégou
Armand Joulin
RALMKELM
77
139
0
02 Jul 2019
Kite: Automatic speech recognition for unmanned aerial vehicles
Kite: Automatic speech recognition for unmanned aerial vehicles
Dan Oneaţă
H. Cucu
40
13
0
02 Jul 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue
  Systems
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
70
112
0
02 Jul 2019
Inter and Intra Document Attention for Depression Risk Assessment
Inter and Intra Document Attention for Depression Risk Assessment
Diego Maupomé
Marc Queudot
Marie-Jean Meurs
14
7
0
30 Jun 2019
Machine Reading Comprehension: a Literature Review
Machine Reading Comprehension: a Literature Review
Xin Zhang
An Yang
Sujian Li
Yizhong Wang
91
33
0
30 Jun 2019
A Neural Attention Model for Adaptive Learning of Social Friends'
  Preferences
A Neural Attention Model for Adaptive Learning of Social Friends' Preferences
Dimitrios Rafailidis
Gerhard Weiss
GNNFedML
50
2
0
29 Jun 2019
A Deep Decoder Structure Based on WordEmbedding Regression for An
  Encoder-Decoder Based Model for Image Captioning
A Deep Decoder Structure Based on WordEmbedding Regression for An Encoder-Decoder Based Model for Image Captioning
A. Asadi
Reza Safabakhsh
26
3
0
26 Jun 2019
Creating A Neural Pedagogical Agent by Jointly Learning to Review and
  Assess
Creating A Neural Pedagogical Agent by Jointly Learning to Review and Assess
Youngnam Lee
Youngduck Choi
Junghyun Cho
Alexander R. Fabbri
Hyunbin Loh
Chanyou Hwang
Yongku Lee
Sang-Wook Kim
Dragomir R. Radev
35
18
0
26 Jun 2019
Deep Modular Co-Attention Networks for Visual Question Answering
Deep Modular Co-Attention Networks for Visual Question Answering
Zhou Yu
Jun Yu
Yuhao Cui
Dacheng Tao
Q. Tian
150
811
0
25 Jun 2019
Is It Worth the Attention? A Comparative Evaluation of Attention Layers
  for Argument Unit Segmentation
Is It Worth the Attention? A Comparative Evaluation of Attention Layers for Argument Unit Segmentation
Maximilian Spliethover
Jonas Klaff
Hendrik Heuer
54
10
0
24 Jun 2019
CORAL8: Concurrent Object Regression for Area Localization in Medical
  Image Panels
CORAL8: Concurrent Object Regression for Area Localization in Medical Image Panels
Sam Maksoud
Arnold Wiliem
Kun-li Zhao
Teng Zhang
Lin Wu
Brian C. Lovell
MedIm
51
11
0
24 Jun 2019
Improving Description-based Person Re-identification by
  Multi-granularity Image-text Alignments
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments
K. Niu
Y. Huang
Wanli Ouyang
Liang Wang
58
144
0
23 Jun 2019
Sequence Generation: From Both Sides to the Middle
Sequence Generation: From Both Sides to the Middle
Long Zhou
Jiajun Zhang
Chengqing Zong
Heng Yu
76
22
0
23 Jun 2019
Informative Image Captioning with External Sources of Information
Informative Image Captioning with External Sources of Information
Sanqiang Zhao
Piyush Sharma
Tomer Levinboim
Radu Soricut
65
46
0
20 Jun 2019
Understanding More about Human and Machine Attention in Deep Neural
  Networks
Understanding More about Human and Machine Attention in Deep Neural Networks
Qiuxia Lai
Salman Khan
Wenguan Wang
Jianbing Shen
Hanqiu Sun
Ling Shao
HAIXAI
52
7
0
20 Jun 2019
SMILES-X: autonomous molecular compounds characterization for small
  datasets without descriptors
SMILES-X: autonomous molecular compounds characterization for small datasets without descriptors
G. Lambard
Ekaterina Gracheva
71
21
0
20 Jun 2019
A simple and effective postprocessing method for image classification
A simple and effective postprocessing method for image classification
Yan Liu
Yun Li
Yunhao Yuan
Jipeng Qiang
21
1
0
19 Jun 2019
VizADS-B: Analyzing Sequences of ADS-B Images Using Explainable
  Convolutional LSTM Encoder-Decoder to Detect Cyber Attacks
VizADS-B: Analyzing Sequences of ADS-B Images Using Explainable Convolutional LSTM Encoder-Decoder to Detect Cyber Attacks
Sefi Akerman
Edan Habler
A. Shabtai
82
18
0
19 Jun 2019
Distilling Translations with Visual Awareness
Distilling Translations with Visual Awareness
Julia Ive
Pranava Madhyastha
Lucia Specia
VLM
154
76
0
18 Jun 2019
Expressing Visual Relationships via Language
Expressing Visual Relationships via Language
Hao Tan
Franck Dernoncourt
Zhe Lin
Trung Bui
Joey Tianyi Zhou
93
68
0
18 Jun 2019
Attention Guided Graph Convolutional Networks for Relation Extraction
Attention Guided Graph Convolutional Networks for Relation Extraction
Zhijiang Guo
Yan Zhang
Wei Lu
GNN
97
413
0
18 Jun 2019
ASAC: Active Sensing using Actor-Critic models
ASAC: Active Sensing using Actor-Critic models
Chang Jo Kim
James Jordon
M. Schaar
CML
63
16
0
16 Jun 2019
Image Captioning with Integrated Bottom-Up and Multi-level Residual
  Top-Down Attention for Game Scene Understanding
Image Captioning with Integrated Bottom-Up and Multi-level Residual Top-Down Attention for Game Scene Understanding
Jian Zheng
S. Krishnamurthy
Ruxin Chen
Min-Hung Chen
Zhenhao Ge
Xiaohua Li
85
4
0
16 Jun 2019
Generating Diverse and Informative Natural Language Fashion Feedback
Generating Diverse and Informative Natural Language Fashion Feedback
Gil Sadeh
L. Fritz
Gabi Shalev
Eduard Oks
58
5
0
15 Jun 2019
Connecting Touch and Vision via Cross-Modal Prediction
Connecting Touch and Vision via Cross-Modal Prediction
Yunzhu Li
Jun-Yan Zhu
Russ Tedrake
Antonio Torralba
80
139
0
14 Jun 2019
Image Captioning: Transforming Objects into Words
Image Captioning: Transforming Objects into Words
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
ViT
175
476
0
14 Jun 2019
Multigrid Neural Memory
Multigrid Neural Memory
T. Huynh
Michael Maire
Matthew R. Walter
64
10
0
13 Jun 2019
Stand-Alone Self-Attention in Vision Models
Stand-Alone Self-Attention in Vision Models
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
VLMSLRViT
193
1,218
0
13 Jun 2019
Near-Optimal Glimpse Sequences for Improved Hard Attention Neural
  Network Training
Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network Training
William Harvey
Michael Teng
Frank Wood
50
4
0
13 Jun 2019
Vispi: Automatic Visual Perception and Interpretation of Chest X-rays
Vispi: Automatic Visual Perception and Interpretation of Chest X-rays
X. Li
Rui Cao
D. Zhu
79
20
0
12 Jun 2019
Pay Attention to Convolution Filters: Towards Fast and Accurate
  Fine-Grained Transfer Learning
Pay Attention to Convolution Filters: Towards Fast and Accurate Fine-Grained Transfer Learning
Xiangxi Mo
Ruizhe Cheng
Tianyi Fang
35
3
0
12 Jun 2019
Previous
123...404142...697071
Next