Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4555
Cited By
Show and Tell: A Neural Image Caption Generator
17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show and Tell: A Neural Image Caption Generator"
50 / 2,023 papers shown
Title
Meaning guided video captioning
Rushi J. Babariya
Toru Tamaki
30
3
0
12 Dec 2019
Connecting Vision and Language with Localized Narratives
Jordi Pont-Tuset
J. Uijlings
Soravit Changpinyo
Radu Soricut
V. Ferrari
ObjD
36
244
0
06 Dec 2019
Synchronous Transformers for End-to-End Speech Recognition
Zhengkun Tian
Jiangyan Yi
Ye Bai
J. Tao
Shuai Zhang
Zhengqi Wen
27
72
0
06 Dec 2019
Siamese Natural Language Tracker: Tracking by Natural Language Descriptions with Siamese Trackers
Qi Feng
Vitaly Ablavsky
Qinxun Bai
Stan Sclaroff
39
17
0
04 Dec 2019
Better Understanding Hierarchical Visual Relationship for Image Caption
Z. Fei
31
0
0
04 Dec 2019
Assessing the Robustness of Visual Question Answering Models
Jia-Hong Huang
Modar Alfadly
Guohao Li
Marcel Worring
AAML
OOD
28
23
0
30 Nov 2019
Multimodal Machine Translation through Visuals and Speech
U. Sulubacak
Ozan Caglayan
Stig-Arne Gronroos
Aku Rouhe
Desmond Elliott
Lucia Specia
Jörg Tiedemann
63
74
0
28 Nov 2019
Multimodal Attention Networks for Low-Level Vision-and-Language Navigation
Federico Landi
Lorenzo Baraldi
Marcella Cornia
M. Corsini
Rita Cucchiara
LM&Ro
16
27
0
27 Nov 2019
CRUR: Coupled-Recurrent Unit for Unification, Conceptualization and Context Capture for Language Representation -- A Generalization of Bi Directional LSTM
C. Sur
BDL
14
6
0
22 Nov 2019
Injecting Prior Knowledge into Image Caption Generation
A. Goel
Basura Fernando
Thanh-Son Nguyen
Hakan Bilen
23
0
0
22 Nov 2019
Orderless Recurrent Models for Multi-label Classification
V. O. Yazici
Abel Gonzalez-Garcia
Arnau Ramisa
Bartlomiej Twardowski
Joost van de Weijer
SSL
19
92
0
22 Nov 2019
Characterizing the impact of using features extracted from pre-trained models on the quality of video captioning sequence-to-sequence models
Menatallh Hammad
May Hammad
Mohamed Elshenawy
24
2
0
22 Nov 2019
Continual adaptation for efficient machine communication
Robert D. Hawkins
Minae Kwon
Dorsa Sadigh
Noah D. Goodman
CLL
32
33
0
22 Nov 2019
Reinforcing an Image Caption Generator Using Off-Line Human Feedback
Paul Hongsuck Seo
Piyush Sharma
Tomer Levinboim
Bohyung Han
Radu Soricut
OffRL
29
22
0
21 Nov 2019
Inspect Transfer Learning Architecture with Dilated Convolution
Syeda Noor Jaha Azim
Md. Aminur Rab Ratul
24
0
0
20 Nov 2019
Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA
Badri N. Patro
Anupriy
Vinay P. Namboodiri
AAML
FAtt
48
26
0
19 Nov 2019
Conditionally Learn to Pay Attention for Sequential Visual Task
Jun He
Quan-Jie Cao
Lei Zhang
32
0
0
11 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAI
AI4TS
40
326
0
10 Nov 2019
Distilling Knowledge Learned in BERT for Text Generation
Yen-Chun Chen
Zhe Gan
Yu Cheng
Jingzhou Liu
Jingjing Liu
28
28
0
10 Nov 2019
On Architectures for Including Visual Information in Neural Language Models for Image Description
Marc Tanti
Albert Gatt
K. Camilleri
VLM
30
2
0
09 Nov 2019
Early Predictions for Medical Crowdfunding: A Deep Learning Approach Using Diverse Inputs
Tong Wang
Fujie Jin
Y. Hu
Yuan Cheng
OOD
30
5
0
09 Nov 2019
Are we asking the right questions in MovieQA?
Bhavan A. Jasani
Rohit Girdhar
Deva Ramanan
19
15
0
08 Nov 2019
Boosting LSTM Performance Through Dynamic Precision Selection
Franyell Silfa
J. Arnau
Antonio González
MQ
21
5
0
07 Nov 2019
Dancing to Music
Hsin-Ying Lee
Xiaodong Yang
Xuan Li
Ting-Chun Wang
Yu-Ding Lu
Ming-Hsuan Yang
Jan Kautz
27
15
0
05 Nov 2019
SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural Network
R. Yazdani
Olatunji Ruwase
Minjia Zhang
Yuxiong He
J. Arnau
Antonio González
38
4
0
04 Nov 2019
Predicting the Politics of an Image Using Webly Supervised Data
Christopher Thomas
Adriana Kovashka
SSL
26
21
0
31 Oct 2019
Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder
Jialin Wu
Raymond J. Mooney
26
0
0
31 Oct 2019
XL-Editor: Post-editing Sentences with XLNet
Yong-Siang Shih
Wei-Cheng Chang
Yiming Yang
KELM
28
11
0
19 Oct 2019
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style
Hongwei Ge
Zehang Yan
Kai Zhang
Mingde Zhao
Liang Sun
30
24
0
15 Oct 2019
Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating Referee
Shuangjie Xu
Feng Xu
Yu Cheng
Pan Zhou
21
2
0
14 Oct 2019
Dynamic Attention Networks for Task Oriented Grounding
S. Dasgupta
Badri N. Patro
Vinay P. Namboodiri
36
1
0
14 Oct 2019
Granular Multimodal Attention Networks for Visual Dialog
Badri N. Patro
Shivansh Patel
Vinay P. Namboodiri
33
1
0
13 Oct 2019
Referring Expression Object Segmentation with Caption-Aware Consistency
Yi-Wen Chen
Yi-Hsuan Tsai
Tiantian Wang
Yen-Yu Lin
Ming-Hsuan Yang
EgoV
17
87
0
10 Oct 2019
Text-to-Image Synthesis Based on Machine Generated Captions
Marco Menardi
Alex Falcon
Saida S. Mohamed
Lorenzo Seidenari
G. Serra
A. Bimbo
C. Tasso
33
0
0
09 Oct 2019
Semantic-aware Image Deblurring
Fuhai Chen
Rongrong Ji
Chengpeng Dai
Xiaoshuai Sun
Chia-Wen Lin
Jiayi Ji
Baochang Zhang
Feiyue Huang
Liujuan Cao
BDL
VLM
25
6
0
09 Oct 2019
Prose for a Painting
Prerna Kashyap
Samrat Phatale
Iddo Drori
9
3
0
08 Oct 2019
Score-CAM: Score-Weighted Visual Explanations for Convolutional Neural Networks
Mehdi Neshat
Zifan Wang
Bradley Alexander
Fan Yang
Zijian Zhang
Sirui Ding
Markus Wagner
Xia Hu
FAtt
45
1,053
0
03 Oct 2019
A Hierarchical Approach for Visual Storytelling Using Image Description
Md Sultan al Nahian
Tasmia Tasrin
Sagar Gandhi
Ryan Gaines
Brent Harrison
19
11
0
26 Sep 2019
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators
Kuang-Huei Lee
Hamid Palangi
Xi Chen
Houdong Hu
Jianfeng Gao
VLM
35
37
0
22 Sep 2019
Towards Explainable Neural-Symbolic Visual Reasoning
Adrien Bennetot
J. Laurent
Raja Chatila
Natalia Díaz Rodríguez
XAI
30
1
0
19 Sep 2019
Adaptively Aligned Image Captioning via Adaptive Attention Time
Lun Huang
Wenmin Wang
Yaxian Xia
Jie Chen
8
61
0
19 Sep 2019
Large-scale representation learning from visually grounded untranscribed speech
Gabriel Ilharco
Yuan Zhang
Jason Baldridge
SSL
27
60
0
19 Sep 2019
ContCap: A scalable framework for continual image captioning
Giang Nguyen
Tae Joon Jun
T. Tran
Tolcha Yalew
Daeyoung Kim
VLM
CLL
24
10
0
19 Sep 2019
BSDAR: Beam Search Decoding with Attention Reward in Neural Keyphrase Generation
Iftitahu Ni'mah
Vlado Menkovski
Mykola Pechenizkiy
30
2
0
17 Sep 2019
Semantic Relatedness Based Re-ranker for Text Spotting
Ahmed Sabir
Francesc Moreno-Noguer
Lluís Padró
VLM
30
5
0
17 Sep 2019
Controllable Length Control Neural Encoder-Decoder via Reinforcement Learning
Junyi Bian
Baojun Lin
Kecheng Zhang
Zhaohui Yan
H. Tang
Yonghe Zhang
22
5
0
17 Sep 2019
Inverse Visual Question Answering with Multi-Level Attentions
Yaser Alwatter
Yuhong Guo
BDL
24
1
0
17 Sep 2019
ChOracle: A Unified Statistical Framework for Churn Prediction
Ali Khodadadi
Seyed Abbas Hosseini
Ehsan Pajouheshgar
Farnam Mansouri
Hamid R. Rabiee
AI4TS
16
12
0
15 Sep 2019
Understanding LSTM -- a tutorial into Long Short-Term Memory Recurrent Neural Networks
R. C. Staudemeyer
Eric Rothstein Morris
17
479
0
12 Sep 2019
Speculative Beam Search for Simultaneous Translation
Renjie Zheng
Mingbo Ma
Baigong Zheng
Liang Huang
45
24
0
12 Sep 2019
Previous
1
2
3
...
19
20
21
...
39
40
41
Next