Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4555
Cited By
Show and Tell: A Neural Image Caption Generator
17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show and Tell: A Neural Image Caption Generator"
50 / 2,023 papers shown
Title
Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation
Vihan Jain
Gabriel Ilharco
Alexander Ku
Ashish Vaswani
Eugene Ie
Jason Baldridge
LM&Ro
16
179
0
29 May 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
21
37
0
29 May 2019
Straight to Shapes++: Real-time Instance Segmentation Made More Accurate
Laurynas Miksys
Saumya Jetley
Michael Sapienza
Stuart Golodetz
Philip Torr
3DV
24
7
0
27 May 2019
Transcribing Content from Structural Images with Spotlight Mechanism
Yu Yin
Zhenya Huang
Enhong Chen
Qi Liu
Fuzheng Zhang
Xing Xie
Guoping Hu
19
22
0
27 May 2019
A Survey on Biomedical Image Captioning
Vasiliki Kougia
John Pavlopoulos
Ion Androutsopoulos
MedIm
22
80
0
26 May 2019
Bivariate Beta-LSTM
Kyungwoo Song
Joonho Jang
Seung-Jae Shin
Il-Chul Moon
20
6
0
25 May 2019
Recent Advances in Neural Question Generation
Liangming Pan
Wenqiang Lei
Tat-Seng Chua
Min-Yen Kan
22
116
0
22 May 2019
Image Captioning based on Deep Learning Methods: A Survey
Yiyu Wang
Jungang Xu
Yingfei Sun
Xianpei Han
VLM
18
7
0
20 May 2019
Multimodal Transformer with Multi-View Visual Representation for Image Captioning
Jun-chen Yu
Jing Li
Zhou Yu
Qingming Huang
ViT
27
377
0
20 May 2019
Exact-K Recommendation via Maximal Clique Optimization
Yu Gong
Yu Zhu
Lu Duan
Qingwen Liu
Ziyu Guan
Fei Sun
Wenwu Ou
Kenny Q. Zhu
OffRL
CML
37
59
0
17 May 2019
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations
Fenglin Liu
Yuanxin Liu
Xuancheng Ren
Xiaodong He
Xu Sun
VLM
34
81
0
15 May 2019
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features
Sangdoo Yun
Dongyoon Han
Seong Joon Oh
Sanghyuk Chun
Junsuk Choe
Y. Yoo
OOD
439
4,705
0
13 May 2019
Exact Adversarial Attack to Image Captioning via Structured Output Learning with Latent Variables
Yan Xu
Baoyuan Wu
Fumin Shen
Yanbo Fan
Yong Zhang
Heng Tao Shen
Wei Liu
AAML
33
55
0
10 May 2019
Memory-Attended Recurrent Network for Video Captioning
Wenjie Pei
Jiyuan Zhang
Xiangrong Wang
Lei Ke
Xiaoyong Shen
Yu-Wing Tai
22
200
0
10 May 2019
Sketch2code: Generating a website from a paper mockup
Alex Robinson
3DV
30
37
0
09 May 2019
ShapeGlot: Learning Language for Shape Differentiation
Panos Achlioptas
Judy Fan
Robert D. Hawkins
Noah D. Goodman
Leonidas J. Guibas
36
83
0
08 May 2019
Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing
Philipp Harzig
D. Zecha
Rainer Lienhart
Carolin Kaiser
René Schallner
19
2
0
06 May 2019
PAL: A Wearable Platform for Real-time, Personalized and Context-Aware Health and Cognition Support
Mina Khan
Glenn Fernandes
U. Sarawgi
Prudhvi Rampey
Pattie Maes
15
9
0
03 May 2019
Impact of Artificial Intelligence on Businesses: from Research, Innovation, Market Deployment to Future Shifts in Business Models
N. Soni
E. Sharma
Narotam Singh
A. Kapoor
24
68
0
03 May 2019
AI-Powered Text Generation for Harmonious Human-Machine Interaction: Current State and Future Directions
Qiuyun Zhang
Bin Guo
Hao Wang
Yunji Liang
Shaoyang Hao
Zhiwen Yu
20
6
0
01 May 2019
Knowing When to Stop: Evaluation and Verification of Conformity to Output-size Specifications
Chenglong Wang
Rudy Bunel
Krishnamurthy Dvijotham
Po-Sen Huang
Edward Grefenstette
Pushmeet Kohli
30
5
0
26 Apr 2019
Pointing Novel Objects in Image Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
33
69
0
25 Apr 2019
3G structure for image caption generation
Aihong Yuan
Xuelong Li
Xiaoqiang Lu
21
34
0
21 Apr 2019
Multi-modal gated recurrent units for image description
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
GAN
21
26
0
20 Apr 2019
Learning to Collocate Neural Modules for Image Captioning
Xu Yang
Hanwang Zhang
Jianfei Cai
27
77
0
18 Apr 2019
Visual Relationship Detection with Language prior and Softmax
Jaewon Jung
Jongyoul Park
27
9
0
16 Apr 2019
Positional Encoding to Control Output Sequence Length
Sho Takase
Naoaki Okazaki
17
110
0
16 Apr 2019
Self-critical n-step Training for Image Captioning
Junlong Gao
Shiqi Wang
Shanshe Wang
Siwei Ma
Wen Gao
22
55
0
15 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
Alex Schwing
Tamir Hazan
27
69
0
11 Apr 2019
Reasoning Visual Dialogs with Structural and Partial Observations
Zilong Zheng
Wenguan Wang
Siyuan Qi
Song-Chun Zhu
39
117
0
11 Apr 2019
UniVSE: Robust Visual Semantic Embeddings via Structured Semantic Representations
Hao Wu
Jiayuan Mao
Yufeng Zhang
Yuning Jiang
Lei Li
Weiwei Sun
Wei-Ying Ma
22
8
0
11 Apr 2019
Streamlined Dense Video Captioning
Jonghwan Mun
L. Yang
Zhou Ren
N. Xu
Bohyung Han
28
137
0
08 Apr 2019
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
32
539
0
06 Apr 2019
Reducing catastrophic forgetting when evolving neural networks
Joseph Early
19
2
0
05 Apr 2019
Clinically Accurate Chest X-Ray Report Generation
Guanxiong Liu
T. Hsu
Matthew B. A. McDermott
Willie Boag
W. Weng
Peter Szolovits
Marzyeh Ghassemi
MedIm
39
271
0
04 Apr 2019
MMED: A Multi-domain and Multi-modality Event Dataset
Zhenguo Yang
Zehang Lin
Min Cheng
Qing Li
Wenyin Liu
31
9
0
04 Apr 2019
Good News, Everyone! Context driven entity-aware captioning for news images
Ali Furkan Biten
Lluís Gómez
Marçal Rusiñol
Dimosthenis Karatzas
27
139
0
02 Apr 2019
HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models
Sharon Zhou
Mitchell L. Gordon
Ranjay Krishna
Austin Narcomey
Li Fei-Fei
Michael S. Bernstein
VLM
EGVM
6
118
0
01 Apr 2019
Describing like humans: on diversity in image captioning
Qingzhong Wang
Antoni B. Chan
27
98
0
28 Mar 2019
Information Maximizing Visual Question Generation
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
23
95
0
27 Mar 2019
Improve Diverse Text Generation by Self Labeling Conditional Variational Auto Encoder
Yuchi Zhang
Yongliang Wang
Liping Zhang
Qing Cui
Kun Gai
22
19
0
26 Mar 2019
Knowledge-driven Encode, Retrieve, Paraphrase for Medical Image Report Generation
Yuan Li
Xiaodan Liang
Zhiting Hu
Eric Xing
MedIm
25
268
0
25 Mar 2019
End-to-End Learning Using Cycle Consistency for Image-to-Caption Transformations
Keisuke Hagiwara
Yusuke Mukuta
Tatsuya Harada
24
0
0
25 Mar 2019
Comparison of State-of-the-Art Deep Learning APIs for Image Multi-Label Classification using Semantic Metrics
Adam Kubany
Shimon Ben Ishay
Ruben-sacha Ohayon
A. Shmilovici
Lior Rokach
Tomer Doitshman
VLM
11
2
0
21 Mar 2019
Neural Sequential Phrase Grounding (SeqGROUND)
Pelin Dogan
Leonid Sigal
Markus Gross
ObjD
33
52
0
18 Mar 2019
Boosted Attention: Leveraging Human Attention for Image Captioning
Shi Chen
Qi Zhao
30
47
0
18 Mar 2019
Effects of padding on LSTMs and CNNs
Dwarampudi Mahidhar Reddy
Subba Reddy
19
96
0
18 Mar 2019
A Weighted Multi-Criteria Decision Making Approach for Image Captioning
Hassan Maleki Galandouz
M. Moghaddam
M. Shamsfard
24
0
0
17 Mar 2019
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement
W. Kool
H. V. Hoof
Max Welling
71
216
0
14 Mar 2019
Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning
Dong-Jin Kim
Jinsoo Choi
Tae-Hyun Oh
In So Kweon
14
84
0
14 Mar 2019
Previous
1
2
3
...
22
23
24
...
39
40
41
Next