Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4555
Cited By
Show and Tell: A Neural Image Caption Generator
17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show and Tell: A Neural Image Caption Generator"
50 / 2,023 papers shown
Title
The Role of Syntactic Planning in Compositional Image Captioning
Emanuele Bugliarello
Desmond Elliott
CoGe
36
13
0
28 Jan 2021
Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network
Yehao Li
Yingwei Pan
Ting Yao
Jingwen Chen
Tao Mei
VLM
29
52
0
27 Jan 2021
CPTR: Full Transformer Network for Image Captioning
Wei Liu
Sihan Chen
Longteng Guo
Xinxin Zhu
Jing Liu
ViT
18
141
0
26 Jan 2021
Probability Trajectory: One New Movement Description for Trajectory Prediction
Pei Lv
Hui Wei
Tianxin Gu
Yuzhen Zhang
Xiaoheng Jiang
Bing Zhou
Mingliang Xu
33
0
0
26 Jan 2021
Adversarial Text-to-Image Synthesis: A Review
Stanislav Frolov
Tobias Hinz
Federico Raue
Jörn Hees
Andreas Dengel
EGVM
29
175
0
25 Jan 2021
Macroscopic Control of Text Generation for Image Captioning
Zhangzi Zhu
Tianlei Wang
Hong Qu
31
4
0
20 Jan 2021
ArtEmis: Affective Language for Visual Art
Panos Achlioptas
M. Ovsjanikov
Kilichbek Haydarov
Mohamed Elhoseiny
Leonidas J. Guibas
31
115
0
19 Jan 2021
Diagnostic Captioning: A Survey
John Pavlopoulos
Vasiliki Kougia
Ion Androutsopoulos
D. Papamichail
3DV
MedIm
91
26
0
18 Jan 2021
Understanding in Artificial Intelligence
S. Maetschke
D. M. Iraola
Pieter Barnard
Elaheh Shafieibavani
Peter Zhong
Ying Xu
Antonio Jimeno Yepes
ELM
VLM
24
0
0
17 Jan 2021
Dual-Level Collaborative Transformer for Image Captioning
Yunpeng Luo
Jiayi Ji
Xiaoshuai Sun
Liujuan Cao
Yongjian Wu
Feiyue Huang
Chia-Wen Lin
Rongrong Ji
ViT
19
274
0
16 Jan 2021
CityFlow-NL: Tracking and Retrieval of Vehicles at City Scale by Natural Language Descriptions
Qi Feng
Vitaly Ablavsky
Stan Sclaroff
22
45
0
12 Jan 2021
Comprehensible Convolutional Neural Networks via Guided Concept Learning
Sandareka Wickramanayake
Wynne Hsu
Mong Li Lee
SSL
30
23
0
11 Jan 2021
Unifying Relational Sentence Generation and Retrieval for Medical Image Report Composition
Fuyu Wang
Xiaodan Liang
Lin Xu
Liang Lin
MedIm
34
25
0
09 Jan 2021
Transformers in Vision: A Survey
Salman Khan
Muzammal Naseer
Munawar Hayat
Syed Waqas Zamir
Fahad Shahbaz Khan
M. Shah
ViT
233
2,434
0
04 Jan 2021
Advances in Electron Microscopy with Deep Learning
Jeffrey M. Ede
42
2
0
04 Jan 2021
Searching a Raw Video Database using Natural Language Queries
Sriram Krishna
Siddarth Vinay
S. SrinivasK.
18
0
0
31 Dec 2020
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Wei-Ning Hsu
David Harwath
Christopher Song
James R. Glass
CLIP
37
66
0
31 Dec 2020
Towards Fully Automated Manga Translation
Ryota Hinami
Shonosuke Ishiwatari
K. Yasuda
Yusuke Matsui
51
32
0
28 Dec 2020
Neural Text Generation with Artificial Negative Examples
Keisuke Shirai
Kazuma Hashimoto
Akiko Eriguchi
Takashi Ninomiya
Shinsuke Mori
13
7
0
28 Dec 2020
LCEval: Learned Composite Metric for Caption Evaluation
Naeha Sharif
Lyndon White
Bennamoun
Wei Liu
Syed Afaq Ali Shah
26
8
0
24 Dec 2020
SubICap: Towards Subword-informed Image Captioning
Naeha Sharif
Bennamoun
Wei Liu
Syed Afaq Ali Shah
30
2
0
24 Dec 2020
Image to Bengali Caption Generation Using Deep CNN and Bidirectional Gated Recurrent Unit
Albay Faruk
Hasan Al Faraby
M. M. Azad
Md. Riduyan Fedous
Md. Kishor Morol
17
15
0
22 Dec 2020
Transductive Visual Verb Sense Disambiguation
Sebastiano Vascon
Sinem Aslan
Gianluca Bigaglia
Lorenzo Giudice
Marcello Pelillo
CoGe
9
2
0
20 Dec 2020
AutoCaption: Image Captioning with Neural Architecture Search
Xinxin Zhu
Weining Wang
Longteng Guo
Jing Liu
32
9
0
16 Dec 2020
Intrinsic Image Captioning Evaluation
Chao Zeng
Sam Kwong
21
0
0
14 Dec 2020
Delay Differential Neural Networks
Srinivas Anumasa
P. K. Srijith
27
5
0
12 Dec 2020
Dependency Decomposition and a Reject Option for Explainable Models
Jan Kronenberger
Anselm Haselhoff
FAtt
AAML
32
8
0
11 Dec 2020
A Log-likelihood Regularized KL Divergence for Video Prediction with A 3D Convolutional Variational Recurrent Network
Haziq Razali
Basura Fernando
DRL
24
6
0
11 Dec 2020
Debiased-CAM to mitigate image perturbations with faithful visual explanations of machine learning
Wencan Zhang
Mariella Dimiccoli
Brian Y. Lim
FAtt
34
18
0
10 Dec 2020
Image Captioning with Context-Aware Auxiliary Guidance
Zeliang Song
Xiaofei Zhou
Zhendong Mao
Jianlong Tan
41
31
0
10 Dec 2020
Towards Annotation-Free Evaluation of Cross-Lingual Image Captioning
Aozhu Chen
Xinyi Huang
Hailan Lin
Xirong Li
16
5
0
09 Dec 2020
Robust Image Captioning
Daniel Yarnell
Xian Wang
21
0
0
06 Dec 2020
Understanding Guided Image Captioning Performance across Domains
Edwin G. Ng
Bo Pang
P. Sharma
Radu Soricut
37
24
0
04 Dec 2020
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Dave Zhenyu Chen
A. Gholami
Matthias Nießner
Angel X. Chang
3DPC
23
161
0
03 Dec 2020
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling
Jing Su
Qingyun Dai
Frank Guerin
Mian Zhou
30
24
0
03 Dec 2020
Cross-Modal Retrieval and Synthesis (X-MRS): Closing the Modality Gap in Shared Representation Learning
Ricardo Guerrero
Hai Xuan Pham
Vladimir Pavlovic
14
23
0
02 Dec 2020
Generating Descriptions for Sequential Images with Local-Object Attention and Global Semantic Context Modelling
Jing Su
Chenghua Lin
Mian Zhou
Qingyun Dai
Haoyu Lv
24
2
0
02 Dec 2020
Language-Driven Region Pointer Advancement for Controllable Image Captioning
Annika Lindh
R. Ross
John D. Kelleher
8
13
0
30 Nov 2020
When Machine Learning Meets Privacy: A Survey and Outlook
B. Liu
Ming Ding
Sina shaham
W. Rahayu
F. Farokhi
Zihuai Lin
25
282
0
24 Nov 2020
Watch and Learn: Mapping Language and Noisy Real-world Videos with Self-supervision
Yujie Zhong
Linhai Xie
Sen Wang
Lucia Specia
Yishu Miao
SSL
11
0
0
19 Nov 2020
Neuro-Symbolic Representations for Video Captioning: A Case for Leveraging Inductive Biases for Vision and Language
Hassan Akbari
Hamid Palangi
Jianwei Yang
Sudha Rao
Asli Celikyilmaz
Roland Fernandez
P. Smolensky
Jianfeng Gao
Shih-Fu Chang
37
3
0
18 Nov 2020
Inspecting state of the art performance and NLP metrics in image-based medical report generation
Pablo Pino
Denis Parra
Pablo Messina
Cecilia Besa
S. Uribe
MedIm
LM&MA
26
8
0
18 Nov 2020
Generating Natural Questions from Images for Multimodal Assistants
Alkesh Patel
Akanksha Bindal
Hadas Kotek
Christopher Klein
Jason D. Williams
VGen
43
7
0
17 Nov 2020
Improving Calibration in Deep Metric Learning With Cross-Example Softmax
Andreas Veit
Kimberly Wilber
20
2
0
17 Nov 2020
Structural and Functional Decomposition for Personality Image Captioning in a Communication Game
Minh-Thu Nguyen
Duy Phung
Minh Hoai
Thien Huu Nguyen
38
4
0
17 Nov 2020
DORB: Dynamically Optimizing Multiple Rewards with Bandits
Ramakanth Pasunuru
Han Guo
Joey Tianyi Zhou
OffRL
32
6
0
15 Nov 2020
Goal-driven Command Recommendations for Analysts
Samarth Aggarwal
Rohin Garg
Abhilasha Sancheti
Bhanu Prakash Reddy Guda
I. Burhanuddin
21
4
0
12 Nov 2020
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human Gaze
Ece Takmaz
Sandro Pezzelle
Lisa Beinborn
Raquel Fernández
40
22
0
09 Nov 2020
CapWAP: Captioning with a Purpose
Adam Fisch
Kenton Lee
Ming-Wei Chang
J. Clark
Regina Barzilay
13
11
0
09 Nov 2020
Template Controllable keywords-to-text Generation
Abhijit Mishra
Md. Faisal Mahbub Chowdhury
Sagar Manohar
Dan Gutfreund
Karthik Sankaranarayanan
BDL
38
3
0
07 Nov 2020
Previous
1
2
3
...
14
15
16
...
39
40
41
Next