Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.11872
Cited By
Vision and Language: from Visual Perception to Content Creation
26 December 2019
Tao Mei
Wei Zhang
Ting Yao
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vision and Language: from Visual Perception to Content Creation"
24 / 24 papers shown
Title
Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation
Jing Wang
Yingwei Pan
Ting Yao
Jinhui Tang
Tao Mei
VLM
BDL
DiffM
46
36
0
01 Aug 2019
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
Jingwen Chen
Yingwei Pan
Yehao Li
Ting Yao
Hongyang Chao
Tao Mei
49
103
0
03 May 2019
Pointing Novel Objects in Image Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
66
69
0
25 Apr 2019
Joint Discriminative and Generative Learning for Person Re-identification
Zhedong Zheng
Xiaodong Yang
Zhiding Yu
Liang Zheng
Yi Yang
Jan Kautz
GAN
41
750
0
15 Apr 2019
Photo-Realistic Facial Details Synthesis from Single Image
Anpei Chen
Zhaoyu Chen
Guli Zhang
Ziheng Zhang
Kenny Mitchell
Jingyi Yu
3DV
3DH
36
103
0
26 Mar 2019
High-Fidelity Image Generation With Fewer Labels
Mario Lucic
Michael Tschannen
Marvin Ritter
Xiaohua Zhai
Olivier Bachem
Sylvain Gelly
GAN
OOD
77
158
0
06 Mar 2019
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
529
10,527
0
12 Dec 2018
Exploring Visual Relationship for Image Captioning
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
74
831
0
19 Sep 2018
Recurrent Fusion Network for Image Captioning
Wenhao Jiang
Lin Ma
Yu-Gang Jiang
Wen Liu
Tong Zhang
ObjD
56
234
0
26 Jul 2018
To Create What You Tell: Generating Videos from Captions
Yingwei Pan
Zhaofan Qiu
Ting Yao
Houqiang Li
Tao Mei
GAN
75
154
0
23 Apr 2018
Image Generation from Scene Graphs
Justin Johnson
Agrim Gupta
Li Fei-Fei
GNN
287
820
0
04 Apr 2018
DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks (with Supplementary Materials)
Shuang Ma
Jianlong Fu
Chang Wen Chen
Tao Mei
GAN
49
154
0
18 Feb 2018
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks
Han Zhang
Tao Xu
Hongsheng Li
Shaoting Zhang
Xiaogang Wang
Xiaolei Huang
Dimitris N. Metaxas
GAN
77
1,057
0
19 Oct 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang
AIMat
111
4,208
0
25 Jul 2017
Pose Guided Person Image Generation
Liqian Ma
Xu Jia
Qianru Sun
Bernt Schiele
Tinne Tuytelaars
Luc Van Gool
GAN
72
816
0
25 May 2017
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
134
1,242
0
02 May 2017
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Jiasen Lu
Caiming Xiong
Devi Parikh
R. Socher
115
1,450
0
06 Dec 2016
Improved Image Captioning via Policy Gradient optimization of SPIDEr
Siqi Liu
Zhenhai Zhu
Ning Ye
S. Guadarrama
Kevin Patrick Murphy
120
446
0
01 Dec 2016
Video Captioning with Transferred Semantic Attributes
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
57
328
0
23 Nov 2016
Boosting Image Captioning with Attributes
Ting Yao
Yingwei Pan
Yehao Li
Zhaofan Qiu
Tao Mei
VLM
83
621
0
05 Nov 2016
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
Justin Johnson
A. Karpathy
Li Fei-Fei
VLM
122
1,167
0
24 Nov 2015
Describing Videos by Exploiting Temporal Structure
L. Yao
Atousa Torabi
Kyunghyun Cho
Nicolas Ballas
C. Pal
Hugo Larochelle
Aaron Courville
139
1,063
0
27 Feb 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
310
10,050
0
10 Feb 2015
Show and Tell: A Neural Image Caption Generator
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
211
6,018
0
17 Nov 2014
1