Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.13592
Cited By
Multimodal Image Synthesis and Editing: The Generative AI Era
27 December 2021
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multimodal Image Synthesis and Editing: The Generative AI Era"
50 / 314 papers shown
Title
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
161
4,928
0
02 Nov 2017
Progressive Growing of GANs for Improved Quality, Stability, and Variation
Tero Karras
Timo Aila
S. Laine
J. Lehtinen
GAN
105
7,318
0
27 Oct 2017
VGGFace2: A dataset for recognising faces across pose and age
Qiong Cao
Li Shen
Weidi Xie
Omkar M. Parkhi
Andrew Zisserman
CVBM
72
2,617
0
23 Oct 2017
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks
Han Zhang
Tao Xu
Hongsheng Li
Shaoting Zhang
Xiaogang Wang
Xiaolei Huang
Dimitris N. Metaxas
GAN
72
1,055
0
19 Oct 2017
FiLM: Visual Reasoning with a General Conditioning Layer
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
269
2,178
0
22 Sep 2017
PixelNN: Example-based Image Synthesis
Aayush Bansal
Yaser Sheikh
Deva Ramanan
3DV
GAN
48
47
0
17 Aug 2017
Photographic Image Synthesis with Cascaded Refinement Networks
Qifeng Chen
V. Koltun
58
947
0
28 Jul 2017
Semantic Image Synthesis via Adversarial Learning
Hao Dong
Simiao Yu
Chao Wu
Yike Guo
GAN
38
265
0
21 Jul 2017
Perceptual Adversarial Networks for Image-to-Image Transformation
Chaoyue Wang
Chang Xu
Chaohui Wang
Dacheng Tao
GAN
54
362
0
28 Jun 2017
VoxCeleb: a large-scale speaker identification dataset
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
103
2,263
0
26 Jun 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
453
129,831
0
12 Jun 2017
One-Sided Unsupervised Domain Mapping
Sagie Benaim
Lior Wolf
56
314
0
02 Jun 2017
Unsupervised Learning of Disentangled Representations from Video
Emily L. Denton
Vighnesh Birodkar
DRL
CoGe
OOD
71
552
0
31 May 2017
Dilated Residual Networks
Feng Yu
V. Koltun
Thomas Funkhouser
MedIm
105
1,617
0
28 May 2017
Pose Guided Person Image Generation
Liqian Ma
Xu Jia
Qianru Sun
Bernt Schiele
Tinne Tuytelaars
Luc Van Gool
GAN
70
816
0
25 May 2017
You said that?
Joon Son Chung
A. Jamaludin
Andrew Zisserman
CVBM
55
258
0
08 May 2017
Deep Cross-Modal Audio-Visual Generation
Lele Chen
Sudhanshu Srivastava
Z. Duan
Chenliang Xu
76
221
0
26 Apr 2017
TAC-GAN - Text Conditioned Auxiliary Classifier Generative Adversarial Network
Ayushman Dash
J. Gamboa
Sheraz Ahmed
Marcus Liwicki
Muhammad Zeshan Afzal
GAN
39
142
0
19 Mar 2017
A Structured Self-attentive Sentence Embedding
Zhouhan Lin
Minwei Feng
Cicero Nogueira dos Santos
Mo Yu
Bing Xiang
Bowen Zhou
Yoshua Bengio
106
2,132
0
09 Mar 2017
Learning Word-Like Units from Joint Audio-Visual Analysis
David Harwath
James R. Glass
46
106
0
25 Jan 2017
Creating A Multi-track Classical Musical Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications
Bochen Li
Xinzhao Liu
K. Dinesh
Z. Duan
Gaurav Sharma
110
151
0
27 Dec 2016
Learning from Simulated and Unsupervised Images through Adversarial Training
A. Shrivastava
Tomas Pfister
Oncel Tuzel
J. Susskind
Wenda Wang
Russ Webb
GAN
74
1,800
0
22 Dec 2016
COCO-Stuff: Thing and Stuff Classes in Context
Holger Caesar
J. Uijlings
V. Ferrari
112
1,377
0
12 Dec 2016
StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks
Han Zhang
Tao Xu
Hongsheng Li
Shaoting Zhang
Xiaogang Wang
Xiaolei Huang
Dimitris N. Metaxas
GAN
89
2,717
0
10 Dec 2016
Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola
Jun-Yan Zhu
Tinghui Zhou
Alexei A. Efros
SSeg
292
19,560
0
21 Nov 2016
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
235
788
0
16 Nov 2016
Categorical Reparameterization with Gumbel-Softmax
Eric Jang
S. Gu
Ben Poole
BDL
221
5,323
0
03 Nov 2016
SoundNet: Learning Sound Representations from Unlabeled Video
Y. Aytar
Carl Vondrick
Antonio Torralba
SSL
90
1,040
0
27 Oct 2016
Learning What and Where to Draw
Scott E. Reed
Zeynep Akata
S. Mohan
Samuel Tenka
Bernt Schiele
Honglak Lee
DRL
GAN
54
618
0
08 Oct 2016
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
C. Ledig
Lucas Theis
Ferenc Huszár
Jose Caballero
Andrew Cunningham
...
Andrew P. Aitken
Alykhan Tejani
J. Totz
Zehan Wang
Wenzhe Shi
GAN
229
10,646
0
15 Sep 2016
Discrete Variational Autoencoders
J. Rolfe
BDL
DRL
140
255
0
07 Sep 2016
Conditional Image Generation with PixelCNN Decoders
Aaron van den Oord
Nal Kalchbrenner
Oriol Vinyals
L. Espeholt
Alex Graves
Koray Kavukcuoglu
VLM
131
2,495
0
16 Jun 2016
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
Xi Chen
Yan Duan
Rein Houthooft
John Schulman
Ilya Sutskever
Pieter Abbeel
GAN
140
4,224
0
12 Jun 2016
Improved Techniques for Training GANs
Tim Salimans
Ian Goodfellow
Wojciech Zaremba
Vicki Cheung
Alec Radford
Xi Chen
GAN
377
8,999
0
10 Jun 2016
Adversarially Learned Inference
Vincent Dumoulin
Ishmael Belghazi
Ben Poole
Olivier Mastropietro
Alex Lamb
Martín Arjovsky
Aaron Courville
GAN
62
1,312
0
02 Jun 2016
Asynchrony begets Momentum, with an Application to Deep Learning
Jeff Donahue
Philipp Krahenbuhl
Stefan Hadjis
Christopher Ré
82
1,827
0
31 May 2016
Faster Eigenvector Computation via Shift-and-Invert Preconditioning
Dan Garber
Laurent Dinh
Chi Jin
Jascha Narain Sohl-Dickstein
Samy Bengio
Praneeth Netrapalli
Aaron Sidford
185
3,681
0
26 May 2016
Generative Adversarial Text to Image Synthesis
Scott E. Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
GAN
147
3,136
0
17 May 2016
The Cityscapes Dataset for Semantic Urban Scene Understanding
Marius Cordts
Mohamed Omran
Sebastian Ramos
Timo Rehfeld
Markus Enzweiler
Rodrigo Benenson
Uwe Franke
Stefan Roth
Bernt Schiele
691
11,540
0
06 Apr 2016
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Justin Johnson
Alexandre Alahi
Li Fei-Fei
SupR
187
10,202
0
27 Mar 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
170
5,706
0
23 Feb 2016
Discriminative Regularization for Generative Models
Alex Lamb
Vincent Dumoulin
Aaron Courville
DRL
70
65
0
09 Feb 2016
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSeg
GAN
399
2,563
0
25 Jan 2016
Autoencoding beyond pixels using a learned similarity metric
Anders Boesen Lindbo Larsen
Søren Kaae Sønderby
Hugo Larochelle
Ole Winther
GAN
136
2,061
0
31 Dec 2015
Visually Indicated Sounds
Andrew Owens
Phillip Isola
Josh H. McDermott
Antonio Torralba
Edward H. Adelson
William T. Freeman
74
382
0
28 Dec 2015
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
497
27,231
0
02 Dec 2015
Generating Images from Captions with Attention
Elman Mansimov
Emilio Parisotto
Jimmy Lei Ba
Ruslan Salakhutdinov
VLM
76
453
0
09 Nov 2015
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
471
7,952
0
13 Jun 2015
Variational Inference with Normalizing Flows
Danilo Jimenez Rezende
S. Mohamed
DRL
BDL
258
4,143
0
21 May 2015
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Narain Sohl-Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
SyDa
DiffM
186
6,780
0
12 Mar 2015
Previous
1
2
3
4
5
6
7
Next