Multimodal Image Synthesis and Editing: The Generative AI Era

27 December 2021

Papers citing "Multimodal Image Synthesis and Editing: The Generative AI Era"

50 / 314 papers shown

Title
Neural Discrete Representation Learning Aaron van den Oord Oriol Vinyals Koray Kavukcuoglu BDL SSL OCL 161 4,928 0 02 Nov 2017
Progressive Growing of GANs for Improved Quality, Stability, and Variation Tero Karras Timo Aila S. Laine J. Lehtinen GAN 105 7,318 0 27 Oct 2017
VGGFace2: A dataset for recognising faces across pose and age Qiong Cao Li Shen Weidi Xie Omkar M. Parkhi Andrew Zisserman CVBM 72 2,617 0 23 Oct 2017
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks Han Zhang Tao Xu Hongsheng Li Shaoting Zhang Xiaogang Wang Xiaolei Huang Dimitris N. Metaxas GAN 72 1,055 0 19 Oct 2017
FiLM: Visual Reasoning with a General Conditioning Layer Ethan Perez Florian Strub H. D. Vries Vincent Dumoulin Aaron Courville FAtt AIMat OffRL AI4CE 269 2,178 0 22 Sep 2017
PixelNN: Example-based Image Synthesis Aayush Bansal Yaser Sheikh Deva Ramanan 3DV GAN 48 47 0 17 Aug 2017
Photographic Image Synthesis with Cascaded Refinement Networks Qifeng Chen V. Koltun 58 947 0 28 Jul 2017
Semantic Image Synthesis via Adversarial Learning Hao Dong Simiao Yu Chao Wu Yike Guo GAN 38 265 0 21 Jul 2017
Perceptual Adversarial Networks for Image-to-Image Transformation Chaoyue Wang Chang Xu Chaohui Wang Dacheng Tao GAN 54 362 0 28 Jun 2017
VoxCeleb: a large-scale speaker identification dataset Arsha Nagrani Joon Son Chung Andrew Zisserman 103 2,263 0 26 Jun 2017
Attention Is All You Need Ashish Vaswani Noam M. Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan Gomez Lukasz Kaiser Illia Polosukhin 3DV 453 129,831 0 12 Jun 2017
One-Sided Unsupervised Domain Mapping Sagie Benaim Lior Wolf 56 314 0 02 Jun 2017
Unsupervised Learning of Disentangled Representations from Video Emily L. Denton Vighnesh Birodkar DRL CoGe OOD 71 552 0 31 May 2017
Dilated Residual Networks Feng Yu V. Koltun Thomas Funkhouser MedIm 105 1,617 0 28 May 2017
Pose Guided Person Image Generation Liqian Ma Xu Jia Qianru Sun Bernt Schiele Tinne Tuytelaars Luc Van Gool GAN 70 816 0 25 May 2017
You said that? Joon Son Chung A. Jamaludin Andrew Zisserman CVBM 55 258 0 08 May 2017
Deep Cross-Modal Audio-Visual Generation Lele Chen Sudhanshu Srivastava Z. Duan Chenliang Xu 76 221 0 26 Apr 2017
TAC-GAN - Text Conditioned Auxiliary Classifier Generative Adversarial Network Ayushman Dash J. Gamboa Sheraz Ahmed Marcus Liwicki Muhammad Zeshan Afzal GAN 39 142 0 19 Mar 2017
A Structured Self-attentive Sentence Embedding Zhouhan Lin Minwei Feng Cicero Nogueira dos Santos Mo Yu Bing Xiang Bowen Zhou Yoshua Bengio 106 2,132 0 09 Mar 2017
Learning Word-Like Units from Joint Audio-Visual Analysis David Harwath James R. Glass 46 106 0 25 Jan 2017
Creating A Multi-track Classical Musical Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications Bochen Li Xinzhao Liu K. Dinesh Z. Duan Gaurav Sharma 110 151 0 27 Dec 2016
Learning from Simulated and Unsupervised Images through Adversarial Training A. Shrivastava Tomas Pfister Oncel Tuzel J. Susskind Wenda Wang Russ Webb GAN 74 1,800 0 22 Dec 2016
COCO-Stuff: Thing and Stuff Classes in Context Holger Caesar J. Uijlings V. Ferrari 112 1,377 0 12 Dec 2016
StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks Han Zhang Tao Xu Hongsheng Li Shaoting Zhang Xiaogang Wang Xiaolei Huang Dimitris N. Metaxas GAN 89 2,717 0 10 Dec 2016
Image-to-Image Translation with Conditional Adversarial Networks Phillip Isola Jun-Yan Zhu Tinghui Zhou Alexei A. Efros SSeg 292 19,560 0 21 Nov 2016
Lip Reading Sentences in the Wild Joon Son Chung A. Senior Oriol Vinyals Andrew Zisserman 235 788 0 16 Nov 2016
Categorical Reparameterization with Gumbel-Softmax Eric Jang S. Gu Ben Poole BDL 221 5,323 0 03 Nov 2016
SoundNet: Learning Sound Representations from Unlabeled Video Y. Aytar Carl Vondrick Antonio Torralba SSL 90 1,040 0 27 Oct 2016
Learning What and Where to Draw Scott E. Reed Zeynep Akata S. Mohan Samuel Tenka Bernt Schiele Honglak Lee DRL GAN 54 618 0 08 Oct 2016
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network C. Ledig Lucas Theis Ferenc Huszár Jose Caballero Andrew Cunningham ... Andrew P. Aitken Alykhan Tejani J. Totz Zehan Wang Wenzhe Shi GAN 229 10,646 0 15 Sep 2016
Discrete Variational Autoencoders J. Rolfe BDL DRL 140 255 0 07 Sep 2016
Conditional Image Generation with PixelCNN Decoders Aaron van den Oord Nal Kalchbrenner Oriol Vinyals L. Espeholt Alex Graves Koray Kavukcuoglu VLM 131 2,495 0 16 Jun 2016
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets Xi Chen Yan Duan Rein Houthooft John Schulman Ilya Sutskever Pieter Abbeel GAN 140 4,224 0 12 Jun 2016
Improved Techniques for Training GANs Tim Salimans Ian Goodfellow Wojciech Zaremba Vicki Cheung Alec Radford Xi Chen GAN 377 8,999 0 10 Jun 2016
Adversarially Learned Inference Vincent Dumoulin Ishmael Belghazi Ben Poole Olivier Mastropietro Alex Lamb Martín Arjovsky Aaron Courville GAN 62 1,312 0 02 Jun 2016
Asynchrony begets Momentum, with an Application to Deep Learning Jeff Donahue Philipp Krahenbuhl Stefan Hadjis Christopher Ré 82 1,827 0 31 May 2016
Faster Eigenvector Computation via Shift-and-Invert Preconditioning Dan Garber Laurent Dinh Chi Jin Jascha Narain Sohl-Dickstein Samy Bengio Praneeth Netrapalli Aaron Sidford 185 3,681 0 26 May 2016
Generative Adversarial Text to Image Synthesis Scott E. Reed Zeynep Akata Xinchen Yan Lajanugen Logeswaran Bernt Schiele Honglak Lee GAN 147 3,136 0 17 May 2016
The Cityscapes Dataset for Semantic Urban Scene Understanding Marius Cordts Mohamed Omran Sebastian Ramos Timo Rehfeld Markus Enzweiler Rodrigo Benenson Uwe Franke Stefan Roth Bernt Schiele 691 11,540 0 06 Apr 2016
Perceptual Losses for Real-Time Style Transfer and Super-Resolution Justin Johnson Alexandre Alahi Li Fei-Fei SupR 187 10,202 0 27 Mar 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations Ranjay Krishna Yuke Zhu Oliver Groth Justin Johnson Kenji Hata ... Yannis Kalantidis Li Li David A. Shamma Michael S. Bernstein Fei-Fei Li 170 5,706 0 23 Feb 2016
Discriminative Regularization for Generative Models Alex Lamb Vincent Dumoulin Aaron Courville DRL 70 65 0 09 Feb 2016
Pixel Recurrent Neural Networks Aaron van den Oord Nal Kalchbrenner Koray Kavukcuoglu SSeg GAN 399 2,563 0 25 Jan 2016
Autoencoding beyond pixels using a learned similarity metric Anders Boesen Lindbo Larsen Søren Kaae Sønderby Hugo Larochelle Ole Winther GAN 136 2,061 0 31 Dec 2015
Visually Indicated Sounds Andrew Owens Phillip Isola Josh H. McDermott Antonio Torralba Edward H. Adelson William T. Freeman 74 382 0 28 Dec 2015
Rethinking the Inception Architecture for Computer Vision Christian Szegedy Vincent Vanhoucke Sergey Ioffe Jonathon Shlens Z. Wojna 3DV BDL 497 27,231 0 02 Dec 2015
Generating Images from Captions with Attention Elman Mansimov Emilio Parisotto Jimmy Lei Ba Ruslan Salakhutdinov VLM 76 453 0 09 Nov 2015
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting Xingjian Shi Zhourong Chen Hao Wang Dit-Yan Yeung W. Wong W. Woo 471 7,952 0 13 Jun 2015
Variational Inference with Normalizing Flows Danilo Jimenez Rezende S. Mohamed DRL BDL 258 4,143 0 21 May 2015
Deep Unsupervised Learning using Nonequilibrium Thermodynamics Jascha Narain Sohl-Dickstein Eric A. Weiss Niru Maheswaranathan Surya Ganguli SyDa DiffM 186 6,780 0 12 Mar 2015