Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 1,364 papers shown
Title
Multi-Class Segmentation from Aerial Views using Recursive Noise Diffusion
Benedikt Kolbeinsson
K. Mikolajczyk
DiffM
77
13
0
01 Dec 2022
Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation
Haochen Wang
Xiaodan Du
Jiahao Li
Raymond A. Yeh
Gregory Shakhnarovich
DiffM
191
550
0
01 Dec 2022
Shape-Guided Diffusion with Inside-Outside Attention
Dong Huk Park
Grace Luo
C. Toste
S. Azadi
Xihui Liu
M. Karalashvili
Anna Rohrbach
Trevor Darrell
DiffM
103
44
0
01 Dec 2022
One Artist's Personal Reflections on Methods and Ethics of Creating Mixed Media Artificial Intelligence Art
J. Adams
76
1
0
30 Nov 2022
Multiresolution Textual Inversion
Giannis Daras
A. Dimakis
104
33
0
30 Nov 2022
High-Fidelity Guided Image Synthesis with Latent Diffusion Models
Jaskirat Singh
Stephen Gould
Liang Zheng
DiffM
90
42
0
30 Nov 2022
SinDDM: A Single Image Denoising Diffusion Model
Vladimir Kulikov
Shahar Yadin
Matan Kleiner
T. Michaeli
DiffM
78
80
0
29 Nov 2022
DiffPose: Multi-hypothesis Human Pose Estimation using Diffusion models
Karl Holmquist
Bastian Wandt
DiffM
90
64
0
29 Nov 2022
DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model
Gwanghyun Kim
S. Chun
DiffM
89
40
0
29 Nov 2022
Wavelet Diffusion Models are fast and scalable Image Generators
Hao Phung
Quan Dao
Anh Tran
DiffM
97
97
0
29 Nov 2022
Dimensionality-Varying Diffusion Process
Han Zhang
Ruili Feng
Zhantao Yang
Lianghua Huang
Yu Liu
Yifei Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
Fan Cheng
DiffM
49
10
0
29 Nov 2022
Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models
Dongjun Kim
Yeongmin Kim
Se Jung Kwon
Wanmo Kang
Il-Chul Moon
DiffM
120
89
0
28 Nov 2022
Post-training Quantization on Diffusion Models
Yuzhang Shang
Zhihang Yuan
Bin Xie
Bingzhe Wu
Yan Yan
DiffM
MQ
154
182
0
28 Nov 2022
Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay
Yilun Du
Abhi Gupta
J. Tenenbaum
Tommi Jaakkola
Pulkit Agrawal
DiffM
160
408
0
28 Nov 2022
Continuous diffusion for categorical data
Sander Dieleman
Laurent Sartran
Arman Roshannai
Nikolay Savinov
Yaroslav Ganin
...
Conor Durkan
Curtis Hawthorne
Rémi Leblond
Will Grathwohl
J. Adler
DiffM
121
106
0
28 Nov 2022
CLIP2GAN: Towards Bridging Text with the Latent Space of GANs
Yixuan Wang
Wen-gang Zhou
Jianmin Bao
Weilun Wang
Li Li
Houqiang Li
GAN
CLIP
62
6
0
28 Nov 2022
DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
Zhengfu He
Tianxiang Sun
Kuan-Chieh Wang
Xuanjing Huang
Xipeng Qiu
DiffM
VLM
101
131
0
28 Nov 2022
Unified Discrete Diffusion for Simultaneous Vision-Language Generation
Minghui Hu
Chuanxia Zheng
Heliang Zheng
Tat-Jen Cham
Chaoyue Wang
Zuopeng Yang
Dacheng Tao
Ponnuthurai Nagaratnam Suganthan
DiffM
131
26
0
27 Nov 2022
Traditional Classification Neural Networks are Good Generators: They are Competitive with DDPMs and GANs
Guangrun Wang
Philip Torr
83
9
0
27 Nov 2022
3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models
Gang Li
Heliang Zheng
Chaoyue Wang
Chang Li
C. Zheng
Dacheng Tao
DiffM
97
60
0
25 Nov 2022
Expanding Small-Scale Datasets with Guided Imagination
Yifan Zhang
Daquan Zhou
Bryan Hooi
Kaixin Wang
Jiashi Feng
171
48
0
25 Nov 2022
Sketch-Guided Text-to-Image Diffusion Models
A. Voynov
Kfir Aberman
Daniel Cohen-Or
DiffM
100
211
0
24 Nov 2022
Shifted Diffusion for Text-to-image Generation
Yufan Zhou
Bingchen Liu
Yizhe Zhu
Xiao Yang
Changyou Chen
Jinhui Xu
DiffM
133
45
0
24 Nov 2022
Ham2Pose: Animating Sign Language Notation into Pose Sequences
Rotem Shalev-Arkushin
Amit Moryossef
Ohad Fried
SLR
85
19
0
24 Nov 2022
Improving dermatology classifiers across populations using images generated by large diffusion models
Luke Sagers
James A. Diao
Matthew Groh
Pranav Rajpurkar
A. Adamson
Arjun K. Manrai
DiffM
MedIm
70
33
0
23 Nov 2022
Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
R. Burgert
Kanchana Ranasinghe
Xiang Li
Michael S. Ryoo
DiffM
VLM
86
38
0
23 Nov 2022
TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation
Nikolai Kalischek
T. Peters
Jan Dirk Wegner
Konrad Schindler
DiffM
61
12
0
23 Nov 2022
ReCo: Region-Controlled Text-to-Image Generation
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Linjie Li
Kevin Qinghong Lin
...
Nan Duan
Zicheng Liu
Ce Liu
Michael Zeng
Lijuan Wang
DiffM
105
150
0
23 Nov 2022
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
Tsu-Jui Fu
Licheng Yu
Ning Zhang
Cheng-Yang Fu
Jong-Chyi Su
William Yang Wang
Sean Bell
VGen
146
38
0
23 Nov 2022
RoentGen: Vision-Language Foundation Model for Chest X-ray Generation
Pierre J. Chambon
Christian Blüthgen
Jean-Benoit Delbrouck
Rogier van der Sluijs
M. Polacin
Juan Manuel Zambrano Chaves
Tanishq Mathew Abraham
Shivanshu Purohit
C. Langlotz
Akshay S. Chaudhari
LM&MA
DiffM
MedIm
92
102
0
23 Nov 2022
Retrieval-Augmented Multimodal Language Modeling
Michihiro Yasunaga
Armen Aghajanyan
Weijia Shi
Rich James
J. Leskovec
Percy Liang
M. Lewis
Luke Zettlemoyer
Wen-tau Yih
RALM
104
108
0
22 Nov 2022
EDICT: Exact Diffusion Inversion via Coupled Transformations
Bram Wallace
Akash Gokul
Nikhil Naik
DiffM
107
188
0
22 Nov 2022
Can denoising diffusion probabilistic models generate realistic astrophysical fields?
N. Mudur
D. Finkbeiner
DiffM
58
15
0
22 Nov 2022
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Wenxuan Zhang
Xiaodong Cun
Xuan Wang
Yong Zhang
Xiaodong Shen
Yu-Xiao Guo
Ying Shan
Fei Wang
VGen
99
256
0
22 Nov 2022
SinFusion: Training Diffusion Models on a Single Image or Video
Yaniv Nikankin
Niv Haim
Michal Irani
VGen
106
71
0
21 Nov 2022
Exploring Discrete Diffusion Models for Image Captioning
Zixin Zhu
Yixuan Wei
Jianfeng Wang
Zhe Gan
Zheng Zhang
Le Wang
G. Hua
Lijuan Wang
Zicheng Liu
Han Hu
DiffM
VLM
100
24
0
21 Nov 2022
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models
Ajay Jain
Amber Xie
Pieter Abbeel
DiffM
87
95
0
21 Nov 2022
Video Background Music Generation: Dataset, Method and Evaluation
Le Zhuo
Zhaokai Wang
Baisen Wang
Yue Liao
Chenxi Bao
Stanley Peng
Miao Lu
Xiaobo Li
Fei Fang
Si Liu
VGen
87
31
0
21 Nov 2022
Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training
Ling Yang
Zhilin Huang
Yang Song
Shenda Hong
Ge Li
Wentao Zhang
Tengjiao Wang
Guohao Li
Ming-Hsuan Yang
104
57
0
21 Nov 2022
MagicVideo: Efficient Video Generation With Latent Diffusion Models
Daquan Zhou
Weimin Wang
Hanshu Yan
Weiwei Lv
Yizhe Zhu
Jiashi Feng
DiffM
VGen
129
390
0
20 Nov 2022
DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization
Nisha Huang
Yuxin Zhang
Fan Tang
Chongyang Ma
Haibin Huang
Yong Zhang
Weiming Dong
Changsheng Xu
DiffM
90
44
0
19 Nov 2022
EDGE: Editable Dance Generation From Music
Jo-Han Tseng
Rodrigo Castellon
Chenxi Liu
111
242
0
19 Nov 2022
Magic3D: High-Resolution Text-to-3D Content Creation
Chen-Hsuan Lin
Jun Gao
Luming Tang
Towaki Takikawa
Fangyin Wei
Xun Huang
Karsten Kreis
Sanja Fidler
Ming-Yuan Liu
Nayeon Lee
238
1,167
0
18 Nov 2022
InstructPix2Pix: Learning to Follow Image Editing Instructions
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
215
1,841
0
17 Nov 2022
Conffusion: Confidence Intervals for Diffusion Models
Eliahu Horwitz
Yedid Hoshen
DiffM
81
28
0
17 Nov 2022
DiffusionDet: Diffusion Model for Object Detection
Shoufa Chen
Pei Sun
Yibing Song
Ping Luo
133
473
0
17 Nov 2022
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models
Simon Alexanderson
Rajmund Nagy
Jonas Beskow
G. Henter
DiffM
VGen
79
174
0
17 Nov 2022
GLAMI-1M: A Multilingual Image-Text Fashion Dataset
Vaclav Kosar
A. Hoskovec
Milan Šulc
Radek Bartyzal
VLM
71
3
0
17 Nov 2022
A Stable, Fast, and Fully Automatic Learning Algorithm for Predictive Coding Networks
Tommaso Salvatori
Yuhang Song
Yordan Yordanov
Beren Millidge
Zheng R. Xu
Lei Sha
Cornelius Emde
Rafal Bogacz
Thomas Lukasiewicz
99
13
0
16 Nov 2022
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
Xingqian Xu
Zhangyang Wang
Eric Zhang
Kai Wang
Humphrey Shi
DiffM
153
198
0
15 Nov 2022
Previous
1
2
3
...
23
24
25
26
27
28
Next