Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 4,340 papers shown
Title
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
Lukas Höllein
Ang Cao
Andrew Owens
Justin Johnson
Matthias Nießner
DiffM
38
179
0
21 Mar 2023
3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion
Yu-Jhe Li
Tao Xu
Ji Hou
Bichen Wu
Xiaoliang Dai
Albert Pumarola
Peizhao Zhang
Peter Vajda
Kris Kitani
DiffM
38
6
0
21 Mar 2023
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Yushi Hu
Benlin Liu
Jungo Kasai
Yizhong Wang
Mari Ostendorf
Ranjay Krishna
Noah A. Smith
EGVM
46
213
0
21 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
85
160
0
21 Mar 2023
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Weijia Wu
Yuzhong Zhao
Mike Zheng Shou
Hong Zhou
Chunhua Shen
50
140
0
21 Mar 2023
LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models
Junyi Zhang
Jiaqi Guo
Shizhao Sun
Jian-Guang Lou
Dongmei Zhang
DiffM
21
33
0
21 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the Future
Jianing Qiu
Lin Li
Jiankai Sun
Jiachuan Peng
Peilun Shi
...
Bo Xiao
Wu Yuan
Ningli Wang
Dong Xu
Benny Lo
AI4MH
LM&MA
42
128
0
21 Mar 2023
Zero-1-to-3: Zero-shot One Image to 3D Object
Ruoshi Liu
Rundi Wu
Basile Van Hoorick
P. Tokmakov
Sergey Zakharov
Carl Vondrick
DiffM
29
1,051
0
20 Mar 2023
Localizing Object-level Shape Variations with Text-to-Image Diffusion Models
Or Patashnik
Daniel Garibi
Idan Azuri
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
37
109
0
20 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
52
270
0
20 Mar 2023
Cascaded Latent Diffusion Models for High-Resolution Chest X-ray Synthesis
Tobias Weber
Michael Ingrisch
Bernd Bischl
David Rügamer
DiffM
MedIm
32
25
0
20 Mar 2023
Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models
René Haas
Inbar Huberman-Spiegelglas
Rotem Mulayoff
Stella Graßhof
Sami S. Brandt
T. Michaeli
DiffM
40
39
0
20 Mar 2023
MXM-CLR: A Unified Framework for Contrastive Learning of Multifold Cross-Modal Representations
Ye Wang
Bo‐Shu Jiang
C. Zou
Rui Ma
32
5
0
20 Mar 2023
Object-Centric Slot Diffusion
Jindong Jiang
Fei Deng
Gautam Singh
S. Ahn
DiffM
BDL
OCL
30
57
0
20 Mar 2023
Deep Image Fingerprint: Towards Low Budget Synthetic Image Detection and Model Lineage Analysis
Sergey Sinitsa
Ohad Fried
21
17
0
19 Mar 2023
SKED: Sketch-guided Text-based 3D Editing
Aryan Mikaeili
Or Perel
Mehdi Safaee
Daniel Cohen-Or
Ali Mahdavi-Amiri
DiffM
41
66
0
19 Mar 2023
DialogPaint: A Dialog-based Image Editing Model
Jingxuan Wei
Shiyu Wu
Xin Jiang
Yequan Wang
KELM
DiffM
26
5
0
17 Mar 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
53
20
0
17 Mar 2023
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
Jiwen Yu
Yinhuai Wang
Chen Zhao
Guohao Li
Jian Zhang
DiffM
41
169
0
17 Mar 2023
DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery
Chaofan Ma
Yu-Hao Yang
Chen Ju
Feifan Zhang
Jinxian Liu
Yu Wang
Ya Zhang
Yanfeng Wang
DiffM
37
37
0
17 Mar 2023
Denoising Diffusion Autoencoders are Unified Self-supervised Learners
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
DiffM
35
71
0
17 Mar 2023
Instance-Conditioned GAN Data Augmentation for Representation Learning
Pietro Astolfi
Arantxa Casanova
Jakob Verbeek
Pascal Vincent
Adriana Romero Soriano
M. Drozdzal
26
6
0
16 Mar 2023
HIVE: Harnessing Human Feedback for Instructional Visual Editing
Shu Zhen Zhang
Xinyi Yang
Yihao Feng
Can Qin
Chia-Chih Chen
...
Haiquan Wang
Silvio Savarese
Stefano Ermon
Caiming Xiong
Ran Xu
28
105
0
16 Mar 2023
Efficient Diffusion Training via Min-SNR Weighting Strategy
Tiankai Hang
Shuyang Gu
Chen Li
Jianmin Bao
Dong Chen
Han Hu
Xin Geng
B. Guo
30
150
0
16 Mar 2023
Diffusion-HPC: Synthetic Data Generation for Human Mesh Recovery in Challenging Domains
Zhenzhen Weng
Laura Bravo Sánchez
Serena Yeung-Levy
DiffM
35
0
0
16 Mar 2023
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
Chenyang Qi
Xiaodong Cun
Yong Zhang
Chenyang Lei
Xintao Wang
Ying Shan
Qifeng Chen
VGen
42
331
0
16 Mar 2023
P+: Extended Textual Conditioning in Text-to-Image Generation
A. Voynov
Qinghao Chu
Daniel Cohen-Or
Kfir Aberman
VLM
DiffM
51
176
0
16 Mar 2023
Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation
Yi Ma
Huan Yang
Wenjing Wang
Jianlong Fu
Jiaying Liu
25
65
0
16 Mar 2023
DIRE for Diffusion-Generated Image Detection
Zhendong Wang
Jianmin Bao
Wen-gang Zhou
Weilun Wang
Hezhen Hu
Hong Chen
Houqiang Li
27
196
0
16 Mar 2023
StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model
Zipeng Xu
E. Sangineto
N. Sebe
DiffM
33
13
0
16 Mar 2023
Identifiability Results for Multimodal Contrastive Learning
Imant Daunhawer
Alice Bizeul
Emanuele Palumbo
Alexander Marx
Julia E. Vogt
42
39
0
16 Mar 2023
Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
Lingting Zhu
Xian Liu
Xuanyu Liu
Rui Qian
Ziwei Liu
Lequan Yu
38
115
0
16 Mar 2023
Mimic3D: Thriving 3D-Aware GANs via 3D-to-2D Imitation
Xingyu Chen
Yu Deng
Baoyuan Wang
37
14
0
16 Mar 2023
Stochastic Segmentation with Conditional Categorical Diffusion Models
L. Zbinden
Lars Doorenbos
Theodoros Pissas
Adrian Thomas Huber
Raphael Sznitman
Pablo Márquez-Neila
DiffM
34
30
0
15 Mar 2023
Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield Images with Class Labels
J. Cross-Zamirski
P. Anand
Guy B. Williams
E. Mouchet
Yinhai Wang
Carola-Bibiane Schönlieb
VLM
DiffM
MedIm
14
8
0
15 Mar 2023
Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion
Inhwa Han
Serin Yang
Taesung Kwon
Jong Chul Ye
DiffM
31
36
0
15 Mar 2023
ResDiff: Combining CNN and Diffusion Model for Image Super-Resolution
Shuyao Shang
Zhengyang Shan
Guangxing Liu
LunQian Wang
XingHua Wang
Zekai Zhang
Jingling Zhang
DiffM
42
83
0
15 Mar 2023
Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models
Suhyeon Lee
Hyungjin Chung
Minyoung Park
Jonghyuk Park
Wi-Sun Ryu
J. C. Ye
DiffM
MedIm
27
44
0
15 Mar 2023
VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation
Zhengxiong Luo
Dayou Chen
Yingya Zhang
Yan Huang
Liangsheng Wang
Yujun Shen
Deli Zhao
Jinren Zhou
Tien-Ping Tan
DiffM
VGen
132
309
0
15 Mar 2023
Diffusion Models for Contrast Harmonization of Magnetic Resonance Images
Alicia Durrer
J. Wolleb
Florentin Bieder
T. Sinnecker
Matthias Weigel
Robin Sandkühler
Cristina Granziera
Ozgur Yaldizli
Philippe C. Cattin
MedIm
41
11
0
14 Mar 2023
Editing Implicit Assumptions in Text-to-Image Diffusion Models
Hadas Orgad
Bahjat Kawar
Yonatan Belinkov
DiffM
53
86
0
14 Mar 2023
Edit-A-Video: Single Video Editing with Object-Aware Consistency
Chaehun Shin
Heeseung Kim
Che Hyun Lee
Sang-gil Lee
Sung-Hoon Yoon
DiffM
VGen
111
51
0
14 Mar 2023
Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation
Junyoung Seo
Wooseok Jang
Minseop Kwak
Ines Hyeonsu Kim
Jaehoon Ko
Junho Kim
Jin-Hwa Kim
Jiyoung Lee
Seung Wook Kim
DiffM
46
136
0
14 Mar 2023
Text-to-image Diffusion Models in Generative AI: A Survey
Chenshuang Zhang
Chaoning Zhang
Mengchun Zhang
In So Kweon
VLM
51
267
0
14 Mar 2023
Diffusion Models in NLP: A Survey
Yuansong Zhu
Yu Zhao
DiffM
VLM
MedIm
29
23
0
14 Mar 2023
Prompting AI Art: An Investigation into the Creative Skill of Prompt Engineering
J. Oppenlaender
Rhema Linder
Johanna M. Silvennoinen
21
73
0
13 Mar 2023
Synthesizing Realistic Image Restoration Training Pairs: A Diffusion Approach
Tao Yang
Peiran Ren
Xuansong Xie
Lei Zhang
DiffM
37
15
0
13 Mar 2023
One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
Fan Bao
Shen Nie
Kaiwen Xue
Chongxuan Li
Shiliang Pu
Yaole Wang
Gang Yue
Yue Cao
Hang Su
Jun Zhu
DiffM
207
151
0
12 Mar 2023
PARASOL: Parametric Style Control for Diffusion Image Synthesis
Gemma Canet Tarrés
Dan Ruta
Tu Bui
John Collomosse
DiffM
47
6
0
11 Mar 2023
DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation
Yueming Lyu
Tianwei Lin
Fu Li
Dongliang He
Jing Dong
Tien-Ping Tan
41
39
0
11 Mar 2023
Previous
1
2
3
...
75
76
77
...
85
86
87
Next