Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 4,340 papers shown
Title
Text2Performer: Text-Driven Human Video Generation
Yuming Jiang
Shuai Yang
Tong Liang Koh
Wayne Wu
Chen Change Loy
Ziwei Liu
DiffM
VGen
51
48
0
17 Apr 2023
Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation
Jie An
Songyang Zhang
Harry Yang
Sonal Gupta
Jia-Bin Huang
Jiebo Luo
Xiaoyue Yin
DiffM
VGen
38
108
0
17 Apr 2023
Synthetic Data from Diffusion Models Improves ImageNet Classification
Shekoofeh Azizi
Simon Kornblith
Chitwan Saharia
Mohammad Norouzi
David J. Fleet
VLM
DiffM
45
298
0
17 Apr 2023
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
Ming Cao
Xintao Wang
Zhongang Qi
Ying Shan
Xiaohu Qie
Yinqiang Zheng
DiffM
42
432
0
17 Apr 2023
The Design Space of Generative Models
Meredith Ringel Morris
Carrie J. Cai
J. Holbrook
Chinmay Kulkarni
Michael Terry
3DV
21
28
0
15 Apr 2023
A Comparative Study on Generative Models for High Resolution Solar Observation Imaging
Mehdi Cherti
Alexander Czernik
Stefan Kesselheim
F. Effenberger
J. Jitsev
DiffM
33
0
0
14 Apr 2023
Delta Denoising Score
Amir Hertz
Kfir Aberman
Daniel Cohen-Or
DiffM
40
91
0
14 Apr 2023
One-Shot Stylization for Full-Body Human Images
Aiyu Cui
Svetlana Lazebnik
3DH
35
0
0
14 Apr 2023
AutoSplice: A Text-prompt Manipulated Image Dataset for Media Forensics
Shan Jia
Mingzhen Huang
Zhou Zhou
Yan Ju
Jialing Cai
Siwei Lyu
DiffM
34
29
0
14 Apr 2023
On the Opportunities and Challenges of Foundation Models for Geospatial Artificial Intelligence
Gengchen Mai
Weiming Huang
Jin Sun
Suhang Song
Deepak Mishra
...
Yingjie Hu
Chris Cundy
Ziyuan Li
Rui Zhu
Ni Lao
AI4CE
40
124
0
13 Apr 2023
Inpaint Anything: Segment Anything Meets Image Inpainting
Tao Yu
Runsen Feng
Ruoyu Feng
Jinming Liu
Xin Jin
Wenjun Zeng
Zhibo Chen
DiffM
53
213
0
13 Apr 2023
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
81
79
0
13 Apr 2023
Control3Diff: Learning Controllable 3D Diffusion Models from Single-view Images
Jiatao Gu
Qingzhe Gao
Shuangfei Zhai
Baoquan Chen
Lingjie Liu
J. Susskind
46
29
0
13 Apr 2023
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Jaemin Cho
Linjie Li
Zhengyuan Yang
Zhe Gan
Lijuan Wang
Joey Tianyi Zhou
EGVM
16
5
0
13 Apr 2023
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning
Enze Xie
Lewei Yao
Han Shi
Zhili Liu
Daquan Zhou
Zhaoqiang Liu
Jiawei Li
Zhenguo Li
36
77
0
13 Apr 2023
Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation
Mohit Sharma
Claudio Fantacci
Yuxiang Zhou
Skanda Koppula
N. Heess
Jonathan Scholz
Y. Aytar
VLM
55
29
0
13 Apr 2023
An Edit Friendly DDPM Noise Space: Inversion and Manipulations
Inbar Huberman-Spiegelglas
Vladimir Kulikov
T. Michaeli
DiffM
15
142
0
12 Apr 2023
DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion
J. Karras
Aleksander Holynski
Ting-Chun Wang
Ira Kemelmacher-Shlizerman
DiffM
VGen
35
138
0
12 Apr 2023
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
Jiazheng Xu
Xiao Liu
Yuchen Wu
Yuxuan Tong
Qinkai Li
Ming Ding
Jie Tang
Yuxiao Dong
63
327
0
12 Apr 2023
SpectralDiff: A Generative Framework for Hyperspectral Image Classification with Diffusion Models
Ning Chen
Jun Yue
Leyuan Fang
Shaobo Xia
DiffM
33
58
0
12 Apr 2023
Exploring Diffusion Models for Unsupervised Video Anomaly Detection
Anil Osman Tur
Nicola Dall’Asen
Cigdem Beyan
Elisa Ricci
DiffM
VGen
42
34
0
12 Apr 2023
Gradient-Free Textual Inversion
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
38
31
0
12 Apr 2023
Improving Diffusion Models for Scene Text Editing with Dual Encoders
Jiabao Ji
Guanhua Zhang
Zhaowen Wang
Bairu Hou
Zhifei Zhang
Brian L. Price
Shiyu Chang
DiffM
38
29
0
12 Apr 2023
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models
Eslam Mohamed Bakr
Pengzhan Sun
Xiaoqian Shen
Faizan Farooq Khan
Li Erran Li
Mohamed Elhoseiny
VLM
32
77
0
11 Apr 2023
Diffusion Models for Constrained Domains
N. Fishman
Leo Klarner
Valentin De Bortoli
Emile Mathieu
M. Hutchinson
DiffM
33
35
0
11 Apr 2023
Controllable Textual Inversion for Personalized Text-to-Image Generation
Jianan Yang
Haobo Wang
Yanming Zhang
Rui Xiao
Sai Wu
Gang Chen
Jiaqi Zhao
DiffM
32
12
0
11 Apr 2023
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond
Mohammadreza Armandpour
A. Sadeghian
Huangjie Zheng
Amir Sadeghian
Mingyuan Zhou
DiffM
20
123
0
11 Apr 2023
Binary Latent Diffusion
Ze Wang
Jiang Wang
Zicheng Liu
Qiang Qiu
37
13
0
10 Apr 2023
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen
Aston Zhang
Mu Li
Alexander J. Smola
Diyi Yang
DiffM
32
17
0
10 Apr 2023
Reflected Diffusion Models
Aaron Lou
Stefano Ermon
34
51
0
10 Apr 2023
EKILA: Synthetic Media Provenance and Attribution for Generative Art
Kar Balan
S. Agarwal
Simon Jenni
Andy Parsons
Andrew Gilbert
John Collomosse
30
12
0
10 Apr 2023
DDRF: Denoising Diffusion Model for Remote Sensing Image Fusion
Zihan Cao
Shiqi Cao
Xiao Wu
Junming Hou
Ran Ran
Liang-Jian Deng
DiffM
40
15
0
10 Apr 2023
Defense-Prefix for Preventing Typographic Attacks on CLIP
Hiroki Azuma
Yusuke Matsui
VLM
AAML
20
18
0
10 Apr 2023
BerDiff: Conditional Bernoulli Diffusion Model for Medical Image Segmentation
Tao Chen
Chenhui Wang
Hongming Shan
DiffM
MedIm
23
34
0
10 Apr 2023
Leveraging Neural Representations for Audio Manipulation
Scott H. Hawley
C. Steinmetz
41
2
0
10 Apr 2023
Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models
Nikita Starodubcev
Dmitry Baranchuk
Valentin Khrulkov
Artem Babenko
DiffM
51
4
0
10 Apr 2023
ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes
Ran Gong
Jiangyong Huang
Yizhou Zhao
Haoran Geng
Xiaofeng Gao
...
Ziheng Zhou
D. Terzopoulos
Song-Chun Zhu
Baoxiong Jia
Siyuan Huang
LM&Ro
50
45
0
09 Apr 2023
A Comprehensive Survey on Knowledge Distillation of Diffusion Models
Weijian Luo
DiffM
MedIm
57
33
0
09 Apr 2023
Hi Sheldon! Creating Deep Personalized Characters from TV Shows
Meidai Xuanyuan
Yuwang Wang
Honglei Guo
Xiao Ma
Yuchen Guo
Tao Yu
Qionghai Dai
VGen
33
0
0
09 Apr 2023
Deep Generative Modeling with Backward Stochastic Differential Equations
Xingcheng Xu
PINN
32
0
0
08 Apr 2023
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Qiucheng Wu
Yujian Liu
Handong Zhao
T. Bui
Zhe Lin
Yang Zhang
Shiyu Chang
DiffM
47
45
0
07 Apr 2023
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Jing Shi
Wei Xiong
Zhe Lin
H. J. Jung
DiffM
133
281
0
06 Apr 2023
RoSteALS: Robust Steganography using Autoencoder Latent Space
Tu Bui
Shrutina Agarwal
Ning Yu
John Collomosse
37
39
0
06 Apr 2023
Training-Free Layout Control with Cross-Attention Guidance
Minghao Chen
Iro Laina
Andrea Vedaldi
DiffM
135
223
0
06 Apr 2023
Diffusion Models as Masked Autoencoders
Chen Wei
K. Mangalam
Po-Yao (Bernie) Huang
Yanghao Li
Haoqi Fan
Hu Xu
Huiyu Wang
Cihang Xie
Alan Yuille
Christoph Feichtenhofer
DiffM
SyDa
36
49
0
06 Apr 2023
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
Ahmet Burak Yildirim
Vedat Baday
Erkut Erdem
Aykut Erdem
Aysegül Dündar
DiffM
35
61
0
06 Apr 2023
Advances in Data-Driven Analysis and Synthesis of 3D Indoor Scenes
A. Patil
Supriya Gadi Patil
Manyi Li
Matthew Fisher
Manolis Savva
Haotong Zhang
3DV
37
17
0
06 Apr 2023
Uncurated Image-Text Datasets: Shedding Light on Demographic Bias
Noa Garcia
Yusuke Hirota
Yankun Wu
Yuta Nakashima
EGVM
51
52
0
06 Apr 2023
DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model
H. Seo
Hayeon Kim
Gwanghyun Kim
S. Chun
DiffM
21
40
0
06 Apr 2023
Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models
Xuhui Jia
Yang Zhao
Kelvin C. K. Chan
Yandong Li
Han-Ying Zhang
Boqing Gong
Tingbo Hou
Haoran Wang
Yu-Chuan Su
DiffM
35
100
0
05 Apr 2023
Previous
1
2
3
...
72
73
74
...
85
86
87
Next