Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 1,364 papers shown
Title
A generic diffusion-based approach for 3D human pose prediction in the wild
Saeed Saadatnejad
Ali-Ahmad Rasekh
Mohammadreza Mofayezi
Yasamin Medghalchi
Sara Rajabzadeh
Taylor Mordan
Alexandre Alahi
DiffM
90
36
0
11 Oct 2022
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance
Chen Henry Wu
Fernando de la Torre
DiffM
112
69
0
11 Oct 2022
GENIE: Higher-Order Denoising Diffusion Solvers
Tim Dockhorn
Arash Vahdat
Karsten Kreis
DiffM
109
114
0
11 Oct 2022
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
Matthew Baas
Herman Kamper
DiffM
86
8
0
11 Oct 2022
Markup-to-Image Diffusion Models with Scheduled Sampling
Yuntian Deng
Noriyuki Kojima
Alexander M. Rush
DiffM
86
4
0
11 Oct 2022
f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation
Jiatao Gu
Shuangfei Zhai
Yizhe Zhang
Miguel Angel Bautista
J. Susskind
DiffM
103
27
0
10 Oct 2022
What the DAAM: Interpreting Stable Diffusion Using Cross Attention
Raphael Tang
Linqing Liu
Akshat Pandey
Zhiying Jiang
Gefei Yang
K. Kumar
Pontus Stenetorp
Jimmy J. Lin
Ferhan Ture
175
177
0
10 Oct 2022
CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning
Shi-You Xu
VLM
DiffM
90
14
0
10 Oct 2022
Bridging CLIP and StyleGAN through Latent Alignment for Image Editing
Wanfeng Zheng
Qiang Li
Xiaoyan Guo
Pengfei Wan
Zhong-ming Wang
117
14
0
10 Oct 2022
Adapting Pretrained Vision-Language Foundational Models to Medical Imaging Domains
Pierre J. Chambon
Christian Blüthgen
C. Langlotz
Akshay S. Chaudhari
DiffM
MedIm
LM&MA
61
117
0
09 Oct 2022
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
Wanrong Zhu
An Yan
Yujie Lu
Wenda Xu
Xinze Wang
Miguel P. Eckstein
William Yang Wang
126
36
0
07 Oct 2022
Trustworthiness of Laser-Induced Breakdown Spectroscopy Predictions via Simulation-based Synthetic Data Augmentation and Multitask Learning
Riccardo Finotello
D. L’hermite
Celine Quéré
Benjamin Rouge
M. Tamaazousti
J. Sirven
62
1
0
07 Oct 2022
Efficient Diffusion Models for Vision: A Survey
Anwaar Ulhaq
Naveed Akhtar
MedIm
155
68
0
07 Oct 2022
On Distillation of Guided Diffusion Models
Chenlin Meng
Robin Rombach
Ruiqi Gao
Diederik P. Kingma
Stefano Ermon
Jonathan Ho
Tim Salimans
VLM
DiffM
89
536
0
06 Oct 2022
Content-Based Search for Deep Generative Models
Daohan Lu
Sheng-Yu Wang
Nupur Kumari
Rohan Agarwal
Mia Tang
David Bau
Jun-Yan Zhu
DiffM
SyDa
101
6
0
06 Oct 2022
Novel View Synthesis with Diffusion Models
Daniel Watson
William Chan
Ricardo Martín Brualla
Jonathan Ho
Andrea Tagliasacchi
Mohammad Norouzi
DiffM
152
273
0
06 Oct 2022
DALL-E-Bot: Introducing Web-Scale Diffusion Models to Robotics
Ivan Kapelyukh
Vitalis Vosylius
Edward Johns
LM&Ro
DiffM
236
148
0
05 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffM
VGen
157
396
0
05 Oct 2022
Bayesian Prompt Learning for Image-Language Model Generalization
Mohammad Mahdi Derakhshani
Enrique Sanchez
Adrian Bulat
Victor G. Turrisi da Costa
Cees G. M. Snoek
Georgios Tzimiropoulos
Brais Martínez
VPVLM
VLM
171
37
0
05 Oct 2022
Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
...
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
181
1,548
0
05 Oct 2022
Progressive Text-to-Image Generation
Zhengcong Fei
Mingyuan Fan
Li Zhu
Junshi Huang
156
4
0
05 Oct 2022
Contrastive Multimodal Learning for Emergence of Graphical Sensory-Motor Communication
Tristan Karch
Yoann Lemesle
Romain Laroche
Clément Moulin-Frier
Pierre-Yves Oudeyer
61
1
0
03 Oct 2022
Visual Prompt Tuning for Generative Transfer Learning
Kihyuk Sohn
Yuan Hao
José Lezama
Luisa F. Polanía
Huiwen Chang
Han Zhang
Irfan Essa
Lu Jiang
VPVLM
VLM
161
89
0
03 Oct 2022
Red-Teaming the Stable Diffusion Safety Filter
Javier Rando
Daniel Paleka
David Lindner
Lennard Heim
Florian Tramèr
DiffM
231
206
0
03 Oct 2022
Improving Sample Quality of Diffusion Models Using Self-Attention Guidance
Susung Hong
Gyuseong Lee
Wooseok Jang
Seung Wook Kim
DiffM
128
105
0
03 Oct 2022
OCD: Learning to Overfit with Conditional Diffusion Models
Shahar Lutati
Lior Wolf
DiffM
75
8
0
02 Oct 2022
Protein structure generation via folding diffusion
Kevin E. Wu
Kevin Kaichuang Yang
Rianne van den Berg
James Zou
Alex X. Lu
Ava P. Amini
DiffM
123
207
0
30 Sep 2022
AudioGen: Textually Guided Audio Generation
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre Défossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
DiffM
127
309
0
30 Sep 2022
Diffusion-based Image Translation using Disentangled Style and Content Representation
Gihyun Kwon
Jong Chul Ye
DiffM
238
160
0
30 Sep 2022
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
183
2,445
0
29 Sep 2022
Human Motion Diffusion Model
Guy Tevet
Sigal Raab
Brian Gordon
Yonatan Shafir
Daniel Cohen-Or
Amit H. Bermano
DiffM
VGen
287
771
0
29 Sep 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffM
VGen
97
1,439
0
29 Sep 2022
Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Huayu Chen
Cheng Lu
Chengyang Ying
Hang Su
Jun Zhu
DiffM
OffRL
192
122
0
29 Sep 2022
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Wenhu Chen
Hexiang Hu
Chitwan Saharia
William W. Cohen
VLM
212
178
0
29 Sep 2022
Compositional Score Modeling for Simulation-based Inference
Tomas Geffner
George Papamakarios
A. Mnih
138
30
0
28 Sep 2022
What Does DALL-E 2 Know About Radiology?
Lisa Christine Adams
Felix Busch
Daniel Truhn
Marcus R. Makowski
Hugo J. W. L. Aerts
Keno K. Bressem
MedIm
66
61
0
27 Sep 2022
Learning to Learn with Generative Models of Neural Network Checkpoints
William S. Peebles
Ilija Radosavovic
Tim Brooks
Alexei A. Efros
Jitendra Malik
UQCV
156
69
0
26 Sep 2022
A Collaborative, Interactive and Context-Aware Drawing Agent for Co-Creative Design
F. Ibarrola
Tomas Lawton
Kazjon Grace
83
16
0
26 Sep 2022
All are Worth Words: A ViT Backbone for Diffusion Models
Fan Bao
Shen Nie
Kaiwen Xue
Yue Cao
Chongxuan Li
Hang Su
Jun Zhu
VLM
185
365
0
25 Sep 2022
A Case Report On The "A.I. Locked-In Problem": social concerns with modern NLP
Yoshija Walter
LLMAG
50
2
0
22 Sep 2022
Implementing and Experimenting with Diffusion Models for Text-to-Image Generation
Robin Zbinden
42
3
0
22 Sep 2022
Deep Lake: a Lakehouse for Deep Learning
S. Hambardzumyan
Abhina Tuli
Levon Ghukasyan
Fariz Rahman
Hrant Topchyan
...
Mark McQuade
M. Harutyunyan
Tatevik Hakobyan
I. Stranic
Davit Buniatyan
90
20
0
22 Sep 2022
Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis
Lukas Struppek
Dominik Hintersdorf
Felix Friedrich
Manuel Brack
P. Schramowski
Kristian Kersting
121
33
0
19 Sep 2022
Can There be Art Without an Artist?
A. Ghosh
Genoveva Fossas
106
25
0
16 Sep 2022
Brain Imaging Generation with Latent Diffusion Models
W. H. Pinaya
Petru-Daniel Tudosiu
J. Dafflon
P. F. D. Costa
Virginia Fernandez
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
DiffM
MedIm
154
305
0
15 Sep 2022
Soft Diffusion: Score Matching for General Corruptions
Giannis Daras
M. Delbracio
Hossein Talebi
A. Dimakis
P. Milanfar
DiffM
147
111
0
12 Sep 2022
Diffusion Models in Vision: A Survey
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
M. Shah
DiffM
VLM
MedIm
363
1,255
0
10 Sep 2022
TEACH: Temporal Action Composition for 3D Humans
Nikos Athanasiou
Mathis Petrovich
Michael J. Black
Gül Varol
157
147
0
09 Sep 2022
Text-Free Learning of a Natural Language Interface for Pretrained Face Generators
Xiaodan Du
Raymond A. Yeh
Nicholas I. Kolkin
Eli Shechtman
Gregory Shakhnarovich
CLIP
59
1
0
08 Sep 2022
Data Feedback Loops: Model-driven Amplification of Dataset Biases
Rohan Taori
Tatsunori B. Hashimoto
124
48
0
08 Sep 2022
Previous
1
2
3
...
25
26
27
28
Next