Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 4,332 papers shown
Title
Bayesian Prompt Learning for Image-Language Model Generalization
Mohammad Mahdi Derakhshani
Enrique Sanchez
Adrian Bulat
Victor G. Turrisi da Costa
Cees G. M. Snoek
Georgios Tzimiropoulos
Brais Martínez
VPVLM
VLM
97
34
0
05 Oct 2022
clip2latent: Text driven sampling of a pre-trained StyleGAN using denoising diffusion and CLIP
Justin N. M. Pinkney
Chuan Li
CLIP
VLM
54
20
0
05 Oct 2022
Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
...
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
52
1,479
0
05 Oct 2022
Progressive Text-to-Image Generation
Zhengcong Fei
Mingyuan Fan
Li Zhu
Junshi Huang
89
4
0
05 Oct 2022
When and why vision-language models behave like bags-of-words, and what to do about it?
Mert Yuksekgonul
Federico Bianchi
Pratyusha Kalluri
Dan Jurafsky
James Zou
VLM
CoGe
30
362
0
04 Oct 2022
Contrastive Multimodal Learning for Emergence of Graphical Sensory-Motor Communication
Tristan Karch
Yoann Lemesle
Romain Laroche
Clément Moulin-Frier
Pierre-Yves Oudeyer
37
1
0
03 Oct 2022
Visual Prompt Tuning for Generative Transfer Learning
Kihyuk Sohn
Yuan Hao
José Lezama
Luisa F. Polanía
Huiwen Chang
Han Zhang
Irfan Essa
Lu Jiang
VPVLM
VLM
56
81
0
03 Oct 2022
Membership Inference Attacks Against Text-to-image Generation Models
Yixin Wu
Ning Yu
Zheng Li
Michael Backes
Yang Zhang
DiffM
24
65
0
03 Oct 2022
Red-Teaming the Stable Diffusion Safety Filter
Javier Rando
Daniel Paleka
David Lindner
Lennard Heim
Florian Tramèr
DiffM
132
184
0
03 Oct 2022
Improving Sample Quality of Diffusion Models Using Self-Attention Guidance
Susung Hong
Gyuseong Lee
Wooseok Jang
Seung Wook Kim
DiffM
30
97
0
03 Oct 2022
OCD: Learning to Overfit with Conditional Diffusion Models
Shahar Lutati
Lior Wolf
DiffM
22
8
0
02 Oct 2022
NeRF: Neural Radiance Field in 3D Vision, A Comprehensive Review
K. Gao
Yina Gao
Hongjie He
Dening Lu
Linlin Xu
Jonathan Li
19
47
0
01 Oct 2022
Protein structure generation via folding diffusion
Kevin E. Wu
Kevin Kaichuang Yang
Rianne van den Berg
James Zou
Alex X. Lu
Ava P. Amini
DiffM
35
192
0
30 Sep 2022
TabDDPM: Modelling Tabular Data with Diffusion Models
Akim Kotelnikov
Dmitry Baranchuk
Ivan Rubachev
Artem Babenko
DiffM
75
239
0
30 Sep 2022
AudioGen: Textually Guided Audio Generation
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre Défossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
DiffM
27
289
0
30 Sep 2022
Diffusion-based Image Translation using Disentangled Style and Content Representation
Gihyun Kwon
Jong Chul Ye
DiffM
162
155
0
30 Sep 2022
Understanding Pure CLIP Guidance for Voxel Grid NeRF Models
Han-Hung Lee
Angel X. Chang
24
63
0
30 Sep 2022
State-specific protein-ligand complex structure prediction with a multi-scale deep generative model
Zhuoran Qiao
Weili Nie
Arash Vahdat
Thomas F. Miller
Anima Anandkumar
DiffM
36
84
0
30 Sep 2022
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
70
2,319
0
29 Sep 2022
Spotlight: Mobile UI Understanding using Vision-Language Models with a Focus
Gang Li
Yang Li
30
67
0
29 Sep 2022
Human Motion Diffusion Model
Guy Tevet
Sigal Raab
Brian Gordon
Yonatan Shafir
Daniel Cohen-Or
Amit H. Bermano
DiffM
VGen
223
724
0
29 Sep 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffM
VGen
45
1,352
0
29 Sep 2022
Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Huayu Chen
Cheng Lu
Chengyang Ying
Hang Su
Jun Zhu
DiffM
OffRL
105
106
0
29 Sep 2022
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Wenhu Chen
Hexiang Hu
Chitwan Saharia
William W. Cohen
VLM
131
161
0
29 Sep 2022
Compositional Score Modeling for Simulation-based Inference
Tomas Geffner
George Papamakarios
A. Mnih
72
25
0
28 Sep 2022
What Does DALL-E 2 Know About Radiology?
Lisa Christine Adams
Felix Busch
Daniel Truhn
Marcus R. Makowski
Hugo J. W. L. Aerts
Keno K. Bressem
MedIm
39
58
0
27 Sep 2022
Learning to Learn with Generative Models of Neural Network Checkpoints
William S. Peebles
Ilija Radosavovic
Tim Brooks
Alexei A. Efros
Jitendra Malik
UQCV
75
65
0
26 Sep 2022
A Collaborative, Interactive and Context-Aware Drawing Agent for Co-Creative Design
F. Ibarrola
Tomas Lawton
Kazjon Grace
40
13
0
26 Sep 2022
All are Worth Words: A ViT Backbone for Diffusion Models
Fan Bao
Shen Nie
Kaiwen Xue
Yue Cao
Chongxuan Li
Hang Su
Jun Zhu
VLM
26
318
0
25 Sep 2022
Face Super-Resolution Using Stochastic Differential Equations
Marcelo dos Santos
Rayson Laroca
Rafael O. Ribeiro
João Neves
Hugo Proencca
David Menotti
DiffM
26
12
0
24 Sep 2022
A Case Report On The "A.I. Locked-In Problem": social concerns with modern NLP
Yoshija Walter
LLMAG
15
2
0
22 Sep 2022
Implementing and Experimenting with Diffusion Models for Text-to-Image Generation
Robin Zbinden
30
3
0
22 Sep 2022
Deep Lake: a Lakehouse for Deep Learning
S. Hambardzumyan
Abhina Tuli
Levon Ghukasyan
Fariz Rahman
Hrant Topchyan
...
Mark McQuade
M. Harutyunyan
Tatevik Hakobyan
I. Stranic
Davit Buniatyan
23
17
0
22 Sep 2022
Extremely Simple Activation Shaping for Out-of-Distribution Detection
Andrija Djurisic
Nebojsa Bozanic
Arjun Ashok
Rosanne Liu
OODD
172
151
0
20 Sep 2022
Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis
Lukas Struppek
Dominik Hintersdorf
Felix Friedrich
Manuel Brack
P. Schramowski
Kristian Kersting
76
26
0
19 Sep 2022
Can There be Art Without an Artist?
A. Ghosh
Genoveva Fossas
58
25
0
16 Sep 2022
Does CLIP Know My Face?
Dominik Hintersdorf
Lukas Struppek
Manuel Brack
Felix Friedrich
P. Schramowski
Kristian Kersting
VLM
21
9
0
15 Sep 2022
Brain Imaging Generation with Latent Diffusion Models
W. H. Pinaya
Petru-Daniel Tudosiu
J. Dafflon
P. F. D. Costa
Virginia Fernandez
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
DiffM
MedIm
107
285
0
15 Sep 2022
Soft Diffusion: Score Matching for General Corruptions
Giannis Daras
M. Delbracio
Hossein Talebi
A. Dimakis
P. Milanfar
DiffM
64
106
0
12 Sep 2022
Diffusion Models in Vision: A Survey
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
M. Shah
DiffM
VLM
MedIm
197
1,149
0
10 Sep 2022
ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
Zhengzhe Liu
Peng Dai
Ruihui Li
Xiaojuan Qi
Chi-Wing Fu
DiffM
182
25
0
09 Sep 2022
TEACH: Temporal Action Composition for 3D Humans
Nikos Athanasiou
Mathis Petrovich
Michael J. Black
Gül Varol
89
142
0
09 Sep 2022
Text-Free Learning of a Natural Language Interface for Pretrained Face Generators
Xiaodan Du
Raymond A. Yeh
Nicholas I. Kolkin
Eli Shechtman
Gregory Shakhnarovich
CLIP
29
1
0
08 Sep 2022
Data Feedback Loops: Model-driven Amplification of Dataset Biases
Rohan Taori
Tatsunori B. Hashimoto
74
43
0
08 Sep 2022
FETA: Towards Specializing Foundation Models for Expert Task Applications
Amit Alfassy
Assaf Arbelle
Oshri Halimi
Sivan Harary
Roei Herzig
...
Christoph Auer
Kate Saenko
Peter W. J. Staar
Rogerio Feris
Leonid Karlinsky
23
19
0
08 Sep 2022
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow
Xingchao Liu
Chengyue Gong
Qiang Liu
OOD
58
850
0
07 Sep 2022
Statistical Foundation Behind Machine Learning and Its Impact on Computer Vision
Lei Zhang
H. Shum
VLM
SSL
22
2
0
06 Sep 2022
A Survey on Generative Diffusion Model
Hanqun Cao
Cheng Tan
Zhangyang Gao
Yilun Xu
Guangyong Chen
Pheng-Ann Heng
Stan Z. Li
MedIm
37
207
0
06 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Bin Cui
Ming-Hsuan Yang
DiffM
MedIm
224
1,311
0
02 Sep 2022
Zero-Shot Multi-Modal Artist-Controlled Retrieval and Exploration of 3D Object Sets
Kristofer Schlachter
Benjamin Ahlbrand
Zhu Wang
V. Ortenzi
Ken Perlin
DiffM
3DV
14
7
0
01 Sep 2022
Previous
1
2
3
...
84
85
86
87
Next