Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 4,340 papers shown
Title
Error Bounds for Flow Matching Methods
Joe Benton
George Deligiannidis
Arnaud Doucet
DiffM
46
33
0
26 May 2023
Improved Visual Story Generation with Adaptive Context Modeling
Zhangyin Feng
Yuchen Ren
Xinmiao Yu
Xiaocheng Feng
Duyu Tang
Shuming Shi
Bing Qin
DiffM
57
14
0
26 May 2023
Negative-prompt Inversion: Fast Image Inversion for Editing with Text-guided Diffusion Models
Daiki Miyake
Akihiro Iohara
Yuriko Saito
Toshiyuki Tanaka
DiffM
21
114
0
26 May 2023
Data-Driven Optimization for Deposition with Degradable Tools
Tony Zheng
Monimoy Bujarbaruah
Francesco Borrelli
48
0
0
26 May 2023
LANISTR: Multimodal Learning from Structured and Unstructured Data
Sayna Ebrahimi
Sercan O. Arik
Yihe Dong
Tomas Pfister
25
4
0
26 May 2023
Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability
Haotian Xue
Alexandre Araujo
Bin Hu
Yongxin Chen
DiffM
64
43
0
25 May 2023
ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image
Zhenzhen Weng
Zeyu Wang
S. Yeung
DiffM
33
20
0
25 May 2023
Are Diffusion Models Vision-And-Language Reasoners?
Benno Krojer
Elinor Poole-Dayan
Vikram S. Voleti
Christopher Pal
Siva Reddy
50
14
0
25 May 2023
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Omri Avrahami
Kfir Aberman
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
VLM
DiffM
46
165
0
25 May 2023
Securing Deep Generative Models with Universal Adversarial Signature
Yu Zeng
Mo Zhou
Yuan Xue
Vishal M. Patel
WIGM
26
10
0
25 May 2023
Imitating Task and Motion Planning with Visuomotor Transformers
Murtaza Dalal
Ajay Mandlekar
Caelan Reed Garrett
Ankur Handa
Ruslan Salakhutdinov
Dieter Fox
71
54
0
25 May 2023
CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion
Guangyao Zhai
Evin Pınar Örnek
Shun-cheng Wu
Yan Di
F. Tombari
Nassir Navab
Benjamin Busam
DiffM
40
12
0
25 May 2023
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Ying Fan
Olivia Watkins
Yuqing Du
Hao Liu
Moonkyung Ryu
Craig Boutilier
Pieter Abbeel
Mohammad Ghavamzadeh
Kangwook Lee
Kimin Lee
59
138
0
25 May 2023
Trans-Dimensional Generative Modeling via Jump Diffusion Models
Andrew Campbell
William Harvey
Christian D. Weilbach
Valentin De Bortoli
Tom Rainforth
Arnaud Doucet
DiffM
50
11
0
25 May 2023
ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models
Yuxin Zhang
Weiming Dong
Fan Tang
Nisha Huang
Haibin Huang
Chongyang Ma
Tong-Yee Lee
Oliver Deussen
Changsheng Xu
DiffM
40
75
0
25 May 2023
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Xingqian Xu
Jiayi Guo
Zhangyang Wang
Gao Huang
Irfan Essa
Humphrey Shi
VLM
DiffM
45
57
0
25 May 2023
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation
Zhengyi Wang
Cheng Lu
Yikai Wang
Fan Bao
Chongxuan Li
Hang Su
Jun Zhu
DiffM
91
825
0
25 May 2023
Robust Category-Level 3D Pose Estimation from Synthetic Data
Jiahao Yang
Wufei Ma
Angtian Wang
Xiaoding Yuan
Alan Yuille
Adam Kortylewski
29
2
0
25 May 2023
GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes
Ibrahim Ethem Hamamci
Sezgin Er
Anjany Sekuboyina
Enis Simsar
A. Tezcan
...
Hadrien Reynaud
Sarthak Pati
Christian Bluethgen
M. K. Özdemir
Bjoern Menze
DiffM
MedIm
50
17
0
25 May 2023
Detecting Adversarial Data by Probing Multiple Perturbations Using Expected Perturbation Score
Shuhai Zhang
Feng Liu
Jiahao Yang
Yifan Yang
Changsheng Li
Bo Han
Mingkui Tan
DiffM
AAML
42
17
0
25 May 2023
DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
Sitian Shen
Zilin Zhu
Linqian Fan
Harry Zhang
Xinxiao Wu
DiffM
45
27
0
25 May 2023
Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3)
Tsu-Ching Hsiao
Haoming Chen
Hsuan-Kung Yang
Chun-Yi Lee
DiffM
28
7
0
25 May 2023
DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion
Haram Choi
Sang-Hoon Lee
Seong-Whan Lee
DiffM
18
27
0
25 May 2023
Towards Language-guided Interactive 3D Generation: LLMs as Layout Interpreter with Generative Feedback
Yiqi Lin
Hao Wu
Ruichen Wang
H. Lu
Xiaodong Lin
Hui Xiong
Lin Wang
3DV
48
12
0
25 May 2023
Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models
Jooyoung Choi
Yunjey Choi
Yunji Kim
Junho Kim
Sung-Hoon Yoon
DiffM
41
52
0
25 May 2023
Zero-shot Generation of Training Data with Denoising Diffusion Probabilistic Model for Handwritten Chinese Character Recognition
Dongnan Gui
Kai Chen
Haisong Ding
Qiang Huo
VLM
DiffM
40
14
0
25 May 2023
Debias Coarsely, Sample Conditionally: Statistical Downscaling through Optimal Transport and Probabilistic Diffusion Models
Z. Y. Wan
Ricardo Baptista
Yi-fan Chen
John R. Anderson
Anudhyan Boral
Fei Sha
Leonardo Zepeda-Núñez
DiffM
52
24
0
24 May 2023
Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps
Mingxiao Li
Tingyu Qu
Ruicong Yao
Wei Sun
Marie-Francine Moens
DiffM
47
40
0
24 May 2023
Unsupervised Semantic Correspondence Using Stable Diffusion
Eric Hedlin
Gopal Sharma
Shweta Mahajan
Hossam N. Isack
Abhishek Kar
Andrea Tagliasacchi
K. M. Yi
DiffM
52
87
0
24 May 2023
Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets
Brandon Smith
Miguel Farinha
S. Hall
Hannah Rose Kirk
Aleksandar Shtedritski
Max Bain
49
19
0
24 May 2023
Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape
Rundi Wu
Ruoshi Liu
Carl Vondrick
Changxi Zheng
DiffM
50
24
0
24 May 2023
A Neural Space-Time Representation for Text-to-Image Personalization
Yuval Alaluf
Elad Richardson
G. Metzer
Daniel Cohen-Or
DiffM
53
94
0
24 May 2023
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution
Yi Ma
Huan Yang
Wenhan Yang
Jianlong Fu
Jiaying Liu
DiffM
30
7
0
24 May 2023
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence
Junyi Zhang
Charles Herrmann
Junhwa Hur
Luisa Polania Cabrera
Varun Jampani
Deqing Sun
Ming-Hsuan Yang
DiffM
44
172
0
24 May 2023
Visual Programming for Text-to-Image Generation and Evaluation
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
MLLM
43
50
0
24 May 2023
Training on Thin Air: Improve Image Classification with Generated Data
Yongchao Zhou
Hshmat Sahak
Jimmy Ba
DiffM
24
44
0
24 May 2023
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation
Marco Bellagente
Manuel Brack
H. Teufel
Felix Friedrich
Bjorn Deiseroth
...
Koen Oostermeijer
Andres Felipe Cruz Salinas
P. Schramowski
Kristian Kersting
Samuel Weinbach
47
16
0
24 May 2023
L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors
Zheng Chang
Shuchen Weng
Pei Zhang
Yu Li
Si Li
Boxin Shi
DiffM
21
7
0
24 May 2023
DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models
Sungnyun Kim
Junsoo Lee
Kibeom Hong
Daesik Kim
Namhyuk Ahn
DiffM
40
14
0
24 May 2023
Transferring Visual Attributes from Natural Language to Verified Image Generation
Rodrigo Valerio
João Bordalo
Michal Yarom
Yonattan Bitton
Idan Szpektor
João Magalhães
41
5
0
24 May 2023
In-Context Impersonation Reveals Large Language Models' Strengths and Biases
Leonard Salewski
Stephan Alaniz
Isabel Rio-Torto
Eric Schulz
Zeynep Akata
49
151
0
24 May 2023
Text encoders bottleneck compositionality in contrastive vision-language models
Amita Kamath
Jack Hessel
Kai-Wei Chang
CoGe
CLIP
VLM
37
19
0
24 May 2023
HARD: Hard Augmentations for Robust Distillation
Arne F. Nix
Max F. Burg
Fabian H. Sinz
AAML
44
1
0
24 May 2023
ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation
Dongxu Yue
Qin Guo
Munan Ning
Jiaxi Cui
Yuesheng Zhu
Liuliang Yuan
DiffM
37
11
0
24 May 2023
I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors
Tuhin Chakrabarty
Arkadiy Saakyan
Olivia Winn
Artemis Panagopoulou
Yue Yang
Marianna Apidianaki
Smaranda Muresan
DiffM
38
41
0
24 May 2023
BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing
Dongxu Li
Junnan Li
Steven C. H. Hoi
42
305
0
24 May 2023
Segmented Recurrent Transformer: An Efficient Sequence-to-Sequence Model
Yinghan Long
Sayeed Shafayet Chowdhury
Kaushik Roy
51
1
0
24 May 2023
Optimal Linear Subspace Search: Learning to Construct Fast and High-Quality Schedulers for Diffusion Models
Zhongjie Duan
Chengyu Wang
Cen Chen
Jun Huang
Weining Qian
DiffM
27
12
0
24 May 2023
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
46
6
0
24 May 2023
Video Prediction Models as Rewards for Reinforcement Learning
Alejandro Escontrela
Ademi Adeniji
Wilson Yan
Ajay Jain
Xue Bin Peng
Ken Goldberg
Youngwoon Lee
Danijar Hafner
Pieter Abbeel
42
55
0
23 May 2023
Previous
1
2
3
...
68
69
70
...
85
86
87
Next