Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 4,340 papers shown
Title
DAG: Depth-Aware Guidance with Denoising Diffusion Probabilistic Models
Gyeongnyeon Kim
Wooseok Jang
Gyuseong Lee
Susung Hong
Junyoung Seo
Seung Wook Kim
VLM
DiffM
37
11
0
17 Dec 2022
Point-E: A System for Generating 3D Point Clouds from Complex Prompts
Alex Nichol
Heewoo Jun
Prafulla Dhariwal
Pamela Mishkin
Mark Chen
DiffM
47
587
0
16 Dec 2022
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
Qiucheng Wu
Yujian Liu
Handong Zhao
Ajinkya Kale
T. Bui
Tong Yu
Zhe-nan Lin
Yang Zhang
Shiyu Chang
DiffM
CoGe
30
98
0
16 Dec 2022
Mystique: Enabling Accurate and Scalable Generation of Production AI Benchmarks
Mingyu Liang
Wenyin Fu
Louis Feng
Zhongyi Lin
P. Panakanti
Shengbao Zheng
Srinivas Sridharan
Christina Delimitrou
26
12
0
16 Dec 2022
Unifying Human Motion Synthesis and Style Transfer with Denoising Diffusion Probabilistic Models
Ziyi Chang
Edmund J. C. Findlay
Hao Zhang
Hubert P. H. Shum
DiffM
VGen
26
11
0
16 Dec 2022
CLIPPO: Image-and-Language Understanding from Pixels Only
Michael Tschannen
Basil Mustafa
N. Houlsby
CLIP
VLM
32
48
0
15 Dec 2022
Manifestations of Xenophobia in AI Systems
Nenad Tomašev
J. L. Maynard
Iason Gabriel
24
9
0
15 Dec 2022
TeTIm-Eval: a novel curated evaluation data set for comparing text-to-image models
Federico A. Galatolo
M. G. Cimino
E. Cogotti
32
4
0
15 Dec 2022
Text-Guided Mask-free Local Image Retouching
Zerun Liu
Fan Zhang
Jingxuan He
Jin Wang
Zhangye Wang
Lechao Cheng
DiffM
33
5
0
15 Dec 2022
The Infinite Index: Information Retrieval on Generative Text-To-Image Models
Niklas Deckers
Maik Fröbe
Johannes Kiesel
G. Pandolfo
Christopher Schröder
Benno Stein
Martin Potthast
DiffM
42
16
0
14 Dec 2022
Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Su Wang
Chitwan Saharia
Ceslee Montgomery
Jordi Pont-Tuset
Shai Noy
...
Radu Soricut
Jason Baldridge
Mohammad Norouzi
Peter Anderson
William Chan
35
178
0
13 Dec 2022
CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Zixian Ma
Jerry Hong
Mustafa Omer Gul
Mona Gandhi
Irena Gao
Ranjay Krishna
CoGe
37
125
0
13 Dec 2022
Semantic Brain Decoding: from fMRI to conceptually similar image reconstruction of visual stimuli
Matteo Ferrante
T. Boccato
N. Toschi
DiffM
33
21
0
13 Dec 2022
HS-Diffusion: Semantic-Mixing Diffusion for Head Swapping
Qinghe Wang
Lijie Liu
Miao Hua
Pengfei Zhu
W. Zuo
Qinghua Hu
Huchuan Lu
Bing Cao
DiffM
29
8
0
13 Dec 2022
Rodin: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion
Tengfei Wang
Bo Zhang
Ting Zhang
Shuyang Gu
Jianmin Bao
...
Jingjing Shen
Dong Chen
Fang Wen
Qifeng Chen
B. Guo
40
280
0
12 Dec 2022
The Stable Artist: Steering Semantics in Diffusion Latent Space
Manuel Brack
P. Schramowski
Felix Friedrich
Dominik Hintersdorf
Kristian Kersting
DiffM
19
25
0
12 Dec 2022
RGBD2: Generative Scene Synthesis via Incremental View Inpainting using RGBD Diffusion Models
Jiabao Lei
Jiapeng Tang
Kui Jia
DiffM
32
38
0
12 Dec 2022
Diff-Font: Diffusion Model for Robust One-Shot Font Generation
Haibin He
Xinyuan Chen
Chaoyue Wang
Juhua Liu
Bo Du
Dacheng Tao
Yu Qiao
DiffM
42
36
0
12 Dec 2022
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning
Zhao Mandi
Homanga Bharadhwaj
Vincent Moens
Shuran Song
Aravind Rajeswaran
Vikash Kumar
LM&Ro
30
70
0
12 Dec 2022
How to Backdoor Diffusion Models?
Sheng-Yen Chou
Pin-Yu Chen
Tsung-Yi Ho
DiffM
SILM
25
96
0
11 Dec 2022
SmartBrush: Text and Shape Guided Object Inpainting with Diffusion Model
Shaoan Xie
Zhifei Zhang
Zhe-nan Lin
Tobias Hinz
Kun Zhang
DiffM
33
232
0
09 Dec 2022
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Weixi Feng
Xuehai He
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
P. Narayana
Sugato Basu
Junfeng Fang
William Yang Wang
CoGe
53
300
0
09 Dec 2022
LADIS: Language Disentanglement for 3D Shape Editing
Ian Huang
Panos Achlioptas
Tianyi Zhang
Sergey Tulyakov
Minhyuk Sung
Leonidas J. Guibas
34
10
0
09 Dec 2022
Reminding Forgetful Organic Neuromorphic Device Networks
Daniel Felder
Katerina Muche
J. Linkhorst
Matthias Wessling
49
6
0
09 Dec 2022
MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis
Rishabh Dabral
Muhammad Hamza Mughal
Vladislav Golyanik
Christian Theobalt
DiffM
VGen
37
171
0
08 Dec 2022
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation
Yen-Chi Cheng
Hsin-Ying Lee
Sergey Tulyakov
Alex Schwing
Liangyan Gui
DiffM
35
247
0
08 Dec 2022
Multi-Concept Customization of Text-to-Image Diffusion
Nupur Kumari
Bin Zhang
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
67
827
0
08 Dec 2022
Diffusion Guided Domain Adaptation of Image Generators
Kunpeng Song
Ligong Han
Bingchen Liu
Dimitris N. Metaxas
Ahmed Elgammal
DiffM
28
34
0
08 Dec 2022
Executing your Commands via Motion Diffusion in Latent Space
Xin Chen
Biao Jiang
Wen Liu
Zilong Huang
Bin-Bin Fu
Tao Chen
Jingyi Yu
Gang Yu
VGen
DiffM
25
338
0
08 Dec 2022
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
Zhixing Zhang
Ligong Han
Arna Ghosh
Dimitris N. Metaxas
Jian Ren
DiffM
51
155
0
08 Dec 2022
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion
Hanqing Zhao
Dianmo Sheng
Jianmin Bao
Dongdong Chen
Dong Chen
...
Ce Liu
Wenbo Zhou
Qi Chu
Weiming Zhang
Neng H. Yu
VLM
DiffM
38
39
0
07 Dec 2022
Diffusion-SDF: Text-to-Shape via Voxelized Diffusion
Muheng Li
Yueqi Duan
Jie Zhou
Jiwen Lu
DiffM
44
121
0
06 Dec 2022
NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors
Congyue Deng
C. Jiang
C. Qi
Xinchen Yan
Yin Zhou
Leonidas J. Guibas
Drago Anguelov
DiffM
29
161
0
06 Dec 2022
Fine-tuned CLIP Models are Efficient Video Learners
H. Rasheed
Muhammad Uzair Khattak
Muhammad Maaz
Salman Khan
Fahad Shahbaz Khan
CLIP
VLM
34
150
0
06 Dec 2022
ADIR: Adaptive Diffusion for Image Reconstruction
Shady Abu Hussein
Tom Tirer
Raja Giryes
DiffM
21
20
0
06 Dec 2022
Image Inpainting via Iteratively Decoupled Probabilistic Modeling
Wenbo Li
Xin Yu
Kun Zhou
Yibing Song
Zhe-nan Lin
Jiaya Jia
DiffM
42
11
0
06 Dec 2022
M-VADER: A Model for Diffusion with Multimodal Context
Samuel Weinbach
Marco Bellagente
C. Eichenberg
Andrew M. Dai
R. Baldock
Souradeep Nanda
Bjorn Deiseroth
Koen Oostermeijer
H. Teufel
Andres Felipe Cruz Salinas
DiffM
37
11
0
06 Dec 2022
Pretrained Diffusion Models for Unified Human Motion Synthesis
Jianxin Ma
Shuai Bai
Chang Zhou
DiffM
VGen
AI4CE
33
31
0
06 Dec 2022
Adaptive Testing of Computer Vision Models
Irena Gao
Gabriel Ilharco
Scott M. Lundberg
Marco Tulio Ribeiro
VLM
17
42
0
06 Dec 2022
PhysDiff: Physics-Guided Human Motion Diffusion Model
Ye Yuan
Jiaming Song
Umar Iqbal
Arash Vahdat
Jan Kautz
VGen
DiffM
45
237
0
05 Dec 2022
One-shot Implicit Animatable Avatars with Model-based Priors
Yangyi Huang
Hongwei Yi
Weiyang Liu
Haofan Wang
Boxi Wu
Wenxiao Wang
Binbin Lin
Debing Zhang
Deng Cai
3DH
37
32
0
05 Dec 2022
CLIPVG: Text-Guided Image Manipulation Using Differentiable Vector Graphics
Yiren Song
Xuning Shao
Kang Chen
Weidong Zhang
Minzhe Li
Zhongliang Jing
CLIP
VLM
29
22
0
05 Dec 2022
Multiscale Structure Guided Diffusion for Image Deblurring
Mengwei Ren
M. Delbracio
Hossein Talebi
Guido Gerig
P. Milanfar
DiffM
23
59
0
04 Dec 2022
PartSLIP: Low-Shot Part Segmentation for 3D Point Clouds via Pretrained Image-Language Models
Minghua Liu
Yinhao Zhu
H. Cai
Shizhong Han
Z. Ling
Fatih Porikli
Hao Su
3DPC
41
70
0
03 Dec 2022
ObjectStitch: Generative Object Compositing
Yi-Zhe Song
Zhifei Zhang
Zhe-nan Lin
Scott D. Cohen
Brian L. Price
Jianming Zhang
Seunggeun Kim
Daniel G. Aliaga
DiffM
26
31
0
02 Dec 2022
Diffusion Generative Models in Infinite Dimensions
Gavin Kerrigan
Justin Ley
Padhraic Smyth
DiffM
60
28
0
01 Dec 2022
Weakly Supervised Annotations for Multi-modal Greeting Cards Dataset
Sidra Hanif
Longin Jan Latecki
24
0
0
01 Dec 2022
3D-LDM: Neural Implicit 3D Shape Generation with Latent Diffusion Models
Gimin Nam
Mariem Khlifi
Andrew Rodriguez
Alberto Tono
Linqi Zhou
Paul Guerrero
DiffM
37
68
0
01 Dec 2022
Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models
Nithin Gopalakrishnan Nair
W. G. C. Bandara
Vishal M. Patel
DiffM
8
5
0
01 Dec 2022
SparseFusion: Distilling View-conditioned Diffusion for 3D Reconstruction
Zhizhuo Zhou
Shubham Tulsiani
DiffM
38
210
0
01 Dec 2022
Previous
1
2
3
...
80
81
82
...
85
86
87
Next