Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 4,340 papers shown
Title
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
Pierre Fernandez
Guillaume Couairon
Hervé Jégou
Matthijs Douze
Teddy Furon
WIGM
34
177
0
27 Mar 2023
Anti-DreamBooth: Protecting users from personalized text-to-image synthesis
T. Le
Hao Phung
Thuan Hoang Nguyen
Quan Dao
Ngoc N. Tran
Anh Tran
33
92
0
27 Mar 2023
Debiasing Scores and Prompts of 2D Diffusion for View-consistent Text-to-3D Generation
Susung Hong
Donghoon Ahn
Seungryong Kim
DiffM
33
23
0
27 Mar 2023
Training-free Content Injection using h-space in Diffusion Models
Jaeseok Jeong
Mingi Kwon
Youngjung Uh
DiffM
31
25
0
27 Mar 2023
Exploring Continual Learning of Diffusion Models
Michal Zajac
Kamil Deja
Anna Kuzina
Jakub M. Tomczak
Tomasz Trzciñski
Florian Shkurti
Piotr Milo's
DiffM
35
11
0
27 Mar 2023
Zero-Shot Composed Image Retrieval with Textual Inversion
Alberto Baldrati
Lorenzo Agnolucci
Marco Bertini
A. Bimbo
20
101
0
27 Mar 2023
Text-to-Image Diffusion Models are Zero-Shot Classifiers
Kevin Clark
P. Jaini
DiffM
VLM
38
107
0
27 Mar 2023
Data Augmentation for Environmental Sound Classification Using Diffusion Probabilistic Model with Top-k Selection Discriminator
Yunhao Chen
Yunjie Zhu
Zihui Yan
Jian Shen
Zhen Ren
Yifan Huang
DiffM
44
8
0
27 Mar 2023
Seer: Language Instructed Video Prediction with Latent Diffusion Models
Xianfan Gu
Chuan Wen
Weirui Ye
Jiaming Song
Yang Gao
DiffM
VGen
26
40
0
27 Mar 2023
DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
Sauradip Nag
Xiatian Zhu
Jiankang Deng
Yi-Zhe Song
Tao Xiang
DiffM
VGen
48
21
0
27 Mar 2023
Equivariant Similarity for Vision-Language Foundation Models
Tan Wang
Kevin Qinghong Lin
Linjie Li
Chung-Ching Lin
Zhengyuan Yang
Hanwang Zhang
Zicheng Liu
Lijuan Wang
CoGe
46
44
0
25 Mar 2023
Human Preference Score: Better Aligning Text-to-Image Models with Human Preference
Xiaoshi Wu
Keqiang Sun
Feng Zhu
Rui Zhao
Hongsheng Li
34
133
0
25 Mar 2023
Freestyle Layout-to-Image Synthesis
Han Xue
Z. Huang
Qianru Sun
Li Song
Wenjun Zhang
DiffM
21
62
0
25 Mar 2023
DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-Consistency
Zalan Fabian
Berk Tınaz
Mahdi Soltanolkotabi
DiffM
19
8
0
25 Mar 2023
Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Junshu Tang
Tengfei Wang
Bo Zhang
Ting Zhang
Ran Yi
Lizhuang Ma
Dong Chen
DiffM
192
309
0
24 Mar 2023
CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images
Jordan J. Bird
Ahmad Lotfi
DiffM
30
105
0
24 Mar 2023
Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation
Rui Chen
Yingfa Chen
Ningxin Jiao
Kui Jia
DiffM
53
562
0
24 Mar 2023
CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D Scene Layout
Haotian Bai
Yiqi Lin
Hui Xiong
Sijia Li
H. Lu
Xiaodong Lin
Lin Wang
DiffM
45
42
0
24 Mar 2023
DreamStone: Image as Stepping Stone for Text-Guided 3D Shape Generation
Zhengzhe Liu
Peng Dai
Ruihui Li
Xiaojuan Qi
Chi-Wing Fu
DiffM
21
10
0
24 Mar 2023
Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Haomiao Ni
Changhao Shi
Kaican Li
Sharon X. Huang
Martin Renqiang Min
VGen
DiffM
40
165
0
24 Mar 2023
High Fidelity Image Synthesis With Deep VAEs In Latent Space
Troy Luhman
Eric Luhman
DRL
3DV
39
7
0
23 Mar 2023
End-to-End Diffusion Latent Optimization Improves Classifier Guidance
Bram Wallace
Akash Gokul
Stefano Ermon
Nikhil Naik
124
71
0
23 Mar 2023
NOPE: Novel Object Pose Estimation from a Single Image
Van Nguyen Nguyen
Thibault Groueix
Yinlin Hu
Mathieu Salzmann
Vincent Lepetit
48
25
0
23 Mar 2023
Artificial-intelligence-based molecular classification of diffuse gliomas using rapid, label-free optical imaging
Todd C. Hollon
Cheng Jiang
Asadur Chowdury
Mustafa Nasir-Moin
A. Kondepudi
...
M. Snuderl
S. Camelo-Piragua
C. Freudiger
Ho Hin Lee
D. Orringer
40
88
0
23 Mar 2023
Ablating Concepts in Text-to-Image Diffusion Models
Nupur Kumari
Bin Zhang
Sheng-Yu Wang
Eli Shechtman
Richard Y. Zhang
Jun-Yan Zhu
VLM
21
184
0
23 Mar 2023
DreamBooth3D: Subject-Driven Text-to-3D Generation
Amit Raj
S. Kaza
Ben Poole
Michael Niemeyer
Nataniel Ruiz
...
Kfir Aberman
Michael Rubinstein
Jonathan T. Barron
Yuanzhen Li
Varun Jampani
DiffM
24
220
0
23 Mar 2023
ReVersion: Diffusion-Based Relation Inversion from Images
Ziqi Huang
Tianxing Wu
Yuming Jiang
Kelvin C. K. Chan
Ziwei Liu
54
67
0
23 Mar 2023
Promptable Game Models: Text-Guided Game Simulation via Masked Diffusion Models
Willi Menapace
Aliaksandr Siarohin
Stéphane Lathuilière
Panos Achlioptas
Vladislav Golyanik
Sergey Tulyakov
Elisa Ricci
LM&Ro
VGen
DiffM
49
14
0
23 Mar 2023
CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Haoxuan You
Mandy Guo
Zhecan Wang
Kai-Wei Chang
Jason Baldridge
Jiahui Yu
DiffM
54
13
0
23 Mar 2023
Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes
Dana Cohen-Bar
Elad Richardson
G. Metzer
Raja Giryes
Daniel Cohen-Or
79
54
0
23 Mar 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
VGen
29
548
0
23 Mar 2023
Medical diffusion on a budget: textual inversion for medical image generation
B. D. Wilde
A. Saha
R. T. Broek
Henkjan Huisman
DiffM
MedIm
47
15
0
23 Mar 2023
ChatGPT for Shaping the Future of Dentistry: The Potential of Multi-Modal Large Language Model
Hanyao Huang
Ou Zheng
Dongdong Wang
Jiayi Yin
Zijin Wang
...
H. Yin
Chuan Xu
Renjie Yang
Q. Zheng
B. Shi
MedIm
AI4MH
AI4CE
LM&MA
58
176
0
23 Mar 2023
TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision
Jiacheng Wei
Hao Wang
Jiashi Feng
Guosheng Lin
Kim-Hui Yap
24
30
0
23 Mar 2023
Explore the Power of Synthetic Data on Few-shot Object Detection
Shaobo Lin
Kun Wang
Xingyu Zeng
Ruili Zhao
40
32
0
23 Mar 2023
MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models
Jing Zhao
Heliang Zheng
Chaoyue Wang
L. Lan
Wenjing Yang
VLM
47
17
0
23 Mar 2023
Controllable Inversion of Black-Box Face Recognition Models via Diffusion
Manuel Kansy
Anton Raël
Graziana Mignone
Jacek Naruniec
Christopher Schroers
Markus Gross
Romann M. Weber
DiffM
79
18
0
23 Mar 2023
Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions
Ayaan Haque
Matthew Tancik
Alexei A. Efros
Aleksander Holynski
Angjoo Kanazawa
VGen
DiffM
56
361
0
22 Mar 2023
LFM-3D: Learnable Feature Matching Across Wide Baselines Using 3D Signals
Arjun Karpur
Guilherme Perrotta
Ricardo Martín Brualla
Howard Zhou
A. Araújo
3DV
43
4
0
22 Mar 2023
Pix2Video: Video Editing using Image Diffusion
Duygu Ceylan
C. Huang
Niloy J. Mitra
DiffM
VGen
46
245
0
22 Mar 2023
A Word is Worth a Thousand Pictures: Prompts as AI Design Material
Chinmay Kulkarni
Stefania Druga
Minsuk Chang
Alexander J. Fiannaca
Carrie J. Cai
Michael Terry
3DV
29
30
0
22 Mar 2023
Feature-Conditioned Cascaded Video Diffusion Models for Precise Echocardiogram Synthesis
Hadrien Reynaud
Mengyun Qiao
Mischa Dombrowski
Thomas Day
Reza Razavi
Alberto Gómez
Paul Leeson
Bernhard Kainz
DiffM
VGen
MedIm
48
22
0
22 Mar 2023
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Sheng-Siang Yin
Chenfei Wu
Huan Yang
Jianfeng Wang
Xiaodong Wang
...
Gong Ming
Lijuan Wang
Zicheng Liu
Houqiang Li
Nan Duan
VGen
20
125
0
22 Mar 2023
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
K. Pnvr
Bharat Singh
P. Ghosh
Behjat Siddiquie
David Jacobs
DiffM
40
29
0
22 Mar 2023
The Prompt Artists
Minsuk Chang
Stefania Druga
Alexander J. Fiannaca
P. Vergani
Chinmay Kulkarni
Carrie J. Cai
Michael Terry
21
59
0
22 Mar 2023
Compositional 3D Scene Generation using Locally Conditioned Diffusion
Ryan Po
Gordon Wetzstein
DiffM
32
85
0
21 Mar 2023
MAGVLT: Masked Generative Vision-and-Language Transformer
Sungwoong Kim
DaeJin Jo
Donghoon Lee
Jongmin Kim
VLM
47
12
0
21 Mar 2023
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation
Sara Sarto
Manuele Barraco
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
23
55
0
21 Mar 2023
Affordance Diffusion: Synthesizing Hand-Object Interactions
Yufei Ye
Xueting Li
Abhi Gupta
Shalini De Mello
Stan Birchfield
Jiaming Song
Shubham Tulsiani
Sifei Liu
DiffM
43
76
0
21 Mar 2023
Vox-E: Text-guided Voxel Editing of 3D Objects
Etai Sella
Gal Fiebelman
Peter Hedman
Hadar Averbuch-Elor
DiffM
36
74
0
21 Mar 2023
Previous
1
2
3
...
74
75
76
...
85
86
87
Next