Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01952
Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"
35 / 1,635 papers shown
Title
NExT-GPT: Any-to-Any Multimodal LLM
Shengqiong Wu
Hao Fei
Leigang Qu
Wei Ji
Tat-Seng Chua
MLLM
46
458
0
11 Sep 2023
AdBooster: Personalized Ad Creative Generation using Stable Diffusion Outpainting
Veronika Shilova
Ludovic Dos Santos
Flavian Vasile
Gaetan Racic
Ugo Tanielian
DiffM
11
7
0
08 Sep 2023
Chasing Consistency in Text-to-3D Generation from a Single Image
Yichen Ouyang
Wenhao Chai
Jiayi Ye
Dapeng Tao
Yibing Zhan
Gaoang Wang
DiffM
20
15
0
07 Sep 2023
Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities
Shanyuan Liu
Dawei Leng
Yuhui Yin
DiffM
24
7
0
02 Sep 2023
MVDream: Multi-view Diffusion for 3D Generation
Yichun Shi
Peng Wang
Jianglong Ye
Mai Long
Kejie Li
X. Yang
22
589
0
31 Aug 2023
Manipulating Embeddings of Stable Diffusion Prompts
Niklas Deckers
Julia Peters
Martin Potthast
DiffM
40
9
0
23 Aug 2023
DUAW: Data-free Universal Adversarial Watermark against Stable Diffusion Customization
Xiaoyu Ye
Hao Huang
Jiaqi An
Yongtao Wang
WIGM
26
22
0
19 Aug 2023
Dynamic Attention-Guided Diffusion for Image Super-Resolution
Brian B. Moser
Stanislav Frolov
Federico Raue
Sebastián M. Palacio
Andreas Dengel
DiffM
32
3
0
15 Aug 2023
COMICS: End-to-end Bi-grained Contrastive Learning for Multi-face Forgery Detection
Cong Zhang
H. Qi
Shuhui Wang
Yuezun Li
Siwei Lyu
CVBM
41
6
0
03 Aug 2023
Revisiting DETR Pre-training for Object Detection
Yan Ma
Weicong Liang
Bo-Ying Chen
Yiduo Hao
Bojian Hou
Xiangyu Yue
Chao Zhang
Yuhui Yuan
VLM
ViT
35
4
0
02 Aug 2023
General Purpose Artificial Intelligence Systems (GPAIS): Properties, Definition, Taxonomy, Societal Implications and Responsible Governance
I. Triguero
Daniel Molina
Javier Poyatos
Javier Del Ser
Francisco Herrera
AI4TS
AI4MH
34
5
0
26 Jul 2023
Objaverse-XL: A Universe of 10M+ 3D Objects
Matt Deitke
Ruoshi Liu
Matthew Wallingford
Huong Ngo
Oscar Michel
...
Carl Vondrick
Georgia Gkioxari
Kiana Ehsani
Ludwig Schmidt
Ali Farhadi
25
381
0
11 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
40
782
0
10 Jul 2023
JourneyDB: A Benchmark for Generative Image Understanding
Keqiang Sun
Junting Pan
Yuying Ge
Hao Li
Haodong Duan
...
Yi Wang
Jifeng Dai
Yu Qiao
Limin Wang
Hongsheng Li
54
103
0
03 Jul 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffM
VLM
33
39
0
01 Jun 2023
Differential Diffusion: Giving Each Pixel Its Strength
E. Levin
Ohad Fried
DiffM
37
20
0
01 Jun 2023
Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models
Pablo Pernias
Dominic Rampas
Mats L. Richter
Christopher Pal
Marc Aubreville
DiffM
VLM
26
42
0
01 Jun 2023
Addressing Negative Transfer in Diffusion Models
Hyojun Go
Jinyoung Kim
Yunsung Lee
Seunghyun Lee
Shinhyeok Oh
Hyeongdon Moon
Seungtaek Choi
DiffM
VLM
32
24
0
01 Jun 2023
LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images
Viraj Prabhu
Sriram Yenamandra
Prithvijit Chattopadhyay
Judy Hoffman
33
38
0
30 May 2023
Real-World Image Variation by Aligning Diffusion Inversion Chain
Yuechen Zhang
Jinbo Xing
Eric Lo
Jiaya Jia
27
34
0
30 May 2023
GlyphControl: Glyph Conditional Control for Visual Text Generation
Yukang Yang
Dongnan Gui
Yuhui Yuan
Weicong Liang
Haisong Ding
Hang-Rui Hu
Kai Chen
DiffM
38
78
0
29 May 2023
Are Diffusion Models Vision-And-Language Reasoners?
Benno Krojer
Elinor Poole-Dayan
Vikram S. Voleti
Christopher Pal
Siva Reddy
45
13
0
25 May 2023
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence
Junyi Zhang
Charles Herrmann
Junhwa Hur
Luisa Polania Cabrera
Varun Jampani
Deqing Sun
Ming Yang
DiffM
39
171
0
24 May 2023
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Long Lian
Boyi Li
Adam Yala
Trevor Darrell
43
152
0
23 May 2023
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
Haoyu Lu
Guoxing Yang
Nanyi Fei
Yuqi Huo
Zhiwu Lu
Ping Luo
Mingyu Ding
DiffM
VGen
28
56
0
22 May 2023
LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model
Chenjie Cao
Yunuo Cai
Qiaole Dong
Yikai Wang
Yanwei Fu
DiffM
40
15
0
19 May 2023
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Junchen Zhu
Jianlong Fu
Jiaying Liu
DiffM
VGen
48
114
0
18 May 2023
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Jianyi Wang
Zongsheng Yue
Shangchen Zhou
Kelvin C. K. Chan
Chen Change Loy
46
284
0
11 May 2023
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Chenghao Li
Chaoning Zhang
Atish Waghwase
Lik-Hang Lee
François Rameau
Yang Yang
Sung-Ho Bae
Choong Seon Hong
54
74
0
10 May 2023
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation
Yuval Kirstain
Adam Polyak
Uriel Singer
Shahbuland Matiana
Joe Penna
Omer Levy
EGVM
168
352
0
02 May 2023
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
Ming Cao
Xintao Wang
Zhongang Qi
Ying Shan
Xiaohu Qie
Yinqiang Zheng
DiffM
42
430
0
17 Apr 2023
Improving Diffusion Models for Scene Text Editing with Dual Encoders
Jiabao Ji
Guanhua Zhang
Zhaowen Wang
Bairu Hou
Zhifei Zhang
Brian L. Price
Shiyu Chang
DiffM
35
29
0
12 Apr 2023
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models
Chong Mou
Xintao Wang
Liangbin Xie
Yanze Wu
Jing Zhang
Zhongang Qi
Ying Shan
Xiaohu Qie
DiffM
31
977
0
16 Feb 2023
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models
Rongjie Huang
Jia-Bin Huang
Dongchao Yang
Yi Ren
Luping Liu
Mingze Li
Zhenhui Ye
Jinglin Liu
Xiaoyue Yin
Zhou Zhao
DiffM
151
317
0
30 Jan 2023
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,796
0
24 Feb 2021
Previous
1
2
3
...
31
32
33