ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01952
  4. Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
ArXivPDFHTML

Papers citing "SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"

35 / 1,635 papers shown
Title
NExT-GPT: Any-to-Any Multimodal LLM
NExT-GPT: Any-to-Any Multimodal LLM
Shengqiong Wu
Hao Fei
Leigang Qu
Wei Ji
Tat-Seng Chua
MLLM
46
458
0
11 Sep 2023
AdBooster: Personalized Ad Creative Generation using Stable Diffusion
  Outpainting
AdBooster: Personalized Ad Creative Generation using Stable Diffusion Outpainting
Veronika Shilova
Ludovic Dos Santos
Flavian Vasile
Gaetan Racic
Ugo Tanielian
DiffM
11
7
0
08 Sep 2023
Chasing Consistency in Text-to-3D Generation from a Single Image
Chasing Consistency in Text-to-3D Generation from a Single Image
Yichen Ouyang
Wenhao Chai
Jiayi Ye
Dapeng Tao
Yibing Zhan
Gaoang Wang
DiffM
20
15
0
07 Sep 2023
Bridge Diffusion Model: bridge non-English language-native text-to-image
  diffusion model with English communities
Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities
Shanyuan Liu
Dawei Leng
Yuhui Yin
DiffM
24
7
0
02 Sep 2023
MVDream: Multi-view Diffusion for 3D Generation
MVDream: Multi-view Diffusion for 3D Generation
Yichun Shi
Peng Wang
Jianglong Ye
Mai Long
Kejie Li
X. Yang
22
589
0
31 Aug 2023
Manipulating Embeddings of Stable Diffusion Prompts
Manipulating Embeddings of Stable Diffusion Prompts
Niklas Deckers
Julia Peters
Martin Potthast
DiffM
40
9
0
23 Aug 2023
DUAW: Data-free Universal Adversarial Watermark against Stable Diffusion
  Customization
DUAW: Data-free Universal Adversarial Watermark against Stable Diffusion Customization
Xiaoyu Ye
Hao Huang
Jiaqi An
Yongtao Wang
WIGM
26
22
0
19 Aug 2023
Dynamic Attention-Guided Diffusion for Image Super-Resolution
Dynamic Attention-Guided Diffusion for Image Super-Resolution
Brian B. Moser
Stanislav Frolov
Federico Raue
Sebastián M. Palacio
Andreas Dengel
DiffM
32
3
0
15 Aug 2023
COMICS: End-to-end Bi-grained Contrastive Learning for Multi-face
  Forgery Detection
COMICS: End-to-end Bi-grained Contrastive Learning for Multi-face Forgery Detection
Cong Zhang
H. Qi
Shuhui Wang
Yuezun Li
Siwei Lyu
CVBM
41
6
0
03 Aug 2023
Revisiting DETR Pre-training for Object Detection
Revisiting DETR Pre-training for Object Detection
Yan Ma
Weicong Liang
Bo-Ying Chen
Yiduo Hao
Bojian Hou
Xiangyu Yue
Chao Zhang
Yuhui Yuan
VLM
ViT
35
4
0
02 Aug 2023
General Purpose Artificial Intelligence Systems (GPAIS): Properties,
  Definition, Taxonomy, Societal Implications and Responsible Governance
General Purpose Artificial Intelligence Systems (GPAIS): Properties, Definition, Taxonomy, Societal Implications and Responsible Governance
I. Triguero
Daniel Molina
Javier Poyatos
Javier Del Ser
Francisco Herrera
AI4TS
AI4MH
34
5
0
26 Jul 2023
Objaverse-XL: A Universe of 10M+ 3D Objects
Objaverse-XL: A Universe of 10M+ 3D Objects
Matt Deitke
Ruoshi Liu
Matthew Wallingford
Huong Ngo
Oscar Michel
...
Carl Vondrick
Georgia Gkioxari
Kiana Ehsani
Ludwig Schmidt
Ali Farhadi
25
381
0
11 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models
  without Specific Tuning
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
40
782
0
10 Jul 2023
JourneyDB: A Benchmark for Generative Image Understanding
JourneyDB: A Benchmark for Generative Image Understanding
Keqiang Sun
Junting Pan
Yuying Ge
Hao Li
Haodong Duan
...
Yi Wang
Jifeng Dai
Yu Qiao
Limin Wang
Hongsheng Li
54
103
0
03 Jul 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion
  Models
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffM
VLM
33
39
0
01 Jun 2023
Differential Diffusion: Giving Each Pixel Its Strength
Differential Diffusion: Giving Each Pixel Its Strength
E. Levin
Ohad Fried
DiffM
37
20
0
01 Jun 2023
Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image
  Diffusion Models
Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models
Pablo Pernias
Dominic Rampas
Mats L. Richter
Christopher Pal
Marc Aubreville
DiffM
VLM
26
42
0
01 Jun 2023
Addressing Negative Transfer in Diffusion Models
Addressing Negative Transfer in Diffusion Models
Hyojun Go
Jinyoung Kim
Yunsung Lee
Seunghyun Lee
Shinhyeok Oh
Hyeongdon Moon
Seungtaek Choi
DiffM
VLM
32
24
0
01 Jun 2023
LANCE: Stress-testing Visual Models by Generating Language-guided
  Counterfactual Images
LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images
Viraj Prabhu
Sriram Yenamandra
Prithvijit Chattopadhyay
Judy Hoffman
33
38
0
30 May 2023
Real-World Image Variation by Aligning Diffusion Inversion Chain
Real-World Image Variation by Aligning Diffusion Inversion Chain
Yuechen Zhang
Jinbo Xing
Eric Lo
Jiaya Jia
27
34
0
30 May 2023
GlyphControl: Glyph Conditional Control for Visual Text Generation
GlyphControl: Glyph Conditional Control for Visual Text Generation
Yukang Yang
Dongnan Gui
Yuhui Yuan
Weicong Liang
Haisong Ding
Hang-Rui Hu
Kai Chen
DiffM
38
78
0
29 May 2023
Are Diffusion Models Vision-And-Language Reasoners?
Are Diffusion Models Vision-And-Language Reasoners?
Benno Krojer
Elinor Poole-Dayan
Vikram S. Voleti
Christopher Pal
Siva Reddy
45
13
0
25 May 2023
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot
  Semantic Correspondence
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence
Junyi Zhang
Charles Herrmann
Junhwa Hur
Luisa Polania Cabrera
Varun Jampani
Deqing Sun
Ming Yang
DiffM
39
171
0
24 May 2023
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image
  Diffusion Models with Large Language Models
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Long Lian
Boyi Li
Adam Yala
Trevor Darrell
43
152
0
23 May 2023
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
Haoyu Lu
Guoxing Yang
Nanyi Fei
Yuqi Huo
Zhiwu Lu
Ping Luo
Mingyu Ding
DiffM
VGen
28
56
0
22 May 2023
LeftRefill: Filling Right Canvas based on Left Reference through
  Generalized Text-to-Image Diffusion Model
LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model
Chenjie Cao
Yunuo Cai
Qiaole Dong
Yikai Wang
Yanwei Fu
DiffM
40
15
0
19 May 2023
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Junchen Zhu
Jianlong Fu
Jiaying Liu
DiffM
VGen
48
114
0
18 May 2023
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Jianyi Wang
Zongsheng Yue
Shangchen Zhou
Kelvin C. K. Chan
Chen Change Loy
46
284
0
11 May 2023
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Chenghao Li
Chaoning Zhang
Atish Waghwase
Lik-Hang Lee
François Rameau
Yang Yang
Sung-Ho Bae
Choong Seon Hong
54
74
0
10 May 2023
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image
  Generation
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation
Yuval Kirstain
Adam Polyak
Uriel Singer
Shahbuland Matiana
Joe Penna
Omer Levy
EGVM
168
352
0
02 May 2023
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image
  Synthesis and Editing
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
Ming Cao
Xintao Wang
Zhongang Qi
Ying Shan
Xiaohu Qie
Yinqiang Zheng
DiffM
42
430
0
17 Apr 2023
Improving Diffusion Models for Scene Text Editing with Dual Encoders
Improving Diffusion Models for Scene Text Editing with Dual Encoders
Jiabao Ji
Guanhua Zhang
Zhaowen Wang
Bairu Hou
Zhifei Zhang
Brian L. Price
Shiyu Chang
DiffM
35
29
0
12 Apr 2023
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for
  Text-to-Image Diffusion Models
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models
Chong Mou
Xintao Wang
Liangbin Xie
Yanze Wu
Jing Zhang
Zhongang Qi
Ying Shan
Xiaohu Qie
DiffM
31
977
0
16 Feb 2023
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion
  Models
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models
Rongjie Huang
Jia-Bin Huang
Dongchao Yang
Yi Ren
Luping Liu
Mingze Li
Zhenhui Ye
Jinglin Liu
Xiaoyue Yin
Zhou Zhao
DiffM
151
317
0
30 Jan 2023
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,796
0
24 Feb 2021
Previous
123...313233