ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01952
  4. Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
ArXivPDFHTML

Papers citing "SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"

50 / 1,635 papers shown
Title
Chain of Images for Intuitively Reasoning
Chain of Images for Intuitively Reasoning
Fanxu Meng
Haotong Yang
Yiding Wang
Muhan Zhang
LRM
36
7
0
09 Nov 2023
A Data Perspective on Enhanced Identity Preservation for Diffusion
  Personalization
A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Xingzhe He
Zhiwen Cao
Nicholas I. Kolkin
Lantao Yu
Kun Wan
Helge Rhodin
Ratheesh Kalarot
37
13
0
07 Nov 2023
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion
  Models
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Zhanyue Qin
Xiang Wang
Deli Zhao
Jingren Zhou
DiffM
VGen
43
200
0
07 Nov 2023
SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img
  Synthesis
SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis
Hanrong Ye
Jason Kuen
Qing Liu
Zhe-nan Lin
Brian L. Price
Dan Xu
VLM
18
11
0
06 Nov 2023
Quantum circuit synthesis with diffusion models
Quantum circuit synthesis with diffusion models
Florian Fürrutter
Gorka Muñoz-Gil
H. Briegel
AI4CE
DiffM
32
20
0
03 Nov 2023
The Blessing of Randomness: SDE Beats ODE in General Diffusion-based
  Image Editing
The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing
Shen Nie
Hanzhong Guo
Cheng Lu
Yuhao Zhou
Chenyu Zheng
Chongxuan Li
DiffM
29
38
0
02 Nov 2023
POS: A Prompts Optimization Suite for Augmenting Text-to-Video
  Generation
POS: A Prompts Optimization Suite for Augmenting Text-to-Video Generation
Shijie Ma
Huayi Xu
Mengjian Li
Weidong Geng
Yaxiong Wang
Meng Wang
DiffM
VGen
19
0
0
02 Nov 2023
De-Diffusion Makes Text a Strong Cross-Modal Interface
De-Diffusion Makes Text a Strong Cross-Modal Interface
Chen Wei
Chenxi Liu
Siyuan Qiao
Zhishuai Zhang
Alan Yuille
Jiahui Yu
VLM
DiffM
37
10
0
01 Nov 2023
Consistent Video-to-Video Transfer Using Synthetic Dataset
Consistent Video-to-Video Transfer Using Synthetic Dataset
Jiaxin Cheng
Tianjun Xiao
Tong He
VGen
DiffM
33
14
0
01 Nov 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen
Menghan Xia
Yin-Yin He
Yong Zhang
Xiaodong Cun
...
Yaofang Liu
Qifeng Chen
Xintao Wang
Chao-Liang Weng
Ying Shan
DiffM
26
282
0
30 Oct 2023
Noise-Free Score Distillation
Noise-Free Score Distillation
Oren Katzir
Or Patashnik
Daniel Cohen-Or
Dani Lischinski
DiffM
21
70
0
26 Oct 2023
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image
  Detectors
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors
You-Ming Chang
Chen Yeh
Wei-Chen Chiu
Ning Yu
VPVLM
VLM
78
23
0
26 Oct 2023
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
Yixin Wu
Ning Yu
Michael Backes
Yun Shen
Yang Zhang
DiffM
59
8
0
25 Oct 2023
Integrating View Conditions for Image Synthesis
Integrating View Conditions for Image Synthesis
Jinbin Bai
Zhen Dong
Aosong Feng
Xiao Zhang
Tian-Chun Ye
Kaicheng Zhou
67
13
0
24 Oct 2023
Matryoshka Diffusion Models
Matryoshka Diffusion Models
Jiatao Gu
Shuangfei Zhai
Yizhen Zhang
Joshua M. Susskind
Navdeep Jaitly
DiffM
21
43
0
23 Oct 2023
Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
Ruoxi Shi
Hansheng Chen
Zhuoyang Zhang
Minghua Liu
Chao Xu
Xinyue Wei
Linghao Chen
Chong Zeng
Hao Su
VLM
36
340
0
23 Oct 2023
$Λ$-Split: A Privacy-Preserving Split Computing Framework for
  Cloud-Powered Generative AI
ΛΛΛ-Split: A Privacy-Preserving Split Computing Framework for Cloud-Powered Generative AI
Shoki Ohta
Takayuki Nishio
70
4
0
23 Oct 2023
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image
  Generative Models
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models
Shawn Shan
Wenxin Ding
Josephine Passananti
Stanley Wu
Haitao Zheng
Ben Y. Zhao
SILM
DiffM
31
44
0
20 Oct 2023
DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture
  Propagation
DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation
Bangbang Yang
Wenqi Dong
Lin Ma
Wenbo Hu
Xiao Liu
Zhaopeng Cui
Yuewen Ma
DiffM
33
16
0
19 Oct 2023
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient
  Clipping
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
Zijie Pan
Jiachen Lu
Xiatian Zhu
Li Zhang
DiffM
28
11
0
19 Oct 2023
Object-aware Inversion and Reassembly for Image Editing
Object-aware Inversion and Reassembly for Image Editing
Zhen Yang
Dinggang Gui
Wen Wang
Hao Chen
Bohan Zhuang
Chunhua Shen
DiffM
33
15
0
18 Oct 2023
Language Agents for Detecting Implicit Stereotypes in Text-to-image
  Models at Scale
Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale
Qichao Wang
Tian Bian
Yian Yin
Tingyang Xu
Hong Cheng
Helen M. Meng
Zibin Zheng
Liang Chen
Bingzhe Wu
VLM
DiffM
33
3
0
18 Oct 2023
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Yaofang Liu
Xiaodong Cun
Xuebo Liu
Xintao Wang
Yong Zhang
Haoxin Chen
Yang Liu
Tieyong Zeng
Raymond H. F. Chan
Ying Shan
VGen
EGVM
24
128
0
17 Oct 2023
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
Ruiqi Wu
Liangyu Chen
Tong Yang
Chunle Guo
Chongyi Li
Xiangyu Zhang
DiffM
VGen
89
52
0
16 Oct 2023
LOVECon: Text-driven Training-Free Long Video Editing with ControlNet
LOVECon: Text-driven Training-Free Long Video Editing with ControlNet
Zhenyi Liao
Zhijie Deng
DiffM
38
7
0
15 Oct 2023
HyperHuman: Hyper-Realistic Human Generation with Latent Structural
  Diffusion
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
Xian Liu
Jian Ren
Aliaksandr Siarohin
Ivan Skorokhodov
Yanyu Li
Dahua Lin
Xihui Liu
Ziwei Liu
Sergey Tulyakov
32
57
0
12 Oct 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic
  Image Design and Generation
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Lijuan Wang
LRM
MLLM
DiffM
13
22
0
12 Oct 2023
XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness
  Evaluation
XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation
Qiang Li
Dan Zhang
Shengzhao Lei
Xun Zhao
Porawit Kamnoedboon
WeiWei Li
Junhao Dong
Shuyan Li
VLM
30
1
0
12 Oct 2023
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Jie An
Zhengyuan Yang
Linjie Li
Jianfeng Wang
K. Lin
Zicheng Liu
Lijuan Wang
Jiebo Luo
25
11
0
11 Oct 2023
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with
  Diffusion Models
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
Yin-Yin He
Shaoshu Yang
Haoxin Chen
Xiaodong Cun
Menghan Xia
Yong Zhang
Xintao Wang
Ran He
Qifeng Chen
Ying Shan
39
71
0
11 Oct 2023
Mitigating stereotypical biases in text to image generative systems
Mitigating stereotypical biases in text to image generative systems
Piero Esposito
Parmida Atighehchian
Anastasis Germanidis
Deepti Ghadiyaram
33
16
0
10 Oct 2023
JointNet: Extending Text-to-Image Diffusion for Dense Distribution
  Modeling
JointNet: Extending Text-to-Image Diffusion for Dense Distribution Modeling
Jingyang Zhang
Shiwei Li
Yuanxun Lu
Tian Fang
David McKinnon
Yanghai Tsin
Long Quan
Yao Yao
25
10
0
10 Oct 2023
IPDreamer: Appearance-Controllable 3D Object Generation with Complex
  Image Prompts
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
Bo-Wen Zeng
Shanglin Li
Yutang Feng
Ling Yang
Hong Li
...
Conghui He
Wentao Zhang
Jianzhuang Liu
Baochang Zhang
Shuicheng Yan
DiffM
37
1
0
09 Oct 2023
Consistent-1-to-3: Consistent Image to 3D View Synthesis via
  Geometry-aware Diffusion Models
Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Jianglong Ye
Peng Wang
Kejie Li
Yichun Shi
Heng Wang
DiffM
43
72
0
04 Oct 2023
Predicated Diffusion: Predicate Logic-Based Attention Guidance for
  Text-to-Image Diffusion Models
Predicated Diffusion: Predicate Logic-Based Attention Guidance for Text-to-Image Diffusion Models
Kota Sueyoshi
Takashi Matsubara
DiffM
21
8
0
03 Oct 2023
TWIZ-v2: The Wizard of Multimodal Conversational-Stimulus
TWIZ-v2: The Wizard of Multimodal Conversational-Stimulus
Rafael Ferreira
Diogo Tavares
Diogo Glória-Silva
Rodrigo Valerio
João Bordalo
Ines Simoes
Vasco Ramos
David Semedo
João Magalhães
24
4
0
03 Oct 2023
Prompt-tuning latent diffusion models for inverse problems
Prompt-tuning latent diffusion models for inverse problems
Hyungjin Chung
Jong Chul Ye
P. Milanfar
M. Delbracio
DiffM
30
41
0
02 Oct 2023
PixArt-$α$: Fast Training of Diffusion Transformer for
  Photorealistic Text-to-Image Synthesis
PixArt-ααα: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen
Jincheng Yu
Chongjian Ge
Lewei Yao
Enze Xie
...
Zhongdao Wang
James T. Kwok
Ping Luo
Huchuan Lu
Zhenguo Li
DiffM
39
391
0
30 Sep 2023
Leveraging Optimization for Adaptive Attacks on Image Watermarks
Leveraging Optimization for Adaptive Attacks on Image Watermarks
Nils Lukas
Abdulrahman Diaa
L. Fenaux
Florian Kerschbaum
WIGM
22
24
0
29 Sep 2023
Towards Few-Call Model Stealing via Active Self-Paced Knowledge Distillation and Diffusion-Based Image Generation
Towards Few-Call Model Stealing via Active Self-Paced Knowledge Distillation and Diffusion-Based Image Generation
Vlad Hondru
Radu Tudor Ionescu
DiffM
50
1
0
29 Sep 2023
Text-to-3D using Gaussian Splatting
Text-to-3D using Gaussian Splatting
Manish Sharma
Moitreya Chatterjee
Yikai Wang
Huaping Liu
3DGS
28
225
0
28 Sep 2023
Emu: Enhancing Image Generation Models Using Photogenic Needles in a
  Haystack
Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack
Xiaoliang Dai
Ji Hou
Chih-Yao Ma
Sam S. Tsai
Jialiang Wang
...
Roshan Sumbaly
Vignesh Ramanathan
Zijian He
Peter Vajda
Devi Parikh
VLM
36
198
0
27 Sep 2023
DECORAIT -- DECentralized Opt-in/out Registry for AI Training
DECORAIT -- DECentralized Opt-in/out Registry for AI Training
Karthika Balan
Alexander Black
Simon Jenni
Andrew Gilbert
Andy Parsons
John Collomosse
23
7
0
25 Sep 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary
  Instance Segmentation
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffM
VLM
72
35
0
22 Sep 2023
DreamLLM: Synergistic Multimodal Comprehension and Creation
DreamLLM: Synergistic Multimodal Comprehension and Creation
Runpei Dong
Chunrui Han
Yuang Peng
Zekun Qi
Zheng Ge
...
Hao-Ran Wei
Xiangwen Kong
Xiangyu Zhang
Kaisheng Ma
Li Yi
MLLM
39
173
0
20 Sep 2023
FreeU: Free Lunch in Diffusion U-Net
FreeU: Free Lunch in Diffusion U-Net
Chenyang Si
Ziqi Huang
Yuming Jiang
Ziwei Liu
DiffM
54
132
0
20 Sep 2023
On Copyright Risks of Text-to-Image Diffusion Models
On Copyright Risks of Text-to-Image Diffusion Models
Yang Zhang
Teoh Tze Tzun
Lim Wei Hern
Haonan Wang
Kenji Kawaguchi
77
9
0
15 Sep 2023
TextBind: Multi-turn Interleaved Multimodal Instruction-following in the
  Wild
TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wild
Huayang Li
Siheng Li
Deng Cai
Longyue Wang
Lemao Liu
Taro Watanabe
Yujiu Yang
Shuming Shi
MLLM
52
17
0
14 Sep 2023
InstaFlow: One Step is Enough for High-Quality Diffusion-Based
  Text-to-Image Generation
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
Xingchao Liu
Xiwen Zhang
Jianzhu Ma
Jian Peng
Qiang Liu
108
196
0
12 Sep 2023
Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by
  Finding Problematic Prompts
Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts
Zhi-Yi Chin
Chieh-Ming Jiang
Ching-Chun Huang
Pin-Yu Chen
Wei-Chen Chiu
DiffM
19
67
0
12 Sep 2023
Previous
123...313233
Next