ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01952
  4. Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
ArXivPDFHTML

Papers citing "SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"

50 / 1,635 papers shown
Title
UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with
  Authenticity Guided Textures
UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures
Mingyuan Zhou
Rakib Hyder
Ziwei Xuan
Guojun Qi
32
6
0
20 Jan 2024
DiffusionGPT: LLM-Driven Text-to-Image Generation System
DiffusionGPT: LLM-Driven Text-to-Image Generation System
Jie Qin
Jie Wu
Weifeng Chen
Yuxi Ren
Huixian Li
Hefeng Wu
Xuefeng Xiao
Rui Wang
S. Wen
DiffM
62
25
0
18 Jan 2024
Vlogger: Make Your Dream A Vlog
Vlogger: Make Your Dream A Vlog
Shaobin Zhuang
Kunchang Li
Xinyuan Chen
Yaohui Wang
Ziwei Liu
Yu Qiao
Yali Wang
VGen
DiffM
43
35
0
17 Jan 2024
UniVG: Towards UNIfied-modal Video Generation
UniVG: Towards UNIfied-modal Video Generation
Ludan Ruan
Lei Tian
Chuanwei Huang
Xu Zhang
Xinyan Xiao
VGen
DiffM
34
3
0
17 Jan 2024
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image
  Synthesis
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Jonghyun Lee
Hansam Cho
Youngjoon Yoo
Seoung Bum Kim
Yonghyun Jeong
DiffM
23
7
0
17 Jan 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video
  Diffusion Models
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
126
280
0
17 Jan 2024
Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive
Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive
Yumeng Li
M. Keuper
Dan Zhang
Anna Khoreva
DiffM
43
10
0
16 Jan 2024
WAVES: Benchmarking the Robustness of Image Watermarks
WAVES: Benchmarking the Robustness of Image Watermarks
Bang An
Mucong Ding
Tahseen Rabbani
Aakriti Agrawal
Yuancheng Xu
...
Sicheng Zhu
Abdirisak Mohamed
Yuxin Wen
Tom Goldstein
Furong Huang
33
40
0
16 Jan 2024
Instilling Multi-round Thinking to Text-guided Image Generation
Instilling Multi-round Thinking to Text-guided Image Generation
Lidong Zeng
Zhedong Zheng
Yinwei Wei
Tat-Seng Chua
34
5
0
16 Jan 2024
HexaGen3D: StableDiffusion is just one step away from Fast and Diverse
  Text-to-3D Generation
HexaGen3D: StableDiffusion is just one step away from Fast and Diverse Text-to-3D Generation
Antoine Mercier
Ramin Nakhli
Mahesh Reddy
R. Yasarla
Hong Cai
Fatih Porikli
Guillaume Berger
DiffM
35
15
0
15 Jan 2024
InstantID: Zero-shot Identity-Preserving Generation in Seconds
InstantID: Zero-shot Identity-Preserving Generation in Seconds
Qixun Wang
Xu Bai
Haofan Wang
Zekui Qin
Anthony Chen
Huaxia Li
Xu Tang
Yao Hu
46
238
0
15 Jan 2024
PIXART-δ: Fast and Controllable Image Generation with Latent
  Consistency Models
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models
Junsong Chen
Yue Wu
Simian Luo
Enze Xie
Sayak Paul
Ping Luo
Hang Zhao
Zhenguo Li
VLM
36
72
0
10 Jan 2024
Memory-Efficient Fine-Tuning for Quantized Diffusion Model
Memory-Efficient Fine-Tuning for Quantized Diffusion Model
Hyogon Ryu
Seohyun Lim
Hyunjung Shim
DiffM
MQ
27
6
0
09 Jan 2024
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
Tong Wu
Guandao Yang
Zhibing Li
Kai Zhang
Ziwei Liu
Leonidas J. Guibas
Dahua Lin
Gordon Wetzstein
EGVM
VGen
35
88
0
08 Jan 2024
Instruct-Imagen: Image Generation with Multi-modal Instruction
Instruct-Imagen: Image Generation with Multi-modal Instruction
Hexiang Hu
Kelvin C. K. Chan
Yu-Chuan Su
Wenhu Chen
Yandong Li
...
Xue Ben
Boqing Gong
William W. Cohen
Ming-Wei Chang
Xuhui Jia
MLLM
46
43
0
03 Jan 2024
Moonshot: Towards Controllable Video Generation and Editing with
  Multimodal Conditions
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
David Junhao Zhang
Dongxu Li
Hung Le
Mike Zheng Shou
Caiming Xiong
Doyen Sahoo
VGen
27
23
0
03 Jan 2024
aMUSEd: An Open MUSE Reproduction
aMUSEd: An Open MUSE Reproduction
Suraj Patil
William Berman
Robin Rombach
Patrick von Platen
VLM
25
18
0
03 Jan 2024
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields
Jan-Niklas Dihlmann
Andreas Engelhardt
Hendrik P. A. Lensch
DiffM
VGen
24
4
0
03 Jan 2024
ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image
  and Text
ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text
Dingkun Yan
Liang Yuan
Erwin Wu
Yuma Nishioka
I. Fujishiro
Suguru Saito
DiffM
21
5
0
02 Jan 2024
Image Sculpting: Precise Object Editing with 3D Geometry Control
Image Sculpting: Precise Object Editing with 3D Geometry Control
Jiraphon Yenphraphai
Xichen Pan
Sainan Liu
Daniele Panozzo
Saining Xie
34
19
0
02 Jan 2024
Towards a Simultaneous and Granular Identity-Expression Control in
  Personalized Face Generation
Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation
Renshuai Liu
Bowen Ma
Wei Zhang
Zhipeng Hu
Changjie Fan
Tangjie Lv
Yu-qiong Ding
Xuan Cheng
DiffM
27
20
0
02 Jan 2024
New Job, New Gender? Measuring the Social Bias in Image Generation
  Models
New Job, New Gender? Measuring the Social Bias in Image Generation Models
Wenxuan Wang
Haonan Bai
Jen-tse Huang
Yuxuan Wan
Youliang Yuan
Haoyi Qiu
Nanyun Peng
Michael R. Lyu
47
20
0
01 Jan 2024
DiffMorph: Text-less Image Morphing with Diffusion Models
DiffMorph: Text-less Image Morphing with Diffusion Models
Shounak Chatterjee
DiffM
15
0
0
01 Jan 2024
Diffusion Model with Perceptual Loss
Diffusion Model with Perceptual Loss
Shanchuan Lin
Xiao Yang
DiffM
30
15
0
30 Dec 2023
4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency
4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency
Yuyang Yin
Dejia Xu
Zhangyang Wang
Yao-Min Zhao
Yunchao Wei
3DGS
57
72
0
28 Dec 2023
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with
  Time-Decoupled Training and Reusable Coop-Diffusion
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Guansong Lu
Yuanfan Guo
Jianhua Han
Minzhe Niu
Yihan Zeng
Songcen Xu
Zeyi Huang
Zhao Zhong
Wei Zhang
Hang Xu
39
4
0
27 Dec 2023
One-Dimensional Adapter to Rule Them All: Concepts, Diffusion Models and
  Erasing Applications
One-Dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications
Mengyao Lyu
Yuhong Yang
Haiwen Hong
Hui Chen
Xuan Jin
Yuan He
Hui Xue
Jungong Han
Guiguang Ding
DiffM
29
58
0
26 Dec 2023
SSR-Encoder: Encoding Selective Subject Representation for
  Subject-Driven Generation
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Yuxuan Zhang
Yiren Song
Jiaming Liu
Rui Wang
Jinpeng Yu
...
Huaxia Li
Xu Tang
Yao Hu
Han Pan
Zhongliang Jing
49
58
0
26 Dec 2023
SAiD: Speech-driven Blendshape Facial Animation with Diffusion
SAiD: Speech-driven Blendshape Facial Animation with Diffusion
Inkyu Park
Jaewoong Cho
34
4
0
25 Dec 2023
Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data
  Generation Framework using Foundational Models
Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data Generation Framework using Foundational Models
Gurusha Juneja
Sukrit Kumar
DiffM
19
0
0
23 Dec 2023
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Xinyan Chen
Jiaxin Ge
Tianjun Zhang
Jiaming Liu
Shanghang Zhang
VLM
EGVM
42
0
0
23 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
20
241
0
21 Dec 2023
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image
  Inpainting with Diffusion Models
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Hayk Manukyan
Andranik Sargsyan
Barsegh Atanyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
35
28
0
21 Dec 2023
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion
  Models with RL Finetuning
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
Desai Xie
Jiahao Li
Hao Tan
Xin Sun
Zhixin Shu
Yi Zhou
Sai Bi
Soren Pirk
Arie E. Kaufman
37
8
0
21 Dec 2023
PIA: Your Personalized Image Animator via Plug-and-Play Modules in
  Text-to-Image Models
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
Yiming Zhang
Zhening Xing
Yanhong Zeng
Youqing Fang
Kai Chen
VGen
36
27
0
21 Dec 2023
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
Xianfang Zeng
Xin Chen
Zhongqi Qi
Wen Liu
Zibo Zhao
Zhibin Wang
Bin-Bin Fu
Yong-jin Liu
Gang Yu
DiffM
18
67
0
21 Dec 2023
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed
  Diffusion Models
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling
Seung Wook Kim
Antonio Torralba
Sanja Fidler
Karsten Kreis
DiffM
3DGS
45
113
0
21 Dec 2023
Generative Multimodal Models are In-Context Learners
Generative Multimodal Models are In-Context Learners
Quan-Sen Sun
Yufeng Cui
Xiaosong Zhang
Fan Zhang
Qiying Yu
...
Yueze Wang
Yongming Rao
Jingjing Liu
Tiejun Huang
Xinlong Wang
MLLM
LRM
45
247
0
20 Dec 2023
ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors
ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors
Weijia Mao
Yan-Pei Cao
Jia-Wei Liu
Zhongcong Xu
Mike Zheng Shou
DiffM
51
5
0
20 Dec 2023
RadEdit: stress-testing biomedical vision models via diffusion image
  editing
RadEdit: stress-testing biomedical vision models via diffusion image editing
Fernando Pérez-García
Sam Bond-Taylor
Pedro P. Sanchez
B. V. Breugel
Daniel Coelho De Castro
...
M. Lungren
A. Nori
Javier Alvarez-Valle
Ozan Oktay
Maximilian Ilse
MedIm
43
8
0
20 Dec 2023
Your Student is Better Than Expected: Adaptive Teacher-Student
  Collaboration for Text-Conditional Diffusion Models
Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models
Nikita Starodubcev
Artem Fedorov
Artem Babenko
Dmitry Baranchuk
DiffM
50
3
0
17 Dec 2023
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Mingsheng Li
Xin Chen
C. Zhang
Sijin Chen
Erik Cambria
Fukun Yin
Gang Yu
Tao Chen
36
24
0
17 Dec 2023
M^2ConceptBase: A Fine-Grained Aligned Concept-Centric Multimodal Knowledge Base
M^2ConceptBase: A Fine-Grained Aligned Concept-Centric Multimodal Knowledge Base
Zhiwei Zha
Jiaan Wang
Zhixu Li
Xiangru Zhu
Wei Song
Yanghua Xiao
VLM
45
2
0
16 Dec 2023
Latent Diffusion Models with Image-Derived Annotations for Enhanced
  AI-Assisted Cancer Diagnosis in Histopathology
Latent Diffusion Models with Image-Derived Annotations for Enhanced AI-Assisted Cancer Diagnosis in Histopathology
Pedro Osório
Guillermo Jiménez-Pérez
Javier Montalt-Tordera
Jens Hooge
Guillem Duran Ballester
...
Sabrina Schroeder
K. Siudak
Julia Vienenkoetter
Bettina Lawrenz
Sadegh Mohammadi
MedIm
30
8
0
15 Dec 2023
Focus on Your Instruction: Fine-grained and Multi-instruction Image
  Editing by Attention Modulation
Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
Qin Guo
Tianwei Lin
DiffM
22
31
0
15 Dec 2023
ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining
ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining
Ruoxi Shi
Xinyue Wei
Cheng Wang
Hao Su
41
16
0
14 Dec 2023
Reliability in Semantic Segmentation: Can We Use Synthetic Data?
Reliability in Semantic Segmentation: Can We Use Synthetic Data?
Thibaut Loiseau
Tuan-Hung Vu
Mickaël Chen
Patrick Pérez
Matthieu Cord
UQCV
34
12
0
14 Dec 2023
DiffusionLight: Light Probes for Free by Painting a Chrome Ball
DiffusionLight: Light Probes for Free by Painting a Chrome Ball
Pakkapon Phongthawee
Worameth Chinchuthakun
Nontaphat Sinsunthithet
Amit Raj
Varun Jampani
Pramook Khungurn
Supasorn Suwajanakorn
DiffM
35
23
0
14 Dec 2023
Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and
  Multi-Source Supervision
Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and Multi-Source Supervision
Shengguang Wu
Zhenglun Chen
Qi Su
DiffM
30
0
0
13 Dec 2023
FreeInit: Bridging Initialization Gap in Video Diffusion Models
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Tianxing Wu
Chenyang Si
Yuming Jiang
Ziqi Huang
Ziwei Liu
DiffM
VGen
38
45
0
12 Dec 2023
Previous
123...282930313233
Next