ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01952
  4. Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
ArXivPDFHTML

Papers citing "SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"

50 / 1,635 papers shown
Title
Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation
Amir Mohammad Izadi
Seyed Mohsen Hosseini
Soroush Vafaie Tabar
Ali Abdollahi
Armin Saghafian
M. Baghshah
EGVM
48
0
0
09 Mar 2025
SemHiTok: A Unified Image Tokenizer via Semantic-Guided Hierarchical Codebook for Multimodal Understanding and Generation
SemHiTok: A Unified Image Tokenizer via Semantic-Guided Hierarchical Codebook for Multimodal Understanding and Generation
Zhenpeng Chen
Chunwei Wang
Xiuwei Chen
Hang Xu
J. Han
Xiandan Liang
VLM
71
1
0
09 Mar 2025
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
Xiang Gao
Shuai Yang
Jiaying Liu
DiffM
51
0
0
08 Mar 2025
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
Jian Ma
Qirong Peng
Xu Guo
Chen Chen
H. Lu
Zhenyu Yang
VLM
72
1
0
08 Mar 2025
Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
Seil Kang
Jinyeong Kim
Junhyeok Kim
Seong Jae Hwang
VLM
93
2
0
08 Mar 2025
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice
Hongwei Yi
Tian Ye
Shitong Shao
Xuancheng Yang
Jiantong Zhao
...
Zeke Xie
Lei Zhu
Wei Li
Michael Lingelbach
Daquan Zhou
VGen
55
1
0
07 Mar 2025
DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction
Miaowei Wang
Yibo Zhang
R. Ma
Weiwei Xu
C. Zou
Daniel Morris
3DV
48
1
0
07 Mar 2025
Unified Reward Model for Multimodal Understanding and Generation
Yibin Wang
Yuhang Zang
Hao Li
Cheng Jin
Jie Wang
EGVM
73
4
0
07 Mar 2025
THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks
Chaoran Xiong
Litao Wei
Kehui Ma
Zhen Sun
Yan Xiang
Zihan Nan
Trieu-Kien Truong
Ling Pei
41
0
0
07 Mar 2025
Beyond Existance: Fulfill 3D Reconstructed Scenes with Pseudo Details
Yifei Gao
Jun Huang
Lei Wang
Ruiting Dai
Jun Cheng
3DGS
61
0
0
06 Mar 2025
ControlFill: Spatially Adjustable Image Inpainting from Prompt Learning
Boseong Jeon
55
0
0
06 Mar 2025
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion
Ziyi Yang
Fanqi Wan
Longguang Zhong
Canbin Huang
Guosheng Liang
Xiaojun Quan
MoMe
98
0
0
06 Mar 2025
CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models
Shengzhuang Chen
Yikai Liao
Xiaoxiao Sun
Kede Ma
Ying Wei
70
0
0
06 Mar 2025
SpinML: Customized Synthetic Data Generation for Private Training of Specialized ML Models
SpinML: Customized Synthetic Data Generation for Private Training of Specialized ML Models
Jiang Zhang
Rohan Sequeira
Konstantinos Psounis
SyDa
78
0
0
05 Mar 2025
From Architectural Sketch to Conceptual Representation: Using Structure-Aware Diffusion Model to Generate Renderings of School Buildings
Zhengyang Wang
H. Jin
Xusheng Du
Yuxiao Ren
Ye Zhang
H. Xie
DiffM
52
0
0
05 Mar 2025
SPG: Improving Motion Diffusion by Smooth Perturbation Guidance
Boseong Jeon
DiffM
50
0
0
04 Mar 2025
VisAgent: Narrative-Preserving Story Visualization Framework
Seungkwon Kim
GyuTae Park
Sangyeon Kim
Seung-Hun Nam
45
0
0
04 Mar 2025
Smoothing the Shift: Towards Stable Test-Time Adaptation under Complex Multimodal Noises
Zirun Guo
Tao Jin
TTA
92
1
0
04 Mar 2025
RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification
Zhen Yang
Guibao Shen
Liang Hou
Mushui Liu
Luozhou Wang
Xin Tao
Pengfei Wan
Di Zhang
Ying-cong Chen
DiffM
79
0
0
04 Mar 2025
Jailbreaking Safeguarded Text-to-Image Models via Large Language Models
Zhengyuan Jiang
Yuepeng Hu
Yuqing Yang
Yinzhi Cao
Neil Zhenqiang Gong
72
0
0
03 Mar 2025
DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models
Zhendong Wang
Jianmin Bao
Shuyang Gu
Dong Chen
Wengang Zhou
Yiming Li
DiffM
53
0
0
03 Mar 2025
ACCORD: Alleviating Concept Coupling through Dependence Regularization for Text-to-Image Diffusion Personalization
Shizhan Liu
Hao Zheng
Hang Yu
Jianguo Li
DiffM
71
0
0
03 Mar 2025
WeGen: A Unified Model for Interactive Multimodal Generation as We Chat
Zhipeng Huang
Shaobin Zhuang
Canmiao Fu
Binxin Yang
Ying Zhang
Chong Sun
Zhizheng Zhang
Yali Wang
Chen Li
Zheng-Jun Zha
DiffM
69
2
0
03 Mar 2025
MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation
Yi Wang
Mushui Liu
Wanggui He
Longxiang Zhang
Z. Huang
...
Yiming Li
Weilong Dai
Mingli Song
Jie Song
Hao Jiang
MLLM
MoE
LRM
86
1
0
03 Mar 2025
Zero-Shot Head Swapping in Real-World Scenarios
Zero-Shot Head Swapping in Real-World Scenarios
S. Jeong
Taewoong Kang
Hyojin Jang
Jaegul Choo
39
0
0
02 Mar 2025
Enhancing Monocular 3D Scene Completion with Diffusion Model
Changlin Song
Jiaqi Wang
Liyun Zhu
He Weng
3DGS
38
0
0
02 Mar 2025
GenVDM: Generating Vector Displacement Maps From a Single Image
GenVDM: Generating Vector Displacement Maps From a Single Image
Yuezhi Yang
Qimin Chen
Vladimir G. Kim
S. Chaudhuri
Qixing Huang
Z. Chen
3DGS
VGen
29
1
0
01 Mar 2025
DiffBrush:Just Painting the Art by Your Hands
DiffBrush:Just Painting the Art by Your Hands
Jiaming Chu
Lei Jin
Tao Wang
Junliang Xing
Jian-jun Zhao
DiffM
42
0
0
28 Feb 2025
SafeText: Safe Text-to-image Models via Aligning the Text Encoder
SafeText: Safe Text-to-image Models via Aligning the Text Encoder
Yuepeng Hu
Zhengyuan Jiang
Neil Zhenqiang Gong
69
1
0
28 Feb 2025
Diffusion Restoration Adapter for Real-World Image Restoration
Diffusion Restoration Adapter for Real-World Image Restoration
Hanbang Liang
Zhen Wang
Weihui Deng
DiffM
45
0
0
28 Feb 2025
Interpreting CLIP with Hierarchical Sparse Autoencoders
Interpreting CLIP with Hierarchical Sparse Autoencoders
Vladimir Zaigrajew
Hubert Baniecki
P. Biecek
56
0
0
27 Feb 2025
LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces
LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces
Rashid Mushkani
Shravan Nayak
Hugo Berard
Allison Cohen
Shin Koseki
Hadrien Bertrand
54
2
0
27 Feb 2025
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think
L. Chen
S. Bai
Wenhao Chai
Weichu Xie
Haozhe Zhao
Leon Vinci
Junyang Lin
Baobao Chang
DiffM
92
4
0
27 Feb 2025
QPM: Discrete Optimization for Globally Interpretable Image Classification
QPM: Discrete Optimization for Globally Interpretable Image Classification
Thomas Norrenbrock
Timo Kaiser
Sovan Biswas
R. Manuvinakurike
Bodo Rosenhahn
62
0
0
27 Feb 2025
Knowledge Bridger: Towards Training-free Missing Multi-modality Completion
Knowledge Bridger: Towards Training-free Missing Multi-modality Completion
Guanzhou Ke
Shengfeng He
Xueliang Wang
Bo Wang
Guoqing Chao
Yujie Zhang
Yi Xie
HeXing Su
68
0
0
27 Feb 2025
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation
Reza Abbasi
Ali Nazari
Aminreza Sefid
Mohammadali Banayeeanzade
M. Rohban
M. Baghshah
VLM
89
1
0
27 Feb 2025
Image Referenced Sketch Colorization Based on Animation Creation Workflow
Image Referenced Sketch Colorization Based on Animation Creation Workflow
Dingkun Yan
Xinrui Wang
Zhuoru Li
Suguru Saito
Yusuke Iwasawa
Y. Matsuo
Jiaxian Guo
DiffM
64
0
0
27 Feb 2025
Tight Inversion: Image-Conditioned Inversion for Real Image Editing
Tight Inversion: Image-Conditioned Inversion for Real Image Editing
Edo Kadosh
Nir Goren
Or Patashnik
Daniel Garibi
Daniel Cohen-Or
DiffM
74
0
0
27 Feb 2025
Analyzing CLIP's Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study
Analyzing CLIP's Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study
Reza Abbasi
Ali Nazari
Aminreza Sefid
Mohammadali Banayeeanzade
M. Rohban
M. Baghshah
VLM
64
1
0
27 Feb 2025
SubZero: Composing Subject, Style, and Action via Zero-Shot Personalization
SubZero: Composing Subject, Style, and Action via Zero-Shot Personalization
Shubhankar Borse
K. Bhardwaj
Mohammad Reza Karimi Dastjerdi
Hyojin Park
Shreya Kadambi
...
Prathamesh Mandke
Ankita Nayak
Harris Teague
Munawar Hayat
Fatih Porikli
DiffM
84
1
0
27 Feb 2025
Attention Distillation: A Unified Approach to Visual Characteristics Transfer
Attention Distillation: A Unified Approach to Visual Characteristics Transfer
Yang Zhou
Xu Gao
Zichong Chen
Hui Huang
DiffM
73
5
0
27 Feb 2025
SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation
SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation
Junlong Ren
Hao Wu
Hui Xiong
Haoran Wang
68
0
0
26 Feb 2025
HRR: Hierarchical Retrospection Refinement for Generated Image Detection
HRR: Hierarchical Retrospection Refinement for Generated Image Detection
Peipei Yuan
Zijing Xie
Shuo Ye
Hong Chen
Yulong Wang
DiffM
154
1
0
25 Feb 2025
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
Yifan Pu
Yiming Zhao
Zhicong Tang
Ruihong Yin
Haoxing Ye
...
Ji Li
Xiu Li
Zheng Lian
Gao Huang
Baining Guo
DiffM
64
2
0
25 Feb 2025
FairGen: Controlling Sensitive Attributes for Fair Generations in Diffusion Models via Adaptive Latent Guidance
Mintong Kang
Vinayshekhar Bannihatti Kumar
Shamik Roy
Abhishek Kumar
Sopan Khosla
Balakrishnan Narayanaswamy
Rashmi Gangadharaiah
50
0
0
25 Feb 2025
LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation
LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation
Pengzhi Li
Pengfei Yu
Zide Liu
Wei He
Xuhao Pan
Xudong Rao
Tao Wei
Wei Chen
VLM
60
0
0
25 Feb 2025
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Shuai Yang
Jing Tan
Mengchen Zhang
Tong Wu
Yong Li
Gordon Wetzstein
Ziwei Liu
Dahua Lin
MDE
VGen
59
6
0
24 Feb 2025
CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models
CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models
Shunchang Liu
Zhuan Shi
Lingjuan Lyu
Yaochu Jin
Boi Faltings
66
2
0
24 Feb 2025
Aligning Compound AI Systems via System-level DPO
Aligning Compound AI Systems via System-level DPO
Xiangwen Wang
Yibo Jacky Zhang
Zhoujie Ding
Katherine Tsai
Sanmi Koyejo
38
0
0
24 Feb 2025
Compact Latent Representation for Image Compression (CLRIC)
Compact Latent Representation for Image Compression (CLRIC)
Ayman A. Ameen
Thomas Richter
André Kaup
63
0
0
24 Feb 2025
Previous
123...789...313233
Next