ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01952
  4. Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
ArXivPDFHTML

Papers citing "SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"

50 / 1,630 papers shown
Title
TeLL Me what you cant see
TeLL Me what you cant see
Saverio Cavasin
Pietro Biasetton
Mattia Tamiazzo
Mauro Conti
Simone Milani
DiffM
45
0
0
25 Mar 2025
Reverse Prompt: Cracking the Recipe Inside Text-to-Image Generation
Reverse Prompt: Cracking the Recipe Inside Text-to-Image Generation
Zhiyao Ren
Yibing Zhan
B. Yu
Dacheng Tao
DiffM
74
0
0
25 Mar 2025
IPGO: Indirect Prompt Gradient Optimization for Parameter-Efficient Prompt-level Fine-Tuning on Text-to-Image Models
IPGO: Indirect Prompt Gradient Optimization for Parameter-Efficient Prompt-level Fine-Tuning on Text-to-Image Models
Jianping Ye
Michel Wedel
Kunpeng Zhang
44
0
0
25 Mar 2025
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
Junhyuk So
Jiwoong Shin
Chaeyeon Jang
Eunhyeok Park
DiffM
53
0
0
25 Mar 2025
AvatarArtist: Open-Domain 4D Avatarization
AvatarArtist: Open-Domain 4D Avatarization
Hongyu Liu
Xuan Wang
Bo Liu
Yue Ma
Jingye Chen
Yanbo Fan
Yujun Shen
Yibing Song
Qifeng Chen
41
0
0
25 Mar 2025
LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration
LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration
Yuyao Zhang
Jinghao Li
Yu-Wing Tai
DiffM
64
0
0
25 Mar 2025
A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design
A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design
Jie Tian
Martin Taylor Sobczak
Dhanush Patil
Jixin Hou
Lin Pang
...
Yuval Golan
Xiaoming Zhai
Hongyue Sun
Kenan Song
Xiaobei Wang
LLMAG
AI4CE
55
0
0
25 Mar 2025
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model
Zhiqiang Zhang
J. Li
Zunnan Xu
Hanhui Li
Yiji Cheng
Fa-Ting Hong
Qin Lin
Qinglin Lu
Xiaodan Liang
DiffM
73
1
0
25 Mar 2025
ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning
ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning
Jiaqi Liao
Zhengyuan Yang
Linjie Li
Dianqi Li
Kevin Qinghong Lin
Yu-Xi Cheng
Lijuan Wang
MLLM
LRM
62
0
0
25 Mar 2025
MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Agentic Post-Processing
MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Agentic Post-Processing
Lingting Zhu
Jingrui Ye
Runze Zhang
Zeyu Hu
Yingda Yin
...
Jinnan Chen
Shengju Qian
Xin Wang
Qingmin Liao
L. Yu
60
2
0
24 Mar 2025
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
Jinho Jeong
Sangmin Han
Jinwoo Kim
Seon Joo Kim
42
0
0
24 Mar 2025
RomanTex: Decoupling 3D-aware Rotary Positional Embedded Multi-Attention Network for Texture Synthesis
RomanTex: Decoupling 3D-aware Rotary Positional Embedded Multi-Attention Network for Texture Synthesis
Yifei Feng
M. Yang
Steve Yang
Sheng Zhang
Jianwei Yu
Zibo Zhao
Yuhong Liu
Jie Jiang
Chunchao Guo
DiffM
61
0
0
24 Mar 2025
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
Jinjin Zhang
Qiuyu Huang
Junjie Liu
Xiefan Guo
Di Huang
62
2
0
24 Mar 2025
Hiding Images in Diffusion Models by Editing Learned Score Functions
Hiding Images in Diffusion Models by Editing Learned Score Functions
Haoyu Chen
Yunqiao Yang
Nan Zhong
Kede Ma
DiffM
68
0
0
24 Mar 2025
Coeff-Tuning: A Graph Filter Subspace View for Tuning Attention-Based Large Models
Coeff-Tuning: A Graph Filter Subspace View for Tuning Attention-Based Large Models
Zichen Miao
Wei Chen
Qiang Qiu
92
1
0
24 Mar 2025
Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning
Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning
Sherry X. Chen
Misha Sra
Pradeep Sen
55
0
0
24 Mar 2025
Human Motion Unlearning
Human Motion Unlearning
Edoardo De Matteis
Matteo Migliarini
Alessio Sampieri
Indro Spinelli
Fabio Galasso
MU
60
0
0
24 Mar 2025
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation
Guosheng Zhao
Xiaofeng Wang
Chaojun Ni
Zheng Zhu
Wenkang Qin
Guan Huang
Xingang Wang
76
1
0
24 Mar 2025
Panorama Generation From NFoV Image Done Right
Panorama Generation From NFoV Image Done Right
Dian Zheng
Cheng Zhang
Xiao-Ming Wu
Cao Li
Chengfei Lv
Jian-Fang Hu
Wei-Shi Zheng
DiffM
81
0
0
24 Mar 2025
Color Conditional Generation with Sliced Wasserstein Guidance
Color Conditional Generation with Sliced Wasserstein Guidance
Alexander Lobashev
Maria Larchenko
Dmitry Guskov
DiffM
48
0
0
24 Mar 2025
InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment
InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment
Yunfan LU
Qichao Wang
H. Cao
Xierui Wang
Xiaoyin Xu
Min Zhang
64
0
0
24 Mar 2025
Training-free Diffusion Acceleration with Bottleneck Sampling
Training-free Diffusion Acceleration with Bottleneck Sampling
Ye Tian
Xin Xia
Yuxi Ren
Shanchuan Lin
Xing Wang
Xuefeng Xiao
Yunhai Tong
L. Yang
Bin Cui
60
0
0
24 Mar 2025
TCFG: Tangential Damping Classifier-free Guidance
TCFG: Tangential Damping Classifier-free Guidance
Mingi Kwon
Shin seong Kim
Jaeseok Jeong. Yi Ting Hsiao
Youngjung Uh
DiffM
65
0
0
23 Mar 2025
CODA: Repurposing Continuous VAEs for Discrete Tokenization
CODA: Repurposing Continuous VAEs for Discrete Tokenization
Zeyu Liu
Zanlin Ni
Yeguo Hua
Xin Deng
Xiao Ma
Cheng Zhong
Gao Huang
47
0
0
22 Mar 2025
Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models
Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models
Ketan Suhaas Saichandran
Xavier Thomas
Prakhar Kaushik
Deepti Ghadiyaram
DiffM
78
0
0
22 Mar 2025
RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation
RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation
Zhiqiang Yuan
Ting Zhang
Ying Deng
Jiapei Zhang
Yeshuang Zhu
Zexi Jia
Jie Zhou
Jinchao Zhang
VGen
41
0
0
22 Mar 2025
FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy
FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy
Xingchao Yang
Takafumi Taketomi
Yuki Endo
Yoshihiro Kanamori
DiffM
46
0
0
21 Mar 2025
Real-Time Diffusion Policies for Games: Enhancing Consistency Policies with Q-Ensembles
Real-Time Diffusion Policies for Games: Enhancing Consistency Policies with Q-Ensembles
Ruoqi Zhang
Ziwei Luo
Jens Sjölund
Per Mattsson
Linus Gisslén
Alessandro Sestini
44
1
0
21 Mar 2025
D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens
D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens
Panpan Wang
Liqiang Niu
Fandong Meng
Jinan Xu
Yufeng Chen
Jie Zhou
DiffM
50
0
0
21 Mar 2025
HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis
HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis
Mengtian Li
Jinshu Chen
Wanquan Feng
Bingchuan Li
Fei Dai
Mingcong Liu
Qian He
3DH
52
0
0
21 Mar 2025
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
Qingyu Shi
Jianzong Wu
Jinbin Bai
Jun Zhang
Lu Qi
Xiaomeng Li
Yunhai Tong
48
0
0
21 Mar 2025
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models
Fanghua Yu
Jinjin Gu
Jinfan Hu
Zheyuan Li
Chao Dong
DiffM
55
0
0
21 Mar 2025
Scale-wise Distillation of Diffusion Models
Scale-wise Distillation of Diffusion Models
Nikita Starodubcev
Denis Kuznedelev
Artem Babenko
Dmitry Baranchuk
DiffM
53
0
0
20 Mar 2025
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Liming Jiang
Qing Yan
Yumin Jia
Zichuan Liu
Hao Kang
Xin Lu
49
1
0
20 Mar 2025
World Knowledge from AI Image Generation for Robot Control
World Knowledge from AI Image Generation for Robot Control
Jonas Krumme
C. Zetzsche
LM&Ro
55
0
0
20 Mar 2025
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
Yanjie Wang
Zhijie Lin
Yao Teng
Yuanzhi Zhu
Shuhuai Ren
Jiashi Feng
Xihui Liu
53
0
0
20 Mar 2025
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
Philipp Becker
Abhinav Mehrotra
Ruchika Chavhan
Malcolm Chadwick
Luca Morreale
Mehdi Noroozi
Alberto Gil C. P. Ramos
Sourav Bhattacharya
51
0
0
20 Mar 2025
Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
Akhil Perincherry
Jacob Krantz
Stefan Lee
LM&Ro
41
1
0
20 Mar 2025
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
Hui Zhang
Tingwei Gao
Jie Shao
Zuxuan Wu
69
0
0
20 Mar 2025
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
Tianyi Wei
Yifan Zhou
Dongdong Chen
Xingang Pan
77
0
0
20 Mar 2025
Single Image Iterative Subject-driven Generation and Editing
Single Image Iterative Subject-driven Generation and Editing
Yair Shpitzer
Gal Chechik
Idan Schwartz
53
0
0
20 Mar 2025
A Recipe for Generating 3D Worlds From a Single Image
A Recipe for Generating 3D Worlds From a Single Image
Katja Schwarz
Denys Rozumnyi
Samuel Rota Buló
Lorenzo Porzi
Peter Kontschieder
VGen
79
1
0
20 Mar 2025
Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models
Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models
Jin Wang
Chenghui Lv
Xian Li
Shichao Dong
Huadong Li
Kelu Yao
Chao Li
Wenqi Shao
Ping Luo
62
1
0
19 Mar 2025
Efficient Personalization of Quantized Diffusion Model without Backpropagation
Efficient Personalization of Quantized Diffusion Model without Backpropagation
H. Seo
Wongi Jeong
Kyungryeol Lee
Se Young Chun
DiffM
MQ
78
0
0
19 Mar 2025
LEGION: Learning to Ground and Explain for Synthetic Image Detection
LEGION: Learning to Ground and Explain for Synthetic Image Detection
Hengrui Kang
Siwei Wen
Zichen Wen
Junyan Ye
Weijia Li
...
Baichuan Zhou
Bin Wang
Dahua Lin
Linfeng Zhang
Conghui He
49
0
0
19 Mar 2025
Visual Persona: Foundation Model for Full-Body Human Customization
Visual Persona: Foundation Model for Full-Body Human Customization
Jisu Nam
Soowon Son
Zhan Xu
Jing Shi
Difan Liu
Feng Liu
Aashish Misraa
Seungryong Kim
Yang Zhou
DiffM
51
0
0
19 Mar 2025
TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning in Text-to-Image Models
TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning in Text-to-Image Models
Teng-Fang Hsiao
Bo-Kai Ruan
Yi-Lun Wu
Tzu-Ling Lin
Hong-Han Shuai
VLM
53
0
0
19 Mar 2025
POSTA: A Go-to Framework for Customized Artistic Poster Generation
POSTA: A Go-to Framework for Customized Artistic Poster Generation
Haoyu Chen
Xiaojie Xu
Wenbo Li
Jingjing Ren
Tian Ye
Songhua Liu
Ying Chen
Lei Zhu
Xinchao Wang
DiffM
57
1
0
19 Mar 2025
FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers
FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers
Ruichen Chen
Keith G. Mills
Di Niu
MQ
59
0
0
19 Mar 2025
The Power of Context: How Multimodality Improves Image Super-Resolution
The Power of Context: How Multimodality Improves Image Super-Resolution
Kangfu Mei
Hossein Talebi
Mojtaba Ardakani
Vishal M. Patel
P. Milanfar
M. Delbracio
DiffM
85
1
0
18 Mar 2025
Previous
123456...313233
Next