Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01952
Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
Re-assign community
ArXiv (abs)
PDF
HTML
Github (25942★)
Papers citing
"SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"
50 / 607 papers shown
Title
ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation
Sanghyun Jo
Wooyeol Lee
Ziseok Lee
Kyungsu Kim
801
0
0
27 May 2025
A Stereotype Content Analysis on Color-related Social Bias in Large Vision Language Models
Junhyuk Choi
Minju Kim
Yeseon Hong
Bugeun Kim
64
0
0
27 May 2025
ConsiStyle: Style Diversity in Training-Free Consistent T2I Generation
Yohai Mazuz
Janna Bruner
Lior Wolf
DiffM
66
0
0
27 May 2025
Sci-Fi: Symmetric Constraint for Frame Inbetweening
Liuhan Chen
Xiaodong Cun
Xiaoyu Li
Xianyi He
Shenghai Yuan
Jie Chen
Ying Shan
Li Yuan
VGen
81
0
0
27 May 2025
Conditional Diffusion Models with Classifier-Free Gibbs-like Guidance
Badr Moufad
Yazid Janati
Alain Durmus
Ahmed Ghorbel
Eric Moulines
Jimmy Olsson
DiffM
74
0
0
27 May 2025
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Models
Dar-Yen Chen
Hmrishav Bandyopadhyay
Kai Zou
Yi-Zhe Song
54
0
0
27 May 2025
FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
Jin Wang
Yao Lai
Aoxue Li
Shifeng Zhang
Jiacheng Sun
Ning Kang
Chengyue Wu
Zhenguo Li
Ping Luo
74
2
0
26 May 2025
What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
Lorenzo Baraldi
Davide Bucciarelli
Federico Betti
Marcella Cornia
Lorenzo Baraldi
N. Sebe
Rita Cucchiara
231
0
0
26 May 2025
Regularized Personalization of Text-to-Image Diffusion Models without Distributional Drift
Gihoon Kim
Hyungjin Park
Taesup Kim
DiffM
VLM
197
0
0
26 May 2025
MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models
Hang Hua
Ziyun Zeng
Yizhi Song
Yunlong Tang
Liu He
Daniel G. Aliaga
Wei Xiong
Jiebo Luo
EGVM
88
0
0
26 May 2025
In-Context Brush: Zero-shot Customized Subject Insertion with Context-Aware Latent Space Manipulation
Yu Xu
Fan Tang
You Wu
Lin Gao
Oliver Deussen
Hongbin Yan
Jintao Li
Juan Cao
Tong-Yee Lee
DiffM
49
0
0
26 May 2025
Adaptive Diffusion Guidance via Stochastic Optimal Control
Iskander Azangulov
Peter Potaptchik
Qinyu Li
Eddie Aamari
George Deligiannidis
Judith Rousseau
25
0
0
25 May 2025
Training-free Stylized Text-to-Image Generation with Fast Inference
X. Ma
Yaohui Wang
Xinyuan Chen
Tien-Tsin Wong
C. L. P. Chen
813
0
0
25 May 2025
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
H. Zhang
Dexiang Hong
Maoke Yang
Yutao Chen
Zhao Zhang
Jie Shao
Xinglong Wu
Zuxuan Wu
Yu Jiang
DiffM
AI4CE
174
0
0
25 May 2025
Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation
Wenchao Zhang
Jiahe Tian
Runze He
Jizhong Han
Jiao Dai
Miaomiao Feng
Wei Mi
Xiaodan Zhang
113
0
0
24 May 2025
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data
Yiren Song
Cheng Liu
Mike Zheng Shou
DiffM
180
2
0
24 May 2025
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
Jiayu Wang
Yang Jiao
Yue Yu
Tianwen Qian
Shaoxiang Chen
Jingjing Chen
Yu Jiang
MLLM
LM&MA
ELM
112
0
0
24 May 2025
Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation
Alexander Shabalin
Viacheslav Meshchaninov
Dmitry Vetrov
46
0
0
24 May 2025
Affective Image Editing: Shaping Emotional Factors via Text Descriptions
Peixuan Zhang
Shuchen Weng
Chengxuan Zhu
Binghao Tang
Zijian Jia
Si Li
Boxin Shi
DiffM
31
0
0
24 May 2025
A Minimalist Method for Fine-tuning Text-to-Image Diffusion Models
Yanting Miao
William Loh
Suraj Kothawade
Pacal Poupart
45
0
0
23 May 2025
SpikeGen: Generative Framework for Visual Spike Stream Processing
Gaole Dai
Menghang Dong
Rongyu Zhang
Ruichuan An
Shanghang Zhang
Tiejun Huang
DiffM
3DGS
48
0
0
23 May 2025
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
Jingjing Jiang
Chongjie Si
Jun Luo
Hanwang Zhang
Chao Ma
194
0
0
23 May 2025
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
Litao Guo
Xinli Xu
Luozhou Wang
Jiantao Lin
Jinsong Zhou
Zixin Zhang
Bolan Su
Ying-Cong Chen
LLMAG
LRM
86
1
0
23 May 2025
Creatively Upscaling Images with Global-Regional Priors
Yurui Qian
Qi Cai
Yingwei Pan
Ting Yao
Tao Mei
DiffM
204
0
0
22 May 2025
CDST: Color Disentangled Style Transfer for Universal Style Reference Customization
Shiwen Zhang
Zhuowei Chen
Lang Chen
Yanze Wu
23
0
0
22 May 2025
From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Reasoning-Driven Pedagogical Visualization
Haonian Ji
Shi Qiu
Siyang Xin
Siwei Han
Zhaorun Chen
Hongyi Wang
Dake Zhang
Huaxiu Yao
90
0
0
22 May 2025
My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping
Hon Ming Yam
Zhongliang Guo
Chun Pong Lau
DiffM
AAML
62
0
0
21 May 2025
LTDA-Drive: LLMs-guided Generative Models based Long-tail Data Augmentation for Autonomous Driving
Mahmut Yurt
Xin Ye
Yunsheng Ma
Jingru Luo
Abhirup Mallik
John Pauly
Burhaneddin Yaman
Liu Ren
46
0
0
21 May 2025
Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation
Xinran Wang
Muxi Diao
Yuanzhi Liu
Chunyu Wang
Kongming Liang
Zhanyu Ma
Jun Guo
94
0
0
21 May 2025
MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding
Yuxiang Wei
Yanteng Zhang
Xi Xiao
Tianyang Wang
Xiao Wang
Vince D. Calhoun
MoE
222
0
0
21 May 2025
MVPainter: Accurate and Detailed 3D Texture Generation via Multi-View Diffusion with Geometric Control
Mingqi Shao
Feng Xiong
Zhaoxu Sun
Mu Xu
DiffM
89
0
0
19 May 2025
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
Yicheng Xiao
Lin Song
Yukang Chen
Yingmin Luo
Yuxin Chen
Yukang Gan
Wei Huang
Xiu Li
Xiaojuan Qi
Ying Shan
LRM
107
5
0
19 May 2025
Improving Compositional Generation with Diffusion Models Using Lift Scores
Chenning Yu
Sicun Gao
DiffM
843
0
0
19 May 2025
Accelerate TarFlow Sampling with GS-Jacobi Iteration
Ben Liu
Zhen Qin
87
0
0
19 May 2025
Training Latent Diffusion Models with Interacting Particle Algorithms
Tim Y. J. Wang
Juan Kuntz
O. Deniz Akyildiz
118
0
0
18 May 2025
Video-GPT via Next Clip Diffusion
Shaobin Zhuang
Zhipeng Huang
Ying Zhang
Fangyikang Wang
Canmiao Fu
Binxin Yang
Chong Sun
Chen Li
Yali Wang
DiffM
VGen
243
0
0
18 May 2025
Accelerating Diffusion-based Super-Resolution with Dynamic Time-Spatial Sampling
Rui Qin
Qijie Wang
Ming Sun
Haowei Zhu
Chao Zhou
Bin Wang
141
0
0
17 May 2025
Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models
Fu-Yun Wang
Yunhao Shui
Jingtan Piao
Keqiang Sun
Hongsheng Li
97
4
0
16 May 2025
Diverging Towards Hallucination: Detection of Failures in Vision-Language Models via Multi-token Aggregation
Geigh Zollicoffer
Minh Vu
Manish Bhattarai
VLM
92
0
0
16 May 2025
DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models
Giulia Bertazzini
Daniele Baracchi
Dasara Shullani
Isao Echizen
Alessandro Piva
129
0
0
16 May 2025
Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration
Haipeng Fang
Sheng Tang
Juan Cao
Enshuo Zhang
Fan Tang
Tong-Yee Lee
98
0
0
16 May 2025
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
Bingda Tang
Boyang Zheng
Xichen Pan
Sayak Paul
Saining Xie
81
0
0
15 May 2025
Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field
Jinlong Fan
Xuepu Zeng
Jing Zhang
Mingming Gong
Yuxiang Yang
Dacheng Tao
3DGS
AI4CE
147
0
0
15 May 2025
InstanceGen: Image Generation with Instance-level Instructions
Etai Sella
Yanir Kleiman
Hadar Averbuch-Elor
95
0
0
08 May 2025
ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis
Onkar Susladkar
Gayatri Deshmukh
Yalcin Tur
Gorkhem Durak
Ulas Bagci
MedIm
248
0
0
08 May 2025
Flow-GRPO: Training Flow Matching Models via Online RL
Jie Liu
Gongye Liu
Jiajun Liang
Yongqian Li
Jiaheng Liu
Xinyu Wang
Pengfei Wan
Di Zhang
Wanli Ouyang
AI4CE
223
5
0
08 May 2025
PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with Auto-Regressive Transformer
Jingwen Ye
Yuze He
Yanning Zhou
Yiqin Zhu
Kaiwen Xiao
Yong-Jin Liu
Wei Yang
Xiao Han
104
1
0
07 May 2025
Generating Synthetic Data via Augmentations for Improved Facial Resemblance in DreamBooth and InstantID
Koray Ulusan
Benjamin Kiefer
DiffM
102
0
0
06 May 2025
Using Knowledge Graphs to harvest datasets for efficient CLIP model training
Simon Ging
Sebastian Walter
Jelena Bratulić
Johannes Dienert
Hannah Bast
Thomas Brox
CLIP
63
0
0
05 May 2025
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Wei Wei
Jintao Guo
Shanshan Zhao
Minghao Fu
Lunhao Duan
...
Guo-Hua Wang
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
DiffM
311
1
0
05 May 2025
Previous
1
2
3
4
5
6
...
11
12
13
Next