Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01952
Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"
50 / 1,630 papers shown
Title
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
D. Jiang
Ziyu Guo
Renrui Zhang
Zhuofan Zong
Hao Li
Le Zhuo
Shilin Yan
Pheng-Ann Heng
Yiming Li
LRM
72
2
0
01 May 2025
Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation
Vaidehi Patil
Yi-Lin Sung
Peter Hase
Jie Peng
Jen-tse Huang
Joey Tianyi Zhou
AAML
MU
83
3
0
01 May 2025
GarmentDiffusion: 3D Garment Sewing Pattern Generation with Multimodal Diffusion Transformers
Xinyu Li
Qi Yao
Yanjie Wang
DiffM
48
0
0
30 Apr 2025
Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing
Hong Zhang
Zhongjie Duan
Xingjun Wang
Yuze Zhao
Weiyi Lu
Zhipeng Di
Yongjun Xu
Yingda Chen
Yu Zhang
MLLM
94
1
0
30 Apr 2025
Can We Achieve Efficient Diffusion without Self-Attention? Distilling Self-Attention into Convolutions
Ziyi Dong
Chengxing Zhou
Weijian Deng
Pengxu Wei
Xiangyang Ji
Liang Lin
MQ
53
0
0
30 Apr 2025
AGHI-QA: A Subjective-Aligned Dataset and Metric for AI-Generated Human Images
Yunhao Li
Sijing Wu
Wei Sun
Zhichao Zhang
Yucheng Zhu
Zicheng Zhang
Huiyu Duan
Xiongkuo Min
Guangtao Zhai
EGVM
93
0
0
30 Apr 2025
YoChameleon: Personalized Vision and Language Generation
Thao Nguyen
Krishna Kumar Singh
Jing Shi
Trung H. Bui
Yong Jae Lee
Yuheng Li
MLLM
82
0
0
29 Apr 2025
T2ID-CAS: Diffusion Model and Class Aware Sampling to Mitigate Class Imbalance in Neck Ultrasound Anatomical Landmark Detection
Manikanta Varaganti
Amulya Vankayalapati
Nour Awad
Gregory R. Dion
Laura J. Brattain
DiffM
MedIm
67
0
0
29 Apr 2025
TrueFake: A Real World Case Dataset of Last Generation Fake Images also Shared on Social Networks
S. Dell’Anna
Andrea Montibeller
Giulia Boato
62
0
0
29 Apr 2025
Image Generation Method Based on Heat Diffusion Models
Pengfei Zhang
Shouqing Jia
DiffM
VLM
47
0
0
28 Apr 2025
DiVE: Efficient Multi-View Driving Scenes Generation Based on Video Diffusion Transformer
Junpeng Jiang
Gangyi Hong
Miao Zhang
Hengtong Hu
Kun Zhan
Rui Shao
Liqiang Nie
VGen
51
0
0
28 Apr 2025
RepText: Rendering Visual Text via Replicating
Haozhao Wang
Yongjun Xu
Yong Li
Jiajun Li
Chaowei Zhang
Jingchao Wang
Kejia Yang
Z. Chen
VLM
66
0
0
28 Apr 2025
ShowMak3r: Compositional TV Show Reconstruction
S. Kim
Seunguk Do
Jaesik Park
VGen
43
0
0
28 Apr 2025
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Haoran Geng
Feishi Wang
Songlin Wei
Y. Li
Bangjun Wang
...
Hao Dong
Siyuan Huang
Yue Wang
Jitendra Malik
Pieter Abbeel
85
4
0
26 Apr 2025
REED-VAE: RE-Encode Decode Training for Iterative Image Editing with Diffusion Models
Gal Almog
Ariel Shamir
Ohad Fried
DiffM
63
0
0
26 Apr 2025
Dual Prompting Image Restoration with Diffusion Transformers
Dehong Kong
Fan Li
Zhixin Wang
Jiaqi Xu
Renjing Pei
W. J. Li
Wenqi Ren
DiffM
69
0
0
24 Apr 2025
AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models
Mohammad Zarei
Melanie A Jutras
Eliana Evans
Mike Tan
Omid Aaramoon
AAML
DiffM
55
0
0
24 Apr 2025
Text-to-Image Alignment in Denoising-Based Models through Step Selection
P. Grimal
Hervé Le Borgne
Olivier Ferret
DiffM
EGVM
48
0
0
24 Apr 2025
Step1X-Edit: A Practical Framework for General Image Editing
Shixuan Liu
Yucheng Han
Peng Xing
Fukun Yin
Rui Wang
...
Yibo Zhu
Binxing Jiao
Xuzhi Zhang
Gang Yu
Daxin Jiang
DiffM
111
4
0
24 Apr 2025
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models
Xu Ma
Peize Sun
Haoyu Ma
Hao Tang
Chih-Yao Ma
...
Matt Feiszli
Peizhao Zhang
Peter Vajda
Sam S. Tsai
Y. Fu
73
1
0
24 Apr 2025
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
Le Zhuo
Liangbing Zhao
Sayak Paul
Yue Liao
Renrui Zhang
Yi Xin
Peng Gao
Mohamed Elhoseiny
Yiming Li
VLM
75
0
0
22 Apr 2025
Structure-Preserving Zero-Shot Image Editing via Stage-Wise Latent Injection in Diffusion Models
Dasol Jeong
Donggoo Kang
Jiwon Park
Hyebean Lee
Joonki Paik
DiffM
27
0
0
22 Apr 2025
Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model
Ahmed Sobhi Saleh
Kristof Croes
Hajdin Ceric
Ingrid De Wolf
Houman Zahedmanesh
DiffM
29
0
0
21 Apr 2025
A Controllable Appearance Representation for Flexible Transfer and Editing
Santiago Jimenez-Navarro
Julia Guerrero-Viu
B. Masiá
DiffM
33
0
0
21 Apr 2025
MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World
Ankit Dhiman
Manan Shah
R. V. Babu
31
0
0
21 Apr 2025
Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration
Junyuan Deng
Xinyi Wu
Yongxing Yang
Congchao Zhu
Song Wang
Zhenyao Wu
38
0
0
21 Apr 2025
"I Know It When I See It": Mood Spaces for Connecting and Expressing Visual Concepts
Huzheng Yang
Katherine Xu
Michael D. Grossberg
Yutong Bai
Jianbo Shi
39
0
0
21 Apr 2025
Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Kaihang Pan
Wang Lin
Zhongqi Yue
Tenglong Ao
Liyu Jia
Wei Zhao
Juncheng Billy Li
Siliang Tang
Hanwang Zhang
52
2
0
20 Apr 2025
SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization
Liang Peng
Boxi Wu
Haoran Cheng
Yibo Zhao
Xiaofei He
36
0
0
20 Apr 2025
REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models
Chongye Guo
Jinhu Fu
Junfeng Fang
Kun Wang
Guorui Feng
39
0
0
20 Apr 2025
NTIRE 2025 Challenge on Real-World Face Restoration: Methods and Results
Zheng Chen
Jun Wang
Kai Liu
Jue Gong
Lei Sun
...
J. Lee
C. Lee
Chih-Chung Hsu
Hu Peng
Chunming He
61
3
0
20 Apr 2025
Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis
Zichuan Liu
Liming Jiang
Qing Yan
Yumin Jia
Hao Kang
Xin Lu
DiffM
31
0
0
19 Apr 2025
Fashion-RAG: Multimodal Fashion Image Editing via Retrieval-Augmented Generation
Fulvio Sanguigni
Davide Morelli
Marcella Cornia
Rita Cucchiara
DiffM
40
0
0
18 Apr 2025
ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis
Andrea Rigo
Luca Stornaiuolo
Mauro Martino
Bruno Lepri
N. Sebe
48
0
0
18 Apr 2025
Image Editing with Diffusion Models: A Survey
Jia Wang
Jie Hu
Xiaoqi Ma
Hanghang Ma
Xiaoming Wei
Enhua Wu
74
0
0
17 Apr 2025
Science-T2I: Addressing Scientific Illusions in Image Synthesis
Jialuo Li
Wenhao Chai
Xingyu Fu
Haiyang Xu
Saining Xie
MedIm
43
0
0
17 Apr 2025
HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation
Wenqi Dong
Bangbang Yang
Zesong Yang
Yuan Li
Tao Hu
Ruixing Wang
Yuewen Ma
Zhaopeng Cui
65
0
0
17 Apr 2025
Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off
Riza Velioglu
Petra Bevandic
Robin Chan
Barbara Hammer
DiffM
36
0
0
17 Apr 2025
ICAS: IP Adapter and ControlNet-based Attention Structure for Multi-Subject Style Transfer Optimization
Fuwei Liu
DiffM
43
0
0
17 Apr 2025
A Survey on Cross-Modal Interaction Between Music and Multimodal Data
Sifei Li
Mining Tan
Feier Shen
Minyan Luo
Zijiao Yin
Fan Tang
W. Dong
Changsheng Xu
69
0
0
17 Apr 2025
UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models
Guanlong Jiao
Biqing Huang
Kuan-Chieh Wang
Renjie Liao
DiffM
85
0
0
17 Apr 2025
ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models
Linkang Du
Zheng Zhu
M. Chen
Zhou Su
S. Ji
Peng Cheng
Jiming Chen
Zhikun Zhang
DiffM
WIGM
MLAU
71
0
0
17 Apr 2025
TwoSquared: 4D Generation from 2D Image Pairs
Lu Sang
Zehranaz Canfes
Dongliang Cao
Riccardo Marin
Florian Bernard
Daniel Cremers
149
0
0
17 Apr 2025
Learning What NOT to Count
Adriano DÁlessandro
Ali Mahdavi-Amiri
Ghassan Hamarneh
32
0
0
16 Apr 2025
VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Zhihang Yuan
Rui Xie
Yuzhang Shang
Hao Zhang
Siyuan Wang
Shengen Yan
Guohao Dai
Yu Wang
DiffM
VGen
42
0
0
16 Apr 2025
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework
Jiale Tao
Yanbing Zhang
Qixun Wang
Yiji Cheng
Haofan Wang
...
Ruihuang Li
Linqing Wang
Chunyu Wang
Qin Lin
Qinglin Lu
DiffM
47
1
0
16 Apr 2025
Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach
Lvpan Cai
Haowei Wang
Jiayi Ji
YanShu ZhouMen
Yiwei Ma
Xiaoshuai Sun
Liujuan Cao
Rongrong Ji
ViT
39
0
0
16 Apr 2025
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
Tianhui Song
Weixin Feng
Shuai Wang
Xinfeng Li
Tiezheng Ge
Bo Zheng
Limin Wang
MoMe
62
0
0
16 Apr 2025
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
Bingjie Gao
Xinyu Gao
Xiaoxue Wu
Yujie Zhou
Yu Qiao
Li Niu
Xinyuan Chen
Yaohui Wang
76
0
0
16 Apr 2025
Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis
Songping Wang
Yueming Lyu
Shiqi Liu
Ning Li
Tong Tong
Hao Sun
Caifeng Shan
PICV
72
0
0
16 Apr 2025
Previous
1
2
3
4
5
...
31
32
33
Next