Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01952
Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"
50 / 1,635 papers shown
Title
UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures
Mingyuan Zhou
Rakib Hyder
Ziwei Xuan
Guojun Qi
32
6
0
20 Jan 2024
DiffusionGPT: LLM-Driven Text-to-Image Generation System
Jie Qin
Jie Wu
Weifeng Chen
Yuxi Ren
Huixian Li
Hefeng Wu
Xuefeng Xiao
Rui Wang
S. Wen
DiffM
62
25
0
18 Jan 2024
Vlogger: Make Your Dream A Vlog
Shaobin Zhuang
Kunchang Li
Xinyuan Chen
Yaohui Wang
Ziwei Liu
Yu Qiao
Yali Wang
VGen
DiffM
43
35
0
17 Jan 2024
UniVG: Towards UNIfied-modal Video Generation
Ludan Ruan
Lei Tian
Chuanwei Huang
Xu Zhang
Xinyan Xiao
VGen
DiffM
34
3
0
17 Jan 2024
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Jonghyun Lee
Hansam Cho
Youngjoon Yoo
Seoung Bum Kim
Yonghyun Jeong
DiffM
23
7
0
17 Jan 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
126
280
0
17 Jan 2024
Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive
Yumeng Li
M. Keuper
Dan Zhang
Anna Khoreva
DiffM
43
10
0
16 Jan 2024
WAVES: Benchmarking the Robustness of Image Watermarks
Bang An
Mucong Ding
Tahseen Rabbani
Aakriti Agrawal
Yuancheng Xu
...
Sicheng Zhu
Abdirisak Mohamed
Yuxin Wen
Tom Goldstein
Furong Huang
33
40
0
16 Jan 2024
Instilling Multi-round Thinking to Text-guided Image Generation
Lidong Zeng
Zhedong Zheng
Yinwei Wei
Tat-Seng Chua
34
5
0
16 Jan 2024
HexaGen3D: StableDiffusion is just one step away from Fast and Diverse Text-to-3D Generation
Antoine Mercier
Ramin Nakhli
Mahesh Reddy
R. Yasarla
Hong Cai
Fatih Porikli
Guillaume Berger
DiffM
35
15
0
15 Jan 2024
InstantID: Zero-shot Identity-Preserving Generation in Seconds
Qixun Wang
Xu Bai
Haofan Wang
Zekui Qin
Anthony Chen
Huaxia Li
Xu Tang
Yao Hu
46
238
0
15 Jan 2024
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models
Junsong Chen
Yue Wu
Simian Luo
Enze Xie
Sayak Paul
Ping Luo
Hang Zhao
Zhenguo Li
VLM
36
72
0
10 Jan 2024
Memory-Efficient Fine-Tuning for Quantized Diffusion Model
Hyogon Ryu
Seohyun Lim
Hyunjung Shim
DiffM
MQ
27
6
0
09 Jan 2024
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
Tong Wu
Guandao Yang
Zhibing Li
Kai Zhang
Ziwei Liu
Leonidas J. Guibas
Dahua Lin
Gordon Wetzstein
EGVM
VGen
35
88
0
08 Jan 2024
Instruct-Imagen: Image Generation with Multi-modal Instruction
Hexiang Hu
Kelvin C. K. Chan
Yu-Chuan Su
Wenhu Chen
Yandong Li
...
Xue Ben
Boqing Gong
William W. Cohen
Ming-Wei Chang
Xuhui Jia
MLLM
46
43
0
03 Jan 2024
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
David Junhao Zhang
Dongxu Li
Hung Le
Mike Zheng Shou
Caiming Xiong
Doyen Sahoo
VGen
27
23
0
03 Jan 2024
aMUSEd: An Open MUSE Reproduction
Suraj Patil
William Berman
Robin Rombach
Patrick von Platen
VLM
25
18
0
03 Jan 2024
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields
Jan-Niklas Dihlmann
Andreas Engelhardt
Hendrik P. A. Lensch
DiffM
VGen
24
4
0
03 Jan 2024
ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text
Dingkun Yan
Liang Yuan
Erwin Wu
Yuma Nishioka
I. Fujishiro
Suguru Saito
DiffM
21
5
0
02 Jan 2024
Image Sculpting: Precise Object Editing with 3D Geometry Control
Jiraphon Yenphraphai
Xichen Pan
Sainan Liu
Daniele Panozzo
Saining Xie
34
19
0
02 Jan 2024
Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation
Renshuai Liu
Bowen Ma
Wei Zhang
Zhipeng Hu
Changjie Fan
Tangjie Lv
Yu-qiong Ding
Xuan Cheng
DiffM
27
20
0
02 Jan 2024
New Job, New Gender? Measuring the Social Bias in Image Generation Models
Wenxuan Wang
Haonan Bai
Jen-tse Huang
Yuxuan Wan
Youliang Yuan
Haoyi Qiu
Nanyun Peng
Michael R. Lyu
47
20
0
01 Jan 2024
DiffMorph: Text-less Image Morphing with Diffusion Models
Shounak Chatterjee
DiffM
15
0
0
01 Jan 2024
Diffusion Model with Perceptual Loss
Shanchuan Lin
Xiao Yang
DiffM
30
15
0
30 Dec 2023
4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency
Yuyang Yin
Dejia Xu
Zhangyang Wang
Yao-Min Zhao
Yunchao Wei
3DGS
57
72
0
28 Dec 2023
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Guansong Lu
Yuanfan Guo
Jianhua Han
Minzhe Niu
Yihan Zeng
Songcen Xu
Zeyi Huang
Zhao Zhong
Wei Zhang
Hang Xu
39
4
0
27 Dec 2023
One-Dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications
Mengyao Lyu
Yuhong Yang
Haiwen Hong
Hui Chen
Xuan Jin
Yuan He
Hui Xue
Jungong Han
Guiguang Ding
DiffM
29
58
0
26 Dec 2023
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Yuxuan Zhang
Yiren Song
Jiaming Liu
Rui Wang
Jinpeng Yu
...
Huaxia Li
Xu Tang
Yao Hu
Han Pan
Zhongliang Jing
49
58
0
26 Dec 2023
SAiD: Speech-driven Blendshape Facial Animation with Diffusion
Inkyu Park
Jaewoong Cho
34
4
0
25 Dec 2023
Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data Generation Framework using Foundational Models
Gurusha Juneja
Sukrit Kumar
DiffM
19
0
0
23 Dec 2023
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Xinyan Chen
Jiaxin Ge
Tianjun Zhang
Jiaming Liu
Shanghang Zhang
VLM
EGVM
42
0
0
23 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
20
241
0
21 Dec 2023
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Hayk Manukyan
Andranik Sargsyan
Barsegh Atanyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
35
28
0
21 Dec 2023
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
Desai Xie
Jiahao Li
Hao Tan
Xin Sun
Zhixin Shu
Yi Zhou
Sai Bi
Soren Pirk
Arie E. Kaufman
37
8
0
21 Dec 2023
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
Yiming Zhang
Zhening Xing
Yanhong Zeng
Youqing Fang
Kai Chen
VGen
36
27
0
21 Dec 2023
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
Xianfang Zeng
Xin Chen
Zhongqi Qi
Wen Liu
Zibo Zhao
Zhibin Wang
Bin-Bin Fu
Yong-jin Liu
Gang Yu
DiffM
18
67
0
21 Dec 2023
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling
Seung Wook Kim
Antonio Torralba
Sanja Fidler
Karsten Kreis
DiffM
3DGS
45
113
0
21 Dec 2023
Generative Multimodal Models are In-Context Learners
Quan-Sen Sun
Yufeng Cui
Xiaosong Zhang
Fan Zhang
Qiying Yu
...
Yueze Wang
Yongming Rao
Jingjing Liu
Tiejun Huang
Xinlong Wang
MLLM
LRM
45
247
0
20 Dec 2023
ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors
Weijia Mao
Yan-Pei Cao
Jia-Wei Liu
Zhongcong Xu
Mike Zheng Shou
DiffM
51
5
0
20 Dec 2023
RadEdit: stress-testing biomedical vision models via diffusion image editing
Fernando Pérez-García
Sam Bond-Taylor
Pedro P. Sanchez
B. V. Breugel
Daniel Coelho De Castro
...
M. Lungren
A. Nori
Javier Alvarez-Valle
Ozan Oktay
Maximilian Ilse
MedIm
43
8
0
20 Dec 2023
Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models
Nikita Starodubcev
Artem Fedorov
Artem Babenko
Dmitry Baranchuk
DiffM
50
3
0
17 Dec 2023
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Mingsheng Li
Xin Chen
C. Zhang
Sijin Chen
Erik Cambria
Fukun Yin
Gang Yu
Tao Chen
36
24
0
17 Dec 2023
M^2ConceptBase: A Fine-Grained Aligned Concept-Centric Multimodal Knowledge Base
Zhiwei Zha
Jiaan Wang
Zhixu Li
Xiangru Zhu
Wei Song
Yanghua Xiao
VLM
45
2
0
16 Dec 2023
Latent Diffusion Models with Image-Derived Annotations for Enhanced AI-Assisted Cancer Diagnosis in Histopathology
Pedro Osório
Guillermo Jiménez-Pérez
Javier Montalt-Tordera
Jens Hooge
Guillem Duran Ballester
...
Sabrina Schroeder
K. Siudak
Julia Vienenkoetter
Bettina Lawrenz
Sadegh Mohammadi
MedIm
30
8
0
15 Dec 2023
Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
Qin Guo
Tianwei Lin
DiffM
22
31
0
15 Dec 2023
ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining
Ruoxi Shi
Xinyue Wei
Cheng Wang
Hao Su
41
16
0
14 Dec 2023
Reliability in Semantic Segmentation: Can We Use Synthetic Data?
Thibaut Loiseau
Tuan-Hung Vu
Mickaël Chen
Patrick Pérez
Matthieu Cord
UQCV
34
12
0
14 Dec 2023
DiffusionLight: Light Probes for Free by Painting a Chrome Ball
Pakkapon Phongthawee
Worameth Chinchuthakun
Nontaphat Sinsunthithet
Amit Raj
Varun Jampani
Pramook Khungurn
Supasorn Suwajanakorn
DiffM
35
23
0
14 Dec 2023
Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and Multi-Source Supervision
Shengguang Wu
Zhenglun Chen
Qi Su
DiffM
30
0
0
13 Dec 2023
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Tianxing Wu
Chenyang Si
Yuming Jiang
Ziqi Huang
Ziwei Liu
DiffM
VGen
38
45
0
12 Dec 2023
Previous
1
2
3
...
28
29
30
31
32
33
Next