Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization
Jiafeng Mao
Xueting Wang
Kiyoharu Aizawa
DiffM
105
5
0
13 Dec 2023
Diffusion Models Enable Zero-Shot Pose Estimation for Lower-Limb Prosthetic Users
Tianxun Zhou
Muhammad Nur Shahril Iskandar
K. Chiam
DiffM
59
0
0
13 Dec 2023
HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation
Hongyu Liu
Xuan Wang
Bo Liu
Yujun Shen
Yibing Song
Jing Liao
Qifeng Chen
DiffM
102
17
0
12 Dec 2023
FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Sicheng Mo
Fangzhou Mu
Kuan Heng Lin
Yanli Liu
Bochen Guan
Yin Li
Bolei Zhou
DiffM
105
67
0
12 Dec 2023
EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection
Xuanyu Zhang
Runyi Li
Jiwen Yu
You-song Xu
Weiqi Li
Jian Zhang
WIGM
110
51
0
12 Dec 2023
Boosting Latent Diffusion with Flow Matching
Johannes S. Fischer
Ming Gui
Pingchuan Ma
Nick Stracke
S. A. Baumann
Bjorn Ommer
104
24
0
12 Dec 2023
GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos
Tomávs Souvcek
Dima Damen
Michael Wray
Ivan Laptev
Josef Sivic
VGen
88
21
0
12 Dec 2023
NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image
Yoonwoo Jeong
Jinwoo Lee
Chiheon Kim
Minsu Cho
Doyup Lee
59
4
0
12 Dec 2023
LatentMan: Generating Consistent Animated Characters using Image Diffusion Models
Abdelrahman Eldesokey
Peter Wonka
39
4
0
12 Dec 2023
Open-Pose 3D Zero-Shot Learning: Benchmark and Challenges
Weiguang Zhao
Guanyu Yang
Rui Zhang
Chenru Jiang
Chaolong Yang
Yuyao Yan
Amir Hussain
Kaizhu Huang
VLM
72
5
0
12 Dec 2023
CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Jie Xiao
Kai Zhu
Han Zhang
Zhiheng Liu
Yujun Shen
Yu Liu
Xueyang Fu
Zheng-Jun Zha
DiffM
74
11
0
12 Dec 2023
MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing
Kangneng Zhou
Daiheng Gao
Xuan Wang
Jie Zhang
Peng Zhang
...
Shiqi Yang
Bang Zhang
Liefeng Bo
Yaxing Wang
Ming-Ming Cheng
DiffM
112
4
0
12 Dec 2023
Relightful Harmonization: Lighting-aware Portrait Background Replacement
Mengwei Ren
Wei Xiong
Jae Shin Yoon
Zhixin Shu
Jianming Zhang
HyunJoon Jung
Guido Gerig
He Zhang
DiffM
102
24
0
11 Dec 2023
CAD: Photorealistic 3D Generation via Adversarial Distillation
Bo Liu
Despoina Paschalidou
Ian Huang
Hongyu Liu
Bokui Shen
Xiaoyu Xiang
Jing Liao
Leonidas Guibas
DiffM
146
11
0
11 Dec 2023
UpFusion: Novel View Diffusion from Unposed Sparse View Observations
Bharath Raj Nagoor Kani
Hsin-Ying Lee
Sergey Tulyakov
Shubham Tulsiani
84
6
0
11 Dec 2023
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
99
74
0
11 Dec 2023
Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
Shangchen Zhou
Peiqing Yang
Jianyi Wang
Yihang Luo
Chen Change Loy
VGen
171
48
0
11 Dec 2023
DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection
Haoyang He
Jiangning Zhang
Hongxu Chen
Xuhai Chen
Zhishan Li
Xu Chen
Yabiao Wang
Chengjie Wang
Lei Xie
DiffM
75
32
0
11 Dec 2023
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models
Denis Zavadski
Johann-Friedrich Feiden
Carsten Rother
DiffM
81
10
0
11 Dec 2023
InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following
Shufan Li
Harkanwar Singh
Aditya Grover
DiffM
93
10
0
11 Dec 2023
DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior
Tianyu Huang
Yihan Zeng
Zhilu Zhang
Wan Xu
Hang Xu
Songcen Xu
Rynson W. H. Lau
Wangmeng Zuo
46
26
0
11 Dec 2023
Offloading and Quality Control for AI Generated Content Services in 6G Mobile Edge Computing Networks
Yi-Ting Wang
Chang Liu
Jun Zhao
49
1
0
11 Dec 2023
DisControlFace: Disentangled Control for Personalized Facial Image Editing
Haozhe Jia
Yan Li
Hengfei Cui
Di Xu
Changpeng Yang
Yuwang Wang
Tao Yu
DiffM
60
1
0
11 Dec 2023
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Ka Leong Cheng
Qiuyu Wang
Zifan Shi
Kecheng Zheng
Yinghao Xu
Ouyang Hao
Qifeng Chen
Yujun Shen
3DH
137
4
0
11 Dec 2023
Correcting Diffusion Generation through Resampling
Yujian Liu
Yang Zhang
Tommi Jaakkola
Shiyu Chang
103
8
0
10 Dec 2023
PSCR: Patches Sampling-based Contrastive Regression for AIGC Image Quality Assessment
Jiquan Yuan
Xinyan Cao
Linjing Cao
Jinlong Lin
Xixin Cao
EGVM
74
11
0
10 Dec 2023
A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
Maomao Li
Yu Li
Tianyu Yang
Yunfei Liu
Dongxu Yue
Zhihui Lin
Dong Xu
VGen
36
9
0
10 Dec 2023
InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models
Jiun Tian Hoe
Xudong Jiang
Chee Seng Chan
Yap-Peng Tan
Weipeng Hu
85
14
0
10 Dec 2023
NLLG Quarterly arXiv Report 09/23: What are the most influential current AI Papers?
Ran Zhang
Aida Kostikova
Christoph Leiter
Jonas Belouadi
Daniil Larionov
Yanran Chen
Vivian Fresen
Steffen Eger
81
0
0
09 Dec 2023
BARET : Balanced Attention based Real image Editing driven by Target-text Inversion
Yuming Qiao
Fanyi Wang
Jingwen Su
Yanhao Zhang
Yunjie Yu
Siyu Wu
Guo-Jun Qi
DiffM
49
4
0
09 Dec 2023
Exploring the Naturalness of AI-Generated Images
Zijian Chen
Wei Sun
Haoning Wu
Zicheng Zhang
Jun Jia
...
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guangtao Zhai
Wenjun Zhang
109
22
0
09 Dec 2023
Efficient Quantization Strategies for Latent Diffusion Models
Yuewei Yang
Xiaoliang Dai
Jialiang Wang
Peizhao Zhang
Hongbo Zhang
DiffM
MQ
88
14
0
09 Dec 2023
NoiseCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions in Diffusion Models
Yusuf Dalva
Pinar Yanardag
DiffM
86
20
0
08 Dec 2023
Cross Domain Generative Augmentation: Domain Generalization with Latent Diffusion Models
S. Hemati
Mahdi Beitollahi
A. Estiri
Bassel Al Omari
Xi Chen
Guojun Zhang
69
7
0
08 Dec 2023
SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation
Thuan Hoang Nguyen
Anh Tran
DiffM
78
66
0
08 Dec 2023
Disentangled Clothed Avatar Generation from Text Descriptions
Jiong-Qi Wang
Yuan Liu
Zhiyang Dou
Zhengming Yu
Yongqing Liang
Xin Li
Wenping Wang
Rong Xie
Li Song
80
23
0
08 Dec 2023
ControlRoom3D: Room Generation using Semantic Proxy Rooms
Jonas Schult
Sam S. Tsai
Lukas Höllein
Bichen Wu
Jialiang Wang
...
Zijian He
Peizhao Zhang
Bastian Leibe
Peter Vajda
Ji Hou
80
34
0
08 Dec 2023
DreaMoving: A Human Video Generation Framework based on Diffusion Models
Mengyang Feng
Jinlin Liu
Kai Yu
Yuan Yao
Zheng Hui
...
Xiaoyang Kang
Biwen Lei
Miaomiao Cui
Peiran Ren
Xuansong Xie
VGen
64
29
0
08 Dec 2023
SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control
Jaskirat Singh
Jianming Zhang
Qing Liu
Cameron Smith
Zhe Lin
Liang Zheng
DiffM
80
11
0
08 Dec 2023
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
Yiming Zhao
Zhouhui Lian
118
30
0
08 Dec 2023
Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting
Xiaofeng Yang
Yiwen Chen
Cheng Chen
Chi Zhang
Yi Tian Xu
Xulei Yang
Fayao Liu
Guosheng Lin
3DGS
DiffM
70
18
0
08 Dec 2023
RS-Corrector: Correcting the Racial Stereotypes in Latent Diffusion Models
Yue Jiang
Yueming Lyu
Tianxiang Ma
Bo Peng
Jing Dong
119
4
0
08 Dec 2023
Reality's Canvas, Language's Brush: Crafting 3D Avatars from Monocular Video
Yuchen Rao
Eduardo Pérez-Pellitero
Benjamin Busam
Yiren Zhou
Jifei Song
86
0
0
08 Dec 2023
NeuSD: Surface Completion with Multi-View Text-to-Image Diffusion
Savva Ignatyev
Daniil Selikhanovych
Oleg Voynov
Yiqun Wang
Peter Wonka
Stamatios Lefkimmiatis
Evgeny Burnaev
62
0
0
07 Dec 2023
Gen2Det: Generate to Detect
Saksham Suri
Fanyi Xiao
Animesh Sinha
Sean Culatana
Raghuraman Krishnamoorthi
Chenchen Zhu
Abhinav Shrivastava
VLM
DiffM
93
10
0
07 Dec 2023
GenDeF: Learning Generative Deformation Field for Video Generation
Wen Wang
Kecheng Zheng
Qiuyu Wang
Hao Chen
Zifan Shi
Ceyuan Yang
Yujun Shen
Chunhua Shen
VGen
DiffM
76
2
0
07 Dec 2023
Inversion-Free Image Editing with Natural Language
Sihan Xu
Yidong Huang
Jiayi Pan
Ziqiao Ma
Joyce Chai
DiffM
94
66
0
07 Dec 2023
PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns
Shuliang Ning
Duomin Wang
Yipeng Qin
Zirong Jin
Baoyuan Wang
Xiaoguang Han
DiffM
75
12
0
07 Dec 2023
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models
Ozgur Kara
Barışcan Kurtkaya
Hidir Yesiltepe
James M. Rehg
Pinar Yanardag
VGen
DiffM
102
55
0
07 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
64
43
0
07 Dec 2023
Previous
1
2
3
...
46
47
48
...
60
61
62
Next