Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image
Dimitrije Antić
Sai Kumar Dwivedi
Shashank Tripathi
Theo Gevers
Dimitrios Tzionas
Dimitrios Tzionas
174
2
0
24 Sep 2024
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
Yifang Men
Yuan Yao
Miaomiao Cui
Liefeng Bo
DiffM
138
30
0
24 Sep 2024
Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection
Alireza Ganjdanesh
Yan Kang
Yuchen Liu
Richard Y. Zhang
Zhe Lin
Heng Huang
DiffM
105
5
0
23 Sep 2024
ControlEdit: A MultiModal Local Clothing Image Editing Method
Di Cheng
YingJie Shi
ShiXin Sun
JiaFu Zhang
WeiJing Wang
Yu Liu
DiffM
59
0
0
23 Sep 2024
Multi-modal Generative AI: Multi-modal LLMs, Diffusions and the Unification
X. Wang
Yuwei Zhou
Bin Huang
Hong Chen
Wenwu Zhu
DiffM
151
9
0
23 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
192
14
0
23 Sep 2024
GroupDiff: Diffusion-based Group Portrait Editing
Yuming Jiang
Nanxuan Zhao
Qing Liu
Krishna Kumar Singh
Shuai Yang
Chen Change Loy
Ziwei Liu
DiffM
72
1
0
22 Sep 2024
Dormant: Defending against Pose-driven Human Image Animation
Jiachen Zhou
Mingsi Wang
Tianlin Li
Guozhu Meng
Kai Chen
160
5
0
22 Sep 2024
DilateQuant: Accurate and Efficient Diffusion Quantization via Weight Dilation
Xuewen Liu
Zhikai Li
Minhao Jiang
Mengjuan Chen
Jianquan Li
Qingyi Gu
MQ
95
5
0
22 Sep 2024
JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation
Hadrien Reynaud
Matthew Baugh
Mischa Dombrowski
Sarah Cechnicka
Qingjie Meng
Bernhard Kainz
VLM
66
0
0
21 Sep 2024
KALIE: Fine-Tuning Vision-Language Models for Open-World Manipulation without Robot Data
Grace Tang
Swetha Rajkumar
Yifei Zhou
Homer Walke
Sergey Levine
Kuan Fang
LM&Ro
VLM
67
8
0
21 Sep 2024
BrainDreamer: Reasoning-Coherent and Controllable Image Generation from EEG Brain Signals via Language Guidance
Ling Wang
Chen Wu
Lin Wang
DiffM
66
0
0
21 Sep 2024
Beauty Beyond Words: Explainable Beauty Product Recommendations Using Ingredient-Based Product Attributes
Siliang Liu
Rahul Suresh
Amin Banitalebi-Dehkordi
66
2
0
20 Sep 2024
Imagine yourself: Tuning-Free Personalized Image Generation
Zecheng He
Bo Sun
Felix Juefei-Xu
Haoyu Ma
Ankit Ramchandani
...
Ning Zhang
Peizhao Zhang
Roshan Sumbaly
Peter Vajda
Animesh Sinha
DiffM
100
19
0
20 Sep 2024
StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation
Zhengguang Zhou
Jing Li
Huaxia Li
Nemo Chen
Xu Tang
DiffM
VGen
82
11
0
19 Sep 2024
AudioComposer: Towards Fine-grained Audio Generation with Natural Language Descriptions
Yun Wang
Hangting Chen
Dongchao Yang
Zhiyong Wu
Xixin Wu
DiffM
97
2
0
19 Sep 2024
LEMON: Localized Editing with Mesh Optimization and Neural Shaders
Furkan Mert Algan
Umut Yazgan
Driton Salihu
Cem Eteke
Eckehard G. Steinbach
DiffM
26
0
0
18 Sep 2024
GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation
Shuowen Liang
Sisi Li
Qingyun Wang
Cen Zhang
Kaiquan Zhu
Tian Yang
DiffM
54
0
0
18 Sep 2024
Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion Models
Eleonora Lopez
Luigi Sigillo
Federica Colonnese
Massimo Panella
Danilo Comminiello
DiffM
185
2
0
17 Sep 2024
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
Zhenwei Wang
Tengfei Wang
Zexin He
Gerhard Hancke
Ziwei Liu
Rynson W. H. Lau
DiffM
97
5
0
17 Sep 2024
OmniGen: Unified Image Generation
Shitao Xiao
Yueze Wang
Yueze Wang
Huaying Yuan
Xingrun Xing
Ruiran Yan
Shuting Wang
Tiejun Huang
Zheng Liu
DiffM
VLM
SyDa
133
88
0
17 Sep 2024
MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance
Debin Meng
Christos Tzelepis
Ioannis Patras
Georgios Tzimiropoulos
DiffM
91
0
0
17 Sep 2024
Edge-based Denoising Image Compression
Ryugo Morita
Hitoshi Nishimura
Ko Watanabe
Andreas Dengel
Jinjia Zhou
71
0
0
17 Sep 2024
Taming Diffusion Models for Image Restoration: A Review
Ziwei Luo
Fredrik K. Gustafsson
Zheng Zhao
Jens Sjölund
Thomas B. Schön
99
6
0
16 Sep 2024
SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps
Jakub Gregorek
Lazaros Nalpantidis
3DGS
133
4
0
16 Sep 2024
GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
Vitor Campagnolo Guizilini
P. Tokmakov
Achal Dave
Rares Andrei Ambrus
DiffM
75
2
0
15 Sep 2024
Latent Diffusion Models for Controllable RNA Sequence Generation
Kaixuan Huang
Yukang Yang
Kaidi Fu
Yanyi Chu
Le Cong
Mengdi Wang
89
2
0
15 Sep 2024
Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through
f
f
f
-divergence Minimization
Haoyuan Sun
Bo Xia
Yongzhe Chang
Xueqian Wang
EGVM
64
6
0
15 Sep 2024
E-Commerce Inpainting with Mask Guidance in Controlnet for Reducing Overcompletion
Guandong Li
DiffM
60
1
0
15 Sep 2024
One-Shot Learning for Pose-Guided Person Image Synthesis in the Wild
Dongqi Fan
Tao Chen
Mingjie Wang
Rui Ma
Qiang Tang
Zili Yi
Qian Wang
Liang Chang
72
0
0
15 Sep 2024
Adaptive Multi-Modal Control of Digital Human Hand Synthesis Using a Region-Aware Cycle Loss
Qifan Fu
Xiaohang Yang
Muhammad Asad
Changjae Oh
Shanxin Yuan
Gregory Slabaugh
77
3
0
13 Sep 2024
MAISI: Medical AI for Synthetic Imaging
Pengfei Guo
Can Zhao
Dong Yang
Ziyue Xu
Vishwesh Nath
...
Benjamin D. Simon
Mason J Belue
Stephanie Harmon
Baris Turkbey
Daguang Xu
DiffM
MedIm
79
25
0
13 Sep 2024
A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis
Yohan Poirier-Ginter
Alban Gauthier
Julien Philip
Jean-François Lalonde
G. Drettakis
98
13
0
13 Sep 2024
Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection
Dixi Yao
FedML
76
1
0
13 Sep 2024
Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
Zhiqi Huang
Dan Luo
Jun Wang
Huan Liao
Zhiheng Li
Zhiyong Wu
VGen
88
4
0
13 Sep 2024
DiffFAS: Face Anti-Spoofing via Generative Diffusion Models
Xinxu Ge
Xin Liu
Zitong Yu
Jingang Shi
Chun Qi
Jie Li
Heikki Kälviäinen
AAML
DiffM
62
5
0
13 Sep 2024
Enhancing Privacy in ControlNet and Stable Diffusion via Split Learning
Dixi Yao
63
0
0
13 Sep 2024
GroundingBooth: Grounding Text-to-Image Customization
Zhexiao Xiong
Wei Xiong
Jing Shi
He Zhang
Yizhi Song
Nathan Jacobs
DiffM
158
9
0
13 Sep 2024
STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment
Yong Ren
Chenxing Li
Manjie Xu
Wei Liang
Yu Gu
Rilin Chen
Dong Yu
VGen
DiffM
101
9
0
13 Sep 2024
SIG: A Synthetic Identity Generation Pipeline for Generating Evaluation Datasets for Face Recognition
Kassi Nzalasse
Rishav Raj
Eli J. Laird
Corey Clark
66
0
0
12 Sep 2024
Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation
Samanta Rodriguez
Yiming Dou
Miquel Oller
Andrew Owens
Nima Fazeli
DiffM
93
8
0
12 Sep 2024
Improving Text-guided Object Inpainting with Semantic Pre-inpainting
Yifu Chen
Jingwen Chen
Yingwei Pan
Yehao Li
Ting Yao
Zhineng Chen
Tao Mei
DiffM
77
7
0
12 Sep 2024
MagicStyle: Portrait Stylization Based on Reference Image
Zhaoli Deng
Kaibin Zhou
Fanyi Wang
Zhenpeng Mi
DiffM
103
1
0
12 Sep 2024
SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality
Chenyang Lei
Liyi Chen
Jun Cen
Xiao Chen
Zhen Lei
Felix Heide
Ziwei Liu
Qifeng Chen
Zhaoxiang Zhang
97
0
0
12 Sep 2024
Data Augmentation via Latent Diffusion for Saliency Prediction
Bahar Aydemir
Deblina Bhattacharjee
Tong Zhang
Mathieu Salzmann
Sabine Süsstrunk
110
1
0
11 Sep 2024
Phy124: Fast Physics-Driven 4D Content Generation from a Single Image
Jiajing Lin
Zhenzhong Wang
Yongjie Hou
Yuzhou Tang
Min Jiang
VGen
70
6
0
11 Sep 2024
ODYSSEE: Oyster Detection Yielded by Sensor Systems on Edge Electronics
Xiaomin Lin
Vivek Mange
Arjun Suresh
Bernhard Neuberger
Aadi Palnitkar
...
Alhim Vera
Markus Vincze
Ioannis Rekleitis
Herbert G. Tanner
Yiannis Aloimonos
134
3
0
11 Sep 2024
RealisDance: Equip controllable character animation with realistic hands
Jingkai Zhou
Benzhi Wang
Weihua Chen
Jingqi Bai
Dongyang Li
Aixi Zhang
Hao Xu
Mingyang Yang
F. Wang
53
17
0
10 Sep 2024
DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement
Jia-Wei Liao
Winston Wang
Tzu-Sian Wang
Li-Xuan Peng
Ju-Hsuan Weng
Cheng-Fu Chou
Jun-Cheng Chen
DiffM
119
2
0
10 Sep 2024
MemoVis: A GenAI-Powered Tool for Creating Companion Reference Images for 3D Design Feedback
Chen Chen
Cuong Nguyen
Thibault Groueix
Vladimir G. Kim
Nadir Weibel
DiffM
67
4
0
09 Sep 2024
Previous
1
2
3
...
22
23
24
...
60
61
62
Next