Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
Focused ReAct: Improving ReAct through Reiterate and Early Stop
Shuoqiu Li
Han Xu
Haipeng Chen
ReLM
LRM
106
7
0
14 Oct 2024
Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning
Etai Littwin
Vimal Thilak
Anand Gopalakrishnan
134
8
0
14 Oct 2024
Saliency Guided Optimization of Diffusion Latents
Xiwen Wang
Jizhe Zhou
Xuekang Zhu
Cheng Li
Mao Li
EGVM
33
0
0
14 Oct 2024
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Weichao Zeng
Yan Shu
Zhenhang Li
Dongbao Yang
Yu Zhou
DiffM
71
11
0
14 Oct 2024
Learning to Customize Text-to-Image Diffusion In Diverse Context
Taewook Kim
Wei Chen
Qiang Qiu
DiffM
60
2
0
14 Oct 2024
REPeat: A Real2Sim2Real Approach for Pre-acquisition of Soft Food Items in Robot-assisted Feeding
Nayoung Ha
Ruolin Ye
Ziang Liu
Shubhangi Sinha
Tapomayukh Bhattacharjee
61
5
0
13 Oct 2024
AuthFace: Towards Authentic Blind Face Restoration with Face-oriented Generative Diffusion Prior
Guoqiang Liang
Qingnan Fan
Bingtao Fu
Jinwei Chen
Hong Gu
Lin Wang
DiffM
68
1
0
13 Oct 2024
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
Eungbean Lee
Somi Jeong
Kwanghoon Sohn
DiffM
63
1
0
13 Oct 2024
Generating Intermediate Representations for Compositional Text-To-Image Generation
Ran Galun
Sagie Benaim
49
0
0
13 Oct 2024
CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation
Yifeng Xu
Zhenliang He
Shiguang Shan
Xilin Chen
DiffM
69
6
0
12 Oct 2024
Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Xuan Huang
Hanhui Li
Wanquan Liu
Xiaodan Liang
Yiqiang Yan
Yuhao Cheng
Chengqiang Gao
3DGS
82
1
0
11 Oct 2024
Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models
Pascal Zwick
Kevin Roesch
Marvin Klemp
Oliver Bringmann
DiffM
63
1
0
11 Oct 2024
TD-Paint: Faster Diffusion Inpainting Through Time Aware Pixel Conditioning
Tsiry Mayet
Pourya Shamsolmoali
Simon Bernard
Eric Granger
Romain Hérault
Clément Chatelain
DiffM
82
1
0
11 Oct 2024
SceneCraft: Layout-Guided 3D Scene Generation
Xiuyu Yang
Yunze Man
Jun-Kun Chen
Yu-Xiong Wang
3DV
178
9
0
11 Oct 2024
RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image
Xiaoxue Chen
Jv Zheng
Hao Huang
Haoran Xu
Weihao Gu
...
He xiang
Huan-ang Gao
Hao Zhao
Guyue Zhou
Yaqin Zhang
3DGS
63
2
0
10 Oct 2024
ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion
Zitian Zhang
Frédéric Fortier-Chouinard
Mathieu Garon
Anand Bhattad
Jean-François Lalonde
DiffM
134
4
0
10 Oct 2024
HARIVO: Harnessing Text-to-Image Models for Video Generation
Mingi Kwon
Seoung Wug Oh
Yang Zhou
Difan Liu
Joon-Young Lee
Haoran Cai
Baqiao Liu
Feng Liu
Youngjung Uh
VGen
65
4
0
10 Oct 2024
DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models
Xiaoxiao He
Ligong Han
Quan Dao
Song Wen
Minhao Bai
...
Hongdong Li
Junzhou Huang
Faez Ahmed
Akash Srivastava
Dimitris Metaxas
DiffM
SyDa
159
5
0
10 Oct 2024
AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Yukang Cao
Liang Pan
Kai Han
Kwan-Yee K. Wong
Ziwei Liu
VGen
129
6
0
09 Oct 2024
InstructG2I: Synthesizing Images from Multimodal Attributed Graphs
Bowen Jin
Ziqi Pang
Bingjun Guo
Yu-Xiong Wang
Jiaxuan You
Jiawei Han
DiffM
94
2
0
09 Oct 2024
Thing2Reality: Transforming 2D Content into Conditioned Multiviews and 3D Gaussian Objects for XR Communication
Erzhen Hu
Mingyi Li
Jungtaek Hong
Xun Qian
A. Olwal
David Kim
Seongkook Heo
Ruofei Du
54
1
0
09 Oct 2024
VehicleSDF: A 3D generative model for constrained engineering design via surrogate modeling
Hayata Morita
Kohei Shintani
Chenyang Yuan
Frank Permenter
AI4CE
143
2
0
09 Oct 2024
Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis
Ahmed Abdullah
Nikolas Ebert
Oliver Wasenmüller
ObjD
60
1
0
09 Oct 2024
Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques
Benyuan Meng
Qianqian Xu
Zitai Wang
Zhiyong Yang
Xiaochun Cao
Qingming Huang
96
0
0
09 Oct 2024
InstantIR: Blind Image Restoration with Instant Generative Reference
Jen-Yuan Huang
Haofan Wang
Qixun Wang
Xu Bai
Hao Ai
Peng-Fei Xing
Jen-tse Huang
45
1
0
09 Oct 2024
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
Xinchen Zhang
Ling Yang
Ge Li
Yaqi Cai
Jiake Xie
Yong Tang
Yujiu Yang
Mengdi Wang
Bin Cui
EGVM
CoGe
113
11
0
09 Oct 2024
Chemistry-Inspired Diffusion with Non-Differentiable Guidance
Yuchen Shen
Chenhao Zhang
Sijie Fu
Chenghui Zhou
Newell Washburn
Barnabás Póczos
112
2
0
09 Oct 2024
Unsupervised Model Diagnosis
Yinong Wang
Eileen Li
Jinqi Luo
Zhaoning Wang
Fernando de la Torre
AAML
70
1
0
08 Oct 2024
RelitLRM: Generative Relightable Radiance for Large Reconstruction Models
Tianyuan Zhang
Zhengfei Kuang
Haian Jin
Zexiang Xu
Sai Bi
...
Yiwei Hu
Jian Yang
William T. Freeman
Kai Zhang
Fujun Luan
3DGS
120
3
0
08 Oct 2024
AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation
Boyuan Cao
Jiaxin Ye
Yujie Wei
Hongming Shan
80
4
0
08 Oct 2024
PixLens: A Novel Framework for Disentangled Evaluation in Diffusion-Based Image Editing with Object Detection + SAM
Stefan Stefanache
Lluís Pastor Pérez
Julen Costa Watanabe
Ernesto Sanchez Tejedor
Thomas Hofmann
Enis Simsar
EGVM
38
0
0
08 Oct 2024
ReFIR: Grounding Large Restoration Models with Retrieval Augmentation
Hang Guo
Tao Dai
Zhihao Ouyang
Taolin Zhang
Yaohua Zha
Bin Chen
Shu-Tao Xia
DiffM
83
6
0
08 Oct 2024
Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning
Saemi Moon
M. Lee
Sangdon Park
Dongwoo Kim
94
3
0
08 Oct 2024
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way
Jiazi Bu
Pengyang Ling
Pan Zhang
Tong Wu
Xiaoyi Dong
Yuhang Zang
Yuhang Cao
Dahua Lin
Jiaqi Wang
DiffM
VGen
44
0
0
08 Oct 2024
GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting
Yukang Cao
Masoud Hadi
Liang Pan
Ziwei Liu
3DGS
DiffM
102
5
0
07 Oct 2024
L-C4: Language-Based Video Colorization for Creative and Consistent Color
Zheng Chang
Shuchen Weng
Huan Ouyang
Yu Li
Si Li
Boxin Shi
DiffM
VGen
VLM
62
0
0
07 Oct 2024
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction
Leheng Li
Weichao Qiu
Xu Yan
Jing He
Kaiqiang Zhou
Yingjie Cai
Qing Lian
Bingbing Liu
Ying-Cong Chen
SyDa
DiffM
85
1
0
07 Oct 2024
ACDC: Autoregressive Coherent Multimodal Generation using Diffusion Correction
Hyungjin Chung
Dohun Lee
Jong Chul Ye
VGen
DiffM
68
2
0
07 Oct 2024
CAR: Controllable Autoregressive Modeling for Visual Generation
Ziyu Yao
Jialin Li
Yifeng Zhou
Yong Liu
Xi Jiang
Chengjie Wang
Feng Zheng
Yuexian Zou
Lei Li
DiffM
146
15
0
07 Oct 2024
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning
Ayano Hiranaka
Shang-Fu Chen
Chieh-Hsin Lai
Dongjun Kim
Naoki Murata
Takashi Shibuya
Wei-Hsiang Liao
Shao-Hua Sun
Yuki Mitsufuji
126
2
0
07 Oct 2024
Image Watermarks are Removable Using Controllable Regeneration from Clean Noise
Yepeng Liu
Yiren Song
Hai Ci
Yu Zhang
Haofan Wang
Mike Zheng Shou
Yuheng Bu
WIGM
118
7
0
07 Oct 2024
Presto! Distilling Steps and Layers for Accelerating Music Generation
Cheng-i Wang
Ge Zhu
Jonah Casebeer
Julian McAuley
Taylor Berg-Kirkpatrick
Nicholas J. Bryan
128
7
0
07 Oct 2024
Diffusion Models in 3D Vision: A Survey
Zhen Wang
Dongyuan Li
Xue Liu
Tianyu He
Jiang Bian
Renhe Jiang
MedIm
254
4
0
07 Oct 2024
TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio Motion Embedding and Diffusion Interpolation
Haiyang Liu
Xingchao Yang
Tomoya Akiyama
Yuantian Huang
Qiaoge Li
Shigeru Kuriyama
Takafumi Taketomi
VGen
SLR
75
10
0
05 Oct 2024
IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis
Shitong Shao
Zikai Zhou
Lichen Bai
Haoyi Xiong
Zeke Xie
VGen
97
2
0
05 Oct 2024
Noise Crystallization and Liquid Noise: Zero-shot Video Generation using Image Diffusion Models
Muhammad Haaris Khan
Hadrien Reynaud
Bernhard Kainz
VGen
DiffM
61
0
0
05 Oct 2024
Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher
Yong Guo
Shulian Zhang
Haolin Pan
Jing Liu
Yulun Zhang
Jian Chen
87
0
0
05 Oct 2024
Beyond Imperfections: A Conditional Inpainting Approach for End-to-End Artifact Removal in VTON and Pose Transfer
Aref Tabatabaei
Zahra Dehghanian
M. Amirmazlaghani
DiffM
99
0
0
05 Oct 2024
Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model
Keda Tao
Jinjin Gu
Yulun Zhang
Xiucheng Wang
Nan Cheng
DiffM
124
4
0
05 Oct 2024
ShieldDiff: Suppressing Sexual Content Generation from Diffusion Models through Reinforcement Learning
Dong Han
Salaheldin Mohamed
Yong Li
55
2
0
04 Oct 2024
Previous
1
2
3
...
20
21
22
...
60
61
62
Next