Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models
Shuhong Zheng
Zhipeng Bao
Ruoyu Zhao
Martial Hebert
Yu-Xiong Wang
DiffM
162
0
0
07 Nov 2024
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Wenqiang Sun
Shuo Chen
Fan Liu
Zilong Chen
Yueqi Duan
Jun Zhang
Yikai Wang
VGen
124
41
0
07 Nov 2024
Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation
Benito Buchheim
M. Reimann
Jürgen Döllner
63
0
0
07 Nov 2024
MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation
Han Yang
Sotiris Anagnostidis
Enis Simsar
Thomas Hofmann
DiffM
33
0
0
07 Nov 2024
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
Koichi Namekata
Sherwin Bahmani
Ziyi Wu
Yash Kant
Igor Gilitschenski
David B. Lindell
VGen
171
16
0
07 Nov 2024
HandCraft: Anatomically Correct Restoration of Malformed Hands in Diffusion Generated Images
Zhenyue Qin
Yiqun Zhang
Yang Liu
Dylan Campbell
DiffM
97
3
0
07 Nov 2024
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
Ashutosh Srivastava
Tarun Ram Menta
Abhinav Java
Avadhoot Jadhav
Silky Singh
Surgan Jandial
Balaji Krishnamurthy
DiffM
74
1
0
06 Nov 2024
Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Zhitong Gao
Bingnan Li
Mathieu Salzmann
Xuming He
OOD
VLM
176
1
0
06 Nov 2024
ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization
Huayang Huang
Yu Wu
Qian Wang
DiffM
WIGM
108
7
0
06 Nov 2024
SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture
Andrew Heschl
Mauricio Murillo
Keyhan Najafian
F. Maleki
DiffM
56
2
0
05 Nov 2024
On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
Tariq Berrada Ifriqi
Pietro Astolfi
Melissa Hall
Reyhane Askari Hemmat
Yohann Benchetrit
...
Matthew Muckley
Karteek Alahari
Adriana Romero Soriano
Jakob Verbeek
M. Drozdzal
AI4CE
VLM
139
4
0
05 Nov 2024
Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Ao Fu
Yi Zhou
Tao Zhou
Yue Yang
Bojun Gao
Qun Li
Guobin Wu
Ling Shao
VGen
100
3
0
05 Nov 2024
LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior
Xingjian Tang
Jingwei Guan
Linge Li
Ran Shi
Youmei Zhang
Mengye Lyu
Li Yan
MedIm
DiffM
171
0
0
05 Nov 2024
AutoVFX: Physically Realistic Video Editing from Natural Language Instructions
Hao-Yu Hsu
Zhi-Hao Lin
Albert Zhai
Hongchi Xia
Shenlong Wang
VGen
105
11
0
04 Nov 2024
Training-free Regional Prompting for Diffusion Transformers
Anthony Chen
Jianjin Xu
Wenzhao Zheng
Gaole Dai
Yun Wang
Renrui Zhang
Haofan Wang
Shanghang Zhang
VLM
102
5
0
04 Nov 2024
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Wei Cheng
Juncheng Mu
Xianfang Zeng
Xin Chen
Anqi Pang
...
Zhibin Wang
Bin-Bin Fu
Gang Yu
Ziwei Liu
Liang Pan
119
12
0
04 Nov 2024
Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition
Xinkai Liu
Mahdi Boloursaz Mashhadi
Li Qiao
Yi Ma
Rahim Tafazolli
Mehdi Bennis
DiffM
100
2
0
04 Nov 2024
KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Jie Yang
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chen Qian
Ruimao Zhang
MLLM
133
3
0
04 Nov 2024
DreamPolish: Domain Score Distillation With Progressive Geometry Generation
Yean Cheng
Ziqi Cai
Ming Ding
Wendi Zheng
Shiyu Huang
Yuxiao Dong
J. Tang
Boxin Shi
DiffM
59
0
0
03 Nov 2024
Infinite-Resolution Integral Noise Warping for Diffusion Models
Yitong Deng
Winnie Lin
Lingxiao Li
Dmitriy Smirnov
Ryan Burgert
Ning Yu
Vincent Dedun
Mohammad H. Taghavi
65
3
0
02 Nov 2024
Pin-Tuning: Parameter-Efficient In-Context Tuning for Few-Shot Molecular Property Prediction
Liang Wang
Qiang Liu
Shaozhen Liu
Xin Sun
Shu Wu
Liang Wang
106
2
0
02 Nov 2024
X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios
Yichen Xie
Chenfeng Xu
C-T.John Peng
Shuqi Zhao
Nhat Ho
Alexander T. Pham
Mingyu Ding
Masayoshi Tomizuka
Weidong Zhan
DiffM
92
3
0
02 Nov 2024
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing
Naufal Suryanto
Andro Aprila Adiputra
Ahmada Yusril Kadiptya
Thi-Thu-Huong Le
Derry Pratama
Yongsu Kim
Howon Kim
DiffM
128
0
0
01 Nov 2024
TextDestroyer: A Training- and Annotation-Free Diffusion Method for Destroying Anomal Text from Images
Mengcheng Li
Mingbao Lin
Yong Li
Chia-Wen Lin
DiffM
94
0
0
01 Nov 2024
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images
Timing Yang
Yuanliang Ju
Li Yi
3DPC
95
4
0
31 Oct 2024
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
Xiang Deng
Youxin Pang
Xiaochen Zhao
Chao Xu
Lizhen Wang
Hongjiang Xiao
Shi Yan
Hongwen Zhang
Yebin Liu
DiffM
VGen
68
1
0
31 Oct 2024
Disentangling Disentangled Representations: Towards Improved Latent Units via Diffusion Models
Youngjun Jun
Jiwoo Park
Kyobin Choo
Tae Eun Choi
Seong Jae Hwang
CoGe
119
0
0
31 Oct 2024
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
Fu Feng
Yucheng Xie
Xu Yang
Jing Wang
Xin Geng
DiffM
80
0
0
31 Oct 2024
TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation
Sunjae Yoon
Gwanhyeong Koo
Younghwan Lee
Chang D. Yoo
VGen
149
5
0
31 Oct 2024
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
Shengkai Zhang
Nianhong Jiao
Tian Li
Chaojie Yang
Chenhui Xue
Boya Niu
Jun Gao
VGen
VLM
DiffM
62
3
0
30 Oct 2024
Prune and Repaint: Content-Aware Image Retargeting for any Ratio
Feihong Shen
Chong Li
Yifeng Geng
Yongjian Deng
Hao Chen
58
1
0
30 Oct 2024
One Prompt to Verify Your Models: Black-Box Text-to-Image Models Verification via Non-Transferable Adversarial Attacks
Ji Guo
Wenbo Jiang
Rui Zhang
Guoming Lu
Hongwei Li
AAML
160
0
0
30 Oct 2024
FairSkin: Fair Diffusion for Skin Disease Image Generation
Ruichen Zhang
Yuguang Yao
Zhen Tan
Zechao Li
Pan Wang
Huan Liu
Jingtong Hu
Sijia Liu
Tianlong Chen
MedIm
65
0
0
29 Oct 2024
PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement
Shutong Jin
Ruiyu Wang
Kuangyi Chen
Florian T. Pokorny
76
0
0
29 Oct 2024
Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images
Suhyun Ahn
Wonjung Park
Jihoon Cho
Seunghyuck Park
Jinah Park
MedIm
85
0
0
29 Oct 2024
HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion
Yu Zeng
Zhiyuan Liu
Jiachen Liu
Linlin Shen
Kaijun Deng
Weizhao He
Jinbao Wang
DiffM
52
0
0
29 Oct 2024
DiffSTR: Controlled Diffusion Models for Scene Text Removal
Sanhita Pathak
V. Kaushik
Brejesh Lall
DiffM
70
0
0
29 Oct 2024
Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image Synthesis
Deepak Sridhar
Abhishek Peri
Rohith Rachala
Nuno Vasconcelos
DiffM
64
1
0
29 Oct 2024
IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models
Hang Guo
Yawei Li
Tao Dai
Shu-Tao Xia
Luca Benini
MQ
131
2
0
29 Oct 2024
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
Yaopei Zeng
Yuanpu Cao
Bochuan Cao
Yurui Chang
Jinghui Chen
Lu Lin
DiffM
92
3
0
28 Oct 2024
MovieCharacter: A Tuning-Free Framework for Controllable Character Video Synthesis
Di Qiu
Zheng Chen
Rui Wang
Mingyuan Fan
Changqian Yu
Junshi Huan
Xiang Wen
VGen
96
9
0
28 Oct 2024
FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space
Yiyang Guo
Ruizhe Li
Mude Hui
Hanzhong Guo
Chen Zhang
Chuangjian Cai
Le Wan
Shangfei Wang
92
0
0
28 Oct 2024
Novel Object Synthesis via Adaptive Text-Image Harmony
Zeren Xiong
Zedong Zhang
Zikun Chen
Shuo Chen
Xianrui Li
Gan Sun
Jian Yang
Jun Li
DiffM
95
4
0
28 Oct 2024
GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation
Phillip Y. Lee
Taehoon Yoon
Minhyuk Sung
138
7
1
27 Oct 2024
Diff-CXR: Report-to-CXR generation through a disease-knowledge enhanced diffusion model
Peng Huang
Bowen Guo
Shuyu Liang
Junhu Fu
Yuanyuan Wang
Yi Guo
DiffM
MedIm
68
1
0
26 Oct 2024
Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models
Liulei Li
Wenguan Wang
Yue Yang
98
8
0
26 Oct 2024
MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation
Jialin Luo
Yuanzhi Wang
Ziqi Gu
Yide Qiu
Shuaizhen Yao
Fuyun Wang
Chunyan Xu
Wenhua Zhang
Dan Wang
Zhen Cui
DiffM
54
2
0
26 Oct 2024
Semantic Feature Decomposition based Semantic Communication System of Images with Large-scale Visual Generation Models
Senran Fan
Zhicheng Bao
Chen Dong
Haotai Liang
Xiaodong Xu
Ping Zhang
DiffM
48
3
0
26 Oct 2024
ArCSEM: Artistic Colorization of SEM Images via Gaussian Splatting
Takuma Nishimura
Andreea Dogaru
Martin Oeggerli
Bernhard Egger
70
0
0
25 Oct 2024
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
Z. Gong
Guangyin Bao
Qi Zhang
Zhongwei Wan
Duoqian Miao
...
Changwei Wang
Rongtao Xu
Liang Hu
Ke Liu
Yu Zhang
DiffM
VGen
124
10
0
25 Oct 2024
Previous
1
2
3
...
18
19
20
...
60
61
62
Next