Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation
Wei Wu
Xi Guo
Weixuan Tang
Tingxuan Huang
Chiyu Wang
Dongyue Chen
C. Ding
VGen
80
8
0
09 Sep 2024
Prim2Room: Layout-Controllable Room Mesh Generation from Primitives
Chengzeng Feng
Jiacheng Wei
Cheng Chen
Yang Li
Pan Ji
Fayao Liu
Hongdong Li
Guosheng Lin
90
1
0
09 Sep 2024
Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance
Quang-Huy Che
Duc-Tri Le
Vinh-Tiep Nguyen
D. Lam
Vinh-Tiep Nguyen
DiffM
253
1
0
09 Sep 2024
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Jiaxin Cheng
Zixu Zhao
Tong He
Tianjun Xiao
Yicong Zhou
Zheng Zhang
DiffM
148
0
0
07 Sep 2024
Training-Free Style Consistent Image Synthesis with Condition and Mask Guidance in E-Commerce
Guandong Li
DiffM
63
2
0
07 Sep 2024
One-Shot Diffusion Mimicker for Handwritten Text Generation
Gang Dai
Yifan Zhang
Quhui Ke
Qiangya Guo
Shuangping Huang
DiffM
107
8
0
06 Sep 2024
An overview of domain-specific foundation model: key technologies, applications and challenges
Haolong Chen
Hanzhi Chen
Zijian Zhao
Kaifeng Han
Guangxu Zhu
Yichen Zhao
Ying Du
Wei Xu
Qingjiang Shi
ALM
VLM
129
5
0
06 Sep 2024
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Jianbiao Mei
T. Hu
Xuemeng Yang
Licheng Wen
Yu Yang
Tiantian Wei
Yukai Ma
Min Dou
Botian Shi
Yong Liu
VGen
DiffM
170
6
0
06 Sep 2024
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation
Wenliang Zhao
Haolin Wang
Jie Zhou
Jiwen Lu
DiffM
56
3
0
05 Sep 2024
LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors
Hanyang Yu
Xiaoxiao Long
Ping Tan
3DGS
90
6
0
05 Sep 2024
SketcherX: AI-Driven Interactive Robotic drawing with Diffusion model and Vectorization Techniques
Jookyung Song
Mookyoung Kang
Nojun Kwak
37
1
0
04 Sep 2024
Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors
Haiyu Wu
Jaskirat Singh
Sicong Tian
Liang Zheng
Kevin W. Bowyer
CVBM
145
4
0
04 Sep 2024
LinFusion: 1 GPU, 1 Minute, 16K Image
Songhua Liu
Weihao Yu
Zhenxiong Tan
Xinchao Wang
123
16
0
03 Sep 2024
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Wangbo Yu
Jinbo Xing
Li Yuan
Wenbo Hu
Xiaoyu Li
Zhipeng Huang
Xiangjun Gao
T. Wong
Ying Shan
Yonghong Tian
VGen
DiffM
116
97
0
03 Sep 2024
EarthGen: Generating the World from Top-Down Views
Ansh Sharma
Albert Xiao
Praneet Rathi
Rohit Kundu
Albert Zhai
Yuan Shen
Shenlong Wang
123
1
0
02 Sep 2024
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
Qihua Chen
Yi Ma
Haobo Wang
Junkun Yuan
Wenzhe Zhao
Q. Tian
Hongmei Wang
Shaobo Min
Qifeng Chen
Wen Liu
DiffM
110
21
0
02 Sep 2024
From Bird's-Eye to Street View: Crafting Diverse and Condition-Aligned Images with Latent Diffusion Model
Xiaojie Xu
Tianshuo Xu
Fulong Ma
Yingcong Chen
102
0
0
02 Sep 2024
SPDiffusion: Semantic Protection Diffusion Models for Multi-concept Text-to-image Generation
Yang Zhang
Rui Zhang
Xuecheng Nie
Haochen Li
Jikun Chen
Yifan Hao
Xin Zhang
Luoqi Liu
Ling Li
122
0
0
02 Sep 2024
COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
Shaorong Sun
Shuchao Pang
Yazhou Yao
Xiaoshui Huang
71
1
0
01 Sep 2024
FLUX that Plays Music
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Junshi Huang
141
9
0
01 Sep 2024
Compositional 3D-aware Video Generation with LLM Director
Hanxin Zhu
Tianyu He
Anni Tang
Junliang Guo
Zhibo Chen
Jiang Bian
DiffM
VGen
108
7
0
31 Aug 2024
Training-Free Sketch-Guided Diffusion with Latent Optimization
Sandra Zhang Ding
Jiafeng Mao
Kiyoharu Aizawa
DiffM
184
3
0
31 Aug 2024
Efficient Image Restoration through Low-Rank Adaptation and Stable Diffusion XL
Haiyang Zhao
DiffM
71
0
0
30 Aug 2024
GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content
Lebin Zhou
Kun Han
Nam Ling
Wei Wang
Wei Jiang
3DGS
89
0
0
29 Aug 2024
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models
Moreno DÍncà
E. Peruzzo
Massimiliano Mancini
Xingqian Xu
Humphrey Shi
N. Sebe
103
0
0
29 Aug 2024
COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation
Jiefeng Li
Ye Yuan
Davis Rempe
Haotian Zhang
Pavlo Molchanov
Cewu Lu
Jan Kautz
Umar Iqbal
DiffM
VGen
104
2
0
29 Aug 2024
Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
Xiaoyu Jin
Zunnan Xu
Mingwen Ou
Wenming Yang
DiffM
89
7
0
29 Aug 2024
TEDRA: Text-based Editing of Dynamic and Photoreal Actors
Basavaraj Sunagad
Heming Zhu
Mohit Mendiratta
Adam Kortylewski
Christian Theobalt
Marc Habermann
DiffM
101
1
0
28 Aug 2024
DiffAge3D: Diffusion-based 3D-aware Face Aging
Junaid Wahid
Fangneng Zhan
Pramod Rao
Christian Theobalt
DiffM
79
1
0
28 Aug 2024
GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model
Yongjie Fu
Yunlong Li
Xuan Di
VGen
127
3
0
28 Aug 2024
Are Pose Estimators Ready for the Open World? STAGE: Synthetic Data Generation Toolkit for Auditing 3D Human Pose Estimators
Nikita Kister
István Sárándi
Anna Khoreva
Gerard Pons-Moll
152
0
0
28 Aug 2024
Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas
Fabio Quattrini
Vittorio Pippi
Silvia Cascianelli
Rita Cucchiara
72
3
0
28 Aug 2024
Alfie: Democratising RGBA Image Generation With No
Fabio Quattrini
Vittorio Pippi
Silvia Cascianelli
Rita Cucchiara
DiffM
93
6
0
27 Aug 2024
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation
Abdelrahman Eldesokey
Peter Wonka
DiffM
127
4
0
27 Aug 2024
CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis
Weijia Li
Jun He
Junyan Ye
Huaping Zhong
Zhimeng Zheng
Zilong Huang
Dahua Lin
Conghui He
88
7
0
27 Aug 2024
MeshUp: Multi-Target Mesh Deformation via Blended Score Distillation
Hyunwoo Kim
Itai Lang
Noam Aigerman
Thibault Groueix
Vladimir G. Kim
Rana Hanocka
AI4CE
127
3
0
27 Aug 2024
Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models
Chaohua Shi
Xuan Wang
Si Shi
Xule Wang
Mingrui Zhu
Nannan Wang
X. Gao
CoGe
93
2
0
26 Aug 2024
Avatar Concept Slider: Controllable Editing of Concepts in 3D Human Avatars
Yixuan He
Lin Geng Foo
Ajmal Mian
Hossein Rahmani
Jun Liu
Christian Theobalt
77
1
0
26 Aug 2024
Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching
Minghao Liu
Le Zhang
Yingjie Tian
Xiaochao Qu
Luoqi Liu
Ting Liu
DiffM
CoGe
71
4
0
25 Aug 2024
TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation
Jack D. Saunders
Vinay P. Namboodiri
95
2
0
25 Aug 2024
SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting
Wenrui Li
Yapeng Mi
Fucheng Cai
Zhe Yang
Wangmeng Zuo
Xingtao Wang
Xiaopeng Fan
3DGS
100
9
0
25 Aug 2024
EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation
Cong Wang
Jiaxi Gu
Panwen Hu
Haoyu Zhao
Yuanfan Guo
J. N. Han
Hang Xu
Xiaodan Liang
VGen
DiffM
99
7
0
23 Aug 2024
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey
Qika Lin
Yifan Zhu
Xin Mei
Ling Huang
Jingying Ma
Kai He
Zhen Peng
Min Zhang
Mengling Feng
111
23
0
23 Aug 2024
Abstract Art Interpretation Using ControlNet
Rishabh Srivastava
Addrish Roy
26
0
0
23 Aug 2024
Atlas Gaussians Diffusion for 3D Generation
Haitao Yang
Yuan Dong
Hanwen Jiang
Dejia Xu
Georgios Pavlakos
Qixing Huang
3DGS
191
3
0
23 Aug 2024
CatFree3D: Category-agnostic 3D Object Detection with Diffusion
Wenjing Bian
Zirui Wang
Andrea Vedaldi
96
1
0
22 Aug 2024
Sapiens: Foundation for Human Vision Models
Rawal Khirodkar
Timur M. Bagautdinov
Julieta Martinez
Su Zhaoen
Austin James
Peter Selednik
Stuart Anderson
Shunsuke Saito
VLM
145
82
0
22 Aug 2024
JieHua Paintings Style Feature Extracting Model using Stable Diffusion with ControlNet
Yujia Gu
Haofeng Li
Xinyu Fang
Zihan Peng
Yinan Peng
DiffM
47
0
0
21 Aug 2024
Evolution of Detection Performance throughout the Online Lifespan of Synthetic Images
Dimitrios Karageorgiou
Quentin Bammey
Valentin Porcellini
Bertrand Goupil
Denis Teyssou
Symeon Papadopoulos
98
2
0
21 Aug 2024
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
Junwon Lee
Jaekwon Im
Dabin Kim
Juhan Nam
VGen
140
10
0
21 Aug 2024
Previous
1
2
3
...
23
24
25
...
60
61
62
Next