ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
DriveScape: Towards High-Resolution Controllable Multi-View Driving
  Video Generation
DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation
Wei Wu
Xi Guo
Weixuan Tang
Tingxuan Huang
Chiyu Wang
Dongyue Chen
C. Ding
VGen
80
8
0
09 Sep 2024
Prim2Room: Layout-Controllable Room Mesh Generation from Primitives
Prim2Room: Layout-Controllable Room Mesh Generation from Primitives
Chengzeng Feng
Jiacheng Wei
Cheng Chen
Yang Li
Pan Ji
Fayao Liu
Hongdong Li
Guosheng Lin
90
1
0
09 Sep 2024
Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance
Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance
Quang-Huy Che
Duc-Tri Le
Vinh-Tiep Nguyen
D. Lam
Vinh-Tiep Nguyen
DiffM
253
1
0
09 Sep 2024
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Jiaxin Cheng
Zixu Zhao
Tong He
Tianjun Xiao
Yicong Zhou
Zheng Zhang
DiffM
148
0
0
07 Sep 2024
Training-Free Style Consistent Image Synthesis with Condition and Mask
  Guidance in E-Commerce
Training-Free Style Consistent Image Synthesis with Condition and Mask Guidance in E-Commerce
Guandong Li
DiffM
63
2
0
07 Sep 2024
One-Shot Diffusion Mimicker for Handwritten Text Generation
One-Shot Diffusion Mimicker for Handwritten Text Generation
Gang Dai
Yifan Zhang
Quhui Ke
Qiangya Guo
Shuangping Huang
DiffM
107
8
0
06 Sep 2024
An overview of domain-specific foundation model: key technologies, applications and challenges
An overview of domain-specific foundation model: key technologies, applications and challenges
Haolong Chen
Hanzhi Chen
Zijian Zhao
Kaifeng Han
Guangxu Zhu
Yichen Zhao
Ying Du
Wei Xu
Qingjiang Shi
ALMVLM
129
5
0
06 Sep 2024
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Jianbiao Mei
T. Hu
Xuemeng Yang
Licheng Wen
Yu Yang
Tiantian Wei
Yukai Ma
Min Dou
Botian Shi
Yong Liu
VGenDiffM
170
6
0
06 Sep 2024
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic
  Compensation
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation
Wenliang Zhao
Haolin Wang
Jie Zhou
Jiwen Lu
DiffM
56
3
0
05 Sep 2024
LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model
  Priors
LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors
Hanyang Yu
Xiaoxiao Long
Ping Tan
3DGS
90
6
0
05 Sep 2024
SketcherX: AI-Driven Interactive Robotic drawing with Diffusion model
  and Vectorization Techniques
SketcherX: AI-Driven Interactive Robotic drawing with Diffusion model and Vectorization Techniques
Jookyung Song
Mookyoung Kang
Nojun Kwak
37
1
0
04 Sep 2024
Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors
Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors
Haiyu Wu
Jaskirat Singh
Sicong Tian
Liang Zheng
Kevin W. Bowyer
CVBM
145
4
0
04 Sep 2024
LinFusion: 1 GPU, 1 Minute, 16K Image
LinFusion: 1 GPU, 1 Minute, 16K Image
Songhua Liu
Weihao Yu
Zhenxiong Tan
Xinchao Wang
123
16
0
03 Sep 2024
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View
  Synthesis
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Wangbo Yu
Jinbo Xing
Li Yuan
Wenbo Hu
Xiaoyu Li
Zhipeng Huang
Xiangjun Gao
T. Wong
Ying Shan
Yonghong Tian
VGenDiffM
116
97
0
03 Sep 2024
EarthGen: Generating the World from Top-Down Views
EarthGen: Generating the World from Top-Down Views
Ansh Sharma
Albert Xiao
Praneet Rathi
Rohit Kundu
Albert Zhai
Yuan Shen
Shenlong Wang
123
1
0
02 Sep 2024
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive
  Content Generation
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
Qihua Chen
Yi Ma
Haobo Wang
Junkun Yuan
Wenzhe Zhao
Q. Tian
Hongmei Wang
Shaobo Min
Qifeng Chen
Wen Liu
DiffM
110
21
0
02 Sep 2024
From Bird's-Eye to Street View: Crafting Diverse and Condition-Aligned
  Images with Latent Diffusion Model
From Bird's-Eye to Street View: Crafting Diverse and Condition-Aligned Images with Latent Diffusion Model
Xiaojie Xu
Tianshuo Xu
Fulong Ma
Yingcong Chen
102
0
0
02 Sep 2024
SPDiffusion: Semantic Protection Diffusion Models for Multi-concept Text-to-image Generation
SPDiffusion: Semantic Protection Diffusion Models for Multi-concept Text-to-image Generation
Yang Zhang
Rui Zhang
Xuecheng Nie
Haochen Li
Jikun Chen
Yifan Hao
Xin Zhang
Luoqi Liu
Ling Li
122
0
0
02 Sep 2024
COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
Shaorong Sun
Shuchao Pang
Yazhou Yao
Xiaoshui Huang
71
1
0
01 Sep 2024
FLUX that Plays Music
FLUX that Plays Music
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Junshi Huang
141
9
0
01 Sep 2024
Compositional 3D-aware Video Generation with LLM Director
Compositional 3D-aware Video Generation with LLM Director
Hanxin Zhu
Tianyu He
Anni Tang
Junliang Guo
Zhibo Chen
Jiang Bian
DiffMVGen
108
7
0
31 Aug 2024
Training-Free Sketch-Guided Diffusion with Latent Optimization
Training-Free Sketch-Guided Diffusion with Latent Optimization
Sandra Zhang Ding
Jiafeng Mao
Kiyoharu Aizawa
DiffM
184
3
0
31 Aug 2024
Efficient Image Restoration through Low-Rank Adaptation and Stable
  Diffusion XL
Efficient Image Restoration through Low-Rank Adaptation and Stable Diffusion XL
Haiyang Zhao
DiffM
71
0
0
30 Aug 2024
GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image
  Restoration over Gaming Content
GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content
Lebin Zhou
Kun Han
Nam Ling
Wei Wang
Wei Jiang
3DGS
89
0
0
29 Aug 2024
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative
  Models
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models
Moreno DÍncà
E. Peruzzo
Massimiliano Mancini
Xingqian Xu
Humphrey Shi
N. Sebe
103
0
0
29 Aug 2024
COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion
  Estimation
COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation
Jiefeng Li
Ye Yuan
Davis Rempe
Haotian Zhang
Pavlo Molchanov
Cewu Lu
Jan Kautz
Umar Iqbal
DiffMVGen
104
2
0
29 Aug 2024
Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
Xiaoyu Jin
Zunnan Xu
Mingwen Ou
Wenming Yang
DiffM
89
7
0
29 Aug 2024
TEDRA: Text-based Editing of Dynamic and Photoreal Actors
TEDRA: Text-based Editing of Dynamic and Photoreal Actors
Basavaraj Sunagad
Heming Zhu
Mohit Mendiratta
Adam Kortylewski
Christian Theobalt
Marc Habermann
DiffM
101
1
0
28 Aug 2024
DiffAge3D: Diffusion-based 3D-aware Face Aging
DiffAge3D: Diffusion-based 3D-aware Face Aging
Junaid Wahid
Fangneng Zhan
Pramod Rao
Christian Theobalt
DiffM
79
1
0
28 Aug 2024
GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video
  Generative Model
GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model
Yongjie Fu
Yunlong Li
Xuan Di
VGen
127
3
0
28 Aug 2024
Are Pose Estimators Ready for the Open World? STAGE: Synthetic Data
  Generation Toolkit for Auditing 3D Human Pose Estimators
Are Pose Estimators Ready for the Open World? STAGE: Synthetic Data Generation Toolkit for Auditing 3D Human Pose Estimators
Nikita Kister
István Sárándi
Anna Khoreva
Gerard Pons-Moll
152
0
0
28 Aug 2024
Merging and Splitting Diffusion Paths for Semantically Coherent
  Panoramas
Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas
Fabio Quattrini
Vittorio Pippi
Silvia Cascianelli
Rita Cucchiara
72
3
0
28 Aug 2024
Alfie: Democratising RGBA Image Generation With No $$$
Alfie: Democratising RGBA Image Generation With No
Fabio Quattrini
Vittorio Pippi
Silvia Cascianelli
Rita Cucchiara
DiffM
93
6
0
27 Aug 2024
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image
  Generation
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation
Abdelrahman Eldesokey
Peter Wonka
DiffM
127
4
0
27 Aug 2024
CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View
  Synthesis
CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis
Weijia Li
Jun He
Junyan Ye
Huaping Zhong
Zhimeng Zheng
Zilong Huang
Dahua Lin
Conghui He
88
7
0
27 Aug 2024
MeshUp: Multi-Target Mesh Deformation via Blended Score Distillation
MeshUp: Multi-Target Mesh Deformation via Blended Score Distillation
Hyunwoo Kim
Itai Lang
Noam Aigerman
Thibault Groueix
Vladimir G. Kim
Rana Hanocka
AI4CE
127
3
0
27 Aug 2024
Foodfusion: A Novel Approach for Food Image Composition via Diffusion
  Models
Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models
Chaohua Shi
Xuan Wang
Si Shi
Xule Wang
Mingrui Zhu
Nannan Wang
X. Gao
CoGe
93
2
0
26 Aug 2024
Avatar Concept Slider: Controllable Editing of Concepts in 3D Human Avatars
Avatar Concept Slider: Controllable Editing of Concepts in 3D Human Avatars
Yixuan He
Lin Geng Foo
Ajmal Mian
Hossein Rahmani
Jun Liu
Christian Theobalt
77
1
0
26 Aug 2024
Draw Like an Artist: Complex Scene Generation with Diffusion Model via
  Composition, Painting, and Retouching
Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching
Minghao Liu
Le Zhang
Yingjie Tian
Xiaochao Qu
Luoqi Liu
Ting Liu
DiffMCoGe
71
4
0
25 Aug 2024
TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation
TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation
Jack D. Saunders
Vinay P. Namboodiri
95
2
0
25 Aug 2024
SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with
  Panoramic Gaussian Splatting
SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting
Wenrui Li
Yapeng Mi
Fucheng Cai
Zhe Yang
Wangmeng Zuo
Xingtao Wang
Xiaopeng Fan
3DGS
100
9
0
25 Aug 2024
EasyControl: Transfer ControlNet to Video Diffusion for Controllable
  Generation and Interpolation
EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation
Cong Wang
Jiaxi Gu
Panwen Hu
Haoyu Zhao
Yuanfan Guo
J. N. Han
Hang Xu
Xiaodan Liang
VGenDiffM
99
7
0
23 Aug 2024
Has Multimodal Learning Delivered Universal Intelligence in Healthcare?
  A Comprehensive Survey
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey
Qika Lin
Yifan Zhu
Xin Mei
Ling Huang
Jingying Ma
Kai He
Zhen Peng
Min Zhang
Mengling Feng
111
23
0
23 Aug 2024
Abstract Art Interpretation Using ControlNet
Abstract Art Interpretation Using ControlNet
Rishabh Srivastava
Addrish Roy
26
0
0
23 Aug 2024
Atlas Gaussians Diffusion for 3D Generation
Atlas Gaussians Diffusion for 3D Generation
Haitao Yang
Yuan Dong
Hanwen Jiang
Dejia Xu
Georgios Pavlakos
Qixing Huang
3DGS
191
3
0
23 Aug 2024
CatFree3D: Category-agnostic 3D Object Detection with Diffusion
CatFree3D: Category-agnostic 3D Object Detection with Diffusion
Wenjing Bian
Zirui Wang
Andrea Vedaldi
96
1
0
22 Aug 2024
Sapiens: Foundation for Human Vision Models
Sapiens: Foundation for Human Vision Models
Rawal Khirodkar
Timur M. Bagautdinov
Julieta Martinez
Su Zhaoen
Austin James
Peter Selednik
Stuart Anderson
Shunsuke Saito
VLM
145
82
0
22 Aug 2024
JieHua Paintings Style Feature Extracting Model using Stable Diffusion
  with ControlNet
JieHua Paintings Style Feature Extracting Model using Stable Diffusion with ControlNet
Yujia Gu
Haofeng Li
Xinyu Fang
Zihan Peng
Yinan Peng
DiffM
47
0
0
21 Aug 2024
Evolution of Detection Performance throughout the Online Lifespan of
  Synthetic Images
Evolution of Detection Performance throughout the Online Lifespan of Synthetic Images
Dimitrios Karageorgiou
Quentin Bammey
Valentin Porcellini
Bertrand Goupil
Denis Teyssou
Symeon Papadopoulos
98
2
0
21 Aug 2024
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
Junwon Lee
Jaekwon Im
Dabin Kim
Juhan Nam
VGen
140
10
0
21 Aug 2024
Previous
123...232425...606162
Next