ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
Flow-Guided Diffusion for Video Inpainting
Flow-Guided Diffusion for Video Inpainting
Bohai Gu
Yongsheng Yu
Hengrui Fan
Libo Zhang
VGenDiffM
102
12
0
26 Nov 2023
Animate124: Animating One Image to 4D Dynamic Scene
Animate124: Animating One Image to 4D Dynamic Scene
Yuyang Zhao
Zhiwen Yan
Enze Xie
Lanqing Hong
Zhenguo Li
Gim Hee Lee
VGen
106
66
0
24 Nov 2023
MVControl: Adding Conditional Control to Multi-view Diffusion for
  Controllable Text-to-3D Generation
MVControl: Adding Conditional Control to Multi-view Diffusion for Controllable Text-to-3D Generation
Zhiqi Li
Yiming Chen
Lingzhe Zhao
Peidong Liu
85
8
0
24 Nov 2023
Highly Detailed and Temporal Consistent Video Stylization via
  Synchronized Multi-Frame Diffusion
Highly Detailed and Temporal Consistent Video Stylization via Synchronized Multi-Frame Diffusion
M. Xie
Hanyuan Liu
Chengze Li
Tien-Tsin Wong
VGenDiffM
113
0
0
24 Nov 2023
DemoFusion: Democratising High-Resolution Image Generation With No $$$
DemoFusion: Democratising High-Resolution Image Generation With No
Ruoyi Du
Dongliang Chang
Timothy M. Hospedales
Yi-Zhe Song
Zhanyu Ma
127
56
0
24 Nov 2023
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
Weijia Wu
Zhuang Li
Yefei He
Mike Zheng Shou
Chunhua Shen
Lele Cheng
Yan Li
Yan Li
Di Zhang
VLM
230
25
0
24 Nov 2023
Image Super-Resolution with Text Prompt Diffusion
Image Super-Resolution with Text Prompt Diffusion
Zheng Chen
Yulun Zhang
Jinjin Gu
Xin Yuan
Linghe Kong
Guihai Chen
Xiaokang Yang
DiffM
152
21
0
24 Nov 2023
T-Rex: Counting by Visual Prompting
T-Rex: Counting by Visual Prompting
Qing Jiang
Feng Li
Tianhe Ren
Shilong Liu
Zhaoyang Zeng
Kent Yu
Lei Zhang
100
14
0
22 Nov 2023
Soulstyler: Using Large Language Model to Guide Image Style Transfer for
  Target Object
Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object
Junhao Chen
Peng Rong
Jingbo Sun
Chao Li
Xiang Li
Hongwu Lv
VLM
59
2
0
22 Nov 2023
DiffusionMat: Alpha Matting as Sequential Refinement Learning
DiffusionMat: Alpha Matting as Sequential Refinement Learning
Yangyang Xu
Shengfeng He
Wenqi Shao
Kwan-Yee K. Wong
Yu Qiao
Ping Luo
DiffM
72
3
0
22 Nov 2023
Using Human Feedback to Fine-tune Diffusion Models without Any Reward
  Model
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
Kai Yang
Jian Tao
Jiafei Lyu
Chunjiang Ge
Jiaxin Chen
Qimai Li
Weihan Shen
Xiaolong Zhu
Xiu Li
EGVM
126
109
0
22 Nov 2023
Diffusion360: Seamless 360 Degree Panoramic Image Generation based on
  Diffusion Models
Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Mengyang Feng
Jinlin Liu
Miaomiao Cui
Xuansong Xie
65
22
0
22 Nov 2023
Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for
  Enhanced Dataset Pruning
Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset Pruning
Xin Zhang
Jiawei Du
Yunsong Li
Weiying Xie
Qiufeng Wang
89
14
0
22 Nov 2023
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via
  Blender-Oriented GPT Planning
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Jiaxi Lv
Yi Huang
Mingfu Yan
Jiancheng Huang
Jianzhuang Liu
Yifan Liu
Yafei Wen
Xiaoxin Chen
Shifeng Chen
VGenDiffM
119
25
0
21 Nov 2023
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis
Peiang Zhao
Han Li
Ruiyang Jin
S. Kevin Zhou
DiffM
143
13
0
21 Nov 2023
AnimateAnything: Fine-Grained Open Domain Image Animation with Motion
  Guidance
AnimateAnything: Fine-Grained Open Domain Image Animation with Motion Guidance
Zuozhuo Dai
Zhenghao Zhang
Yao Yao
Bingxue Qiu
Siyu Zhu
Long Qin
Weizhi Wang
VGen
100
47
0
21 Nov 2023
A Survey on Multimodal Large Language Models for Autonomous Driving
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Yang Zhou
...
Xinrui Yan
Shuqi Mei
Jianguo Cao
Ziran Wang
Chao Zheng
169
290
0
21 Nov 2023
Text-Guided Texturing by Synchronized Multi-View Diffusion
Text-Guided Texturing by Synchronized Multi-View Diffusion
Yuxin Liu
M. Xie
Hanyuan Liu
Tien-Tsin Wong
DiffM
132
59
0
21 Nov 2023
Applications of Large Scale Foundation Models for Autonomous Driving
Applications of Large Scale Foundation Models for Autonomous Driving
Yu Huang
Yue Chen
Zhu Li
ELMAI4CELRMALMLM&Ro
151
16
0
20 Nov 2023
An Image is Worth Multiple Words: Multi-attribute Inversion for
  Constrained Text-to-Image Synthesis
An Image is Worth Multiple Words: Multi-attribute Inversion for Constrained Text-to-Image Synthesis
Aishwarya Agarwal
Srikrishna Karanam
Tripti Shukla
Balaji Vasan Srinivasan
DiffM
145
19
0
20 Nov 2023
Cut-and-Paste: Subject-Driven Video Editing with Attention Control
Cut-and-Paste: Subject-Driven Video Editing with Attention Control
Zhichao Zuo
Zhao Zhang
Yan Luo
Yang Zhao
Haijun Zhang
Yi Yang
Meng Wang
DiffMVGen
61
7
0
20 Nov 2023
What's left can't be right -- The remaining positional incompetence of
  contrastive vision-language models
What's left can't be right -- The remaining positional incompetence of contrastive vision-language models
Nils Hoehing
Ellen Rushe
Anthony Ventresque
VLM
77
3
0
20 Nov 2023
LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval
  Score Matching
LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Yixun Liang
Xin Yang
Jiantao Lin
Haodong Li
Xiaogang Xu
Yingcong Chen
3DGS
83
200
0
19 Nov 2023
AutoStory: Generating Diverse Storytelling Images with Minimal Human
  Effort
AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Wen Wang
Canyu Zhao
Hao Chen
Zhekai Chen
Kecheng Zheng
Chunhua Shen
DiffM
99
24
0
19 Nov 2023
GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion
  Probabilistic Models with Structured Noise
GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise
Xinhai Li
Huaibin Wang
Kuo-Kun Tseng
3DGS
121
29
0
19 Nov 2023
MagicPose: Realistic Human Poses and Facial Expressions Retargeting with
  Identity-aware Diffusion
MagicPose: Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
Di Chang
Yichun Shi
Quankai Gao
Jessica Fu
Hongyi Xu
Guoxian Song
Qing Yan
Yizhe Zhu
Xiao Yang
Mohammad Soleymani
DiffMVGen
106
59
0
18 Nov 2023
Behavior Optimized Image Generation
Behavior Optimized Image Generation
Varun Khurana
Yaman Kumar Singla
J. Subramanian
R. Shah
Changyou Chen
Zhiqiang Xu
Balaji Krishnamurthy
EGVM
59
4
0
18 Nov 2023
Efficient Domain Adaptation via Generative Prior for 3D Infant Pose
  Estimation
Efficient Domain Adaptation via Generative Prior for 3D Infant Pose Estimation
Zhuoran Zhou
Zhongyu Jiang
Wenhao Chai
Cheng-Yen Yang
Lei Li
Lei Li
84
7
0
17 Nov 2023
SelfEval: Leveraging the discriminative nature of generative models for
  evaluation
SelfEval: Leveraging the discriminative nature of generative models for evaluation
Sai Saketh Rambhatla
Ishan Misra
EGVM
90
5
0
17 Nov 2023
Enhancing Object Coherence in Layout-to-Image Synthesis
Enhancing Object Coherence in Layout-to-Image Synthesis
Yibin Wang
Weizhong Zhang
Jianwei Zheng
Cheng Jin
DiffM
116
3
0
17 Nov 2023
Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human
  Expression
Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Animesh Sinha
Bo Sun
Anmol Kalia
Arantxa Casanova
Elliot Blanchard
...
Ankit Ramchandani
Maziar Sanjabi
Sonal Gupta
Amy Bearman
Dhruv Mahajan
DiffM
69
4
0
17 Nov 2023
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Omri Avrahami
Amir Hertz
Yael Vinker
Moab Arar
Shlomi Fruchter
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
DiffM
125
36
0
16 Nov 2023
Emu Edit: Precise Image Editing via Recognition and Generation Tasks
Emu Edit: Precise Image Editing via Recognition and Generation Tasks
Shelly Sheynin
Adam Polyak
Uriel Singer
Yuval Kirstain
Amit Zohar
Oron Ashual
Devi Parikh
Yaniv Taigman
87
153
0
16 Nov 2023
Single-Image 3D Human Digitization with Shape-Guided Diffusion
Single-Image 3D Human Digitization with Shape-Guided Diffusion
Badour Albahar
Shunsuke Saito
Hung-Yu Tseng
Changil Kim
Johannes Kopf
Jia-Bin Huang
DiffM
84
33
0
15 Nov 2023
FastBlend: a Powerful Model-Free Toolkit Making Video Stylization Easier
FastBlend: a Powerful Model-Free Toolkit Making Video Stylization Easier
Zhongjie Duan
Chengyu Wang
Cen Chen
Weining Qian
Jun Huang
Mingyi Jin
DiffM
33
2
0
15 Nov 2023
UFOGen: You Forward Once Large Scale Text-to-Image Generation via
  Diffusion GANs
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Yanwu Xu
Yang Zhao
Zhisheng Xiao
Tingbo Hou
217
121
0
14 Nov 2023
FIRST: A Million-Entry Dataset for Text-Driven Fashion Synthesis and
  Design
FIRST: A Million-Entry Dataset for Text-Driven Fashion Synthesis and Design
Zhen Huang
Yihao Li
Dong Pei
Jiapeng Zhou
Xuliang Ning
Jianlin Han
Xiaoguang Han
Xuejun Chen
93
3
0
13 Nov 2023
Music ControlNet: Multiple Time-varying Controls for Music Generation
Music ControlNet: Multiple Time-varying Controls for Music Generation
Shih-Lun Wu
Chris Donahue
Shinji Watanabe
Nicholas J. Bryan
DiffMMGen
104
61
0
13 Nov 2023
IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion
  Models
IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models
Zhaoyuan Yang
Zhengyang Yu
Zhiwei Xu
Jaskirat Singh
Jing Zhang
Dylan Campbell
Peter Tu
Richard Hartley
99
11
0
12 Nov 2023
ChatAnything: Facetime Chat with LLM-Enhanced Personas
ChatAnything: Facetime Chat with LLM-Enhanced Personas
Yilin Zhao
Xinbin Yuan
Shanghua Gao
Zhijie Lin
Qibin Hou
Jiashi Feng
Daquan Zhou
58
2
0
12 Nov 2023
Finetuning Text-to-Image Diffusion Models for Fairness
Finetuning Text-to-Image Diffusion Models for Fairness
Xudong Shen
Chao Du
Tianyu Pang
Min Lin
Yongkang Wong
Mohan S. Kankanhalli
117
57
0
11 Nov 2023
Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
Weiyang Liu
Zeju Qiu
Yao Feng
Yuliang Xiu
Yuxuan Xue
...
Songyou Peng
Yandong Wen
Michael J. Black
Adrian Weller
Bernhard Schölkopf
104
72
0
10 Nov 2023
3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with
  2D Diffusion Models
3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models
Haibo Yang
Yang Chen
Yingwei Pan
Ting Yao
Zhineng Chen
Tao Mei
76
20
0
09 Nov 2023
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion
  Priors
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Jingwen Chen
Yingwei Pan
Ting Yao
Tao Mei
DiffM
105
42
0
09 Nov 2023
Control3D: Towards Controllable Text-to-3D Generation
Control3D: Towards Controllable Text-to-3D Generation
Yang Chen
Yingwei Pan
Yehao Li
Ting Yao
Tao Mei
DiffM
97
49
0
09 Nov 2023
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Shilong Liu
Hao Cheng
Haotian Liu
Hao Zhang
Feng Li
...
Hang Su
Jun Zhu
Lei Zhang
Jianfeng Gao
Chun-yue Li
MLLMVLM
113
126
0
09 Nov 2023
Image-Based Virtual Try-On: A Survey
Image-Based Virtual Try-On: A Survey
Dan Song
Xuanpu Zhang
Juan Zhou
Weizhi Nie
Ruofeng Tong
Mohan Kankanhalli
Anan Liu
129
16
0
08 Nov 2023
Weakly-supervised deepfake localization in diffusion-generated images
Weakly-supervised deepfake localization in diffusion-generated images
Dragos Tantaru
Elisabeta Oneata
Dan Oneaţă
DiffM
68
25
0
08 Nov 2023
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features
Chenfeng Xu
Huan Ling
Sanja Fidler
Or Litany
97
15
0
07 Nov 2023
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion
  Models
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Zhan Qin
Xiang Wang
Deli Zhao
Jingren Zhou
DiffMVGen
135
231
0
07 Nov 2023
Previous
123...505152...606162
Next