ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
Long Context Tuning for Video Generation
Yuwei Guo
Ceyuan Yang
Ziyan Yang
Zhibei Ma
Zhijie Lin
Zhenheng Yang
Dahua Lin
Lu Jiang
DiffMVGen
163
17
0
13 Mar 2025
Distilling Diversity and Control in Diffusion Models
Rohit Gandikota
David Bau
98
4
0
13 Mar 2025
Learning a Unified Degradation-aware Representation Model for Multi-modal Image Fusion
Haolong Ma
Hui Li
Chunyang Cheng
Zeyang Zhang
Xiaoning Song
Xiao Wu
125
1
0
13 Mar 2025
Adding Additional Control to One-Step Diffusion with Joint Distribution Matching
Yihong Luo
Tianyang Hu
Yifan Song
Jiacheng Sun
Zechao Li
Jing Tang
DiffM
146
1
0
13 Mar 2025
PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language Navigation
Sen Wang
Dongliang Zhou
Liang Xie
Chao Xu
Ye Yan
Erwei Yin
DiffM
154
3
0
13 Mar 2025
A Self-supervised Motion Representation for Portrait Video Generation
A Self-supervised Motion Representation for Portrait Video Generation
Qiyuan Zhang
Chenyu Wu
Wenzhang Sun
Huaize Liu
Donglin Di
Wei Chen
Changqing Zou
VGen
111
0
0
13 Mar 2025
RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors
Avinash Paliwal
Xilong Zhou
Wei Ye
J. Xiong
Rakesh Ranjan
N. Kalantari
DiffM3DGS
61
0
0
13 Mar 2025
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
Rongyao Fang
Chengqi Duan
Kun Wang
Linjiang Huang
Hao Li
...
Xingyu Zeng
R. Zhao
Jifeng Dai
Xihui Liu
Hongsheng Li
MLLMReLMLRM
165
23
0
13 Mar 2025
Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Hanyang Zhao
Haoxian Chen
Yucheng Guo
Genta Indra Winata
Tingting Ou
Ziyu Huang
D. Yao
Wenpin Tang
130
0
0
13 Mar 2025
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
Yijing Lin
Mengqi Huang
Shuhan Zhuang
Zhendong Mao
VGen
99
3
0
13 Mar 2025
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Hyeonho Jeong
Suhyeon Lee
Jong Chul Ye
VGen
490
2
0
12 Mar 2025
Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models
Sangwon Jang
June Suk Choi
Jaehyeong Jo
Kimin Lee
Sung Ju Hwang
DiffMWIGM
116
1
0
12 Mar 2025
Monte Carlo Diffusion for Generalizable Learning-Based RANSAC
Jiadong Wang
Chen Zhao
Wei Ke
Tong Zhang
DiffM
94
0
0
12 Mar 2025
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Junsong Chen
Shuchen Xue
Yuyang Zhao
Jincheng Yu
Sayak Paul
Junyu Chen
Han Cai
Enze Xie
Enze Xie
VLM
177
10
0
12 Mar 2025
CoRe^2: Collect, Reflect and Refine to Generate Better and Faster
Shitong Shao
Zikai Zhou
Dian Xie
Yuetong Fang
Tian Ye
Lichen Bai
Zeke Xie
DiffMVLM
140
0
0
12 Mar 2025
On the Limitations of Vision-Language Models in Understanding Image Transforms
Ahmad Mustafa Anis
Hasnain Ali
Saquib Sarfraz
VLM
Presented at ResearchTrend Connect | VLM on 28 Mar 2025
208
0
0
12 Mar 2025
Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows
Chengyue Gong
Xiaoyu Li
Yingyu Liang
Jiangxuan Long
Zhenmei Shi
Zhao Song
Yu Tian
138
3
0
12 Mar 2025
PerCoV2: Improved Ultra-Low Bit-Rate Perceptual Image Compression with Implicit Hierarchical Masked Image Modeling
Nikolai Korber
Eduard Kromer
Andreas Siebert
S. Hauke
Daniel Mueller-Gritschneder
Björn Schuller
94
0
0
12 Mar 2025
Close-up-GS: Enhancing Close-Up View Synthesis in 3D Gaussian Splatting with Progressive Self-Training
Jiatong Xia
Lingqiao Liu
3DGS
113
0
0
12 Mar 2025
Unified Dense Prediction of Video Diffusion
Lehan Yang
Lu Qi
Xianrui Li
Sheng Li
Varun Jampani
Ming-Hsuan Yang
MDEVOSVGen
135
0
0
12 Mar 2025
Zero-Shot Subject-Centric Generation for Creative Application Using Entropy Fusion
Kaifeng Zou
Xiaoyi Feng
Peng Wang
Tao Huang
Zizhou Huang
Zhang Haihang
Yuntao Zou
Dagang Li
DiffM
110
0
0
12 Mar 2025
Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space
Yifan Zhou
Zeqi Xiao
Shuai Yang
Xingang Pan
137
3
0
12 Mar 2025
FCaS: Fine-grained Cardiac Image Synthesis based on 3D Template Conditional Diffusion Model
Jiahao Xia
Yutao Hu
Yaolei Qi
Zechao Li
Wenqi Shao
Junjun He
Ying Fu
Longjiang Zhang
Guanyu Yang
DiffMMedIm
73
0
0
12 Mar 2025
Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets
H. Kniesel
Pedro Hermosilla
Timo Ropinski
120
0
0
12 Mar 2025
UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Haoxuan Wang
Jinlong Peng
Qu He
Hao Yang
Ying Jin
...
Yanjie Pan
Zhenye Gan
M. Chi
Bo Peng
Yun Wang
DiffM
103
2
0
12 Mar 2025
Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
Héctor Laria
Alexandra Gomez-Villa
Jiang Qin
Muhammad Atif Butt
Bogdan Raducanu
Javier Vázquez-Corral
Joost van de Weijer
Kai Wang
DiffM
106
1
0
12 Mar 2025
InteractEdit: Zero-Shot Editing of Human-Object Interactions in Images
Jiun Tian Hoe
Weipeng Hu
Wei Zhou
Chao Xie
Ziwei Wang
Chee Seng Chan
Xudong Jiang
Y. Tan
121
0
0
12 Mar 2025
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
Yucheng Suo
Fan Ma
Kaixin Shen
Linchao Zhu
Yi Yang
VLM
86
0
0
12 Mar 2025
I2V3D: Controllable image-to-video generation with 3D guidance
Zhiyuan Zhang
DongDong Chen
J. Liao
VGen
113
1
0
12 Mar 2025
VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion
Lehan Yang
Jincen Song
Tianlong Wang
Daiqing Qi
Weili Shi
Yuheng Liu
Sheng Li
DiffMVOSVGen
131
0
0
11 Mar 2025
OminiControl2: Efficient Conditioning for Diffusion Transformers
Zhenxiong Tan
Qiaochu Xue
Xingyi Yang
Songhua Liu
Xinchao Wang
DiffM
103
4
0
11 Mar 2025
Learning to Match Unpaired Data with Minimum Entropy Coupling
Mustapha Bounoua
Giulio Franzese
Pietro Michiardi
120
0
0
11 Mar 2025
Multimodal Generation of Animatable 3D Human Models with AvatarForge
Xinhang Liu
Yu-Wing Tai
Chi-Keung Tang
99
0
0
11 Mar 2025
TSCnet: A Text-driven Semantic-level Controllable Framework for Customized Low-Light Image Enhancement
Miao Zhang
Jun Yin
Pengyu Zeng
Yiqing Shen
Shuai Lu
Xueqian Wang
DiffM
167
14
0
11 Mar 2025
Preference-Based Alignment of Discrete Diffusion Models
Preference-Based Alignment of Discrete Diffusion Models
Umberto Borso
Davide Paglieri
Jude Wells
Tim Rocktaschel
108
3
0
11 Mar 2025
Identity Preserving Latent Diffusion for Brain Aging Modeling
Gexin Huang
Zhangsihao Yang
Yalin Wang
Guido Gerig
Mengwei Ren
Xiaoxiao Li
MedImDiffM
152
0
0
11 Mar 2025
MGHanD: Multi-modal Guidance for authentic Hand Diffusion
Taehyeon Eum
Jieun Choi
Tae-Kyun Kim
85
1
0
11 Mar 2025
MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-Resolution
Xiaochen Li
Jianlong Wu
Xinchuan Huang
C. L. Philip Chen
Weili Guan
Xian-Sheng Hua
Liqiang Nie
DiffM
81
0
0
11 Mar 2025
MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention
Yuhan Wang
Fangzhou Hong
Shuai Yang
Liming Jiang
Wayne Wu
Chen Change Loy
VGen
128
1
0
11 Mar 2025
Adv-CPG: A Customized Portrait Generation Framework with Facial Adversarial Attacks
Junying Wang
Hongyuan Zhang
Yuan Yuan
AAMLPICV
137
2
0
11 Mar 2025
Efficient Distillation of Classifier-Free Guidance using Adapters
Cristian Perez Jensen
Seyedmorteza Sadat
96
1
0
10 Mar 2025
Inversion-Free Video Style Transfer with Trajectory Reset Attention Control and Content-Style Bridging
Jiang Lin
Zili Yi
DiffMVGen
73
0
0
10 Mar 2025
DreamRelation: Relation-Centric Video Customization
Yujie Wei
Shiwei Zhang
Hangjie Yuan
Biao Gong
Longxiang Tang
...
Haonan Qiu
Hengjia Li
Shuai Tan
Yize Zhang
Hongming Shan
VGen
120
1
0
10 Mar 2025
AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models
Bo Huang
Wenlun Xu
Qizhuo Han
Haodong Jing
Ying Li
DiffM
94
0
0
10 Mar 2025
Synchronized Video-to-Audio Generation via Mel Quantization-Continuum Decomposition
Juncheng Wang
Chao Xu
Cheng Yu
Lei Shang
Zhe Hu
Shujun Wang
Liefeng Bo
DiffMVGen
94
0
0
10 Mar 2025
Keeping Representation Similarity in Finetuning for Medical Image Analysis
Wenqiang Zu
Shenghao Xie
Hao Chen
Yiming Liang
Lei Ma
MedImOOD
139
0
0
10 Mar 2025
Balanced Image Stylization with Style Matching Score
Yuxin Jiang
Liming Jiang
Shuai Yang
Jia-Wei Liu
Ivor Tsang
Mike Zheng Shou
DiffM
130
0
0
10 Mar 2025
Interactive Tumor Progression Modeling via Sketch-Based Image Editing
Gexin Huang
Ruinan Jin
Yucheng Tang
Can Zhao
Tatsuya Harada
Xiaoxiao Li
Gu Lin
MedIm
88
2
0
10 Mar 2025
Temporal Triplane Transformers as Occupancy World Models
Temporal Triplane Transformers as Occupancy World Models
Haoran Xu
Peixi Peng
Guang Tan
Yiqian Chang
Yisen Zhao
Yonghong Tian
182
2
0
10 Mar 2025
TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision
Shaobin Zhuang
Yiwei Guo
Yanbo Ding
Kunchang Li
Xinyuan Chen
Yaohui Wang
Fangyikang Wang
Ying Zhang
Chen Li
Yijiao Wang
84
1
0
10 Mar 2025
Previous
123...101112...606162
Next