ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
Enhancing Image Layout Control with Loss-Guided Diffusion Models
Enhancing Image Layout Control with Loss-Guided Diffusion Models
Zakaria Patel
Kirill Serkh
DiffM
78
3
0
23 May 2024
Text-to-Model: Text-Conditioned Neural Network Diffusion for Train-Once-for-All Personalization
Text-to-Model: Text-Conditioned Neural Network Diffusion for Train-Once-for-All Personalization
Zexi Li
Lingzhi Gao
Chao Wu
AI4CEDiffM
131
4
0
23 May 2024
Text Prompting for Multi-Concept Video Customization by Autoregressive
  Generation
Text Prompting for Multi-Concept Video Customization by Autoregressive Generation
D. Kothandaraman
Kihyuk Sohn
Ruben Villegas
P. Voigtlaender
Dinesh Manocha
Mohammad Babaeizadeh
VGenDiffM
59
2
0
22 May 2024
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept
  Composition
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
Ganggui Ding
Canyu Zhao
Wen Wang
Zhen Yang
Zide Liu
Hao Chen
Chunhua Shen
DiffM
84
26
0
22 May 2024
MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing
  Image Generation
MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation
Zhiping Yu
Chenyang Liu
Liqin Liu
Z. Shi
Zhengxia Zou
VGen
73
16
0
22 May 2024
MotionCraft: Physics-based Zero-Shot Video Generation
MotionCraft: Physics-based Zero-Shot Video Generation
L. S. Aira
Antonio Montanaro
Emanuele Aiello
D. Valsesia
E. Magli
DiffMVGen
76
14
0
22 May 2024
Enhanced Creativity and Ideation through Stable Video Synthesis
Enhanced Creativity and Ideation through Stable Video Synthesis
Elijah Miller
Thomas Dupont
Mingming Wang
VGen
61
1
0
22 May 2024
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
N. Sebe
Mubarak Shah
EGVM
206
7
0
22 May 2024
TauAD: MRI-free Tau Anomaly Detection in PET Imaging via Conditioned
  Diffusion Models
TauAD: MRI-free Tau Anomaly Detection in PET Imaging via Conditioned Diffusion Models
Lujia Zhong
Shuo Huang
Jiaxin Yue
Jianwei Zhang
Zhiwei Deng
Wenhao Chi
Yonggang Shi
MedIm
77
0
0
21 May 2024
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and
  Attribute Control
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
Yue Han
Junwei Zhu
Keke He
Xu Chen
Yanhao Ge
Wei Li
Xiangtai Li
Jiangning Zhang
Chengjie Wang
Yong Liu
DiffM
115
29
0
21 May 2024
Can We Treat Noisy Labels as Accurate?
Can We Treat Noisy Labels as Accurate?
Yuxiang Zheng
Zhongyi Han
Yilong Yin
Xin Gao
Tongliang Liu
68
1
0
21 May 2024
An Empirical Study and Analysis of Text-to-Image Generation Using Large
  Language Model-Powered Textual Representation
An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation
Zhiyu Tan
Mengping Yang
Luozheng Qin
Hao Yang
Ye Qian
Qiang-feng Zhou
Cheng Zhang
Hao Li
110
6
0
21 May 2024
Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in
  Remote Sensing Images
Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images
Xiaofei Yu
Yitong Li
Jie Ma
DiffM
88
0
0
21 May 2024
Generalize Polyp Segmentation via Inpainting across Diverse Backgrounds
  and Pseudo-Mask Refinement
Generalize Polyp Segmentation via Inpainting across Diverse Backgrounds and Pseudo-Mask Refinement
Jiajian Ma
Fangqi Lu
Silin Huang
Song Wu
Zhen Li
DiffMMedIm
95
0
0
21 May 2024
Multi-Subject Personalization
Multi-Subject Personalization
Arushi Jain
Shubham Paliwal
Monika Sharma
Vikram Jamwal
Lovekesh Vig
DiffM
42
1
0
21 May 2024
Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Yi Cheng
Ziwei Xu
Dongyun Lin
Harry Cheng
Yongkang Wong
Ying Sun
Joo Hwee Lim
Mohan Kankanhalli
85
0
0
21 May 2024
CustomText: Customized Textual Image Generation using Diffusion Models
CustomText: Customized Textual Image Generation using Diffusion Models
Shubham Paliwal
Arushi Jain
Monika Sharma
Vikram Jamwal
Lovekesh Vig
60
1
0
21 May 2024
LAGA: Layered 3D Avatar Generation and Customization via Gaussian Splatting
LAGA: Layered 3D Avatar Generation and Customization via Gaussian Splatting
Jia Gong
Shenyu Ji
Lin Geng Foo
Kang Chen
Hossein Rahmani
Jun Liu
3DGS
120
6
0
21 May 2024
EmoEdit: Evoking Emotions through Image Manipulation
EmoEdit: Evoking Emotions through Image Manipulation
Jingyuan Yang
Jiawei Feng
Weibin Luo
Dani Lischinski
Daniel Cohen-Or
Hui Huang
DiffM
84
2
0
21 May 2024
Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models
  Using Spatio-Temporal Slices
Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices
Nathaniel Cohen
Vladimir Kulikov
Matan Kleiner
Inbar Huberman-Spiegelglas
T. Michaeli
VGenDiffM
71
17
0
20 May 2024
ViViD: Video Virtual Try-on using Diffusion Models
ViViD: Video Virtual Try-on using Diffusion Models
Zixun Fang
Wei Zhai
Aimin Su
Hongliang Song
Kai Zhu
Mao Wang
Yu Chen
Zhiheng Liu
Yang Cao
Zheng-jun Zha
DiffMVGen
102
14
0
20 May 2024
URDFormer: A Pipeline for Constructing Articulated Simulation
  Environments from Real-World Images
URDFormer: A Pipeline for Constructing Articulated Simulation Environments from Real-World Images
Zoey Chen
Aaron Walsman
Marius Memmel
Kaichun Mo
Alex Fang
Karthikeya Vemuri
Alan Wu
Dieter Fox
Abhishek Gupta
AI4CEVGen
138
32
0
19 May 2024
Diffusion-Based Hierarchical Image Steganography
Diffusion-Based Hierarchical Image Steganography
You-song Xu
Xuanyu Zhang
Jiwen Yu
Chong Mou
Xiandong Meng
Jian Zhang
DiffM
71
2
0
19 May 2024
CoLay: Controllable Layout Generation through Multi-conditional Latent
  Diffusion
CoLay: Controllable Layout Generation through Multi-conditional Latent Diffusion
Chin-Yi Cheng
Ruiqi Gao
Forrest Huang
Yang Li
DiffM
72
2
0
18 May 2024
Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory
  Score Matching
Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching
Xingyu Miao
Haoran Duan
Varun Ojha
Jun Song
Tejal Shah
Yang Long
R. Ranjan
131
4
0
18 May 2024
AquaLoRA: Toward White-box Protection for Customized Stable Diffusion
  Models via Watermark LoRA
AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA
Weitao Feng
Wenbo Zhou
Jiyan He
Jie Zhang
Tianyi Wei
Guanlin Li
Tianwei Zhang
Weiming Zhang
Neng H. Yu
96
21
0
18 May 2024
Generative AI for 2D Character Animation
Generative AI for 2D Character Animation
Jaime Guajardo
Ozgun Y. Bursalioglu
Dan B. Goldman
VGen
49
3
0
17 May 2024
From Sora What We Can See: A Survey of Text-to-Video Generation
From Sora What We Can See: A Survey of Text-to-Video Generation
Rui Sun
Yumin Zhang
Tejal Shah
Jiahao Sun
Shuoying Zhang
Wenqi Li
Haoran Duan
Bo Wei
R. Ranjan
EGVM
136
20
0
17 May 2024
Not All Prompts Are Secure: A Switchable Backdoor Attack Against
  Pre-trained Vision Transformers
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers
Shengyuan Yang
Jiawang Bai
Kuofeng Gao
Yong-Liang Yang
Yiming Li
Shu-Tao Xia
AAMLSILM
111
5
0
17 May 2024
LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with
  T-Diffusion
LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-Diffusion
Tong Chen
Qingcheng Lyu
Long Bai
Erjian Guo
Huxin Gao
Xiaoxiao Yang
Hongliang Ren
Luping Zhou
MedIm
86
5
0
17 May 2024
ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation
ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation
Pengzhi Li
Chengshuai Tang
Qinxuan Huang
Zhiheng Li
3DGS
78
12
0
17 May 2024
VirtualModel: Generating Object-ID-retentive Human-object Interaction
  Image by Diffusion Model for E-commerce Marketing
VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing
Binghui Chen
Chongyang Zhong
Wangmeng Xiang
Yifeng Geng
Xuansong Xie
DiffM
54
9
0
16 May 2024
NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge
NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge
Jie Liang
Radu Timofte
Qiaosi Yi
Shuai Liu
Lingchen Sun
Rongyuan Wu
Xindong Zhang
Huiyu Zeng
Lei Zhang
111
19
0
16 May 2024
Naturalistic Music Decoding from EEG Data via Latent Diffusion Models
Naturalistic Music Decoding from EEG Data via Latent Diffusion Models
Emilian Postolache
Natalia Polouliakh
Hiroaki Kitano
Akima Connelly
Emanuele Rodolà
Luca Cosmo
Taketo Akama
MedImDiffM
102
4
0
15 May 2024
Compositional Text-to-Image Generation with Dense Blob Representations
Compositional Text-to-Image Generation with Dense Blob Representations
Weili Nie
Sifei Liu
Morteza Mardani
Chao Liu
Benjamin Eckart
Arash Vahdat
DiffM
129
20
0
14 May 2024
Infinite Texture: Text-guided High Resolution Diffusion Texture
  Synthesis
Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis
Yifan Wang
Aleksander Holynski
Brian L. Curless
Steven M. Seitz
63
2
0
13 May 2024
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator
Hanshu Yan
Xingchao Liu
Jiachun Pan
Jun Hao Liew
Qiang Liu
Jiashi Feng
138
48
0
13 May 2024
MAxPrototyper: A Multi-Agent Generation System for Interactive User
  Interface Prototyping
MAxPrototyper: A Multi-Agent Generation System for Interactive User Interface Prototyping
Mingyue Yuan
Jieshan Chen
Aaron Quigley
LLMAG
78
6
0
12 May 2024
Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution
  with Generative Diffusion Prior
Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior
Ce Wang
Wanjie Sun
DiffM
86
6
0
11 May 2024
Training-free Subject-Enhanced Attention Guidance for Compositional
  Text-to-image Generation
Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation
Shengyuan Liu
Bo Wang
Ye Ma
Te Yang
Xipeng Cao
Quan Chen
Han Li
Di Dong
Peng Jiang
EGVM
80
2
0
11 May 2024
Prompt-guided Precise Audio Editing with Diffusion Models
Prompt-guided Precise Audio Editing with Diffusion Models
Manjie Xu
Chenxing Li
Duzhen Zhang
Dan Su
Weihan Liang
Dong Yu
DiffM
71
7
0
11 May 2024
Non-confusing Generation of Customized Concepts in Diffusion Models
Non-confusing Generation of Customized Concepts in Diffusion Models
Wang Lin
Jingyuan Chen
Jiaxin Shi
Yichen Zhu
Chen Liang
...
Tao Jin
Zhou Zhao
Fei Wu
Shuicheng Yan
Hanwang Zhang
DiffM
77
14
0
11 May 2024
Distilling Diffusion Models into Conditional GANs
Distilling Diffusion Models into Conditional GANs
Minguk Kang
Richard Zhang
Connelly Barnes
Sylvain Paris
Suha Kwak
Jaesik Park
Eli Shechtman
Jun-Yan Zhu
Taesung Park
124
45
0
09 May 2024
Lumina-T2X: Transforming Text into Any Modality, Resolution, and
  Duration via Flow-based Large Diffusion Transformers
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Peng Gao
Le Zhuo
Ziyi Lin
Ruoyi Du
Xu Luo
...
Weicai Ye
He Tong
Jingwen He
Yu Qiao
Hongsheng Li
VGen
103
91
0
09 May 2024
MasterWeaver: Taming Editability and Face Identity for Personalized
  Text-to-Image Generation
MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation
Yuxiang Wei
Zhilong Ji
Jinfeng Bai
Hongzhi Zhang
Lei Zhang
W. Zuo
DiffM
80
0
0
09 May 2024
FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic
  Gaussian Splatting
FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting
Yikun Ma
Dandan Zhan
Zhi Jin
3DGS
84
10
0
09 May 2024
AI in Your Toolbox: A Plugin for Generating Renderings from 3D Models
AI in Your Toolbox: A Plugin for Generating Renderings from 3D Models
Mingming Wang
52
1
0
09 May 2024
Vision-Language Modeling with Regularized Spatial Transformer Networks
  for All Weather Crosswind Landing of Aircraft
Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft
Debabrata Pal
Anvita Singh
Saumya Saumya
Shouvik Das
54
0
0
09 May 2024
A Survey on Personalized Content Synthesis with Diffusion Models
A Survey on Personalized Content Synthesis with Diffusion Models
Xu-Lu Zhang
Xiao Wei
Wengyu Zhang
Jinlin Wu
Jiaxin Wu
Zhen Lei
Zhaoxiang Zhang
Zhen Lei
Qing Li
EGVM
236
22
0
09 May 2024
FlexEControl: Flexible and Efficient Multimodal Control for
  Text-to-Image Generation
FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation
Xuehai He
Jian Zheng
Jacob Zhiyuan Fang
Robinson Piramuthu
Mohit Bansal
Vicente Ordonez
Gunnar Sigurdsson
Nanyun Peng
Xin Eric Wang
DiffM
98
1
0
08 May 2024
Previous
123...323334...606162
Next