ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
CIA: Controllable Image Augmentation Framework Based on Stable Diffusion
CIA: Controllable Image Augmentation Framework Based on Stable Diffusion
Mohamed Benkedadra
Dany Rimez
Tiffanie Godelaine
Natarajan Chidambaram
Hamed Razavi Khosroshahi
Horacio Tellez
Matei Mancas
Benoît Macq
Sidi Ahmed Mahmoudi
DiffM
111
2
0
25 Nov 2024
AI-Generated Image Quality Assessment Based on Task-Specific Prompt and
  Multi-Granularity Similarity
AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity
Jili Xia
Lihuo He
Fei Gao
Peng Sun
Leida Li
Xinbo Gao
EGVM
142
1
0
25 Nov 2024
One Diffusion to Generate Them All
One Diffusion to Generate Them All
Duong H. Le
Tuan Pham
Sangho Lee
Christopher Clark
Aniruddha Kembhavi
Stephan Mandt
Ranjay Krishna
Jiasen Lu
VLM
162
9
0
25 Nov 2024
Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing
Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing
Kaifeng Gao
Jiaxin Shi
Hanwang Zhang
Chunping Wang
Jun Xiao
Long Chen
VGenDiffM
211
4
0
25 Nov 2024
Phys4DGen: Physics-Compliant 4D Generation with Multi-Material Composition Perception
Phys4DGen: Physics-Compliant 4D Generation with Multi-Material Composition Perception
Jiajing Lin
Zhenzhong Wang
Shu Jiang
Yongjie Hou
Min Jiang
Min Jiang
VGen
153
0
0
25 Nov 2024
Generalizable Single-view Object Pose Estimation by Two-side Generating
  and Matching
Generalizable Single-view Object Pose Estimation by Two-side Generating and Matching
Yujing Sun
Caiyi Sun
Yuan Liu
Yuexin Ma
Siu-Ming Yiu
133
1
0
24 Nov 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
Hao Zhang
Yueting Zhuang
DiffM
236
29
0
24 Nov 2024
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
P. Xu
Boyuan Jiang
Xiaobin Hu
Donghao Luo
Qu He
Jing Zhang
Chengjie Wang
Yunsheng Wu
Charles Ling
Boyu Wang
227
3
0
24 Nov 2024
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
Ryugo Morita
Stanislav Frolov
Brian B. Moser
Takahiro Shirakawa
Ko Watanabe
Andreas Dengel
Jinjia Zhou
DiffM
156
0
0
23 Nov 2024
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Chaehun Shin
Jooyoung Choi
Heeseung Kim
Sungroh Yoon
DiffM
171
13
0
23 Nov 2024
Exploiting Watermark-Based Defense Mechanisms in Text-to-Image Diffusion
  Models for Unauthorized Data Usage
Exploiting Watermark-Based Defense Mechanisms in Text-to-Image Diffusion Models for Unauthorized Data Usage
Soumil Datta
Shih-Chieh Dai
Leo Yu
Guanhong Tao
WIGM
119
0
0
22 Nov 2024
LocRef-Diffusion:Tuning-Free Layout and Appearance-Guided Generation
LocRef-Diffusion:Tuning-Free Layout and Appearance-Guided Generation
Fan Deng
Yaguang Wu
Xinyang Yu
Xiangjun Huang
Jian Yang
Guangyu Yan
Qiang Xu
DiffM
142
0
0
22 Nov 2024
AnyText2: Visual Text Generation and Editing With Customizable
  Attributes
AnyText2: Visual Text Generation and Editing With Customizable Attributes
Yuxiang Tuo
Yifeng Geng
Liefeng Bo
VLM
145
10
0
22 Nov 2024
Exploratory Study Of Human-AI Interaction For Hindustani Music
Exploratory Study Of Human-AI Interaction For Hindustani Music
N. Shikarpur
Cheng-Zhi Anna Huang
162
0
0
21 Nov 2024
GalaxyEdit: Large-Scale Image Editing Dataset with Enhanced Diffusion
  Adapter
GalaxyEdit: Large-Scale Image Editing Dataset with Enhanced Diffusion Adapter
Aniruddha Bala
Rohan Jaiswal
Loay Rashid
Siddharth Roheda
120
0
0
21 Nov 2024
Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation
Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation
Seokil Ham
H. Kim
Sangmin Woo
Changick Kim
Mamba
510
0
0
21 Nov 2024
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic
  Segmentation
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
Ziyi Wang
Yijiao Wang
Xumin Yu
Jie Zhou
Jiwen Lu
100
0
0
20 Nov 2024
RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image
  Generation
RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation
Christoph Reinders
Radu Berdan
Beril Besbinar
Junji Otsuka
Daisuke Iso
121
2
0
20 Nov 2024
Identity Preserving 3D Head Stylization with Multiview Score Distillation
Identity Preserving 3D Head Stylization with Multiview Score Distillation
Bahri Batuhan Bilecen
Ahmet Berke Gokmen
Furkan Guzelant
Aysegül Dündar
196
0
0
20 Nov 2024
Sketch-guided Cage-based 3D Gaussian Splatting Deformation
Sketch-guided Cage-based 3D Gaussian Splatting Deformation
Tianhao Xie
Noam Aigerman
Eugene Belilovsky
Tiberiu Popa
3DGS
151
3
0
19 Nov 2024
Decoupling Training-Free Guided Diffusion by ADMM
Decoupling Training-Free Guided Diffusion by ADMM
Youyuan Zhang
Zehua Liu
Zenan Li
Zhaoyu Li
James J. Clark
X. Si
104
0
0
18 Nov 2024
GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts
Junwen He
Yifan Wang
Lijun Wang
Huchuan Lu
Jun-Yan He
Chong Li
Hanyuan Chen
Jin-Peng Lan
Bin Luo
Yifeng Geng
117
1
0
18 Nov 2024
Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge
Qinglong Cao
Ding Wang
Xirui Li
Yuntian Chen
Chao Ma
Xiaokang Yang
DiffMVGen
148
2
0
18 Nov 2024
DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation
Tianyi Yan
Dongming Wu
Wencheng Han
Junpeng Jiang
Xia Zhou
Kun Zhan
Cheng-Zhong Xu
Jianbing Shen
131
7
0
18 Nov 2024
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing
Chang-Shu Liu
Rui Li
Kaidong Zhang
Yunwei Lan
Dong Liu
DiffMVGen
87
7
0
17 Nov 2024
Constrained Diffusion with Trust Sampling
William Huang
Yifeng Jiang
Tom Van Wouwe
Chenxi Liu
90
4
0
17 Nov 2024
Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Alessandro Fontanella
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Sarah Parisot
98
2
0
16 Nov 2024
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations
Hmrishav Bandyopadhyay
Yi-Zhe Song
DiffMVGen
81
3
0
16 Nov 2024
C-DiffSET: Leveraging Latent Diffusion for SAR-to-EO Image Translation with Confidence-Guided Reliable Object Generation
Jeonghyeok Do
Jaehyup Lee
Munchurl Kim
DiffM
151
2
0
16 Nov 2024
Learning Generalizable 3D Manipulation With 10 Demonstrations
Learning Generalizable 3D Manipulation With 10 Demonstrations
Yu Ren
Yang Cong
Ronghan Chen
Jiahao Long
SSL
108
1
0
15 Nov 2024
OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion
  Models
OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models
Mathis Koroglu
Hugo Caselles-Dupré
Guillaume Jeanneret Sanmiguel
Matthieu Cord
VGenDiffM
52
2
0
15 Nov 2024
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Rang Meng
Xingyu Zhang
Yuming Li
Chenguang Ma
124
13
0
15 Nov 2024
Boundary Attention Constrained Zero-Shot Layout-To-Image Generation
Boundary Attention Constrained Zero-Shot Layout-To-Image Generation
Huancheng Chen
Jingtao Li
Weiming Zhuang
H. Vikalo
Lingjuan Lyu
DiffM
131
2
0
15 Nov 2024
Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting
Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting
Ziqi Xie
Xiao Lai
Weidong Zhao
Xianhui Liu
Wenlong Hou
Wenlong Hou
163
0
0
15 Nov 2024
Jailbreak Attacks and Defenses against Multimodal Generative Models: A
  Survey
Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey
Xuannan Liu
Xing Cui
Peipei Li
Zekun Li
Huaibo Huang
Shuhan Xia
Miaoxuan Zhang
Yueying Zou
Ran He
AAML
154
11
0
14 Nov 2024
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video
  Generation
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Xiaofeng Wang
Kang Zhao
Fan Liu
Jiayu Wang
Guosheng Zhao
Xiaoyi Bao
Zheng Hua Zhu
Yingya Zhang
Xingang Wang
VGen
119
10
0
13 Nov 2024
Unraveling the Connections between Flow Matching and Diffusion
  Probabilistic Models in Training-free Conditional Generation
Unraveling the Connections between Flow Matching and Diffusion Probabilistic Models in Training-free Conditional Generation
Kaiyu Song
Hanjiang Lai
DiffM
52
1
0
12 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
177
0
0
12 Nov 2024
All-in-one Weather-degraded Image Restoration via Adaptive
  Degradation-aware Self-prompting Model
All-in-one Weather-degraded Image Restoration via Adaptive Degradation-aware Self-prompting Model
Yuanbo Wen
Tao Gao
Ziqi Li
Jing Zhang
Kaihao Zhang
Ting Chen
VLMDiffM
102
1
0
12 Nov 2024
GaussianAnything: Interactive Point Cloud Flow Matching For 3D Object Generation
GaussianAnything: Interactive Point Cloud Flow Matching For 3D Object Generation
Yushi Lan
Shangchen Zhou
Zhaoyang Lyu
Fangzhou Hong
Shuai Yang
Bo Dai
Xingang Pan
Chen Change Loy
3DGS
144
0
0
12 Nov 2024
Edify 3D: Scalable High-Quality 3D Asset Generation
Edify 3D: Scalable High-Quality 3D Asset Generation
Nvidia
:
Maciej Bala
Huayu Chen
Yin Cui
...
Shuran Song
Donglai Xiang
Lyne Tchapmi
Fangyin Wei
Qinsheng Zhang
102
7
0
11 Nov 2024
Token Merging for Training-Free Semantic Binding in Text-to-Image
  Synthesis
Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis
Taihang Hu
Linxuan Li
Joost van de Weijer
Hongcheng Gao
Fahad Shahbaz Khan
Jian Yang
Ming-Ming Cheng
Kai Wang
Yaxing Wang
DiffM
132
9
0
11 Nov 2024
Edify Image: High-Quality Image Generation with Pixel Space Laplacian
  Diffusion Models
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models
Nvidia
:
Yuval Atzmon
Maciej Bala
Yogesh Balaji
...
Ting-Chun Wang
Shuran Song
Fangyin Wei
Yu Zeng
Qinsheng Zhang
87
9
0
11 Nov 2024
Arctique: An artificial histopathological dataset unifying realism and
  controllability for uncertainty quantification
Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantification
Jannik Franzen
Claudia Winklmayr
Vanessa Emanuela Guarino
Christoph Karg
Xiaoyan Yu
Nora Koreuber
Jan P. Albrecht
Philip Bischoff
Dagmar Kainmueller
130
0
0
11 Nov 2024
Layout Control and Semantic Guidance with Attention Loss Backward for
  T2I Diffusion Model
Layout Control and Semantic Guidance with Attention Loss Backward for T2I Diffusion Model
Guandong Li
DiffM
60
0
0
11 Nov 2024
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Cong Wei
Zheyang Xiong
Weiming Ren
Xinrun Du
Ge Zhang
Wenhu Chen
176
28
0
11 Nov 2024
Region-Aware Text-to-Image Generation via Hard Binding and Soft
  Refinement
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Zhennan Chen
Yajie Li
Haofan Wang
Zheyu Chen
Zhengkai Jiang
Jun Yu Li
Qian Wang
Jian Yang
Ying Tai
DiffM
108
9
0
10 Nov 2024
Improving image synthesis with diffusion-negative sampling
Improving image synthesis with diffusion-negative sampling
Alakh Desai
Nuno Vasconcelos
DiffM
42
2
0
08 Nov 2024
Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with
  ControlNet
Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet
Boxiao Yu
Kuang Gong
MedImAI4CEDiffM
56
1
0
08 Nov 2024
Generalizable Single-Source Cross-modality Medical Image Segmentation
  via Invariant Causal Mechanisms
Generalizable Single-Source Cross-modality Medical Image Segmentation via Invariant Causal Mechanisms
Boqi Chen
Yuanzhi Zhu
Yunke Ao
Sebastiano Caprara
Reto Sutter
Gunnar Rätsch
E. Konukoglu
A. Susmelj
MedImDiffMOOD
96
1
0
07 Nov 2024
Previous
123...171819...606162
Next