ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep
  Approach
Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach
Yaofang Liu
Y. Ren
Xiaodong Cun
Aitor Artola
Yang Liu
Tieyong Zeng
Raymond H. Chan
Jean-Michel Morel
VGenDiffM
122
3
0
04 Oct 2024
SteerDiff: Steering towards Safe Text-to-Image Diffusion Models
SteerDiff: Steering towards Safe Text-to-Image Diffusion Models
Hongxiang Zhang
Yifeng He
Hao Chen
84
5
0
03 Oct 2024
Event-Customized Image Generation
Event-Customized Image Generation
Zhen Wang
Yilei Jiang
Dong Zheng
Jun Xiao
Long Chen
DiffM
53
1
0
03 Oct 2024
Computer-aided Colorization State-of-the-science: A Survey
Computer-aided Colorization State-of-the-science: A Survey
Yu Cao
Xin Duan
Xiangqiao Meng
P. Y. Mok
Ping Li
Tong-Yee Lee
71
0
0
03 Oct 2024
ControlAR: Controllable Image Generation with Autoregressive Models
ControlAR: Controllable Image Generation with Autoregressive Models
Zongming Li
Tianheng Cheng
Shoufa Chen
Peize Sun
Haocheng Shen
Longjin Ran
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
DiffM
248
19
0
03 Oct 2024
FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
Zhipei Xu
Xuanyu Zhang
Runyi Li
Zecheng Tang
Qing Huang
Jian Zhang
AAML
136
24
0
03 Oct 2024
MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation
MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation
T. Pham
Tri Ton
Chang D. Yoo
105
3
0
03 Oct 2024
RDEIC: Accelerating Diffusion-Based Extreme Image Compression with Relay Residual Diffusion
RDEIC: Accelerating Diffusion-Based Extreme Image Compression with Relay Residual Diffusion
Zhiyuan Li
Yanhui Zhou
Hao Wei
Chenyang Ge
Ajmal Mian
DiffM
100
0
0
03 Oct 2024
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Yuqing Wang
Tianwei Xiong
Daquan Zhou
Zhijie Lin
Yang Zhao
Bingyi Kang
Jiashi Feng
Xihui Liu
VGen
163
35
0
03 Oct 2024
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation
Rinon Gal
Adi Haviv
Yuval Alaluf
Amit H. Bermano
Daniel Cohen-Or
Gal Chechik
DiffM
56
6
0
02 Oct 2024
Harnessing the Latent Diffusion Model for Training-Free Image Style
  Transfer
Harnessing the Latent Diffusion Model for Training-Free Image Style Transfer
Kento Masui
Mayu Otani
Masahiro Nomura
Hideki Nakayama
DiffM
53
1
0
02 Oct 2024
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models
Pouyan Navard
Amin Karimi Monsefi
Mengxi Zhou
Wei-Lun Chao
Alper Yilmaz
R. Ramnath
DiffM
133
3
0
02 Oct 2024
Khattat: Enhancing Readability and Concept Representation of Semantic
  Typography
Khattat: Enhancing Readability and Concept Representation of Semantic Typography
Ahmed Hussein
Alaa Elsetohy
Sama Hadhoud
Tameem Bakr
Yasser Rohaim
Badr AlKhamissi
VLM
83
0
0
01 Oct 2024
AVID: Adapting Video Diffusion Models to World Models
AVID: Adapting Video Diffusion Models to World Models
Marc Rigter
Tarun Gupta
Agrin Hilmkil
Chao Ma
VGen
75
8
0
01 Oct 2024
Scene Graph Disentanglement and Composition for Generalizable Complex
  Image Generation
Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation
Yunnan Wang
Ziqiang Li
Zequn Zhang
Wenyao Zhang
Baao Xie
Xihui Liu
Wenjun Zeng
Xin Jin
CoGeDiffM
68
3
0
01 Oct 2024
AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures
  in Robotic Manipulation
AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation
Jiafei Duan
Wilbert Pumacay
Nishanth Kumar
Yi Ru Wang
Shulin Tian
Wentao Yuan
Ranjay Krishna
Dieter Fox
Ajay Mandlekar
Yijie Guo
VLMLRM
117
29
0
01 Oct 2024
TFCT-I2P: Three stream fusion network with color aware transformer for
  image-to-point cloud registration
TFCT-I2P: Three stream fusion network with color aware transformer for image-to-point cloud registration
Muyao Peng
Pei An
Zichen Wan
You Yang
Qiong Liu
3DPC
96
0
0
01 Oct 2024
SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D
  Semantic MPIs
SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs
Leheng Li
Weichao Qiu
Yingjie Cai
Xu Yan
Qing Lian
Bingbing Liu
Ying-Cong Chen
DiffM
83
4
0
01 Oct 2024
RadGazeGen: Radiomics and Gaze-guided Medical Image Generation using
  Diffusion Models
RadGazeGen: Radiomics and Gaze-guided Medical Image Generation using Diffusion Models
Moinak Bhattacharya
Gagandeep Singh
Shubham Jain
Prateek Prasanna
MedImDiffM
112
2
0
01 Oct 2024
ACE: All-round Creator and Editor Following Instructions via Diffusion
  Transformer
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer
Zhen Han
Zeyinzi Jiang
Yulin Pan
Jingfeng Zhang
Chaojie Mao
Chenwei Xie
Yu Liu
Jingren Zhou
DiffM
98
21
0
30 Sep 2024
PerCo (SD): Open Perceptual Compression
PerCo (SD): Open Perceptual Compression
Nikolai Korber
Eduard Kromer
Andreas Siebert
S. Hauke
Daniel Mueller-Gritschneder
Björn Schuller
71
5
0
30 Sep 2024
UIR-LoRA: Achieving Universal Image Restoration through Multiple
  Low-Rank Adaptation
UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation
Cheng Zhang
Dong Gong
Jiumei He
Yu Zhu
Jinqiu Sun
Yanning Zhang
AI4CE
74
0
0
30 Sep 2024
Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We
  Learn How Vision-Language Models Function
Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Chenyi Zhuang
Ying Hu
Pan Gao
DiffMVLM
110
11
0
30 Sep 2024
Illustrious: an Open Advanced Illustration Model
Illustrious: an Open Advanced Illustration Model
Sang Hyun Park
Jun Young Koh
Junha Lee
Joy Song
Dongha Kim
Hoyeon Moon
Hyunju Lee
Min Song
VLM
51
1
0
30 Sep 2024
Replace Anyone in Videos
Replace Anyone in Videos
Xiang Wang
Shiwei Zhang
Haonan Qiu
Ruihang Chu
Zekun Li
Yuanxing Zhang
Changxin Gao
Yuehuan Wang
Chunhua Shen
Nong Sang
VGenDiffM
123
1
0
30 Sep 2024
Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model
Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model
Fulong Ma
Weiqing Qi
Guoyang Zhao
Ming Liu
Jun Ma
DiffM
123
0
0
30 Sep 2024
High Quality Human Image Animation using Regional Supervision and Motion
  Blur Condition
High Quality Human Image Animation using Regional Supervision and Motion Blur Condition
Zhongcong Xu
Chaoyue Song
Guoxian Song
Jianfeng Zhang
Jun Hao Liew
...
You Xie
Linjie Luo
Guosheng Lin
Jiashi Feng
Mike Zheng Shou
DiffM3DHVGen
114
3
0
29 Sep 2024
Conditional Image Synthesis with Diffusion Models: A Survey
Conditional Image Synthesis with Diffusion Models: A Survey
Zheyuan Zhan
Defang Chen
Jian-Ping Mei
Zhenghe Zhao
Jiawei Chen
Chun-Yen Chen
Siwei Lyu
Can Wang
VLM
109
10
0
28 Sep 2024
Pruning then Reweighting: Towards Data-Efficient Training of Diffusion
  Models
Pruning then Reweighting: Towards Data-Efficient Training of Diffusion Models
Yize Li
Yihua Zhang
Sijia Liu
Xue Lin
96
5
0
27 Sep 2024
Fusion is all you need: Face Fusion for Customized Identity-Preserving
  Image Synthesis
Fusion is all you need: Face Fusion for Customized Identity-Preserving Image Synthesis
Salaheldin Mohamed
Dong Han
Yong Li
57
1
0
27 Sep 2024
Learning from Pattern Completion: Self-supervised Controllable
  Generation
Learning from Pattern Completion: Self-supervised Controllable Generation
Zhiqiang Chen
Guofan Fan
Jinying Gao
Lei Ma
Bo Lei
Tiejun Huang
Shan Yu
52
0
0
27 Sep 2024
Simulating Dynamic Tumor Contrast Enhancement in Breast MRI using Conditional Generative Adversarial Networks
Simulating Dynamic Tumor Contrast Enhancement in Breast MRI using Conditional Generative Adversarial Networks
Richard Osuala
Smriti Joshi
Apostolia Tsirikoglou
Lidia Garrucho
Walter H. L. Pinaya
Daniel M. Lang
Julia A. Schnabel
Oliver Díaz
Karim Lekadir
MedIm
138
0
0
27 Sep 2024
DeBaRA: Denoising-Based 3D Room Arrangement Generation
DeBaRA: Denoising-Based 3D Room Arrangement Generation
Léopold Maillard
Nicolas Sereyjol-Garros
Tom Durand
Maks Ovsjanikov
DiffM3DV
92
5
0
26 Sep 2024
Trustworthy Text-to-Image Diffusion Models: A Timely and Focused Survey
Trustworthy Text-to-Image Diffusion Models: A Timely and Focused Survey
Yi Zhang
Zhen Chen
Chih-Hong Cheng
Wenjie Ruan
Xiaowei Huang
Dezong Zhao
David Flynn
Siddartha Khastgir
Xingyu Zhao
MedIm
97
4
0
26 Sep 2024
FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity
  Refiner
FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner
Wenliang Zhao
Minglei Shi
Xumin Yu
Jie Zhou
Jiwen Lu
53
2
0
26 Sep 2024
Stable Video Portraits
Stable Video Portraits
Mirela Ostrek
Justus Thies
VGenDiffM
80
1
0
26 Sep 2024
FreeEdit: Mask-free Reference-based Image Editing with Multi-modal
  Instruction
FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction
Runze He
Kai Ma
Linjiang Huang
Shaofei Huang
Jialin Gao
Xiaoming Wei
Jiao Dai
Jizhong Han
Si Liu
DiffM
78
9
0
26 Sep 2024
PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless
  Imaging
PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging
Xin Cai
Zhiyuan You
Hailong Zhang
Wentao Liu
Liang Feng
Tianfan Xue
DiffM
79
5
0
26 Sep 2024
Self-Distilled Depth Refinement with Noisy Poisson Fusion
Self-Distilled Depth Refinement with Noisy Poisson Fusion
Jiaqi Li
Yiran Wang
Jinghong Zheng
Zihao Huang
Ke Xian
Zhiguo Cao
Jianming Zhang
110
2
0
26 Sep 2024
Physics-aligned Schrödinger bridge
Physics-aligned Schrödinger bridge
Zeyu Li
Hongkun Dou
Shen Fang
Wang Han
Yue Deng
Lijun Yang
AI4CEDiffM
52
0
0
26 Sep 2024
Appearance Blur-driven AutoEncoder and Motion-guided Memory Module for
  Video Anomaly Detection
Appearance Blur-driven AutoEncoder and Motion-guided Memory Module for Video Anomaly Detection
Jiahao Lyu
Minghua Zhao
Jing Hu
Xuewen Huang
Shuangli Du
Cheng Shi
Zhiyong Lv
44
1
0
26 Sep 2024
Pixel-Space Post-Training of Latent Diffusion Models
Pixel-Space Post-Training of Latent Diffusion Models
Christina Zhang
Simran Motwani
Matthew Yu
Ji Hou
Felix Juefei-Xu
Sam S. Tsai
Peter Vajda
Zijian He
Jialiang Wang
51
2
0
26 Sep 2024
StackGen: Generating Stable Structures from Silhouettes via Diffusion
StackGen: Generating Stable Structures from Silhouettes via Diffusion
Luzhe Sun
Takuma Yoneda
Samuel Wheeler
Tianchong Jiang
Matthew R. Walter
DiffM
180
1
0
26 Sep 2024
DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D
  Diffusion
DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Yukun Huang
Jianan Wang
Ailing Zeng
Zheng-Jun Zha
Lei Zhang
Xihui Liu
3DGS
89
7
0
25 Sep 2024
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with
  Adaptive Guidance Scaling
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
Kyuheon Jung
Yongdeuk Seo
Seongwoo Cho
Jaeyoung Kim
Hyun-seok Min
Sungchul Choi
33
1
0
25 Sep 2024
Skyeyes: Ground Roaming using Aerial View Images
Skyeyes: Ground Roaming using Aerial View Images
Zhiyuan Gao
Wenbin Teng
Gonglin Chen
Jinsen Wu
Ningli Xu
R. Qin
Andrew Feng
Yajie Zhao
VGen
90
2
0
25 Sep 2024
GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design
GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design
Phillip Mueller
Sebastian Mueller
Lars Mikelsons
123
2
0
25 Sep 2024
Compressed Depth Map Super-Resolution and Restoration: AIM 2024
  Challenge Results
Compressed Depth Map Super-Resolution and Restoration: AIM 2024 Challenge Results
Marcos V. Conde
Florin-Alexandru Vasluianu
Jinhui Xiong
Wei Ye
Rakesh Ranjan
Radu Timofte
SupRMDE
65
7
0
24 Sep 2024
The Roles of Generative Artificial Intelligence in Internet of Electric
  Vehicles
The Roles of Generative Artificial Intelligence in Internet of Electric Vehicles
Hanwen Zhang
Dusit Niyato
Wei Zhang
Changyuan Zhao
Hongyang Du
Abbas Jamalipour
Sumei Sun
Yiyang Pei
AI4CE
70
2
0
24 Sep 2024
ImPoster: Text and Frequency Guidance for Subject Driven Action
  Personalization using Diffusion Models
ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models
D. Kothandaraman
Kuldeep Kulkarni
Sumit Shekhar
Balaji Vasan Srinivasan
Dinesh Manocha
DiffM
95
1
0
24 Sep 2024
Previous
123...212223...606162
Next