ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion
  Models
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models
Shuhong Zheng
Zhipeng Bao
Ruoyu Zhao
Martial Hebert
Yu-Xiong Wang
DiffM
162
0
0
07 Nov 2024
DimensionX: Create Any 3D and 4D Scenes from a Single Image with
  Controllable Video Diffusion
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Wenqiang Sun
Shuo Chen
Fan Liu
Zilong Chen
Yueqi Duan
Jun Zhang
Yikai Wang
VGen
124
41
0
07 Nov 2024
Controlling Human Shape and Pose in Text-to-Image Diffusion Models via
  Domain Adaptation
Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation
Benito Buchheim
M. Reimann
Jürgen Döllner
63
0
0
07 Nov 2024
MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait
  Generation
MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation
Han Yang
Sotiris Anagnostidis
Enis Simsar
Thomas Hofmann
DiffM
33
0
0
07 Nov 2024
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
Koichi Namekata
Sherwin Bahmani
Ziyi Wu
Yash Kant
Igor Gilitschenski
David B. Lindell
VGen
171
16
0
07 Nov 2024
HandCraft: Anatomically Correct Restoration of Malformed Hands in Diffusion Generated Images
HandCraft: Anatomically Correct Restoration of Malformed Hands in Diffusion Generated Images
Zhenyue Qin
Yiqun Zhang
Yang Liu
Dylan Campbell
DiffM
97
3
0
07 Nov 2024
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
Ashutosh Srivastava
Tarun Ram Menta
Abhinav Java
Avadhoot Jadhav
Silky Singh
Surgan Jandial
Balaji Krishnamurthy
DiffM
74
1
0
06 Nov 2024
Generalize or Detect? Towards Robust Semantic Segmentation Under
  Multiple Distribution Shifts
Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Zhitong Gao
Bingnan Li
Mathieu Salzmann
Xuming He
OODVLM
176
1
0
06 Nov 2024
ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization
ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization
Huayang Huang
Yu Wu
Qian Wang
DiffMWIGM
108
7
0
06 Nov 2024
SynthSet: Generative Diffusion Model for Semantic Segmentation in
  Precision Agriculture
SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture
Andrew Heschl
Mauricio Murillo
Keyhan Najafian
F. Maleki
DiffM
56
2
0
05 Nov 2024
On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
Tariq Berrada Ifriqi
Pietro Astolfi
Melissa Hall
Reyhane Askari Hemmat
Yohann Benchetrit
...
Matthew Muckley
Karteek Alahari
Adriana Romero Soriano
Jakob Verbeek
M. Drozdzal
AI4CEVLM
139
4
0
05 Nov 2024
Exploring the Interplay Between Video Generation and World Models in
  Autonomous Driving: A Survey
Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Ao Fu
Yi Zhou
Tao Zhou
Yue Yang
Bojun Gao
Qun Li
Guobin Wu
Ling Shao
VGen
100
3
0
05 Nov 2024
LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior
LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior
Xingjian Tang
Jingwei Guan
Linge Li
Ran Shi
Youmei Zhang
Mengye Lyu
Li Yan
MedImDiffM
171
0
0
05 Nov 2024
AutoVFX: Physically Realistic Video Editing from Natural Language
  Instructions
AutoVFX: Physically Realistic Video Editing from Natural Language Instructions
Hao-Yu Hsu
Zhi-Hao Lin
Albert Zhai
Hongchi Xia
Shenlong Wang
VGen
105
11
0
04 Nov 2024
Training-free Regional Prompting for Diffusion Transformers
Training-free Regional Prompting for Diffusion Transformers
Anthony Chen
Jianjin Xu
Wenzhao Zheng
Gaole Dai
Yun Wang
Renrui Zhang
Haofan Wang
Shanghang Zhang
VLM
102
5
0
04 Nov 2024
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Wei Cheng
Juncheng Mu
Xianfang Zeng
Xin Chen
Anqi Pang
...
Zhibin Wang
Bin-Bin Fu
Gang Yu
Ziwei Liu
Liang Pan
119
12
0
04 Nov 2024
Diffusion-based Generative Multicasting with Intent-aware Semantic
  Decomposition
Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition
Xinkai Liu
Mahdi Boloursaz Mashhadi
Li Qiao
Yi Ma
Rahim Tafazolli
Mehdi Bennis
DiffM
100
2
0
04 Nov 2024
KptLLM: Unveiling the Power of Large Language Model for Keypoint
  Comprehension
KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Jie Yang
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chen Qian
Ruimao Zhang
MLLM
133
3
0
04 Nov 2024
DreamPolish: Domain Score Distillation With Progressive Geometry
  Generation
DreamPolish: Domain Score Distillation With Progressive Geometry Generation
Yean Cheng
Ziqi Cai
Ming Ding
Wendi Zheng
Shiyu Huang
Yuxiao Dong
J. Tang
Boxin Shi
DiffM
59
0
0
03 Nov 2024
Infinite-Resolution Integral Noise Warping for Diffusion Models
Infinite-Resolution Integral Noise Warping for Diffusion Models
Yitong Deng
Winnie Lin
Lingxiao Li
Dmitriy Smirnov
Ryan Burgert
Ning Yu
Vincent Dedun
Mohammad H. Taghavi
65
3
0
02 Nov 2024
Pin-Tuning: Parameter-Efficient In-Context Tuning for Few-Shot Molecular
  Property Prediction
Pin-Tuning: Parameter-Efficient In-Context Tuning for Few-Shot Molecular Property Prediction
Liang Wang
Qiang Liu
Shaozhen Liu
Xin Sun
Shu Wu
Liang Wang
106
2
0
02 Nov 2024
X-Drive: Cross-modality consistent multi-sensor data synthesis for
  driving scenarios
X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios
Yichen Xie
Chenfeng Xu
C-T.John Peng
Shuqi Zhao
Nhat Ho
Alexander T. Pham
Mingyu Ding
Masayoshi Tomizuka
Weidong Zhan
DiffM
92
3
0
02 Nov 2024
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with
  Realistic Scene Modifications via Diffusion-Based Image Editing
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing
Naufal Suryanto
Andro Aprila Adiputra
Ahmada Yusril Kadiptya
Thi-Thu-Huong Le
Derry Pratama
Yongsu Kim
Howon Kim
DiffM
128
0
0
01 Nov 2024
TextDestroyer: A Training- and Annotation-Free Diffusion Method for Destroying Anomal Text from Images
TextDestroyer: A Training- and Annotation-Free Diffusion Method for Destroying Anomal Text from Images
Mengcheng Li
Mingbao Lin
Yong Li
Chia-Wen Lin
DiffM
94
0
0
01 Nov 2024
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from
  Only 2D Images
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images
Timing Yang
Yuanliang Ju
Li Yi
3DPC
95
4
0
31 Oct 2024
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided
  Mixture-of-Experts
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
Xiang Deng
Youxin Pang
Xiaochen Zhao
Chao Xu
Lizhen Wang
Hongjiang Xiao
Shi Yan
Hongwen Zhang
Yebin Liu
DiffMVGen
68
1
0
31 Oct 2024
Disentangling Disentangled Representations: Towards Improved Latent
  Units via Diffusion Models
Disentangling Disentangled Representations: Towards Improved Latent Units via Diffusion Models
Youngjun Jun
Jiwoo Park
Kyobin Choo
Tae Eun Choi
Seong Jae Hwang
CoGe
119
0
0
31 Oct 2024
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
Fu Feng
Yucheng Xie
Xu Yang
Jing Wang
Xin Geng
DiffM
80
0
0
31 Oct 2024
TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation
TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation
Sunjae Yoon
Gwanhyeong Koo
Younghwan Lee
Chang D. Yoo
VGen
149
5
0
31 Oct 2024
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level
  and Fidelity-Rich Conditions in Diffusion Models
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
Shengkai Zhang
Nianhong Jiao
Tian Li
Chaojie Yang
Chenhui Xue
Boya Niu
Jun Gao
VGenVLMDiffM
62
3
0
30 Oct 2024
Prune and Repaint: Content-Aware Image Retargeting for any Ratio
Prune and Repaint: Content-Aware Image Retargeting for any Ratio
Feihong Shen
Chong Li
Yifeng Geng
Yongjian Deng
Hao Chen
58
1
0
30 Oct 2024
One Prompt to Verify Your Models: Black-Box Text-to-Image Models Verification via Non-Transferable Adversarial Attacks
One Prompt to Verify Your Models: Black-Box Text-to-Image Models Verification via Non-Transferable Adversarial Attacks
Ji Guo
Wenbo Jiang
Rui Zhang
Guoming Lu
Hongwei Li
AAML
160
0
0
30 Oct 2024
FairSkin: Fair Diffusion for Skin Disease Image Generation
FairSkin: Fair Diffusion for Skin Disease Image Generation
Ruichen Zhang
Yuguang Yao
Zhen Tan
Zechao Li
Pan Wang
Huan Liu
Jingtong Hu
Sijia Liu
Tianlong Chen
MedIm
65
0
0
29 Oct 2024
PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot
  Scene Rearrangement
PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement
Shutong Jin
Ruiyu Wang
Kuangyi Chen
Florian T. Pokorny
76
0
0
29 Oct 2024
Volumetric Conditioning Module to Control Pretrained Diffusion Models
  for 3D Medical Images
Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images
Suhyun Ahn
Wonjung Park
Jihoon Cho
Seunghyuck Park
Jinah Park
MedIm
85
0
0
29 Oct 2024
HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion
HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion
Yu Zeng
Zhiyuan Liu
Jiachen Liu
Linlin Shen
Kaijun Deng
Weizhao He
Jinbao Wang
DiffM
52
0
0
29 Oct 2024
DiffSTR: Controlled Diffusion Models for Scene Text Removal
DiffSTR: Controlled Diffusion Models for Scene Text Removal
Sanhita Pathak
V. Kaushik
Brejesh Lall
DiffM
70
0
0
29 Oct 2024
Adapting Diffusion Models for Improved Prompt Compliance and
  Controllable Image Synthesis
Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image Synthesis
Deepak Sridhar
Abhishek Peri
Rohith Rachala
Nuno Vasconcelos
DiffM
64
1
0
29 Oct 2024
IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models
IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models
Hang Guo
Yawei Li
Tao Dai
Shu-Tao Xia
Luca Benini
MQ
131
2
0
29 Oct 2024
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
Yaopei Zeng
Yuanpu Cao
Bochuan Cao
Yurui Chang
Jinghui Chen
Lu Lin
DiffM
92
3
0
28 Oct 2024
MovieCharacter: A Tuning-Free Framework for Controllable Character Video Synthesis
MovieCharacter: A Tuning-Free Framework for Controllable Character Video Synthesis
Di Qiu
Zheng Chen
Rui Wang
Mingyuan Fan
Changqian Yu
Junshi Huan
Xiang Wen
VGen
96
9
0
28 Oct 2024
FreqMark: Invisible Image Watermarking via Frequency Based Optimization
  in Latent Space
FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space
Yiyang Guo
Ruizhe Li
Mude Hui
Hanzhong Guo
Chen Zhang
Chuangjian Cai
Le Wan
Shangfei Wang
92
0
0
28 Oct 2024
Novel Object Synthesis via Adaptive Text-Image Harmony
Novel Object Synthesis via Adaptive Text-Image Harmony
Zeren Xiong
Zedong Zhang
Zikun Chen
Shuo Chen
Xianrui Li
Gan Sun
Jian Yang
Jun Li
DiffM
95
4
0
28 Oct 2024
GrounDiT: Grounding Diffusion Transformers via Noisy Patch
  Transplantation
GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation
Phillip Y. Lee
Taehoon Yoon
Minhyuk Sung
138
7
1
27 Oct 2024
Diff-CXR: Report-to-CXR generation through a disease-knowledge enhanced
  diffusion model
Diff-CXR: Report-to-CXR generation through a disease-knowledge enhanced diffusion model
Peng Huang
Bowen Guo
Shuyu Liang
Junhu Fu
Yuanyuan Wang
Yi Guo
DiffMMedIm
68
1
0
26 Oct 2024
Human-Object Interaction Detection Collaborated with Large
  Relation-driven Diffusion Models
Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models
Liulei Li
Wenguan Wang
Yue Yang
98
8
0
26 Oct 2024
MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and
  Benchmark for Text-to-Image Generation
MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation
Jialin Luo
Yuanzhi Wang
Ziqi Gu
Yide Qiu
Shuaizhen Yao
Fuyun Wang
Chunyan Xu
Wenhua Zhang
Dan Wang
Zhen Cui
DiffM
54
2
0
26 Oct 2024
Semantic Feature Decomposition based Semantic Communication System of
  Images with Large-scale Visual Generation Models
Semantic Feature Decomposition based Semantic Communication System of Images with Large-scale Visual Generation Models
Senran Fan
Zhicheng Bao
Chen Dong
Haotai Liang
Xiaodong Xu
Ping Zhang
DiffM
48
3
0
26 Oct 2024
ArCSEM: Artistic Colorization of SEM Images via Gaussian Splatting
ArCSEM: Artistic Colorization of SEM Images via Gaussian Splatting
Takuma Nishimura
Andreea Dogaru
Martin Oeggerli
Bernhard Egger
70
0
0
25 Oct 2024
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video
  Reconstruction
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
Z. Gong
Guangyin Bao
Qi Zhang
Zhongwei Wan
Duoqian Miao
...
Changwei Wang
Rongtao Xu
Liang Hu
Ke Liu
Yu Zhang
DiffMVGen
124
10
0
25 Oct 2024
Previous
123...181920...606162
Next