ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
Pose-Diversified Augmentation with Diffusion Model for Person
  Re-Identification
Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification
Ines Hyeonsu Kim
Joungbin Lee
Soowon Son
Woojeong Jin
Kyusun Cho
...
Min-Seop Kwak
Seokju Cho
Jeongyeol Baek
Byeongwon Lee
Seungryong Kim
62
1
0
23 Jun 2024
Identifying and Solving Conditional Image Leakage in Image-to-Video
  Diffusion Model
Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model
Min Zhao
Hongzhou Zhu
Chendong Xiang
Kaiwen Zheng
Chongxuan Li
Jun Zhu
122
11
0
22 Jun 2024
Image Conductor: Precision Control for Interactive Video Synthesis
Image Conductor: Precision Control for Interactive Video Synthesis
Yaowei Li
Xintao Wang
Zhaoyang Zhang
Zhouxia Wang
Ziyang Yuan
Liangbin Xie
Yuexian Zou
Ying Shan
VGen
117
27
0
21 Jun 2024
VividDreamer: Towards High-Fidelity and Efficient Text-to-3D Generation
VividDreamer: Towards High-Fidelity and Efficient Text-to-3D Generation
Zixuan Chen
Ruijie Su
Jiahao Zhu
Lingxiao Yang
Jian-Huang Lai
Xiaohua Xie
DiffM
74
1
0
21 Jun 2024
Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models
Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models
Matthew Zheng
Enis Simsar
Hidir Yesiltepe
Federico Tombari
Joel Simon
Pinar Yanardag
138
4
0
20 Jun 2024
Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation
Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation
Eyal Michaeli
Ohad Fried
119
1
0
20 Jun 2024
Splatter a Video: Video Gaussian Representation for Versatile Processing
Splatter a Video: Video Gaussian Representation for Versatile Processing
Yang-tian Sun
Yi-Hua Huang
Lin Ma
Xiaoyang Lyu
Yan-Pei Cao
Xiaojuan Qi
3DGS
82
9
0
19 Jun 2024
4K4DGen: Panoramic 4D Generation at 4K Resolution
4K4DGen: Panoramic 4D Generation at 4K Resolution
Renjie Li
Panwang Pan
Bangbang Yang
Dejia Xu
Shijie Zhou
Xuanyang Zhang
Zeming Li
A. Kadambi
Zhangyang Wang
Zhiwen Fan
VGen
129
21
0
19 Jun 2024
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
Haruo Fujiwara
Yusuke Mukuta
Tatsuya Harada
100
5
0
19 Jun 2024
AniFaceDiff: High-Fidelity Face Reenactment via Facial Parametric
  Conditioned Diffusion Models
AniFaceDiff: High-Fidelity Face Reenactment via Facial Parametric Conditioned Diffusion Models
Ken Chen
Sachith Seneviratne
Wei Wang
Dongting Hu
Sanjay Saha
Md. Tarek Hasan
Sanka Rasnayaka
T. Malepathirana
Mingming Gong
Saman K. Halgamuge
DiffM
40
2
0
19 Jun 2024
Neural Residual Diffusion Models for Deep Scalable Vision Generation
Neural Residual Diffusion Models for Deep Scalable Vision Generation
Zhiyuan Ma
Liangliang Zhao
Biqing Qi
Bowen Zhou
DiffM
125
4
0
19 Jun 2024
Extracting Training Data from Unconditional Diffusion Models
Extracting Training Data from Unconditional Diffusion Models
Yunhao Chen
Xingjun Ma
Difan Zou
Yu-Gang Jiang
104
5
0
18 Jun 2024
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation
  Models
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models
Yongtao Ge
Guangkai Xu
Zhiyue Zhao
Libo Sun
Zheng Huang
Yanlong Sun
Hao Chen
Chunhua Shen
MDE
80
3
0
18 Jun 2024
Unmasking the Veil: An Investigation into Concept Ablation for Privacy
  and Copyright Protection in Images
Unmasking the Veil: An Investigation into Concept Ablation for Privacy and Copyright Protection in Images
Shivank Garg
Manyana Tiwari
57
2
0
18 Jun 2024
Neural Approximate Mirror Maps for Constrained Diffusion Models
Neural Approximate Mirror Maps for Constrained Diffusion Models
Berthy Feng
Ricardo Baptista
Katherine Bouman
MedImDiffM
136
4
0
18 Jun 2024
ARTIST: Improving the Generation of Text-rich Images by Disentanglement
ARTIST: Improving the Generation of Text-rich Images by Disentanglement
Jianyi Zhang
Yufan Zhou
Jiuxiang Gu
Curtis Wigington
Tong Yu
Yiran Chen
Tong Sun
Ruiyi Zhang
130
0
0
17 Jun 2024
AnyMaker: Zero-shot General Object Customization via Decoupled
  Dual-Level ID Injection
AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injection
Lingjie Kong
Kai WU
Xiaobin Hu
Wenhui Han
Jinlong Peng
Chengming Xu
Donghao Luo
Jiangning Zhang
Chengjie Wang
Yanwei Fu
DiffM
74
0
0
17 Jun 2024
ChildDiffusion: Unlocking the Potential of Generative AI and
  Controllable Augmentations for Child Facial Data using Stable Diffusion and
  Large Language Models
ChildDiffusion: Unlocking the Potential of Generative AI and Controllable Augmentations for Child Facial Data using Stable Diffusion and Large Language Models
Muhammad Ali Farooq
Wang Yao
Peter Corcoran
71
1
0
17 Jun 2024
AnyTrans: Translate AnyText in the Image with Large Scale Models
AnyTrans: Translate AnyText in the Image with Large Scale Models
Zhipeng Qian
Pei Zhang
Baosong Yang
Kai Fan
Yiwei Ma
Derek F. Wong
Xiaoshuai Sun
Rongrong Ji
VLM
87
2
0
17 Jun 2024
Consistency^2: Consistent and Fast 3D Painting with Latent Consistency
  Models
Consistency^2: Consistent and Fast 3D Painting with Latent Consistency Models
Tianfu Wang
Anton Obukhov
Konrad Schindler
DiffM
48
1
0
17 Jun 2024
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Alireza Ganjdanesh
Reza Shirkavand
Shangqian Gao
Heng Huang
DiffMVLM
151
5
0
17 Jun 2024
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Yulai Zhao
Masatoshi Uehara
Gabriele Scalia
Tommaso Biancalani
Sergey Levine
Ehsan Hajiramezanali
Ehsan Hajiramezanali
AI4CE
181
7
0
17 Jun 2024
Exploiting Diffusion Prior for Out-of-Distribution Detection
Exploiting Diffusion Prior for Out-of-Distribution Detection
Armando Zhu
Jiabei Liu
Keqin Li
Shuying Dai
Bo Hong
Peng Zhao
Changsong Wei
114
9
0
16 Jun 2024
An Analysis on Quantizing Diffusion Transformers
An Analysis on Quantizing Diffusion Transformers
Yuewei Yang
Jialiang Wang
Xiaoliang Dai
Peizhao Zhang
Hongbo Zhang
MQ
111
1
0
16 Jun 2024
ViD-GPT: Introducing GPT-style Autoregressive Generation in Video
  Diffusion Models
ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models
Kaifeng Gao
Jiaxin Shi
Hanwang Zhang
Chunping Wang
Jun Xiao
DiffMVGen
131
15
0
16 Jun 2024
Joint Audio and Symbolic Conditioning for Temporally Controlled
  Text-to-Music Generation
Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation
Or Tal
Alon Ziv
Itai Gat
Felix Kreuk
Yossi Adi
88
17
0
16 Jun 2024
Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions
  with HSI-Diffusion for the FINCH Spacecraft
Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions with HSI-Diffusion for the FINCH Spacecraft
Ian Vyse
Rishit Dagli
Dav Vrat Chadha
John P. Ma
Hector Chen
...
Iliya Shofman
Coby Silayan
Reid Sox-Harris
Shuhan Zheng
Khang Nguyen
75
0
0
15 Jun 2024
GenMM: Geometrically and Temporally Consistent Multimodal Data
  Generation for Video and LiDAR
GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR
Bharat Singh
Viveka Kulharia
Luyu Yang
Avinash Ravichandran
Ambrish Tyagi
Ashish Shrivastava
VGen
94
2
0
15 Jun 2024
SatDiffMoE: A Mixture of Estimation Method for Satellite Image
  Super-resolution with Latent Diffusion Models
SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-resolution with Latent Diffusion Models
Zhaoxu Luo
Bowen Song
Liyue Shen
73
1
0
14 Jun 2024
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Xiang Li
Kai Qiu
Hao Chen
Jason Kuen
Zhe Lin
Rita Singh
Bhiksha Raj
DiffM
93
27
0
14 Jun 2024
Composing Parts for Expressive Object Generation
Composing Parts for Expressive Object Generation
Harsh Rangwani
Aishwarya Agarwal
Kuldeep Kulkarni
R. Venkatesh Babu
Srikrishna Karanam
DiffM
110
2
0
14 Jun 2024
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models
  in Decision Making
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yi Ma
Pengyi Li
Yan Zheng
DiffM
99
17
0
13 Jun 2024
Depth Anything V2
Depth Anything V2
Lihe Yang
Bingyi Kang
Zilong Huang
Zhen Zhao
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
DiffMVLMMDE
131
437
0
13 Jun 2024
Interpreting the Weight Space of Customized Diffusion Models
Interpreting the Weight Space of Customized Diffusion Models
Amil Dravid
Yossi Gandelsman
Kuan-Chieh Wang
Rameen Abdal
Gordon Wetzstein
Alexei A. Efros
Kfir Aberman
98
12
0
13 Jun 2024
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
Roman Bachmann
Oğuzhan Fatih Kar
David Mizrahi
Ali Garjani
Mingfei Gao
David Griffiths
Jiaming Hu
Afshin Dehghan
Amir Zamir
MoEVLMMLLM
113
17
0
13 Jun 2024
ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene
  Editing
ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing
Jun-Kun Chen
Samuel Rota Buló
Norman Muller
Lorenzo Porzi
Peter Kontschieder
Yu-Xiong Wang
DiffM
89
9
0
13 Jun 2024
Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D
  Diffusion
Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion
Linzhan Mou
Jun-Kun Chen
Yu-Xiong Wang
VGenDiffM
133
11
0
13 Jun 2024
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Junke Wang
Yi Jiang
Zehuan Yuan
Binyue Peng
Zuxuan Wu
Yu-Gang Jiang
ViTVGen
124
46
0
13 Jun 2024
Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion
  Prior
Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior
Baiang Li
Sizhuo Ma
Yanhong Zeng
Xiaogang Xu
Youqing Fang
Zhao Zhang
Jian Wang
Kai Chen
DiffM
60
1
0
13 Jun 2024
SimGen: Simulator-conditioned Driving Scene Generation
SimGen: Simulator-conditioned Driving Scene Generation
Yunsong Zhou
Michael Simon
Zhenghao Peng
Sicheng Mo
Hongzi Zhu
Minyi Guo
Bolei Zhou
VGen
93
17
0
13 Jun 2024
CLIPAway: Harmonizing Focused Embeddings for Removing Objects via
  Diffusion Models
CLIPAway: Harmonizing Focused Embeddings for Removing Objects via Diffusion Models
Yigit Ekin
Ahmet Burak Yildirim
Erdem Çağlar
Aykut Erdem
Erkut Erdem
Aysegül Dündar
DiffM
90
10
0
13 Jun 2024
CMC-Bench: Towards a New Paradigm of Visual Signal Compression
CMC-Bench: Towards a New Paradigm of Visual Signal Compression
Chunyi Li
Xiele Wu
H. Wu
Donghui Feng
Zicheng Zhang
Guo Lu
Xiongkuo Min
Xiaohong Liu
Guangtao Zhai
Weisi Lin
VLM
78
5
0
13 Jun 2024
Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven
  Text-to-Image Generation
Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation
Yufan Zhou
Ruiyi Zhang
Kaizhi Zheng
Nanxuan Zhao
Jiuxiang Gu
Zichao Wang
Xin Eric Wang
Tong Sun
DiffM
50
2
0
13 Jun 2024
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image
  Diffusion Models
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models
Ziyi Wu
Yulia Rubanova
Rishabh Kabra
Drew A. Hudson
Igor Gilitschenski
Yusuf Aytar
Sjoerd van Steenkiste
Kelsey R. Allen
Thomas Kipf
VGenDiffM
107
9
0
13 Jun 2024
MirrorCheck: Efficient Adversarial Defense for Vision-Language Models
MirrorCheck: Efficient Adversarial Defense for Vision-Language Models
Samar Fares
Klea Ziu
Toluwani Aremu
Nikita Durasov
Martin Takáč
Pascal Fua
Karthik Nandakumar
Ivan Laptev
VLMAAML
101
5
0
13 Jun 2024
Complex Image-Generative Diffusion Transformer for Audio Denoising
Complex Image-Generative Diffusion Transformer for Audio Denoising
Junhui Li
Pu Wang
Jialu Li
Youshan Zhang
DiffM
67
1
0
13 Jun 2024
Preserving Identity with Variational Score for General-purpose 3D
  Editing
Preserving Identity with Variational Score for General-purpose 3D Editing
Duong H. Le
Tuan Pham
Aniruddha Kembhavi
Stephan Mandt
Wei-Chiu Ma
Jiasen Lu
84
0
0
13 Jun 2024
COVE: Unleashing the Diffusion Feature Correspondence for Consistent
  Video Editing
COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing
Jiangshan Wang
Yue Ma
Jiayi Guo
Yicheng Xiao
Gao Huang
Xiu Li
DiffM
110
24
0
13 Jun 2024
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
Bing Li
Cheng Zheng
Wenxuan Zhu
Jinjie Mai
Biao Zhang
Peter Wonka
Bernard Ghanem
108
17
0
12 Jun 2024
FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent
  Font Effect Generation
FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation
Xinzhi Mu
Li Chen
Bohan Chen
Shuyang Gu
Jianmin Bao
Dong Chen
Ji Li
Yuhui Yuan
DiffM
56
3
0
12 Jun 2024
Previous
123...282930...606162
Next