ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
MegaFusion: Extend Diffusion Models towards Higher-resolution Image
  Generation without Further Tuning
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
Haoning Wu
Shaocheng Shen
Qiang Hu
Xiaoyun Zhang
Ya Zhang
Yanfeng Wang
114
11
0
20 Aug 2024
Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models
Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models
Cong Wan
Yuhang He
Xiang Song
Yihong Gong
DiffMAAML
100
7
0
20 Aug 2024
Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds
Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds
Kai Liu
Kang-Soo You
Pan Gao
DiffM
67
1
0
20 Aug 2024
MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction
  Model
MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model
Minghua Liu
Chong Zeng
Xinyue Wei
Ruoxi Shi
Linghao Chen
...
Zhaoning Wang
Xiaoshuai Zhang
Isabella Liu
Hongzhi Wu
Hao Su
162
26
0
19 Aug 2024
Factorized-Dreamer: Training A High-Quality Video Generator with Limited
  and Low-Quality Data
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data
Tao Yang
Yangming Shi
Yunwen Huang
Feng Chen
Yin Zheng
Lei Zhang
DiffMVGen
87
0
0
19 Aug 2024
SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation
  with Latent Consistency Diffusion Models
SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models
Danush Kumar Venkatesh
Dominik Rivoir
Micha Pfeiffer
Stefanie Speidel
MedIm
123
2
0
19 Aug 2024
Latent Diffusion for Guided Document Table Generation
Latent Diffusion for Guided Document Table Generation
Syed Jawwad Haider Hamdani
S. Saifullah
S. Agne
Andreas Dengel
Sheraz Ahmed
75
0
0
19 Aug 2024
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation
Liu He
Yizhi Song
Hejun Huang
Pinxin Liu
Yunlong Tang
Daniel G. Aliaga
Xin Zhou
DiffMVGen
148
6
0
19 Aug 2024
Combo: Co-speech holistic 3D human motion generation and efficient
  customizable adaptation in harmony
Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Chao Xu
Mingze Sun
Zhi-Qi Cheng
Fei Wang
Yang Liu
Baigui Sun
Ruqi Huang
Alexander G. Hauptmann
VGen
97
3
0
18 Aug 2024
RepControlNet: ControlNet Reparameterization
RepControlNet: ControlNet Reparameterization
Zhaoli Deng
Kaibin Zhou
Fanyi Wang
Zhenpeng Mi
DiffM
74
3
0
17 Aug 2024
Thin-Plate Spline-based Interpolation for Animation Line Inbetweening
Thin-Plate Spline-based Interpolation for Animation Line Inbetweening
Tianyi Zhu
Wei Shang
Dongwei Ren
W. Zuo
97
3
0
17 Aug 2024
Barbie: Text to Barbie-Style 3D Avatars
Barbie: Text to Barbie-Style 3D Avatars
Xiaokun Sun
Zhenyu Zhang
Ying Tai
Qian Wang
Hao Tang
Zili Yi
Jian Yang
LM&Ro
118
2
0
17 Aug 2024
MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and
  3D Editing
MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing
Chenjie Cao
Chaohui Yu
Yanwei Fu
Fan Wang
Xiangyang Xue
VGen
100
10
0
15 Aug 2024
DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion
  Consistency
DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency
Xiaojing Zhong
Xinyi Huang
Xiaofeng Yang
Guosheng Lin
Qingyao Wu
DiffM
77
4
0
14 Aug 2024
Connecting Dreams with Visual Brainstorming Instruction
Connecting Dreams with Visual Brainstorming Instruction
Yasheng Sun
Bohan Li
Mingchen Zhuge
Deng-Ping Fan
Salman Khan
Fahad Shahbaz Khan
Hideki Koike
DiffM
66
0
0
14 Aug 2024
Dual-Domain CLIP-Assisted Residual Optimization Perception Model for
  Metal Artifact Reduction
Dual-Domain CLIP-Assisted Residual Optimization Perception Model for Metal Artifact Reduction
Xinrui Zhang
Ailong Cai
Shaoyu Wang
Linyuan Wang
Zhizhong Zheng
Lei Li
Bin Yan
MedIm
76
0
0
14 Aug 2024
GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models
GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models
Lei Kang
Fei Yang
Kai Wang
Mohamed Ali Souibgui
Lluís Gómez
Alicia Fornés
Ernest Valveny
Dimosthenis Karatzas
DiffM
78
0
0
14 Aug 2024
Controlling the World by Sleight of Hand
Controlling the World by Sleight of Hand
Sruthi Sudhakar
Ruoshi Liu
Basile Van Hoorick
Carl Vondrick
Richard Zemel
98
4
0
13 Aug 2024
ViMo: Generating Motions from Casual Videos
ViMo: Generating Motions from Casual Videos
Liangdong Qiu
Chengxing Yu
Yanran Li
Zhao Wang
Haibin Huang
Chongyang Ma
Di Zhang
Pengfei Wan
Xiaoguang Han
VGen
123
2
0
13 Aug 2024
3D Reconstruction of Protein Structures from Multi-view AFM Images using
  Neural Radiance Fields (NeRFs)
3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs)
Jaydeep Rade
Ethan Herron
Soumik Sarkar
Anwesha Sarkar
A. Krishnamurthy
AI4CE
70
0
0
12 Aug 2024
UniPortrait: A Unified Framework for Identity-Preserving Single- and
  Multi-Human Image Personalization
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization
Junjie He
Yifeng Geng
Liefeng Bo
DiffM
117
23
0
12 Aug 2024
Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Utkarsh Nath
Rajeev Goel
Eun Som Jeon
Changhoon Kim
Kyle Min
Yezhou Yang
Yingzhen Yang
Pavan Turaga
167
1
0
12 Aug 2024
Egocentric Vision Language Planning
Egocentric Vision Language Planning
Zhirui Fang
Ming Yang
Weishuai Zeng
Boyu Li
Junpeng Yue
Ziluo Ding
Xiu Li
Zongqing Lu
LM&Ro
69
1
0
11 Aug 2024
LaWa: Using Latent Space for In-Generation Image Watermarking
LaWa: Using Latent Space for In-Generation Image Watermarking
Ahmad Rezaei
Mohammad Akbari
Saeed Ranjbar Alvar
Arezou Fatemi
Yong Zhang
WIGM
100
17
0
11 Aug 2024
High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based
  Diffusion Model
High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Weizhi Zhong
Junfan Lin
Peixin Chen
Liang Lin
Guanbin Li
69
1
0
10 Aug 2024
TEAdapter: Supply abundant guidance for controllable text-to-music
  generation
TEAdapter: Supply abundant guidance for controllable text-to-music generation
Jialing Zou
Jiahao Mei
Xudong Nan
Jinghua Li
Daoguo Dong
Liang He
60
0
0
09 Aug 2024
Puppet-Master: Scaling Interactive Video Generation as a Motion Prior
  for Part-Level Dynamics
Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics
Ruining Li
Chuanxia Zheng
Christian Rupprecht
Andrea Vedaldi
DiffMVGen
110
10
0
08 Aug 2024
Semantic Communication based on Large Language Model for Underwater
  Image Transmission
Semantic Communication based on Large Language Model for Underwater Image Transmission
Weilong Chen
Wenxuan Xu
Haoran Chen
Xinran Zhang
Zhijin Qin
Yanru Zhang
Zhu Han
69
5
0
08 Aug 2024
Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from
  User's Casual Sketches
Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches
Yongzhi Xu
Yonhon Ng
Yifu Wang
Inkyu Sa
Yunfei Duan
Yang Li
Pan Ji
Hongdong Li
VGen3DV
89
7
0
08 Aug 2024
Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and
  Text Guidance
Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance
Ahmad Arrabi
Xiaohan Zhang
Waqas Sultani
Chong Chen
S. Wshah
DiffM
79
5
0
08 Aug 2024
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning
  using Instruct Prompts
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts
Ciara Rowles
Shimon Vainer
Dante De Nigris
Slava Elizarov
Konstantin Kutsy
Simon Donné
DiffM
92
10
0
06 Aug 2024
Diverse Generation while Maintaining Semantic Coordination: A
  Diffusion-Based Data Augmentation Method for Object Detection
Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection
Sen Nie
Zhuo Wang
Xinxin Wang
Kun He
DiffM
156
0
0
06 Aug 2024
Fairness and Bias Mitigation in Computer Vision: A Survey
Fairness and Bias Mitigation in Computer Vision: A Survey
Sepehr Dehdashtian
Ruozhen He
Yi Li
Guha Balakrishnan
Nuno Vasconcelos
Vicente Ordonez
Vishnu Boddeti
141
5
0
05 Aug 2024
RCDM: Enabling Robustness for Conditional Diffusion Model
RCDM: Enabling Robustness for Conditional Diffusion Model
Weifeng Xu
Xiang Zhu
Xiaoyong Li
AAML
75
0
0
05 Aug 2024
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language
  Models
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models
Agneet Chatterjee
Yiran Luo
Tejas Gokhale
Yezhou Yang
Chitta Baral
LRM
101
5
0
05 Aug 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu
Shitian Zhao
Le Zhuo
Weifeng Lin
Ping Luo
Xinyue Li
Qi Qin
Yu Qiao
Hongsheng Li
Peng Gao
MLLM
168
59
0
05 Aug 2024
PanoFree: Tuning-Free Holistic Multi-view Image Generation with
  Cross-view Self-Guidance
PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance
Aoming Liu
Zhong Li
Zhang Chen
Nannan Li
Yinghao Xu
Bryan A. Plummer
91
7
0
04 Aug 2024
Stimulating Imagination: Towards General-purpose Object Rearrangement
Stimulating Imagination: Towards General-purpose Object Rearrangement
Jianyang Wu
Jie Gu
Xiaokang Ma
Chu Tang
Jingmin Chen
DiffMLM&RoOCL
57
0
0
03 Aug 2024
Leveraging BEV Paradigm for Ground-to-Aerial Image Synthesis
Leveraging BEV Paradigm for Ground-to-Aerial Image Synthesis
Junyan Ye
Jun He
Weijia Li
Zhutao Lv
Yi Lin
Haote Yang
Haote Yang
Conghui He
95
0
0
03 Aug 2024
Conditional LoRA Parameter Generation
Conditional LoRA Parameter Generation
Aaron Mueller
Millicent Li
Koyena Pal
Wangbo Zhao
Yukun Zhou
Jiuding Sun
Yonatan Belinkov
DiffM
91
6
0
02 Aug 2024
TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and
  Resampling
TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling
Dong Huo
Zixin Guo
Wei Ji
Zhihao Shi
Juwei Lu
Peng Dai
Songcen Xu
Li Cheng
Yee-Hong Yang
DiffM
136
13
0
02 Aug 2024
CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset
  Augmentation using Diffusion Models
CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models
Kushal Kumar Jain
Steven A. Grosz
A. Namboodiri
Anil K. Jain
DiffM
79
2
0
02 Aug 2024
VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling
VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling
Qian Zhang
Xiangzi Dai
Ninghua Yang
Xiang An
Ziyong Feng
Xingyu Ren
VLMCLIP
122
22
0
02 Aug 2024
Contribution-based Low-Rank Adaptation with Pre-training Model for Real
  Image Restoration
Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration
Donwon Park
Leixian Shen
Se Young Chun
96
2
0
02 Aug 2024
FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features
  for Highly Controllable Text-Driven Image Translation
FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation
Xiang Gao
Jiaying Liu
118
2
0
02 Aug 2024
Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy
  Curvature of Attention
Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention
Mengkang Hu
DiffM
115
10
0
01 Aug 2024
Towards Reliable Advertising Image Generation Using Human Feedback
Towards Reliable Advertising Image Generation Using Human Feedback
Thorben Werner
Wei Feng
Haohan Wang
Yaoyu Li
Jingsen Wang
...
Maximilian Stubbemann
Junsheng Jin
Lars Schmidt-Thieme
Zhangang Lin
Jingping Shao
129
3
0
01 Aug 2024
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous
  Driving
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
Xuemeng Yang
Licheng Wen
Yukai Ma
Jianbiao Mei
Xin Li
...
Min Dou
Botian Shi
Liang He
Yong-Jin Liu
Yu Qiao
VGen
107
25
0
01 Aug 2024
Few-shot Defect Image Generation based on Consistency Modeling
Few-shot Defect Image Generation based on Consistency Modeling
Qingfeng Shi
Jing Wei
Fei Shen
Zheng Zhang
72
2
0
01 Aug 2024
Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for
  Studying Species Evolution
Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution
Yuanqing Wang
Arka Daw
M. Maruf
Josef C. Uyeda
Wasila Dahdul
...
James P. Balhoff
Kyunghyun Cho
Charles V. Stewart
Tanya Berger-Wolf
Anuj Karpatne
AI4CE
69
2
0
31 Jul 2024
Previous
123...242526...606162
Next