ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
HandRefiner: Refining Malformed Hands in Generated Images by
  Diffusion-based Conditional Inpainting
HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting
Wenquan Lu
Yufei Xu
Jing Zhang
Chaoyue Wang
Dacheng Tao
DiffM
73
28
0
29 Nov 2023
Rethinking Image Editing Detection in the Era of Generative AI
  Revolution
Rethinking Image Editing Detection in the Era of Generative AI Revolution
Zhihao Sun
Haipeng Fang
Xinying Zhao
Danding Wang
Juan Cao
91
10
0
29 Nov 2023
C3Net: Compound Conditioned ControlNet for Multimodal Content Generation
C3Net: Compound Conditioned ControlNet for Multimodal Content Generation
Juntao Zhang
Yuehuai Liu
Yu-Wing Tai
Chi-Keung Tang
DiffM
76
5
0
29 Nov 2023
Efficient Stitchable Task Adaptation
Efficient Stitchable Task Adaptation
Haoyu He
Zizheng Pan
Jing Liu
Jianfei Cai
Bohan Zhuang
126
3
0
29 Nov 2023
SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion
  Priors
SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors
Dave Zhenyu Chen
Haoxuan Li
Hsin-Ying Lee
Sergey Tulyakov
Matthias Nießner
DiffM
76
29
0
28 Nov 2023
Self-Discovering Interpretable Diffusion Latent Directions for
  Responsible Text-to-Image Generation
Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation
Hang Li
Chengzhi Shen
Philip Torr
Volker Tresp
Jindong Gu
132
37
0
28 Nov 2023
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
Xian Liu
Xiaohang Zhan
Jiaxiang Tang
Ying Shan
Gang Zeng
Dahua Lin
Xihui Liu
Ziwei Liu
3DGS
128
77
0
28 Nov 2023
Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with
  Distilled Semantic Features
Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features
Niladri Shekhar Dutt
Sanjeev Muralikrishnan
Niloy J. Mitra
112
22
0
28 Nov 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Yutong Feng
Biao Gong
Di Chen
Yujun Shen
Yu Liu
Jingren Zhou
DiffM
119
50
0
28 Nov 2023
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo
Ceyuan Yang
Anyi Rao
Maneesh Agrawala
Dahua Lin
Bo Dai
DiffMVGen
97
125
0
28 Nov 2023
UGG: Unified Generative Grasping
UGG: Unified Generative Grasping
Jiaxin Lu
Hao Kang
Haoxiang Li
Bo Liu
Yiding Yang
Qixing Huang
Gang Hua
102
25
0
28 Nov 2023
Reason out Your Layout: Evoking the Layout Master from Large Language
  Models for Text-to-Image Synthesis
Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis
Xiaohui Chen
Yongfei Liu
Yingxiang Yang
Jianbo Yuan
Quanzeng You
Liping Liu
Hongxia Yang
DiffM
86
13
0
28 Nov 2023
Panacea: Panoramic and Controllable Video Generation for Autonomous
  Driving
Panacea: Panoramic and Controllable Video Generation for Autonomous Driving
Yuqing Wen
Yucheng Zhao
Yingfei Liu
Fan Jia
Yanhui Wang
Chong Luo
Chi Zhang
Tiancai Wang
Xiaoyan Sun
Xiangyu Zhang
157
72
0
28 Nov 2023
ScribbleGen: Generative Data Augmentation Improves Scribble-supervised
  Semantic Segmentation
ScribbleGen: Generative Data Augmentation Improves Scribble-supervised Semantic Segmentation
Jacob Schnell
Jieke Wang
Lu Qi
Vincent Tao Hu
Meng Tang
DiffM
97
3
0
28 Nov 2023
As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D
  Diffusion Priors
As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors
Seungwoo Yoo
Kunho Kim
Vladimir G. Kim
Minhyuk Sung
DiffM
97
14
0
28 Nov 2023
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for
  Character Animation
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Liucheng Hu
Xin Gao
Peng Zhang
Ke Sun
Bang Zhang
Liefeng Bo
DiffMVGen
139
397
0
28 Nov 2023
CADTalk: An Algorithm and Benchmark for Semantic Commenting of CAD
  Programs
CADTalk: An Algorithm and Benchmark for Semantic Commenting of CAD Programs
Haocheng Yuan
Jing Xu
Hao Pan
Adrien Bousseau
Niloy J. Mitra
Changjian Li
90
10
0
28 Nov 2023
MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video
  Generation
MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation
Jingkuan Song
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
VGen
73
4
0
28 Nov 2023
On the Calibration of Human Pose Estimation
On the Calibration of Human Pose Estimation
Kerui Gu
Rongyu Chen
Angela Yao
180
8
0
28 Nov 2023
MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices
MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices
Yang Zhao
Yanwu Xu
Zhisheng Xiao
Haolin Jia
Tingbo Hou
VLM
95
13
0
28 Nov 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text
  Rendering
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
128
70
0
28 Nov 2023
Text-Driven Image Editing via Learnable Regions
Text-Driven Image Editing via Learnable Regions
Yuanze Lin
Yi-Wen Chen
Yi-Hsuan Tsai
Lu Jiang
Ming-Hsuan Yang
DiffM
103
20
0
28 Nov 2023
Manifold Preserving Guided Diffusion
Manifold Preserving Guided Diffusion
Yutong He
Naoki Murata
Chieh-Hsin Lai
Yuhta Takida
Toshimitsu Uesaka
...
Wei-Hsiang Liao
Yuki Mitsufuji
J. Zico Kolter
Ruslan Salakhutdinov
Stefano Ermon
DiffM
175
81
0
28 Nov 2023
DreamPropeller: Supercharge Text-to-3D Generation with Parallel Sampling
DreamPropeller: Supercharge Text-to-3D Generation with Parallel Sampling
Linqi Zhou
Andy Shih
Minh-Tuan Tran
Dinh Q. Phung
DiffM
101
14
0
28 Nov 2023
CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts
CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts
Yichao Cai
Yuhang Liu
Zhen Zhang
Javen Qinfeng Shi
CLIPVLM
152
8
0
28 Nov 2023
Diffusion-TTA: Test-time Adaptation of Discriminative Models via
  Generative Feedback
Diffusion-TTA: Test-time Adaptation of Discriminative Models via Generative Feedback
Mihir Prabhudesai
Tsung-Wei Ke
Alexander C. Li
Deepak Pathak
Katerina Fragkiadaki
TTA
84
15
0
27 Nov 2023
Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person
  Images
Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images
Aiyu Cui
Jay Mahajan
Viraj Shah
Preeti Gomathinayagam
Chang Liu
Svetlana Lazebnik
OODDiffM
59
16
0
27 Nov 2023
Self-correcting LLM-controlled Diffusion Models
Self-correcting LLM-controlled Diffusion Models
Tsung-Han Wu
Long Lian
Joseph E. Gonzalez
Boyi Li
Trevor Darrell
127
67
0
27 Nov 2023
MagicAnimate: Temporally Consistent Human Image Animation using
  Diffusion Model
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Zhongcong Xu
Jianfeng Zhang
Jun Hao Liew
Hanshu Yan
Jia-Wei Liu
Chenxu Zhang
Jiashi Feng
Mike Zheng Shou
VGenDiffM
127
205
0
27 Nov 2023
DiffSLVA: Harnessing Diffusion Models for Sign Language Video
  Anonymization
DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization
Zhaoyang Xia
C. Neidle
Dimitris N. Metaxas
DiffM
72
4
0
27 Nov 2023
SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
Rongyuan Wu
Tao Yang
Lingchen Sun
Zhengqiang Zhang
Shuai Li
Lei Zhang
DiffMSupR
100
146
0
27 Nov 2023
CoSeR: Bridging Image and Language for Cognitive Super-Resolution
CoSeR: Bridging Image and Language for Cognitive Super-Resolution
Haoze Sun
Wenbo Li
Jianzhuang Liu
Haoyu Chen
Renjing Pei
X. Zou
Youliang Yan
Yujiu Yang
SupR
152
47
0
27 Nov 2023
Deceptive-Human: Prompt-to-NeRF 3D Human Generation with 3D-Consistent
  Synthetic Images
Deceptive-Human: Prompt-to-NeRF 3D Human Generation with 3D-Consistent Synthetic Images
Shiu-hong Kao
Xinhang Liu
Yu-Wing Tai
Chi-Keung Tang
100
0
0
27 Nov 2023
Enhancing Perceptual Quality in Video Super-Resolution through
  Temporally-Consistent Detail Synthesis using Diffusion Models
Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models
C. Rota
M. Buzzelli
Joost van de Weijer
DiffM
104
3
0
27 Nov 2023
InterControl: Zero-shot Human Interaction Generation by Controlling
  Every Joint
InterControl: Zero-shot Human Interaction Generation by Controlling Every Joint
Zhenzhi Wang
Jingbo Wang
Yixuan Li
Dahua Lin
Bo Dai
79
2
0
27 Nov 2023
SiTH: Single-view Textured Human Reconstruction with Image-Conditioned
  Diffusion
SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion
Hsuan-I Ho
Mingli Song
Otmar Hilliges
DiffM
83
36
0
27 Nov 2023
Learning Disentangled Identifiers for Action-Customized Text-to-Image
  Generation
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
Siteng Huang
Biao Gong
Yutong Feng
Xi Chen
Yu Fu
Yu Liu
Donglin Wang
DiffM
68
14
0
27 Nov 2023
FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic
  Scene Syntax
FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax
Yu Lu
Linchao Zhu
Hehe Fan
Yi Yang
VGenDiffM
83
13
0
27 Nov 2023
LLMGA: Multimodal Large Language Model based Generation Assistant
LLMGA: Multimodal Large Language Model based Generation Assistant
Bin Xia
Shiyin Wang
Yingfan Tao
Yitong Wang
Jiaya Jia
MLLM
95
12
0
27 Nov 2023
Check, Locate, Rectify: A Training-Free Layout Calibration System for
  Text-to-Image Generation
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
Biao Gong
Siteng Huang
Yutong Feng
Shiwei Zhang
Yuyuan Li
Yu Liu
DiffM
110
13
0
27 Nov 2023
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion
  Schedule Flaws and Enhancing Low-Frequency Controls
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
Minghui Hu
Jianbin Zheng
Chuanxia Zheng
Chaoyue Wang
Dacheng Tao
Tat-Jen Cham
DiffM
99
5
0
27 Nov 2023
Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Chaofeng Chen
Annan Wang
Haoning Wu
Liang Liao
Wenxiu Sun
Qiong Yan
Weisi Lin
65
15
0
27 Nov 2023
Reinforcement Learning from Diffusion Feedback: Q* for Image Search
Reinforcement Learning from Diffusion Feedback: Q* for Image Search
Aboli Rajan Marathe
VLM
93
0
0
27 Nov 2023
ChatTraffic: Text-to-Traffic Generation via Diffusion Model
ChatTraffic: Text-to-Traffic Generation via Diffusion Model
Chengyang Zhang
Yong Zhang
Qitan Shao
Bo Li
Yisheng Lv
Xinglin Piao
Baocai Yin
84
7
0
27 Nov 2023
EucliDreamer: Fast and High-Quality Texturing for 3D Models with Stable
  Diffusion Depth
EucliDreamer: Fast and High-Quality Texturing for 3D Models with Stable Diffusion Depth
Cindy X. Le
Congrui Hetang
Chendi Lin
Ang Cao
Yihui He
68
7
0
27 Nov 2023
ET3D: Efficient Text-to-3D Generation via Multi-View Distillation
ET3D: Efficient Text-to-3D Generation via Multi-View Distillation
Yiming Chen
Zhiqi Li
Peidong Liu
73
6
0
27 Nov 2023
PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI
  Generated Images
PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images
Jiquan Yuan
Xinyan Cao
Changjin Li
Fanyi Yang
Jinlong Lin
Xixin Cao
EGVM
90
18
0
27 Nov 2023
HawkI: Homography & Mutual Information Guidance for 3D-free Single Image
  to Aerial View
HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View
D. Kothandaraman
Dinesh Manocha
Ming C. Lin
Dinesh Manocha
DiffM
78
2
0
27 Nov 2023
FLAIR: A Conditional Diffusion Framework with Applications to Face Video
  Restoration
FLAIR: A Conditional Diffusion Framework with Applications to Face Video Restoration
Zihao Zou
Jiaming Liu
Shirin Shoushtari
Yubo Wang
Weijie Gan
Ulugbek S. Kamilov
VGenDiffM
85
2
0
26 Nov 2023
Wired Perspectives: Multi-View Wire Art Embraces Generative AI
Wired Perspectives: Multi-View Wire Art Embraces Generative AI
Zhiyu Qu
Lan Yang
Honggang Zhang
Tao Xiang
Kaiyue Pang
Yi-Zhe Song
AI4CE
69
12
0
26 Nov 2023
Previous
123...495051...606162
Next