Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting
Wenquan Lu
Yufei Xu
Jing Zhang
Chaoyue Wang
Dacheng Tao
DiffM
73
28
0
29 Nov 2023
Rethinking Image Editing Detection in the Era of Generative AI Revolution
Zhihao Sun
Haipeng Fang
Xinying Zhao
Danding Wang
Juan Cao
91
10
0
29 Nov 2023
C3Net: Compound Conditioned ControlNet for Multimodal Content Generation
Juntao Zhang
Yuehuai Liu
Yu-Wing Tai
Chi-Keung Tang
DiffM
76
5
0
29 Nov 2023
Efficient Stitchable Task Adaptation
Haoyu He
Zizheng Pan
Jing Liu
Jianfei Cai
Bohan Zhuang
126
3
0
29 Nov 2023
SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors
Dave Zhenyu Chen
Haoxuan Li
Hsin-Ying Lee
Sergey Tulyakov
Matthias Nießner
DiffM
76
29
0
28 Nov 2023
Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation
Hang Li
Chengzhi Shen
Philip Torr
Volker Tresp
Jindong Gu
132
37
0
28 Nov 2023
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
Xian Liu
Xiaohang Zhan
Jiaxiang Tang
Ying Shan
Gang Zeng
Dahua Lin
Xihui Liu
Ziwei Liu
3DGS
128
77
0
28 Nov 2023
Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features
Niladri Shekhar Dutt
Sanjeev Muralikrishnan
Niloy J. Mitra
112
22
0
28 Nov 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Yutong Feng
Biao Gong
Di Chen
Yujun Shen
Yu Liu
Jingren Zhou
DiffM
119
50
0
28 Nov 2023
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo
Ceyuan Yang
Anyi Rao
Maneesh Agrawala
Dahua Lin
Bo Dai
DiffM
VGen
97
125
0
28 Nov 2023
UGG: Unified Generative Grasping
Jiaxin Lu
Hao Kang
Haoxiang Li
Bo Liu
Yiding Yang
Qixing Huang
Gang Hua
102
25
0
28 Nov 2023
Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis
Xiaohui Chen
Yongfei Liu
Yingxiang Yang
Jianbo Yuan
Quanzeng You
Liping Liu
Hongxia Yang
DiffM
86
13
0
28 Nov 2023
Panacea: Panoramic and Controllable Video Generation for Autonomous Driving
Yuqing Wen
Yucheng Zhao
Yingfei Liu
Fan Jia
Yanhui Wang
Chong Luo
Chi Zhang
Tiancai Wang
Xiaoyan Sun
Xiangyu Zhang
157
72
0
28 Nov 2023
ScribbleGen: Generative Data Augmentation Improves Scribble-supervised Semantic Segmentation
Jacob Schnell
Jieke Wang
Lu Qi
Vincent Tao Hu
Meng Tang
DiffM
95
3
0
28 Nov 2023
As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors
Seungwoo Yoo
Kunho Kim
Vladimir G. Kim
Minhyuk Sung
DiffM
97
14
0
28 Nov 2023
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Liucheng Hu
Xin Gao
Peng Zhang
Ke Sun
Bang Zhang
Liefeng Bo
DiffM
VGen
139
397
0
28 Nov 2023
CADTalk: An Algorithm and Benchmark for Semantic Commenting of CAD Programs
Haocheng Yuan
Jing Xu
Hao Pan
Adrien Bousseau
Niloy J. Mitra
Changjian Li
90
10
0
28 Nov 2023
MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation
Jingkuan Song
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
VGen
73
4
0
28 Nov 2023
On the Calibration of Human Pose Estimation
Kerui Gu
Rongyu Chen
Angela Yao
180
8
0
28 Nov 2023
MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices
Yang Zhao
Yanwu Xu
Zhisheng Xiao
Haolin Jia
Tingbo Hou
VLM
95
13
0
28 Nov 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
128
70
0
28 Nov 2023
Text-Driven Image Editing via Learnable Regions
Yuanze Lin
Yi-Wen Chen
Yi-Hsuan Tsai
Lu Jiang
Ming-Hsuan Yang
DiffM
103
20
0
28 Nov 2023
Manifold Preserving Guided Diffusion
Yutong He
Naoki Murata
Chieh-Hsin Lai
Yuhta Takida
Toshimitsu Uesaka
...
Wei-Hsiang Liao
Yuki Mitsufuji
J. Zico Kolter
Ruslan Salakhutdinov
Stefano Ermon
DiffM
175
81
0
28 Nov 2023
DreamPropeller: Supercharge Text-to-3D Generation with Parallel Sampling
Linqi Zhou
Andy Shih
Minh-Tuan Tran
Dinh Q. Phung
DiffM
101
14
0
28 Nov 2023
CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts
Yichao Cai
Yuhang Liu
Zhen Zhang
Javen Qinfeng Shi
CLIP
VLM
152
8
0
28 Nov 2023
Diffusion-TTA: Test-time Adaptation of Discriminative Models via Generative Feedback
Mihir Prabhudesai
Tsung-Wei Ke
Alexander C. Li
Deepak Pathak
Katerina Fragkiadaki
TTA
84
15
0
27 Nov 2023
Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images
Aiyu Cui
Jay Mahajan
Viraj Shah
Preeti Gomathinayagam
Chang Liu
Svetlana Lazebnik
OOD
DiffM
59
16
0
27 Nov 2023
Self-correcting LLM-controlled Diffusion Models
Tsung-Han Wu
Long Lian
Joseph E. Gonzalez
Boyi Li
Trevor Darrell
127
67
0
27 Nov 2023
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Zhongcong Xu
Jianfeng Zhang
Jun Hao Liew
Hanshu Yan
Jia-Wei Liu
Chenxu Zhang
Jiashi Feng
Mike Zheng Shou
VGen
DiffM
127
205
0
27 Nov 2023
DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization
Zhaoyang Xia
C. Neidle
Dimitris N. Metaxas
DiffM
72
4
0
27 Nov 2023
SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
Rongyuan Wu
Tao Yang
Lingchen Sun
Zhengqiang Zhang
Shuai Li
Lei Zhang
DiffM
SupR
100
146
0
27 Nov 2023
CoSeR: Bridging Image and Language for Cognitive Super-Resolution
Haoze Sun
Wenbo Li
Jianzhuang Liu
Haoyu Chen
Renjing Pei
X. Zou
Youliang Yan
Yujiu Yang
SupR
152
47
0
27 Nov 2023
Deceptive-Human: Prompt-to-NeRF 3D Human Generation with 3D-Consistent Synthetic Images
Shiu-hong Kao
Xinhang Liu
Yu-Wing Tai
Chi-Keung Tang
100
0
0
27 Nov 2023
Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models
C. Rota
M. Buzzelli
Joost van de Weijer
DiffM
104
3
0
27 Nov 2023
InterControl: Zero-shot Human Interaction Generation by Controlling Every Joint
Zhenzhi Wang
Jingbo Wang
Yixuan Li
Dahua Lin
Bo Dai
79
2
0
27 Nov 2023
SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion
Hsuan-I Ho
Mingli Song
Otmar Hilliges
DiffM
83
36
0
27 Nov 2023
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
Siteng Huang
Biao Gong
Yutong Feng
Xi Chen
Yu Fu
Yu Liu
Donglin Wang
DiffM
68
14
0
27 Nov 2023
FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax
Yu Lu
Linchao Zhu
Hehe Fan
Yi Yang
VGen
DiffM
83
13
0
27 Nov 2023
LLMGA: Multimodal Large Language Model based Generation Assistant
Bin Xia
Shiyin Wang
Yingfan Tao
Yitong Wang
Jiaya Jia
MLLM
95
12
0
27 Nov 2023
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
Biao Gong
Siteng Huang
Yutong Feng
Shiwei Zhang
Yuyuan Li
Yu Liu
DiffM
110
13
0
27 Nov 2023
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
Minghui Hu
Jianbin Zheng
Chuanxia Zheng
Chaoyue Wang
Dacheng Tao
Tat-Jen Cham
DiffM
99
5
0
27 Nov 2023
Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Chaofeng Chen
Annan Wang
Haoning Wu
Liang Liao
Wenxiu Sun
Qiong Yan
Weisi Lin
65
15
0
27 Nov 2023
Reinforcement Learning from Diffusion Feedback: Q* for Image Search
Aboli Rajan Marathe
VLM
93
0
0
27 Nov 2023
ChatTraffic: Text-to-Traffic Generation via Diffusion Model
Chengyang Zhang
Yong Zhang
Qitan Shao
Bo Li
Yisheng Lv
Xinglin Piao
Baocai Yin
84
7
0
27 Nov 2023
EucliDreamer: Fast and High-Quality Texturing for 3D Models with Stable Diffusion Depth
Cindy X. Le
Congrui Hetang
Chendi Lin
Ang Cao
Yihui He
68
7
0
27 Nov 2023
ET3D: Efficient Text-to-3D Generation via Multi-View Distillation
Yiming Chen
Zhiqi Li
Peidong Liu
73
6
0
27 Nov 2023
PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images
Jiquan Yuan
Xinyan Cao
Changjin Li
Fanyi Yang
Jinlong Lin
Xixin Cao
EGVM
90
18
0
27 Nov 2023
HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View
D. Kothandaraman
Dinesh Manocha
Ming C. Lin
Dinesh Manocha
DiffM
78
2
0
27 Nov 2023
FLAIR: A Conditional Diffusion Framework with Applications to Face Video Restoration
Zihao Zou
Jiaming Liu
Shirin Shoushtari
Yubo Wang
Weijie Gan
Ulugbek S. Kamilov
VGen
DiffM
85
2
0
26 Nov 2023
Wired Perspectives: Multi-View Wire Art Embraces Generative AI
Zhiyu Qu
Lan Yang
Honggang Zhang
Tao Xiang
Kaiyue Pang
Yi-Zhe Song
AI4CE
69
12
0
26 Nov 2023
Previous
1
2
3
...
49
50
51
...
60
61
62
Next