Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
Prompting for Discovery: Flexible Sense-Making for AI Art-Making with Dreamsheets
Shm Garanganao
J.D. Zamfirescu-Pereira
Kyu Won Kim
Mani Rathnam
Bjoern Hartmann
67
31
0
15 Oct 2023
ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context
Binglun Wang
Niladri Shekhar Dutt
Niloy J. Mitra
88
11
0
15 Oct 2023
Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models
Zijian Zhang
Luping Liu
Zhijie Lin
Yichen Zhu
Zhou Zhao
DiffM
68
6
0
15 Oct 2023
GPT-Prompt Controlled Diffusion for Weakly-Supervised Semantic Segmentation
Wangyu Wu
Tianhong Dai
Xiaowei Huang
Fei Ma
Jimin Xiao
DiffM
94
1
0
15 Oct 2023
LOVECon: Text-driven Training-Free Long Video Editing with ControlNet
Zhenyi Liao
Zhijie Deng
DiffM
59
7
0
15 Oct 2023
Integrating Symbolic Reasoning into Neural Generative Models for Design Generation
Maxwell J. Jacobson
Yexiang Xue
NAI
69
3
0
13 Oct 2023
CopyScope: Model-level Copyright Infringement Quantification in the Diffusion Workflow
Junlei Zhou
Jiashi Gao
Ziwei Wang
Xuetao Wei
47
2
0
13 Oct 2023
R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image Generation
Jiayu Xiao
Henglei Lv
Liang Li
Shuhui Wang
Qingming Huang
DiffM
116
23
0
13 Oct 2023
OmniControl: Control Any Joint at Any Time for Human Motion Generation
Yiming Xie
Varun Jampani
Lei Zhong
Deqing Sun
Huaizu Jiang
DiffM
79
120
0
12 Oct 2023
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
Xian Liu
Jian Ren
Aliaksandr Siarohin
Ivan Skorokhodov
Yanyu Li
Dahua Lin
Xihui Liu
Ziwei Liu
Sergey Tulyakov
86
61
0
12 Oct 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Lijuan Wang
LRM
MLLM
DiffM
35
25
0
12 Oct 2023
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Rui Zhao
Yuchao Gu
Jay Zhangjie Wu
David Junhao Zhang
Jia-Wei Liu
Weijia Wu
Jussi Keppo
Mike Zheng Shou
DiffM
VGen
110
118
0
12 Oct 2023
Tailored Visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting
Zijie Chen
Lichao Zhang
Fangsheng Weng
Lili Pan
Zhenzhong Lan
90
10
0
12 Oct 2023
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model
Xiaofan Li
Yifu Zhang
Xiaoqing Ye
VGen
125
78
0
11 Oct 2023
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Jie An
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Kevin Qinghong Lin
Zicheng Liu
Lijuan Wang
Jiebo Luo
75
10
0
11 Oct 2023
ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation
Bo Peng
Xinyuan Chen
Yaohui Wang
Chaochao Lu
Yu Qiao
DiffM
VGen
46
7
0
11 Oct 2023
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Zeqiang Lai
Xizhou Zhu
Jifeng Dai
Yu Qiao
Wenhai Wang
MLLM
DiffM
105
24
0
11 Oct 2023
GraphControl: Adding Conditional Control to Universal Graph Pre-trained Models for Graph Domain Transfer Learning
Yun Zhu
Yaoke Wang
Haizhou Shi
Zhenshuo Zhang
Dian Jiao
Siliang Tang
AI4CE
122
28
0
11 Oct 2023
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
130
111
0
11 Oct 2023
HiFi-123: Towards High-fidelity One Image to 3D Content Generation
Wangbo Yu
Li-ming Yuan
Yan-Pei Cao
Xiangjun Gao
Xiaoyu Li
Wenbo Hu
Long Quan
Ying Shan
Yonghong Tian
DiffM
96
31
0
10 Oct 2023
JointNet: Extending Text-to-Image Diffusion for Dense Distribution Modeling
Jingyang Zhang
Shiwei Li
Yuanxun Lu
Tian Fang
David McKinnon
Yanghai Tsin
Long Quan
Yao Yao
73
11
0
10 Oct 2023
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
Fei Shen
Hu Ye
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
DiffM
138
64
0
10 Oct 2023
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Yuren Cong
Mengmeng Xu
Christian Simon
Shoufa Chen
Jiawei Ren
Yanping Xie
Juan-Manuel Perez-Rua
Bodo Rosenhahn
Tao Xiang
Sen He
DiffM
VGen
122
87
0
09 Oct 2023
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
Bo-Wen Zeng
Shanglin Li
Yutang Feng
Ling Yang
Hong Li
...
Conghui He
Wentao Zhang
Jianzhuang Liu
Baochang Zhang
Shuicheng Yan
DiffM
97
2
0
09 Oct 2023
EasyPhoto: Your Smart AI Photo Generator
Ziheng Wu
Jiaqi Xu
Xinyi Zou
Kunzhe Huang
Xing Shi
Jun Huang
34
4
0
07 Oct 2023
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Simian Luo
Yiqin Tan
Longbo Huang
Jian Li
Hang Zhao
DiffM
117
479
0
06 Oct 2023
MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images
Yanwu Xu
Li Sun
Wei Peng
Shyam Visweswaran
Kayhan Batmanghelich
MedIm
DiffM
111
23
0
05 Oct 2023
FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators
Haiping Wang
Yuan Liu
Bing Wang
Yujing Sun
Zhenchao Dong
Wenping Wang
Bisheng Yang
DiffM
82
12
0
05 Oct 2023
Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints
Chuan Fang
Yuan Dong
Kunming Luo
Xiaotao Hu
Rakesh Shrestha
Ping Tan
DiffM
152
37
0
05 Oct 2023
Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day
Yi Ding
Hao Tang
Jen-Hao Rick Chang
Liangchen Song
Zhangyang Wang
Liangliang Cao
DiffM
106
11
0
04 Oct 2023
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Xichen Pan
Li Dong
Shaohan Huang
Zhiliang Peng
Wenhu Chen
Furu Wei
VLM
152
68
0
04 Oct 2023
Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts
Shiyi Du
Xiaosong Wang
Yongyi Lu
Yuyin Zhou
Shaoting Zhang
Alan Yuille
Kang Li
Zongwei Zhou
MedIm
DiffM
39
11
0
04 Oct 2023
MagicDrive: Street View Generation with Diverse 3D Geometry Control
Ruiyuan Gao
Kai Chen
Enze Xie
Lanqing Hong
Zhenguo Li
Dit-Yan Yeung
Qiang Xu
DiffM
92
122
0
04 Oct 2023
DREAM: Visual Decoding from Reversing Human Visual System
Weihao Xia
Raoul de Charette
Cengiz Öztireli
Jing-Hao Xue
78
38
0
03 Oct 2023
AI-Generated Images as Data Source: The Dawn of Synthetic Era
Zuhao Yang
Fangneng Zhan
Kunhao Liu
Muyu Xu
Shijian Lu
EGVM
107
20
0
03 Oct 2023
Transcending Domains through Text-to-Image Diffusion: A Source-Free Approach to Domain Adaptation
Shivang Chopra
Suraj Kothawade
Houda Aynaou
Aman Chadha
DiffM
67
0
0
02 Oct 2023
ImagenHub: Standardizing the evaluation of conditional image generation models
Max Ku
Tianle Li
Kai Zhang
Yujie Lu
Xingyu Fu
Wenwen Zhuang
Wenhu Chen
EGVM
132
48
0
02 Oct 2023
CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation
Kangfu Mei
M. Delbracio
Hossein Talebi
Zhengzhong Tu
Vishal M. Patel
P. Milanfar
VLM
DiffM
97
15
0
02 Oct 2023
HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation
Xin Huang
Ruizhi Shao
Qi Zhang
Hongwen Zhang
Yingfa Feng
Yebin Liu
Qing Wang
DiffM
62
73
0
02 Oct 2023
GenQuery: Supporting Expressive Visual Search with Generative Models
Kihoon Son
DaEun Choi
Tae Soo Kim
Young‐Ho Kim
Juho Kim
67
28
0
02 Oct 2023
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models
Hyeonho Jeong
Jong Chul Ye
DiffM
VGen
95
43
0
02 Oct 2023
Controlling Vision-Language Models for Multi-Task Image Restoration
Ziwei Luo
Fredrik K. Gustafsson
Zheng Zhao
Jens Sjölund
Thomas B. Schon
VLM
146
41
0
02 Oct 2023
Completing Visual Objects via Bridging Generation and Segmentation
Xiang Li
Yinpeng Chen
Chung-Ching Lin
Hao Chen
Kai Hu
Rita Singh
Bhiksha Raj
Lijuan Wang
Zicheng Liu
DiffM
106
3
0
01 Oct 2023
PixArt-
α
α
α
: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen
Jincheng Yu
Chongjian Ge
Lewei Yao
Enze Xie
...
Zhongdao Wang
James T. Kwok
Ping Luo
Huchuan Lu
Zhenguo Li
DiffM
117
461
0
30 Sep 2023
AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ
Jonas Belouadi
Anne Lauscher
Steffen Eger
75
31
0
30 Sep 2023
Decoding Realistic Images from Brain Activity with Contrastive Self-supervision and Latent Diffusion
Jingyuan Sun
Mingxiao Li
Marie-Francine Moens
DiffM
102
7
0
30 Sep 2023
AdaptNet: Policy Adaptation for Physics-Based Character Control
Pei Xu
Kaixiang Xie
Sheldon Andrews
P. Kry
Michael Neff
Morgan McGuire
Ioannis Karamouzas
Victor Zordan
TTA
119
19
0
30 Sep 2023
LLM-grounded Video Diffusion Models
Long Lian
Baifeng Shi
Semih Yavuz
Ye Liu
Boyi Li
DiffM
103
55
0
29 Sep 2023
SatDM: Synthesizing Realistic Satellite Image with Semantic Layout Conditioning using Diffusion Models
Ethan Brewer
Hamid Askarov
Imran Ibrahimli
Ismat Bakhishov
N. Nabiyev
DiffM
99
5
0
28 Sep 2023
RealFill: Reference-Driven Generation for Authentic Image Completion
Luming Tang
Nataniel Ruiz
Qinghao Chu
Yuanzhen Li
Aleksander Holynski
...
Bharath Hariharan
Yael Pritch
Neal Wadhwa
Kfir Aberman
Michael Rubinstein
DiffM
89
45
0
28 Sep 2023
Previous
1
2
3
...
52
53
54
...
60
61
62
Next