ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
Deep Geometrized Cartoon Line Inbetweening
Deep Geometrized Cartoon Line Inbetweening
Lian Siyao
Tianpei Gu
Weiye Xiao
Henghui Ding
Ziwei Liu
Chen Change Loy
61
12
0
28 Sep 2023
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image
  Action Editing
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing
Jiarui Yao
Yifan Liu
Simon S. Du
Shifeng Chen
DiffM
64
24
0
28 Sep 2023
CCEdit: Creative and Controllable Video Editing via Diffusion Models
CCEdit: Creative and Controllable Video Editing via Diffusion Models
Danfeng Hong
Wenming Weng
Hao Li
Yuhui Yuan
Jing Yao
Chong Luo
Zhibo Chen
Baining Guo
DiffMVGen
93
49
0
28 Sep 2023
Cloth2Body: Generating 3D Human Body Mesh from 2D Clothing
Cloth2Body: Generating 3D Human Body Mesh from 2D Clothing
Lu Dai
Liqian Ma
Shenhan Qian
Hao Liu
Ziwei Liu
Hui Xiong
3DH
94
4
0
28 Sep 2023
Compositional Sculpting of Iterative Generative Processes
Compositional Sculpting of Iterative Generative Processes
Yixuan Wang
Sebastiaan De Peuter
Mingtong Zhang
Vikas Garg
Samuel Kaski
Tommi Jaakkola
DiffM
121
15
0
28 Sep 2023
Emu: Enhancing Image Generation Models Using Photogenic Needles in a
  Haystack
Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack
Xiaoliang Dai
Ji Hou
Chih-Yao Ma
Sam S. Tsai
Jialiang Wang
...
Roshan Sumbaly
Vignesh Ramanathan
Zijian He
Peter Vajda
Devi Parikh
VLM
91
216
0
27 Sep 2023
P2I-NET: Mapping Camera Pose to Image via Adversarial Learning for New
  View Synthesis in Real Indoor Environments
P2I-NET: Mapping Camera Pose to Image via Adversarial Learning for New View Synthesis in Real Indoor Environments
Xujie Kang
Kanglin Liu
Jiang Duan
Yuanhao Gong
Guoping Qiu
GAN
53
3
0
27 Sep 2023
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion
  Models
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Yaohui Wang
Xinyuan Chen
Xin Ma
Shangchen Zhou
Ziqi Huang
...
Chen Change Loy
Bo Dai
Dahua Lin
Yu Qiao
Ziwei Liu
VGenDiffM
112
231
0
26 Sep 2023
Directional Texture Editing for 3D Models
Directional Texture Editing for 3D Models
Shengqi Liu
Zhuo Chen
Jin Gao
Yichao Yan
Wenhan Zhu
Jia-Ming Lyu
Xiaokang Yang
DiffM
96
0
0
26 Sep 2023
Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM
  Animator
Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator
Hanzhuo Huang
Yufan Feng
Cheng Shi
Lan Xu
Jingyi Yu
Sibei Yang
DiffMVGen
97
66
0
25 Sep 2023
Chop & Learn: Recognizing and Generating Object-State Compositions
Chop & Learn: Recognizing and Generating Object-State Compositions
Nirat Saini
Hanyu Wang
Archana Swaminathan
Vinoj Jayasundara
Bo He
Kamal Gupta
Abhinav Shrivastava
CoGe
81
12
0
25 Sep 2023
Innovative Digital Storytelling with AIGC: Exploration and Discussion of
  Recent Advances
Innovative Digital Storytelling with AIGC: Exploration and Discussion of Recent Advances
Rongzhang Gu
Hui Li
Chang Su
Wenyan Wu
49
6
0
25 Sep 2023
ID.8: Co-Creating Visual Stories with Generative AI
ID.8: Co-Creating Visual Stories with Generative AI
Victor Nikhil Antony
Chien-Ming Huang
107
27
0
25 Sep 2023
Multiple Noises in Diffusion Model for Semi-Supervised Multi-Domain
  Translation
Multiple Noises in Diffusion Model for Semi-Supervised Multi-Domain Translation
Tsiry Mayet
Simon Bernard
Clément Chatelain
Romain Hérault
DiffM
89
0
0
25 Sep 2023
Diverse Semantic Image Editing with Style Codes
Diverse Semantic Image Editing with Style Codes
Hakan Sivuk
Aysegül Dündar
47
0
0
25 Sep 2023
Identifying Systematic Errors in Object Detectors with the SCROD
  Pipeline
Identifying Systematic Errors in Object Detectors with the SCROD Pipeline
Valentyn Boreiko
Matthias Hein
J. H. Metzen
71
6
0
23 Sep 2023
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided
  Video DecodER
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER
Mingzhen Sun
Weining Wang
Zihan Qin
Jiahui Sun
Si-Qing Chen
Qingbin Liu
DiffM
59
3
0
23 Sep 2023
AntiBARTy Diffusion for Property Guided Antibody Design
AntiBARTy Diffusion for Property Guided Antibody Design
Jordan Venderley
DiffM
47
1
0
22 Sep 2023
Diffusion Augmentation for Sequential Recommendation
Diffusion Augmentation for Sequential Recommendation
Qidong Liu
Fan Yan
Xiangyu Zhao
Zhaochen Du
Huifeng Guo
Ruiming Tang
Feng Tian
78
42
0
22 Sep 2023
LLMR: Real-time Prompting of Interactive Worlds using Large Language
  Models
LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
Fernanda De La Torre
Cathy Mengying Fang
Han Huang
Andrzej Banburski-Fahey
Judith Amores Fernandez
Jaron Lanier
154
50
0
21 Sep 2023
DreamLLM: Synergistic Multimodal Comprehension and Creation
DreamLLM: Synergistic Multimodal Comprehension and Creation
Runpei Dong
Chunrui Han
Yuang Peng
Zekun Qi
Zheng Ge
...
Hao-Ran Wei
Xiangwen Kong
Xiangyu Zhang
Kaisheng Ma
Li Yi
MLLM
109
199
0
20 Sep 2023
FreeU: Free Lunch in Diffusion U-Net
FreeU: Free Lunch in Diffusion U-Net
Chenyang Si
Ziqi Huang
Yuming Jiang
Ziwei Liu
DiffM
116
147
0
20 Sep 2023
Kosmos-2.5: A Multimodal Literate Model
Kosmos-2.5: A Multimodal Literate Model
Tengchao Lv
Yupan Huang
Jingye Chen
Lei Cui
Shuming Ma
...
Weiyao Luo
Shaoxiang Wu
Guoxin Wang
Cha Zhang
Furu Wei
VLMMLLM
114
66
0
20 Sep 2023
Language-driven Object Fusion into Neural Radiance Fields with
  Pose-Conditioned Dataset Updates
Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates
Kashun Shum
Jaeyeon Kim
Binh-Son Hua
Duc Thanh Nguyen
Sai-Kit Yeung
3DHAI4CE
74
8
0
20 Sep 2023
Forgedit: Text Guided Image Editing via Learning and Forgetting
Forgedit: Text Guided Image Editing via Learning and Forgetting
Shiwen Zhang
Shuai Xiao
Weilin Huang
DiffM
76
21
0
19 Sep 2023
DriveDreamer: Towards Real-world-driven World Models for Autonomous
  Driving
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Xiaofeng Wang
Zheng Hua Zhu
Guan Huang
Xinze Chen
Jiagang Zhu
Jiwen Lu
VGen
116
167
0
18 Sep 2023
LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation
LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation
Yihao Zhi
Xiaodong Cun
Xuelin Chen
Xi Shen
Wen Guo
Shaoli Huang
Shenghua Gao
68
28
0
17 Sep 2023
Viewpoint Textual Inversion: Unleashing Novel View Synthesis with
  Pretrained 2D Diffusion Models
Viewpoint Textual Inversion: Unleashing Novel View Synthesis with Pretrained 2D Diffusion Models
James Burgess
Kuan-Chieh Wang
Serena Yeung-Levy
DiffM
81
6
0
14 Sep 2023
Market-GAN: Adding Control to Financial Market Data Generation with
  Semantic Context
Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context
Haochong Xia
Shuo Sun
Xinrun Wang
Bo An
AIFin
98
9
0
14 Sep 2023
Text-to-Image Models for Counterfactual Explanations: a Black-Box
  Approach
Text-to-Image Models for Counterfactual Explanations: a Black-Box Approach
Guillaume Jeanneret
Loïc Simon
Frédéric Jurie
DiffM
95
13
0
14 Sep 2023
DiffTalker: Co-driven audio-image diffusion for talking faces via
  intermediate landmarks
DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks
Zipeng Qi
Xulong Zhang
Ning Cheng
Jing Xiao
Jianzong Wang
96
7
0
14 Sep 2023
Mitigate Replication and Copying in Diffusion Models with Generalized
  Caption and Dual Fusion Enhancement
Mitigate Replication and Copying in Diffusion Models with Generalized Caption and Dual Fusion Enhancement
Chenghao Li
Dake Chen
Yuke Zhang
Peter A. Beerel
DiffM
79
8
0
13 Sep 2023
DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion
  Models
DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models
Namhyuk Ahn
Junsoo Lee
Chunggi Lee
Kunhee Kim
Daesik Kim
Seung-Hun Nam
Kibeom Hong
DiffM
89
24
0
13 Sep 2023
InstaFlow: One Step is Enough for High-Quality Diffusion-Based
  Text-to-Image Generation
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
Xingchao Liu
Xiwen Zhang
Jianzhu Ma
Jian Peng
Qiang Liu
191
223
0
12 Sep 2023
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion
  Models
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Li Chen
Mengyi Zhao
Yiheng Liu
Mingxu Ding
Yangyang Song
...
Xu Wang
Hao Yang
Jing Liu
Kang Du
Min Zheng
DiffM
75
55
0
11 Sep 2023
Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction
  Clips
Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips
Yufei Ye
Poorvi Hebbar
Abhinav Gupta
Shubham Tulsiani
DiffM
105
46
0
11 Sep 2023
ITI-GEN: Inclusive Text-to-Image Generation
ITI-GEN: Inclusive Text-to-Image Generation
Cheng Zhang
Xuanbai Chen
Siqi Chai
Chen Henry Wu
Dmitry Lagun
Thabo Beeler
Fernando de la Torre
VLM
122
58
0
11 Sep 2023
PAI-Diffusion: Constructing and Serving a Family of Open Chinese
  Diffusion Models for Text-to-image Synthesis on the Cloud
PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud
Chengyu Wang
Zhongjie Duan
Bingyan Liu
Xinyi Zou
Cen Chen
Kui Jia
Jun Huang
DiffM
65
4
0
11 Sep 2023
FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World
  Conditions
FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions
Jiong Wang
Fengyu Yang
Wenbo Gou
Bingliang Li
Danqi Yan
Ailing Zeng
Yijun Gao
Junle Wang
Yanqing Jing
Ruimao Zhang
91
1
0
10 Sep 2023
MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask
MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask
Yupeng Zhou
Daquan Zhou
Zuo-Liang Zhu
Yaxing Wang
Qibin Hou
Jiashi Feng
75
12
0
08 Sep 2023
MoEController: Instruction-based Arbitrary Image Manipulation with
  Mixture-of-Expert Controllers
MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers
Sijia Li
Chen Chen
H. Lu
DiffM
81
10
0
08 Sep 2023
AdBooster: Personalized Ad Creative Generation using Stable Diffusion
  Outpainting
AdBooster: Personalized Ad Creative Generation using Stable Diffusion Outpainting
Veronika Shilova
Ludovic Dos Santos
Flavian Vasile
Gaetan Racic
Ugo Tanielian
DiffM
56
7
0
08 Sep 2023
Chasing Consistency in Text-to-3D Generation from a Single Image
Chasing Consistency in Text-to-3D Generation from a Single Image
Yichen Ouyang
Wenhao Chai
Jiayi Ye
Dapeng Tao
Yibing Zhan
Gaoang Wang
DiffM
80
15
0
07 Sep 2023
Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance
  Fields using Geometry-Guided Text-to-Image Diffusion Model
Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model
Sungwon Hwang
J. Hyung
Jaegul Choo
DiffM
58
4
0
07 Sep 2023
My Art My Choice: Adversarial Protection Against Unruly AI
My Art My Choice: Adversarial Protection Against Unruly AI
Anthony Rhodes
Ram Bhagat
U. Ciftci
Ilke Demir
DiffM
98
4
0
06 Sep 2023
MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
Zeyu Ling
Bo Han
Yongkang Wong
Mohan Kankanhalli
Weidong Geng
DiffM
59
6
0
06 Sep 2023
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction
  Tuning
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
L. Yu
Bowen Shi
Ramakanth Pasunuru
Benjamin Muller
O. Yu. Golovneva
...
Yaniv Taigman
Maryam Fazel-Zarandi
Asli Celikyilmaz
Luke Zettlemoyer
Armen Aghajanyan
MLLM
101
142
0
05 Sep 2023
NICE: CVPR 2023 Challenge on Zero-shot Image Captioning
NICE: CVPR 2023 Challenge on Zero-shot Image Captioning
Taehoon Kim
Pyunghwan Ahn
Sangyun Kim
Sihaeng Lee
Mark A Marsden
...
Yujin Wang
Yimu Wang
Tiancheng Gu
Xingchang Lv
Mingmao Sun
VLM
132
6
0
05 Sep 2023
StyleAdapter: A Single-Pass LoRA-Free Model for Stylized Image
  Generation
StyleAdapter: A Single-Pass LoRA-Free Model for Stylized Image Generation
Zhouxia Wang
Xintao Wang
Liangbin Xie
Zhongang Qi
Ying Shan
Wenping Wang
Ping Luo
DiffM
53
12
0
04 Sep 2023
ControlMat: A Controlled Generative Approach to Material Capture
ControlMat: A Controlled Generative Approach to Material Capture
Giuseppe Vecchio
Rosalie Martin
Arthur Roullier
Adrien Kaiser
Romain Rouffet
Valentin Deschaintre
T. Boubekeur
DiffM
80
40
0
04 Sep 2023
Previous
123...535455...606162
Next