ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 3,090 papers shown
Title
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Haozhe Zhao
Xiaojian Ma
Liang Chen
Shuzheng Si
Rujie Wu
Kaikai An
Peiyu Yu
Minjia Zhang
Qing Li
Baobao Chang
108
63
0
07 Jul 2024
PartCraft: Crafting Creative Objects by Parts
PartCraft: Crafting Creative Objects by Parts
Kam Woh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
104
8
0
05 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting,
  and Transportation
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
97
8
0
05 Jul 2024
VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided
  Texturing
VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing
Shang Liu
Chaohui Yu
Chenjie Cao
Wen Qian
Fan Wang
DiffM
117
6
0
05 Jul 2024
Leveraging Latent Diffusion Models for Training-Free In-Distribution
  Data Augmentation for Surface Defect Detection
Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect Detection
Federico Girella
Ziyue Liu
Franco Fummi
Francesco Setti
Marco Cristani
Luigi Capogrosso
98
4
0
04 Jul 2024
Timestep-Aware Correction for Quantized Diffusion Models
Timestep-Aware Correction for Quantized Diffusion Models
Yuzhe Yao
Feng Tian
Jun Chen
Haonan Lin
Guang Dai
Yong Liu
Jingdong Wang
DiffMMQ
97
5
0
04 Jul 2024
Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal
  Image Restoration
Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration
Yuhong Zhang
Hengsheng Zhang
Xinning Chai
Zhengxue Cheng
Rong Xie
Li Song
Wenjun Zhang
DiffM
76
5
0
04 Jul 2024
FDS: Feedback-guided Domain Synthesis with Multi-Source Conditional
  Diffusion Models for Domain Generalization
FDS: Feedback-guided Domain Synthesis with Multi-Source Conditional Diffusion Models for Domain Generalization
Mehrdad Noori
Milad Cheraghalikhani
Ali Bahri
G. A. V. Hakim
David Osowiechi
Moslem Yazdanpanah
Ismail Ben Ayed
Christian Desrosiers
92
1
0
04 Jul 2024
Learning Action and Reasoning-Centric Image Editing from Videos and
  Simulations
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations
Benno Krojer
Dheeraj Vattikonda
Luis Lara
Varun Jampani
Eva Portelance
Christopher Pal
Siva Reddy
EGVMVGen
111
7
0
03 Jul 2024
Magic Insert: Style-Aware Drag-and-Drop
Magic Insert: Style-Aware Drag-and-Drop
Nataniel Ruiz
Yuanzhen Li
Neal Wadhwa
Yael Pritch
Michael Rubinstein
David E. Jacobs
Shlomi Fruchter
DiffM
96
8
0
02 Jul 2024
Boosting Consistency in Story Visualization with Rich-Contextual
  Conditional Diffusion Models
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
Fei Shen
Hu Ye
Sibo Liu
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
140
40
0
02 Jul 2024
UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
Jingjing Ren
Wenbo Li
Haoyu Chen
Renjing Pei
Bin Shao
Yong Guo
Long Peng
Fenglong Song
Lei Zhu
110
22
0
02 Jul 2024
TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D
  Gaussian Splatting Manipulation
TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Splatting Manipulation
Chaofan Luo
Donglin Di
Xun Yang
Yongjia Ma
Zhou Xue
Chen Wei
Yebin Liu
3DGS
90
9
0
02 Jul 2024
SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules
SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules
Suyi Li
Lingyun Yang
Xiaoxiao Jiang
Hanfeng Lu
Zhipeng Di
...
Tao Lan
Guodong Yang
Lin Qu
Liping Zhang
Wei Wang
68
4
0
02 Jul 2024
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Huanzhang Dou
Ruixiang Li
Wei Su
Xi Li
DiffM
92
1
0
02 Jul 2024
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models
Seyedmorteza Sadat
Manuel Kansy
Otmar Hilliges
Romann M. Weber
98
14
0
02 Jul 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Yuchen Li
Fan Ma
Zongxin Yang
Yue Yang
173
11
0
02 Jul 2024
Label-free Neural Semantic Image Synthesis
Label-free Neural Semantic Image Synthesis
Jiayi Wang
Kevin Laube
Yumeng Li
J. H. Metzen
Shin-I Cheng
Julio Borges
Anna Khoreva
DiffM
150
0
0
01 Jul 2024
Pictures Of MIDI: Controlled Music Generation via Graphical Prompts for
  Image-Based Diffusion Inpainting
Pictures Of MIDI: Controlled Music Generation via Graphical Prompts for Image-Based Diffusion Inpainting
Scott H. Hawley
83
2
0
01 Jul 2024
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized
  Sounds
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Yiming Zhang
Yicheng Gu
Yanhong Zeng
Zhening Xing
Yuancheng Wang
Zhizheng Wu
Kai Chen
VGen
105
41
0
01 Jul 2024
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
Chang-Han Yeh
Chin-Yang Lin
Zhixiang Wang
Chi-Wei Hsiao
Ting-Hsuan Chen
Hau-Shiang Shiu
Yu-Lun Liu
VGenDiffM
198
6
0
01 Jul 2024
InstantStyle-Plus: Style Transfer with Content-Preserving in
  Text-to-Image Generation
InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation
Haofan Wang
Peng-Fei Xing
Renyuan Huang
Hao Ai
Qixun Wang
Xu Bai
DiffM
110
25
0
30 Jun 2024
Diffusion Models and Representation Learning: A Survey
Diffusion Models and Representation Learning: A Survey
Michael Fuest
Pingchuan Ma
Ming Gui
Johannes S. Fischer
Vincent Tao Hu
Bjorn Ommer
DiffM
104
24
0
30 Jun 2024
Instruct-IPT: All-in-One Image Processing Transformer via Weight
  Modulation
Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation
Yuchuan Tian
Jianhong Han
Hanting Chen
Yuanyuan Xi
Guoyang Zhang
Jie Hu
Chao Xu
Yunhe Wang
ViTVLM
82
8
0
30 Jun 2024
Diffusion-BBO: Diffusion-Based Inverse Modeling for Online Black-Box Optimization
Diffusion-BBO: Diffusion-Based Inverse Modeling for Online Black-Box Optimization
D. Wu
Nikki Lijing Kuang
Ruijia Niu
Yi-An Ma
Rose Yu
105
1
0
30 Jun 2024
OccFusion: Rendering Occluded Humans with Generative Diffusion Priors
OccFusion: Rendering Occluded Humans with Generative Diffusion Priors
Adam Sun
Tiange Xiang
Scott Delp
Li Fei-Fei
Ehsan Adeli
94
2
0
29 Jun 2024
SemUV: Deep Learning based semantic manipulation over UV texture map of
  virtual human heads
SemUV: Deep Learning based semantic manipulation over UV texture map of virtual human heads
Anirban Mukherjee
Venkat Suprabath Bitra
Vignesh Bondugula
Tarun Reddy Tallapureddy
D. Jayagopi
3DH
97
0
0
28 Jun 2024
MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Yuang Zhang
Jiaxi Gu
L. Wang
Han Wang
Junqi Cheng
Yuefeng Zhu
Fangyuan Zou
VGen
166
85
0
28 Jun 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLMDiffM
162
3
0
28 Jun 2024
Dataset Size Recovery from LoRA Weights
Dataset Size Recovery from LoRA Weights
Mohammad Salama
Jonathan Kahana
Eliahu Horwitz
Yedid Hoshen
86
5
0
27 Jun 2024
Subtractive Training for Music Stem Insertion using Latent Diffusion Models
Subtractive Training for Music Stem Insertion using Latent Diffusion Models
Ivan Villa-Renteria
Mason L. Wang
Zachary Shah
Zhe Li
Soohyun Kim
Neelesh Ramachandran
Mert Pilanci
181
0
0
27 Jun 2024
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image
  Generation
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation
Yanan Sun
Yanchen Liu
Yinhao Tang
Wenjie Pei
Kai Chen
DiffM
114
11
0
27 Jun 2024
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data
William Berman
A. Peysakhovich
91
4
0
26 Jun 2024
MultiDiff: Consistent Novel View Synthesis from a Single Image
MultiDiff: Consistent Novel View Synthesis from a Single Image
Norman Muller
Katja Schwarz
Barbara Roessle
Lorenzo Porzi
Samuel Rota Buló
Matthias Nießner
Peter Kontschieder
DiffM
154
27
0
26 Jun 2024
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis
  through Structure Guidance
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance
Younghyun Kim
Geunmin Hwang
Junyu Zhang
Eunbyung Park
154
10
0
26 Jun 2024
Diffusion Model-Based Video Editing: A Survey
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
118
25
0
26 Jun 2024
Changen2: Multi-Temporal Remote Sensing Generative Change Foundation
  Model
Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model
Zhuo Zheng
Stefano Ermon
Dongjun Kim
Liangpei Zhang
Yanfei Zhong
DiffM
87
20
0
26 Jun 2024
Towards Synchronous Memorizability and Generalizability with Site-Modulated Diffusion Replay for Cross-Site Continual Segmentation
Towards Synchronous Memorizability and Generalizability with Site-Modulated Diffusion Replay for Cross-Site Continual Segmentation
Dunyuan Xu
Xi Wang
Jingyang Zhang
Pheng-Ann Heng
MedImCLL
156
0
0
26 Jun 2024
Text-Animator: Controllable Visual Text Video Generation
Text-Animator: Controllable Visual Text Video Generation
Lin Liu
Quande Liu
Shengju Qian
Yuan Zhou
Wengang Zhou
Houqiang Li
Lingxi Xie
Qi Tian
VGen
99
1
0
25 Jun 2024
Test-Time Generative Augmentation for Medical Image Segmentation
Test-Time Generative Augmentation for Medical Image Segmentation
Xiao Ma
Yuhui Tao
Yuhan Zhang
Zexuan Ji
Yizhe Zhang
Qiang Chen
MedIm
85
1
0
25 Jun 2024
Semantic Deep Hiding for Robust Unlearnable Examples
Semantic Deep Hiding for Robust Unlearnable Examples
Ruohan Meng
Chenyu Yi
Yi Yu
Siyuan Yang
Bingquan Shen
Alex C. Kot
133
5
0
25 Jun 2024
Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image
Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image
Jinkun Hao
Junshu Tang
Jiangning Zhang
Ran Yi
Yijia Hong
Moran Li
Weijian Cao
Yating Wang
Lizhuang Ma
DiffM
80
0
0
24 Jun 2024
Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind
  AI Generated Image Quality Assessment
Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment
Jun Fu
Wei Zhou
Qiuping Jiang
Hantao Liu
Guangtao Zhai
VLMCLIP
79
8
0
24 Jun 2024
EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned
  Data for Evaluating Text-to-Image Models
EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models
Zhiyu Tan
Xiaomeng Yang
Luozheng Qin
Mengping Yang
Cheng Zhang
Hao Li
106
8
0
24 Jun 2024
Character-Adapter: Prompt-Guided Region Control for High-Fidelity
  Character Customization
Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
Yuhang Ma
Wenting Xu
Jiji Tang
Qinfeng Jin
Rongsheng Zhang
Zeng Zhao
Changjie Fan
Zhipeng Hu
83
6
0
24 Jun 2024
DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World
  Image Super-Resolution
DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution
Aiwen Jiang
Zhi Wei
Long Peng
Feiqiang Liu
Wenbo Li
Mingwen Wang
DiffM
82
2
0
24 Jun 2024
Prompt-Consistency Image Generation (PCIG): A Unified Framework
  Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models
Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models
Yichen Sun
Zhixuan Chu
Zhan Qin
Kui Ren
DiffM
86
1
0
24 Jun 2024
YouDream: Generating Anatomically Controllable Consistent Text-to-3D
  Animals
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals
Sandeep Mishra
Oindrila Saha
A. Bovik
102
0
0
24 Jun 2024
MLPHand: Real Time Multi-View 3D Hand Mesh Reconstruction via MLP
  Modeling
MLPHand: Real Time Multi-View 3D Hand Mesh Reconstruction via MLP Modeling
Jian Yang
Jiakun Li
Guoming Li
Zhen Shen
Huai-Yu Wu
Zhaoxin Fan
Heng Huang
3DH
90
1
0
23 Jun 2024
X-ray2CTPA: Generating 3D CTPA scans from 2D X-ray conditioning
X-ray2CTPA: Generating 3D CTPA scans from 2D X-ray conditioning
Noa Cahan
Eyal Klang
Galit Aviram
Y. Barash
Eli Konen
Raja Giryes
H. Greenspan
MedIm
75
0
0
23 Jun 2024
Previous
123...272829...606162
Next