Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.09481
Cited By
AnyDoor: Zero-shot Object-level Image Customization
18 July 2023
Xi Chen
Lianghua Huang
Yu Liu
Yujun Shen
Deli Zhao
Hengshuang Zhao
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"AnyDoor: Zero-shot Object-level Image Customization"
50 / 219 papers shown
Title
SOEDiff: Efficient Distillation for Small Object Editing
Yiming Wu
Qihe Pan
Zhen Zhao
Zicheng Wang
Sifan Long
Ronghua Liang
DiffM
68
0
0
03 Jan 2025
Edicho: Consistent Image Editing in the Wild
Qingyan Bai
Hao Ouyang
Yinghao Xu
Qiuyu Wang
Ceyuan Yang
Ka Leong Cheng
Yujun Shen
Qifeng Chen
DiffM
74
1
0
30 Dec 2024
DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder
Ente Lin
Xujie Zhang
Fuwei Zhao
Yuxuan Luo
Xin Dong
Long Zeng
Xiaodan Liang
VLM
DiffM
71
2
0
23 Dec 2024
FashionComposer: Compositional Fashion Image Generation
S. Ji
Yiyang Wang
Xi Chen
Xiaogang Xu
Hao Luo
Hengshuang Zhao
78
0
0
18 Dec 2024
Urban Air Temperature Prediction using Conditional Diffusion Models
Siyang Dai
Jun Liu
Ngai-Man Cheung
82
0
0
18 Dec 2024
OmniPrism: Learning Disentangled Visual Concept for Image Generation
Yangyang Li
Daqing Liu
Wu Liu
Allen He
Xinchen Liu
Yongdong Zhang
Guoqing Jin
DiffM
CoGe
83
0
0
16 Dec 2024
SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models
Zhaoyang Sun
Shengwu Xiong
Yaxiong Chen
Fei Du
Weihua Chen
Fan Wang
Yi Rong
DiffM
74
1
0
15 Dec 2024
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism
Jun Zheng
Jing Wang
Fuwei Zhao
Xujie Zhang
Xiaodan Liang
DiffM
VGen
73
0
0
13 Dec 2024
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Xi Chen
Zhifei Zhang
He Zhang
Yuqian Zhou
S. Kim
...
Nanxuan Zhao
Yilin Wang
Hui Ding
Zhe Lin
Hengshuang Zhao
VGen
DiffM
123
21
0
10 Dec 2024
StyleMaster: Stylize Your Video with Artistic Generation and Translation
Zixuan Ye
Huijuan Huang
Xintao Wang
Pengfei Wan
Di Zhang
Wenhan Luo
DiffM
VGen
98
4
0
10 Dec 2024
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Tong Wu
Yinghao Xu
Ryan Po
Mengchen Zhang
Guandao Yang
Jiaqi Wang
Ziqiang Liu
Dahua Lin
Gordon Wetzstein
76
0
0
10 Dec 2024
Pinco: Position-induced Consistent Adapter for Diffusion Transformer in Foreground-conditioned Inpainting
Guangben Lu
Yuzhen Du
Zhimin Sun
Ran Yi
Yifan Qi
Yizhe Tang
Tianyi Wang
Lizhuang Ma
Fangyuan Zou
DiffM
80
1
0
05 Dec 2024
AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations?
Shouwei Ruan
Hanqin Liu
Yao Huang
Xiaoqi Wang
Caixin Kang
Hang Su
Yinpeng Dong
Xingxing Wei
VGen
93
0
0
04 Dec 2024
MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models
Xiaomin Li
Xu Jia
Qinghe Wang
Haiwen Diao
Mengmeng Ge
Pengxiang Li
You He
Huchuan Lu
VGen
DiffM
68
3
0
02 Dec 2024
Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise
Yeonguk Yu
Minhwan Ko
Sungho Shin
Kangmin Kim
K. Lee
NoLa
82
1
0
29 Nov 2024
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models
Shwetha Ram
T. Neiman
Qianli Feng
Andrew Stuart
S. D. Tran
Trishul Chilimbi
77
1
0
28 Nov 2024
Diffusion Autoencoders for Few-shot Image Generation in Hyperbolic Space
Lingxiao Li
Kaixuan Fan
Boqing Gong
Xiangyu Yue
DiffM
75
0
0
27 Nov 2024
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
Y. Li
Fan Ma
Yi Yang
140
2
0
24 Nov 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
H. Zhang
Yueting Zhuang
DiffM
106
15
0
24 Nov 2024
Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Alessandro Fontanella
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Sarah Parisot
38
2
0
16 Nov 2024
ColorEdit: Training-free Image-Guided Color editing with diffusion model
Xingxi Yin
Zhi Li
Jingfeng Zhang
Chenglin Li
Yin Zhang
DiffM
51
0
0
15 Nov 2024
StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration
Panwen Hu
Jin Jiang
Jianqi Chen
Mingfei Han
Shengcai Liao
Xiaojun Chang
Xiaodan Liang
VGen
DiffM
41
5
0
07 Nov 2024
BIFRÖST: 3D-Aware Image compositing with Language Instructions
Lingxiao Li
Kaixiong Gong
Weihong Li
Xili Dai
Tao Chen
Xiaojun Yuan
Xiangyu Yue
29
2
0
24 Oct 2024
Unbounded: A Generative Infinite Game of Character Life Simulation
Jialu Li
Yuanzhen Li
Neal Wadhwa
Yael Pritch
David E. Jacobs
Michael Rubinstein
Joey Tianyi Zhou
Nataniel Ruiz
VGen
AI4CE
36
4
0
24 Oct 2024
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Shilin Lu
Zihan Zhou
Jiayou Lu
Yuanzhi Zhu
A. Kong
WIGM
97
10
0
24 Oct 2024
How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?
Jiahua Dong
Wenqi Liang
Hongliu Li
Duzhen Zhang
Meng Cao
Henghui Ding
Salman Khan
F. Khan
DiffM
65
9
0
23 Oct 2024
DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization
Haowei Zhu
Dehua Tang
Ji Liu
Mingjie Lu
Jintu Zheng
...
Spandan Tiwari
Ashish Sirasao
Jun-Hai Yong
Bin Wang
E. Barsoum
DiffM
32
5
0
22 Oct 2024
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Yujie Wei
Shiwei Zhang
Hangjie Yuan
Xiang Wang
Haonan Qiu
...
F. Liu
Zhizhong Huang
Jiaxin Ye
Yingya Zhang
Hongming Shan
DiffM
VGen
72
14
0
17 Oct 2024
AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing
DuoSheng Chen
Binghui Chen
Yifeng Geng
Liefeng Bo
DiffM
30
1
0
16 Oct 2024
ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion
Zitian Zhang
Frédéric Fortier-Chouinard
Mathieu Garon
Anand Bhattad
Jean-François Lalonde
DiffM
38
4
0
10 Oct 2024
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction
Leheng Li
Weichao Qiu
Xu Yan
Jing He
Kaiqiang Zhou
Yingjie Cai
Qing Lian
Bingbing Liu
Ying-Cong Chen
SyDa
DiffM
47
1
0
07 Oct 2024
Improving Image Clustering with Artifacts Attenuation via Inference-Time Attention Engineering
Kazumoto Nakamura
Yuji Nozawa
Yu-Chieh Lin
K. Nakata
Youyang Ng
ViT
35
1
0
07 Oct 2024
Event-Customized Image Generation
Zhen Wang
Yilei Jiang
Dong Zheng
Jun Xiao
Long Chen
DiffM
26
1
0
03 Oct 2024
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer
Zhen Han
Zeyinzi Jiang
Yulin Pan
Jingfeng Zhang
Chaojie Mao
Chenwei Xie
Yu Liu
Jingren Zhou
DiffM
32
17
0
30 Sep 2024
Conditional Image Synthesis with Diffusion Models: A Survey
Zheyuan Zhan
Defang Chen
Jian-Ping Mei
Zhenghe Zhao
Jiawei Chen
Chun Chen
Siwei Lyu
Can Wang
VLM
45
5
0
28 Sep 2024
FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction
Runze He
Kai Ma
Linjiang Huang
Shaofei Huang
Jialin Gao
Xiaoming Wei
Jiao Dai
Jizhong Han
Si Liu
DiffM
49
7
0
26 Sep 2024
AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status
Jinghao Zhang
Wen Qian
Hao Luo
Fan Wang
Feng Zhao
DiffM
28
0
0
26 Sep 2024
GroupDiff: Diffusion-based Group Portrait Editing
Yuming Jiang
Nanxuan Zhao
Qing Liu
Krishna Kumar Singh
Shuai Yang
Chen Change Loy
Ziwei Liu
DiffM
41
1
0
22 Sep 2024
Portrait Video Editing Empowered by Multimodal Generative Priors
Xuan Gao
Haiyao Xiao
Chenglai Zhong
Shimin Hu
Yudong Guo
Juyong Zhang
VGen
3DGS
42
4
0
20 Sep 2024
MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior
Weijing Tao
Xiaofeng Yang
Miaomiao Cui
Guosheng Lin
DiffM
26
1
0
16 Sep 2024
GroundingBooth: Grounding Text-to-Image Customization
Zhexiao Xiong
Wei Xiong
Jing Shi
He Zhang
Yizhi Song
Nathan Jacobs
DiffM
59
6
0
13 Sep 2024
CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Nan Chen
Mengqi Huang
Zhuowei Chen
Yang Zheng
Lei Zhang
Zhendong Mao
DiffM
49
5
0
09 Sep 2024
Thinking Outside the BBox: Unconstrained Generative Object Compositing
Gemma Canet Tarrés
Zhe Lin
Zhifei Zhang
Jianming Zhang
Yizhi Song
Dan Ruta
Andrew Gilbert
John Collomosse
Soo Ye Kim
DiffM
35
9
0
06 Sep 2024
What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer
Chaeyeon Chung
Sunghyun Park
J. Kim
Jaegul Choo
DiffM
36
0
0
29 Aug 2024
Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing
Yitong Yang
Yinglin Wang
Jing Wang
Tian Zhang
DiffM
40
1
0
24 Aug 2024
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Tao Wu
Yong Zhang
Xintao Wang
Xianpan Zhou
Guangcong Zheng
Zhongang Qi
Ying Shan
Xi Li
VGen
DiffM
24
26
0
23 Aug 2024
Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Chao Xu
Mingze Sun
Zhi-Qi Cheng
Fei-Yue Wang
Yang Liu
Baigui Sun
Ruqi Huang
Alexander G. Hauptmann
VGen
45
2
0
18 Aug 2024
MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing
Chenjie Cao
Chaohui Yu
Yanwei Fu
Fan Wang
Xiangyang Xue
VGen
45
7
0
15 Aug 2024
BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation
Hee Suk Yoon
Eunseop Yoon
Joshua Tian Jin Tee
Kang Zhang
Yu-Jung Heo
Du-Seong Chang
Chang D. Yoo
36
3
0
12 Aug 2024
TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization
K. T. Pham
Jingye Chen
Qifeng Chen
DiffM
42
0
0
07 Aug 2024
Previous
1
2
3
4
5
Next