Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.06721
Cited By
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
13 August 2023
Hu Ye
Jun Zhang
Siyi Liu
Xiao Han
Wei Yang
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models"
29 / 579 papers shown
Title
ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation
Dar-Yen Chen
Hamish Tennent
Ching-Wen Hsu
DiffM
16
24
0
04 Dec 2023
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Yuchao Gu
Yipin Zhou
Bichen Wu
Licheng Yu
Jia-Wei Liu
Rui Zhao
Jay Zhangjie Wu
David Junhao Zhang
Mike Zheng Shou
Kevin Tang
DiffM
VGen
67
37
0
04 Dec 2023
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
L. Ran
Xiaodong Cun
Jia-Wei Liu
Rui Zhao
Song Zijie
Xintao Wang
Jussi Keppo
Mike Zheng Shou
37
11
0
04 Dec 2023
ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation
Peng Wang
Yichun Shi
16
165
0
02 Dec 2023
VideoBooth: Diffusion-based Video Generation with Image Prompts
Yuming Jiang
Tianxing Wu
Shuai Yang
Chenyang Si
Dahua Lin
Yu Qiao
Chen Change Loy
Ziwei Liu
DiffM
VGen
40
65
0
01 Dec 2023
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter
Gongye Liu
Menghan Xia
Yong Zhang
Haoxin Chen
Jinbo Xing
Xintao Wang
Yujiu Yang
Ying Shan
DiffM
VGen
141
0
0
01 Dec 2023
DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars
Tobias Kirschstein
Simon Giebenhain
Matthias Nießner
34
27
0
30 Nov 2023
CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model
Jianhao Zeng
Dan Song
Weizhi Nie
Hongshuo Tian
Tongtong Wang
Anan Liu
DiffM
28
20
0
30 Nov 2023
AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text
Jianfeng Zhang
Xuanmeng Zhang
Huichao Zhang
Jun Hao Liew
Chenxu Zhang
Yi Yang
Jiashi Feng
DiffM
58
15
0
29 Nov 2023
Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers
Chi-Pin Huang
Kai-Po Chang
Chung-Ting Tsai
Yung-Hsuan Lai
Fu-En Yang
Yu-Chiang Frank Wang
DiffM
13
48
0
29 Nov 2023
When StyleGAN Meets Stable Diffusion: a
W
+
\mathscr{W}_+
W
+
Adapter for Personalized Image Generation
Xiaoming Li
Xinyu Hou
Chen Change Loy
36
11
0
29 Nov 2023
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo
Ceyuan Yang
Anyi Rao
Maneesh Agrawala
Dahua Lin
Bo Dai
DiffM
VGen
28
114
0
28 Nov 2023
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Liucheng Hu
Xin Gao
Peng Zhang
Ke Sun
Bang Zhang
Liefeng Bo
DiffM
VGen
44
342
0
28 Nov 2023
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Zhongcong Xu
Jianfeng Zhang
Jun Hao Liew
Hanshu Yan
Jia-Wei Liu
Chenxu Zhang
Jiashi Feng
Mike Zheng Shou
VGen
DiffM
33
184
0
27 Nov 2023
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Omri Avrahami
Amir Hertz
Yael Vinker
Moab Arar
Shlomi Fruchter
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
DiffM
60
32
0
16 Nov 2023
Stable Diffusion Reference Only: Image Prompt and Blueprint Jointly Guided Multi-Condition Diffusion Model for Secondary Painting
Hao Ai
Lu Sheng
DiffM
16
3
0
04 Nov 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen
Menghan Xia
Yin-Yin He
Yong Zhang
Xiaodong Cun
...
Yaofang Liu
Qifeng Chen
Xintao Wang
Chao-Liang Weng
Ying Shan
DiffM
26
280
0
30 Oct 2023
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Jinbo Xing
Menghan Xia
Yong Zhang
Haoxin Chen
Wangbo Yu
Hanyuan Liu
Xintao Wang
Tien-Tsin Wong
Ying Shan
VGen
47
224
0
18 Oct 2023
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Zeqiang Lai
Xizhou Zhu
Jifeng Dai
Yu Qiao
Wenhai Wang
MLLM
DiffM
54
22
0
11 Oct 2023
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
Bo-Wen Zeng
Shanglin Li
Yutang Feng
Ling Yang
Hong Li
...
Conghui He
Wentao Zhang
Jianzhuang Liu
Baochang Zhang
Shuicheng Yan
DiffM
32
1
0
09 Oct 2023
AnyDoor: Zero-shot Object-level Image Customization
Xi Chen
Lianghua Huang
Yu Liu
Yujun Shen
Deli Zhao
Hengshuang Zhao
DiffM
46
256
0
18 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
28
782
0
10 Jul 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffM
VLM
28
39
0
01 Jun 2023
Versatile Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers
Ke Xu
Hongrui Chen
Zihao Zhu
Li Liu
Baoyuan Wu
DiffM
30
11
0
01 Jun 2023
Real-World Image Variation by Aligning Diffusion Inversion Chain
Yuechen Zhang
Jinbo Xing
Eric Lo
Jiaya Jia
27
34
0
30 May 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
287
4,261
0
30 Jan 2023
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Wenhu Chen
Hexiang Hu
Chitwan Saharia
William W. Cohen
VLM
125
161
0
29 Sep 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLM
BDL
VLM
CLIP
392
4,154
0
28 Jan 2022
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,796
0
24 Feb 2021
Previous
1
2
3
...
10
11
12