Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.06721
Cited By
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
13 August 2023
Hu Ye
Jun Zhang
Siyi Liu
Xiao Han
Wei Yang
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models"
50 / 580 papers shown
Title
Beyond Imperfections: A Conditional Inpainting Approach for End-to-End Artifact Removal in VTON and Pose Transfer
Aref Tabatabaei
Zahra Dehghanian
M. Amirmazlaghani
DiffM
37
0
0
05 Oct 2024
Audio-Agent: Leveraging LLMs For Audio Generation, Editing and Composition
Zixuan Wang
Chi-Keung Tang
Chi-Keung Tang
DiffM
VGen
LLMAG
49
4
0
04 Oct 2024
Event-Customized Image Generation
Zhen Wang
Yilei Jiang
Dong Zheng
Jun Xiao
Long Chen
DiffM
26
1
0
03 Oct 2024
DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation
Jing He
Haodong Li
Yongzhe Hu
Guibao Shen
Yingjie Cai
Weichao Qiu
Ying-Cong Chen
DiffM
32
2
0
02 Oct 2024
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation
Rinon Gal
Adi Haviv
Yuval Alaluf
Amit H. Bermano
Daniel Cohen-Or
Gal Chechik
DiffM
34
3
0
02 Oct 2024
Causal Representation Learning with Generative Artificial Intelligence: Application to Texts as Treatments
Kosuke Imai
Kentaro Nakamura
CML
28
4
0
01 Oct 2024
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer
Zhen Han
Zeyinzi Jiang
Yulin Pan
Jingfeng Zhang
Chaojie Mao
Chenwei Xie
Yu Liu
Jingren Zhou
DiffM
35
17
0
30 Sep 2024
Illustrious: an Open Advanced Illustration Model
Sang Hyun Park
Jun Young Koh
Junha Lee
Joy Song
Dongha Kim
Hoyeon Moon
Hyunju Lee
Min Song
VLM
46
1
0
30 Sep 2024
Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection
Yuhang Ma
Wenting Xu
Chaoyi Zhao
Keqiang Sun
Qinfeng Jin
Zeng Zhao
Changjie Fan
Zhipeng Hu
VGen
DiffM
32
1
0
29 Sep 2024
High Quality Human Image Animation using Regional Supervision and Motion Blur Condition
Zhongcong Xu
Chaoyue Song
Guoxian Song
Jianfeng Zhang
Jun Hao Liew
...
You Xie
Linjie Luo
Guosheng Lin
Jiashi Feng
Mike Zheng Shou
DiffM
3DH
VGen
30
3
0
29 Sep 2024
Conditional Image Synthesis with Diffusion Models: A Survey
Zheyuan Zhan
Defang Chen
Jian-Ping Mei
Zhenghe Zhao
Jiawei Chen
Chun Chen
Siwei Lyu
Can Wang
VLM
48
5
0
28 Sep 2024
Fusion is all you need: Face Fusion for Customized Identity-Preserving Image Synthesis
Salaheldin Mohamed
Dong Han
Yong Li
23
1
0
27 Sep 2024
FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction
Runze He
Kai Ma
Linjiang Huang
Shaofei Huang
Jialin Gao
Xiaoming Wei
Jiao Dai
Jizhong Han
Si Liu
DiffM
52
7
0
26 Sep 2024
Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Qihan Huang
Siming Fu
Jinlong Liu
Hao Jiang
Yipeng Yu
Jie Song
39
5
0
26 Sep 2024
AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status
Jinghao Zhang
Wen Qian
Hao Luo
Fan Wang
Feng Zhao
DiffM
28
0
0
26 Sep 2024
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
Kyuheon Jung
Yongdeuk Seo
Seongwoo Cho
Jaeyoung Kim
Hyun-seok Min
Sungchul Choi
26
0
0
25 Sep 2024
Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification
Xinrui Zhou
Yuhao Huang
Haoran Dou
Shijing Chen
Ao Chang
...
Jie Jessie Ren
Ruobing Huang
Jun Cheng
Wufeng Xue
Dong Ni
MedIm
180
0
0
25 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
63
10
0
23 Sep 2024
BrainDreamer: Reasoning-Coherent and Controllable Image Generation from EEG Brain Signals via Language Guidance
Ling Wang
Chen Wu
Lin Wang
DiffM
36
0
0
21 Sep 2024
Imagine yourself: Tuning-Free Personalized Image Generation
Zecheng He
Bo Sun
Felix Juefei-Xu
Haoyu Ma
Ankit Ramchandani
...
Ning Zhang
Peizhao Zhang
Roshan Sumbaly
Peter Vajda
Animesh Sinha
DiffM
37
16
0
20 Sep 2024
StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation
Zhengguang Zhou
Jing Li
Huaxia Li
Nemo Chen
Xu Tang
DiffM
VGen
44
8
0
19 Sep 2024
GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation
Shuowen Liang
Sisi Li
Qingyun Wang
Cen Zhang
Kaiquan Zhu
Tian Yang
DiffM
28
0
0
18 Sep 2024
OmniGen: Unified Image Generation
Shitao Xiao
Yueze Wang
Yueze Wang
Huaying Yuan
Xingrun Xing
Ruiran Yan
Shuting Wang
Tiejun Huang
Zheng Liu
DiffM
VLM
SyDa
62
65
0
17 Sep 2024
One-Shot Learning for Pose-Guided Person Image Synthesis in the Wild
Dongqi Fan
Tao Chen
Mingjie Wang
Rui Ma
Qiang Tang
Zili Yi
Qian Wang
Liang Chang
27
0
0
15 Sep 2024
MagicStyle: Portrait Stylization Based on Reference Image
Zhaoli Deng
Kaibin Zhou
Fanyi Wang
Zhenpeng Mi
DiffM
49
1
0
12 Sep 2024
BrainDecoder: Style-Based Visual Decoding of EEG Signals
Minsuk Choi
Hiroshi Ishikawa
25
0
0
09 Sep 2024
CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Nan Chen
Mengqi Huang
Zhuowei Chen
Yang Zheng
Lei Zhang
Zhendong Mao
DiffM
52
5
0
09 Sep 2024
Plug-and-Hide: Provable and Adjustable Diffusion Generative Steganography
Jiahao Zhu
Zixuan Chen
Lingxiao Yang
Xiaohua Xie
Yi Zhou
DiffM
23
0
0
07 Sep 2024
Training-Free Style Consistent Image Synthesis with Condition and Mask Guidance in E-Commerce
Guandong Li
DiffM
40
2
0
07 Sep 2024
Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors
Haiyu Wu
Jaskirat Singh
Sicong Tian
Liang Zheng
Kevin W. Bowyer
CVBM
44
3
0
04 Sep 2024
LinFusion: 1 GPU, 1 Minute, 16K Image
Songhua Liu
Weihao Yu
Zhenxiong Tan
Xinchao Wang
48
13
0
03 Sep 2024
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
Qihua Chen
Yi Ma
Haozhao Wang
Junkun Yuan
Wenzhe Zhao
Q. Tian
Hongmei Wang
Shaobo Min
Qifeng Chen
Wei Liu
DiffM
45
16
0
02 Sep 2024
PS-StyleGAN: Illustrative Portrait Sketching using Attention-Based Style Adaptation
Kushal Kumar Jain
Ankith Varun J
A. Namboodiri
33
0
0
31 Aug 2024
Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
Xiaoyu Jin
Zunnan Xu
Mingwen Ou
Wenming Yang
DiffM
46
7
0
29 Aug 2024
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation
Abdelrahman Eldesokey
Peter Wonka
DiffM
46
4
0
27 Aug 2024
Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models
Chaohua Shi
Xuan Wang
Si Shi
Xule Wang
Mingrui Zhu
Nannan Wang
X. Gao
CoGe
43
1
0
26 Aug 2024
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
Junwon Lee
Jaekwon Im
Dabin Kim
Juhan Nam
VGen
40
9
0
21 Aug 2024
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
Haoning Wu
Shaocheng Shen
Qiang Hu
Xiaoyun Zhang
Ya Zhang
Yanfeng Wang
40
10
0
20 Aug 2024
Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Chao Xu
Mingze Sun
Zhi-Qi Cheng
Fei Wang
Yang Liu
Baigui Sun
Ruqi Huang
Alexander G. Hauptmann
VGen
45
2
0
18 Aug 2024
RepControlNet: ControlNet Reparameterization
Zhaoli Deng
Kaibin Zhou
Fanyi Wang
Zhenpeng Mi
DiffM
48
2
0
17 Aug 2024
MagicFace: Training-free Universal-Style Human Image Customized Synthesis
Yibin Wang
Weizhong Zhang
Cheng Jin
DiffM
39
3
0
14 Aug 2024
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization
Junjie He
Yifeng Geng
Liefeng Bo
DiffM
54
20
0
12 Aug 2024
BRAT: Bonus oRthogonAl Token for Architecture Agnostic Textual Inversion
James Baker
42
1
0
08 Aug 2024
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts
Ciara Rowles
Shimon Vainer
Dante De Nigris
Slava Elizarov
Konstantin Kutsy
Simon Donné
DiffM
46
9
0
06 Aug 2024
Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection
Sen Nie
Zhuo Wang
Xinxin Wang
Kun He
DiffM
73
0
0
06 Aug 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu
Shitian Zhao
Le Zhuo
Weifeng Lin
Ping Luo
Xinyue Li
Qi Qin
Yu Qiao
Hongsheng Li
Peng Gao
MLLM
76
48
0
05 Aug 2024
SiCo: An Interactive Size-Controllable Virtual Try-On Approach for Informed Decision-Making
Sherry X. Chen
Alex Christopher Lim
Yimeng Liu
Pradeep Sen
Misha Sra
32
0
0
05 Aug 2024
CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models
Kushal Kumar Jain
Steven A. Grosz
A. Namboodiri
Anil K. Jain
DiffM
43
2
0
02 Aug 2024
Towards Localized Fine-Grained Control for Facial Expression Generation
Tuomas Varanka
Huai-Qian Khor
Yante Li
Mengting Wei
Hanwei Kung
N. Sebe
Guoying Zhao
43
4
0
25 Jul 2024
PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
Rishubh Parihar
VS Sachidanand
Sabariswaran Mani
Tejan Karmali
R. V. Babu
DiffM
39
13
0
24 Jul 2024
Previous
1
2
3
...
6
7
8
...
10
11
12
Next