Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.06721
Cited By
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
13 August 2023
Hu Ye
Jun Zhang
Siyi Liu
Xiao Han
Wei Yang
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models"
50 / 579 papers shown
Title
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Jiaqi Liu
Jichao Zahng
Paolo Rota
N. Sebe
DiffM
53
0
0
19 Mar 2025
Diffusion-based Facial Aesthetics Enhancement with 3D Structure Guidance
Lisha Li
Jingwen Hou
Weide Liu
Yuming Fang
Jiebin Yan
DiffM
56
1
0
18 Mar 2025
Concat-ID: Towards Universal Identity-Preserving Video Synthesis
Yong Zhong
Zhuoyi Yang
Jiayan Teng
Xiaotao Gu
Chongxuan Li
VGen
63
0
0
18 Mar 2025
The Power of Context: How Multimodality Improves Image Super-Resolution
Kangfu Mei
Hossein Talebi
Mojtaba Ardakani
Vishal M. Patel
P. Milanfar
M. Delbracio
DiffM
85
1
0
18 Mar 2025
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing
Yulin Pan
Xiangteng He
Chaojie Mao
Zhen Han
Zeyinzi Jiang
J. Zhang
Yu Liu
EGVM
VLM
78
1
0
18 Mar 2025
DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode
Junjia Huang
Pengxiang Yan
Jinhang Cai
Jiyang Liu
Zhao Wang
Yitong Wang
Xinglong Wu
Guanbin Li
DiffM
72
0
0
17 Mar 2025
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
Dewei Zhou
Mingwei Li
Zongxin Yang
Yi Yang
94
0
0
17 Mar 2025
A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models
Ziqiang Li
Jun Li
Lizhi Xiong
Zhangjie Fu
Zechao Li
VLM
59
0
0
17 Mar 2025
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
Yaowei Li
Lingen Li
Zhaoyang Zhang
Xiaoyu Li
Guangzhi Wang
Hongxiang Li
Xiaodong Cun
Ying Shan
Yuexian Zou
DiffM
67
1
0
17 Mar 2025
EditID: Training-Free Editable ID Customization for Text-to-Image Generation
Guandong Li
Zhaobin Chu
DiffM
67
0
0
16 Mar 2025
Personalize Anything for Free with Diffusion Transformer
Haoran Feng
Zehuan Huang
Lin Li
Hairong Lv
Lu Sheng
DiffM
87
1
0
16 Mar 2025
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Tsu-jui Fu
Yusu Qian
Chen Chen
Wenze Hu
Zhe Gan
Yuqing Yang
106
1
0
16 Mar 2025
MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization
Hengjia Li
Lifan Jiang
Xi Xiao
Tianyang Wang
Hongwei Yi
Boxi Wu
D. Cai
VGen
53
0
0
16 Mar 2025
Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars
Eric M. Chen
Di Liu
Sizhuo Ma
Michael Vasilkovsky
Bing Zhou
...
Wei Wang
Jiahao Luo
Dimitris N. Metaxas
Vincent Sitzmann
Jian Wang
3DGS
57
0
0
15 Mar 2025
Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder
Wonwoong Cho
Yan-Ying Chen
M. Klenk
David I. Inouye
Yanxia Zhang
DiffM
189
0
0
15 Mar 2025
ACMo: Attribute Controllable Motion Generation
Mingjie Wei
Xuemei Xie
G. Shi
58
0
0
14 Mar 2025
GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior
Zichen Tang
Yuan Yao
Miaomiao Cui
Liefeng Bo
Hongyu Yang
3DGS
DiffM
60
0
0
14 Mar 2025
Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models
Hao-Ran Cheng
Erjia Xiao
Yichi Wang
Kaidi Xu
Mengshu Sun
Jindong Gu
Renjing Xu
41
0
0
14 Mar 2025
Long Context Tuning for Video Generation
Yuwei Guo
Ceyuan Yang
Ziyan Yang
Zhibei Ma
Zhijie Lin
Zhenheng Yang
Dahua Lin
Lu Jiang
DiffM
VGen
76
3
0
13 Mar 2025
CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance
Yufan Deng
Xun Guo
Yanjie Wang
Jacob Zhiyuan Fang
Angtian Wang
Shenghai Yuan
Yiding Yang
Bo Liu
Haibin Huang
Chongyang Ma
DiffM
VGen
72
0
0
13 Mar 2025
Piece it Together: Part-Based Concepting with IP-Priors
Elad Richardson
Kfir Goldberg
Yuval Alaluf
Daniel Cohen-Or
DiffM
66
0
0
13 Mar 2025
MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment
Hao Zhou
Xiaobao Guo
Yuzhe Zhu
A. Kong
DiffM
63
1
0
13 Mar 2025
Distilling Diversity and Control in Diffusion Models
Rohit Gandikota
David Bau
58
2
0
13 Mar 2025
Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation
Yi Wu
Lingting Zhu
Lei Liu
Wandi Qiao
Ziqiang Li
Lequan Yu
Bin Li
DiffM
52
0
0
13 Mar 2025
MoEdit: On Learning Quantity Perception for Multi-object Image Editing
Yanfeng Li
Kahou Chan
Yue Sun
C. Lam
Tong Tong
Zitong Yu
Keren Fu
Xiaohong Liu
Tao Tan
DiffM
41
0
0
13 Mar 2025
ConsisLoRA: Enhancing Content and Style Consistency for LoRA-based Style Transfer
Bolin Chen
Baoquan Zhao
H. Xie
Yi Cai
Qing Li
Xudong Mao
DiffM
59
0
0
13 Mar 2025
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
Yijing Lin
Mengqi Huang
Shuhan Zhuang
Zhendong Mao
VGen
51
0
0
13 Mar 2025
DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image
Qi Zhao
Zhan Ma
Pan Zhou
VGen
75
0
0
13 Mar 2025
Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
Héctor Laria
Alexandra Gomez-Villa
Jiang Qin
Muhammad Atif Butt
Bogdan Raducanu
Javier Vázquez-Corral
Joost van de Weijer
Kai Wang
DiffM
65
0
0
12 Mar 2025
On the Limitations of Vision-Language Models in Understanding Image Transforms
Ahmad Mustafa Anis
Hasnain Ali
Saquib Sarfraz
VLM
Presented at
ResearchTrend Connect | VLM
on
28 Mar 2025
151
0
0
12 Mar 2025
InteractEdit: Zero-Shot Editing of Human-Object Interactions in Images
Jiun Tian Hoe
Weipeng Hu
Wei Zhou
Chao Xie
Ziwei Wang
Chee Seng Chan
Xudong Jiang
Y. Tan
61
0
0
12 Mar 2025
UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Haoxuan Wang
Jinlong Peng
Q. He
Hao Yang
Ying Jin
...
Yanjie Pan
Zhenye Gan
M. Chi
Bo Peng
Yishuo Wang
DiffM
60
1
0
12 Mar 2025
Adv-CPG: A Customized Portrait Generation Framework with Facial Adversarial Attacks
Junying Wang
Hongyuan Zhang
Yuan Yuan
AAML
PICV
80
0
0
11 Mar 2025
OminiControl2: Efficient Conditioning for Diffusion Transformers
Zhenxiong Tan
Qiaochu Xue
Xingyi Yang
Songhua Liu
Xinchao Wang
DiffM
50
0
0
11 Mar 2025
NullFace: Training-Free Localized Face Anonymization
Han-Wei Kung
Tuomas Varanka
Terence Sim
N. Sebe
DiffM
PICV
68
0
0
11 Mar 2025
MF-VITON: High-Fidelity Mask-Free Virtual Try-On with Minimal Input
Zhenchen Wan
Yanwu Xu
Dongting Hu
Weilun Cheng
Tianxi Chen
Zihan Wang
Feng Liu
Tongliang Liu
Mingming Gong
DiffM
61
1
0
11 Mar 2025
TSCnet: A Text-driven Semantic-level Controllable Framework for Customized Low-Light Image Enhancement
Miao Zhang
Jun Yin
Pengyu Zeng
Yiqing Shen
Shuai Lu
Xueqian Wang
DiffM
68
7
0
11 Mar 2025
U-StyDiT: Ultra-high Quality Artistic Style Transfer Using Diffusion Transformers
Zhanjie Zhang
Ao Ma
Ke Cao
Jing Wang
Shanyuan Liu
Yuhang Ma
Bo Cheng
Dawei Leng
Yuhui Yin
67
0
0
11 Mar 2025
FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset
Shuhe Wang
Xiaoya Li
Jiwei Li
G. Wang
Xiaofei Sun
...
Han Qiu
Mo Yu
Shengjie Shen
Tianwei Zhang
Eduard H. Hovy
VLM
63
0
0
10 Mar 2025
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model
Lixue Gong
Xiaoxia Hou
Fanshi Li
Liang Li
Xiaochen Lian
...
Qi Zhang
Yuwei Zhang
Shijia Zhao
Jianchao Yang
Weilin Huang
DiffM
VLM
63
6
0
10 Mar 2025
Inversion-Free Video Style Transfer with Trajectory Reset Attention Control and Content-Style Bridging
Jiang Lin
Zili Yi
DiffM
VGen
39
0
0
10 Mar 2025
ReelWave: A Multi-Agent Framework Toward Professional Movie Sound Generation
Zixuan Wang
Chi-Keung Tang
Yu-Wing Tai
DiffM
VGen
63
0
0
10 Mar 2025
Automated Movie Generation via Multi-Agent CoT Planning
Weijia Wu
Zeyu Zhu
Mike Zheng Shou
VGen
80
2
0
10 Mar 2025
AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models
Bo Huang
Wenlun Xu
Qizhuo Han
Haodong Jing
Ying Li
DiffM
36
0
0
10 Mar 2025
Efficient Distillation of Classifier-Free Guidance using Adapters
Cristian Perez Jensen
Seyedmorteza Sadat
53
1
0
10 Mar 2025
AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis
Zhangyu Lai
Yilin Lu
Xinyang Li
Jianghang Lin
Yansong Qu
Liujuan Cao
Ming Li
Rongrong Ji
DiffM
170
0
0
10 Mar 2025
TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision
Shaobin Zhuang
Yiwei Guo
Yanbo Ding
Kunchang Li
Xinyuan Chen
Yaohui Wang
Fangyikang Wang
Ying Zhang
Chen Li
Yijiao Wang
45
0
0
10 Mar 2025
Conceptrol: Concept Control of Zero-shot Personalized Image Generation
Qiyuan He
Angela Yao
DiffM
41
0
0
09 Mar 2025
M
3
^3
3
amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing Classification
Mingxiang Cao
Weiying Xie
Xin Zhang
Jiaqing Zhang
Kai Jiang
Jie Lei
Yunsong Li
Mamba
50
0
0
09 Mar 2025
Color Alignment in Diffusion
Ka Chun Shum
Binh-Son Hua
Duc Thanh Nguyen
Sai-Kit Yeung
65
0
0
09 Mar 2025
Previous
1
2
3
4
5
6
...
10
11
12
Next