ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.06721
  4. Cited By
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image
  Diffusion Models

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

13 August 2023
Hu Ye
Jun Zhang
Siyi Liu
Xiao Han
Wei Yang
    DiffM
ArXivPDFHTML

Papers citing "IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models"

50 / 579 papers shown
Title
DragLoRA: Online Optimization of LoRA Adapters for Drag-based Image Editing in Diffusion Model
DragLoRA: Online Optimization of LoRA Adapters for Drag-based Image Editing in Diffusion Model
Siwei Xia
Li Sun
Tiantian Sun
Qingli Li
DiffM
6
0
0
18 May 2025
Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data
Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data
Yiwen Liu
Jessica Bader
Jae Myung Kim
DiffM
16
0
0
15 May 2025
Generative Pre-trained Autoregressive Diffusion Transformer
Generative Pre-trained Autoregressive Diffusion Transformer
Yuan Zhang
Jiacheng Jiang
Guoqing Ma
Zhiying Lu
Haoyang Huang
Jianlong Yuan
Nan Duan
VGen
43
1
0
12 May 2025
ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models
ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models
Ozgur Kara
Krishna Kumar Singh
Feng Liu
Duygu Ceylan
James M. Rehg
Tobias Hinz
DiffM
VGen
41
0
0
12 May 2025
Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets
Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets
Weiyu Li
Xuanyang Zhang
Zheng Sun
Di Qi
Yiming Li
...
Zeming Li
Gang Yu
Xiangyu Zhang
Daxin Jiang
Ping Tan
43
0
0
12 May 2025
ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images
ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images
Xianghao Kong
Qiaosong Qi
Yuanbin Wang
Anyi Rao
Biaolong Chen
Aixi Zhang
Si Liu
Hao Jiang
DiffM
VGen
25
0
0
10 May 2025
Photovoltaic Defect Image Generator with Boundary Alignment Smoothing Constraint for Domain Shift Mitigation
Photovoltaic Defect Image Generator with Boundary Alignment Smoothing Constraint for Domain Shift Mitigation
Dongying Li
Binyi Su
Hua Zhang
Yong Li
Haiyong Chen
146
0
0
09 May 2025
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
Kunpeng Qiu
Zhiqiang Gao
Zhiying Zhou
Mingjie Sun
Yongxin Guo
MedIm
34
0
0
09 May 2025
EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution
EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution
Haizhen Xie
Kunpeng Du
Qiangyu Yan
Sen Lu
Jianhong Han
Hanting Chen
Hailin Hu
Jie Hu
53
0
0
08 May 2025
Multi-turn Consistent Image Editing
Multi-turn Consistent Image Editing
Zijun Zhou
Yingying Deng
Xiangyu He
Weiming Dong
Fan Tang
53
0
0
07 May 2025
MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation
MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation
Zilong Chen
Yikai Wang
Wenqiang Sun
Feng Wang
Yiwen Chen
Huaping Liu
34
0
0
07 May 2025
Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability
Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability
L. Wang
Senmao Li
Fei Yang
Jianye Wang
Ziheng Zhang
Yong Liu
Y. Wang
Jian Yang
DiffM
61
0
0
06 May 2025
Distribution-Conditional Generation: From Class Distribution to Creative Generation
Distribution-Conditional Generation: From Class Distribution to Creative Generation
Fu Feng
Yucheng Xie
Xu Yang
Jing Wang
Xin Geng
DiffM
31
0
0
06 May 2025
Multimodal Benchmarking and Recommendation of Text-to-Image Generation Models
Multimodal Benchmarking and Recommendation of Text-to-Image Generation Models
Kapil Wanaskar
Gaytri Jena
Magdalini Eirinaki
EGVM
36
0
0
06 May 2025
SynSHRP2: A Synthetic Multimodal Benchmark for Driving Safety-critical Events Derived from Real-world Driving Data
SynSHRP2: A Synthetic Multimodal Benchmark for Driving Safety-critical Events Derived from Real-world Driving Data
Liang Shi
Boyu Jiang
Zhenyuan Yuan
Miguel A. Perez
Feng Guo
34
0
0
06 May 2025
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing
Zinan Guo
Pengze Zhang
Yanze Wu
Chong Mou
Mingcong Liu
Qian He
33
0
0
05 May 2025
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Xuzhi Zhang
Jintao Guo
Shanshan Zhao
Minghao Fu
Lunhao Duan
Guo-Hua Wang
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
DiffM
74
0
0
05 May 2025
GarmentDiffusion: 3D Garment Sewing Pattern Generation with Multimodal Diffusion Transformers
GarmentDiffusion: 3D Garment Sewing Pattern Generation with Multimodal Diffusion Transformers
Xinyu Li
Qi Yao
Yin Wang
DiffM
48
0
0
30 Apr 2025
Visual Text Processing: A Comprehensive Review and Unified Evaluation
Visual Text Processing: A Comprehensive Review and Unified Evaluation
Yan Shu
Weichao Zeng
Fangmin Zhao
Zeyu Chen
Zeju Li
...
Paolo Rota
Xiang Bai
Lianwen Jin
Xu-Cheng Yin
N. Sebe
CoGe
64
0
0
30 Apr 2025
YoChameleon: Personalized Vision and Language Generation
YoChameleon: Personalized Vision and Language Generation
Thao Nguyen
Krishna Kumar Singh
Jing Shi
Trung H. Bui
Yong Jae Lee
Yuheng Li
MLLM
82
0
0
29 Apr 2025
RepText: Rendering Visual Text via Replicating
RepText: Rendering Visual Text via Replicating
Haozhao Wang
Yongjun Xu
Yongqian Li
Jiajun Li
Chaowei Zhang
J. Wang
Kejia Yang
Z. Chen
VLM
66
0
0
28 Apr 2025
CompleteMe: Reference-based Human Image Completion
CompleteMe: Reference-based Human Image Completion
Yu-Ju Tsai
Brian L. Price
Qing Liu
Luis Figueroa
D. Pakhomov
Zhihong Ding
Scott D. Cohen
Ming Yang
3DH
52
0
0
28 Apr 2025
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation
Aviv Slobodkin
Hagai Taitelbaum
Yonatan Bitton
Brian Gordon
Michal Sokolik
...
Almog Gueta
Royi Rassin
Itay Laish
Dani Lischinski
Idan Szpektor
EGVM
VGen
43
0
0
24 Apr 2025
DreamO: A Unified Framework for Image Customization
DreamO: A Unified Framework for Image Customization
Chong Mou
Yanze Wu
Wenxu Wu
Zinan Guo
Pengze Zhang
...
Shaojin Wu
Songtao Zhao
Jian Zhang
Qian He
Xinglong Wu
49
0
0
23 Apr 2025
Subject-driven Video Generation via Disentangled Identity and Motion
Subject-driven Video Generation via Disentangled Identity and Motion
Daneul Kim
Jingxu Zhang
W. Jin
Sunghyun Cho
Qi Dai
Jaesik Park
Chong Luo
DiffM
VGen
115
0
0
23 Apr 2025
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
Le Zhuo
Liangbing Zhao
Sayak Paul
Yue Liao
Renrui Zhang
Yi Xin
Peng Gao
Mohamed Elhoseiny
Yiming Li
VLM
75
0
0
22 Apr 2025
"I Know It When I See It": Mood Spaces for Connecting and Expressing Visual Concepts
"I Know It When I See It": Mood Spaces for Connecting and Expressing Visual Concepts
Huzheng Yang
Katherine Xu
Michael D. Grossberg
Yutong Bai
Jianbo Shi
36
0
0
21 Apr 2025
MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World
MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World
Ankit Dhiman
Manan Shah
R. V. Babu
31
0
0
21 Apr 2025
A Controllable Appearance Representation for Flexible Transfer and Editing
A Controllable Appearance Representation for Flexible Transfer and Editing
Santiago Jimenez-Navarro
Julia Guerrero-Viu
B. Masiá
DiffM
33
0
0
21 Apr 2025
Insert Anything: Image Insertion via In-Context Editing in DiT
Insert Anything: Image Insertion via In-Context Editing in DiT
Wensong Song
Hong Jiang
Zongxing Yang
Ruijie Quan
Yi Yang
DiffM
45
0
0
21 Apr 2025
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning
Fulong Ye
Miao Hua
Pengze Zhang
Xinghui Li
Qichao Sun
Mingcong Liu
Qian He
Xinglong Wu
66
0
0
20 Apr 2025
Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis
Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis
Zichuan Liu
Liming Jiang
Qing Yan
Yumin Jia
Hao Kang
Xin Lu
DiffM
31
0
0
19 Apr 2025
Fashion-RAG: Multimodal Fashion Image Editing via Retrieval-Augmented Generation
Fashion-RAG: Multimodal Fashion Image Editing via Retrieval-Augmented Generation
Fulvio Sanguigni
Davide Morelli
Marcella Cornia
Rita Cucchiara
DiffM
40
0
0
18 Apr 2025
ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis
ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis
Andrea Rigo
Luca Stornaiuolo
Mauro Martino
Bruno Lepri
N. Sebe
48
0
0
18 Apr 2025
Personalized Text-to-Image Generation with Auto-Regressive Models
Personalized Text-to-Image Generation with Auto-Regressive Models
Kaiyue Sun
Xian Liu
Yao Teng
Xihui Liu
38
0
0
17 Apr 2025
UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models
UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models
Guanlong Jiao
Biqing Huang
Kuan-Chieh Wang
Renjie Liao
DiffM
82
0
0
17 Apr 2025
Image-Editing Specialists: An RLAIF Approach for Diffusion Models
Image-Editing Specialists: An RLAIF Approach for Diffusion Models
Elior Benarous
Yilun Du
Heng Yang
22
0
0
17 Apr 2025
Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off
Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off
Riza Velioglu
Petra Bevandic
Robin Chan
Barbara Hammer
DiffM
36
0
0
17 Apr 2025
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework
Jiale Tao
Yanbing Zhang
Qixun Wang
Yiji Cheng
Haofan Wang
...
Ruihuang Li
Linqing Wang
Chunyu Wang
Qin Lin
Qinglin Lu
DiffM
47
1
0
16 Apr 2025
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
Tianhui Song
Weixin Feng
Shuai Wang
Xinfeng Li
Tiezheng Ge
Bo Zheng
Limin Wang
MoMe
62
0
0
16 Apr 2025
WORLDMEM: Long-term Consistent World Simulation with Memory
WORLDMEM: Long-term Consistent World Simulation with Memory
Zeqi Xiao
Yushi Lan
Yifan Zhou
Wenqi Ouyang
Shuai Yang
Yanhong Zeng
Xingang Pan
78
0
0
16 Apr 2025
DuoLoRA : Cycle-consistent and Rank-disentangled Content-Style Personalization
DuoLoRA : Cycle-consistent and Rank-disentangled Content-Style Personalization
Aniket Roy
Shubhankar Borse
Shreya Kadambi
Debasmit Das
Shweta Mahajan
...
Hyojin Park
Ankita Nayak
Rama Chellappa
Munawar Hayat
Fatih Porikli
MoMe
50
0
0
15 Apr 2025
VideoPanda: Video Panoramic Diffusion with Multi-view Attention
VideoPanda: Video Panoramic Diffusion with Multi-view Attention
Kevin Xie
Amirmojtaba Sabour
Jiahui Huang
Despoina Paschalidou
G. Klár
Umar Iqbal
Sanja Fidler
Fangyin Wei
VGen
MDE
39
0
0
15 Apr 2025
IlluSign: Illustrating Sign Language Videos by Leveraging the Attention Mechanism
IlluSign: Illustrating Sign Language Videos by Leveraging the Attention Mechanism
Janna Bruner
Amit Moryossef
Lior Wolf
DiffM
SLR
50
0
0
15 Apr 2025
Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing
Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing
Taihang Hu
Linxuan Li
Kai Wang
Yaxing Wang
Jian Yang
Ming-Ming Cheng
DiffM
VGen
23
0
0
14 Apr 2025
Flux Already Knows -- Activating Subject-Driven Image Generation without Training
Flux Already Knows -- Activating Subject-Driven Image Generation without Training
Hao Kang
Stathi Fotiadis
Liming Jiang
Qing Yan
Yumin Jia
Zichuan Liu
Min Jin Chong
Xin Lu
40
0
0
12 Apr 2025
CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model
CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model
Ruohao Zhan
Yijin Li
Yisheng He
Shuo Chen
Yichen Shen
Xinyu Chen
Zilong Dong
Zhaoyang Huang
Guofeng Zhang
DiffM
36
0
0
11 Apr 2025
DreamFuse: Adaptive Image Fusion with Diffusion Transformer
DreamFuse: Adaptive Image Fusion with Diffusion Transformer
Junjia Huang
Pengxiang Yan
Jiyang Liu
Jie Wu
Zhao Wang
Yitong Wang
Liang Lin
G. Li
37
0
0
11 Apr 2025
Teaching Humans Subtle Differences with DIFFusion
Teaching Humans Subtle Differences with DIFFusion
Mia Chiquier
Orr Avrech
Yossi Gandelsman
Berthy Feng
Katherine Bouman
Carl Vondrick
DiffM
51
0
0
10 Apr 2025
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning
Zhong-Yu Li
Ruoyi Du
Juncheng Yan
Le Zhuo
Zhen Li
Peng Gao
Zhanyu Ma
Ming-Ming Cheng
VLM
72
2
0
10 Apr 2025
1234...101112
Next