ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.06721
  4. Cited By
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image
  Diffusion Models

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

13 August 2023
Hu Ye
Jun Zhang
Siyi Liu
Xiao Han
Wei Yang
    DiffM
ArXivPDFHTML

Papers citing "IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models"

50 / 582 papers shown
Title
Towards Localized Fine-Grained Control for Facial Expression Generation
Towards Localized Fine-Grained Control for Facial Expression Generation
Tuomas Varanka
Huai-Qian Khor
Yante Li
Mengting Wei
Hanwei Kung
N. Sebe
Guoying Zhao
43
4
0
25 Jul 2024
PreciseControl: Enhancing Text-To-Image Diffusion Models with
  Fine-Grained Attribute Control
PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
Rishubh Parihar
VS Sachidanand
Sabariswaran Mani
Tejan Karmali
R. V. Babu
DiffM
44
13
0
24 Jul 2024
Text2Place: Affordance-aware Text Guided Human Placement
Text2Place: Affordance-aware Text Guided Human Placement
Rishubh Parihar
Harsh Gupta
VS Sachidanand
R. V. Babu
DiffM
45
5
0
22 Jul 2024
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
Bowen Zhang
Cheng Yang
Xuanhui Liu
DiffM
42
0
0
21 Jul 2024
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
Zheng Chong
Xiao Dong
Haoxiang Li
Shiyue Zhang
Wenqing Zhang
Xujie Zhang
Hanqing Zhao
D. Jiang
Xiaodan Liang
DiffM
60
18
0
21 Jul 2024
CoCoG-2: Controllable generation of visual stimuli for understanding
  human concept representation
CoCoG-2: Controllable generation of visual stimuli for understanding human concept representation
Chen Wei
Jiachen Zou
Dietmar Heinke
Quanying Liu
38
0
0
20 Jul 2024
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Sherwin Bahmani
Ivan Skorokhodov
Aliaksandr Siarohin
Willi Menapace
Guocheng Qian
...
Chaoyang Wang
Jiaxu Zou
Andrea Tagliasacchi
David B. Lindell
Sergey Tulyakov
VGen
DiffM
88
42
0
17 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
67
12
0
17 Jul 2024
Subject-driven Text-to-Image Generation via Preference-based
  Reinforcement Learning
Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning
Yanting Miao
William Loh
Suraj Kothawade
Pascal Poupart
Abdullah Rashwan
Yeqing Li
EGVM
55
1
0
16 Jul 2024
UrbanWorld: An Urban World Model for 3D City Generation
UrbanWorld: An Urban World Model for 3D City Generation
Yu Shang
Jiansheng Chen
Hangyu Fan
Jingtao Ding
J. Feng
Yong Li
61
6
0
16 Jul 2024
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Yanqin Jiang
Chaohui Yu
Chenjie Cao
Fan Wang
Weiming Hu
Jin Gao
VGen
54
16
0
16 Jul 2024
MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed
  Image Restoration
MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration
Yulin Ren
Xin Li
Bingchen Li
Xingrui Wang
Mengxi Guo
Shijie Zhao
Li Zhang
Zhibo Chen
DiffM
43
7
0
15 Jul 2024
Addressing Image Hallucination in Text-to-Image Generation through
  Factual Image Retrieval
Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval
Youngsun Lim
Hyunjung Shim
DiffM
HILM
MQ
48
3
0
15 Jul 2024
DiffStega: Towards Universal Training-Free Coverless Image Steganography
  with Diffusion Models
DiffStega: Towards Universal Training-Free Coverless Image Steganography with Diffusion Models
Yiwei Yang
Zheyuan Liu
Jun Jia
Zhongpai Gao
Yunhao Li
Wei Sun
Xiaohong Liu
Guangtao Zhai
DiffM
24
6
0
15 Jul 2024
TCAN: Animating Human Images with Temporally Consistent Pose Guidance
  using Diffusion Models
TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
J. Kim
Min-Jung Kim
Junsoo Lee
Jaegul Choo
DiffM
39
5
0
12 Jul 2024
Still-Moving: Customized Video Generation without Customized Video Data
Still-Moving: Customized Video Generation without Customized Video Data
Hila Chefer
Shiran Zada
Roni Paiss
Ariel Ephrat
Omer Tov
Michael Rubinstein
Lior Wolf
Tali Dekel
T. Michaeli
Inbar Mosseri
DiffM
VGen
34
20
0
11 Jul 2024
Coherent and Multi-modality Image Inpainting via Latent Space
  Optimization
Coherent and Multi-modality Image Inpainting via Latent Space Optimization
Lingzhi Pan
Tong Zhang
Bingyuan Chen
Qi Zhou
Wei Ke
Sabine Süsstrunk
Mathieu Salzmann
DiffM
40
2
0
10 Jul 2024
Generative Image as Action Models
Generative Image as Action Models
Mohit Shridhar
Yat Long Lo
Stephen James
45
9
0
10 Jul 2024
Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for
  Text-to-Video Generation Task
Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task
Yiran Yang
Jinchao Zhang
Ying Deng
Jie Zhou
DiffM
31
0
0
09 Jul 2024
Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with
  Pre-trained Image Encoder
Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder
Jia Liu
Changlin Li
Qirui Sun
Jiahui Ming
Chen Fang
Jue Wang
Bing Zeng
Shuaicheng Liu
DiffM
37
3
0
08 Jul 2024
Magic Insert: Style-Aware Drag-and-Drop
Magic Insert: Style-Aware Drag-and-Drop
Nataniel Ruiz
Yuanzhen Li
Neal Wadhwa
Yael Pritch
Michael Rubinstein
David E. Jacobs
Shlomi Fruchter
DiffM
41
7
0
02 Jul 2024
Boosting Consistency in Story Visualization with Rich-Contextual
  Conditional Diffusion Models
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
Fei Shen
Hu Ye
Sibo Liu
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
92
35
0
02 Jul 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Y. Li
Fan Ma
Zongxin Yang
Yuqing Yang
101
11
0
02 Jul 2024
Label-free Neural Semantic Image Synthesis
Label-free Neural Semantic Image Synthesis
Jiayi Wang
Kevin Laube
Yumeng Li
J. H. Metzen
Shin-I Cheng
Julio Borges
Anna Khoreva
DiffM
36
0
0
01 Jul 2024
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized
  Sounds
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Yiming Zhang
Yicheng Gu
Yanhong Zeng
Zhening Xing
Yuancheng Wang
Zhizheng Wu
Kai Chen
VGen
37
37
0
01 Jul 2024
StyleShot: A Snapshot on Any Style
StyleShot: A Snapshot on Any Style
Junyao Gao
Yanchen Liu
Yanan Sun
Yinhao Tang
Yanhong Zeng
Kai Chen
Cairong Zhao
TTA
3DH
VLM
82
15
0
01 Jul 2024
InstantStyle-Plus: Style Transfer with Content-Preserving in
  Text-to-Image Generation
InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation
Haofan Wang
Peng-Fei Xing
Renyuan Huang
Hao Ai
Qixun Wang
Xu Bai
DiffM
55
23
0
30 Jun 2024
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image
  Generation
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation
Yanan Sun
Yanchen Liu
Yinhao Tang
Wenjie Pei
Kai Chen
DiffM
42
9
0
27 Jun 2024
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data
William Berman
A. Peysakhovich
39
4
0
26 Jun 2024
Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image
Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image
Jinkun Hao
Junshu Tang
Jiangning Zhang
Ran Yi
Yijia Hong
Moran Li
Weijian Cao
Yating Wang
Lizhuang Ma
DiffM
51
0
0
24 Jun 2024
Character-Adapter: Prompt-Guided Region Control for High-Fidelity
  Character Customization
Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
Yuhang Ma
Wenting Xu
Jiji Tang
Qinfeng Jin
Rongsheng Zhang
Zeng Zhao
Changjie Fan
Zhipeng Hu
48
6
0
24 Jun 2024
ResMaster: Mastering High-Resolution Image Generation via Structural and
  Fine-Grained Guidance
ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance
Shuwei Shi
Wenbo Li
Yuechen Zhang
Jingwen He
Biao Gong
Yinqiang Zheng
57
10
0
24 Jun 2024
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Yuang Peng
Yuxin Cui
Haomiao Tang
Zekun Qi
Runpei Dong
Jing Bai
Chunrui Han
Zheng Ge
Xiangyu Zhang
Shu-Tao Xia
EGVM
83
31
0
24 Jun 2024
Neural Pose Representation Learning for Generating and Transferring
  Non-Rigid Object Poses
Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses
Seungwoo Yoo
Juil Koo
Kyeongmin Yeo
Minhyuk Sung
3DH
DRL
37
0
0
14 Jun 2024
Interpreting the Weight Space of Customized Diffusion Models
Interpreting the Weight Space of Customized Diffusion Models
Amil Dravid
Yossi Gandelsman
Kuan-Chieh Jackson Wang
Rameen Abdal
Gordon Wetzstein
Alexei A. Efros
Kfir Aberman
34
10
0
13 Jun 2024
CLIPAway: Harmonizing Focused Embeddings for Removing Objects via
  Diffusion Models
CLIPAway: Harmonizing Focused Embeddings for Removing Objects via Diffusion Models
Yigit Ekin
Ahmet Burak Yildirim
Erdem Çağlar
Aykut Erdem
Erkut Erdem
Aysegül Dündar
DiffM
39
8
0
13 Jun 2024
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal
  Prompts
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts
Yucheng Han
Rui Wang
Chi Zhang
Juntao Hu
Pei Cheng
Bin-Bin Fu
Hanwang Zhang
77
6
0
13 Jun 2024
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
Bing Li
Cheng Zheng
Wenxuan Zhu
Jinjie Mai
Biao Zhang
Peter Wonka
Bernard Ghanem
48
16
0
12 Jun 2024
WMAdapter: Adding WaterMark Control to Latent Diffusion Models
WMAdapter: Adding WaterMark Control to Latent Diffusion Models
Hai Ci
Yiren Song
Pei Yang
Jinheng Xie
Mike Zheng Shou
WIGM
40
13
0
12 Jun 2024
Zero-shot Image Editing with Reference Imitation
Zero-shot Image Editing with Reference Imitation
Xi Chen
Yutong Feng
Mengting Chen
Yiyang Wang
Shilong Zhang
Yu Liu
Yujun Shen
Hengshuang Zhao
DiffM
37
21
0
11 Jun 2024
Ctrl-X: Controlling Structure and Appearance for Text-To-Image
  Generation Without Guidance
Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance
Kuan Heng Lin
Sicheng Mo
Ben Klingher
Fangzhou Mu
Bolei Zhou
DiffM
41
15
0
11 Jun 2024
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
X. Wang
Siming Fu
Qihan Huang
Wanggui He
Hao Jiang
DiffM
50
41
0
11 Jun 2024
Margin-aware Preference Optimization for Aligning Diffusion Models
  without Reference
Margin-aware Preference Optimization for Aligning Diffusion Models without Reference
Jiwoo Hong
Sayak Paul
Noah Lee
Kashif Rasul
James Thorne
Jongheon Jeong
43
13
0
10 Jun 2024
ProcessPainter: Learn Painting Process from Sequence Data
ProcessPainter: Learn Painting Process from Sequence Data
Yiren Song
Shijie Huang
Chen Yao
Xiaojun Ye
Hai Ci
Jiaming Liu
Yuxuan Zhang
Mike Zheng Shou
DiffM
37
6
0
10 Jun 2024
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image
  Generation
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation
Lianyu Pang
Jian Yin
Baoquan Zhao
Feize Wu
Fu Lee Wang
Qing Li
Xudong Mao
DiffM
51
1
0
07 Jun 2024
VideoTetris: Towards Compositional Text-to-Video Generation
VideoTetris: Towards Compositional Text-to-Video Generation
Ye Tian
Ling Yang
Haotian Yang
Yuan Gao
Yufan Deng
...
Zhaochen Yu
Xin Tao
Pengfei Wan
Di Zhang
Bin Cui
DiffM
VGen
95
17
0
06 Jun 2024
Inv-Adapter: ID Customization Generation via Image Inversion and
  Lightweight Adapter
Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter
Peng-Fei Xing
Ning Wang
Jianbo Ouyang
Zechao Li
DiffM
44
1
0
05 Jun 2024
ORACLE: Leveraging Mutual Information for Consistent Character
  Generation with LoRAs in Diffusion Models
ORACLE: Leveraging Mutual Information for Consistent Character Generation with LoRAs in Diffusion Models
Kiymet Akdemir
Pinar Yanardag
DiffM
41
1
0
04 Jun 2024
V-Express: Conditional Dropout for Progressive Training of Portrait
  Video Generation
V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation
Cong Wang
Kuan Tian
Jun Zhang
Yonghang Guan
Feng Luo
Fei Shen
Zhiwei Jiang
Qing Gu
Xiao Han
Wei Yang
55
38
0
04 Jun 2024
GraVITON: Graph based garment warping with attention guided inversion
  for Virtual-tryon
GraVITON: Graph based garment warping with attention guided inversion for Virtual-tryon
Sanhita Pathak
V. Kaushik
Brejesh Lall
DiffM
48
0
0
04 Jun 2024
Previous
123...101112789
Next