Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.06721
Cited By
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
13 August 2023
Hu Ye
Jun Zhang
Siyi Liu
Xiao Han
Wei Yang
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models"
50 / 579 papers shown
Title
LocRef-Diffusion:Tuning-Free Layout and Appearance-Guided Generation
Fan Deng
Yaguang Wu
Xinyang Yu
Xiangjun Huang
Jian Yang
Guangyu Yan
Qiang Xu
DiffM
94
0
0
22 Nov 2024
AnyText2: Visual Text Generation and Editing With Customizable Attributes
Yuxiang Tuo
Yifeng Geng
Liefeng Bo
VLM
93
6
0
22 Nov 2024
Foundation Cures Personalization: Improving Personalized Models' Prompt Consistency via Hidden Foundation Knowledge
Yiyang Cai
Zhengkai Jiang
Yong-Jin Liu
Chunyang Jiang
Wei Xue
Wenhan Luo
Yike Guo
98
0
0
22 Nov 2024
GalaxyEdit: Large-Scale Image Editing Dataset with Enhanced Diffusion Adapter
Aniruddha Bala
Rohan Jaiswal
Loay Rashid
Siddharth Roheda
77
0
0
21 Nov 2024
Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing Images
Xuechao Zou
Shun Zhang
Kai Li
Shiying Wang
Junliang Xing
Lei Jin
Congyan Lang
Pin Tao
66
1
0
20 Nov 2024
From Text to Pose to Image: Improving Diffusion Model Control and Quality
Clément Bonnet
Ariel N. Lee
Franck Wertel
Antoine Tamano
Tanguy Cizain
Pablo Ducru
DiffM
71
0
0
19 Nov 2024
Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method
Yan Zheng
Zhenxiao Liang
Xiaoyan Cong
Lanqing guo
Yuehao Wang
Peihao Wang
Zihan Wang
DiffM
35
2
0
17 Nov 2024
OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models
Mathis Koroglu
Hugo Caselles-Dupré
Guillaume Jeanneret Sanmiguel
Matthieu Cord
VGen
DiffM
20
1
0
15 Nov 2024
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Zhennan Chen
Yajie Li
Haofan Wang
Z. Chen
Zhengkai Jiang
Jun Yu Li
Qian Wang
Jian Yang
Ying Tai
DiffM
52
8
0
10 Nov 2024
I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength
Wanquan Feng
Jiawei Liu
Pengqi Tu
Tianhao Qi
Mingzhen Sun
Tianxiang Ma
Mingcong Liu
Siyu Zhou
Qian He
VGen
55
7
0
10 Nov 2024
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning
David Junhao Zhang
Roni Paiss
Shiran Zada
Nikhil Karnad
David E. Jacobs
Yael Pritch
Inbar Mosseri
Mike Zheng Shou
Neal Wadhwa
Nataniel Ruiz
DiffM
VGen
73
15
0
07 Nov 2024
StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration
Panwen Hu
Jin Jiang
Jianqi Chen
Mingfei Han
Shengcai Liao
Xiaojun Chang
Xiaodan Liang
VGen
DiffM
43
5
0
07 Nov 2024
MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation
Han Yang
Sotiris Anagnostidis
Enis Simsar
Thomas Hofmann
DiffM
28
0
0
07 Nov 2024
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
Koichi Namekata
Sherwin Bahmani
Ziyi Wu
Yash Kant
Igor Gilitschenski
David B. Lindell
VGen
65
13
0
07 Nov 2024
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
Ashutosh Srivastava
Tarun Ram Menta
Abhinav Java
Avadhoot Jadhav
Silky Singh
Surgan Jandial
Balaji Krishnamurthy
DiffM
38
1
0
06 Nov 2024
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Wei Cheng
Juncheng Mu
Xianfang Zeng
Xin Chen
Anqi Pang
...
Zhibin Wang
Bin-Bin Fu
Gang Yu
Z. Liu
Liang Pan
44
9
0
04 Nov 2024
Controlling Language and Diffusion Models by Transporting Activations
P. Rodríguez
Arno Blaas
Michal Klein
Luca Zappella
N. Apostoloff
Marco Cuturi
Xavier Suau
LLMSV
40
4
0
30 Oct 2024
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
Shengkai Zhang
Nianhong Jiao
Tian Li
Chaojie Yang
Chenhui Xue
Boya Niu
Jun Gao
VGen
VLM
DiffM
34
1
0
30 Oct 2024
Prune and Repaint: Content-Aware Image Retargeting for any Ratio
Feihong Shen
Chong Li
Yifeng Geng
Yongjian Deng
Hao Chen
29
1
0
30 Oct 2024
FuseAnyPart: Diffusion-Driven Facial Parts Swapping via Multiple Reference Images
Zheng Yu
Yaohua Wang
Siying Cui
Aixi Zhang
Wei-Long Zheng
Senzhang Wang
36
0
0
30 Oct 2024
Paint Bucket Colorization Using Anime Character Color Design Sheets
Yuekun Dai
Qinyue Li
Shangchen Zhou
Yihang Luo
Chongyi Li
Chen Change Loy
32
0
0
25 Oct 2024
Unbounded: A Generative Infinite Game of Character Life Simulation
Jialu Li
Yuanzhen Li
Neal Wadhwa
Yael Pritch
David E. Jacobs
Michael Rubinstein
Joey Tianyi Zhou
Nataniel Ruiz
VGen
AI4CE
36
4
0
24 Oct 2024
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Yuang Ai
Xiaoqiang Zhou
Huaibo Huang
Xiaotian Han
Zhengyu Chen
Quanzeng You
Hongxia Yang
50
9
0
24 Oct 2024
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Liwen Wang
Sheng Chen
Linnan Jiang
Shu Pan
Runze Cai
Sen Yang
Fei Yang
49
3
0
24 Oct 2024
Group Diffusion Transformers are Unsupervised Multitask Learners
Lianghua Huang
Wei Wang
Zhi-Fan Wu
Huanzhang Dou
Yupeng Shi
Yutong Feng
C. Liang
Yu Liu
Jingren Zhou
VLM
49
12
0
19 Oct 2024
HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation
Bo Cheng
Yuhang Ma
Liebucha Wu
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Dawei Leng
Yuhui Yin
DiffM
30
8
0
18 Oct 2024
Assessing Open-world Forgetting in Generative Image Model Customization
Héctor Laria
Alex Gomez-Villa
Imad Eddine Marouf
Bogdan Raducanu
Bogdan Raducanu
VLM
DiffM
37
0
0
18 Oct 2024
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Yujie Wei
Shiwei Zhang
Hangjie Yuan
Xiang Wang
Haonan Qiu
...
F. Liu
Zhizhong Huang
Jiaxin Ye
Yingya Zhang
Hongming Shan
DiffM
VGen
72
14
0
17 Oct 2024
RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models
Haoran Hao
Jiaming Han
Changsheng Li
Yu-Feng Li
Xiangyu Yue
RALM
56
1
0
17 Oct 2024
DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model
Jingxiang Sun
Cheng Peng
Ruizhi Shao
Y. Guo
Xiaochen Zhao
Yangguang Li
Yanpei Cao
Bo Zhang
Yebin Liu
46
2
0
16 Oct 2024
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation
Dewei Zhou
Ji Xie
Zongxin Yang
Yi Yang
DiffM
70
7
0
16 Oct 2024
DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning
Jiabao Wei
Zhiyuan Ma
DiffM
43
0
0
16 Oct 2024
FaceChain-FACT: Face Adapter with Decoupled Training for Identity-preserved Personalization
Cheng Yu
Haoyu Xie
Lei Shang
Yong-Jin Liu
Jun Dan
Liefeng Bo
Baigui Sun
24
2
0
16 Oct 2024
TV-3DG: Mastering Text-to-3D Customized Generation with Visual Prompt
Jiahui Yang
Donglin Di
Baorui Ma
Xun Yang
Yongjia Ma
...
Wei Chen
Jianxun Cui
Zhou Xue
Meng Wang
Yebin Liu
DiffM
45
1
0
16 Oct 2024
Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling
Guiyu Zhang
Huan-ang Gao
Zijian Jiang
Hao Zhao
Zhedong Zheng
EGVM
52
6
0
15 Oct 2024
A Simple Approach to Unifying Diffusion-based Conditional Generation
Xirui Li
Charles Herrmann
Kelvin C.K. Chan
Yinxiao Li
Deqing Sun
Chao Ma
Ming Yang
DiffM
VLM
43
1
0
15 Oct 2024
SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Xilin He
Cheng Luo
Xiaole Xian
Bing Li
Siyang Song
Muhammad Haris Khan
Weicheng Xie
L. Shen
Zongyuan Ge
41
3
0
13 Oct 2024
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
Eungbean Lee
Somi Jeong
Kwanghoon Sohn
DiffM
30
1
0
13 Oct 2024
RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image
Xiaoxue Chen
Jv Zheng
Hao Huang
Haoran Xu
Weihao Gu
...
He xiang
Huan-ang Gao
Hao Zhao
Guyue Zhou
Yaqin Zhang
3DGS
48
2
0
10 Oct 2024
ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion
Zitian Zhang
Frédéric Fortier-Chouinard
Mathieu Garon
Anand Bhattad
Jean-François Lalonde
DiffM
44
4
0
10 Oct 2024
HARIVO: Harnessing Text-to-Image Models for Video Generation
Mingi Kwon
Seoung Wug Oh
Yang Zhou
Difan Liu
Joon-Young Lee
Haoran Cai
Baqiao Liu
Feng Liu
Youngjung Uh
VGen
43
1
0
10 Oct 2024
InstructG2I: Synthesizing Images from Multimodal Attributed Graphs
Bowen Jin
Ziqi Pang
Bingjun Guo
Yu-Xiong Wang
Jiaxuan You
Jiawei Han
DiffM
47
1
0
09 Oct 2024
Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis
Bohan Zeng
Ling Yang
Siyu Li
Jiaming Liu
Zixiang Zhang
...
Yongzhen Guo
Fu-Yun Wang
Minkai Xu
Stefano Ermon
Wentao Zhang
VGen
AI4CE
31
7
0
09 Oct 2024
Personalized Visual Instruction Tuning
Renjie Pi
Jianshu Zhang
Tianyang Han
Jipeng Zhang
Rui Pan
Tong Zhang
MLLM
39
6
0
09 Oct 2024
Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques
Benyuan Meng
Qianqian Xu
Zitai Wang
Zhiyong Yang
Xiaochun Cao
Qingming Huang
23
0
0
09 Oct 2024
InstantIR: Blind Image Restoration with Instant Generative Reference
Jen-Yuan Huang
Haofan Wang
Qixun Wang
Xu Bai
Hao Ai
Peng-Fei Xing
Jen-tse Huang
30
1
0
09 Oct 2024
Story-Adapter: A Training-free Iterative Framework for Long Story Visualization
Jiawei Mao
Xiaoke Huang
Yunfei Xie
Yuanqi Chang
Mude Hui
Bingjie Xu
Yuyin Zhou
VGen
DiffM
43
0
0
08 Oct 2024
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction
Leheng Li
Weichao Qiu
Xu Yan
Jing He
Kaiqiang Zhou
Yingjie Cai
Qing Lian
Bingbing Liu
Ying-Cong Chen
SyDa
DiffM
47
1
0
07 Oct 2024
Image Watermarks are Removable Using Controllable Regeneration from Clean Noise
Yepeng Liu
Yiren Song
Hai Ci
Yu Zhang
Haofan Wang
Mike Zheng Shou
Yuheng Bu
WIGM
58
3
0
07 Oct 2024
Beyond Imperfections: A Conditional Inpainting Approach for End-to-End Artifact Removal in VTON and Pose Transfer
Aref Tabatabaei
Zahra Dehghanian
M. Amirmazlaghani
DiffM
37
0
0
05 Oct 2024
Previous
1
2
3
...
5
6
7
...
10
11
12
Next