ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.02642
  4. Cited By
Taming Encoder for Zero Fine-tuning Image Customization with
  Text-to-Image Diffusion Models

Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models

5 April 2023
Xuhui Jia
Yang Zhao
Kelvin C. K. Chan
Yandong Li
Han-Ying Zhang
Boqing Gong
Tingbo Hou
Haoran Wang
Yu-Chuan Su
    DiffM
ArXivPDFHTML

Papers citing "Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models"

45 / 95 papers shown
Title
Textual Localization: Decomposing Multi-concept Images for
  Subject-Driven Text-to-Image Generation
Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation
Junjie Shentu
Matthew Watson
Noura Al Moubayed
25
0
0
15 Feb 2024
DreamMatcher: Appearance Matching Self-Attention for
  Semantically-Consistent Text-to-Image Personalization
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
Jisu Nam
Heesu Kim
Dongjae Lee
Siyoon Jin
Seungryong Kim
Seunggyu Chang
DiffM
32
40
0
15 Feb 2024
$λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion
  Models by Leveraging CLIP Latent Space
λλλ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Maitreya Patel
Sangmin Jung
Chitta Baral
Yezhou Yang
VLM
31
29
0
07 Feb 2024
CapHuman: Capture Your Moments in Parallel Universes
CapHuman: Capture Your Moments in Parallel Universes
Chao Liang
Fan Ma
Linchao Zhu
Yingying Deng
Yi Yang
DiffM
31
23
0
01 Feb 2024
Pick-and-Draw: Training-free Semantic Guidance for Text-to-Image
  Personalization
Pick-and-Draw: Training-free Semantic Guidance for Text-to-Image Personalization
Henglei Lv
Jiayu Xiao
Liang Li
Qingming Huang
DiffM
33
5
0
30 Jan 2024
BootPIG: Bootstrapping Zero-shot Personalized Image Generation
  Capabilities in Pretrained Diffusion Models
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
Senthil Purushwalkam
Akash Gokul
Shafiq Joty
Nikhil Naik
DiffM
46
17
0
25 Jan 2024
360DVD: Controllable Panorama Video Generation with 360-Degree Video
  Diffusion Model
360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
Qian Wang
Weiqi Li
Chong Mou
Xinhua Cheng
Jian Zhang
VGen
58
17
0
12 Jan 2024
Instruct-Imagen: Image Generation with Multi-modal Instruction
Instruct-Imagen: Image Generation with Multi-modal Instruction
Hexiang Hu
Kelvin C. K. Chan
Yu-Chuan Su
Wenhu Chen
Yandong Li
...
Xue Ben
Boqing Gong
William W. Cohen
Ming-Wei Chang
Xuhui Jia
MLLM
48
43
0
03 Jan 2024
Restoration by Generation with Constrained Priors
Restoration by Generation with Constrained Priors
Zheng Ding
Xuaner Zhang
Zhuowen Tu
Zhihao Xia
DiffM
31
3
0
28 Dec 2023
SSR-Encoder: Encoding Selective Subject Representation for
  Subject-Driven Generation
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Yuxuan Zhang
Yiren Song
Jiaming Liu
Rui Wang
Jinpeng Yu
...
Huaxia Li
Xu Tang
Yao Hu
Han Pan
Zhongliang Jing
51
59
0
26 Dec 2023
Cross Initialization for Personalized Text-to-Image Generation
Cross Initialization for Personalized Text-to-Image Generation
Lianyu Pang
Jian Yin
Haoran Xie
Qiping Wang
Qing Li
Xudong Mao
DiffM
41
7
0
26 Dec 2023
DreamTuner: Single Image is Enough for Subject-Driven Generation
DreamTuner: Single Image is Enough for Subject-Driven Generation
Miao Hua
Jiawei Liu
Fei Ding
Wei Liu
Jie Wu
Qian He
28
28
0
21 Dec 2023
Stellar: Systematic Evaluation of Human-Centric Personalized
  Text-to-Image Methods
Stellar: Systematic Evaluation of Human-Centric Personalized Text-to-Image Methods
Panos Achlioptas
Alexandros Benetatos
Iordanis Fostiropoulos
Dimitris Skourtis
34
8
0
11 Dec 2023
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
67
190
0
07 Dec 2023
ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for
  ControlNet
ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet
Soon Yau Cheong
Armin Mustafa
Andrew Gilbert
DiffM
29
5
0
05 Dec 2023
FaceStudio: Put Your Face Everywhere in Seconds
FaceStudio: Put Your Face Everywhere in Seconds
Yuxuan Yan
C. Zhang
Rui Wang
Yichao Zhou
Gege Zhang
Pei Cheng
Gang Yu
Bin-Bin Fu
DiffM
43
41
0
05 Dec 2023
Orthogonal Adaptation for Modular Customization of Diffusion Models
Orthogonal Adaptation for Modular Customization of Diffusion Models
Ryan Po
Guandao Yang
Kfir Aberman
Gordon Wetzstein
DiffM
33
26
0
05 Dec 2023
ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder
  and Explicit Adaptation
ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation
Dar-Yen Chen
Hamish Tennent
Ching-Wen Hsu
DiffM
24
23
0
04 Dec 2023
VideoSwap: Customized Video Subject Swapping with Interactive Semantic
  Point Correspondence
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Yuchao Gu
Yipin Zhou
Bichen Wu
Licheng Yu
Jia-Wei Liu
Rui Zhao
Jay Zhangjie Wu
David Junhao Zhang
Mike Zheng Shou
Kevin Tang
DiffM
VGen
70
37
0
04 Dec 2023
VideoBooth: Diffusion-based Video Generation with Image Prompts
VideoBooth: Diffusion-based Video Generation with Image Prompts
Yuming Jiang
Tianxing Wu
Shuai Yang
Chenyang Si
Dahua Lin
Yu Qiao
Chen Change Loy
Ziwei Liu
DiffM
VGen
48
66
0
01 Dec 2023
HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion
  Models
HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models
Zhonghao Wang
Wei Wei
Yang Zhao
Zhisheng Xiao
M. Hasegawa-Johnson
Humphrey Shi
Tingbo Hou
DiffM
41
11
0
30 Nov 2023
When StyleGAN Meets Stable Diffusion: a $\mathscr{W}_+$ Adapter for
  Personalized Image Generation
When StyleGAN Meets Stable Diffusion: a W+\mathscr{W}_+W+​ Adapter for Personalized Image Generation
Xiaoming Li
Xinyu Hou
Chen Change Loy
38
11
0
29 Nov 2023
CLiC: Concept Learning in Context
CLiC: Concept Learning in Context
Mehdi Safaee
Aryan Mikaeili
Or Patashnik
Daniel Cohen-Or
Ali Mahdavi-Amiri
36
11
0
28 Nov 2023
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Omri Avrahami
Amir Hertz
Yael Vinker
Moab Arar
Shlomi Fruchter
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
DiffM
60
32
0
16 Nov 2023
State of the Art on Diffusion Models for Visual Computing
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
38
103
0
11 Oct 2023
Diffusion Model for Camouflaged Object Detection
Diffusion Model for Camouflaged Object Detection
Zhe Chen
Rongrong Gao
Tian-Zhu Xiang
Fanzhao Lin
DiffM
45
19
0
01 Aug 2023
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation
  without Test-time Fine-tuning
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
Jiancang Ma
Junhao Liang
Chen Chen
H. Lu
31
138
0
21 Jul 2023
AnyDoor: Zero-shot Object-level Image Customization
AnyDoor: Zero-shot Object-level Image Customization
Xi Chen
Lianghua Huang
Yu Liu
Yujun Shen
Deli Zhao
Hengshuang Zhao
DiffM
51
256
0
18 Jul 2023
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image
  Models
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Wei Wei
Tingbo Hou
Yael Pritch
Neal Wadhwa
Michael Rubinstein
Kfir Aberman
DiffM
31
173
0
13 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models
  without Specific Tuning
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
40
788
0
10 Jul 2023
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image
  Generation
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation
Shaozhe Hao
Kai Han
Shihao Zhao
Kwan-Yee K. Wong
34
10
0
01 Jun 2023
Inserting Anybody in Diffusion Models via Celeb Basis
Inserting Anybody in Diffusion Models via Celeb Basis
Genlan Yuan
Xiaodong Cun
Yong Zhang
Maomao Li
Chenyang Qi
Xintao Wang
Ying Shan
Huicheng Zheng
DiffM
25
52
0
01 Jun 2023
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept
  Customization of Diffusion Models
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
Yuchao Gu
Xintao Wang
Jay Zhangjie Wu
Yujun Shi
Yunpeng Chen
...
Shuning Chang
Wei Wu
Yixiao Ge
Ying Shan
Mike Zheng Shou
DiffM
54
167
0
29 May 2023
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Omri Avrahami
Kfir Aberman
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
VLM
DiffM
43
165
0
25 May 2023
BLIP-Diffusion: Pre-trained Subject Representation for Controllable
  Text-to-Image Generation and Editing
BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing
Dongxu Li
Junnan Li
Steven C. H. Hoi
42
303
0
24 May 2023
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven
  Text-to-Image Generation
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation
Hong Chen
Yipeng Zhang
Simin Wu
Xin Eric Wang
Xuguang Duan
Yuwei Zhou
Wenwu Zhu
DiffM
28
47
0
05 May 2023
Expressive Text-to-Image Generation with Rich Text
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
79
79
0
13 Apr 2023
InstantBooth: Personalized Text-to-Image Generation without Test-Time
  Finetuning
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Jing Shi
Wei Xiong
Zhe Lin
H. J. Jung
DiffM
133
281
0
06 Apr 2023
Subject-driven Text-to-Image Generation via Apprenticeship Learning
Subject-driven Text-to-Image Generation via Apprenticeship Learning
Wenhu Chen
Hexiang Hu
Yandong Li
Nataniel Rui
Xuhui Jia
Ming-Wei Chang
William W. Cohen
DiffM
33
187
0
01 Apr 2023
StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing
StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing
Senmao Li
Joost van de Weijer
Taihang Hu
Fahad Shahbaz Khan
Qibin Hou
Yaxing Wang
Jian Yang
DiffM
48
52
0
28 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
52
270
0
20 Mar 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
197
526
0
02 Jan 2023
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Wenhu Chen
Hexiang Hu
Chitwan Saharia
William W. Cohen
VLM
131
164
0
29 Sep 2022
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
257
4,816
0
24 Feb 2021
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
333
10,391
0
12 Dec 2018
Previous
12