ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.13370
113
5
v1v2v3 (latest)

MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models

17 October 2024
Donghao Zhou
Jiancheng Huang
J. Bai
Jiaze Wang
Hao Chen
Guangyong Chen
Xiaowei Hu
Pheng Ann Heng
ArXiv (abs)PDFHTML
Main:7 Pages
18 Figures
Bibliography:3 Pages
7 Tables
Appendix:6 Pages
Abstract

Recent advancements in text-to-image (T2I) diffusion models have enabled the creation of high-quality images from text prompts, but they still struggle to generate images with precise control over specific visual concepts. Existing approaches can replicate a given concept by learning from reference images, yet they lack the flexibility for fine-grained customization of the individual component within the concept. In this paper, we introduce component-controllable personalization, a novel task that pushes the boundaries of T2I models by allowing users to reconfigure specific components when personalizing visual concepts. This task is particularly challenging due to two primary obstacles: semantic pollution, where unwanted visual elements corrupt the personalized concept, and semantic imbalance, which causes disproportionate learning of the concept and component. To overcome these challenges, we design MagicTailor, an innovative framework that leverages Dynamic Masked Degradation (DM-Deg) to dynamically perturb undesired visual semantics and Dual-Stream Balancing (DS-Bal) to establish a balanced learning paradigm for desired visual semantics. Extensive comparisons, ablations, and analyses demonstrate that MagicTailor not only excels in this challenging task but also holds significant promise for practical applications, paving the way for more nuanced and creative image generation.

View on arXiv
@article{zhou2025_2410.13370,
  title={ MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models },
  author={ Donghao Zhou and Jiancheng Huang and Jinbin Bai and Jiaze Wang and Hao Chen and Guangyong Chen and Xiaowei Hu and Pheng-Ann Heng },
  journal={arXiv preprint arXiv:2410.13370},
  year={ 2025 }
}
Comments on this paper