ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.22869
7
0

CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language Models

28 May 2025
Junbo Yin
Chao Zha
Wenjia He
Chencheng Xu
Xin Gao
    DiffM
ArXivPDFHTML
Abstract

Existing PLMs generate protein sequences based on a single-condition constraint from a specific modality, struggling to simultaneously satisfy multiple constraints across different modalities. In this work, we introduce CFP-Gen, a novel diffusion language model for Combinatorial Functional Protein GENeration. CFP-Gen facilitates the de novo protein design by integrating multimodal conditions with functional, sequence, and structural constraints. Specifically, an Annotation-Guided Feature Modulation (AGFM) module is introduced to dynamically adjust the protein feature distribution based on composable functional annotations, e.g., GO terms, IPR domains and EC numbers. Meanwhile, the Residue-Controlled Functional Encoding (RCFE) module captures residue-wise interaction to ensure more precise control. Additionally, off-the-shelf 3D structure encoders can be seamlessly integrated to impose geometric constraints. We demonstrate that CFP-Gen enables high-throughput generation of novel proteins with functionality comparable to natural proteins, while achieving a high success rate in designing multifunctional proteins. Code and data available atthis https URL.

View on arXiv
@article{yin2025_2505.22869,
  title={ CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language Models },
  author={ Junbo Yin and Chao Zha and Wenjia He and Chencheng Xu and Xin Gao },
  journal={arXiv preprint arXiv:2505.22869},
  year={ 2025 }
}
Comments on this paper