DisProtEdit: Exploring Disentangled Representations for Multi-Attribute Protein Editing

17 June 2025

Max Ku

Sun Sun

Hongyu Guo

Wenhu Chen

ArXiv (abs)PDF HTML

Main:9 Pages

5 Figures

Bibliography:3 Pages

8 Tables

Appendix:3 Pages

Abstract

We introduce DisProtEdit, a controllable protein editing framework that leverages dual-channel natural language supervision to learn disentangled representations of structural and functional properties. Unlike prior approaches that rely on joint holistic embeddings, DisProtEdit explicitly separates semantic factors, enabling modular and interpretable control. To support this, we construct SwissProtDis, a large-scale multimodal dataset where each protein sequence is paired with two textual descriptions, one for structure and one for function, automatically decomposed using a large language model. DisProtEdit aligns protein and text embeddings using alignment and uniformity objectives, while a disentanglement loss promotes independence between structural and functional semantics. At inference time, protein editing is performed by modifying one or both text inputs and decoding from the updated latent representation. Experiments on protein editing and representation learning benchmarks demonstrate that DisProtEdit performs competitively with existing methods while providing improved interpretability and controllability. On a newly constructed multi-attribute editing benchmark, the model achieves a both-hit success rate of up to 61.7%, highlighting its effectiveness in coordinating simultaneous structural and functional edits.

View on arXiv

@article{ku2025_2506.14853,
  title={ DisProtEdit: Exploring Disentangled Representations for Multi-Attribute Protein Editing },
  author={ Max Ku and Sun Sun and Hongyu Guo and Wenhu Chen },
  journal={arXiv preprint arXiv:2506.14853},
  year={ 2025 }
}

Comments on this paper