ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.15438
50
1

VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning

19 March 2025
Y. Tan
Chen Liu
Jingyuan Gao
Banghao Wu
Mingchen Li
Ruilin Wang
Lingrong Zhang
Huiqun Yu
Guisheng Fan
Liang Hong
Bingxin Zhou
ArXivPDFHTML
Abstract

Natural language processing (NLP) has significantly influenced scientific domains beyond human language, including protein engineering, where pre-trained protein language models (PLMs) have demonstrated remarkable success. However, interdisciplinary adoption remains limited due to challenges in data collection, task benchmarking, and application. This work presents VenusFactory, a versatile engine that integrates biological data retrieval, standardized task benchmarking, and modular fine-tuning of PLMs. VenusFactory supports both computer science and biology communities with choices of both a command-line execution and a Gradio-based no-code interface, integrating 40+40+40+ protein-related datasets and 40+40+40+ popular PLMs. All implementations are open-sourced onthis https URL.

View on arXiv
@article{tan2025_2503.15438,
  title={ VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning },
  author={ Yang Tan and Chen Liu and Jingyuan Gao and Banghao Wu and Mingchen Li and Ruilin Wang and Lingrong Zhang and Huiqun Yu and Guisheng Fan and Liang Hong and Bingxin Zhou },
  journal={arXiv preprint arXiv:2503.15438},
  year={ 2025 }
}
Comments on this paper